Raw data is usually dirty , e.g., missing or inconsistent information, and needs substantial amounts of resources to clean. Data cleaning is a major obstacle in ML and inference on large data. In this project, we propose efficient methods to learn accurate models or infer accurate results over dirty data without cleaning.

 

Publications

People

  • Arash Termehchy
  • Cheng Zhen
  • Nischal Aryal
  • Jose Picado
  • Amandeep Singh Chabada
  • John Davis
  • Claire Lee