Publications
2024
Towards Consistent Language Models Using Controlled Prompting and Decoding
Jasmin Mousavi and Arash Termehchy
SIGMOD workshop on Data Management for End-to-End Machine Learning, June, 2024.
ShiftScope: Adapting Visualization Recommendations to Users’ Dynamic Data Focus [Tutorial_Video ]
Sanad Saha, Nischal Aryal, Leilani Battle and Arash Termehchy
The Proceedings of the ACM on Management of Data (SIGMOD) 2024, [Demonstration Track] (to appear)
User Learning In Interactive Data Exploration [BibTex ]
Sanad Saha, Nischal Aryal, Leilani Battle and Arash Termehchy
40th IEEE International Conference on Data Engineering (ICDE) [Lightening Talk] , May, 2024.
Certain and Approximately Certain Models for Statistical Learning
Cheng Zhen, Nischal Arya, Arash Termehchy and Amandeep Singh Chabada
The Proceedings of the ACM on Management of Data (SIGMOD), Article 126, 2024
Towards Consistent Language Models Using Controlled Prompting and Decoding [Slides ] [Poster ] [BibTeX ]
Jasmin Mousavi and Arash Termehchy
AAAI workshop on Neuro-Symbolic Learning and Reasoning in the Era of Large Language Models, February, 2024.
How Does User Behavior Evolve During Exploratory Visual Analysis? [Poster ] [BibTex ]
Sanad Saha, Nischal Aryal, Leilani Battle and Arash Termehchy
AAAI workshop on Collaborative AI and Modeling of Humans , February, 2024.
2023
Multi-Agent Join [BibTeX ]
Vahid Ghadakchi, Arash Termehchy, Mian Xiei, Bakhtiyar Doskenov, Bharghav Srikhakollu, Summit Haque, Huazheng Wang
Technical Report (in arXiv): arXiv:2312.14291 [cs.DB], December 2023
Multi-Agent Join
Vahid Ghadakchi, Arash Termehchy, Mian Xiei, Bakhtiyar Doskenov, Bharghav Srikhakollu, Summit Haque, Huazheng Wang
NeurIPS Workshop on ML for Systems, 2023
Towards Consistent Language Models Using Declarative Constraints [BibTeX ]
Jasmin Mousavi and Arash Termehchy
Technical Report (in arXiv): arXiv:2312.15472 [cs.DB], December 2023
Modeling and Analyzing User Behavior During Exploratory Visual Analysis
Sanad Saha, Nischal Aryal, Leilani Battle and Arash Termehchy
Technical Report (in arXiv): arXiv:2312.09407 [cs.HC], December 2023
Towards Consistent Large Language Models Using Declarative Constraints [Slides ] [BibTeX ]
Jasmin Mousavi and Arash Termehchy
The Proceedings of VLDB Workshop on Databases and Large Language Models (LLMDB), August, 2023.
Generating Data Integration Queries Using Large Language Models [Slides ] [Poster ] [Code+Data ] [BibTeX ]
Christopher Buss, Jasmin Mousavi, Mikhail Tokarev, Arash Termehchy, David Maier, and Stefan Lee
The Proceedings of VLDB Workshop on Databases and Large Language Models (LLMDB), August, 2023.
Also presented at the 2nd NeurIPS Table Representation Learning Workshop, December, 2023 .
Effective Entity Augmentation By Querying External Data Sources [Slides ] [Poster ] [Video ] [Code+Data ] [BibTeX ]
Christopher Buss, Jasmin Mousavi, Mikhail Tokarev, Arash Termehchy, David Maier, and Stefan Lee
The Proceedings of VLDB Endowment (PVLDB), Vol16, 2023.
Exploratory Training: When Annotators Learn About Data [Slides ]
Rajesh Shreshta,Omeed Habibelahian, Arash Termehchy, and Paolo Papotti
The Proceedings of the ACM on Management of Data (SIGMOD), Vol2, 2023.
When Can We Ignore Missing Data in Model Training? [Slides ]
Cheng Zhen, Amandeep Singh Chabada, Arash Termehchy
The Proceedings of SIGMOD Workshop on Data Management for End-to -End Machine Learning (DEEM), June 2023.
2022
Exploratory Training: When the Trainers Learn
Omeed Habibelahian, Rajesh Shrestha, Arash Termehchy, and Paolo Papotti
The Proceedings of SIGMOD Workshop on Human-In-the-Loop Data Analytics (HILDA), June, 2022
Also presented at NeurIPS Workshop on Human in the Loop Learning (HILL), December, 2022
RTX-KG2: A System for Building a Semantically Standardized Knowledge Graph for Translational Biomedicine
E. C. Wood, Amy K. Glen, Lindsey G. Kvarfordt, Finn Womack, Liliana Acevedo, Timothy Yoon, Chunyu Ma, Veronica Flores, Meghamala Sinha, Yodsawalai Chodpathumwan, Arash Termehchy, Jared C. Roach, Luis Mendoza, Andrew S. Hoffman, Eric Deutsch, David Koslicki, Stephen A. Ramsey
BMC Bioinformatics, 23:400, 2022.
https://doi.org/10.1186/s12859?022?04932?3
Effective Entity Augmentation By Querying External Data Sources
Christopher Buss, Mikhail Tokarev, Jasmin Mousavi, Arash Termehchy, David Maier, and Stefan Lee
Technical Report, 2022
2021
Structural Generalizability: The Case of Similarity Search
Yodsawalai Chodpathumwan, Arash Termehchy, Aayam Shrestha, Stephen Ramsey, Amy Glen, and Zheng Liu
The Proceedings of SIGMOD, 2021.
The full version with proofs
Scalable and Usable Relational Learning With Automatic Language Bias
Jose Picado, Arash Termehchy, Alan Fern, Sudhanshu Pathak, Praveen Ilango, and John Davis,
The Proceedings of SIGMOD, 2021.
RTX-KG2: A System for Building a Semantically Standardized Knowledge Graph for Translational Biomedicine
E. C. Wood, Amy K. Glen, Lindsey G. Kvarfordt, Finn Womack, Liliana Acevedo, Timothy Yoon, Chunyu Ma, Veronica Flores, Meghamala Sinha, Yodsawalai Chodpathumwan, Arash Termehchy, Jared C. Roach, Luis Mendoza, Andrew S. Hoffman, Eric Deutsch, David Koslicki, Stephen A. Ramsey
BioRXIV, 2021.
2020
Learning Over Dirty Data Without Cleaning [Slides ]
Jose Picado, John Davis, Arash Termehchy, and Claire Lee
The Proceedings of SIGMOD, 2020.
Full version with proofs
Bandit Join: Preliminary Results
Vahid Ghadakchi, Mian Xie, Arash Termehchy
The Proceedings of SIGMOD Workshop on AI & Data Management (aiDM), 2020.
Usable & Scalable Learning Over Relational Data With Automatic Language Bias
Jose Picado, Arash Termehchy, Alan Fern, Sudhanshu Pathak, Praveen Ilango, and Yunqiao Cai
Technical report:arXiv:1710.01420, April 2020.
2019
A Game-Theoretic Approach to Data Interaction
Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Behrouz Touri and Liang Huang
The ACM Transactions on Database Systems (TODS)
How Do Users and Data Systems Establish a Common Query Language?
Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Liang Huang and Behrouz Touri
SIGMOD Record on ACM SIGMOD Research Highlights 48 (1), ,2019
Less Data Delivers Higher Effectiveness for Keyword Queries
Vahid Ghadakchi, Abtin Khodadad, and Arash Termehchy
In Proceedings of Statistical and Scientific Database Management (SSDBM), July 2019
Logically Scalable and Efficient Relational Learning
Jose Picado, Arash Termehchy, Alan Fern, and Parisa Ataie
The VLDB Journal, 2019.
Leveraging Human Learning in Interactive Data Exploration
Sanad Saha, Leilani Battle, Arash Termehchy
The Proceedings of VLDB Workshop on Conversational Access to Data (CAST), August, 2019.
2018
Progressive Interaction for Autonomous Entity Matching [Slides ]
Ben McCamish, Arash Termehchy
The Proceedings VLDB Workshop on Polystore Systems (Poly), September 2018.
Managing Structurally Heterogeneous Databases in Software Product Lines
Parisa Ataei, Arash Termehchy, and Eric Walkingshaw
The Proceedings VLDB Workshop on Polystore Systems (Poly), September 2018.
Learning Efficiently Over Heterogenous Databases [Poster ]
Jose Picado, Sudhanshu Pathak, and Arash Termehchy
The Proceedings of the VLDB Endowment (Demonstration Track) , August 2018.
Learning Over Heterogenous Databases: Sampling and Constraints to the Rescue
Jose Picado, Sudhanshu Pathak, and Arash Termehchy
The Proceedings of SIGMOD Workshop on Data Management for End-to-End Machine Learning (DEEM), June 2018.
The Data Interaction Game [One-slide teaser ] [Slides ]
Ben McCamish, Vahid Ghadakchi, Arash Termehchy, Behrouz Touri and Liang Huang
The Proceedings of SIGMOD, June 2018.
Selected as one of the best papers in SIGMOD 2018
Cost-Effective Conceptual Design Using Taxonomies
Yodsawalai Chodpathumwan, Ali Vakilian, Arash Termehchy and Amir Nayyeri
The VLDB Journal, April 2018.
AutoMode: Relational Learning With Less Black Magic
Jose Picado, Sudhanshu Pathak, Arash Termehchy, and Alan Fern
The Proceedings of ICDE (Demonstration Track), April 2018.
There is no Dichotomy Between Effectiveness and Efficiency in Keyword Query Processing [Slides ]
Vahid Ghadakchi, Arash Termehchy
The Proceedings of ICDE (Lightening Talk), April 2018.
2017
Variational Databases
Parisa Ataie, Arash Termehchy, and Eric Walkingshaw
The Proceedings of International Symposium on Database Programming Languages (DBPL), September 2017.
Schema Independent Relational Learning [One-slide teaser ] [Slides ]
Jose Picado, Arash Termehchy, Alan Fern, and Parisa Ataie
The Proceedings of SIGMOD, May 2017.
Cost-Effective Conceptual Design Using Taxonomies
Yodsawalai Chodpathumwan, Ali Vakilian, Arash Termehchy and Amir Nayyeri
In Proceedings of SIGMOD Workshop on Web and Databases (WebDB), May 2017.
Towards Automatically Setting Language Bias in Relational Learning [Slides ]
Jose Picado, Arash Termehchy, Alan Fern, and Sudhanshu Pathak
In Proceedings of SIGMOD Workshop on Data Management for End-to -End Machine Learning (DEEM), May 2017
A Signaling Game Approach to Databases Querying - A Progress Report
Ben McCamish, Arash Termehchy, Behrouz Touri
The Proceedings of SIGMOD Workshop on Human-In-the-Loop Data Analytics (HILDA), May 2017.
Representational Scalability
Jose Picado
The Conference on Innovative Data Systems Research (CIDR), abstract, January 2017.
Reaching Mutual Understanding in a Society of Humans and Database Systems
Arash Termehchy
The Conference on Innovative Data Systems Research (CIDR), abstract, January 2017.
2016
Realizing Representation Independent Analytics
Yodswalai Chodpathumwan, Jose Picado, Arash Termehchy, Alan Fern, and Yizhou Sun
The ICDM Workshop on Data Wrangling Automation, December 2016.
Towards Representation Independent Similarity Search Over Graph Databases
Yodsawalai Chodpathumwan, Amirhossein Aleyasin, Arash Termehchy, and Yizhou Sun
The Proceedings of CIKM, October 2016.
Schema Independent and Scalable Relational Learning By Castor
Jose Picado, Parisa Ataie, Arash Termehchy, and Alan Fern
The Proceedings of the VLDB Endowment (Demonstration Track), September 2016.
A Signaling Game Approach to Databases Querying and Interaction
Ben McCamish, Arash Termehchy, Behrouz Touri
Technical Report:arXiv:1603.04068, March 2016.
2015
Towards Schema Independent Relational Learning
Jose Picado, Arash Termehchy, and Alan Fern
The NIPS Workshop on Machine Learning Systems, December 2015.
A Signaling Game Approach to Database Querying
Arash Termehchy and Behrouz Touri
The Proceedings of SIGIR International Conference on Theory of Information Retrieval (ICTIR), September 2015.
Schema Independent Relational Learning
Jose Picado, Arash Termehchy, and Alan Fern
Technical Report:arXiv:1508.03846, August 2015.
Representation Independent Similarity and Proximity Search
Yodsawalai Chodpathumwan, Amirhossein Aleyasin, Arash Termehchy, and Yizhou Sun
Technical Report:arXiv:1508.03763, August 2015.
Universal-DB: Towards Representation Independent Graph Analytics
Yodsawalai Chodpathumwan, Amirhossein Aleyasin, Arash Termehchy and Yizhou Sun
The Proceedings of the VLDB Endowment (Demonstration Track), September 2015.
Cost-Effective Conceptual Design Using Taxonomies
Ali Vakilian, Yodsawalai Chodpathumwan, Arash Termehchy and Amir Nayyeri
Technical Report:arXiv:1503.05656, April 2015.
Cost Effective Conceptual Design for Information Extraction
Arash Termehchy, Ali Vakilian, Yodswalai Chodpathumwan and Marianne Winslett
The ACM Transactions on Database Systems (TODS), June 2015.
2014
Representation Independent Analytics Over Structured Data
Yodswalai Chodpathumwan, Jose Picado, Arash Termehchy, Alan Fern, and Yizhou Sun
Technical Report: arXiv:1409.2553, August 2014.
Which Concepts Are Worth Extracting? [Slides ]
Arash Termehchy, Ali Vakilian, Yodswalai Chodpathumwan, and Marianne Winslett
The Proceedings of SIGMOD, June 2014.
Schema Independence of Relational Learning Algorithms
Jose Picado, Arash Termehchy, and Alan Fern
The SIGMOD Workshop on Big Uncertain Data (BUDA), June 2014.
Toward Representation Independent Similarity Search Over Graphs
Yodswalai Chodpathumwan, Arash Termehchy, Yizhou Sun, Amirhossein Aleyasin, and Jose Picado
The SIGMOD Workshop on Graph Data Management Experiences and Systems (GRADES), June 2014.
Efficient Prediction of Difficult Keyword Queries over Databases
Chen Shiwen, Arash Termehchy, and Vagelis Hristidis
The IEEE Transactions on Knowledge and Data Engineering (TKDE), March 2014.
Before 2014
Predicting the Effectiveness of Keyword Queries on Databases
Chen Shiwen, Arash Termehchy, and Vagelis Hristidis
The Proceedings of ACM International Conference on Information and Knowledge Management (CIKM), October 2012 (13.5% acceptance).
Schema Independent Query Interfaces
Arash Termehchy, Marianne Winslett, Yodswalai Chodpathumwan, and Austin Gibbons
The IEEE Transactions on Knowledge and Data Engineering (TKDE), Special Issue on the Best Papers of ICDE 2011, July 2012.
How Schema Independent are Schema Free Query Interfaces? [Teaser Slide ]
Arash Termehchy, Marianne Winslett, and Yodsawalai Chodpathumwan
The Proceedings of IEEE International Conference on Data Engineering (ICDE), April 2011, (19.8% acceptance)
Best Student Paper Award ; Bests of Conference Selection .
Using Structural Information in XML Keyword Search Effectively
Arash Termehchy and Marianne Winslett,
The ACM Transactions on Database Systems (TODS), March 2011.
EXTRUCT: Using Deep Structural Information in XML Keyword Search
Arash Termehchy and Marianne Winslett
The Proceedings of the VLDB Endowment (PVLDB), September 2010.
Keyword Search over Key-Value Stores
Arash Termehchy and Marianne Winslett
The Proceedings of the World Wide Web Conference (WWW), April 2010 (poster paper).
Keyword Search for Data-Centric XML Collections with Long Text Fields
Arash Termehchy and Marianne Winslett
The Proceedings of the International Conference on Extending Database Technology (EDBT), 2010 (18% acceptance).
Effective, Design Independent XML Keyword Search
Arash Termehchy and Marianne Winslett,
The ACM International Conference on Information and Knowledge Management (CIKM), October 2009 (14.5% acceptance).
Maitri: A Format-Independent Framework for Managing Large Scale Scientific Data
Rishi R. Sinha, Arash Termehchy, Marianne Winslett, Soumyadeb Mitra, and John Noris
The Conference on Innovative Data Systems Research (CIDR), January 2007.