The Data-AI Symbiosis (DAIS) group at Megagon Labs explores research problems at the intersection of data management and AI. At its core, the DAIS group is working toward building the next-generation data platform that enables self-serving data analytics at scale within compound AI systems involving multi-agent workflows.
Advances in large language models (LLMs), specifically deep language understanding capabilities, offer new opportunities to tackle classic data-management problems such as data integration, entity matching, and data discovery. Our work in the AI-for-data-management area focuses on exploiting language models and state-of-the-art machine-learning approaches for data discovery in data lakes, tabular data understanding, data augmentation for data management, and natural language to domain-specific query generation.
Conversely, as LLMs are increasingly adopted in enterprise systems — where accuracy, privacy, trust, governance, and explainability are of the utmost importance — it is necessary to develop systematic approaches toward enhancing knowledge-intensive query understanding, knowledge retrieval over heterogeneous data sources, optimization during retrieval and querying, robustness in fact-checking and verification, and flexibility in domain adaptation. Our work in the data-management-for-AI area focuses on enterprise data cataloging, fact-checking and verification, data lake usability, and benchmarking multi-agent systems to enable effective knowledge grounding and contextualization for knowledge-guided generation with LLMs.