Blog


Megagon Team Profile: Dan Zhang
Dan Zhang, Research Manager and Senior Research Engineer at Megagon Labs, gives us a recount

Weedle: Composable Dashboard for Data-Centric NLP in Computational Notebooks
To help NLP researchers and practitioners understand and improve their data, we introduce Weedle, an

Magneton: Transparent and Customizable Widget Framework for Jupyter Notebooks
Data practitioners have widely adopted computational notebooks such as Jupyter Notebooks due to the relative

Feature Stores: Deep Learning, NLP, and Knowledge Graphs
We will introduce feature stores and examine the implications of deep learning on feature stores

Sudowoodo: Contrastive Self-supervised Learning for Data Integration Applications
We introduce Sudowoodo, an end-to-end framework for a variety of data integration applications to resolve

The First Workshop on Matching: Introduction, Scope, and Highlights
In this workshop, we are interested in (but not restricted to) the dimensions of matching

MEGAnno: Exploratory Labeling for NLP in Jupyter Notebooks
In this blog post, we present MEGAnno, our flexible, exploratory, efficient, and seamless labeling

Highlights of 2022 at Megagon Labs
2022 was a very productive year for our research team to bring forth many ideas

Summarizing Community-based Question-Answer Pairs
Megagon Labs researchers proposed a new CQA Summarization task focused on summarizing QA pairs in

Hybrid Active Learning for Low-Resource LM Fine-tuning
We identified two key designs that can improve the effectiveness and efficiency of sample acquisition:

Paraphrase Generation for Long Text
In this blog post, we will define the problem of paraphrasing. We will explain the

ACM SIGKDD 2022 Conference Highlight
The ACM SIGKDD conference is the premier forum for the advancement, education, and adoption of