Blog

Hybrid Active Learning for Low-Resource LM Fine-tuning

We identified two key designs that can improve the effectiveness and efficiency of sample acquisition: random sampling reduces the unlabeled pool being considered for acquisition, and decouples the diversity and uncertainty objectives in hybrid acquisition. Based on an investigation of existing methods, we propose a novel active learning method: TYROGUE.

Read More »

Paraphrase Generation for Long Text

In this blog post, we will define the problem of paraphrasing. We will explain the challenges of document-level paraphrasing, especially in the business domain. These challenges include evaluation. Following this, we will briefly describe the results of the survey study, and identify key ideas.

Read More »

ACM SIGKDD 2022 Conference Highlight

The ACM SIGKDD conference is the premier forum for the advancement, education, and adoption of computer science, specifically for knowledge discovery and data mining. Get an inside view of what happened at this year’s conference.

Read More »

NAACL 2022 Highlights

NAACL is a key conference for our research work, as such 6 of our researchers attended the event and Megagon Labs was a Platinum level sponsor of the event. To share with you what we gathered from the conference, below we summarize papers, workshops, invited talks, and other conference events. We found the topics both interesting and relevant to the ongoing research at Megagon Labs.

Read More »