2nd WIT: Workshop On Deriving Insights From User-Generated Text @ACL2022

Workshop Overview

Every day, millions of people use Web platforms and online services, generating massive amounts of data about products and services. This data can be structured such as in tables; semi-structured such as in call-center records and dialogues; or unstructured such as in customer reviews and forums. The goal of this workshop is to advance research over user-generated text. Recent progress in natural language processing, machine learning, knowledge bases and database management have demonstrated promising results and far-reaching uses of text. However, there is tremendous untapped potential in exploring and exploiting advanced AI/ML/NLP techniques on user-generated text, which is rich in user insights and experiences. The goal of this workshop is to bring together researchers and practitioners in this area, to clarify impactful research problems, share findings from adaptation of existing approaches to user-generated data, and generate new ideas for future research. We seek papers that address challenges in harnessing user-generated data.  

Confirmed Invited Speakers

Confirmed Speaker (alphabetically ordered list):

Stanford University
University of Edinburgh
Carnegie Mellon University
University of Michigan, Ann Arbor


Friday, May 27th, 2022
9:20 – 9:30 am
Opening Remarks
9:30 – 10:30 am
Invited Talk I
10:30 – 11:00 am
Coffee Break
11:00 – 11:30 am
Session 1:
An Interactive Analysis of User-reported Long COVID Symptoms using Twitter Data
– Lin Miao, Mark Last, and Marina Litvak
Bi-Directional Recurrent Neural Ordinary Differential Equations for Social Media Text Classification
– Maunika Tamire, Srinivas Anumasa, and P. K. Srijith
11:30- 12:30 pm
Invited Talk II
12:30 – 2:00 pm
Poster Session (and Lunch Break)
2:00 – 3:00 pm
Invited Talk III
3:00 – 3:30 pm
Coffee Break
3:30 – 3:45 pm
Session 2
Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion
– Seongmin Park and Jihwa Lee
3:45 – 4:45 pm
Invited Talk IV
4:45 – 5:45 pm

Call For Submissions

We invite submissions of long and short papers on original and unpublished work that address challenges around harnessing textual user-generated data, such as that found in online reviews, web forums, and crowdsourced data. The topics include but are not limited to:

  • Information extraction from user-generated text
  • Domain adaptation to user-generated text
  • Data cleaning
  • Entity matching in user-generated text
  • Semantic Search
  • Robustness to noise
  • Summarization of user-generated text
  • Language generation
  • (Commonsense) knowledge bases from user-generated text
  • Information Seeking QA/Dialogue


All machine learning, text mining, and natural language processing techniques are welcome. All regular papers and short papers should follow the ACL 2022 style guidelines and multiple submission policy. The maximum length of a regular paper is 8 pages plus unlimited number of pages for references. The maximum length of a short paper is 4 pages plus unlimited number of pages for references. At least one author of every accepted paper is expected to attend the workshop.

WIT is using a hybrid submission process, Authors can submit their papers using the OpenReview platform. Alternatively, authors can also commit papers and reviews from ARR.  We allow parallel commitment to the NAACL 2022 conference and our workshop, with the requirement that if the paper is accepted at NAACL 2022, it will be withdrawn from archival publication at the workshop. We ask the authors to notify WIT if they have also committed to NAACL or other workshops.

All accepted short and long papers must be presented as talk/poster/demo at the workshop, depending on the workshop schedule. At least one author of each accepted paper must register for ACL 2022 and attend the workshop.

Important Dates

  • Workshop Paper Due Date
    • submission via OpenReview: February 28, 2022 March 6, 2022
    • submission via ARR
      • final ARR deadline: January 15, 2022
      • commitment deadline: March 14, 2022
  • Notification of Acceptance: March 26, 2022
  • Camera-ready papers due: April 10, 2022
  • Workshop On Deriving Insights From User-Generated Text @ACL2022: May 27, 2022

Mentorship Program

We will be hosting a mentorship program to facilitate exchange between authors and experts working in areas relevant to the workshop. The goal of this program is to foster collaborations and increase the quality and impact of the submissions. Mentors are expected to guide mentees during the mentorship period as they prepare submissions for this workshop. Their interactions may include an in-depth discussion of related work to ensure submissions are well contextualized, discussions on writing and presentation of the submission, and discussions about possible extensions of the submissions. Mentees are expected to prepare the submissions by Feb. 7, 2022 and contact their assigned mentor. Mentors and mentees are encouraged to dedicate at least 4hrs over the course of the program to maximize the benefits of the program. They can meet virtually within the first week after the mentor-mentee matching is made and set expectations for subsequent meetings. Their efforts should culminate in a final version of the paper that should be submitted by the deadline.

Applications to the mentorship program are due by Feb. 1, 2022 now closed.

Application to be a mentor/mentee: link now closed  

Program Committee

Sara Abdali – Georgia Institute of Technology
Shabnam Behzad – Georgetown University
Arthur Brazinskas – University of Edinburgh
Brett Zhiyuan Chen – Google
Maisa Duarte – Bradesco Bank – Brazil
Nelson Ebecken – COPPE/UFRJ Federal University of Rio de Janeiro – Brazil
Jacob Eisenstein – Google
Joao Gama – University of Porto – Portugal
Tianyu Jiang – University of Utah
Hannah Kim – Megagon Labs
Aljaz Kosmerlj – Viaduct.ai
Thom Lake – Indeed.com
Yutong Li – Apple
Jun Ma – Amazon
Vagelis Papalexakis – UC Riverside
Jing Qian – University of California Santa Barbara
Sajjadur Rahman – Megagon Labs
Yutong Shao – UC San Diego
Evan Shieh – Amazon
Nedelina Teneva – Amazon
Xiaolan Wang – Megagon Labs
Xinyi(Cindy) Wang – Carnegie Mellon University
Yusuke Watanabe – Amazon
Chris Welty – Google
Natasha Zhang Foutz – University of Virginia


Megagon Labs
Carnegie Mellon University
Jozef Stefan Institute (JSI)
Jozef Stefan Institute (JSI)
Megagon Labs


If you have any questions or inquiries regarding the workshop or need further information, please do not hesitate to send an email to wit@megagon.ai.