Publications

Deep or Simple Models for Semantic Tagging? It Depends on your Data (PDF)
Jinfeng Li, Yuliang Li, Xiaolan Wang, Wang-Chiew Tan – VLDB 2020

Sato: Contextual Semantic Type Detection in Tables (PDF)
Dan Zhang, Yoshihiko Suhara, Jinfeng Li, Madelon Hulsebos, Çağatay Demiralp, Wang-Chiew Tan – VLDB 2020

 

Sampo: Unsupervised Knowledge Base Construction for Opinions and Implications (PDF)
Nikita Bhutani, Aaron Taylor, Chen Chen, Xiaolan Wang, Behzad Golshan, Wang-Chiew Tan – AKBC 2020

 

OpinionDigest: A Simple Framework for Opinion Summarization (PDF)
Yoshihiko Suhara*, Xiaolan Wang*, Stefanos Angelidis, Wang-Chiew Tan –
ACL 2020 (short paper) (to appear)
* Equal contribution

 

Teddy: A System for Interactive Review Analysis (PDF)
Xiong Zhang, Jonathan Engel, Sara Evensen, Yuliang Li, Çağatay Demiralp, Wang-Chiew Tan   – CHI 2020 

 

Snippext: Semi-supervised Opinion Mining with Augmented Data (PDF)
Zhengjie Miao, Yuliang Li, Xiaolan Wang, Wang-Chiew Tan – WWW 2020

 

ExtremeReader: An Interactive Explorer For Customizable And Explainable Review Summarization (PDF)
Xiaolan Wang, Yoshihiko Suhara, Natalie Nuno, Yuliang Li, Jinfeng Li, Nofar Carmeli, Stefanos Angelidis, Eser Kindogan, Wang-Chiew Tan – WWW 2020 (demo) 

 

Emu: Enhancing Multilingual Sentence Embeddings with Semantic Specialization (PDF)
Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan – AAAI 2020 

 

Happiness Entailment: Automating Suggestions for Well-Being (PDF)
Sara Evensen, Yoshihiko Suhara, Alon Halevy, Wang-Chiew Tan, Saran Mumick – Affective Computing & Intelligent Interaction (ACII) 2019

 

Building a Hotel Concierge Bot: an industrial case study (PDF)
Behzad Golshan, George Mihaila, Chen Chen, Jonathan Engel, Alon Halevy, Yoshihiko Suhara, Wang-Chiew Tan, Michael Matuschek (TrustYou) – CAST 2019

 

Subjective Databases (PDF)
Yuliang Li, Aaron Feng, Jinfeng Li, Saran Mumick, Alon Halevy, Vivian Li, Wang-Chiew Tan
– VLDB 2019

 

Semantic Cross-lingual Sentence Embedding 
Wataru Hirota, Yoshihiko Suhara, Behzad Golshan, Wang-Chiew Tan – RepL4NLP@ACL 2019

 

Open Information Extraction from Question-Answer Pairs (PDF)
Nikita Bhutani, Yoshihiko Suhara, Wang-Chiew Tan, Alon Halevy, H. V. Jagadish – NAACL-HLT 2019

 

Essentia: Mining Domain-specific Paraphrases with Word-Alignment Graphs (PDF)
Danni Ma, Chen Chen, Behzad Golshan, Wang-Chiew Tan – TextGraphs 2019

 

Voyageur: An Experiential Travel Search Engine (PDF)
Sara Evensen, Aaron Feng, Alon Halevy, Jinfeng Li, Vivian Li, Yuliang Li, Huining Liu, George Mihaila, John Morales, Natalie Nuno, Ekaterina Pavlovic, Wang-Chiew Tan, Xiaolan Wang – WWW 2019 (Demonstration)

 

 

FrameIt: Ontology Discovery for Noisy User-Generated Text (PDF)
Dan Iter, Alon Y. Halevy, Wang-Chiew Tan – NUT@EMNLP 2018: 173-183

 

Scalable Semantic Querying of Text (PDF) 
Xiaolan Wang, Aaron Feng, Behzad Golshan, Alon Y. Halevy, George A. Mihaila, Hidekazu Oiwa, Wang-Chiew Tan – PVLDB 11(9): 961-974 (2018)

 – Arxiv version

 

Koko: A System for Scalable Semantic Querying of Text (PDF)
Xiaolan Wang, Jiyu Komiya, Yoshihiko Suhara, Aaron Feng, Behzad Golshan, Alon Y. Halevy, Wang-Chiew Tan – PVLDB 11(12): 2018-2021 (2018) (Demonstration)

 

BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration (PDF)
Chen Chen, Behzad Golshan, Alon Y. Halevy, Wang-Chiew Tan, AnHai Doan – IEEE Data Eng. Bull. 41(2): 10-22 (2018)

 

HappyDB: A Corpus of 100, 000 Crowdsourced Happy Moments (PDF)
Akari Asai, Sara Evensen, Behzad Golshan, Alon Y. Halevy, Vivian Li, Andrei Lopatenko, Daniela Stepanov, Yoshihiko Suhara, Wang-Chiew Tan, Yinzhan Xu – LREC 2018

– HappyDB corpus: 100,000 crowd-sourced happy moments

– HappyDB Kaggle task

 

A Lightweight Front-end Tool for Interactive Entity Population (PDF) 
Hidekazu Oiwa, Yoshihiko Suhara, Jiyu Komiya, Andrei Lopatenko – ICML Workshop on Interactive Machine Learning 2017

 

DeepMood: Forecasting Depressed Mood Based on Self-Reported Histories via Recurrent Neural Networks (PDF)
Yoshihiko Suhara, Yinzhan Xu (MIT), Alex `Sandy’ Pentland (MIT) – WWW 2017

 

Data Integration: After the Teenage Years (PDF)
Alon Halevy, Wang-Chiew Tan, George Mihaila, Behzad Golshan – SIGMOD/PODS Conference 2017

 

CoFE: A Collaborative Feature Engineering Framework for Data Science
Yoshihiko Suhara (MIT Media Lab), Hideki Awashima (Recruit Institute of Technology), Hidekazu Oiwa (Recruit Institute of Technology) and Alex Pentland (MIT Media Lab) – appeared in HCOMP 2016

 

Managing Google’s data lake: an overview of the Goods system (PDF)
Alon Halevy, Flip Korn, Natalya F. Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang – IEEE Data Eng. Bull. 39(3) (work done at Google)

 

Goods: Organizing Google’s Datasets (PDF)

Alon Halevy, Flip Korn, Natalya F. Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang – SIGMOD Conference 2016 (work done at Google).

 

Discovering Structure in the Universe of Attribute Names (PDF)
Alon Halevy, Flip Korn, Natalya F. Noy, Christopher Olston, Neoklis Polyzotis, Sudip Roy, Steven Euijong Whang – WWW 2016: 939-949 (work done at Google)