Jordan Boyd-Graber: Pubs (Authors)

Sort by: Year Category Authors Venue Project

Aaron Gerow

Aaron Gerow, Yuening Hu, Jordan Boyd-Graber, David M. Blei, and James A. Evans. Measuring Discursive Influence Across Scholarship. Proceedings of the National Academies of Science, 2018. [Journal] [Bibtex]

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

Ahmed Elgohary

Denis Peskov, Benny Cheng, Ahmed Elgohary Ghoneim, Joe Barrow, Cristian Danescu-Niculescu-Mizil, and Jordan Boyd-Graber. It Takes Two to Lie: One to Lie and One to Listen. Association for Computational Linguistics, 2020. [Video] [Podcast] [Data and Code] [Bibtex]
```
@inproceedings{Peskov:Cheng:Elgohary:Barrow:Danescu-Niculescu-Mizil:Boyd-Graber-2020,
	Title = {It Takes Two to Lie: One to Lie and One to Listen},
	Author = {Denis Peskov and Benny Cheng and Ahmed Elgohary and Joe Barrow and Cristian Danescu-Niculescu-Mizil and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_diplomacy.pdf},
}
```
Accessible Abstract: Machine learning techniques to detect deception in online communications requires training and evaluation data. However, there is a dearth of data either because of uncertain gold labels or privacy concerns; we create a new, large deception-centered dataset in the online game of Diplomacy. We gathered 17,289 messages from 12 games (each of which took over a month) involving 84 players, the majority of which were unique users. This data was collected with a custom-made bot that allowed us to collect messages and annotations. The user pool was created from scratch: we varied participant demographics across gender, age, nationality, and past game experience. Some of our participants included the former president of the Diplomacy players' association, several top ranked players in the world, a board game shop owner, and scientists. We create machine learning models to detect lies using linguistic, context, and power-dynamic features. Our best model had similar lie detection accuracy to humans.

Ahmed Elgohary Ghoneim, Denis Peskov, and Jordan Boyd-Graber. Can You Unpack That? Learning to Rewrite Questions-in-Context. Empirical Methods in Natural Language Processing, 2019. [Data] [Bibtex]

@inproceedings{Elgohary:Peskov:Boyd-Graber-2019,
	Title = {Can You Unpack That? Learning to Rewrite Questions-in-Context},
	Author = {Ahmed Elgohary and Denis Peskov and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2019},
	Location = {Hong Kong, China},
	Url = {http://cs.umd.edu/~jbg//docs/2019_emnlp_sequentialqa.pdf},
}

Ahmed Elgohary Ghoneim, Chen Zhao, and Jordan Boyd-Graber. Dataset and Baselines for Sequential Open-Domain Question Answering. Empirical Methods in Natural Language Processing, 2018. [Data] [Bibtex]

@inproceedings{Elgohary:Zhao:Boyd-Graber-2018,
	Title = {Dataset and Baselines for Sequential Open-Domain Question Answering},
	Author = {Ahmed Elgohary and Chen Zhao and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://cs.umd.edu/~jbg//docs/2018_emnlp_linked.pdf},
}

Alexander Hoyle

Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, and Jordan Boyd-Graber. A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Balepur:Shu:Hoyle:Robey:Feng:Goldfarb-Tarrant:Boyd-Graber-2024,
	Title = {A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick},
	Author = {Nishant Balepur and Matthew Shu and Alexander Hoyle and Alison Robey and Shi Feng and Seraphina Goldfarb-Tarrant and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2024},
	Location = {Miami},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_mnemonic.pdf},
}
```
Accessible Abstract: Learning vocabulary (e.g., benevolent) can be tedious, but using mnemonics (e.g., benevolent sounds like "benefits," and a kind boss gives benefits) makes it more engaging and effective. This paper introduces SMART, a large language model trained to produce mnemonics based on feedback from flashcard learners. Students struggle to predict which mnemonics will help them most. Still, by training SMART on both student preferences and learning outcomes, we can generate mnemonics as effectively as GPT-4, but at a much lower cost.
Pedro Rodriguez, Joe Barrow, Alexander Hoyle, John P. Lalor, Robin Jia, and Jordan Boyd-Graber. Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards?. Association for Computational Linguistics, 2021. [Results and Code] [Paper Read Aloud] [Research Talk Video] [Code and Data] [Bibtex]
```
@inproceedings{Rodriguez:Barrow:Hoyle:Lalor:Jia:Boyd-Graber-2021,
	Title = {Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards?},
	Author = {Pedro Rodriguez and Joe Barrow and Alexander Hoyle and John P. Lalor and Robin Jia and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_acl_leaderboard.pdf},
}
```
Accessible Abstract: When can we call an AI "intelligent"? Just like humans, a common approach is to ask them a bunch of questions. These questions posed to modern machine learning methods are collected in metrics called leaderboards to monitor progress, but beyond ranking approaches, this does not help us better understand our problems or our systems very well. This paper introduces probabilistic models inspired by psychometric approaches called item response theory models (think year-end standardized tests) to better understand how computers can answer questions and whether we are asking the right questions. This allows researchers to better compare what kinds of questions systems can answer, better compare human and machine ability, and discover problematic questions (e.g., questions that have incorrect answer keys, are vague, or "trick" those trying to answer the questions).
Alexander Hoyle, Pranav Goel, Denis Peskov, Andrew Hian-Cheong, Jordan Boyd-Graber, and Philip Resnik. Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence. Neural Information Processing Systems, 2021. [ArXiv] [Research Talk (students)] [Research Talk (Jordan)] [Code] [Bibtex]
```
@inproceedings{Hoyle:Goel:Peskov:Hian-Cheong:Boyd-Graber:Resnik-2021,
	Title = {Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence},
	Author = {Alexander Hoyle and Pranav Goel and Denis Peskov and Andrew Hian-Cheong and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Neural Information Processing Systems},
	Location = {Online},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_neurips_incoherence.pdf},
}
```
Accessible Abstract: Topic models help historians, journalists, and analysts make sense of large text collections. But how do you know if you have a good one? The field has settled on using "Automatic Coherence", but this paper argues that maybe that isn't the right choice if you want to actually make real users happy. This paper builds on our 2009 that showed perplexity was not a good evaluation of interpretability for topic models; while the field adopted automatic topic coherence as a result of that 2009 paper, this paper argues that automatic topic coherence is not a good metric for neural topic models (even though it worked for probabilistic topic models).

Alison Smith

Alison Smith, Jordan Boyd-Graber, Ron Fan, Melissa Birchfield, Tongshuang Wu, Dan Weld, and Leah Findlater. No Explainability without Accountability: An Empirical Study of Explanations and Feedback in Interactive ML. Computer-Human Interaction, 2020. [Bibtex]

@inproceedings{Smith:Boyd-Graber:Fan:Birchfield:Wu:Weld:Findlater-2020,
	Author = {Alison Smith and Jordan Boyd-Graber and Ron Fan and Melissa Birchfield and Tongshuang Wu and Dan Weld and Leah Findlater},
	Booktitle = {Computer-Human Interaction},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_chi_explanation.pdf},
	Title = {No Explainability without Accountability: An Empirical Study of Explanations and Feedback in Interactive ML},
}

Alison Smith, Varun Kumar, Jordan Boyd-Graber, Kevin Seppi, and Leah Findlater. Digging into User Control: Perceptions of Adherence and Instability in Transparent Models. Intelligent User Interfaces, 2020. [Bibtex]

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2020,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Url = {http://cs.umd.edu/~jbg//docs/2020_iui_control.pdf},
	Year = {2020},
	Title = {Digging into User Control: Perceptions of Adherence and
Instability in Transparent Models},
}

Varun Kumar, Alison Smith, Leah Findlater, Kevin Seppi, and Jordan Boyd-Graber. Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models. Association for Computational Linguistics, 2019. [Bibtex]

@inproceedings{Kumar:Smith:Findlater:Seppi:Boyd-Graber-2019,
	Author = {Varun Kumar and Alison Smith and Leah Findlater and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Title = {Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_control.pdf},
}

Alison Smith, Varun Kumar, Jordan Boyd-Graber, Kevin Seppi, and Leah Findlater. User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System. Intelligent User Interfaces, 2018. [Bibtex]

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

Alison Smith, Varun Kumar, Jordan Boyd-Graber, Kevin Seppi, and Leah Findlater. Accounting for Input Uncertainty in Human-in-the-Loop Systems. CHI 2017 Designing for Uncertainty Workshop, 2017. [Bibtex]

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2017,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {CHI 2017 Designing for Uncertainty Workshop},
	Year = {2017},
	Location = {Denver, CO},
	Title = {Accounting for Input Uncertainty in Human-in-the-Loop Systems},
	Url = {http://visualization.ischool.uw.edu/hci_uncertainty/papers/Paper11.pdf},
}

Tak Yeon Lee, Alison Smith, Kevin Seppi, Niklas Elmqvist, Jordan Boyd-Graber, and Leah Findlater. The Human Touch: How Non-expert Users Perceive, Interpret, and Fix Topic Models. International Journal of Human-Computer Studies, 2017. [Journal] [Bibtex]

@article{Lee:Smith:Seppi:Elmqvist:Boyd-Graber:Findlater-2017,
	Author = {Tak Yeon Lee and Alison Smith and Kevin Seppi and Niklas Elmqvist and Jordan Boyd-Graber and Leah Findlater},
	Journal = {International Journal of Human-Computer Studies},
	Year = {2017},
	Title = {The Human Touch: How Non-expert Users Perceive, Interpret, and Fix Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2017_ijhcs_human_touch.pdf},
}

Alison Smith, Tak Yeon Lee, Forough Poursabzi-Sangdeh, Jordan Boyd-Graber, Kevin Seppi, Niklas Elmqvist, and Leah Findlater. Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels. Transactions of the Association for Computational Linguistics, 2017. [Journal] [Data] [Bibtex]

@article{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2017,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2017},
	Title = {Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels},
	Volume = {5},
	Url = {http://cs.umd.edu/~jbg//docs/2017_tacl_eval_tm_viz.pdf},
	Pages = {1--15},
}

Alison Smith, Tak Yeon Lee, Forough Poursabzi-Sangdeh, Jordan Boyd-Graber, Kevin Seppi, Niklas Elmqvist, and Leah Findlater. Human-Centered and Interactive: Expanding the Impact of Topic Models. CHI Human Centred Machine Learning Workshop, 2016. [Bibtex]

@inproceedings{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2016,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Booktitle = {CHI Human Centred Machine Learning Workshop},
	Year = {2016},
	Location = {San Jose, CA},
	Title = {Human-Centered and Interactive: Expanding the Impact of Topic Models},
}

Alison Smith, Jason Chuang, Yuening Hu, Jordan Boyd-Graber, and Leah Findlater. Concurrent Visualization of Relationships between Words and Topics in Topic Models. ACL Workshop on Workshop on Interactive Language Learning, Visualization, and Interfaces, 2014. [Bibtex]

@inproceedings{Smith:Chuang:Hu:Boyd-Graber:Findlater-2014,
	Title = {Concurrent Visualization of Relationships between Words and Topics in Topic Models},
	Author = {Alison Smith and Jason Chuang and Yuening Hu and Jordan Boyd-Graber and Leah Findlater},
	Booktitle = {ACL Workshop on Workshop on Interactive Language Learning, Visualization, and Interfaces},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

Yuening Hu, Jordan Boyd-Graber, Brianna Satinoff, and Alison Smith. Interactive Topic Modeling. Machine Learning, 2014. [Journal] [Frontend Code] [Backend Code] [Bibtex]

@article{Hu:Boyd-Graber:Satinoff:Smith-2014,
	Title = {Interactive Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber and Brianna Satinoff and Alison Smith},
	Journal = {Machine Learning},
	Url = {http://cs.umd.edu/~jbg//docs/2014_mlj_itm.pdf},
	Year = {2014},
	Volume = {95},
	Pages = {423--469},
	Publisher = {Springer},
}

Alvin Grissom II

Alvin Grissom II, Jo Shoemaker, Benjamin Goldman, Ruikang Shi, Craig Stewart, C. Anton Rytting, Leah Findlater, Jordan Boyd-Graber, Wenyan Li, Alvin Grissom II, and Jordan Boyd-Graber. Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates. Linguistic Resources and Evaluation Conference, 2024. [Bibtex]

@inproceedings{Grissom-II:Shoemaker:Goldman:Shi:Stewart:Rytting:Findlater:Boyd-Graber:Li:Grissom-II:Boyd-Graber-2024,
	Author = {Alvin {Grissom II} and Jo Shoemaker and Benjamin Goldman and Ruikang Shi and Craig Stewart and C. Anton Rytting and Leah Findlater and Jordan Boyd-Graber},
	Location = {Torino, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2024_lrec_siminthelp.pdf},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Year = {2024},
	Title = {Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates},
}

@inproceedings{Grissom-II:Shoemaker:Goldman:Shi:Stewart:Rytting:Findlater:Boyd-Graber:Li:Grissom-II:Boyd-Graber-2024,
	Author = {Alvin {Grissom II} and Jo Shoemaker and Benjamin Goldman and Ruikang Shi and Craig Stewart and C. Anton Rytting and Leah Findlater and Jordan Boyd-Graber},
	Location = {Torino, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2024_lrec_siminthelp.pdf},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Year = {2024},
	Title = {Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates},
}

Wenyan Li, Alvin Grissom II, and Jordan Boyd-Graber. An Attentive Recurrent Model for Incremental Prediction of Sentence-final Verbs. Findings of EMNLP, 2020. [Bibtex]

@article{Li:Grissom-II:Boyd-Graber-2020,
	Title = {An Attentive Recurrent Model for Incremental Prediction of Sentence-final Verbs},
	Author = {Wenyan Li and Alvin {Grissom II} and Jordan Boyd-Graber},
	Journal = {Findings of EMNLP},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_findings_verbs.pdf},
}

Alvin Grissom II, Naho Orita, and Jordan Boyd-Graber. Incremental Prediction of Sentence-final Verbs. Conference on Computational Natural Language Learning, 2016. [Bibtex]

@inproceedings{Grissom-II:Orita:Boyd-Graber-2016,
	Title = {Incremental Prediction of Sentence-final Verbs},
	Author = {Alvin {Grissom II} and Naho Orita and Jordan Boyd-Graber},
	Booktitle = {Conference on Computational Natural Language Learning},
	Year = {2016},
	Location = {Berlin, Germany},
	Url = {http://cs.umd.edu/~jbg//docs/2016_conll_verbpred.pdf},
}

He He, Alvin Grissom II, Jordan Boyd-Graber, and Hal Daumé III. Syntax-based Rewriting for Simultaneous Machine Translation. Empirical Methods in Natural Language Processing, 2015. [Bibtex]

@inproceedings{He:Grissom-II:Boyd-Graber:Daume-III-2015,
	Title = {Syntax-based Rewriting for Simultaneous Machine Translation},
	Author = {He He and Alvin {Grissom II} and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2015},
	Location = {Lisbon, Portugal},
	Url = {http://cs.umd.edu/~jbg//docs/2015_emnlp_rewrite.pdf},
}

Alvin Grissom II, He He, Jordan Boyd-Graber, John Morgan, and Hal Daumé III. Don't Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation. Empirical Methods in Natural Language Processing, 2014. [Talk] [Bibtex]

@inproceedings{Grissom-II:He:Boyd-Graber:Morgan:Daume-III-2014,
	Title = {Don't Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation},
	Author = {Alvin {Grissom II} and He He and Jordan Boyd-Graber and John Morgan and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_simtrans.pdf},
}

Anandhavelu Natarajan

Ishani Mondal, Shwetha S, Anandhavelu Natarajan, Aparna Garimella, Sambaran Bandyopadhyay, and Jordan Boyd-Graber. Presentations by the People, for the People: Harnessing LLMs for Generating Persona-Aware Slides from Documents. European Association for Computational Linguistics, 2024. [Bibtex]

@inproceedings{Mondal:S:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {Presentations by the People, for the People: Harnessing LLMs for Generating Persona-Aware Slides from Documents},
	Author = {Ishani Mondal and Shwetha S and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_slides.pdf},
}

Ishani Mondal, Zongxia Li, Yufang Hou, Anandhavelu Natarajan, Aparna Garimella, Sambaran Bandyopadhyay, and Jordan Boyd-Graber. SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement. Findings of the Empirical Methods in Natural Language Processing, 2024. [Bibtex]

@article{Mondal:Li:Hou:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement},
	Author = {Ishani Mondal and Zongxia Li and Yufang Hou and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Year = {2024},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_diagramgen.pdf},
}

Andrew Mao

Zongxia Li, Andrew Mao, Daniel Kofi Stephens, Pranav Goel, Emily Walpole, Juan Francisco Fung, Alden Dima, and Jordan Lee Boyd-Graber. TENOR: Topic Enabled Neural Organization and Recommendation: Evaluating Topic Models in Task Based Settings. European Association for Computational Linguistics, 2024. [Bibtex]

@inproceedings{Li:Mao:Stephens:Goel:Walpole:Fung:Dima:Boyd-Graber-2024,
	Title = {TENOR: Topic Enabled Neural Organization and Recommendation: Evaluating Topic Models in Task Based Settings},
	Author = {Zongxia Li and Andrew Mao and Daniel Kofi Stephens and Pranav Goel and Emily Walpole and Juan Francisco Fung and Alden Dima and Jordan Lee Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_tenor.pdf},
}

Wanrong He, Andrew Mao, and Jordan Boyd-Graber. Cheater's Bowl: Human vs. Computer Search Strategies for Open-Domain QA. Findings of Empirical Methods in Natural Language Processing, 2022. [Code] [Data] [Research Talk] [Bibtex]
```
@article{He:Mao:Boyd-Graber-2022,
	Title = {Cheater's Bowl: Human vs. Computer Search Strategies for Open-Domain QA},
	Author = {Wanrong He and Andrew Mao and Jordan Boyd-Graber},
	Journal = {Findings of Empirical Methods in Natural Language Processing},
	Year = {2022},
	Location = {Abu Dhabi},
	Url = {http://cs.umd.edu/~jbg//docs/2022_emnlp_cheaters.pdf},
}
```
Accessible Abstract: When the Covid pandemic it, trivia games moved online. With it came cheating: people tried to quickly Google answers. This is bad for sportsmanship, but a good source of training data for helping teach computers how to find answers. We built an interface to harvest this training data from trivia players, fed these into retrieval-based QA systems, showing that these queries were better than the automatically generated queries used by the current state of the art.

Anupam Guha

Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, and Larry Davis. The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives. Computer Vision and Pattern Recognition, 2017. [Code/Data] [Bibtex]

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

Anupam Guha, Mohit Iyyer, and Jordan Boyd-Graber. A Distorted Skull Lies in the Bottom Center: Identifying Paintings from Text Descriptions. NAACL Human-Computer Question Answering Workshop, 2016. [Data] [Bibtex]

@inproceedings{Guha:Iyyer:Boyd-Graber-2016,
	Title = {A Distorted Skull Lies in the Bottom Center: Identifying Paintings from Text Descriptions},
	Author = {Anupam Guha and Mohit Iyyer and Jordan Boyd-Graber},
	Booktitle = {NAACL Human-Computer Question Answering Workshop},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_paintings.pdf},
}

Mohit Iyyer, Anupam Guha, Snigdha Chaturvedi, Jordan Boyd-Graber, and Hal Daumé III. Feuding Families and Former Friends: Unsupervised Learning for Dynamic Fictional Relationships. North American Association for Computational Linguistics, 2016. [Code/Data] [Bibtex]

@inproceedings{Iyyer:Guha:Chaturvedi:Boyd-Graber:Daume-III-2016,
	Title = {Feuding Families and Former Friends: Unsupervised Learning for Dynamic Fictional Relationships},
	Author = {Mohit Iyyer and Anupam Guha and Snigdha Chaturvedi and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_relationships.pdf},
}

Best paper award (2 out of 1592)

Anupam Guha, Mohit Iyyer, Danny Bouman, and Jordan Boyd-Graber. Removing the Training Wheels: A Coreference Dataset that Entertains Humans and Challenges Computers. North American Association for Computational Linguistics, 2015. [Code/Data] [Slides] [Video] [LaTeX] [Bibtex]

@inproceedings{Guha:Iyyer:Bouman:Boyd-Graber-2015,
	Title = {Removing the Training Wheels: A Coreference Dataset that Entertains Humans and Challenges Computers},
	Author = {Anupam Guha and Mohit Iyyer and Danny Bouman and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
	Url = {http://cs.umd.edu/~jbg//docs/2015_naacl_qb_coref.pdf},
}

Aparicio, Elizabeth M.

Neha Punklik Srikanth, Rupak Sarkar, Mane, Heran Y., Aparicio, Elizabeth M., Nguyen, Quynh C., Rachel Rudinger, and Jordan Boyd-Graber. Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering. North American Association for Computational Linguistics, 2024. [Code and Data] [Bibtex]

@inproceedings{Srikanth:Sarkar:Y.:M.:C.:Rudinger:Boyd-Graber-2024,
	Title = {Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering},
	Author = {Neha Srikanth and Rupak Sarkar and Mane, Heran Y. and Aparicio, Elizabeth M. and Nguyen, Quynh C. and Rachel Rudinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_pregnant.pdf},
}

Mane, Heran Y., Channell Doig, Amara, Marin Gutierrez, Francia Ximena, Jasczynski, Michelle, Yue, Xiaohe, Neha Punklik Srikanth, Mane, Sourabh, Sun, Abby, Moats, Rachel Ann, Patel, Pragat, He, Xin, Jordan Boyd-Graber, Aparicio, Elizabeth M., and Nguyen, Quynh C.. Practical Guidance for the Development of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers. Journal of Public Health Management and Practice, 2023. [Bibtex]

@article{Y.:Amara:Ximena:Michelle:Xiaohe:Srikanth:Sourabh:Abby:Ann:Pragat:Xin:Boyd-Graber:M.:C.-2023,
	Title = {Practical Guidance for the Development of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers},
	Author = {Mane, Heran Y. and Channell Doig, Amara and Marin Gutierrez, Francia Ximena and Jasczynski, Michelle and Yue, Xiaohe and Neha Srikanth and Mane, Sourabh and Sun, Abby and Moats, Rachel Ann and Patel, Pragat and He, Xin and Jordan Boyd-Graber and Aparicio, Elizabeth M. and Nguyen, Quynh C.},
	Journal = {Journal of Public Health Management and Practice},
	Year = {2023},
	Url = {https://journals.lww.com/jphmp/fulltext/2023/09000/practical_guidance_for_the_development_of_rosie,_a.9.aspx},
}

Aparna Garimella

@inproceedings{Mondal:S:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {Presentations by the People, for the People: Harnessing LLMs for Generating Persona-Aware Slides from Documents},
	Author = {Ishani Mondal and Shwetha S and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_slides.pdf},
}

@article{Mondal:Li:Hou:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement},
	Author = {Ishani Mondal and Zongxia Li and Yufang Hou and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Year = {2024},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_diagramgen.pdf},
}

Benjamin Börschinger

Benjamin Börschinger, Jordan Boyd-Graber, Christian Buck, Jannis Bulian, Massimiliano Ciaramita, Michelle Chen Huebscher, Wojciech Gajewski, Yannic Kilcher, Rodrigo Nogueira, and Lierni Sestorain Saralegu. Meta Answering for Machine Reading. ArXiv, Preprint. [Preprint] [Bibtex]

@article{B\"orschinger:Boyd-Graber:Buck:Bulian:Ciaramita:Huebscher:Gajewski:Kilcher:Nogueira:Saralegu-Preprint,
	Title = {Meta Answering for Machine Reading},
	Author = {Benjamin B\"orschinger and Jordan Boyd-Graber and Christian Buck and Jannis Bulian and Massimiliano Ciaramita and Michelle Chen Huebscher and Wojciech Gajewski and Yannic Kilcher and Rodrigo Nogueira and Lierni Sestorain Saralegu},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1911.04156},
}

Julian Martin Eisenschlos, Bhuwan Dhingra, Jannis Bulian, Benjamin Börschinger, and Jordan Boyd-Graber. Fool Me Twice: Entailment from Wikipedia Gamification. North American Association for Computational Linguistics, 2021. [Preprint] [Research Talk] [Code and Data] [Paper Read Aloud] [Play] [Bibtex]
```
@inproceedings{Eisenschlos:Dhingra:Bulian:B\"orschinger:Boyd-Graber-2021,
	Title = {Fool Me Twice: Entailment from Wikipedia Gamification},
	Author = {Julian Martin Eisenschlos and Bhuwan Dhingra and Jannis Bulian and Benjamin B\"orschinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_naacl_fm2.pdf},
}
```
Accessible Abstract: Democracy and the free press depends on being able to recognize when facts online are true or not. For machine learning to help this critical problem, it needs good data identifying which statements are backed up by trusted sources and which are not. This research creates a game people can play online to craft difficult claims that can train computers to spot disinformation online.
Jordan Boyd-Graber and Benjamin Börschinger. What Question Answering can Learn from Trivia Nerds. Association for Computational Linguistics, 2020. [Preprint] [Audio of Paper Readthrough] [Video of Paper Readthrough] [ACL Presentation] [Bibtex]
```
@inproceedings{Boyd-Graber:B\"orschinger-2020,
	Title = {What Question Answering can Learn from Trivia Nerds},
	Author = {Jordan Boyd-Graber and Benjamin B\"orschinger},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_trivia.pdf},
	Location = {The Cyberverse Simulacrum of Seattle},
	Booktitle = {Association for Computational Linguistics},
}
```
Accessible Abstract: This paper reflects on the similarities between trivia competitions and computer question answering research. Modern machine learning requires large, quality datasets. The central thesis of this article argues that the same things that make trivia tournaments good (they're fun, fair, and consistently crown the best trivia players) can also improve question answering datasets. Concretely, we argue that question answering datasets should clearly specify what answers are requested, have systematic policies to deal with natural ambiguity and variation, have authors look at the data (and help others do the same), make sure questions separate the best from the rest, and ensure people can have fun. We draw on the authors' experience in the trivia community (including embarrassing episodes on Jeopardy!) to illustrate our arguments.

Benjamin Van Durme

Michelle Yuan, Mozhi Zhang, Benjamin Van Durme, Leah Findlater, and Jordan Boyd-Graber. Interactive Refinement of Cross-Lingual Word Embeddings. Empirical Methods in Natural Language Processing, 2020. [Audio of Paper Readthrough] [Video of Paper Readthrough] [Code] [Conference Talk] [Bibtex]
```
@inproceedings{Yuan:Zhang:Van-Durme:Findlater:Boyd-Graber-2020,
	Title = {Interactive Refinement of Cross-Lingual Word Embeddings},
	Author = {Michelle Yuan and Mozhi Zhang and Benjamin {Van Durme} and Leah Findlater and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Punta Cana, Dominican Republic},
	Url = {http://cs.umd.edu/~jbg//docs/2020_emnlp_clime.pdf},
}
```
Accessible Abstract: Language technologies sometimes need to be quickly deployed in low-resource languages. For example, in the 2010 Haiti earthquake, researchers used machine learning models to analyze social media and text messages to gain situational awareness. We introduce CLIME, an interactive system that can help in these scenarios: users see which words related to the task the system thinks are similar, corrects the model to push similar words together and dissimilar words apart.

Michelle Yuan, Benjamin Van Durme, and Jordan Boyd-Graber. Multilingual Anchoring: Interactive Topic Modeling and Alignment Across Languages. Neural Information Processing Systems, 2018. [Code] [Bibtex]

@inproceedings{Yuan:Van-Durme:Boyd-Graber-2018,
	Title = {Multilingual Anchoring: Interactive Topic Modeling and Alignment  Across Languages},
	Author = {Michelle Yuan and Benjamin {Van Durme} and Jordan Boyd-Graber},
	Booktitle = {Neural Information Processing Systems},
	Year = {2018},
	Location = {Montreal, Quebec},
	Url = {http://cs.umd.edu/~jbg//docs/2018_neurips_mtanchor.pdf},
}

Brianna Satinoff

Yuening Hu, Jordan Boyd-Graber, Brianna Satinoff, and Alison Smith. Interactive Topic Modeling. Machine Learning, 2014. [Journal] [Frontend Code] [Backend Code] [Bibtex]

@article{Hu:Boyd-Graber:Satinoff:Smith-2014,
	Title = {Interactive Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber and Brianna Satinoff and Alison Smith},
	Journal = {Machine Learning},
	Url = {http://cs.umd.edu/~jbg//docs/2014_mlj_itm.pdf},
	Year = {2014},
	Volume = {95},
	Pages = {423--469},
	Publisher = {Springer},
}

Jordan Boyd-Graber, Brianna Satinoff, He He, and Hal Daumé III. Besting the Quiz Master: Crowdsourcing Incremental Classification Games. Empirical Methods in Natural Language Processing, 2012. [Presentation] [Data] [Bibtex]

@inproceedings{Boyd-Graber:Satinoff:He:Daume-III-2012,
	Title = {Besting the Quiz Master: Crowdsourcing Incremental Classification Games},
	Author = {Jordan Boyd-Graber and Brianna Satinoff and He He and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2012},
	Url = {http://cs.umd.edu/~jbg//docs/qb_emnlp_2012.pdf},
	Location = {Jeju, South Korea},
}

Yuening Hu, Jordan Boyd-Graber, and Brianna Satinoff. Interactive Topic Modeling. Association for Computational Linguistics, 2011. [Slides] [Code] [Bibtex]

@inproceedings{Hu:Boyd-Graber:Satinoff-2011,
	Title = {Interactive Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber and Brianna Satinoff},
	Booktitle = {Association for Computational Linguistics},
	Year = {2011},
	Location = {Portland, Oregon},
	Url = {http://cs.umd.edu/~jbg//docs/itm.pdf},
}

Brianna Satinoff and Jordan Boyd-Graber. Trivial Classification: What features do humans use for classification?. Workshop on Crowdsourcing Technologies for Language and Cognition Studies, 2011. [Bibtex]

@inproceedings{Satinoff:Boyd-Graber-2011,
	Title = {Trivial Classification: What features do humans use for classification?},
	Author = {Brianna Satinoff and Jordan Boyd-Graber},
	Booktitle = {Workshop on Crowdsourcing Technologies for Language and Cognition Studies},
	Year = {2011},
	Location = {Boulder, CO},
}

Chen Zhao

Chenglei Si, Navita Goyal, Tongshuang Wu, Chen Zhao, Shi Feng, Hal Daumé III, and Jordan Boyd-Graber. Large Language Models Help Humans Verify Truthfulness---Except When They Are Convincingly Wrong. North American Association for Computational Linguistics, 2024. [Bibtex]

@inproceedings{Si:Goyal:Wu:Zhao:Feng:III:Boyd-Graber-2024,
	Title = {Large Language Models Help Humans Verify Truthfulness---Except When They Are Convincingly Wrong},
	Author = {Chenglei Si and Navita Goyal and Tongshuang Wu and Chen Zhao and Shi Feng and Hal Daum\'{e} {III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_convincingly.pdf},
}

Chenglei Si, Weijia Shi, Chen Zhao, Luke Zettlemoyer, and Jordan Boyd-Graber. Getting MoRE out of Mixture of Language Model Reasoning Experts. Findings of Empirical Methods in Natural Language Processing, 2023. [Video] [Bibtex]
```
@article{Si:Shi:Zhao:Zettlemoyer:Boyd-Graber-2023,
	Title = {Getting \underline{MoRE} out of \underline{M}ixture \underline{o}f Language Model \underline{R}easoning \underline{E}xperts},
	Author = {Chenglei Si and Weijia Shi and Chen Zhao and Luke Zettlemoyer and Jordan Boyd-Graber},
	Journal = {Findings of Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_findings_more.pdf},
}
```
Accessible Abstract: There are many ways for a computer to answer a question: a general knowledge question, a common sense question, or a math question. Each of these types of questions can be answered by a particular kind of expert. This paper investigates if we can automatically detect what kind of expert is best suited to answer a question and route the question to the correct expert.
Chenglei Si, Chen Zhao, Sewon Min, and Jordan Boyd-Graber. Re-Examining Calibration: The Case of Question Answering. Findings of Empirical Methods in Natural Language Processing, 2022. [Code] [Research Talk] [Bibtex]
```
@article{Si:Zhao:Min:Boyd-Graber-2022,
	Title = {Re-Examining Calibration: The Case of Question Answering},
	Author = {Chenglei Si and Chen Zhao and Sewon Min and Jordan Boyd-Graber},
	Journal = {Findings of Empirical Methods in Natural Language Processing},
	Year = {2022},
	Location = {Abu Dhabi},
	Url = {http://cs.umd.edu/~jbg//docs/2022_emnlp_calibration.pdf},
}
```
Accessible Abstract: Calibration is an important problem in question answering: if a search engine or virtual assistant doesn't know the answer to a question, you should probably abstain from showing an answer (to save embarassment, as when Google said a horse had six legs). This EMNLP Findings paper shows that existing metrics to test how good a QA calibration push calibrated confidence toward the average confidence. We proposed an alternate method both for evaluation and to generate better calibration by looking how models change as they learn.
Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber, and Hal Daumé III. Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation. Empirical Methods in Natural Language Processing, 2021. [Code] [Research Talk] [Bibtex]
```
@inproceedings{Zhao:Xiong:Boyd-Graber:Daume-III-2021,
	Title = {Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation},
	Author = {Chen Zhao and Chenyan Xiong and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_weak_dpr.pdf},
}
```
Accessible Abstract: Answering questions sometimes requires tying multiple pieces of information together. Previous datasets have required annotators to explicitly build these reasoning chains (e.g., to answer "where do I know the cop from Die Hard from", you need to figure out that the actor's name is "Reginald VelJohnson" and then find out that he's best known as the dad on Family Matters.). By exploring search queries that get to the right answer, we're able to answer these questions without expensive annotation.
Chenglei Si, Chen Zhao, and Jordan Boyd-Graber. What's in a Name? Answer Equivalence For Open-Domain Question Answering. Empirical Methods in Natural Language Processing, 2021. [Research Talk] [Bibtex]
```
@inproceedings{Si:Zhao:Boyd-Graber-2021,
	Title = {What's in a Name? Answer Equivalence For Open-Domain Question Answering},
	Author = {Chenglei Si and Chen Zhao and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_answer_equiv.pdf},
}
```
Accessible Abstract: Is Tim Cook the same person as Timothy Donald Cook? You might think so, but the way we train computers to answer questions would say they aren't. We show that keeping track of multiple names (and it's really simple) can create better question answering systems. Simply by adding alternate answers mined from knowledge bases, we can improve accuracy 1-2 points on major QA datasets.
Chen Zhao, Chenyan Xiong, Hal Daumé III, and Jordan Boyd-Graber. Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval. North American Association for Computational Linguistics, 2021. [Paper Read Aloud] [Bibtex]
```
@inproceedings{Zhao:Xiong:Daume-III:Boyd-Graber-2021,
	Title = {Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval},
	Author = {Chen Zhao and Chenyan Xiong and Hal {Daum\'{e} III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_naacl_multi_ance.pdf},
}
```
Accessible Abstract: For computers to answer complicated questions online, they often need to put together multiple pieces of information (Ronald Reagan was both governor of California and an actor in Bedtime for Bonzo). However, existing approaches use the links in Wikipedia to combine these clues. This research helps computers find connected information without using these explicit links.

Chen Zhao, Chenyan Xiong, Xin Qian, and Jordan Boyd-Graber. Complex Factoid Question Answering with a Free-Text Knowledge Graph. ACM International Conference on World Wide Web, 2020. [Video] [Bibtex]

@inproceedings{Zhao:Xiong:Qian:Boyd-Graber-2020,
	Title = {Complex Factoid Question Answering with a Free-Text Knowledge Graph},
	Author = {Chen Zhao and Chenyan Xiong and Xin Qian and Jordan Boyd-Graber},
	Booktitle = {ACM International Conference on World Wide Web},
	Year = {2020},
	Location = {Taipei, Taiwan},
	Url = {http://cs.umd.edu/~jbg//docs/2020_www_delft.pdf},
}

Tianze Shi, Chen Zhao, Jordan Boyd-Graber, Hal Daumé III, and Lillian Lee. On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries. Findings of EMNLP, 2020. [Data] [Preprint] [Bibtex]

@article{Shi:Zhao:Boyd-Graber:Daume-III:Lee-2020,
	Title = {On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries},
	Author = {Tianze Shi and Chen Zhao and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Lillian Lee},
	Journal = {Findings of EMNLP},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_findings_qalign.pdf},
}

@inproceedings{Elgohary:Zhao:Boyd-Graber-2018,
	Title = {Dataset and Baselines for Sequential Open-Domain Question Answering},
	Author = {Ahmed Elgohary and Chen Zhao and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://cs.umd.edu/~jbg//docs/2018_emnlp_linked.pdf},
}

Chenglei Si

@inproceedings{Si:Goyal:Wu:Zhao:Feng:III:Boyd-Graber-2024,
	Title = {Large Language Models Help Humans Verify Truthfulness---Except When They Are Convincingly Wrong},
	Author = {Chenglei Si and Navita Goyal and Tongshuang Wu and Chen Zhao and Shi Feng and Hal Daum\'{e} {III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_convincingly.pdf},
}

Sander V Schulhoff, Jeremy Pinto, Anaum Khan, Louis-François Bouchard, Chenglei Si, Jordan Lee Boyd-Graber, Svetlina Anati, Valen Tagliabue, Anson Liu Kost, and Christopher R Carnahan. Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition. Empirical Methods in Natural Language Processing, 2023. [Prerecorded Video] [Data] [Award Video] [Bibtex]
```
@inproceedings{Schulhoff:Pinto:Khan:Bouchard:Si:Boyd-Graber:Anati:Tagliabue:Kost:Carnahan-2023,
	Title = {Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition},
	Author = {Sander V Schulhoff and Jeremy Pinto and Anaum Khan and Louis-François Bouchard and Chenglei Si and Jordan Lee Boyd-Graber and Svetlina Anati and Valen Tagliabue and Anson Liu Kost and Christopher R Carnahan},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_emnlp_hackaprompt.pdf},
}
```
This paper was selected as the Best Theme Paper at EMNLP 2023 (1 of 4909)

Accessible Abstract: As more AI services online are provided by prompted language models, we need to be aware of the weaknesses and exploits of the models. We present the HackAPrompt competition to help elicit a broad array of exploits that get around large langauge models.
Chenglei Si, Weijia Shi, Chen Zhao, Luke Zettlemoyer, and Jordan Boyd-Graber. Getting MoRE out of Mixture of Language Model Reasoning Experts. Findings of Empirical Methods in Natural Language Processing, 2023. [Video] [Bibtex]
```
@article{Si:Shi:Zhao:Zettlemoyer:Boyd-Graber-2023,
	Title = {Getting \underline{MoRE} out of \underline{M}ixture \underline{o}f Language Model \underline{R}easoning \underline{E}xperts},
	Author = {Chenglei Si and Weijia Shi and Chen Zhao and Luke Zettlemoyer and Jordan Boyd-Graber},
	Journal = {Findings of Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_findings_more.pdf},
}
```
Accessible Abstract: There are many ways for a computer to answer a question: a general knowledge question, a common sense question, or a math question. Each of these types of questions can be answered by a particular kind of expert. This paper investigates if we can automatically detect what kind of expert is best suited to answer a question and route the question to the correct expert.

Chenglei Si, Zhe Gan, Zhengyuan Yang, Shuohang Wang, Jianfeng Wang, Jordan Boyd-Graber, and Lijuan Wang. Prompting GPT-3 To Be Reliable. International Conference on Learning Representations, 2023. [Code] [Bibtex]

@inproceedings{Si:Gan:Yang:Wang:Wang:Boyd-Graber:Wang-2023,
	Title = {Prompting GPT-3 To Be Reliable},
	Author = {Chenglei Si and Zhe Gan and Zhengyuan Yang and Shuohang Wang and Jianfeng Wang and Jordan Boyd-Graber and Lijuan Wang},
	Booktitle = {International Conference on Learning Representations},
	Year = {2023},
	Url = {http://cs.umd.edu/~jbg//docs/2023_iclr_reliable.pdf},
}

Chenglei Si, Chen Zhao, Sewon Min, and Jordan Boyd-Graber. Re-Examining Calibration: The Case of Question Answering. Findings of Empirical Methods in Natural Language Processing, 2022. [Code] [Research Talk] [Bibtex]
```
@article{Si:Zhao:Min:Boyd-Graber-2022,
	Title = {Re-Examining Calibration: The Case of Question Answering},
	Author = {Chenglei Si and Chen Zhao and Sewon Min and Jordan Boyd-Graber},
	Journal = {Findings of Empirical Methods in Natural Language Processing},
	Year = {2022},
	Location = {Abu Dhabi},
	Url = {http://cs.umd.edu/~jbg//docs/2022_emnlp_calibration.pdf},
}
```
Accessible Abstract: Calibration is an important problem in question answering: if a search engine or virtual assistant doesn't know the answer to a question, you should probably abstain from showing an answer (to save embarassment, as when Google said a horse had six legs). This EMNLP Findings paper shows that existing metrics to test how good a QA calibration push calibrated confidence toward the average confidence. We proposed an alternate method both for evaluation and to generate better calibration by looking how models change as they learn.
Chenglei Si, Chen Zhao, and Jordan Boyd-Graber. What's in a Name? Answer Equivalence For Open-Domain Question Answering. Empirical Methods in Natural Language Processing, 2021. [Research Talk] [Bibtex]
```
@inproceedings{Si:Zhao:Boyd-Graber-2021,
	Title = {What's in a Name? Answer Equivalence For Open-Domain Question Answering},
	Author = {Chenglei Si and Chen Zhao and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_answer_equiv.pdf},
}
```
Accessible Abstract: Is Tim Cook the same person as Timothy Donald Cook? You might think so, but the way we train computers to answer questions would say they aren't. We show that keeping track of multiple names (and it's really simple) can create better question answering systems. Simply by adding alternate answers mined from knowledge bases, we can improve accuracy 1-2 points on major QA datasets.

Chenyan Xiong

Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber, and Hal Daumé III. Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation. Empirical Methods in Natural Language Processing, 2021. [Code] [Research Talk] [Bibtex]
```
@inproceedings{Zhao:Xiong:Boyd-Graber:Daume-III-2021,
	Title = {Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation},
	Author = {Chen Zhao and Chenyan Xiong and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_weak_dpr.pdf},
}
```
Accessible Abstract: Answering questions sometimes requires tying multiple pieces of information together. Previous datasets have required annotators to explicitly build these reasoning chains (e.g., to answer "where do I know the cop from Die Hard from", you need to figure out that the actor's name is "Reginald VelJohnson" and then find out that he's best known as the dad on Family Matters.). By exploring search queries that get to the right answer, we're able to answer these questions without expensive annotation.
Chen Zhao, Chenyan Xiong, Hal Daumé III, and Jordan Boyd-Graber. Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval. North American Association for Computational Linguistics, 2021. [Paper Read Aloud] [Bibtex]
```
@inproceedings{Zhao:Xiong:Daume-III:Boyd-Graber-2021,
	Title = {Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval},
	Author = {Chen Zhao and Chenyan Xiong and Hal {Daum\'{e} III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_naacl_multi_ance.pdf},
}
```
Accessible Abstract: For computers to answer complicated questions online, they often need to put together multiple pieces of information (Ronald Reagan was both governor of California and an actor in Bedtime for Bonzo). However, existing approaches use the links in Wikipedia to combine these clues. This research helps computers find connected information without using these explicit links.

@inproceedings{Zhao:Xiong:Qian:Boyd-Graber-2020,
	Title = {Complex Factoid Question Answering with a Free-Text Knowledge Graph},
	Author = {Chen Zhao and Chenyan Xiong and Xin Qian and Jordan Boyd-Graber},
	Booktitle = {ACM International Conference on World Wide Web},
	Year = {2020},
	Location = {Taipei, Taiwan},
	Url = {http://cs.umd.edu/~jbg//docs/2020_www_delft.pdf},
}

Christiane Fellbaum

Sonya S. Nikolova, Jordan Boyd-Graber, and Christiane Fellbaum. Collecting Semantic Similarity Ratings to Connect Concepts in Assistive Communication Tools. Modeling, Learning and Processing of Text Technological Data Structures, 2011. [Ratings] [Bibtex]

@inbook{Nikolova:Boyd-Graber:Fellbaum-2011,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Christiane Fellbaum},
	Title = {Collecting Semantic Similarity Ratings to Connect Concepts in Assistive Communication Tools},
	Editor = {Angelika Storrer},
	Booktitle = {Modeling, Learning and Processing of Text Technological Data Structures},
	Series = {Studies in Computational Intelligence},
	Address = {Heidelberg},
	Url = {http://cs.umd.edu/~jbg//docs/2011_book_chapter_evocation.pdf},
	Publisher = {Springer Verlag},
	Year = {2011},
}

Sonya S. Nikolova, Jordan Boyd-Graber, Christiane Fellbaum, and Perry Cook. Better Vocabularies for Assistive Communication Aids: Connecting Terms using Semantic Networks and Untrained Annotators. ACM Conference on Computers and Accessibility, 2009. [Ratings] [Slides] [Bibtex]

@inproceedings{Nikolova:Boyd-Graber:Fellbaum:Cook-2009,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Christiane Fellbaum and Perry Cook},
	Title = {Better Vocabularies for Assistive Communication Aids: Connecting Terms using Semantic Networks and Untrained Annotators},
	Booktitle = {ACM Conference on Computers and Accessibility},
	Year = {2009},
	Location = {Pittsburgh, PA},
	Url = {http://cs.umd.edu/~jbg//docs/evocation-viva.pdf},
}

Jordan Boyd-Graber, Christiane Fellbaum, Daniel Osherson, and Robert Schapire. Adding Dense, Weighted, Connections to WordNet. Proceedings of the Global WordNet Conference, 2006. [Presentation] [Data] [Bibtex]

@inproceedings{Boyd-Graber:Fellbaum:Osherson:Schapire-2006,
	Title = {Adding Dense, Weighted, Connections to {WordNet}},
	Author = {Jordan Boyd-Graber and Christiane Fellbaum and Daniel Osherson and Robert Schapire},
	Booktitle = {Proceedings of the Global {WordNet} Conference},
	Year = {2006},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-jeju.pdf},
}

@inproceedings{Boyd-Graber:Fellbaum:Osherson:Schapire-2006,
	Title = {Adding Dense, Weighted, Connections to {WordNet}},
	Author = {Jordan Boyd-Graber and Christiane Fellbaum and Daniel Osherson and Robert Schapire},
	Booktitle = {Proceedings of the Global {WordNet} Conference},
	Year = {2006},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-jeju.pdf},
}

Clay Templeton

Clay Templeton, Travis Brown, Sayan Battacharyya, and Jordan Boyd-Graber. Mining the Dispatch under Supervision: Using Casualty Counts to Guide Topics from the Richmond Daily Dispatch Corpus. Chicago Colloquium on Digital Humanities and Computer Science, 2011. [Bibtex]

@inproceedings{Templeton:Brown:Battacharyya:Boyd-Graber-2011,
	Title = {Mining the Dispatch under Supervision: Using Casualty Counts to Guide Topics from the Richmond Daily Dispatch Corpus},
	Author = {Clay Templeton and Travis Brown and Sayan Battacharyya and Jordan Boyd-Graber},
	Booktitle = {Chicago Colloquium on Digital Humanities and Computer Science},
	Year = {2011},
	Location = {Chicago, IL},
	Url = {http://cs.umd.edu/~jbg//docs/slda_civil_war.pdf},
}

Clay Templeton, Kenneth R. Fleischmann, and Jordan Boyd-Graber. Simulating Audiences: Automating Analysis of Values, Attitudes, and Sentiment. IEEE International Conference on Social Computing, 2011. [Bibtex]

@inproceedings{Templeton:Fleischmann:Boyd-Graber-2011,
	Title = {Simulating Audiences: Automating Analysis of Values, Attitudes, and Sentiment},
	Author = {Clay Templeton and Kenneth R. Fleischmann and Jordan Boyd-Graber},
	Booktitle = {IEEE International Conference on Social Computing},
	Year = {2011},
	Location = {Cambridge, MA},
	Url = {http://cs.umd.edu/~jbg//docs/simulating_audiences.pdf},
}

Clay Templeton, Kenneth R. Fleischmann, and Jordan Boyd-Graber. Comparing Values and Sentiment Using Mechanical Turk. iConference, 2011. [Bibtex]

@inproceedings{Templeton:Fleischmann:Boyd-Graber-2011,
	Title = {Comparing Values and Sentiment Using Mechanical Turk},
	Author = {Clay Templeton and Kenneth R. Fleischmann and Jordan Boyd-Graber},
	Booktitle = {iConference},
	Year = {2011},
	Location = {Seattle, Washington},
	Url = {http://cs.umd.edu/~jbg//docs/iconference-2011-comparing.pdf},
}

Kenneth R. Fleischmann, Clay Templeton, and Jordan Boyd-Graber. Modeling Diverse Standpoints in Text Classification: Learning to Be Human by Modeling Human Values. iConference, 2011. [Bibtex]

@inproceedings{Fleischmann:Templeton:Boyd-Graber-2011,
	Title = {Modeling Diverse Standpoints in Text Classification: Learning to Be Human by Modeling Human Values},
	Author = {Kenneth R. Fleischmann and Clay Templeton and Jordan Boyd-Graber},
	Booktitle = {iConference},
	Year = {2011},
	Location = {Seattle, Washington},
	Url = {http://cs.umd.edu/~jbg//docs/iconference-2011-learning.pdf},
}

Craig Stewart

@inproceedings{Grissom-II:Shoemaker:Goldman:Shi:Stewart:Rytting:Findlater:Boyd-Graber:Li:Grissom-II:Boyd-Graber-2024,
	Author = {Alvin {Grissom II} and Jo Shoemaker and Benjamin Goldman and Ruikang Shi and Craig Stewart and C. Anton Rytting and Leah Findlater and Jordan Boyd-Graber},
	Location = {Torino, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2024_lrec_siminthelp.pdf},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Year = {2024},
	Title = {Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates},
}

Craig Stewart, Nikolai Vogler, Junjie Hu, Jordan Boyd-Graber, and Graham Neubig. Automatic Estimation of Simultaneous Interpreter Performance. Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

Cristian Danescu-Niculescu-Mizil

Denis Peskov, Benny Cheng, Ahmed Elgohary Ghoneim, Joe Barrow, Cristian Danescu-Niculescu-Mizil, and Jordan Boyd-Graber. It Takes Two to Lie: One to Lie and One to Listen. Association for Computational Linguistics, 2020. [Video] [Podcast] [Data and Code] [Bibtex]
```
@inproceedings{Peskov:Cheng:Elgohary:Barrow:Danescu-Niculescu-Mizil:Boyd-Graber-2020,
	Title = {It Takes Two to Lie: One to Lie and One to Listen},
	Author = {Denis Peskov and Benny Cheng and Ahmed Elgohary and Joe Barrow and Cristian Danescu-Niculescu-Mizil and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_diplomacy.pdf},
}
```
Accessible Abstract: Machine learning techniques to detect deception in online communications requires training and evaluation data. However, there is a dearth of data either because of uncertain gold labels or privacy concerns; we create a new, large deception-centered dataset in the online game of Diplomacy. We gathered 17,289 messages from 12 games (each of which took over a month) involving 84 players, the majority of which were unique users. This data was collected with a custom-made bot that allowed us to collect messages and annotations. The user pool was created from scratch: we varied participant demographics across gender, age, nationality, and past game experience. Some of our participants included the former president of the Diplomacy players' association, several top ranked players in the world, a board game shop owner, and scientists. We create machine learning models to detect lies using linguistic, context, and power-dynamic features. Our best model had similar lie detection accuracy to humans.
Vlad Niculae, Srijan Kumar, Jordan Boyd-Graber, and Cristian Danescu-Niculescu-Mizil. Linguistic Harbingers of Betrayal: A Case Study on an Online Strategy Game. Association for Computational Linguistics, 2015. [Code/Data] [Bibtex]
```
@inproceedings{Niculae:Kumar:Boyd-Graber:Danescu-Niculescu-Mizil-2015,
	Title = {Linguistic Harbingers of Betrayal: A Case Study on an Online Strategy Game},
	Author = {Vlad Niculae and Srijan Kumar and Jordan Boyd-Graber and Cristian Danescu-Niculescu-Mizil},
	Booktitle = {Association for Computational Linguistics},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_acl_diplomacy.pdf},
}
```
Accessible Abstract: This paper introduces the application of natural language processing techniques to understand the relationships (and their dissolution) in the game of Diplomacy. This popular board game simulates Europe at the eve of World War I and forces players to work with each other to forge alliances and make plans together. However, the game's setup also encourages players to turn against each other. This paper analyzes whether we can predict these betrayals (we can!) and the linguistic and social phenomena (demands, politeness, and planning) that can predict when a betrayal will happen.

Daniel Osherson

@inproceedings{Boyd-Graber:Fellbaum:Osherson:Schapire-2006,
	Title = {Adding Dense, Weighted, Connections to {WordNet}},
	Author = {Jordan Boyd-Graber and Christiane Fellbaum and Daniel Osherson and Robert Schapire},
	Booktitle = {Proceedings of the Global {WordNet} Conference},
	Year = {2006},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-jeju.pdf},
}

@inproceedings{Boyd-Graber:Fellbaum:Osherson:Schapire-2006,
	Title = {Adding Dense, Weighted, Connections to {WordNet}},
	Author = {Jordan Boyd-Graber and Christiane Fellbaum and Daniel Osherson and Robert Schapire},
	Booktitle = {Proceedings of the Global {WordNet} Conference},
	Year = {2006},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-jeju.pdf},
}

David M. Blei

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

Jonathan Chang, Jordan Boyd-Graber, and David M. Blei. Connections between the Lines: Augmenting Social Networks with Text. Knowledge Discovery and Data Mining, 2009. [Code] [Slides] [Video] [Pie Fight] [Bibtex]

@inproceedings{Chang:Boyd-Graber:Blei-2009,
	Title = {Connections between the Lines: Augmenting Social Networks with Text},
	Url = {http://cs.umd.edu/~jbg//docs/kdd2009.pdf},
	Author = {Jonathan Chang and Jordan Boyd-Graber and David M. Blei},
	Booktitle = {Knowledge Discovery and Data Mining},
	Year = {2009},
	Location = {Paris, France},
}

Jonathan Chang, Jordan Boyd-Graber, Chong Wang, Sean Gerrish, and David M. Blei. Reading Tea Leaves: How Humans Interpret Topic Models. Neural Information Processing Systems, 2009. [Data] [Presentation] [Video] [Bibtex]
```
@inproceedings{Chang:Boyd-Graber:Wang:Gerrish:Blei-2009,
	Title = {Reading Tea Leaves: How Humans Interpret Topic Models},
	Author = {Jonathan Chang and Jordan Boyd-Graber and Chong Wang and Sean Gerrish and David M. Blei},
	Booktitle = {Neural Information Processing Systems},
	Year = {2009},
	Location = {Vancouver, BC},
	Url = {http://cs.umd.edu/~jbg//docs/nips2009-rtl.pdf},
}
```
Jonathan Chang and I shared a NIPS student award honorable mention for this paper (5 out of 1105)

Accessible Abstract: Topic models are a tool that historians and social sciences use to explore large text corpora. But how do you know if you have a good topic model? Before this paper, the consensus was to use held-out likelihood to evaluate if you had a good model. This paper argues that this does not fit how people actually use topic models and proposes new human-centered metrics for evaluating topic models. This method inspired a rethinking of model evaluation and showed that the complexity of a model does not always correspond to what a user might want.
Jordan Boyd-Graber and David M. Blei. Multilingual Topic Models for Unaligned Text. Uncertainty in Artificial Intelligence, 2009. [More Recent Paper] [Bibtex]
```
@inproceedings{Boyd-Graber:Blei-2009,
	Title = {Multilingual Topic Models for Unaligned Text},
	Author = {Jordan Boyd-Graber and David M. Blei},
	Booktitle = {Uncertainty in Artificial Intelligence},
	Year = {2009},
	Location = {Montreal, Quebec},
	Url = {http://cs.umd.edu/~jbg//docs/uai2009.pdf},
}
```
For coverage of current state-of-the-art in cross-lingual topic models see: Yuening Hu, Ke Zhai, Vlad Eidelman, and Jordan Boyd-Graber. Polylingual Tree-Based Topic Models for Translation Domain Adaptation. Association for Computational Linguistics, 2014.
Jordan Boyd-Graber and David M. Blei. Multilingual Topic Models for Unaligned Text. Uncertainty in Artificial Intelligence, 2009. [More Recent Paper] [Bibtex]
```
@inproceedings{Boyd-Graber:Blei-2009,
	Title = {Multilingual Topic Models for Unaligned Text},
	Author = {Jordan Boyd-Graber and David M. Blei},
	Booktitle = {Uncertainty in Artificial Intelligence},
	Year = {2009},
	Location = {Montreal, Quebec},
	Url = {http://cs.umd.edu/~jbg//docs/uai2009.pdf},
}
```
For coverage of current state-of-the-art in cross-lingual topic models see: Yuening Hu, Ke Zhai, Vlad Eidelman, and Jordan Boyd-Graber. Polylingual Tree-Based Topic Models for Translation Domain Adaptation. Association for Computational Linguistics, 2014.

Jonathan Chang, Jordan Boyd-Graber, and David M. Blei. Discovering social networks from free text. 3rd Annual Machine Learning Symposium, 2008. [Bibtex]

@inproceedings{Chang:Boyd-Graber:Blei-2008,
	Title = {Discovering social networks from free text},
	Author = {Jonathan Chang and Jordan Boyd-Graber and David M. Blei},
	Booktitle = {3rd Annual Machine Learning Symposium},
	Year = {2008},
	Location = {New York, New York},
}

Jonathan Chang, Jordan Boyd-Graber, and David M. Blei. Discovering social networks from free text. 3rd Annual Machine Learning Symposium, 2008. [Bibtex]

@inproceedings{Chang:Boyd-Graber:Blei-2008,
	Title = {Discovering social networks from free text},
	Author = {Jonathan Chang and Jordan Boyd-Graber and David M. Blei},
	Booktitle = {3rd Annual Machine Learning Symposium},
	Year = {2008},
	Location = {New York, New York},
}

Jordan Boyd-Graber and David M. Blei. Multilingual Topic Models. NIPS Workshop on Unsupervised Latent Variable Models, 2008. [Bibtex]

@inproceedings{Boyd-Graber:Blei-2008,
	Title = {Multilingual Topic Models},
	Author = {Jordan Boyd-Graber and David M. Blei},
	Booktitle = {NIPS Workshop on Unsupervised Latent Variable Models},
	Year = {2008},
	Location = {Whistler, Canada},
}

Jordan Boyd-Graber and David M. Blei. Syntactic Topic Models. Neural Information Processing Systems, 2008. [Presentation] [Extended Version] [Bibtex]

@inproceedings{Boyd-Graber:Blei-2008,
	Title = {Syntactic Topic Models},
	Author = {Jordan Boyd-Graber and David M. Blei},
	Booktitle = {Neural Information Processing Systems},
	Year = {2008},
	Location = {Vancouver, British Columbia},
	Url = {http://cs.umd.edu/~jbg//docs/nips2008.pdf},
}

Jordan Boyd-Graber and David M. Blei. Syntactic Topic Models. Neural Information Processing Systems, 2008. [Presentation] [Extended Version] [Bibtex]

@inproceedings{Boyd-Graber:Blei-2008,
	Title = {Syntactic Topic Models},
	Author = {Jordan Boyd-Graber and David M. Blei},
	Booktitle = {Neural Information Processing Systems},
	Year = {2008},
	Location = {Vancouver, British Columbia},
	Url = {http://cs.umd.edu/~jbg//docs/nips2008.pdf},
}

Jordan Boyd-Graber and David M. Blei. PUTOP: Turning Predominant Senses into a Topic Model for WSD. 4th International Workshop on Semantic Evaluations, 2007. [Bibtex]

@inproceedings{Boyd-Graber:Blei-2007,
	Title = {{PUTOP}: {T}urning Predominant Senses into a Topic Model for {WSD}},
	Author = {Jordan Boyd-Graber and David M. Blei},
	Booktitle = {4th International Workshop on Semantic Evaluations},
	Year = {2007},
	Location = {Prague, Czech Republic},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-SEMEVAL07.pdf},
}

Jordan Boyd-Graber, David M. Blei, and Xiaojin Zhu. A Topic Model for Word Sense Disambiguation. Empirical Methods in Natural Language Processing, 2007. [Presentation] [Code] [Bibtex]

@inproceedings{Boyd-Graber:Blei:Zhu-2007,
	Title = {A Topic Model for Word Sense Disambiguation},
	Author = {Jordan Boyd-Graber and David M. Blei and Xiaojin Zhu},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2007},
	Location = {Prague, Czech Republic},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-EMNLP07.pdf},
}

David Mimno

Jordan Boyd-Graber, Yuening Hu, and David Mimno. Applications of Topic Models. 2017. [Preprint] [Bibtex]

@book{Boyd-Graber:Hu:Mimno-2017,
	Editor = {Doug Oard},
	Author = {Jordan Boyd-Graber and Yuening Hu and David Mimno},
	Title = {Applications of Topic Models},
	Publisher = {NOW Publishers},
	Series = {Foundations and Trends in Information Retrieval},
	Year = {2017},
	Volume = {11},
	Number = {2--3},
	Url = {http://www.nowpublishers.com/article/Details/INR-030},
}

Jordan Boyd-Graber, Yuening Hu, and David Mimno. Applications of Topic Models. 2017. [Preprint] [Bibtex]

@book{Boyd-Graber:Hu:Mimno-2017,
	Editor = {Doug Oard},
	Author = {Jordan Boyd-Graber and Yuening Hu and David Mimno},
	Title = {Applications of Topic Models},
	Publisher = {NOW Publishers},
	Series = {Foundations and Trends in Information Retrieval},
	Year = {2017},
	Volume = {11},
	Number = {2--3},
	Url = {http://www.nowpublishers.com/article/Details/INR-030},
}

Jordan Boyd-Graber, David Mimno, and David Newman. Care and Feeding of Topic Models: Problems, Diagnostics, and Improvements. Handbook of Mixed Membership Models and Their Applications, 2014. [Bibtex]

@inbook{Boyd-Graber:Mimno:Newman-2014,
	Author = {Jordan Boyd-Graber and David Mimno and David Newman},
	Title = {Care and Feeding of Topic Models: Problems, Diagnostics, and Improvements},
	Editor = {Edoardo M. Airoldi and David Blei and Elena A. Erosheva and Stephen E. Fienberg},
	Booktitle = {Handbook of Mixed Membership Models and Their Applications},
	Series = {CRC Handbooks of Modern Statistical Methods},
	Address = {Boca Raton, Florida},
	Publisher = {CRC Press},
	Year = {2014},
	Url = {http://cs.umd.edu/~jbg//docs/2014_book_chapter_care_and_feeding.pdf},
}

Denis Peskov

Denis Peskov, Viktor Hangya, Jordan Boyd-Graber, and Alexander Fraser. Adapting Entities across Languages and Cultures. Findings of Empirical Methods in Natural Language Processing, 2021. [Research Talk] [Code] [Bibtex]
```
@article{Peskov:Hangya:Boyd-Graber:Fraser-2021,
	Title = {Adapting Entities across Languages and Cultures},
	Author = {Denis Peskov and Viktor Hangya and Jordan Boyd-Graber and Alexander Fraser},
	Journal = {Findings of Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_adaptation.pdf},
}
```
Accessible Abstract: If you ask who Germany's "Christian Drosten" is, a simple answer is that he's their "Anthony Fauci". We create a system to automatically generate these adaptations, which can help improve cross-cultural understanding and create new training data for tasks like question answering.
Alexander Hoyle, Pranav Goel, Denis Peskov, Andrew Hian-Cheong, Jordan Boyd-Graber, and Philip Resnik. Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence. Neural Information Processing Systems, 2021. [ArXiv] [Research Talk (students)] [Research Talk (Jordan)] [Code] [Bibtex]
```
@inproceedings{Hoyle:Goel:Peskov:Hian-Cheong:Boyd-Graber:Resnik-2021,
	Title = {Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence},
	Author = {Alexander Hoyle and Pranav Goel and Denis Peskov and Andrew Hian-Cheong and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Neural Information Processing Systems},
	Location = {Online},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_neurips_incoherence.pdf},
}
```
Accessible Abstract: Topic models help historians, journalists, and analysts make sense of large text collections. But how do you know if you have a good one? The field has settled on using "Automatic Coherence", but this paper argues that maybe that isn't the right choice if you want to actually make real users happy. This paper builds on our 2009 that showed perplexity was not a good evaluation of interpretability for topic models; while the field adopted automatic topic coherence as a result of that 2009 paper, this paper argues that automatic topic coherence is not a good metric for neural topic models (even though it worked for probabilistic topic models).
Denis Peskov, Benny Cheng, Ahmed Elgohary Ghoneim, Joe Barrow, Cristian Danescu-Niculescu-Mizil, and Jordan Boyd-Graber. It Takes Two to Lie: One to Lie and One to Listen. Association for Computational Linguistics, 2020. [Video] [Podcast] [Data and Code] [Bibtex]
```
@inproceedings{Peskov:Cheng:Elgohary:Barrow:Danescu-Niculescu-Mizil:Boyd-Graber-2020,
	Title = {It Takes Two to Lie: One to Lie and One to Listen},
	Author = {Denis Peskov and Benny Cheng and Ahmed Elgohary and Joe Barrow and Cristian Danescu-Niculescu-Mizil and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_diplomacy.pdf},
}
```
Accessible Abstract: Machine learning techniques to detect deception in online communications requires training and evaluation data. However, there is a dearth of data either because of uncertain gold labels or privacy concerns; we create a new, large deception-centered dataset in the online game of Diplomacy. We gathered 17,289 messages from 12 games (each of which took over a month) involving 84 players, the majority of which were unique users. This data was collected with a custom-made bot that allowed us to collect messages and annotations. The user pool was created from scratch: we varied participant demographics across gender, age, nationality, and past game experience. Some of our participants included the former president of the Diplomacy players' association, several top ranked players in the world, a board game shop owner, and scientists. We create machine learning models to detect lies using linguistic, context, and power-dynamic features. Our best model had similar lie detection accuracy to humans.

Denis Peskov, Joe Barrow, Pedro Rodriguez, Graham Neubig, and Jordan Boyd-Graber. Mitigating Noisy Inputs for Question Answering. Conference of the International Speech Communication Association, 2019. [Bibtex]

@inproceedings{Peskov:Barrow:Rodriguez:Neubig:Boyd-Graber-2019,
	Title = {Mitigating Noisy Inputs for Question Answering},
	Author = {Denis Peskov and Joe Barrow and Pedro Rodriguez and Graham Neubig and Jordan Boyd-Graber},
	Booktitle = {Conference of the International Speech Communication Association},
	Year = {2019},
	Location = {Graz, Austria},
	Url = {http://cs.umd.edu/~jbg//docs/2019_interspeech_asr},
}

@inproceedings{Elgohary:Peskov:Boyd-Graber-2019,
	Title = {Can You Unpack That? Learning to Rewrite Questions-in-Context},
	Author = {Ahmed Elgohary and Denis Peskov and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2019},
	Location = {Hong Kong, China},
	Url = {http://cs.umd.edu/~jbg//docs/2019_emnlp_sequentialqa.pdf},
}

Eric Ringger

Paul Felt, Eric Ringger, Kevin Seppi, and Jordan Boyd-Graber. Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types. International Conference on Computational Linguistics, 2018. [Bibtex]

@inproceedings{Felt:Ringger:Seppi:Boyd-Graber-2018,
	Title = {Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types},
	Author = {Paul Felt and Eric Ringger and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {International Conference on Computational Linguistics},
	Year = {2018},
	Location = {Santa Fe, New Mexico},
	Url = {http://cs.umd.edu/~jbg//docs/2018_coling_measurements.pdf},
}

Paul Felt, Eric Ringger, Jordan Boyd-Graber, and Kevin Seppi. Making the Most of Crowdsourced Document Annotations: Confused Supervised LDA. Conference on Computational Natural Language Learning, 2015. [Talk] [Bibtex]

@inproceedings{Felt:Ringger:Boyd-Graber:Seppi-2015,
	Title = {Making the Most of Crowdsourced Document Annotations: Confused Supervised {LDA}},
	Author = {Paul Felt and Eric Ringger and Jordan Boyd-Graber and Kevin Seppi},
	Booktitle = {Conference on Computational Natural Language Learning},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_conll_cslda.pdf},
}

This paper received the best paper award at CoNLL

Thang Nguyen, Jordan Boyd-Graber, Jeff Lund, Kevin Seppi, and Eric Ringger. Is your anchor going up or down? Fast and accurate supervised topic models. North American Association for Computational Linguistics, 2015. [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Lund:Seppi:Ringger-2015,
	Title = {Is your anchor going up or down?  {F}ast and accurate supervised topic models},
	Author = {Thang Nguyen and Jordan Boyd-Graber and Jeff Lund and Kevin Seppi and Eric Ringger},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
	Url = {http://cs.umd.edu/~jbg//docs/2015_naacl_supervised_anchor.pdf},
}

Eric Wallace

Eric Wallace, Shi Feng, and Jordan Boyd-Graber. Misleading Failures of Partial-input Baselines. Association for Computational Linguistics, 2019. [Bibtex]

@inproceedings{Wallace:Feng:Boyd-Graber-2019,
	Title = {Misleading Failures of Partial-input Baselines},
	Author = {Eric Wallace and Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_flipside.pdf},
}

Eric Wallace, Pedro Rodriguez, Shi Feng, Ikuya Yamada, and Jordan Boyd-Graber. Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples. Transactions of the Association for Computational Linguistics, 2019. [Code] [Videos] [Data] [Bibtex]

@article{Wallace:Rodriguez:Feng:Yamada:Boyd-Graber-2019,
	Title = {Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples},
	Author = {Eric Wallace and Pedro Rodriguez and Shi Feng and Ikuya Yamada and Jordan Boyd-Graber},
	Booktitle = {Transactions of the Association for Computational Linguistics},
	Year = {2019},
	Volume = {10},
	Url = {http://cs.umd.edu/~jbg//docs/2019_tacl_trick.pdf},
}

Eric Wallace and Jordan Boyd-Graber. Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions. ACL Student Research Workshop, 2018. [Bibtex]

@inproceedings{Wallace:Boyd-Graber-2018,
	Title = {Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions},
	Author = {Eric Wallace and Jordan Boyd-Graber},
	Booktitle = {ACL Student Research Workshop},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://aclweb.org/anthology/P18-3018},
}

Shi Feng, Eric Wallace, and Jordan Boyd-Graber. Interpreting Neural Networks with Nearest Neighbors. EMNLP Workshop on BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, 2018. [Bibtex]

@inproceedings{Feng:Wallace:Boyd-Graber-2018,
	Title = {Interpreting Neural Networks with Nearest Neighbors},
	Author = {Shi Feng and Eric Wallace and Jordan Boyd-Graber},
	Booktitle = {EMNLP Workshop on BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://aclweb.org/anthology/W18-5416},
}

Shi Feng, Eric Wallace, Alvin Grissom II, Pedro Rodriguez, Mohit Iyyer, and Jordan Boyd-Graber. Pathologies of Neural Models Make Interpretation Difficult. Empirical Methods in Natural Language Processing, 2018. [Blog Post] [Bibtex]

@inproceedings{Feng:Wallace:II:Rodriguez:Iyyer:Boyd-Graber-2018,
	Title = {Pathologies of Neural Models Make Interpretation Difficult},
	Author = {Shi Feng and Eric Wallace and Alvin Grissom II and Pedro Rodriguez and Mohit Iyyer and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://cs.umd.edu/~jbg//docs/2018_emnlp_rs.pdf},
}

Eve Fleisig

Yoo Yeon Sung, Eve Fleisig, Yu Hope, Ishan Upadhyay, and Jordan Lee Boyd-Graber. GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration. ArXiv, Preprint. [Bibtex]

@article{Sung:Fleisig:Hope:Upadhyay:Boyd-Graber-Preprint,
	Title = {GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration},
	Author = {Yoo Yeon Sung and Eve Fleisig and Yu Hope and Ishan Upadhyay and Jordan Lee Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/pdf/2502.19684},
}

Yoo Yeon Sung, Maharshi Gor, Eve Fleisig, Ishani Mondal, and Jordan Lee Boyd-Graber. ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks. North American Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Sung:Gor:Fleisig:Mondal:Boyd-Graber-2025,
	Title = {ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks},
	Author = {Yoo Yeon Sung and Maharshi Gor and Eve Fleisig and Ishani Mondal and Jordan Lee Boyd-Graber},
	Journal = {North American Association for Computational Linguistics},
	Year = {2025},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_advscore.pdf},
}
```
Accessible Abstract: Adversarial datasets should validate AI robustness by presenting samples that humans handle well but models struggle with. However, as models advance, these datasets risk becoming obsolete. Assessing whether a dataset remains adversarial is challenging due to the absence of a standardized metric for adversarialness. To address this, we introduce AdvScore, a human-grounded evaluation metric that quantifies a dataset's adversarial nature by accounting for the differing abilities of models and humans while also identifying low-quality examples.

Evgeny Klochikhin

Evgeny Klochikhin and Jordan Boyd-Graber. Text Analysis. Big Data and Social Science Research: Theory and Practical Approaches, 2016. [Bibtex]

@inbook{Klochikhin:Boyd-Graber-2016,
	Editor = {Julia Lane and Ian Foster and Rayid Ghani and Ron Jarmin and Frauke Kreuter},
	Title = {Text Analysis},
	Author = {Evgeny Klochikhin and Jordan Boyd-Graber},
	Booktitle = {Big Data and Social Science Research: Theory and Practical Approaches},
	Publisher = {Taylor Francis},
	Year = {2016},
}

Evgeny Klochikhin and Jordan Boyd-Graber. Text Analysis. Big Data and Social Science Research: Theory and Practical Approaches, 2016. [Bibtex]

@inbook{Klochikhin:Boyd-Graber-2016,
	Editor = {Julia Lane and Ian Foster and Rayid Ghani and Ron Jarmin and Frauke Kreuter},
	Title = {Text Analysis},
	Author = {Evgeny Klochikhin and Jordan Boyd-Graber},
	Booktitle = {Big Data and Social Science Research: Theory and Practical Approaches},
	Publisher = {Taylor Francis},
	Year = {2016},
}

Fenfei Guo

Fenfei Guo, Chen Zhang, Zhirui Zhang, Qixin He, Kejun Zhang, Jun Xie, and Jordan Boyd-Graber. Automatic Song Translation for Tonal Languages. Findings of the Association for Computational Linguistics, 2022. [Translation Examples (with sound)] [Code] [Bibtex]

@article{Guo:Zhang:Zhang:He:Zhang:Xie:Boyd-Graber-2022,
	Author = {Fenfei Guo and Chen Zhang and Zhirui Zhang and Qixin He and Kejun Zhang and Jun Xie and Jordan Boyd-Graber},
	Title = {Automatic Song Translation for Tonal Languages},
	Journal = {Findings of the Association for Computational Linguistics},
	Year = {2022},
	Location = {Dublin},
	Url = {http://cs.umd.edu/~jbg//docs/2022_acl_ast.pdf},
}

Fenfei Guo, Jordan Boyd-Graber, Mohit Iyyer, and Leah Findlater. Which Evaluations Uncover Sense Representations that Actually Make Sense?. Linguistic Resources and Evaluation Conference, 2020. [Bibtex]

@inproceedings{Guo:Boyd-Graber:Iyyer:Findlater-2020,
	Author = {Fenfei Guo and Jordan Boyd-Graber and Mohit Iyyer and Leah Findlater},
	Location = {France (but only in dreams)},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Url = {http://cs.umd.edu/~jbg//docs/2020_lrec_sense.pdf},
	Year = {2020},
	Title = {Which Evaluations Uncover Sense Representations that Actually Make Sense?},
}

Feng Gu

Feng Gu, Wichayaporn Wongkamjan, Jonathan K. Kummerfeld, Denis Peskoff, Jonathan May, and Jordan Boyd-Graber. Personalized Help for Optimizing Low-Skilled Users' Strategy. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Gu:Wongkamjan:Kummerfeld:Peskoff:May:Boyd-Graber-2025,
	Title = {Personalized Help for Optimizing Low-Skilled Users' Strategy},
	Author = {Feng Gu and Wichayaporn Wongkamjan and Jonathan K. Kummerfeld and Denis Peskoff and Jonathan May and Jordan Boyd-Graber},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2024_arr_chiron-advisor.pdf},
}
```
Accessible Abstract: AIs can beat humans in game environments; however, how helpful those agents are to human remains understudied. We augment CICERO, a natural language agent that demonstrates superhuman performance in Diplomacy, to generate both move and message advice based on player intentions. A dozen Diplomacy games with novice and experienced players, with varying advice settings, show that some of the generated advice is beneficial. It helps novices compete with experienced players and in some instances even surpass them. The mere presence of advice can be advantageous, even if players do not follow it.
Nishant Balepur, Feng Gu, Abhilasha Ravichander, Shi Feng, Jordan Boyd-Graber, and Rachel Rudinger. Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Balepur:Gu:Ravichander:Feng:Boyd-Graber:Rudinger-2025,
	Title = {Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?},
	Author = {Nishant Balepur and Feng Gu and Abhilasha Ravichander and Shi Feng and Jordan Boyd-Graber and Rachel Rudinger},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_reverseqa.pdf},
}
```
Accessible Abstract: Language models like ChatGPT are pretty good at answering questions (e.g. "What is 12 * 12?"), but we show they can surprisingly struggle when asked to do the reverse task: generating questions for answers (e.g. "Give me a question with the answer 144"). We study when these errors happen, what might be causing them, and how they can be addressed.

Forough Poursabzi-Sangdeh

@article{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2017,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2017},
	Title = {Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels},
	Volume = {5},
	Url = {http://cs.umd.edu/~jbg//docs/2017_tacl_eval_tm_viz.pdf},
	Pages = {1--15},
}

Forough Poursabzi-Sangdeh, Jordan Boyd-Graber, Leah Findlater, and Kevin Seppi. ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling. Association for Computational Linguistics, 2016. [Code] [Bibtex]

@inproceedings{Poursabzi-Sangdeh:Boyd-Graber:Findlater:Seppi-2016,
	Title = {ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling},
	Author = {Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Leah Findlater and Kevin Seppi},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_doclabel.pdf},
}

@inproceedings{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2016,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Booktitle = {CHI Human Centred Machine Learning Workshop},
	Year = {2016},
	Location = {San Jose, CA},
	Title = {Human-Centered and Interactive: Expanding the Impact of Topic Models},
}

Forough Poursabzi-Sangdeh and Jordan Boyd-Graber. Speeding Document Annotation with Topic Models. NAACL Student Research Workshop, 2015. [Bibtex]

@inproceedings{Poursabzi-Sangdeh:Boyd-Graber-2015,
	Title = {Speeding Document Annotation with Topic Models},
	Author = {Forough Poursabzi-Sangdeh and Jordan Boyd-Graber},
	Booktitle = {NAACL Student Research Workshop},
	Year = {2015},
	Location = {Denver, CO},
}

Jason Chuang, John D. Wilkerson, Rebecca Weiss, Dustin Tingley, Brandon M. Stewart, Margaret E. Roberts, Forough Poursabzi-Sangdeh, Justin Grimmer, Leah Findlater, Jordan Boyd-Graber, and Jeffrey Heer. Computer-Assisted Content Analysis: Topic Models for Exploring Multiple Subjective Interpretations. NIPS Workshop on Human-Propelled Machine Learning, 2014. [Bibtex]

@inproceedings{Chuang:Wilkerson:Weiss:Tingley:Stewart:Roberts:Poursabzi-Sangdeh:Grimmer:Findlater:Boyd-Graber:Heer-2014,
	Title = {Computer-Assisted Content Analysis: Topic Models for Exploring Multiple Subjective Interpretations},
	Author = {Jason Chuang and John D. Wilkerson and Rebecca Weiss and Dustin Tingley and Brandon M. Stewart and Margaret E. Roberts and Forough Poursabzi-Sangdeh and Justin Grimmer and Leah Findlater and Jordan Boyd-Graber and Jeffrey Heer},
	Booktitle = {NIPS Workshop on Human-Propelled Machine Learning},
	Year = {2014},
	Location = {Montreal, Canada},
}

Graham Neubig

@inproceedings{Peskov:Barrow:Rodriguez:Neubig:Boyd-Graber-2019,
	Title = {Mitigating Noisy Inputs for Question Answering},
	Author = {Denis Peskov and Joe Barrow and Pedro Rodriguez and Graham Neubig and Jordan Boyd-Graber},
	Booktitle = {Conference of the International Speech Communication Association},
	Year = {2019},
	Location = {Graz, Austria},
	Url = {http://cs.umd.edu/~jbg//docs/2019_interspeech_asr},
}

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

Hal Daumé III

Chen Zhao, Chenyan Xiong, Jordan Boyd-Graber, and Hal Daumé III. Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation. Empirical Methods in Natural Language Processing, 2021. [Code] [Research Talk] [Bibtex]
```
@inproceedings{Zhao:Xiong:Boyd-Graber:Daume-III-2021,
	Title = {Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence Annotation},
	Author = {Chen Zhao and Chenyan Xiong and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_weak_dpr.pdf},
}
```
Accessible Abstract: Answering questions sometimes requires tying multiple pieces of information together. Previous datasets have required annotators to explicitly build these reasoning chains (e.g., to answer "where do I know the cop from Die Hard from", you need to figure out that the actor's name is "Reginald VelJohnson" and then find out that he's best known as the dad on Family Matters.). By exploring search queries that get to the right answer, we're able to answer these questions without expensive annotation.
Chen Zhao, Chenyan Xiong, Hal Daumé III, and Jordan Boyd-Graber. Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval. North American Association for Computational Linguistics, 2021. [Paper Read Aloud] [Bibtex]
```
@inproceedings{Zhao:Xiong:Daume-III:Boyd-Graber-2021,
	Title = {Multi-Step Reasoning Over Unstructured Text with Beam Dense Retrieval},
	Author = {Chen Zhao and Chenyan Xiong and Hal {Daum\'{e} III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_naacl_multi_ance.pdf},
}
```
Accessible Abstract: For computers to answer complicated questions online, they often need to put together multiple pieces of information (Ronald Reagan was both governor of California and an actor in Bedtime for Bonzo). However, existing approaches use the links in Wikipedia to combine these clues. This research helps computers find connected information without using these explicit links.

@article{Shi:Zhao:Boyd-Graber:Daume-III:Lee-2020,
	Title = {On the Potential of Lexico-logical Alignments for Semantic Parsing to SQL Queries},
	Author = {Tianze Shi and Chen Zhao and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Lillian Lee},
	Journal = {Findings of EMNLP},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_findings_qalign.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

Khanh Nguyen, Jordan Boyd-Graber, and Hal Daumé III. Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback. Empirical Methods in Natural Language Processing, 2017. [Code] [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Daume-III-2017,
	Title = {Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback},
	Author = {Khanh Nguyen and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2017},
	Location = {Copenhagen, Denmark},
	Url = {http://cs.umd.edu/~jbg//docs/2017_emnlp_bandit_mt.pdf},
}

Hadi Amiri, Philip Resnik, Jordan Boyd-Graber, and Hal Daumé III. Learning Text Pair Similarity with Context-sensitive Autoencoders. Association for Computational Linguistics, 2016. [Code] [Bibtex]

@inproceedings{Amiri:Resnik:Boyd-Graber:Daume-III-2016,
	Title = {Learning Text Pair Similarity with Context-sensitive Autoencoders},
	Author = {Hadi Amiri and Philip Resnik and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_context_ae.pdf},
}

He He, Jordan Boyd-Graber, Kevin Kwok, and Hal Daumé III. Opponent Modeling in Deep Reinforcement Learning. International Conference on Machine Learning, 2016. [Video] [Bibtex]

@inproceedings{He:Boyd-Graber:Kwok:Daume-III-2016,
	Title = {Opponent Modeling in Deep Reinforcement Learning},
	Author = {He He and Jordan Boyd-Graber and Kevin Kwok and Hal {Daum\'{e} III}},
	Booktitle = {International Conference on Machine Learning},
	Year = {2016},
	Location = {New York, NY},
	Url = {http://cs.umd.edu/~jbg//docs/2016_icml_opponent.pdf},
}

@inproceedings{Iyyer:Guha:Chaturvedi:Boyd-Graber:Daume-III-2016,
	Title = {Feuding Families and Former Friends: Unsupervised Learning for Dynamic Fictional Relationships},
	Author = {Mohit Iyyer and Anupam Guha and Snigdha Chaturvedi and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_relationships.pdf},
}

Best paper award (2 out of 1592)

He He, Jordan Boyd-Graber, and Hal Daumé III. Interpretese vs. Translationese: The Uniqueness of Human Strategies in Simultaneous Interpretation. North American Association for Computational Linguistics, 2016. [Talk] [Bibtex]

@inproceedings{He:Boyd-Graber:Daume-III-2016,
	Title = {Interpretese vs. Translationese: The Uniqueness of Human Strategies in Simultaneous Interpretation},
	Author = {He He and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_interpretese.pdf},
}

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Hal Daumé III. Deep Unordered Composition Rivals Syntactic Methods for Text Classification. Association for Computational Linguistics, 2015. [Slides] [Code] [Talk] [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Daume-III-2015,
	Title = {Deep Unordered Composition Rivals Syntactic Methods for Text Classification},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Association for Computational Linguistics},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_acl_dan.pdf},
}

@inproceedings{He:Grissom-II:Boyd-Graber:Daume-III-2015,
	Title = {Syntax-based Rewriting for Simultaneous Machine Translation},
	Author = {He He and Alvin {Grissom II} and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2015},
	Location = {Lisbon, Portugal},
	Url = {http://cs.umd.edu/~jbg//docs/2015_emnlp_rewrite.pdf},
}

Jordan Boyd-Graber, Mohit Iyyer, He He, and Hal Daumé III. Interactive Incremental Question Answering. Neural Information Processing Systems, 2015. [Bibtex]

@inproceedings{Boyd-Graber:Iyyer:He:Daume-III-2015,
	Title = {Interactive Incremental Question Answering},
	Author = {Jordan Boyd-Graber and Mohit Iyyer and He He and Hal {Daum\'{e} III}},
	Booktitle = {Neural Information Processing Systems},
	Year = {2015},
	Location = {Montreal, Canada},
}

This won the best demonstration award at NIPS 2015

Mohit Iyyer, Jordan Boyd-Graber, Leonardo Claudino, Richard Socher, and Hal Daumé III. A Neural Network for Factoid Question Answering over Paragraphs. Empirical Methods in Natural Language Processing, 2014. [Code/Data] [Bibtex]

@inproceedings{Iyyer:Boyd-Graber:Claudino:Socher:Daume-III-2014,
	Title = {A Neural Network for Factoid Question Answering over Paragraphs},
	Author = {Mohit Iyyer and Jordan Boyd-Graber and Leonardo Claudino and Richard Socher and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_qb_rnn.pdf},
}

The partial derivatives of "C" and "J" with respect to the parameters should be switched in Equation 7.

@inproceedings{Grissom-II:He:Boyd-Graber:Morgan:Daume-III-2014,
	Title = {Don't Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation},
	Author = {Alvin {Grissom II} and He He and Jordan Boyd-Graber and John Morgan and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_simtrans.pdf},
}

Mohit Iyyer, Jordan Boyd-Graber, and Hal Daumé III. Generating Sentences from Semantic Vector Space Representations. NIPS Workshop on Learning Semantics, 2014. [Bibtex]

@inproceedings{Iyyer:Boyd-Graber:Daume-III-2014,
	Title = {Generating Sentences from Semantic Vector Space Representations},
	Author = {Mohit Iyyer and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {NIPS Workshop on Learning Semantics},
	Year = {2014},
	Location = {Montreal, Canada},
}

Yuening Hu, Jordan Boyd-Graber, Hal Daumé III, and Z. Irene Ying. Binary to Bushy: Bayesian Hierarchical Clustering with the Beta Coalescent. Neural Information Processing Systems, 2013. [Supplement] [Data] [Bibtex]

@inproceedings{Hu:Boyd-Graber:Daume-III:Ying-2013,
	Url = {http://cs.umd.edu/~jbg//docs/2013_coalescent.pdf},
	Title = {Binary to Bushy: Bayesian Hierarchical Clustering with the Beta Coalescent},
	Author = {Yuening Hu and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Z. Irene Ying},
	Booktitle = {Neural Information Processing Systems},
	Year = {2013},
}

@inproceedings{Boyd-Graber:Satinoff:He:Daume-III-2012,
	Title = {Besting the Quiz Master: Crowdsourcing Incremental Classification Games},
	Author = {Jordan Boyd-Graber and Brianna Satinoff and He He and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2012},
	Url = {http://cs.umd.edu/~jbg//docs/qb_emnlp_2012.pdf},
	Location = {Jeju, South Korea},
}

He He

Pedro Rodriguez, Shi Feng, Mohit Iyyer, He He, and Jordan Boyd-Graber. Quizbowl: The Case for Incremental Question Answering. ArXiv, Preprint. [Webpage] [Bibtex]

@article{Rodriguez:Feng:Iyyer:He:Boyd-Graber-Preprint,
	Title = {Quizbowl: The Case for Incremental Question Answering},
	Author = {Pedro Rodriguez and Shi Feng and Mohit Iyyer and He He and Jordan Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1904.04792},
}

He He, Jordan Boyd-Graber, Kevin Kwok, and Hal Daumé III. Opponent Modeling in Deep Reinforcement Learning. International Conference on Machine Learning, 2016. [Video] [Bibtex]

@inproceedings{He:Boyd-Graber:Kwok:Daume-III-2016,
	Title = {Opponent Modeling in Deep Reinforcement Learning},
	Author = {He He and Jordan Boyd-Graber and Kevin Kwok and Hal {Daum\'{e} III}},
	Booktitle = {International Conference on Machine Learning},
	Year = {2016},
	Location = {New York, NY},
	Url = {http://cs.umd.edu/~jbg//docs/2016_icml_opponent.pdf},
}

@inproceedings{He:Boyd-Graber:Daume-III-2016,
	Title = {Interpretese vs. Translationese: The Uniqueness of Human Strategies in Simultaneous Interpretation},
	Author = {He He and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_interpretese.pdf},
}

@inproceedings{He:Grissom-II:Boyd-Graber:Daume-III-2015,
	Title = {Syntax-based Rewriting for Simultaneous Machine Translation},
	Author = {He He and Alvin {Grissom II} and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2015},
	Location = {Lisbon, Portugal},
	Url = {http://cs.umd.edu/~jbg//docs/2015_emnlp_rewrite.pdf},
}

Jordan Boyd-Graber, Mohit Iyyer, He He, and Hal Daumé III. Interactive Incremental Question Answering. Neural Information Processing Systems, 2015. [Bibtex]

@inproceedings{Boyd-Graber:Iyyer:He:Daume-III-2015,
	Title = {Interactive Incremental Question Answering},
	Author = {Jordan Boyd-Graber and Mohit Iyyer and He He and Hal {Daum\'{e} III}},
	Booktitle = {Neural Information Processing Systems},
	Year = {2015},
	Location = {Montreal, Canada},
}

This won the best demonstration award at NIPS 2015

@inproceedings{Grissom-II:He:Boyd-Graber:Morgan:Daume-III-2014,
	Title = {Don't Until the Final Verb Wait: Reinforcement Learning for Simultaneous Machine Translation},
	Author = {Alvin {Grissom II} and He He and Jordan Boyd-Graber and John Morgan and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_simtrans.pdf},
}

@inproceedings{Boyd-Graber:Satinoff:He:Daume-III-2012,
	Title = {Besting the Quiz Master: Crowdsourcing Incremental Classification Games},
	Author = {Jordan Boyd-Graber and Brianna Satinoff and He He and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2012},
	Url = {http://cs.umd.edu/~jbg//docs/qb_emnlp_2012.pdf},
	Location = {Jeju, South Korea},
}

HyoJung Han

HyoJung Han, Marine Carpuat, and Jordan Boyd-Graber. Automatic Explicitation to Bridge the Background Knowledge Gap in Translation and its Evaluation with Multilingual QA. Empirical Methods in Natural Language Processing, 2023. [Video] [Bibtex]
```
@inproceedings{Han:Carpuat:Boyd-Graber-2023,
	Title = {Automatic Explicitation to Bridge the Background Knowledge Gap in Translation and its Evaluation with Multilingual QA},
	Author = {HyoJung Han and Marine Carpuat and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_emnlp_explicitation.pdf},
}
```
Accessible Abstract: Sometimes when you a translating from one language to another, a literal translation is not enough. Sometimes to actually understand what is being said, you need additional context. Professional translators know this, and the process that they use to help a listener is called "explicitation" to capturing cultural differences between source and target audiences. We introduce techniques for automatically generating explicitations, motivated by WikiExpl(a dataset collected from Wikipedia and annotate with human translators), and evaluate the explicitation.
HyoJung Han, Marine Carpuat, and Jordan Boyd-Graber. SimQA: Detecting Simultaneous MT Errors through Word-by-Word Question Answering. Empirical Methods in Natural Language Processing, 2022. [Code] [Research Talk] [Bibtex]
```
@inproceedings{Han:Carpuat:Boyd-Graber-2022,
	Title = {SimQA: Detecting Simultaneous MT Errors through Word-by-Word Question Answering},
	Author = {HyoJung Han and Marine Carpuat and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2022},
	Location = {Abu Dhabi},
	Url = {http://cs.umd.edu/~jbg//docs/2022_emnlp_simqa.pdf},
}
```
Accessible Abstract: Simultaneous interpretation (where a translation happens word by word before the source sentence is finished) is difficult to evaluate. We created a new evaluation framework based on the following scenario: imagine that you're thrown into a trivia gameshow where you don't know the language. Specifically, it's a game format where you interrupt the question word by word as soon as possible. Our hypothesis is that a monolingual player (who doesn't speak the source language) will be able to do better in the game with a better simultaneous translation system. In this 2022 EMNLP publication, we show that this evaluation is not only cheaper (you just need to translate the answer) but can also detect hallucinations and undertranslations better than existing evaluation methods.

Ishani Mondal

Yoo Yeon Sung, Maharshi Gor, Eve Fleisig, Ishani Mondal, and Jordan Lee Boyd-Graber. ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks. North American Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Sung:Gor:Fleisig:Mondal:Boyd-Graber-2025,
	Title = {ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks},
	Author = {Yoo Yeon Sung and Maharshi Gor and Eve Fleisig and Ishani Mondal and Jordan Lee Boyd-Graber},
	Journal = {North American Association for Computational Linguistics},
	Year = {2025},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_advscore.pdf},
}
```
Accessible Abstract: Adversarial datasets should validate AI robustness by presenting samples that humans handle well but models struggle with. However, as models advance, these datasets risk becoming obsolete. Assessing whether a dataset remains adversarial is challenging due to the absence of a standardized metric for adversarialness. To address this, we introduce AdvScore, a human-grounded evaluation metric that quantifies a dataset's adversarial nature by accounting for the differing abilities of models and humans while also identifying low-quality examples.

@inproceedings{Mondal:S:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {Presentations by the People, for the People: Harnessing LLMs for Generating Persona-Aware Slides from Documents},
	Author = {Ishani Mondal and Shwetha S and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_slides.pdf},
}

Zongxia Li, Ishani Mondal, Huy Nghiem, Yijun Liang, and Jordan Boyd-Graber. PEDANTS (Precise Evaluations of Diverse Answer Nominee Text for Skinflints): Use Evaluation Metrics Wisely---Efficient Evaluation Analysis and Benchmarking for Open-Domain Question Answering. Findings of the Empirical Methods in Natural Language Processing, 2024. [Bibtex]

@article{Li:Mondal:Nghiem:Liang:Boyd-Graber-2024,
	Title = {PEDANTS (Precise Evaluations of Diverse Answer Nominee Text for Skinflints): Use Evaluation Metrics Wisely---Efficient Evaluation Analysis and Benchmarking for Open-Domain Question Answering},
	Author = {Zongxia Li and Ishani Mondal and Huy Nghiem and Yijun Liang and Jordan Boyd-Graber},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {https://arxiv.org/abs/2402.11161},
}

@article{Mondal:Li:Hou:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement},
	Author = {Ishani Mondal and Zongxia Li and Yufang Hou and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Year = {2024},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_diagramgen.pdf},
}

James A. Evans

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

Jannis Bulian

@article{B\"orschinger:Boyd-Graber:Buck:Bulian:Ciaramita:Huebscher:Gajewski:Kilcher:Nogueira:Saralegu-Preprint,
	Title = {Meta Answering for Machine Reading},
	Author = {Benjamin B\"orschinger and Jordan Boyd-Graber and Christian Buck and Jannis Bulian and Massimiliano Ciaramita and Michelle Chen Huebscher and Wojciech Gajewski and Yannic Kilcher and Rodrigo Nogueira and Lierni Sestorain Saralegu},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1911.04156},
}

Julian Martin Eisenschlos, Bhuwan Dhingra, Jannis Bulian, Benjamin Börschinger, and Jordan Boyd-Graber. Fool Me Twice: Entailment from Wikipedia Gamification. North American Association for Computational Linguistics, 2021. [Preprint] [Research Talk] [Code and Data] [Paper Read Aloud] [Play] [Bibtex]
```
@inproceedings{Eisenschlos:Dhingra:Bulian:B\"orschinger:Boyd-Graber-2021,
	Title = {Fool Me Twice: Entailment from Wikipedia Gamification},
	Author = {Julian Martin Eisenschlos and Bhuwan Dhingra and Jannis Bulian and Benjamin B\"orschinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_naacl_fm2.pdf},
}
```
Accessible Abstract: Democracy and the free press depends on being able to recognize when facts online are true or not. For machine learning to help this critical problem, it needs good data identifying which statements are backed up by trusted sources and which are not. This research creates a game people can play online to craft difficult claims that can train computers to spot disinformation online.

Jason Chuang

@inproceedings{Smith:Chuang:Hu:Boyd-Graber:Findlater-2014,
	Title = {Concurrent Visualization of Relationships between Words and Topics in Topic Models},
	Author = {Alison Smith and Jason Chuang and Yuening Hu and Jordan Boyd-Graber and Leah Findlater},
	Booktitle = {ACL Workshop on Workshop on Interactive Language Learning, Visualization, and Interfaces},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

@inproceedings{Chuang:Wilkerson:Weiss:Tingley:Stewart:Roberts:Poursabzi-Sangdeh:Grimmer:Findlater:Boyd-Graber:Heer-2014,
	Title = {Computer-Assisted Content Analysis: Topic Models for Exploring Multiple Subjective Interpretations},
	Author = {Jason Chuang and John D. Wilkerson and Rebecca Weiss and Dustin Tingley and Brandon M. Stewart and Margaret E. Roberts and Forough Poursabzi-Sangdeh and Justin Grimmer and Leah Findlater and Jordan Boyd-Graber and Jeffrey Heer},
	Booktitle = {NIPS Workshop on Human-Propelled Machine Learning},
	Year = {2014},
	Location = {Montreal, Canada},
}

Jeff Lund

Jeff Lund, Connor Cook, Kevin Seppi, and Jordan Boyd-Graber. Tandem Anchoring: A Multiword Anchor Approach for Interactive Topic Modeling. Association for Computational Linguistics, 2017. [Code] [Bibtex]

@inproceedings{Lund:Cook:Seppi:Boyd-Graber-2017,
	Title = {Tandem Anchoring: A Multiword Anchor Approach for Interactive Topic Modeling},
	Author = {Jeff Lund and Connor Cook and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2017},
	Location = {Vancouver, British Columbia},
	Url = {http://cs.umd.edu/~jbg//docs/2017_acl_multiword_anchors.pdf},
}

You Lu, Jeff Lund, and Jordan Boyd-Graber. Why ADAGRAD Fails for Online Topic Modeling. Empirical Methods in Natural Language Processing, 2017. [Bibtex]

@inproceedings{Lu:Lund:Boyd-Graber-2017,
	Title = {Why ADAGRAD Fails for Online Topic Modeling},
	Author = {You Lu and Jeff Lund and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Url = {http://cs.umd.edu/~jbg//docs/2017_emnlp_adagrad_olda.pdf},
	Year = {2017},
	Location = {Copenhagen, Denmark},
}

@inproceedings{Nguyen:Boyd-Graber:Lund:Seppi:Ringger-2015,
	Title = {Is your anchor going up or down?  {F}ast and accurate supervised topic models},
	Author = {Thang Nguyen and Jordan Boyd-Graber and Jeff Lund and Kevin Seppi and Eric Ringger},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
	Url = {http://cs.umd.edu/~jbg//docs/2015_naacl_supervised_anchor.pdf},
}

Joe Barrow

Pedro Rodriguez, Joe Barrow, Alexander Hoyle, John P. Lalor, Robin Jia, and Jordan Boyd-Graber. Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards?. Association for Computational Linguistics, 2021. [Results and Code] [Paper Read Aloud] [Research Talk Video] [Code and Data] [Bibtex]
```
@inproceedings{Rodriguez:Barrow:Hoyle:Lalor:Jia:Boyd-Graber-2021,
	Title = {Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards?},
	Author = {Pedro Rodriguez and Joe Barrow and Alexander Hoyle and John P. Lalor and Robin Jia and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_acl_leaderboard.pdf},
}
```
Accessible Abstract: When can we call an AI "intelligent"? Just like humans, a common approach is to ask them a bunch of questions. These questions posed to modern machine learning methods are collected in metrics called leaderboards to monitor progress, but beyond ranking approaches, this does not help us better understand our problems or our systems very well. This paper introduces probabilistic models inspired by psychometric approaches called item response theory models (think year-end standardized tests) to better understand how computers can answer questions and whether we are asking the right questions. This allows researchers to better compare what kinds of questions systems can answer, better compare human and machine ability, and discover problematic questions (e.g., questions that have incorrect answer keys, are vague, or "trick" those trying to answer the questions).
Denis Peskov, Benny Cheng, Ahmed Elgohary Ghoneim, Joe Barrow, Cristian Danescu-Niculescu-Mizil, and Jordan Boyd-Graber. It Takes Two to Lie: One to Lie and One to Listen. Association for Computational Linguistics, 2020. [Video] [Podcast] [Data and Code] [Bibtex]
```
@inproceedings{Peskov:Cheng:Elgohary:Barrow:Danescu-Niculescu-Mizil:Boyd-Graber-2020,
	Title = {It Takes Two to Lie: One to Lie and One to Listen},
	Author = {Denis Peskov and Benny Cheng and Ahmed Elgohary and Joe Barrow and Cristian Danescu-Niculescu-Mizil and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_diplomacy.pdf},
}
```
Accessible Abstract: Machine learning techniques to detect deception in online communications requires training and evaluation data. However, there is a dearth of data either because of uncertain gold labels or privacy concerns; we create a new, large deception-centered dataset in the online game of Diplomacy. We gathered 17,289 messages from 12 games (each of which took over a month) involving 84 players, the majority of which were unique users. This data was collected with a custom-made bot that allowed us to collect messages and annotations. The user pool was created from scratch: we varied participant demographics across gender, age, nationality, and past game experience. Some of our participants included the former president of the Diplomacy players' association, several top ranked players in the world, a board game shop owner, and scientists. We create machine learning models to detect lies using linguistic, context, and power-dynamic features. Our best model had similar lie detection accuracy to humans.

@inproceedings{Peskov:Barrow:Rodriguez:Neubig:Boyd-Graber-2019,
	Title = {Mitigating Noisy Inputs for Question Answering},
	Author = {Denis Peskov and Joe Barrow and Pedro Rodriguez and Graham Neubig and Jordan Boyd-Graber},
	Booktitle = {Conference of the International Speech Communication Association},
	Year = {2019},
	Location = {Graz, Austria},
	Url = {http://cs.umd.edu/~jbg//docs/2019_interspeech_asr},
}

Jonathan Chang

Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, and Jonathan Chang. Learning a Concept Hierarchy from Multi-labeled Documents. Neural Information Processing Systems, 2014. [Code] [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Resnik:Chang-2014,
	Title = {Learning a Concept Hierarchy from Multi-labeled Documents},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Jonathan Chang},
	Booktitle = {Neural Information Processing Systems},
	Year = {2014},
	Location = {Montreal, Canada},
	Url = {http://cs.umd.edu/~jbg//docs/2014_nips_l2h.pdf},
}

Viet-An Nguyen, Jordan Boyd-Graber, Jonathan Chang, and Philip Resnik. Tree-Based Label Dependency Topic Models. NIPS Workshop on Topic Models: Computation, Application, and Evaluation, 2013. [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Chang:Resnik-2013,
	Title = {Tree-Based Label Dependency Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Jonathan Chang and Philip Resnik},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

@inproceedings{Chang:Boyd-Graber:Blei-2009,
	Title = {Connections between the Lines: Augmenting Social Networks with Text},
	Url = {http://cs.umd.edu/~jbg//docs/kdd2009.pdf},
	Author = {Jonathan Chang and Jordan Boyd-Graber and David M. Blei},
	Booktitle = {Knowledge Discovery and Data Mining},
	Year = {2009},
	Location = {Paris, France},
}

Jonathan Chang, Jordan Boyd-Graber, Chong Wang, Sean Gerrish, and David M. Blei. Reading Tea Leaves: How Humans Interpret Topic Models. Neural Information Processing Systems, 2009. [Data] [Presentation] [Video] [Bibtex]
```
@inproceedings{Chang:Boyd-Graber:Wang:Gerrish:Blei-2009,
	Title = {Reading Tea Leaves: How Humans Interpret Topic Models},
	Author = {Jonathan Chang and Jordan Boyd-Graber and Chong Wang and Sean Gerrish and David M. Blei},
	Booktitle = {Neural Information Processing Systems},
	Year = {2009},
	Location = {Vancouver, BC},
	Url = {http://cs.umd.edu/~jbg//docs/nips2009-rtl.pdf},
}
```
Jonathan Chang and I shared a NIPS student award honorable mention for this paper (5 out of 1105)

Accessible Abstract: Topic models are a tool that historians and social sciences use to explore large text corpora. But how do you know if you have a good topic model? Before this paper, the consensus was to use held-out likelihood to evaluate if you had a good model. This paper argues that this does not fit how people actually use topic models and proposes new human-centered metrics for evaluating topic models. This method inspired a rethinking of model evaluation and showed that the complexity of a model does not always correspond to what a user might want.

Jonathan Chang, Jordan Boyd-Graber, and David M. Blei. Discovering social networks from free text. 3rd Annual Machine Learning Symposium, 2008. [Bibtex]

@inproceedings{Chang:Boyd-Graber:Blei-2008,
	Title = {Discovering social networks from free text},
	Author = {Jonathan Chang and Jordan Boyd-Graber and David M. Blei},
	Booktitle = {3rd Annual Machine Learning Symposium},
	Year = {2008},
	Location = {New York, New York},
}

Jonathan Chang, Jordan Boyd-Graber, and David M. Blei. Discovering social networks from free text. 3rd Annual Machine Learning Symposium, 2008. [Bibtex]

@inproceedings{Chang:Boyd-Graber:Blei-2008,
	Title = {Discovering social networks from free text},
	Author = {Jonathan Chang and Jordan Boyd-Graber and David M. Blei},
	Booktitle = {3rd Annual Machine Learning Symposium},
	Year = {2008},
	Location = {New York, New York},
}

Jordan Lee Boyd-Graber

@article{Sung:Fleisig:Hope:Upadhyay:Boyd-Graber-Preprint,
	Title = {GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration},
	Author = {Yoo Yeon Sung and Eve Fleisig and Yu Hope and Ishan Upadhyay and Jordan Lee Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/pdf/2502.19684},
}

Nishant Balepur, Alexa Siu, Nedim Lipka, Franck Dernoncourt, Tong Sun, Jordan Lee Boyd-Graber, and Puneet Mathur. MoDS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Balepur:Siu:Lipka:Dernoncourt:Sun:Boyd-Graber:Mathur-2025,
	Title = {MoDS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections},
	Author = {Nishant Balepur and Alexa Siu and Nedim Lipka and Franck Dernoncourt and Tong Sun and Jordan Lee Boyd-Graber and Puneet Mathur},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_mods.pdf},
}
```
Accessible Abstract: When you ask ChatGPT for advice on questions with multiple perspectives (e.g. "Is pineapple good on pizza?"), you likely want a response that fairly represents all viewpoints. We formulate this task, collect a dataset to test it, and develop MoDS—a system where multiple ChatGPT's debate like a panel discussion—to generate balanced answers for questions based on multiple sources.
Yoo Yeon Sung, Maharshi Gor, Eve Fleisig, Ishani Mondal, and Jordan Lee Boyd-Graber. ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks. North American Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Sung:Gor:Fleisig:Mondal:Boyd-Graber-2025,
	Title = {ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks},
	Author = {Yoo Yeon Sung and Maharshi Gor and Eve Fleisig and Ishani Mondal and Jordan Lee Boyd-Graber},
	Journal = {North American Association for Computational Linguistics},
	Year = {2025},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_advscore.pdf},
}
```
Accessible Abstract: Adversarial datasets should validate AI robustness by presenting samples that humans handle well but models struggle with. However, as models advance, these datasets risk becoming obsolete. Assessing whether a dataset remains adversarial is challenging due to the absence of a standardized metric for adversarialness. To address this, we introduce AdvScore, a human-grounded evaluation metric that quantifies a dataset's adversarial nature by accounting for the differing abilities of models and humans while also identifying low-quality examples.
Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, and Jordan Lee Boyd-Graber. You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions. Empirical Methods in Natural Language Processing, 2024. [ArXiv] [Research Talk] [Code] [Bibtex]
```
@inproceedings{Kabir:Sung:Bandyopadhyay:Zou:Chandra:Boyd-Graber-2024,
	Title = {You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions},
	Author = {Tasnim Kabir and Yoo Yeon Sung and Saptarashmi Bandyopadhyay and Hao Zou and Abhranil Chandra and Jordan Lee Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_natural.pdf},
}
```
Accessible Abstract: Many of the questions for training AIs how to answer questions come from the queries users type into search engines (like Google's Natural Questions). Is there a cheaper---perhaps even better---way? We propose a "naturalization" technique to turn high-quality, rigorously edited trivia questions into examples that resembles Natural Questions. Training on our naturalized questions and testing on natural questions comes close to the results with using Natural Questions, and we can improve results on MMLU (a standard modern evaluation set) by using our data.

@inproceedings{Li:Mao:Stephens:Goel:Walpole:Fung:Dima:Boyd-Graber-2024,
	Title = {TENOR: Topic Enabled Neural Organization and Recommendation: Evaluating Topic Models in Task Based Settings},
	Author = {Zongxia Li and Andrew Mao and Daniel Kofi Stephens and Pranav Goel and Emily Walpole and Juan Francisco Fung and Alden Dima and Jordan Lee Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_tenor.pdf},
}

Sander V Schulhoff, Jeremy Pinto, Anaum Khan, Louis-François Bouchard, Chenglei Si, Jordan Lee Boyd-Graber, Svetlina Anati, Valen Tagliabue, Anson Liu Kost, and Christopher R Carnahan. Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition. Empirical Methods in Natural Language Processing, 2023. [Prerecorded Video] [Data] [Award Video] [Bibtex]
```
@inproceedings{Schulhoff:Pinto:Khan:Bouchard:Si:Boyd-Graber:Anati:Tagliabue:Kost:Carnahan-2023,
	Title = {Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs Through a Global Prompt Hacking Competition},
	Author = {Sander V Schulhoff and Jeremy Pinto and Anaum Khan and Louis-François Bouchard and Chenglei Si and Jordan Lee Boyd-Graber and Svetlina Anati and Valen Tagliabue and Anson Liu Kost and Christopher R Carnahan},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_emnlp_hackaprompt.pdf},
}
```
This paper was selected as the Best Theme Paper at EMNLP 2023 (1 of 4909)

Accessible Abstract: As more AI services online are provided by prompted language models, we need to be aware of the weaknesses and exploits of the models. We present the HackAPrompt competition to help elicit a broad array of exploits that get around large langauge models.

Junjie Hu

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

Ke Zhai

Yuening Hu, Ke Zhai, Vlad Eidelman, and Jordan Boyd-Graber. Polylingual Tree-Based Topic Models for Translation Domain Adaptation. Association for Computational Linguistics, 2014. [Code] [Bibtex]

@inproceedings{Hu:Zhai:Eidelman:Boyd-Graber-2014,
	Title = {Polylingual Tree-Based Topic Models for Translation Domain Adaptation},
	Author = {Yuening Hu and Ke Zhai and Vlad Eidelman and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_ptlda_mt.pdf},
}

Ke Zhai, Jordan Boyd-Graber, and Shay B. Cohen. Hybrid Online Inference with Adaptor Grammars. NIPS Workshop on Advances in Variational Inference, 2014. [Bibtex]

@article{Zhai:Boyd-Graber:Cohen-2014,
	Title = {Hybrid Online Inference with Adaptor Grammars},
	Author = {Ke Zhai and Jordan Boyd-Graber and Shay B. Cohen},
	Booktitle = {NIPS Workshop on Advances in Variational Inference},
	Year = {2014},
}

Ke Zhai, Jordan Boyd-Graber, and Shay B. Cohen. Online Adaptor Grammars with Hybrid Inference. Transactions of the Association for Computational Linguistics, 2014. [Code] [Bibtex]

@article{Zhai:Boyd-Graber:Cohen-2014,
	Title = {Online Adaptor Grammars with Hybrid Inference},
	Author = {Ke Zhai and Jordan Boyd-Graber and Shay B. Cohen},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2014},
	Publisher = {Association for Computational Linguistics},
	Url = {http://cs.umd.edu/~jbg//docs/2014_tacl_ag_vb_online.pdf},
}

Ke Zhai and Jordan Boyd-Graber. Online Topic Models with Infinite Vocabulary. International Conference on Machine Learning, 2013. [Poster] [Talk] [Code] [Bibtex]

@inproceedings{Zhai:Boyd-Graber-2013,
	Title = {Online Topic Models with Infinite Vocabulary},
	Author = {Ke Zhai and Jordan Boyd-Graber},
	Booktitle = {International Conference on Machine Learning},
	Year = {2013},
	Url = {http://cs.umd.edu/~jbg//docs/2013_icml_infvoc.pdf},
}

Yuening Hu, Ke Zhai, Vlad Edelman, and Jordan Boyd-Graber. Topic Models for Translation Domain Adaptation. NIPS Workshop on Topic Models: Computation, Application, and Evaluation, 2013. [Bibtex]

@inproceedings{Hu:Zhai:Edelman:Boyd-Graber-2013,
	Title = {Topic Models for Translation Domain Adaptation},
	Author = {Yuening Hu and Ke Zhai and Vlad Edelman and Jordan Boyd-Graber},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

Ke Zhai, Jordan Boyd-Graber, Nima Asadi, and Mohamad (Jude) Alkhouja. Mr. LDA: A Flexible Large Scale Topic Modeling Package using Variational Inference in MapReduce. ACM International Conference on World Wide Web, 2012. [Code] [Slides] [Bibtex]

@inproceedings{Zhai:Boyd-Graber:Asadi:Alkhouja-2012,
	Title = {{Mr. LDA}: A Flexible Large Scale Topic Modeling Package using Variational Inference in MapReduce},
	Url = {http://cs.umd.edu/~jbg//docs/2012_www_mrlda.pdf},
	Author = {Ke Zhai and Jordan Boyd-Graber and Nima Asadi and Mohamad Alkhouja},
	Booktitle = {ACM International Conference on World Wide Web},
	Year = {2012},
	Location = {Lyon, France},
}

Yuening Hu, Ke Zhai, Sinead Williamson, and Jordan Boyd-Graber. Modeling Images using Transformed Indian Buffet Processes. International Conference on Machine Learning, 2012. [Code] [Data] [Research Talk] [Bibtex]

@inproceedings{Hu:Zhai:Williamson:Boyd-Graber-2012,
	Title = {Modeling Images using Transformed {I}ndian Buffet Processes},
	Author = {Yuening Hu and Ke Zhai and Sinead Williamson and Jordan Boyd-Graber},
	Booktitle = {International Conference on Machine Learning},
	Year = {2012},
	Url = {http://cs.umd.edu/~jbg//docs/mtibp_icml_2012.pdf},
	Location = {Edinburgh, Scotland},
}

Ke Zhai and Jordan Boyd-Graber. Online Topic Model with Infinite Vocabulary. Mid-Atlantic Student Colloquium on Speech, Language, and Learning, 2012. [Bibtex]

@inproceedings{Zhai:Boyd-Graber-2012,
	Title = {Online Topic Model with Infinite Vocabulary},
	Author = {Ke Zhai and Jordan Boyd-Graber},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2012},
}

Kenneth R. Fleischmann

@inproceedings{Templeton:Fleischmann:Boyd-Graber-2011,
	Title = {Simulating Audiences: Automating Analysis of Values, Attitudes, and Sentiment},
	Author = {Clay Templeton and Kenneth R. Fleischmann and Jordan Boyd-Graber},
	Booktitle = {IEEE International Conference on Social Computing},
	Year = {2011},
	Location = {Cambridge, MA},
	Url = {http://cs.umd.edu/~jbg//docs/simulating_audiences.pdf},
}

Clay Templeton, Kenneth R. Fleischmann, and Jordan Boyd-Graber. Comparing Values and Sentiment Using Mechanical Turk. iConference, 2011. [Bibtex]

@inproceedings{Templeton:Fleischmann:Boyd-Graber-2011,
	Title = {Comparing Values and Sentiment Using Mechanical Turk},
	Author = {Clay Templeton and Kenneth R. Fleischmann and Jordan Boyd-Graber},
	Booktitle = {iConference},
	Year = {2011},
	Location = {Seattle, Washington},
	Url = {http://cs.umd.edu/~jbg//docs/iconference-2011-comparing.pdf},
}

@inproceedings{Fleischmann:Templeton:Boyd-Graber-2011,
	Title = {Modeling Diverse Standpoints in Text Classification: Learning to Be Human by Modeling Human Values},
	Author = {Kenneth R. Fleischmann and Clay Templeton and Jordan Boyd-Graber},
	Booktitle = {iConference},
	Year = {2011},
	Location = {Seattle, Washington},
	Url = {http://cs.umd.edu/~jbg//docs/iconference-2011-learning.pdf},
}

Kevin Seppi

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2020,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Url = {http://cs.umd.edu/~jbg//docs/2020_iui_control.pdf},
	Year = {2020},
	Title = {Digging into User Control: Perceptions of Adherence and
Instability in Transparent Models},
}

Jeffrey Lund, Piper Armstrong, Wilson Fearn, Stephen Cowley, Courtni Byun, Jordan Boyd-Graber, and Kevin Seppi. Automatic and Human Evaluation of Local Topic Quality. Association for Computational Linguistics, 2019. [Code] [Bibtex]

@inproceedings{Lund:Armstrong:Fearn:Cowley:Byun:Boyd-Graber:Seppi-2019,
	Title = {Automatic and Human Evaluation of Local Topic Quality},
	Author = {Jeffrey Lund and Piper Armstrong and Wilson Fearn and Stephen Cowley and Courtni Byun and Jordan Boyd-Graber and Kevin Seppi},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_local.pdf},
}

@inproceedings{Kumar:Smith:Findlater:Seppi:Boyd-Graber-2019,
	Author = {Varun Kumar and Alison Smith and Leah Findlater and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Title = {Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_control.pdf},
}

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Felt:Ringger:Seppi:Boyd-Graber-2018,
	Title = {Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types},
	Author = {Paul Felt and Eric Ringger and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {International Conference on Computational Linguistics},
	Year = {2018},
	Location = {Santa Fe, New Mexico},
	Url = {http://cs.umd.edu/~jbg//docs/2018_coling_measurements.pdf},
}

@inproceedings{Lund:Cook:Seppi:Boyd-Graber-2017,
	Title = {Tandem Anchoring: A Multiword Anchor Approach for Interactive Topic Modeling},
	Author = {Jeff Lund and Connor Cook and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2017},
	Location = {Vancouver, British Columbia},
	Url = {http://cs.umd.edu/~jbg//docs/2017_acl_multiword_anchors.pdf},
}

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2017,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {CHI 2017 Designing for Uncertainty Workshop},
	Year = {2017},
	Location = {Denver, CO},
	Title = {Accounting for Input Uncertainty in Human-in-the-Loop Systems},
	Url = {http://visualization.ischool.uw.edu/hci_uncertainty/papers/Paper11.pdf},
}

@article{Lee:Smith:Seppi:Elmqvist:Boyd-Graber:Findlater-2017,
	Author = {Tak Yeon Lee and Alison Smith and Kevin Seppi and Niklas Elmqvist and Jordan Boyd-Graber and Leah Findlater},
	Journal = {International Journal of Human-Computer Studies},
	Year = {2017},
	Title = {The Human Touch: How Non-expert Users Perceive, Interpret, and Fix Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2017_ijhcs_human_touch.pdf},
}

@article{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2017,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2017},
	Title = {Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels},
	Volume = {5},
	Url = {http://cs.umd.edu/~jbg//docs/2017_tacl_eval_tm_viz.pdf},
	Pages = {1--15},
}

@inproceedings{Poursabzi-Sangdeh:Boyd-Graber:Findlater:Seppi-2016,
	Title = {ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling},
	Author = {Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Leah Findlater and Kevin Seppi},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_doclabel.pdf},
}

@inproceedings{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2016,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Booktitle = {CHI Human Centred Machine Learning Workshop},
	Year = {2016},
	Location = {San Jose, CA},
	Title = {Human-Centered and Interactive: Expanding the Impact of Topic Models},
}

@inproceedings{Felt:Ringger:Boyd-Graber:Seppi-2015,
	Title = {Making the Most of Crowdsourced Document Annotations: Confused Supervised {LDA}},
	Author = {Paul Felt and Eric Ringger and Jordan Boyd-Graber and Kevin Seppi},
	Booktitle = {Conference on Computational Natural Language Learning},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_conll_cslda.pdf},
}

This paper received the best paper award at CoNLL

@inproceedings{Nguyen:Boyd-Graber:Lund:Seppi:Ringger-2015,
	Title = {Is your anchor going up or down?  {F}ast and accurate supervised topic models},
	Author = {Thang Nguyen and Jordan Boyd-Graber and Jeff Lund and Kevin Seppi and Eric Ringger},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
	Url = {http://cs.umd.edu/~jbg//docs/2015_naacl_supervised_anchor.pdf},
}

Kimberly Glasgow

Kimberly Glasgow, Clay Fink, and Jordan Boyd-Graber. Our grief is unspeakable: Measuring the community impact of a tragedy. The International AAAI Conference on Weblogs and Social Media, 2014. [Bibtex]

@inproceedings{Glasgow:Fink:Boyd-Graber-2014,
	Title = {Our grief is unspeakable: Measuring the community impact of a tragedy},
	Author = {Kimberly Glasgow and Clay Fink and Jordan Boyd-Graber},
	Booktitle = {The International AAAI Conference on Weblogs and Social Media},
	Year = {2014},
	Location = {Ann Arbor},
	Url = {http://cs.umd.edu/~jbg//docs/2014_icwsm_grief.pdf},
}

Jordan Boyd-Graber, Kimberly Glasgow, and Jackie Sauter Zajac. Spoiler Alert: Machine Learning Approaches to Detect Social Media Posts with Revelatory Information. ASIST 2013: The 76th Annual Meeting of the American Society for Information Science and Technology, 2013. [Data] [Bibtex]

@inproceedings{Boyd-Graber:Glasgow:Zajac-2013,
	Title = {Spoiler Alert: Machine Learning Approaches to Detect Social Media Posts with Revelatory Information},
	Author = {Jordan Boyd-Graber and Kimberly Glasgow and Jackie Sauter Zajac},
	Booktitle = {ASIST 2013: The 76th Annual Meeting of the American Society for Information Science and Technology},
	Year = {2013},
	Location = {Montreal, Canada},
	Url = {http://cs.umd.edu/~jbg//docs/2013_spoiler.pdf},
}

Larry Davis

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Larry Davis. Learning to Color from Language. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Davis-2018,
	Title = {Learning to Color from Language},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Larry Davis},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_colorization.pdf},
}

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Larry Davis. Learning to Color from Language. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Davis-2018,
	Title = {Learning to Color from Language},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Larry Davis},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_colorization.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

Leah Findlater

@inproceedings{Grissom-II:Shoemaker:Goldman:Shi:Stewart:Rytting:Findlater:Boyd-Graber:Li:Grissom-II:Boyd-Graber-2024,
	Author = {Alvin {Grissom II} and Jo Shoemaker and Benjamin Goldman and Ruikang Shi and Craig Stewart and C. Anton Rytting and Leah Findlater and Jordan Boyd-Graber},
	Location = {Torino, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2024_lrec_siminthelp.pdf},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Year = {2024},
	Title = {Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates},
}

@inproceedings{Smith:Boyd-Graber:Fan:Birchfield:Wu:Weld:Findlater-2020,
	Author = {Alison Smith and Jordan Boyd-Graber and Ron Fan and Melissa Birchfield and Tongshuang Wu and Dan Weld and Leah Findlater},
	Booktitle = {Computer-Human Interaction},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_chi_explanation.pdf},
	Title = {No Explainability without Accountability: An Empirical Study of Explanations and Feedback in Interactive ML},
}

Michelle Yuan, Mozhi Zhang, Benjamin Van Durme, Leah Findlater, and Jordan Boyd-Graber. Interactive Refinement of Cross-Lingual Word Embeddings. Empirical Methods in Natural Language Processing, 2020. [Audio of Paper Readthrough] [Video of Paper Readthrough] [Code] [Conference Talk] [Bibtex]
```
@inproceedings{Yuan:Zhang:Van-Durme:Findlater:Boyd-Graber-2020,
	Title = {Interactive Refinement of Cross-Lingual Word Embeddings},
	Author = {Michelle Yuan and Mozhi Zhang and Benjamin {Van Durme} and Leah Findlater and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Punta Cana, Dominican Republic},
	Url = {http://cs.umd.edu/~jbg//docs/2020_emnlp_clime.pdf},
}
```
Accessible Abstract: Language technologies sometimes need to be quickly deployed in low-resource languages. For example, in the 2010 Haiti earthquake, researchers used machine learning models to analyze social media and text messages to gain situational awareness. We introduce CLIME, an interactive system that can help in these scenarios: users see which words related to the task the system thinks are similar, corrects the model to push similar words together and dissimilar words apart.

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2020,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Url = {http://cs.umd.edu/~jbg//docs/2020_iui_control.pdf},
	Year = {2020},
	Title = {Digging into User Control: Perceptions of Adherence and
Instability in Transparent Models},
}

@inproceedings{Guo:Boyd-Graber:Iyyer:Findlater-2020,
	Author = {Fenfei Guo and Jordan Boyd-Graber and Mohit Iyyer and Leah Findlater},
	Location = {France (but only in dreams)},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Url = {http://cs.umd.edu/~jbg//docs/2020_lrec_sense.pdf},
	Year = {2020},
	Title = {Which Evaluations Uncover Sense Representations that Actually Make Sense?},
}

@inproceedings{Kumar:Smith:Findlater:Seppi:Boyd-Graber-2019,
	Author = {Varun Kumar and Alison Smith and Leah Findlater and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Title = {Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_control.pdf},
}

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2017,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {CHI 2017 Designing for Uncertainty Workshop},
	Year = {2017},
	Location = {Denver, CO},
	Title = {Accounting for Input Uncertainty in Human-in-the-Loop Systems},
	Url = {http://visualization.ischool.uw.edu/hci_uncertainty/papers/Paper11.pdf},
}

@article{Lee:Smith:Seppi:Elmqvist:Boyd-Graber:Findlater-2017,
	Author = {Tak Yeon Lee and Alison Smith and Kevin Seppi and Niklas Elmqvist and Jordan Boyd-Graber and Leah Findlater},
	Journal = {International Journal of Human-Computer Studies},
	Year = {2017},
	Title = {The Human Touch: How Non-expert Users Perceive, Interpret, and Fix Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2017_ijhcs_human_touch.pdf},
}

@article{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2017,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2017},
	Title = {Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels},
	Volume = {5},
	Url = {http://cs.umd.edu/~jbg//docs/2017_tacl_eval_tm_viz.pdf},
	Pages = {1--15},
}

@inproceedings{Poursabzi-Sangdeh:Boyd-Graber:Findlater:Seppi-2016,
	Title = {ALTO: Active Learning with Topic Overviews for Speeding Label Induction and Document Labeling},
	Author = {Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Leah Findlater and Kevin Seppi},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_doclabel.pdf},
}

@inproceedings{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2016,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Booktitle = {CHI Human Centred Machine Learning Workshop},
	Year = {2016},
	Location = {San Jose, CA},
	Title = {Human-Centered and Interactive: Expanding the Impact of Topic Models},
}

@inproceedings{Smith:Chuang:Hu:Boyd-Graber:Findlater-2014,
	Title = {Concurrent Visualization of Relationships between Words and Topics in Topic Models},
	Author = {Alison Smith and Jason Chuang and Yuening Hu and Jordan Boyd-Graber and Leah Findlater},
	Booktitle = {ACL Workshop on Workshop on Interactive Language Learning, Visualization, and Interfaces},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

@inproceedings{Chuang:Wilkerson:Weiss:Tingley:Stewart:Roberts:Poursabzi-Sangdeh:Grimmer:Findlater:Boyd-Graber:Heer-2014,
	Title = {Computer-Assisted Content Analysis: Topic Models for Exploring Multiple Subjective Interpretations},
	Author = {Jason Chuang and John D. Wilkerson and Rebecca Weiss and Dustin Tingley and Brandon M. Stewart and Margaret E. Roberts and Forough Poursabzi-Sangdeh and Justin Grimmer and Leah Findlater and Jordan Boyd-Graber and Jeffrey Heer},
	Booktitle = {NIPS Workshop on Human-Propelled Machine Learning},
	Year = {2014},
	Location = {Montreal, Canada},
}

Leonardo Claudino

Philip Resnik, William Armstrong, Leonardo Claudino, Thang Nguyen, Viet-An Nguyen, and Jordan Boyd-Graber. Beyond LDA: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter. NAACL Workshop on Cognitive Modeling and Computational Linguistics, 2015. [Bibtex]

@inproceedings{Resnik:Armstrong:Claudino:Nguyen:Nguyen:Boyd-Graber-2015,
	Title = {Beyond {LDA}: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter},
	Author = {Philip Resnik and William Armstrong and Leonardo Claudino and Thang Nguyen and Viet-An Nguyen and Jordan Boyd-Graber},
	Booktitle = {NAACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
}

@inproceedings{Iyyer:Boyd-Graber:Claudino:Socher:Daume-III-2014,
	Title = {A Neural Network for Factoid Question Answering over Paragraphs},
	Author = {Mohit Iyyer and Jordan Boyd-Graber and Leonardo Claudino and Richard Socher and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_qb_rnn.pdf},
}

The partial derivatives of "C" and "J" with respect to the parameters should be switched in Equation 7.

Maharshi Gor

Yoo Yeon Sung, Maharshi Gor, Eve Fleisig, Ishani Mondal, and Jordan Lee Boyd-Graber. ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks. North American Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Sung:Gor:Fleisig:Mondal:Boyd-Graber-2025,
	Title = {ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks},
	Author = {Yoo Yeon Sung and Maharshi Gor and Eve Fleisig and Ishani Mondal and Jordan Lee Boyd-Graber},
	Journal = {North American Association for Computational Linguistics},
	Year = {2025},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_advscore.pdf},
}
```
Accessible Abstract: Adversarial datasets should validate AI robustness by presenting samples that humans handle well but models struggle with. However, as models advance, these datasets risk becoming obsolete. Assessing whether a dataset remains adversarial is challenging due to the absence of a standardized metric for adversarialness. To address this, we introduce AdvScore, a human-grounded evaluation metric that quantifies a dataset's adversarial nature by accounting for the differing abilities of models and humans while also identifying low-quality examples.
Maharshi Gor, Hal Daumé III Tianyi Zhou, and Jordan Boyd-Graber. Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA. Empirical Methods in Natural Language Processing, 2024. [Talk] [Code] [Data] [Bibtex]
```
@inproceedings{Gor:Daume-III:Boyd-Graber-2024,
	Title = {Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA},
	Author = {Maharshi Gor and Hal {Daum\'{e} III} Tianyi Zhou and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2024},
	Location = {Miami},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_caimira.pdf},
}
```
Accessible Abstract: CAIMIRA discovers the skills that humans and AIs use to answer questions. By scraping websites where trivia nerds answer really difficult questions and posing those questions to AI models like GPT-4 and LLaMA-3-70B, while humans excel in knowledge-based abductive reasoning, AI outperforms on fact-based historical recall. This research suggests future challenges should focus on more complex reasoning and nuanced language tasks to better align AI development with human cognitive strengths.
Maharshi Gor, Kellie Webster, and Jordan Boyd-Graber. Toward Deconfounding the Influence of Subject's Demographic Characteristics in Question Answering. Empirical Methods in Natural Language Processing, 2021. [Research Talk] [Bibtex]
```
@inproceedings{Gor:Webster:Boyd-Graber-2021,
	Title = {Toward Deconfounding the Influence of Subject's Demographic Characteristics in Question Answering},
	Author = {Maharshi Gor and Kellie Webster and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Location = {Punta Cana},
	Pages = {6},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_qa_fairness.pdf},
}
```
Accessible Abstract: The data used to train computer question answering systems have three times as many men as women. This paper examines whether this is a problem for question answering accuracy. After a thorough investigation, we do not find evidence of serious accuracy discrepancies between languages. However, an absence of evidence is not evidence of absence, and we would argue that we need more diverse datasets to better represent the world's population.

Mane, Heran Y.

@inproceedings{Srikanth:Sarkar:Y.:M.:C.:Rudinger:Boyd-Graber-2024,
	Title = {Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering},
	Author = {Neha Srikanth and Rupak Sarkar and Mane, Heran Y. and Aparicio, Elizabeth M. and Nguyen, Quynh C. and Rachel Rudinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_pregnant.pdf},
}

@article{Y.:Amara:Ximena:Michelle:Xiaohe:Srikanth:Sourabh:Abby:Ann:Pragat:Xin:Boyd-Graber:M.:C.-2023,
	Title = {Practical Guidance for the Development of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers},
	Author = {Mane, Heran Y. and Channell Doig, Amara and Marin Gutierrez, Francia Ximena and Jasczynski, Michelle and Yue, Xiaohe and Neha Srikanth and Mane, Sourabh and Sun, Abby and Moats, Rachel Ann and Patel, Pragat and He, Xin and Jordan Boyd-Graber and Aparicio, Elizabeth M. and Nguyen, Quynh C.},
	Journal = {Journal of Public Health Management and Practice},
	Year = {2023},
	Url = {https://journals.lww.com/jphmp/fulltext/2023/09000/practical_guidance_for_the_development_of_rosie,_a.9.aspx},
}

Marine Carpuat

HyoJung Han, Marine Carpuat, and Jordan Boyd-Graber. Automatic Explicitation to Bridge the Background Knowledge Gap in Translation and its Evaluation with Multilingual QA. Empirical Methods in Natural Language Processing, 2023. [Video] [Bibtex]
```
@inproceedings{Han:Carpuat:Boyd-Graber-2023,
	Title = {Automatic Explicitation to Bridge the Background Knowledge Gap in Translation and its Evaluation with Multilingual QA},
	Author = {HyoJung Han and Marine Carpuat and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_emnlp_explicitation.pdf},
}
```
Accessible Abstract: Sometimes when you a translating from one language to another, a literal translation is not enough. Sometimes to actually understand what is being said, you need additional context. Professional translators know this, and the process that they use to help a listener is called "explicitation" to capturing cultural differences between source and target audiences. We introduce techniques for automatically generating explicitations, motivated by WikiExpl(a dataset collected from Wikipedia and annotate with human translators), and evaluate the explicitation.
HyoJung Han, Marine Carpuat, and Jordan Boyd-Graber. SimQA: Detecting Simultaneous MT Errors through Word-by-Word Question Answering. Empirical Methods in Natural Language Processing, 2022. [Code] [Research Talk] [Bibtex]
```
@inproceedings{Han:Carpuat:Boyd-Graber-2022,
	Title = {SimQA: Detecting Simultaneous MT Errors through Word-by-Word Question Answering},
	Author = {HyoJung Han and Marine Carpuat and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2022},
	Location = {Abu Dhabi},
	Url = {http://cs.umd.edu/~jbg//docs/2022_emnlp_simqa.pdf},
}
```
Accessible Abstract: Simultaneous interpretation (where a translation happens word by word before the source sentence is finished) is difficult to evaluate. We created a new evaluation framework based on the following scenario: imagine that you're thrown into a trivia gameshow where you don't know the language. Specifically, it's a game format where you interrupt the question word by word as soon as possible. Our hypothesis is that a monolingual player (who doesn't speak the source language) will be able to do better in the game with a better simultaneous translation system. In this 2022 EMNLP publication, we show that this evaluation is not only cheaper (you just need to translate the answer) but can also detect hallucinations and undertranslations better than existing evaluation methods.

Massimiliano Ciaramita

@article{B\"orschinger:Boyd-Graber:Buck:Bulian:Ciaramita:Huebscher:Gajewski:Kilcher:Nogueira:Saralegu-Preprint,
	Title = {Meta Answering for Machine Reading},
	Author = {Benjamin B\"orschinger and Jordan Boyd-Graber and Christian Buck and Jannis Bulian and Massimiliano Ciaramita and Michelle Chen Huebscher and Wojciech Gajewski and Yannic Kilcher and Rodrigo Nogueira and Lierni Sestorain Saralegu},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1911.04156},
}

Francesco Saverio Varini, Jordan Boyd-Graber, Massimiliano Ciaramita, and Markus Leippold. ClimaText: A Dataset for Climate Change Topic Detection. NeurIPS Workshop on Tackling Climate Change with Machine Learning, 2020. [Bibtex]

@inproceedings{Varini:Boyd-Graber:Ciaramita:Leippold-2020,
	Title = {ClimaText: A Dataset for Climate Change Topic Detection},
	Author = {Francesco Saverio Varini and Jordan Boyd-Graber and Massimiliano Ciaramita and Markus Leippold},
	Booktitle = {NeurIPS Workshop on Tackling Climate Change with Machine Learning},
	Year = {2020},
	Location = {Virtual Simulacrum of Vancouver},
}

Matthew Shu

Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, and Jordan Boyd-Graber. A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Balepur:Shu:Hoyle:Robey:Feng:Goldfarb-Tarrant:Boyd-Graber-2024,
	Title = {A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick},
	Author = {Nishant Balepur and Matthew Shu and Alexander Hoyle and Alison Robey and Shi Feng and Seraphina Goldfarb-Tarrant and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2024},
	Location = {Miami},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_mnemonic.pdf},
}
```
Accessible Abstract: Learning vocabulary (e.g., benevolent) can be tedious, but using mnemonics (e.g., benevolent sounds like "benefits," and a kind boss gives benefits) makes it more engaging and effective. This paper introduces SMART, a large language model trained to produce mnemonics based on feedback from flashcard learners. Students struggle to predict which mnemonics will help them most. Still, by training SMART on both student preferences and learning outcomes, we can generate mnemonics as effectively as GPT-4, but at a much lower cost.
Matthew Shu, Nishant Balepur, Shi Feng, and Jordan Boyd-Graber. KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Shu:Balepur:Feng:Boyd-Graber-2024,
	Title = {KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students},
	Author = {Matthew Shu and Nishant Balepur and Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_karl.pdf},
}
```
Accessible Abstract: Flashcard help students study by figuring out which flashcards to show students and when. However, current systems do not pay attention to what information (the actual text of the flashcards) to make these predictions. This paper introduces KARL, a new flashcard scheduler that uses language models to encode the text of flashcards. We host KARL in our own flashcard app for 500+ learners and show that students using KARL learn more efficiently than when they use other traditional systems that only know, for example, that a student has studied Flashcard \#24601 on Monday and got it wrong.

Michael J. Paul

Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul, and Jordan Boyd-Graber. Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries. Association for Computational Linguistics, 2020. [Preprint] [Video] [Code] [Bibtex]
```
@inproceedings{Zhang:Fujinuma:Paul:Boyd-Graber-2020,
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Michael J. Paul and Jordan Boyd-Graber},
	Title = {Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_refine.pdf},
}
```
Accessible Abstract: Computers need to represent words in a computer-readable way. This work talks about how slightly moving these representations for words in different languages to be closer to a small list of translations (like from a dictionary) after doing fancy machine learning works better on downstream tasks (e.g., guessing grammatical category of a word) but hurts on asking the algorithm for translations of unseen words.

Shudong Hao, Michael J. Paul, and Jordan Boyd-Graber. Lessons from the Bible on Modern Topics: Multilingual Topic Model Evaluation on Low-Resource Languages. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Hao:Paul:Boyd-Graber-2018,
	Title = {Lessons from the Bible on Modern Topics: Multilingual Topic Model Evaluation on Low-Resource Languages},
	Author = {Shudong Hao and Michael J. Paul and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Location = {New Orleans, LA},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_mltm_eval.pdf},
}

Michael Paul

Yoshinari Fujinuma, Michael Paul, and Jordan Boyd-Graber. A Resource-Free Evaluation Metric for Cross-Lingual Word Embeddings Based on Graph Modularity. Association for Computational Linguistics, 2019. [Bibtex]

@inproceedings{Fujinuma:Paul:Boyd-Graber-2019,
	Title = {A Resource-Free Evaluation Metric for Cross-Lingual Word Embeddings Based on Graph Modularity},
	Author = {Yoshinari Fujinuma and Michael Paul and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_modularity.pdf},
}

Dasha Pruss, Yoshinari Fujinuma, Ashlynn Daughton, Michael Paul, Brad Arnot, Danielle Szafir, and Jordan Boyd-Graber. Zika discourse in the Americas: A multilingual topic analysis of Twitter. PlosOne, 2019. [Data] [Bibtex]

@article{Pruss:Fujinuma:Daughton:Paul:Arnot:Szafir:Boyd-Graber-2019,
	Author = {Dasha Pruss and Yoshinari Fujinuma and Ashlynn Daughton and Michael Paul and Brad Arnot and Danielle Szafir and Jordan Boyd-Graber},
	Journal = {PlosOne},
	Year = {2019},
	Title = {Zika discourse in the Americas: A multilingual topic analysis of {Twitter}},
	Url = {https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0216922},
}

Michelle Yuan

Michelle Yuan, Patrick Xia, Chandler May, Benjamin Van Durme, and Jordan Boyd-Graber. Adapting Coreference Resolution Models through Active Learning. Association for Computational Linguistics, 2022. [Code] [Bibtex]

@inproceedings{Yuan:Xia:May:Durme:Boyd-Graber-2022,
	Author = {Michelle Yuan and Patrick Xia and Chandler May and Benjamin Van Durme and Jordan Boyd-Graber},
	Title = {Adapting Coreference Resolution Models through Active Learning},
	Booktitle = {Association for Computational Linguistics},
	Year = {2022},
	Location = {Dublin},
	Url = {http://cs.umd.edu/~jbg//docs/2022_acl_alcoref.pdf},
}

Michelle Yuan, Hsuan-Tien Lin, and Jordan Boyd-Graber. Cold-start Active Learning through Self-Supervised Language Modeling. Empirical Methods in Natural Language Processing, 2020. [Video] [Code] [Bibtex]
```
@inproceedings{Yuan:Lin:Boyd-Graber-2020,
	Title = {Cold-start Active Learning through Self-Supervised Language Modeling},
	Author = {Michelle Yuan and Hsuan-Tien Lin and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Punta Cana, Dominican Republic},
	Url = {http://cs.umd.edu/~jbg//docs/2020_emnlp_alps.pdf},
}
```
Accessible Abstract: Labeling data is a fundamental bottleneck in machine learning, especially for NLP, due to annotation cost and time. For medical text, obtaining labeled data is challenging because of privacy issues or shortage in expertise. Thus, active learning can be employed to recognize the most relevant examples and then query labels from an oracle. However, developing a strategy for selecting examples to label is non-trivial. Active learning is difficult to use in cold-start; all examples confuse the model because it has not trained on enough data. Fortunately, modern NLP provides an additional source of information: pre-trained language models. In our paper, we propose an active learning strategy called ALPS to find sentences that perplex the language model. We evaluate our approach on sentence classification datasets spanning across different domains. Results show that ALPS is an efficient active learning strategy that is competitive with state-of-the-art approaches.
Michelle Yuan, Mozhi Zhang, Benjamin Van Durme, Leah Findlater, and Jordan Boyd-Graber. Interactive Refinement of Cross-Lingual Word Embeddings. Empirical Methods in Natural Language Processing, 2020. [Audio of Paper Readthrough] [Video of Paper Readthrough] [Code] [Conference Talk] [Bibtex]
```
@inproceedings{Yuan:Zhang:Van-Durme:Findlater:Boyd-Graber-2020,
	Title = {Interactive Refinement of Cross-Lingual Word Embeddings},
	Author = {Michelle Yuan and Mozhi Zhang and Benjamin {Van Durme} and Leah Findlater and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Punta Cana, Dominican Republic},
	Url = {http://cs.umd.edu/~jbg//docs/2020_emnlp_clime.pdf},
}
```
Accessible Abstract: Language technologies sometimes need to be quickly deployed in low-resource languages. For example, in the 2010 Haiti earthquake, researchers used machine learning models to analyze social media and text messages to gain situational awareness. We introduce CLIME, an interactive system that can help in these scenarios: users see which words related to the task the system thinks are similar, corrects the model to push similar words together and dissimilar words apart.

@inproceedings{Yuan:Van-Durme:Boyd-Graber-2018,
	Title = {Multilingual Anchoring: Interactive Topic Modeling and Alignment  Across Languages},
	Author = {Michelle Yuan and Benjamin {Van Durme} and Jordan Boyd-Graber},
	Booktitle = {Neural Information Processing Systems},
	Year = {2018},
	Location = {Montreal, Quebec},
	Url = {http://cs.umd.edu/~jbg//docs/2018_neurips_mtanchor.pdf},
}

Mohit Iyyer

Pedro Rodriguez, Shi Feng, Mohit Iyyer, He He, and Jordan Boyd-Graber. Quizbowl: The Case for Incremental Question Answering. ArXiv, Preprint. [Webpage] [Bibtex]

@article{Rodriguez:Feng:Iyyer:He:Boyd-Graber-Preprint,
	Title = {Quizbowl: The Case for Incremental Question Answering},
	Author = {Pedro Rodriguez and Shi Feng and Mohit Iyyer and He He and Jordan Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1904.04792},
}

@inproceedings{Guo:Boyd-Graber:Iyyer:Findlater-2020,
	Author = {Fenfei Guo and Jordan Boyd-Graber and Mohit Iyyer and Leah Findlater},
	Location = {France (but only in dreams)},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Url = {http://cs.umd.edu/~jbg//docs/2020_lrec_sense.pdf},
	Year = {2020},
	Title = {Which Evaluations Uncover Sense Representations that Actually Make Sense?},
}

@inproceedings{Feng:Wallace:II:Rodriguez:Iyyer:Boyd-Graber-2018,
	Title = {Pathologies of Neural Models Make Interpretation Difficult},
	Author = {Shi Feng and Eric Wallace and Alvin Grissom II and Pedro Rodriguez and Mohit Iyyer and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://cs.umd.edu/~jbg//docs/2018_emnlp_rs.pdf},
}

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Larry Davis. Learning to Color from Language. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Davis-2018,
	Title = {Learning to Color from Language},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Larry Davis},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_colorization.pdf},
}

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Larry Davis. Learning to Color from Language. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Davis-2018,
	Title = {Learning to Color from Language},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Larry Davis},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_colorization.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Guha:Iyyer:Boyd-Graber-2016,
	Title = {A Distorted Skull Lies in the Bottom Center: Identifying Paintings from Text Descriptions},
	Author = {Anupam Guha and Mohit Iyyer and Jordan Boyd-Graber},
	Booktitle = {NAACL Human-Computer Question Answering Workshop},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_paintings.pdf},
}

@inproceedings{Iyyer:Guha:Chaturvedi:Boyd-Graber:Daume-III-2016,
	Title = {Feuding Families and Former Friends: Unsupervised Learning for Dynamic Fictional Relationships},
	Author = {Mohit Iyyer and Anupam Guha and Snigdha Chaturvedi and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2016},
	Location = {San Diego, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2016_naacl_relationships.pdf},
}

Best paper award (2 out of 1592)

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Daume-III-2015,
	Title = {Deep Unordered Composition Rivals Syntactic Methods for Text Classification},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Association for Computational Linguistics},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_acl_dan.pdf},
}

Jordan Boyd-Graber, Mohit Iyyer, He He, and Hal Daumé III. Interactive Incremental Question Answering. Neural Information Processing Systems, 2015. [Bibtex]

@inproceedings{Boyd-Graber:Iyyer:He:Daume-III-2015,
	Title = {Interactive Incremental Question Answering},
	Author = {Jordan Boyd-Graber and Mohit Iyyer and He He and Hal {Daum\'{e} III}},
	Booktitle = {Neural Information Processing Systems},
	Year = {2015},
	Location = {Montreal, Canada},
}

This won the best demonstration award at NIPS 2015

@inproceedings{Guha:Iyyer:Bouman:Boyd-Graber-2015,
	Title = {Removing the Training Wheels: A Coreference Dataset that Entertains Humans and Challenges Computers},
	Author = {Anupam Guha and Mohit Iyyer and Danny Bouman and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
	Url = {http://cs.umd.edu/~jbg//docs/2015_naacl_qb_coref.pdf},
}

Mohit Iyyer, Peter Enns, Jordan Boyd-Graber, and Philip Resnik. Political Ideology Detection Using Recursive Neural Networks. Association for Computational Linguistics, 2014. [Data] [Bibtex]

@inproceedings{Iyyer:Enns:Boyd-Graber:Resnik-2014,
	Title = {Political Ideology Detection Using Recursive Neural Networks},
	Author = {Mohit Iyyer and Peter Enns and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_rnn_ideology.pdf},
}

@inproceedings{Iyyer:Boyd-Graber:Claudino:Socher:Daume-III-2014,
	Title = {A Neural Network for Factoid Question Answering over Paragraphs},
	Author = {Mohit Iyyer and Jordan Boyd-Graber and Leonardo Claudino and Richard Socher and Hal {Daum\'{e} III}},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_qb_rnn.pdf},
}

The partial derivatives of "C" and "J" with respect to the parameters should be switched in Equation 7.

Mohit Iyyer, Jordan Boyd-Graber, and Hal Daumé III. Generating Sentences from Semantic Vector Space Representations. NIPS Workshop on Learning Semantics, 2014. [Bibtex]

@inproceedings{Iyyer:Boyd-Graber:Daume-III-2014,
	Title = {Generating Sentences from Semantic Vector Space Representations},
	Author = {Mohit Iyyer and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {NIPS Workshop on Learning Semantics},
	Year = {2014},
	Location = {Montreal, Canada},
}

Mozhi Zhang

Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul, and Jordan Boyd-Graber. Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries. Association for Computational Linguistics, 2020. [Preprint] [Video] [Code] [Bibtex]
```
@inproceedings{Zhang:Fujinuma:Paul:Boyd-Graber-2020,
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Michael J. Paul and Jordan Boyd-Graber},
	Title = {Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_refine.pdf},
}
```
Accessible Abstract: Computers need to represent words in a computer-readable way. This work talks about how slightly moving these representations for words in different languages to be closer to a small list of translations (like from a dictionary) after doing fancy machine learning works better on downstream tasks (e.g., guessing grammatical category of a word) but hurts on asking the algorithm for translations of unseen words.

Mozhi Zhang, Yoshinari Fujinuma, and Jordan Boyd-Graber. Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification. Association for the Advancement of Artificial Intelligence, 2020. [Bibtex]

@inproceedings{Zhang:Fujinuma:Boyd-Graber-2020,
	Title = {Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification},
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Jordan Boyd-Graber},
	Booktitle = {Association for the Advancement of Artificial Intelligence},
	Year = {2020},
	Location = {New York, New York},
	Url = {https://arxiv.org/abs/1812.09617},
}

Michelle Yuan, Mozhi Zhang, Benjamin Van Durme, Leah Findlater, and Jordan Boyd-Graber. Interactive Refinement of Cross-Lingual Word Embeddings. Empirical Methods in Natural Language Processing, 2020. [Audio of Paper Readthrough] [Video of Paper Readthrough] [Code] [Conference Talk] [Bibtex]
```
@inproceedings{Yuan:Zhang:Van-Durme:Findlater:Boyd-Graber-2020,
	Title = {Interactive Refinement of Cross-Lingual Word Embeddings},
	Author = {Michelle Yuan and Mozhi Zhang and Benjamin {Van Durme} and Leah Findlater and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Punta Cana, Dominican Republic},
	Url = {http://cs.umd.edu/~jbg//docs/2020_emnlp_clime.pdf},
}
```
Accessible Abstract: Language technologies sometimes need to be quickly deployed in low-resource languages. For example, in the 2010 Haiti earthquake, researchers used machine learning models to analyze social media and text messages to gain situational awareness. We introduce CLIME, an interactive system that can help in these scenarios: users see which words related to the task the system thinks are similar, corrects the model to push similar words together and dissimilar words apart.

Mozhi Zhang, Keyulu Xu, Ken-ichi Kawarabayashi, Stefanie Jegelka, and Jordan Boyd-Graber. Are Girls Neko or Shōjo? Cross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization. Association for Computational Linguistics, 2019. [Preprint] [Bibtex]

@inproceedings{Zhang:Xu:Kawarabayashi:Jegelka:Boyd-Graber-2019,
	Author = {Mozhi Zhang and Keyulu Xu and Ken-ichi Kawarabayashi and Stefanie Jegelka and Jordan Boyd-Graber},
	Title = {Are Girls Neko or Sh{\=o}jo? {C}ross-Lingual Alignment of Non-Isomorphic Embeddings with Iterative Normalization},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_clwe.pdf},
}

Mozhi Zhang, Yoshinari Fujinuma, and Jordan Boyd-Graber. Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification. ACL Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing, 2018. [Bibtex]

@inproceedings{Zhang:Fujinuma:Boyd-Graber-2018,
	Title = {Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification},
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing},
	Year = {2018},
	Location = {Melbourne, Australia},
}

@inproceedings{Zhang:Fujinuma:Boyd-Graber-2018,
	Title = {Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification},
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing},
	Year = {2018},
	Location = {Melbourne, Australia},
}

Naho Orita

Alvin Grissom II, Naho Orita, and Jordan Boyd-Graber. Incremental Prediction of Sentence-final Verbs. Conference on Computational Natural Language Learning, 2016. [Bibtex]

@inproceedings{Grissom-II:Orita:Boyd-Graber-2016,
	Title = {Incremental Prediction of Sentence-final Verbs},
	Author = {Alvin {Grissom II} and Naho Orita and Jordan Boyd-Graber},
	Booktitle = {Conference on Computational Natural Language Learning},
	Year = {2016},
	Location = {Berlin, Germany},
	Url = {http://cs.umd.edu/~jbg//docs/2016_conll_verbpred.pdf},
}

Naho Orita, Naomi Feldman, and Jordan Boyd-Graber. Quantifying the role of discourse topicality in speakers' choices of referring expressions. ACL Workshop on Cognitive Modeling and Computational Linguistics, 2014. [Bibtex]

@inproceedings{Orita:Feldman:Boyd-Graber-2014,
	Title = {Quantifying the role of discourse topicality in  speakers' choices of referring expressions},
	Author = {Naho Orita and Naomi Feldman and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

@inproceedings{Orita:Feldman:Boyd-Graber-2014,
	Title = {Quantifying the role of discourse topicality in  speakers' choices of referring expressions},
	Author = {Naho Orita and Naomi Feldman and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

Naho Orita, Rebecca McKeown, Naomi H. Feldman, Jeffrey Lidz, and Jordan Boyd-Graber. Discovering Pronoun Categories using Discourse Information. Proceedings of the Cognitive Science Society, 2013. [Bibtex]

@inproceedings{Orita:McKeown:Feldman:Lidz:Boyd-Graber-2013,
	Title = {Discovering Pronoun Categories using Discourse Information},
	Author = {Naho Orita and Rebecca McKeown and Naomi H. Feldman and Jeffrey Lidz and Jordan Boyd-Graber},
	Booktitle = {Proceedings of the Cognitive Science Society},
	Year = {2013},
	Location = {Berlin, Germany},
	Url = {http://cs.umd.edu/~jbg//docs/2013_cogsci_pronoun.pdf},
}

Naomi Feldman

@inproceedings{Orita:Feldman:Boyd-Graber-2014,
	Title = {Quantifying the role of discourse topicality in  speakers' choices of referring expressions},
	Author = {Naho Orita and Naomi Feldman and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

@inproceedings{Orita:Feldman:Boyd-Graber-2014,
	Title = {Quantifying the role of discourse topicality in  speakers' choices of referring expressions},
	Author = {Naho Orita and Naomi Feldman and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

Neha Srikanth

Quynh C. Nguyen, Elizabeth M. Aparicio, Michelle Jasczynski, Amara Channell Doig, Xiaohe Yue, Heran Mane, Neha Punklik Srikanth, Francia Ximena Marin Gutierrez, Nataly Delcid, Xin He, and Jordan Boyd-Graber. Randomized Pilot of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers. Journal of Medical Internet Research: Journal of Formative Research, 2024. [Bibtex]

@article{Nguyen:Aparicio:Jasczynski:Doig:Yue:Mane:Srikanth:Gutierrez:Delcid:He:Boyd-Graber-2024,
	Title = {Randomized Pilot of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers},
	Author = {Quynh C. Nguyen and  Elizabeth M. Aparicio and  Michelle Jasczynski and  Amara Channell Doig and  Xiaohe Yue and  Heran Mane and Neha Srikanth and Francia Ximena Marin Gutierrez and Nataly Delcid and  Xin He and Jordan Boyd-Graber},
	Journal = {Journal of Medical Internet Research: Journal of Formative Research},
	Year = {2024},
	Url = {https://formative.jmir.org/2024/1/e51361},
}

@inproceedings{Srikanth:Sarkar:Y.:M.:C.:Rudinger:Boyd-Graber-2024,
	Title = {Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering},
	Author = {Neha Srikanth and Rupak Sarkar and Mane, Heran Y. and Aparicio, Elizabeth M. and Nguyen, Quynh C. and Rachel Rudinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_pregnant.pdf},
}

@article{Y.:Amara:Ximena:Michelle:Xiaohe:Srikanth:Sourabh:Abby:Ann:Pragat:Xin:Boyd-Graber:M.:C.-2023,
	Title = {Practical Guidance for the Development of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers},
	Author = {Mane, Heran Y. and Channell Doig, Amara and Marin Gutierrez, Francia Ximena and Jasczynski, Michelle and Yue, Xiaohe and Neha Srikanth and Mane, Sourabh and Sun, Abby and Moats, Rachel Ann and Patel, Pragat and He, Xin and Jordan Boyd-Graber and Aparicio, Elizabeth M. and Nguyen, Quynh C.},
	Journal = {Journal of Public Health Management and Practice},
	Year = {2023},
	Url = {https://journals.lww.com/jphmp/fulltext/2023/09000/practical_guidance_for_the_development_of_rosie,_a.9.aspx},
}

Nguyen, Quynh C.

@inproceedings{Srikanth:Sarkar:Y.:M.:C.:Rudinger:Boyd-Graber-2024,
	Title = {Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering},
	Author = {Neha Srikanth and Rupak Sarkar and Mane, Heran Y. and Aparicio, Elizabeth M. and Nguyen, Quynh C. and Rachel Rudinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_pregnant.pdf},
}

@article{Y.:Amara:Ximena:Michelle:Xiaohe:Srikanth:Sourabh:Abby:Ann:Pragat:Xin:Boyd-Graber:M.:C.-2023,
	Title = {Practical Guidance for the Development of Rosie, a Health Education Question-and-Answer Chatbot for New Mothers},
	Author = {Mane, Heran Y. and Channell Doig, Amara and Marin Gutierrez, Francia Ximena and Jasczynski, Michelle and Yue, Xiaohe and Neha Srikanth and Mane, Sourabh and Sun, Abby and Moats, Rachel Ann and Patel, Pragat and He, Xin and Jordan Boyd-Graber and Aparicio, Elizabeth M. and Nguyen, Quynh C.},
	Journal = {Journal of Public Health Management and Practice},
	Year = {2023},
	Url = {https://journals.lww.com/jphmp/fulltext/2023/09000/practical_guidance_for_the_development_of_rosie,_a.9.aspx},
}

Niklas Elmqvist

@article{Lee:Smith:Seppi:Elmqvist:Boyd-Graber:Findlater-2017,
	Author = {Tak Yeon Lee and Alison Smith and Kevin Seppi and Niklas Elmqvist and Jordan Boyd-Graber and Leah Findlater},
	Journal = {International Journal of Human-Computer Studies},
	Year = {2017},
	Title = {The Human Touch: How Non-expert Users Perceive, Interpret, and Fix Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2017_ijhcs_human_touch.pdf},
}

@article{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2017,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2017},
	Title = {Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels},
	Volume = {5},
	Url = {http://cs.umd.edu/~jbg//docs/2017_tacl_eval_tm_viz.pdf},
	Pages = {1--15},
}

@inproceedings{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2016,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Booktitle = {CHI Human Centred Machine Learning Workshop},
	Year = {2016},
	Location = {San Jose, CA},
	Title = {Human-Centered and Interactive: Expanding the Impact of Topic Models},
}

Nikolai Vogler

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

@inproceedings{Stewart:Vogler:Hu:Boyd-Graber:Neubig-2018,
	Title = {Automatic Estimation of Simultaneous Interpreter Performance},
	Author = {Craig Stewart and Nikolai Vogler and Junjie Hu and Jordan Boyd-Graber and Graham Neubig},
	Booktitle = {Association for Computational Linguistics},
	Year = {2018},
	Location = {Melbourne, Australia},
	Url = {http://cs.umd.edu/~jbg//docs/2018_acl_interpeval.pdf},
}

Nishant Balepur

Nishant Balepur, Alexa Siu, Nedim Lipka, Franck Dernoncourt, Tong Sun, Jordan Lee Boyd-Graber, and Puneet Mathur. MoDS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Balepur:Siu:Lipka:Dernoncourt:Sun:Boyd-Graber:Mathur-2025,
	Title = {MoDS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections},
	Author = {Nishant Balepur and Alexa Siu and Nedim Lipka and Franck Dernoncourt and Tong Sun and Jordan Lee Boyd-Graber and Puneet Mathur},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_mods.pdf},
}
```
Accessible Abstract: When you ask ChatGPT for advice on questions with multiple perspectives (e.g. "Is pineapple good on pizza?"), you likely want a response that fairly represents all viewpoints. We formulate this task, collect a dataset to test it, and develop MoDS—a system where multiple ChatGPT's debate like a panel discussion—to generate balanced answers for questions based on multiple sources.
Nishant Balepur, Feng Gu, Abhilasha Ravichander, Shi Feng, Jordan Boyd-Graber, and Rachel Rudinger. Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Balepur:Gu:Ravichander:Feng:Boyd-Graber:Rudinger-2025,
	Title = {Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?},
	Author = {Nishant Balepur and Feng Gu and Abhilasha Ravichander and Shi Feng and Jordan Boyd-Graber and Rachel Rudinger},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_reverseqa.pdf},
}
```
Accessible Abstract: Language models like ChatGPT are pretty good at answering questions (e.g. "What is 12 * 12?"), but we show they can surprisingly struggle when asked to do the reverse task: generating questions for answers (e.g. "Give me a question with the answer 144"). We study when these errors happen, what might be causing them, and how they can be addressed.
Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, and Jordan Boyd-Graber. A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Balepur:Shu:Hoyle:Robey:Feng:Goldfarb-Tarrant:Boyd-Graber-2024,
	Title = {A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick},
	Author = {Nishant Balepur and Matthew Shu and Alexander Hoyle and Alison Robey and Shi Feng and Seraphina Goldfarb-Tarrant and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2024},
	Location = {Miami},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_mnemonic.pdf},
}
```
Accessible Abstract: Learning vocabulary (e.g., benevolent) can be tedious, but using mnemonics (e.g., benevolent sounds like "benefits," and a kind boss gives benefits) makes it more engaging and effective. This paper introduces SMART, a large language model trained to produce mnemonics based on feedback from flashcard learners. Students struggle to predict which mnemonics will help them most. Still, by training SMART on both student preferences and learning outcomes, we can generate mnemonics as effectively as GPT-4, but at a much lower cost.
Matthew Shu, Nishant Balepur, Shi Feng, and Jordan Boyd-Graber. KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Shu:Balepur:Feng:Boyd-Graber-2024,
	Title = {KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students},
	Author = {Matthew Shu and Nishant Balepur and Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_karl.pdf},
}
```
Accessible Abstract: Flashcard help students study by figuring out which flashcards to show students and when. However, current systems do not pay attention to what information (the actual text of the flashcards) to make these predictions. This paper introduces KARL, a new flashcard scheduler that uses language models to encode the text of flashcards. We host KARL in our own flashcard app for 500+ learners and show that students using KARL learn more efficiently than when they use other traditional systems that only know, for example, that a student has studied Flashcard \#24601 on Monday and got it wrong.

Paul Felt

@inproceedings{Felt:Ringger:Seppi:Boyd-Graber-2018,
	Title = {Learning from Measurements in Crowdsourcing Models: Inferring Ground Truth from Diverse Annotation Types},
	Author = {Paul Felt and Eric Ringger and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {International Conference on Computational Linguistics},
	Year = {2018},
	Location = {Santa Fe, New Mexico},
	Url = {http://cs.umd.edu/~jbg//docs/2018_coling_measurements.pdf},
}

@inproceedings{Felt:Ringger:Boyd-Graber:Seppi-2015,
	Title = {Making the Most of Crowdsourced Document Annotations: Confused Supervised {LDA}},
	Author = {Paul Felt and Eric Ringger and Jordan Boyd-Graber and Kevin Seppi},
	Booktitle = {Conference on Computational Natural Language Learning},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_conll_cslda.pdf},
}

This paper received the best paper award at CoNLL

Pedro Rodriguez

Pedro Rodriguez, Shi Feng, Mohit Iyyer, He He, and Jordan Boyd-Graber. Quizbowl: The Case for Incremental Question Answering. ArXiv, Preprint. [Webpage] [Bibtex]

@article{Rodriguez:Feng:Iyyer:He:Boyd-Graber-Preprint,
	Title = {Quizbowl: The Case for Incremental Question Answering},
	Author = {Pedro Rodriguez and Shi Feng and Mohit Iyyer and He He and Jordan Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1904.04792},
}

Pedro Rodriguez, Joe Barrow, Alexander Hoyle, John P. Lalor, Robin Jia, and Jordan Boyd-Graber. Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards?. Association for Computational Linguistics, 2021. [Results and Code] [Paper Read Aloud] [Research Talk Video] [Code and Data] [Bibtex]
```
@inproceedings{Rodriguez:Barrow:Hoyle:Lalor:Jia:Boyd-Graber-2021,
	Title = {Evaluation Examples Are Not Equally Informative: How Should That Change NLP Leaderboards?},
	Author = {Pedro Rodriguez and Joe Barrow and Alexander Hoyle and John P. Lalor and Robin Jia and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_acl_leaderboard.pdf},
}
```
Accessible Abstract: When can we call an AI "intelligent"? Just like humans, a common approach is to ask them a bunch of questions. These questions posed to modern machine learning methods are collected in metrics called leaderboards to monitor progress, but beyond ranking approaches, this does not help us better understand our problems or our systems very well. This paper introduces probabilistic models inspired by psychometric approaches called item response theory models (think year-end standardized tests) to better understand how computers can answer questions and whether we are asking the right questions. This allows researchers to better compare what kinds of questions systems can answer, better compare human and machine ability, and discover problematic questions (e.g., questions that have incorrect answer keys, are vague, or "trick" those trying to answer the questions).
Pedro Rodriguez and Jordan Boyd-Graber. Evaluation Paradigms in Question Answering. Empirical Methods in Natural Language Processing, 2021. [Research Talk] [Bibtex]
```
@inproceedings{Rodriguez:Boyd-Graber-2021,
	Title = {Evaluation Paradigms in Question Answering},
	Author = {Pedro Rodriguez and Jordan Boyd-Graber},
	Location = {Punta Cana},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_emnlp_paradigms.pdf},
}
```
Accessible Abstract: Why do we answer questions? Sometimes it's to provide information, which has been the interpretation of the computer science community. But sometimes it's to probe or test intelligence. This paper argues we should think more about that application of question answering and its connection to the foundations of artificial intelligence: The Turing Test. We thus argue that in addition to the long-standing Cranfield paradigm popularized by information retrieval, this paper proposes an alternative "Manchester paradigm" closer to the Turing test, trivia games, and education.

@inproceedings{Peskov:Barrow:Rodriguez:Neubig:Boyd-Graber-2019,
	Title = {Mitigating Noisy Inputs for Question Answering},
	Author = {Denis Peskov and Joe Barrow and Pedro Rodriguez and Graham Neubig and Jordan Boyd-Graber},
	Booktitle = {Conference of the International Speech Communication Association},
	Year = {2019},
	Location = {Graz, Austria},
	Url = {http://cs.umd.edu/~jbg//docs/2019_interspeech_asr},
}

@article{Wallace:Rodriguez:Feng:Yamada:Boyd-Graber-2019,
	Title = {Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples},
	Author = {Eric Wallace and Pedro Rodriguez and Shi Feng and Ikuya Yamada and Jordan Boyd-Graber},
	Booktitle = {Transactions of the Association for Computational Linguistics},
	Year = {2019},
	Volume = {10},
	Url = {http://cs.umd.edu/~jbg//docs/2019_tacl_trick.pdf},
}

@inproceedings{Feng:Wallace:II:Rodriguez:Iyyer:Boyd-Graber-2018,
	Title = {Pathologies of Neural Models Make Interpretation Difficult},
	Author = {Shi Feng and Eric Wallace and Alvin Grissom II and Pedro Rodriguez and Mohit Iyyer and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://cs.umd.edu/~jbg//docs/2018_emnlp_rs.pdf},
}

Jordan Boyd-Graber, Shi Feng, and Pedro Rodriguez. Human-Computer Question Answering: The Case for Quizbowl. The NIPS '17 Competition: Building Intelligent Systems, 2018. [Bibtex]

@inbook{Boyd-Graber:Feng:Rodriguez-2018,
	Editor = {Sergio Escalera and Markus Weimer},
	Title = {Human-Computer Question Answering: The Case for Quizbowl},
	Author = {Jordan Boyd-Graber and Shi Feng and Pedro Rodriguez},
	Booktitle = {The NIPS '17 Competition: Building Intelligent Systems},
	Publisher = {Springer Verlag},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_nips_qbcomp.pdf},
}

Perry Cook

@inproceedings{Nikolova:Boyd-Graber:Fellbaum:Cook-2009,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Christiane Fellbaum and Perry Cook},
	Title = {Better Vocabularies for Assistive Communication Aids: Connecting Terms using Semantic Networks and Untrained Annotators},
	Booktitle = {ACM Conference on Computers and Accessibility},
	Year = {2009},
	Location = {Pittsburgh, PA},
	Url = {http://cs.umd.edu/~jbg//docs/evocation-viva.pdf},
}

Xiaojuan Ma, Jordan Boyd-Graber, Sonya S. Nikolova, and Perry Cook. Speaking Through Pictures: Images vs. Icons. ACM Conference on Computers and Accessibility, 2009. [Slides] [Bibtex]

@inproceedings{Ma:Boyd-Graber:Nikolova:Cook-2009,
	Title = {Speaking Through Pictures: Images vs. Icons},
	Author = {Xiaojuan Ma and Jordan Boyd-Graber and Sonya S. Nikolova and Perry Cook},
	Booktitle = {ACM Conference on Computers and Accessibility},
	Year = {2009},
	Location = {Pittsburgh, PA},
	Url = {http://cs.umd.edu/~jbg//docs/image_icon.pdf},
}

Sonya S. Nikolova, Jordan Boyd-Graber, and Perry Cook. The Design of ViVA: A Mixed-initiative Visual Vocabulary for Aphasia. Proceedings of the 27th international conference extended abstracts on Human factors in computing systems, 2009. [Bibtex]

@inproceedings{Nikolova:Boyd-Graber:Cook-2009,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Perry Cook},
	Title = {The Design of ViVA: A Mixed-initiative Visual Vocabulary for Aphasia},
	Booktitle = {Proceedings of the 27th international conference extended abstracts on Human factors in computing systems},
	Url = {http://cs.umd.edu/~jbg//docs/viva.pdf},
	Year = {2009},
	Isbn = {978-1-60558-247-4},
	Pages = {4015--4020},
	Location = {Boston, MA, USA},
	Publisher = {ACM},
	Address = {New York, NY, USA},
}

Philip Resnik

Alexander Hoyle, Pranav Goel, Denis Peskov, Andrew Hian-Cheong, Jordan Boyd-Graber, and Philip Resnik. Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence. Neural Information Processing Systems, 2021. [ArXiv] [Research Talk (students)] [Research Talk (Jordan)] [Code] [Bibtex]
```
@inproceedings{Hoyle:Goel:Peskov:Hian-Cheong:Boyd-Graber:Resnik-2021,
	Title = {Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence},
	Author = {Alexander Hoyle and Pranav Goel and Denis Peskov and Andrew Hian-Cheong and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Neural Information Processing Systems},
	Location = {Online},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_neurips_incoherence.pdf},
}
```
Accessible Abstract: Topic models help historians, journalists, and analysts make sense of large text collections. But how do you know if you have a good one? The field has settled on using "Automatic Coherence", but this paper argues that maybe that isn't the right choice if you want to actually make real users happy. This paper builds on our 2009 that showed perplexity was not a good evaluation of interpretability for topic models; while the field adopted automatic topic coherence as a result of that 2009 paper, this paper argues that automatic topic coherence is not a good metric for neural topic models (even though it worked for probabilistic topic models).

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. A Multilingual Topic Model for Learning Weighted Topic Links Across Incomparable Corpora. Empirical Methods in Natural Language Processing, 2019. [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2019,
	Title = {A Multilingual Topic Model for Learning Weighted Topic Links Across Incomparable Corpora},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2019},
	Location = {Hong Kong, China},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Url = {http://cs.umd.edu/~jbg//docs/2019_emnlp_mtm.pdf},
}

@inproceedings{Yang:Boyd-Graber:Resnik-2019,
	Title = {A Multilingual Topic Model for Learning Weighted Topic Links Across Incomparable Corpora},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2019},
	Location = {Hong Kong, China},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Url = {http://cs.umd.edu/~jbg//docs/2019_emnlp_mtm.pdf},
}

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. Adapting Topic Models using Lexical Associations with Tree Priors. Empirical Methods in Natural Language Processing, 2017. [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2017,
	Title = {Adapting Topic Models using Lexical Associations with Tree Priors},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2017},
	Location = {Copenhagen, Denmark},
	Url = {http://cs.umd.edu/~jbg//docs/2017_emnlp_tree_prior.pdf},
}

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. A Discriminative Topic Model using Document Network Structure. Association for Computational Linguistics, 2016. [Supplement] [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2016,
	Title = {A Discriminative Topic Model using Document Network Structure},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_docblock.pdf},
}

@inproceedings{Amiri:Resnik:Boyd-Graber:Daume-III-2016,
	Title = {Learning Text Pair Similarity with Context-sensitive Autoencoders},
	Author = {Hadi Amiri and Philip Resnik and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_context_ae.pdf},
}

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. Birds of a Feather in the Same Nest: A Discriminative Topic Model using Block-based Priors. Mid-Atlantic Student Colloquium on Speech, Language, and Learning, 2016. [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2016,
	Title = {Birds of a Feather in the Same Nest: A Discriminative Topic Model using Block-based Priors},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2016},
	Location = {Philadephia},
}

@inproceedings{Yang:Boyd-Graber:Resnik-2016,
	Title = {Birds of a Feather in the Same Nest: A Discriminative Topic Model using Block-based Priors},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2016},
	Location = {Philadephia},
}

Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, and Kristina Miler. Tea Party in the House: A Hierarchical Ideal Point Topic Model and Its Application to Republican Legislators in the 112th Congress. Association for Computational Linguistics, 2015. [Talk] [Code] [LaTeX] [Bibtex]
```
@inproceedings{Nguyen:Boyd-Graber:Resnik:Miler-2015,
	Title = {Tea Party in the House: A Hierarchical Ideal Point Topic Model and Its Application to Republican Legislators in the 112th Congress},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Kristina Miler},
	Booktitle = {Association for Computational Linguistics},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_acl_teaparty.pdf},
}
```
Accessible Abstract: In the mid 2010s, the Republican party in the United States diverged: mainstream conservatives split from the so-called "tea party" caucus. However, the primary statistical tool for analyzing political factions in legislative bodies (ideal point models) fail to account for these changes. This is because the schism is not fully reflected in voting patterns but rather in how politicians present themselves: thus we need to extend these models to capture not just how politicians vote but also how they frame particular issues. This paper proposes a new model to capture framing differences within a voting block to start explaining the new subcoalitions of the republican caucus.

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. Birds of a Feather Linked Together: A Discriminative Topic Model using Link-based Priors. Empirical Methods in Natural Language Processing, 2015. [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2015,
	Title = {Birds of a Feather Linked Together: A Discriminative Topic Model using Link-based Priors},
	Url = {http://cs.umd.edu/~jbg//docs/2015_emnlp_hinge_link.pdf},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2015},
	Location = {Lisbon, Portugal},
}

@inproceedings{Resnik:Armstrong:Claudino:Nguyen:Nguyen:Boyd-Graber-2015,
	Title = {Beyond {LDA}: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter},
	Author = {Philip Resnik and William Armstrong and Leonardo Claudino and Thang Nguyen and Viet-An Nguyen and Jordan Boyd-Graber},
	Booktitle = {NAACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
}

@inproceedings{Iyyer:Enns:Boyd-Graber:Resnik-2014,
	Title = {Political Ideology Detection Using Recursive Neural Networks},
	Author = {Mohit Iyyer and Peter Enns and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_rnn_ideology.pdf},
}

Viet-An Nguyen, Jordan Boyd-Graber, and Philip Resnik. Sometimes Average is Best: The Importance of Averaging for Prediction using MCMC Inference in Topic Modeling. Empirical Methods in Natural Language Processing, 2014. [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Resnik-2014,
	Title = {Sometimes Average is Best: The Importance of Averaging for Prediction using MCMC Inference in Topic Modeling},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_howto_gibbs.pdf},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik-2014,
	Title = {Sometimes Average is Best: The Importance of Averaging for Prediction using MCMC Inference in Topic Modeling},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_howto_gibbs.pdf},
}

Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, Deborah Cai, Jennifer Midberry, and Yuanxin Wang. Modeling Topic Control to Detect Influence in Conversations using Nonparametric Topic Models. Machine Learning, 2014. [Journal] [Code] [Data] [Bibtex]

@article{Nguyen:Boyd-Graber:Resnik:Cai:Midberry:Wang-2014,
	Title = {Modeling Topic Control to Detect Influence in Conversations using Nonparametric Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Deborah Cai and Jennifer Midberry and Yuanxin Wang},
	Url = {http://cs.umd.edu/~jbg//docs/2014_mlj_influencer.pdf},
	Journal = {Machine Learning},
	Year = {2014},
	Volume = {95},
	Pages = {381--421},
	Publisher = {Springer},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik:Chang-2014,
	Title = {Learning a Concept Hierarchy from Multi-labeled Documents},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Jonathan Chang},
	Booktitle = {Neural Information Processing Systems},
	Year = {2014},
	Location = {Montreal, Canada},
	Url = {http://cs.umd.edu/~jbg//docs/2014_nips_l2h.pdf},
}

@inproceedings{Nguyen:Boyd-Graber:Chang:Resnik-2013,
	Title = {Tree-Based Label Dependency Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Jonathan Chang and Philip Resnik},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

Viet-An Nguyen, Jordan Boyd-Graber, and Philip Resnik. Lexical and Hierarchical Topic Regression. Neural Information Processing Systems, 2013. [Supplement] [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Resnik-2013,
	Url = {http://cs.umd.edu/~jbg//docs/2013_shlda.pdf},
	Title = {Lexical and Hierarchical Topic Regression},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Neural Information Processing Systems},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

Viet-An Nguyen, Yuening Hu, Jordan Boyd-Graber, and Philip Resnik. Argviz: Interactive Visualization of Topic Dynamics in Multi-party Conversations. North American Association for Computational Linguistics, 2013. [Bibtex]

@inproceedings{Nguyen:Hu:Boyd-Graber:Resnik-2013,
	Title = {Argviz: Interactive Visualization of Topic Dynamics in Multi-party Conversations},
	Author = {Viet-An Nguyen and Yuening Hu and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2013},
	Location = {Atlanta Georgia},
	Url = {http://cs.umd.edu/~jbg//docs/2013_argviz.pdf},
}

Vladimir Eidelman, Jordan Boyd-Graber, and Philip Resnik. Topic Models for Dynamic Translation Model Adaptation. Association for Computational Linguistics, 2012. [Presentation] [More Recent Paper] [Bibtex]
```
@inproceedings{Eidelman:Boyd-Graber:Resnik-2012,
	Title = {Topic Models for Dynamic Translation Model Adaptation},
	Author = {Vladimir Eidelman and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2012},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/acl_2012_tm_for_mt.pdf},
}
```
For a more thorough evaluation and an exploration of more advanced topic models for machine translation, see: Yuening Hu, Ke Zhai, Vlad Eidelman, and Jordan Boyd-Graber. Polylingual Tree-Based Topic Models for Translation Domain Adaptation. Association for Computational Linguistics, 2014.

Viet-An Nguyen, Jordan Boyd-Graber, and Philip Resnik. SITS: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations. Association for Computational Linguistics, 2012. [Data] [Code] [Slides] [Appendix] [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Resnik-2012,
	Title = {{SITS}: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2012},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/acl_2012_sits.pdf},
}

Viet-An Nguyen, Jordan Boyd-Graber, and Philip Resnik. "I Want to Talk About, Again, My Record On Energy …'': Modeling Topic Control in Conversations using Speaker-centric Nonparametric Topic Models. Mid-Atlantic Student Colloquium on Speech, Language, and Learning, 2012. [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Resnik-2012,
	Title = {``I Want to Talk About, Again, My Record On Energy~\dots'':  Modeling
  Topic Control in Conversations using Speaker-centric Nonparametric Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2012},
}

Pranav Anand, Joseph King, Jordan Boyd-Graber, Earl Wagner, Craig Martell, Douglas W. Oard, and Philip Resnik. Believe Me: We Can Do This!. The AAAI 2011 workshop on Computational Models of Natural Argument, 2011. [Data] [Presentation] [Bibtex]

@inproceedings{Anand:King:Boyd-Graber:Wagner:Martell:Oard:Resnik-2011,
	Title = {Believe Me: We Can Do This!},
	Author = {Pranav Anand and Joseph King and Jordan Boyd-Graber and Earl Wagner and Craig Martell and Douglas W. Oard and Philip Resnik},
	Booktitle = {The AAAI 2011 workshop on Computational Models of Natural Argument},
	Year = {2011},
	Location = {San Francisco, CA},
	Url = {http://cs.umd.edu/~jbg//docs/persuasion.pdf},
}

Nitin Madnani, Jordan Boyd-Graber, and Philip Resnik. Measuring Transitivity Using Untrained Annotators. Creating Speech and Language Data With Amazon's Mechanical Turk, 2010. [Data] [Bibtex]

@inproceedings{Madnani:Boyd-Graber:Resnik-2010,
	Title = {Measuring Transitivity Using Untrained Annotators},
	Author = {Nitin Madnani and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Creating Speech and Language Data With Amazon's Mechanical Turk},
	Year = {2010},
	Location = {Los Angeles, CA},
	Url = {http://cs.umd.edu/~jbg//docs/madnani-boyd-graber-turk-workshop.pdf},
}

Jordan Boyd-Graber and Philip Resnik. Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation. Empirical Methods in Natural Language Processing, 2010. [Data] [Bibtex]

@inproceedings{Boyd-Graber:Resnik-2010,
	Title = {Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation},
	Author = {Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2010},
	Location = {Cambridge, MA},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-mlslda-2010.pdf},
}

Eric Hardisty, Jordan Boyd-Graber, and Philip Resnik. Modeling Perspective using Adaptor Grammars. Empirical Methods in Natural Language Processing, 2010. [Bibtex]

@inproceedings{Hardisty:Boyd-Graber:Resnik-2010,
	Title = {Modeling Perspective using Adaptor Grammars},
	Author = {Eric Hardisty and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2010},
	Location = {Cambridge, MA},
	Url = {http://cs.umd.edu/~jbg//docs/adapted_naive_bayes.pdf},
}

Pranav Goel

@inproceedings{Li:Mao:Stephens:Goel:Walpole:Fung:Dima:Boyd-Graber-2024,
	Title = {TENOR: Topic Enabled Neural Organization and Recommendation: Evaluating Topic Models in Task Based Settings},
	Author = {Zongxia Li and Andrew Mao and Daniel Kofi Stephens and Pranav Goel and Emily Walpole and Juan Francisco Fung and Alden Dima and Jordan Lee Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_tenor.pdf},
}

Alexander Hoyle, Pranav Goel, Denis Peskov, Andrew Hian-Cheong, Jordan Boyd-Graber, and Philip Resnik. Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence. Neural Information Processing Systems, 2021. [ArXiv] [Research Talk (students)] [Research Talk (Jordan)] [Code] [Bibtex]
```
@inproceedings{Hoyle:Goel:Peskov:Hian-Cheong:Boyd-Graber:Resnik-2021,
	Title = {Is Automated Topic Model Evaluation Broken?: The Incoherence of Coherence},
	Author = {Alexander Hoyle and Pranav Goel and Denis Peskov and Andrew Hian-Cheong and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Neural Information Processing Systems},
	Location = {Online},
	Year = {2021},
	Url = {http://cs.umd.edu/~jbg//docs/2021_neurips_incoherence.pdf},
}
```
Accessible Abstract: Topic models help historians, journalists, and analysts make sense of large text collections. But how do you know if you have a good one? The field has settled on using "Automatic Coherence", but this paper argues that maybe that isn't the right choice if you want to actually make real users happy. This paper builds on our 2009 that showed perplexity was not a good evaluation of interpretability for topic models; while the field adopted automatic topic coherence as a result of that 2009 paper, this paper argues that automatic topic coherence is not a good metric for neural topic models (even though it worked for probabilistic topic models).

Rachel Rudinger

Nishant Balepur, Feng Gu, Abhilasha Ravichander, Shi Feng, Jordan Boyd-Graber, and Rachel Rudinger. Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Balepur:Gu:Ravichander:Feng:Boyd-Graber:Rudinger-2025,
	Title = {Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?},
	Author = {Nishant Balepur and Feng Gu and Abhilasha Ravichander and Shi Feng and Jordan Boyd-Graber and Rachel Rudinger},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_reverseqa.pdf},
}
```
Accessible Abstract: Language models like ChatGPT are pretty good at answering questions (e.g. "What is 12 * 12?"), but we show they can surprisingly struggle when asked to do the reverse task: generating questions for answers (e.g. "Give me a question with the answer 144"). We study when these errors happen, what might be causing them, and how they can be addressed.

@inproceedings{Srikanth:Sarkar:Y.:M.:C.:Rudinger:Boyd-Graber-2024,
	Title = {Pregnant Questions: The Importance of Pragmatic Awareness in Maternal Health Question Answering},
	Author = {Neha Srikanth and Rupak Sarkar and Mane, Heran Y. and Aparicio, Elizabeth M. and Nguyen, Quynh C. and Rachel Rudinger and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_pregnant.pdf},
}

Robert Schapire

@inproceedings{Boyd-Graber:Fellbaum:Osherson:Schapire-2006,
	Title = {Adding Dense, Weighted, Connections to {WordNet}},
	Author = {Jordan Boyd-Graber and Christiane Fellbaum and Daniel Osherson and Robert Schapire},
	Booktitle = {Proceedings of the Global {WordNet} Conference},
	Year = {2006},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-jeju.pdf},
}

@inproceedings{Boyd-Graber:Fellbaum:Osherson:Schapire-2006,
	Title = {Adding Dense, Weighted, Connections to {WordNet}},
	Author = {Jordan Boyd-Graber and Christiane Fellbaum and Daniel Osherson and Robert Schapire},
	Booktitle = {Proceedings of the Global {WordNet} Conference},
	Year = {2006},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/jbg-jeju.pdf},
}

Sambaran Bandyopadhyay

@inproceedings{Mondal:S:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {Presentations by the People, for the People: Harnessing LLMs for Generating Persona-Aware Slides from Documents},
	Author = {Ishani Mondal and Shwetha S and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_slides.pdf},
}

@article{Mondal:Li:Hou:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement},
	Author = {Ishani Mondal and Zongxia Li and Yufang Hou and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Year = {2024},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_diagramgen.pdf},
}

Shay B. Cohen

Ke Zhai, Jordan Boyd-Graber, and Shay B. Cohen. Hybrid Online Inference with Adaptor Grammars. NIPS Workshop on Advances in Variational Inference, 2014. [Bibtex]

@article{Zhai:Boyd-Graber:Cohen-2014,
	Title = {Hybrid Online Inference with Adaptor Grammars},
	Author = {Ke Zhai and Jordan Boyd-Graber and Shay B. Cohen},
	Booktitle = {NIPS Workshop on Advances in Variational Inference},
	Year = {2014},
}

Ke Zhai, Jordan Boyd-Graber, and Shay B. Cohen. Online Adaptor Grammars with Hybrid Inference. Transactions of the Association for Computational Linguistics, 2014. [Code] [Bibtex]

@article{Zhai:Boyd-Graber:Cohen-2014,
	Title = {Online Adaptor Grammars with Hybrid Inference},
	Author = {Ke Zhai and Jordan Boyd-Graber and Shay B. Cohen},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2014},
	Publisher = {Association for Computational Linguistics},
	Url = {http://cs.umd.edu/~jbg//docs/2014_tacl_ag_vb_online.pdf},
}

Shi Feng

Pedro Rodriguez, Shi Feng, Mohit Iyyer, He He, and Jordan Boyd-Graber. Quizbowl: The Case for Incremental Question Answering. ArXiv, Preprint. [Webpage] [Bibtex]

@article{Rodriguez:Feng:Iyyer:He:Boyd-Graber-Preprint,
	Title = {Quizbowl: The Case for Incremental Question Answering},
	Author = {Pedro Rodriguez and Shi Feng and Mohit Iyyer and He He and Jordan Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/abs/1904.04792},
}

Nishant Balepur, Feng Gu, Abhilasha Ravichander, Shi Feng, Jordan Boyd-Graber, and Rachel Rudinger. Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?. Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Balepur:Gu:Ravichander:Feng:Boyd-Graber:Rudinger-2025,
	Title = {Reverse Question Answering: Can an LLM Write a Question so Hard (or Bad) that it Can't Answer?},
	Author = {Nishant Balepur and Feng Gu and Abhilasha Ravichander and Shi Feng and Jordan Boyd-Graber and Rachel Rudinger},
	Booktitle = {Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics},
	Year = {2025},
	Location = {Albuquerque},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_reverseqa.pdf},
}
```
Accessible Abstract: Language models like ChatGPT are pretty good at answering questions (e.g. "What is 12 * 12?"), but we show they can surprisingly struggle when asked to do the reverse task: generating questions for answers (e.g. "Give me a question with the answer 144"). We study when these errors happen, what might be causing them, and how they can be addressed.
Nishant Balepur, Matthew Shu, Alexander Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, and Jordan Boyd-Graber. A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Balepur:Shu:Hoyle:Robey:Feng:Goldfarb-Tarrant:Boyd-Graber-2024,
	Title = {A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick},
	Author = {Nishant Balepur and Matthew Shu and Alexander Hoyle and Alison Robey and Shi Feng and Seraphina Goldfarb-Tarrant and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2024},
	Location = {Miami},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_mnemonic.pdf},
}
```
Accessible Abstract: Learning vocabulary (e.g., benevolent) can be tedious, but using mnemonics (e.g., benevolent sounds like "benefits," and a kind boss gives benefits) makes it more engaging and effective. This paper introduces SMART, a large language model trained to produce mnemonics based on feedback from flashcard learners. Students struggle to predict which mnemonics will help them most. Still, by training SMART on both student preferences and learning outcomes, we can generate mnemonics as effectively as GPT-4, but at a much lower cost.
Matthew Shu, Nishant Balepur, Shi Feng, and Jordan Boyd-Graber. KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students. Empirical Methods in Natural Language Processing, 2024. [Code and Data] [Research Talk] [Bibtex]
```
@inproceedings{Shu:Balepur:Feng:Boyd-Graber-2024,
	Title = {KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students},
	Author = {Matthew Shu and Nishant Balepur and Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_karl.pdf},
}
```
Accessible Abstract: Flashcard help students study by figuring out which flashcards to show students and when. However, current systems do not pay attention to what information (the actual text of the flashcards) to make these predictions. This paper introduces KARL, a new flashcard scheduler that uses language models to encode the text of flashcards. We host KARL in our own flashcard app for 500+ learners and show that students using KARL learn more efficiently than when they use other traditional systems that only know, for example, that a student has studied Flashcard \#24601 on Monday and got it wrong.

@inproceedings{Si:Goyal:Wu:Zhao:Feng:III:Boyd-Graber-2024,
	Title = {Large Language Models Help Humans Verify Truthfulness---Except When They Are Convincingly Wrong},
	Author = {Chenglei Si and Navita Goyal and Tongshuang Wu and Chen Zhao and Shi Feng and Hal Daum\'{e} {III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_convincingly.pdf},
}

Shi Feng and Jordan Boyd-Graber. Learning to Explain Selectively: A Case Study on Question Answering. Empirical Methods in Natural Language Processing, 2022. [Research Teaser] [Code and Data] [Bibtex]
```
@inproceedings{Feng:Boyd-Graber-2022,
	Title = {Learning to Explain Selectively: A Case Study on Question Answering},
	Author = {Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2022},
	Location = {Abu Dhabi},
	Url = {http://cs.umd.edu/~jbg//docs/2022_emnlp_augment.pdf},
}
```
Accessible Abstract: Many AI methods are a black box: input goes in, predictions come out. While there are many AI explanation tools that you can add to these predictions, how do you know if they are any good. In this work presented at EMNLP, if you put a human in front of a AI that's trying to answer questions, our hypothesis is that you can measure how good the underlying explanations are by how much the human's score goes up. This 2022 EMNLP publication not just measures which combinations of explanations are most effective for an individual. We use bandit exploration to quickly figure out what set of explanations best help a specific user.

Eric Wallace, Shi Feng, and Jordan Boyd-Graber. Misleading Failures of Partial-input Baselines. Association for Computational Linguistics, 2019. [Bibtex]

@inproceedings{Wallace:Feng:Boyd-Graber-2019,
	Title = {Misleading Failures of Partial-input Baselines},
	Author = {Eric Wallace and Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_flipside.pdf},
}

Shi Feng and Jordan Boyd-Graber. What AI can do for me: Evaluating Machine Learning Interpretations in Cooperative Play. Intelligent User Interfaces, 2019. [Bibtex]

@inproceedings{Feng:Boyd-Graber-2019,
	Title = {What AI can do for me: Evaluating Machine Learning Interpretations in Cooperative Play},
	Author = {Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Intelligent User Interfaces},
	Year = {2019},
	Location = {Los Angeles, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2019_iui_augment.pdf},
}

Shi Feng and Jordan Boyd-Graber. What AI can do for me: Evaluating Machine Learning Interpretations in Cooperative Play. Intelligent User Interfaces, 2019. [Bibtex]

@inproceedings{Feng:Boyd-Graber-2019,
	Title = {What AI can do for me: Evaluating Machine Learning Interpretations in Cooperative Play},
	Author = {Shi Feng and Jordan Boyd-Graber},
	Booktitle = {Intelligent User Interfaces},
	Year = {2019},
	Location = {Los Angeles, CA},
	Url = {http://cs.umd.edu/~jbg//docs/2019_iui_augment.pdf},
}

@article{Wallace:Rodriguez:Feng:Yamada:Boyd-Graber-2019,
	Title = {Trick Me If You Can: Human-in-the-loop Generation of Adversarial Question Answering Examples},
	Author = {Eric Wallace and Pedro Rodriguez and Shi Feng and Ikuya Yamada and Jordan Boyd-Graber},
	Booktitle = {Transactions of the Association for Computational Linguistics},
	Year = {2019},
	Volume = {10},
	Url = {http://cs.umd.edu/~jbg//docs/2019_tacl_trick.pdf},
}

@inproceedings{Feng:Wallace:Boyd-Graber-2018,
	Title = {Interpreting Neural Networks with Nearest Neighbors},
	Author = {Shi Feng and Eric Wallace and Jordan Boyd-Graber},
	Booktitle = {EMNLP Workshop on BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://aclweb.org/anthology/W18-5416},
}

@inproceedings{Feng:Wallace:II:Rodriguez:Iyyer:Boyd-Graber-2018,
	Title = {Pathologies of Neural Models Make Interpretation Difficult},
	Author = {Shi Feng and Eric Wallace and Alvin Grissom II and Pedro Rodriguez and Mohit Iyyer and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2018},
	Location = {Brussels, Belgium},
	Url = {http://cs.umd.edu/~jbg//docs/2018_emnlp_rs.pdf},
}

Jordan Boyd-Graber, Shi Feng, and Pedro Rodriguez. Human-Computer Question Answering: The Case for Quizbowl. The NIPS '17 Competition: Building Intelligent Systems, 2018. [Bibtex]

@inbook{Boyd-Graber:Feng:Rodriguez-2018,
	Editor = {Sergio Escalera and Markus Weimer},
	Title = {Human-Computer Question Answering: The Case for Quizbowl},
	Author = {Jordan Boyd-Graber and Shi Feng and Pedro Rodriguez},
	Booktitle = {The NIPS '17 Competition: Building Intelligent Systems},
	Publisher = {Springer Verlag},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_nips_qbcomp.pdf},
}

Sonya S. Nikolova

@inbook{Nikolova:Boyd-Graber:Fellbaum-2011,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Christiane Fellbaum},
	Title = {Collecting Semantic Similarity Ratings to Connect Concepts in Assistive Communication Tools},
	Editor = {Angelika Storrer},
	Booktitle = {Modeling, Learning and Processing of Text Technological Data Structures},
	Series = {Studies in Computational Intelligence},
	Address = {Heidelberg},
	Url = {http://cs.umd.edu/~jbg//docs/2011_book_chapter_evocation.pdf},
	Publisher = {Springer Verlag},
	Year = {2011},
}

@inproceedings{Nikolova:Boyd-Graber:Fellbaum:Cook-2009,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Christiane Fellbaum and Perry Cook},
	Title = {Better Vocabularies for Assistive Communication Aids: Connecting Terms using Semantic Networks and Untrained Annotators},
	Booktitle = {ACM Conference on Computers and Accessibility},
	Year = {2009},
	Location = {Pittsburgh, PA},
	Url = {http://cs.umd.edu/~jbg//docs/evocation-viva.pdf},
}

Xiaojuan Ma, Jordan Boyd-Graber, Sonya S. Nikolova, and Perry Cook. Speaking Through Pictures: Images vs. Icons. ACM Conference on Computers and Accessibility, 2009. [Slides] [Bibtex]

@inproceedings{Ma:Boyd-Graber:Nikolova:Cook-2009,
	Title = {Speaking Through Pictures: Images vs. Icons},
	Author = {Xiaojuan Ma and Jordan Boyd-Graber and Sonya S. Nikolova and Perry Cook},
	Booktitle = {ACM Conference on Computers and Accessibility},
	Year = {2009},
	Location = {Pittsburgh, PA},
	Url = {http://cs.umd.edu/~jbg//docs/image_icon.pdf},
}

@inproceedings{Nikolova:Boyd-Graber:Cook-2009,
	Author = {Sonya S. Nikolova and Jordan Boyd-Graber and Perry Cook},
	Title = {The Design of ViVA: A Mixed-initiative Visual Vocabulary for Aphasia},
	Booktitle = {Proceedings of the 27th international conference extended abstracts on Human factors in computing systems},
	Url = {http://cs.umd.edu/~jbg//docs/viva.pdf},
	Year = {2009},
	Isbn = {978-1-60558-247-4},
	Pages = {4015--4020},
	Location = {Boston, MA, USA},
	Publisher = {ACM},
	Address = {New York, NY, USA},
}

Jordan Boyd-Graber, Sonya S. Nikolova, Karyn A. Moffatt, Kenrick C. Kin, Joshua Y. Lee, Lester W. Mackey, Marilyn M. Tremaine, and Maria M. Klawe. Participatory design with proxies: Developing a desktop-PDA system to support people with aphasia. Computer-Human Interaction, 2006. [Presentation] [Bibtex]

@inproceedings{Boyd-Graber:Nikolova:Moffatt:Kin:Lee:Mackey:Tremaine:Klawe-2006,
	Title = {Participatory design with proxies: {D}eveloping a desktop-{PDA} system to support people with aphasia},
	Author = {Jordan Boyd-Graber and Sonya S. Nikolova and Karyn A. Moffatt and Kenrick C. Kin and Joshua Y. Lee and Lester W. Mackey and Marilyn M. Tremaine and Maria M. Klawe},
	Booktitle = {Computer-Human Interaction},
	Year = {2006},
	Location = {Montreal, Quebec},
	Url = {http://cs.umd.edu/~jbg//docs/paper673-boyd-graber.pdf},
}

Tak Yeon Lee

@article{Lee:Smith:Seppi:Elmqvist:Boyd-Graber:Findlater-2017,
	Author = {Tak Yeon Lee and Alison Smith and Kevin Seppi and Niklas Elmqvist and Jordan Boyd-Graber and Leah Findlater},
	Journal = {International Journal of Human-Computer Studies},
	Year = {2017},
	Title = {The Human Touch: How Non-expert Users Perceive, Interpret, and Fix Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2017_ijhcs_human_touch.pdf},
}

@article{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2017,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Journal = {Transactions of the Association for Computational Linguistics},
	Year = {2017},
	Title = {Evaluating Visual Representations for Topic Understanding and Their Effects on Manually Generated Labels},
	Volume = {5},
	Url = {http://cs.umd.edu/~jbg//docs/2017_tacl_eval_tm_viz.pdf},
	Pages = {1--15},
}

@inproceedings{Smith:Lee:Poursabzi-Sangdeh:Boyd-Graber:Seppi:Elmqvist:Findlater-2016,
	Author = {Alison Smith and Tak Yeon Lee and Forough Poursabzi-Sangdeh and Jordan Boyd-Graber and Kevin Seppi and Niklas Elmqvist and Leah Findlater},
	Booktitle = {CHI Human Centred Machine Learning Workshop},
	Year = {2016},
	Location = {San Jose, CA},
	Title = {Human-Centered and Interactive: Expanding the Impact of Topic Models},
}

Thang Nguyen

@inproceedings{Resnik:Armstrong:Claudino:Nguyen:Nguyen:Boyd-Graber-2015,
	Title = {Beyond {LDA}: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter},
	Author = {Philip Resnik and William Armstrong and Leonardo Claudino and Thang Nguyen and Viet-An Nguyen and Jordan Boyd-Graber},
	Booktitle = {NAACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
}

@inproceedings{Nguyen:Boyd-Graber:Lund:Seppi:Ringger-2015,
	Title = {Is your anchor going up or down?  {F}ast and accurate supervised topic models},
	Author = {Thang Nguyen and Jordan Boyd-Graber and Jeff Lund and Kevin Seppi and Eric Ringger},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
	Url = {http://cs.umd.edu/~jbg//docs/2015_naacl_supervised_anchor.pdf},
}

Thang Nguyen, Yuening Hu, and Jordan Boyd-Graber. Anchors Regularized: Adding Robustness and Extensibility to Scalable Topic-Modeling Algorithms. Association for Computational Linguistics, 2014. [Talk] [Bibtex]

@inproceedings{Nguyen:Hu:Boyd-Graber-2014,
	Title = {Anchors Regularized: Adding Robustness and Extensibility to Scalable Topic-Modeling Algorithms},
	Author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_anchor_reg.pdf},
}

@inproceedings{Nguyen:Hu:Boyd-Graber-2014,
	Title = {Anchors Regularized: Adding Robustness and Extensibility to Scalable Topic-Modeling Algorithms},
	Author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_anchor_reg.pdf},
}

Thang Nguyen, Yuening Hu, and Jordan Boyd-Graber. Evaluating Regularized Anchor Words. NIPS Workshop on Topic Models: Computation, Application, and Evaluation, 2013. [Bibtex]

@inproceedings{Nguyen:Hu:Boyd-Graber-2013,
	Title = {Evaluating Regularized Anchor Words},
	Author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

Tongshuang Wu

@inproceedings{Si:Goyal:Wu:Zhao:Feng:III:Boyd-Graber-2024,
	Title = {Large Language Models Help Humans Verify Truthfulness---Except When They Are Convincingly Wrong},
	Author = {Chenglei Si and Navita Goyal and Tongshuang Wu and Chen Zhao and Shi Feng and Hal Daum\'{e} {III} and Jordan Boyd-Graber},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_naacl_convincingly.pdf},
}

@inproceedings{Smith:Boyd-Graber:Fan:Birchfield:Wu:Weld:Findlater-2020,
	Author = {Alison Smith and Jordan Boyd-Graber and Ron Fan and Melissa Birchfield and Tongshuang Wu and Dan Weld and Leah Findlater},
	Booktitle = {Computer-Human Interaction},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_chi_explanation.pdf},
	Title = {No Explainability without Accountability: An Empirical Study of Explanations and Feedback in Interactive ML},
}

Varun Kumar

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2020,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Url = {http://cs.umd.edu/~jbg//docs/2020_iui_control.pdf},
	Year = {2020},
	Title = {Digging into User Control: Perceptions of Adherence and
Instability in Transparent Models},
}

@inproceedings{Kumar:Smith:Findlater:Seppi:Boyd-Graber-2019,
	Author = {Varun Kumar and Alison Smith and Leah Findlater and Kevin Seppi and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Title = {Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_control.pdf},
}

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2018,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {Intelligent User Interfaces},
	Year = {2018},
	Title = {User-Centered Design and Evaluation of a Human-in-the-Loop Topic Modeling System},
	Url = {http://cs.umd.edu/~jbg//docs/2018_iui_itm.pdf},
}

Alison won a best student paper honorable mention (3 out of 300)

@inproceedings{Smith:Kumar:Boyd-Graber:Seppi:Findlater-2017,
	Author = {Alison Smith and Varun Kumar and Jordan Boyd-Graber and Kevin Seppi and Leah Findlater},
	Booktitle = {CHI 2017 Designing for Uncertainty Workshop},
	Year = {2017},
	Location = {Denver, CO},
	Title = {Accounting for Input Uncertainty in Human-in-the-Loop Systems},
	Url = {http://visualization.ischool.uw.edu/hci_uncertainty/papers/Paper11.pdf},
}

Varun Manjunatha

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Larry Davis. Learning to Color from Language. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Davis-2018,
	Title = {Learning to Color from Language},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Larry Davis},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_colorization.pdf},
}

Mohit Iyyer, Varun Manjunatha, Jordan Boyd-Graber, and Larry Davis. Learning to Color from Language. North American Association for Computational Linguistics, 2018. [Bibtex]

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Davis-2018,
	Title = {Learning to Color from Language},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Larry Davis},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2018},
	Url = {http://cs.umd.edu/~jbg//docs/2018_naacl_colorization.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Boyd-Graber:Daume-III-2015,
	Title = {Deep Unordered Composition Rivals Syntactic Methods for Text Classification},
	Author = {Mohit Iyyer and Varun Manjunatha and Jordan Boyd-Graber and Hal {Daum\'{e} III}},
	Booktitle = {Association for Computational Linguistics},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_acl_dan.pdf},
}

Viet-An Nguyen

Viet-An Nguyen, Jordan Boyd-Graber, Philip Resnik, and Kristina Miler. Tea Party in the House: A Hierarchical Ideal Point Topic Model and Its Application to Republican Legislators in the 112th Congress. Association for Computational Linguistics, 2015. [Talk] [Code] [LaTeX] [Bibtex]
```
@inproceedings{Nguyen:Boyd-Graber:Resnik:Miler-2015,
	Title = {Tea Party in the House: A Hierarchical Ideal Point Topic Model and Its Application to Republican Legislators in the 112th Congress},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Kristina Miler},
	Booktitle = {Association for Computational Linguistics},
	Year = {2015},
	Location = {Beijing, China},
	Url = {http://cs.umd.edu/~jbg//docs/2015_acl_teaparty.pdf},
}
```
Accessible Abstract: In the mid 2010s, the Republican party in the United States diverged: mainstream conservatives split from the so-called "tea party" caucus. However, the primary statistical tool for analyzing political factions in legislative bodies (ideal point models) fail to account for these changes. This is because the schism is not fully reflected in voting patterns but rather in how politicians present themselves: thus we need to extend these models to capture not just how politicians vote but also how they frame particular issues. This paper proposes a new model to capture framing differences within a voting block to start explaining the new subcoalitions of the republican caucus.

@inproceedings{Resnik:Armstrong:Claudino:Nguyen:Nguyen:Boyd-Graber-2015,
	Title = {Beyond {LDA}: Exploring Supervised Topic Modeling for Depression-Related Language in Twitter},
	Author = {Philip Resnik and William Armstrong and Leonardo Claudino and Thang Nguyen and Viet-An Nguyen and Jordan Boyd-Graber},
	Booktitle = {NAACL Workshop on Cognitive Modeling and Computational Linguistics},
	Year = {2015},
	Location = {Denver, Colorado},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik-2014,
	Title = {Sometimes Average is Best: The Importance of Averaging for Prediction using MCMC Inference in Topic Modeling},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_howto_gibbs.pdf},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik-2014,
	Title = {Sometimes Average is Best: The Importance of Averaging for Prediction using MCMC Inference in Topic Modeling},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2014},
	Location = {Doha, Qatar},
	Url = {http://cs.umd.edu/~jbg//docs/2014_emnlp_howto_gibbs.pdf},
}

@article{Nguyen:Boyd-Graber:Resnik:Cai:Midberry:Wang-2014,
	Title = {Modeling Topic Control to Detect Influence in Conversations using Nonparametric Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Deborah Cai and Jennifer Midberry and Yuanxin Wang},
	Url = {http://cs.umd.edu/~jbg//docs/2014_mlj_influencer.pdf},
	Journal = {Machine Learning},
	Year = {2014},
	Volume = {95},
	Pages = {381--421},
	Publisher = {Springer},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik:Chang-2014,
	Title = {Learning a Concept Hierarchy from Multi-labeled Documents},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik and Jonathan Chang},
	Booktitle = {Neural Information Processing Systems},
	Year = {2014},
	Location = {Montreal, Canada},
	Url = {http://cs.umd.edu/~jbg//docs/2014_nips_l2h.pdf},
}

Viet-An Nguyen, Jordan Boyd-Graber, and Stephen Altschul. Dirichlet Mixtures, the Dirichlet Process, and the Structure of Protein Space. Journal of Computational Biology, 2013. [Journal] [Bibtex]

@article{Nguyen:Boyd-Graber:Altschul-2013,
	Title = {Dirichlet Mixtures, the Dirichlet Process, and the Structure of Protein Space},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Stephen Altschul},
	Journal = {Journal of Computational Biology},
	Year = {2013},
	Volume = {20},
	Number = {1},
	Url = {http://cs.umd.edu/~jbg//docs/2013_dp_protein.pdf},
}

@inproceedings{Nguyen:Boyd-Graber:Chang:Resnik-2013,
	Title = {Tree-Based Label Dependency Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Jonathan Chang and Philip Resnik},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

Viet-An Nguyen, Jordan Boyd-Graber, and Philip Resnik. Lexical and Hierarchical Topic Regression. Neural Information Processing Systems, 2013. [Supplement] [Bibtex]

@inproceedings{Nguyen:Boyd-Graber:Resnik-2013,
	Url = {http://cs.umd.edu/~jbg//docs/2013_shlda.pdf},
	Title = {Lexical and Hierarchical Topic Regression},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Neural Information Processing Systems},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

@inproceedings{Nguyen:Hu:Boyd-Graber:Resnik-2013,
	Title = {Argviz: Interactive Visualization of Topic Dynamics in Multi-party Conversations},
	Author = {Viet-An Nguyen and Yuening Hu and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2013},
	Location = {Atlanta Georgia},
	Url = {http://cs.umd.edu/~jbg//docs/2013_argviz.pdf},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik-2012,
	Title = {{SITS}: A Hierarchical Nonparametric Model using Speaker Identity for Topic Segmentation in Multiparty Conversations},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2012},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/acl_2012_sits.pdf},
}

@inproceedings{Nguyen:Boyd-Graber:Resnik-2012,
	Title = {``I Want to Talk About, Again, My Record On Energy~\dots'':  Modeling
  Topic Control in Conversations using Speaker-centric Nonparametric Topic Models},
	Author = {Viet-An Nguyen and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2012},
}

Weiwei Yang

@inproceedings{Yang:Boyd-Graber:Resnik-2019,
	Title = {A Multilingual Topic Model for Learning Weighted Topic Links Across Incomparable Corpora},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2019},
	Location = {Hong Kong, China},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Url = {http://cs.umd.edu/~jbg//docs/2019_emnlp_mtm.pdf},
}

@inproceedings{Yang:Boyd-Graber:Resnik-2019,
	Title = {A Multilingual Topic Model for Learning Weighted Topic Links Across Incomparable Corpora},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2019},
	Location = {Hong Kong, China},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Url = {http://cs.umd.edu/~jbg//docs/2019_emnlp_mtm.pdf},
}

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. Adapting Topic Models using Lexical Associations with Tree Priors. Empirical Methods in Natural Language Processing, 2017. [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2017,
	Title = {Adapting Topic Models using Lexical Associations with Tree Priors},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2017},
	Location = {Copenhagen, Denmark},
	Url = {http://cs.umd.edu/~jbg//docs/2017_emnlp_tree_prior.pdf},
}

Weiwei Yang, Jordan Boyd-Graber, and Philip Resnik. A Discriminative Topic Model using Document Network Structure. Association for Computational Linguistics, 2016. [Supplement] [Bibtex]

@inproceedings{Yang:Boyd-Graber:Resnik-2016,
	Title = {A Discriminative Topic Model using Document Network Structure},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Association for Computational Linguistics},
	Year = {2016},
	Location = {Berlin, Brandenburg},
	Url = {http://cs.umd.edu/~jbg//docs/2016_acl_docblock.pdf},
}

@inproceedings{Yang:Boyd-Graber:Resnik-2016,
	Title = {Birds of a Feather in the Same Nest: A Discriminative Topic Model using Block-based Priors},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2016},
	Location = {Philadephia},
}

@inproceedings{Yang:Boyd-Graber:Resnik-2016,
	Title = {Birds of a Feather in the Same Nest: A Discriminative Topic Model using Block-based Priors},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2016},
	Location = {Philadephia},
}

@inproceedings{Yang:Boyd-Graber:Resnik-2015,
	Title = {Birds of a Feather Linked Together: A Discriminative Topic Model using Link-based Priors},
	Url = {http://cs.umd.edu/~jbg//docs/2015_emnlp_hinge_link.pdf},
	Author = {Weiwei Yang and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2015},
	Location = {Lisbon, Portugal},
}

Wenyan Li

@inproceedings{Grissom-II:Shoemaker:Goldman:Shi:Stewart:Rytting:Findlater:Boyd-Graber:Li:Grissom-II:Boyd-Graber-2024,
	Author = {Alvin {Grissom II} and Jo Shoemaker and Benjamin Goldman and Ruikang Shi and Craig Stewart and C. Anton Rytting and Leah Findlater and Jordan Boyd-Graber},
	Location = {Torino, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2024_lrec_siminthelp.pdf},
	Booktitle = {Linguistic Resources and Evaluation Conference},
	Year = {2024},
	Title = {Rapidly Piloting Real-time Linguistic Assistance for Simultaneous Interpreters with Untrained Bilingual Surrogates},
}

Wenyan Li, Alvin Grissom II, and Jordan Boyd-Graber. An Attentive Recurrent Model for Incremental Prediction of Sentence-final Verbs. Findings of EMNLP, 2020. [Bibtex]

@article{Li:Grissom-II:Boyd-Graber-2020,
	Title = {An Attentive Recurrent Model for Incremental Prediction of Sentence-final Verbs},
	Author = {Wenyan Li and Alvin {Grissom II} and Jordan Boyd-Graber},
	Journal = {Findings of EMNLP},
	Year = {2020},
	Url = {http://cs.umd.edu/~jbg//docs/2020_findings_verbs.pdf},
}

Yogarshi Vyas

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

@inproceedings{Iyyer:Manjunatha:Guha:Vyas:Boyd-Graber:Daume-III:Davis-2017,
	Title = {The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives},
	Author = {Mohit Iyyer and Varun Manjunatha and Anupam Guha and Yogarshi Vyas and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Larry Davis},
	Booktitle = {Computer Vision and Pattern Recognition},
	Year = {2017},
	Url = {http://cs.umd.edu/~jbg//docs/2017_cvpr_comics.pdf},
}

Yoo Yeon Sung

@article{Sung:Fleisig:Hope:Upadhyay:Boyd-Graber-Preprint,
	Title = {GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration},
	Author = {Yoo Yeon Sung and Eve Fleisig and Yu Hope and Ishan Upadhyay and Jordan Lee Boyd-Graber},
	Journal = {ArXiv},
	Year = {Preprint},
	Url = {https://arxiv.org/pdf/2502.19684},
}

Yoo Yeon Sung, Maharshi Gor, Eve Fleisig, Ishani Mondal, and Jordan Lee Boyd-Graber. ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks. North American Association for Computational Linguistics, 2025. [Bibtex]
```
@inproceedings{Sung:Gor:Fleisig:Mondal:Boyd-Graber-2025,
	Title = {ADVSCORE: A Metric for the Evaluation and Creation of Adversarial Benchmarks},
	Author = {Yoo Yeon Sung and Maharshi Gor and Eve Fleisig and Ishani Mondal and Jordan Lee Boyd-Graber},
	Journal = {North American Association for Computational Linguistics},
	Year = {2025},
	Url = {http://cs.umd.edu/~jbg//docs/2025_naacl_advscore.pdf},
}
```
Accessible Abstract: Adversarial datasets should validate AI robustness by presenting samples that humans handle well but models struggle with. However, as models advance, these datasets risk becoming obsolete. Assessing whether a dataset remains adversarial is challenging due to the absence of a standardized metric for adversarialness. To address this, we introduce AdvScore, a human-grounded evaluation metric that quantifies a dataset's adversarial nature by accounting for the differing abilities of models and humans while also identifying low-quality examples.
Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, and Jordan Lee Boyd-Graber. You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions. Empirical Methods in Natural Language Processing, 2024. [ArXiv] [Research Talk] [Code] [Bibtex]
```
@inproceedings{Kabir:Sung:Bandyopadhyay:Zou:Chandra:Boyd-Graber-2024,
	Title = {You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions},
	Author = {Tasnim Kabir and Yoo Yeon Sung and Saptarashmi Bandyopadhyay and Hao Zou and Abhranil Chandra and Jordan Lee Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_natural.pdf},
}
```
Accessible Abstract: Many of the questions for training AIs how to answer questions come from the queries users type into search engines (like Google's Natural Questions). Is there a cheaper---perhaps even better---way? We propose a "naturalization" technique to turn high-quality, rigorously edited trivia questions into examples that resembles Natural Questions. Training on our naturalized questions and testing on natural questions comes close to the results with using Natural Questions, and we can improve results on MMLU (a standard modern evaluation set) by using our data.
Yoo Yeon Sung, Naeemul Hassan, and Jordan Boyd-Graber. Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines. Empirical Methods in Natural Language Processing, 2023. [Video] [Bibtex]
```
@inproceedings{Sung:Hassan:Boyd-Graber-2023,
	Title = {Not all Fake News is Written: A Dataset and Analysis of Misleading Video Headlines},
	Author = {Yoo Yeon Sung and Naeemul Hassan and Jordan Boyd-Graber},
	Booktitle = {Empirical Methods in Natural Language Processing},
	Year = {2023},
	Location = {Singapore},
	Url = {http://cs.umd.edu/~jbg//docs/2023_emnlp_videoheadline.pdf},
}
```
Accessible Abstract: Misinformation online is not all text-based. More information is being consumed in video form, and both social media companies and external monitors need to know when misleading videos are being shared online. We create a new dataset of misleading videos and describe what makes the problem so challenging.

Yoshinari Fujinuma

Yoshinari Fujinuma, Jordan Boyd-Graber, and Katharina Kann. How Does Multilingual Pretraining Affect Cross-Lingual Transferability?. Association for Computational Linguistics, 2022. [Code] [Bibtex]

@inproceedings{Fujinuma:Boyd-Graber:Kann-2022,
	Author = {Yoshinari Fujinuma and Jordan Boyd-Graber and Katharina Kann},
	Title = {How Does Multilingual Pretraining Affect Cross-Lingual Transferability?},
	Booktitle = {Association for Computational Linguistics},
	Year = {2022},
	Location = {Dublin},
	Url = {http://cs.umd.edu/~jbg//docs/2022_acl_multilingbert.pdf},
}

Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul, and Jordan Boyd-Graber. Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries. Association for Computational Linguistics, 2020. [Preprint] [Video] [Code] [Bibtex]
```
@inproceedings{Zhang:Fujinuma:Paul:Boyd-Graber-2020,
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Michael J. Paul and Jordan Boyd-Graber},
	Title = {Why Overfitting Isn't Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries},
	Booktitle = {Association for Computational Linguistics},
	Year = {2020},
	Location = {The Cyberverse Simulacrum of Seattle},
	Url = {http://cs.umd.edu/~jbg//docs/2020_acl_refine.pdf},
}
```
Accessible Abstract: Computers need to represent words in a computer-readable way. This work talks about how slightly moving these representations for words in different languages to be closer to a small list of translations (like from a dictionary) after doing fancy machine learning works better on downstream tasks (e.g., guessing grammatical category of a word) but hurts on asking the algorithm for translations of unseen words.

@inproceedings{Zhang:Fujinuma:Boyd-Graber-2020,
	Title = {Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification},
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Jordan Boyd-Graber},
	Booktitle = {Association for the Advancement of Artificial Intelligence},
	Year = {2020},
	Location = {New York, New York},
	Url = {https://arxiv.org/abs/1812.09617},
}

@inproceedings{Fujinuma:Paul:Boyd-Graber-2019,
	Title = {A Resource-Free Evaluation Metric for Cross-Lingual Word Embeddings Based on Graph Modularity},
	Author = {Yoshinari Fujinuma and Michael Paul and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2019},
	Location = {Florence, Italy},
	Url = {http://cs.umd.edu/~jbg//docs/2019_acl_modularity.pdf},
}

@article{Pruss:Fujinuma:Daughton:Paul:Arnot:Szafir:Boyd-Graber-2019,
	Author = {Dasha Pruss and Yoshinari Fujinuma and Ashlynn Daughton and Michael Paul and Brad Arnot and Danielle Szafir and Jordan Boyd-Graber},
	Journal = {PlosOne},
	Year = {2019},
	Title = {Zika discourse in the Americas: A multilingual topic analysis of {Twitter}},
	Url = {https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0216922},
}

@inproceedings{Zhang:Fujinuma:Boyd-Graber-2018,
	Title = {Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification},
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing},
	Year = {2018},
	Location = {Melbourne, Australia},
}

@inproceedings{Zhang:Fujinuma:Boyd-Graber-2018,
	Title = {Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification},
	Author = {Mozhi Zhang and Yoshinari Fujinuma and Jordan Boyd-Graber},
	Booktitle = {ACL Workshop on Deep Learning Approaches for Low-Resource Natural Language Processing},
	Year = {2018},
	Location = {Melbourne, Australia},
}

Yuening Hu

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

@article{Gerow:Hu:Boyd-Graber:Blei:Evans-2018,
	Title = {Measuring Discursive Influence Across Scholarship},
	Author = {Aaron Gerow and Yuening Hu and Jordan Boyd-Graber and David M. Blei and James A. Evans},
	Journal = {Proceedings of the National Academies of Science},
	Year = {2018},
}

Jordan Boyd-Graber, Yuening Hu, and David Mimno. Applications of Topic Models. 2017. [Preprint] [Bibtex]

@book{Boyd-Graber:Hu:Mimno-2017,
	Editor = {Doug Oard},
	Author = {Jordan Boyd-Graber and Yuening Hu and David Mimno},
	Title = {Applications of Topic Models},
	Publisher = {NOW Publishers},
	Series = {Foundations and Trends in Information Retrieval},
	Year = {2017},
	Volume = {11},
	Number = {2--3},
	Url = {http://www.nowpublishers.com/article/Details/INR-030},
}

Jordan Boyd-Graber, Yuening Hu, and David Mimno. Applications of Topic Models. 2017. [Preprint] [Bibtex]

@book{Boyd-Graber:Hu:Mimno-2017,
	Editor = {Doug Oard},
	Author = {Jordan Boyd-Graber and Yuening Hu and David Mimno},
	Title = {Applications of Topic Models},
	Publisher = {NOW Publishers},
	Series = {Foundations and Trends in Information Retrieval},
	Year = {2017},
	Volume = {11},
	Number = {2--3},
	Url = {http://www.nowpublishers.com/article/Details/INR-030},
}

@inproceedings{Smith:Chuang:Hu:Boyd-Graber:Findlater-2014,
	Title = {Concurrent Visualization of Relationships between Words and Topics in Topic Models},
	Author = {Alison Smith and Jason Chuang and Yuening Hu and Jordan Boyd-Graber and Leah Findlater},
	Booktitle = {ACL Workshop on Workshop on Interactive Language Learning, Visualization, and Interfaces},
	Year = {2014},
	Location = {Baltimore, Maryland},
}

@inproceedings{Nguyen:Hu:Boyd-Graber-2014,
	Title = {Anchors Regularized: Adding Robustness and Extensibility to Scalable Topic-Modeling Algorithms},
	Author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_anchor_reg.pdf},
}

@inproceedings{Nguyen:Hu:Boyd-Graber-2014,
	Title = {Anchors Regularized: Adding Robustness and Extensibility to Scalable Topic-Modeling Algorithms},
	Author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_anchor_reg.pdf},
}

@inproceedings{Hu:Zhai:Eidelman:Boyd-Graber-2014,
	Title = {Polylingual Tree-Based Topic Models for Translation Domain Adaptation},
	Author = {Yuening Hu and Ke Zhai and Vlad Eidelman and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2014},
	Location = {Baltimore, MD},
	Url = {http://cs.umd.edu/~jbg//docs/2014_acl_ptlda_mt.pdf},
}

Yuening Hu, Jordan Boyd-Graber, Brianna Satinoff, and Alison Smith. Interactive Topic Modeling. Machine Learning, 2014. [Journal] [Frontend Code] [Backend Code] [Bibtex]

@article{Hu:Boyd-Graber:Satinoff:Smith-2014,
	Title = {Interactive Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber and Brianna Satinoff and Alison Smith},
	Journal = {Machine Learning},
	Url = {http://cs.umd.edu/~jbg//docs/2014_mlj_itm.pdf},
	Year = {2014},
	Volume = {95},
	Pages = {423--469},
	Publisher = {Springer},
}

Thang Nguyen, Yuening Hu, and Jordan Boyd-Graber. Evaluating Regularized Anchor Words. NIPS Workshop on Topic Models: Computation, Application, and Evaluation, 2013. [Bibtex]

@inproceedings{Nguyen:Hu:Boyd-Graber-2013,
	Title = {Evaluating Regularized Anchor Words},
	Author = {Thang Nguyen and Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

@inproceedings{Hu:Zhai:Edelman:Boyd-Graber-2013,
	Title = {Topic Models for Translation Domain Adaptation},
	Author = {Yuening Hu and Ke Zhai and Vlad Edelman and Jordan Boyd-Graber},
	Booktitle = {NIPS Workshop on Topic Models: Computation, Application, and Evaluation},
	Year = {2013},
	Location = {Lake Tahoe, Nevada},
}

@inproceedings{Hu:Boyd-Graber:Daume-III:Ying-2013,
	Url = {http://cs.umd.edu/~jbg//docs/2013_coalescent.pdf},
	Title = {Binary to Bushy: Bayesian Hierarchical Clustering with the Beta Coalescent},
	Author = {Yuening Hu and Jordan Boyd-Graber and Hal {Daum\'{e} III} and Z. Irene Ying},
	Booktitle = {Neural Information Processing Systems},
	Year = {2013},
}

@inproceedings{Nguyen:Hu:Boyd-Graber:Resnik-2013,
	Title = {Argviz: Interactive Visualization of Topic Dynamics in Multi-party Conversations},
	Author = {Viet-An Nguyen and Yuening Hu and Jordan Boyd-Graber and Philip Resnik},
	Booktitle = {North American Association for Computational Linguistics},
	Year = {2013},
	Location = {Atlanta Georgia},
	Url = {http://cs.umd.edu/~jbg//docs/2013_argviz.pdf},
}

Yuening Hu and Jordan Boyd-Graber. Efficient Tree-Based Topic Modeling. Association for Computational Linguistics, 2012. [Bibtex]

@inproceedings{Hu:Boyd-Graber-2012,
	Title = {Efficient Tree-Based Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {Association for Computational Linguistics},
	Year = {2012},
	Location = {Jeju, South Korea},
	Url = {http://cs.umd.edu/~jbg//docs/acl_2012_fttm.pdf},
}

Yuening Hu and Jordan Boyd-Graber. Suggesting Constraints for Interactive Topic Modeling. ICML Workshop on Machine Learning in Human Computation and Crowdsourcing, 2012. [Bibtex]

@inproceedings{Hu:Boyd-Graber-2012,
	Title = {Suggesting Constraints for Interactive Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {ICML Workshop on Machine Learning in Human Computation and Crowdsourcing},
	Year = {2012},
	Location = {Edinburgh, Scotland},
}

@inproceedings{Hu:Zhai:Williamson:Boyd-Graber-2012,
	Title = {Modeling Images using Transformed {I}ndian Buffet Processes},
	Author = {Yuening Hu and Ke Zhai and Sinead Williamson and Jordan Boyd-Graber},
	Booktitle = {International Conference on Machine Learning},
	Year = {2012},
	Url = {http://cs.umd.edu/~jbg//docs/mtibp_icml_2012.pdf},
	Location = {Edinburgh, Scotland},
}

Yuening Hu and Jordan Boyd-Graber. Bayesian Hierarchical Clustering with Beta Coalescents. Mid-Atlantic Student Colloquium on Speech, Language, and Learning, 2012. [Bibtex]

@inproceedings{Hu:Boyd-Graber-2012,
	Title = {Bayesian Hierarchical Clustering with Beta Coalescents},
	Author = {Yuening Hu and Jordan Boyd-Graber},
	Booktitle = {Mid-Atlantic Student Colloquium on Speech, Language, and Learning},
	Year = {2012},
}

Yuening Hu, Jordan Boyd-Graber, and Brianna Satinoff. Interactive Topic Modeling. Association for Computational Linguistics, 2011. [Slides] [Code] [Bibtex]

@inproceedings{Hu:Boyd-Graber:Satinoff-2011,
	Title = {Interactive Topic Modeling},
	Author = {Yuening Hu and Jordan Boyd-Graber and Brianna Satinoff},
	Booktitle = {Association for Computational Linguistics},
	Year = {2011},
	Location = {Portland, Oregon},
	Url = {http://cs.umd.edu/~jbg//docs/itm.pdf},
}

Zongxia Li

@inproceedings{Li:Mao:Stephens:Goel:Walpole:Fung:Dima:Boyd-Graber-2024,
	Title = {TENOR: Topic Enabled Neural Organization and Recommendation: Evaluating Topic Models in Task Based Settings},
	Author = {Zongxia Li and Andrew Mao and Daniel Kofi Stephens and Pranav Goel and Emily Walpole and Juan Francisco Fung and Alden Dima and Jordan Lee Boyd-Graber},
	Booktitle = {European Association for Computational Linguistics},
	Year = {2024},
	Url = {http://cs.umd.edu/~jbg//docs/2024_eacl_tenor.pdf},
}

@article{Li:Mondal:Nghiem:Liang:Boyd-Graber-2024,
	Title = {PEDANTS (Precise Evaluations of Diverse Answer Nominee Text for Skinflints): Use Evaluation Metrics Wisely---Efficient Evaluation Analysis and Benchmarking for Open-Domain Question Answering},
	Author = {Zongxia Li and Ishani Mondal and Huy Nghiem and Yijun Liang and Jordan Boyd-Graber},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Location = {Miami},
	Year = {2024},
	Url = {https://arxiv.org/abs/2402.11161},
}

@article{Mondal:Li:Hou:Natarajan:Garimella:Bandyopadhyay:Boyd-Graber-2024,
	Title = {SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement},
	Author = {Ishani Mondal and Zongxia Li and Yufang Hou and Anandhavelu Natarajan and Aparna Garimella and Sambaran Bandyopadhyay and Jordan Boyd-Graber},
	Year = {2024},
	Journal = {Findings of the Empirical Methods in Natural Language Processing},
	Url = {http://cs.umd.edu/~jbg//docs/2024_emnlp_diagramgen.pdf},
}

LaTeX Version BibTex Version Text Version