Graham Neubig

Associate Professor, Language Technology Institute, Carnegie Mellon University
Affiliated Faculty, Machine Learning Department, Carnegie Mellon University
Chief Scientist, All Hands AI
My research is concerned with language and its role in human communication. In particular, my long-term research goal is to break down barriers in human-human or human-machine communication through the development of natural language processing (NLP) technologies. This includes the development of technology for machine translation, which helps break down barriers in communication for people who speak different languages, and natural language understanding, which helps computers understand and respond to human language. Within this overall goal of breaking down barriers to human communication, I have focused on several aspects of language that both make it interesting as a scientific subject, and hold potential for the construction of practical systems. Specific areas of interest include:
- Multilingual Language Processing
- Machine Translation
- Syntactic and Semantic Analysis
- Cross-lingual Learning
- Natural Language Interfaces to Computers
- Natural Language to Code Generation
- Question Answering and Information Extraction
- Modeling Human-Computer or Human-Human Interaction
- Machine Learning for NLP
- Explainability and Interpretable Evaluation
- Neural Network Models for NLP
- Unsupervised and Semi-supervised Learning
Academic/Career History
- 7/2020-onward Carnegie Mellon University (CMU): Associate Professor
- 9/2016-7/2020 Carnegie Mellon University (CMU): Assistant Professor
- 4/2012-8/2016 Nara Institute of Science and Technology (NAIST): Assistant Professor
- 4/2010-3/2012 Kyoto University: Doctoral course in Intelligent Information Systems
- 4/2008-3/2010 Kyoto University: Master's course in Intelligent Information Systems
- 8/2006-3/2008 Hyogo Prefectural Government: Coordinator for International Relations
- 9/2005-7/2006 Tajima Agricultural High School: Assistant Language Teacher
- 8/2001-5/2005 University of Illinois, Urbana-Champaign: B.S. Computer Science
Papers
Here is a list of a few of my current favorite papers:
- Zora Zhiruo Wang, Apurva Gandhi, Graham Neubig, Daniel Fried.
Inducing Programmatic Skills for Agentic Tasks (BibTex, Code/Data)
Conference on Language Modeling (COLM). Vienna, Austria. October 2025 (To Appear). - José Pombal, Dongkeun Yoon, Patrick Fernandes, Ian Wu, Seungone Kim, Ricardo Rei, Graham Neubig, Andre Martins.
M-Prometheus: A Suite of Open Multilingual LLM Judges (BibTex, Code/Data)
Conference on Language Modeling (COLM). Vienna, Austria. October 2025 (To Appear). - Patrick Fernandes, Sweta Agrawal, Emmanouil Zaranis, Andre Martins, Graham Neubig.
Do LLMs Understand Your Translations? Evaluating Paragraph-level MT with Question Answering (BibTex, Code/Data)
Conference on Language Modeling (COLM). Vienna, Austria. October 2025 (To Appear). - Ethan Chern, Steffi Chern, Shiqi Chen, Weizhe Yuan, Kehua Feng, Chunting Zhou, Junxian He, Graham Neubig, Pengfei Liu.
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios (BibTex, Code/Data)
Conference on Language Modeling (COLM). Montreal, Canada. October 2025 (To Appear). - Lindia Tjuatja, Graham Neubig.
BehaviorBox: Automated Discovery of Fine-Grained Performance Differences Between Language Models (BibTex, Code/Data)
Annual Meeting of the Association for Computational Linguistics (ACL). Vienna, Austria. July 2025.
All of my publications can be found on my publications page, and my most highly cited papers can be found on Google Scholar.
Other Links
- Slides for tutorials and classes can be found on my teaching page
- Software and resources that I've developed can be found on my software page.
- Tools for Natural Language Processing
- The Kyoto Free Translation Task: A task that can be used for evaluation of English-Japanese translation systems
- Japanese Parallel Data: A list of various data that can be used to create machine translation systms to/from Japanese