Graham Neubig's Research

I am currently a masters student at the Kyoto University Graduate School of Informatics, affiliated with the Media Archiving Research Laboratory run by Professor Tatsuya Kawahara. Some of my research interests include:

Academic/Career History

Software I've Developed

Kylm ("KYoto Language Modeling toolkit"): A language modeling toolkit written in Java. It currently is able to train n-gram models with a variety of smoothing techniques. Eventually it will have the ability to perform detailed comparisons of a number of different types of language models, and simply model unknown words using sub-word structure (characters).

Kyfd ("KYoto Fst Decoder"): A beam-search decoder for FST models written in C++. It features the ability to keep track of separate component weights for log-linear tuning, use hierarchical failure transitions, and handle lattice input.

KyTea ("KYoto Text Analysis toolkit"): A toolkit for text analysis including word (morpheme) segmentation and pronunciation estimation.

Links

Research Papers

Conference Papers

For a list of papers in Japanese see the Japanese page.