NAACL HLT 2009 Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics Tutorial Abstracts Ciprian Chelba, Paul Kantor, Brian Roark Tutorial Chairs May 31, 2009 Boulder, Colorado Table of Contents Data Intensive Text Processing with MapReduce Jimmy Lin and Chris Dyer . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Distributed Language Models Thorsten Brants and Peng Xu . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 Dynamic Programming-based Search Algorithms in NLP Liang Huang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 Extracting World and Linguistic Knowledge from Wikipedia Simone Paolo Ponzetto and Michael Strube . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 OpenFst: An Open-Source, Weighted Finite-State Transducer Library and its Applications to Speech and Language Michael Riley, Cyril Allauzen and Martin Jansche . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 OntoNotes: The 90% Solution Sameer S. Pradhan and Nianwen Xue . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 VerbNet overview, extensions, mappings and applications Karin Kipper Schuler, Anna Korhonen and Susan Brown . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 Writing Systems, Transliteration and Decipherment Kevin Knight and Richard Sproat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 ii