ACL-08: HLT 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies Proceedings of the Student Research Workshop June 16, 2008 The Ohio State University Columbus, Ohio, USA Production and Manufacturing by Omnipress Inc. 2600 Anderson Street Madison, WI 53707 USA Sponsored by The National Science Foundation c 2008 The Association for Computational Linguistics Order copies of this and other ACL proceedings from: Association for Computational Linguistics (ACL) 209 N. Eighth Street Stroudsburg, PA 18360 USA Tel: +1-570-476-8006 Fax: +1-570-476-0860 acl@aclweb.org ii Introduction Welcome to the ACL-08: HLT Student Research Workshop! The Student Research Workshop is now an established tradition at ACL conferences and provides a venue for student researchers investigating topics in Computational Linguistics and Natural Language Processing to present their work and receive feedback. New this year was encouraging researchers in Information Retrieval to be involved, both reviewers and authors. We received a total of 27 submissions coming from 11 different countries, and accepted 12 of them. 5 will be presented orally, and 7 as posters, during a common poster session with the main conference. A total of 61 students and senior researchers agreed to serve on the program committee, which allowed us to assign 5 reviewers per paper. We would like to thank the reviewers for understanding the spirit of the Student Research Workshop and giving careful and constructive reviews. We hope their comments will be helpful to all the students who submitted their work. All presenters received financial support from the U.S. National Science Foundation to assist them in their travel to Columbus. We are very grateful to Jan Wiebe, our faculty advisor, for her advice, constant support, and obtaining funding. Finally, we would like to thank the general chair of ACL-08: HLT, Kathleen McKeown, the program chairs, Johanna Moore, Simone Teufel, James Allan, and Sadaoki Furui, the publications chairs Joakim Nivre and Noah Smith, Chris Brew and the local organization committee, and Priscilla Rasmussen. The ACL-08: HLT Student Research Workshop co-chairs: Ebru Arisoy, Keisuke Inoue and Wolfgang Maier iii Co-chairs: ¸ Ebru Arisoy, Bogazici University, Turkey ¨ Wolfgang Maier, University of Tubingen, Germany Keisuke Inoue, University of Syracuse, USA Faculty advisor: Jan Wiebe, University of Pittsburgh, USA Program committee: Murat Akbacak, SRI International, USA ¨ Tanel Alumae, Tallinn University of Technology, Estonia Michiel Bacchiani, Google Inc., USA Timothy Baldwin, University of Melbourne, Australia Chris Bartels, University of Washington, USA ¨ Tilman Becker, DFKI Saarbrucken, Germany Marine Carpuat, Hong Kong University of Science and Technology, Hong Kong ¨¸ Gulsen Cebiroglu Eryigit, Istanbul Technical University, Turkey ¨ Ozlem Cetinoglu, Sabanci University, Turkey ¸ Mathias Creutz, Nokia Research Center, Finland Montse Cuadros, Polytechnic University of Catalonia, Spain Anne Diekema, Syracuse University, USA Markus Dreyer, Johns Hopkins University, USA Kevin Duh, University of Washington, USA Koji Eguchi, Kobe University, Japan Hakan Erdogan, Sabanci University, Turkey Katja Filippova, EML Research, Germany Seeger Fisher, OGI School of Science and Engineering, USA ¨ Dilek Hakkani-Tur, ICSI, USA Dustin Hillard, University of Washington, USA ¨ Teemu Hirsimaki, Helsinki University of Technology, Finland Tatsuya Kawahara, Kyoto University, Japan Eric Kow, University of Brighton, UK ¨ Sandra Kubler, Indiana University, USA Giridhar Kumaran, University of Massachusetts Amherst, USA Mikko Kurimo, Helsinki University of Technology, Finland ¨ Staffan Larsson, Goteborg University, Sweden Lin-Shan Lee, National Taiwan University, Taiwan Kyung-Soon Lee, Chonbuk National University, Korea ¨ Timm Lichte, University of Tubingen, Germany Andrej Ljolje, AT&T Labs - Research, USA v Berenike Loos, German National Library, Germany Robert Luk, The Hong Kong Polytechnic University, Hong Kong Lambert Mathias, Johns Hopkins University, USA Olena Medelyan, University of Waikato, New Zealand Quiozhu Mei, University of Illinois at Urbana-Champaign, USA Simon Mille, Pompeu Fabra University, Spain ¨ Mathias Mohl, Saarland University, Germany Kemal Oflazer, Sabanci University, Turkey Paul Ogilvie, mSpoke, Inc., USA Constantin Orasan, University of Wolverhampton, UK ¨ Yannick Parmentier, University of Tubingen, Germany Thomas Pellegrini, LIMSI, France ´ Adam Przepiorkowski, Institute of Computer Science, Polish Academy of Sciences, Poland ¨ Janne Pylkkonen, Helsinki University of Technology, Finland ¨ Georg Rehm, University of Tubingen, Germany Brian Roark, OGI School of Science and Engineering, USA ¸ Hasim Sak, Bogazici University, Turkey ¸ ¸ Murat Saraclar, Bogazici University, Turkey ¸ Ruhi Sarikaya, IBM Watson Research Center, USA Oliver Schonefeld, University of Bielefeld, Germany Izak Shafran, OGI School of Science and Engineering, USA Anders Søgaard, University of Potsdam, Germany Richard Sproat, University of Illinois at Urbana-Champaign, USA Yael Sygal, University of Haifa, Israel ¨ Cuneyd Tantug, Istanbul Technical University, Turkey ¨ ¨ Gokhan Tur, SRI International, USA Suzan Verberne, University of Nijmegen, The Netherlands Ellen Voorhees, National Institute of Standards and Technology, USA Christopher White, Johns Hopkins University, USA Hans Friedrich Witschel, University of Leipzig, Germany vi Table of Contents A Supervised Learning Approach to Automatic Synonym Identification Based on Distributional Features Masato Hagiwara . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 An Integraged Architecture for Generating Parenthetical Constructions Eva Banik . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 Inferring Activity Time in News through Event Modeling Vladimir Eidelman . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 Combining Source and Target Language Information for Name Tagging of Machine Translation Output Shasha Liao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 A Re-examination on Features in Regression Based Approach to Automatic MT Evaluation Shuqi Sun, Yin Chen and Jufeng Li . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 The Role of Positive Feedback in Intelligent Tutoring Systems Davide Fossati . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31 Arabic Language Modeling with Finite State Transducers Ilana Heintz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37 Impact of Initiative on Collaborative Problem Solving Cynthia Kersey . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 An Unsupervised Vector Approach to Biomedical Term Disambiguation: Integrating UMLS and Medline Bridget McInnes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 A Subcategorization Acquisition System for French Verbs ´ Cedric Messiant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 Adaptive Language Modeling for Word Prediction Keith Trnka . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 A Hierarchical Approach to Encoding Medical Concepts for Clinical Notes Yitao Zhang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 67 vii Workshop Program Monday, June 16, 2008 Oral Session 3:45­4:10 A Supervised Learning Approach to Automatic Synonym Identification Based on Distributional Features Masato Hagiwara An Integraged Architecture for Generating Parenthetical Constructions Eva Banik Inferring Activity Time in News through Event Modeling Vladimir Eidelman Combining Source and Target Language Information for Name Tagging of Machine Translation Output Shasha Liao A Re-examination on Features in Regression Based Approach to Automatic MT Evaluation Shuqi Sun, Yin Chen and Jufeng Li Poster Session 6:00­8:30 The Role of Positive Feedback in Intelligent Tutoring Systems Davide Fossati Arabic Language Modeling with Finite State Transducers Ilana Heintz Impact of Initiative on Collaborative Problem Solving Cynthia Kersey An Unsupervised Vector Approach to Biomedical Term Disambiguation: Integrating UMLS and Medline Bridget McInnes A Subcategorization Acquisition System for French Verbs ´ Cedric Messiant 4:10­4:35 4:35­5:00 5:00­5:25 5:25­5:50 ix Monday, June 16, 2008 (continued) Adaptive Language Modeling for Word Prediction Keith Trnka A Hierarchical Approach to Encoding Medical Concepts for Clinical Notes Yitao Zhang x