 
LBSC 796/INFM 718R 
Information Retrieval Systems
Fall 2007 
Recommended Readings
Downloading readings from the Web may require Microsoft Word or
Acrobat Reader, depending on the format.  
Some other books on information retrieval:
  -  Ricardo Baeza-Yates and Berthier Rubiero-Neto, Modern
       Information Retrieval, Addison Wesley, 1999.
  
-  Ian H. Witten, Alaitair Moffat, and Timmothy C. Bell,
       Managing Gigabytes, Morgan Kaufmann, Second Edition,
       1999.
  
-  David A. Grossman and Ophir Frieder, Information Retrieval:
       Algorithms and Heuristics, Kluwer Academic, 2004.
  
-  William B. Frakes and Ricardo Baeza-Yates, ed., Information
       Retrieval: Data Structures and Algorithms, Prentice-Hall,
       1992.
  
-  Tomek Strzalkowski, ed., Natural Language Information
       Retrieval, Kluwer, 1999.
  
-  Christopher D. Manning and Heinrich Schuetze, Statistical
       Natural Language Processing, MIT Press, 2000.
  
-  Karen Sparck-Jones and Peter Willet, ed., Readings in
       Information Retrieval, Morgan-Kaufmann, 1997.
  
-  David C. Blair, Language and Representation in
       Information Retrieval, Elsevier Science, 1990.
Recommended Reading for Week 1 (Overview)
  -  Tefko Saracevic, (1999) Information
       Science. Journal of the American Society for Information
       Science, 50(12)1051-1063.
  
-  David C. Blair, Language and Representation in
       Information Retrieval, Elsevier Science, 1990.  Chapter 1,
       pages 1-10.
Recommended Reading for Week 2 (Evidence from Content)
  -  Christopher Manning and Heinrich Schuetze, Foundations of
       Statistical Natural Language Processing, Chapter 5
       (Collocations), MIT Press, 1999.  Available from the
       book's Web site.
  
  
-  Sparck-Jones, Karen, "What is the Role of NLP in Text Retrieval?," in
       Tomek Strzalkowski (ed.), Natural Language Information
       Retrieval,
       Kluwer, 1999, Chapter 1, pp. 1-24.  On reserve in the Paul
       Wasserman Library.
       
  
-  Jacquemin, C. and E. Tzoukermann.  "NLP for Term Variant Extraction:
       Synergy between Morphology, Lexicon, and Syntax," in
       T. Strzalkowski (ed.), Natural Language Information Retrieval, 
       Kluwer, 1999, Chapter 2, pp. 25-70.  On reserve in the Paul
       Wasserman Library.
       
  
-  Prager, John, Eric Brown, Anni Coden and Dragomir Radev.
       "Question-Answering by Predictive Annotation," in
       Proceedings of the 23rd Annual International ACM SIGIR Conference on
       Research and Development in Information Retrieval 
       July 24-28, 2000, Athens Greece, pp. 184-191.  Available on
       campus from the
       ACM
       Digital Library.
  
-  Donna Harman "Inverted Files," in William B. Frakes and
       Ricardo Baeza-Yates, Information Retrieval: Data Structures and
       Algorithms, Prentice Hall, 1992, Chapter 3.
  
-  George A. Miller. (1995) WordNet:
       A Lexical Database for English. Communications of the ACM,
       38(11)39-41. Available on campus from the ACM Digital Library
  
-  John Prager, Eric Brown, and Anni Coden. (2000)
       Question-Answering by Predictive Annotation. Proceedings of
       the 23rd Annual International ACM SIGIR Conference on Research
       and Development in Information Retrieval (SIGIR 2000).
Recommended Reading for Week 3 (Ranked retrieval)
  -  Donna Harman "Ranking Algorithms," in William B. Frakes and
       Ricardo Baeza-Yates, Information Retrieval: Data Structures and
       Algorithms, Prentice Hall, 1992, Chapter 14.
  
-  Amit Singhal, "Pivoted Document Length Normalization," SIGIR
       1996.  Available on campus through the ACM
       Digital Library.
       
  
-  S.E. Robertson et al, "Okapi at TREC-3," Proceedings of the
       Third Text Retrieval Conference, 1994.  Available on the TREC
       Web site.
  
-  James Allan, ed. "Challlenges in Information Retrieval and
       Language Modeling", SIGIR Forum, 37(1)31-47, Spring, 2003.
       Available from SIGIR.
  
-  W. B. Croft and J. Lafferty, ed., Language Modeling for
       Information Retrieval, Kluwer, 2003.
  
-  David R. H. Miller, Tim Leek, and Richard M. Schwartz,
       "A Hidden Markov Model Information Retrieval System,"
       SIGIR 99.  Available on campus from the
       ACM
       Digital Library.
Recommended Reading for Week 4 (Interaction)
  -  Robert S. Taylor, "The Process of Asking Questions,"
       American Documentation, 13(4)391-396, 1962.
  
-  Peter Pirolli and Stuart Card, "Information Foraging," 
       Psychological Review. 106(4)643-675, 1999.  May be
       available on campus through Science
       Direct
  
-  Efthimis N. Efthimiadis and Stephen E. Robertson. (1989)
       Feedback and Interaction in Information Retrieval. In
       Charles Oppenheim, ed., Perspectives in Information
       Management. London: Butterworth.
Recommended Readings for Week 5 (Evaluation)
  -  Ellen M. Voorhees, "Variations in Relevance Judgments and the
       Measurement of Retrieval Effectiveness," Information
       Processing and Management, 36(5)697-716.  Available on
       campus from Science
       Direct
  
-  Chris Buckley and Ellen M. Voorhees, "Evaluating Evaluation
       Measure Stability", SIGIR 2000.  Available on campus through the ACM
       Digital Library
  
-  Ellen M. Voorhees and Chris Buckley, "The Effect of Topic Set
       Size on Retrieval Experiment Error," SIGIR 2002, Available on
       campus through the ACM
       Digital Library
  
-  R. Mamantha, Ao Feng and James Allan, "A Critical Evaluation of
       TDT's Cost Function," SIGIR 2002.  Available on campus from the ACM
       Digital Library
  
-  Stefano Mizzaro. (1999) How Many Relevances in Information
       Retrieval? Interacting With Computers, 10(3)305-322.
  
-  Andrew H. Turpin and William Hersh, "Why Batch and User
       Evaluations Do Not Give the Same Results," SIGIR 2001.
       Available on campus from the ACM
       Digital Library.
Recommended Reading for Week 6 (Web Search)
  -  Eric Brill, Jimmy Lin, Michele Banko, Susan Dumais, and Andrew
       Ng. Data-Intensive Question Answering. Proceedings of the Tenth
       Text REtrieval Conference (TREC 2001).
Recommended Reading for Week 7 (Evidence from Behavior)
  -  Larry Page, Sergey Brin, Rajeev Motwani and Terry Winograd, "Page
       Rank Citation Ranking: Bringing
       Order to the Web," Stanford Digital Library Working Paper
       SIDL-WP-1999-0120, 1998.  Available from CiteSeer.
  
-  Diane Kelly and Jamie Teevan, "Implicit Feedback for Inferring
       User Preference: A Bibliography," SIGIR Forum, 37(2)18-28, Fall
       2003.  Available from the SIGIR
       Forum Web site.
  
-  Jon M. Kleinberg, "Authoratative Sources in a Hyperlinked
       Environment," Journal of the ACM, 46(5)604-632.  Available on
       campus from the ACM
       Digital Library.
  
-  Douglas W. Oard and Jinmook Kim, "Modeling Information Content
       Using Observable Behavior," in Proceedings of the 2001
       Annual Meeting of the American Society for Information Science and
       Technology, Washington, November, 2001.  Available from Doug Oard's Web
       site
       
Recommended Reading for Week 8 (Scanned Documents)
  -  David Doermann, "The Indexing and Retrieval of Document Images:
       A Survey",
       Computer Vision and Image Understanding, 70(3)287-298,
       1998.  Available on campus from Science Direct.
  
-  Tseng, Y.-H. and Oard, D. W., Document Image Retrieval
       Techniques for Chinese.  In Proceedings of the 2001 Symposium
       on Document Image Understanding Technology, Columbia, MD, 2001.
       Available from Doug Oard's
       Web site
Recommended Reading for Week 9 (Evidence from Metadata)
  -  Carl Lagoze and Herbert Van de Stomple, "The Open Archives
       Initiative: Building a Low-Barrier Interoperability Framework,"
       Proceedings of the First ACM/IEEE-CS Joint Conference on Digital
       Libraries, Roanoke, VA, June 2001, pp. 54-62.  Available on
       campus from the ACM
       Digital Library.
  
-  Diane Hillman, "National Science Digital Library (NSDL)
       Metadata Primer," Web publication, 2003.  Available
       from the Open Archives
       Initiative Web site.
Recommended Reading for Week 10 (Filtering)
  -  Douglas W. Oard, "The State of the Art in Text Filtering," User
       Modeling and User-Adapted Interaction, 2007.
Recommended Reading for Week 11 (Audio)
  -  Jonathan Foote, "An Overview of Audio Information Retrieval,"
       ACM-Springer Multimedia Systems, 7(1)2-10,
       1999. Available from CiteSeer
  
-  John S. Garofolo, Cedric G. P. Auzanne and Ellen M. Voorhees,
       "The TREC Spoken Document Retrieval Track: A success story,"
       in Proceedings of the Eighth Text Retrieval Conference,
       1999, pp. 107-130.  Available from the TREC
       Web site
  
-  Rodger J. McNabb, Lloyd A. Smith, Ian H. Witten, and Clare
       L. Henderson, "Tune
       Retrieval in the Multimedia Library," Multimedia Tools and
       Applications, 10(2-3)113-132, 2000.  Available from the
       New
       Zealand Digital Library Web site.
Recommended Reading for Week 12 (Cross-Language Search)
  -  Daqing He, et al., "Making MIRACLEs: Interactive Translingual
       Search for Cebuano and Hindi," ACM Transactions on Asian
       Language Information Retrieval, 2(2-3).  Available from the
       ACM Digital Library.
Recommended Reading for Week 13 (Photographs and Video)
  -  Vekant N. Gudivada and Vijay V. Raghavan, "Modeling and
       Retrieving Images by Content," Information Processing and
       Management, 33(4)427-452, 1997.  Available on campus from
       Science Direct.
  
-  Howard Wactlar et al., "Complementary Audio and Video Analysis
       for Broadcast News Archives," Communicatuions of the
       ACM, 43(2)42-47, 2000.  Available on campus from the ACM
       Digital Library.
       
Doug Oard
Last modified: Aug 19 2007