LBSC 796/INFM 718R
Information Retrieval Systems
Fall 2007
Recommended Readings
Downloading readings from the Web may require Microsoft Word or
Acrobat Reader, depending on the format.
Some other books on information retrieval:
- Ricardo Baeza-Yates and Berthier Rubiero-Neto, Modern
Information Retrieval, Addison Wesley, 1999.
- Ian H. Witten, Alaitair Moffat, and Timmothy C. Bell,
Managing Gigabytes, Morgan Kaufmann, Second Edition,
1999.
- David A. Grossman and Ophir Frieder, Information Retrieval:
Algorithms and Heuristics, Kluwer Academic, 2004.
- William B. Frakes and Ricardo Baeza-Yates, ed., Information
Retrieval: Data Structures and Algorithms, Prentice-Hall,
1992.
- Tomek Strzalkowski, ed., Natural Language Information
Retrieval, Kluwer, 1999.
- Christopher D. Manning and Heinrich Schuetze, Statistical
Natural Language Processing, MIT Press, 2000.
- Karen Sparck-Jones and Peter Willet, ed., Readings in
Information Retrieval, Morgan-Kaufmann, 1997.
- David C. Blair, Language and Representation in
Information Retrieval, Elsevier Science, 1990.
Recommended Reading for Week 1 (Overview)
- Tefko Saracevic, (1999) Information
Science. Journal of the American Society for Information
Science, 50(12)1051-1063.
- David C. Blair, Language and Representation in
Information Retrieval, Elsevier Science, 1990. Chapter 1,
pages 1-10.
Recommended Reading for Week 2 (Evidence from Content)
- Christopher Manning and Heinrich Schuetze, Foundations of
Statistical Natural Language Processing, Chapter 5
(Collocations), MIT Press, 1999. Available from the
book's Web site.
- Sparck-Jones, Karen, "What is the Role of NLP in Text Retrieval?," in
Tomek Strzalkowski (ed.), Natural Language Information
Retrieval,
Kluwer, 1999, Chapter 1, pp. 1-24. On reserve in the Paul
Wasserman Library.
- Jacquemin, C. and E. Tzoukermann. "NLP for Term Variant Extraction:
Synergy between Morphology, Lexicon, and Syntax," in
T. Strzalkowski (ed.), Natural Language Information Retrieval,
Kluwer, 1999, Chapter 2, pp. 25-70. On reserve in the Paul
Wasserman Library.
- Prager, John, Eric Brown, Anni Coden and Dragomir Radev.
"Question-Answering by Predictive Annotation," in
Proceedings of the 23rd Annual International ACM SIGIR Conference on
Research and Development in Information Retrieval
July 24-28, 2000, Athens Greece, pp. 184-191. Available on
campus from the
ACM
Digital Library.
- Donna Harman "Inverted Files," in William B. Frakes and
Ricardo Baeza-Yates, Information Retrieval: Data Structures and
Algorithms, Prentice Hall, 1992, Chapter 3.
- George A. Miller. (1995) WordNet:
A Lexical Database for English. Communications of the ACM,
38(11)39-41. Available on campus from the ACM Digital Library
- John Prager, Eric Brown, and Anni Coden. (2000)
Question-Answering by Predictive Annotation. Proceedings of
the 23rd Annual International ACM SIGIR Conference on Research
and Development in Information Retrieval (SIGIR 2000).
Recommended Reading for Week 3 (Ranked retrieval)
- Donna Harman "Ranking Algorithms," in William B. Frakes and
Ricardo Baeza-Yates, Information Retrieval: Data Structures and
Algorithms, Prentice Hall, 1992, Chapter 14.
- Amit Singhal, "Pivoted Document Length Normalization," SIGIR
1996. Available on campus through the ACM
Digital Library.
- S.E. Robertson et al, "Okapi at TREC-3," Proceedings of the
Third Text Retrieval Conference, 1994. Available on the TREC
Web site.
- James Allan, ed. "Challlenges in Information Retrieval and
Language Modeling", SIGIR Forum, 37(1)31-47, Spring, 2003.
Available from SIGIR.
- W. B. Croft and J. Lafferty, ed., Language Modeling for
Information Retrieval, Kluwer, 2003.
- David R. H. Miller, Tim Leek, and Richard M. Schwartz,
"A Hidden Markov Model Information Retrieval System,"
SIGIR 99. Available on campus from the
ACM
Digital Library.
Recommended Reading for Week 4 (Interaction)
- Robert S. Taylor, "The Process of Asking Questions,"
American Documentation, 13(4)391-396, 1962.
- Peter Pirolli and Stuart Card, "Information Foraging,"
Psychological Review. 106(4)643-675, 1999. May be
available on campus through Science
Direct
- Efthimis N. Efthimiadis and Stephen E. Robertson. (1989)
Feedback and Interaction in Information Retrieval. In
Charles Oppenheim, ed., Perspectives in Information
Management. London: Butterworth.
Recommended Readings for Week 5 (Evaluation)
- Ellen M. Voorhees, "Variations in Relevance Judgments and the
Measurement of Retrieval Effectiveness," Information
Processing and Management, 36(5)697-716. Available on
campus from Science
Direct
- Chris Buckley and Ellen M. Voorhees, "Evaluating Evaluation
Measure Stability", SIGIR 2000. Available on campus through the ACM
Digital Library
- Ellen M. Voorhees and Chris Buckley, "The Effect of Topic Set
Size on Retrieval Experiment Error," SIGIR 2002, Available on
campus through the ACM
Digital Library
- R. Mamantha, Ao Feng and James Allan, "A Critical Evaluation of
TDT's Cost Function," SIGIR 2002. Available on campus from the ACM
Digital Library
- Stefano Mizzaro. (1999) How Many Relevances in Information
Retrieval? Interacting With Computers, 10(3)305-322.
- Andrew H. Turpin and William Hersh, "Why Batch and User
Evaluations Do Not Give the Same Results," SIGIR 2001.
Available on campus from the ACM
Digital Library.
Recommended Reading for Week 6 (Web Search)
- Eric Brill, Jimmy Lin, Michele Banko, Susan Dumais, and Andrew
Ng. Data-Intensive Question Answering. Proceedings of the Tenth
Text REtrieval Conference (TREC 2001).
Recommended Reading for Week 7 (Evidence from Behavior)
- Larry Page, Sergey Brin, Rajeev Motwani and Terry Winograd, "Page
Rank Citation Ranking: Bringing
Order to the Web," Stanford Digital Library Working Paper
SIDL-WP-1999-0120, 1998. Available from CiteSeer.
- Diane Kelly and Jamie Teevan, "Implicit Feedback for Inferring
User Preference: A Bibliography," SIGIR Forum, 37(2)18-28, Fall
2003. Available from the SIGIR
Forum Web site.
- Jon M. Kleinberg, "Authoratative Sources in a Hyperlinked
Environment," Journal of the ACM, 46(5)604-632. Available on
campus from the ACM
Digital Library.
- Douglas W. Oard and Jinmook Kim, "Modeling Information Content
Using Observable Behavior," in Proceedings of the 2001
Annual Meeting of the American Society for Information Science and
Technology, Washington, November, 2001. Available from Doug Oard's Web
site
Recommended Reading for Week 8 (Scanned Documents)
- David Doermann, "The Indexing and Retrieval of Document Images:
A Survey",
Computer Vision and Image Understanding, 70(3)287-298,
1998. Available on campus from Science Direct.
- Tseng, Y.-H. and Oard, D. W., Document Image Retrieval
Techniques for Chinese. In Proceedings of the 2001 Symposium
on Document Image Understanding Technology, Columbia, MD, 2001.
Available from Doug Oard's
Web site
Recommended Reading for Week 9 (Evidence from Metadata)
- Carl Lagoze and Herbert Van de Stomple, "The Open Archives
Initiative: Building a Low-Barrier Interoperability Framework,"
Proceedings of the First ACM/IEEE-CS Joint Conference on Digital
Libraries, Roanoke, VA, June 2001, pp. 54-62. Available on
campus from the ACM
Digital Library.
- Diane Hillman, "National Science Digital Library (NSDL)
Metadata Primer," Web publication, 2003. Available
from the Open Archives
Initiative Web site.
Recommended Reading for Week 10 (Filtering)
- Douglas W. Oard, "The State of the Art in Text Filtering," User
Modeling and User-Adapted Interaction, 2007.
Recommended Reading for Week 11 (Audio)
- Jonathan Foote, "An Overview of Audio Information Retrieval,"
ACM-Springer Multimedia Systems, 7(1)2-10,
1999. Available from CiteSeer
- John S. Garofolo, Cedric G. P. Auzanne and Ellen M. Voorhees,
"The TREC Spoken Document Retrieval Track: A success story,"
in Proceedings of the Eighth Text Retrieval Conference,
1999, pp. 107-130. Available from the TREC
Web site
- Rodger J. McNabb, Lloyd A. Smith, Ian H. Witten, and Clare
L. Henderson, "Tune
Retrieval in the Multimedia Library," Multimedia Tools and
Applications, 10(2-3)113-132, 2000. Available from the
New
Zealand Digital Library Web site.
Recommended Reading for Week 12 (Cross-Language Search)
- Daqing He, et al., "Making MIRACLEs: Interactive Translingual
Search for Cebuano and Hindi," ACM Transactions on Asian
Language Information Retrieval, 2(2-3). Available from the
ACM Digital Library.
Recommended Reading for Week 13 (Photographs and Video)
- Vekant N. Gudivada and Vijay V. Raghavan, "Modeling and
Retrieving Images by Content," Information Processing and
Management, 33(4)427-452, 1997. Available on campus from
Science Direct.
- Howard Wactlar et al., "Complementary Audio and Video Analysis
for Broadcast News Archives," Communicatuions of the
ACM, 43(2)42-47, 2000. Available on campus from the ACM
Digital Library.
Doug Oard
Last modified: Aug 19 2007