LBSC 796/INFM 718R
Information Retrieval Systems
Fall 2007
Required Readings
The principal text for this course (referred to below as "MRS" for the
authors' initials) is Christopher D. Manning, Prabhakar Raghavan and
Heinrich Schuetze, An
Introduction to Information Retrieval, Draft as of July 1, 2007
(or later). This book is available only on the Web at this point.
The lecture notes are available on a supplimental password protected readings page. Lecture notes
for a specific topic will be ready at least two weeks before that
session.
Downloading readings from the Web may require Microsoft Word or
Acrobat Reader, depending on the format.
Required Readings for Week 1 (Overview)
- Lecture Notes: A Process Model for Information Retrieval
- MRS Chapter 1: IR Using the Boolean Model
Required Readings for Week 2 (Evidence from Content)
- MRS Chapter 2: The dictionary and postings lists
- MRS Chapter 3: Tolerant retrieval
Required Readings for Week 3 (Ranked Retrieval)
- MRS Chapter 6. Scoring and Term Weighting
- MRS Chapter 7. Vector Space Retrieval
- Djoerd Hiemstra and Arjen P. de Vries, "Relating the New
Language Models of Information Retrieval to the Traditional Retrieval
Models," Technical Report TR-CTIT-00-09. Available from CiteSeer.
Required Readings for Week 4 (Interaction)
Required Readings for Week 5 (Evaluation)
- Lecture Notes: Evaluation (Chapter 4)
- MRS Chapter 8: Evaluation in Information Retrieva
- Ellen M. Voorhees, "Variations in Relevance Judgments and the
Measurement of Retrieval Effectiveness," Information
Processing and Management, 36(5)697-716. Available on
campus from Science
Direct
Required Readings for Week 6
- MRS Chapter 19. Web Search Basics
- MRS Chapter 20. Web Crawling and Indexing
Required Readings for Week 7 (Evidence from Behavior)
Required Readings for Week 8 (Scanned Documents)
- David Doermann, "The Indexing and Retrieval of Document Images:
A Survey",
Computer Vision and Image Understanding, 70(3)287-298,
1998. Available on campus from Science Direct.
- Toni M. Rath, R. Manmatha, and Victor Lavrenko, "A Search
Engine for Historical Manuscripts," SIGIR 2004. Available from
CIIR<
Required Readings for Week 9 (Evidence from Metadata)
- Nigel Shadbolt, Wendy Hall and Tim Berners-Lee, "The Semantic
Web Revisited," IEEE Intelligent Systems, 12(3)96-101, 2006.
- Diane Hillman, "National Science Digital Library (NSDL)
Metadata Primer," Web publication, 2003. Available
from the Open Archives
Initiative Web site.
Required Readings for Week 10 (Filtering)
- MRS Chapter 9: Relevance Feedback and Query Expansion
- Joshua Goodman, Gordon V. Comack and David Heckerman, Spam and
the Ongoing Battle for the Inbox, Communications of the ACM,
50(2)24-33, 2007. (available on campus from the ACM Digital
Library)
Required Readings for Week 11 (Audio)
- William Byrne et al, "Automatic Recognition of Spontaneous
Speech for Access to Multilingual Oral History Archives," IEEE
Transations on Audio and Speech Processing, 2004. Available on
the password protected readings page.
- Elias Pampalk, Simon Dixon and Gerhard Widmer, "Exploring Music
Collections by Browsing Different Views," in International
Conference on Music Information Retrieval, 2003. Available on
the ISMIR 2003
Web site.
Required Readings for Week 12 (CLIR)
- Lecture Notes: Cross-Language IR
- Gina-Anne Levow, Douglas W. Oard, Philip Resnik,
"Dictionary-Based Techniques for Cross-Language Information
Retrieval," Information Processing and Management, 2005.
Available from the password protected
readings page.
Required Readings for Week 13 (Images and Video)
- Chad Carson, Serge Belongie, Hayit Greenspan and Jitendra
Malik, "Blobworld: Image Segmentation Using
Expectation-Maximization and Its Application to Image Querying,"
IEEE Transactions on Pattern Analysis and Machine Intelligence,
24(8)1026-1038, 2002. Available on campus from IEEE
Explore.
- Alan Smeaton, Wessel Kraaij and Paul Over, "TRECVID-2003 Video
Retrieval Evaluation Overview," Powerpoint slides, 2003.
Available from the TRECVID
Web site.
Doug Oard
Last modified: Aug 19 2007