Tentative Course Schedule & Lecture Notes

Week of Lectures & Reading Assignments & Exams
Jan 16 Syllabus & Introduction
(Chapter 1)
Reading: Roll Your Own Search Engine
A History of Search Engines
Jan 23 Basic IR Models
Text Characteristics: Zipf's Law
(Chapter 2, pp. 19-30)
Jan 30 A Guest Presentation by Robert Bland on Library IR Systems
Basic Tokenizing, Indexing, & Vector-Space Retrieval
(Chapter 7, pp. 163- 169 & Chapter 8, pp. 191-199)
Feb 6 Project 1: Code & Description Project 1 Assigned
Feb 13 Tries, Query Languages & Review for Exam
(Chapter 4)
Feb 20 Review of Exam 1 & Experimental Evaluation of IR
(Chapter 3)
Exam 1
Feb 27 Experimental Evaluation of IR & Query Operations
(Chapters 3, & 5)
March 6 Text Representation (Chapter 6)
Additional XML Resources:
Material from Stanford University
XML presentation by Stair & Shurtleff
Project 1 Due
Search Engine Case Studies Assigned
March 13 Spring Break: No Class
March 20 Intro to Web Search & Spidering
(Chapter 13)
March 27 Presentation of Search Engine Case Studies
Interfaces & Link Analysis
(Chapter 13)
April 3 Project 2: Code & Description Project 2 Assigned
April 10 Advanced IR Models (Chapter 2, pp. 30-69)
Vector Models (courtesy of Bruce Croft at UMass)
Probability Models (courtesy of Bruce Croft at UMass)
Bayesian Networks (courtesy of Ami Motro @ George Mason Univ.)
April 17 April Advance: No Class
April 24 Text Categorization & Clustering
May 1 Information Extraction & Review for Exam
May 8 Review of Exam Exam 2; Project 2 is due at mid-night
May 15 Final Meeting: a class discussion

Attendance Policy:

Roll is not taken. Students are expected to attend all class lectures. Failure to do so will be considered a lack of interest in success on the part of the student.