| Class | Date | Topic | Slides | Readings | Homework | Project |
|---|---|---|---|---|---|---|
| 1 | Jan 14 | Course overview, history of IR | As We May Think, What is a browser? | HW0 out | ||
| 2 | Jan 16 | Basics of IR, boolean retrieval | IIR 1 | |||
| 3 | Jan 18 | Text representation | IIR 2.0-2.2 | |||
| Jan 21 | Martin Luther King, Jr. Day (no class) | |||||
| 4 | Jan 23 | Index construction | (see below) | IIR 2.3-2.4 | ||
| 5 | Jan 25 | Index construction | IIR 3 | |||
| 6 | Jan 28 | Spell correction; Index compression | IIR3; IIR 5.1-5.3 | |||
| 7 | Jan 30 | Large-scale indexing: MapReduce | DITP 1, 2, 4.1-4.3, MMS 2.1-2.2 | |||
| 8 | Feb 1 | Ranking | IIR 6 | |||
| 9 | Feb 4 | Vector space retrieval | IIR 6 | |||
| 10 | Feb 6 | Vector space retrieval | IIR 6 | |||
| 11 | Feb 8 | Vector space retrieval | IIR 7 | HW1 due (Feb 10) | ||
| 12 | Feb 11 | Statistical language models | IIR 12, Optional: Statistical Language Models for Information Retrieval: A Critical Review | |||
| 13 | Feb 13 | Web retrieval | NCM 13, IIR 19, The Dirty Little Secrets of Search (NYTimes), Google Penalizes Overstock for Search Tactics (WSJ) | |||
| 14 | Feb 15 | Link Analysis: Overview | IIR 21, NCM 14.1-14.4, MMS 5.1-5.3, 5.5 | |||
| 15 | Feb 18 | Link Analysis: PageRank | Original PageRank paper; Optional: Deeper Inside PageRank, PageRank Overview | |||
| 16 | Feb 20 | Link Analysis: Hubs and Authorities | Kleinberg | |||
| 17 | Feb 22 | Link Analysis: Topic-Sensitive PageRank, TrustRank, and Spam | Haveliwala, MMS 5.4, Web Spam Taxonomy, TrustRank | |||
| 18 | Feb 25 | Evaluation | IIR 8 | |||
| 19 | Feb 27 | Evaluation | IIR 8 | |||
| 20 | Mar 1 | Recommender systems | MMS 9, Adomavicius and Tuzhilin | HW2 due | ||
| 21 | Mar 4 | Recommender systems | MMS 9, Adomavicius and Tuzhilin | |||
| 22 | Mar 6 | Midterm review | Quiz 1 (2012), Quiz 2 (2012) | |||
| 23 | Mar 8 | Midterm | ||||
| Mar 11 | Spring Break (no class) | |||||
| Mar 13 | Spring Break (no class) | |||||
| Mar 15 | Spring Break (no class) | |||||
| 24 | Mar 18 | Text clustering: overview | MMS 7.1 - 7.5 | |||
| 25 | Mar 20 | Text clustering: flat | IIR 16 | |||
| 26 | Mar 22 | Text clustering: hierarchical | IIR 17 | |||
| 27 | Mar 25 | Text classification: Naive Bayes | IIR 13.1 - 13.4 | |||
| 28 | Mar 27 | Project overview | ||||
| Mar 29 | Reading Day (no class) | |||||
| 29 | Apr 1 | Text classification: Rocchio, kNN | IIR 14 | Proposals due | ||
| 30 | Apr 3 | Text classification: SVM, Feature selection | IIR 13.5, IIR 15 | |||
| 31 | Apr 5 | Text classification: Wrap up | HW3 due (Apr 7) | |||
| 32 | Apr 8 | Learning to rank | IIR 6.1, IIR 15.4 | |||
| 33 | Apr 10 | Learning to rank | Richardson Optional: Liu's tutorial, Microsoft LETOR | |||
| 34 | Apr 12 | Learning to rank for spatio-temporal search | Shaw et al. | |||
| 35 | Apr 15 | Project workday (no class) | ||||
| 36 | Apr 17 | Location + Geo: Mining Query Logs | Backstrom et al. | |||
| 37 | Apr 19 | Location + Geo: Clustering Checkins | Cranshaw et al. | |||
| 38 | Apr 22 | Crowdsourcing: Topical Experts on Twitter | Ghosh et al. | |||
| 39 | Apr 24 | Sentiment Analysis | Pang and Lee | |||
| 40 | Apr 26 | Wrap up | ||||
| 41 | Apr 29 | No Class |
One slide due April 28 by 11:59pm | |||
| 42 | Apr 30 | Super Project Workshop -- All Demos (note this is Tuesday); 1:50-3:30 in ETB 2005 | Report due (May 5) | |||
| May 7 | Final exam, 3:30pm-5:30pm | Final Exam (2012) |