Lecture 4: Text Classification, 6 Sept.
Presentation
Recordings
Mandatory reading
Jurafsky and Martin, Speech and Language Processing, 3. ed. (edition of Dec. 2020)
- Ch. 4, "Naive Bayes Classification and Sentiment"
- Except (for now) section 4.9 Statistical significance testing
Recommended reading
Manning, Raghavan, Schütze, Introduction to Information Retrieval,
- Ch. 13, "Text Classification and Naive Bayes", Sec. 13.0-13.3
Lab-session 2, Thursday 9 September at Sed
- Exercises
- The text file, crisis.txt