Home  |   Course Info  |  Schedule  |   References  |   HuskyCT 

Ling 3000Q/5000:
Introduction to Computational Linguistics

Schedule

Note: This schedule is subject to change, but not without notice. Any changes will be announced in class and reflected on these pages. Be sure to check regularly.


Dates Topics Methods Readings

Jan 22, 24
[W,F]
Introduction
Computers in linguistics and Natural Language Processing
The nature and use of text corpora
Some programming basics Jurafsky & Martin 3rd edition Ch. 1
Jan 27 – 31
[1 week]
Regular expressions and regular languages
Pattern matching
Corpus search, concordances, counting
Finite-state automata
File handling, control structures
Regular expressions
Some set theory
J&M 3 Ch. 2
Feb 3 – 14
[2 weeks]
N-grams
Language modeling
Smoothing
Evaluation
Probability theory
Object-oriented programming
J&M 3 Ch. 3
Feb 17 – 28
[2 weeks]
Sequence classification
Part-of-Speech Tagging
Word classes and tagsets
Rule-based and stochastic POS tagging
Hidden Markov Models
Evaluation
More Python J&M 3 Ch. 8
Mar 3 – 7
[1 week]
Context-free languages
Syntactic ambiguity
Context-free grammars
Push-down automata
Chomsky Hierarchy
Language theory J&M 3 Ch. 17
Mar 10 – 21
[2 weeks]
Parsing
Search and dynamic programming
Chart parsing (CKY, Earley)
Chunking
Evaluation
More Python J&M 2 Ch. 13
[J&M 3 Ch. 18]
Mar 24 – 28
[1 week]
Sentiment classification
Bag of words
Naive Bayes
More probability J&M 3 Ch. 4
Mar 31 – Apr 18
[3 weeks]
Computational lexical semantics
Measures of collocational strength
(Un)supervised word sense disambiguation
Classifiers
Vector-space semantics
NLTK, infomap J&M 3 Ch. 6
[J&M Ch. 20]
Apr 21 – May 3
[2 weeks]
Loose ends, advanced topics
May 5 – May 10 Finals Week

Last updated: January 22, 2025