Concepts That Need Coding
Generic NLP Pipeline, Text Processing, Information Retrieval, Text Categorization, and core Evaluation Metrics.
Morphology & N-Grams
Inflectional Morphology, Regular Expressions, Finite Automata, Finite State Transducers, and N-Gram Language Models.
POS Tagging & CFG
Rule-Based & Stochastic POS, Hidden Markov Models, Context Free Grammars, Probabilistic CFG, and Constituency Parsing.
Semantics & WSD
Phrase Attachment, WordNet Lexical Relations, Homonymy & Polysemy detection, Lesk/Yarowsky Algorithms, and ML-based Word Sense Disambiguation.
Discourse & Coherence
Reference Phenomena, Hobbs' Algorithm for Pronouns, Centering Theory, Text Coherence Modeling (Entity Grids), and Rhetorical Structure Theory (RST).