MTech NLP · Module 1

Concepts That Need Coding

An interactive map of all topics in Module 1 that involve programming — with what to code, difficulty, and tools.

8
Coding Topics
6
Applications
5
Categories
NLP Pipeline
ML Role
Applications
Text Processing
Evaluation
Generic NLP Pipeline
NLP Pipeline
⬛⬜⬜⬜ Beginner
Build an end-to-end text processing pipeline: tokenize → normalize → tag → parse → output. Shows all stages in sequence.
NLTK spaCy Python
Ambiguity Detection
Text Processing
⬛⬛⬜⬜ Intermediate
Write code to identify and illustrate lexical, syntactic, and semantic ambiguity in example sentences using parsing and word sense tools.
spaCy NLTK WordNet
ML vs Rule-Based Comparison
ML Role
⬛⬛⬜⬜ Intermediate
Implement a simple task (e.g., spam detection or POS tagging) using both hand-written rules AND an ML model. Compare accuracy side by side.
scikit-learn NLTK pandas
Machine Translation (Demo)
Application
⬛⬛⬜⬜ Intermediate
Implement a simple word-level translation using a bilingual dictionary, then contrast with a pretrained neural MT model (Helsinki-NLP).
transformers HuggingFace Python dict
Question Answering System
Application
⬛⬛⬛⬜ Advanced
Build a simple extractive QA system: given a paragraph and a question, extract the answer span. Use a retrieval-based pipeline or pretrained model.
transformers BERT/DistilBERT HuggingFace
Information Retrieval (TF-IDF)
Application
⬛⬛⬜⬜ Intermediate
Build a document retrieval engine: index a set of documents, accept a query, and return ranked results using TF-IDF similarity.
scikit-learn NumPy NLTK
Text Categorization
Application
⬛⬛⬜⬜ Intermediate
Train a text classifier on a labeled dataset (e.g., 20 Newsgroups or AG News). Implement preprocessing, feature extraction, and evaluation.
scikit-learn NLTK datasets (HF)
Text Summarization
Application
⬛⬛⬜⬜ Intermediate
Implement extractive summarization (sentence scoring with TF-IDF) and contrast with abstractive summarization using a pretrained T5/BART model.
NLTK transformers rouge-score
Sentiment Analysis
Application
⬛⬜⬜⬜ Beginner
Classify text as positive/negative/neutral. Start with a lexicon-based approach (VADER), then move to a trained ML model.
NLTK VADER scikit-learn transformers
NLP Evaluation Metrics
Evaluation
⬛⬜⬜⬜ Beginner
Implement and understand core NLP evaluation metrics: accuracy, precision, recall, F1, BLEU, ROUGE — used across all Module 1 applications.
scikit-learn nltk.translate rouge-score