Lectures

Course Overview (Jan 8)

Content

  • Course logistics
  • What is natural language processing?
  • What are the features of natural language?
  • What do we want to do with NLP?
  • What makes it hard?

Slides

Course Oveview

Reading Material

Text Classification (Aug 29)

Content

  • Defining features
  • Building a rule-based classifier
  • Training a logistic regression based classifier
  • Evaluating classification

Slides

Text Classification

Reading Material

Neural Network Basics (Sept 3)

Content

  • Cross Entropy Loss
  • Gradient Descent
  • Components of a feedforward neural network

Slides

Neural Network Basics

Reading Material

Word Vectors (Sept 5)

Content

  • Deep Averaging Network for Text CLassification
  • Lexical Semantics
  • Distributional Semantics
  • Evaluating Word Vectors

Slides

Word Vectors

Reading Material

Language Modeling (Sept 10)

Content

  • What is a language model
  • How to evaluate a language model
  • How to build a language model - N-gram language model, a simple feedforward neural LM

Slides

Language Modeling

Reading Material

[Eisenstein 6.1-6.2, 6.4]

Language Modeling (Sept 12)

Content

  • Feedforward Language Model
  • Recurrent Neural LM, Attention
  • Building blocks of a transformer

Slides

Neural LM

Reading Material

[J&M Chapter 8, 9]

[Eisenstein 6.3]

[Luong15]

[Illustrated Transformer]

Transformers (Sept 17)

Content

  • Self attention
  • Transformer Encoder
  • Transformer Decoder (Cross Attention, Masked Self Attention)
  • Impact of transformers

Slides

Transformers

Reading Material

[Illustrated Transformer]

[J&M Chapter 9]

[Attention is all you need]

Tokenization (Sept 19)

Tokenization Contd. / Pretraining I (September 24)

Content

  • Unigram tokenizer
  • Pretraining / finetuning paradigm
  • Masked LMs - BERT, RoBERTa, ELECTRA

Slides

Masked LMs

Reading Material

[Illustrated BERT)]

[BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding]

[RoBERTa]

[ELECTRA]

Pretraining II (September 26)

Content

  • T5 / BART / UL2 / GPT2
  • Decoding strategies

Slides

Pretraining II

Reading Material

[What happend to BERT/T5]

[Decoding strategies]

Pretraining II (October 1)

Content

  • Scaling
  • Prompting
  • In-context learning
  • CoT

Slides

Pretraining II

Reading Material

Instruction Following (October 3)

Content

  • Instruction Tuning (T0, FLAN)
  • Evaluating Instruction Tuned LMs
  • Basics of RLHF

Slides

Instruction Following

Reading Material

Instruction Following / Preference Optimization (Oct 8)

Content

  • Supervised Finetuning
  • Reward Modeling
  • Basics of RLHF
  • Direct Preference Optimization

Slides

Learning from Preferences

Reading Material

Learning from Preferences (Oct 10)

Evaluation (Oct 15)

Content

  • What is Benchmarking
  • Open and close ended evaluation
  • LLM Evaluation Challenges

Slides

benchmarking

Reading Material

[The Evolving Landscape of LLM Evaluation (for Quiz)]

Parameter Efficient Finetuning (October 22)

Content

  • LoRA
  • QLoRA

Slides

(q)lora

Reading Material

[LoRA]

[QLoRA]

Efficiency (October 24)

Content

  • Speculative Decoding, Flash Attention
  • Quantization, Pruning, Distillation

Slides

Efficiency

Reading Material

TBA

Interpretability (Oct 28)

Content

  • Global vs Local Explanation
  • Post hoc explanations (LIME, Gradient-based)
  • Probing

Slides

Interpret

Reading Material

TBA

Multilinguality (Oct 31)

Content

  • Linguistic Diversity
  • Cross Lingual Transfer
  • Multilingual Pretraining and Alignment

Slides

Multilingual

Reading Material

TBD

Multimodality (Nov 5)

Content

  • ViT
  • CLIP
  • Image + Text -> Text

Slides

Multimodal

Reading Material

Multimodality II (Nov 7)

Content

  • Image + Text -> Text
  • Audio LMs

Slides

Multimodal

Reading Material

TBA

Ethics (Nov 12)

Content

  • Background on Ethics in AI/NLP
  • Bias and Fairness
  • Toxicity and Other Harmful Content

Slides

Ethics

Reading Material

See Canvas

Sequence Tagging (Nov 14)

Content

  • Why sequence tagging
  • HMMs
  • Viterbi

Slides

sequence tagging

Reading Material

TBA

Sequence Tagging / Parsing (Nov 19)

Content

  • Viterbi
  • Syntatic Parsing, PCFGs
  • CKY
  • Semantic Parsing

Slides

sequence tagging

Reading Material

TBA

Retrieval (Nov 21)

Content

  • QA
  • Retrieval
  • Retrieval augmentation

Slides

retrieval

Reading Material

TBA

No class (Thanksgiving break)

Language Agents (December 3)

Content

  • What are language agents
  • Agents vs workflows
  • Examples: Web Agents, Science Agents

Slides

agents

Reading Material

Language Agents