Step 0

Counting

A bigram language model trained by counting — the simplest possible language model.

The big idea: Counting IS learning. For this simple model, counting the data gives us the exact answer.