Stochastic Parrots
Implement Llama 2 inference in C from scratch and then run a small LLM on a regular laptop. A nice excuse to look under the hood of the systems everyone keeps talking about, and to see firsthand why a 1 GB parrot says the things it says.
- Matrix multiplication
- Quantization
- Sampling and temperature
→ github.com/aaleta/2026_1_stochastic_parrots
TinyStories 110M (left) and TinyLlama (right), generating text token by token on a laptop CPU. The students quickly discovered the parrots have opinions about whether they will pass the exam.
In the exam we asked them to ask the LLM whether they would pass the exam. Apparently they hadn’t even tried once before the exam, and the parrots were not optimistic about their chances.








