Stanford CS336 is a 5-unit course on building language models from scratch, covering data collection, transformer construction, training, evaluation, and deployment.
It requires strong Python, deep learning, systems, calculus, probability, and machine learning background. The course includes five implementation-heavy assignments on basics, systems, scaling, data, and alignment/RL, plus GPU compute guidance, late-day rules, and an honor code.