Kun Yuan

PKU Class 2024 Fall: Optimizaiton for Deep Learning

Instructor: Kun Yuan (kunyuan@pku.edu.cn)

Teaching assistants:

Classroom: 3pm - 6pm Tuesday, 三教403

Office hour: 4pm - 5pm Thursday, 静园六院220

References

Martin Jaggi and Nicolas Flammarion, Optimization for Machine Learning, EPFL Class CS-439
Chris De Sa, Advanced Machine Learning Systems, Cornell CS6787
Kun Yuan, Introduction to LLM, PKU

Materials

Lecture 1: Introduction

Lecture 2: Basics in Machine Learning and Langugae Models

Lecture 3: Transformers

Lecture 4: Parameters and Computations in Decoder-only LLMs

Lecture 5: Stochastic Gradient Descent

Lecture 6: Advances in SGD

Lecture 7: Momentum SGD

Lecture 8: Adaptive SGD

Midterm Exam: Good Luck :)

Lecture 9: Mixed-Precision Training

Lecture 10: Block Coordinate Descent

Lecture 11: Subspace Optimization

Lecture 12: Zeroth-order Optimization

Lecture 13: Distributed Optimization

Lecture 14: Flash Attention