-
SPARKLE: A Unified Single-Loop Primal-Dual Framework for Decentralized Bilevel Optimization
Great Bay University, Dongguan, China
August, 2025
-
A Memory Efficient Subspace Optimization Method for Training Large Language Models
PKU Workshop on Optimization Theory and Methods
Peking University, Beijing, China
June, 2025
-
Subspace Optimization for Large Language Models with Convergence Guarantees
The NUS-PKU-SJTU Workshop on Data Science and Machine Learning
National University of Singapore, Singapore
November, 2024
-
A Mathematics-Inspired Learning-to-Optimize Framework for Decentralized Optimization
ORSC 2024
Guiyang, China
October, 2024
-
Efficient Optimization for Deep Learning: Part I
Adaptive SGD
Fudan University, Shanghai, China
June, 2024
-
Efficient Optimization for Deep Learning: Part II
Fudan University, Shanghai, China
June, 2024
-
Efficient Optimization for Deep Learning: Part III
Mixed-Precision Training
Fudan University, Shanghai, China
June, 2024