Building Advanced Learning Algorithms and AI Systems

Through rigorous theoretical foundations and principled algorithm design

My research explores the overlap between practical machine-learning methods and their underlying theory - a space where recent progress in training deep networks still depends largely on heuristics and has little theoretical backing. In my research, I aim to narrow the gap between theory and practice by

developing practical theory that explains why specific phenomena arise and why certain heuristics succeed during deep-learning training,
advancing existing algorithms by theoretically inspired modifications, and
creating new methods with both strong convergence guarantees and practical performance.

```

Large-Scale Optimization

Large-Scale Optimization

LLMs Robustness Hyperparameter transfer

Federated Learning

Federated Learning

Communication Efficiency Privacy Local Methods

Loss Landscape

Loss Landscape of Neural Networks

Sharpness Edge of Stability Landscape Characterization

```