My recent research centers around reinforcement learning (RL) and LLM post-training.
With backgrounds in probability and statistics, my past research includes mathematical theory and algorithm design of RL, training dynamics and generalization properties of deep learning, and RL for operations and economics.