Title: Reinforcement Learning With Constraints: From Theory to Reasoning in LLM

Abstract: In this talk, I will explore reinforcement learning with constraints, focusing on both theoretical foundations and practical applications. I will first present recent advances in the sample complexity of constrained Markov decision processes (CMDPs), covering both offline and online settings. Our results establish near-optimal upper and lower bounds under relaxed and strict feasibility regimes, revealing that constraint satisfaction—while generally harder—can match the sample efficiency of unconstrained MDPs under certain conditions. These insights are grounded in primal-dual algorithms and generative model frameworks. Inspired by this theory, I will discuss how CMDPs can be applied to impose behavior in large language models (LLMs), such as controlling reasoning length or enforcing budgeted constraints during fine-tuning. By treating response generation as a CMDP and incorporating online dual updates, we show that LLMs can be optimized to meet constraints with minimal degradation in performance. 

Bio: Dr. Lin Yang (杨林) is an Associate Professor in the Electrical and Computer Engineering and Computer Science Departments at UCLA. His research centers on the foundations of modern machine learning and data science, with a focus on fast algorithms with provable guarantees in areas such as reinforcement learning, large language model acceleration, non-convex optimization, and streaming algorithms. Dr. Yang received dual Ph.D. degrees in Computer Science and Physics & Astronomy from Johns Hopkins University and was a postdoctoral researcher at Princeton University. His honors include the Amazon Faculty Award, Simons Research Fellowship, Dean Robert H. Roy Fellowship, and the JHU MINDS Best Dissertation Award.


UBC Crest The official logo of the University of British Columbia. Urgent Message An exclamation mark in a speech bubble. Caret An arrowhead indicating direction. Arrow An arrow indicating direction. Arrow in Circle An arrow indicating direction. Arrow in Circle An arrow indicating direction. Bluesky The logo for the Bluesky social media service. Chats Two speech clouds. Facebook The logo for the Facebook social media service. Information The letter 'i' in a circle. Instagram The logo for the Instagram social media service. External Link An arrow entering a square. Linkedin The logo for the LinkedIn social media service. Location Pin A map location pin. Mail An envelope. Menu Three horizontal lines indicating a menu. Minus A minus sign. Telephone An antique telephone. Plus A plus symbol indicating more or the ability to add. Search A magnifying glass. Twitter The logo for the Twitter social media service. Youtube The logo for the YouTube video sharing service.