About me

Hi! I'm a third-year Ph.D. student at the Electrical and Computer Engineering Department of Carnegie Mellon University. I am fortunate to be advised by Prof. Yuejie Chi. Previously I spent one year at Peking University for a data science master’s program. Before that, I received my B.S. in Mathematics (Honors Program) from Xi'an Jiaotong University. Here is my CV.

Research Interests

I'm interested in machine learning, deep learning, reinforcement learning, optimization and game theory. Specifically, I am interested in developing sample and computationally efficient algorithms for some fundamental machine learning problems.

Selected Publications

  • Exploration from a Primal-Dual Lens: Value-Incentivized Actor-Critic Methods for Sample-Efficient Online RL
    NeurIPS 2025
    Tong Yang, Bo Dai, Lin Xiao, Yuejie Chi
    [PDF] [BibTex]

  • Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent
    NeurIPS 2025
    Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi
    [PDF] [BibTex]

  • Incentivize without Bonus: Provably Efficient Model-based Online Multi-agent RL for Markov Games
    ICML 2025
    Tong Yang, Bo Dai, Lin Xiao, Yuejie Chi
    [PDF] [BibTex]

  • Faster WIND: Accelerating Iterative Best-of-N Distillation for LLM Alignment
    AISTATS 2025
    Tong Yang, Jincheng Mei, Hanjun Dai, Zixin Wen, Shicong Cen, Dale Schuurmans, Yuejie Chi, Bo Dai
    [PDF] [BibTex]

  • In-Context Learning with Representations: Contextual Generalization of Trained Transformers
    NeurIPS 2024
    Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi
    [PDF] [BibTex]

  • Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning
    NeurIPS 2024
    Tong Yang, Shicong Cen, Yuting Wei, Yuxin Chen, Yuejie Chi
    [PDF] [BibTex]

  • A Primal-Dual Approach to Solving Variational Inequalities with General Constraints
    ICLR 2024
    Tatjana Chavdarova*, Tong Yang*, Matteo Pagliardini, Michael I. Jordan (*equal contribution, order is alphabetical.)
    [PDF] [Poster (OPT@NeurIPS '22)] [BibTex]

  • Solving Constrained Variational Inequalities via an Interior Point Method
    ICLR 2023 Spotlight!
    Tong Yang*, Michael I. Jrodan*, Tatjana Chavdarova* (*equal contribution)
    [PDF] [Poster (WiML@ICML '22)] [Code] [BibTex]

  • Optimization for Amortized Inverse Problems
    ICML 2023
    Tianci Liu*, Tong Yang*, Quan Zhang, Qi Lei (*equal contribution)
    [PDF] [BibTex]

  • Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
    ICLR 2025
    Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai
    [PDF] [BibTex]