About me

Hi! I'm a second-year Ph.D. student at the Electrical and Computer Engineering Department of Carnegie Mellon University. I am fortunate to be advised by Prof. Yuejie Chi. Previously I spent one year at Peking University for a data science master’s program. Before that, I received my B.S. in Mathematics (Honors Program) from Xi'an Jiaotong University. Here is my CV.

Research Interests

I have eclectic interests in machine learning, deep learning, reinforcement learning, optimization and game theory. Specifically, I am interested in developing sample and computationally efficient algorithms for some fundamental machine learning problems.

Publications

  • Faster WIND: Accelerating Iterative Best-of-N Distillation for LLM Alignment
    Preprint, 2024
    Tong Yang, Jincheng Mei, Hanjun Dai, Zixin Wen, Shicong Cen, Dale Schuurmans, Yuejie Chi, Bo Dai
    [PDF] [BibTex]

  • In-Context Learning with Representations: Contextual Generalization of Trained Transformers
    NeurIPS 2024
    Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi
    [PDF] [BibTex]

  • Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning
    NeurIPS 2024
    Tong Yang, Shicong Cen, Yuting Wei, Yuxin Chen, Yuejie Chi
    [PDF] [BibTex]

  • A Primal-Dual Approach to Solving Variational Inequalities with General Constraints
    ICLR 2024
    Tatjana Chavdarova*, Tong Yang*, Matteo Pagliardini, Michael I. Jordan (*equal contribution, order is alphabetical.)
    [PDF] [Poster (OPT@NeurIPS '22)] [BibTex]

  • Solving Constrained Variational Inequalities via an Interior Point Method
    ICLR 2023 Spotlight!
    Tong Yang*, Michael I. Jrodan*, Tatjana Chavdarova* (*equal contribution)
    [PDF] [Poster (WiML@ICML '22)] [Code] [BibTex]

  • Optimization for Amortized Inverse Problems
    ICML 2023
    Tianci Liu*, Tong Yang*, Quan Zhang, Qi Lei (*equal contribution)
    [PDF] [BibTex]

  • Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
    Preprint, 2024
    Shicong Cen, Jincheng Mei, Katayoon Goshvadi, Hanjun Dai, Tong Yang, Sherry Yang, Dale Schuurmans, Yuejie Chi, Bo Dai
    [PDF] [BibTex]