Boyi Liu

Affiliations:
  • Northwestern University, IL, USA


According to our database1, Boyi Liu authored at least 19 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2025
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

BRiTE: Bootstrapping Reinforced Thinking Process to Enhance Language Model Reasoning.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024
Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Reason for Future, Act for Now: A Principled Architecture for Autonomous LLM Agents.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Let Models Speak Ciphers: Multiagent Debate through Embeddings.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning.
J. Mach. Learn. Res., 2023

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency.
CoRR, 2023

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Achieving Hierarchy-Free Approximation for Bilevel Programs with Equilibrium Constraints.
Proceedings of the International Conference on Machine Learning, 2023

Differentiable Arbitrating in Zero-sum Markov Games.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022
An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models.
CoRR, 2022

Differentiable Bilevel Programming for Stackelberg Congestion Games.
CoRR, 2022

Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Inducing Equilibria via Incentives: Simultaneous Design-and-Play Ensures Global Convergence.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021
Inducing Equilibria via Incentives: Simultaneous Design-and-Play Finds Global Optima.
CoRR, 2021

BooVI: Provably Efficient Bootstrapped Value Iteration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2019
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy.
CoRR, 2019

Neural Trust Region/Proximal Policy Optimization Attains Globally Optimal Policy.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy.
Proceedings of the 7th International Conference on Learning Representations, 2019


  Loading...