Lunjun Zhang

According to our database¹, Lunjun Zhang authored at least 14 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Evolutionary System Prompt Learning for Reinforcement Learning in LLMs.

[BibT_eX]

[DOI]

Lunjun Zhang

Ryan Chen

Bradly C. Stadie

CoRR, February, 2026

EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL.

[BibT_eX]

[DOI]

Lunjun Zhang

Jimmy Ba

CoRR, February, 2026

2025

Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction.

[BibT_eX]

[DOI]

CoRR, June, 2025

D2 Actor Critic: Diffusion Actor Meets Distributional Critic.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

Thinking vs. Doing: Improving Agent Reasoning by Scaling Test-Time Interaction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Generative Verifiers: Reward Modeling as Next-Token Prediction.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning to Drive via Asymmetric Self-Play.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion.

[BibT_eX]

[DOI]

CoRR, 2023

Towards Unsupervised Object Detection from LiDAR Point Clouds.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Realistic Traffic Agents in Closed-loop.

[BibT_eX]

[DOI]

Proceedings of the Conference on Robot Learning, 2023

2022

Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization.

[BibT_eX]

[DOI]

Lunjun Zhang

Bradly C. Stadie

CoRR, 2022

2021

World Model as a Graph: Learning Latent Landmarks for Planning.

[BibT_eX]

[DOI]

Lunjun Zhang

Ge Yang

Bradly C. Stadie

Proceedings of the 38th International Conference on Machine Learning, 2021

2020

Learning Intrinsic Rewards as a Bi-Level Optimization Problem.

[BibT_eX]

[DOI]

Bradly C. Stadie

Lunjun Zhang

Jimmy Ba

Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020

Lunjun Zhang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...