Lunjun Zhang

According to our database1, Lunjun Zhang authored at least 13 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Evolutionary System Prompt Learning for Reinforcement Learning in LLMs.
CoRR, February, 2026

EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL.
CoRR, February, 2026

2025
Thinking vs. Doing: Agents that Reason by Scaling Test-Time Interaction.
CoRR, June, 2025

D2 Actor Critic: Diffusion Actor Meets Distributional Critic.
Trans. Mach. Learn. Res., 2025

Generative Verifiers: Reward Modeling as Next-Token Prediction.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Learning to Drive via Asymmetric Self-Play.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion.
CoRR, 2023

Towards Unsupervised Object Detection from LiDAR Point Clouds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Learning Realistic Traffic Agents in Closed-loop.
Proceedings of the Conference on Robot Learning, 2023

2022
Understanding Hindsight Goal Relabeling Requires Rethinking Divergence Minimization.
CoRR, 2022

2021
World Model as a Graph: Learning Latent Landmarks for Planning.
Proceedings of the 38th International Conference on Machine Learning, 2021

2020
Learning Intrinsic Rewards as a Bi-Level Optimization Problem.
Proceedings of the Thirty-Sixth Conference on Uncertainty in Artificial Intelligence, 2020


  Loading...