We stand with Ukraine

We stand with Ukraine

Xuekai Zhu

According to our database¹, Xuekai Zhu authored at least 23 papers between 2023 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

FlowRL: Matching Reward Distributions for LLM Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Shanghang Zhang

,

,

,

,

CoRR, September, 2025

A Survey of Reinforcement Learning for Large Reasoning Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Towards a Unified View of Large Language Model Post-Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

SSRL: Self-Search Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Reasoning with Exploration: An Entropy Perspective.

[BibT_eX]

[DOI]

,

,

,

,

,

Zhenliang Zhang

,

CoRR, June, 2025

DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2025

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Eric Hanchen Jiang

,

,

,

,

CoRR, May, 2025

TTRL: Test-Time Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

How to Synthesize Text Data without Model Collapse?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Critical Data Size of Language Models from a Grokking Perspective.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

Advancing Drug-Target Interaction prediction with BERT and subsequence embedding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Comput. Biol. Chem., 2024

UltraMedical: Building Specialized Generalists in Biomedicine.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023

FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding.

[BibT_eX]

[DOI]

,

,

,

,

,

Frontiers Comput. Sci., October, 2023

FingerDTA: A Fingerprint-Embedding Framework for Drug-Target Binding Affinity Prediction.

[BibT_eX]

[DOI]

,

,

,

,

,

Big Data Min. Anal., March, 2023

PaD: Program-aided Distillation Specializes Large Models in Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Loading...