Xuekai Zhu

According to our database1, Xuekai Zhu authored at least 18 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Reasoning with Exploration: An Entropy Perspective.
CoRR, June, 2025

DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving.
CoRR, May, 2025

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space.
CoRR, May, 2025

TTRL: Test-Time Reinforcement Learning.
CoRR, April, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.
CoRR, March, 2025

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding.
CoRR, January, 2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.
CoRR, 2024

How to Synthesize Text Data without Model Collapse?
CoRR, 2024

Critical Data Size of Language Models from a Grokking Perspective.
CoRR, 2024

Advancing Drug-Target Interaction prediction with BERT and subsequence embedding.
Comput. Biol. Chem., 2024

UltraMedical: Building Specialized Generalists in Biomedicine.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
FragDPI: a novel drug-protein interaction prediction model based on fragment understanding and unified coding.
Frontiers Comput. Sci., October, 2023

FingerDTA: A Fingerprint-Embedding Framework for Drug-Target Binding Affinity Prediction.
Big Data Min. Anal., March, 2023

PaD: Program-aided Distillation Specializes Large Models in Reasoning.
CoRR, 2023

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse Representations and Content Enhancing.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023


  Loading...