We stand with Ukraine

We stand with Ukraine

Kaiyan Zhang

Orcid: 0000-0002-8059-1124

According to our database¹, Kaiyan Zhang authored at least 47 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Efficient Diffusion Models: A Comprehensive Survey From Principles to Practices.

[BibT_eX]

[DOI]

,

,

,

Liangliang Zhao

,

,

,

,

,

,

,

IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

FlowRL: Matching Reward Distributions for LLM Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Shanghang Zhang

,

,

,

,

CoRR, September, 2025

A Survey of Reinforcement Learning for Large Reasoning Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

AdsQA: Towards Advertisement Video Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Towards a Unified View of Large Language Model Post-Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

SSRL: Self-Search Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

ReviewRL: Towards Automated Scientific Review with RL.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Joseph Chee Chang

,

,

,

,

Charles McGrady

,

,

,

,

Hannaneh Hajishirzi

,

,

CoRR, July, 2025

Automating Exploratory Multiomics Research via Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, May, 2025

Semantic Correspondence: Unified Benchmarking and a Strong Baseline.

[BibT_eX]

[DOI]

,

,

,

CoRR, May, 2025

TTRL: Test-Time Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

Video-T1: Test-Time Scaling for Video Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, March, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, February, 2025

Process Reinforcement through Implicit Rewards.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Fusing Highly Specialized Language Models for Comprehensive Expertise.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

How to Synthesize Text Data without Model Collapse?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Evolution of Thought: Diverse and High-Quality Reasoning via Multi-Objective Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Automating Exploratory Proteomics Research via Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices.

[BibT_eX]

[DOI]

,

,

,

Liangliang Zhao

,

,

,

,

,

,

CoRR, 2024

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion.

[BibT_eX]

[DOI]

,

,

CoRR, 2024

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

UltraMedical: Building Specialized Generalists in Biomedicine.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SMR: State Memory Replay for Long Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Generative Multi-Modal Knowledge Retrieval with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

ACM Trans. Inf. Syst., January, 2023

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation.

[BibT_eX]

[DOI]

,

,

,

ACM Trans. Inf. Syst., 2023

Large Language Models are Zero Shot Hypothesis Proposers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2023

PaD: Program-aided Distillation Specializes Large Models in Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2023

Demo: Domino: A High-Precision Performance Monitoring and Analysis Platform for Client Applications.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2021

BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Loading...