Kaiyan Zhang

Orcid: 0000-0002-1014-8442

According to our database1, Kaiyan Zhang authored at least 39 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Efficient Diffusion Models: A Comprehensive Survey From Principles to Practices.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks.
CoRR, July, 2025

Automating Exploratory Multiomics Research via Language Models.
CoRR, June, 2025

Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation.
CoRR, May, 2025

Semantic Correspondence: Unified Benchmarking and a Strong Baseline.
CoRR, May, 2025

TTRL: Test-Time Reinforcement Learning.
CoRR, April, 2025

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning.
CoRR, April, 2025

Video-T1: Test-Time Scaling for Video Generation.
CoRR, March, 2025

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models.
CoRR, March, 2025

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling.
CoRR, February, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding.
CoRR, January, 2025

OpenPRM: Building Open-domain Process-based Reward Models with Preference Trees.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Fusing Highly Specialized Language Models for Comprehensive Expertise.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.
CoRR, 2024

How to Synthesize Text Data without Model Collapse?
CoRR, 2024

Evolution of Thought: Diverse and High-Quality Reasoning via Multi-Objective Optimization.
CoRR, 2024

Automating Exploratory Proteomics Research via Language Models.
CoRR, 2024

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices.
CoRR, 2024

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation.
CoRR, 2024

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion.
CoRR, 2024

Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
CoRR, 2024

Online DPO: Online Direct Preference Optimization with Fast-Slow Chasing.
CoRR, 2024

Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process.
CoRR, 2024

UltraMedical: Building Specialized Generalists in Biomedicine.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

SMR: State Memory Replay for Long Sequence Modeling.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Generative Multi-Modal Knowledge Retrieval with Large Language Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Static and Dynamic Attention Framework for Multi Turn Dialogue Generation.
ACM Trans. Inf. Syst., January, 2023

A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation.
ACM Trans. Inf. Syst., 2023

Large Language Models are Zero Shot Hypothesis Proposers.
CoRR, 2023

PaD: Program-aided Distillation Specializes Large Models in Reasoning.
CoRR, 2023

Demo: Domino: A High-Precision Performance Monitoring and Analysis Platform for Client Applications.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2021
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021


  Loading...