Kaiyan Zhang
Orcid: 0000-0002-8059-1124
According to our database1,
Kaiyan Zhang
authored at least 48 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models.
CoRR, September, 2025
SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks.
CoRR, July, 2025
Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation.
CoRR, May, 2025
CoRR, April, 2025
CoRR, March, 2025
CoRR, February, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.
Proceedings of the Forty-second International Conference on Machine Learning, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Evolution of Thought: Diverse and High-Quality Reasoning via Multi-Objective Optimization.
CoRR, 2024
CoRR, 2024
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation.
CoRR, 2024
CoRR, 2024
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
ACM Trans. Inf. Syst., January, 2023
ACM Trans. Inf. Syst., 2023
Demo: Domino: A High-Precision Performance Monitoring and Analysis Platform for Client Applications.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2021
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021