Kaiyan Zhang
Orcid: 0000-0002-1014-8442
According to our database1,
Kaiyan Zhang
authored at least 39 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
IEEE Trans. Pattern Anal. Mach. Intell., September, 2025
SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks.
CoRR, July, 2025
Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation.
CoRR, May, 2025
CoRR, April, 2025
CoRR, March, 2025
CoRR, February, 2025
CoRR, January, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
Retrieval-Augmented Visual Question Answering via Built-in Autoregressive Search Engines.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization.
CoRR, 2024
Evolution of Thought: Diverse and High-Quality Reasoning via Multi-Objective Optimization.
CoRR, 2024
CoRR, 2024
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation.
CoRR, 2024
CoRR, 2024
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
CoGenesis: A Framework Collaborating Large and Small Language Models for Secure Context-Aware Instruction Following.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
ACM Trans. Inf. Syst., January, 2023
ACM Trans. Inf. Syst., 2023
Demo: Domino: A High-Precision Performance Monitoring and Analysis Platform for Client Applications.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2021
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized Data.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021