Kaihang Pan
Orcid: 0009-0001-2967-4573
According to our database1,
Kaihang Pan
authored at least 22 papers
between 2023 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities.
CoRR, June, 2025
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL.
CoRR, June, 2025
Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation.
CoRR, June, 2025
CoRR, May, 2025
Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning.
CoRR, April, 2025
IEEE Trans. Multim., 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
RustGraph: Robust Anomaly Detection in Dynamic Graphs by Jointly Learning Structural-Temporal Dependency.
IEEE Trans. Knowl. Data Eng., July, 2024
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Unified Generative and Discriminative Training for Multi-modal Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
CoRR, 2023
CoRR, 2023
Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR, 2023
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023