Kaichen Zhang

Orcid: 0000-0002-3353-9307

According to our database1, Kaichen Zhang authored at least 25 papers between 2013 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling.
CoRR, April, 2026

UniME-V2: MLLM-as-a-Judge for Universal Multimodal Embedding Learning.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction.
CoRR, December, 2025

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling.
CoRR, November, 2025

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe.
CoRR, November, 2025

RSPO: Risk-Seeking Policy Optimization for Pass@k and Max@k Metrics in Large Language Models.
CoRR, August, 2025

GVPO: Group Variance Policy Optimization for Large Language Model Post-Training.
CoRR, April, 2025

Long Context Transfer from Language to Vision.
Trans. Mach. Learn. Res., 2025

LLaVA-OneVision: Easy Visual Task Transfer.
Trans. Mach. Learn. Res., 2025

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Streaming Multi-agent Pathfinding.
Proceedings of the Thirty-Fourth International Joint Conference on Artificial Intelligence, 2025

MixEval-X: Any-to-any Evaluations from Real-world Data Mixture.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024
Large Multi-modal Models Can Interpret Features in Large Multi-modal Models.
CoRR, 2024

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures.
CoRR, 2024

LLaVA-OneVision: Easy Visual Task Transfer.
CoRR, 2024

WorldQA: Multimodal World Knowledge in Videos through Long-Chain Reasoning.
CoRR, 2024

The Impact of Generative Artificial Intelligence on Market Equilibrium: Evidence from a Natural Experiment.
Proceedings of the Web and Internet Economics - 20th International Conference, 2024

Optimized Cost Per Click in Online Advertising: A Theoretical Analysis.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Robust Reward Placement under Uncertainty.
Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, 2024

2023
The Impact of Generative Artificial Intelligence.
CoRR, 2023

2021
Application of 3D Technology in Garment Design Template.
Proceedings of the 2021 International Conference on Machine Learning and Big Data Analytics for IoT Security and Privacy, 2021

2020
Geodemographic Influence Maximization.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

2016
Outage performance of cognitive AF relay networks with direct link and heterogeneous non-identical constraints.
Wirel. Commun. Mob. Comput., 2016

2013
A 0.13-µm CMOS 0.1-12GHz active balun-LNA for multi-standard applications.
IEICE Electron. Express, 2013


  Loading...