Changyao Tian

Orcid: 0000-0002-3285-4671

According to our database1, Changyao Tian authored at least 18 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2025

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning.
CoRR, October, 2025

MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites.
CoRR, October, 2025

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints.
CoRR, October, 2025

Sequential Diffusion Language Models.
CoRR, September, 2025

GenExam: A Multidisciplinary Text-to-Image Exam.
CoRR, September, 2025

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency.
CoRR, August, 2025

Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models.
CoRR, July, 2025

Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space.
CoRR, May, 2025

LangBridge: Interpreting Image as a Combination of Language Embeddings.
CoRR, March, 2025

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text.
CoRR, 2024

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer.
CoRR, 2024

Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature.
AI Open, 2024

Learning 1D Causal Visual Representation with De-focus Attention Networks.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2022
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition.
CoRR, 2021


  Loading...