Zhaochong An

Orcid: 0009-0007-5985-7470

According to our database¹, Zhaochong An authored at least 24 papers between 2020 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

RGB-D Indiscernible Object Counting in Underwater Scenes.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., May, 2026

Stitched Value Model for Diffusion Alignment.

[BibT_eX]

[DOI]

CoRR, May, 2026

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward.

[BibT_eX]

[DOI]

Marta Tintore Gazulla

CoRR, March, 2026

Video Understanding: From Geometry and Semantics to Unified Models.

[BibT_eX]

[DOI]

CoRR, March, 2026

Revisiting the Perception-Distortion Trade-off with Spatial-Semantic Guided Super-Resolution.

[BibT_eX]

[DOI]

CoRR, March, 2026

SCOPE: Scene-Contextualized Incremental Few-Shot 3D Segmentation.

[BibT_eX]

[DOI]

Abdesselam Bouzerdoum

Lu Yin

Na Zhao

Xiatian Zhu

CoRR, March, 2026

VecGlypher: Unified Vector Glyph Generation with Language Models.

[BibT_eX]

[DOI]

CoRR, February, 2026

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning.

[BibT_eX]

[DOI]

CoRR, January, 2026

2025

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming.

[BibT_eX]

[DOI]

Juan-Manuel Pérez-Rúa

CoRR, December, 2025

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory.

[BibT_eX]

[DOI]

CoRR, December, 2025

Scaling Zero-Shot Reference-to-Video Generation.

[BibT_eX]

[DOI]

CoRR, December, 2025

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models.

[BibT_eX]

[DOI]

Juan-Manuel Pérez-Rúa

CoRR, December, 2025

Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory.

[BibT_eX]

[DOI]

CoRR, May, 2025

ChatMotion: A Multimodal Multi-Agent for Human Motion Analysis.

[BibT_eX]

[DOI]

CoRR, February, 2025

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

What You Have is What You Track: Adaptive and Robust Multimodal Tracking.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer.

[BibT_eX]

[DOI]

Proceedings of the International Conference on 3D Vision, 2025

2024

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Rethinking Few-shot 3D Point Cloud Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

Object Segmentation by Mining Cross-Modal Semantics.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

Indiscernible Object Counting in Underwater Scenes.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the 34th British Machine Vision Conference 2023, 2023

2020

EM-RBR: a reinforced framework for knowledge graph completion from reasoning perspective.

[BibT_eX]

[DOI]

CoRR, 2020

Zhaochong An

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...