Zhaochong An

Orcid: 0009-0007-5985-7470

According to our database1, Zhaochong An authored at least 24 papers between 2020 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
RGB-D Indiscernible Object Counting in Underwater Scenes.
Int. J. Comput. Vis., May, 2026

Stitched Value Model for Diffusion Alignment.
CoRR, May, 2026

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward.
CoRR, March, 2026

Video Understanding: From Geometry and Semantics to Unified Models.
CoRR, March, 2026

Revisiting the Perception-Distortion Trade-off with Spatial-Semantic Guided Super-Resolution.
CoRR, March, 2026

SCOPE: Scene-Contextualized Incremental Few-Shot 3D Segmentation.
CoRR, March, 2026

VecGlypher: Unified Vector Glyph Generation with Language Models.
CoRR, February, 2026

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning.
CoRR, January, 2026

2025
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming.
CoRR, December, 2025

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory.
CoRR, December, 2025

Scaling Zero-Shot Reference-to-Video Generation.
CoRR, December, 2025

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models.
CoRR, December, 2025

Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory.
CoRR, May, 2025

ChatMotion: A Multimodal Multi-Agent for Human Motion Analysis.
CoRR, February, 2025

Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

What You Have is What You Track: Adaptive and Robust Multimodal Tracking.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer.
Proceedings of the International Conference on 3D Vision, 2025

2024
kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies.
Trans. Mach. Learn. Res., 2024

Rethinking Few-shot 3D Point Cloud Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Object Segmentation by Mining Cross-Modal Semantics.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Indiscernible Object Counting in Underwater Scenes.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Temporal-aware Hierarchical Mask Classification for Video Semantic Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2020
EM-RBR: a reinforced framework for knowledge graph completion from reasoning perspective.
CoRR, 2020


  Loading...