Zhenheng Yang
Orcid: 0000-0003-0303-5885
According to our database1,
Zhenheng Yang authored at least 49 papers
between 2016 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
VirtueBench: Evaluating Trustworthiness under Uncertainty in Long Video Understanding.
CoRR, March, 2026
UniWeTok: An Unified Binary Tokenizer with Codebook Size 2<sup>128</sup> for Unified Multimodal Large Language Model.
CoRR, February, 2026
CoRR, February, 2026
CoRR, January, 2026
Dynamic Content Moderation in Livestreams: Combining Supervised Classification with MLLM-Boosted Similarity Matching.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CoRR, December, 2025
CoRR, December, 2025
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation.
CoRR, November, 2025
CoRR, October, 2025
CoRR, October, 2025
UniCode<sup>2</sup>: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation.
CoRR, June, 2025
CoRR, June, 2025
UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning.
CoRR, May, 2025
CoRR, May, 2025
Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning.
CoRR, March, 2025
CoRR, February, 2025
<i>COEF-VQ: </i> Cost-Efficient Video Quality Understanding through a Cascaded Multimodal LLM Framework.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Star: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Every Pixel Counts ++: Joint Learning of Geometry and Motion with 3D Holistic Understanding.
IEEE Trans. Pattern Anal. Mach. Intell., 2020
IEEE Micro, 2020
CoRR, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
UnOS: Unified Unsupervised Optical-Flow and Stereo-Depth Estimation by Watching Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
CoRR, 2018
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018
Every Pixel Counts: Unsupervised Geometry Learning with Holistic 3D Motion Understanding.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Unsupervised Learning of Geometry From Videos With Edge-Aware Depth-Normal Consistency.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2017
CoRR, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the British Machine Vision Conference 2017, 2017
Proceedings of the British Machine Vision Conference 2017, 2017
Proceedings of the British Machine Vision Conference 2017, 2017
2016
Proceedings of the 23rd International Conference on Pattern Recognition, 2016