Yang Zhao

Affiliations:
  • Zhejiang University, College of Computer Science, China


According to our database1, Yang Zhao authored at least 21 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Data-Efficiently Learn Large Language Model for Universal 3D Scene Perception.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
Extending Multi-modal Contrastive Representations.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Multi-Modal Domain Adaptation Across Video Scenes for Temporal Video Grounding.
CoRR, 2023

Chat-3D v2: Bridging 3D Scene and Large Language Models with Object Identifiers.
CoRR, 2023

Extending Multi-modal Contrastive Representations.
CoRR, 2023

Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes.
CoRR, 2023

Connecting Multi-modal Contrastive Representations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

DATE: Domain Adaptive Product Seeker for E-Commerce.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Scene-robust Natural Language Video Localization via Learning Domain-invariant Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments.
CoRR, 2022

AntPivot: Livestream Highlight Detection via Hierarchical Attention Mechanism.
CoRR, 2022

Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Video-Guided Curriculum Learning for Spoken Video Grounding.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Cascaded Prediction Network via Segment Tree for Temporal Video Grounding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Learning From Multi-Dimensional Partial Labels.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Discriminative and Correlative Partial Multi-Label Learning.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019


  Loading...