Bin Huang

Orcid: 0009-0000-2504-3689

Affiliations:
  • Tsinghua University, Beijing, China


According to our database1, Bin Huang authored at least 9 papers between 2020 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Multi-Modal Generative AI: Multi-Modal LLMs, Diffusions, and the Unification.
IEEE Trans. Circuits Syst. Video Technol., April, 2026

Reflective Cross-Granularity Grounding with Preference Optimization for Long Video Understanding.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026

2025
Identity-Text Video Corpus Grounding.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond.
CoRR, 2024

VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Neighbor Does Matter: Curriculum Global Positive-Negative Sampling for Vision-Language Pre-training.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

VTimeLLM: Empower LLM to Grasp Video Moments.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Global-Local GraphFormer: Towards Better Understanding of User Intentions in Sequential Recommendation.
Proceedings of the ACM Multimedia Asia 2023, 2023

2020
Commonsense Learning: An Indispensable Path towards Human-centric Multimedia.
Proceedings of the HuMA'20: Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis, 2020


  Loading...