Bin Zhu
Orcid: 0000-0002-9213-2611Affiliations:
- Singapore Management University, School of Computing and Information Systems, Singapore
- University of Bristol, UK (former)
- City University of Hong Kong, Department of Computer Science, Kowloon Tong, Hong Kong (PhD 2021)
According to our database1,
Bin Zhu authored at least 50 papers
between 2019 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
-
on dl.acm.org
On csauthors.net:
Bibliography
2026
Spatiotemporal Sycophancy: Negation-Based Gaslighting in Video Large Language Models.
CoRR, April, 2026
CoRR, April, 2026
Enhancing Action and Ingredient Modeling for Semantically Grounded Recipe Generation.
CoRR, February, 2026
ACM Trans. Multim. Comput. Commun. Appl., January, 2026
ThinkMatter: Panoramic-Aware Instructional Semantics for Monocular Vision-and-Language Navigation.
IEEE Trans. Image Process., 2026
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026
SAM3-LiteText: An Anatomical Study of the SAM3 Text Encoder for Efficient Vision-Language Segmentation.
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026
Proceedings of the 2026 International Conference on Multimedia Retrieval, 2026
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026
Actor-Critic for Continuous Action Chunks: A Reinforcement Learning Framework for Long-Horizon Robotic Manipulation with Sparse Reward.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026
2025
CVLP-NaVD: Contrastive Visual-language Pre-training Models for Non-annotated Visual Description.
ACM Trans. Multim. Comput. Commun. Appl., November, 2025
CoRR, November, 2025
CoRR, April, 2025
CoRR, January, 2025
IEEE Trans. Multim., 2025
From Canteen Food to Daily Meals: Generalizing Food Recognition to More Practical Scenarios.
IEEE Trans. Multim., 2025
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2025
Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning.
Proceedings of the 33rd ACM International Conference on Multimedia, 2025
Proceedings of the 2025 International Conference on Multimedia Retrieval, 2025
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025
From Holistic to Localized: Local Enhanced Adapters for Efficient Visual Instruction Fine-Tuning.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
OSCAR: Object Status and Contextual Awareness for Recipes to Support Non-Visual Cooking.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025
Exploring Object Status Recognition for Recipe Progress Tracking in Non-Visual Cooking.
Proceedings of the 27th International ACM SIGACCESS Conference on Computers and Accessibility, 2025
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025
2024
Efficient Unsupervised Video Hashing With Contextual Modeling and Structural Controlling.
IEEE Trans. Multim., 2024
Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning.
CoRR, 2024
CoRR, 2024
Proceedings of the 6th ACM International Conference on Multimedia in Asia, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Proceedings of the Computer Vision - ECCV 2024 Workshops, 2024
Proceedings of the Computer Vision - ECCV 2024, 2024
2023
CoRR, 2023
Proceedings of the 31st ACM International Conference on Multimedia, 2023
2022
Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance.
IEEE Trans. Multim., 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Unsupervised Video Hashing with Multi-granularity Contextualization and Multi-structure Preservation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022
2021
Learning to Match Anchor-Target Video Pairs With Dual Attentional Holographic Networks.
IEEE Trans. Image Process., 2021
IEEE Trans. Image Process., 2021
2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019