Yiyi Zhou
Orcid: 0000-0002-5110-4526
According to our database1,
Yiyi Zhou
authored at least 86 papers
between 2015 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
Pattern Recognit., 2026
2025
IEEE Trans. Pattern Anal. Mach. Intell., July, 2025
IEEE Trans. Neural Networks Learn. Syst., April, 2025
CycleTrans: Learning Neutral Yet Discriminative Features via Cycle Construction for Visible- Infrared Person Re-Identification.
IEEE Trans. Neural Networks Learn. Syst., March, 2025
AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection.
CoRR, February, 2025
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy.
CoRR, February, 2025
Secure Service Function Chain Provisioning for Task Offloading in Device-Edge-Cloud Computing.
IEEE Trans. Inf. Forensics Secur., 2025
Pattern Recognit., 2025
Optical remote sensing image salient object detection via bidirectional cross-attention and attention restoration.
Pattern Recognit., 2025
IEEE Access, 2025
DDoS Attack Detection in SDN-Assisted Federated Learning Environment Based on Contrastive Learning.
IEEE Access, 2025
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
DViN: Dynamic Visual Routing Network for Weakly Supervised Referring Expression Comprehension.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
What Kind of Visual Tokens Do We Need? Training-Free Visual Token Pruning for Multi-Modal Large Language Models from the Perspective of Graph.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025
2024
Int. J. Comput. Vis., January, 2024
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension.
IEEE Trans. Multim., 2024
Deep hybrid transformer network for robust modulation classification in wireless communications.
Knowl. Based Syst., 2024
Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings.
CoRR, 2024
Routing Experts: Learning to Route Dynamic Experts in Multi-modal Large Language Models.
CoRR, 2024
Not All Attention is Needed: Parameter and Computation Efficient Transfer Learning for Multi-modal Large Language Models.
CoRR, 2024
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
QueryMatch: A Query-based Contrastive Learning Framework for Weakly Supervised Visual Grounding.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024
Adapting Pre-trained Language Models to Vision-Language Tasksvia Dynamic Visual Prompting.
Proceedings of the International Joint Conference on Neural Networks, 2024
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization.
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
A Real-Time Global Inference Network for One-Stage Referring Expression Comprehension.
IEEE Trans. Neural Networks Learn. Syst., 2023
IEEE Trans. Multim., 2023
IEEE Trans. Multim., 2023
NICE: Improving Panoptic Narrative Detection and Segmentation with Cascading Collaborative Learning.
CoRR, 2023
M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce.
CoRR, 2023
CoRR, 2023
Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting.
CoRR, 2023
CoRR, 2023
IEEE Access, 2023
Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
PixelFace+: Towards Controllable Face Generation and Manipulation with Text Descriptions and Segmentation Masks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 8th International Conference on Information Systems Engineering, 2023
RefTeacher: A Strong Baseline for Semi-Supervised Referring Expression Comprehension.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
RefCLIP: A Universal Teacher for Weakly Supervised Referring Expression Comprehension.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Multim., 2022
Towards Lightweight Transformer Via Group-Wise Transformation for Vision-and-Language Tasks.
IEEE Trans. Image Process., 2022
IEEE Trans. Image Process., 2022
IEEE Trans. Pattern Anal. Mach. Intell., 2022
CycleTrans: Learning Neutral yet Discriminative Features for Visible-Infrared Person Re-Identification.
CoRR, 2022
What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study.
CoRR, 2022
CoRR, 2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
ACM Trans. Intell. Syst. Technol., 2021
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020
Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method.
IEEE Access, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019
2017
Bayesian Estimation of a Dynamic Model of Two-Sided Markets: Application to the U.S. Video Game Industry.
Manag. Sci., 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
2016
Frontiers Comput. Sci., 2016
2015
Proceedings of the Data Science - Second International Conference, 2015