Guolong Wang

Xun Wu

Junchi Yan

Inf. Sci., January, 2024

Element-Centered Multi-granularity Network for Dense Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Boosting Text-to-Video Generative Model with MLLMs Feedback.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Multimodal Large Language Models Make Text-to-Image Generative Models Align Better.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Text-guided Multi-Task Image Aesthetic Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2024

Routing Evidence for Unseen Actions in Video Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Improving Image Reconstruction and Synthesis by Balancing the Optimization from Frequency Perspective.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Keep Knowledge in Perception: Zero-Shot Image Aesthetic Assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Reducing 0s bias in video moment retrieval with a circular competence-based captioner.

[BibT_eX]

[DOI]

Inf. Process. Manag., 2023

Instance-Aware Hierarchical Structured Policy for Prompt Learning in Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Graph Convolution for Video Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning, 2023

2022

Prompt-based Zero-shot Video Moment Retrieval.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021

Dense Video Captioning for Incomplete Videos.

[BibT_eX]

[DOI]

Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

2020

Learning to Select Elements for Graphic Design.

[BibT_eX]

[DOI]

Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Towards Personalized Aesthetic Image Caption.

[BibT_eX]

[DOI]

Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

2019

Delving into Precise Attention in Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 26th International Conference, 2019

2018

Multi-focus Image Fusion using Fully Convolutional Two-stream Network for Visual Sensors.

[BibT_eX]

[DOI]

KSII Trans. Internet Inf. Syst., 2018

Collision-Free LSTM for Human Trajectory Prediction.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Collaborative and Attentive Learning for Personalized Image Aesthetic Assessment.

[BibT_eX]

[DOI]

Junchi Yan

Kouemo Ngayo Anatoli Dimitrov

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Bridge Video and Text with Cascade Syntactic Structure.

[BibT_eX]

[DOI]

Proceedings of the 27th International Conference on Computational Linguistics, 2018

Fundamentals of Software Culture

[BibT_eX]

[DOI]

Wenhui Yu

Springer, ISBN: 978-981-13-0700-3, 2018

2017

Semantic R-CNN for Natural Language Object Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Semantic Sequence Analysis for Human Activity Prediction.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Multi-modality Fusion Network for Action Recognition.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Recognizing Emotions Based on Human Actions in Videos.

[BibT_eX]

[DOI]

Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

2016

Recognize human activities from multi-part missing videos.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Human activities prediction by learning combinatorial sparse representations.

[BibT_eX]

[DOI]