Guolong Wang

Orcid: 0000-0003-4874-2639

Affiliations:
  • University of International Business and Economics, Beijing, China
  • Tsinghua University, School of Software, TNList, Beijing, China (former)


According to our database1, Guolong Wang authored at least 29 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Unsupervised Video Moment Retrieval with Knowledge-Based Pseudo-Supervision Construction.
ACM Trans. Inf. Syst., January, 2025

2024
Progressive reinforcement learning for video summarization.
Inf. Sci., January, 2024

Element-Centered Multi-granularity Network for Dense Video Captioning.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Boosting Text-to-Video Generative Model with MLLMs Feedback.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Multimodal Large Language Models Make Text-to-Image Generative Models Align Better.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Text-guided Multi-Task Image Aesthetic Quality Assessment.
Proceedings of the 2nd International Workshop on Multimedia Content Generation and Evaluation: New Methods and Practice, 2024

Routing Evidence for Unseen Actions in Video Moment Retrieval.
Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024

Improving Image Reconstruction and Synthesis by Balancing the Optimization from Frequency Perspective.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Keep Knowledge in Perception: Zero-Shot Image Aesthetic Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Reducing 0s bias in video moment retrieval with a circular competence-based captioner.
Inf. Process. Manag., 2023

Instance-Aware Hierarchical Structured Policy for Prompt Learning in Vision-Language Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Self-Supervised Graph Convolution for Video Moment Retrieval.
Proceedings of the Artificial Neural Networks and Machine Learning, 2023

2022
Prompt-based Zero-shot Video Moment Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

2021
Dense Video Captioning for Incomplete Videos.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2021, 2021

2020
Learning to Select Elements for Graphic Design.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

Towards Personalized Aesthetic Image Caption.
Proceedings of the 2020 International Joint Conference on Neural Networks, 2020

2019
Delving into Precise Attention in Image Captioning.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

2018
Multi-focus Image Fusion using Fully Convolutional Two-stream Network for Visual Sensors.
KSII Trans. Internet Inf. Syst., 2018

Collision-Free LSTM for Human Trajectory Prediction.
Proceedings of the MultiMedia Modeling - 24th International Conference, 2018

Collaborative and Attentive Learning for Personalized Image Aesthetic Assessment.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Bridge Video and Text with Cascade Syntactic Structure.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Fundamentals of Software Culture
Springer, ISBN: 978-981-13-0700-3, 2018

2017
Semantic R-CNN for Natural Language Object Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Semantic Sequence Analysis for Human Activity Prediction.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Multi-modality Fusion Network for Action Recognition.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Recognizing Emotions Based on Human Actions in Videos.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017

2016
Recognize human activities from multi-part missing videos.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Human activities prediction by learning combinatorial sparse representations.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

2015
Beyond HOG: Learning Local Parts for Object Detection.
Proceedings of the Advances in Multimedia Information Processing - PCM 2015, 2015


  Loading...