Guorong Li

Orcid: 0000-0003-3954-2387

According to our database1, Guorong Li authored at least 108 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Toward Unified Token Learning for Vision-Language Tracking.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Learning Hierarchical Modular Networks for Video Captioning.
IEEE Trans. Pattern Anal. Mach. Intell., February, 2024

A Novel Intercalibration Method for Fengyun(FY)-3 VIRR Using MERSI Onboard the Same Satellite Based on Pseudo-Invariant Pixels.
IEEE Trans. Geosci. Remote. Sens., 2024

CPR++: Object Localization via Single Coarse Point Supervision.
CoRR, 2024

2023
Multi-Modal Multi-Grained Embedding Learning for Generalized Zero-Shot Video Classification.
IEEE Trans. Circuits Syst. Video Technol., October, 2023

Robust Tracking via Uncertainty-Aware Semantic Consistency.
IEEE Trans. Circuits Syst. Video Technol., April, 2023

Vis-NIR Spectroscopy Combined with GAN Data Augmentation for Predicting Soil Nutrients in Degraded Alpine Meadows on the Qinghai-Tibet Plateau.
Sensors, April, 2023

A Tale of HodgeRank and Spectral Method: Target Attack Against Rank Aggregation is the Fixed Point of Adversarial Game.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

SiamBAN: Target-Aware Tracking With Siamese Box Adaptive Network.
IEEE Trans. Pattern Anal. Mach. Intell., April, 2023

Weakly Supervised Text-based Actor-Action Video Segmentation by Clip-level Multi-instance Learning.
ACM Trans. Multim. Comput. Commun. Appl., January, 2023

Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking.
IEEE Trans. Multim., 2023

Rethinking Sampling Strategies for Unsupervised Person Re-Identification.
IEEE Trans. Image Process., 2023

Semantic-aware SAM for Point-Prompted Instance Segmentation.
CoRR, 2023

Subject-Oriented Video Captioning.
CoRR, 2023

Weakly Supervised Video Individual CountingWeakly Supervised Video Individual Counting.
CoRR, 2023

Dynamic Erasing Network Based on Multi-Scale Temporal Features for Weakly Supervised Video Anomaly Detection.
CoRR, 2023

P2RBox: A Single Point is All You Need for Oriented Object Detection.
CoRR, 2023

Towards Unified Token Learning for Vision-Language Tracking.
CoRR, 2023

Spatial Self-Distillation for Object Detection with Inaccurate Bounding Boxes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
A Multi-Frequency DQ Impedance Measurement Algorithm for Single-Phase Vehicle-Grid System in Electrified Railways.
IEEE Trans. Veh. Technol., 2022

Introduction to the Special Issue on Fine-Grained Visual Recognition and Re-Identification.
ACM Trans. Multim. Comput. Commun. Appl., 2022

Weakly Supervised Anomaly Detection in Videos Considering the Openness of Events.
IEEE Trans. Intell. Transp. Syst., 2022

Progressive Multi-resolution Loss for Crowd Counting.
CoRR, 2022

Consistency-Aware Anchor Pyramid Network for Crowd Localization.
CoRR, 2022

Multi-Attention Network for Compressed Video Referring Object Segmentation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

CRNet: Collaborative Refinement Network for Self-Supervised Video Object Segmentation.
Proceedings of the 5th IEEE International Conference on Multimedia Information Processing and Retrieval, 2022

Enhanced Semantic Head for Cascade Instance Segmentation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

Object Localization under Single Coarse Point Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Hierarchical Modular Network for Video Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Self-Supervised Deep TripleNet for Video Object Segmentation.
IEEE Trans. Multim., 2021

Embedding Perspective Analysis Into Multi-Column Convolutional Neural Network for Crowd Counting.
IEEE Trans. Image Process., 2021

Learning Self-Supervised Space-Time CNN for Fast Video Style Transfer.
IEEE Trans. Image Process., 2021

Group Sampling for Unsupervised Person Re-identification.
CoRR, 2021

Anti-UAV: A Large Multi-Modal Benchmark for UAV Tracking.
CoRR, 2021

Cascade Cross-modal Attention Network for Video Actor and Action Segmentation from a Sentence.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

One-Shot Example Videos Localization Network for Weakly-Supervised Temporal Action Localization.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

Action Category and Phase Consistency Regularization for High-Quality Temporal Action Proposal Generation.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

DBAM: Dense Boundary and Actionness Map for Action Localization in Videos via Sentence Query.
Proceedings of the Image and Graphics - 11th International Conference, 2021

Exploiting sample correlation for crowd counting with multi-expert network.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Learning To Filter: Siamese Relation Network for Robust Tracking.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Conditional GAN based individual and global motion fusion for multiple object tracking in UAV videos.
Pattern Recognit. Lett., 2020

The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline.
Int. J. Comput. Vis., 2020

Multi-Label Learning via Feature and Label Space Dimension Reduction.
IEEE Access, 2020

Multi-Label Learning With Hidden Labels.
IEEE Access, 2020

Real-time Visual Object Tracking with Natural Language Description.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

CSCNet: A Shallow Single Column Network for Crowd Counting.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Video Anomaly Detection Using Open Data Filter and Domain Adaptation.
Proceedings of the 2020 IEEE International Conference on Visual Communications and Image Processing, 2020

Generalized Zero-Shot Video Classification via Generative Adversarial Networks.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Siamese Dynamic Mask Estimation Network for Fast Video Object Segmentation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations.
Proceedings of the Computer Vision - ECCV 2020, 2020

Reverse Perspective Network for Perspective-Aware Object Counting.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Siamese Box Adaptive Network for Visual Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Release the Power of Online-Training for Robust Visual Tracking.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Deep Constrained Low-Rank Subspace Learning for Multi-View Semi-Supervised Classification.
IEEE Signal Process. Lett., 2019

Beyond global fusion: A group-aware fusion approach for multi-view image clustering.
Inf. Sci., 2019

Tell Me What to Track.
CoRR, 2019

Label Correlation Guided Deep Multi-View Image Annotation.
IEEE Access, 2019

Localization-Aware Meta Tracker Guided With Adversarial Features.
IEEE Access, 2019

Multi-View Multi-Label Learning With View-Label-Specific Features.
IEEE Access, 2019

Self-balance Motion and Appearance Model for Multi-object Tracking in UAV.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Training Efficient Saliency Prediction Models with Knowledge Distillation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019


Spatiotemporal CNN for Video Object Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Generalized Semi-supervised and Structured Subspace Learning for Cross-Modal Retrieval.
IEEE Trans. Multim., 2018

Joint Feature Selection and Classification for Multilabel Learning.
IEEE Trans. Cybern., 2018

Bilevel Multiview Latent Space Learning.
IEEE Trans. Circuits Syst. Video Technol., 2018

Joint multi-view representation and image annotation via optimal predictive subspace learning.
Inf. Sci., 2018

Vehicle Detection in UAV Traffic Video Based on Convolution Neural Network.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Edge Guided Generation Network for Video Prediction.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018


The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Cross-Modal Retrieval Using Multiordered Discriminative Structured Subspace Learning.
IEEE Trans. Multim., 2017

Multi-label classification by exploiting local positive and negative pairwise label correlation.
Neurocomputing, 2017

Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Metric based on multi-order spaces for cross-modal retrieval.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

2016
Learning Label-Specific Features and Class-Dependent Labels for Multi-Label Classification.
IEEE Trans. Knowl. Data Eng., 2016

Effective Multimodality Fusion Framework for Cross-Media Topic Detection.
IEEE Trans. Circuits Syst. Video Technol., 2016

Online web video topic detection and tracking with semi-supervised learning.
Multim. Syst., 2016

Beyond appearance model: Learning appearance variations for object tracking.
Neurocomputing, 2016

PL-ranking: A Novel Ranking Method for Cross-Modal Retrieval.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Video saliency prediction with optimized optical flow and gravity center bias.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Joint Multi-View Representation Learning and Image Tagging.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Fusing cross-media for topic detection by dense keyword groups.
Neurocomputing, 2015

Online learning affinity measure with CovBoost for multi-target tracking.
Neurocomputing, 2015

GOMES: A group-aware multi-view fusion approach towards real-world image clustering.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Group sensitive Classifier Chains for multi-label classification.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

Learning Label Specific Features for Multi-label Classification.
Proceedings of the 2015 IEEE International Conference on Data Mining, 2015

2014
Face Distortion Recovery Based on Online Learning Database for Conversational Video.
IEEE Trans. Multim., 2014

Web video thumbnail recommendation with content-aware analysis and query-sensitive matching.
Multim. Tools Appl., 2014

Topic detection in cross-media: a semi-supervised co-clustering approach.
Int. J. Multim. Inf. Retr., 2014

Web topic detection using a ranked clustering-like pattern across similarity cascades.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

Categorizing Social Multimedia by Neighborhood Decision Using Local Pairwise Label Correlation.
Proceedings of the 2014 IEEE International Conference on Data Mining Workshops, 2014

2013
SSOCBT: A Robust Semisupervised Online CovBoost Tracker That Uses Samples Differently.
IEEE Trans. Circuits Syst. Video Technol., 2013

Cross-media topic detection: A multi-modality fusion framework.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

An efficient occlusion detection method to improve object trackers.
Proceedings of the IEEE International Conference on Image Processing, 2013

Cross-media topic detection associated with hot search queries.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2013

Online Learning Based Face Distortion Recovery for Conversational Video Coding.
Proceedings of the 2013 Data Compression Conference, 2013

2012
A Multiple Targets Appearance Tracker Based on Object Interaction Models.
IEEE Trans. Circuits Syst. Video Technol., 2012

Online selection of the best k-feature subset for object tracking.
J. Vis. Commun. Image Represent., 2012

A New Method of Computing Chinese Word Similarity Based on Statistics.
Proceedings of the Fifth International Conference on Business Intelligence and Financial Engineering, 2012

A Collaborative Filtering Algorithm Based on User Activity Level.
Proceedings of the Fifth International Conference on Business Intelligence and Financial Engineering, 2012

2011
Treat samples differently: Object tracking with semi-supervised online CovBoost.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Memory matrix: a novel user experience for home video.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Real-time interactive multi-target tracking using kernel-based trackers.
Proceedings of the International Conference on Image Processing, 2010

2008
Object tracking using incremental 2D-LDA learning and Bayes inference.
Proceedings of the International Conference on Image Processing, 2008

2007
QoS-Based Services Selecting and Optimizing Algorithms on Grid.
Proceedings of the Advances in Web and Network Technologies, and Information Management, 2007


  Loading...