Bing Li

Orcid: 0000-0001-6114-1411

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Beijing, China
  • Beijing Jiaotong University, Institute of Computer Science and Engineering, China (former)


According to our database1, Bing Li authored at least 160 papers between 2006 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Joint Learning of Audio-Visual Saliency Prediction and Sound Source Localization on Multi-face Videos.
Int. J. Comput. Vis., June, 2024

DARTScore: DuAl-Reconstruction Transformer for Video Captioning Evaluation.
IEEE Trans. Circuits Syst. Video Technol., April, 2024

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

LG-DBNet: Local and Global Dual-Branch Network for SAR Image Denoising.
IEEE Trans. Geosci. Remote. Sens., 2024

Chinese Title Generation for Short Videos: Dataset, Metric and Algorithm.
IEEE Trans. Pattern Anal. Mach. Intell., 2024

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts.
CoRR, 2024

NFT1000: A Visual Text Dataset For Non-Fungible Token Retrieval.
CoRR, 2024

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval.
CoRR, 2024

GMC-IQA: Exploiting Global-correlation and Mean-opinion Consistency for No-reference Image Quality Assessment.
CoRR, 2024

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Set Prediction Guided by Semantic Concepts for Diverse Video Captioning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Cosine Multilinear Principal Component Analysis for Recognition.
IEEE Trans. Big Data, December, 2023

Self-Prior Guided Pixel Adversarial Networks for Blind Image Inpainting.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

Ranking-Based Color Constancy With Limited Training Samples.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2023

A unified end-to-end classification model for focal liver lesions.
Biomed. Signal Process. Control., September, 2023

TranSkeleton: Hierarchical Spatial-Temporal Transformer for Skeleton-Based Action Recognition.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Jointing Recurrent Across-Channel and Spatial Attention for Multi-Object Tracking With Block-Erasing Data Augmentation.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

β-divergence NMF with biorthogonal regularization for data representation.
Eng. Appl. Artif. Intell., May, 2023

Face swapping detection based on identity spatial constraints with weighted frequency division.
Multim. Syst., April, 2023

Dynamic adjustment of hyperparameters for anchor-based detection of objects with large image size differences.
Pattern Recognit. Lett., March, 2023

Learning to Explore Distillability and Sparsability: A Joint Framework for Model Compression.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2023

Multi-scale self-attention-based feature enhancement for detection of targets with small image sizes.
Pattern Recognit. Lett., February, 2023

Learning Video-Text Aligned Representations for Video Captioning.
ACM Trans. Multim. Comput. Commun. Appl., 2023

MFAENet: A Multiscale Feature Adaptive Enhancement Network for SAR Image Despeckling.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

Circle-Net: An Unsupervised Lightweight-Attention Cyclic Network for Hyperspectral and Multispectral Image Fusion.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2023

Robust dual-graph discriminative NMF for data classification.
Knowl. Based Syst., 2023

Hierarchical Curriculum Learning for No-Reference Image Quality Assessment.
Int. J. Comput. Vis., 2023

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking.
CoRR, 2023

RT-MonoDepth: Real-time Monocular Depth Estimation on Embedded Systems.
CoRR, 2023

Exploiting Contextual Objects and Relations for 3D Visual Grounding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

SMM: Self-supervised Multi-Illumination Color Constancy Model with Multiple Pretext Tasks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Order-Prompted Tag Sequence Generation for Video Tagging.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Learning from the Raw Domain: Cross Modality Distillation for Compressed Video Action Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Learning to Exploit the Sequence-Specific Prior Knowledge for Image Processing Pipelines Optimization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ViLEM: Visual-Language Error Modeling for Image-Text Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

AUNet: Learning Relations Between Action Units for Face Forgery Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
3DCANN: A Spatio-Temporal Convolution Attention Neural Network for EEG Emotion Recognition.
IEEE J. Biomed. Health Informatics, 2022

PDNet: Toward Better One-Stage Object Detection With Prediction Decoupling.
IEEE Trans. Image Process., 2022

Narrowing the Gap: Improved Detector Training With Noisy Location Annotations.
IEEE Trans. Image Process., 2022

Rethinking the Competition Between Detection and ReID in Multiobject Tracking.
IEEE Trans. Image Process., 2022

SSAU-Net: A Spectral-Spatial Attention-Based U-Net for Hyperspectral Image Fusion.
IEEE Trans. Geosci. Remote. Sens., 2022

MRDDANet: A Multiscale Residual Dense Dual Attention Network for SAR Image Denoising.
IEEE Trans. Geosci. Remote. Sens., 2022

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2022

SDTP: Semantic-Aware Decoupled Transformer Pyramid for Dense Image Prediction.
IEEE Trans. Circuits Syst. Video Technol., 2022

Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

PIC 4th Challenge: Semantic-Assisted Multi-Feature Encoding and Multi-Head Decoding for Dense Video Captioning.
CoRR, 2022

CREATE: A Benchmark for Chinese Short Video Retrieval and Title Generation.
CoRR, 2022

Attention deficit/hyperactivity disorder Classification based on deep spatio-temporal features of functional Magnetic Resonance Imaging.
Biomed. Signal Process. Control., 2022

Long-Short Term Cross-Transformer in Compressed Domain for Few-Shot Video Classification.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Learning Target-aware Representation for Visual Tracking via Informative Interactions.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Inter-Intra Cross-Modality Self-Supervised Video Representation Learning by Contrastive Clustering.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Learnable Pixel Clustering Via Structure and Semantic Dual Constraints for Unsupervised Image Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines.
Proceedings of the Computer Vision - ECCV 2022, 2022

Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Exploring Motion Information for Distractor Suppression in Visual Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Cross-Architecture Knowledge Distillation.
Proceedings of the Computer Vision - ACCV 2022, 2022

Teacher-Guided Learning for Blind Image Quality Assessment.
Proceedings of the Computer Vision - ACCV 2022, 2022

One More Check: Making "Fake Background" Be Tracked Again.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
EDP: An Efficient Decomposition and Pruning Scheme for Convolutional Neural Network Compression.
IEEE Trans. Neural Networks Learn. Syst., 2021

Toward Accurate Pixelwise Object Tracking via Attention Retrieval.
IEEE Trans. Image Process., 2021

Multi-Scale Low-Discriminative Feature Reactivation for Weakly Supervised Object Localization.
IEEE Trans. Image Process., 2021

Robust Texture-Aware Computer-Generated Image Forensic: Benchmark and Algorithm.
IEEE Trans. Image Process., 2021

Web Objectionable Video Recognition Based on Deep Multi-Instance Learning With Representative Prototypes Selection.
IEEE Trans. Circuits Syst. Video Technol., 2021

UMAG-Net: A New Unsupervised Multiattention-Guided Network for Hyperspectral and Multispectral Image Fusion.
IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., 2021

Design and Application of Mixed Natural Gas Monitoring System Using Artificial Neural Networks.
Sensors, 2021

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking.
CoRR, 2021

Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos.
CoRR, 2021

SDTP: Semantic-aware Decoupled Transformer Pyramid for Dense Image Prediction.
CoRR, 2021

PDNet: Towards Better One-stage Object Detection with Prediction Decoupling.
CoRR, 2021

One More Check: Making "Fake Background" Be Tracked Again.
CoRR, 2021

Adaptive Coarse-to-Fine Interactor for Multi-Scale Object Detection.
Proceedings of the International Joint Conference on Neural Networks, 2021

DSIC: Dynamic Sample-Individualized Connector for Multi-Scale Object Detection.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

Learn to Match: Automatic Matching Network Design for Visual Tracking.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Practical Face Swapping Detection Based on Identity Spatial Constraints.
Proceedings of the International IEEE Joint Conference on Biometrics, 2021

Open-Book Video Captioning With Retrieve-Copy-Generate Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Anomaly Detection Using Local Kernel Density Estimation and Context-Based Regression.
IEEE Trans. Knowl. Data Eng., 2020

Anisotropic Convolution for Image Classification.
IEEE Trans. Image Process., 2020

Multi-Cue Semi-Supervised Color Constancy With Limited Training Samples.
IEEE Trans. Image Process., 2020

Manipulating Template Pixels for Model Adaptation of Siamese Visual Tracking.
IEEE Signal Process. Lett., 2020

Graph convolutional network with structure pooling and joint-wise channel attention for action recognition.
Pattern Recognit., 2020

DSIC: Dynamic Sample-Individualized Connector for Multi-Scale Object Detection.
CoRR, 2020

Rethinking the competition between detection and ReID in Multi-Object Tracking.
CoRR, 2020

Globally Spatial-Temporal Perception: a Long-Term Tracking System.
Proceedings of the IEEE International Conference on Image Processing, 2020

End-to-End Temporal Feature Aggregation for Siamese Trackers.
Proceedings of the IEEE International Conference on Image Processing, 2020

Ocean: Object-Aware Anchor-Free Tracking.
Proceedings of the Computer Vision - ECCV 2020, 2020

Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model.
Proceedings of the Computer Vision - ECCV 2020, 2020

The Eighth Visual Object Tracking VOT2020 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Object Relational Graph With Teacher-Recommended Learning for Video Captioning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Asymmetric 3D Convolutional Neural Networks for action recognition.
Pattern Recognit., 2019

VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning.
CoRR, 2019

Multimodal Semantic Attention Network for Video Captioning.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

The Seventh Visual Object Tracking VOT2019 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Knowledge Distillation via Instance Relationship Graph.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Hierarchical Nonlinear Orthogonal Adaptive-Subspace Self-Organizing Map Based Feature Extraction for Human Action Recognition.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Salient Object Detection via Structured Matrix Decomposition.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Multi-View Multi-Instance Learning Based on Joint Sparse Representation and Multi-View Dictionary Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection from Videos.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Multimodal Web Aesthetics Assessment Based on Structural SVM and Multitask Fusion Learning.
IEEE Trans. Multim., 2016

Multi-Perspective Cost-Sensitive Context-Aware Multi-Instance Sparse Coding and Its Application to Sensitive Video Recognition.
IEEE Trans. Multim., 2016

Multi-Instance Multi-Label Learning Combining Hierarchical Context and its Application to Image Annotation.
IEEE Trans. Multim., 2016

SubMIL: Discriminative subspaces for multi-instance learning.
Neurocomputing, 2016

Multi-Cue Illumination Estimation via a Tree-Structured Group Joint Sparse Representation.
Int. J. Comput. Vis., 2016

A Novel Emotional Saliency Map to Model Emotional Attention Mechanism.
Proceedings of the MultiMedia Modeling - 22nd International Conference, 2016

Bootstrapping deep feature hierarchy for pornographic image recognition.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Graph Based Skeleton Motion Representation and Similarity Measurement for Action Recognition.
Proceedings of the Computer Vision - ECCV 2016, 2016

Image Copy Detection Based on Convolutional Neural Networks.
Proceedings of the Pattern Recognition - 7th Chinese Conference, 2016

2015
Horror Image Recognition Based on Context-Aware Multi-Instance Learning.
IEEE Trans. Image Process., 2015

Predicting Image Memorability by Multi-view Adaptive Regression.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Fast Film Genres Classification Combining Poster and Synopsis.
Proceedings of the Intelligence Science and Big Data Engineering. Image and Video Data Engineering, 2015

2014
Evaluating Combinational Illumination Estimation Methods on Real-World Images.
IEEE Trans. Image Process., 2014

Hierarchical sparse representation based Multi-Instance Semi-Supervised Learning with application to image categorization.
Signal Process., 2014

RGBD Salient Object Detection: A Benchmark and Algorithms.
Proceedings of the Computer Vision - ECCV 2014, 2014

2013
Horror Text Recognition Based on Generalized Expectation Criteria.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Spatio-temporal Features for Efficient Video Copy Detection.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Robust Object Tracking with Online Multi-lifespan Dictionary Learning.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Illumination Estimation Based on Bilayer Sparse Coding.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

Salient Object Detection via Low-Rank and Structured Sparse Matrix Decomposition.
Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, 2013

2012
Efficient Clustering Aggregation Based on Data Fragments.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Color Constant Descriptors Combining Image Derivative Structures.
Comput. Informatics, 2012

Context-aware affective images classification based on bilayer sparse representation.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Scaring or pleasing: exploit emotional impact of an image.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Towards relevance and saliency ranking of image tags.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

An Efficient Video Copy Detection Method Combining Vocabulary Tree and Inverted File.
Proceedings of the Intelligent Science and Intelligent Data Engineering, 2012

Context-aware horror video scene recognition via cost-sensitive sparse coding.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Horror Video Scene Recognition Based on Multi-view Multi-instance Learning.
Proceedings of the Computer Vision - ACCV 2012, 2012

Visual Saliency Map from Tensor Analysis.
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

2011
Exposure Fusino Based on Shift-Invariant Discrete Wavelet Transform.
J. Inf. Sci. Eng., 2011

Evaluating the visual quality of web pages using a computational aesthetic approach.
Proceedings of the Forth International Conference on Web Search and Web Data Mining, 2011

Web Horror Image Recognition Based on Context-Aware Multi-instance Learning.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Context-Aware Multi-instance Learning Based on Hierarchical Sparse Representation.
Proceedings of the 11th IEEE International Conference on Data Mining, 2011

Horror video scene recognition via Multiple-Instance learning.
Proceedings of the IEEE International Conference on Acoustics, 2011

Evaluating combinational color constancy methods on real-world images.
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011

2010
A supervised combination strategy for illumination chromaticity estimation.
ACM Trans. Appl. Percept., 2010

An Adaptive Tone Mapping Method for Displaying High Dynamic Range Images.
J. Inf. Sci. Eng., 2010

Color Constancy Using Ridge Regression.
J. Inf. Sci. Eng., 2010

Learning to evaluate the visual quality of web pages.
Proceedings of the 19th International Conference on World Wide Web, 2010

Identifying Multi-instance Outliers.
Proceedings of the SIAM International Conference on Data Mining, 2010

Event Recognition Based on Top-Down Motion Attention.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Horror movie scene recognition based on emotional perception.
Proceedings of the International Conference on Image Processing, 2010

Group ranking with application to image retrieval.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Top-Down Cues for Event Recognition.
Proceedings of the Computer Vision - ACCV 2010, 2010

Occlusion Handling with ℓ<sub>1</sub>-Regularized Sparse Reconstruction.
Proceedings of the Computer Vision - ACCV 2010, 2010

Horror Image Recognition Based on Emotional Attention.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
A Novel Tone Mapping Based on Double-Anchoring Theory for Displaying HDR Images.
IEICE Trans. Inf. Syst., 2009

Edge-Based Color Constancy via Support Vector Regression.
IEICE Trans. Inf. Syst., 2009

2008
Color Constancy Based on Effective Regions.
IEICE Trans. Inf. Syst., 2008

Color Constancy Based on Image Similarity.
IEICE Trans. Inf. Syst., 2008

Combining Attention Model with Hierarchical Graph Representation for Region-Based Image Retrieval.
IEICE Trans. Inf. Syst., 2008

An Illumination Independent Descriptor using Moment Invariants.
Proceedings of the 16th Color and Imaging Conference, 2008

2007
A Multi-Scale Adaptive Grey World Algorithm.
IEICE Trans. Inf. Syst., 2007

Visual Perception Theory Guided Depth Motion Estimation.
Proceedings of the Advances in Multimedia Modeling, 2007

2006
Perceptual Depth Estimation from a Single 2D Image Based on Visual Perception Theory.
Proceedings of the Advances in Multimedia Information Processing, 2006


  Loading...