Mingtao Pei

Proc. ACM Comput. Graph. Interact. Tech., May, 2026

Investigating the Effects of Physical Space Memory on User Performance in Virtual Reality.

[BibT_eX]

[DOI]

Bing Ning

Proc. ACM Comput. Graph. Interact. Tech., May, 2026

2025

A method of embedding a high-resolution image into a large field-of-view image.

[BibT_eX]

[DOI]

Yanmei Dong

Multim. Tools Appl., April, 2025

Integrating clinical knowledge and imaging for medical report generation.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2025

Let storytelling tell vivid stories: A multi-modal-agent-based unified storytelling framework.

[BibT_eX]

[DOI]

Neurocomputing, 2025

Align Modalities: Advancing Medical Report Generation with Unified Encoder and Inter-Case Contrastive Learning.

[BibT_eX]

[DOI]

Haoquan Chen

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2025

Retrieval from Dynamic Phrases: Generating Radiograph Reports with Phrase-Level Template and Dynamic Memory Bank.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

Trace3D: Consistent Segmentation Lifting via Gaussian Instance Tracing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

PrimHOI: Compositional Human-Object Interaction via Reusable Primitives.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

2024

Self-trained multi-cues model for video anomaly detection.

[BibT_eX]

[DOI]

Multim. Tools Appl., July, 2024

Task and Environment-Aware Virtual Scene Rearrangement for Enhanced Safety in Virtual Reality.

[BibT_eX]

[DOI]

Bing Ning

IEEE Trans. Vis. Comput. Graph., May, 2024

Token labeling-guided multi-scale medical image classification.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2024

Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller.

[BibT_eX]

[DOI]

CoRR, 2024

Helmet Detection in Mines Using Two-Branch YOLOv5 Network with Adaptive Weight Adjustment.

[BibT_eX]

[DOI]

Zhongyan Sui

Proceedings of the 2024 7th International Conference on Sensors, 2024

Spatio-Temporal Contrastive Learning for Compositional Action Recognition.

[BibT_eX]

[DOI]

Yezi Gong

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Foreign Object Classification for Coal Conveyor Belts Based on Deep Learning.

[BibT_eX]

[DOI]

Siyu Chen

Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Ship Detection in SAR Images Based on Oriented Bounding Box and Supervised Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Image and Graphics Processing, 2024

Mitigating Data Imbalance in Medical Report Generation Through Visual Data Resampling.

[BibT_eX]

[DOI]

Haoquan Chen

Proceedings of the Advanced Intelligent Computing in Bioinformatics, 2024

Automatic Radiology Reports Generation via Memory Alignment Network.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Person-Specific Face Spoofing Detection Based on a Siamese Network.

[BibT_eX]

[DOI]

Pattern Recognit., 2023

A VR Enabled Visualization System for Race Suit Design.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Dual Transformer Encoder Model for Medical Image Classification.

[BibT_eX]

[DOI]

Fangyuan Yan

Proceedings of the IEEE International Conference on Image Processing, 2023

Face Anti-spoofing Based on Client Identity Information and Depth Map.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 12th International Conference, 2023

Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Prior Guided Transformer for Accurate Radiology Reports Generation.

[BibT_eX]

[DOI]

IEEE J. Biomed. Health Informatics, 2022

Stitching images from a conventional camera and a fisheye camera based on nonrigid warping.

[BibT_eX]

[DOI]

Multim. Tools Appl., 2022

Stitching Videos from a Fisheye Lens Camera and a Wide-Angle Lens Camera for Telepresence Robots.

[BibT_eX]

[DOI]

Int. J. Soc. Robotics, 2022

Few-shot human motion prediction using deformable spatio-temporal CNN with parameter generation.

[BibT_eX]

[DOI]

Chuanqi Zang

Menghao Li

Neurocomputing, 2022

Predicting Human Motion Using Key Subsequences.

[BibT_eX]

[DOI]

Menghao Li

Wei Liang

Proceedings of the IEEE International Conference on Acoustics, 2022

Do You Live a Healthy Life? Analyzing Lifestyle by Visual Life Logging.

[BibT_eX]

[DOI]

Qing Gao

Hongyu Shen

Proceedings of the IEEE International Conference on Acoustics, 2022

Clinical-BERT: Vision-Language Pre-training for Radiograph Diagnosis and Reports Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Laryngoscope8: Laryngeal image dataset and classification of laryngeal disease based on attention mechanism.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2021

Self-Trained Video Anomaly Detection Based on Teacher-Student Model.

[BibT_eX]

[DOI]

Xusheng Wang

Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Lightweight Forest Fire Detection Based on Deep Learning.

[BibT_eX]

[DOI]

Ruixian Fan

Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Recognizing Activities from Egocentric Images with Appearance and Motion Features.

[BibT_eX]

[DOI]

Yanhua Chen

Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

2020

Scene-Specific Multiple Cues Integration for Multiperson Tracking.

[BibT_eX]

[DOI]

IEEE Trans. Cogn. Dev. Syst., 2020

Online maximum a posteriori tracking of multiple objects using sequential trajectory prior.

[BibT_eX]

[DOI]

Min Yang

Image Vis. Comput., 2020

Visual-Semantic Graph Matching for Visual Grounding.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Few-shot Human Motion Prediction via Learning Novel Motion Dynamics.

[BibT_eX]

[DOI]

Chuanqi Zang

Yu Kong

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019

Heterogeneous Hashing Network for Face Retrieval Across Image and Video Domains.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2019

Diffusion-based kernel matrix model for face liveness detection.

[BibT_eX]

[DOI]

Image Vis. Comput., 2019

Face Liveness Detection Based on Client Identity Using Siamese Network.

[BibT_eX]

[DOI]

Huiling Hao

CoRR, 2019

Face Liveness Detection Based on Client Identity Using Siamese Network.

[BibT_eX]

[DOI]

Huiling Hao

Meng Zhao

Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Attributes Preserving Face De-Identification.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018

Deep CNN based binary hash video representations for face retrieval.

[BibT_eX]

[DOI]

Pattern Recognit., 2018

License Plate Detection with Shallow and Deep CNNs in Complex Environments.

[BibT_eX]

[DOI]

Complex., 2018

Vehicle Re-Identification by Deep Feature Fusion Based on Joint Bayesian Criterion.

[BibT_eX]

[DOI]

Siyu Li

Leyi Zhu

Proceedings of the 24th International Conference on Pattern Recognition, 2018

2017

Vehicle Verification Based on Deep Siamese Network with Similarity Metric.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Fusing Appearance Features and Correlation Features for Face Video Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Deep Manifold Learning of Symmetric Positive Definite Matrices with Application to Face Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Online Discriminative Tracking With Active Example Selection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2016

Tracking Pedestrian with Multi-Component Online Deformable Part-Based Model.

[BibT_eX]

[DOI]

J. Inf. Sci. Eng., 2016

Orthonormal dictionary learning and its application to face recognition.

[BibT_eX]

[DOI]

Zhen Dong

Image Vis. Comput., 2016

A Tele-Presence Wheelchair for Elderly People.

[BibT_eX]

[DOI]

CoRR, 2016

Input Aggregated Network for Face Video Representation.

[BibT_eX]

[DOI]

CoRR, 2016

Nonnegative correlation coding for image classification.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2016

Online Multi-Person Tracking Based on Metric Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Jointly Learning a Multi-class Discriminative Dictionary for Robust Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

A low-cost tele-presence wheelchair system.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Driver Face Detection Based on Aggregate Channel Features and Deformable Part-Based Model in Traffic Camera.

[BibT_eX]

[DOI]

Yang Wang

Xiaoma Xu

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Pedestrian Detection Using Deep Channel Features in Monocular Image Sequences.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Attention Estimation for Input Switch in Scalable Multi-display Environments.

[BibT_eX]

[DOI]

Xingyuan Bu

Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Pose-indexed based multi-view method for face alignment.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

3D head pose estimation with convolutional neural network trained on synthetic images.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Visual tracking with sparse correlation filters.

[BibT_eX]

[DOI]

Yanmei Dong

Min Yang

Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Face Video Retrieval via Deep Learning of Binary Hash Representations.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015

Vehicle Type Classification Using a Semisupervised Convolutional Neural Network.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2015

Robust Discriminative Tracking via Landmark-Based Label Propagation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2015

A Unified Probabilistic Framework for Real-Time Depth Map Fusion.

[BibT_eX]

[DOI]

J. Inf. Sci. Eng., 2015

Online visual tracking by integrating spatio-temporal cues.

[BibT_eX]

[DOI]

IET Comput. Vis., 2015

Telepresence Interaction by Touching Live Video Images.

[BibT_eX]

[DOI]

CoRR, 2015

Learning online structural appearance model for robust object tracking.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2015

Non-linear Metric Learning Using Metric Tensor.

[BibT_eX]

[DOI]

Liangying Yin

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Robust Online Multi-object Tracking by Maximum a Posteriori Estimation with Sequential Trajectory Prior.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Vehicle Detection Using Appearance and Shape Constrained Active Basis Model.

[BibT_eX]

[DOI]

Sai Liu

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Discriminative Orthonormal Dictionary Learning for Fast Low-Rank Representation.

[BibT_eX]

[DOI]

Zhen Dong

Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Discriminative Neighborhood Preserving Dictionary Learning for Image Classification.

[BibT_eX]

[DOI]

Proceedings of the Image and Graphics - 8th International Conference, 2015

Fusion of Skeletal and STIP-Based Features for Action Recognition with RGB-D Devices.

[BibT_eX]

[DOI]

Ting Liu

Proceedings of the Image and Graphics - 8th International Conference, 2015

2014

Coupling-and-decoupling: A hierarchical model for occlusion-free object detection.

[BibT_eX]

[DOI]

Pattern Recognit., 2014

Tracking Pedestrian with Incrementally Learned Representation and Classification Model.

[BibT_eX]

[DOI]

J. Inf. Sci. Eng., 2014

Learning a discriminative mid-level feature for action recognition.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2014

Visual Tracking Using Multi-stage Random Simple Features.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Vehicle Type Classification Using Unsupervised Convolutional Neural Network.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Stereovision-Only Based Interactive Mobile Robot for Human-Robot Face-to-Face Interaction.

[BibT_eX]

[DOI]

Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Detecting Driver Use of Mobile Phone Based on In-Car Camera.

[BibT_eX]

[DOI]

Dan Wang

Lan Zhu

Proceedings of the Tenth International Conference on Computational Intelligence and Security, 2014

Vehicle Color Recognition Based on License Plate Color.

[BibT_eX]

[DOI]

Yanmei Dong

Xiameng Qin

Proceedings of the Tenth International Conference on Computational Intelligence and Security, 2014

Coupling Semi-supervised Learning and Example Selection for Online Object Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

Landmark-Based Inductive Model for Robust Discriminative Tracking.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2014, 2014

2013

Learning and parsing video events with goal and intent prediction.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2013

Online-Learning Structural Appearance Model for Robust Visual Tracking.

[BibT_eX]

[DOI]

Proceedings of the Intelligence Science and Big Data Engineering, 2013

Robust object tracking via online multiple instance metric learning.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Event recognition based-on social roles in continuous video.

[BibT_eX]

[DOI]

Zhen Dong

Meng Zhao

Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

2012

Stereo camera calibration with an embedded calibration device and scene features.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Robotics and Biomimetics, 2012

Probabilistic depth map fusion of Kinect and stereo in real-time.

[BibT_eX]

[DOI]

Yong Duan

Yucheng Wang

Proceedings of the 2012 IEEE International Conference on Robotics and Biomimetics, 2012

Robust tracking by accounting for hard negatives explicitly.

[BibT_eX]

[DOI]

Proceedings of the 21st International Conference on Pattern Recognition, 2012

Probabilistic depth map fusion for real-time multi-view stereo.

[BibT_eX]

[DOI]

Yong Duan

Proceedings of the 21st International Conference on Pattern Recognition, 2012

Tracking Pedestrian with Multi-component Online Deformable Part-Based Model.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2012, 2012

Coupling-and-Decoupling: A Hierarchical Model for Occlusion-Free Car Detection.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2012, 2012

2011

Tracking pedestrians with incremental learned intensity and contour templates for PTZ camera visual surveillance.

[BibT_eX]

[DOI]

Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Unsupervised learning of event AND-OR grammar and semantics from video.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2011

Parsing video events with goal inference and intent prediction.

[BibT_eX]

[DOI]

Song-Chun Zhu

Proceedings of the IEEE International Conference on Computer Vision, 2011

2010

Multi-scale matching for data association in vision-based SLAM.

[BibT_eX]

[DOI]

Lei Chen