Mingtao Pei

Orcid: 0000-0003-4949-7997

According to our database1, Mingtao Pei authored at least 86 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Let Storytelling Tell Vivid Stories: An Expressive and Fluent Multimodal Storyteller.
CoRR, 2024

Automatic Radiology Reports Generation via Memory Alignment Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Person-Specific Face Spoofing Detection Based on a Siamese Network.
Pattern Recognit., 2023

A VR Enabled Visualization System for Race Suit Design.
Proceedings of the IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023

Dual Transformer Encoder Model for Medical Image Classification.
Proceedings of the IEEE International Conference on Image Processing, 2023

Face Anti-spoofing Based on Client Identity Information and Depth Map.
Proceedings of the Image and Graphics - 12th International Conference, 2023

Discovering the Real Association: Multimodal Causal Reasoning in Video Question Answering.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Prior Guided Transformer for Accurate Radiology Reports Generation.
IEEE J. Biomed. Health Informatics, 2022

Stitching images from a conventional camera and a fisheye camera based on nonrigid warping.
Multim. Tools Appl., 2022

Stitching Videos from a Fisheye Lens Camera and a Wide-Angle Lens Camera for Telepresence Robots.
Int. J. Soc. Robotics, 2022

Few-shot human motion prediction using deformable spatio-temporal CNN with parameter generation.
Neurocomputing, 2022

Predicting Human Motion Using Key Subsequences.
Proceedings of the IEEE International Conference on Acoustics, 2022

Do You Live a Healthy Life? Analyzing Lifestyle by Visual Life Logging.
Proceedings of the IEEE International Conference on Acoustics, 2022

Clinical-BERT: Vision-Language Pre-training for Radiograph Diagnosis and Reports Generation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Laryngoscope8: Laryngeal image dataset and classification of laryngeal disease based on attention mechanism.
Pattern Recognit. Lett., 2021

Self-Trained Video Anomaly Detection Based on Teacher-Student Model.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Lightweight Forest Fire Detection Based on Deep Learning.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

Recognizing Activities from Egocentric Images with Appearance and Motion Features.
Proceedings of the 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021

2020
Scene-Specific Multiple Cues Integration for Multiperson Tracking.
IEEE Trans. Cogn. Dev. Syst., 2020

Online maximum a posteriori tracking of multiple objects using sequential trajectory prior.
Image Vis. Comput., 2020

Visual-Semantic Graph Matching for Visual Grounding.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Few-shot Human Motion Prediction via Learning Novel Motion Dynamics.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

2019
Heterogeneous Hashing Network for Face Retrieval Across Image and Video Domains.
IEEE Trans. Multim., 2019

Diffusion-based kernel matrix model for face liveness detection.
Image Vis. Comput., 2019

Face Liveness Detection Based on Client Identity Using Siamese Network.
CoRR, 2019

Face Liveness Detection Based on Client Identity Using Siamese Network.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Attributes Preserving Face De-Identification.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

2018
Deep CNN based binary hash video representations for face retrieval.
Pattern Recognit., 2018

License Plate Detection with Shallow and Deep CNNs in Complex Environments.
Complex., 2018

Vehicle Re-Identification by Deep Feature Fusion Based on Joint Bayesian Criterion.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

2017
Vehicle Verification Based on Deep Siamese Network with Similarity Metric.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Fusing Appearance Features and Correlation Features for Face Video Retrieval.
Proceedings of the Advances in Multimedia Information Processing - PCM 2017, 2017

Deep Manifold Learning of Symmetric Positive Definite Matrices with Application to Face Recognition.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Online Discriminative Tracking With Active Example Selection.
IEEE Trans. Circuits Syst. Video Technol., 2016

Tracking Pedestrian with Multi-Component Online Deformable Part-Based Model.
J. Inf. Sci. Eng., 2016

Orthonormal dictionary learning and its application to face recognition.
Image Vis. Comput., 2016

A Tele-Presence Wheelchair for Elderly People.
CoRR, 2016

Input Aggregated Network for Face Video Representation.
CoRR, 2016

Nonnegative correlation coding for image classification.
Sci. China Inf. Sci., 2016

Online Multi-Person Tracking Based on Metric Learning.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

Jointly Learning a Multi-class Discriminative Dictionary for Robust Visual Tracking.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

A low-cost tele-presence wheelchair system.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Driver Face Detection Based on Aggregate Channel Features and Deformable Part-Based Model in Traffic Camera.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Pedestrian Detection Using Deep Channel Features in Monocular Image Sequences.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Attention Estimation for Input Switch in Scalable Multi-display Environments.
Proceedings of the Neural Information Processing - 23rd International Conference, 2016

Pose-indexed based multi-view method for face alignment.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

3D head pose estimation with convolutional neural network trained on synthetic images.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Visual tracking with sparse correlation filters.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Face Video Retrieval via Deep Learning of Binary Hash Representations.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Vehicle Type Classification Using a Semisupervised Convolutional Neural Network.
IEEE Trans. Intell. Transp. Syst., 2015

Robust Discriminative Tracking via Landmark-Based Label Propagation.
IEEE Trans. Image Process., 2015

A Unified Probabilistic Framework for Real-Time Depth Map Fusion.
J. Inf. Sci. Eng., 2015

Online visual tracking by integrating spatio-temporal cues.
IET Comput. Vis., 2015

Telepresence Interaction by Touching Live Video Images.
CoRR, 2015

Learning online structural appearance model for robust object tracking.
Sci. China Inf. Sci., 2015

Non-linear Metric Learning Using Metric Tensor.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Robust Online Multi-object Tracking by Maximum a Posteriori Estimation with Sequential Trajectory Prior.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Vehicle Detection Using Appearance and Shape Constrained Active Basis Model.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Discriminative Orthonormal Dictionary Learning for Fast Low-Rank Representation.
Proceedings of the Neural Information Processing - 22nd International Conference, 2015

Discriminative Neighborhood Preserving Dictionary Learning for Image Classification.
Proceedings of the Image and Graphics - 8th International Conference, 2015

Fusion of Skeletal and STIP-Based Features for Action Recognition with RGB-D Devices.
Proceedings of the Image and Graphics - 8th International Conference, 2015

2014
Coupling-and-decoupling: A hierarchical model for occlusion-free object detection.
Pattern Recognit., 2014

Tracking Pedestrian with Incrementally Learned Representation and Classification Model.
J. Inf. Sci. Eng., 2014

Learning a discriminative mid-level feature for action recognition.
Sci. China Inf. Sci., 2014

Visual Tracking Using Multi-stage Random Simple Features.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Vehicle Type Classification Using Unsupervised Convolutional Neural Network.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Stereovision-Only Based Interactive Mobile Robot for Human-Robot Face-to-Face Interaction.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Detecting Driver Use of Mobile Phone Based on In-Car Camera.
Proceedings of the Tenth International Conference on Computational Intelligence and Security, 2014

Vehicle Color Recognition Based on License Plate Color.
Proceedings of the Tenth International Conference on Computational Intelligence and Security, 2014

Coupling Semi-supervised Learning and Example Selection for Online Object Tracking.
Proceedings of the Computer Vision - ACCV 2014, 2014

Landmark-Based Inductive Model for Robust Discriminative Tracking.
Proceedings of the Computer Vision - ACCV 2014, 2014

2013
Learning and parsing video events with goal and intent prediction.
Comput. Vis. Image Underst., 2013

Online-Learning Structural Appearance Model for Robust Visual Tracking.
Proceedings of the Intelligence Science and Big Data Engineering, 2013

Robust object tracking via online multiple instance metric learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

Event recognition based-on social roles in continuous video.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

2012
Stereo camera calibration with an embedded calibration device and scene features.
Proceedings of the 2012 IEEE International Conference on Robotics and Biomimetics, 2012

Probabilistic depth map fusion of Kinect and stereo in real-time.
Proceedings of the 2012 IEEE International Conference on Robotics and Biomimetics, 2012

Robust tracking by accounting for hard negatives explicitly.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Probabilistic depth map fusion for real-time multi-view stereo.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Tracking Pedestrian with Multi-component Online Deformable Part-Based Model.
Proceedings of the Computer Vision - ACCV 2012, 2012

Coupling-and-Decoupling: A Hierarchical Model for Occlusion-Free Car Detection.
Proceedings of the Computer Vision - ACCV 2012, 2012

2011
Tracking pedestrians with incremental learned intensity and contour templates for PTZ camera visual surveillance.
Proceedings of the 2011 IEEE International Conference on Multimedia and Expo, 2011

Unsupervised learning of event AND-OR grammar and semantics from video.
Proceedings of the IEEE International Conference on Computer Vision, 2011

Parsing video events with goal inference and intent prediction.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
Multi-scale matching for data association in vision-based SLAM.
Proceedings of the 2010 IEEE International Conference on Robotics and Biomimetics, 2010

2006
Precise Shape Measurement of Dynamic Surface via Single Camera Stereo.
Int. J. Inf. Acquis., 2006


  Loading...