Hong Liu

Orcid: 0000-0002-7498-6541

Affiliations:
  • Peking University, Shenzhen Graduate School, Key Laboratory of Machine Perception, Engineering Lab on Intelligent Perception for Internet of Things, Beijing, China
  • Harbin Institute of Technology, China (PhD 1996)


According to our database1, Hong Liu authored at least 251 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Audio-visual keyword transformer for unconstrained sentence-level keyword spotting.
CAAI Trans. Intell. Technol., February, 2024

Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification.
IEEE Trans. Multim., 2024

2023
On-device audio-visual multi-person wake word spotting.
CAAI Trans. Intell. Technol., December, 2023

AO2-DETR: Arbitrary-Oriented Object Detection Transformer.
IEEE Trans. Circuits Syst. Video Technol., May, 2023

AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks.
IEEE Trans. Neural Networks Learn. Syst., April, 2023

Achieving domain generalization for underwater object detection by domain mixup and contrastive learning.
Neurocomputing, April, 2023

Mitigating robust overfitting via self-residual-calibration regularization.
Artif. Intell., April, 2023

Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation.
IEEE Trans. Multim., 2023

Weakly-Supervised 3D Human Pose Estimation With Cross-View U-Shaped Graph Convolutional Network.
IEEE Trans. Multim., 2023

Multi-Dimensional Attention With Similarity Constraint for Weakly-Supervised Temporal Action Localization.
IEEE Trans. Multim., 2023

Edge-guided Representation Learning for Underwater Object Detection.
CoRR, 2023

Cross-Modal Retrieval for Motion and Text via DropTriple Loss.
Proceedings of the ACM Multimedia Asia 2023, 2023

Semantic-aware Consistency Network for Cloth-changing Person Re-Identification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Learning Concordant Attention via Target-aware Alignment for Visible-Infrared Person Re-identification.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FSAR: Federated Skeleton-based Action Recognition with Adaptive Topology Structure and Knowledge Distillation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
A Robust Pixel-Aware Gyro-Aided KLT Feature Tracker for Large Camera Motions.
IEEE Trans. Instrum. Meas., 2022

Regularizing Visual Semantic Embedding With Contrastive Learning for Image-Text Matching.
IEEE Signal Process. Lett., 2022

Image-to-video person re-identification using three-dimensional semantic appearance alignment and cross-modal interactive learning.
Pattern Recognit., 2022

IRANet: Identity-relevance aware representation for cloth-changing person re-identification.
Image Vis. Comput., 2022

Contrastive Learning from Spatio-Temporal Mixed Skeleton Sequences for Self-Supervised Skeleton-Based Action Recognition.
CoRR, 2022

Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object Detection.
CoRR, 2022

GraphMLP: A Graph MLP-Like Architecture for 3D Human Pose Estimation.
CoRR, 2022

Enhancing direct-path relative transfer function using deep neural network for robust sound source localization.
CAAI Trans. Intell. Technol., 2022

Head-related transfer function-reserved time-frequency masking for robust binaural sound source localization.
CAAI Trans. Intell. Technol., 2022

Integrating Point and Line Features for Visual-Inertial Initialization.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Identity-Sensitive Knowledge Propagation for Cloth-Changing Person Re-Identification.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Unsupervised Domain Adaptation Person Re-Identification by Camera-Aware Style Decoupling and Uncertainty Modeling.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2022

Adaptive Weighted Network With Edge Enhancement Module For Monocular Self-Supervised Depth Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2022

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Audio-Visual Multi-person Keyword Spotting via Hybrid Fusion.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Audio-Visual Fusion Network Based on Conformer for Multimodal Emotion Recognition.
Proceedings of the Artificial Intelligence - Second CAAI International Conference, 2022

Pose-Guided Feature Disentangling for Occluded Person Re-identification Based on Transformer.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Multi-Modal Perception Attention Network with Self-Supervised Learning for Audio-Visual Speaker Tracking.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-Supervised Action Recognition.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
When Dictionary Learning Meets Deep Learning: Deep Dictionary Learning and Coding Network for Image Recognition With Limited Data.
IEEE Trans. Neural Networks Learn. Syst., 2021

Bi-Directional Exponential Angular Triplet Loss for RGB-Infrared Person Re-Identification.
IEEE Trans. Image Process., 2021

Learning Deep Direct-Path Relative Transfer Function for Binaural Sound Source Localization.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Optimization-Based Online Initialization and Calibration of Monocular Visual-Inertial Odometry Considering Spatial-Temporal Constraints.
Sensors, 2021

PCLoss: Fashion Landmark Estimation with Position Constraint Loss.
Pattern Recognit., 2021

Weakly-supervised Cross-view 3D Human Pose Estimation.
CoRR, 2021

Achieving Domain Generalization in Underwater Object Detection by Image Stylization and Domain Mixup.
CoRR, 2021

Lifting Transformer for 3D Human Pose Estimation in Video.
CoRR, 2021

Adversarial Feature Disentanglement for Long-Term Person Re-identification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Modality-aware Style Adaptation for RGB-Infrared Person Re-Identification.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Attend, Correct And Focus: A Bidirectional Correct Attention Network For Image-Text Matching.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Supervised Direct-Path Relative Transfer Function Learning for Binaural Sound Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Object Goal Visual Navigation Using Semantic Spatial Relationships.
Proceedings of the Artificial Intelligence - First CAAI International Conference, 2021

Learning to Attack Real-World Models for Person Re-identification via Virtual-Guided Meta-Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
An Online Initialization and Self-Calibration Method for Stereo Visual-Inertial Odometry.
IEEE Trans. Robotics, 2020

Unified Generative Adversarial Networks for Controllable Image-to-Image Translation.
IEEE Trans. Image Process., 2020

Incomplete Multiview Spectral Clustering With Adaptive Graph Learning.
IEEE Trans. Cybern., 2020

Attention-guided CNN for image denoising.
Neural Networks, 2020

Identity-sensitive loss guided and instance feature boosted deep embedding for person search.
Neurocomputing, 2020

Online Initialization and Extrinsic Spatial-Temporal Calibration for Monocular Visual-Inertial Odometry.
CoRR, 2020

A Survey on 3D Skeleton-Based Action Recognition Using Learning Method.
CoRR, 2020

An Adaptive Method Based on Multiscale Dilated Convolutional Network for Binaural Speech Source Localization.
Complex., 2020

Deep Metric Learning-Assisted 3D Audio-Visual Speaker Tracking via Two-Layer Particle Filter.
Complex., 2020

Part-Based Lipreading for Audio-Visual Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Systems, Man, and Cybernetics, 2020

Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion.
Proceedings of the Interspeech 2020, 2020

Unsupervised Monocular Visual-inertial Odometry Network.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Audio-Visual Speech Recognition Using A Two-Step Feature Fusion Strategy.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Mutual Alignment between Audiovisual Features for End-to-End Audiovisual Speech Recognition.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

3D Audio-Visual Speaker Tracking with A Novel Particle Filter.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

A Base-Derivative Framework for Cross-Modality RGB-Infrared Person Re-Identification.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Robust Audio-Visual Speech Recognition Based on Hybrid Fusion.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Efficient High-Resolution High-Level-Semantic Representation Learning for Human Pose Estimation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Grouped Temporal Enhancement Module for Human Action Recognition.
Proceedings of the IEEE International Conference on Image Processing, 2020

Motion Rectification Network for Unsupervised Learning of Monocular Depth and Camera Motion.
Proceedings of the IEEE International Conference on Image Processing, 2020

Robust Audio-Visual Mandarin Speech Recognition Based On Adaptive Decision Fusion And Tone Features.
Proceedings of the IEEE International Conference on Image Processing, 2020

Spatio-Temporal and Geometry Constrained Network for Automobile Visual Odometry.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Position Constraint Loss For Fashion Landmark Estimation.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

GFNet: A Lightweight Group Frame Network for Efficient Human Action Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Fast and Accurate Super-Resolution Network Using Progressive Residual Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Spatial Pyramid Based Graph Reasoning for Semantic Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Sample Fusion Network: An End-to-End Data Augmentation Network for Skeleton-Based Human Action Recognition.
IEEE Trans. Image Process., 2019

Multiple Sound Source Counting and Localization Based on TF-Wise Spatial Spectrum Clustering.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Regrasp Planning Using Stable Object Poses Supported by Complex Structures.
IEEE Trans. Cogn. Dev. Syst., 2019

Fast and robust dynamic hand gesture recognition via key frames extraction and feature fusion.
Neurocomputing, 2019

Asymmetric Generative Adversarial Networks for Image-to-Image Translation.
CoRR, 2019

Multi-view Vector-valued Manifold Regularization for Multi-label Image Classification.
CoRR, 2019

TDD-net: a tiny defect detection network for printed circuit boards.
CAAI Trans. Intell. Technol., 2019

Multitask Learning of Time-Frequency CNN for Sound Source Localization.
IEEE Access, 2019

Combining Adaptive Hierarchical Depth Motion Maps With Skeletal Joints for Human Action Recognition.
IEEE Access, 2019

Robust Interaural Time Difference Estimation Based on Convolutional Neural Network.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

Synergistic Optimization based Binaural Time-Frequency Masking for Speech Source Localization.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

An Effective 3D Human Pose Estimation Method Based on Dilated Convolutions for Videos.
Proceedings of the 2019 IEEE International Conference on Robotics and Biomimetics, 2019

3D Audio-Visual Speaker Tracking with A Two-Layer Particle Filter.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Self-Refining Deep Symmetry Enhanced Network for Rain Removal.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Expectation-Maximization Attention Networks for Semantic Segmentation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Weight-shared Dual-branch Convolutional Neural Network for Unsupervised Dense Depth Prediction and Camera Motion Estimation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Unified Embedding Alignment with Missing Views Inferring for Incomplete Multi-View Clustering.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Robust 3D Action Recognition Through Sampling Local Appearances and Global Distributions.
IEEE Trans. Multim., 2018

3D Action Recognition Using Multiscale Energy-Based Global Ternary Image.
IEEE Trans. Circuits Syst. Video Technol., 2018

Sensor-based complete coverage path planning in dynamic environment for cleaning robot.
CAAI Trans. Intell. Technol., 2018

An Attention-Aware Model for Human Action Recognition on Tree-Based Skeleton Sequences.
Proceedings of the Social Robotics - 10th International Conference, 2018

Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter.
Proceedings of the Interspeech 2018, 2018

Online Initialization and Automatic Camera-IMU Extrinsic Calibration for Monocular Visual-Inertial SLAM.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Skeleton-Based Human Action Recognition Using Spatial Temporal 3D Convolutional Neural Networks.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Hierarchical Dropped Convolutional Neural Network for Speed Insensitive Human Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Spatial-Temporal Data Augmentation Based on LSTM Autoencoder Network for Skeleton-Based Human Action Recognition.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Instance Enhancing Loss: Deep Identity-Sensitive Feature Embedding for Person Search.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Audio-Visual Keyword Spotting Based on Multidimensional Convolutional Neural Network.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

An End-To-End Siamese Convolutional Neural Network for Loop Closure Detection in Visual Slam System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Learning Explicit Shape and Motion Evolution Maps for Skeleton-Based Human Action Recognition.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Discriminatively Learned Feature Embedding Based on Multi-Loss Fusion For Person Search.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Recurrent Squeeze-and-Excitation Context Aggregation Net for Single Image Deraining.
Proceedings of the Computer Vision - ECCV 2018, 2018

Structured Attention Guided Convolutional Neural Fields for Monocular Depth Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Binaural Sound Localization Based on Reverberation Weighting and Generalized Parametric Mapping.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Online growing neural gas for anomaly detection in changing surveillance scenes.
Pattern Recognit., 2017

Enhanced skeleton visualization for view invariant human action recognition.
Pattern Recognit., 2017

Spontaneous versus posed smile recognition via region-specific texture descriptor and geometric facial dynamics.
Frontiers Inf. Technol. Electron. Eng., 2017

How do you smile? Towards a comprehensive smile analysis system.
Neurocomputing, 2017

A Bidirectional Adaptive Bandwidth Mean Shift Strategy for Clustering.
CoRR, 2017

Two-Stream 3D Convolutional Neural Network for Skeleton-Based Action Recognition.
CoRR, 2017

Online RGB-D person re-identification based on metric model update.
CAAI Trans. Intell. Technol., 2017

Multi-Temporal Depth Motion Maps-Based Local Binary Patterns for 3-D Human Action Recognition.
IEEE Access, 2017

Multiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector.
Proceedings of the Interspeech 2017, 2017

3D action recognition using data visualization and convolutional neural networks.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Learning informative pairwise joints with energy-based temporal pyramid for 3D action recognition.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

Time-ordered spatial-temporal interest points for human action classification.
Proceedings of the 2017 IEEE International Conference on Multimedia and Expo, 2017

3D action recognition using multi-temporal skeleton visualization.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

A bidirectional adaptive bandwidth mean shift strategy for clustering.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Fusing shape and motion matrices for view invariant action recognition using 3D skeletons.
Proceedings of the 2017 IEEE International Conference on Image Processing, 2017

Multiple sound source localization based on TDOA clustering and multi-path matching pursuit.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

A novel re-tracking strategy for monocular SLAM.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Human action recognition using Adaptive Hierarchical Depth Motion Maps and Gabor filter.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
A Novel Lip Descriptor for Audio-Visual Keyword Spotting Based on Adaptive Decision Fusion.
IEEE Trans. Multim., 2016

Violence detection using Oriented VIolent Flows.
Image Vis. Comput., 2016

A novel hierarchical Bag-of-Words model for compact action representation.
Neurocomputing, 2016

Human activity prediction by mapping grouplets to recurrent Self-Organizing Map.
Neurocomputing, 2016

Depth Context: a new descriptor for human activity recognition by using sole depth sequences.
Neurocomputing, 2016

A new descriptor of gradients Self-Similarity for smile detection in unconstrained scenarios.
Neurocomputing, 2016

Bi-Direction Interaural Matching Filter and Decision Weighting Fusion for Sound Source Localization in Noisy Environments.
IEICE Trans. Inf. Syst., 2016

Orientation Driven Bag of Appearances for Person Re-identification.
CoRR, 2016

Scene-adaptive hierarchical data association and depth-invariant part-based appearance model for indoor multiple objects tracking.
CAAI Trans. Intell. Technol., 2016

Sequential Bag-of-Words model for human action classification.
CAAI Trans. Intell. Technol., 2016

Salient pairwise spatio-temporal interest points for real-time activity recognition.
CAAI Trans. Intell. Technol., 2016

A novel multi-cue integration system for efficient human fall detection.
Proceedings of the 2016 IEEE International Conference on Robotics and Biomimetics, 2016

Probabilistic binaural multiple sources localization based on time-delay compensation estimator and clustering analysis.
Proceedings of the 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2016

Multi-Channel Linear Prediction Based on Binaural Coherence for Speech Dereverberation.
Proceedings of the Interspeech 2016, 2016

A Novel Feature Matching Strategy for Large Scale Image Retrieval.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

3D Action Recognition Using Multi-Temporal Depth Motion Maps and Fisher Vector.
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

Energy-Based Global Ternary Image for Action Recognition Using Sole Depth Sequences.
Proceedings of the Fourth International Conference on 3D Vision, 2016

2015
Robust Acoustic Localization Via Time-Delay Compensation and Interaural Matching Filter.
IEEE Trans. Signal Process., 2015

Exploring Spatial Correlation for Visual Object Retrieval.
ACM Trans. Intell. Syst. Technol., 2015

Noise-free representation based classification and face recognition experiments.
Neurocomputing, 2015

Binaural cues estimates based on Interaural Matching Filter for sound source localization.
Proceedings of the 2015 IEEE International Conference on Robotics and Biomimetics, 2015

Gender Classification Using Pyramid Segmentation for Unconstrained Back-facing Video Sequences.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Direction of arrival estimation based on reverberation weighting and noise error estimator.
Proceedings of the INTERSPEECH 2015, 2015

A predictive model for narrow passage path planner by using Support Vector Machine in changing environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Binaural sound source localization based on generalized parametric model and two-layer matching strategy in complex environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Two-level multi-task metric learning with application to multi-classification.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Regression based landmark estimation and multi-feature fusion for visual speech recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

SDM-BSM: A fusing depth scheme for human action recognition.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Online person orientation estimation based on classifier update.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Body-structure based feature representation for person re-identification.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Two-Layers Local Coordinate Coding.
Proceedings of the Computer Vision - CCF Chinese Conference, 2015

2014
Data Uncertainty in Face Recognition.
IEEE Trans. Cybern., 2014

Depth Motion Detection - A Novel RS-Trigger Temporal Logic based Method.
IEEE Signal Process. Lett., 2014

Scene-Adaptive Hierarchical Data Association for Multiple Objects Tracking.
IEEE Signal Process. Lett., 2014

Contact-free and pose-invariant hand-biometric-based personal identification system using RGB and depth data.
J. Zhejiang Univ. Sci. C, 2014

Modified minimum squared error algorithm for robust classification and face recognition experiments.
Neurocomputing, 2014

Locality and similarity preserving embedding for feature selection.
Neurocomputing, 2014

Online learning on incremental distance metric for person re-identification.
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014

Adaptive scene correlation learning based on scale-invariant appearance co-occurrence model for camera network.
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014

Scene correlation learning by event co-occurrence modeling for camera network.
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014

Human action classification based on sequential bag-of-words model.
Proceedings of the 2014 IEEE International Conference on Robotics and Biomimetics, 2014

A new hierarchical binaural sound source localization method based on Interaural Matching Filter.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Audio-visual keyword spotting based on adaptive decision fusion under noisy conditions for human-robot interaction.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Audio-visual Keyword Spotting for Mandarin Based on Discriminative Local Spatial-Temporal Descriptors.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Action classification by exploring directional co-occurrence of weighted stips.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Gender identification in unconstrained scenarios using Self-Similarity of Gradients features.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Smile detection in unconstrained scenarios using self-similarity of gradients features.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Improved Voice Activity Detection based on support vector machine with high separable speech feature vectors.
Proceedings of the 19th International Conference on Digital Signal Processing, 2014

Spontaneous versus posed smile recognition using discriminative local spatial-temporal descriptors.
Proceedings of the IEEE International Conference on Acoustics, 2014

A binaural sound source localization model based on time-delay compensation and interaural coherence.
Proceedings of the IEEE International Conference on Acoustics, 2014

Learning directional co-occurrence for human action classification.
Proceedings of the IEEE International Conference on Acoustics, 2014

Speaker age recognition based on isolated words by using SVM.
Proceedings of the IEEE 3rd International Conference on Cloud Computing and Intelligence Systems, 2014

Facial age estimation using bio-inspired features and cost-sensitive ordinal hyperplane rank.
Proceedings of the IEEE 3rd International Conference on Cloud Computing and Intelligence Systems, 2014

Cross-domain sentiment classification using deep learning approach.
Proceedings of the IEEE 3rd International Conference on Cloud Computing and Intelligence Systems, 2014

2013
Multiview Vector-Valued Manifold Regularization for Multilabel Image Classification.
IEEE Trans. Neural Networks Learn. Syst., 2013

Sound Source Localization for HRI Using FOC-Based Time Difference Feature and Spatial Grid Matching.
IEEE Trans. Cybern., 2013

A comprehensive study on learning to rank for content-based image retrieval.
Signal Process., 2013

Coarse to fine K nearest neighbor classifier.
Pattern Recognit. Lett., 2013

Using the original and 'symmetrical face' training samples to perform representation based two-step face recognition.
Pattern Recognit., 2013

Infrared and visible imagery fusion based on region saliency detection for 24-hour-surveillance systems.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

Simulated capacitor method for difficult region dynamic boosting in changing environments.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

Obstacle guided RRT path planner with region classification for changing environments.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

On-line sound event detection and recognition based on adaptive background model for robot audition.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

Robust abandoned object detection and analysis based on online learning.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2013

A two-layer probabilistic model based on time-delay compensation for binaural sound localization.
Proceedings of the 2013 IEEE International Conference on Robotics and Automation, 2013

Maximally stable curvature regions for 3D hand tracking.
Proceedings of the IEEE International Conference on Image Processing, 2013

Unusual events detection based on multi-dictionary sparse representation using kinect.
Proceedings of the IEEE International Conference on Image Processing, 2013

Learning spatio-temporal co-occurrence correlograms for efficient human action classification.
Proceedings of the IEEE International Conference on Image Processing, 2013

Hierarchical data association and depth-invariant appearance model for indoor multiple objects tracking.
Proceedings of the IEEE International Conference on Image Processing, 2013

Salient-motion-heuristic scheme for fast 3D optical flow estimation using RGB-D data.
Proceedings of the IEEE International Conference on Acoustics, 2013

Inferring Ongoing Human Activities Based on Recurrent Self-Organizing Map Trajectory.
Proceedings of the British Machine Vision Conference, 2013

2012
Robust visual tracking based on adaptive depth-color-cue integration using range sensor.
Proceedings of the IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2012

Robust Hand Tracking with Hough Forest and Multi-cue Flocks of Features.
Proceedings of the Advances in Visual Computing - 8th International Symposium, 2012

A "capacitor" bridge builder based safe path planner for difficult regions identification in changing environments.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Hierarchical RRT for humanoid robot footstep planning with multiple constraints in complex environments.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Time Delay Estimation for Speech Signal Based on FOC-Spectrum.
Proceedings of the INTERSPEECH 2012, 2012

A Probabilistic Method of Bearing-only Localization by Using Omnidirectional Vision Signal Processing.
Proceedings of the Eighth International Conference on Intelligent Information Hiding and Multimedia Signal Processing, 2012

Comparison of methods for smile deceit detection by training AU6 and AU12 simultaneously.
Proceedings of the 19th IEEE International Conference on Image Processing, 2012

Action Disambiguation Analysis Using Normalized Google-Like Distance Correlogram.
Proceedings of the Computer Vision - ACCV 2012, 2012

2011
Sound source localization for mobile robot based on time difference feature and space grid matching.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Real-time human tracking based on switching linear dynamic system combined with adaptive Meanshift tracker.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

2010
A selection method of speech vocabulary for human-robot speech interaction.
Proceedings of the IEEE International Conference on Systems, 2010

Omnidirectional vision for mobile robot human body detection and localization.
Proceedings of the IEEE International Conference on Systems, 2010

Adaptive replanning in hard changing environments.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

Continuous sound source localization based on microphone array for mobile robots.
Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010

A dynamic subgoal path planner for unpredictable environments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Visual attention & multi-cue fusion based human motion tracking method.
Proceedings of the Sixth International Conference on Natural Computation, 2010

2009
Robust human tracking based on multi-cue integration and mean-shift.
Pattern Recognit. Lett., 2009

Motion Planning for Human-Robot Interaction Based on Stereo Vision and SIFT.
Proceedings of the IEEE International Conference on Systems, 2009

Combining Color Histogram and Gradient Orientation Histogram for Vision Based Global Localization.
Proceedings of the IEEE International Conference on Systems, 2009

A Modified Cross Power-Spectrum Phase Method Based on Microphone Array for Acoustic Source Localization.
Proceedings of the IEEE International Conference on Systems, 2009

A Method to Restore Chinese Warped Document Images Based on Binding Characters and Building Curved Lines.
Proceedings of the IEEE International Conference on Systems, 2009

Image Restoration of Warped Complex Chinese Documents Based on Text Boundary Lines.
Proceedings of the IEEE International Conference on Systems, 2009

An Effective Background Reconstruction Method for Complicated Traffic Crossroads.
Proceedings of the IEEE International Conference on Systems, 2009

Visual Tracking Algorithm Based on CAMSHIFT and Multi-cue Fusion for Human Motion Analysis.
Proceedings of the IEEE International Conference on Systems, 2009

A Slope K method for image based localization.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

A random local-DRM path planning algorithm for dual manipulator mobile robots in changing environments.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

Hierarchical roadmap based rapid path planning for high-DOF mobile manipulators in complex environments.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

Detection of hands-raising gestures using shape and edge features.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2009

Collaboration of spatial and feature attention for visual tracking.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Path planning in changing environments by using optimal path segment search.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Robust visual tracking based on selective attention shift.
Proceedings of the IEEE International Conference on Control Applications, 2009

2008
Skew detection for complex document images using robust borderlines in both text and non-text regions.
Pattern Recognit. Lett., 2008

Detection of hand-raising gestures based on body silhouette analysis.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2008

Predictive model for path planning by using k-near dynamic bridge builder and Inner Parzen Window.
Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2008

Adaptive feature-spatial representation for Mean-shift tracker.
Proceedings of the International Conference on Image Processing, 2008

2007
Automatic seal image retrieval method by using shape features of Chinese characters.
Proceedings of the IEEE International Conference on Systems, 2007

A hybrid HMM/SVM classifier for motion recognition using μIMU data.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2007

A dynamic bridge builder to identify difficult regions for path planning in changing environments.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

Collaborative Mean Shift Tracking Based on Multi-Cue Integration and Auxiliary Objects.
Proceedings of the International Conference on Image Processing, 2007

Fuzzy Decision Method for Motion Deadlock Resolving in Robot Soccer Games.
Proceedings of the Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues, 2007

2006
Robust Mean Shift Tracking Based on Multi-Cue Integration.
Proceedings of the IEEE International Conference on Systems, 2006

A Path Planner in Changing Environments by Using W-C Nodes Mapping Coupled with Lazy Edges Evaluation.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

2005
Omni-directional vision based human motion detection for autonomous mobile robots.
Proceedings of the IEEE International Conference on Systems, 2005

Modeling facial expression space for recognition.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

A planning method for safe interaction between human arms and robot manipulators.
Proceedings of the 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2005

Real-Time and Distributed AV Content Analysis System for Consumer Electronics Networks.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Document Image Retrieval Based on Density Distribution Feature and Key Block Feature.
Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR 2005), 29 August, 2005

2004
A new method of detecting human eyelids based on deformable templates.
Proceedings of the IEEE International Conference on Systems, 2004

Competition analysis system for soccer robots based on global vision and trajectory restrictions.
Proceedings of the IEEE International Conference on Systems, 2004

Real-time Motion Planning for Interaction between Human Arm and Robot Manipulator.
Proceedings of the 2004 IEEE International Conference on Robotics and Biomimetics, 2004

3D model based head pose tracking by using weighted depth and brightness constraints.
Proceedings of the Third International Conference on Image and Graphics, 2004

Affine Correspondence Based Head Pose Estimation for a Sequence of Images by Using a 3D Model.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004


  Loading...