Peng Zhang

Orcid: 0000-0001-9690-7026

Affiliations:
  • Northwestern Polytechnical University, School of Computer Science, Xi'an, China
  • Nanyang Technological University, Singapore (PhD 2011)


According to our database1, Peng Zhang authored at least 98 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
How does Layer Normalization improve Batch Normalization in self-supervised sound source localization?
Neurocomputing, January, 2024

'Parallel-Circuitized' distillation for dense object detection.
Displays, January, 2024

2023
A novel evolutionary algorithm inspired from triangle search and its applications on parameters identification of photovoltaic models.
Soft Comput., October, 2023

Conditional invertible image re-scaling.
Pattern Recognit., July, 2023

Object detection based on cortex hierarchical activation in border sensitive mechanism and classification-GIou joint representation.
Pattern Recognit., May, 2023

Look&listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement.
IEEE Trans. Multim., 2023

Facial Expression Guided Diagnosis of Parkinson's Disease via High-Quality Data Augmentation.
IEEE Trans. Multim., 2023

Reversible Modal Conversion Model for Thermal Infrared Tracking.
IEEE Multim., 2023

FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction.
CoRR, 2023

CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective.
CoRR, 2023

Induction Network: Audio-Visual Modality Gap-Bridging for Self-Supervised Sound Source Localization.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Semi-Supervised Multimodal Emotion Recognition with Class-Balanced Pseudo-labeling.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

CASP-Net: Rethinking Video Saliency Prediction from an Audio-Visual Consistency Perceptual Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Identity-Aware Facial Expression Recognition Via Deep Metric Learning Based on Synthesized Images.
IEEE Trans. Multim., 2022

A novel locally-constrained GAN-based ensemble to synthesize arterial spin labeling images.
Inf. Sci., 2022

One-shot Video Graph Generation for Explainable Action Reasoning.
Neurocomputing, 2022

Semantic-aware spatial regularization correlation filter for visual tracking.
IET Comput. Vis., 2022

Self-Supervised Cross-Modal Distillation for Thermal Infrared Tracking.
IEEE Multim., 2022

An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection.
CoRR, 2022

Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild.
CoRR, 2022

Audio-visual speech separation based on joint feature representation with cross-modal attention.
CoRR, 2022

Look&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement.
CoRR, 2022

Information Lossless Multi-modal Image Generation for RGB-T Tracking.
Proceedings of the Pattern Recognition and Computer Vision - 5th Chinese Conference, 2022

2021
Multiple Instance Models Regression for Robust Visual Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2021

A novel multi-loss-based deep adversarial network for handling challenging cases in semi-supervised image semantic segmentation.
Pattern Recognit. Lett., 2021

Full-scaled deep metric learning for pedestrian re-identification.
Multim. Tools Appl., 2021

Learning spatial-channel regularization jointly with correlation filter for visual tracking.
Neurocomputing, 2021

Multiple object tracking based on multi-task learning with strip attention.
IET Image Process., 2021

A new VAE-GAN model to synthesize arterial spin labeling images from structural MRI.
Displays, 2021

Tracking based on scale-estimated deep networks with hierarchical correlation ensembling for cross-media understanding.
Displays, 2021

A Novel Multi-Feature Joint Learning Ensemble Framework for Multi-Label Facial Expression Recognition.
IEEE Access, 2021

Unsupervised Cross-Modal Distillation for Thermal Infrared Tracking.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

2020
Deep Position-Sensitive Tracking.
IEEE Trans. Multim., 2020

Ensemble Tracking Based on Diverse Collaborative Framework With Multi-Cue Dynamic Fusion.
IEEE Trans. Multim., 2020

Unsupervised Online Video Object Segmentation With Motion Property Understanding.
IEEE Trans. Image Process., 2020

Online Semantic Subspace Learning with Siamese Network for UAV Tracking.
Remote. Sens., 2020

Robust Visual Tracking based on Adversarial Unlabeled Instance Generation with Label Smoothing Loss Regularization.
Pattern Recognit., 2020

Unsupervised Ensemble Hashing: Boosting Minimum Hamming Distance.
IEEE Access, 2020

Arterial Spin Labeling Image Synthesis From Structural MRI Using Improved Capsule-Based Networks.
IEEE Access, 2020

2019
Arterial Spin Labeling Images Synthesis From sMRI Using Unbalanced Deep Discriminant Learning.
IEEE Trans. Medical Imaging, 2019

A Novel Analysis Dictionary Learning Model Based Hyperspectral Image Classification Method.
Remote. Sens., 2019

Robust Hyperspectral Image Domain Adaptation With Noisy Labels.
IEEE Geosci. Remote. Sens. Lett., 2019

ACFT: adversarial correlation filter for robust tracking.
IET Image Process., 2019

Distractor-Aware Visual Tracking by Online Siamese Network.
IEEE Access, 2019

Video-Based Abnormal Driving Behavior Detection via Deep Learning Fusions.
IEEE Access, 2019

Explainable Video Action Reasoning via Prior Knowledge and State Transitions.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Novel Bi-directional Images Synthesis Based on WGAN-GP with GMM-Based Noise Generation.
Proceedings of the Machine Learning in Medical Imaging - 10th International Workshop, 2019

Arterial Spin Labeling Images Synthesis via Locally-Constrained WGAN-GP Ensemble.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019

Deep Audio-visual System for Closed-set Word-level Speech Recognition.
Proceedings of the International Conference on Multimodal Interaction, 2019

Design of UAV Longitudinal Control law Based on Small Disturbance and Motion Characteristics.
Proceedings of the 6th International Conference on Dependable Systems and Their Applications, 2019

Deep Learning-based Pavement Cracks Detection via Wireless Visible Light Camera-based Network.
Proceedings of the Computing, Communications and IoT Applications, ComComAp 2019, Shenzhen, 2019

Deep Discriminant Learning-based Asphalt Road Cracks Detection via Wireless Camera Network.
Proceedings of the Computing, Communications and IoT Applications, ComComAp 2019, Shenzhen, 2019

2018
Saliency flow based video segmentation via motion guided contour refinement.
Signal Process., 2018

Going deeper with two-stream ConvNets for action recognition in video surveillance.
Pattern Recognit. Lett., 2018

Robust tracking based on H-CNN with low-resource sampling and scaling by frame-wise motion localization.
Multim. Tools Appl., 2018

Single-target localization in video sequences using offline deep-ranked metric learning and online learned models updating.
Multim. Tools Appl., 2018

Pixel-wise partial volume effects correction on arterial spin labeling magnetic resonance images.
Multim. Tools Appl., 2018

A Novel Framework to Localize Moving Targets in Video Surveillance Systems via Spectral Clustering.
Proceedings of the 2018 International Conference on Identification, 2018

2017
Online object tracking based on BLSTM-RNN with contextual-sequential labeling.
J. Ambient Intell. Humaniz. Comput., 2017

Object coding based video authentication for privacy protection in immersive communication.
J. Ambient Intell. Humaniz. Comput., 2017

Online object tracking based on CNN with spatial-temporal saliency guided sampling.
Neurocomputing, 2017

Video Action Recognition Based on Deeper Convolution Networks with Pair-Wise Frame Motion Concatenation.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

2016
Real-time tracking-by-learning with high-order regularization fusion for big video abstraction.
Signal Process., 2016

Bayesian tracking fusion framework with online classifier ensemble for immersive visual applications.
Multim. Tools Appl., 2016

Guest Editorial: Immersive Audio/Visual Systems.
Multim. Tools Appl., 2016

A novel dementia diagnosis strategy on arterial spin labeling magnetic resonance images via pixel-wise partial volume correction and ranking.
Multim. Tools Appl., 2016

Online tracking based on efficient transductive learning with sample matching costs.
Neurocomputing, 2016

Deformable object tracking with spatiotemporal segmentation in big vision surveillance.
Neurocomputing, 2016

Autonomous Wheeled Robot Navigation with Uncalibrated Spherical Images.
Proceedings of the Intelligent Visual Surveillance - 4th Chinese Conference, 2016

How does human interest modeling help in computer vision: Tracking-by-saliency in unconstrained social videos.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

A novel online clustering-based object localization strategy using learnings and human interest priors.
Proceedings of the 2016 IEEE International Conference on Multimedia & Expo Workshops, 2016

A multi-label Hyperspectral image classification method with deep learning features.
Proceedings of the International Conference on Internet Multimedia Computing and Service, 2016

2015
Multiple pedestrian tracking based on couple-states Markov chain with semantic topic learning for video surveillance.
Soft Comput., 2015

A novel marker-less lung tumor localization strategy on low-rank fluoroscopic images with similarity learning.
Multim. Tools Appl., 2015

Empirical mode decomposition based blind audio watermarking.
Multim. Tools Appl., 2015

Online Object Tracking Based on CNN with Metropolis-Hasting Re-Sampling.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Superframe segmentation based on content-motion correspondence for social video summarization.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
Moving people tracking with detection by latent semantic analysis for visual surveillance applications.
Multim. Tools Appl., 2014

Coverage enhancement by using the mobility of mobile sensor nodes.
Multim. Tools Appl., 2014

Deformable object tracking with spatiotemporal segmentation in big vision surveillance.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

Medical social media analytics via ranking and big learning: An image-based disease prediction study.
Proceedings of the Proceedings IEEE International Conference on Security, 2014

Object Tracking using Reformative Transductive Learning with Sample Variational Correspondence.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

An ensemble of deep neural networks for object tracking.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

2013
Multi-covered path in wireless sensor networks.
Telecommun. Syst., 2013

A Novel Similarity Learning Method via Relative Comparison for Content-Based Medical Image Retrieval.
J. Digit. Imaging, 2013

Non-rigid target tracking based on 'flow-cut' in pair-wise frames with online hough forests.
Proceedings of the ACM Multimedia Conference, 2013

A novel marker-less tumor tracking strategyonlow-rank fluoroscopic images for image-guided lung cancer radiotherapy.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
Privacy enabled video surveillance using a two state Markov tracking algorithm.
Multim. Syst., 2012

2011
Pedestrian detection and tracking with application in visual surveillance
PhD thesis, 2011

Auto-scaled ISL tracking for region based control infrastructure and applications in video surveillance.
Comput. Syst. Sci. Eng., 2011

Pedestrian Tracking Based on <i>Hidden-Latent</i> Temporal Markov Chain.
Proceedings of the Advances in Multimedia Modeling, 2011

2010
Moving People Segmentation from Zoomed Dynamic Scenes Containing a Door.
New Gener. Comput., 2010

Privacy preserving video surveillance using pedestrian tracking mechanism.
Proceedings of the 2nd ACM workshop on Multimedia in forensics, security and intelligence, 2010

An Authentication Mechanism Using Chinese Remainder Theorem for Efficient Surveillance Video Transmission.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010

2009
Spatiotemporal latent semantic cues for moving people tracking.
Proceedings of the IEEE International Conference on Acoustics, 2009

Auto-scaled Incremental Tensor Subspace Learning for Region Based Rate Control Application.
Proceedings of the Computer Vision, 2009

2008
Spatial and temporal sampling control for visual surveillance application.
Proceedings of the IEEE International Conference on Systems, 2008

Zoomed Object Segmentation from Dynamic Scene Containing a Door.
Proceedings of the 10th IEEE International Conference on High Performance Computing and Communications, 2008


  Loading...