Peng Wang

Orcid: 0000-0001-7689-3405

Affiliations:
  • Northwestern Polytechnical University, School of Computer Science, Xi'an, China (since 2017)
  • University of Adelaide, School of Computer Science, SA, Australia (former)
  • Beihang University, Beijing, China (PhD 2011)


According to our database1, Peng Wang authored at least 106 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Vehicle Re-Identification in Aerial Images and Videos: Dataset and Approach.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Human Cognition-Based Consistency Inference Networks for Multi-Modal Fake News Detection.
IEEE Trans. Knowl. Data Eng., January, 2024

Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model.
IEEE Trans. Multim., 2024

Toward Video Anomaly Retrieval From Video Anomaly Detection: New Benchmarks and Model.
IEEE Trans. Image Process., 2024

VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Addressing Information Inequality for Text-Based Person Search via Pedestrian-Centric Visual Denoising and Bias-Aware Alignments.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Searching sharing relationship for instance segmentation decoder.
Appl. Intell., September, 2023

HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention.
IEEE Trans. Multim., 2023

Rethinking and Improving Feature Pyramids for One-Stage Referring Expression Comprehension.
IEEE Trans. Image Process., 2023

Visible and Infrared Object Tracking via Convolution-Transformer Network With Joint Multimodal Feature Learning.
IEEE Geosci. Remote. Sens. Lett., 2023

A Dynamic Feature Interaction Framework for Multi-task Visual Perception.
Int. J. Comput. Vis., 2023

Open-Vocabulary Video Anomaly Detection.
CoRR, 2023

Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network.
CoRR, 2023

Human-centric Behavior Description in Videos: New Benchmark and Model.
CoRR, 2023

S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning.
CoRR, 2023

AerialVLN: Vision-and-Language Navigation for UAVs.
CoRR, 2023

Towards Video Anomaly Retrieval from Video Anomaly Detection: New Benchmarks and Model.
CoRR, 2023

Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged Object Detection.
CoRR, 2023

A Dynamic Feature Interaction Framework for Multi-task Visual Perception.
CoRR, 2023

Toward Re-Identifying Any Animal.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Ground-to-Aerial Person Search: Benchmark Dataset and Approach.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Weakly Supervised Video Anomaly Detection Based on Cross-Batch Clustering Guidance.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

AerialVLN: Vision-and-Language Navigation for UAVs.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

S<sup>3</sup>C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A New Comprehensive Benchmark for Semi-supervised Video Anomaly Detection and Anticipation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Scale Adaptive Network for Partial Person Re-identification: Counteracting Scale Variance.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

Stop-Gradient Softmax Loss for Deep Metric Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Visual Question Answering - From Theory to Application
Advances in Computer Vision and Pattern Recognition, Springer, ISBN: 978-981-19-0963-4, 2022

Adaptive Graph Convolutional Networks for Weakly Supervised Anomaly Detection in Videos.
IEEE Signal Process. Lett., 2022

Center Prediction Loss for Re-identification.
Pattern Recognit., 2022

Towards End-to-End Text Spotting in Natural Scenes.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Structured Multimodal Attentions for TextVQA.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion.
CoRR, 2022

Multi-Domain Joint Training for Person Re-Identification.
CoRR, 2022

Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Temporal-Consistent Visual Clue Attentive Network for Video-Based Person Re-Identification.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Improving Image Captioning via Enhancing Dual-Side Context Awareness.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Dual-Level Decoupled Transformer for Video Captioning.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

CapOnImage: Context-driven Dense-Captioning on Image.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Simple and Robust Correlation Filtering Method for Text-Based Person Search.
Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Person Re-Identification in Aerial Imagery.
IEEE Trans. Multim., 2021

A Robust Attentional Framework for License Plate Recognition in the Wild.
IEEE Trans. Intell. Transp. Syst., 2021

Attend to the Difference: Cross-Modality Person Re-Identification via Contrastive Correlation.
IEEE Trans. Image Process., 2021

Where to Look and How to Describe: Fashion Image Retrieval With an Attentional Heterogeneous Bilinear Network.
IEEE Trans. Circuits Syst. Video Technol., 2021

An adversarial human pose estimation network injected with graph structure.
Pattern Recognit., 2021

A Performance Evaluation of Correspondence Grouping Methods for 3D Rigid Data Matching.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

NAS-FCOS: Efficient Search for Object Detection Architectures.
Int. J. Comput. Vis., 2021

Few-shot action recognition with implicit temporal alignment and pair similarity optimization.
Comput. Vis. Image Underst., 2021

Center Prediction Loss for Re-identification.
CoRR, 2021

Instance and Pair-Aware Dynamic Networks for Re-Identification.
CoRR, 2021

Text-Guided Visual Feature Refinement for Text-Based Person Search.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching.
IEEE Trans. Image Process., 2020

A holistic representation guided attention network for scene text recognition.
Neurocomputing, 2020

MobileCount: An efficient encoder-decoder framework for real-time crowd counting.
Neurocomputing, 2020

Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020


NAS-FCOS: Fast Neural Architecture Search for Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Discriminative and Robust Online Learning for Siamese Visual Tracking.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Toward End-to-End Car License Plate Detection and Recognition With Deep Neural Networks.
IEEE Trans. Intell. Transp. Syst., 2019

Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning.
IEEE Trans. Geosci. Remote. Sens., 2019

Attend to the Difference: Cross-Modality Person Re-identification via Contrastive Correlation.
CoRR, 2019

Person Re-identification in Aerial Imagery.
CoRR, 2019

NAS-FCOS: Fast Neural Architecture Search for Object Detection.
CoRR, 2019

A Simple and Robust Convolutional-Attention Network for Irregular Text Recognition.
CoRR, 2019

A Simple and Robust Attentional Encoder-Decoder Model for License Plate Recognition.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Person Re-identification with Neural Architecture Search.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

MobileCount: An Efficient Encoder-Decoder Framework for Real-Time Crowd Counting.
Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019


Vehicle Re-Identification in Aerial Imagery: Dataset and Approach.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Visual Question Answering as Reading Comprehension.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Pushing the Limits of Deep CNNs for Pedestrian Detection.
IEEE Trans. Circuits Syst. Video Technol., 2018

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

FVQA: Fact-Based Visual Question Answering.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Reading car license plates using deep neural networks.
Image Vis. Comput., 2018

Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks.
CoRR, 2018

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks.
CoRR, 2018

Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Large-Scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Visual question answering: A survey of methods and datasets.
Comput. Vis. Image Underst., 2017

Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks.
CoRR, 2017

Explicit Knowledge-based Reasoning for Visual Question Answering.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks.
Proceedings of the IEEE International Conference on Computer Vision, 2017

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Efficient Semidefinite Branch-and-Cut for MAP-MRF Inference.
Int. J. Comput. Vis., 2016

Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge.
CoRR, 2016

Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Efficient SDP inference for fully-connected CRFs based on low-rank decomposition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
Large-scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications.
CoRR, 2014

2013
Training Effective Node Classifiers for Cascade Classification.
Int. J. Comput. Vis., 2013

A Fast Semidefinite Approach to Solving Binary Quadratic Problems.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012
Fast and Robust Object Detection Using Asymmetric Totally Corrective Boosting.
IEEE Trans. Neural Networks Learn. Syst., 2012

UBoost: Boosting with the Universum.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

2010
Optimally Training a Cascade Classifier
CoRR, 2010

Training a multi-exit cascade with linear asymmetric classification for efficient object detection.
Proceedings of the International Conference on Image Processing, 2010

LACBoost and FisherBoost: Optimally Building Cascade Classifiers.
Proceedings of the Computer Vision, 2010

Robust Face Recognition via Accurate Face Alignment and Sparse Representation.
Proceedings of the International Conference on Digital Image Computing: Techniques and Applications, 2010

Asymmetric Totally-Corrective Boosting for Real-Time Object Detection.
Proceedings of the Computer Vision - ACCV 2010, 2010

2009
A Variant of the Trace Quotient Formulation for Dimensionality Reduction.
Proceedings of the Computer Vision, 2009


  Loading...