Peng Wang

Yanning Zhang

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Visual Question Answering - From Theory to Application

[BibT_eX]

[DOI]

Advances in Computer Vision and Pattern Recognition, Springer, ISBN: 978-981-19-0963-4, 2022

Adaptive Graph Convolutional Networks for Weakly Supervised Anomaly Detection in Videos.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2022

Center Prediction Loss for Re-identification.

[BibT_eX]

[DOI]

Pattern Recognit., 2022

Towards End-to-End Text Spotting in Natural Scenes.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

Structured Multimodal Attentions for TextVQA.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2022

A time-incorporated SOFA score-based machine learning model for predicting mortality in critically ill patients: A multicenter, real-world study.

[BibT_eX]

[DOI]

Int. J. Medical Informatics, 2022

Generalizable Person Re-Identification via Viewpoint Alignment and Fusion.

[BibT_eX]

[DOI]

CoRR, 2022

Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information.

[BibT_eX]

[DOI]

CoRR, 2022

HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-Domain Joint Training for Person Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2022

Cross-modal Co-occurrence Attributes Alignments for Person Search by Language.

[BibT_eX]

[DOI]

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Pluggable Weakly-Supervised Cross-View Learning for Accurate Vehicle Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Temporal-Consistent Visual Clue Attentive Network for Video-Based Person Re-Identification.

[BibT_eX]

[DOI]

Bingliang Jiao

Liying Gao

Seyed Mojtaba Marvasti-Zadeh

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Improving Image Captioning via Enhancing Dual-Side Context Awareness.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Dual-Level Decoupled Transformer for Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

CapOnImage: Context-driven Dense-Captioning on Image.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

A Simple and Robust Correlation Filtering Method for Text-Based Person Search.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Dynamically Transformed Instance Normalization Network for Generalizable Person Re-Identification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Person Re-Identification in Aerial Imagery.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2021

A Robust Attentional Framework for License Plate Recognition in the Wild.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2021

Attend to the Difference: Cross-Modality Person Re-Identification via Contrastive Correlation.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2021

Where to Look and How to Describe: Fashion Image Retrieval With an Attentional Heterogeneous Bilinear Network.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2021

An adversarial human pose estimation network injected with graph structure.

[BibT_eX]

[DOI]

Pattern Recognit., 2021

A Performance Evaluation of Correspondence Grouping Methods for 3D Rigid Data Matching.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

NAS-FCOS: Efficient Search for Object Detection Architectures.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

Few-shot action recognition with implicit temporal alignment and pair similarity optimization.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2021

CAT: Cross-Attention Transformer for One-Shot Object Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Center Prediction Loss for Re-identification.

[BibT_eX]

[DOI]

CoRR, 2021

Instance and Pair-Aware Dynamic Networks for Re-Identification.

[BibT_eX]

[DOI]

CoRR, 2021

Text-Guided Visual Feature Refinement for Text-Based Person Search.

[BibT_eX]

[DOI]

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads.

[BibT_eX]

[DOI]

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Evaluating Local Geometric Feature Representations for 3D Rigid Data Matching.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2020

A holistic representation guided attention network for scene text recognition.

[BibT_eX]

[DOI]

Neurocomputing, 2020

MobileCount: An efficient encoder-decoder framework for real-time crowd counting.

[BibT_eX]

[DOI]

Neurocomputing, 2020

Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

VisDrone-SOT2020: The Vision Meets Drone Single Object Tracking Challenge Results.

[BibT_eX]

[DOI]

Hossein Ghanei-Yakhdan

Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

NAS-FCOS: Fast Neural Architecture Search for Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Discriminative and Robust Online Learning for Siamese Visual Tracking.

[BibT_eX]

[DOI]

Jinghao Zhou

Haoyang Sun

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Toward End-to-End Car License Plate Detection and Recognition With Deep Neural Networks.

[BibT_eX]

[DOI]

IEEE Trans. Intell. Transp. Syst., 2019

Hyperspectral Classification Based on Lightweight 3-D-CNN With Transfer Learning.

[BibT_eX]

[DOI]

IEEE Trans. Geosci. Remote. Sens., 2019

Attend to the Difference: Cross-Modality Person Re-identification via Contrastive Correlation.

[BibT_eX]

[DOI]

CoRR, 2019

Person Re-identification in Aerial Imagery.

[BibT_eX]

[DOI]

CoRR, 2019

NAS-FCOS: Fast Neural Architecture Search for Object Detection.

[BibT_eX]

[DOI]

CoRR, 2019

A Simple and Robust Convolutional-Attention Network for Irregular Text Recognition.

[BibT_eX]

[DOI]

CoRR, 2019

A Simple and Robust Attentional Encoder-Decoder Model for License Plate Recognition.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

Person Re-identification with Neural Architecture Search.

[BibT_eX]

[DOI]

Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

MobileCount: An Efficient Encoder-Decoder Framework for Real-Time Crowd Counting.

[BibT_eX]

[DOI]

Chenyu Gao

Ye Gao

Proceedings of the Pattern Recognition and Computer Vision - Second Chinese Conference, 2019

VisDrone-SOT2019: The Vision Meets Drone Single Object Tracking Challenge Results.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Vehicle Re-Identification in Aerial Imagery: Dataset and Approach.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Visual Question Answering as Reading Comprehension.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Pushing the Limits of Deep CNNs for Pedestrian Detection.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2018

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

FVQA: Fact-Based Visual Question Answering.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2018

Reading car license plates using deep neural networks.

[BibT_eX]

[DOI]

Image Vis. Comput., 2018

Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks.

[BibT_eX]

[DOI]

CoRR, 2018

RGB-D Based Action Recognition with Light-weight 3D Convolutional Networks.

[BibT_eX]

[DOI]

CoRR, 2018

Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Large-Scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2017

Visual question answering: A survey of methods and datasets.

[BibT_eX]

[DOI]

Comput. Vis. Image Underst., 2017

Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks.

[BibT_eX]

[DOI]

CoRR, 2017

Explicit Knowledge-based Reasoning for Visual Question Answering.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Towards End-to-End Text Spotting with Convolutional Recurrent Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Computer Vision, 2017

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Efficient Semidefinite Branch-and-Cut for MAP-MRF Inference.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2016

Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge.

[BibT_eX]

[DOI]

CoRR, 2016

Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

A multi-agent genetic algorithm for local community detection by extending the tightest nodes.

[BibT_eX]

[DOI]

Jing Liu

Proceedings of the IEEE Congress on Evolutionary Computation, 2016

2015

Efficient SDP inference for fully-connected CRFs based on low-rank decomposition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014

Large-scale Binary Quadratic Optimization Using Semidefinite Relaxation and Applications.

[BibT_eX]

[DOI]

CoRR, 2014

2013

Training Effective Node Classifiers for Cascade Classification.

[BibT_eX]

[DOI]

Sakrapee Paisitkriangkrai

Int. J. Comput. Vis., 2013

A Fast Semidefinite Approach to Solving Binary Quadratic Problems.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2012

Fast and Robust Object Detection Using Asymmetric Totally Corrective Boosting.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., 2012

UBoost: Boosting with the Universum.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2012

2010

Optimally Training a Cascade Classifier

[BibT_eX]

[DOI]

CoRR, 2010

Training a multi-exit cascade with linear asymmetric classification for efficient object detection.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Image Processing, 2010

LACBoost and FisherBoost: Optimally Building Cascade Classifiers.

[BibT_eX]

[DOI]

Hanxi Li

Proceedings of the Computer Vision, 2010

Robust Face Recognition via Accurate Face Alignment and Sparse Representation.

[BibT_eX]

[DOI]

Hanxi Li