Yaowei Wang

Orcid: 0000-0003-2197-9038

Affiliations:
  • Peng Cheng Laboratory, Shenzhen, China


According to our database1, Yaowei Wang authored at least 168 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Universal Object Detection with Large Vision Model.
Int. J. Comput. Vis., April, 2024

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows.
IEEE Trans. Cybern., March, 2024

Exploring and Exploiting High-Order Spatial-Temporal Dynamics for Long-Term Frame Prediction.
IEEE Trans. Circuits Syst. Video Technol., March, 2024

Towards Bridged Vision and Language: Learning Cross-Modal Knowledge Representation for Relation Extraction.
IEEE Trans. Circuits Syst. Video Technol., January, 2024

Prompt-Based Learning for Unpaired Image Captioning.
IEEE Trans. Multim., 2024

CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding.
IEEE Trans. Multim., 2024

Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution Visual Question Answering.
IEEE Trans. Multim., 2024

SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification.
IEEE Trans. Multim., 2024

CRADA: Cross Domain Object Detection With Cyclic Reconstruction and Decoupling Adaptation.
IEEE Trans. Multim., 2024

Fine-Grained Accident Detection: Database and Algorithm.
IEEE Trans. Image Process., 2024

Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization.
CoRR, 2024

Towards Robust and Efficient Cloud-Edge Elastic Model Adaptation via Selective Entropy Distillation.
CoRR, 2024

VMamba: Visual State Space Model.
CoRR, 2024

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Generative Data Free Model Quantization With Knowledge Matching for Classification.
IEEE Trans. Circuits Syst. Video Technol., December, 2023

Self-Supervised Attentive Generative Adversarial Networks for Video Anomaly Detection.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Multi-proxy feature learning for robust fine-grained visual recognition.
Pattern Recognit., November, 2023

Entity-Graph Enhanced Cross-Modal Pretraining for Instance-Level Product Retrieval.
IEEE Trans. Pattern Anal. Mach. Intell., November, 2023

Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction.
IEEE Trans. Circuits Syst. Video Technol., September, 2023

DCR-ReID: Deep Component Reconstruction for Cloth-Changing Person Re-Identification.
IEEE Trans. Circuits Syst. Video Technol., August, 2023

Conformer: Local Features Coupling Global Representations for Recognition and Detection.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey.
Mach. Intell. Res., August, 2023

DRAKE: Deep Pair-Wise Relation Alignment for Knowledge-Enhanced Multimodal Scene Graph Generation in Social Media Posts.
IEEE Trans. Circuits Syst. Video Technol., July, 2023

WDMNet: Modeling diverse variations of regional wind speed for multi-step predictions.
Neural Networks, May, 2023

Robust and Hierarchical Spatial Relation Analysis for Traffic Forecasting.
IEEE Trans. Intell. Transp. Syst., January, 2023

Unpaired Image Captioning by Image-Level Weakly-Supervised Visual Concept Recognition.
IEEE Trans. Multim., 2023

MFGNet: Dynamic Modality-Aware Filter Generation for RGB-T Tracking.
IEEE Trans. Multim., 2023

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition.
IEEE Trans. Multim., 2023

PolarPose: Single-Stage Multi-Person Pose Estimation in Polar Coordinates.
IEEE Trans. Image Process., 2023

TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection.
IEEE Trans. Image Process., 2023

Spatial-Temporal Graph Network for Video Crowd Counting.
IEEE Trans. Circuits Syst. Video Technol., 2023

Classification of single-view object point clouds.
Pattern Recognit., 2023

Regressor-Segmenter Mutual Prompt Learning for Crowd Counting.
CoRR, 2023

Recognizing Conditional Causal Relationships about Emotions and Their Corresponding Conditions.
CoRR, 2023

Uncovering Hidden Connections: Iterative Tracking and Reasoning for Video-grounded Dialog.
CoRR, 2023

MixBCT: Towards Self-Adapting Backward-Compatible Training.
CoRR, 2023

ShuffleMix: Improving Representations via Channel-Wise Shuffle of Interpolated Hidden States.
CoRR, 2023

Improving Deep Representation Learning via Auxiliary Learnable Target Coding.
CoRR, 2023

CLIP-VG: Self-paced Curriculum Adapting of CLIP via Exploiting Pseudo-Language Labels for Visual Grounding.
CoRR, 2023

Towards Efficient Task-Driven Model Reprogramming with Foundation Models.
CoRR, 2023

Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey.
CoRR, 2023

DilateFormer: Multi-Scale Dilated Transformer for Visual Recognition.
CoRR, 2023

Learning Mask-aware CLIP Representations for Zero-Shot Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Benign Shortcut for Debiasing: Fair Visual Recognition via Intervention with Shortcut Features.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Client-Adaptive Cross-Model Reconstruction Network for Modality-Incomplete Multimodal Federated Learning.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

HumVis: Human-Centric Visual Analysis System.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Manifold-Aware Self-Training for Unsupervised Domain Adaptation on Regressing 6D Object Pose.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Gradual Study Advising with Course Knowledge Graphs.
Proceedings of the Advances in Web-Based Learning - ICWL 2023, 2023

Spikformer: When Spiking Neural Network Meets Transformer.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Integrally Pre-Trained Transformer Pyramid Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Graph Neural Diffusion for Traffic Demand Forecasting.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Digging out Discrimination Information from Generated Samples for Robust Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Optimized separable convolution: Yet another efficient convolution operator.
AI Open, January, 2022

Tracking by Joint Local and Global Search: A Target-Aware Attention-Based Approach.
IEEE Trans. Neural Networks Learn. Syst., 2022

Attribute-Aware Feature Encoding for Object Recognition and Segmentation.
IEEE Trans. Multim., 2022

Bidirectional Posture-Appearance Interaction Network for Driver Behavior Recognition.
IEEE Trans. Intell. Transp. Syst., 2022

Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System.
IEEE Trans. Ind. Informatics, 2022

Adaptive Spatial Pyramid Constraint for Hyperspectral Image Classification With Limited Training Samples.
IEEE Trans. Geosci. Remote. Sens., 2022

Self-Supervision-Augmented Deep Autoencoder for Unsupervised Visual Anomaly Detection.
IEEE Trans. Cybern., 2022

Multi-attribute object detection benchmark for smart city.
Multim. Syst., 2022

Million-scale Object Detection with Large Vision Model.
CoRR, 2022

Integrally Pre-Trained Transformer Pyramid Networks.
CoRR, 2022

Revisiting Color-Event based Tracking: A Unified Network, Dataset, and Metric.
CoRR, 2022

HARDVS: Revisiting Human Activity Recognition with Dynamic Vision Sensors.
CoRR, 2022

Prompt-based Learning for Unpaired Image Captioning.
CoRR, 2022

Boost Test-Time Performance with Closed-Loop Inference.
CoRR, 2022

Peng Cheng Object Detection Benchmark for Smart City.
CoRR, 2022

Conceptor Learning for Class Activation Mapping.
CoRR, 2022

Identifying the kind behind SMILES - anatomical therapeutic chemical classification using structure-only representations.
Briefings Bioinform., 2022

Learning to Share in Networked Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Span-based Audio-Visual Localization.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Hierarchical Graph Embedded Pose Regularity Learning via Spatio-Temporal Transformer for Abnormal Behavior Detection.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Intelligent Instructional Design via Interactive Knowledge Graph Editing.
Proceedings of the Learning Technologies and Systems, 2022

Downscaling and Overflow-aware Model Compression for Efficient Vision Processors.
Proceedings of the 42nd IEEE International Conference on Distributed Computing Systems, 2022

KCUBE: A Knowledge Graph University Curriculum Framework for Student Advising and Career Planning.
Proceedings of the Blended Learning: Engaging Students in the New Normal Era, 2022

DAS: Densely-Anchored Sampling for Deep Metric Learning.
Proceedings of the Computer Vision - ECCV 2022, 2022

Fine-Grained Object Classification via Self-Supervised Pose Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Towards End-to-End Image Compression and Analysis with Transformers.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Progressive Feature Enhancement for Person Re-Identification.
IEEE Trans. Image Process., 2021

Dynamic Attention Guided Multi-Trajectory Analysis for Single Object Tracking.
IEEE Trans. Circuits Syst. Video Technol., 2021

Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding.
IEEE Trans. Circuits Syst. Video Technol., 2021

Diverse part attentive network for video-based person re-identification.
Pattern Recognit. Lett., 2021

Towards effective deep transfer via attentive feature alignment.
Neural Networks, 2021

Learning to Share in Multi-Agent Reinforcement Learning.
CoRR, 2021

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows.
CoRR, 2021

PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation.
CoRR, 2021

AAformer: Auto-Aligned Transformer for Person Re-Identification.
CoRR, 2021

Anomaly Detection with Prototype-Guided Discriminative Latent Embeddings.
Proceedings of the IEEE International Conference on Data Mining, 2021

Conformer: Local Features Coupling Global Representations for Visual Recognition.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Reducing Image Compression Artifacts for Deep Neural Networks.
Proceedings of the 31st Data Compression Conference, 2021

Towards More Flexible and Accurate Object Tracking With Natural Language: Algorithms and Benchmark.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Contrastive Neural Architecture Search With Neural Architecture Comparators.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Hierarchically and Cooperatively Learning Traffic Signal Control.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Adaptation-Oriented Feature Projection for One-Shot Action Recognition.
IEEE Trans. Multim., 2020

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Anonymous Model Pruning for Compressing Deep Neural Networks.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Learning Compact Networks via Similarity-Aware Channel Pruning.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Prune it Yourself: Automated Pruning by Multiple Level Sensitivity.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

End-Edge-Cloud Collaborative System: A Video Big Data Processing and Analysis Architecture.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

R-SiamNet: ROI-Align Pooling Baesd Siamese Network for Object Tracking.
Proceedings of the 3rd IEEE Conference on Multimedia Information Processing and Retrieval, 2020

Large Batch Optimization for Object Detection: Training COCO in 12 minutes.
Proceedings of the Computer Vision - ECCV 2020, 2020

An Asymmetric Modeling for Action Assessment.
Proceedings of the Computer Vision - ECCV 2020, 2020

Modular Graph Attention Network for Complex Visual Relational Reasoning.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Towards Accurate Low Bit-Width Quantization with Multiple Phase Adaptations.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learning from Multi-annotator Data: A Noise-aware Classification Framework.
ACM Trans. Inf. Syst., 2019

Can Categories and Attributes Be Learned in a Multi-Task Way?
IEEE Trans. Multim., 2019

P-ODN: Prototype based Open Deep Network for Open Set Recognition.
CoRR, 2019

EAN: Event Attention Network for Stock Price Trend Prediction based on Sentimental Embedding.
Proceedings of the 11th ACM Conference on Web Science, 2019

Bi-directional Re-ranking for Person Re-identification.
Proceedings of the 2nd IEEE Conference on Multimedia Information Processing and Retrieval, 2019

Transductive Episodic-Wise Adaptive Metric for Few-Shot Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Efficient and Fast Coefficient Sign Inference for Video Coding.
Proceedings of the Data Compression Conference, 2019

2018
Joint Semantic and Latent Attribute Modelling for Cross-Class Transfer Learning.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

Cross-Domain Adversarial Feature Learning for Sketch Re-identification.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Fast Compressed Domain Copy Detection with Motion Vector Imaging.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Multi-Pose Learning based Head-Shoulder Re-identification.
Proceedings of the IEEE 1st Conference on Multimedia Information Processing and Retrieval, 2018

Hierarchical Temporal Memory Enhanced One-Shot Distance Learning for Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Attribute Driven Zero-Shot Classification and Segmentation.
Proceedings of the 2018 IEEE International Conference on Multimedia & Expo Workshops, 2018

SFCM: Learn a Pooling Kernel for Weakly Supervised Object Localization.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

ODN: Opening the Deep Network for Open-Set Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Temporal Attentive Network for Action Recognition.
Proceedings of the 2018 IEEE International Conference on Multimedia and Expo, 2018

Toward Efficient Simultaneous Detection and Segmentation.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

Deep Transfer Learning for Person Re-Identification.
Proceedings of the Fourth IEEE International Conference on Multimedia Big Data, 2018

2017
Sequential Deep Trajectory Descriptor for Action Recognition With Three-Stream CNN.
IEEE Trans. Multim., 2017

Rate-Performance-Loss Optimization for Inter-Frame Deep Feature Coding From Videos.
IEEE Trans. Image Process., 2017

A fast skip and direction adaptive search algorithm for Sub-Pixel Motion Estimation on HEVC.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Deep hashing with mixed supervised losses for image search.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Deep hashing with multi-task learning for large-scale instance-level vehicle search.
Proceedings of the 2017 IEEE International Conference on Multimedia & Expo Workshops, 2017

Exploiting Multi-grain Ranking Constraints for Precisely Searching Visually-similar Vehicles.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Learning Long-Term Dependencies for Action Recognition with a Biologically-Inspired Deep Network.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A Network Framework for Noisy Label Aggregation in Social Media.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Fixed-point Gaussian Mixture Model for analysis-friendly surveillance video coding.
Comput. Vis. Image Underst., 2016

shuttleNet: A biologically-inspired RNN with loop connection and parameter sharing.
CoRR, 2016

Joint Network based Attention for Action Recognition.
CoRR, 2016

Deep Transfer Learning for Person Re-identification.
CoRR, 2016

CNN vs. SIFT for Image Retrieval: Alternative or Complementary?
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Joint Learning of Semantic and Latent Attributes.
Proceedings of the Computer Vision - ECCV 2016, 2016

Unsupervised Cross-Dataset Transfer Learning for Person Re-identification.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Deep Relative Distance Learning: Tell the Difference between Similar Vehicles.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

CNUSVM: Hybrid CNN-Uneven SVM Model for Imbalanced Visual Learning.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

High-Efficiency Coding for Shaking Surveillance Videos Based on Global Motion Compensation.
Proceedings of the IEEE Second International Conference on Multimedia Big Data, 2016

2015
Robust multiple cameras pedestrian detection with multi-view Bayesian network.
Pattern Recognit., 2015

Quality-progressive coding for high bit-rate background frames on surveillance videos.
Proceedings of the 2015 IEEE International Symposium on Circuits and Systems, 2015

Learning Deep Trajectory Descriptor for action recognition in videos using deep neural networks.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015

CNN Based Vehicle Counting with Virtual Coil in Traffic Surveillance Video.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Detecting Rare Actions and Events from Surveillance Big Data with Bag of Dynamic Trajectories.
Proceedings of the 2015 IEEE International Conference on Multimedia Big Data, BigMM 2015, 2015

Swiss-System Based Cascade Ranking for Gait-Based Person Re-Identification.
Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015

2014
A refined object detection method based on HTM.
Proceedings of the 2014 IEEE Visual Communications and Image Processing Conference, 2014

Multi-view gait recognition with incomplete training data.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2014

2013
Selective Eigenbackground for Background Modeling and Subtraction in Crowded Scenes.
IEEE Trans. Circuits Syst. Video Technol., 2013

A coding unit classification based AVC-to-HEVC transcoding with background modeling for surveillance videos.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Wavelet based smoke detection method with RGB Contrast-image and shape constrain.
Proceedings of the 2013 Visual Communications and Image Processing, 2013

Pair-wise event detection using cubic features and sequence discriminant learning.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

A system based on sequence learning for event detection in surveillance video.
Proceedings of the IEEE International Conference on Image Processing, 2013

2012
PKU-NEC @TRECVID2012 SED : Uneven-Sequence Based Event Detection in Surveillance Video.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Multi-camera Pedestrian Detection with Multi-view Bayesian Network Model.
Proceedings of the British Machine Vision Conference, 2012

Single and Multiple View Detection, Tracking and Video Analysis in Crowded Environments.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Automatic Webcam-Based Human Heart Rate Measurements Using Laplacian Eigenmap.
Proceedings of the Computer Vision, 2012

2011
PKU-NEC @TRECVID2011 SED: Sequence-Based Event Detection in Surveillance Video.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Selective eigenbackgrounds method for background subtraction in crowed scenes.
Proceedings of the 18th IEEE International Conference on Image Processing, 2011

2010
PKU@TRECVID2010: Pair-Wise Event Detection in Surveillance Video.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Dynamic multi-cue tracking with detection responses association.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

ESUR: A system for Events detection in SURveillance video.
Proceedings of the International Conference on Image Processing, 2010

2009
PKU@TRECVID2009: Single-Actor and Pair-Activity Event Detection in surveillance Video.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

2007
A Robust Caption Detecting Algorithm on MPEG Compressed Video.
Proceedings of the Multimedia Content Analysis and Mining, International Workshop, 2007

2003
A regularized simultaneous autoregressive model for texture classification.
Proceedings of the 2003 International Symposium on Circuits and Systems, 2003

A new algorithm for remotely sensed image texture classification and segmentation.
Proceedings of the 2003 IEEE International Geoscience and Remote Sensing Symposium, 2003


  Loading...