Wei Zhang

Orcid: 0000-0002-1492-8286

Affiliations:

JD AI Research, Beijing, China
Chinese Academy of Sciences, Institute of Information Engineering, State Key Laboratory of Information Security, Beijing, China
City University of Hong Kong (PhD 2015)
Tianjin University, School of Computer Science & Technology, China (former)

According to our database¹, Wei Zhang authored at least 75 papers between 2005 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Robust facial marker tracking based on a synthetic analysis of optical flows and the YOLO network.

[BibT_eX]

[DOI]

Vis. Comput., April, 2024

2023

Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis.

[BibT_eX]

[DOI]

ACM Comput. Surv., December, 2023

Augmentation Pathways Network for Visual Recognition.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Boosting Generic Visual-Linguistic Representation With Dynamic Contexts.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Interactive Conversational Head Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Deep Equilibrium Multimodal Fusion.

[BibT_eX]

[DOI]

CoRR, 2023

Visual-Aware Text-to-Speech.

[BibT_eX]

[DOI]

CoRR, 2023

Learning and Evaluating Human Preferences for Conversational Head Generation.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Visual-Aware Text-to-Speech<sup>*</sup>.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Video2StyleGAN: Encoding Video in Latent Space for Manipulation.

[BibT_eX]

[DOI]

CoRR, 2022

Visualizing and Understanding Patch Interactions in Vision Transformer.

[BibT_eX]

[DOI]

CoRR, 2022

Freeform Body Motion Generation from Speech.

[BibT_eX]

[DOI]

CoRR, 2022

Responsive Listening Head Generation: A Benchmark Dataset and Baseline.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Directional Self-supervised Learning for Heavy Image Augmentations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Unpaired Person Image Generation With Semantic Parsing Transformation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

The Classification and Segmentation of Fetal Anatomies Ultrasound Image: A Survey.

[BibT_eX]

[DOI]

J. Medical Imaging Health Informatics, 2021

Responsive Listening Head Generation: A Benchmark Dataset and Baseline.

[BibT_eX]

[DOI]

CoRR, 2021

Directional Self-supervised Learning for Risky Image Augmentations.

[BibT_eX]

[DOI]

CoRR, 2021

Augmentation Pathways Network for Visual Recognition.

[BibT_eX]

[DOI]

CoRR, 2021

Flat and Shallow: Understanding Fake Image Detection Models by Architecture Profiling.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

ViDA-MAN: Visual Dialog with Digital Humans.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones.

[BibT_eX]

[DOI]

Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Exploiting Relationship for Complex-scene Image Generation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Products-10K: A Large-scale Product Recognition Dataset.

[BibT_eX]

[DOI]

CoRR, 2020

AI-SAS: Automated In-match Soccer Analysis System.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Down to the Last Detail: Virtual Try-on with Fine-grained Details.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

SketchMan: Learning to Create Professional Sketches.

[BibT_eX]

[DOI]

Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Deep Learning-Based Multimedia Analytics: A Review.

[BibT_eX]

[DOI]

Wei Zhang

Ting Yao

Shiai Zhu

Abdulmotaleb El-Saddik

ACM Trans. Multim. Comput. Commun. Appl., 2019

Editorial to Special Issue on Deep Learning for Intelligent Multimedia Analytics.

[BibT_eX]

[DOI]

Wei Zhang

Ting Yao

Shiai Zhu

Abdulmotaleb El-Saddik

ACM Trans. Multim. Comput. Commun. Appl., 2019

Pyrboxes: An efficient multi-scale scene text detector with feature pyramids.

[BibT_eX]

[DOI]

Pattern Recognit. Lett., 2019

Name-face association with web facial image supervision.

[BibT_eX]

[DOI]

Multim. Syst., 2019

An efficient privacy protection scheme for data security in video surveillance.

[BibT_eX]

[DOI]

J. Vis. Commun. Image Represent., 2019

Vision and Language: from Visual Perception to Content Creation.

[BibT_eX]

[DOI]

Tao Mei

Wei Zhang

Ting Yao

CoRR, 2019

Zooming into Face Forensics: A Pixel-level Analysis.

[BibT_eX]

[DOI]

CoRR, 2019

Regularizing Proxies with Multi-Adversarial Training for Unsupervised Domain-Adaptive Semantic Segmentation.

[BibT_eX]

[DOI]

CoRR, 2019

Hard-Aware Fashion Attribute Classification.

[BibT_eX]

[DOI]

CoRR, 2019

Predictive Ensemble Learning with Application to Scene Text Detection.

[BibT_eX]

[DOI]

CoRR, 2019

Rethinking Visual Relationships for High-level Image Understanding.

[BibT_eX]

[DOI]

CoRR, 2019

Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Sampling Wisely: Deep Image Embedding by Top-K Precision Optimization.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

VrR-VG: Refocusing Visually-Relevant Relationships.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Unsupervised Person Image Generation With Semantic Parsing Transformation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Destruction and Construction Learning for Fine-Grained Image Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Fake Colorized Image Detection.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2018

Consistent and Specific Multi-View Subspace Clustering.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017

Retrieving Objects by Partitioning.

[BibT_eX]

[DOI]

IEEE Trans. Big Data, 2017

Binarized Mode Seeking for Scalable Visual Pattern Discovery.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016

Hyperlink-Aware Object Retrieval.

[BibT_eX]

[DOI]

Wei Zhang

Chong-Wah Ngo

Xiaochun Cao

IEEE Trans. Image Process., 2016

Semi-fragile watermarking for image authentication based on compressive sensing.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2016

Context-Oriented Name-Face Association in Web Videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

MatchDR: Image Correspondence by Leveraging Distance Ratio Constraint.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Research on Micro-blog New Word Recognition Based on MapReduce.

[BibT_eX]

[DOI]

Proceedings of the Bio-inspired Computing - Theories and Applications, 2016

2015

Topological Spatial Verification for Instance Search.

[BibT_eX]

[DOI]

Wei Zhang

Chong-Wah Ngo

IEEE Trans. Multim., 2015

Image composite authentication using a single shadow observation.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2015

2014

Visual Typo Correction by Collocative Optimization: A Case Study on Merchandize Images.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2014

Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues.

[BibT_eX]

[DOI]

J. Comput. Sci. Technol., 2014

VIREO @ TRECVID 2014: Instance Search and Semantic Indexing.

[BibT_eX]

[DOI]

Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Scalable Visual Instance Mining with Threads of Features.

[BibT_eX]

[DOI]

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013

VIREO/ECNU @ TRECVID 2013: A Video Dance of Detection, Recounting and Search with Motion Relativity and Concept Learning from Wild.

[BibT_eX]

[DOI]

Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Searching visual instances with topology checking and context modeling.

[BibT_eX]

[DOI]

Wei Zhang

Chong-Wah Ngo

Proceedings of the International Conference on Multimedia Retrieval, 2013

2012

Detecting image forgeries using metrology.

[BibT_eX]

[DOI]

Mach. Vis. Appl., 2012

VIREO @ TRECVID 2012: Searching with Topology, Recounting will Small Concepts, Learning with Free Examples.

[BibT_eX]

[DOI]

Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

FashionAsk: pushing community answers to your fingertips.

[BibT_eX]

[DOI]

Wei Zhang

Lei Pang

Chong-Wah Ngo

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Snap-and-ask: answering multimodal question by naming visual instance.

[BibT_eX]

[DOI]

Wei Zhang

Lei Pang

Chong-Wah Ngo

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Video hyperlinking: libraries and tools for threading and visualizing large video collection.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Community as a connector: associating faces with celebrity names in web videos.

[BibT_eX]

[DOI]

Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011

VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search.

[BibT_eX]

[DOI]

Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

2010

Detecting and extracting the photo composites using planar homography and graph cut.

[BibT_eX]

[DOI]

IEEE Trans. Inf. Forensics Secur., 2010

2009

Detecting photographic composites using shadows.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Detecting photographic composites using two-view geometrical constraints.

[BibT_eX]

[DOI]

Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

2005

A Hybrid GMM and Codebook Mapping Method for Spectral Conversion.

[BibT_eX]

[DOI]

Proceedings of the Affective Computing and Intelligent Interaction, 2005

Wei Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...