Wei Zhang

Orcid: 0000-0002-1492-8286

Affiliations:
  • JD AI Research, Beijing, China
  • Chinese Academy of Sciences, Institute of Information Engineering, State Key Laboratory of Information Security, Beijing, China
  • City University of Hong Kong (PhD 2015)
  • Tianjin University, School of Computer Science & Technology, China (former)


According to our database1, Wei Zhang authored at least 75 papers between 2005 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Robust facial marker tracking based on a synthetic analysis of optical flows and the YOLO network.
Vis. Comput., April, 2024

2023
Deep Person Generation: A Survey from the Perspective of Face, Pose, and Cloth Synthesis.
ACM Comput. Surv., December, 2023

Augmentation Pathways Network for Visual Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., August, 2023

Boosting Generic Visual-Linguistic Representation With Dynamic Contexts.
IEEE Trans. Multim., 2023

Interactive Conversational Head Generation.
CoRR, 2023

Deep Equilibrium Multimodal Fusion.
CoRR, 2023

Visual-Aware Text-to-Speech.
CoRR, 2023

Learning and Evaluating Human Preferences for Conversational Head Generation.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Visual-Aware Text-to-Speech<sup>*</sup>.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Video2StyleGAN: Encoding Video in Latent Space for Manipulation.
CoRR, 2022

Visualizing and Understanding Patch Interactions in Vision Transformer.
CoRR, 2022

Freeform Body Motion Generation from Speech.
CoRR, 2022

Responsive Listening Head Generation: A Benchmark Dataset and Baseline.
Proceedings of the Computer Vision - ECCV 2022, 2022

Directional Self-supervised Learning for Heavy Image Augmentations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Unpaired Person Image Generation With Semantic Parsing Transformation.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

The Classification and Segmentation of Fetal Anatomies Ultrasound Image: A Survey.
J. Medical Imaging Health Informatics, 2021

Responsive Listening Head Generation: A Benchmark Dataset and Baseline.
CoRR, 2021

Directional Self-supervised Learning for Risky Image Augmentations.
CoRR, 2021

Augmentation Pathways Network for Visual Recognition.
CoRR, 2021

Flat and Shallow: Understanding Fake Image Detection Models by Architecture Profiling.
Proceedings of the MMAsia '21: ACM Multimedia Asia, Gold Coast, Australia, December 1, 2021

ViDA-MAN: Visual Dialog with Digital Humans.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Trustworthy AI'21: 1st International Workshop on Trustworthy AI for Multimedia Computing.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

ARShoe: Real-Time Augmented Reality Shoe Try-on System on Smartphones.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Exploiting Relationship for Complex-scene Image Generation.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Products-10K: A Large-scale Product Recognition Dataset.
CoRR, 2020

AI-SAS: Automated In-match Soccer Analysis System.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Down to the Last Detail: Virtual Try-on with Fine-grained Details.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

SketchMan: Learning to Create Professional Sketches.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Deep Learning-Based Multimedia Analytics: A Review.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Editorial to Special Issue on Deep Learning for Intelligent Multimedia Analytics.
ACM Trans. Multim. Comput. Commun. Appl., 2019

Pyrboxes: An efficient multi-scale scene text detector with feature pyramids.
Pattern Recognit. Lett., 2019

Name-face association with web facial image supervision.
Multim. Syst., 2019

An efficient privacy protection scheme for data security in video surveillance.
J. Vis. Commun. Image Represent., 2019

Vision and Language: from Visual Perception to Content Creation.
CoRR, 2019

Zooming into Face Forensics: A Pixel-level Analysis.
CoRR, 2019

Regularizing Proxies with Multi-Adversarial Training for Unsupervised Domain-Adaptive Semantic Segmentation.
CoRR, 2019

Hard-Aware Fashion Attribute Classification.
CoRR, 2019

Predictive Ensemble Learning with Application to Scene Text Detection.
CoRR, 2019

Rethinking Visual Relationships for High-level Image Understanding.
CoRR, 2019

Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

Sampling Wisely: Deep Image Embedding by Top-K Precision Optimization.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

VrR-VG: Refocusing Visually-Relevant Relationships.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Unsupervised Person Image Generation With Semantic Parsing Transformation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Destruction and Construction Learning for Fine-Grained Image Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Fake Colorized Image Detection.
IEEE Trans. Inf. Forensics Secur., 2018

Consistent and Specific Multi-View Subspace Clustering.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Retrieving Objects by Partitioning.
IEEE Trans. Big Data, 2017

Binarized Mode Seeking for Scalable Visual Pattern Discovery.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Hyperlink-Aware Object Retrieval.
IEEE Trans. Image Process., 2016

Semi-fragile watermarking for image authentication based on compressive sensing.
Sci. China Inf. Sci., 2016

Context-Oriented Name-Face Association in Web Videos.
Proceedings of the Advances in Multimedia Information Processing - PCM 2016, 2016

MatchDR: Image Correspondence by Leveraging Distance Ratio Constraint.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016

Research on Micro-blog New Word Recognition Based on MapReduce.
Proceedings of the Bio-inspired Computing - Theories and Applications, 2016

2015
Topological Spatial Verification for Instance Search.
IEEE Trans. Multim., 2015

Image composite authentication using a single shadow observation.
Sci. China Inf. Sci., 2015

2014
Visual Typo Correction by Collocative Optimization: A Case Study on Merchandize Images.
IEEE Trans. Image Process., 2014

Name-Face Association in Web Videos: A Large-Scale Dataset, Baselines, and Open Issues.
J. Comput. Sci. Technol., 2014

VIREO @ TRECVID 2014: Instance Search and Semantic Indexing.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

Scalable Visual Instance Mining with Threads of Features.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

2013
VIREO/ECNU @ TRECVID 2013: A Video Dance of Detection, Recounting and Search with Motion Relativity and Concept Learning from Wild.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013

Searching visual instances with topology checking and context modeling.
Proceedings of the International Conference on Multimedia Retrieval, 2013

2012
Detecting image forgeries using metrology.
Mach. Vis. Appl., 2012

VIREO @ TRECVID 2012: Searching with Topology, Recounting will Small Concepts, Learning with Free Examples.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

FashionAsk: pushing community answers to your fingertips.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Snap-and-ask: answering multimodal question by naming visual instance.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Video hyperlinking: libraries and tools for threading and visualizing large video collection.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

Community as a connector: associating faces with celebrity names in web videos.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

2010
Detecting and extracting the photo composites using planar homography and graph cut.
IEEE Trans. Inf. Forensics Secur., 2010

2009
Detecting photographic composites using shadows.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

Detecting photographic composites using two-view geometrical constraints.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

2005
A Hybrid GMM and Codebook Mapping Method for Spectral Conversion.
Proceedings of the Affective Computing and Intelligent Interaction, 2005


  Loading...