Yong Zhang

Orcid: 0000-0003-0066-3448

Affiliations:
  • Tencent AI Lab, Shenzhen, China
  • Chinese Academy of Sciences, Institute of Automation, Beijing, China (PhD 2018)


According to our database1, Yong Zhang authored at least 73 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Effective and Robust Detection of Adversarial Examples via Benford-Fourier Coefficients.
Mach. Intell. Res., October, 2023

W-Net: Structure and Texture Interaction for Image Inpainting.
IEEE Trans. Multim., 2023

Fast Adversarial Training With Adaptive Step Size.
IEEE Trans. Image Process., 2023

VDTR: Video Deblurring With Transformer.
IEEE Trans. Circuits Syst. Video Technol., 2023

Robust Physical-World Attacks on Face Recognition.
Pattern Recognit., 2023

Towards harmonized regional style transfer and manipulation for facial images.
Comput. Vis. Media, 2023

Domain Generalization via Rationale Invariance.
CoRR, 2023

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation.
CoRR, 2023

NOFA: NeRF-based One-shot Facial Avatar Reconstruction.
CoRR, 2023

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.
CoRR, 2023

Inserting Anybody in Diffusion Models via Celeb Basis.
CoRR, 2023

Robust Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers.
CoRR, 2023

TaleCrafter: Interactive Story Visualization with Multiple Characters.
CoRR, 2023

UCF: Uncovering Common Features for Generalizable Deepfake Detection.
CoRR, 2023

Improving Fast Adversarial Training with Prior-Guided Knowledge.
CoRR, 2023

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing.
CoRR, 2023

T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations.
CoRR, 2023

Generalizable Black-Box Adversarial Attack with Meta Learning.
CoRR, 2023

Interactive Story Visualization with Multiple Characters.
Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

NOFA: NeRF-based One-shot Facial Avatar Reconstruction.
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D GAN Inversion with Facial Symmetry Prior.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DPE: Disentanglement of Pose and Expression for General Video Portrait Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-Fidelity Clothed Avatar Reconstruction from a Single Image.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improved Test-Time Adaptation for Domain Generalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Image Inpainting With Local and Global Refinement.
IEEE Trans. Image Process., 2022

Boosting Fast Adversarial Training With Learnable Adversarial Initialization.
IEEE Trans. Image Process., 2022

Fine-Grained Face Swapping via Regional GAN Inversion.
CoRR, 2022

Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths.
CoRR, 2022

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild.
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cross-subject Action Unit Detection with Meta Learning and Transformer-based Relation Modeling.
Proceedings of the International Joint Conference on Neural Networks, 2022

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN.
Proceedings of the Computer Vision - ECCV 2022, 2022

Prior-Guided Adversarial Initialization for Fast Adversarial Training.
Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Real-World Video Deblurring by Exploring Blur Formation Process.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

High-Fidelity GAN Inversion for Image Attribute Editing.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FENeRF: Face Editing in Neural Radiance Fields.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LAS-AT: Adversarial Training with Learnable Attack Strategy.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Aesthetic-guided outward image cropping.
ACM Trans. Graph., 2021

Learning to assess visual aesthetics of food images.
Comput. Vis. Media, 2021

Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits.
Proceedings of the 9th International Conference on Learning Representations, 2021

Semi-Autoregressive Transformer for Image Captioning.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generalizing Face Forgery Detection With High-Frequency Features.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Dual ResGCN for Balanced Scene GraphGeneration.
CoRR, 2020

Effective and Robust Detection of Adversarial Examples via Benford-Fourier Coefficients.
CoRR, 2020

Controllable Descendant Face Synthesis.
CoRR, 2020

Context-Aware Cross-Attention for Skeleton-Based Human Action Recognition.
IEEE Access, 2020

Groupwise Ranking Loss for Multi-Label Learning.
IEEE Access, 2020

Sparse Adversarial Attack via Perturbation Factorization.
Proceedings of the Computer Vision - ECCV 2020, 2020

Label Error Correction and Generation through Label Relationships.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.
CoRR, 2019

Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units.
IEEE Access, 2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.
IEEE Access, 2019

Food Photo Enhancer of One Sample Generative Adversarial Network.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Context-Aware Feature and Label Fusion for Facial Action Unit Intensity Estimation With Partially Labeled Data.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Compressing Convolutional Neural Networks via Factorized Convolutional Filters.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exact Adversarial Attack to Image Captioning via Structured Output Learning With Latent Variables.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Bilateral Ordinal Relevance Multi-Instance Regression for Facial Action Unit Intensity Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Classifier Learning With Prior Probabilities for Facial Action Unit Recognition.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Weakly-Supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Data-Driven Synthesis of Cartoon Faces Using Different Styles.
IEEE Trans. Image Process., 2017

2014
Data-driven face cartoon stylization.
Proceedings of the SIGGRAPH Asia 2014 Technical Briefs, 2014


  Loading...