Yong Zhang

Orcid: 0000-0003-0066-3448

Affiliations:

Tencent AI Lab, Shenzhen, China
Chinese Academy of Sciences, Institute of Automation, Beijing, China (PhD 2018)

According to our database¹, Yong Zhang authored at least 93 papers between 2002 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

Improving Fast Adversarial Training With Prior-Guided Knowledge.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., September, 2024

Generalizable Black-Box Adversarial Attack With Meta Learning.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation.

[BibT_eX]

[DOI]

CoRR, 2024

CV-VAE: A Compatible Video VAE for Latent Generative Video Models.

[BibT_eX]

[DOI]

CoRR, 2024

LLMs Meet Multimodal Generation and Editing: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

ToonCrafter: Generative Cartoon Interpolation.

[BibT_eX]

[DOI]

CoRR, 2024

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.

[BibT_eX]

[DOI]

CoRR, 2024

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation.

[BibT_eX]

[DOI]

CoRR, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Effective and Robust Detection of Adversarial Examples via Benford-Fourier Coefficients.

[BibT_eX]

[DOI]

Mach. Intell. Res., October, 2023

W-Net: Structure and Texture Interaction for Image Inpainting.

[BibT_eX]

[DOI]

IEEE Trans. Multim., 2023

Fast Adversarial Training With Adaptive Step Size.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2023

VDTR: Video Deblurring With Transformer.

[BibT_eX]

[DOI]

IEEE Trans. Circuits Syst. Video Technol., 2023

Robust Physical-World Attacks on Face Recognition.

[BibT_eX]

[DOI]

Pattern Recognit., 2023

Towards harmonized regional style transfer and manipulation for facial images.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2023

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators.

[BibT_eX]

[DOI]

CoRR, 2023

StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter.

[BibT_eX]

[DOI]

CoRR, 2023

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

E4S: Fine-grained Face Swapping via Editing With Regional GAN Inversion.

[BibT_eX]

[DOI]

CoRR, 2023

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors.

[BibT_eX]

[DOI]

CoRR, 2023

EvalCrafter: Benchmarking and Evaluating Large Video Generation Models.

[BibT_eX]

[DOI]

CoRR, 2023

Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation.

[BibT_eX]

[DOI]

CoRR, 2023

On the Cultural Gap in Text-to-Image Generation.

[BibT_eX]

[DOI]

CoRR, 2023

Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance.

[BibT_eX]

[DOI]

CoRR, 2023

Robust Backdoor Attack with Visible, Semantic, Sample-Specific, and Compatible Triggers.

[BibT_eX]

[DOI]

CoRR, 2023

TaleCrafter: Interactive Story Visualization with Multiple Characters.

[BibT_eX]

[DOI]

CoRR, 2023

T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations.

[BibT_eX]

[DOI]

CoRR, 2023

Interactive Story Visualization with Multiple Characters.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2023 Conference Papers, 2023

NOFA: NeRF-based One-shot Facial Avatar Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023

Inserting Anybody in Diffusion Models via Celeb Basis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DeepfakeBench: A Comprehensive Benchmark of Deepfake Detection.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

ToonTalker: Cross-Domain Face Reenactment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Domain Generalization via Rationale Invariance.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UCF: Uncovering Common Features for Generalizable Deepfake Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Generating Human Motion from Textual Descriptions with Discrete Representations.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

3D GAN Inversion with Facial Symmetry Prior.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Next3D: Generative Neural Texture Rasterization for 3D-Aware Head Avatars.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

DPE: Disentanglement of Pose and Expression for General Video Portrait Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Fine-Grained Face Swapping Via Regional GAN Inversion.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-Fidelity Clothed Avatar Reconstruction from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improved Test-Time Adaptation for Domain Generalization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

High-fidelity Facial Avatar Reconstruction from Monocular Video with Generative Priors.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CoordFill: Efficient High-Resolution Image Inpainting via Parameterized Coordinate Querying.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Image Inpainting With Local and Global Refinement.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Boosting Fast Adversarial Training With Learnable Adversarial Initialization.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2022

Latent Video Diffusion Models for High-Fidelity Video Generation with Arbitrary Lengths.

[BibT_eX]

[DOI]

CoRR, 2022

VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022

Boosting the Transferability of Adversarial Attacks with Reverse Adversarial Perturbation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cross-subject Action Unit Detection with Meta Learning and Transformer-based Relation Modeling.

[BibT_eX]

[DOI]

Jiyuan Cao

Zhilei Liu

Yong Zhang

Proceedings of the International Joint Conference on Neural Networks, 2022

StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Prior-Guided Adversarial Initialization for Fast Adversarial Training.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

Towards Real-World Video Deblurring by Exploring Blur Formation Process.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

High-Fidelity GAN Inversion for Image Attribute Editing.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

FENeRF: Face Editing in Neural Radiance Fields.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

LAS-AT: Adversarial Training with Learnable Attack Strategy.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021

Aesthetic-guided outward image cropping.

[BibT_eX]

[DOI]

ACM Trans. Graph., 2021

Learning to assess visual aesthetics of food images.

[BibT_eX]

[DOI]

Comput. Vis. Media, 2021

An Effective and Robust Detector for Logo Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Learning Representations, 2021

Semi-Autoregressive Transformer for Image Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

DAE-GAN: Dynamic Aspect-aware GAN for Text-to-Image Synthesis.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Meta-Attack: Class-agnostic and Model-agnostic Physical Adversarial Attack.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Generalizing Face Forgery Detection With High-Frequency Features.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Dual ResGCN for Balanced Scene GraphGeneration.

[BibT_eX]

[DOI]

CoRR, 2020

Effective and Robust Detection of Adversarial Examples via Benford-Fourier Coefficients.

[BibT_eX]

[DOI]

CoRR, 2020

Controllable Descendant Face Synthesis.

[BibT_eX]

[DOI]

CoRR, 2020

Context-Aware Cross-Attention for Skeleton-Based Human Action Recognition.

[BibT_eX]

[DOI]

IEEE Access, 2020

Groupwise Ranking Loss for Multi-Label Learning.

[BibT_eX]

[DOI]

IEEE Access, 2020

Sparse Adversarial Attack via Perturbation Factorization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Label Error Correction and Generation through Label Relationships.

[BibT_eX]

[DOI]

Zijun Cui

Yong Zhang

Qiang Ji

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Semi-Supervised Deep Neural Network for Joint Intensity Estimation of Multiple Facial Action Units.

[BibT_eX]

[DOI]

IEEE Access, 2019

Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning.

[BibT_eX]

[DOI]

IEEE Access, 2019

Food Photo Enhancer of One Sample Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Context-Aware Feature and Label Fusion for Facial Action Unit Intensity Estimation With Partially Labeled Data.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Joint Representation and Estimator Learning for Facial Action Unit Intensity Estimation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Compressing Convolutional Neural Networks via Factorized Convolutional Filters.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Exact Adversarial Attack to Image Captioning via Structured Output Learning With Latent Variables.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Bilateral Ordinal Relevance Multi-Instance Regression for Facial Action Unit Intensity Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Classifier Learning With Prior Probabilities for Facial Action Unit Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Weakly-Supervised Deep Convolutional Neural Network Learning for Facial Action Unit Intensity Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Data-Driven Synthesis of Cartoon Faces Using Different Styles.

[BibT_eX]

[DOI]

IEEE Trans. Image Process., 2017

2014

Data-driven face cartoon stylization.

[BibT_eX]

[DOI]

Proceedings of the SIGGRAPH Asia 2014 Technical Briefs, 2014

2002

Automatic Scientific Text Classification Using Local Patterns: KDD Cup 2002 (Task 1).

[BibT_eX]

[DOI]

SIGKDD Explor., 2002

Yong Zhang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...