Jie Cao

Orcid: 0000-0001-6368-4495

Affiliations:
  • Chinese Academy of Sciences, Institute of Automation, National Laboratory of Pattern Recognition, Center for Research on Intelligent Perception and Computing, Center for Excellence in Brain Science and Intelligence Technology, Beijing, China
  • University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China


According to our database1, Jie Cao authored at least 57 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Towards Robust Defense against Customization via Protective Perturbation Resistant to Diffusion-based Purification.
CoRR, September, 2025

HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling.
CoRR, May, 2025

Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment.
CoRR, April, 2025

Test-time Forgery Detection with Spatial-Frequency Prompt Learning.
Int. J. Comput. Vis., February, 2025

MTSD: Simple Yet Effective Self-Distillation for Generalizable Deepfake Detection.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Degradation-Aware Multi-Task Image Restoration with State Space Models.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Dual-PST: Dual-Branch SpatioTemporal-Planar Network for Video Forgery Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
r-FACE: Reference guided face component editing.
Pattern Recognit., 2024

TT-DF: A Large-Scale Diffusion-Based Dataset and Benchmark for Human Body Forgery Detection.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Learning Fine-Grained and Semantically Aware Mamba Representations for Tampered Text Detection in Images.
Proceedings of the Pattern Recognition and Computer Vision - 7th Chinese Conference, 2024

Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ZePo: Zero-Shot Portrait Stylization with Faster Sampling.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Semantic-Aware Detail Enhancement for Blind Face Restoration.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

PortraitDAE: Line-Drawing Portraits Style Transfer from Photos via Diffusion Autoencoder with Meaningful Encoded Noise.
Proceedings of the 18th IEEE International Conference on Automatic Face and Gesture Recognition, 2024

2023
ScoreMix: A Scalable Augmentation Strategy for Training GANs With Limited Data.
IEEE Trans. Pattern Anal. Mach. Intell., July, 2023

Semantic-Aware Noise Driven Portrait Synthesis and Manipulation.
IEEE Trans. Multim., 2023

Exploring Straighter Trajectories of Flow Matching with Diffusion Guidance.
CoRR, 2023

Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion.
CoRR, 2023

AUTO: Adaptive Outlier Optimization for Online Test-Time OOD Detection.
CoRR, 2023

Where to Focus: Central Attention-Based Face Forgery Detection.
Proceedings of the Pattern Recognition and Computer Vision - 6th Chinese Conference, 2023

Modify: Model-Driven Face Stylization Without Style Images.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fine-Grained Face Sketch-Photo Synthesis with Text-Guided Diffusion Models.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

2022
Learning 3D Human Shape and Pose From Dense Body Parts.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Vision Transformer with Super Token Sampling.
CoRR, 2022

Styleverse: Towards Identity Stylization across Heterogeneous Domains.
CoRR, 2022

Order-aware Human Interaction Manipulation.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Adaptive Transformer-Based Conditioned Variational Autoencoder for Incomplete Social Event Classification.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Improving Subgraph Recognition with Variational Graph Information Bottleneck.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Partial NIR-VIS Heterogeneous Face Recognition With Automatic Saliency Search.
IEEE Trans. Inf. Forensics Secur., 2021

FA-GAN: Face Augmentation GAN for Deformation-Invariant Face Recognition.
IEEE Trans. Inf. Forensics Secur., 2021

Controllable Multi-Attribute Editing of High-Resolution Face Images.
IEEE Trans. Inf. Forensics Secur., 2021

Global Relation-Aware Attention Network for Image-Text Retrieval.
Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

FaceInpainter: High Fidelity Face Adaptation to Heterogeneous Domains.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

ReMix: Towards Image-to-Image Translation With Limited Data.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Adversarial Cross-Spectral Face Completion for NIR-VIS Face Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Disentangled Representation Learning of Makeup Portraits in the Wild.
Int. J. Comput. Vis., 2020

Towards High Fidelity Face Frontalization in the Wild.
Int. J. Comput. Vis., 2020

P<sup>2</sup> Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation.
CoRR, 2020

Augmented Parallel-Pyramid Net for Attention Guided Pose-Estimation.
CoRR, 2020

InteractGAN: Learning to Generate Human-Object Interaction.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

Reference Guided Face Component Editing.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Exemplar Guided Cross-Spectral Face Hallucination via Mutual Information Disentanglement.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

$P^{2}$ Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Informative Sample Mining Network for Multi-domain Image-to-Image Translation.
Proceedings of the Computer Vision - ECCV 2020, 2020

PSGAN: Pose and Expression Robust Spatial-Aware GAN for Customizable Makeup Transfer.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
3D Aided Duet GANs for Multi-View Face Image Synthesis.
IEEE Trans. Inf. Forensics Secur., 2019

PSGAN: Pose-Robust Spatial-Aware GAN for Customizable Makeup Transfer.
CoRR, 2019

Biphasic Learning of GANs for High-Resolution Image-to-Image Translation.
CoRR, 2019

Cross-spectral Face Completion for NIR-VIS Heterogeneous Face Recognition.
CoRR, 2019

DaNet: Decompose-and-aggregate Network for 3D Human Shape and Pose Estimation.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Pose-preserving Cross Spectral Face Hallucination.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Geometry-Aware Face Completion and Editing.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Load Balanced GANs for Multi-view Face Image Synthesis.
CoRR, 2018

Learning a High Fidelity Pose Invariant Model for High-resolution Face Frontalization.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
Recent Progress of Face Image Synthesis.
Proceedings of the 4th IAPR Asian Conference on Pattern Recognition, 2017

2016
Accurate mouth state estimation via convolutional neural networks.
Proceedings of the 2016 IEEE International Conference on Digital Signal Processing, 2016


  Loading...