Yaohui Wang
Orcid: 0009-0002-9487-6187Affiliations:
- Shanghai Artificial Intelligence Laboratory, China
- University of Côte d'Azur, Nice, France (PhD 2021)
- INRIA, STARS, Sophia-Antipolis, France (former)
According to our database1,
Yaohui Wang
authored at least 59 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on linkedin.com
-
on orcid.org
-
on github.com
On csauthors.net:
Bibliography
2025
Vinci: A Real-time Smart Assistant Based on Egocentric Vision-language Model for Portable Devices.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., September, 2025
CineTrans: Learning to Generate Videos with Cinematic Transitions via Masked Diffusion Models.
CoRR, August, 2025
Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers.
CoRR, August, 2025
GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects.
CoRR, June, 2025
Int. J. Comput. Vis., May, 2025
Int. J. Comput. Vis., March, 2025
CoRR, March, 2025
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision.
CoRR, March, 2025
CoRR, March, 2025
CoRR, February, 2025
CoRR, January, 2025
Trans. Mach. Learn. Res., 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
Int. J. Comput. Vis., July, 2024
Expert Syst. Appl., January, 2024
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model.
CoRR, 2024
CoRR, 2024
CoRR, 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Learning Invariance From Generated Variance for Unsupervised Person Re-Identification.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation.
CoRR, 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning.
CoRR, 2023
Proceedings of the International Conference on Machine Learning, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
CoRR, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
2021
PhD thesis, 2021
InMoDeGAN: Interpretable Motion Decomposition Generative Adversarial Network for Video Generation.
CoRR, 2021
Selective Spatio-Temporal Aggregation Based Pose Refinement System: Towards Understanding Human Activities in Real-World Videos.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021
Self-Supervised Video Pose Representation Learning for Occlusion- Robust Action Recognition.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
Proceedings of the 32nd British Machine Vision Conference 2021, 2021
2020
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020
A video is worth more than 1000 lies. Comparing 3DCNN approaches for detecting deepfakes.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
G<sup>3</sup>AN: This video does not exist. Disentangling motion and appearance for video generation.
CoRR, 2019
2018
Comparing Methods for Assessment of Facial Dynamics in Patients with Major Neurocognitive Disorders.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
From Attribute-Labels to Faces: Face Generation Using a Conditional Generative Adversarial Network.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018
Proceedings of the 2018 International Conference of the Biometrics Special Interest Group, 2018