Kaisiyuan Wang
Orcid: 0000-0002-2120-8383
According to our database1,
Kaisiyuan Wang authored at least 31 papers
between 2020 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration.
CoRR, April, 2026
Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers.
Int. J. Comput. Vis., March, 2026
InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance.
CoRR, March, 2026
MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model.
CoRR, March, 2026
DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary.
CoRR, March, 2026
GenHOI: Towards Object-Consistent Hand-Object Interaction with Temporally Balanced and Spatially Selective Object Injection.
CoRR, March, 2026
2025
Improving CXR Bone Suppression by Exploiting Domain-Level and Instance-Level Information.
IEEE Trans. Medical Imaging, November, 2025
Real-Time Neural Radiance Talking Portrait Synthesis via Audio-Spatial Decomposition.
Int. J. Comput. Vis., September, 2025
iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer.
CoRR, June, 2025
GestureHYDRA: Semantic Co-Speech Gesture Synthesis via Hybrid Modality Diffusion Transformer and Cascaded-Synchronized Retrieval-Augmented Generation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
2024
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation.
IEEE Access, 2024
TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
ShowMaker: Creating High-Fidelity 2D Human Video via Fine-Grained Diffusion Modeling.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024
ReSyncer: Rewiring Style-Based Generator for Unified Audio-Visually Synced Facial Performer.
Proceedings of the Computer Vision - ECCV 2024, 2024
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.
CoRR, 2023
Proceedings of the ACM SIGGRAPH 2023 Conference Proceedings, 2023
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation.
Proceedings of the 25th International Conference on Multimodal Interaction, 2023
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-Based Generator.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
IEEE Trans. Image Process., 2022
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition.
CoRR, 2022
Proceedings of the SIGGRAPH Asia 2022 Conference Papers, 2022
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022
2021
IEEE Trans. Circuits Syst. Video Technol., 2021
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Proceedings of the Computer Vision - ECCV 2020, 2020