Zhe Kong

Orcid: 0000-0001-8680-5376

According to our database1, Zhe Kong authored at least 18 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
InfiniteTalk: Audio-driven Video Generation for Sparse-Frame Video Dubbing.
CoRR, August, 2025

FineMotion: A Dataset and Benchmark with both Spatial and Temporal Annotation for Fine-grained Motion Generation and Editing.
CoRR, July, 2025

DAM-VSR: Disentanglement of Appearance and Motion for Video Super-Resolution.
CoRR, July, 2025

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation.
CoRR, May, 2025

MG-MotionLLM: A Unified Framework for Motion Comprehension and Generation across Multiple Granularities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
Dual Teacher Knowledge Distillation With Domain Alignment for Face Anti-Spoofing.
IEEE Trans. Circuits Syst. Video Technol., December, 2024

Taming Self-Supervised Learning for Presentation Attack Detection: De-Folding and De-Mixing.
IEEE Trans. Neural Networks Learn. Syst., August, 2024

StereoCrafter: Diffusion-based Generation of Long and High-fidelity Stereoscopic 3D from Monocular Videos.
CoRR, 2024

Enhancing Generative Generalized Zero Shot Learning via Multi-Space Constraints and Adaptive Integration.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

Enhancing Document-Level Event Extraction via Structure-Aware Heterogeneous Graph with Multi-Granularity Subsentences.
Proceedings of the IEEE International Conference on Acoustics, 2024

OMG: Occlusion-Friendly Personalized Multi-concept Generation in Diffusion Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

2022
Fingerprint Presentation Attack Detection by Channel-Wise Feature Denoising.
IEEE Trans. Inf. Forensics Secur., 2022

Multi-level Fusion of Multi-modal Semantic Embeddings for Zero Shot Learning.
Proceedings of the International Conference on Multimodal Interaction, 2022

Searching Models with Nested Attention for Blind Super-Resolution.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

A Noise-Aware Framework for Blind Image Super-Resolution.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

2021
Taming Self-Supervised Learning for Presentation Attack Detection: In-Image De-Folding and Out-of-Image De-Mixing.
CoRR, 2021

Entity Extraction of Electrical Equipment Malfunction Text by a Hybrid Natural Language Processing Algorithm.
IEEE Access, 2021

2014
A New Design of Video Streaming Capture and Transmission System.
J. Multim., 2014


  Loading...