Kai Li

Orcid: 0000-0001-8452-620X

Affiliations:
  • Tsinghua University, Institute for Artificial Intelligence, Department of Computer Science and Technology, BNRist, Beijing, China


According to our database1, Kai Li authored at least 15 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder.
CoRR, August, 2025

Mastering Visual Reinforcement Learning via Positive Unlabeled Policy-Guided Contrast.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

2024
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion.
CoRR, 2024

CMIX: Causal Value Decomposition for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SafeEar: Content Privacy-Preserving Audio Deepfake Detection.
Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024

2023
SCANet: A Self- and Cross-Attention Network for Audio-Visual Speech Separation.
CoRR, 2023

Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

An efficient encoder-decoder architecture with top-down attention for speech separation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Inferring Mechanisms of Auditory Attentional Modulation with Deep Neural Networks.
Neural Comput., 2022

On the Use of Deep Mask Estimation Module for Neural Source Separation Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2018
Multi-USVs Coordinated Detection in Marine Environment with Deep Reinforcement Learning.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2018


  Loading...