Kai Li

Orcid: 0000-0002-4960-3019

Affiliations:
  • Tsinghua University, Institute for Artificial Intelligence, Department of Computer Science and Technology, BNRist, Beijing, China


According to our database1, Kai Li authored at least 26 papers between 2014 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Enhancing Spectrogram Realism in Singing Voice Synthesis via Explicit Bandwidth Extension Prior to Vocoder.
CoRR, August, 2025

EAGLE: An Efficient Global Attention Lesion Segmentation Model for Hepatic Echinococcosis.
CoRR, June, 2025

A Fast and Lightweight Model for Causal Audio-Visual Speech Separation.
CoRR, June, 2025

GDSR: Global-Detail Integration Through Dual-Branch Network With Wavelet Losses for Remote Sensing Image Super-Resolution.
IEEE Trans. Geosci. Remote. Sens., 2025

SPMamba: Leveraging Long-Sequence Modeling with State Space Models for Speech Separation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2025

Mastering Visual Reinforcement Learning via Positive Unlabeled Policy-Guided Contrast.
Proceedings of the Advanced Intelligent Computing Technology and Applications, 2025

2024
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits.
IEEE Trans. Pattern Anal. Mach. Intell., October, 2024

HES-UNet: A U-Net for Hepatic Echinococcosis Lesion Segmentation.
CoRR, 2024

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion.
CoRR, 2024

CMIX: Causal Value Decomposition for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2024

IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SafeEar: Content Privacy-Preserving Audio Deepfake Detection.
Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, 2024

2023
SCANet: A Self- and Cross-Attention Network for Audio-Visual Speech Separation.
CoRR, 2023

Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

An efficient encoder-decoder architecture with top-down attention for speech separation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Inferring Mechanisms of Auditory Attentional Modulation with Deep Neural Networks.
Neural Comput., 2022

On the Use of Deep Mask Estimation Module for Neural Source Separation Systems.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Speech Separation Using an Asynchronous Fully Recurrent Convolutional Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Survey of single image super-resolution reconstruction.
IET Image Process., 2020

Dynamic Spatio-Temporal Graph-Based CNNs for Traffic Flow Prediction.
IEEE Access, 2020

An Algorithm-optimized Car-following Model Based on Chengdu Ring Expressway Traffic Flow Characteristics.
Proceedings of the ICBDC 2020: 5th International Conference on Big Data and Computing, 2020

2019
Single Image Super-Resolution Reconstruction of Enhanced Loss Function with Multi-GPU Training.
Proceedings of the 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2019

2018
Multi-USVs Coordinated Detection in Marine Environment with Deep Reinforcement Learning.
Proceedings of the Benchmarking, Measuring, and Optimizing, 2018

2015
Accurate kidney surface reconstruction from 3D ultrasonography for volume assessment: First clinical evaluation.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

2014
Augmenting interventional ultrasound using statistical shape model for guiding percutaneous nephrolithotomy: Initial evaluation in pigs.
Neurocomputing, 2014


  Loading...