Rongzhi Gu

Orcid: 0000-0003-1861-9170

According to our database1, Rongzhi Gu authored at least 34 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
SECap: Speech Emotion Captioning with Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

ReZero: Region-customizable Sound Extraction.
CoRR, 2023

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression.
CoRR, 2023

The Sound Demixing Challenge 2023 - Cinematic Demixing Track.
CoRR, 2023

Fast Random Approximation of Multi-channel Room Impulse Response.
CoRR, 2023

3D Neural Beamforming for Multi-channel Speech Separation Against Location Uncertainty.
CoRR, 2023

TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Parameter-Efficient Transfer Learning of Pre-Trained Transformer Models for Speaker Verification Using Adapters.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.
Proceedings of the Interspeech 2022, 2022

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction.
Proceedings of the Interspeech 2022, 2022

Learnable Sparse Filterbank for Speaker Verification.
Proceedings of the Interspeech 2022, 2022

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention.
Proceedings of the IEEE International Conference on Acoustics, 2022

Learning Decoupling Features Through Orthogonality Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Complex Neural Spatial Filter: Enhancing Multi-Channel Target Speech Separation in Complex Domain.
IEEE Signal Process. Lett., 2021

Layer Reduction: Accelerating Conformer-Based Self-Supervised Model via Layer Consistency.
CoRR, 2021

Text Anchor Based Metric Learning for Small-Footprint Keyword Spotting.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

ICSpk: Interpretable Complex Speaker Embedding Extractor from Raw Waveform.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Effective Phase Encoding for End-To-End Speaker Verification.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

3D Spatial Features for Multi-Channel Target Speech Separation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Multi-Modal Multi-Channel Target Speech Separation.
IEEE J. Sel. Top. Signal Process., 2020

Temporal-Spatial Neural Filter: Direction Informed End-to-End Multi-channel Target Speech Separation.
CoRR, 2020

Audio-Visual Multi-Channel Recognition of Overlapped Speech.
Proceedings of the Interspeech 2020, 2020

Deep Speaker Embedding with Long Short Term Centroid Learning for Text-Independent Speaker Verification.
Proceedings of the Interspeech 2020, 2020

Enhancing End-to-End Multi-Channel Speech Separation Via Spatial Feature Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Context-adaptive Gaussian Attention for Text-independent Speaker Verification.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
End-to-End Multi-Channel Speech Separation.
CoRR, 2019

Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
Proceedings of the Interspeech 2019, 2019

A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation.
Proceedings of the Interspeech 2019, 2019

Logistic Similarity Metric Learning via Affinity Matrix for Text-Independent Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Speaker-discriminative Embedding Learning via Affinity Matrix for Short Utterance Speaker Verification.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Alleviate Cross-chunk Permutation through Chunk-level Speaker Embedding for Blind Speech Separation.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2017
Interaction Data Detection System to Upgrade Brick and Mortar Shops: Metrics Allow Offline Shops to Compete with Online Retailers.
IEEE Consumer Electron. Mag., 2017

Learning a robust DOA estimation model with acoustic vector sensor cues.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017


  Loading...