Hangting Chen

Orcid: 0000-0002-4085-4364

According to our database1, Hangting Chen authored at least 27 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Continuous Target Speech Extraction: Enhancing Personalized Diarization and Extraction on Complex Recordings.
CoRR, 2024

SECap: Speech Emotion Captioning with Large Language Model.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
How to make embeddings suitable for PLDA.
Comput. Speech Lang., June, 2023

First coarse, fine afterward: A lightweight two-stage complex approach for monaural speech enhancement.
Speech Commun., 2023

Consistent and Relevant: Rethink the Query Embedding in General Sound Separation.
CoRR, 2023

AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data.
CoRR, 2023

Complexity Scaling for Speech Denoising.
CoRR, 2023

Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression.
CoRR, 2023

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction.
CoRR, 2023

TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Master-Teacher-Student: A Weakly Labelled Semi-Supervised Framework for Audio Tagging and Sound Event Detection.
IEICE Trans. Inf. Syst., 2022

The HCCL System for the NIST SRE21.
Proceedings of the Interspeech 2022, 2022

Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output.
Proceedings of the Interspeech 2022, 2022

DPT-FSNet: Dual-Path Transformer Based Full-Band and Sub-Band Fusion Network for Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
A dual-stream deep attractor network with multi-domain learning for speech dereverberation and separation.
Neural Networks, 2021

Improved Speech Enhancement Using a Complex-Domain GAN with Fused Time-Domain and Time-Frequency Domain Constraints.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Power Pooling: An Adaptive Pooling Function for Weakly Labelled Sound Event Detection.
Proceedings of the International Joint Conference on Neural Networks, 2021

2020
Power pooling: An adaptive pooling function for weakly labelled sound event detection.
CoRR, 2020

Exploring the time-domain deep attractor network with two-stream architectures in a reverberant environment.
CoRR, 2020

ACGAN-based Data Augmentation Integrated with Long-term Scalogram for Acoustic Scene Classification.
CoRR, 2020

Power Pooling Operators and Confidence Learning for Semi-Supervised Sound Event Detection.
CoRR, 2020

Improved Guided Source Separation Integrated with a Strong Back-End for the CHiME-6 Dinner Party Scenario.
Proceedings of the Interspeech 2020, 2020

2019
Integrating the Data Augmentation Scheme with Various Classifiers for Acoustic Scene Modeling.
CoRR, 2019

Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning.
Proceedings of the Interspeech 2019, 2019

Audio Scene Classification with Discriminatively-Trained Segment-Level Features.
Proceedings of the IEEE International Conference on Multimedia & Expo Workshops, 2019

An Audio Scene Classification Framework with Embedded Filters and a DCT-based Temporal Module.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling.
Proceedings of the Interspeech 2018, 2018


  Loading...