Aswin Shanmugam Subramanian

Orcid: 0000-0003-4446-001X

According to our database1, Aswin Shanmugam Subramanian authored at least 23 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Hyperbolic Audio Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Reverberation as Supervision For Speech Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition.
Comput. Speech Lang., 2022

Towards End-to-end Speaker Diarization in the Wild.
CoRR, 2022

Heterogeneous Target Speech Separation.
Proceedings of the Interspeech 2022, 2022

Improved Domain Generalization via Disentangled Multi-Task Learning in Unsupervised Anomalous Sound Detection.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
ESPnet-SE: End-To-End Speech Enhancement and Separation Toolkit Designed for ASR Integration.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Significance of spectral cues in automatic speech segmentation for Indian language speech synthesizers.
Speech Commun., 2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans.
CoRR, 2020

The JHU Multi-Microphone Multi-Speaker ASR System for the CHiME-6 Challenge.
CoRR, 2020

End-to-End Far-Field Speech Recognition with Unified Dereverberation and Beamforming.
Proceedings of the Interspeech 2020, 2020

End-to-End ASR with Adaptive Span Self-Attention.
Proceedings of the Interspeech 2020, 2020

Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Attention-Based ASR with Lightweight and Dynamic Convolutions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Dry, Focus, and Transcribe: End-to-End Integration of Dereverberation, Beamforming, and ASR.
CoRR, 2019

Generalized Weighted-Prediction-Error Dereverberation with Varying Source Priors For Reverberant Speech Recognition.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Speech Enhancement Using End-to-End Speech Recognition Objectives.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

2018
Student-Teacher Learning for BLSTM Mask-based Speech Enhancement.
Proceedings of the Interspeech 2018, 2018

Building State-of-the-art Distant Speech Recognition Using the CHiME-4 Challenge with a Setup of Speech Enhancement Baseline.
Proceedings of the Interspeech 2018, 2018


  Loading...