Nabarun Goswami
Orcid: 0000-0002-3960-5627
According to our database1,
Nabarun Goswami
authored at least 12 papers
between 2018 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency.
CoRR, August, 2025
FUSE: Universal Speech Enhancement using Multi-Stage Fusion of Sparse Compression and Token Generation Models for the URGENT 2025 Challenge.
CoRR, June, 2025
CoRR, February, 2025
EDM-TTS: Efficient Dual-Stage Masked Modeling for Alignment-Free Text-to-Speech Synthesis.
Trans. Mach. Learn. Res., 2025
Trans. Mach. Learn. Res., 2025
T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Trans. Int. Soc. Music. Inf. Retr., January, 2024
Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation.
CoRR, 2024
2022
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022
2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
2018
Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018
PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018