Nabarun Goswami

Orcid: 0000-0002-3960-5627

According to our database1, Nabarun Goswami authored at least 12 papers between 2018 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency.
CoRR, August, 2025

FUSE: Universal Speech Enhancement using Multi-Stage Fusion of Sparse Compression and Token Generation Models for the URGENT 2025 Challenge.
CoRR, June, 2025

ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model.
CoRR, February, 2025

EDM-TTS: Efficient Dual-Stage Masked Modeling for Alignment-Free Text-to-Speech Synthesis.
Trans. Mach. Learn. Res., 2025

HyperVQ: MLR-based Vector Quantization in Hyperbolic Space.
Trans. Mach. Learn. Res., 2025

T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
The Sound Demixing Challenge 2023 - Music Demixing Track.
Trans. Int. Soc. Music. Inf. Retr., January, 2024

Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation.
CoRR, 2024

2022
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2019
Recursive Speech Separation for Unknown Number of Speakers.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018
Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018


  Loading...