Nabarun Goswami

Orcid: 0000-0002-3960-5627

According to our database¹, Nabarun Goswami authored at least 12 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of five.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

Video Forgery Detection with Optical Flow Residuals and Spatial-Temporal Consistency.

[BibT_eX]

[DOI]

CoRR, August, 2025

ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model.

[BibT_eX]

[DOI]

CoRR, February, 2025

EDM-TTS: Efficient Dual-Stage Masked Modeling for Alignment-Free Text-to-Speech Synthesis.

[BibT_eX]

[DOI]

Nabarun Goswami

Hanqin Wang

Tatsuya Harada

Trans. Mach. Learn. Res., 2025

HyperVQ: MLR-based Vector Quantization in Hyperbolic Space.

[BibT_eX]

[DOI]

Nabarun Goswami

Yusuke Mukuta

Tatsuya Harada

Trans. Mach. Learn. Res., 2025

FUSE: Universal Speech Enhancement using Multi-Stage Fusion of Sparse Compression and Token Generation Models for the URGENT 2025 Challenge.

[BibT_eX]

[DOI]

Nabarun Goswami

Tatsuya Harada

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

T2V2: A Unified Non-Autoregressive Model for Speech Recognition and Synthesis via Multitask Learning.

[BibT_eX]

[DOI]

Nabarun Goswami

Hanqin Wang

Tatsuya Harada

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

The Sound Demixing Challenge 2023 - Music Demixing Track.

[BibT_eX]

[DOI]

Trans. Int. Soc. Music. Inf. Retr., January, 2024

Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation.

[BibT_eX]

[DOI]

CoRR, 2024

2022

SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate.

[BibT_eX]

[DOI]

Nabarun Goswami

Tatsuya Harada

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2019

Recursive Speech Separation for Unknown Number of Speakers.

[BibT_eX]

[DOI]

Naoya Takahashi

Sudarsanam Parthasaarathy

Nabarun Goswami

Yuki Mitsufuji

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation.

[BibT_eX]

[DOI]

Naoya Takahashi

Nabarun Goswami

Yuki Mitsufuji

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Nabarun Goswami

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...