Siddharth Gururani

Orcid: 0009-0000-8511-6528

According to our database¹, Siddharth Gururani authored at least 27 papers between 2016 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Benchmarking Single-Factor Physical Video-to-Audio Generation.

[BibT_eX]

[DOI]

Gopala Anumanchipalli

Ming-Yu Liu

CoRR, May, 2026

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music.

[BibT_eX]

[DOI]

CoRR, April, 2026

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos.

[BibT_eX]

[DOI]

CoRR, March, 2026

2025

Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning.

[BibT_eX]

[DOI]

CoRR, March, 2025

Cosmos World Foundation Model Platform for Physical AI.

[BibT_eX]

[DOI]

Prithvijit Chattopadhyay

Vasanth Rao Naik Sabavat

CoRR, January, 2025

Fugatto 1: Foundational Generative Audio Transformer Opus 1.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models.

[BibT_eX]

[DOI]

CoRR, 2024

ExpressiveSinger: Multilingual and Multi-Style Score-based Singing Voice Synthesis with Expressive Performance Control.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion.

[BibT_eX]

[DOI]

Chandramouli Shama Sastry

Siddharth Gururani

Sageev Oore

Yisong Yue

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Multilingual Multiaccented Multispeaker TTS with RADTTS.

[BibT_eX]

[DOI]

CoRR, 2023

RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SPACE: Speech-driven Portrait Animation with Controllable Expression.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022

SPACEx: Speech-driven Portrait Animation with Controllable Expression.

[BibT_eX]

[DOI]

CoRR, 2022

Anomalous behaviour in loss-gradient based interpretability methods.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Weakly Supervised Learning for Musical Instrument Classification.

[BibT_eX]

[DOI]

Siddharth Kumar Gururani

PhD thesis, 2021

Semi-Supervised Audio Classification with Partially Labeled Data.

[BibT_eX]

[DOI]

Siddharth Gururani

Alexander Lerch

Proceedings of the IEEE International Symposium on Multimedia, 2021

2020

An Interdisciplinary Review of Music Performance Analysis.

[BibT_eX]

[DOI]

Trans. Int. Soc. Music. Inf. Retr., 2020

Visual Attention for Musical Instrument Recognition.

[BibT_eX]

[DOI]

Karn Watcharasupat

Siddharth Gururani

Alexander Lerch

CoRR, 2020

dMelodies: A Music Dataset for Disentanglement Learning.

[BibT_eX]

[DOI]

Ashis Pati

Siddharth Kumar Gururani

Alexander Lerch

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Score-informed Networks for Music Performance Assessment.

[BibT_eX]

[DOI]

Jiawen Huang

Yun-Ning Hung

Ashis Pati

Siddharth Kumar Gururani

Alexander Lerch

Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

2019

Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features.

[BibT_eX]

[DOI]

CoRR, 2019

Music Performance Analysis: A Survey.

[BibT_eX]

[DOI]

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

An Attention Mechanism for Musical Instrument Recognition.

[BibT_eX]

[DOI]

Siddharth Gururani

Mohit Sharma

Alexander Lerch

Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

2018

Instrument Activity Detection in Polyphonic Music using Deep Neural Networks.

[BibT_eX]

[DOI]

Siddharth Gururani

Cameron Summers

Alexander Lerch

Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

2017

Objective Descriptors for the Assessment of Student Music Performances.

[BibT_eX]

[DOI]

Proceedings of the AES International Conference Semantic Audio 2017, 2017

Automatic Sample Detection in Polyphonic Music.

[BibT_eX]

[DOI]

Siddharth Gururani

Alexander Lerch

Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

2016

Automatic Practice Logging: Introduction, Dataset & Preliminary Study.

[BibT_eX]

[DOI]

R. Michael Winters

Siddharth Gururani

Alexander Lerch

Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016

Siddharth Gururani

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...