Siddharth Gururani

Orcid: 0009-0000-8511-6528

According to our database1, Siddharth Gururani authored at least 27 papers between 2016 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Benchmarking Single-Factor Physical Video-to-Audio Generation.
CoRR, May, 2026

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music.
CoRR, April, 2026

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos.
CoRR, March, 2026

2025
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning.
CoRR, March, 2025

Cosmos World Foundation Model Platform for Physical AI.
CoRR, January, 2025

Fugatto 1: Foundational Generative Audio Transformer Opus 1.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models.
CoRR, 2024

ExpressiveSinger: Multilingual and Multi-Style Score-based Singing Voice Synthesis with Expressive Performance Control.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Multilingual Multiaccented Multispeaker TTS with RADTTS.
CoRR, 2023

RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

SPACE: Speech-driven Portrait Animation with Controllable Expression.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
SPACEx: Speech-driven Portrait Animation with Controllable Expression.
CoRR, 2022

Anomalous behaviour in loss-gradient based interpretability methods.
CoRR, 2022

2021
Weakly Supervised Learning for Musical Instrument Classification.
PhD thesis, 2021

Semi-Supervised Audio Classification with Partially Labeled Data.
Proceedings of the IEEE International Symposium on Multimedia, 2021

2020
An Interdisciplinary Review of Music Performance Analysis.
Trans. Int. Soc. Music. Inf. Retr., 2020

Visual Attention for Musical Instrument Recognition.
CoRR, 2020

dMelodies: A Music Dataset for Disentanglement Learning.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Score-informed Networks for Music Performance Assessment.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

2019
Prosody Transfer in Neural Text to Speech Using Global Pitch and Loudness Features.
CoRR, 2019

Music Performance Analysis: A Survey.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

An Attention Mechanism for Musical Instrument Recognition.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

2018
Instrument Activity Detection in Polyphonic Music using Deep Neural Networks.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

2017
Objective Descriptors for the Assessment of Student Music Performances.
Proceedings of the AES International Conference Semantic Audio 2017, 2017

Automatic Sample Detection in Polyphonic Music.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

2016
Automatic Practice Logging: Introduction, Dataset & Preliminary Study.
Proceedings of the 17th International Society for Music Information Retrieval Conference, 2016


  Loading...