Rif A. Saurous

According to our database1, Rif A. Saurous authored at least 34 papers between 2016 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Scalable Spatiotemporal Prediction with Bayesian Neural Fields.
CoRR, 2024

Robust Inverse Graphics via Probabilistic Inference.
CoRR, 2024

2023
Grammar Prompting for Domain-Specific Language Generation with Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Training Chain-of-Thought via Latent-Variable Inference.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Sequential Monte Carlo Learning for Time Series Structure Discovery.
Proceedings of the International Conference on Machine Learning, 2023

ProbNeRF: Uncertainty-Aware Inference of 3D Shapes from 2D Images.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Language Model Cascades.
CoRR, 2022

2020
tfp.mcmc: Modern Markov Chain Monte Carlo Tools Built for Modern Hardware.
CoRR, 2020

Automatically batching control-intensive programs for modern accelerators.
Proceedings of Machine Learning and Systems 2020, 2020

Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Large-Scale Weakly-Supervised Content Embeddings for Music Recommendation and Tagging.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking.
Proceedings of the Interspeech 2019, 2019

Differentiable Consistency Constraints for Improved Deep Speech Enhancement.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Ultrasonic Communication Using Consumer Hardware.
IEEE Trans. Multim., 2018

Simple, Distributed, and Accelerated Probabilistic Programming.
CoRR, 2018

Exploring Tradeoffs in Models for Low-Latency Speech Enhancement.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Emotion Recognition from Human Speech Using Temporal Information and Deep Learning.
Proceedings of the Interspeech 2018, 2018

Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.
Proceedings of the 35th International Conference on Machine Learning, 2018

Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron.
Proceedings of the 35th International Conference on Machine Learning, 2018

Fixing a Broken ELBO.
Proceedings of the 35th International Conference on Machine Learning, 2018

Neumann Optimizer: A Practical Optimization Algorithm for Deep Neural Networks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Unsupervised Learning of Semantic Audio Representations.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

On Using Backpropagation for Speech Texture Generation and Voice Conversion.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.
CoRR, 2017

TensorFlow Distributions.
CoRR, 2017

Uncovering Latent Style Factors for Expressive Speech Synthesis.
CoRR, 2017

An Information-Theoretic Analysis of Deep Latent-Variable Models.
CoRR, 2017

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
CoRR, 2017


Deep Probabilistic Programming.
Proceedings of the 5th International Conference on Learning Representations, 2017

Trainable frontend for robust and far-field keyword spotting.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

CNN architectures for large-scale audio classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
AutoMOS: Learning a non-intrusive assessor of naturalness-of-speech.
CoRR, 2016


  Loading...