Santiago Pascual

According to our database1, Santiago Pascual authored at least 35 papers between 2016 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
GASS: Generalizing Audio Source Separation with Large-scale Data.
CoRR, 2023

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Mono-to-Stereo Through Parametric Stereo Generation.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Adversarial Permutation Invariant Training for Universal Sound Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Full-Band General Audio Synthesis with Score-Based Diffusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Upsampling Layers for Music Source Separation.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
Universal Speech Enhancement with Score-based Diffusion.
CoRR, 2022

On Loss Functions and Evaluation Metrics for Music Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
On tuning consistent annealed sampling for denoising score matching.
CoRR, 2021

Adversarial Auto-Encoding for Packet Loss Concealment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects.
Proceedings of the IEEE International Conference on Acoustics, 2021

SESQA: Semi-Supervised Learning for Speech Quality Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2021

Upsampling Artifacts in Neural Audio Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
CATOTRON - A Neural Text-to-Speech System in Catalan.
Proceedings of the Interspeech 2020, 2020

Multi-Task Self-Supervised Learning for Robust Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sample drop detection for asynchronous devices distributed in space.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Time-domain speech enhancement using generative adversarial networks.
Speech Commun., 2019

Sample Drop Detection for Distant-speech Recognition with Asynchronous Devices Distributed in Space.
CoRR, 2019

Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN.
CoRR, 2019

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Generalized Speech Enhancement with Generative Adversarial Networks.
Proceedings of the Interspeech 2019, 2019

Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks.
Proceedings of the Interspeech 2019, 2019

Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder.
Proceedings of the Interspeech 2018, 2018

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks.
Proceedings of the Fourth International Conference, 2018

Self-Attention Linguistic-Acoustic Decoder.
Proceedings of the Fourth International Conference, 2018

Multi-Speaker Neural Vocoder.
Proceedings of the Fourth International Conference, 2018

Towards a Universal Neural Network Encoder for Time Series.
Proceedings of the Artificial Intelligence Research and Development, 2018

2017
SEGAN: Speech Enhancement Generative Adversarial Network.
Proceedings of the Interspeech 2017, 2017

2016
Multi-output RNN-LSTM for multiple speaker speech synthesis with α-interpolation model.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Prosodic Break Prediction with RNNs.
Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation.
Proceedings of the 24th European Signal Processing Conference, 2016

Acoustic feature prediction from semantic features for expressive speech using deep neural networks.
Proceedings of the 24th European Signal Processing Conference, 2016


  Loading...