Santiago Pascual

CoRR, 2021

Adversarial Auto-Encoding for Packet Loss Concealment.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects.

[BibT_eX]

[DOI]

Christian J. Steinmetz

Proceedings of the IEEE International Conference on Acoustics, 2021

SESQA: Semi-Supervised Learning for Speech Quality Assessment.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Upsampling Artifacts in Neural Audio Synthesis.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

CATOTRON - A Neural Text-to-Speech System in Catalan.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multi-Task Self-Supervised Learning for Robust Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Sample drop detection for asynchronous devices distributed in space.

[BibT_eX]

[DOI]

Tina Raissi

Maurizio Omologo

Proceedings of the 28th European Signal Processing Conference, 2020

2019

Time-domain speech enhancement using generative adversarial networks.

[BibT_eX]

[DOI]

Speech Commun., 2019

Sample Drop Detection for Distant-speech Recognition with Asynchronous Devices Distributed in Space.

[BibT_eX]

[DOI]

Tina Raissi

Maurizio Omologo

CoRR, 2019

Problem-Agnostic Speech Embeddings for Multi-Speaker Text-to-Speech with SampleRNN.

[BibT_eX]

[DOI]

David Álvarez

Proceedings of the 10th ISCA Speech Synthesis Workshop, 2019

Blow: a single-scale hyperconditioned flow for non-parallel raw-audio voice conversion.

[BibT_eX]

[DOI]

Carlos Segura

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Towards Generalized Speech Enhancement with Generative Adversarial Networks.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Wav2Pix: Speech-conditioned Face Generation Using Generative Adversarial Networks.

[BibT_eX]

[DOI]

Amanda Cardoso Duarte

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder.

[BibT_eX]

[DOI]

Georgina Dorca

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Whispered-to-voiced Alaryngeal Speech Conversion with Generative Adversarial Networks.

[BibT_eX]

[DOI]

José Andrés González López

Proceedings of the Fourth International Conference, 2018

Self-Attention Linguistic-Acoustic Decoder.

[BibT_eX]

[DOI]

Proceedings of the Fourth International Conference, 2018

Multi-Speaker Neural Vocoder.

[BibT_eX]

[DOI]

Oriol Barbany

Proceedings of the Fourth International Conference, 2018

Towards a Universal Neural Network Encoder for Time Series.

[BibT_eX]

[DOI]

Alexandros Karatzoglou

Proceedings of the Artificial Intelligence Research and Development, 2018

2017

SEGAN: Speech Enhancement Generative Adversarial Network.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Multi-output RNN-LSTM for multiple speaker speech synthesis with α-interpolation model.

[BibT_eX]

[DOI]

Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

Prosodic Break Prediction with RNNs.

[BibT_eX]

[DOI]

Proceedings of the Advances in Speech and Language Technologies for Iberian Languages, 2016

Multi-output RNN-LSTM for multiple speaker speech synthesis and adaptation.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Acoustic feature prediction from semantic features for expressive speech using deep neural networks.

[BibT_eX]

[DOI]

Igor Jauk