Jordi Pons

Orcid: 0000-0001-9603-0869

According to our database1, Jordi Pons authored at least 38 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Fast Timing-Conditioned Latent Audio Diffusion.
CoRR, 2024

2023
GASS: Generalizing Audio Source Separation with Large-scale Data.
CoRR, 2023

Towards Robust Image-in-Audio Deep Steganography.
CoRR, 2023

CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Mono-to-Stereo Through Parametric Stereo Generation.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Adversarial Permutation Invariant Training for Universal Sound Separation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Full-Band General Audio Synthesis with Score-Based Diffusion.
Proceedings of the IEEE International Conference on Acoustics, 2023

Upsampling Layers for Music Source Separation.
Proceedings of the 31st European Signal Processing Conference, 2023

2022
FSD50K: An Open Dataset of Human-Labeled Sound Events.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Universal Speech Enhancement with Score-based Diffusion.
CoRR, 2022

PodcastMix: A dataset for separating music and speech in podcasts.
Proceedings of the Interspeech 2022, 2022

On Loss Functions and Evaluation Metrics for Music Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Pixinwav: Residual Steganography for Hiding Pixels in Audio.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
On tuning consistent annealed sampling for denoising score matching.
CoRR, 2021

Adversarial Auto-Encoding for Packet Loss Concealment.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2021

Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects.
Proceedings of the IEEE International Conference on Acoustics, 2021

SESQA: Semi-Supervised Learning for Speech Quality Assessment.
Proceedings of the IEEE International Conference on Acoustics, 2021

Upsampling Artifacts in Neural Audio Synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2021

On Permutation Invariant Training For Speech Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Multichannel-based Learning for Audio Object Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
An Empirical Study of Conv-Tasnet.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Tensorflow Audio Models in Essentia.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Deep neural networks for music and audio tagging.
PhD thesis, 2019

musicnn: Pre-trained convolutional neural networks for music audio tagging.
CoRR, 2019

End-to-End Music Source Separation: Is it Possible in the Waveform Domain?
Proceedings of the Interspeech 2019, 2019

Training Neural Audio Classifiers with Few Data.
Proceedings of the IEEE International Conference on Acoustics, 2019

Randomly Weighted CNNs for (Music) Audio Classification.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
End-to-end Learning for Music Audio Tagging at Scale.
Proceedings of the 19th International Society for Music Information Retrieval Conference, 2018

A Wavenet for Speech Denoising.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

General-purpose tagging of Freesound audio with AudioSet labels: task description, dataset, and baseline.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2018

2017
Score-Informed Syllable Segmentation for A Cappella Singing Voice with Convolutional Neural Networks.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Audio to Score Matching by Combining Phonetic and Duration Information.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Freesound Datasets: A Platform for the Creation of Open Audio Datasets.
Proceedings of the 18th International Society for Music Information Retrieval Conference, 2017

Designing efficient architectures for modeling temporal features with convolutional neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Timbre analysis of music audio signals with convolutional neural networks.
Proceedings of the 25th European Signal Processing Conference, 2017

2016
Experimenting with musically motivated convolutional neural networks.
Proceedings of the 14th International Workshop on Content-Based Multimedia Indexing, 2016

2015
On automatic drum transcription using non-negative matrix deconvolution and itakura saito divergence.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2010
La Evaluación de Competencias en los Trabajos Fin de Estudios.
Rev. Iberoam. de Tecnol. del Aprendiz., 2010


  Loading...