Tom Bäckström

CoRR, January, 2026

2025

Introduction to Speech Processing v2.01.

[BibT_eX]

[DOI]

Okko Räsänen

Abraham Woubie Zewoudie

Daniel Ramos

Sudarsana Reddy Kadiri

Dataset, October, 2025

DiVeQ: Differentiable Vector Quantization Using the Reparameterization Trick.

[BibT_eX]

[DOI]

Arno Solin

CoRR, September, 2025

Privacy in Speech Technology.

[BibT_eX]

[DOI]

Proc. IEEE, July, 2025

Unsupervised Panoptic Interpretation of Latent Spaces in GANs Using Space-Filling Vector Quantization.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

2024

Good practices for evaluation of machine learning systems.

[BibT_eX]

[DOI]

Luciana Ferrer

Odette Scharenborg

CoRR, 2024

Evaluating privacy, security, and trust perceptions in conversational AI: A systematic review.

[BibT_eX]

[DOI]

Comput. Hum. Behav., 2024

Real-Time Joint Noise Suppression and Bandwidth Extension of Noisy Reverberant Wideband Speech.

[BibT_eX]

[DOI]

Esteban Gómez

Proceedings of the 18th International Workshop on Acoustic Signal Enhancement, 2024

Privacy PORCUPINE: Anonymization of Speaker Attributes Using Occurrence Normalization for Space-Filling Vector Quantization.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

2023

The Internet of Sounds: Convergent Trends, Insights, and Future Directions.

[BibT_eX]

[DOI]

IEEE Internet Things J., July, 2023

Interpretable Latent Space Using Space-Filling Curves for Phonetic Analysis in Voice Conversion.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Optimizing the Performance of Text Classification Models by Improving the Isotropy of the Embeddings Using a Joint Loss Function.

[BibT_eX]

[DOI]

Joseph Attieh

Abraham Woubie Zewoudie

Vladimir Vlassov

Adrian Flanagan

Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Stochastic Optimization of Vector Quantization Methods in Application to Speech and Image Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Low-Complexity Real-Time Neural Network for Blind Bandwidth Extension of Wideband Speech.

[BibT_eX]

[DOI]

Esteban Gómez

Proceedings of the 31st European Signal Processing Conference, 2023

2022

NSVQ: Noise Substitution in Vector Quantization for Machine Learning.

[BibT_eX]

[DOI]

IEEE Access, 2022

Voice Quality Features for Replay Attack Detection.

[BibT_eX]

[DOI]

Proceedings of the 30th European Signal Processing Conference, 2022

2021

Federated Learning for Privacy-Preserving Speaker Recognition.

[BibT_eX]

[DOI]

IEEE Access, 2021

Cancellation of Local Competing Speaker with Near-Field Localization for Distributed ad-hoc Sensor Network.

[BibT_eX]

[DOI]

Zied Lachiri

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

End-to-End Optimized Multi-Stage Vector Quantization of Spectral Envelopes for Speech and Audio Coding.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Voice-quality Features for Deep Neural Network Based Speaker Verification Systems.

[BibT_eX]

[DOI]

Lauri Koivisto

Proceedings of the 29th European Signal Processing Conference, 2021

PyAWNeS-Codec: Speech and audio codec for ad-hoc acoustic wireless sensor networks.

[BibT_eX]

[DOI]

Proceedings of the 29th European Signal Processing Conference, 2021

2020

Enhancement by postfiltering for speech and audio coding in ad-hoc sensor networks.

[BibT_eX]

[DOI]

CoRR, 2020

Users Perceptions about Teleconferencing Applications Collected through Twitter.

[BibT_eX]

[DOI]

CoRR, 2020

Acoustic Fingerprints for Access Management in Ad-Hoc Sensor Networks.

[BibT_eX]

[DOI]

Stephan Sigg

IEEE Access, 2020

Provable Consent for Voice User Interfaces.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Pervasive Computing and Communications Workshops, 2020

Perception of Privacy Measured in the Crowd - Paired Comparison on the Effect of Background Noises.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Fundamental Frequency Model for Postfiltering at Low Bitrates in a Transform-Domain Speech and Audio Codec.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Evaluation of Zero Frequency Filtering based Method for Multi-pitch Streaming of Concurrent Speech Signals.

[BibT_eX]

[DOI]

Zied Lachiri

Proceedings of the 28th European Signal Processing Conference, 2020

2019

Sound Privacy: A Conversational Speech Corpus for Quantifying the Experience of Privacy.

[BibT_eX]

[DOI]

Vishnu Vidyadhara Raju Vegesna

Anil Kumar Vuppala

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Super-Wideband Spectral Envelope Modeling for Speech Coding.

[BibT_eX]

[DOI]

Chamran Ashour

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

End-to-End Optimization of Source Models for Speech and Audio Coding Using a Machine Learning Framework.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Overlap-add Windows with Maximum Energy Concentration for Speech and Audio Processing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Robust and Responsive Acoustic Pairing of Devices Using Decorrelating Time-Frequency Modelling.

[BibT_eX]

[DOI]

Stephan Sigg

Proceedings of the 27th European Signal Processing Conference, 2019

2018

Fast Randomization for Distributed Low-Bitrate Coding of Speech and Audio.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Postfiltering with Complex Spectral Correlations for Speech and Audio Coding.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Dithered Quantization for Frequency-Domain Speech and Audio Coding.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

GMM-Based Iterative Entropy Coding for Spectral Envelopes of Speech and Audio.

[BibT_eX]

[DOI]

Srikanth Korse

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Speech Coding, Speech Interfaces and IOT - Opportunities and Challenges.

[BibT_eX]

[DOI]

Proceedings of the 52nd Asilomar Conference on Signals, Systems, and Computers, 2018

Optimal temporal dynamics of MFCCs for low-complexity VAD systems - a case study.

[BibT_eX]

[DOI]

Alexandra Craciun

Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017

Quadratic Programming Approach to Glottal Inverse Filtering by Joint Norm-1 and Norm-2 Optimization.

[BibT_eX]

[DOI]

Manu Airaksinen

IEEE ACM Trans. Audio Speech Lang. Process., 2017

An Unsupervised Hybrid Approach for Online Detection of Sound Scene Changes in Broadcast Content.

[BibT_eX]

[DOI]

Gökhan Sevkin

Alexandra Craciun

Proceedings of the AES International Conference Semantic Audio 2017, 2017

Estimation of the Probability Distribution of Spectral Fine Structure in the Speech Source.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Modeling formant dynamics in speech spectral envelopes.

[BibT_eX]

[DOI]

Proceedings of the 25th European Signal Processing Conference, 2017

2016

Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch.

[BibT_eX]

[DOI]

Rahim Saeidi

IEEE ACM Trans. Audio Speech Lang. Process., 2016

Entropy Coding of Spectral Envelopes for Speech and Audio Coding Using Distribution Quantization.

[BibT_eX]

[DOI]

Srikanth Korse

Tobias Jähnel

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Joint Enhancement and Coding of Speech by Incorporating Wiener Filtering in a CELP Codec.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Blind Recovery of Perceptual Models in Distributed Speech and Audio Coding.

[BibT_eX]

[DOI]

Florin Ghido

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Noise-adaptive perceptual weighting in the AMR-WB encoder for increased speech loudness in adverse far-end noise conditions.

[BibT_eX]

[DOI]

Emma Jokinen

Proceedings of the 24th European Signal Processing Conference, 2016

Spectral Envelope Statistics for Source Modeling in Speech Enhancement.

[BibT_eX]

[DOI]

Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Coding of Parametric Models with Randomized Quantization in a Distributed Speech and Audio Codec.

[BibT_eX]

[DOI]

Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015

Decorrelating MVDR Filterbanks Using the Non-Uniform Discrete Fourier Transform.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2015

Glottal inverse filtering based on quadratic programming.

[BibT_eX]

[DOI]

Manu Airaksinen

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Intelligibility evaluation of speech coding standards in severe background noise and packet loss conditions.

[BibT_eX]

[DOI]

Emma Jokinen

Jérémie Lecomte

Nadja Schinkel-Bielefeld

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Finding line spectral frequencies using the fast fourier transform.

[BibT_eX]

[DOI]

Christian Fischer Pedersen

Grzegorz Pietrzyk

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Arithmetic coding of speech and audio spectra using tcx based on linear predictive spectral envelopes.

[BibT_eX]

[DOI]

Christian R. Helmrich

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Harmonic model for MDCT based audio coding with LPC envelope.

[BibT_eX]

[DOI]

Christian R. Helmrich

Proceedings of the 23rd European Signal Processing Conference, 2015

Envelope modeling for speech and audio processing using distribution quantization.

[BibT_eX]

[DOI]

Tobias Jähnel

Benjamin Schubert

Proceedings of the 23rd European Signal Processing Conference, 2015

Comparison of windowing schemes for speech coding.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

An evaluation of stereo speech enhancement methods for different audio-visual scenarios.

[BibT_eX]

[DOI]

Alexandra Craciun

Christian Uhle

Christian Fischer Pedersen

Proceedings of the 23rd European Signal Processing Conference, 2015

2014

Sparse time-frequency representation of speech by the vandermonde transform.

[BibT_eX]

[DOI]

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix.

[BibT_eX]

[DOI]

Christian R. Helmrich

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Automatic estimation of the lip radiation effect in glottal inverse filtering.

[BibT_eX]

[DOI]

Manu Airaksinen

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Implementation and evaluation of the Vandermonde transform.

[BibT_eX]

[DOI]

Daniel Boley

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

Vandermonde Factorization of Toeplitz Matrices and Applications in Filtering and Warping.

[BibT_eX]

[DOI]

IEEE Trans. Signal Process., 2013

Comparison of windowing in speech and audio coding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Computationally efficient objective function for algebraic codebook optimization in ACELP.

[BibT_eX]

[DOI]

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

2012

Enumerative Algebraic Coding for ACELP.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2009

Stabilised weighted linear prediction.

[BibT_eX]

[DOI]

Speech Commun., 2009

Parametric AM/FM decomposition for speech and audio coding.

[BibT_eX]

[DOI]

Sascha Disch

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009

Pitch variation estimation.

[BibT_eX]

[DOI]

Stefan Bayer

Sascha Disch

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

2008

Simple proofs of root locations of two symmetric linear prediction models.

[BibT_eX]

[DOI]

Signal Process., 2008

DC-constrained linear prediction for glottal inverse filtering.

[BibT_eX]

[DOI]

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Minimum Separation of Line Spectral Frequencies.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

Effect of White-Noise Correction on Linear Predictive Coding.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2007

Stabilised weighted linear prediction - a robust all-pole method for speech processing.

[BibT_eX]

[DOI]

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

2006

Properties of line spectrum pair polynomials - A review.

[BibT_eX]

[DOI]

Signal Process., 2006

2005

Group delay function as a means to assess quality of glottal inverse filtering.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

A toolkit for voice inverse filtering and parametrisation.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Objective Quality Measures for Glottal Inverse Filtering of Speech Pressure Signals.

[BibT_eX]

[DOI]

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Line spectral properties of quadratic models.

[BibT_eX]

[DOI]

Proceedings of the 13th European Signal Processing Conference, 2005

2004

A time-domain interpretation for the LSP decomposition.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2004

Linear predictive method for improved spectral modeling of lower frequencies of speech with small prediction orders.

[BibT_eX]

[DOI]

IEEE Trans. Speech Audio Process., 2004

2003

On line spectral frequencies.

[BibT_eX]

[DOI]

W. Bastiaan Kleijn

IEEE Signal Process. Lett., 2003

A constrained linear predictive model with the minimum-phase property.

[BibT_eX]

[DOI]

Signal Process., 2003

Linear predictive method with low-frequency emphasis.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

On the stability of constrained linear predictive models.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

All-pole modeling of wide-band speech with symmetric linear prediction.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range.

[BibT_eX]

[DOI]

Erkki Vilkman

IEEE Trans. Speech Audio Process., 2002

All-pole modeling of wide-band speech using weighted sum of the LSP polynomials.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A time domain reformulation of linear prediction equivalent to the LSP decomposition.

[BibT_eX]

[DOI]

W. Bastiaan Kleijn

Proceedings of the IEEE International Conference on Acoustics, 2002

All-pole modeling technique based on the Weighted Sum of the LSP polynomials.

[BibT_eX]

[DOI]