We stand with Ukraine

We stand with Ukraine

Jan Skoglund

Orcid: 0009-0008-0167-4628

According to our database¹, Jan Skoglund authored at least 68 papers between 1995 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

Systematic Evaluation of Time-Frequency Features for Binaural Sound Source Localization.

[DOI]

Davoud Shariat Panah

,

Alessandro Ragano

,

,

,

CoRR, November, 2025

Binaspect - A Python Library for Binaural Audio Analysis, Visualization & Feature Generation.

[DOI]

,

Davoud Shariat Panah

,

Alessandro Ragano

,

,

CoRR, October, 2025

BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio.

[DOI]

Davoud Shariat Panah

,

,

Alessandro Ragano

,

,

CoRR, May, 2025

Binamix - A Python Library for Generating Binaural Audio Datasets.

[DOI]

,

Davoud Shariat Panah

,

Alessandro Ragano

,

,

CoRR, May, 2025

Perceptual Audio Coding: A 40-Year Historical Perspective.

[DOI]

,

Schuyler Quackenbush

,

,

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024

Editorial JSTSP NSAC Editorial.

[DOI]

,

,

,

Lars F. Villemoes

IEEE J. Sel. Top. Signal Process., December, 2024

Neural Speech and Audio Coding: Modern AI technology meets traditional codecs [Special Issue On Model-Based and Data-Driven Audio Signal Processing].

[DOI]

,

IEEE Signal Process. Mag., November, 2024

Neural Speech and Audio Coding.

[DOI]

,

CoRR, 2024

SCOREQ: Speech Quality Assessment with Contrastive Regression.

[DOI]

Alessandro Ragano

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

NOMAD: Unsupervised Learning of Perceptual Embeddings For Speech Enhancement and Non-Matching Reference Audio Quality Assessment.

[DOI]

Alessandro Ragano

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Twenty-Five Years of Evolution in Speech and Language Processing.

[DOI]

,

,

Michael A. Picheny

,

Bhuvana Ramabhadran

,

Dilek Hakkani-Tür

,

,

,

,

Jan Honza Cernocký

,

,

Abdelrahman Mohamed

IEEE Signal Process. Mag., July, 2023

A High-Rate Extension to Soundstream.

[DOI]

,

,

W. Bastiaan Kleijn

,

,

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Multi-Channel Audio Signal Generation.

[DOI]

W. Bastiaan Kleijn

,

,

Felicia S. C. Lim

,

Proceedings of the IEEE International Conference on Acoustics, 2023

LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models.

[DOI]

Teerapat Jenrungrot

,

,

W. Bastiaan Kleijn

,

,

,

,

Marco Tagliasacchi

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

Speech quality assessment with WARP-Q: From similarity to subsequence dynamic time warp cost.

[DOI]

Wissam A. Jassim

,

,

,

IET Signal Process., December, 2022

SoundStream: An End-to-End Neural Audio Codec.

[DOI]

,

Alejandro Luebs

,

,

,

Marco Tagliasacchi

IEEE ACM Trans. Audio Speech Lang. Process., 2022

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers.

[DOI]

,

,

,

W. Bastiaan Kleijn

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset.

[DOI]

,

,

Chandan K. A. Reddy

,

Alessandro Ragano

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021

Handling Background Noise in Neural Speech Generation.

[DOI]

,

Alejandro Luebs

,

Felicia S. C. Lim

,

,

,

W. Bastiaan Kleijn

,

CoRR, 2021

Generative Speech Coding with Predictive Variance Regularization.

[DOI]

W. Bastiaan Kleijn

,

,

,

,

Felicia S. C. Lim

,

Alejandro Luebs

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Warp-Q: Quality Prediction for Generative Neural Speech Codecs.

[DOI]

Wissam A. Jassim

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders.

[DOI]

Wissam A. Jassim

,

,

,

Proceedings of the Twelfth International Conference on Quality of Multimedia Experience, 2020

ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric.

[DOI]

,

Felicia S. C. Lim

,

,

,

Feargus O'Gorman

,

Proceedings of the Twelfth International Conference on Quality of Multimedia Experience, 2020

Improving Opus Low Bit Rate Quality with Neural Speech Synthesis.

[DOI]

,

Jean-Marc Valin

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Robust Low Rate Speech Coding Based on Cloned Networks and Wavenet.

[DOI]

Felicia S. C. Lim

,

W. Bastiaan Kleijn

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Handling Background Noise in Neural Speech Generation.

[DOI]

,

Alejandro Luebs

,

,

Felicia S. C. Lim

,

,

,

W. Bastiaan Kleijn

,

Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

2019

A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet.

[DOI]

Jean-Marc Valin

,

CoRR, 2019

Generative Speech Enhancement Based on Cloned Networks.

[DOI]

,

W. Bastiaan Kleijn

,

Felicia S. C. Lim

,

Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

A Real-Time Wideband Neural Vocoder at 1.6kb/s Using LPCNet.

[DOI]

Jean-Marc Valin

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Salient Speech Representations Based on Cloned Networks.

[DOI]

W. Bastiaan Kleijn

,

Felicia S. C. Lim

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LPCNET: Improving Neural Speech Synthesis through Linear Prediction.

[DOI]

Jean-Marc Valin

,

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Ambisonics in an Ogg Opus Container.

[DOI]

,

Michael Graczyk

RFC, October, 2018

Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement.

[DOI]

,

,

Turaj Shabestary

,

IEEE Signal Process. Lett., 2018

AMBIQUAL - a full reference objective quality metric for ambisonic spatial audio.

[DOI]

Miroslaw Narbutt

,

,

,

,

Proceedings of the Tenth International Conference on Quality of Multimedia Experience, 2018

Beamforming with Partial Knowledge of the Acoustic Scenario.

[DOI]

W. Bastiaan Kleijn

,

Christopher Laguna

,

Alejandro Luebs

,

Andrew MacDonald

,

Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

Spatial Audio on the Web - Create, Compress, and Render.

[DOI]

Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

Exploring Tradeoffs in Models for Low-Latency Speech Enhancement.

[DOI]

Kevin W. Wilson

,

,

,

,

John R. Hershey

,

,

,

Richard F. Lyon

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Wavenet Based Low Rate Speech Coding.

[DOI]

W. Bastiaan Kleijn

,

Felicia S. C. Lim

,

Alejandro Luebs

,

,

Florian Stimberg

,

,

Thomas C. Walters

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017

Joint wideband source localization and acquisition based on a grid-shift approach.

[DOI]

Christos Tzagkarakis

,

W. Bastiaan Kleijn

,

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Incoherent idempotent ambisonics rendering.

[DOI]

W. Bastiaan Kleijn

,

,

,

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Streaming VR for immersion: Quality aspects of compressed spatial audio.

[DOI]

Miroslaw Narbutt

,

,

,

,

Proceedings of the 23rd International Conference on Virtual System & Multimedia, 2017

Practically efficient nonlinear acoustic echo cancellers using cascaded block RLS and FLMS adaptive filters.

[DOI]

,

,

Alejandro Luebs

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

On pre-filtering strategies for the GCC-PHAT algorithm.

[DOI]

,

Michael Graczyk

,

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Bi-magnitude processing framework for nonlinear acoustic echo cancellation on Android devices.

[DOI]

Yiteng Arden Huang

,

,

Alejandro Luebs

Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Globally optimized least-squares post-filtering for microphone array speech enhancement.

[DOI]

Yiteng Arden Huang

,

Alejandro Luebs

,

,

W. Bastiaan Kleijn

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

An acoustic keystroke transient canceler for speech communication terminals using a semi-blind adaptive filter model.

[DOI]

Herbert Buchner

,

,

Simon J. Godsill

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

ViSQOL: an objective speech quality model.

[DOI]

,

,

Anil C. Kokaram

,

EURASIP J. Audio Speech Music. Process., 2015

Detection and suppression of keyboard transient noise in audio streams with auxiliary keybed microphone.

[DOI]

Simon J. Godsill

,

Herbert Buchner

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Direct-to-Reverberant Ratio estimation using a null-steered beamformer.

[DOI]

,

Alastair H. Moore

,

Patrick A. Naylor

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014

Perceived Audio Quality for Streaming Stereo Music.

[DOI]

,

,

,

,

Anil C. Kokaram

,

Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Sinusoidal interpolation across missing data.

[DOI]

W. Bastiaan Kleijn

,

Turaj Zakizadeh Shabestary

,

Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

An analysis of the effect of larynx-synchronous averaging on dereverberation of voiced speech.

[DOI]

Alastair H. Moore

,

Patrick A. Naylor

,

Proceedings of the 22nd European Signal Processing Conference, 2014

2013

Rate-distortion optimization for multichannel audio compression.

[DOI]

,

,

W. Bastiaan Kleijn

Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Monitoring the effects of temporal clipping on voIP speech quality.

[DOI]

,

,

Anil C. Kokaram

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA.

[DOI]

,

,

Anil C. Kokaram

,

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Improved Prediction of Nearly-Periodic Signals.

[DOI]

W. Bastiaan Kleijn

,

Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

ViSQOL: The Virtual Speech Quality Objective Listener.

[DOI]

,

,

Anil C. Kokaram

,

Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

2000

On time-frequency masking in voiced speech.

[DOI]

,

W. Bastiaan Kleijn

IEEE Trans. Speech Audio Process., 2000

Vector quantization based on Gaussian mixture models.

[DOI]

,

IEEE Trans. Speech Audio Process., 2000

A combined WI and MELP coder at 5.2 kbps.

[DOI]

,

,

John S. Collura

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Interframe LSF quantization for noisy channels.

[DOI]

Thomas Eriksson

,

,

IEEE Trans. Speech Audio Process., 1999

Performance bounds for LPC spectrum quantization.

[DOI]

,

,

Jonas Samuelsson

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Analysis and quantization of glottal pulse shapes.

[DOI]

Speech Commun., 1998

On the significance of temporal masking in speech coding.

[DOI]

,

W. Bastiaan Kleijn

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On nonlinear utilization of intervector dependency in vector quantization.

[DOI]

Mikael Skoglund

,

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997

Predictive VQ for noisy channel spectrum coding: AR or MA?

[DOI]

,

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Exploiting interframe correlation in spectral quantization: a study of different memory VQ schemes.

[DOI]

Thomas Eriksson

,

,

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995

Vector quantization of glottal pulses.

[DOI]

Thomas Eriksson

,

,

Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Loading...