Jan Skoglund

Orcid: 0009-0008-0167-4628

According to our database1, Jan Skoglund authored at least 59 papers between 1995 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Twenty-Five Years of Evolution in Speech and Language Processing.
IEEE Signal Process. Mag., July, 2023

NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment.
CoRR, 2023

A High-Rate Extension to Soundstream.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Multi-Channel Audio Signal Generation.
Proceedings of the IEEE International Conference on Acoustics, 2023

LMCodec: A Low Bitrate Speech Codec with Causal Transformer Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Speech quality assessment with WARP-Q: From similarity to subsequence dynamic time warp cost.
IET Signal Process., December, 2022

SoundStream: An End-to-End Neural Audio Codec.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Ultra-Low-Bitrate Speech Coding with Pretrained Transformers.
Proceedings of the Interspeech 2022, 2022

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset.
Proceedings of the Interspeech 2022, 2022

2021
Handling Background Noise in Neural Speech Generation.
CoRR, 2021

Generative Speech Coding with Predictive Variance Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

Warp-Q: Quality Prediction for Generative Neural Speech Codecs.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders.
Proceedings of the Twelfth International Conference on Quality of Multimedia Experience, 2020

ViSQOL v3: An Open Source Production Ready Objective Speech and Audio Metric.
Proceedings of the Twelfth International Conference on Quality of Multimedia Experience, 2020

Improving Opus Low Bit Rate Quality with Neural Speech Synthesis.
Proceedings of the Interspeech 2020, 2020

Robust Low Rate Speech Coding Based on Cloned Networks and Wavenet.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Handling Background Noise in Neural Speech Generation.
Proceedings of the 54th Asilomar Conference on Signals, Systems, and Computers, 2020

2019
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet.
CoRR, 2019

Generative Speech Enhancement Based on Cloned Networks.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

A Real-Time Wideband Neural Vocoder at 1.6kb/s Using LPCNet.
Proceedings of the Interspeech 2019, 2019

Salient Speech Representations Based on Cloned Networks.
Proceedings of the Interspeech 2019, 2019

LPCNET: Improving Neural Speech Synthesis through Linear Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Ambisonics in an Ogg Opus Container.
RFC, October, 2018

Phase-Sensitive Joint Learning Algorithms for Deep Learning-Based Speech Enhancement.
IEEE Signal Process. Lett., 2018

AMBIQUAL - a full reference objective quality metric for ambisonic spatial audio.
Proceedings of the Tenth International Conference on Quality of Multimedia Experience, 2018

Beamforming with Partial Knowledge of the Acoustic Scenario.
Proceedings of the 20th IEEE International Workshop on Multimedia Signal Processing, 2018

Spatial Audio on the Web - Create, Compress, and Render.
Proceedings of the 2018 Workshop on Audio-Visual Scene Understanding for Immersive Multimedia, 2018

Exploring Tradeoffs in Models for Low-Latency Speech Enhancement.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Wavenet Based Low Rate Speech Coding.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Joint wideband source localization and acquisition based on a grid-shift approach.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Incoherent idempotent ambisonics rendering.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Streaming VR for immersion: Quality aspects of compressed spatial audio.
Proceedings of the 23rd International Conference on Virtual System & Multimedia, 2017

Practically efficient nonlinear acoustic echo cancellers using cascaded block RLS and FLMS adaptive filters.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
On pre-filtering strategies for the GCC-PHAT algorithm.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Bi-magnitude processing framework for nonlinear acoustic echo cancellation on Android devices.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Globally optimized least-squares post-filtering for microphone array speech enhancement.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

An acoustic keystroke transient canceler for speech communication terminals using a semi-blind adaptive filter model.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
ViSQOL: an objective speech quality model.
EURASIP J. Audio Speech Music. Process., 2015

Detection and suppression of keyboard transient noise in audio streams with auxiliary keybed microphone.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Direct-to-Reverberant Ratio estimation using a null-steered beamformer.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Perceived Audio Quality for Streaming Stereo Music.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014

Sinusoidal interpolation across missing data.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

An analysis of the effect of larynx-synchronous averaging on dereverberation of voiced speech.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Rate-distortion optimization for multichannel audio compression.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2013

Monitoring the effects of temporal clipping on voIP speech quality.
Proceedings of the INTERSPEECH 2013, 2013

Robustness of speech quality metrics to background noise and network degradations: Comparing ViSQOL, PESQ and POLQA.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Improved Prediction of Nearly-Periodic Signals.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

ViSQOL: The Virtual Speech Quality Objective Listener.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

2000
On time-frequency masking in voiced speech.
IEEE Trans. Speech Audio Process., 2000

Vector quantization based on Gaussian mixture models.
IEEE Trans. Speech Audio Process., 2000

A combined WI and MELP coder at 5.2 kbps.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Interframe LSF quantization for noisy channels.
IEEE Trans. Speech Audio Process., 1999

Performance bounds for LPC spectrum quantization.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Analysis and quantization of glottal pulse shapes.
Speech Commun., 1998

On the significance of temporal masking in speech coding.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On nonlinear utilization of intervector dependency in vector quantization.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Predictive VQ for noisy channel spectrum coding: AR or MA?
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996
Exploiting interframe correlation in spectral quantization: a study of different memory VQ schemes.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Vector quantization of glottal pulses.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995


  Loading...