Torbjørn Svendsen

Orcid: 0000-0003-0578-7941

According to our database1, Torbjørn Svendsen authored at least 74 papers between 1984 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
On the Predictive Power of Objective Intelligibility Metrics for the Subjective Performance of Deep Complex Convolutional Recurrent Speech Enhancement Networks.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

2023
Developing an AI-Assisted Low-Resource Spoken Language Learning App for Children.
IEEE Access, 2023

Improving Generalization of Norwegian ASR with Limited Linguistic Resources.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

A character-based analysis of impacts of dialects on end-to-end Norwegian ASR.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Using Modified Adult Speech as Data Augmentation for Child Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Semantically Meaningful Metrics for Norwegian ASR Systems.
Proceedings of the Interspeech 2022, 2022

wav2vec2-based Speech Rating System for Children with Speech Sound Disorder.
Proceedings of the Interspeech 2022, 2022

2021
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2021

Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

A Two-Stage Deep Modeling Approach to Articulatory Inversion.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Sequence-to-Sequence Articulatory Inversion Through Time Convolution of Sub-Band Frequency Signals.
Proceedings of the Interspeech 2020, 2020

Transfer Learning of Articulatory Information Through Phone Information.
Proceedings of the Interspeech 2020, 2020

2019
A Comparative Study of Deep Learning Techniques on Frame-Level Speech Data Classification.
Circuits Syst. Signal Process., 2019

A Phonetic-Level Analysis of Different Input Features for Articulatory Inversion.
Proceedings of the Interspeech 2019, 2019

Text-Independent Speaker ID Employing 2D-CNN for Automatic Video Lecture Categorization in a MOOC Setting.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

A Study on the Performance Evaluation of Machine Learning Models for Phoneme Classification.
Proceedings of the 2019 11th International Conference on Machine Learning and Computing, 2019

Evaluating Acoustic Feature Maps in 2D-CNN for Speaker Identification.
Proceedings of the 2019 11th International Conference on Machine Learning and Computing, 2019

Text-Independent Speaker ID for Automatic Video Lecture Classification Using Deep Learning.
Proceedings of the 2019 5th International Conference on Computing and Artificial Intelligence, 2019

2018
Acoustic Feature Comparison for Different Speaking Rates.
Proceedings of the Human-Computer Interaction. Interaction Technologies, 2018

2015
Combining NDHMM and phonetic feature detection for speech recognition.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
An artificial neural network approach to automatic speech processing.
Neurocomputing, 2014

2013
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition.
IEEE Trans. Speech Audio Process., 2013

Universal attribute characterization of spoken languages for automatic spoken language recognition.
Comput. Speech Lang., 2013

Non-negative durational HMM.
Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Synthetic speaker models using VTLN to improve the performance of children in mismatched speaker conditions for ASR.
Proceedings of the INTERSPEECH 2013, 2013

2012
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data.
IEEE Trans. Speech Audio Process., 2012

2011
iVector Approach to Phonotactic Language Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Frequency-Warped and Stabilized Time-Varying Cepstral Coefficients.
Proceedings of the INTERSPEECH 2011, 2011

A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines.
Proceedings of the INTERSPEECH 2011, 2011

Pronunciation variation modeling of non-native proper names by discriminative tree search.
Proceedings of the IEEE International Conference on Acoustics, 2011


2010
On the use of discriminative and non-discriminative pronunciation priors in pronunciation variation modeling of non-native proper names.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Spontal-N: A Corpus of Interactional Spoken Norwegian.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

NameDat: A Database of English Proper Names Spoken by Native Norwegians.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

A survey on recent progress in the ASAT/SIRKUS paradigm.
Proceedings of the 7th International Symposium on Chinese Spoken Language Processing, 2010

Intra-frame variability as a predictor of frame classifiability.
Proceedings of the INTERSPEECH 2010, 2010

Exploiting context-dependency and acoustic resolution of universal speech attribute models in spoken language recognition.
Proceedings of the INTERSPEECH 2010, 2010

A minimum classification error approach to pronunciation variation modeling of non-native proper names.
Proceedings of the INTERSPEECH 2010, 2010

Experimental studies on continuous speech recognition using neural architectures with "adaptive" hidden activation functions.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Exploring universal attribute characterization of spoken languages for spoken language recognition.
Proceedings of the INTERSPEECH 2009, 2009

A phonetic feature based lattice rescoring approach to LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2009

Lexicon adaptation for subword speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
RUNDKAST: an Annotated Norwegian Broadcast News Speech Corpus.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

A penalized logistic regression approach to detection based phone classification.
Proceedings of the INTERSPEECH 2008, 2008

Toward a detector-based universal phone recognizer.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Towards bottom-up continuous phone recognition.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
FonDat1: A Speech Synthesis Corpus for Norwegian.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

2005
Distributed ASR using speech coder data for efficient feature vector representation.
Proceedings of the INTERSPEECH 2005, 2005

Comparing spectral distance measures for join cost optimization in concatenative speech synthesis.
Proceedings of the INTERSPEECH 2005, 2005

Unit selection synthesis database development using utterance verification.
Proceedings of the INTERSPEECH 2005, 2005

2003
Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Cross-lingual pronunciation modelling for indonesian speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Evaluation of Pronunciation Variants in the ASR Lexicon for Different Speaking Styles.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

2001
Fast adaptation using constrained affine transformations with hierarchical priors.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
TABOR - a norwegian spoken dialogue system for bus travel information.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Stochastic modeling of semantic content for use IN a spoken dialogue system.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

ASR-based subtitling of live TV-programs for the hearing impaired.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1999
Maximum likelihood modelling of pronunciation variation.
Speech Commun., 1999

On-line captioning of TV-programs for the hearing impaired.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1997
Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Combined Optimisation of Baseforms and Subword Models for an Hmm Based Speech Recogniser.
Proceedings of the Fourth International Symposium on Signal Processing and Its Applications, 1996

1995
Optimizing baseforms for HMM-based speech recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
Segmental quantization of speech spectral information.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
Efficient quantization of speech spectral information.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Cost232: speech recognition over the telephone line.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A time-frequency segmental neural network for phoneme recognition.
Proceedings of the IEEE International Conference on Acoustics, 1993

1991
ANN-based speech recognition using a preprocessor for non-linear time compression.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990
Automatic alignment of phonemic labels with continuous speech.
Proceedings of the First International Conference on Spoken Language Processing, 1990

1989
An improved sub-word based speech recognizer.
Proceedings of the IEEE International Conference on Acoustics, 1989

1987
On the automatic segmentation of speech signals.
Proceedings of the IEEE International Conference on Acoustics, 1987

1986
Multi-dimensional quantization applied to predictive coding of speech.
Proceedings of the IEEE International Conference on Acoustics, 1986

1985
A study of three coders (sub-band, RELP and MPE) for speech with additive white noise.
Proceedings of the IEEE International Conference on Acoustics, 1985

1984
Tree encoding of the LPC residual.
Proceedings of the IEEE International Conference on Acoustics, 1984


  Loading...