Tim Fingscheidt

Orcid: 0000-0002-8895-5041

According to our database1, Tim Fingscheidt authored at least 215 papers between 1995 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Generalization by Adaptation: Diffusion-Based Domain Extension for Domain-Generalized Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

2023
Coded Speech Quality Measurement by a Non-Intrusive PESQ-DNN.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Employing Real Training Data for Deep Noise Suppression.
CoRR, 2023

Survey on Unsupervised Domain Adaptation for Semantic Segmentation for Visual Perception in Automated Driving.
IEEE Access, 2023

Efficient Deep Acoustic Echo Suppression with Condition-Aware Training.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

Relaxed Attention for Transformer Models.
Proceedings of the International Joint Conference on Neural Networks, 2023

A Re-Parameterized Vision Transformer (ReVT) for Domain-Generalized Semantic Segmentation.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

What Does Really Count? Estimating Relevance of Corner Cases for Semantic Segmentation in Automated Driving.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

A Super-Resolution Training Paradigm Based on Low-Resolution Data Only to Surpass the Technical Limits of STEM and STM Microscopy.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Novel Benchmark for Refinement of Noisy Localization Labels in Autolabeled Datasets for Object Detection.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Improvements to Image Reconstruction-Based Performance Prediction for Semantic Segmentation in Highly Automated Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

End-to-end Amodal Video Instance Segmentation.
Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023

Parameter-Efficient Cross-Language Transfer Learning for a Language-Modular Audiovisual Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

Deep Long Term Prediction for Semantic Segmentation in Autonomous Driving.
Proceedings of the Advanced Analytics and Learning on Temporal Data, 2023

2022
SVDistNet: Self-Supervised Near-Field Distance Estimation on Surround View Fisheye Cameras.
IEEE Trans. Intell. Transp. Syst., 2022

Continual BatchNorm Adaptation (CBNA) for Semantic Segmentation.
IEEE Trans. Intell. Transp. Syst., 2022

Deep Noise Suppression Maximizing Non-Differentiable PESQ Mediated by a Non-Intrusive PESQNet.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Improving Performance of Semantic Segmentation CycleGANs by Noise Injection into the Latent Segmentation Space.
CoRR, 2022

Reconfigurable Intelligent Surface Enabled Spatial Multiplexing with Fully Convolutional Network.
CoRR, 2022

Unsupervised BatchNorm Adaptation (UBNA): A Domain Adaptation Method for Semantic Segmentation Without Using Source Domain Representations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Transformer-Based Lip-Reading with Regularized Dropout and Relaxed Attention.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Does a PESQNet (Loss) Require a Clean Reference Input? The Original PESQ Does, But ACR Listening Tests Don't.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Self-Attention With Restricted Time Context And Resolution In Dnn Speech Enhancement.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Bandwidth-Scalable Fully Mask-Based Deep FCRN Acoustic Echo Cancellation and Postfiltering.
Proceedings of the 17th International Workshop on Acoustic Signal Enhancement, 2022

Amodal Cityscapes: A New Dataset, its Generation, and an Amodal Semantic Segmentation Challenge Baseline.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

3DHD CityScenes: High-Definition Maps in High-Density Point Clouds.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2022

Detecting Adversarial Perturbations in Multi-Task Perception.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System.
Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Prediction of Amodal and Visible Semantic Segmentation for Automated Driving.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Adaptive Bitrate Quantization Scheme Without Codebook for Learned Image Compression.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

On the Choice of Data for Efficient Training and Validation of End-to-End Driving Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Performance Prediction for Semantic Segmentation by a Self-Supervised Image Reconstruction Decoder.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Online Performance Prediction of Perception DNNs by Multi-Task Learning With Depth Estimation.
IEEE Trans. Intell. Transp. Syst., 2021

The Vulnerability of Semantic Segmentation Networks to Adversarial Attacks in Autonomous Driving: Enhancing Extensive Environment Sensing.
IEEE Signal Process. Mag., 2021

Components loss for neural networks in mask-based speech enhancement.
EURASIP J. Audio Speech Music. Process., 2021

Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety.
CoRR, 2021

Y$^2$-Net FCRN for Acoustic Echo and Noise Suppression.
CoRR, 2021

Corner Cases for Visual Perception in Automated Driving: Some Guidance on Detection Approaches.
CoRR, 2021

SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized with Semantic Segmentation for Autonomous Driving.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Improving Convolutional Recurrent Neural Networks for Speech Emotion Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

An Application-Driven Conceptualization of Corner Cases for Perception in Highly Automated Driving.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

Detection of Collective Anomalies in Images for Automated Driving Using an Earth Mover's Deviation (EMDEV) Measure.
Proceedings of the IEEE Intelligent Vehicles Symposium Workshops, 2021

Continual Unsupervised Domain Adaptation for Semantic Segmentation by Online Frequency Domain Style Transfer.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

DNN-Based Recognition of Pole-Like Objects in LiDAR Point Clouds.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Quo Vadis? Meaningful Multiple Trajectory Hypotheses Prediction in Autonomous Driving.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Deep Noise Suppression with Non-Intrusive PESQNet Supervision Enabling the Use of Real Training Data.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Y<sup>2</sup>-Net FCRN for Acoustic Echo and Noise Suppression.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Multi-Encoder Learning and Stream Fusion for Transformer-Based End-to-End Automatic Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

From a Fourier-Domain Perspective on Adversarial Examples to a Wiener Filter Defense for Semantic Segmentation.
Proceedings of the International Joint Conference on Neural Networks, 2021

Description of Corner Cases in Automated Driving: Goals and Challenges.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

AEC in A Netshell: on Target and Topology Choices for FCRN Acoustic Echo Cancellation.
Proceedings of the IEEE International Conference on Acoustics, 2021

A New DCASE 2017 Rare Sound Event Detection Benchmark Under Equal Training Data: CRNN With Multi-Width Kernels.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Unsupervised Temporal Consistency (TC) Loss To Improve the Performance of Semantic Segmentation Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Improving Online Performance Prediction for Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Multi-Head Fusion Attention for Transformer-Based End-to-End Automatic Speech Recognition.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

2020
Speech enhancement by LSTM-based noise suppression followed by CNN-based speech restoration.
EURASIP J. Adv. Signal Process., 2020

Multichannel speaker interference reduction using frequency domain adaptive filtering.
EURASIP J. Audio Speech Music. Process., 2020

Transferable Universal Adversarial Perturbations Using Generative Models.
CoRR, 2020

Terminology and Analysis of Map Deviations in Urban Domains: Towards Dependability for HD Maps in Automated Vehicles.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Focussing Learned Image Compression to Semantic Classes for V2X Applications.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Systematization of Corner Cases for Visual Perception in Automated Driving.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Scalar and Vector Quantization for Learned Image Compression: A Study on the Effects of MSE and GAN Loss in Various Spaces.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

Class-Incremental Learning for Semantic Segmentation Re-Using Neither Old Data Nor Old Labels.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

openDD: A Large-Scale Roundabout Drone Dataset.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

INTERSPEECH 2020 Deep Noise Suppression Challenge: A Fully Convolutional Recurrent Network (FCRN) for Joint Dereverberation and Denoising.
Proceedings of the Interspeech 2020, 2020

BLSTM-Driven Stream Fusion for Automatic Speech Recognition: Novel Methods and a Multi-Size Window Fusion Example.
Proceedings of the Interspeech 2020, 2020

Using Separate Losses for Speech and Noise in Mask-Based Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Fully Convolutional Recurrent Networks for Speech Enhancement.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Multichannel Kalman-Based Wiener Filter Approach for Speaker Interference Reduction in Meetings.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Beyond the Dcase 2017 Challenge on Rare Sound Event Detection: A Proposal for a More Realistic Training and Test Framework.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Multichannel Acoustic Echo Cancellation Applied to Microphone Leakage Reduction in Meetings.
Proceedings of the 28th European Signal Processing Conference, 2020

Self-supervised Monocular Depth Estimation: Solving the Dynamic Object Problem by Semantic Guidance.
Proceedings of the Computer Vision - ECCV 2020, 2020

Unsupervised Temporal Consistency Metric for Video Segmentation in Highly-Automated Driving.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Self-Supervised Domain Mismatch Estimation for Autonomous Perception.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Improved Noise and Attack Robustness for Semantic Segmentation by Using Multi-Task Training with Self-Supervised Depth Estimation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Robust Semantic Segmentation by Redundant Networks With a Layer-Specific Loss Contribution and Majority Vote.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

A Self-Supervised Feature Map Augmentation (FMA) Loss and Combined Augmentations Finetuning to Efficiently Improve the Robustness of CNNs.
Proceedings of the CSCS '20: Computer Science in Cars Symposium, 2020

2019
Convolutional Neural Networks to Enhance Coded Speech.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

DNN-Based Cepstral Excitation Manipulation for Speech Enhancement.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Sinusoidal-Based Lowband Synthesis for Artificial Speech Bandwidth Extension.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

GAN- vs. JPEG2000 Image Compression for Distributed Automotive Perception: Higher Peak SNR Does Not Mean Better Semantic Segmentation.
CoRR, 2019

A Perceptual Weighting Filter Loss for DNN Training In Speech Enhancement.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Separated Noise Suppression and Speech Restoration: Lstm-Based Speech Enhancement in Two Stages.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

Improvement of Speech Residuals for Speech Enhancement.
Proceedings of the 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2019

On Low-Bitrate Image Compression for Distributed Automotive Perception: Higher Peak SNR Does Not Mean Better Semantic Segmentation.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Towards Tactical Maneuver Detection for Autonomous Driving Based on Vision Only.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Towards Corner Case Detection for Autonomous Driving.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

Analysis of the Effect of Various Input Representations for LSTM-Based Trajectory Prediction.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Improved Measurement Noise Covariance Estimation for N-channel Feedback Cancellation Based on the Frequency Domain Adaptive Kalman Filter.
Proceedings of the IEEE International Conference on Acoustics, 2019

Learning to Dequantize Speech Signals by Primal-dual Networks: an Approach for Acoustic Sensor Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Concatenated Identical DNN (CI-DNN) to Reduce Noise-Type Dependence in DNN-Based Speech Enhancement.
Proceedings of the 27th European Signal Processing Conference, 2019

Unsupervised Domain Adaptation to Improve Image Segmentation Quality Both in the Source and Target Domain.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

On the Robustness of Redundant Teacher-Student Frameworks for Semantic Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

On Temporal Context Information for Hybrid BLSTM-Based Phoneme Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
DNN-Supported Speech Enhancement With Cepstral Estimation of Both Excitation and Envelope.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Artificial Speech Bandwidth Extension Using Deep Neural Networks for Wideband Spectral Envelope Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Densenet Blstm for Acoustic Modeling in Robust ASR.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

A New Timit Benchmark for Context-Independent Phone Recognition Using Turbo Fusion.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

A CNN Postprocessor to Enhance Coded Speech.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

A Priori SNR Computation for Speech Enhancement Based on Cepstral Envelope Estimation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Enhancing the EVS Codec in Wideband Mode by Blind Artificial Bandwidth Extension to Superwideband.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

What Do Classifiers Actually Learn? a Case Study on Emotion Recognition Datasets.
Proceedings of the Interspeech 2018, 2018

A Priori SNR Estimation Using Discriminative Non-Negative Matrix Factorization.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multichannel Speaker Activity Detection for Meetings.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Efficient Residual Echo Suppression for Multi-Channel Acoustic Echo Cancellation Based on the Frequency-Domain Adaptive Kalman Filter.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

A Simple Cepstral Domain DNN Approach to Artificial Speech Bandwidth Extension.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Nonlinear Prediction of Speech by Echo State Networks.
Proceedings of the 26th European Signal Processing Conference, 2018

Enhancement of G.711-Coded Speech Providing Quality Higher Than Uncoded.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

On the Effects of Speaker Gender in Emotion Recognition Training Data.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

DNN/CNN Acoustic Model Turbo Fusion for Phoneme Recognition.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

On the Benefit of a Stereo Acoustic Echo Cancellation in an In-Car Communication System.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Instantaneous A Priori SNR Estimation by Cepstral Excitation Manipulation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

An Instrumental Quality Measure for Artificially Bandwidth-Extended Speech Signals.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

A DNN regression approach to speech enhancement by artificial bandwidth extension.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

A Delay-Flexible Stereo Acoustic Echo Cancellation for DFT-Based In-Car Communication (ICC) Systems.
Proceedings of the Interspeech 2017, 2017

Two-stage speech enhancement with manipulation of the cepstral excitation.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

Turbo fusion of magnitude and phase information for DNN-based phoneme recognition.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Turbo Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Artificial bandwidth extension using deep neural networks for spectral envelope estimation.
Proceedings of the IEEE International Workshop on Acoustic Signal Enhancement, 2016

Soft linear discriminant analysis (SLDA) for pattern recognition with ambiguous reference labels: Application to social signal processing.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

System-compatible robustness improvement for new generation dect decoders by G.722 soft-decision decoding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Evaluating instrumental measures of speech quality using Bayesian model selection: Correlations can be misleading!
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

A subjective listening test of six different artificial bandwidth extension approaches in English, Chinese, German, and Korean.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Improving Vector Quantization-Based Decoders for Correlated Processes in Error-Free Transmission.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Introducing Block-Wise Processing into Turbo Viterbi ASR.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

EXIT Charts for Turbo Automatic Speech Recognition: A Case Study.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

Objective Assessment of Artificial Speech Bandwidth Extension Approaches.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
A Priori SNR Estimation Using Air- and Bone-Conduction Microphones.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

A computational analysis of the neural bases of Bayesian inference.
NeuroImage, 2015

Linking speech enhancement and error concealment based on recursive MMSE estimation.
EURASIP J. Adv. Signal Process., 2015

An acoustic event detection framework and evaluation metric for surveillance in cars.
Proceedings of the INTERSPEECH 2015, 2015

An iterative speech model-based a priori SNR estimator.
Proceedings of the INTERSPEECH 2015, 2015

Acoustic event source localization for surveillance in reverberant environments supported by an event onset detection.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

An improved adpcm decoder by adaptively controlled quantization interval centroids.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
A Turbo-Decoding Weighted Forward-Backward Algorithm for Multimodal Speech Recognition.
Proceedings of the Situated Dialog in Speech-Based Human-Computer Interaction, 2014

HMM-based artificial bandwidth extension supported by neural networks.
Proceedings of the 14th International Workshop on Acoustic Signal Enhancement, 2014

Writer Identification for Historical Arabic Documents.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

An Historical Handwritten Arabic Dataset for Segmentation-Free Word Spotting - HADARA80P.
Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition, 2014

Document Writer Analysis with Rejection for Historical Arabic Manuscripts.
Proceedings of the 14th International Conference on Frontiers in Handwriting Recognition, 2014

A compact formulation of turbo audio-visual speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2014

Variable-length versus fixed-length coding: On tradeoffs for soft-decision decoding.
Proceedings of the IEEE International Conference on Acoustics, 2014

On speech quality assessment of artificial bandwidth extension.
Proceedings of the IEEE International Conference on Acoustics, 2014

An automotive wideband stereo acoustic echo canceler using frequency-domain adaptive filtering.
Proceedings of the 22nd European Signal Processing Conference, 2014

Improving scalar quantization for correlated processes using adaptive codebooks only at the receiver.
Proceedings of the 22nd European Signal Processing Conference, 2014

Automatic recognition of wideband telephone speech with limited amount of matched training data.
Proceedings of the 22nd European Signal Processing Conference, 2014

Towards Acoustic Event Detection for Surveillance in Cars.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

Multimodal ASR by Turbo Decoding vs. Feature Concatenation: Where to Perform Information Integration?
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

A New Evaluation Methodology for Speech Emotion Recognition With Confidence Output.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

Scalar Quantization With Optimized Receiver-Sided Adaptive Codebook Reconstruction Levels Controlled by a Predictor.
Proceedings of the 11th ITG Symposium on Speech Communication, 2014

A Wideband Automotive Hands-Free System for Mobile HD Voice Services.
Proceedings of the Smart Mobile In-Vehicle Systems, Next Generation Advancements, 2014

2013
Robust Ultra-Low Latency Soft-Decision Decoding of Linear PCM Audio.
IEEE Trans. Speech Audio Process., 2013

A dynamic multi-channel speech enhancement system for distributed microphones in a car environment.
EURASIP J. Adv. Signal Process., 2013

Speech quality prediction for artificial bandwidth extension algorithms.
Proceedings of the INTERSPEECH 2013, 2013

On Evaluation of Segmentation-Free Word Spotting Approaches without Hard Decisions.
Proceedings of the 12th International Conference on Document Analysis and Recognition, 2013

On the use of explicit redundancy for delayless soft-decision audio decoding.
Proceedings of the IEEE International Conference on Acoustics, 2013

Towards reproducible evaluation of automotive hands-free systems in dynamic conditions.
Proceedings of the IEEE International Conference on Acoustics, 2013

Impact of hearing impairment on fricative intelligibility for artificially bandwidth-extended telephone speech in noise.
Proceedings of the IEEE International Conference on Acoustics, 2013

Improved amr wideband error concealment for mobile communications.
Proceedings of the 21st European Signal Processing Conference, 2013

Density-induced oversampling for highly imbalanced datasets.
Proceedings of the Image Processing: Machine Vision Applications VI, 2013

2012
MMSE Log-Spectral Amplitude Estimation Under Speech Presence Uncertainty Using Generalized Γ Speech Priors.
Proceedings of the IWAENC 2012 - International Workshop on Acoustic Signal Enhancement, Proceedings, RWTH Aachen University, Germany, September 4th, 2012

Black box measurement of musical tones produced by noise reduction systems.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

MMSE speech enhancement under speech presence uncertainty assuming (generalized) gamma speech priors throughout.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Training of Classifiers for Quality Control of On-Line Laser Brazing Processes with Highly Imbalanced Datasets.
Proceedings of the Pattern Recognition, 2012

A Weighted Log Kurtosis Ratio Measure for Instrumental Musical Tones Assessment in Wideband Speech.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

A Beamformer Post-Filter with Hybrid Noise Coherence Functions Instrumentally Optimized Using a Figure of Merit.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

Survey of Speech Enhancement Supported by a Bone Conduction Microphone.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

On Iterative Exchange of Soft State Information in Two-Channel Automatic Speech Recognition.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

A Measurement Methodology for Automotive Teleconferencing.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

NLMS-Supported Decoding of High-Quality Speech for Burst Channels.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

A Multi-Channel Quality Assessment Setup Applied to a Distributed Microphone Speech Enhancement SystemWith Spectral Boosting.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

Reference-free SNR Measurement for Narrowband and Wideband Speech Signals in Car Noise.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

Comparison and Signal-Component-Wise Instrumental Evaluation of MMSE Log-Spectral Amplitude Estimation Under Speech Presence Uncertainty.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

On Improving Telephone Speech Intelligibility for Hearing Impaired Persons.
Proceedings of the 10th ITG Conference on Speech Communication, 2012

2011
A Two-Dimensional Channel Model for Digital Data Storage on Microfilm.
IEEE Trans. Commun., 2011

A Data-Driven Approach to A Priori SNR Estimation.
IEEE Trans. Speech Audio Process., 2011

On-line Detection of Imperfections in Laser-brazed Joints.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

A data-driven post-filter design based on spatially and temporally smoothed a priori SNR.
Proceedings of the IEEE International Conference on Acoustics, 2011

Delayless soft-decision decoding of high-quality audio transmitted over awgn channels.
Proceedings of the IEEE International Conference on Acoustics, 2011

Speech enhancement using a joint map estimator with Gaussian mixture model for (non-)stationary noise.
Proceedings of the IEEE International Conference on Acoustics, 2011

Delayless soft-decision decoding of high-quality audio with adaptively shaped priors.
Proceedings of the 19th European Signal Processing Conference, 2011

Robust acoustic speaker localization with distributed microphones.
Proceedings of the 19th European Signal Processing Conference, 2011

MMSE speech spectral amplitude estimation assuming non-Gaussian noise.
Proceedings of the 19th European Signal Processing Conference, 2011

2010
Performance Evaluation of Iterative Channel Codes for Digital Data Storage on Microfilm.
Proceedings of the Global Communications Conference, 2010

A New Hybrid Post-Filter using a Multichannel Decision-Directed Approach for A Priori SNR Estimation.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

A Priori SNR Estimation Using an Artificial Neural Network.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

WTIMIT: The TIMIT Speech Corpus Transmitted Over the 3G AMR Wideband Mobile Network.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

Investigations on Offline Artificial Bandwidth Extension of Telephone Speech Databases.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

2009
Entropy-based feature analysis for speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

A statistical framework for artificial bandwidth extension exploiting speech waveform and phonetic transcription.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Environment-Optimized Speech Enhancement.
IEEE Trans. Speech Audio Process., 2008

Hands-free system with low-delay subband acoustic echo control and noise reduction.
Proceedings of the IEEE International Conference on Acoustics, 2008

Towards objective quality assessment of speech enhancement systems in a black box approach.
Proceedings of the IEEE International Conference on Acoustics, 2008

An HMM-based artificial bandwidth extension evaluated by cross-language training and test.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Joint source and channel coding: from the beginning until the 'EXIT'.
Eur. Trans. Telecommun., 2007

A Particle Filtering Algorithm for Audiovisual Speaker Localisation.
Proceedings of the 4th Workshop on Positioning, Navigation and Communication, 2007

Speech enhancement with improved a posteriori SNR computation.
Proceedings of the INTERSPEECH 2007, 2007

Quality assessment of speech enhancement systems by separation of enhanced speech, noise, and echo.
Proceedings of the INTERSPEECH 2007, 2007

2006
A novel environment-dependent speech enhancement method with optimized memory footprint.
Proceedings of the INTERSPEECH 2006, 2006

2005
Robust speech recognition for mobile devices in car noise.
Proceedings of the INTERSPEECH 2005, 2005

Overcoming the Statistical Independence Assumption w.r.t. Frequency in Speech Enhancement.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
Evaluation of a small-footprint text and language independent speaker recognition system on forensic data.
Proceedings of the ODYSSEY 2004 - The Speaker and Language Recognition Workshop, Toledo, Spain, May 31, 2004

Revisiting some model-based and data-driven denoising algorithms in Aurora 2 context.
Proceedings of the INTERSPEECH 2004, 2004

Generalized stochastic principle for microphone array speech enhancement and applications to car environments.
Proceedings of the 2004 12th European Signal Processing Conference, 2004

2003
An evaluation of VTS and IMM for speaker verification in noise.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Joint source-channel (de-)coding for mobile communications.
IEEE Trans. Commun., 2002

Network-based vs. distributed speech recognition in adaptive multi-rate wireless systems.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Softbit speech decoding: a new approach to error concealment.
IEEE Trans. Speech Audio Process., 2001

A candidate proposal for a 3GPP adaptive multi-rate wideband speech codec.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Combined Source/Channel (De-)Coding: Can a Priori Information be Used Twice?.
Proceedings of the 2000 IEEE International Conference on Communications: Global Convergence Through Communications, 2000

1999
Von der Soft-Decision-Kanaldecodierung zur Softbit-Sprachdecodierung.
Informationstechnik Tech. Inform., 1999

1998
Robust speech decoding: can error concealment be better than error correction?
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Robust GSM speech decoding using the channel decoder's soft output.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Robust speech decoding: a universal approach to bit error concealment.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1995
Implementation aspects of the GSM half-rate speech codec.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995


  Loading...