Tanja Schultz

Orcid: 0000-0002-9809-7028

Affiliations:
  • University of Bremen, Cognitive Systems Lab, Germany
  • Carnegie Mellon University, Pittsburgh, USA (former)


According to our database1, Tanja Schultz authored at least 389 papers between 1992 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
MS2OD: outlier detection using minimum spanning tree and medoid selection.
Mach. Learn. Sci. Technol., March, 2024

Brain Topology Modeling With EEG-Graphs for Auditory Spatial Attention Detection.
IEEE Trans. Biomed. Eng., January, 2024

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition.
CoRR, 2024

Uncovering the Full Potential of Visual Grounding Methods in VQA.
CoRR, 2024

Really Can't Hold On Anymore? Physiological Indicators Versus Self-Reported Motivation Drop During Jogging.
Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies, 2024

Can Electromyography Alone Reveal Facial Action Units? A Pilot EMG-Based Action Unit Recognition Study with Real-Time Validation.
Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies, 2024

Gait Parameter Estimation from a Single Privacy Preserving Depth Sensor.
Proceedings of the 17th International Joint Conference on Biomedical Engineering Systems and Technologies, 2024

2023
The nested hierarchy of overt, mouthed, and imagined speech activity evident in intracranial recordings.
NeuroImage, April, 2023

EEG Correlates of Distractions and Hesitations in Human-Robot Interaction: A LabLinking Pilot Study.
Multimodal Technol. Interact., March, 2023

Sensor-Based Human Activity and Behavior Research: Where Advanced Sensing and Recognition Technologies Meet.
Sensors, 2023

Enhancing Subject-Independent EEG-Based Auditory Attention Decoding with WGAN and Pearson Correlation Coefficient.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2023

Data augmentation strategies for low resource conversational code-switching.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Few-shot meta multilabel classifier for low resource accented code-switched speech.
Proceedings of the 26th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2023

Discrimination of Overt, Mouthed, and Imagined Speech Activity using Stereotactic EEG.
Proceedings of the 11th International IEEE/EMBS Conference on Neural Engineering, 2023

XAnet: Cross-Attention Between EEG of Left and Right Brain for Auditory Attention Decoding.
Proceedings of the 11th International IEEE/EMBS Conference on Neural Engineering, 2023

An Overview of the ICASSP Special Session on AI Security and Privacy in Speech and Audio Processing.
Proceedings of the ACM Multimedia Asia Workshops, 2023

Multi-Speaker Speech Synthesis from Electromyographic Signals by Soft Speech Unit Prediction.
Proceedings of the IEEE International Conference on Acoustics, 2023

Multi-Head Attention and GRU for Improved Match-Mismatch Classification of Speech Stimulus and EEG Response.
Proceedings of the IEEE International Conference on Acoustics, 2023

Measuring Faithful and Plausible Visual Grounding in VQA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

On a Real Real-Time Wearable Human Activity Recognition System.
Proceedings of the 16th International Joint Conference on Biomedical Engineering Systems and Technologies, 2023

2022
STAnet: A Spatiotemporal Attention Network for Decoding Auditory Spatial Attention From EEG.
IEEE Trans. Biomed. Eng., 2022

Multilingual speech recognition for GlobalPhone languages.
Speech Commun., 2022

TSSEARCH: Time Series Subsequence Search Library.
SoftwareX, 2022

Visually Grounded VQA by Lattice-based Retrieval.
CoRR, 2022

Self-Supervised Learning of Neural Speech Representations From Unlabeled Intracranial Signals.
IEEE Access, 2022

Evaluation of an Engagement-Aware Recommender System for People with Dementia.
Proceedings of the UMAP '22: 30th ACM Conference on User Modeling, Adaptation and Personalization, Barcelona, Spain, July 4, 2022

SmartHelm: User Studies from Lab to Field for Attention Modeling.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2022

Merged Pitch Histograms and Pitch-duration Histograms.
Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications, 2022

Interactive and Interpretable Online Human Activity Recognition.
Proceedings of the 2022 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, 2022

Normalization of code-switched text for speech synthesis.
Proceedings of the Interspeech 2022, 2022

Challenges of using longitudinal and cross-domain corpora on studies of pathological speech.
Proceedings of the Interspeech 2022, 2022

Blind Language Separation: Disentangling Multilingual Cocktail Party Voices by Language.
Proceedings of the Interspeech 2022, 2022

Deep Learning Approaches for Detecting Alzheimer's Dementia from Conversational Speech of ILSE Study.
Proceedings of the Interspeech 2022, 2022

An Overview of the FIRST ICASSP Special Session on Computer Audition for Healthcare.
Proceedings of the IEEE International Conference on Acoustics, 2022

Hybrid sub-word segmentation for handling long tail in morphologically rich low resource languages.
Proceedings of the IEEE International Conference on Acoustics, 2022

Experts Versus All-Rounders: Target Language Extraction for Multiple Target Languages.
Proceedings of the IEEE International Conference on Acoustics, 2022

Towards Closed-Loop Speech Synthesis from Stereotactic EEG: A Unit Selection Approach.
Proceedings of the IEEE International Conference on Acoustics, 2022

Exploring Dementia Detection from Speech: Cross Corpus Analysis.
Proceedings of the IEEE International Conference on Acoustics, 2022

Contributions of Stereotactic EEG Electrodes in Grey and White Matter to Speech Activity Detection.
Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2022

How Long Are Various Types of Daily Activities? Statistical Analysis of a Multimodal Wearable Sensor-based Human Activity Dataset.
Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies, 2022

A Practical Wearable Sensor-based Human Activity Recognition Research Pipeline.
Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies, 2022

Interpretable High-level Features for Human Activity Recognition.
Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies, 2022

High-Level Features for Human Activity Recognition and Modeling.
Proceedings of the Biomedical Engineering Systems and Technologies, 2022

2021
Predicting Activation Liking of People With Dementia.
Frontiers Comput. Sci., 2021

Corrigendum: CSL-SHARE: A Multimodal Wearable Sensor-Based Human Activity Dataset.
Frontiers Comput. Sci., 2021

CSL-SHARE: A Multimodal Wearable Sensor-Based Human Activity Dataset.
Frontiers Comput. Sci., 2021

Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study.
Comput. Speech Lang., 2021

Adventurer's Treasure Hunt: A Transparent System for Visually Grounded Compositional Visual Question Answering based on Scene Graphs.
CoRR, 2021

Linking Labs: Interconnecting Experimental Environments.
CoRR, 2021

Speech Activity Detection from Stereotactic EEG.
Proceedings of the 2021 IEEE International Conference on Systems, Man, and Cybernetics, 2021

Audio-Visual Recognition of Emotional Engagement of People with Dementia.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Visual Speech for Obstructive Sleep Apnea Detection.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

GlobalPhone Mix-To-Separate Out of 2: A Multilingual 2000 Speakers Mixtures Database for Speech Separation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Universal Speaker Extraction in the Presence and Absence of Target Speakers for Speech of One and Two Talkers.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

3rd Workshop on Modeling Socio-Emotional and Cognitive Processes from Multimodal Data in the Wild.
Proceedings of the ICMI '21: International Conference on Multimodal Interaction, 2021

End-to-End Multilingual Automatic Speech Recognition for Less-Resourced Languages: The Case of Four Ethiopian Languages.
Proceedings of the IEEE International Conference on Acoustics, 2021


Motion Units: Generalized Sequence Modeling of Human Activities for Sensor-Based Activity Recognition.
Proceedings of the 29th European Signal Processing Conference, 2021

Low-Latency Auditory Spatial Attention Detection Based on Spectro-Spatial Features from EEG.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Speech Synthesis from Stereotactic EEG using an Electrode Shaft Dependent Multi-Input Convolutional Neural Network Approach.
Proceedings of the 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2021

Feature Space Reduction for Human Activity Recognition based on Multi-channel Biosignals.
Proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies, 2021

Target Language Extraction at Multilingual Cocktail Parties.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

Automatic Speech Recognition for Dementia Screening using ILSE-Interviews.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

2020
TSFEL: Time Series Feature Extraction Library.
SoftwareX, 2020

DNN-Based Multilingual Automatic Speech Recognition for Wolaytta using Oromo Speech.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Building Language Models for Morphological Rich Low-Resource Languages using Data from Related Donor Languages: the Case of Uyghur.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Analysis of GlobalPhone and Ethiopian Languages Speech Corpora for Multilingual ASR.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Automatic Speech Recognition for Uyghur through Multilingual Acoustic Modeling.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

From Human to Robot Everyday Activity.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Development of Multilingual ASR Using GlobalPhone for Less-Resourced Languages: The Case of Ethiopian Languages.
Proceedings of the Interspeech 2020, 2020

Malayalam-English Code-Switched: Grapheme to Phoneme System.
Proceedings of the Interspeech 2020, 2020

CSL-EMG_Array: An Open Access Corpus for EMG-to-Speech Conversion.
Proceedings of the Interspeech 2020, 2020

Towards Silent Paralinguistics: Deriving Speaking Mode and Speaker ID from Electromyographic Signals.
Proceedings of the Interspeech 2020, 2020

Toward Silent Paralinguistics: Speech-to-EMG - Retrieving Articulatory Muscle Activity from Speech.
Proceedings of the Interspeech 2020, 2020

Speech Spectrogram Estimation from Intracranial Brain Activity Using a Quantization Approach.
Proceedings of the Interspeech 2020, 2020

Automatic Speech Recognition for ILSE-Interviews: Longitudinal Conversational Speech Recordings Covering Aging and Cognitive Decline.
Proceedings of the Interspeech 2020, 2020

Multilingual Acoustic and Language Modeling for Ethio-Semitic Languages.
Proceedings of the Interspeech 2020, 2020

Towards Engagement Recognition of People with Dementia in Care Settings.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Model-based Prediction of Exogeneous and Endogeneous Attention Shifts During an Everyday Activity.
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020

Modeling Socio-Emotional and Cognitive Processes from Multimodal Data in the Wild.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

SmartHelm: Towards Multimodal Detection of Attention in an Outdoor Augmented Reality Biking Scenario.
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020

DNN-Based Speech Recognition for Globalphone Languages.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Deep Neural Networks Based Automatic Speech Recognition for Four Ethiopian Languages.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Platform for Studying Self-Repairing Auto-Corrections in Mobile Text Entry based on Brain Activity, Gaze, and Context.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

Feature Space Reduction for Multimodal Human Activity Recognition.
Proceedings of the 13th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2020), 2020

2019
Interpretation of convolutional neural networks for speech spectrogram regression from intracranial recordings.
Neurocomputing, 2019

Visual and Memory-based HCI Obstacles: Behaviour-based Detection and User Interface Adaptations Analysis.
Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019

Augmented Reality Interface for Smart Home Control using SSVEP-BCI and Eye Gaze.
Proceedings of the 2019 IEEE International Conference on Systems, Man and Cybernetics, 2019

Decoding Lip Movements During Continuous Speech using Electrocorticography.
Proceedings of the 2019 9th International IEEE/EMBS Conference on Neural Engineering (NER), 2019

Biosignal Processing for Human-Machine Interaction.
Proceedings of the Interspeech 2019, 2019

Comparative Analysis of Think-Aloud Methods for Everyday Activities in the Context of Cognitive Robotics.
Proceedings of the Interspeech 2019, 2019

Adaptation of an EMG-Based Speech Recognizer via Meta-Learning.
Proceedings of the 2019 IEEE Global Conference on Signal and Information Processing, 2019

Towards Restoration of Articulatory Movements: Functional Electrical Stimulation of Orofacial Muscles.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Decoding Mental Workload in Virtual Environments: A fNIRS Study using an Immersive n-back Task.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

A Wearable Real-time Human Activity Recognition System using Biosensors Integrated into a Knee Bandage.
Proceedings of the 12th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2019), 2019

Speech Reveals Future Risk of Developing Dementia: Predictive Dementia Screening from Biographic Interviews.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Improving Fundamental Frequency Generation in EMG-to-Speech Conversion Using a Quantization Approach.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Selecting Features for Automatic Screening for Dementia Based on Speech.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Detecting Memory-Based Interaction Obstacles with a Recurrent Neural Model of User Behavior.
Proceedings of the 23rd International Conference on Intelligent User Interfaces, 2018

Investigating the Effect of Audio Duration on Dementia Detection Using Acoustic Features.
Proceedings of the Interspeech 2018, 2018

Domain-Adversarial Training for Session Independent EMG-based Speech Recognition.
Proceedings of the Interspeech 2018, 2018

Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion.
Proceedings of the Interspeech 2018, 2018

Investigating static and sequential models for intervention-free selection using multimodal data of EEG and eye tracking.
Proceedings of the Workshop on Modeling Cognitive Processes from Multimodal Data, 2018

Modeling Cognitive Processes from Multimodal Signals.
Proceedings of the 2018 on International Conference on Multimodal Interaction, 2018

Bio signal-based Spoken Communication.
Proceedings of the Fourth International Conference, 2018

interpretation of convolutional neural networks for speech regression from electrocorticography.
Proceedings of the 26th European Symposium on Artificial Neural Networks, 2018

ASK: A Framework for Data Acquisition and Activity Recognition.
Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2018), 2018

Automatic Screening for Transition into Dementia using Speech.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

Session-Independent Array-Based EMG-to-Speech Conversion using Convolutional Neural Networks.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

A comparison of EMG-to-Speech Conversion for Isolated and Continuous Speech.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Towards Continuous Speech Recognition for BCI.
Proceedings of the Brain-Computer Interface Research - A State-of-the-Art Summary 5, 2017

Biosignal-Based Spoken Communication: A Survey.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Introduction to the Special Issue on Biosignal-Based Spoken Communication.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Bremen Big Data Challenge 2017: Predicting University Cafeteria Load.
Proceedings of the KI 2017: Advances in Artificial Intelligence, 2017

Manual and Automatic Transcriptions in Dementia Detection from Speech.
Proceedings of the Interspeech 2017, 2017

Automatic classification of auto-correction errors in predictive text entry based on EEG and context information.
Proceedings of the 19th ACM International Conference on Multimodal Interaction, 2017

2016
Increased gamma band power during movement planning coincides with motor memory retrieval.
NeuroImage, 2016

Word segmentation and pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment.
Comput. Speech Lang., 2016

Biosignal-based Cognitive Systems.
Proceedings of the 3rd International Conference on Physiological Computing Systems (PhyCS 2016), 2016

Starring into the void?: Classifying Internal vs. External Attention from EEG.
Proceedings of the 9th Nordic Conference on Human-Computer Interaction, Gothenburg, Sweden, October 23, 2016

Towards Automatic Transcription of ILSE ― an Interdisciplinary Longitudinal Study of Adult Development and Aging.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Speech-Based Detection of Alzheimer's Disease in Conversational German.
Proceedings of the Interspeech 2016, 2016

To be Defined.
Proceedings of the 5th International Conference on Pattern Recognition Applications and Methods, 2016

Intervention-free selection using EEG and eye tracking.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

Towards direct speech synthesis from ECoG: A pilot study.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

An initial investigation into the real-time conversion of facial surface EMG signals to audible speech.
Proceedings of the 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2016

Detection of Intra-Personal Development of Cognitive Impairment From Conversational Speech.
Proceedings of the 12th ITG Symposium on Speech Communication, 2016

2015
Syntactic and Semantic Features For Code-Switching Factored Language Models.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Dummy Model Based Workload Modeling.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Model-Based Evaluation of Playing Strategies in a Memo Game for Elderly Users.
Proceedings of the 2015 IEEE International Conference on Systems, 2015

Hybrid fNIRS-EEG based discrimination of 5 levels of memory load.
Proceedings of the 7th International IEEE/EMBS Conference on Neural Engineering, 2015

Joint optimization for discriminative, compact and robust Brain-Computer Interfacing.
Proceedings of the 7th International IEEE/EMBS Conference on Neural Engineering, 2015

Telemanipulation with force-based display of proximity fields.
Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2015

Continuous speech recognition from ECoG.
Proceedings of the INTERSPEECH 2015, 2015

Codebook clustering for unit selection based EMG-to-speech conversion.
Proceedings of the INTERSPEECH 2015, 2015

Direct conversion from facial myoelectric signals to speech using Deep Neural Networks.
Proceedings of the 2015 International Joint Conference on Neural Networks, 2015

Cross-lingual lexical language discovery from audio data using multiple translations.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Investigating deep learning for fNIRS based BCI.
Proceedings of the 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2015

Design and Evaluation of a Self-Correcting Gesture Interface based on Error Potentials from EEG.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

Advancing Muscle-Computer Interfaces with High-Density Electromyography.
Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, 2015

Fusion and Comparison of IMU and EMG Signals for Wearable Gesture Recognition.
Proceedings of the Biomedical Engineering Systems and Technologies, 2015

Recognizing Hand and Finger Gestures with IMU based Motion and EMG based Muscle Activity Sensing.
Proceedings of the BIOSIGNALS 2015, 2015

2014
Tackling Speaking Mode Varieties in EMG-Based Speech Recognition.
IEEE Trans. Biomed. Eng., 2014

Web-based tools and methods for rapid pronunciation dictionary creation.
Speech Commun., 2014

Automatic speech recognition for under-resourced languages: A survey.
Speech Commun., 2014

Introduction to the special issue on processing under-resourced languages.
Speech Commun., 2014

Airwriting: a wearable handwriting recognition system.
Pers. Ubiquitous Comput., 2014

Towards automatic speech recognition without pronunciation dictionary, transcribed speech and text resources in the target language using cross-lingual word-to-phoneme alignment.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Combining grapheme-to-phoneme converter outputs for enhanced pronunciation generation in low-resource scenarios.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Automatic detection of anglicisms for the pronunciation dictionary generation: a case study on our German IT corpus.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

Features for factored language models for code-Switching speech.
Proceedings of the 4th Workshop on Spoken Language Technologies for Under-resourced Languages, 2014

GlobalPhone: Pronunciation Dictionaries in 20 Languages.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Conversion from facial myoelectric signals to speech: a unit selection approach.
Proceedings of the INTERSPEECH 2014, 2014

Towards real-life application of EMG-based speech recognition by using unsupervised adaptation.
Proceedings of the INTERSPEECH 2014, 2014

The EMG-UKA corpus for electromyographic speech processing.
Proceedings of the INTERSPEECH 2014, 2014

Investigating the learning effect of multilingual bottle-neck features for ASR.
Proceedings of the INTERSPEECH 2014, 2014

Improving ASR performance on non-native speech using multilingual and crosslingual information.
Proceedings of the INTERSPEECH 2014, 2014

BioKIT - real-time decoder for biosignal processing.
Proceedings of the INTERSPEECH 2014, 2014

Methods for efficient semi-automatic pronunciation dictionary bootstrapping.
Proceedings of the INTERSPEECH 2014, 2014

Combining recurrent neural networks and factored language models during decoding of code-Switching speech.
Proceedings of the INTERSPEECH 2014, 2014

Comparing approaches to convert recurrent neural networks into backoff language models for efficient decoding.
Proceedings of the INTERSPEECH 2014, 2014

Investigating Intrusiveness of Workload Adaptation.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Spatio-Temporal Event Selection in Basic Surveillance Tasks using Eye Tracking and EEG.
Proceedings of the 7th Workshop on Eye Gaze in Intelligent Human Machine Interaction: Eye-Gaze & Multimodality, 2014

Compensation of recording position shifts for a myoelectric Silent Speech Recognizer.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multilingual deep neural network based acoustic modeling for rapid language adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2014

Fundamental frequency generation for whisper-to-audible speech conversion.
Proceedings of the IEEE International Conference on Acoustics, 2014

Connectivity based feature-level filtering for single-trial EEG BCIs.
Proceedings of the IEEE International Conference on Acoustics, 2014

Model-Based Identification of EEG Markers for Learning Opportunities in an Associative Learning Task with Delayed Feedback.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2014, 2014

Pattern learning with deep neural networks in EMG-based speech recognition.
Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2014

Combining feature extraction and classification for fNIRS BCIs by regularized least squares optimization.
Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2014

Spatial Artifact Detection for Multi-channel EMG-based Speech Recognition.
Proceedings of the BIOSIGNALS 2014, 2014

Enhancement of EMG-based Thai number words classification using frame-based time domain features with stacking filter.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Exploration of the Impact of Maximum Entropy in Recurrent Neural Network Language Models for Code-Switching Speech.
Proceedings of the First Workshop on Computational Approaches to Code Switching@EMNLP 2014, 2014

2013
Airwriting: bringing text entry to wearable computers.
XRDS, 2013

Biosignale-basierte Mensch-Maschine Schnittstellen.
Autom., 2013

An Investigation of Code-Switching Attitude Dependent Language Modeling.
Proceedings of the Statistical Language and Speech Processing, 2013

Pronunciation Extraction from Phoneme Sequences through Cross-Lingual Word-to-Phoneme Alignment.
Proceedings of the Statistical Language and Speech Processing, 2013

Locating user attention using eye tracking and EEG for spatio-temporal event selection.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013

Multilingual multilayer perceptron for rapid language adaptation between and across language families.
Proceedings of the INTERSPEECH 2013, 2013

Unsupervised language model adaptation for automatic speech recognition of broadcast news using web 2.0.
Proceedings of the INTERSPEECH 2013, 2013

Experiments towards a better LVCSR system for tamil.
Proceedings of the INTERSPEECH 2013, 2013

GlobalPhone: A multilingual text & speech database in 20 languages.
Proceedings of the IEEE International Conference on Acoustics, 2013

Statistical machine translation based text normalization with crowdsourcing.
Proceedings of the IEEE International Conference on Acoustics, 2013

Rapid bootstrapping of a Ukrainian large vocabulary continuous speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2013

Recurrent neural network language modeling for code switching conversational speech.
Proceedings of the IEEE International Conference on Acoustics, 2013

Compressed signal representation for inertial sensor signals.
Proceedings of the 2013 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2013

Artifact removal algorithm for an EMG-based Silent Speech Interface.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Classification of mental tasks in the prefrontal cortex using fNIRS.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Subject-to-subject transfer for CSP based BCIs: Feature space transformation and decision-level fusion.
Proceedings of the 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2013

Array-based Electromyographic Silent Speech Interface.
Proceedings of the BIOSIGNALS 2013, 2013

Application of Electrode Arrays for Artifact Removal in an Electromyographic Silent Speech Interface.
Proceedings of the Biomedical Engineering Systems and Technologies, 2013

Human Activity Recognition for an Intelligent Knee Orthosis.
Proceedings of the BIOSIGNALS 2013, 2013

Session-independent EEG-based Workload Recognition.
Proceedings of the BIOSIGNALS 2013, 2013

Profiling Arousal in Response to Complex Stimuli using Biosignals.
Proceedings of the BIOSIGNALS 2013, 2013

Neighbour selection and adaptation for rapid speaker-dependent ASR.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Combination of Recurrent Neural Networks and Factored Language Models for Code-Switching Language Modeling.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Continuous Recognition of Affective States by Functional Near Infrared Spectroscopy Signals.
Proceedings of the 2013 Humaine Association Conference on Affective Computing and Intelligent Interaction, 2013

2012
On-line Action Recognition from Sparse Feature Flow.
Proceedings of the VISAPP 2012, 2012

Integration of language identification into a recognition system for spoken conversations containing code-Switches.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Multilingual bottle-neck features and its application for under-resourced languages.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Hausa large vocabulary continuous speech recognition.
Proceedings of the Third Workshop on Spoken Language Technologies for Under-resourced Languages, 2012

Word segmentation through cross-lingual word-to-phoneme alignment.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Active learning for accent adaptation in Automatic Speech Recognition.
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012

Semi-supervised learning for speech recognition in the context of accent adaptation.
Proceedings of the 2012 Symposium on Machine Learning in Speech and Language Processing, 2012

Airwriting: demonstrating mobile text input by 3D-space handwriting.
Proceedings of the 17th International Conference on Intelligent User Interfaces, 2012

Airwriting: Hands-Free Mobile Text Input by Spotting and Continuous Recognition of 3d-Space Handwriting with Inertial Sensors.
Proceedings of the 16th International Symposium on Wearable Computers, 2012

Initialization Schemes for Multilayer Perceptron Training and their Impact on ASR Performance using Multilingual Data.
Proceedings of the INTERSPEECH 2012, 2012

Automatic Error Recovery for Pronunciation Dictionaries.
Proceedings of the INTERSPEECH 2012, 2012

Enhanced Polyphone Decision Tree Adaptation for Accented Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Cross-Subject Classification of Speaking Modes Using fNIRS.
Proceedings of the Neural Information Processing - 19th International Conference, 2012

Vision-based handwriting recognition for unrestricted text input in mid-air.
Proceedings of the International Conference on Multimodal Interaction, 2012

Modeling gender dependency in the Subspace GMM framework.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A first speech recognition system for Mandarin-English code-switch conversational speech.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Grapheme-to-phoneme model generation for Indo-European languages.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Further investigations on EMG-to-speech conversion.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Towards single pass discriminative training for speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Initial Experiments with Tamil LVCSR.
Proceedings of the 2012 International Conference on Asian Language Processing, 2012

Speaking mode recognition from functional Near Infrared Spectroscopy.
Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012

Filling a glass of water: Continuously decoding the speed of 3D hand movements from EEG signals.
Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2012

Decision-tree based Analysis of Speaking Mode Discrepancies in EMG-based Speech Recognition.
Proceedings of the BIOSIGNALS 2012, 2012

2011
An EEG Adaptive Information System for an Empathic Robot.
Int. J. Soc. Robotics, 2011


JAM: Java-based Associative Memory.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Combined intention, activity, and motion recognition for a humanoid household robot.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Investigation of Cross-Show Speaker Diarization.
Proceedings of the INTERSPEECH 2011, 2011

Investigations on Speaking Mode Discrepancies in EMG-Based Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Rapid Building of an ASR System for Under-Resourced Languages Based on Multilingual Unsupervised Training.
Proceedings of the INTERSPEECH 2011, 2011

Tue-SeA Real-Time Speech Command Detector for a Smart Control Room.
Proceedings of the INTERSPEECH 2011, 2011

Analysis of Dialectal Influence in Pan-Arabic ASR.
Proceedings of the INTERSPEECH 2011, 2011

Generalized Baum-Welch Algorithm and its Implication to a New Extended Baum-Welch Algorithm.
Proceedings of the INTERSPEECH 2011, 2011

Impact of Different Feedback Mechanisms in EMG-Based Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

Multimodal person independent recognition of workload related biosignal patterns.
Proceedings of the 13th International Conference on Multimodal Interfaces, 2011

Analysis of phone confusion in EMG-based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2011

Cross-language bootstrapping based on completely unsupervised training using multilingual A-stabil.
Proceedings of the IEEE International Conference on Acoustics, 2011

Estimation of fundamental frequency from surface electromyographic data: EMG-to-F0.
Proceedings of the IEEE International Conference on Acoustics, 2011

Session-independent EMG-based Speech Recognition.
Proceedings of the BIOSIGNALS 2011, 2011

Biosignals and Interfaces.
Proceedings of the BIODEVICES 2011, 2011

Online Recognition of Facial Actions for Natural EEG-Based BCI Applications.
Proceedings of the Affective Computing and Intelligent Interaction, 2011

2010
Modeling coarticulation in EMG-based continuous speech recognition.
Speech Commun., 2010

Silent speech interfaces.
Speech Commun., 2010

Guest Editorial.
Speech Commun., 2010

An Adaptive Information System for an Empathic Robot Using EEG Data.
Proceedings of the Social Robotics - Second International Conference on Social Robotics, 2010

Optimization on Vietnamese large vocabulary speech recognition.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Multilingual a-stabil: A new confidence score for multilingual unsupervised training.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Online Workload Recognition from EEG Data during Cognitive Tests and Human-Machine Interaction.
Proceedings of the KI 2010: Advances in Artificial Intelligence, 2010

BiosignalsStudio: A Flexible Framework for Biosignal Capturing and Processing.
Proceedings of the KI 2010: Advances in Artificial Intelligence, 2010

Towards Semantic Segmentation of Human Motion Sequences.
Proceedings of the KI 2010: Advances in Artificial Intelligence, 2010

Rapid bootstrapping of five eastern european languages using the rapid language adaptation toolkit.
Proceedings of the INTERSPEECH 2010, 2010

Text normalization based on statistical machine translation and internet user support.
Proceedings of the INTERSPEECH 2010, 2010

Wiktionary as a source for automatic pronunciation extraction.
Proceedings of the INTERSPEECH 2010, 2010

Utterance selection for speech acts in a cognitive tourguide scenario.
Proceedings of the INTERSPEECH 2010, 2010

The 2010 CMU GALE speech-to-text system.
Proceedings of the INTERSPEECH 2010, 2010

Impact of lack of acoustic feedback in EMG-based silent speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Improvements to generalized discriminative feature transformation for speech recognition.
Proceedings of the INTERSPEECH 2010, 2010

Multimodal Recognition of Cognitive Workload for Multitasking in the Car.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

ICCHP Keynote: Recognizing Silent and Weak Speech Based on Electromyography.
Proceedings of the Computers Helping People with Special Needs, 2010

Speaker identification with distant microphone speech.
Proceedings of the IEEE International Conference on Acoustics, 2010

A Spectral Mapping Method for EMG-based Recognition of Silent Speech.
Proceedings of the B-Interface 2010, 2010

Airwriting recognition using wearable motion sensors.
Proceedings of the 1st Augmented Human International Conference, 2010

2009
Introduction to the Special Issue on Processing Morphologically Rich Languages.
IEEE Trans. Speech Audio Process., 2009

Towards an EEG-based emotion recognizer for humanoid robots.
Proceedings of the 18th IEEE International Symposium on Robot and Human Interactive Communication, 2009

Enhancement of human computer interaction with facial electromyographic sensors.
Proceedings of the 21st Australasian Computer-Human Interaction Conference, 2009

Incremental Adaptation of Speech-to-Speech Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Speaker identification using warped MVDR cepstral features.
Proceedings of the INTERSPEECH 2009, 2009

Impact of different speaking modes on EMG-based speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Synthesizing speech from electromyography using voice transformation techniques.
Proceedings of the INTERSPEECH 2009, 2009

Improving speaker segmentation via speaker identification and text segmentation.
Proceedings of the INTERSPEECH 2009, 2009

Generalized discriminative feature transformation for speech recognition.
Proceedings of the INTERSPEECH 2009, 2009

Incorporating monolingual corpora into bilingual latent semantic analysis for crosslingual LM adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2009


Voice convergin: Speaker de-identification by voice transformation.
Proceedings of the IEEE International Conference on Acoustics, 2009

Generalized Baum-Welch algorithm for discriminative training on large vocabulary continuous speech recognition system.
Proceedings of the IEEE International Conference on Acoustics, 2009

Detecting bandlimited audio in broadcast television shows.
Proceedings of the IEEE International Conference on Acoustics, 2009

HMM-based human motion recognition with optical flow data.
Proceedings of the 9th IEEE-RAS International Conference on Humanoid Robots, 2009

Joint Learning of Preposition Senses and Semantic Roles of Prepositional Phrases.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Speaker-Adaptive Speech Recognition Based on Surface Electromyography.
Proceedings of the Biomedical Engineering Systems and Technologies, 2009

Towards Speaker-adaptive Speech Recognition based on Surface Electromyography.
Proceedings of the BIOSIGNALS 2009, 2009

EEG-based Speech Recognition - Impact of Temporal Effects.
Proceedings of the BIOSIGNALS 2009, 2009

Vietnamese large vocabulary continuous speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

A multiplatform speech recognition decoder based on weighted finite-state transducers.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Rapid language adaptation tools for multilingual speech processing.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Speaker de-identification via voice transformation.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Towards emotion recognition from electroencephalographic signals.
Proceedings of the Affective Computing and Intelligent Interaction, 2009

2008
Multilingual spoken language processing.
IEEE Signal Process. Mag., 2008

Rapid language adaptation tools and technologies for multilingual speech processing systems.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Synthesizer voice quality of new languages calibrated with mean mel cepstral distortion.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Improving word segmentation for Thai speech translation.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Modeling Vocal Interaction for Text-Independent Participant Characterization in Multi-Party Conversation.
Proceedings of the SIGDIAL 2008 Workshop, 2008

Correlated Bigram LSA for Unsupervised Language Model Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 21, 2008

Detection of Laughter-in-Interaction in Multichannel Close-Talk Microphone Recordings of Meetings.
Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

NineOneOne: Recognizing and Classifying Speech for Handling Minority Language Emergency Calls.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Recovering participant identities in meetings from a probabilistic description of vocal interaction.
Proceedings of the INTERSPEECH 2008, 2008

Improving speech systems built from very little data.
Proceedings of the INTERSPEECH 2008, 2008

Robust far-field speaker identification under mismatched conditions.
Proceedings of the INTERSPEECH 2008, 2008

The CMU-interACT 2008 Mandarin transcription system.
Proceedings of the INTERSPEECH 2008, 2008

Selecting relevant features for human motion recognition.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Sentence segmentation and punctuation recovery for spoken language translation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Is voice transformation a threat to speaker identification?
Proceedings of the IEEE International Conference on Acoustics, 2008

Speech Translation for Triage of Emergency Phonecalls in Minority Languages.
Proceedings of the workshop on Speech Processing for Safety Critical Translation and Pervasive Applications@COLING 2008, 2008

Automatic Speech Recognition Based on Electromyographic Biosignals.
Proceedings of the Biomedical Engineering Systems and Technologies, 2008

EARS: Electromyographical Automatic Recognition of Speech.
Proceedings of the First International Conference on Biomedical Electronics and Devices, 2008

Determine Task Demand from Brain Activity.
Proceedings of the First International Conference on Biomedical Electronics and Devices, 2008

2007
Far-Field Speaker Recognition.
IEEE Trans. Speech Audio Process., 2007

Bilingual LSA-based adaptation for statistical machine translation.
Mach. Transl., 2007

Voice building from insufficient data - classroom experiences with web-based language development tools.
Proceedings of the Sixth ISCA Workshop on Speech Synthesis, 2007

Speaker Characteristics.
Proceedings of the Speaker Classification I: Fundamentals, Features, and Methods, 2007

Modeling Vocal Interaction for Text-Independent Classification of Conversation Type.
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue, 2007

Advances in the CMU/Interact Arabic GALE Transcription System.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

A Geometric Interpretation of Non-Target-Normalized Maximum Cross-Channel Correlation for Vocal Activity Detection in Meetings.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Improving spoken language translation by automatic disfluency removal: evidence from conversational speech transcripts.
Proceedings of Machine Translation Summit XI: Papers, 2007

Modeling Vocal Interaction for Segmentation in Meeting Recognition.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Wavelet-based front-end for electromyographic speech recognition.
Proceedings of the INTERSPEECH 2007, 2007

Bilingual LSA-based translation lexicon adaptation for spoken language translation.
Proceedings of the INTERSPEECH 2007, 2007

SPICE: web-based tools for rapid language adaptation in speech processing systems.
Proceedings of the INTERSPEECH 2007, 2007

Optimizing sentence segmentation for spoken language translation.
Proceedings of the INTERSPEECH 2007, 2007

Handling OOV words in Arabic ASR via flexible morphological constraints.
Proceedings of the INTERSPEECH 2007, 2007

Whispering Speaker Identification.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Correlated Latent Semantic Model for Unsupervised LM Adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Continuous Electromyographic Speech Recognition with a Multi-Stream Decoding Architecture.
Proceedings of the IEEE International Conference on Acoustics, 2007

Simultaneous multispeaker segmentation for automatic meeting recognition.
Proceedings of the 15th European Signal Processing Conference, 2007

Bilingual-LSA Based LM Adaptation for Spoken Language Translation.
Proceedings of the ACL 2007, 2007

2006
Flexible speech translation systems.
IEEE Trans. Speech Audio Process., 2006

Thai Grapheme-Based Speech Recognition.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Spontaneous Thai speech recognition.
Proceedings of the INTERSPEECH 2006, 2006

Sub-word unit based non-audible speech recognition using surface electromyography.
Proceedings of the INTERSPEECH 2006, 2006

Unsupervised language model adaptation using latent semantic marginals.
Proceedings of the INTERSPEECH 2006, 2006

Towards continuous speech recognition using surface electromyography.
Proceedings of the INTERSPEECH 2006, 2006

Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system.
Proceedings of the INTERSPEECH 2006, 2006

Example-based grapheme-to-phoneme conversion for Thai.
Proceedings of the INTERSPEECH 2006, 2006

Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Acoustic-Phonetic Unit Similarities For Context Dependent Acoustic Model Portability.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Unsupervised Learning of Overlapped Speech Model Parameters For Multichannel Speech Activity Detection in Meetings.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Articulatory Feature Classification using Surface Electromyography.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Far-Field Speaker Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Rapid development of an Afrikaans-English speech-to-speech translator.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Dynamic language model adaptation using variational Bayes inference.
Proceedings of the INTERSPEECH 2005, 2005

Document driven machine translation enhanced ASR.
Proceedings of the INTERSPEECH 2005, 2005

Thai Automatic Speech Recognition.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Whispery Speech Recognition using Adapted Articulatory Features.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Automatic Disfluency Removal on Recognized Spontaneous Speech - Rapid Adaptation to Speaker Dependent Disfluencies.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004
A Thai Speech Translation System for Medical Dialogs.
Proceedings of the Demonstration Papers at HLT-NAACL 2004, 2004

Using word latice information for a tighter coupling in speech translation systems.
Proceedings of the INTERSPEECH 2004, 2004

Issues in meeting transcription - the ISL meeting transcription system.
Proceedings of the INTERSPEECH 2004, 2004

Crosscorrelation-based multispeaker speech activity detection.
Proceedings of the INTERSPEECH 2004, 2004

Adaptation for soft whisper recognition using a throat microphone.
Proceedings of the INTERSPEECH 2004, 2004

Speaker segmentation and clustering in meetings.
Proceedings of the INTERSPEECH 2004, 2004

Identifying the addressee in human-human-robot interactions based on head pose and speech.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Towards language portability in statistical speech translation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003
Implicit Trajectory Modeling through Gaussian Transition Models for Speech Recognition.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Speechalator: Two-Way Speech-to-Speech Translation in Your Hand.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Non-native spontaneous speech recognition through polyphone decision tree specialization.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speechalator: two-way speech-to-speech translation on a consumer PDA.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Integrating multilingual articulatory features into speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Grapheme based speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Correction of disfluencies in spontaneous speech using a noisy-channel approach.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Comparison of acoustic model adaptation techniques on non-native speech.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

SMaRT: the Smart Meeting Room Task at ISL.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Multilingual articulatory features.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Globalphone: a multilingual speech and text database developed at karlsruhe university.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Phonetic speaker identification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Towards Universal Speech Recognition.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Toward robust parametric trajectory segmental model for vowel recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002

Speaker identification using multilingual phone strings.
Proceedings of the IEEE International Conference on Acoustics, 2002

Improvements in Non-Verbal Cue Identification Using Multilingual Phone Strings.
Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems@ACL 2002, 2002

2001
Multilinguale Spracherkennung: Kombination akustischer Modelle zur Portierung auf neue Sprachen.
PhD thesis, 2001

Language-independent and language-adaptive acoustic modeling for speech recognition.
Speech Commun., 2001

Advances in meeting recognition.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Domain Portability in Speech-to-Speech Translation.
Proceedings of the First International Conference on Human Language Technology Research, 2001

LingWear: A Mobile Tourist Information System.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Experiments on cross-language acoustic modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Advances in automatic meeting record creation and access.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Multilinguality in speech and spoken language systems.
Proc. IEEE, 2000

VERBMOBIL dialogues: multifaced analysis.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Polyphone decision tree specialization for language adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2000

Confidence measure based language identification.
Proceedings of the IEEE International Conference on Acoustics, 2000

Turkish LVCSR: towards better speech recognition for agglutinative languages.
Proceedings of the IEEE International Conference on Acoustics, 2000

1999
Mandarin large vocabulary speech recognition using the globalphone database.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Linear discriminant - a new criterion for speaker normalization.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Language independent and language adaptive large vocabulary speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Recognition of music types.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Fast bootstrapping of LVCSR systems with multilingual phoneme sets.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Japanese LVCSR on the spontaneous scheduling task with JANUS-3.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen.
Proceedings of the Natural Language Processing and Speech Technology, 1996

LVCSR-based language identification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

1995
Integrating different learning approaches into a multilingual spoken language translation system.
Proceedings of the Connectionist, 1995

Acoustic and language modeling of human and nonhuman noises for human-to-human spontaneous speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
JANUS 93: towards spontaneous speech translation.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1992
Stochastic modeling of syllable-based units for continuous speech recognition.
Proceedings of the Second International Conference on Spoken Language Processing, 1992


  Loading...