Alex Waibel

According to our database1, Alex Waibel authored at least 432 papers between 1982 and 2018.

Collaborative distances:

Awards

IEEE Fellow

IEEE Fellow 2015, "For contributions to neural network based speech recognition and translation and multimodal interfaces".

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Homepages:

On csauthors.net:

Bibliography

2018
Towards one-shot learning for rare-word translation with external experts.
CoRR, 2018

Paraphrases as Foreign Languages in Multilingual Neural Machine Translation.
CoRR, 2018

Low-Latency Neural Speech Translation.
CoRR, 2018

A Hierarchical Approach to Neural Context-Aware Modeling.
CoRR, 2018

Robust and Scalable Differentiable Neural Computer for Question Answering.
CoRR, 2018

Neural Language Codes for Multilingual Acoustic Models.
CoRR, 2018

Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation.
CoRR, 2018

Self-Attentional Acoustic Models.
CoRR, 2018

Automated Evaluation of Out-of-Context Errors.
CoRR, 2018

An End-to-End Goal-Oriented Dialog System with a Generative Natural Language Response Generation.
CoRR, 2018

Massively Parallel Cross-Lingual Learning in Low-Resource Target Language Translation.
Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2018.
Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Building Real-Time Speech Recognition Without CMVN.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Automated Evaluation of Out-of-Context Errors.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

BULBasaa: A Bilingual Basaa-French Speech Corpus for the Evaluation of Language Documentation Tools.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Subword and Crossword Units for CTC Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Self-Attentional Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Low-Latency Neural Speech Translation.
Proceedings of the Interspeech 2018, 2018

Term Extraction via Neural Sequence Labeling a Comparative Evaluation of Strategies Using Recurrent Neural Networks.
Proceedings of the Interspeech 2018, 2018

Neural Language Codes for Multilingual Acoustic Models.
Proceedings of the Interspeech 2018, 2018

Exploring Ctc-Network Derived Features with Conventional Hybrid System.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multilingual Adaptation of RNN Based ASR Systems.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning.
Proceedings of the COLING 2018, 2018

Towards one-shot learning for rare-word translation with external experts.
Proceedings of the 2nd Workshop on Neural Machine Translation and Generation, 2018

Robust and Scalable Differentiable Neural Computer for Question Answering.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

2017
Transcribing against time.
Speech Communication, 2017

Subword and Crossword Units for CTC Acoustic Models.
CoRR, 2017

Effective Strategies in Zero-Shot Neural Machine Translation.
CoRR, 2017

Multilingual Adaptation of RNN Based ASR Systems.
CoRR, 2017

Phonemic and Graphemic Multilingual CTC Based Speech Recognition.
CoRR, 2017

Transcribing Against Time.
CoRR, 2017

Comparison of Decoding Strategies for CTC Acoustic Models.
CoRR, 2017

Analyzing Neural MT Search and Model Performance.
CoRR, 2017

Neural Lattice-to-Sequence Models for Uncertain Inputs.
CoRR, 2017

Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor.
CoRR, 2017

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2017.
Proceedings of the Second Conference on Machine Translation, 2017

The QT21 Combined Machine Translation System for English to Latvian.
Proceedings of the Second Conference on Machine Translation, 2017

Improved Speaker Adaptation by Combining I-vector and fMLLR with Deep Bottleneck Networks.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Language Adaptive Multilingual CTC Speech Recognition.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor.
Proceedings of the Advanced Social Interaction with Agents, 2017

Comparison of Decoding Strategies for CTC Acoustic Models.
Proceedings of the Interspeech 2017, 2017

Enhancing Backchannel Prediction Using Word Embeddings.
Proceedings of the Interspeech 2017, 2017

NMT-Based Segmentation and Punctuation Insertion for Real-Time Spoken Language Translation.
Proceedings of the Interspeech 2017, 2017

Towards phoneme inventory discovery for documentation of unwritten languages.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Keynote Talk.
Proceedings of the 5th International Conference on Human Agent Interaction, 2017

Neural Lattice-to-Sequence Models for Uncertain Inputs.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

DBLSTM based multilingual articulatory feature extraction for language documentation.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

Analyzing Neural MT Search and Model Performance.
Proceedings of the First Workshop on Neural Machine Translation, 2017

2016
Pre-Translation for Neural Machine Translation.
CoRR, 2016

Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder.
CoRR, 2016


Using Factored Word Representation in Neural Network Language Models.
Proceedings of the First Conference on Machine Translation, 2016

The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016.
Proceedings of the First Conference on Machine Translation, 2016

Lecture Translator - Speech translation framework for simultaneous lecture translation.
Proceedings of the Demonstrations Session, 2016

Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Evaluation of the KIT Lecture Translation System.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Towards an Open-Domain Social Dialog System.
Proceedings of the Dialogues with Social Robots, 2016

Unsupervised Phoneme Segmentation of Previously Unseen Languages.
Proceedings of the Interspeech 2016, 2016

Dynamic Transcription for Low-Latency Speech Translation.
Proceedings of the Interspeech 2016, 2016

Language Adaptive DNNs for Improved Low Resource Speech Recognition.
Proceedings of the Interspeech 2016, 2016

An empirical exploration of CTC acoustic models.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Lightly Supervised Quality Estimation.
Proceedings of the COLING 2016, 2016

Pre-Translation for Neural Machine Translation.
Proceedings of the COLING 2016, 2016

Training Deep Neural Networks for Reverberation Robust Speech Recognition.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

Language Feature Vectors for Resource Constraint Speech Recognition.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

Growing a Deep Neural Network Acoustic Model with Singular Value Decomposition.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

Phoneme Boundary Detection using Deep Bidirectional LSTMs.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

Personalized News Event Retrieval for Small Talk in Social Dialog Systems.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System.
Proceedings of the 12. ITG Symposium on Speech Communication, 2016

2015
Lexical Translation Model Using a Deep Neural Network Architecture.
CoRR, 2015

ListNet-based MT Rescoring.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

The KIT-LIMSI Translation System for WMT 2015.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

The Karlsruhe Institute of Technology Translation Systems for the WMT 2015.
Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems.
Proceedings of the SIGDIAL 2015 Conference, 2015

Gaussian free cluster tree construction using deep neural network.
Proceedings of the INTERSPEECH 2015, 2015

Combination of NN and CRF models for joint detection of punctuation and disfluencies.
Proceedings of the INTERSPEECH 2015, 2015

Using Neural Networks for Data-Driven Backchannel Prediction: A Survey on Input Features and Training Techniques.
Proceedings of the Human-Computer Interaction: Interaction Technologies, 2015

Stripping Adjectives: Integration Techniques for Selective Stemming in SMT Systems.
Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

2014
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff.
TACL, 2014

The Karlsruhe Institute of Technology Translation Systems for the WMT 2014.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

EU-BRIDGE MT: Combined Machine Translation.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

The KIT-LIMSI Translation System for WMT 2014.
Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

A Neural Network Keyword Search System for Telephone Speech.
Proceedings of the Speech and Computer - 16th International Conference, 2014

On-the-fly user modeling for cost-sensitive correction of speech transcripts.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Manual Analysis of Structurally Informed Reordering in German-English Machine Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A World without Barriers: Connecting the World across Languages, Distances and Media.
Proceedings of the 16th International Conference on Multimodal Interaction, 2014

Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task.
Proceedings of the IEEE International Conference on Acoustics, 2014

Multilingual shifting deep bottleneck features for low-resource ASR.
Proceedings of the IEEE International Conference on Acoustics, 2014

Optimization of Neural Network Language Models for keyword search.
Proceedings of the IEEE International Conference on Acoustics, 2014

Tight Integration of Speech Disfluency Removal into SMT.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

2013
Training speech translation from audio recordings of interpreter-mediated communication.
Computer Speech & Language, 2013

Joint WMT 2013 Submission of the QUAERO Project.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

An MT Error-Driven Discriminative Word Lexicon using Sentence Structure Features.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

The Karlsruhe Institute of Technology Translation Systems for the WMT 2013.
Proceedings of the Eighth Workshop on Statistical Machine Translation, 2013

Combining Word Reordering Methods on different Linguistic Abstraction Levels for Statistical Machine Translation.
Proceedings of the Seventh Workshop on Syntax, 2013

Segmentation of Telephone Speech Based on Speech and Non-speech Models.
Proceedings of the Speech and Computer - 15th International Conference, 2013

Optimizing deep bottleneck feature extraction.
Proceedings of the 2013 IEEE RIVF International Conference on Computing and Communication Technologies, 2013

Measuring the Structural Importance through Rhetorical Structure Index.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Efficient speech transcription through respeaking.
Proceedings of the INTERSPEECH 2013, 2013

Slightly Supervised Adaptation of Acoustic Models on Captioned BBC Weather Forecasts.
Proceedings of the First Workshop on Speech, 2013

Modular combination of deep neural networks for acoustic modeling.
Proceedings of the INTERSPEECH 2013, 2013

A real-world system for simultaneous translation of German lectures.
Proceedings of the INTERSPEECH 2013, 2013

Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2013

Subspace mixture model for low-resource speech recognition in cross-lingual settings.
Proceedings of the IEEE International Conference on Acoustics, 2013

Warped Minimum Variance Distortionless Response based bottle neck features for LVCSR.
Proceedings of the IEEE International Conference on Acoustics, 2013

Extracting deep bottleneck features using stacked auto-encoders.
Proceedings of the IEEE International Conference on Acoustics, 2013

Models of tone for tonal and non-tonal languages.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

DNN acoustic modeling with modular multi-lingual feature extraction networks.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

Letter N-Gram-based Input Encoding for Continuous Space Language Models.
Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, 2013

2012
Parallel Phrase Scoring for Extra-large Corpora.
Prague Bull. Math. Linguistics, 2012

The Karlsruhe Institute of Technology Translation Systems for the WMT 2012.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Joint WMT 2012 Submission of the QUAERO Project.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

The KIT Lecture Corpus for Speech Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

The 2012 KIT and KIT-NAIST English ASR systems for the IWSLT evaluation.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Continuous space language models using restricted Boltzmann machines.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT translation systems for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Evaluation of interactive user corrections for lecture transcription.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

The KIT-NAIST (contrastive) English ASR system for IWSLT 2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Segmentation and punctuation prediction in speech language translation using a monolingual translation system.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Unsupervised vocabulary selection for real-time speech recognition of lectures.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Blind dereverberation of sinusoid signals using PLL-based combined phase and amplitude analysis.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Wider Context by Using Bilingual Language Models in Machine Translation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

The Karlsruhe Institute of Technology Translation Systems for the WMT 2011.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

Joint WMT Submission of the QUAERO Project.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

The 2011 KIT English ASR system for the IWSLT evaluation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Using Wikipedia to translate domain-specific terms in SMT.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

The KIT English-French translation systems for IWSLT 2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Unsupervised vocabulary selection for simultaneous lecture translation.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011


The 2011 KIT QUAERO speech-to-text system for Spanish.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Advances on spoken language translation in the Quaero program.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

TriS: A Statistical Sentence Simplifier with Log-linear Models and Margin-based Discriminative Training.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

2010
The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

Speech translators for humanitarian projects.
Proceedings of the 2nd Workshop on Spoken Language Technologies for Under-Resourced Languages, 2010

Jibbigo: Speech-to-speech translation on mobile devices.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Tools for Collecting Speech Corpora via Mechanical-Turk.
Proceedings of the 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, 2010

The KIT translation system for IWSLT 2010.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Real-time spoken language identification and recognition for speech-to-speech translation.
Proceedings of the 2010 International Workshop on Spoken Language Translation, 2010

Rapid development of speech translation using consecutive interpretation.
Proceedings of the INTERSPEECH 2010, 2010

Named-entity projection and data-driven morphological decomposition for field maintainable speech-to-speech translation systems.
Proceedings of the INTERSPEECH 2010, 2010

Spoken language translation from parallel speech audio: Simultaneous interpretation as SLT training data.
Proceedings of the IEEE International Conference on Acoustics, 2010

Towards social integration of humanoid robots by conversational concept learning.
Proceedings of the 10th IEEE-RAS International Conference on Humanoid Robots, 2010


2009
Computers in the Human Interaction Loop.
Proceedings of the Computers in the Human Interaction Loop, 2009

Beyond CHIL.
Proceedings of the Computers in the Human Interaction Loop, 2009

Consolidation-Based Speech Translation and Evaluation Approach.
IEICE Transactions, 2009

The Universität Karlsruhe Translation System for the EACL-WMT 2009.
Proceedings of the Fourth Workshop on Statistical Machine Translation, 2009

Incremental Adaptation of Speech-to-Speech Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Human translations guided language discovery for ASR systems.
Proceedings of the INTERSPEECH 2009, 2009

Multimodal Interfaces in Support of Human-Human Interaction.
Proceedings of the Gesture in Embodied Communication and Human-Computer Interaction, 2009

End-to-End Evaluation in Simultaneous Translation.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Automatic translation from parallel speech: Simultaneous interpretation as MT training data.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Pronunciation modeling for dialectal arabic speech recognition.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

2008
A dialogue approach to learning object descriptions and semantic categories.
Robotics and Autonomous Systems, 2008

Towards human translations guided language discovery for ASR systems.
Proceedings of the First International Workshop on Spoken Languages Technologies for Under-Resourced Languages, 2008

Simultaneous machine translation of german lectures into english: Investigating research challenges for the future.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Modelling multimodal user ID in dialogue.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Confidence based multimodal fusion for person identification.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Probabilistic integration of sparse audio-visual cues for identity tracking.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Communicating Unknown Words in Machine Translation.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Simultaneous German-English lecture translation.
Proceedings of the 2008 International Workshop on Spoken Language Translation, 2008

Speech Processing in Support of Human-Human Communication (Invited Paper).
Proceedings of the ISUC 2008, 2008

Lightly supervised acoustic model training on EPPS recordings.
Proceedings of the INTERSPEECH 2008, 2008

Class-based statistical machine translation for field maintainable speech-to-speech translation.
Proceedings of the INTERSPEECH 2008, 2008

Stream decoding for simultaneous spoken language translation.
Proceedings of the INTERSPEECH 2008, 2008

Extracting clues from human interpreter speech for spoken language translation.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Enabling Multimodal Human-Robot Interaction for the Karlsruhe Humanoid Robot.
IEEE Trans. Robotics, 2007

Far-Field Speaker Recognition.
IEEE Trans. Audio, Speech & Language Processing, 2007

Simultaneous translation of lectures and speeches.
Machine Translation, 2007

Translation Model Pruning via Usage Statistics for Statistical Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

The CMU-UKA statistical machine translation systems for IWSLT 2007.
Proceedings of the 2007 International Workshop on Spoken Language Translation, 2007

Computer-supported human-human multilingual communication.
Proceedings of the INTERSPEECH 2007, 2007

Behavior models for learning and receptionist dialogs.
Proceedings of the INTERSPEECH 2007, 2007

Speech Translation Enhanced ASR for European Parliament Speeches - On the Influence of ASR Performance on Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2007

Continuous Electromyographic Speech Recognition with a Multi-Stream Decoding Architecture.
Proceedings of the IEEE International Conference on Acoustics, 2007

Consolidation based speech translation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
A Pattern Learning Approach to Question Answering Within the Ephyra Framework.
Proceedings of the Text, Speech and Dialogue, 9th International Conference, 2006

Speech-to-Speech Translation Services for the Olympic Games 2008.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

A Robot Learns to Know People - First Contacts of a Robot.
Proceedings of the KI 2006: Advances in Artificial Intelligence, 2006

The CMU-UKA syntax augmented machine translation system for IWSLT-06.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

The UKA/CMU statistical machine translation system for IWSLT 2006.
Proceedings of the 2006 International Workshop on Spoken Language Translation, 2006

Sub-word unit based non-audible speech recognition using surface electromyography.
Proceedings of the INTERSPEECH 2006, 2006

Rapid simulation-driven reinforcement learning of multimodal dialog strategies in human-robot interaction.
Proceedings of the INTERSPEECH 2006, 2006

Towards continuous speech recognition using surface electromyography.
Proceedings of the INTERSPEECH 2006, 2006

Optimizing components for handheld two-way speech translation for an English-iraqi Arabic system.
Proceedings of the INTERSPEECH 2006, 2006

A multilingual expectations model for contextual utterances in mixed-initiative spoken dialogue.
Proceedings of the INTERSPEECH 2006, 2006

Dynamic extension of a grammar-based dialogue system: constructing an all-recipes knowing robot.
Proceedings of the INTERSPEECH 2006, 2006

Multimodal estimation of user interruptibility for smart mobile telephones.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

Directing Attention in Online Aggregate Sensor Streams via Auditory Blind Value Assignment.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Articulatory Feature Classification using Surface Electromyography.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Open Domain Speech Recognition & Translation: Lectures and Speeches.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Computer-Supported Human-Human Multilingual Communication.
Proceedings of the 50 Years of Artificial Intelligence, 2006

2005
CHIL - Computers in the Human Interaction Loop.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2005), 2005


The CMU statistical machine translation system for IWSLT 2005.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Low cost Portability for statistical machine translation based on n-gram frequency and TF-IDF.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Document driven machine translation enhanced ASR.
Proceedings of the INTERSPEECH 2005, 2005

Clarification questions to improve dialogue flow and speech recognition in spoken dialogue systems.
Proceedings of the INTERSPEECH 2005, 2005

Temporal ICA for classification of acoustic events i a kitchen environment.
Proceedings of the INTERSPEECH 2005, 2005

Rapid porting of ASR-systems to mobile devices.
Proceedings of the INTERSPEECH 2005, 2005

Spontaneous speech consolidation for spoken language applications.
Proceedings of the INTERSPEECH 2005, 2005

The connector: facilitating context-aware communication.
Proceedings of the 7th International Conference on Multimodal Interfaces, 2005

Automatically Transcribing Meetings using Distant Microphones.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Classifying user environment for mobile applications using linear autoencoding of ambient audio.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Whispery Speech Recognition using Adapted Articulatory Features.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Bilingual Word Spectral Clustering for Statistical Machine Translation.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

Training and Evaluating Error Minimization Decision Rules for Statistical Machine Translation.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

Learning a Log-Linear Model with Bilingual Phrase-Pair Features for Statistical Machine Translation.
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing, 2005

Clustering and Classifying Person Names by Origin.
Proceedings of the Proceedings, 2005

2004
Automatic detection and recognition of signs from natural scenes.
IEEE Trans. Image Processing, 2004

Speaker adaptation with all-pass transforms.
Speech Communication, 2004

A Thai Speech Translation System for Medical Dialogs.
Proceedings of the Demonstration Papers at HLT-NAACL 2004, 2004

Improving Named Entity Translation Combining Phonetic and Semantic Similarities.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Interpreting BLEU/NIST Scores: How Much Improvement do We Need to Have a Better System?
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Language Model Adaptation for Statistical Machine Translation Based on Information Retrieval.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

The ISL statistical translation system for spoken language translation.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

The ISL EDTRL system.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Towards named entity extraction and translation in spoken language translation.
Proceedings of the 2004 International Workshop on Spoken Language Translation, 2004

Natural human-robot interaction using speech, head pose and gestures.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Speech translation: past, present and future.
Proceedings of the INTERSPEECH 2004, 2004

Worldwide ongoing activities on multilingual speech to speech translation.
Proceedings of the INTERSPEECH 2004, 2004

Adaptation for soft whisper recognition using a throat microphone.
Proceedings of the INTERSPEECH 2004, 2004

Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition.
Proceedings of the INTERSPEECH 2004, 2004

Integrating thumbnail features for speech recognition using conditional exponential models.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Towards language portability in statistical speech translation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Minimum Kullback-Leibler distance based multivariate Gaussian feature adaptation for distant-talking speech recognition.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Performance comparisons of all-pass transform adaptation with maximum likelihood linear regression.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

Phrase Pair Rescoring with Term Weighting for Statistical Machine Translatio.
Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing , 2004

Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit.
Proceedings of the Pattern Recognition, 26th DAGM Symposium, August 30, 2004

Improving Statistical Machine Translation in the Medical Domain using the Unified Medical Language system.
Proceedings of the COLING 2004, 2004

2003
Extracting named entity translingual equivalence with limited resources.
ACM Trans. Asian Lang. Inf. Process., 2003

A Statistical Approach to Automatic Speech Summarization.
EURASIP J. Adv. Sig. Proc., 2003

Efficient Optimization for Bilingual Sentence Alignment Based on Linear Regression.
Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 2003

Speechalator: Two-Way Speech-to-Speech Translation in Your Hand.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Minimum variance distortionless response on a warped frequency scale.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speechalator: two-way speech-to-speech translation on a consumer PDA.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Integrating multilingual articulatory features into speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Calibration of a Hybrid Camera Network.
Proceedings of the 9th IEEE International Conference on Computer Vision (ICCV 2003), 2003

Comparison of acoustic model adaptation techniques on non-native speech.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

SMaRT: the Smart Meeting Room Task at ISL.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Multilingual articulatory features.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Maximum mutual information speaker adapted training with semi-tied covariance matrices.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Effective Phrase Translation Extraction from Alignment Models.
Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, 2003

Automatic Extraction of Named Entity Translingual Equivalence Based on Multi-Feature Cost Minimization.
Proceedings of the Workshop on Multilingual and Mixed-language Named Entity Recognition, 2003

2002
Modeling focus of attention for meeting indexing based on multiple cues.
IEEE Trans. Neural Networks, 2002

Automatic Detection of Signs with Affine Transformation.
Proceedings of the 6th IEEE Workshop on Applications of Computer Vision (WACV 2002), 2002

Automatic sign translation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Compensating for hyperarticulation by modeling articulatory properties.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A flexible stream architecture for ASR using articulatory features.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Interlingua based statistical machine translation.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Phonetic speaker identification.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

A Robust Approach for Recognition of Text Embedded in Natural Scenes.
Proceedings of the 16th International Conference on Pattern Recognition, 2002

A PDA-Based Sign Translator.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Towards Universal Speech Recognition.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Flexi-Modal and Multi-Machine User Interfaces.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Integrating Emotional Cues into a Framework for Dialogue Management.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Automatic detection and translation of text from natural scenes.
Proceedings of the IEEE International Conference on Acoustics, 2002

Efficient language model lookahead through polymorphic linguistic context assignment.
Proceedings of the IEEE International Conference on Acoustics, 2002

Experiments on distant-talking speech recognition in meeting room using extended MAM.
Proceedings of the IEEE International Conference on Acoustics, 2002

On maximum mutual information speaker-adapted training.
Proceedings of the IEEE International Conference on Acoustics, 2002

Speaker identification using multilingual phone strings.
Proceedings of the IEEE International Conference on Acoustics, 2002

Automatic speech summarization applied to English broadcast news speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

Improvements in Non-Verbal Cue Identification Using Multilingual Phone Strings.
Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems@ACL 2002, 2002

2001
Multimodal error correction for speech user interfaces.
ACM Trans. Comput.-Hum. Interact., 2001

Language-independent and language-adaptive acoustic modeling for speech recognition.
Speech Communication, 2001

The ISL View4You Broadcast News Transcription System.
I. J. Speech Technology, 2001

Online handwriting recognition: the NPen++ recognizer.
IJDAR, 2001

Towards Automatic Sign Translation.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Advances in meeting recognition.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Activity detection for information access to oral communication.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Architecture and Design Considerations in NESPOLE!: a Speech Translation System for E-commerce Applications.
Proceedings of the First International Conference on Human Language Technology Research, 2001

LingWear: A Mobile Tourist Information System.
Proceedings of the First International Conference on Human Language Technology Research, 2001

Experiments on cross-language acoustic modeling.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Model-combination-based acoustic mapping.
Proceedings of the IEEE International Conference on Acoustics, 2001

Advances in automatic meeting record creation and access.
Proceedings of the IEEE International Conference on Acoustics, 2001

The ISL evaluation system for Verbmobil-II.
Proceedings of the IEEE International Conference on Acoustics, 2001

Speaker compensation with sine-log all-pass transforms.
Proceedings of the IEEE International Conference on Acoustics, 2001

Estimating focus of attention based on gaze and sound.
Proceedings of the Auditory-Visual Speech Processing, 2001

2000
The Janus-III Translation System: Speech-to-Speech Translation in Multiple Domains.
Machine Translation, 2000

Towards Unrestricted Lip Reading.
IJPRAI, 2000

End to end evaluation of the ISL View4You broadcast news transcription and retrieval system.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Multimodal Meeting Tracker.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Shallow Discourse Genre Annotation in CallHome Spanish.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Streamlining the front end of a speech recognizer.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

New developments in automatic meeting transcription.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Phone dependent modeling of hyperarticulated effects#.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

The effects of room acoustics on MFCC speech parameter.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

A na ve de-lambing method for speaker identification.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Application of LDA to speaker recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Dialogue management for multimodal user registration.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Simultaneous Tracking of Head Poses in a Panoramic View.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Growing Gaussian Mixture Models for Pose Invariant Face Recognition.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

Towards a Multimodal Meeting Record.
Proceedings of the 2000 IEEE International Conference on Multimedia and Expo, 2000

Specialized acoustic models for hyperarticulated speech.
Proceedings of the IEEE International Conference on Acoustics, 2000

Polyphone decision tree specialization for language adaptation.
Proceedings of the IEEE International Conference on Acoustics, 2000

Strategies for automatic segmentation of audio data.
Proceedings of the IEEE International Conference on Acoustics, 2000

Segmenting Hands of Arbitrary Color.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

Face Recognition in a Meeting Room.
Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2000), 2000

DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

Minimizing Word Error Rate in Textual Summaries of Spoken Language.
Proceedings of the 6th Applied Natural Language Processing Conference, 2000

1999
Stochastically-based semantic analysis for machine translation.
Computer Speech & Language, 1999

From Gaze to Focus of Attention.
Proceedings of the Visual Information and Information Systems, 1999

Multimodal people ID for a multimedia meeting browser.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Modeling focus of attention for meeting indexing.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

Smart Sight: A Tourist Assistant System.
Proceedings of the Third International Symposium on Wearable Computers (ISWC 1999), 1999

Progress in automatic meeting transcription.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Towards spontaneous speech recognition for on-board car navigation and information systems.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Mandarin large vocabulary speech recognition using the globalphone database.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Unsupervised training of a speech recognizer: recent experiments.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Navigating German cities by spontaneous French queries.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Modeling and efficient decoding of large vocabulary conversational speech.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Selection criteria for hypothesis driven lexical adaptation.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Model-Based and Empirical Evaluation of Multimodal Interactive Error Correction.
Proceedings of the Proceeding of the CHI '99 Conference on Human Factors in Computing Systems: The CHI is the Limit, 1999

Face translation: A multimodal translation agent.
Proceedings of the Auditory-Visual Speech Processing, 1999

1998
Linear discriminant - a new criterion for speaker normalization.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Fast decoding for statistical machine translation.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

On the influence of hyperarticulated speech on recognition performance.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Language independent and language adaptive large vocabulary speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

An interlingua based on domain actions for machine translation of task-oriented dialogues.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Unsupervised training of a speech recognizer using TV broadcasts.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Reducing the OOV rate in broadcast news speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

The interactive systems labs view4you video indexing system.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multlingual broadcast news.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Conversational speech systems for on-board car navigation and assistance.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Probabilistic dialogue act extraction for concept based multilingual translation systems.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic models.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Experiments in automatic meeting transcription using JRTK.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Recognition of music types.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Serbo-Croatian LVCSR on the dictation and broadcast news domain.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Hierarchies of neural networks for connectionist speech recognition.
Proceedings of the ESANN 1998, 1998

Visual Tracking for Multimodal Human Computer Interaction.
Proceedings of the Proceeding of the CHI '98 Conference on Human Factors in Computing Systems, 1998

Interactive error repair for an online handwriting interface.
Proceedings of the CHI 98 Conference Summary on Human Factors in Computing Systems, 1998

Real-Time Face and Facial Feature Tracking and Applications.
Proceedings of the Auditory-Visual Speech Processing, 1998

A Modular Approach to Spoken Language Translation for Large Domains.
Proceedings of the Machine Translation and the Information Soup, 1998

Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Modeling with Structures in Statistical Machine Translation.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Growing Semantic Grammars.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Skin-Color Modeling and Adaptation.
Proceedings of the Computer Vision, 1998

1997
Janus: A System for Translation of Conversational Speech.
KI, 1997

A Model-Based Gaze Tracking System.
International Journal on Artificial Intelligence Tools, 1997

Speaker normalization and speaker adaptation - a combination for conversational speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Statistical analysis of dialogue structure.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Exploiting repair context in interactive error recovery.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Fast bootstrapping of LVCSR systems with multilingual phoneme sets.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Japanese LVCSR on the spontaneous scheduling task with JANUS-3.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Dialogue strategies guiding users to their communicative goals.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Recognition of conversational telephone speech using the JANUS speech engine.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Multimodal interfaces for multimedia information agents.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Janus-III: speech-to-speech translation in multiple languages.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Verbmobil: the combination of deep and shallow processing for spontaneous speech translation.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Decoding Algorithm in Statistical Machine Translation.
Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, 1997

1996
Interactive Translation of Conversational Speech.
IEEE Computer, 1996

Multimodal Interfaces.
Artif. Intell. Rev., 1996

A real-time face tracker.
Proceedings of ThirdIEEE Workshop on Applications of Computer Vision, 1996

Adaptively Growing Hierarchical Mixtures of Experts.
Proceedings of the Advances in Neural Information Processing Systems 9, 1996

JANUS-II: towards spontaneous Spanish speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Word clustering with parallel spoken language corpora.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Interactive recovery from speech recognition errors in speech user interfaces.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Dictionary learning for spontaneous speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Class phrase models for language modelling.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Translation of conversational speech with JANUS-II.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Dialogue processing in a conversational speech translation system.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Recognition of spelled names over the telephone.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Recognizing emotion in speech.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Learning to parse spontaneous speech.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Focus of attention: Towards low bitrate video tele-conferencing.
Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

JANUS-II-translation of spontaneous conversational speech.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

LVCSR-based language identification.
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

End-to-End Evaluation in JANUS: A Speech-to-speech Translation System.
Proceedings of the Dialogue Processing in Spoken Language Systems, 1996

Search in a Learnable Spoken Language Parser.
Proceedings of the 12th European Conference on Artificial Intelligence, 1996

Multi-lingual Translation of Spontaneously Spoken Language in a Limited Domain.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

FeasPar - A Feature Structure Parser Learning to Parse Spoken Language.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

1995
The challenge of spoken language systems: research directions for the nineties.
IEEE Trans. Speech and Audio Processing, 1995

Integrating spelling into spoken dialogue recognition.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Speeding up the score computation of HMM speech regognizers with the bucket voronoi intersection algorithm.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

Integrating different learning approaches into a multilingual spoken language translation system.
Proceedings of the Connectionist, 1995

NPen/sup ++/: a writer independent, large vocabulary on-line cursive handwriting recognition system.
Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

Concept-based speech translation.
Proceedings of the 1995 International Conference on Acoustics, 1995

Toward movement-invariant automatic lip-reading and speech recognition.
Proceedings of the 1995 International Conference on Acoustics, 1995

Knowing who to listen to in speech recognition: visually guided beamforming.
Proceedings of the 1995 International Conference on Acoustics, 1995

1994
Introduction Structured Connectionist Systems.
Machine Learning, 1994

Recovering From Parser Failures: A Hybrid Statistical/Symbolic Approach.
CoRR, 1994

The Use of Dynamic Writing Information in a Connectionist On-Line Cursive Handwriting Recognition System.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

Inferring linguistic structure in spoken language.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Towards better language models for spontaneous speech.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Improving recognizer acceptance through robust, natural speech repair.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

See me, hear me: integrating automatic speech recognition and lip-reading.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Combining bitmaps with dynamic writing information for on-line handwriting recognition.
Proceedings of the 12th IAPR International Conference on Pattern Recognition, 1994

JANUS 93: towards spontaneous speech translation.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Learning state-dependent stream weights for multi-codebook HMM speech recognition systems.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Learning complex output representations in connectionist parsing of spoken language.
Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993
A neural fuzzy training approach for improving speech recognition.
Systems and Computers in Japan, 1993

Recent advances in JANUS: a speech translation system.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

Machine Translation.
Proceedings of the Human Language Technology: Proceedings of a Workshop Held at Plainsboro, 1993

Recent advances in JANUS: a speech translation system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Detection and transcription of new words.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speaker-independent connected letter recognition with a multi-state time delay neural network.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Tuning by doing: flexibility through automatic structure optimization.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Multi-modal HCI: combination of gesture and speech recognition.
Proceedings of the Human-Computer Interaction, 1993

1992
Integrated phoneme and function word architecture of hidden control neural networks for continuous speech recognition.
Speech Communication, 1992

The Meta-Pi Network: Building Distributed Knowledge Representations for Robust Multisource Pattern Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., 1992

Performance Through Consistency: MS-TDNN's for Large Vocabulary Continuous Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 5, [NIPS Conference, Denver, Colorado, USA, November 30, 1992

Connected Letter Recognition with a Multi-State Time Delay Neural Network.
Proceedings of the Advances in Neural Information Processing Systems 5, [NIPS Conference, Denver, Colorado, USA, November 30, 1992

1991
JANUS: Speech-to-Speech Translation Using Connectionist and Non-Connectionist Techniques.
Proceedings of the Advances in Neural Information Processing Systems 4, 1991

Multi-State Time Delay Networks for Continuous Speech Recognition.
Proceedings of the Advances in Neural Information Processing Systems 4, 1991

Continuous Speech Recognition with the Connectionist Viterbi Training Procedure: A Summary of Recent Work.
Proceedings of the Artificial Neural Networks, 1991

Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Evaluation of speaker-independent phoneme recognition on TIMIT database using TDNNs.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Time-delay neural networks embedding time alignment: a performance analysis.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Recent work in continuous speech recognition using the connectionist viterbi training procedure.
Proceedings of the Second European Conference on Speech Communication and Technology, 1991

1990
A novel objective function for improved phoneme recognition using time-delay neural networks.
IEEE Trans. Neural Networks, 1990

Spotting Phonemes and Syllables for Continuous Speech Recognition Using Time-Delay Neural Networks.
Systems and Computers in Japan, 1990

A time-delay neural network architecture for isolated word recognition.
Neural Networks, 1990

Continuous Speech Recognition by Linked Predictive Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 3, 1990

The Tempo 2 Algorithm: Adjusting Time-Delays By Supervised Learning.
Proceedings of the Advances in Neural Information Processing Systems 3, 1990

Speech recognition using sub-phoneme recognition neural network.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Speaker-independent phoneme recognition on TIMIT database using integrated time-delay neural networks (TDNNs).
Proceedings of the IJCNN 1990, 1990

1989
Modularity and scaling in large phonemic neural networks.
IEEE Trans. Acoustics, Speech, and Signal Processing, 1989

Phoneme recognition using time-delay neural networks.
IEEE Trans. Acoustics, Speech, and Signal Processing, 1989

Modular Construction of Time-Delay Neural Networks for Speech Recognition.
Neural Computation, 1989

Incremental Parsing by Modular Recurrent Connectionist Networks.
Proceedings of the Advances in Neural Information Processing Systems 2, 1989

Connectionist Architectures for Multi-Speaker Phoneme Recognition.
Proceedings of the Advances in Neural Information Processing Systems 2, 1989

Fast back-propagation learning methods for large phonemic neural networks.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

1988
Consonant Recognition by Modular Construction of Large Phonemic Time-Delay Neural Networks.
Proceedings of the Advances in Neural Information Processing Systems 1, 1988

1987
Learned phonetic discrimination using connectionist networks.
Proceedings of the European Conference on Speech Technology, 1987

1984
Suprasegmentals in very large vocabulary isolated word recognition.
Proceedings of the IEEE International Conference on Acoustics, 1984

1982
Performance trade-offs in search techniques for isolated word speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 1982


  Loading...