Tetsunori Kobayashi

Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017

Incorporating visual features into word embeddings: A bimodal autoencoder-based approach.

[BibT_eX]

[DOI]

Mika Hasegawa

Yoshihiko Hayashi

Proceedings of the IWCS 2017 - 12th International Conference on Computational Semantics - Short papers, Montpellier, France, September 19, 2017

Prosody Control of Utterance Sequence for Information Delivering.

[BibT_eX]

[DOI]

Ishin Fukuoka

Proceedings of the Interspeech 2017, 2017

Exploiting end of sentences and speaker alternations in language modeling for multiparty conversations.

[BibT_eX]

[DOI]

Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016

Waseda at TRECVID 2016: Ad-hoc Video Search.

[BibT_eX]

[DOI]

Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016

Evaluation of Collaborative Video Surveillance Platform: Prototype Development of Abandoned Object Detection.

[BibT_eX]

[DOI]

Proceedings of the 10th International Conference on Distributed Smart Camera, 2016

Improving semantic video indexing: Efforts in Waseda TRECVID 2015 SIN system.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Image retrieval under very noisy annotations.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Video semantic indexing using object detection-derived features.

[BibT_eX]

[DOI]

Proceedings of the 24th European Signal Processing Conference, 2016

Towards a Framework for Collaborative Video Surveillance System Using Crowdsourcing.

[BibT_eX]

[DOI]

Susumu Saito

Teppei Nakano

Proceedings of the 19th ACM Conference on Computer Supported Cooperative Work and Social Computing, 2016

A Spoken Dialog System for Coordinating Information Consumption and Exploration.

[BibT_eX]

[DOI]

Proceedings of the 2016 ACM Conference on Human Information Interaction and Retrieval, 2016

Multi-feature based fast depth decision in HEVC inter prediction for VLSI implementation.

[BibT_eX]

[DOI]

Proceedings of the 9th International Congress on Image and Signal Processing, 2016

2015

Automatic Expressive Opinion Sentence Generation for Enjoyable Conversational Systems.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2015

Four-participant group conversation: A facilitation robot controlling engagement density as the fourth participant.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2015

Waseda @ TRECVID 2015: Semantic Indexing.

[BibT_eX]

[DOI]

Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015

Multi-layer feature extractions for image classification - Knowledge from deep CNNs.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Systems, Signals and Image Processing, 2015

Bilinear map of filter-bank outputs for DNN-based speech recognition.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

Multiscale recurrent neural network based language model.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2015, 2015

A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions.

[BibT_eX]

[DOI]

Naohiro Tawara

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Separation matrix optimization using associative memory model for blind source separation.

[BibT_eX]

[DOI]

Proceedings of the 23rd European Signal Processing Conference, 2015

Towards a Computational Model of Small Group Facilitation.

[BibT_eX]

[DOI]

Yoichi Matsuyama

Proceedings of the 2015 AAAI Spring Symposia, 2015

2014

Effect of frequency weighting on MLP-based speaker canonicalization.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2014, 2014

2013

Expression of speaker's intentions through sentence-final particle/ intonation combinations in Japanese conversational speech synthesis.

[BibT_eX]

[DOI]

Proceedings of the Eighth ISCA Tutorial and Research Workshop on Speech Synthesis, 2013

A Four-Participant Group Facilitation Framework for Conversational Robots.

[BibT_eX]

[DOI]

Proceedings of the SIGDIAL 2013 Conference, 2013

Blocked Gibbs sampling based multi-scale mixture model for speaker clustering on noisy data.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Workshop on Machine Learning for Signal Processing, 2013

Speaker's intentions conveyed to listeners by sentence-final particles and their intonations in Japanese conversational speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2013

2012

Fully Bayesian speaker clustering based on hierarchically structured utterance-oriented Dirichlet process mixture model.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2012, 2012

Expressing Speaker's Intentions through Sentence-Final Intonations for Japanese Conversational Speech Synthesis.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2012, 2012

Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering.

[BibT_eX]

[DOI]

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

AAM fitting using shape parameter distribution.

[BibT_eX]

[DOI]

Youhei Shiraishi

Proceedings of the 20th European Signal Processing Conference, 2012

2011

Class-Distance-Based Discriminant Analysis and Its Application to Supervised Automatic Age Estimation.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2011

Multiparty Conversation Facilitation Strategy Using Combination of Question Answering and Spontaneous Utterances.

[BibT_eX]

[DOI]

Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Conversational Speech Synthesis System with Communication Situation Dependent HMMs.

[BibT_eX]

[DOI]

Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

Spatial Filter Calibration Based on Minimization of Modified LSD.

[BibT_eX]

[DOI]

Nobuaki Tanaka

Proceedings of the INTERSPEECH 2011, 2011

Speaker Verification Robust to Talking Style Variation Using Multiple Kernel Learning Based on Conditional Entropy Minimization.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2011, 2011

Speaker recognition using multiple kernel learning based on conditional entropy minimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

Subspace pursuit method for kernel-log-linear models.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2010

Speech Enhancement Using a Square Microphone Array in the Presence of Directional and Diffuse Noise.

[BibT_eX]

[DOI]

IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2010

Psychological evaluation of a group communication activation robot in a party game.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2010, 2010

A regularized discriminative training method of acoustic models derived by minimum relative entropy discrimination.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2010, 2010

Development of zonal beamformer and its application to robot audition.

[BibT_eX]

[DOI]

Proceedings of the 18th European Signal Processing Conference, 2010

Robot as a multimodal human interface device.

[BibT_eX]

[DOI]

Proceedings of the Auditory-Visual Speech Processing, 2010

Framework of Communication Activation Robot Participating in Multiparty Conversation.

[BibT_eX]

[DOI]

Proceedings of the Dialog with Robots, 2010

2009

Influence of Lombard Effect: Accuracy Analysis of Simulation-Based Assessments of Noisy Speech Recognition Systems for Various Recognition Conditions.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2009

SCHEMA: multi-party interaction-oriented humanoid robot.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computer Graphics and Interactive Techniques, 2009

Upper-Body Contour Extraction Using Face and Body Shape Variance Information.

[BibT_eX]

[DOI]

Kazuki Hoshiai

Proceedings of the Advances in Image and Video Technology, Third Pacific Rim Symposium, 2009

Robot auditory system using head-mounted square microphone array.

[BibT_eX]

[DOI]

Kosuke Hosoya

Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Conversation robot participating in and activating a group communication.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2009, 2009

System design of group communication activator: an entertainment task for elderly care.

[BibT_eX]

[DOI]

Proceedings of the 4th ACM/IEEE International Conference on Human Robot Interaction, 2009

Direction-of-arrival estimation under noisy condition using four-line omni-directional microphones mounted on a robot head.

[BibT_eX]

[DOI]

Proceedings of the 17th European Signal Processing Conference, 2009

2008

Social Robots that Interact with People.

[BibT_eX]

[DOI]

Cynthia Breazeal

Atsuo Takanishi

Proceedings of the Springer Handbook of Robotics, 2008

Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2008

Ears of the Robot: Direction of Arrival Estimation Based on Pattern Recognition Using Robot-Mounted Microphones.

[BibT_eX]

[DOI]

Naoya Mochiki

IEICE Trans. Inf. Syst., 2008

Design and formulation for speech interface based on flexible shortcuts.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2008, 2008

Speech enhancement using square microphone array for mobile devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2008

Incorporation of phrase intonation to context clustering for average voice models in HMM-based Thai speech synthesis.

[BibT_eX]

[DOI]

Suphattharachai Chomphan

Proceedings of the IEEE International Conference on Acoustics, 2008

Designing communication activation system in group communication.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE-RAS International Conference on Humanoid Robots, 2008

Upper-body contour extraction and tracking using face and body shape variance information.

[BibT_eX]

[DOI]

Kazuki Hoshiai

Proceedings of the 8th IEEE-RAS International Conference on Humanoid Robots, 2008

Multi-modal integration for personalized conversation: Towards a humanoid in daily life.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE-RAS International Conference on Humanoid Robots, 2008

An ASM fitting method based on machine learning that provides a robust parameter initialization for AAM fitting.

[BibT_eX]

[DOI]

Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Ears of the robot: Noise reduction using four-line ultra-micro omni-directional microphones mounted on a robot head.

[BibT_eX]

[DOI]

Proceedings of the 2008 16th European Signal Processing Conference, 2008

2007

Spectrum conversion using prosodic information.

[BibT_eX]

[DOI]

Tadashi Okubo

Syst. Comput. Jpn., 2007

Fusion-Based Age-Group Classification Method Using Multiple Two-Dimensional Feature Extraction Algorithms.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2007

Ears of the Robot: Three Simultaneous Speech Segregation and Recognition Using Robot-Mounted Microphones.

[BibT_eX]

[DOI]

Naoya Mochiki

IEICE Trans. Inf. Syst., 2007

Dynamic integration of multiple feature streams for robust real-time LVCSR.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2007, 2007

Adequacy Analysis of Simulation-Based Assessment of Speech Recognition System.

[BibT_eX]

[DOI]

Satoshi Kanba

Proceedings of the IEEE International Conference on Acoustics, 2007

Extensible speech recognition system using proxy-agent.

[BibT_eX]

[DOI]

Teppei Nakano

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

Introduction of the METI project "development of fundamental speech recognition technology".

[BibT_eX]

[DOI]

Sadaoki Furui

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006

Adaptive understanding of proposal-requesting expressions for conversational information retrieval system.

[BibT_eX]

[DOI]

Kenichiro Hosokawa

Syst. Comput. Jpn., 2006

Recognition of positive/negative attitude and its application to a spoken dialogue system.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2006

Hybrid Voice Conversion of Unit Selection and Generation Using Prosody Dependent HMM.

[BibT_eX]

[DOI]

Tadashi Okubo

IEICE Trans. Inf. Syst., 2006

Genetic Algorithm Based Optimization of Partly-Hidden Markov Model Structure Using Discriminative Criterion.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2006

Manifold HLDA and its application to robust speech recognition.

[BibT_eX]

[DOI]

Toshiaki Kubo

Proceedings of the INTERSPEECH 2006, 2006

MONEA: Message-oriented Networked-robot Architecture.

[BibT_eX]

[DOI]

Teppei Nakano

Proceedings of the 2006 IEEE International Conference on Robotics and Automation, 2006

Two-dimensional Heteroscedastic Linear Discriminant Analysis for Age-group Classification.

[BibT_eX]

[DOI]

Teruhide Hayashida

Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Conversation Robot with the Function of Gaze Recognition.

[BibT_eX]

[DOI]

Toshihiko Yamahata

Proceedings of the 2006 6th IEEE-RAS International Conference on Humanoid Robots, 2006

Subspace-based Age-group Classification Using Facial Images under Various Lighting Conditions.

[BibT_eX]

[DOI]

Teruhide Hayashida

Proceedings of the Seventh IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2006), 2006

A method for solving the permutation problem of frequency-domain BSS using reference signal.

[BibT_eX]

[DOI]

Proceedings of the 14th European Signal Processing Conference, 2006

Source separation using multiple directivity patterns produced by ICA-based BSS.

[BibT_eX]

[DOI]

Proceedings of the 14th European Signal Processing Conference, 2006

2005

An extension of the state-observation dependency in partly hidden Markov models and its application to continuous speech recognition.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2005

Extension of Hidden Markov Models for Multiple Candidates and Its Application to Gesture Recognition.

[BibT_eX]

[DOI]

Yosuke Sato

IEICE Trans. Inf. Syst., 2005

Optimizing the structure of partly-hidden Markov models using weighted likelihood-ratio maximization criterion.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2005, 2005

Back-channel feedback generation using linguistic and nonlinguistic information and its application to spoken dialogue system.

[BibT_eX]

[DOI]

Kenta Fukushima

Proceedings of the INTERSPEECH 2005, 2005

Speech recognition in the blind condition based on multiple directivity patterns using a microphone array.

[BibT_eX]

[DOI]

Toshiyuki Sekiya

Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

2004

Design and implementation of data sharing architecture for multifunctional robot development.

[BibT_eX]

[DOI]

Yosuke Matsusaka

Kentaro Oku

Syst. Comput. Jpn., 2004

A Low-Band Spectrum Envelope Reconstruction Method for PSOLA-Based <i>F</i><sub>0</sub> Modification.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2004

Speech-Recognition Interfaces for Music Information Retrieval: 'Speech Completion' and 'Speech Spotter'.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2004, 2004

Recognition of three simultaneous utterance of speech by four-line directivity microphone mounted on head of robot.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2004, 2004

Speech spotter: on-demand speech recognition in human-human conversation on the telephone or in face-to-face situations.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2004, 2004

Prosody based attitude recognition with feature selection and its application to spoken dialog system as para-linguistic information.

[BibT_eX]

[DOI]

Proceedings of the INTERSPEECH 2004, 2004

A Method of Gender Classification by Integrating Facial, Hairstyle, and Clothing Images.

[BibT_eX]

[DOI]

Proceedings of the 17th International Conference on Pattern Recognition, 2004

Speech enhancement based on multiple directivity patterns using a microphone array.

[BibT_eX]

[DOI]

Toshiyuki Sekiya

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

A low-band spectrum envelope modeling for high quality pitch modification.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Robust language modeling for a small corpus of target tasks using class-combined word statistics and selective use of a general corpus.

[BibT_eX]

[DOI]

Yosuke Wada

Norihiko Kobayashi

Syst. Comput. Jpn., 2003

Dictation of multiparty conversation considering speaker individuality and turn taking.

[BibT_eX]

[DOI]

Noriyuki Murai

Syst. Comput. Jpn., 2003

Speech recognition of double talk using SAFIA-based audio segregation.

[BibT_eX]

[DOI]

Toshiyuki Sekiya

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech starter: noise-robust endpoint detection by using filled pauses.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Speech shift: direct speech-input-mode switching through intentional control of voice pitch.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Hybrid modeling of PHMM and HMM for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002

Humanoid Robots in Waseda University-Hadaly-2 and WABIAN.

[BibT_eX]

[DOI]

Auton. Robots, 2002

Inter-module cooperation architecture for interactive robot.

[BibT_eX]

[DOI]

KyeongJu Kim

Yosuke Matsusaka

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, Lausanne, Switzerland, September 30, 2002

Generalization of state-observation-dependency in partly hidden Markov models.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Media-Integrated Biometric Person Recognition Based on the Dempster-Shafer Theory.

[BibT_eX]

[DOI]

Yoshiaki Sugie

Proceedings of the 16th International Conference on Pattern Recognition, 2002

Extension of Hidden Markov Models to Deal with Multiple Candidates of Observations and its Application to Mobile-Robot-Oriented Gesture Recognition.

[BibT_eX]

[DOI]

Yosuke Sato

Proceedings of the 16th International Conference on Pattern Recognition, 2002

Design and collection of acoustic sound data for hands-free speech recognition and sound scene understanding.

[BibT_eX]

[DOI]

Proceedings of the 2002 IEEE International Conference on Multimedia and Expo, 2002

2001

Modeling of conversational strategy for the robot participating in the group conversation.

[BibT_eX]

[DOI]

Yosuke Matsusaka

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Estimating positions of multiple adjacent speakers based on MUSIC spectra correlation using a microphone array.

[BibT_eX]

[DOI]

Hidetomo Tanaka

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

A conversational robot utilizing facial and body expressions.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Systems, 2000

IPA Japanese Dictation Free Software Project.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Free software toolkit for Japanese large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Dictation of multiparty conversation using statistical turn taking model and speaker model.

[BibT_eX]

[DOI]

Noriyuki Murai

Proceedings of the IEEE International Conference on Acoustics, 2000

1999

Multi-person conversation via multi-modal interface - a robot who communicate with multi-user -.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Class-combined word n-gram for robust language modeling.

[BibT_eX]

[DOI]

Norihiko Kobayashi

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Partly hidden Markov model and its application to speech recognition.

[BibT_eX]

[DOI]

Junko Furuyama

Ken Masumitsu

Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998

Controlling gaze of humanoid in communication with human.

[BibT_eX]

[DOI]

Proceedings of the Proceedings 1998 IEEE/RSJ International Conference on Intelligent Robots and Systems. Innovations in Theory, 1998

Source-extended language model for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Yosuke Wada

Norihiko Kobayashi

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Sharable software repository for Japanese large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus.

[BibT_eX]

[DOI]

1997

Partly-hidden Markov model and its application to gesture recognition.

[BibT_eX]

[DOI]

Satoshi Haruyama

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

1996

Speech recognition in nonstationary noise based on parallel HMMs and spectral subtraction.

[BibT_eX]

[DOI]

Ryuji Mine

Syst. Comput. Jpn., 1996

ALICE: acquisition of language in conversational environment - an approach to weakly supervised training of spoken language system for language porting.

[BibT_eX]

[DOI]

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995

Handling of user interruption to achieve timing-free utterances for spoken dialogue interface.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1995

1994

Generation of prosody in speech synthesis using large speech data-base.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Phoneme recognition in various styles of utterance based on mutual information criterion.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Multimodal drawing tool using speech, mouse and key-board.

[BibT_eX]

[DOI]

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Automatic training of phoneme dictionary based on mutual information criterion.

[BibT_eX]

[DOI]

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

Markov model based noise modeling and its application to noisy speech recognition using dynamical features of speech.

[BibT_eX]

[DOI]

Ryuji Mine

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1993

Word spotting in conversational speech based on phonemic unit likelihood by mutual information criterion.

[BibT_eX]

[DOI]

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

Speech recognition under the unstationary noise based on the noise Markov model and spectral-subtraction.

[BibT_eX]

[DOI]

Ryuji Mine

Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992

Phoneme recognition in continuous speech based on mutual information considering phonemic duration and connectivity.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Spectral mapping onto probabilistic domain using neural networks and its application to speaker adaptive phoneme recognition.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Spoken Language Processing, 1992

Speaker adaptive phoneme recognition based on feature mapping from spectral domain to probabilistic domain.

[BibT_eX]

[DOI]

Proceedings of the 1992 IEEE International Conference on Acoustics, 1992

1991

Text-to-speech synthesizer using superposition of sinusoidal waves generated by synchronized oscillators.

[BibT_eX]

[DOI]

Kazuo Hashimoto

Proceedings of the Second European Conference on Speech Communication and Technology, 1991

Application of neural networks to articulatory motion estimation.

[BibT_eX]

[DOI]

Masayuki Yagyu

Proceedings of the 1991 International Conference on Acoustics, 1991

1990

Dependence of phonemic feature on context.

[BibT_eX]

[DOI]

Kazuhiro Watanabe

Proceedings of the 1990 International Conference on Acoustics, 1990

Statistical properties of fluctuation of pitch intervals and its modeling for natural synthetic speech.

[BibT_eX]

[DOI]

Hidetoshi Sekine

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

Contextual factor analysis of vowel distribution.

[BibT_eX]

[DOI]

Toshiyuki Matsuda

Kazuhiro Watanabe

Proceedings of the First European Conference on Speech Communication and Technology, 1989

1987

The robot musician 'wabot-2' (waseda robot-2).

[BibT_eX]

[DOI]

Robotics, 1987

Description of task dependent knowledge for speech understanding system.

[BibT_eX]

[DOI]

Proceedings of the European Conference on Speech Technology, 1987

1986

Estimating articulatory motion from speech wave.

[BibT_eX]

[DOI]

Speech Commun., 1986

Estimation of articulatory parameters by table look-up method and its application for speaker independent phoneme recognition.

[BibT_eX]

[DOI]

J. Yazawa

Proceedings of the IEEE International Conference on Acoustics, 1986

A network model dealing with focus of conversation for speech understanding system.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1986

1984

Phrase speech recognition of large vocabulary using feature in articulatory domain.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1984

1983

Considerations on articulatory dynamics for continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 1983

1982

Recognition of semivowels and consonants in continuous speech using articulatory parameters.

[BibT_eX]

[DOI]