We stand with Ukraine

We stand with Ukraine

Chiori Hori

Orcid: 0000-0002-4201-7578

According to our database¹, Chiori Hori authored at least 165 papers between 2000 and 2026.

Collaborative distances:

Dijkstra number² of three.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

On csauthors.net:

Bibliography

2026

Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling.

[DOI]

Yoshiki Masuyama

,

François G. Germain

,

,

,

Jonathan Le Roux

CoRR, March, 2026

Embedding Morphology into Transformers for Cross-Robot Policy Learning.

[DOI]

,

,

,

,

,

,

Toshiaki Koike-Akino

CoRR, March, 2026

2025

SpinBench: Perspective and Rotation as a Lens on Spatial Reasoning in VLMs.

[DOI]

,

,

,

,

CoRR, September, 2025

Factorized RVQ-GAN For Disentangled Speech Tokenization.

[DOI]

,

Dominik Klement

,

Antoine Laurent

,

,

,

,

,

,

,

,

Yoshiki Masuyama

,

,

,

François G. Germain

,

,

Jonathan Le Roux

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

Interactive Robot Action Replanning using Multimodal LLM Trained from Human Demonstration Videos.

[DOI]

,

Motonari Kambara

,

,

,

,

,

,

,

,

Jonathan Le Roux

Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Robot Confirmation Generation and Action Planning Using Long-context Q-Former Integrated with Multimodal LLM.

[DOI]

,

Yoshiki Masuyama

,

,

,

,

,

Jonathan Le Roux

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2025

2024

Overview of the Tenth Dialog System Technology Challenge: DSTC10.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

ZeroST: Zero-Shot Speech Translation.

[DOI]

,

,

Antoine Laurent

,

,

Jonathan Le Roux

Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Interactive Planning Using Large Language Models for Partially Observable Robotic Tasks.

[DOI]

,

,

,

,

,

,

Masayoshi Tomizuka

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization.

[DOI]

Yoshiki Masuyama

,

,

François G. Germain

,

,

,

,

Jonathan Le Roux

Proceedings of the IEEE International Conference on Acoustics, 2024

WI-FI based Indoor Monitoring Enhanced by Multimodal Fusion.

[DOI]

,

,

,

Cristian J. Vaca-Rubio

,

,

,

Jonathan Le Roux

Proceedings of the IEEE International Conference on Acoustics, 2024

Generation or Replication: Auscultating Audio Latent Diffusion Models.

[DOI]

Dimitrios Bralios

,

,

François G. Germain

,

,

,

,

Jonathan Le Roux

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks.

[DOI]

,

,

,

,

,

,

Masayoshi Tomizuka

,

CoRR, 2023

Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos.

[DOI]

,

,

,

,

,

,

,

,

,

Jonathan Le Roux

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction.

[DOI]

,

,

Yoshiki Masuyama

,

François G. Germain

,

,

,

Jonathan Le Roux

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022

Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers.

[DOI]

,

,

Jonathan Le Roux

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.

[DOI]

,

,

,

,

,

,

Jonathan Le Roux

,

Proceedings of the IEEE International Conference on Acoustics, 2022

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering.

[DOI]

,

,

,

Jonathan Le Roux

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Editorial: Special Issue on the Eighth Dialog System Technology Challenge.

[DOI]

,

,

R. Chulaka Gunasekara

,

,

Abhinav Rastogi

,

Luis Fernando D'Haro

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Overview of the Eighth Dialog System Technology Challenge: DSTC8.

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers.

[DOI]

,

,

Jonathan Le Roux

CoRR, 2021

Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers.

[DOI]

,

,

,

Jonathan Le Roux

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers.

[DOI]

,

,

Jonathan Le Roux

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.

[DOI]

,

,

Moitreya Chatterjee

,

,

Jonathan Le Roux

,

,

,

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Overview of the seventh Dialog System Technology Challenge: DSTC7.

[DOI]

Luis Fernando D'Haro

,

Koichiro Yoshino

,

,

,

Lazaros Polymenakos

,

Jonathan K. Kummerfeld

,

,

Comput. Speech Lang., 2020

Multi-Pass Transformer for Machine Translation.

[DOI]

,

,

,

,

Jonathan Le Roux

CoRR, 2020

Spatio-Temporal Scene Graphs for Video Dialog.

[DOI]

,

,

,

Jonathan Le Roux

,

CoRR, 2020

Spatio-Temporal Ranked-Attention Networks for Video Captioning.

[DOI]

,

,

,

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Transformer-Based Long-Context End-to-End Speech Recognition.

[DOI]

,

,

,

Jonathan Le Roux

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Multi-Layer Content Interaction Through Quaternion Product for Visual Question Answering.

[DOI]

,

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Analysis of Malicious Email Detection using Cialdini's Principles.

[DOI]

Hiroki Nishikawa

,

Takumi Yamamoto

,

Bret A. Harsham

,

,

,

,

,

Kiyoto Kawauchi

,

Masakatsu Nishigaki

Proceedings of the 15th Asia Joint Conference on Information Security, 2020

2019

Adversarial training and decoding strategies for end-to-end neural conversation models.

[DOI]

,

,

,

,

,

John R. Hershey

Comput. Speech Lang., 2019

Overview of the sixth dialog system technology challenge: DSTC6.

[DOI]

,

,

Ryuichiro Higashinaka

,

,

,

Michimasa Inaba

,

Yuiko Tsunomori

,

Tetsuro Takahashi

,

Koichiro Yoshino

,

Comput. Speech Lang., 2019

Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics.

[DOI]

Luis Fernando D'Haro

,

Rafael E. Banchs

,

,

Comput. Speech Lang., 2019

The Eighth Dialog System Technology Challenge.

[DOI]

CoRR, 2019

Dialog System Technology Challenge 7.

[DOI]

Koichiro Yoshino

,

,

,

Luis Fernando D'Haro

,

Lazaros Polymenakos

,

R. Chulaka Gunasekara

,

Walter S. Lasecki

,

Jonathan K. Kummerfeld

,

,

,

,

,

,

,

,

,

CoRR, 2019

Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence.

[DOI]

AI Mag., 2019

Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog.

[DOI]

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.

[DOI]

,

,

,

,

,

,

,

Vincent Cartillier

,

Raphael Gontijo Lopes

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Audio Visual Scene-Aware Dialog.

[DOI]

,

Vincent Cartillier

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7.

[DOI]

,

Vincent Cartillier

,

Raphael Gontijo Lopes

,

,

,

,

,

,

,

,

CoRR, 2018

Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description.

[DOI]

,

,

,

,

,

,

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017

Attention-Based Multimodal Fusion for Video Description.

[DOI]

,

,

,

,

John R. Hershey

,

CoRR, 2017

End-to-end Conversation Modeling Track in DSTC6.

[DOI]

,

CoRR, 2017

Attention-Based Multimodal Fusion for Video Description.

[DOI]

,

,

,

,

,

John R. Hershey

,

,

Proceedings of the IEEE International Conference on Computer Vision, 2017

Early and late integration of audio features for automatic video description.

[DOI]

,

,

,

John R. Hershey

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription.

[DOI]

,

,

,

,

,

,

Speech Commun., 2016

Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers.

[DOI]

,

Shigeki Matsuda

,

Hideyuki Watanabe

,

,

,

,

Shigeru Katagiri

IEICE Trans. Inf. Syst., 2016

Dialog state tracking with attention-based sequence-to-sequence learning.

[DOI]

,

,

,

Shinji Watanabe

,

,

Jonathan Le Roux

,

John R. Hershey

,

,

,

,

Takeyuki Aikawa

Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs.

[DOI]

,

,

Shinji Watanabe

,

John R. Hershey

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Driver confusion status detection using recurrent neural networks.

[DOI]

,

Shinji Watanabe

,

,

Bret A. Harsham

,

John R. Hershey

,

,

,

Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition.

[DOI]

,

,

Shinji Watanabe

,

John R. Hershey

Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015

Leveraging social Q&A collections for improving complex question answering.

[DOI]

,

,

Hideki Kashioka

,

Comput. Speech Lang., 2015

A cloud robotics approach towards dialogue-oriented robot speech.

[DOI]

,

Yoshinori Shiga

,

,

,

Adv. Robotics, 2015

The Application of Phrase Based Statistical Machine Translation Techniques to Myanmar Grapheme to Phoneme Conversion.

[DOI]

,

,

Andrew M. Finch

,

,

Eiichiro Sumita

,

Proceedings of the Computational Linguistics, 2015

HMM based myanmar text to speech system.

[DOI]

,

,

,

Yoshinori Shiga

,

Andrew M. Finch

,

,

,

Eiichiro Sumita

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Sparse representation with temporal max-smoothing for acoustic event detection.

[DOI]

,

,

,

,

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Speaker adaptive training for deep neural networks embedding linear transformation networks.

[DOI]

,

Shigeki Matsuda

,

Hideyuki Watanabe

,

,

,

Shigeru Katagiri

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Extraction of pitch register from expressive speech in Japanese.

[DOI]

,

Yoshinori Shiga

,

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A Myanmar large vocabulary continuous speech recognition system.

[DOI]

Hay Mar Soe Naing

,

,

,

,

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014

A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling.

[DOI]

,

Shigeki Matsuda

,

,

Hideki Kashioka

,

IEEE ACM Trans. Audio Speech Lang. Process., 2014

Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation.

[DOI]

,

,

,

,

Shigeki Matsuda

,

Comput. Speech Lang., 2014

Efficient multi-lingual unsupervised acoustic model training under mismatch conditions.

[DOI]

,

Hitoshi Yamamoto

,

Ryosuke Isotani

,

Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The NCT ASR system for IWSLT 2014.

[DOI]

,

,

,

,

,

Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Superpositional HMM-based intonation synthesis using a functional F0 model.

[DOI]

,

Yoshinori Shiga

,

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Spectral patch based sparse coding for acoustic event detection.

[DOI]

,

,

,

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Mandarin speech recognition using convolution neural network with augmented tone features.

[DOI]

,

,

Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Ensemble modeling of denoising autoencoder for speech spectrum restoration.

[DOI]

,

,

Shigeki Matsuda

,

Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014

Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach.

[DOI]

,

Yoshinori Shiga

,

,

,

Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Translating TED speeches by recurrent neural network based translation model.

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Speaker Adaptive Training using Deep Neural Networks.

[DOI]

,

Shigeki Matsuda

,

,

,

Shigeru Katagiri

Proceedings of the IEEE International Conference on Acoustics, 2014

Sparse representation based on a bag of spectral exemplars for acoustic event detection.

[DOI]

,

,

Shigeki Matsuda

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Semantic context inference for spoken document retrieval using term association matrices.

[DOI]

Chien-Lin Huang

,

Proceedings of the IEEE International Conference on Acoustics, 2014

Recurrent Neural Network-based Tuple Sequence Model for Machine Translation.

[DOI]

,

,

Proceedings of the COLING 2014, 2014

Tuning intonation with pitch accent decomposition for HMM-based expressive speech synthesis.

[DOI]

,

Yoshinori Shiga

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Incorporating tone features to convolutional neural network to improve Mandarin/Thai speech recognition.

[DOI]

,

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013

Controlling Tradeoff Between Approximation Accuracy and Complexity of a Smooth Function in a Reproducing Kernel Hilbert Space for Noise Reduction.

[DOI]

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

IEEE Trans. Signal Process., 2013

Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition.

[DOI]

,

Shigeki Matsuda

,

,

Hideki Kashioka

Inf. Media Technol., 2013

A-STAR: Toward translating Asian spoken languages.

[DOI]

,

,

Andrew M. Finch

,

,

,

Noriyuki Kimura

,

,

Eiichiro Sumita

,

Satoshi Nakamura

,

,

Chai Wutiwiwatchai

,

,

,

,

,

Comput. Speech Lang., 2013

WFST-Based Spoken Dialogue System on Smartphones - Its Development and Implementation for Field Use.

[DOI]

,

,

Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

Multilingual Speech-to-Speech Translation System: VoiceTra.

[DOI]

Shigeki Matsuda

,

,

Yoshinori Shiga

,

Hideki Kashioka

,

,

,

,

,

Eiichiro Sumita

,

,

Satoshi Nakamura

Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

The NICT ASR system for IWSLT 2013.

[DOI]

Chien-Lin Huang

,

,

Shigeki Matsuda

,

,

,

,

Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP.

[DOI]

,

Shigeki Matsuda

,

,

Ryosuke Isotani

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A targets-based superpositional model of fundamental frequency contours applied to HMM-based speech synthesis.

[DOI]

,

Yoshinori Shiga

,

,

Yutaka Kidawara

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speech enhancement based on deep denoising autoencoder.

[DOI]

,

,

Shigeki Matsuda

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Speech spectrum restoration based on conditional restricted boltzmann machine.

[DOI]

,

Shigeki Matsuda

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A lecture transcription system combining neural network acoustic and language models.

[DOI]

,

Hitoshi Yamamoto

,

Pawel Swietojanski

,

,

,

,

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Joint analysis of vocal tract length and temporal information for robust speech recognition.

[DOI]

Chien-Lin Huang

,

,

Hideki Kashioka

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker clustering using vector representation with long-term feature for lecture speech recognition.

[DOI]

Chien-Lin Huang

,

,

Hideki Kashioka

,

Proceedings of the IEEE International Conference on Acoustics, 2013

Semantic inference based on neural probabilistic language modeling for speech indexing.

[DOI]

Chien-Lin Huang

,

,

Hideki Kashioka

Proceedings of the IEEE International Conference on Acoustics, 2013

Feature normalization using MVAW processing for spoken language recognition.

[DOI]

Chien-Lin Huang

,

Shigeki Matsuda

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Classification of children with voice impairments using deep neural networks.

[DOI]

Chien-Lin Huang

,

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012

Distributed speech translation technologies for multiparty multilingual communication.

[DOI]

,

,

Andrew M. Finch

,

,

,

Noriyuki Kimura

,

Shigeki Matsuda

,

,

Yutaka Ashikari

,

,

Hideki Kashioka

,

Eiichiro Sumita

,

Satoshi Nakamura

ACM Trans. Speech Lang. Process., 2012

Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach.

[DOI]

Hansjörg Hofmann

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

,

Wolfgang Minker

IEICE Trans. Inf. Syst., 2012

The NICT ASR system for IWSLT2012.

[DOI]

Hitoshi Yamamoto

,

,

Chien-Lin Huang

,

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Factored recurrent neural network language model in TED lecture transcription.

[DOI]

,

Hitoshi Yamamoto

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Controlling the tradeoff property in a regularization framework for noise reduction.

[DOI]

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Acoustic space partition based on broad phonetic class for ensemble acoustic modeling.

[DOI]

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Collecting sentences from web resources for constructing spontaneous Chinese language model.

[DOI]

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Tied-State Mixture Language Model for WFST-based Speech Recognition.

[DOI]

Hitoshi Yamamoto

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Leveraging Social Annotation for Topic Language Model Adaptation.

[DOI]

,

,

,

,

Hideki Kashioka

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring.

[DOI]

,

Nobuaki Minematsu

,

Keikichi Hirose

,

,

Hideki Kashioka

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Speech restoration based on deep learning autoencoder with layer-wised pretraining.

[DOI]

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition.

[DOI]

Chien-Lin Huang

,

,

Hideki Kashioka

,

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A Specialized WFST Approach for Class Models and Dynamic Vocabulary.

[DOI]

,

,

Hideki Kashioka

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

A linear projection approach to environment modeling for robust speech recognition.

[DOI]

,

Chien-Lin Huang

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A comparison of dynamic WFST decoding approaches.

[DOI]

,

,

Hideki Kashioka

Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Factored Language Model based on Recurrent Neural Network.

[DOI]

,

,

Hitoshi Yamamoto

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the COLING 2012, 2012

2011

Modeling spoken decision support dialogue and optimization of its dialogue strategy.

[DOI]

,

,

Tatsuya Kawahara

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

,

Satoshi Nakamura

ACM Trans. Speech Lang. Process., 2011

Investigation on the effects of ASR tuning on speech translation performance.

[DOI]

,

Andrew M. Finch

,

,

Hideki Kashioka

Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

The NICT ASR system for IWSLT2011.

[DOI]

,

,

Chien-Lin Huang

,

,

Shigeki Matsuda

,

,

Hideki Kashioka

Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Conditional Random Fields for Modeling Korean Pronunciation Variation.

[DOI]

,

Andrew M. Finch

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition.

[DOI]

,

,

,

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

User Study of Spoken Decision Support System.

[DOI]

,

Kiyonori Ohtake

,

,

,

Satoshi Nakamura

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Answering Complex Questions via Exploiting Social Q&A Collection.

[DOI]

,

,

,

Hideki Kashioka

Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Improving Related Entity Finding via Incorporating Homepages and Recognizing Fine-grained Entities.

[DOI]

,

,

,

Hideki Kashioka

Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Dialogue Acts Annotation to Construct Dialogue Systems for Consulting.

[DOI]

Kiyonori Ohtake

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface.

[DOI]

,

,

Tatsuya Kawahara

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

2010

NiCT at TREC 2010: Related Entity Finding.

[DOI]

,

,

Proceedings of The Nineteenth Text REtrieval Conference, 2010

Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems.

[DOI]

,

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

,

Satoshi Nakamura

Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy.

[DOI]

,

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

,

Satoshi Nakamura

Proceedings of the SIGDIAL 2010 Conference, 2010

Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems.

[DOI]

Kiyonori Ohtake

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the International Conference on Language Resources and Evaluation, 2010

Sightseeing Guidance Systems Based on WFST-Based Dialogue Manager.

[DOI]

,

,

Kiyonori Ohtake

,

,

Akihiro Kobayashi

,

,

,

Hideki Kashioka

,

,

Satoshi Nakamura

Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010

Construction and Experiment of a Spoken Consulting Dialogue System.

[DOI]

,

,

Kiyonori Ohtake

,

Hideki Kashioka

,

,

Satoshi Nakamura

Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010

Expansion of WFST-Based Dialog Management for Handling Multiple ASR Hypotheses.

[DOI]

,

,

,

Kiyonori Ohtake

,

,

Satoshi Nakamura

Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010

Web text classification for response generation in spoken decision support dialogue systems.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

,

Satoshi Nakamura

Proceedings of the 4th International Universal Communication Symposium, 2010

2009

Consolidation-Based Speech Translation and Evaluation Approach.

[DOI]

,

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

IEICE Trans. Inf. Syst., 2009

Network-based speech-to-speech translation.

[DOI]

,

,

,

Noriyuki Kimura

,

Yutaka Ashikari

,

Ryosuke Isotani

,

Eiichiro Sumita

,

Satoshi Nakamura

Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Dialogue act annotation for consulting dialogue corpus.

[DOI]

Kiyonori Ohtake

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 3rd International Universal Communication Symposium, 2009

Evaluation for WFST-based dialog management.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 3rd International Universal Communication Symposium, 2009

Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Recent advances in WFST-based dialog system.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Statistical dialog management applied to WFST-based dialog systems.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the IEEE International Conference on Acoustics, 2009

The Asian network-based speech-to-speech translation system.

[DOI]

,

Noriyuki Kimura

,

,

,

Eiichiro Sumita

,

Satoshi Nakamura

,

,

Chai Wutiwiwatchai

,

,

,

,

,

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Weighted finite state transducer based statistical dialog management.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Annotating Dialogue Acts to Construct Dialogue Systems for Consulting.

[DOI]

Kiyonori Ohtake

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 7th Workshop on Asian Language Resources, 2009

2008

Dialogue Act Annotation for Statistically Managed Spoken Dialogue Systems.

[DOI]

Kiyonori Ohtake

,

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the ISUC 2008, 2008

A Statistical Approach to Expandable Spoken Dialog Systems using WFSTs.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the ISUC 2008, 2008

Detection of feeling through back-channels in spoken dialogue.

[DOI]

Tatsuya Kawahara

,

Masayoshi Toyokura

,

,

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Dialog management using weighted finite-state transducers.

[DOI]

,

Kiyonori Ohtake

,

,

Hideki Kashioka

,

Satoshi Nakamura

Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

2007

Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition.

[DOI]

,

,

Yasuhiro Minami

,

Atsushi Nakamura

IEEE Trans. Speech Audio Process., 2007

Consolidation based speech translation.

[DOI]

,

,

,

Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2005

The CMU statistical machine translation system for IWSLT 2005.

[DOI]

Sanjika Hewavitharana

,

,

Almut Silja Hildebrand

,

,

,

,

Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Machine translation evaluation inside QARLA.

[DOI]

Jesús Giménez

,

,

Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Overview of the IWSLT 2005 evaluation campaign.

[DOI]

,

Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Spontaneous speech consolidation for spoken language applications.

[DOI]

,

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Speech-to-text and speech-to-speech summarization of spontaneous speech.

[DOI]

,

Tomonori Kikuchi

,

Yosuke Shinnaka

,

IEEE Trans. Speech Audio Process., 2004

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation.

[DOI]

,

IEICE Trans. Inf. Syst., 2004

Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition.

[DOI]

,

,

Yasuhiro Minami

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

2003

A new approach to automatic speech summarization.

[DOI]

,

IEEE Trans. Multim., 2003

A Statistical Approach to Automatic Speech Summarization.

[DOI]

,

,

Robert G. Malkin

,

,

EURASIP J. Adv. Signal Process., 2003

Speech summarization using weighted finite-state transducers.

[DOI]

,

,

Yasuhiro Minami

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluation method for automatic speech summarization.

[DOI]

,

,

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic speech summarization based on sentence extraction and compaction.

[DOI]

Tomonori Kikuchi

,

,

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Deriving disambiguous queries in a spoken interactive ODQA system.

[DOI]

,

,

,

,

Shigeru Katagiri

,

Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Spoken Interactive ODQA System: SPIQA.

[DOI]

,

,

,

,

,

Proceedings of the ACL 2003, 2003

2002

Erratum: Language modeling by stochastic dependency grammer for Japanese speech recognition.

[DOI]

,

,

,

Syst. Comput. Jpn., 2002

Construction and evaluation of language models based on stochastic context-free grammar for speech recognition Chiori Hori, Masaharu Katoh, Akinori Ito, Masaki Koh.

[DOI]

,

,

,

Syst. Comput. Jpn., 2002

Automatic speech summarization applied to English broadcast news speech.

[DOI]

,

,

Robert G. Malkin

,

,

Proceedings of the IEEE International Conference on Acoustics, 2002

2001

Towards automatic transcription of spontaneous presentations.

[DOI]

Takahiro Shinozaki

,

,

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Advances in automatic speech summarization.

[DOI]

,

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Ubiquitous speech processing.

[DOI]

,

,

,

Takahiro Shinozaki

,

,

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Language modeling by stochastic dependency grammar for Japanese speech recognition.

[DOI]

,

,

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Improvements in automatic speech summarization and evaluation methods.

[DOI]

,

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Automatic speech summarization based on word significance and linguistic likelihood.

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2000

Loading...