Chiori Hori

Orcid: 0000-0002-4201-7578

According to our database1, Chiori Hori authored at least 156 papers between 2000 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Overview of the Tenth Dialog System Technology Challenge: DSTC10.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

NIIRF: Neural IIR Filter Field for HRTF Upsampling and Personalization.
CoRR, 2024

2023
Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks.
CoRR, 2023

Generation or Replication: Auscultating Audio Latent Diffusion Models.
CoRR, 2023

Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos.
CoRR, 2023

Scenario-Aware Audio-Visual TF-Gridnet for Target Speech Extraction.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers.
Proceedings of the Interspeech 2022, 2022

Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.
Proceedings of the IEEE International Conference on Acoustics, 2022

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Editorial: Special Issue on the Eighth Dialog System Technology Challenge.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Overview of the Eighth Dialog System Technology Challenge: DSTC8.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers.
CoRR, 2021

Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Overview of the seventh Dialog System Technology Challenge: DSTC7.
Comput. Speech Lang., 2020

Multi-Pass Transformer for Machine Translation.
CoRR, 2020

Spatio-Temporal Scene Graphs for Video Dialog.
CoRR, 2020

Spatio-Temporal Ranked-Attention Networks for Video Captioning.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Transformer-Based Long-Context End-to-End Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Multi-Layer Content Interaction Through Quaternion Product for Visual Question Answering.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Analysis of Malicious Email Detection using Cialdini's Principles.
Proceedings of the 15th Asia Joint Conference on Information Security, 2020

2019
Adversarial training and decoding strategies for end-to-end neural conversation models.
Comput. Speech Lang., 2019

Overview of the sixth dialog system technology challenge: DSTC6.
Comput. Speech Lang., 2019

Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics.
Comput. Speech Lang., 2019

The Eighth Dialog System Technology Challenge.
CoRR, 2019

Dialog System Technology Challenge 7.
CoRR, 2019

Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence.
AI Mag., 2019

Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog.
Proceedings of the Interspeech 2019, 2019

End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features.
Proceedings of the IEEE International Conference on Acoustics, 2019

Audio Visual Scene-Aware Dialog.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7.
CoRR, 2018

Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

2017
Attention-Based Multimodal Fusion for Video Description.
CoRR, 2017

End-to-end Conversation Modeling Track in DSTC6.
CoRR, 2017

Attention-Based Multimodal Fusion for Video Description.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Early and late integration of audio features for automatic video description.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016
Superpositional HMM-Based Intonation Synthesis Using a Functional F0 Model.
J. Signal Process. Syst., 2016

Combination of multiple acoustic models with unsupervised adaptation for lecture speech transcription.
Speech Commun., 2016

Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers.
IEICE Trans. Inf. Syst., 2016

Dialog state tracking with attention-based sequence-to-sequence learning.
Proceedings of the 2016 IEEE Spoken Language Technology Workshop, 2016

Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs.
Proceedings of the Interspeech 2016, 2016

Driver confusion status detection using recurrent neural networks.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2016

Minimum word error training of long short-term memory recurrent neural network language models for speech recognition.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Leveraging social Q&A collections for improving complex question answering.
Comput. Speech Lang., 2015

A cloud robotics approach towards dialogue-oriented robot speech.
Adv. Robotics, 2015

The Application of Phrase Based Statistical Machine Translation Techniques to Myanmar Grapheme to Phoneme Conversion.
Proceedings of the Computational Linguistics, 2015

HMM based myanmar text to speech system.
Proceedings of the INTERSPEECH 2015, 2015

Sparse representation with temporal max-smoothing for acoustic event detection.
Proceedings of the INTERSPEECH 2015, 2015

Speaker adaptive training for deep neural networks embedding linear transformation networks.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Extraction of pitch register from expressive speech in Japanese.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

A Myanmar large vocabulary continuous speech recognition system.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
A MAP-based Online Estimation Approach to Ensemble Speaker and Speaking Environment Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2014

Incorporating local information of the acoustic environments to MAP-based feature compensation and acoustic model adaptation.
Comput. Speech Lang., 2014

Efficient multi-lingual unsupervised acoustic model training under mismatch conditions.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

The NCT ASR system for IWSLT 2014.
Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2014, 2014

Spectral patch based sparse coding for acoustic event detection.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Mandarin speech recognition using convolution neural network with augmented tone features.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Ensemble modeling of denoising autoencoder for speech spectrum restoration.
Proceedings of the INTERSPEECH 2014, 2014

Non-monologue HMM-based speech synthesis for service robots: A cloud robotics approach.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Translating TED speeches by recurrent neural network based translation model.
Proceedings of the IEEE International Conference on Acoustics, 2014

Speaker Adaptive Training using Deep Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2014

Sparse representation based on a bag of spectral exemplars for acoustic event detection.
Proceedings of the IEEE International Conference on Acoustics, 2014

Semantic context inference for spoken document retrieval using term association matrices.
Proceedings of the IEEE International Conference on Acoustics, 2014

Recurrent Neural Network-based Tuple Sequence Model for Machine Translation.
Proceedings of the COLING 2014, 2014

Tuning intonation with pitch accent decomposition for HMM-based expressive speech synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Incorporating tone features to convolutional neural network to improve Mandarin/Thai speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

2013
Controlling Tradeoff Between Approximation Accuracy and Complexity of a Smooth Function in a Reproducing Kernel Hilbert Space for Noise Reduction.
IEEE Trans. Signal Process., 2013

Collecting Colloquial and Spontaneous-like Sentences from Web Resources for Constructing Chinese Language Models of Speech Recognition.
J. Inf. Process., 2013

A-STAR: Toward translating Asian spoken languages.
Comput. Speech Lang., 2013

WFST-Based Spoken Dialogue System on Smartphones - Its Development and Implementation for Field Use.
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

Multilingual Speech-to-Speech Translation System: VoiceTra.
Proceedings of the 2013 IEEE 14th International Conference on Mobile Data Management, Milan, Italy, June 3-6, 2013, 2013

The NICT ASR system for IWSLT 2013.
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2013, 2013

Cross-lingual acoustic model adaptation based on transfer vector field smoothing with MAP.
Proceedings of the INTERSPEECH 2013, 2013

A targets-based superpositional model of fundamental frequency contours applied to HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2013, 2013

Speech enhancement based on deep denoising autoencoder.
Proceedings of the INTERSPEECH 2013, 2013

Speech spectrum restoration based on conditional restricted boltzmann machine.
Proceedings of the INTERSPEECH 2013, 2013

A lecture transcription system combining neural network acoustic and language models.
Proceedings of the INTERSPEECH 2013, 2013

Joint analysis of vocal tract length and temporal information for robust speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Speaker clustering using vector representation with long-term feature for lecture speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2013

Semantic inference based on neural probabilistic language modeling for speech indexing.
Proceedings of the IEEE International Conference on Acoustics, 2013

Feature normalization using MVAW processing for spoken language recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

Classification of children with voice impairments using deep neural networks.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Distributed speech translation technologies for multiparty multilingual communication.
ACM Trans. Speech Lang. Process., 2012

Sequence-Based Pronunciation Variation Modeling for Spontaneous ASR Using a Noisy Channel Approach.
IEICE Trans. Inf. Syst., 2012

The NICT ASR system for IWSLT2012.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Factored recurrent neural network language model in TED lecture transcription.
Proceedings of the 2012 International Workshop on Spoken Language Translation, 2012

Controlling the tradeoff property in a regularization framework for noise reduction.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Acoustic space partition based on broad phonetic class for ensemble acoustic modeling.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Collecting sentences from web resources for constructing spontaneous Chinese language model.
Proceedings of the 8th International Symposium on Chinese Spoken Language Processing, 2012

Tied-State Mixture Language Model for WFST-based Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Leveraging Social Annotation for Topic Language Model Adaptation.
Proceedings of the INTERSPEECH 2012, 2012

Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring.
Proceedings of the INTERSPEECH 2012, 2012

Speech restoration based on deep learning autoencoder with layer-wised pretraining.
Proceedings of the INTERSPEECH 2012, 2012

Ensemble Classifiers Using Unsupervised Data Selection for Speaker Recognition.
Proceedings of the INTERSPEECH 2012, 2012

A Specialized WFST Approach for Class Models and Dynamic Vocabulary.
Proceedings of the INTERSPEECH 2012, 2012

A linear projection approach to environment modeling for robust speech recognition.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

A comparison of dynamic WFST decoding approaches.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Factored Language Model based on Recurrent Neural Network.
Proceedings of the COLING 2012, 2012

2011
Modeling spoken decision support dialogue and optimization of its dialogue strategy.
ACM Trans. Speech Lang. Process., 2011

Investigation on the effects of ASR tuning on speech translation performance.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

The NICT ASR system for IWSLT2011.
Proceedings of the 2011 International Workshop on Spoken Language Translation, 2011

Conditional Random Fields for Modeling Korean Pronunciation Variation.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Incorporating Regional Information to Enhance MAP-Based Stochastic Feature Compensation for Robust Speech Recognition.
Proceedings of the INTERSPEECH 2011, 2011

User Study of Spoken Decision Support System.
Proceedings of the INTERSPEECH 2011, 2011

Answering Complex Questions via Exploiting Social Q&A Collection.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Improving Related Entity Finding via Incorporating Homepages and Recognizing Fine-grained Entities.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Dialogue Acts Annotation to Construct Dialogue Systems for Consulting.
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

Online Learning of Bayes Risk-Based Optimization of Dialogue Management for Document Retrieval Systems with Speech Interface.
Proceedings of the Spoken Dialogue Systems Technology and Design, 2011

2010
NiCT at TREC 2010: Related Entity Finding.
Proceedings of The Nineteenth Text REtrieval Conference, 2010

Dialogue strategy optimization to assist user's decision for spoken consulting dialogue systems.
Proceedings of the 2010 IEEE Spoken Language Technology Workshop, 2010

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy.
Proceedings of the SIGDIAL 2010 Conference, 2010

Dialogue Acts Annotation for NICT Kyoto Tour Dialogue Corpus to Construct Statistical Dialogue Systems.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Sightseeing Guidance Systems Based on WFST-Based Dialogue Manager.
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010

Construction and Experiment of a Spoken Consulting Dialogue System.
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010

Expansion of WFST-Based Dialog Management for Handling Multiple ASR Hypotheses.
Proceedings of the Spoken Dialogue Systems for Ambient Environments, 2010

Web text classification for response generation in spoken decision support dialogue systems.
Proceedings of the 4th International Universal Communication Symposium, 2010

2009
Consolidation-Based Speech Translation and Evaluation Approach.
IEICE Trans. Inf. Syst., 2009

Network-based speech-to-speech translation.
Proceedings of the 2009 International Workshop on Spoken Language Translation, 2009

Dialogue act annotation for consulting dialogue corpus.
Proceedings of the 3rd International Universal Communication Symposium, 2009

Evaluation for WFST-based dialog management.
Proceedings of the 3rd International Universal Communication Symposium, 2009

Annotating communicative function and semantic content in dialogue act for construction of consulting dialogue systems.
Proceedings of the INTERSPEECH 2009, 2009

Recent advances in WFST-based dialog system.
Proceedings of the INTERSPEECH 2009, 2009

Statistical dialog management applied to WFST-based dialog systems.
Proceedings of the IEEE International Conference on Acoustics, 2009

The Asian network-based speech-to-speech translation system.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Weighted finite state transducer based statistical dialog management.
Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition & Understanding, 2009

Annotating Dialogue Acts to Construct Dialogue Systems for Consulting.
Proceedings of the 7th Workshop on Asian Language Resources, 2009

2008
Dialogue Act Annotation for Statistically Managed Spoken Dialogue Systems.
Proceedings of the ISUC 2008, 2008

A Statistical Approach to Expandable Spoken Dialog Systems using WFSTs.
Proceedings of the ISUC 2008, 2008

Detection of feeling through back-channels in spoken dialogue.
Proceedings of the INTERSPEECH 2008, 2008

Dialog management using weighted finite-state transducers.
Proceedings of the INTERSPEECH 2008, 2008

2007
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition.
IEEE Trans. Speech Audio Process., 2007

Consolidation based speech translation.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2005
The CMU statistical machine translation system for IWSLT 2005.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Machine translation evaluation inside QARLA.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Overview of the IWSLT 2005 evaluation campaign.
Proceedings of the 2005 International Workshop on Spoken Language Translation, 2005

Spontaneous speech consolidation for spoken language applications.
Proceedings of the INTERSPEECH 2005, 2005

2004
Speech-to-text and speech-to-speech summarization of spontaneous speech.
IEEE Trans. Speech Audio Process., 2004

Speech Summarization: An Approach through Word Extraction and a Method for Evaluation.
IEICE Trans. Inf. Syst., 2004

Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition.
Proceedings of the INTERSPEECH 2004, 2004

2003
A new approach to automatic speech summarization.
IEEE Trans. Multim., 2003

A Statistical Approach to Automatic Speech Summarization.
EURASIP J. Adv. Signal Process., 2003

Speech summarization using weighted finite-state transducers.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Evaluation method for automatic speech summarization.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Automatic speech summarization based on sentence extraction and compaction.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Deriving disambiguous queries in a spoken interactive ODQA system.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Spoken Interactive ODQA System: SPIQA.
Proceedings of the ACL 2003, 2003

2002
Erratum: Language modeling by stochastic dependency grammer for Japanese speech recognition.
Syst. Comput. Jpn., 2002

Construction and evaluation of language models based on stochastic context-free grammar for speech recognition Chiori Hori, Masaharu Katoh, Akinori Ito, Masaki Koh.
Syst. Comput. Jpn., 2002

Automatic speech summarization applied to English broadcast news speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Language modeling by stochastic dependency grammar for Japanese speech recognition.
Syst. Comput. Jpn., 2001

Towards automatic transcription of spontaneous presentations.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Advances in automatic speech summarization.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Ubiquitous speech processing.
Proceedings of the IEEE International Conference on Acoustics, 2001

2000
Improvements in automatic speech summarization and evaluation methods.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Automatic speech summarization based on word significance and linguistic likelihood.
Proceedings of the IEEE International Conference on Acoustics, 2000


  Loading...