Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

2021

Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 2021

Improving Speaker Verification in Noisy Environment Using DNN Classifier.

[BibT_eX]

[DOI]

Proceedings of the RIVF International Conference on Computing and Communication Technologies, 2021

A Study on Neural-Network-Based Text-to-Speech Adaptation Techniques for Vietnamese.

[BibT_eX]

[DOI]

Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

2020

Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models.

[BibT_eX]

[DOI]

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

2019

VAIS Hate Speech Detection System: A Deep Learning based Approach for System Combination.

[BibT_eX]

[DOI]

CoRR, 2019

A high quality and phonetic balanced speech corpus for Vietnamese.

[BibT_eX]

[DOI]

Pham Ngoc Phuong

Quoc Truong Do

Luong Chi Mai

CoRR, 2019

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging.

[BibT_eX]

[DOI]

Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Recovering Capitalization for Automatic Speech Recognition of Vietnamese using Transformer and Chunk Merging.

[BibT_eX]

[DOI]

Huyen Nguyen Thi Minh

Proceedings of the 11th International Conference on Knowledge and Systems Engineering, 2019

2018

VAIS-1000: A Vietnamese Speech Synthesis Corpus.

[BibT_eX]

[DOI]

Quoc Truong Do

Chi Mai Luong

Dataset, November, 2018

Sequence-to-Sequence Models for Emphasis Speech Translation.

[BibT_eX]

[DOI]

Quoc Truong Do

Sakriani Sakti

Satoshi Nakamura

IEEE ACM Trans. Audio Speech Lang. Process., 2018

Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues.

[BibT_eX]

[DOI]

Quoc Truong Do

Sakriani Sakti

Satoshi Nakamura

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data.

[BibT_eX]

[DOI]

Proceedings of the 2018 Oriental COCOSDA, 2018

Japanese-English Code-Switching Speech Data Construction.

[BibT_eX]

[DOI]

Proceedings of the 2018 Oriental COCOSDA, 2018

Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017

Preserving Word-Level Emphasis in Speech-to-Speech Translation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2017

Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis.

[BibT_eX]

[DOI]

Quoc Truong Do

Sakriani Sakti

Satoshi Nakamura

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Learning a Lexicon and Translation Model from Phoneme Lattices.

[BibT_eX]

[DOI]

Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015

The NAIST English speech recognition system for IWSLT 2015.

[BibT_eX]

[DOI]

Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Improving translation of emphasis with pause prediction in speech-to-speech translation systems.

[BibT_eX]

[DOI]

Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs.

[BibT_eX]

[DOI]

Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014

Collection and analysis of a Japanese-English emphasized speech corpora.

[BibT_eX]

[DOI]

Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014

Quoc Truong Do

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...