Quoc Truong Do

Orcid: 0000-0003-1472-1370

According to our database1, Quoc Truong Do authored at least 27 papers between 2014 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
End-to-end named entity recognition for Vietnamese speech.
Proceedings of the 25th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2022

2021
Toward Human-Friendly ASR Systems: Recovering Capitalization and Punctuation for Vietnamese Text.
IEICE Trans. Inf. Syst., 2021

Improving Speaker Verification in Noisy Environment Using DNN Classifier.
Proceedings of the RIVF International Conference on Computing and Communication Technologies, 2021

A Study on Neural-Network-Based Text-to-Speech Adaptation Techniques for Vietnamese.
Proceedings of the 24th Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2021

2020
Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models.
Proceedings of the Interspeech 2020, 2020

2019
VAIS Hate Speech Detection System: A Deep Learning based Approach for System Combination.
CoRR, 2019

A high quality and phonetic balanced speech corpus for Vietnamese.
CoRR, 2019

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging.
Proceedings of the 22nd Conference of the Oriental COCOSDA International Committee for the Co-ordination and Standardisation of Speech Databases and Assessment Techniques, 2019

Recovering Capitalization for Automatic Speech Recognition of Vietnamese using Transformer and Chunk Merging.
Proceedings of the 11th International Conference on Knowledge and Systems Engineering, 2019

2018
VAIS-1000: A Vietnamese Speech Synthesis Corpus.
Dataset, November, 2018

Sequence-to-Sequence Models for Emphasis Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Toward Multi-Features Emphasis Speech Translation: Assessment of Human Emphasis Production and Perception with Speech and Text Clues.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Multi-Modal Multi-Task Deep Learning For Speaker And Emotion Recognition Of TV-Series Data.
Proceedings of the 2018 Oriental COCOSDA, 2018

Japanese-English Code-Switching Speech Data Construction.
Proceedings of the 2018 Oriental COCOSDA, 2018

Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017
Preserving Word-Level Emphasis in Speech-to-Speech Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Toward Expressive Speech Translation: A Unified Sequence-to-Sequence LSTMs Approach for Translating Words and Emphasis.
Proceedings of the Interspeech 2017, 2017

2016
A Hybrid System for Continuous Word-Level Emphasis Modeling Based on HMM State Clustering and Adaptive Training.
Proceedings of the Interspeech 2016, 2016

Transferring Emphasis in Speech Translation Using Hard-Attentional Neural Network Models.
Proceedings of the Interspeech 2016, 2016

Learning a Lexicon and Translation Model from Phoneme Lattices.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

2015
The NAIST English speech recognition system for IWSLT 2015.
Proceedings of the 12th International Workshop on Spoken Language Translation: Evaluation Campaign@IWSLT 2015, 2015

Improving translation of emphasis with pause prediction in speech-to-speech translation systems.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Preserving word-level emphasis in speech-to-speech translation using linear regression HSMMs.
Proceedings of the INTERSPEECH 2015, 2015

WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

The NAIST ASR system for the 2015 Multi-Genre Broadcast challenge: On combination of deep learning systems using a rank-score function.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Collection and analysis of a Japanese-English emphasized speech corpora.
Proceedings of the 2014 17th Oriental Chapter of the International Committee for the Co-ordination and Standardization of Speech Databases and Assessment Techniques (COCOSDA), 2014


  Loading...