Duc Le

Orcid: 0000-0001-7771-7729

According to our database1, Duc Le authored at least 63 papers between 2005 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
sCL-ST: Supervised Contrastive Learning With Semantic Transformations for Multiple Lead ECG Arrhythmia Classification.
IEEE J. Biomed. Health Informatics, June, 2023

Seq2seq for Automatic Paraphasia Detection in Aphasic Speech.
CoRR, 2023

StemGen: A music generation model that listens.
CoRR, 2023

A Foundation Model for Music Informatics.
CoRR, 2023

Scaling Up Music Information Retrieval Training with Semi-Supervised Learning.
CoRR, 2023

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding.
CoRR, 2023

Text Generation with Speech Synthesis for ASR Data Augmentation.
CoRR, 2023

Learning ASR Pathways: A Sparse Multilingual ASR Model.
Proceedings of the IEEE International Conference on Acoustics, 2023

Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities.
Proceedings of the IEEE International Conference on Acoustics, 2023

ICASSP 2023 Spoken Language Understanding Grand Challenge.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improving fast-slow Encoder based Transducer with Streaming Deliberation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Factorized Blank Thresholding for Improved Runtime Efficiency of Neural Transducers.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition.
CoRR, 2022

Learning ASR pathways: A sparse multilingual ASR model.
CoRR, 2022

STOP: A dataset for Spoken Task Oriented Semantic Parsing.
CoRR, 2022

Stop: A Dataset for Spoken Task Oriented Semantic Parsing.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Scaling ASR Improves Zero and Few Shot Learning.
Proceedings of the Interspeech 2022, 2022

Streaming parallel transducer beam search with fast slow cascaded encoders.
Proceedings of the Interspeech 2022, 2022

Deliberation Model for On-Device Spoken Language Understanding.
Proceedings of the Interspeech 2022, 2022

Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric.
Proceedings of the Interspeech 2022, 2022

Neural-FST Class Language Model for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

Joint Audio/Text Training for Transformer Rescorer of Streaming Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Multimodality Multi-Lead ECG Arrhythmia Classification using Self-Supervised Learning.
Proceedings of the IEEE-EMBS International Conference on Biomedical and Health Informatics, 2022

2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios.
CoRR, 2021

Wearable Metasurface-Enabled Quasi-Yagi Antenna for UHF RFID Reader With End-Fire Radiation Along the Forearm.
IEEE Access, 2021

Alignment Restricted Streaming Recurrent Neural Network Transducer.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Improving RNN Transducer Based ASR with Auxiliary Tasks.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Deep Shallow Fusion for RNN-T Personalization.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Dissecting User-Perceived Latency of On-Device E2E Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Emformer: Efficient Memory Transformer Based Acoustic Model for Low Latency Streaming Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Emformer: Efficient Memory Transformer Based Acoustic Model For Low Latency Streaming Speech Recognition.
CoRR, 2020

Weak-Attention Suppression for Transformer Based Speech Recognition.
Proceedings of the Interspeech 2020, 2020

Transformer-Based Acoustic Modeling for Hybrid Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

ML-Assisted Monitoring and Characterization of IoT Sensor Networks.
Proceedings of the 2020 IEEE Conference on Evolving and Adaptive Intelligent Systems, 2020

2019
Neural network modeling of monthly salinity variations in oyster reef in Apalachicola Bay in response to freshwater inflow and winds.
Neural Comput. Appl., 2019

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention.
CoRR, 2019

The integrated National NeuroAIDS Tissue Consortium database: a rich platform for neuroHIV research.
Database J. Biol. Databases Curation, 2019

From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

2018
Automatic quantitative analysis of spontaneous aphasic speech.
Speech Commun., 2018

Real-time Air Pollution prediction model based on Spatiotemporal Big data.
CoRR, 2018

Classification of Huntington Disease Using Acoustic and Lexical Features.
Proceedings of the Interspeech 2018, 2018

2017
Towards Automatic Speech-Language Assessment for Aphasia Rehabilitation.
PhD thesis, 2017

Automatic Paraphasia Detection from Aphasic Speech: A Preliminary Study.
Proceedings of the Interspeech 2017, 2017

Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network.
Proceedings of the Interspeech 2017, 2017

2016
Automatic Assessment of Speech Intelligibility for Individuals With Aphasia.
IEEE ACM Trans. Audio Speech Lang. Process., 2016

Improving Automatic Recognition of Aphasic Speech with AphasiaBank.
Proceedings of the Interspeech 2016, 2016

Wild wild emotion: a multimodal ensemble approach.
Proceedings of the 18th ACM International Conference on Multimodal Interaction, 2016

2015
Data selection for acoustic emotion recognition: Analyzing and comparing utterance and sub-utterance selection strategies.
Proceedings of the 2015 International Conference on Affective Computing and Intelligent Interaction, 2015

2014
MuCheck: an extensible tool for mutation testing of haskell programs.
Proceedings of the International Symposium on Software Testing and Analysis, 2014

Modeling pronunciation, rhythm, and intonation for automatic assessment of speech quality in aphasia rehabilitation.
Proceedings of the INTERSPEECH 2014, 2014

Automatic analysis of speech quality for aphasia treatment.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
A preliminary study of cross-lingual emotion recognition from speech: automatic classification versus human perception.
Proceedings of the INTERSPEECH 2013, 2013

Emotion recognition from spontaneous speech using Hidden Markov models with deep belief networks.
Proceedings of the 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013

2011
#ifdef confirmed harmful: Promoting understandable software variation.
Proceedings of the 2011 IEEE Symposium on Visual Languages and Human-Centric Computing, 2011

Support for software variation editing.
Proceedings of the 2011 IEEE Symposium on Visual Languages and Human-Centric Computing, 2011

2005
A 10.24GSPS photonic sampled bandpass ΔΣ modulator direct-sampling at 12GHz.
Proceedings of the IEEE 2005 Custom Integrated Circuits Conference, 2005


  Loading...