Yonghui Wu

According to our database1, Yonghui Wu authored at least 183 papers between 2000 and 2022.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2022
Distributed Event-Triggered Consensus of General Linear Multiagent Systems Under Directed Graphs.
IEEE Trans. Cybern., 2022

Biases in using social media data for public health surveillance: A scoping review.
Int. J. Medical Informatics, 2022

N-Grammer: Augmenting Transformers with latent n-grams.
CoRR, 2022

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation.
CoRR, 2022

Building Machine Translation Systems for the Next Thousand Languages.
CoRR, 2022

CoCa: Contrastive Captioners are Image-Text Foundation Models.
CoRR, 2022

GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records.
CoRR, 2022

Description-Driven Task-Oriented Dialog Modeling.
CoRR, 2022

Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022



Self-supervised learning with random-projection quantizer for speech recognition.
Proceedings of the International Conference on Machine Learning, 2022


SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Extracting social determinants of health from electronic health records using natural language processing: a systematic review.
J. Am. Medical Informatics Assoc., 2021

Gendered linguistic structures and the innovation performance of new ventures in emerging countries: the moderating effects of digitalisation and the entrepreneurial ecosystem.
Int. J. Technol. Manag., 2021

Vector-quantized Image Modeling with Improved VQGAN.
CoRR, 2021

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
CoRR, 2021

A Study of Social and Behavioral Determinants of Health in Lung Cancer Patients Using Transformers-based Natural Language Processing Models.
CoRR, 2021

Clinical Relation Extraction Using Transformer-based Models.
CoRR, 2021

GSPMD: General and Scalable Parallelization for ML Computation Graphs.
CoRR, 2021

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS.
CoRR, 2021

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.
CoRR, 2021

Improving Longer-range Dialogue State Tracking.
CoRR, 2021

Distilling Interpretable Models into Human-Readable Code.
CoRR, 2021

Interpretable Ranking with Generalized Additive Models.
Proceedings of the WSDM '21, 2021

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling.
Proceedings of the 9th International Conference on Learning Representations, 2021

Identify Diabetic Retinopathy-related Clinical Concepts Using Transformer-based Natural Language Processing Methods.
Proceedings of the 9th IEEE International Conference on Healthcare Informatics, 2021

FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization.
Proceedings of the IEEE International Conference on Acoustics, 2021

A Better and Faster end-to-end Model for Streaming ASR.
Proceedings of the IEEE International Conference on Acoustics, 2021

Parallel Tacotron: Non-Autoregressive and Controllable TTS.
Proceedings of the IEEE International Conference on Acoustics, 2021

Effective Sequence-to-Sequence Dialogue State Tracking.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Transformer-based named entity recognition for parsing clinical trial eligibility criteria.
Proceedings of the BCB '21: 12th ACM International Conference on Bioinformatics, 2021

w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Output-based event-triggered consensus of general linear multi-agent systems with communication delay under directed graphs.
J. Frankl. Inst., 2020

Clinical concept extraction using transformers.
J. Am. Medical Informatics Assoc., 2020

Identifying relations of medications with adverse drug events using recurrent convolutional neural networks and gradient boosting.
J. Am. Medical Informatics Assoc., 2020

Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data.
J. Am. Medical Informatics Assoc., 2020

Identification of important factors in an inpatient fall risk prediction model to improve the quality of care using EHR and electronic administrative data: A machine-learning approach.
Int. J. Medical Informatics, 2020

Assessing mental health signals among sexual and gender minorities using Twitter data.
Health Informatics J., 2020

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition.
CoRR, 2020

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling.
CoRR, 2020

Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling.
CoRR, 2020

Interpretable Learning-to-Rank with Generalized Additive Models.
CoRR, 2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.
CoRR, 2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior.
CoRR, 2020

Improved Noisy Student Training for Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context.
Proceedings of the Interspeech 2020, 2020

Conformer: Convolution-augmented Transformer for Speech Recognition.
Proceedings of the Interspeech 2020, 2020

A Natural Language Processing Tool to Extract Quantitative Smoking Status from Clinical Narratives.
Proceedings of the 8th IEEE International Conference on Healthcare Informatics, 2020

A 26.5GHz Wideband Gilbert-Cell Mixer MMIC Based on InP DHBT Technology.
Proceedings of the 20th IEEE International Conference on Communication Technology, 2020

Event-Triggered Consensus of Linear Multi-agent Systems with Input Time Delay.
Proceedings of the 16th IEEE International Conference on Control & Automation, 2020

Improving Speech Recognition Using Consistent Predictions on Synthesized Speech.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


Specaugment on Large Scale Datasets.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards Fast and Accurate Streaming End-To-End ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Developing and Validating a Computable Phenotype for the Identification of Transgender and Gender Nonconforming Individuals and Subgroups.
Proceedings of the AMIA 2020, 2020

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Input Time Delay Margin in Event-Triggered Consensus of Multiagent Systems.
IEEE Trans. Cybern., 2019

A Stereo-Vision System for Measuring the Ram Speed of Steam Hammers in an Environment with a Large Field of View and Strong Vibrations.
Sensors, 2019

A study of deep learning methods for de-identification of clinical notes in cross-institute settings.
BMC Medical Informatics Decis. Mak., 2019

Applying a deep learning-based sequence labeling approach to detect attributes of medical concepts in clinical text.
BMC Medical Informatics Decis. Mak., 2019

Time-sensitive clinical concept embeddings learned from large electronic health records.
BMC Medical Informatics Decis. Mak., 2019

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges.
CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.
CoRR, 2019

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Gmail Smart Compose: Real-Time Assisted Writing.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning.
Proceedings of the Interspeech 2019, 2019

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech.
Proceedings of the Interspeech 2019, 2019


Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model.
Proceedings of the Interspeech 2019, 2019

Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model.
Proceedings of the Interspeech 2019, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Study of Deep Learning Methods for De-identification of Clinical Notes at Cross Institute Settings.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Detect Attributes of Medical Concepts via Sequence Labeling.
Proceedings of the 2019 IEEE International Conference on Healthcare Informatics, 2019

Bytes Are All You Need: End-to-end Multilingual Speech Recognition and Synthesis with Bytes.
Proceedings of the IEEE International Conference on Acoustics, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.
Proceedings of the IEEE International Conference on Acoustics, 2019

Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization.
Proceedings of the IEEE International Conference on Acoustics, 2019


Speech Recognition with Augmented Synthesized Speech.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

A Comparison of End-to-End Models for Long-Form Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

A 20GS/s Track-and-Hold Amplifier based on InP DHBT Process.
Proceedings of the 13th IEEE International Conference on ASIC, 2019

Identifying Cancer Patients at Risk for Heart Failure Using Machine Learning Methods.
Proceedings of the AMIA 2019, 2019

2018
A study of generalizability of recurrent neural network-based predictive models for heart failure onset risk using a large and heterogeneous EHR data set.
J. Biomed. Informatics, 2018

CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines.
J. Am. Medical Informatics Assoc., 2018

PIE: A prior knowledge guided integrated likelihood estimation method for bias reduction in association studies using electronic health records data.
J. Am. Medical Informatics Assoc., 2018

Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches.
Int. J. Medical Informatics, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.
CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Detecting Medications and Adverse Drug Events in Clinical Notes Using Recurrent Neural Networks.
Proceedings of the 1st International Workshop on Medication and Adverse Drug Event Detection, 2018

Event-Triggered Consensus of General Linear Multi-agent System with Time Delay.
Proceedings of the Advances in Neural Networks - ISNN 2018, 2018

Compression of End-to-End Models.
Proceedings of the Interspeech 2018, 2018


Assessing Mental Health Signals Among Sexual and Gender Minorities using Twitter Data.
Proceedings of the IEEE International Conference on Healthcare Informatics Workshops, 2018

Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving the Performance of Online Neural Transducer Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Training Deeper Neural Machine Translation Models with Transparent Attention.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Computable Eligibility Criteria through Ontology-driven Data Access: A Case Study of Hepatitis C Virus Trials.
Proceedings of the AMIA 2018, 2018

Combine Factual Medical Knowledge and Distributed Word Representation to Improve Clinical Named Entity Recognition.
Proceedings of the AMIA 2018, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation.
Trans. Assoc. Comput. Linguistics, 2017

A long journey to short abbreviations: developing an open-source framework for clinical abbreviation recognition and disambiguation (CARD).
J. Am. Medical Informatics Assoc., 2017

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions.
CoRR, 2017

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model.
CoRR, 2017

Sequence-to-Sequence Models Can Directly Transcribe Foreign Speech.
CoRR, 2017

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.
CoRR, 2017

Comparing Cancer Information Needs for Consumers in the US and China.
Proceedings of the MEDINFO 2017: Precision Healthcare through Informatics, 2017

Sequence-to-Sequence Models Can Directly Translate Foreign Speech.
Proceedings of the Interspeech 2017, 2017


A comparative study of different methods for automatic identification of clopidogrel-induced bleedings in electronic health records.
Proceedings of the Summit on Clinical Research Informatics, 2017

Evaluating Word Embeddings from Multiple Domains for Symptom Recognition in Psychiatric Notes.
Proceedings of the AMIA 2017, 2017

Detecting Body Location Modifiers of Disorders in Clinical Texts via Sequence Labeling.
Proceedings of the AMIA 2017, 2017

Detecting Contradictory and Consistent Citations in Biomedical Literature.
Proceedings of the AMIA 2017, 2017

CLAMP - A User-Centric Clinical Natural Language Processing Toolkit.
Proceedings of the AMIA 2017, 2017

Clinical Named Entity Recognition Using Deep Learning Models.
Proceedings of the AMIA 2017, 2017

2016
Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov.
J. Am. Medical Informatics Assoc., 2016

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.
CoRR, 2016

Exploring the Limits of Language Modeling.
CoRR, 2016

Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning.
Database J. Biol. Databases Curation, 2016

CD-REST: a system for extracting chemical-induced disease relation in literature.
Database J. Biol. Databases Curation, 2016

UTHealth at SemEval-2016 Task 12: an End-to-End System for Temporal Information Extraction from Clinical Notes.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Reward Augmented Maximum Likelihood for Neural Structured Prediction.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

What Can Neural Networks Learn from Unlabeled Clinical Narratives?
Proceedings of the AMIA 2016, 2016

An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies.
Proceedings of the AMIA 2016, 2016

2015
Scattering Mechanism Extraction by a Modified Cloude-Pottier Decomposition for Dual Polarization SAR.
Remote. Sens., 2015

Deciphering Signaling Pathway Networks to Understand the Molecular Mechanisms of Metformin Action.
PLoS Comput. Biol., 2015

A comparison of conditional random fields and structured support vector machines for chemical entity recognition in biomedical literature.
J. Cheminformatics, 2015

UTH-CCB: The Participation of the SemEval 2015 Challenge - Task 14.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Named Entity Recognition in Chinese Clinical Text Using Deep Neural Network.
Proceedings of the MEDINFO 2015: eHealth-enabled Health, 2015

Clinical Abbreviation Disambiguation Using Neural Word Embeddings.
Proceedings of the Workshop on Biomedical Natural Language Processing, BioNLP@IJCNLP 2015, 2015

A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.
Proceedings of the AMIA 2015, 2015

Recognizing Disjoint Clinical Concepts in Clinical Text Using Machine Learning-based Methods.
Proceedings of the AMIA 2015, 2015

Clinical Language Annotation, Modeling, and Processing Toolkit (CLAMP) - a user-centric NLP system.
Proceedings of the AMIA 2015, 2015

Citation Sentiment Analysis in Clinical Trial Papers.
Proceedings of the AMIA 2015, 2015

2014
UTH_CCB: A report for SemEval 2014 - Task 7 Analysis of Clinical Text.
Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Domain Adaptation for Semantic Role Labeling of Clinical Text.
Proceedings of the AMIA 2014, 2014

Development of a Unified Computable Problem-Medication Knowledge base.
Proceedings of the AMIA 2014, 2014

2013
Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space.
PLoS Comput. Biol., 2013

Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.
BMC Medical Informatics Decis. Mak., 2013

A hybrid system for temporal information extraction from clinical text.
J. Am. Medical Informatics Assoc., 2013

Analyzing Differences between Chinese and English Clinical Text: A Cross-Institution Comparison of Discharge Summaries in Two Languages.
Proceedings of the MEDINFO 2013, 2013

Clinical Acronym/Abbreviation Normalization using a Hybrid Approach.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Recognizing and Encoding Discorder Concepts in Clinical Text using Machine Learning and Vector Space Model.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

A prototype application for real-time recognition and disambiguation of clinical abbreviations.
Proceedings of the Proceeding of the 7rd International Workshop on Data and Text Mining in Bioinformatics, 2013

Building a Large Clinical Abbreviation Sense Inventory from Discharge Summaries.
Proceedings of the AMIA 2013, 2013

2012
A new clustering method for detecting rare senses of abbreviations in clinical notes.
J. Biomed. Informatics, 2012

Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs.
J. Am. Medical Informatics Assoc., 2012

DTome: a web-based tool for drug-target interactome construction.
BMC Bioinform., 2012

Ranking Gene-Drug Relationships in Biomedical Literature Using Latent Dirichlet Allocation.
Proceedings of the Biocomputing 2012: Proceedings of the Pacific Symposium, 2012

Detecting Adverse Drug Reactions Using Inpatient Medication Orders and Laboratory Tests Data.
Proceedings of the 2012 IEEE Second International Conference on Healthcare Informatics, 2012

Clinical entity recognition using structural support vector machines with rich features.
Proceedings of the ACM sixth international workshop on Data and text mining in biomedical informatics, 2012

A comparative study of current clinical natural language processing systems on handling abbreviations in discharge summaries.
Proceedings of the AMIA 2012, 2012

Clinical Entity Recognition Using Structural Support Vector Machines.
Proceedings of the AMIA 2012, 2012

MedEx-UIMA - An Open-Source System for Medication Information Extraction from Clinical Text.
Proceedings of the AMIA 2012, 2012

2011
Accurate Construction of Consensus Genetic Maps via Integer Linear Programming.
IEEE ACM Trans. Comput. Biol. Bioinform., 2011

Barcoding-free BAC Pooling Enables Combinatorial Selective Sequencing of the Barley Gene Space
CoRR, 2011

2010
On-line Hot Topic Recommendation Using Tolerance Rough Set Based Topic Clustering.
J. Comput., 2010

Efficient Genome-Wide TagSNP Selection Across Populations via the Linkage Disequilibrium Criterion.
J. Comput. Biol., 2010

Topic Detection by Topic Model Induced Distance Using Biased Initiation.
Proceedings of the Advances in Computer Science and Information Technology, 2010

Topic based automatic news recommendation using topic model and affinity propagation.
Proceedings of the International Conference on Machine Learning and Cybernetics, 2010

2009
STRank: A SiteRank Algorithm Using Semantic Relevance and Time Frequency.
Proceedings of the IEEE International Conference on Systems, 2009

2008
Region-Based Classification of Polarimetric SAR Images Using Wishart MRF.
IEEE Geosci. Remote. Sens. Lett., 2008

Deconvoluting BAC-Gene Relationships Using a Physical Map.
J. Bioinform. Comput. Biol., 2008

A Linear-Time Algorithm for Predicting Functional Annotations from PPI Networks.
J. Bioinform. Comput. Biol., 2008

Genre identification of Chinese finance text using machine learning method.
Proceedings of the IEEE International Conference on Systems, 2008

2007
Efficient and Accurate Construction of Genetic Linkage Maps from Noisy and Missing Genotyping Data.
Proceedings of the Algorithms in Bioinformatics, 7th International Workshop, 2007

Clock-frequency assignment for multiple clock domain systems-on-a-chip.
Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007

Two-level microprocessor-accelerator partitioning.
Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007

2006
Error-Resilient LZW Data Compression.
Proceedings of the 2006 Data Compression Conference (DCC 2006), 2006

2005
Effective statistical features for coding and non-coding DNA sequence classification for yeast, C. elegans and human.
Int. J. Bioinform. Res. Appl., 2005

2004
Selection of Statistical Features Based on Mutual Information for Classification of Human Coding and Non-coding DNA Sequences.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

2000
Implementation and Proof for Normalization Design of Object-Oriented Data Schemes.
Proceedings of the TOOLS Asia 2000: 36th International Conference on Technology of Object-Oriented Languages and Systems, Xi'an, China, 30 October, 2000


  Loading...