We stand with Ukraine

We stand with Ukraine

Yonghui Wu

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Known people with the same name:

Bibliography

2025

Seedream 4.0: Toward Next-generation Multimodal Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Agentic AutoSurvey: Let LLMs Survey LLMs.

[BibT_eX]

[DOI]

,

,

,

CoRR, September, 2025

Balanced Actor Initialization: Stable RLHF Training of Distillation-Based Reasoning Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, September, 2025

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Reliable Indoor Localization in Multibuilding Environments: Leveraging Environment-Invariant and Position-Related Features.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

IEEE Internet Things J., July, 2025

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Thomas Hanwen Zhu

CoRR, July, 2025

Seed LiveInterpret 2.0: End-to-end Simultaneous Speech-to-speech Translation with Your Voice.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

Seed-X: Building Strong Multilingual Translation LLM with 7B Parameters.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

Truncated Proximal Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

Seed-Coder: Let the Code Model Curate Data for Itself.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Zongzhang Zhang

,

,

,

,

,

,

,

CoRR, June, 2025

Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, May, 2025

Model Merging in Pre-training of Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2025

Natural Language Generation in Healthcare: A Review of Methods and Applications.

[BibT_eX]

[DOI]

,

,

,

,

,

Sankalp Talankar

,

CoRR, May, 2025

VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Cheng-Xiang Wang

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, April, 2025

A Unified Pairwise Framework for RLHF: Bridging Generative Reward Modeling and Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, April, 2025

DAPO: An Open-Source LLM Reinforcement Learning System at Scale.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Guangming Sheng

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

FSLNet: Filter sensitivity-based lightweight network for rice leaf disease recognition.

[BibT_eX]

[DOI]

,

,

,

,

,

Comput. Electron. Agric., 2025

Optimization of motion strategy for a micro multi-functional chassis based on RBF neural network in intercropping mode.

[BibT_eX]

[DOI]

,

,

,

,

,

Comput. Electron. Agric., 2025

2024

Automatic Summarization of Doctor-Patient Encounter Dialogues Using Large Language Model through Prompt Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Llama-TCR: Generate De Novo TCR with Large Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Rick Siow Mong Goh

Proceedings of the IEEE Conference on Artificial Intelligence, 2024

TTCR: Accurate TCR-Epitope Binding Affinity Prediction Using Transformers.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Joyjit Chattoraj

,

,

,

,

Rick Siow Mong Goh

Proceedings of the IEEE Conference on Artificial Intelligence, 2024

Constructions of Teaching Materials, Curriculums, and the Teaching System Cross-Region for "Solving Problems by Programming".

[BibT_eX]

[DOI]

,

Proceedings of the Computing and Combinatorics - 30th International Conference, 2024

2023

Combined scaling for zero-shot transfer learning.

[BibT_eX]

[DOI]

,

,

,

Kenji Kawaguchi

,

,

,

,

,

Minh-Thang Luong

,

,

,

Neurocomputing, October, 2023

SLM: Bridge the thin gap between speech and text foundation models.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

,

,

,

Paul K. Rubenstein

,

,

,

,

,

Nikhil Siddhartha

,

Johan Schalkwyk

,

CoRR, 2023

Efficient Adapters for Giant Speech Models.

[BibT_eX]

[DOI]

,

,

,

Chung-Cheng Chiu

,

,

,

CoRR, 2023

Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages.

[BibT_eX]

[DOI]

CoRR, 2023

AnyTOD: A Programmable Task-Oriented Dialog System.

[BibT_eX]

[DOI]

,

,

,

,

Abhinav Rastogi

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining.

[BibT_eX]

[DOI]

,

,

,

,

Peyman Milanfar

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SLM: Bridge the Thin Gap Between Speech and Text Foundation Models.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

,

,

Paul K. Rubenstein

,

,

,

,

Nikhil Siddhartha

,

Johan Schalkwyk

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

A High Precision Capacitive Isolation Amplifier for Current Sensing Applications.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 15th IEEE International Conference on ASIC, 2023

2022

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Vijay Vasudevan

,

,

,

Burcu Karagol Ayan

,

,

,

,

,

,

Jason Baldridge

,

Trans. Mach. Learn. Res., 2022

CoCa: Contrastive Captioners are Image-Text Foundation Models.

[BibT_eX]

[DOI]

,

,

Vijay Vasudevan

,

,

Mojtaba Seyedhosseini

,

Trans. Mach. Learn. Res., 2022

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2022

Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

N-Grammer: Augmenting Transformers with latent n-grams.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Christopher Fifty

,

,

CoRR, 2022

Building Machine Translation Systems for the Next Thousand Languages.

[BibT_eX]

[DOI]

CoRR, 2022

Description-Driven Task-Oriented Dialog Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Abhinav Rastogi

,

,

CoRR, 2022

Confusing Traffic against Intra-domain Webpage Fingerprinting Attacks.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Trust, 2022

Show, Don't Tell: Demonstrations Outperform Descriptions for Schema-Guided Task-Oriented Dialogue.

[BibT_eX]

[DOI]

,

,

,

,

Abhinav Rastogi

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Pathways: Asynchronous Distributed Dataflow for ML.

[BibT_eX]

[DOI]

,

Aakanksha Chowdhery

,

,

Sanjay Ghemawat

,

,

,

,

,

,

,

,

,

,

Laurent El Shafey

,

Chandramohan A. Thekkath

,

Proceedings of the Fifth Conference on Machine Learning and Systems, 2022

Training Text-To-Speech Systems From Synthetic Data: A Practical Approach For Accent Transfer Tasks.

[BibT_eX]

[DOI]

Lev Finkelstein

,

,

Norman Casagrande

,

,

,

,

,

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Self-supervised learning with random-projection quantizer for speech recognition.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Vector-quantized Image Modeling with Improved VQGAN.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Jason Baldridge

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving The Latency And Quality Of Cascaded Encoders.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems.

[BibT_eX]

[DOI]

,

,

Abhinav Rastogi

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

GSPMD: General and Scalable Parallelization for ML Computation Graphs.

[BibT_eX]

[DOI]

,

,

,

Blake A. Hechtman

,

,

,

,

Dmitry Lepikhin

,

,

Marcello Maggioni

,

,

,

,

,

,

CoRR, 2021

Improving Longer-range Dialogue State Tracking.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2021

Distilling Interpretable Models into Human-Readable Code.

[BibT_eX]

[DOI]

,

,

Olexiy Oryeshko

,

,

,

,

,

Alexander Grushetsky

CoRR, 2021

Interpretable Ranking with Generalized Additive Models.

[BibT_eX]

[DOI]

,

,

Michael Bendersky

,

Alexander Grushetsky

,

,

,

,

,

,

Proceedings of the WSDM '21, 2021

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

,

,

Rohit Prabhavalkar

,

,

,

,

Tara N. Sainath

,

,

,

Proceedings of the IEEE Spoken Language Technology Workshop, 2021

PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

R. J. Skerry-Ryan

,

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Dual-mode ASR: Unify and Improve Streaming ASR with Full-context Modeling.

[BibT_eX]

[DOI]

,

,

,

Chung-Cheng Chiu

,

,

Tara N. Sainath

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

FastEmit: Low-Latency Streaming ASR with Sequence-Level Emission Regularization.

[BibT_eX]

[DOI]

,

Chung-Cheng Chiu

,

,

Shuo-Yiin Chang

,

Tara N. Sainath

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

A Better and Faster end-to-end Model for Streaming ASR.

[BibT_eX]

[DOI]

,

,

,

Tara N. Sainath

,

Chung-Cheng Chiu

,

,

Shuo-Yiin Chang

,

,

,

,

,

,

,

Trevor Strohman

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Parallel Tacotron: Non-Autoregressive and Controllable TTS.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Effective Sequence-to-Sequence Dialogue State Tracking.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training.

[BibT_eX]

[DOI]

,

,

,

Chung-Cheng Chiu

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

CoRR, 2020

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling.

[BibT_eX]

[DOI]

,

,

,

Chung-Cheng Chiu

,

,

Tara N. Sainath

,

,

CoRR, 2020

Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling.

[BibT_eX]

[DOI]

,

,

Mike Chrzanowski

,

,

,

,

CoRR, 2020

Interpretable Learning-to-Rank with Generalized Additive Models.

[BibT_eX]

[DOI]

,

,

Michael Bendersky

,

Alexander Grushetsky

,

,

,

,

,

,

CoRR, 2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency.

[BibT_eX]

[DOI]

CoRR, 2020

Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior.

[BibT_eX]

[DOI]

,

,

,

,

,

Andrew Rosenberg

,

Bhuvana Ramabhadran

,

CoRR, 2020

Improved Noisy Student Training for Automatic Speech Recognition.

[BibT_eX]

[DOI]

,

,

,

,

Chung-Cheng Chiu

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context.

[BibT_eX]

[DOI]

,

Zhengdong Zhang

,

,

,

Chung-Cheng Chiu

,

,

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Conformer: Convolution-augmented Transformer for Speech Recognition.

[BibT_eX]

[DOI]

,

,

Chung-Cheng Chiu

,

,

,

,

,

,

Zhengdong Zhang

,

,

Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

A 26.5GHz Wideband Gilbert-Cell Mixer MMIC Based on InP DHBT Technology.

[BibT_eX]

[DOI]

,

Proceedings of the 20th IEEE International Conference on Communication Technology, 2020

Improving Speech Recognition Using Consistent Predictions on Synthesized Speech.

[BibT_eX]

[DOI]

,

Andrew Rosenberg

,

,

,

Bhuvana Ramabhadran

,

,

Pedro J. Moreno

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior.

[BibT_eX]

[DOI]

,

,

,

,

,

Andrew Rosenberg

,

Bhuvana Ramabhadran

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Streaming On-Device End-To-End Model Surpassing Server-Side Conventional Model Quality and Latency.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Specaugment on Large Scale Datasets.

[BibT_eX]

[DOI]

,

,

Chung-Cheng Chiu

,

,

,

,

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Towards Fast and Accurate Streaming End-To-End ASR.

[BibT_eX]

[DOI]

,

Shuo-Yiin Chang

,

Tara N. Sainath

,

,

,

Trevor Strohman

,

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Leveraging Monolingual Data with Self-Supervision for Multilingual Neural Machine Translation.

[BibT_eX]

[DOI]

Aditya Siddhant

,

,

,

,

,

Sneha Reddy Kudugunta

,

Naveen Arivazhagan

,

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

A Stereo-Vision System for Measuring the Ram Speed of Steam Hammers in an Environment with a Large Field of View and Strong Vibrations.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Sensors, 2019

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges.

[BibT_eX]

[DOI]

Naveen Arivazhagan

,

,

,

Dmitry Lepikhin

,

,

,

,

,

George F. Foster

,

,

Wolfgang Macherey

,

,

CoRR, 2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Tara N. Sainath

,

,

Chung-Cheng Chiu

,

,

,

,

Stella Laurenzo

,

,

,

Wolfgang Macherey

,

,

,

,

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Sébastien Jean

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Kuan-Chieh Wang

,

Ekaterina Gonina

,

,

,

,

,

,

,

,

,

George F. Foster

,

John Richardson

,

,

Antoine Bruguier

,

,

,

,

,

,

,

Vijayaditya Peddinti

,

,

Michiel Bacchiani

,

Thomas B. Jablin

,

Robert Suderman

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Dmitry Lepikhin

,

,

,

,

Shubham Toshniwal

,

,

Michael Nirschl

,

CoRR, 2019

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Gmail Smart Compose: Real-Time Assisted Writing.

[BibT_eX]

[DOI]

,

Benjamin N. Lee

,

,

,

,

,

,

,

,

,

,

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning.

[BibT_eX]

[DOI]

,

,

,

,

,

R. J. Skerry-Ryan

,

,

Andrew Rosenberg

,

Bhuvana Ramabhadran

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Two-Pass End-to-End Speech Recognition.

[BibT_eX]

[DOI]

Tara N. Sainath

,

,

,

,

Rohit Prabhavalkar

,

,

Mirkó Visontai

,

,

Trevor Strohman

,

,

,

Chung-Cheng Chiu

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model.

[BibT_eX]

[DOI]

,

Arindrima Datta

,

Tara N. Sainath

,

Eugene Weinstein

,

Bhuvana Ramabhadran

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model.

[BibT_eX]

[DOI]

,

,

,

Wolfgang Macherey

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Hierarchical Generative Modeling for Controllable Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 7th International Conference on Learning Representations, 2019

Bytes Are All You Need: End-to-end Multilingual Speech Recognition and Synthesis with Bytes.

[BibT_eX]

[DOI]

,

,

Tara N. Sainath

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation.

[BibT_eX]

[DOI]

,

,

Wolfgang Macherey

,

,

,

Chung-Cheng Chiu

,

,

Stella Laurenzo

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2019

Streaming End-to-end Speech Recognition for Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

Speech Recognition with Augmented Synthesized Speech.

[BibT_eX]

[DOI]

Andrew Rosenberg

,

,

Bhuvana Ramabhadran

,

,

Pedro J. Moreno

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

A Comparison of End-to-End Models for Long-Form Speech Recognition.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

,

Rohit Prabhavalkar

,

,

Tara N. Sainath

,

,

,

,

,

Sergey Kishchenko

,

,

,

,

Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

A 20GS/s Track-and-Hold Amplifier based on InP DHBT Process.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 13th IEEE International Conference on ASIC, 2019

2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Wolfgang Macherey

,

George F. Foster

,

,

,

,

,

,

CoRR, 2018

A Comparison of Techniques for Language Model Integration in Encoder-Decoder Speech Recognition.

[BibT_eX]

[DOI]

Shubham Toshniwal

,

,

Chung-Cheng Chiu

,

,

Tara N. Sainath

,

Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Ignacio López-Moreno

,

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Event-Triggered Consensus of General Linear Multi-agent System with Time Delay.

[BibT_eX]

[DOI]

,

Proceedings of the Advances in Neural Networks - ISNN 2018, 2018

Compression of End-to-End Models.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

Rohit Prabhavalkar

,

,

,

,

Chung-Cheng Chiu

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Speech Recognition for Medical Conversations.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

Anshuman Tripathi

,

,

,

,

Diana Jaunzeikare

,

,

,

,

,

Justin Tansuwan

,

,

,

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

R. J. Skerry-Ryan

,

,

Yannis Agiomyrgiannakis

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

No Need for a Lexicon? Evaluating the Value of the Pronunciation Lexica in End-to-End Models.

[BibT_eX]

[DOI]

Tara N. Sainath

,

Rohit Prabhavalkar

,

,

,

,

,

,

,

,

,

,

Chung-Cheng Chiu

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Improving the Performance of Online Neural Transducer Models.

[BibT_eX]

[DOI]

Tara N. Sainath

,

Chung-Cheng Chiu

,

Rohit Prabhavalkar

,

,

,

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

Rohit Prabhavalkar

,

Tara N. Sainath

,

,

,

,

Chung-Cheng Chiu

,

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

An Analysis of Incorporating an External Language Model into a Sequence-to-Sequence Model.

[BibT_eX]

[DOI]

,

,

,

Tara N. Sainath

,

,

Rohit Prabhavalkar

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

State-of-the-Art Speech Recognition with Sequence-to-Sequence Models.

[BibT_eX]

[DOI]

Chung-Cheng Chiu

,

Tara N. Sainath

,

,

Rohit Prabhavalkar

,

,

,

,

,

,

Ekaterina Gonina

,

,

,

,

Michiel Bacchiani

Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

CaLcs: Continuously Approximating Longest Common Subsequence for Sequence Level Optimization.

[BibT_eX]

[DOI]

,

Chung-Cheng Chiu

,

,

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Training Deeper Neural Machine Translation Models with Transparent Attention.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation.

[BibT_eX]

[DOI]

,

,

,

,

Wolfgang Macherey

,

George F. Foster

,

,

,

,

,

,

Jakob Uszkoreit

,

,

,

,

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Fernanda B. Viégas

,

Martin Wattenberg

,

,

,

Trans. Assoc. Comput. Linguistics, 2017

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model.

[BibT_eX]

[DOI]

,

Tara N. Sainath

,

,

Michiel Bacchiani

,

Eugene Weinstein

,

,

,

,

CoRR, 2017

Sequence-to-Sequence Models Can Directly Transcribe Foreign Speech.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2017

Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model.

[BibT_eX]

[DOI]

,

R. J. Skerry-Ryan

,

,

,

,

,

,

,

,

,

,

Yannis Agiomyrgiannakis

,

,

CoRR, 2017

Sequence-to-Sequence Models Can Directly Translate Foreign Speech.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Tacotron: Towards End-to-End Speech Synthesis.

[BibT_eX]

[DOI]

,

R. J. Skerry-Ryan

,

,

,

,

,

,

,

,

,

,

Yannis Agiomyrgiannakis

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2016

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2016

Exploring the Limits of Language Modeling.

[BibT_eX]

[DOI]

Rafal Józefowicz

,

,

,

,

CoRR, 2016

Reward Augmented Maximum Likelihood for Neural Structured Prediction.

[BibT_eX]

[DOI]

Mohammad Norouzi

,

,

,

,

,

,

Dale Schuurmans

Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

2015

Scattering Mechanism Extraction by a Modified Cloude-Pottier Decomposition for Dual Polarization SAR.

[BibT_eX]

[DOI]

,

Remote. Sens., 2015

2013

Combinatorial Pooling Enables Selective Sequencing of the Barley Gene Space.

[BibT_eX]

[DOI]

Stefano Lonardi

,

,

,

Francesca Cordero

,

,

Prasanna R. Bhat

,

,

Gianfranco Ciardo

,

Burair Alsaihati

,

,

Steve Wanamaker

,

,

,

,

Timothy J. Close

PLoS Comput. Biol., 2013

2011

Accurate Construction of Consensus Genetic Maps via Integer Linear Programming.

[BibT_eX]

[DOI]

,

Timothy J. Close

,

Stefano Lonardi

IEEE ACM Trans. Comput. Biol. Bioinform., 2011

Barcoding-free BAC Pooling Enables Combinatorial Selective Sequencing of the Barley Gene Space

[BibT_eX]

[DOI]

Stefano Lonardi

,

,

,

Francesca Cordero

,

,

,

,

Gianfranco Ciardo

,

Burair Alsaihati

,

,

Steve Wanamaker

,

,

Timothy J. Close

CoRR, 2011

2010

Efficient Genome-Wide TagSNP Selection Across Populations via the Linkage Disequilibrium Criterion.

[BibT_eX]

[DOI]

,

,

Stefano Lonardi

,

J. Comput. Biol., 2010

2008

Region-Based Classification of Polarimetric SAR Images Using Wishart MRF.

[BibT_eX]

[DOI]

,

,

,

IEEE Geosci. Remote. Sens. Lett., 2008

Deconvoluting BAC-Gene Relationships Using a Physical Map.

[BibT_eX]

[DOI]

,

,

Timothy J. Close

,

Stefano Lonardi

J. Bioinform. Comput. Biol., 2008

A Linear-Time Algorithm for Predicting Functional Annotations from PPI Networks.

[BibT_eX]

[DOI]

,

Stefano Lonardi

J. Bioinform. Comput. Biol., 2008

2007

Efficient and Accurate Construction of Genetic Linkage Maps from Noisy and Missing Genotyping Data.

[BibT_eX]

[DOI]

,

,

Timothy J. Close

,

Stefano Lonardi

Proceedings of the Algorithms in Bioinformatics, 7th International Workshop, 2007

Clock-frequency assignment for multiple clock domain systems-on-a-chip.

[BibT_eX]

[DOI]

,

,

Stefano Lonardi

,

Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007

Two-level microprocessor-accelerator partitioning.

[BibT_eX]

[DOI]

,

,

Stefano Lonardi

,

Proceedings of the 2007 Design, Automation and Test in Europe Conference and Exposition, 2007

2006

Error-Resilient LZW Data Compression.

[BibT_eX]

[DOI]

,

Stefano Lonardi

,

Wojciech Szpankowski

Proceedings of the 2006 Data Compression Conference (DCC 2006), 2006

2000

Implementation and Proof for Normalization Design of Object-Oriented Data Schemes.

[BibT_eX]

[DOI]

,

,

Proceedings of the TOOLS Asia 2000: 36th International Conference on Technology of Object-Oriented Languages and Systems, Xi'an, China, 30 October, 2000

Loading...