Xu Tan

Affiliations:
  • Microsoft Research Asia, Beijing, China


According to our database1, Xu Tan authored at least 71 papers between 2018 and 2021.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2021
Transformer-S2A: Robust and Efficient Speech-to-Animation.
CoRR, 2021

DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021.
CoRR, 2021

A study on the efficacy of model pre-training in developing neural text-to-speech system.
CoRR, 2021

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition.
CoRR, 2021

TeleMelody: Lyric-to-Melody Generation with a Template-Based Two-Stage Method.
CoRR, 2021

PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription.
CoRR, 2021

Analyzing and Mitigating Interference in Neural Architecture Search.
CoRR, 2021

AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style.
CoRR, 2021

A Survey on Neural Speech Synthesis.
CoRR, 2021

PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Driven Adaptive Prior.
CoRR, 2021

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition.
CoRR, 2021

Cross-domain Speech Recognition with Unsupervised Character-level Distribution Matching.
CoRR, 2021

Improving Long-Tailed Classification from Instance Level.
CoRR, 2021

NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

A Survey on Low-Resource Neural Machine Translation.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

AdaSpeech: Adaptive Text to Speech for Custom Voice.
Proceedings of the 9th International Conference on Learning Representations, 2021

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech.
Proceedings of the 9th International Conference on Learning Representations, 2021

Denoispeech: Denoising Text to Speech with Frame-Level Noise Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2021

Adaspeech 2: Adaptive Text to Speech with Untranscribed Data.
Proceedings of the IEEE International Conference on Acoustics, 2021

MixSpeech: Data Augmentation for Low-Resource Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

Lightspeech: Lightweight and Fast Text to Speech with Neural Architecture Search.
Proceedings of the IEEE International Conference on Acoustics, 2021

MBNET: MOS Prediction for Synthesized Speech with Mean-Bias Network.
Proceedings of the IEEE International Conference on Acoustics, 2021

FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

MusicBERT: Symbolic Music Understanding with Large-Scale Pre-Training.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

UWSpeech: Speech to Speech Translation for Unwritten Languages.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis.
CoRR, 2020

Neural Architecture Search with GBDT.
CoRR, 2020

LightPAFF: A Two-Stage Distillation Framework for Pre-training and Fine-tuning.
CoRR, 2020

VESR-Net: The Winning Solution to Youku Video Enhancement and Super-Resolution Challenge.
CoRR, 2020

MPNet: Masked and Permuted Pre-training for Language Understanding.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Semi-Supervised Neural Architecture Search.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

PopMAG: Pop Music Accompaniment Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

DualLip: A System for Joint Lip Reading and Generation.
Proceedings of the MM '20: The 28th ACM International Conference on Multimedia, 2020

LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

DeepSinger: Singing Voice Synthesis with Data Mined From the Web.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

XiaoiceSing: A High-Quality and Integrated Singing Voice Synthesis System.
Proceedings of the Interspeech 2020, 2020

MultiSpeech: Multi-Speaker Text to Speech with Transformer.
Proceedings of the Interspeech 2020, 2020

Neural Machine Translation with Error Correction.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

A Study of Non-autoregressive Model for Sequence Generation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

SimulSpeech: End-to-End Simultaneous Speech to Text Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Fine-Tuning by Curriculum Learning for Non-Autoregressive Neural Machine Translation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Beyond Error Propagation: Language Branching Also Affects the Accuracy of Sequence Generation.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

A Study of Multilingual Neural Machine Translation.
CoRR, 2019

Microsoft Research Asia's Systems for WMT19.
CoRR, 2019

Efficient Bidirectional Neural Machine Translation.
CoRR, 2019

Hard but Robust, Easy but Sensitive: How Encoder and Decoder Perform in Neural Machine Translation.
CoRR, 2019

Language Graph Distillation for Low-Resource Machine Translation.
CoRR, 2019

Microsoft Research Asia's Systems for WMT19.
Proceedings of the Fourth Conference on Machine Translation, 2019

FastSpeech: Fast, Robust and Controllable Text to Speech.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion.
Proceedings of the Interspeech 2019, 2019

Deliberation Learning for Image-to-Image Translation.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

MASS: Masked Sequence to Sequence Pre-training for Language Generation.
Proceedings of the 36th International Conference on Machine Learning, 2019

Almost Unsupervised Text to Speech and Automatic Speech Recognition.
Proceedings of the 36th International Conference on Machine Learning, 2019

Multilingual Neural Machine Translation with Knowledge Distillation.
Proceedings of the 7th International Conference on Learning Representations, 2019

Representation Degeneration Problem in Training Natural Language Generation Models.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multilingual Neural Machine Translation with Language Clustering.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Knowledge Distillation from Bert in Pre-Training and Fine-Tuning for Polyphone Disambiguation.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Unsupervised Pivot Translation for Distant Languages.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Tied Transformers: Neural Machine Translation with Shared Encoder and Decoder.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Non-Autoregressive Neural Machine Translation with Enhanced Decoder Input.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Sentence-Wise Smooth Regularization for Sequence to Sequence Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Achieving Human Parity on Automatic Chinese to English News Translation.
CoRR, 2018

Layer-Wise Coordination between Encoder and Decoder for Neural Machine Translation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

FRAGE: Frequency-Agnostic Word Representation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Dense Information Flow for Neural Machine Translation.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Model-Level Dual Learning.
Proceedings of the 35th International Conference on Machine Learning, 2018

Beyond Error Propagation in Neural Machine Translation: Characteristics of Language Also Matter.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Double Path Networks for Sequence to Sequence Learning.
Proceedings of the 27th International Conference on Computational Linguistics, 2018


  Loading...