Jian Luan

Orcid: 0000-0002-2383-226X

Affiliations:
  • Xiaoice, Software Technology Center Asia, Microsoft


According to our database1, Jian Luan authored at least 26 papers between 2006 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
CBSiMT: Mitigating Hallucination in Simultaneous Machine Translation with Weighted Prefix-to-Prefix Training.
CoRR, 2023

The Xiaomi AI Lab's Speech Translation Systems for IWSLT 2023 Offline Task, Simultaneous Task and Speech-to-Speech Task.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Rethinking the Reasonability of the Test Set for Simultaneous Machine Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Joint Training and Decoding for Multilingual End-to-End Simultaneous Speech Translation.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploring All-In-One Knowledge Distillation Framework for Neural Machine Translation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exploring Better Text Image Translation with Multimodal Codebook.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

BERT-ERC: Fine-Tuning BERT Is Enough for Emotion Recognition in Conversation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Improve Bilingual TTS Using Dynamic Language and Phonology Embedding.
CoRR, 2022

J-TranPSP: A Joint Transition-based Model for Prosodic Structure Prediction, Word Segmentation and PoS Tagging.
Proceedings of the 13th International Symposium on Chinese Spoken Language Processing, 2022

Improving Emotional Speech Synthesis by Using SUS-Constrained VAE and Text Encoder Aggregation.
Proceedings of the IEEE International Conference on Acoustics, 2022

MSDTRON: A High-Capability Multi-Speaker Speech Synthesis System for Diverse Data Using Characteristic Information.
Proceedings of the IEEE International Conference on Acoustics, 2022

PAMA-TTS: Progression-Aware Monotonic Attention for Stable SEQ2SEQ TTS with Accurate Phoneme Duration Control.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Effective and Differentiated Use of Control Information for Multi-speaker Speech Synthesis.
CoRR, 2021

Noise Robust Singing Voice Synthesis Using Gaussian Mixture Variational Autoencoder.
Proceedings of the ICMI '21 Companion: Companion Publication of the 2021 International Conference on Multimodal Interaction, Montreal, QC, Canada, October 18, 2021

2020
HiFiSinger: Towards High-Fidelity Neural Singing Voice Synthesis.
CoRR, 2020

PPSpeech: Phrase based Parallel End-to-End TTS System.
CoRR, 2020

DeepSinger: Singing Voice Synthesis with Data Mined From the Web.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting.
Proceedings of the Interspeech 2020, 2020

Adversarially Trained Multi-Singer Sequence-to-Sequence Singing Synthesizer.
Proceedings of the Interspeech 2020, 2020

XiaoiceSing: A High-Quality and Integrated Singing Voice Synthesis System.
Proceedings of the Interspeech 2020, 2020

Transfer Learning for Improving Singing-Voice Detection in Polyphonic Instrumental Music.
Proceedings of the Interspeech 2020, 2020

2019
Vocal Pitch Extraction in Polyphonic Music Using Convolutional Residual Network.
Proceedings of the Interspeech 2019, 2019

2012
Expand CRF to Model Long Distance Dependencies in Prosodic Break Prediction.
Proceedings of the INTERSPEECH 2012, 2012

2010
Improvement on plural unit selection and fusion.
Proceedings of the INTERSPEECH 2010, 2010

2007
Codebook-Based Pseudo-Impostor Data Generation and Template Compression for Text-Dependent Speaker Verification.
IEICE Trans. Inf. Syst., 2007

2006
Template Compression and Distance Normalization for Reliable Text-dependent Speaker Verification.
Proceedings of the Odyssey 2006, 2006


  Loading...