Xiaohai Tian

According to our database1, Xiaohai Tian authored at least 42 papers between 2010 and 2021.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Other 

Links

On csauthors.net:

Bibliography

2021
Factorized WaveNet for voice conversion with limited data.
Speech Commun., 2021

Optimizing Voice Conversion Network with Cycle Consistency Loss of Speaker Identity.
Proceedings of the IEEE Spoken Language Technology Workshop, 2021

The Multi-Speaker Multi-Style Voice Cloning Challenge 2021.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Multi-Task WaveRNN With an Integrated Architecture for Cross-Lingual Voice Conversion.
IEEE Signal Process. Lett., 2020

NHSS: A Speech and Singing Parallel Database.
CoRR, 2020

Predictions of Subjective Ratings and Spoofing Assessments of Voice Conversion Challenge 2020 Submissions.
CoRR, 2020

Voice Conversion Challenge 2020: Intra-lingual semi-parallel and cross-lingual voice conversion.
CoRR, 2020

Black-box Attacks on Automatic Speaker Verification using Feedback-controlled Voice Conversion.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Personalized Singing Voice Generation Using WaveRNN.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

The Attacker's Perspective on Automatic Speaker Verification: An Overview.
Proceedings of the Interspeech 2020, 2020

End-to-End Code-Switching TTS with Cross-Lingual Language Model.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Effective Wavenet Adaptation for Voice Conversion with Limited Data.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
A Vocoder-free WaveNet Voice Conversion with Non-Parallel Data.
CoRR, 2019

A Speaker-Dependent WaveNet for Voice Conversion with Non-Parallel Data.
Proceedings of the Interspeech 2019, 2019

Cross-lingual Voice Conversion with Bilingual Phonetic Posteriorgram and Average Modeling.
Proceedings of the IEEE International Conference on Acoustics, 2019

A Modularized Neural Network with Language-Specific Output Layers for Cross-Lingual Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

WaveNet Factorization with Singular Value Decomposition for Voice Conversion.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Many-to-many Cross-lingual Voice Conversion with a Jointly Trained Speaker Embedding Network.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

Speaker-independent Spectral Mapping for Speech-to-Singing Conversion.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Average Modeling Approach to Voice Conversion with Non-Parallel Data.
Proceedings of the Odyssey 2018: The Speaker and Language Recognition Workshop, 2018

Usability Analysis of the Novel Functions to Assist the Senior Customers in Online Shopping.
Proceedings of the Social Computing and Social Media. User Experience and Behavior, 2018

2017
An Exemplar-Based Approach to Frequency Warping for Voice Conversion.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Towards Age-friendly E-commerce Through Crowd-Improved Speech Recognition, Multimodal Search, and Personalized Speech Feedback.
Proceedings of the 2nd International Conference on Crowd Science and Engineering, 2017

Improving air traffic control speech intelligibility by reducing speaking rate effectively.
Proceedings of the 2017 International Conference on Asian Language Processing, 2017

Novel Functional Technologies for Age-Friendly E-commerce.
Proceedings of the Human Aspects of IT for the Aged Population. Applications, Services and Contexts, 2017

An investigation of spectral feature partitioning for replay attacks detection.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
High quality voice conversion using prosodic and high-resolution spectral features.
Multim. Tools Appl., 2016

Spoofing detection under noisy conditions: a preliminary investigation and an initial database.
CoRR, 2016

An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

An Investigation of Spoofing Speech Detection Under Additive Noise and Reverberant Conditions.
Proceedings of the Interspeech 2016, 2016

Spoofing detection from a feature representation perspective.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Spoofing speech detection using temporal convolutional neural network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016

2015
Spoofing speech detection using high dimensional magnitude and phase features: the NTU approach for ASVspoof 2015 challenge.
Proceedings of the INTERSPEECH 2015, 2015

System fusion for high-performance voice conversion.
Proceedings of the INTERSPEECH 2015, 2015

Personalized synthetic voices for speaking impaired: website and app.
Proceedings of the INTERSPEECH 2015, 2015

Sparse representation for frequency warping based voice conversion.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Detecting synthetic speech using long term magnitude and phase information.
Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing, 2015

A waveform representation framework for high-quality statistical parametric speech synthesis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
Correlation-based frequency warping for voice conversion.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

A comparative study of spectral transformation techniques for singing voice synthesis.
Proceedings of the INTERSPEECH 2014, 2014

2013
Local partial least square regression for spectral mapping in voice conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2010
Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications.
Proceedings of the Symposia and Workshops on Ubiquitous, 2010


  Loading...