Bajibabu Bollepalli

Orcid: 0000-0003-1268-0579

According to our database1, Bajibabu Bollepalli authored at least 30 papers between 2011 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of five.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Distribution Augmentation for Low-Resource Expressive Text-To-Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech.
CoRR, 2021

Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks.
IEEE Access, 2021

2020
Multiscale System for Alzheimer's Dementia Recognition Through Spontaneous Speech.
Proceedings of the Interspeech 2020, 2020

2019
GlotNet - A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Normal-to-Lombard adaptation of speech synthesis using long short-term memory recurrent neural networks.
Speech Commun., 2019

GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram.
Proceedings of the Interspeech 2019, 2019

Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System.
Proceedings of the Interspeech 2019, 2019

Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention.
CoRR, 2018

Speaker-independent Raw Waveform Model for Glottal Excitation.
Proceedings of the Interspeech 2018, 2018

Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
Glottal Vocoding With Frequency-Warped Time-Weighted Linear Prediction.
IEEE Signal Process. Lett., 2017

Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System.
Proceedings of the Interspeech 2017, 2017

Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis.
Proceedings of the Interspeech 2017, 2017

Lombard speech synthesis using long short-term memory recurrent neural networks.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Frequency-warped time-weighted linear prediction for glottal vocoding.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
DNN-based Speech Synthesis for Indian Languages from ASCII text.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis.
Proceedings of the Interspeech 2016, 2016

High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2014
The Tutorbot Corpus ― A Corpus for Studying Tutoring Behaviour in Multiparty Face-to-Face Spoken Dialogue.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A comparative evaluation of vocoding techniques for HMM-based laughter synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Human-robot collaborative tutoring using multiparty multimodal spoken dialogue.
Proceedings of the ACM/IEEE International Conference on Human-Robot Interaction, 2014

Effect of MPEG audio compression on vocoders used in statistical parametric speech synthesis.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
Non-linear Pitch Modification in Voice Conversion Using Artificial Neural Networks.
Proceedings of the Advances in Nonlinear Speech Processing - 6th International Conference, 2013

Effect of MPEG audio compression on HMM-based speech synthesis.
Proceedings of the INTERSPEECH 2013, 2013

Tutoring Robots - Multiparty Multimodal Social Dialogue with an Embodied Tutor.
Proceedings of the Innovative and Creative Developments in Multimodal Interaction Systems, 2013

2012
Modelling a Noisy-channel for Voice Conversion Using Articulatory Features.
Proceedings of the INTERSPEECH 2012, 2012

2011
SWS task: Articulatory phonetic units and sliding DTW.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011


  Loading...