Nirmesh J. Shah

Orcid: 0000-0002-7294-6757

Affiliations:
  • Sony Research India


According to our database1, Nirmesh J. Shah authored at least 32 papers between 2013 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning.
CoRR, 2024

2023
Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Semi-supervised Acoustic and Language Modeling for Hindi ASR.
Proceedings of the Interspeech 2022, 2022

M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
Exploiting Phase-based Features for Whisper vs. Speech Classification.
Proceedings of the 29th European Signal Processing Conference, 2021

2020
Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN.
Proceedings of the International Conference on Signal Processing and Communications, 2020

Query-By-Example Spoken Term Detection Using Generative Adversarial Network.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
A novel approach to remove outliers for parallel voice conversion.
Comput. Speech Lang., 2019

Whether to Pretrain DNN or not?: An Empirical Analysis for Voice Conversion.
Proceedings of the Interspeech 2019, 2019

Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion.
Proceedings of the Interspeech 2019, 2019

Novel Metric Learning for Non-parallel Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2019

Effectiveness of Cross-Domain Architectures for Whisper-to-Normal Speech Conversion.
Proceedings of the 27th European Signal Processing Conference, 2019

Novel Adaptive Generative Adversarial Network for Voice Conversion.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion.
Proceedings of the Interspeech 2018, 2018

Effectiveness of Dynamic Features in INCA and Temporal Context-INCA.
Proceedings of the Interspeech 2018, 2018

Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion.
Proceedings of the Interspeech 2018, 2018

Novel Inter Mixture Weighted GMM Posteriorgram for DNN and GAN-based Voice Conversion.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2017
Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion.
Proceedings of the Pattern Recognition and Machine Intelligence, 2017

Novel Amplitude Scaling method for bilinear frequency Warping-based Voice Conversion.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Quality assessment of voice converted speech using articulatory features.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

On the convergence of INCA algorithm.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

A novel filtering-based F0 estimation algorithm with an application to voice conversion.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Novel Pre-processing using Outlier Removal in Voice Conversion.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

2015
Effectiveness of multiscale fractal dimension for improvement of frame classification rate.
Proceedings of the 23rd European Signal Processing Conference, 2015

2014
Effectiveness of fractal dimension for ASR in low resource language.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Deterministic annealing EM algorithm for developing TTS system in Gujarati.
Proceedings of the 9th International Symposium on Chinese Spoken Language Processing, 2014

Effectiveness of PLP-based phonetic segmentation for speech synthesis.
Proceedings of the IEEE International Conference on Acoustics, 2014

Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

Influence of various asymmetrical contextual factors for TTS in a low resource language.
Proceedings of the 2014 International Conference on Asian Language Processing, 2014

2013
Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

A syllable-based framework for unit selection synthesis in 13 Indian languages.
Proceedings of the 2013 International Conference Oriental COCOSDA held jointly with 2013 Conference on Asian Spoken Language Research and Evaluation (O-COCOSDA/CASLRE), 2013

A Novel Gaussian Filter-Based Automatic Labeling of Speech Data for TTS System in Gujarati Language.
Proceedings of the 2013 International Conference on Asian Language Processing, 2013


  Loading...