Alexander Sorin

According to our database1, Alexander Sorin authored at least 28 papers between 1996 and 2022.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2022
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis.
Proceedings of the Interspeech 2022, 2022

2021
Synthesis of Expressive Speaking Styles with Limited Training Data in a Multi-Speaker, Prosody-Controllable Sequence-to-Sequence Architecture.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

2020
Principal Style Components: Expressive Style Control and Cross-Speaker Transfer in Neural TTS.
Proceedings of the Interspeech 2020, 2020

2019
Sequence to Sequence Neural Speech Synthesis with Prosody Modification Capabilities.
CoRR, 2019

High Quality, Lightweight and Adaptable TTS Using LPCNet.
Proceedings of the Interspeech 2019, 2019

2018
Neural TTS Voice Conversion.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

The IBM Virtual Voice Creator.
Proceedings of the Interspeech 2018, 2018

Data Augmentation Improves Recognition of Foreign Accented Speech.
Proceedings of the Interspeech 2018, 2018

2017
Semi Parametric Concatenative TTS with Instant Voice Modification Capabilities.
Proceedings of the Interspeech 2017, 2017

Voice-transformation-based data augmentation for prosodic classification.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Wideband Harmonic Model: Alignment and Noise Modeling for High Quality Speech Synthesis.
Proceedings of the 9th ISCA Speech Synthesis Workshop, 2016

2015
Coherent modification of pitch and energy for expressive prosody implantation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
Refined inter-segment joining in multi-form speech synthesis.
Proceedings of the INTERSPEECH 2014, 2014

Exploring modulation spectrum features for speech-based depression level classification.
Proceedings of the INTERSPEECH 2014, 2014

2013
Evaluation of speech-based protocol for detection of early-stage dementia.
Proceedings of the INTERSPEECH 2013, 2013

Voice-based sadness and anger recognition with cross-corpora evaluation.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Psychoacoustic Segment Scoring for Multi-Form Speech Synthesis.
Proceedings of the INTERSPEECH 2012, 2012

Towards automatic phonetic segmentation for TTS.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2011
Uniform Speech Parameterization for Multi-Form Segment Synthesis.
Proceedings of the INTERSPEECH 2011, 2011

Speech processing and retrieval in a personal memory aid system for the elderly.
Proceedings of the IEEE International Conference on Acoustics, 2011

2010
Sinusoidal model parameterization for HMM-based TTS system.
Proceedings of the INTERSPEECH 2010, 2010

2008
Using robust audio and video processing technologies to alleviate the elderly cognitive decline.
Proceedings of the 1st ACM International Conference on Pervasive Technologies Related to Assistive Environments, 2008

2006
High Quality Sinusoidal Modeling of Wideband Speech for the Purposes of Speech Synthesis and Modification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Small footprint concatenative text-to-speech synthesis system using complex spectral envelope modeling.
Proceedings of the INTERSPEECH 2005, 2005

2004
The ETSI extended distributed speech recognition (DSR) standards: client side processing and tonal language recognition evaluation.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

The ETSI extended distributed speech recognition (DSR) standards: server-side speech reconstruction.
Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2002
Reducing the footprint of the IBM trainable speech synthesis system.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

1996
Automated forms-processing software and services.
IBM J. Res. Dev., 1996


  Loading...