Nicolas Obin
Orcid: 0000-0002-5236-5306
According to our database1,
Nicolas Obin
authored at least 52 papers
between 2008 and 2023.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2023
Zero-shot style transfer for gesture animation driven by text and speech using adversarial disentanglement of multimodal style encoding.
Frontiers Artif. Intell., February, 2023
Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations.
Entropy, February, 2023
META4: Semantically-Aligned Generation of Metaphoric Gestures Using Self-Supervised Text and Speech Representation.
CoRR, 2023
TranSTYLer: Multimodal Behavioral Style Transfer for Facial and Body Gestures Generation.
CoRR, 2023
ZS-MSTM: Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding.
CoRR, 2023
I-Brow: Hierarchical and Multimodal Transformer Model for Eyebrows Animation Synthesis.
Proceedings of the Artificial Intelligence in HCI, 2023
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023
From signal representation to representation learning: structured modeling of speech signals. (De la représentation du signal à l'apprentissage de représentation : modélisation structurée de signaux de parole).
, 2023
2022
Rookognise: Acoustic detection and identification of individual rooks in field recordings using multi-task neural networks.
Ecol. Informatics, 2022
Zero-Shot Style Transfer for Gesture Animation driven by Text and Speech using Adversarial Disentanglement of Multimodal Style Encoding.
CoRR, 2022
Proceedings of the 30th European Signal Processing Conference, 2022
Voice Reenactment with F0 and timing constraints and adversarial learning of conversions.
Proceedings of the 30th European Signal Processing Conference, 2022
2021
Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning.
CoRR, 2021
Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations.
CoRR, 2021
Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels.
CoRR, 2021
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021
Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels.
Proceedings of the 29th European Signal Processing Conference, 2021
2020
La voix actée : pratiques, enjeux, applications (Acted voice : practices, challenges, applications).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020
Proceedings of the 28th European Signal Processing Conference, 2020
2019
SoftGAN: Learning generative models efficiently with application to CycleGAN Voice Conversion.
CoRR, 2019
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Binaural Localization of Multiple Sound Sources by Non-Negative Tensor Factorization.
IEEE ACM Trans. Audio Speech Lang. Process., 2018
2016
IEEE ACM Trans. Audio Speech Lang. Process., 2016
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016
2015
IEEE ACM Trans. Audio Speech Lang. Process., 2015
Real-time audio-to-score alignment of singing voice based on melody and lyric information.
Proceedings of the INTERSPEECH 2015, 2015
The role of glottal source parameters for high-quality transformation of perceptual age.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015
2014
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014
Phase distortion statistics as a representation of the glottal source: application to the classification of voice qualities.
Proceedings of the INTERSPEECH 2014, 2014
On automatic voice casting for expressive speech: Speaker recognition vs. speech classification.
Proceedings of the IEEE International Conference on Acoustics, 2014
2013
Syll-O-Matic: An adaptive time-frequency representation for the automatic segmentation of speech into syllables.
Proceedings of the IEEE International Conference on Acoustics, 2013
2012
A la recherche des temps perdus : Variations sur le rythme en français (Regional Variations of Speech Rhythm in French: In Search of Lost Times) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
La variation prosodique dialectale en français. Données et hypothèses (Speech Prosody of Dialectal French: Data and Hypotheses) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012
Proceedings of the 2012 IEEE Spoken Language Technology Workshop (SLT), 2012
Proceedings of the INTERSPEECH 2012, 2012
Proceedings of the INTERSPEECH 2012, 2012
Proceedings of the INTERSPEECH 2012, 2012
2011
Stylization and Trajectory Modelling of Short and Long Term Speech Prosody Variations.
Proceedings of the INTERSPEECH 2011, 2011
Discrete/Continuous Modelling of Speaking Style in HMM-Based Speech Synthesis: Design and Evaluation.
Proceedings of the INTERSPEECH 2011, 2011
Proceedings of the INTERSPEECH 2011, 2011
Toward a Continuous Modeling of French Prosodic Structure: Using Acoustic Features to Predict Prominence Location and Prominence Degree.
Proceedings of the INTERSPEECH 2011, 2011
Proceedings of the 17th International Congress of Phonetic Sciences, 2011
2010
Proceedings of the INTERSPEECH 2010, 2010
Proceedings of the INTERSPEECH 2010, 2010
Design and Evaluation of Shared Prosodic Annotation for Spontaneous French Speech: From Expert Knowledge to Non-Expert Annotation.
Proceedings of the Fourth Linguistic Annotation Workshop, 2010
2009
Proceedings of the INTERSPEECH 2009, 2009
2008
A method for automatic and dynamic estimation of discourse genre typology with prosodic features.
Proceedings of the INTERSPEECH 2008, 2008
Proceedings of the IEEE International Conference on Acoustics, 2008