Mickael Rouvier

Orcid: 0000-0003-3541-3385

Affiliations:
  • Aix Marseille Universite, France


According to our database1, Mickael Rouvier authored at least 81 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 2.
CoRR, 2024

Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems.
CoRR, 2024

How Important Is Tokenization in French Medical Masked Language Models?
CoRR, 2024

DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain.
CoRR, 2024

BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains.
CoRR, 2024

2023
SynVox2: Towards a privacy-friendly VoxCeleb2 dataset.
CoRR, 2023

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech.
CoRR, 2023

A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks.
CoRR, 2023

HATS: An Open Data Set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

HATS : Un jeu de données intégrant la perception humaine appliquée à l'évaluation des métriques de transcription de la parole.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023, 2023

MORFITT : Un corpus multi-labels d'articles scientifiques français dans le domaine biomédical.
Proceedings of the Actes de CORIA-TALN 2023. Actes de l'atelier "Analyse et Recherche de Textes Scientifiques", 2023

DrBERT: Un modèle robuste pré-entraîné en français pour les domaines biomédical et clinique.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023, 2023

Jeffreys Divergence-Based Regularization of Neural Network Output Distribution Applied to Speaker Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

DrBERT: A Robust Pre-trained Model in French for Biomedical and Clinical domains.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
I4U System Description for NIST SRE'20 CTS Challenge.
CoRR, 2022

Mesures linguistiques automatiques pour l'évaluation des systèmes de Reconnaissance Automatique de la Parole (Automated linguistic measures for automatic speech recognition systems' evaluation).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding.
Proceedings of the IEEE Spoken Language Technology Workshop, 2022

Far-Field Speaker Recognition Benchmark Derived From The DiPCo Corpus.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Speech Resources in the Tamasheq Language.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition.
Proceedings of the Interspeech 2022, 2022

Reliability criterion based on learning-phase entropy for speaker recognition with neural network.
Proceedings of the Interspeech 2022, 2022

FrenchMedMCQA: A French Multiple-Choice Question Answering Dataset for Medical domain.
Proceedings of the 13th International Workshop on Health Text Mining and Information Analysis, 2022

2021
Influence of Speaker Pre-training on Character Voice Representation.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Language Adaptation for Speaker Recognition Systems Using Contrastive Learning.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Study On the Temporal Pooling Used In Deep Neural Networks For Speaker Verification.
Proceedings of the 29th European Signal Processing Conference, 2021

Studying Squeeze-and-Excitation Used in CNN for Speaker Verification.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021

2020
Adaptation Strategy and Clustering from Scratch for New Domains of Speaker Recognition.
Proceedings of the Odyssey 2020: The Speaker and Language Recognition Workshop, 2020

Review of different robust x-vector extractors for speaker verification.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
ON-TRAC Consortium End-to-End Speech Translation Systems for the IWSLT 2019 Shared Task.
CoRR, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences.
CoRR, 2019


On Robustness of Unsupervised Domain Adaptation for Speaker Recognition.
Proceedings of the Interspeech 2019, 2019

2018
LIA@CLEF 2018: Mining Events Opinion Argumentation from Raw Unlabeled Twitter Data using Convolutional Neural Network.
Proceedings of the Working Notes of CLEF 2018, 2018

2017
LIA at SemEval-2017 Task 4: An Ensemble of Neural Networks for Sentiment Classification.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017


Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization.
Proceedings of the Interspeech 2017, 2017

Duration Mismatch Compensation Using Four-Covariance Model and Deep Neural Network for Speaker Verification.
Proceedings of the Interspeech 2017, 2017

2016
Building a robust sentiment lexicon with (almost) no resource.
CoRR, 2016

LIA system description for NIST SRE 2016.
CoRR, 2016

Fusion d'espaces de représentations multimodaux pour la reconnaissance du rôle du locuteur dans des documents télévisuels (Multimodal embedding fusion for robust speaker role recognition in video broadcast ).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 1 : JEP, 2016

SENSEI-LIF at SemEval-2016 Task 4: Polarity embedding fusion for robust sentiment analysis.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Investigation of speaker embeddings for cross-show speaker diarization.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

2015
Audio-Based Video Genre Identification.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

PERCOLATTE : A Multimodal Person Discovery System in TV Broadcast for the Medieval 2015 Evaluation Campaign.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

"speech is silver, but silence is golden": improving speech-to-speech translation performance by slashing users input.
Proceedings of the INTERSPEECH 2015, 2015

Speaker diarization through speaker embeddings.
Proceedings of the 23rd European Signal Processing Conference, 2015

Identification de personnes dans des flux multimédia.
Proceedings of the CORIA 2015 - Conférence en Recherche d'Infomations et Applications, 2015

Multimodal embedding fusion for robust speaker role recognition in video broadcast.
Proceedings of the 2015 IEEE Workshop on Automatic Speech Recognition and Understanding, 2015

2014
Joint decoding of complementary utterances.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Speaker adaptation of DNN-based ASR with i-vectors: does it actually adapt models to speakers?
Proceedings of the INTERSPEECH 2014, 2014


Reranked aligners for interactive transcript correction.
Proceedings of the IEEE International Conference on Acoustics, 2014

Scene understanding for identifying persons in TV shows: Beyond face authentication.
Proceedings of the 12th International Workshop on Content-Based Multimedia Indexing, 2014

2013
Searching segments of interest in single story web-videos.
Proceedings of the 14th International Workshop on Image Analysis for Multimedia Interactive Services, 2013

LIUM ASR System for ETAPE French Evaluation Campaign: Experiments on System Combination Using Open-Source Recognizers.
Proceedings of the Text, Speech, and Dialogue - 16th International Conference, 2013

An Investigation of Single-Pass ASR System Combination for Spoken Language Understanding.
Proceedings of the Statistical Language and Speech Processing, 2013

An open-source state-of-the-art toolbox for broadcast news diarization.
Proceedings of the INTERSPEECH 2013, 2013

Semi-Supervised and Unsupervised Data Extraction Targeting Speakers: From Speaker Roles to Fame?
Proceedings of the First Workshop on Speech, 2013

2012
Nouvelle approche pour le regroupement des locuteurs dans des émissions radiophoniques et télévisuelles (New approach for speaker clustering of broadcast news) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Segmentation et Regroupement en Locuteurs d'une collection de documents audio (Cross-show speaker diarization) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Avancées dans le domaine de la transcription automatique par décodage guidé (Improvements on driven decoding system combination) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

A global optimization framework for speaker diarization.
Proceedings of the Odyssey 2012: The Speaker and Language Recognition Workshop, 2012

I-vectors and ILP clustering adapted to cross-show speaker diarization.
Proceedings of the INTERSPEECH 2012, 2012

Subspace Gaussian Mixture Models Based on Noise Compensation for Speech Recognition.
Proceedings of the INTERSPEECH 2012, 2012

Low latency combination of parallelized single-pass LVCSR systems.
Proceedings of the INTERSPEECH 2012, 2012

2011
Structuration de contenus audio-visuel pour le résumé automatique. (Audio-visual content structuring for automatic summarization).
PhD thesis, 2011

Modeling nuisance variabilities with factor analysis for GMM-based audio pattern classification.
Comput. Speech Lang., 2011

Qui êtes-vous ? Catégoriser les questions pour déterminer le rôle des locuteurs dans des conversations orales (Who are you? Categorize questions to determine the role of speakers in oral conversations).
Proceedings of the Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2011

Static and dynamic video summaries.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

LIA @ MediaEval 2011: Compact representation of heterogeneous descriptors for video genre classification.
Proceedings of the Working Notes Proceedings of the MediaEval 2011 Workshop, 2011

Speaker Role Recognition Using Question Detection and Characterization.
Proceedings of the INTERSPEECH 2011, 2011

Factor analysis based session variability compensation for Automatic Speech Recognition.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

Subspace Gaussian Mixture Models for vectorial HMM-states representation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Query-Driven Strategy for On-the-Fly Term Spotting in Spontaneous Speech.
EURASIP J. Audio Speech Music. Process., 2010

Classification du genre vidéo reposant sur des transcriptions automatiques.
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2010

A language-identification inspired method for spontaneous speech detection.
Proceedings of the INTERSPEECH 2010, 2010

On-the-fly video genre classification by combination of audio features.
Proceedings of the IEEE International Conference on Acoustics, 2010

Transcription-based video genre classification.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Factor analysis for audio-based video genre classification.
Proceedings of the INTERSPEECH 2009, 2009

Robust audio-based classification of video genre.
Proceedings of the INTERSPEECH 2009, 2009

2008
On-the-fly term spotting by phonetic filtering and request-driven decoding.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008


  Loading...