Cyril Grouin

Orcid: 0000-0001-5809-188X

According to our database1, Cyril Grouin authored at least 86 papers between 2002 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Dataset for Pharmacovigilance in German, French, and Japanese: Annotating Adverse Drug Reactions across Languages.
CoRR, 2024

2023
La pré-annotation automatique de textes cliniques comme support au dialogue avec les experts du domaine lors de la mise au point d'un schéma d'annotation.
Proceedings of the Actes de CORIA-TALN 2023. Actes de l'atelier "Analyse et Recherche de Textes Scientifiques", 2023

Étude de méthodes d'augmentation de données pour la reconnaissance d'entités nommées en astrophysique.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 1 : travaux de recherche originaux, 2023

Le traitement automatique des langues face à l'évolution des usages de la langue. (Natural Language Processing Facing the Language Uses Evolution).
, 2023

2022
Impact du français inclusif sur les outils du TAL (Impact of French Inclusive Language on NLP Tools).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Etude des stéréotypes genrés dans le théâtre français du XVIe au XIXe siècle à travers des plongements lexicaux (Studying gender stereotypes in French theater from XVIth to XIXth century through the use of lexical embeddings ).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Evaluating Tokenizers Impact on OOVs Representation with Transformers Models.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
Classification de cas cliniques et évaluation automatique de réponses d'étudiants : présentation de la campagne DEFT 2021 (Clinical cases classification and automatic evaluation of student answers : Presentation of the DEFT 2021 Challenge).
Proceedings of the Actes de la 28e Conférence sur le Traitement Automatique des Langues Naturelles. Atelier DÉfi Fouille de Textes, DEFT@TALN 2021, Lille, France, June 28, 2021

Differential Evaluation: a Qualitative Analysis of Natural Language Processing System Behavior Based Upon Data Resistance to Processing.
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

Easy-to-use Combination of POS and BERT Model for Domain-Specific and Misspelled Terms.
Proceedings of the Fifth Workshop on Natural Language for Artificial Intelligence (NL4AI 2021) co-located with 20th International Conference of the Italian Association for Artificial Intelligence (AI*IA 2021), 2021

2020
Présentation de la campagne d'évaluation DEFT 2020 : similarité textuelle en domaine ouvert et extraction d'information précise dans des cas cliniques (Presentation of the DEFT 2020 Challenge : open domain textual similarity and precise information extraction from clinical cases ).
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Inference Annotation of a Chinese Corpus for Opinion Mining.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


Experiments from LIMSI at the French Named Entity Recognition Coarse-grained Task.
Proceedings of the Working Notes of CLEF 2020, 2020

2019
Automatic classification of free-text medical causes from death certificates for reactive mortality surveillance in France.
Int. J. Medical Informatics, 2019

Recherche et extraction d'information dans des cas cliniques. Présentation de la campagne d'évaluation DEFT 2019 (Information Retrieval and Information Extraction from Clinical Cases).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Défi Fouille de Textes (atelier TALN-RECITAL), 2019

Corpus annoté de cas cliniques en français (Annotated corpus with clinical cases in French).
Proceedings of the Actes de la Conférence sur le Traitement Automatique des Langues Naturelles (TALN) PFIA 2019. Volume I : Articles longs, 2019

Community Perspective on Replicability in Natural Language Processing.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Initial Experiments for Pharmacovigilance Analysis in Social Media Using Summaries of Product Characteristics.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

A New Approach to Compare the Performance of Two Classification Methods of Causes of Death for Timely Surveillance in France.
Proceedings of the MEDINFO 2019: Health and Wellbeing e-Networks for All, 2019

Clinical Case Reports for NLP.
Proceedings of the 18th BioNLP Workshop and Shared Task, 2019

2018
A French clinical corpus with comprehensive semantic annotations: development of the Medical Entity and Relation LIMSI annOtated Text corpus (MERLOT).
Lang. Resour. Evaluation, 2018

DEFT2018 : recherche d'information et analyse de sentiments dans des tweets concernant les transports en Île de France (DEFT2018 : Information Retrieval and Sentiment Analysis in Tweets about Public Transportation in Île de France Region ).
Proceedings of the Actes de la Conférence CORIA-TALN-RJC - Volume 2, 2018

Simplification de schémas d'annotation : un aller sans retour ? (Annotation scheme simplification : a one way trip with no return ?).
Proceedings of the Actes de la Conférence TALN. CORIA-TALN-RJC 2018 - Volume 1, 2018

Three Dimensions of Reproducibility in Natural Language Processing.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

2017
Traitement automatique de la langue biomédicale au LIMSI (Biomedical language processing at LIMSI).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles. Orléans, France, June 26-30, 2017 - Volume 3, 2017

Generating a Training Corpus for OCR Post-Correction Using Encoder-Decoder Model.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

CLEF eHealth 2017 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certificates in English and French.
Proceedings of the Working Notes of CLEF 2017, 2017

Reproducibility in Biomedical Natural Language Processing.
Proceedings of the AMIA 2017, 2017

2016
Une catégorisation de fins de lignes non-supervisée (End-of-line classification with no supervision).
Proceedings of the Actes de la conférence conjointe JEP-TALN-RECITAL 2016. Volume 2 : TALN (Posters), 2016

LIMSI at SemEval-2016 Task 12: machine-learning and temporal information to identify clinical events and time expressions.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Identification of Drug-Related Medical Conditions in Social Media.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Controlled Propagation of Concept Annotations in Textual Corpora.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Text Segmentation of Digitized Clinical Texts.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Supervised classification of end-of-lines in clinical text with no manual annotation.
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016

A Dataset for ICD-10 Coding of Death Certificates: Creation and Usage.
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016

Detection of Text Reuse in French Medical Corpora.
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining, 2016

Clinical Information Extraction at the CLEF eHealth Evaluation lab 2016.
Proceedings of the Working Notes of CLEF 2016, 2016

Identification of Mentions and Relations between Bacteria and Biotope from PubMed Abstracts.
Proceedings of the 4th BioNLP Shared Task Workshop, BioNLP 2016, 2016

Replicability of Research in Biomedical Natural Language Processing: a pilot evaluation for a coding task.
Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis, 2016

Low-resource OCR error detection and correction in French Clinical Texts.
Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis, 2016

2015
The contribution of co-reference resolution to supervised relation detection between bacteria and biotopes entities.
BMC Bioinform., December, 2015

Combining glass box and black box evaluations in the identification of heart disease risk factors and their temporal relations from clinical records.
J. Biomed. Informatics, 2015

Médicaments qui soignent, médicaments qui rendent malades : étude des relations causales pour identifier les effets secondaires.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

Étude des verbes introducteurs de noms de médicaments dans les forums de santé.
Proceedings of the Actes de la 22e conference sur le Traitement Automatique des Langues Naturelles. Articles courts, 2015

Identification de facteurs de risque pour des patients diabétiques à partir de comptes-rendus cliniques par des approches hybrides.
Proceedings of the Actes de la 22e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2015

CLEF eHealth Evaluation Lab 2015 Task 1b: Clinical Named Entity Recognition.
Proceedings of the Working Notes of CLEF 2015, 2015

Overview of the CLEF eHealth Evaluation Lab 2015.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

Is it possible to recover personal health information from an automatically de-identified corpus of French EHRs?
Proceedings of the Sixth International Workshop on Health Text Mining and Information Analysis, 2015

2014
De-identification of clinical notes in French: towards a protocol for reference corpus development.
J. Biomed. Informatics, 2014

Automatic Analysis of Scientific and Literary Texts. Presentation and Results of the DEFT2014 Text Mining Challenge (Analyse automatique de textes littéraires et scientifiques : présentation et résultats du défi fouille de texte DEFT2014) [in French].
Proceedings of the TALN-RECITAL 2014 Workshop: DÉfi Fouille de Textes (Text Mining Challenge), 2014

Human annotation of ASR error regions: Is "gravity" a sharable concept for human annotators?
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Biomedical entity extraction using machine-learning based approaches.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Morpho-Syntactic Study of Errors from Speech Recognition System.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Annotation of specialized corpora using a comprehensive entity and relation scheme.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Use of unsupervised word classes for entity recognition: Application to the detection of disorders in clinical reports.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Disease and Disorder Template Filling using Rule-based and Statistical Approaches.
Proceedings of the Working Notes for CLEF 2014 Conference, 2014

How to de-identify a large clinical corpus in 10 days.
Proceedings of the AMIA 2014, 2014

Automatic Content Extraction for Designing a French Clinical Corpus.
Proceedings of the AMIA 2014, 2014

Optimizing annotation efforts to build reliable annotated corpora for training statistical models.
Proceedings of the 8th Linguistic Annotation Workshop, 2014

2013
Anonymisation de documents cliniques : performances et limites des méthodes symboliques et par apprentissage statistique. (Clinical Records De-Identification: Performances and Limits of Rule-based and Machine-Learning based Approaches).
PhD thesis, 2013

Eventual situations for timeline extraction from clinical reports.
J. Am. Medical Informatics Assoc., 2013

Studying frequency-based approaches to process lexical simplification (Approches à base de fréquences pour la simplification lexicale) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

Automatic De-Identification of French Clinical Records: Comparison of Rule-Based and Machine-Learning Approaches.
Proceedings of the MEDINFO 2013, 2013

Building A Contrasting Taxa Extractor for Relation Identification from Assertions: BIOlogical Taxonomy & Ontology Phrase Extraction System.
Proceedings of the BioNLP Shared Task 2013 Workshop, Sofia, 2013

Automatic Named Entity Pre-annotation for Out-of-domain Human Annotation.
Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse, 2013

2012
Indexation libre et contrôlée d'articles scientifiques. Présentation et résultats du défi fouille de textes DEFT2012 (Controlled and free indexing of scientific papers. Presentation and results of the DEFT2012 text-mining challenge) [in French].
Proceedings of the JEP-TALN-RECITAL 2012, 2012

ANNLOR: A Naïve Notation-system for Lexical Outputs Ranking.
Proceedings of the 6th International Workshop on Semantic Evaluation, 2012

Extended Named Entities Annotation on OCRed Documents: From Corpus Constitution to Evaluation Campaign.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Detecting negation of medical problems in French clinical notes.
Proceedings of the ACM International Health Informatics Symposium, 2012

Manual Corpus Annotation: Giving Meaning to the Evaluation Metrics.
Proceedings of the COLING 2012, 2012

Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers.
Proceedings of the Sixth Linguistic Annotation Workshop, 2012

2011
Une approche à plusieurs étapes pour anonymiser des documents médicaux.
Rev. d'Intelligence Artif., 2011

Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification.
J. Am. Medical Informatics Assoc., 2011

Extraction d'informations médicales au LIMSI (Medical information extraction at LIMSI).
Proceedings of the Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. Démonstrations, 2011

Accès au contenu sémantique en langue de spécialité : extraction des prescriptions et concepts médicaux (Accessing the semantic content in a specialized language: extracting prescriptions and medical concepts).
Proceedings of the Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2011

Structured and Extended Named Entity Evaluation in Automatic Speech Transcriptions.
Proceedings of the Fifth International Joint Conference on Natural Language Processing, 2011

Handling Outlandish Occurrences: Using Rules and Lexicons for Correcting NLP Articles.
Proceedings of the ENLG 2011, 2011

Proposal for an Extension of Traditional Named Entities: From Guidelines to Evaluation, an Overview.
Proceedings of the Fifth Linguistic Annotation Workshop, 2011

2010
Extracting medical information from narrative patient records: the case of medication-related information.
J. Am. Medical Informatics Assoc., 2010

Extracting Medication Information from French Clinical Texts.
Proceedings of the MEDINFO 2010, 2010

A Corpus for Studying Full Answer Justification.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

2009
DEFT'07 : une campagne d'évaluation en fouille d'opinion.
Proceedings of the Fouille de Données d'Opinions, 2009

Testing Tactics to Localize De-Identification.
Proceedings of the Medical Informatics in a United and Healthy Europe - Proceedings of MIE 2009, The XXIInd International Congress of the European Federation for Medical Informatics, Sarajevo, Bosnia and Herzegovina, August 30, 2009

2008
Certification and Cleaning up of a Text Corpus: Towards an Evaluation of the "Grammatical" Quality of a Corpus.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

2002
Recycling an Information Extraction System to Automatically Produce Semantic Annotations for the Web.
Proceedings of the ECAI 2002 Workshop on Semantic Authoring, Annotation & Knowledge Markup, 2002


  Loading...