Karën Fort

Orcid: 0000-0002-0723-8850

Affiliations:
  • Université de Lorraine, LORIA, Nancy, France
  • Sorbonne Université, Paris, France


According to our database1, Karën Fort authored at least 65 papers between 2007 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
From the ground up: developing a practical ethical methodology for integrating AI into industry.
AI Soc., April, 2023

Les textes cliniques français générés sont-ils dangereusement similaires à leur source ? Analyse par plongements de phrases.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 2 : travaux de recherche originaux, 2023

Des ressources lexicales du français et de leur utilisation en TAL : étude des actes de TALN.
Proceedings of the Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles, TALN 2023 - Volume 2 : travaux de recherche originaux, 2023

Can Synthetic Text Help Clinical Named Entity Recognition? A Study of Electronic Health Records in French.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
French CrowS-Pairs: Extension à une langue autre que l'anglais d'un corpus de mesure des biais sociétaux dans les modèles de langue masqués (French CrowS-Pairs : Extending a challenge dataset for measuring social bias in masked language models to a language other than English).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

FENEC : un corpus équilibré pour l'évaluation des entités nommées en français (FENEC : a balanced sample corpus for French named entity recognition ).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

CLISTER : Un corpus pour la similarité sémantique textuelle dans des cas cliniques en français (CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Langues par défaut? Analyse contrastive et diachronique des langues non citées dans les articles de TALN et d'ACL (Contrastive and diachronic study of unmentioned (by default ?) languages in TALN and ACL We study the application of the #BenderRule in natural language processing articles, taking into account a contrastive and a diachronic dimensions, by examining the proceedings of two NLP conferences, TALN and ACL, over time).
Proceedings of the Actes de la 29e Conférence sur le Traitement Automatique des Langues Naturelles. Volume 1 : conférence principale, 2022

Ethical Internal Logistics 4.0: Observations and Suggestions from a Working Internal Logistics Case.
Proceedings of the Service Oriented, Holonic and Multi-Agent Manufacturing Systems for Industry of the Future, 2022

CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Do we Name the Languages we Study? The #BenderRule in LREC and ACL articles.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Quantification Annotation in ISO 24617-12, Second Draft.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

French CrowS-Pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Myriadisation et éthique pour le traitement automatique des langues. (Crowdsourcing and ethics for Natural Language Processing).
, 2022

2021
Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

Reviewing Natural Language Processing Research.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts, 2021

2020
Répliquer et étendre pour l'alsacien "Étiquetage en parties du discours de langues peu dotées par spécialisation des plongements lexicaux" (Replicating and extending for Alsatian : "POS tagging for low-resource languages by adapting word embeddings").
Proceedings of the Actes de la 6e conférence conjointe Journées d'Études sur la Parole (JEP, 2020

Text Corpora and the Challenge of Newly Written Languages.
Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

Creating Expert Knowledge by Relying on Language Learners: a Generic Approach for Mass-Producing Language Resources by Combining Implicit Crowdsourcing and Language Learning.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Rigor Mortis: Annotating MWEs with a Gamified Platform.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Reviewing Natural Language Processing Research.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

2019
Unsupervised Data Augmentation for Less-Resourced Languages with no Standardized Spelling.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

Community Perspective on Replicability in Natural Language Processing.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

2018
Toward a Lightweight Solution for Less-resourced Languages: Creating a POS Tagger for Alsatian Using Voluntary Crowdsourcing.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

"Fingers in the Nose": Evaluating Speakers' Identification of Multi-Word Expressions Using a Slightly Gamified Crowdsourcing Platform.
Proceedings of the Joint Workshop on Linguistic Annotation, 2018

2017
Vers une solution légère de production de données pour le TAL : création d'un tagger de l'alsacien par crowdsourcing bénévole (Toward a lightweight solution to the language resources bottleneck issue: creating a POS tagger for Alsatian using voluntary crowdsourcing).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles, 2017

Vers l'annotation par le jeu de corpus (plus) complexes : le cas de la langue de spécialité (Towards (more) complex corpora annotation using a game with a purpose : the case of scientific language).
Proceedings of the Actes des 24ème Conférence sur le Traitement Automatique des Langues Naturelles. Orléans, France, June 26-30, 2017, Volume 2, 2017

2016
Introduction.
Trait. Autom. des Langues, 2016

Crowdsourcing and curation: perspectives from biology and natural language processing.
Database J. Biol. Databases Curation, 2016

Yes, We Care! Results of the Ethics and Natural Language Processing Surveys.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Crowdsourcing Complex Language Resources: Playing to Annotate Dependency Syntax.
Proceedings of the COLING 2016, 2016

2014
Analyse lexicale outillée de la parole transcrite de patients schizophrènes.
Trait. Autom. des Langues, 2014

Annotation scheme for deep dependency syntax of French (Un schéma d'annotation en dépendances syntaxiques profondes pour le français) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

ZOMBILINGO: eating heads to perform dependency syntax annotation (ZOMBILINGO : manger des têtes pour annoter en syntaxe de dépendances) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

Quantitative study of disfluencies in schizophrenics' speech: Automatize to limit biases (Étude quantitative des disfluences dans le discours de schizophrènes : automatiser pour limiter les biais) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2014

Propa-L: a semantic filtering service from a lexical network created using Games With A Purpose.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Mapping the Lexique des Verbes du Français (Lexicon of French Verbs) to a NLP lexicon using examples.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Evaluating corpora documentation with regards to the Ethics and Big Data Charter.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Deep Syntax Annotation of the Sequoia French Treebank.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Creating <i>Zombilingo</i>, a game with a purpose for dependency syntax annotation.
Proceedings of the First International Workshop on Gamification for Information Retrieval, 2014

2013
Using Games to Create Language Resources: Successes and Limitations of the Approach.
Proceedings of the People's Web Meets NLP, Collaboratively Constructed Language Resources, 2013

Formalizing an annotation guide : some experiments towards assisted agile annotation (Expériences de formalisation d'un guide d'annotation : vers l'annotation agile assistée) [in French].
Proceedings of the Traitement Automatique des Langues Naturelles, 2013

2012
Les ressources annotées, un enjeu pour l'analyse de contenu : vers une méthodologie de l'annotation manuelle de corpus. (Annotated resources, a key issue in content analysis : towards a methodology for manual corpus annotation).
PhD thesis, 2012

Annotation manuelle de matchs de foot : Oh la la la ! l'accord inter-annotateurs ! et c'est le but ! (Manual Annotation of Football Matches : Inter-annotator Agreement ! Gooooal !) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

TCOF-POS : un corpus libre de français parlé annoté en morphosyntaxe (TCOF-POS : A Freely Available POS-Tagged Corpus of Spoken French) [in French].
Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, 2012

Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Annotating Football Matches: Influence of the Source Medium on Manual Annotation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Towards a (Better) Definition of the Description of Annotated MIR Corpora.
Proceedings of the 13th International Society for Music Information Retrieval Conference, 2012

Manual Corpus Annotation: Giving Meaning to the Evaluation Metrics.
Proceedings of the COLING 2012, 2012

Modeling the Complexity of Manual Annotation Tasks: a Grid of Analysis.
Proceedings of the COLING 2012, 2012

Structured Named Entities in two distinct press corpora: Contemporary Broadcast News and Old Newspapers.
Proceedings of the Sixth Linguistic Annotation Workshop, 2012

2011
Amazon Mechanical Turk: Gold Mine or Coal Mine?
Comput. Linguistics, 2011

Un turc mécanique pour les ressources linguistiques : critique de la myriadisation du travail parcellisé (Mechanical Turk for linguistic resources: review of the crowdsourcing of parceled work).
Proceedings of the Actes de la 18e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2011

Crowdsourcing for Language Resource Development: Criticisms About Amazon Mechanical Turk Overpowering Use.
Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

BioNLP Shared Task 2011 - Bacteria Gene Interactions and Renaming.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

Proposal for an Extension of Traditional Named Entities: From Guidelines to Evaluation, an Overview.
Proceedings of the Fifth Linguistic Annotation Workshop, 2011

2010
Évaluer des annotations manuelles dispersées : les coefficients sont-ils suffisants pour estimer l'accord inter-annotateurs ?
Proceedings of the Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2010

FastKwic, an "Intelligent" Concordancer Using FASTR.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Influence of Pre-Annotation on POS-Tagged Corpus Development.
Proceedings of the Fourth Linguistic Annotation Workshop, 2010

2009
Vers une méthodologie d'annotation des entités nommées en corpus ?
Proceedings of the Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2009

Towards a Methodology for Named Entities Annotation.
Proceedings of the Third Linguistic Annotation Workshop, 2009

2008
Sylva : plate-forme de validation multi-niveaux de lexiques.
Proceedings of the Actes de la 15ème conférence sur le Traitement Automatique des Langues Naturelles. Articles courts, 2008

A Toolchain for Grammarians.
Proceedings of the COLING 2008, 2008

2007
PrepLex : un lexique des prépositions du français pour l'analyse syntaxique.
Proceedings of the Actes de la 14ème conférence sur le Traitement Automatique des Langues Naturelles. Articles longs, 2007


  Loading...