Dirk Hovy

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Divine LLaMAs: Bias, Stereotypes, Stigmatization, and Emotion Representation of Religion in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Explaining Speech Classification Models via Word-Level Audio Segments and Paralinguistic Features.

[BibT_eX]

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

DADIT: A Dataset for Demographic Classification of Italian Twitter Users and a Comparison of Prediction Methods.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Impoverished Language Technology: The Lack of (Social) Class in NLP.

[BibT_eX]

[DOI]

Zeerak Talat

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Emotion Analysis in NLP: Trends, Gaps and Roadmap for Future Directions.

[BibT_eX]

[DOI]

Alba Curry

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts.

[BibT_eX]

[DOI]

Donya Rooein

Paul Röttger

Anastassia Shaitarova

Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications, 2024

Narratives at Conflict: Computational Analysis of News Framing in Multilingual Disinformation Campaigns.

[BibT_eX]

[DOI]

Antonina Sinelnik

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2024

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Compromesso! Italian Many-Shot Jailbreaks undermine the safety of Large Language Models.

[BibT_eX]

[DOI]

Fabio Pernisi

Paul Röttger

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 4: Student Research Workshop), 2024

Classist Tools: Social Class Correlates with Performance in NLP.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Angry Men, Sad Women: Large Language Models Reflect Gendered Stereotypes in Emotion Attribution.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Viewpoint: Artificial Intelligence Accidents Waiting to Happen?

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2023

Know Your Audience: Do LLMs Adapt to Different Age and Education Levels?

[BibT_eX]

[DOI]

Donya Rooein

CoRR, 2023

How to Use Large Language Models for Text Coding: The Case of Fatherhood Roles in Public Policy Documents.

[BibT_eX]

[DOI]

CoRR, 2023

Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification.

[BibT_eX]

[DOI]

CoRR, 2023

The Ecological Fallacy in Annotation: Modelling Human Label Variation goes beyond Sociodemographics.

[BibT_eX]

[DOI]

CoRR, 2023

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP.

[BibT_eX]

[DOI]

CoRR, 2023

Leveraging Social Interactions to Detect Misinformation on Social Media.

[BibT_eX]

[DOI]

CoRR, 2023

Consistency is Key: Disentangling Label Variation in Natural Language Processing with Intra-Annotator Agreement.

[BibT_eX]

[DOI]

Gavin Abercrombie

Verena Rieser

CoRR, 2023

Beyond Digital "Echo Chambers": The Role of Viewpoint Diversity in Political Discussion.

[BibT_eX]

[DOI]

Patrícia G. C. Rossini

Rebekah Tromble

Nava Tintarev

Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

MilaNLP at SemEval-2023 Task 10: Ensembling Domain-Adapted and Regularized Pretrained Language Models for Robust Sexism Detection.

[BibT_eX]

[DOI]

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Top-Down Influence? Predicting CEO Personality and Risk Impact from Speech Transcripts.

[BibT_eX]

[DOI]

Kilian Theil

Heiner Stuckenschmidt

Proceedings of the Seventeenth International AAAI Conference on Web and Social Media, 2023

Can Demographic Factors Improve Text Classification? Revisiting Demographic Adaptation in the Age of Transformers.

[BibT_eX]

[DOI]

Chia-Chien Hung

Anne Lauscher

Simone Paolo Ponzetto

Goran Glavas

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

The Ecological Fallacy in Annotation: Modeling Human Label Variation goes beyond Sociodemographics.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

The State of Profanity Obfuscation in Natural Language Processing Scientific Publications.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

What about "em"? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Gender and Age Bias in Commercial Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Towards Responsible Machine Translation, 2023

2022

ProSiT! Latent Variable Discovery with PROgressive SImilarity Thresholds.

[BibT_eX]

[DOI]

CoRR, 2022

The State of Profanity Obfuscation in Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2022

Is It Worth the (Environmental) Cost? Limited Evidence for the Benefits of Diachronic Continuous Training.

[BibT_eX]

[DOI]

CoRR, 2022

On the Limitations of Sociodemographic Adaptation with Transformers.

[BibT_eX]

[DOI]

Chia-Chien Hung

Anne Lauscher

Simone Paolo Ponzetto

Goran Glavas

CoRR, 2022

XLM-EMO: Multilingual Emotion Prediction in Social Media Text.

[BibT_eX]

[DOI]

Proceedings of the 12th Workshop on Computational Approaches to Subjectivity, 2022

Guiding the Release of Safer E2E Conversational AI through Value Sensitive Design.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2022

Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks.

[BibT_eX]

[DOI]

Paul Röttger

Bertie Vidgen

Janet B. Pierrehumbert

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Measuring Harmful Sentence Completion in Language Models for LGBTQIA+ Individuals.

[BibT_eX]

[DOI]

Proceedings of the Second Workshop on Language Technology for Equality, 2022

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

SocioProbe: What, When, and Where Language Models Learn about Sociodemographics.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Bridging Fairness and Environmental Sustainability in Natural Language Processing.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

"It's Not Just Hate": A Multi-Dimensional Perspective on Detecting Harmful Speech Online.

[BibT_eX]

[DOI]

Stefanie Anja Hills

Patrícia G. C. Rossini

Rebekah Tromble

Nava Tintarev

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Twitter-Demographer: A Flow-based Tool to Enrich Twitter Data.

[BibT_eX]

[DOI]

Vincenzo Cutrona

Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender.

[BibT_eX]

[DOI]

Anne Lauscher

Archie Crowley

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Hard and Soft Evaluation of NLP models with BOOtSTrap SAmpling - BooStSa.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

SafetyKit: First Aid for Measuring Safety in Open-domain Conversational Systems.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Universal Joy A Data Set and Results for Classifying Emotions Across Lanugages.

[BibT_eX]

[DOI]

Dataset, March, 2021

Five sources of bias in natural language processing.

[BibT_eX]

[DOI]

Shrimai Prabhumoye

Lang. Linguistics Compass, 2021

Learning from Disagreement: A Survey.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2021

Language Invariant Properties in Natural Language Processing.

[BibT_eX]

[DOI]

CoRR, 2021

Anticipating Safety Issues in E2E Conversational AI: Framework and Tooling.

[BibT_eX]

[DOI]

CoRR, 2021

Universal Joy A Data Set and Results for Classifying Emotions Across Languages.

[BibT_eX]

[DOI]

Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

MilaNLP @ WASSA: Does BERT Feel Sad When You Cry?

[BibT_eX]

[DOI]

Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

FEEL-IT: Emotion and Sentiment Classification for the Italian Language.

[BibT_eX]

[DOI]

Proceedings of the Eleventh Workshop on Computational Approaches to Subjectivity, 2021

HONEST: Measuring Hurtful Sentence Completion in Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

The Importance of Modeling Social Factors of Language: Theory and Practice.

[BibT_eX]

[DOI]

Diyi Yang

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Beyond Black & White: Leveraging Annotator Disagreement via Soft-Label Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

BERTective: Language Models and Contextual Information for Deception Detection.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Cross-lingual Contextualized Topic Models with Zero-shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

"We will Reduce Taxes" - Identifying Election Pledges with Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Pre-training is a Hot Topic: Contextualized Document Embeddings Improve Topic Coherence.

[BibT_eX]

[DOI]

Silvia Terragni

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

On the Gap between Adoption and Understanding in NLP.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

What the [MASK]? Making Sense of Language-Specific BERT Models.

[BibT_eX]

[DOI]

CoRR, 2020

A Report on the VarDial Evaluation Campaign 2020.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, 2020

A Case for Soft Loss Functions.

[BibT_eX]

[DOI]

Proceedings of the Eighth AAAI Conference on Human Computation and Crowdsourcing, 2020

Helpful or Hierarchical? Predicting the Communicative Strategies of Chat Participants, and their Impact on Success.

[BibT_eX]

[DOI]

Fernando Vega-Redondo

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Predictive Biases in Natural Language Processing Models: A Conceptual Framework and Overview.

[BibT_eX]

[DOI]

Deven Shah

H. Andrew Schwartz

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

"You Sound Just Like Your Father" Commercial Machine Translation Systems Include Stylistic Biases.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Integrating Ethics into the NLP Curriculum.

[BibT_eX]

[DOI]

Emily M. Bender

Alexandra Schofield

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, 2020

2019

Recognizing and Reducing Bias in NLP Applications.

[BibT_eX]

[DOI]

Proceedings of the Sixth Italian Conference on Computational Linguistics, 2019

Hey Siri. Ok Google. Alexa: A topic modeling of user reviews for smart speakers.

[BibT_eX]

[DOI]

Hanh Nguyen

Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Identifying Linguistic Areas for Geolocation.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Dense Node Representation for Geolocation.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Geolocation with Attention-Based Multitask Learning Models.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

2018

Comparing Bayesian Models of Annotation.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2018

Predicting News Headline Popularity with Syntactic and Semantic Knowledge Using Multi-Task Learning.

[BibT_eX]

[DOI]

Sotiris Lamprinidis

Daniel Hardt

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Capturing Regional Variation with Distributed Place Representations and Geographic Retrofitting.

[BibT_eX]

[DOI]

Christoph Purschke

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Improving Author Attribute Prediction by Retrofitting Linguistic Representations with Homophily.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

The Social and the Neural Network: How to Make Natural Language Processing about People again.

[BibT_eX]

[DOI]

Proceedings of the Second Workshop on Computational Modeling of People's Opinions, 2018

2017

Multi-Task Learning for Mental Health using Social Media Text.

[BibT_eX]

[DOI]

Adrian Benton

Margaret Mitchell

CoRR, 2017

End-to-End Information Extraction without Token-Level Supervision.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Speech-Centric Natural Language Processing, 2017

Multitask Learning for Mental Health Conditions with Limited Social Media Data.

[BibT_eX]

[DOI]

Adrian Benton

Margaret Mitchell

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Huntsville, hospitals, and hockey teams: Names can reveal your location.

[BibT_eX]

[DOI]

Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017

2016

SemEval-2016 Task 10: Detecting Minimal Semantic Units and their Meanings (DiMSUM).

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter.

[BibT_eX]

[DOI]

Zeerak Waseem

Proceedings of the Student Research Workshop, 2016

Learning a POS tagger for AAVE-like language.

[BibT_eX]

[DOI]

Anna Jørgensen

Proceedings of the NAACL HLT 2016, 2016

Exploring Language Variation Across Europe - A Web-based Tool for Computational Sociolinguistics.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

The Social Impact of Natural Language Processing.

[BibT_eX]

[DOI]

Shannon L. Spruit

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

The Enemy in Your Own Camp: How Well Can We Detect Statistically-Generated Fake Reviews - An Adversarial Study.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Putting Sarcasm Detection into Context: The Effects of Class Imbalance and Manual Labelling on Supervised Machine Classification of Twitter Conversations.

[BibT_eX]

[DOI]

Gavin Abercrombie

Proceedings of the ACL 2016 Student Research Workshop, Berlin, Germany, August 7-12, 2016, 2016

2015

User Review Sites as a Resource for Large-Scale Sociolinguistic Studies.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on World Wide Web, 2015

Personality Traits on Twitter - or - How to Get 1, 500 Personality Tests in a Week.

[BibT_eX]

[DOI]

Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, 2015

Mining for unambiguous instances to adapt part-of-speech taggers to new domains.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

The Rating Game: Sentiment Rating Reproducibility from Text.

[BibT_eX]

[DOI]

Lasse Borgholt

Peter Simonsen

Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Cross-lingual syntactic variation over age and gender.

[BibT_eX]

[DOI]

Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Challenges of studying and processing dialects in social media.

[BibT_eX]

[DOI]

Anna Jørgensen

Proceedings of the Workshop on Noisy User-generated Text, 2015

Tagging Performance Correlates with Author Age.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Demographic Factors Improve Classification Performance.

[BibT_eX]

[DOI]

If all you have is a bit of the Bible: Learning POS taggers for truly low-resource languages.

[BibT_eX]

[DOI]

Zeljko Agic

2014

Robust Cross-Domain Sentiment Analysis for Low-Resource Languages.

[BibT_eX]

[DOI]

Jakob Elming

Proceedings of the 5th Workshop on Computational Approaches to Subjectivity, 2014

More or less supervised supersense tagging of Twitter.

[BibT_eX]

[DOI]

Proceedings of the Third Joint Conference on Lexical and Computational Semantics, 2014

Copenhagen-Malmö: Tree Approximations of Semantic Parsing Problems.

[BibT_eX]

[DOI]

Sigrid Klerke

Proceedings of the 8th International Workshop on Semantic Evaluation, 2014

Augmenting English Adjective Senses with Supersenses.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

When POS data sets don't add up: Combatting sample bias.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Crowdsourcing and annotating NER for Twitter #drift.

[BibT_eX]

[DOI]

Hege Fromreide

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Learning part-of-speech taggers with inter-annotator agreement loss.

[BibT_eX]

[DOI]

Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

What's in a p-value in NLP?

[BibT_eX]

[DOI]

Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

Selection Bias, Label Bias, and Bias in Ground Truth.

[BibT_eX]

[DOI]

Proceedings of the COLING 2014, 2014

Adapting taggers to Twitter with not-so-distant supervision.

[BibT_eX]

[DOI]

Proceedings of the COLING 2014, 2014

Linguistically debatable or just plain wrong?

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Experiments with crowdsourced re-annotation of a POS tagging data set.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

How Well can We Learn Interpretable Entity Types from Text?

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013

Solving electrical networks to incorporate supervision in random walks.

[BibT_eX]

[DOI]

Mrinmaya Sachan

Proceedings of the 22nd International World Wide Web Conference, 2013

Learning Whom to Trust with MACE.

[BibT_eX]

[DOI]

Taylor Berg-Kirkpatrick

Ashish Vaswani

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Analysis and modeling of "focus" in context.

[BibT_eX]

[DOI]

Gopala Krishna Anumanchipalli

Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

A Walk-Based Semantically Enriched Tree Kernel Over Distributed Word Representations.

[BibT_eX]

[DOI]

Shashank Srivastava

Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

2012

When Did that Happen? - Linking Events and Relations to Timestamps.

[BibT_eX]

[DOI]

Alfio Massimiliano Gliozzo

James Fan

Siddharth Patwardhan

Christopher A. Welty

Proceedings of the EACL 2012, 2012

2011

Unsupervised Discovery of Domain-Specific Knowledge from Text.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Models and Training for Unsupervised Preposition Sense Disambiguation.

[BibT_eX]

[DOI]

Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

2010

What's in a Preposition? Dimensions of Sense Disambiguation for an Interesting Word Class.

[BibT_eX]

[DOI]

Stephen Tratz

Proceedings of the COLING 2010, 2010

2009

Disambiguation of Preposition Sense Using Linguistically Motivated Features.

[BibT_eX]

[DOI]

Stephen Tratz