Carolina Scarton

Zhixue Zhao

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

It's All About In-Context Learning! Teaching Extremely Low-Resource Languages to LLMs.

[BibT_eX]

[DOI]

Zhixue Zhao

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

WPextract.

[BibT_eX]

[DOI]

Dataset, August, 2024

WPextract.

[BibT_eX]

[DOI]

Dataset, August, 2024

WPextract.

[BibT_eX]

[DOI]

Dataset, July, 2024

WPextract.

[BibT_eX]

[DOI]

Dataset, July, 2024

WPextract.

[BibT_eX]

[DOI]

Dataset, July, 2024

WPextract.

[BibT_eX]

[DOI]

Dataset, July, 2024

WPextract.

[BibT_eX]

[DOI]

Dataset, July, 2024

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification.

[BibT_eX]

[DOI]

Dataset, May, 2024

NILC-Metrix: assessing the complexity of written and spoken language in Brazilian Portuguese.

[BibT_eX]

[DOI]

Sidney Evaldo Leal

Magali Sanches Duran

Nathan Siegle Hartmann

Lang. Resour. Evaluation, March, 2024

EUvsDisinfo: a Dataset for Multilingual Detection of Pro-Kremlin Disinformation in News Articles (Dataset).

[BibT_eX]

[DOI]

Dataset, January, 2024

EUvsDisinfo: a Dataset for Multilingual Detection of Pro-Kremlin Disinformation in News Articles (Software).

[BibT_eX]

[DOI]

Dataset, January, 2024

Accelerating discoveries in medicine using distributed vector representations of words.

[BibT_eX]

[DOI]

Expert Syst. Appl., 2024

Investigating Idiomaticity in Word Representations.

[BibT_eX]

[DOI]

CoRR, 2024

A Survey on Automatic Credibility Assessment of Textual Credibility Signals in the Era of Large Language Models.

[BibT_eX]

[DOI]

Ivan Srba

Francisco Moreno García

Santiago Barrio Lottmann

CoRR, 2024

Word Boundary Information Isn't Useful for Encoder Language Models.

[BibT_eX]

[DOI]

Dylan Phelps

CoRR, 2024

A Case Study on Contextual Machine Translation in a Professional Scenario of Subtitling.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 1), 2024

ExU: AI Models for Examining Multilingual Disinformation Narratives and Understanding their Spread.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), 2024

Multilinguality in the VIGILANT project.

[BibT_eX]

[DOI]

Proceedings of the 25th Annual Conference of the European Association for Machine Translation (Volume 2), 2024

Reference-less Analysis of Context Specificity in Translation with Personalised Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Can We Identify Stance without Target Arguments? A Study for Rumour Stance Classification.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

EUvsDisinfo: A Dataset for Multilingual Detection of Pro-Kremlin Disinformation in News Articles.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles.

[BibT_eX]

[DOI]

Proceedings of the 23rd Workshop on Biomedical Natural Language Processing, 2024

A Lightweight Approach for User and Keyword Classification in Controversial Topics.

[BibT_eX]

[DOI]

Ahmad Zareie

Proceedings of the Social Networks Analysis and Mining - 16th International Conference, 2024

ATLAS: Improving Lay Summarisation with Attribute-based Control.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

Enhancing Idiomatic Representation in Multiple Languages via an Adaptive Contrastive Triplet Loss.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

UTDRM: unsupervised method for training debunked-narrative retrieval models.

[BibT_eX]

[DOI]

Iknoor Singh

EPJ Data Sci., December, 2023

Analysing state-backed propaganda websites: a new dataset and linguistic study (complete dataset).

[BibT_eX]

[DOI]

Dataset, October, 2023

Analysing state-backed propaganda websites: a new dataset and linguistic study (software).

[BibT_eX]

[DOI]

Dataset, October, 2023

Analysing state-backed propaganda websites: a new dataset and linguistic study (public dataset).

[BibT_eX]

[DOI]

Dataset, October, 2023

Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles.

[BibT_eX]

[DOI]

CoRR, 2023

Detecting Misinformation with LLM-Predicted Credibility Signals and Weak Supervision.

[BibT_eX]

[DOI]

CoRR, 2023

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification.

[BibT_eX]

[DOI]

CoRR, 2023

Finding Already Debunked Narratives via Multistage Retrieval: Enabling Cross-Lingual, Cross-Dataset and Zero-Shot Learning.

[BibT_eX]

[DOI]

CoRR, 2023

A Large-Scale Comparative Study of Accurate COVID-19 Information versus Misinformation.

[BibT_eX]

[DOI]

CoRR, 2023

Personalised Language Modelling of Screen Characters Using Rich Metadata Annotations.

[BibT_eX]

[DOI]

CoRR, 2023

Evaluating the Role of Target Arguments in Rumour Stance Classification.

[BibT_eX]

[DOI]

CoRR, 2023

Team SheffieldVeraAI at SemEval-2023 Task 3: Mono and multilingual approaches for news genre, topic and persuasion technique classification.

[BibT_eX]

[DOI]

Ben Wu

CoRR, 2023

SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification.

[BibT_eX]

[DOI]

Ben Wu

Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Classifying COVID-19 Vaccine Narratives.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks.

[BibT_eX]

[DOI]

Diego F. Silva

Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of the COVID-19 Infodemic.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

VaxxHesitancy: A Dataset for Studying Hesitancy towards COVID-19 Vaccination on Twitter.

[BibT_eX]

[DOI]

Proceedings of the Seventeenth International AAAI Conference on Web and Social Media, 2023

Don't waste a single annotation: improving single-label classifiers through soft labels.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Analysing State-Backed Propaganda Websites: a New Dataset and Linguistic Study.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Enhancing Biomedical Lay Summarisation with External Knowledge Graphs.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Domain-Driven and Discourse-Guided Scientific Summarisation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Information Retrieval, 2023

BioLaySumm 2023 Shared Task: Lay Summarisation of Biomedical Research Articles.

[BibT_eX]

[DOI]

Proceedings of the 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks, 2023

MTCue: Learning Zero-Shot Control of Extra-Textual Attributes by Leveraging Unstructured Context in Neural Machine Translation.

[BibT_eX]

[DOI]

Sebastian T. Vincent

Robert Flynn

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

Comparative Analysis of Engagement, Themes, and Causality of Ukraine-Related Debunks and Disinformation.

[BibT_eX]

[DOI]

Proceedings of the Social Informatics - 13th International Conference, 2022

GateNLP-UShef at SemEval-2022 Task 8: Entity-Enriched Siamese Transformer for Multilingual News Article Similarity.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Sample Efficient Approaches for Idiomaticity Detection.

[BibT_eX]

[DOI]

Dylan Phelps

Xuan-Rui Fan

Proceedings of the 18th Workshop on Multiword Expressions, 2022

Controlling Formality in Low-Resource NMT with Domain Adaptation and Re-Ranking: SLT-CDT-UoS at IWSLT2022.

[BibT_eX]

[DOI]

Sebastian T. Vincent

Loïc Barrault

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Improving Tokenisation by Alternative Treatment of Spaces.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Controlling Extra-Textual Attributes about Dialogue Participants: A Case Study of English-to-Polish Neural Machine Translation.

[BibT_eX]

[DOI]

Sebastian T. Vincent

Loïc Barrault

Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

2021

Special Issue on Disinformation, Hoaxes and Propaganda within Online Social Networks and Media.

[BibT_eX]

[DOI]

Yelena Mejova

Marinella Petrocchi

Online Soc. Networks Media, 2021

The False COVID-19 Narratives That Keep Being Debunked: A Spatiotemporal Analysis.

[BibT_eX]

[DOI]

Iknoor Singh

CoRR, 2021

Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of COVID-19 Infodemic.

[BibT_eX]

[DOI]

CoRR, 2021

Multistage BiCross Encoder: Team GATE Entry for MLIA Multilingual Semantic Search Task 2.

[BibT_eX]

[DOI]

Iknoor Singh

CoRR, 2021

The (Un)Suitability of Automatic Evaluation Metrics for Text Simplification.

[BibT_eX]

[DOI]

Comput. Linguistics, 2021

Cross-lingual Rumour Stance Classification: a First Study with BERT and Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2021 Truth and Trust Online Conference (TTO 2021), 2021

AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Probing for idiomaticity in vector space models.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

Horacio Saggion, Automatic Text Simplification. Synthesis lectures on human language technologies, April 2017.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2020

Data-Driven Sentence Simplification: Survey and Benchmark.

[BibT_eX]

[DOI]

Comput. Linguistics, 2020

Linguistic Analysis Model for Monitoring User Reaction on Satirical News for Brazilian Portuguese.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2020

Measuring the Impact of Readability Features in Fake News Detection.

[BibT_eX]

[DOI]

Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Measuring What Counts: The Case of Rumour Stance Classification.

[BibT_eX]

[DOI]

Diego F. Silva

Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis.

[BibT_eX]

[DOI]

Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing, 2020

Deciding When, How and for Whom to Simplify.

[BibT_eX]

[DOI]

Pranava Madhyastha

Proceedings of the ECAI 2020 - 24th European Conference on Artificial Intelligence, 29 August-8 September 2020, Santiago de Compostela, Spain, August 29 - September 8, 2020, 2020

ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Spoken Language Translation, 2019

EASSE: Easier Automatic Sentence Simplification Evaluation.

[BibT_eX]

[DOI]

Louis Martin

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Cross-Sentence Transformations in Text Simplification.

[BibT_eX]

[DOI]

Proceedings of the 2019 Workshop on Widening NLP@ACL 2019, Florence, Italy, July 28, 2019, 2019

2018

Quality Estimation for Machine Translation

[BibT_eX]

[DOI]

Gustavo Henrique Paetzold

Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers, ISBN: 978-3-031-02168-8, 2018

Sheffield Submissions for WMT18 Multimodal Translation Shared Task.

[BibT_eX]

[DOI]

Chiraag Lala

Pranava Swaroop Madhyastha

Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Sheffield Submissions for the WMT18 Quality Estimation Shared Task.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Translation: Shared Task Papers, 2018

Exploring gap filling as a cheaper alternative to reading comprehension questionnaires when evaluating machine translation for gisting.

[BibT_eX]

[DOI]

Proceedings of the Third Conference on Machine Translation: Research Papers, 2018

SimPA: A Sentence-Level Simplification Corpus for the Public Administration Domain.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Text Simplification from Professionally Produced Corpora.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Learning Simplifications for Specific Target Audiences.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Bilexical Embeddings for Quality Estimation.

[BibT_eX]

[DOI]

Frédéric Blain

Proceedings of the Second Conference on Machine Translation, 2017

MUSST: A Multilingual Syntactic Simplification Tool.

[BibT_eX]

[DOI]

Alessio Palmero Aprosio

Sara Tonelli

Tamara Martín-Wanton

Proceedings of the IJCNLP 2017, Tapei, Taiwan, November 27, 2017

Learning How to Simplify From Explicit Labeling of Complex-Simplified Text Pairs.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Improving Evaluation of Document-level Machine Translation Quality Estimation.

[BibT_eX]

[DOI]

Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2016

Document-level machine translation quality estimation.

[BibT_eX]

[DOI]

PhD thesis, 2016

Word embeddings and discourse information for Quality Estimation.

[BibT_eX]

[DOI]

Proceedings of the First Conference on Machine Translation, 2016

Findings of the 2016 Conference on Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the First Conference on Machine Translation, 2016

SAARSHEFF at SemEval-2016 Task 1: Semantic Textual Similarity with Machine Translation Evaluation Metrics and (eXtreme) Boosted Tree Ensembles.

[BibT_eX]

[DOI]

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Evaluating Progression of Alzheimer's Disease by Regression and Classification Methods in a Narrative Language Test in Portuguese.

[BibT_eX]

[DOI]

Andre Cunha

Proceedings of the Computational Processing of the Portuguese Language, 2016

A Reading Comprehension Corpus for Machine Translation Evaluation.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Quality Estimation for Language Output Applications.

[BibT_eX]

[DOI]

Proceedings of the COLING 2016, 2016

2015

USHEF and USAAR-USHEF participation in the WMT15 QE shared task.

[BibT_eX]

[DOI]

Liling Tan

Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

Findings of the 2015 Workshop on Statistical Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Tenth Workshop on Statistical Machine Translation, 2015

USAAR-SHEFFIELD: Semantic Textual Similarity with Deep Regression and Machine Translation Evaluation Metrics.

[BibT_eX]

[DOI]

Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Discourse and Document-level Information for Evaluating Language Output Tasks.

[BibT_eX]

[DOI]

Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Searching for Context: a Study on Document-Level Labels for Translation Quality Estimation.

[BibT_eX]

[DOI]

Proceedings of the 18th Annual Conference of the European Association for Machine Translation, 2015

Multi-level Translation Quality Prediction with QuEst++.

[BibT_eX]

[DOI]

Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014

Exploring Consensus in Machine Translation for Quality Estimation.

[BibT_eX]

[DOI]

Proceedings of the Ninth Workshop on Statistical Machine Translation, 2014

Using Cross-Linguistic Knowledge to Build VerbNet-Style Lexicons: Results for a (Brazilian) Portuguese VerbNet.

[BibT_eX]

[DOI]

Magali Sanches Duran

Proceedings of the Computational Processing of the Portuguese Language, 2014

Document-level translation quality estimation: exploring discourse and pseudo-references.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual conference of the European Association for Machine Translation, 2014

Verb Clustering for Brazilian Portuguese.

[BibT_eX]

[DOI]

Proceedings of the Computational Linguistics and Intelligent Text Processing, 2014

2013

Identifying Pronominal Verbs: Towards Automatic Disambiguation of the Clitic 'se' in Portuguese.

[BibT_eX]

[DOI]

Magali Sanches Duran

Carlos Ramisch

Proceedings of the 9th Workshop on Multiword Expressions, 2013

2011

VerbNet.Br: construção semiautomática de um léxico computacional de verbos para o português do Brasil (VerbNet.Br: semiautomatic construction of a computational verb lexicon for Brazilian Portuguese) [in Portuguese].

[BibT_eX]

[DOI]

Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

Comparando Avaliações de Inteligibilidade Textual entre Originais e Traduções de Textos Literários (Comparing Textual Intelligibility Evaluations among Literary Source Texts and their Translations) [in Portuguese].

[BibT_eX]

[DOI]

Bianca Franco Pasqualini

Maria José Bocorny Finatto

Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

Características do jornalismo popular: avaliação da inteligibilidade e auxílio à descrição do gênero (Characteristics of Popular News: the Evaluation of Intelligibility and Support to the Genre Description) [in Portuguese].

[BibT_eX]

[DOI]

Maria José Bocorny Finatto

Amanda Rocha

Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

2010

Análise da Inteligibilidade de textos via ferramentas de Processamento de Língua Natural: adaptando as métricas do Coh-Metrix para o Português.

[BibT_eX]

[DOI]

Linguamática, 2010

SIMPLIFICA: a tool for authoring simplified texts in Brazilian Portuguese guided by readability assessments.

[BibT_eX]

[DOI]

Matheus de Oliveira

Arnaldo Cândido Júnior

Caroline Gasperin

Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 2, 2010, Los Angeles, California, USA, 2010

Revisiting the Readability Assessment of Texts in Portuguese.

[BibT_eX]

[DOI]

Caroline Gasperin