Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection - Extended Abstract.

[BibT_eX]

[DOI]

Janek Bevendorff

Efstathios Stamatatos

Proceedings of the Advances in Information Retrieval, 2025

OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

GenAI Content Detection Task 1: English and Multilingual Machine-Generated Text Detection: AI vs. Human.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Proceedings of the 1st Workshop on GenAI Content Detection (GenAIDetect).

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Loki: An Open-Source Tool for Fact Verification.

[BibT_eX]

[DOI]

Proceedings of the 31st International Conference on Computational Linguistics, 2025

Overview of PAN 2025: Voight-Kampff Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection.

[BibT_eX]

[DOI]

Efstathios Stamatatos

Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Explicit and Implicit Data Augmentation for Social Event Detection.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Qorǵau: Evaluating Safety in Kazakh-Russian Bilingual Contexts.

[BibT_eX]

[DOI]

Zain Muhammad Mujahid

Fajri Koto

Timothy Baldwin

Preslav Nakov

Proceedings of the Findings of the Association for Computational Linguistics, 2025

VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Black-Box Machine-Generated Text Detection.

[BibT_eX]

[DOI]

Dataset, April, 2024

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability.

[BibT_eX]

[DOI]

CoRR, 2024

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection.

[BibT_eX]

[DOI]

CoRR, 2024

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs.

[BibT_eX]

[DOI]

CoRR, 2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.

[BibT_eX]

[DOI]

CoRR, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection.

[BibT_eX]

[DOI]

CoRR, 2024

Factuality of Large Language Models in the Year 2024.

[BibT_eX]

[DOI]

Yuxia Wang

Minghan Wang

Muhammad Arslan Manzoor

CoRR, 2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.

[BibT_eX]

[DOI]

Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

A Survey of Confidence Estimation and Calibration in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Factuality of Large Language Models: A Survey.

[BibT_eX]

[DOI]

Yuxia Wang

Minghan Wang

Muhammad Arslan Manzoor

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers.

[BibT_eX]

[DOI]

Yuxia Wang

Revanth Gangi Reddy

Zain Muhammad Mujahid

Arnav Arora

Aleksandr Rubashevskii

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs.

[BibT_eX]

[DOI]

Muhammad Arslan Manzoor

Yuxia Wang

Minghan Wang

Preslav Nakov

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Rethinking STS and NLI in Large Language Models.

[BibT_eX]

[DOI]

Yuxia Wang

Minghan Wang

Preslav Nakov

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection.

[BibT_eX]

[DOI]

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Do-Not-Answer: Evaluating Safeguards in LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Demystifying Instruction Mixing for Fine-tuning Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2023

Collective Human Opinions in Semantic Textual Similarity.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2023

Understanding the Instruction Mixture for Large Language Model Fine-tuning.

[BibT_eX]

[DOI]

CoRR, 2023

Factcheck-GPT: End-to-End Fine-Grained Document-Level Fact-Checking and Correction of LLM Output.

[BibT_eX]

[DOI]

Yuxia Wang

Revanth Gangi Reddy

Zain Muhammad Mujahid

Arnav Arora

Aleksandr Rubashevskii

CoRR, 2023

A Survey of Language Model Confidence Estimation and Calibration.

[BibT_eX]

[DOI]

CoRR, 2023

Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs.

[BibT_eX]

[DOI]

CoRR, 2023

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression.

[BibT_eX]

[DOI]

Trans. Assoc. Comput. Linguistics, 2022

The HW-TSC's Offline Speech Translation System for IWSLT 2022 Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

The HW-TSC's Simultaneous Speech Translation System for IWSLT 2022 Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

The HW-TSC's Speech to Speech Translation System for IWSLT 2022 Evaluation.

[BibT_eX]

[DOI]

Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Diformer: Directional Transformer for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Noisy Label Regularisation for Textual Regression.

[BibT_eX]

[DOI]

Yuxia Wang

Timothy Baldwin

Karin Verspoor

Proceedings of the 29th International Conference on Computational Linguistics, 2022

Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021

Joint-training on Symbiosis Networks for Deep Nueral Machine Translation models.

[BibT_eX]

[DOI]

CoRR, 2021

Self-Distillation Mixup Training for Non-autoregressive Neural Machine Translation.

[BibT_eX]

[DOI]

CoRR, 2021

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation.

[BibT_eX]

[DOI]

CoRR, 2021

HW-TSC's Participation at WMT 2021 Quality Estimation Shared Task.

[BibT_eX]

[DOI]

Proceedings of the Sixth Conference on Machine Translation, 2021

HI-CMLM: Improve CMLM with Hybrid Decoder Input.

[BibT_eX]

[DOI]

Proceedings of the 14th International Conference on Natural Language Generation, 2021

Incorporating Complete Syntactical Knowledge for Spoken Language Understanding.

[BibT_eX]

[DOI]

Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction, 2021

How Length Prediction Influence the Performance of Non-Autoregressive Translation?

[BibT_eX]

[DOI]

Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

2020

Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity.

[BibT_eX]

[DOI]

Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Learning from Unlabelled Data for Clinical Semantic Textual Similarity.

[BibT_eX]

[DOI]

Yuxia Wang

Karin Verspoor

Timothy Baldwin

Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020

Yuxia Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...