Yuxia Wang

Affiliations:
  • INSAIT, Sofia, Bulgaria


According to our database1, Yuxia Wang authored at least 68 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA.
CoRR, September, 2025

MuDRiC: Multi-Dialect Reasoning for Arabic Commonsense Validation.
CoRR, August, 2025

UnsafeChain: Enhancing Reasoning Model Safety via Hard Cases.
CoRR, July, 2025

FRaN-X: FRaming and Narratives-eXplorer.
CoRR, July, 2025

FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning.
CoRR, June, 2025

UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking.
CoRR, May, 2025

FAID: Fine-grained AI-generated Text Detection using Multi-task Auxiliary and Multi-level Contrastive Learning.
CoRR, May, 2025

A Comprehensive Survey of Machine Unlearning Techniques for Large Language Models.
CoRR, March, 2025

Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh.
CoRR, March, 2025

Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts.
CoRR, February, 2025

Is Human-Like Text Liked by Humans? Multilingual Human Detection and Preference Against AI.
CoRR, February, 2025

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models.
J. Artif. Intell. Res., 2025

FIRE: Fact-checking with Iterative Retrieval and Verification.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Arabic Dataset for LLM Safeguard Evaluation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection - Extended Abstract.
Proceedings of the Advances in Information Retrieval, 2025

OpenFactCheck: Building, Benchmarking Customized Fact-Checking Systems and Evaluating the Factuality of Claims and LLMs.
Proceedings of the 31st International Conference on Computational Linguistics, 2025


Proceedings of the 1st Workshop on GenAI Content Detection (GenAIDetect).
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Loki: An Open-Source Tool for Fact Verification.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Overview of PAN 2025: Voight-Kampff Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2025

KazMMLU: Evaluating Language Models on Kazakh, Russian, and Regional Knowledge of Kazakhstan.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Explicit and Implicit Data Augmentation for Social Event Detection.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Qorǵau: Evaluating Safety in Kazakh-Russian Bilingual Contexts.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Black-Box Machine-Generated Text Detection.
Dataset, April, 2024

Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability.
CoRR, 2024

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs.
CoRR, 2024

LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection.
CoRR, 2024

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs.
CoRR, 2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.
CoRR, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.
CoRR, 2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection.
CoRR, 2024

Factuality of Large Language Models in the Year 2024.
CoRR, 2024

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection.
Proceedings of the 18th International Workshop on Semantic Evaluation, 2024

A Survey of Confidence Estimation and Calibration in Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Factuality of Large Language Models: A Survey.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Factcheck-Bench: Fine-Grained Evaluation Benchmark for Automatic Fact-checkers.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Can Machines Resonate with Humans? Evaluating the Emotional and Empathic Comprehension of LMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Rethinking STS and NLI in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Do-Not-Answer: Evaluating Safeguards in LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

A Chinese Dataset for Evaluating the Safeguards in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Demystifying Instruction Mixing for Fine-tuning Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2023
Collective Human Opinions in Semantic Textual Similarity.
Trans. Assoc. Comput. Linguistics, 2023

Understanding the Instruction Mixture for Large Language Model Fine-tuning.
CoRR, 2023

Factcheck-GPT: End-to-End Fine-Grained Document-Level Fact-Checking and Correction of LLM Output.
CoRR, 2023

A Survey of Language Model Confidence Estimation and Calibration.
CoRR, 2023

Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs.
CoRR, 2023

M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection.
CoRR, 2023

2022
Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression.
Trans. Assoc. Comput. Linguistics, 2022

The HW-TSC's Offline Speech Translation System for IWSLT 2022 Evaluation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

The HW-TSC's Simultaneous Speech Translation System for IWSLT 2022 Evaluation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

The HW-TSC's Speech to Speech Translation System for IWSLT 2022 Evaluation.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

Diformer: Directional Transformer for Neural Machine Translation.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

Noisy Label Regularisation for Textual Regression.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
Joint-training on Symbiosis Networks for Deep Nueral Machine Translation models.
CoRR, 2021

Self-Distillation Mixup Training for Non-autoregressive Neural Machine Translation.
CoRR, 2021

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation.
CoRR, 2021

HW-TSC's Participation at WMT 2021 Quality Estimation Shared Task.
Proceedings of the Sixth Conference on Machine Translation, 2021

HI-CMLM: Improve CMLM with Hybrid Decoder Input.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

Incorporating Complete Syntactical Knowledge for Spoken Language Understanding.
Proceedings of the Knowledge Graph and Semantic Computing: Knowledge Graph Empowers New Infrastructure Construction, 2021

How Length Prediction Influence the Performance of Non-Autoregressive Translation?
Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, 2021

2020
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity.
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing, 2020

Learning from Unlabelled Data for Clinical Semantic Textual Similarity.
Proceedings of the 3rd Clinical Natural Language Processing Workshop, 2020


  Loading...