Yixiao Song

According to our database1, Yixiao Song authored at least 14 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Does quantization affect models' performance on long-context tasks?
CoRR, May, 2025

BEARCUBS: A benchmark for computer-using web agents.
CoRR, March, 2025

Enhancing Human Evaluation in Machine Translation with Comparative Judgment.
CoRR, February, 2025

Enhancing Human Evaluation in Machine Translation with Comparative Judgement.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Localizing and Mitigating Errors in Long-form Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Synthetic-to-realistic domain adaptation for cold-start of rail inspection systems.
Comput. Aided Civ. Infrastructure Eng., February, 2024

Fine-grained Hallucination Detection and Mitigation in Long-form Question Answering.
CoRR, 2024

GEE! Grammar Error Explanation with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

VeriScore: Evaluating the factuality of verifiable claims in long-form text generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

kNN-LM Does Not Improve Open-ended Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

A Critical Evaluation of Evaluations for Long-form Question Answering.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
SLING: Sino Linguistic Evaluation of Large Language Models.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DEMETR: Diagnosing Evaluation Metrics for Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022


  Loading...