Zongxia Li

According to our database1, Zongxia Li authored at least 19 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
R-Zero: Self-Evolving Reasoning LLM from Zero Data.
CoRR, August, 2025

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation.
CoRR, June, 2025

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding.
CoRR, May, 2025

Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators.
CoRR, March, 2025

Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs.
CoRR, February, 2025

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey.
CoRR, January, 2025

A Survey of State of the Art Large Vision Language Models: Benchmark Evaluations and Challenges.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Large Language Models Struggle to Describe the Haystack without Human Help: A Social Science-Inspired Evaluation of Topic Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
PANDA (Pedantic ANswer-correctness Determination and Adjudication): Improving Automatic Evaluation for Question Answering and Text Generation.
CoRR, 2024

Beyond Automated Evaluation Metrics: Evaluating Topic Models On Practical Social Science Content Analysis Tasks.
CoRR, 2024

SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

PEDANTS: Cheap but Effective and Interpretable Answer Equivalence.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2023
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models.
CoRR, 2023

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps.
CoRR, 2023

SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2021
An Empirical Comparison of the Quadratic Sieve Factoring Algorithm and the Pollard Rho Factoring Algorithm.
CoRR, 2021


  Loading...