Zongxia Li

According to our database1, Zongxia Li authored at least 29 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks.
CoRR, April, 2026

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills.
CoRR, April, 2026

SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models.
CoRR, March, 2026

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data.
CoRR, March, 2026

2025
From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition.
CoRR, November, 2025

MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models.
CoRR, November, 2025

First Frame Is the Place to Go for Video Content Customization.
CoRR, November, 2025

VisPlay: Self-Evolving Vision-Language Models from Images.
CoRR, November, 2025

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning.
CoRR, October, 2025

Self-Rewarding Vision-Language Model via Reasoning Decomposition.
CoRR, August, 2025

R-Zero: Self-Evolving Reasoning LLM from Zero Data.
CoRR, August, 2025

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation.
CoRR, June, 2025

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding.
CoRR, May, 2025

Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators.
CoRR, March, 2025

Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs.
CoRR, February, 2025

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey.
CoRR, January, 2025

A Survey of State of the Art Large Vision Language Models: Benchmark Evaluations and Challenges.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Large Language Models Struggle to Describe the Haystack without Human Help: A Social Science-Inspired Evaluation of Topic Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
PANDA (Pedantic ANswer-correctness Determination and Adjudication): Improving Automatic Evaluation for Question Answering and Text Generation.
CoRR, 2024

Beyond Automated Evaluation Metrics: Evaluating Topic Models On Practical Social Science Content Analysis Tasks.
CoRR, 2024

SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

PEDANTS: Cheap but Effective and Interpretable Answer Equivalence.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

2023
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models.
CoRR, 2023

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps.
CoRR, 2023

SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2021
An Empirical Comparison of the Quadratic Sieve Factoring Algorithm and the Pollard Rho Factoring Algorithm.
CoRR, 2021


  Loading...