We stand with Ukraine

We stand with Ukraine

Zongxia Li

According to our database¹, Zongxia Li authored at least 30 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

G-Zero: Self-Play for Open-Ended Generation from Zero Data.

[DOI]

Chengsong Huang

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks.

[DOI]

,

,

,

Alexander Duffy

,

,

Matthew Lyle Olson

,

,

CoRR, April, 2026

Graph of Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills.

[DOI]

,

,

,

,

,

,

CoRR, April, 2026

SABER: A Stealthy Agentic Black-Box Attack Framework for Vision-Language-Action Models.

[DOI]

,

,

,

,

Amrit Singh Bedi

,

CoRR, March, 2026

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data.

[DOI]

,

,

Chengsong Huang

,

,

,

,

,

,

,

,

CoRR, March, 2026

2025

From Inpainting to Layer Decomposition: Repurposing Generative Inpainting Models for Image Layer Decomposition.

[DOI]

,

,

,

,

Cornelia Fermüller

,

,

Yiannis Aloimonos

CoRR, November, 2025

MASS: Motion-Aware Spatial-Temporal Grounding for Physics Reasoning and Comprehension in Vision-Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, November, 2025

First Frame Is the Place to Go for Video Content Customization.

[DOI]

,

,

,

,

,

,

Cornelia Fermüller

,

Brandon Y. Feng

,

Yiannis Aloimonos

CoRR, November, 2025

VisPlay: Self-Evolving Vision-Language Models from Images.

[DOI]

,

Chengsong Huang

,

,

,

CoRR, November, 2025

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, October, 2025

Self-Rewarding Vision-Language Model via Reasoning Decomposition.

[DOI]

,

,

Chengsong Huang

,

,

,

,

,

,

Jordan L. Boyd-Graber

,

,

CoRR, August, 2025

R-Zero: Self-Evolving Reasoning LLM from Zero Data.

[DOI]

Chengsong Huang

,

,

,

,

,

,

,

,

CoRR, August, 2025

Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation.

[DOI]

,

,

,

,

,

,

Jordan Lee Boyd-Graber

CoRR, June, 2025

Large Language Models Are Effective Human Annotation Assistants, But Not Good Independent Annotators.

[DOI]

,

,

Carlos Rafael Colon

,

,

,

Jordan Lee Boyd-Graber

CoRR, March, 2025

Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs.

[DOI]

,

Lorena Calvo-Bartolomé

,

Alexander Miserlis Hoyle

,

,

,

Juan Francisco Fung

,

Jordan L. Boyd-Graber

CoRR, February, 2025

Benchmark Evaluations, Applications, and Challenges of Large Vision Language Models: A Survey.

[DOI]

,

,

,

,

CoRR, January, 2025

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations on Synthetic Video Understanding.

[DOI]

,

,

,

,

,

,

,

Jordan L. Boyd-Graber

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

A Survey of State of the Art Large Vision Language Models: Benchmark Evaluations and Challenges.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Large Language Models Struggle to Describe the Haystack without Human Help: A Social Science-Inspired Evaluation of Topic Models.

[DOI]

,

Lorena Calvo-Bartolomé

,

Alexander Miserlis Hoyle

,

,

Daniel Kofi Stephens

,

Juan Francisco Fung

,

,

Jordan Lee Boyd-Graber

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

PANDA (Pedantic ANswer-correctness Determination and Adjudication): Improving Automatic Evaluation for Question Answering and Text Generation.

[DOI]

,

,

,

,

Jordan Lee Boyd-Graber

CoRR, 2024

Beyond Automated Evaluation Metrics: Evaluating Topic Models On Practical Social Science Content Analysis Tasks.

[DOI]

,

,

Daniel Kofi Stephens

,

,

,

,

,

Jordan L. Boyd-Graber

CoRR, 2024

SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement.

[DOI]

,

,

,

Anandhavelu Natarajan

,

Aparna Garimella

,

Jordan L. Boyd-Graber

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

PEDANTS: Cheap but Effective and Interpretable Answer Equivalence.

[DOI]

,

,

,

,

Jordan L. Boyd-Graber

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Improving the TENOR of Labeling: Re-evaluating Topic Models for Content Analysis.

[DOI]

,

,

Daniel Kofi Stephens

,

,

,

,

,

Jordan L. Boyd-Graber

Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

Hallusionbench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Do Large Language Models Discriminate in Hiring Decisions on the Basis of Race, Ethnicity, and Gender?

[DOI]

,

Christabel Acquaye

,

,

,

Rachel Rudinger

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

2023

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models.

[DOI]

,

,

,

,

,

,

CoRR, 2023

Towards Understanding In-Context Learning with Contrastive Demonstrations and Saliency Maps.

[DOI]

,

,

,

CoRR, 2023

SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning Models.

[DOI]

,

,

,

Rachel Rudinger

Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2021

An Empirical Comparison of the Quadratic Sieve Factoring Algorithm and the Pollard Rho Factoring Algorithm.

[DOI]

,

William Gasarch

CoRR, 2021

Loading...