We stand with Ukraine

We stand with Ukraine

Zixian Ma

Orcid: 0000-0002-5369-6430

According to our database¹, Zixian Ma authored at least 32 papers between 2021 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

JobBench: Aligning Agent Work With Human Will.

[DOI]

CoRR, May, 2026

CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows?

[DOI]

,

,

,

,

,

,

,

,

T. Y. Alvin Liu

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass.

[DOI]

,

,

,

,

CoRR, April, 2026

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web.

[DOI]

,

,

,

,

Rock Yuren Pang

,

,

,

,

,

,

,

,

Caleb Ouellette

,

,

,

CoRR, April, 2026

MolmoPoint: Better Pointing for VLMs with Grounding Tokens.

[DOI]

Christopher Clark

,

,

,

,

,

,

Mohammadreza Salehi

,

,

,

,

CoRR, March, 2026

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models.

[DOI]

,

,

,

,

,

,

,

,

CoRR, March, 2026

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding.

[DOI]

CoRR, January, 2026

2025

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning.

[DOI]

,

,

,

,

Chris Dongjoo Kim

,

,

,

,

Christopher Clark

,

CoRR, December, 2025

Completion ≠ Collaboration: Scaling Collaborative Effort with Agents.

[DOI]

Shannon Zejiang Shen

,

,

,

,

,

,

,

,

,

,

Jocelyn J. Shen

,

Ameet Talwalkar

,

,

David A. Sontag

CoRR, October, 2025

Rethinking Human Preference Evaluation of LLM Rationales.

[DOI]

,

,

,

Helena Vasconcelos

,

,

CoRR, September, 2025

Reinforced Visual Perception with Tools.

[DOI]

,

,

,

,

,

,

,

,

CoRR, September, 2025

Explain Before You Answer: A Survey on Compositional Visual Reasoning.

[DOI]

,

,

,

,

,

,

,

,

Pari Delir Haghighi

,

Gholamreza Haffari

,

,

,

Hamid Rezatofighi

CoRR, August, 2025

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations.

[DOI]

,

Mahtab Bigverdi

,

,

,

,

,

,

CoRR, June, 2025

Biological Sequence with Language Model Prompting: A Survey.

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

LATTE: Learning to Think with Vision Specialists.

[DOI]

,

,

,

,

,

,

Juan Carlos Niebles

,

Shelby Heinecke

,

,

,

,

Silvio Savarese

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Synthetic Visual Genome.

[DOI]

,

,

,

,

,

,

Khyathi Raghavi Chandu

,

,

Norimasa Kobori

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models.

[DOI]

,

,

,

,

,

,

,

,

Juan Carlos Niebles

,

Silvio Savarese

,

,

,

,

CoRR, 2024

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action.

[DOI]

,

,

,

,

,

,

Juan Carlos Niebles

,

Shelby Heinecke

,

,

,

,

Silvio Savarese

CoRR, 2024

Task Me Anything.

[DOI]

,

,

,

,

,

,

,

,

Aniruddha Kembhavi

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples.

[DOI]

,

,

,

Jean de Dieu Nyandwi

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

m &m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality.

[DOI]

,

,

,

Aniruddha Kembhavi

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

@ CREPE: Can Vision-Language Foundation Models Reason Compositionally?

[DOI]

,

,

Mustafa Omer Gul

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design.

[DOI]

Michelle S. Lam

,

,

,

Izequiel Freitas

,

,

James A. Landay

,

Michael S. Bernstein

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022

A Lightweight Deep Learning-Based Algorithm for Array Imperfection Correction and DOA Estimation.

[DOI]

,

,

,

,

,

,

,

J. Commun. Inf. Networks, September, 2022

DOA Estimation Based on Root Sparse Bayesian Learning Under Gain and Phase Error.

[DOI]

,

,

,

,

,

,

J. Commun. Inf. Networks, June, 2022

MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing.

[DOI]

,

,

,

,

,

,

Shwetak N. Patel

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2022

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward.

[DOI]

,

,

,

Michael S. Bernstein

,

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Satellite Tracking For The LEO Satellite Communication Network.

[DOI]

,

,

,

,

Proceedings of the IEEE International Conference on Communications, 2022

2021

OpenAttack: An Open-source Textual Adversarial Attack Toolkit.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Loading...