Zixian Ma

Orcid: 0000-0002-5369-6430

According to our database1, Zixian Ma authored at least 30 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
You Only Judge Once: Multi-response Reward Modeling in a Single Forward Pass.
CoRR, April, 2026

MolmoWeb: Open Visual Web Agent and Open Data for the Open Web.
CoRR, April, 2026

MolmoPoint: Better Pointing for VLMs with Grounding Tokens.
CoRR, March, 2026

VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models.
CoRR, March, 2026

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding.
CoRR, January, 2026

2025
SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning.
CoRR, December, 2025

Completion ≠ Collaboration: Scaling Collaborative Effort with Agents.
CoRR, October, 2025

Rethinking Human Preference Evaluation of LLM Rationales.
CoRR, September, 2025

Reinforced Visual Perception with Tools.
CoRR, September, 2025

Explain Before You Answer: A Survey on Compositional Visual Reasoning.
CoRR, August, 2025

Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations.
CoRR, June, 2025

Biological Sequence with Language Model Prompting: A Survey.
CoRR, March, 2025

Benchmarking Large Language Models on Multiple Tasks in Bioinformatics NLP with Prompting.
CoRR, March, 2025

LATTE: Learning to Think with Vision Specialists.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Synthetic Visual Genome.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024
ProVision: Programmatically Scaling Vision-centric Instruction Data for Multimodal Language Models.
CoRR, 2024

TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action.
CoRR, 2024

Task Me Anything.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

m &m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
SugarCrepe: Fixing Hackable Benchmarks for Vision-Language Compositionality.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

@ CREPE: Can Vision-Language Foundation Models Reason Compositionally?
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Model Sketching: Centering Concepts in Early-Stage Machine Learning Model Design.
Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, 2023

2022
A Lightweight Deep Learning-Based Algorithm for Array Imperfection Correction and DOA Estimation.
J. Commun. Inf. Networks, September, 2022

DOA Estimation Based on Root Sparse Bayesian Learning Under Gain and Phase Error.
J. Commun. Inf. Networks, June, 2022

MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2022

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multi-Satellite Tracking For The LEO Satellite Communication Network.
Proceedings of the IEEE International Conference on Communications, 2022

2021
OpenAttack: An Open-source Textual Adversarial Attack Toolkit.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021


  Loading...