Ziqiao Ma

Orcid: 0000-0002-0760-4638

Affiliations:
  • University of Michigan, USA


According to our database1, Ziqiao Ma authored at least 27 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry.
CoRR, October, 2025

The Mechanistic Emergence of Symbol Grounding in Language Models.
CoRR, October, 2025

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens.
CoRR, October, 2025

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies.
CoRR, August, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time.
CoRR, June, 2025

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study.
CoRR, June, 2025

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation.
CoRR, April, 2025

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation.
CoRR, March, 2025

Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models.
Trans. Mach. Learn. Res., 2024

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions.
CoRR, 2024

Multi-Object Hallucination in Vision Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Groundhog Grounding Large Language Models to Holistic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inversion-Free Image Editing with Language-Guided Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Partition-Based Active Learning for Graph Neural Networks.
Trans. Mach. Learn. Res., 2023

Inversion-Free Image Editing with Natural Language.
CoRR, 2023

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

NLP Reproducibility For All: Understanding Experiences of Beginners.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
DANLI: Deliberative Agent for Following Natural Language Instructions.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


  Loading...