Ziqiao Ma

Orcid: 0000-0002-0760-4638

Affiliations:
  • University of Michigan, USA


According to our database1, Ziqiao Ma authored at least 23 papers between 2022 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time.
CoRR, June, 2025

Can Vision Language Models Infer Human Gaze Direction? A Controlled Study.
CoRR, June, 2025

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation.
CoRR, April, 2025

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation.
CoRR, March, 2025

Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference under Ambiguities.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models.
Trans. Mach. Learn. Res., 2024

Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future Directions.
CoRR, 2024

Multi-Object Hallucination in Vision Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2024

Groundhog Grounding Large Language Models to Holistic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Inversion-Free Image Editing with Language-Guided Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Partition-Based Active Learning for Graph Neural Networks.
Trans. Mach. Learn. Res., 2023

Inversion-Free Image Editing with Natural Language.
CoRR, 2023

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Collaborative Plan Acquisition through Theory of Mind Modeling in Situated Dialogue.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

NLP Reproducibility For All: Understanding Experiences of Beginners.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
DANLI: Deliberative Agent for Following Natural Language Instructions.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DOROTHIE: Spoken Dialogue for Handling Unexpected Situations in Interactive Autonomous Driving Agents.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


  Loading...