Jiarui Zhang

Orcid: 0009-0002-7294-541X

Affiliations:
  • University of Southern California (USC), Information Sciences Institute, Los Angeles, CA, USA
  • Tsinghua University, Department of Electronic Engineering, Beijing, China


According to our database1, Jiarui Zhang authored at least 17 papers between 2021 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Exploring Perceptual Limitations of Multimodal LLMs on Small Visual Objects.
Trans. Mach. Learn. Res., 2026

2025
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions.
CoRR, 2024

Guided Profile Generation Improves Personalization with LLMs.
CoRR, 2024

Exploring Perceptual Limitation of Multimodal Large Language Models.
CoRR, 2024

The Curious Case of Nonverbal Abstract Reasoning with Multi-Modal Large Language Models.
CoRR, 2024

MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Guided Profile Generation Improves Personalization with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Visual Cropping Improves Zero-Shot Question Answering of Multimodal Large Language Models.
CoRR, 2023

Using Visual Cropping to Enhance Fine-Detail Question Answering of BLIP-Family Models.
CoRR, 2023

A Study of Situational Reasoning for Traffic Understanding.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Knowledge-enhanced Agents for Interactive Text Games.
Proceedings of the 12th Knowledge Capture Conference 2023, 2023

2022
Utilizing Background Knowledge for Robust Reasoning over Traffic Situations.
CoRR, 2022

An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs.
CoRR, 2022

Clickbait Detection via Contrastive Variational Modelling of Text and Label.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

A Study of Zero-shot Adaptation with Commonsense Knowledge.
Proceedings of the 4th Conference on Automated Knowledge Base Construction, 2022

2021
CCPM: A Chinese Classical Poetry Matching Dataset.
CoRR, 2021


  Loading...