Chaoya Jiang

Orcid: 0009-0009-7282-159X

According to our database¹, Chaoya Jiang authored at least 22 papers between 2020 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

OSWorld-MCP: Benchmarking MCP Tool Invocation In Computer-Use Agents.

[BibT_eX]

[DOI]

CoRR, October, 2025

Decoupling Reasoning and Perception: An LLM-LMM Framework for Faithful Visual Reasoning.

[BibT_eX]

[DOI]

CoRR, September, 2025

HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

VLM-R<sup>3</sup>: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought.

[BibT_eX]

[DOI]

CoRR, May, 2025

SymDPO: Boosting In-Context Learning of Large Multimodal Models with Symbol Demonstration Direct Preference Optimization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection.

[BibT_eX]

[DOI]

CoRR, 2024

MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

MIBench: Evaluating Multimodal Large Language Models over Multiple Images.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Enhancing In-Context Learning via Implicit Demonstration Augmentation.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization.

[BibT_eX]

[DOI]

CoRR, 2023

Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation.

[BibT_eX]

[DOI]

CoRR, 2023

COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment.

[BibT_eX]

[DOI]

Proceedings of the 31st ACM International Conference on Multimedia, 2023

BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Exploiting Pseudo Image Captions for Multimodal Summarization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022

TRIPS: Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2020

Learn A Robust Representation For Cover Song Identification Via Aggregating Local And Global Music Temporal Context.

[BibT_eX]

[DOI]

Chaoya Jiang

Deshun Yang

Xiaoou Chen

Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

Similarity Learning For Cover Song Identification Using Cross-Similarity Matrices of Multi-Level Deep Sequences.

[BibT_eX]

[DOI]

Chaoya Jiang

Deshun Yang

Xiaoou Chen

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Chaoya Jiang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...