We stand with Ukraine

We stand with Ukraine

Ziyue Wang

Orcid: 0009-0004-1433-0681

Affiliations:

Tsinghua University, Institute for AI, Department of Computer Science and Technology, Beijing, China
iFLYTEK Research, Beijing, China

According to our database¹, Ziyue Wang authored at least 16 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

Online presence:

on orcid.org
on scholar.google.com

On csauthors.net:

Bibliography

2026

Evaluating Time Awareness and Cross-modal Active Perception of Large Models via 4D Escape Room Task.

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2026

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025

Bridging Vision, Language, and Mathematics: Pictographic Character Reconstruction with Bézier Curves.

[DOI]

,

Pau Tong Lin Xu

,

,

,

,

CoRR, November, 2025

Perspective Transition of Large Language Models for Solving Subjective Tasks.

[DOI]

,

,

,

,

,

,

,

CoRR, January, 2025

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in an Extensible Escape Game.

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion.

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Model Composition for Multimodal Large Language Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

TiBERT: A Non-autoregressive Pre-trained Model for Text Editing.

[DOI]

,

,

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2023

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions.

[DOI]

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022

Augmented and challenging datasets with multi-step reasoning and multi-span questions for Chinese judicial reading comprehension.

[DOI]

,

,

,

,

,

,

,

,

,

AI Open, January, 2022

InterHT: Knowledge Graph Embeddings by Interaction between Head and Tail Entities.

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

2021

Various Legal Factors Extraction Based on Machine Reading Comprehension.

[DOI]

,

,

,

,

,

,

Proceedings of the Information Retrieval - 27th China Conference, 2021

2019

IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis.

[DOI]

,

,

,

,

,

,

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019

Loading...