Ziyue Wang

Orcid: 0009-0004-1433-0681

Affiliations:
  • Tsinghua University, Institute for AI, Department of Computer Science and Technology, Beijing, China
  • iFLYTEK Research, Beijing, China


According to our database1, Ziyue Wang authored at least 16 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Evaluating Time Awareness and Cross-modal Active Perception of Large Models via 4D Escape Room Task.
CoRR, March, 2026

MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Bridging Vision, Language, and Mathematics: Pictographic Character Reconstruction with Bézier Curves.
CoRR, November, 2025

Perspective Transition of Large Language Models for Solving Subjective Tasks.
CoRR, January, 2025

How Do Multimodal Large Language Models Handle Complex Multimodal Reasoning? Placing Them in an Extensible Escape Game.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Browse and Concentrate: Comprehending Multimodal Content via Prior-LLM Context Fusion.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

CODIS: Benchmarking Context-dependent Visual Comprehension for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Model Composition for Multimodal Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
TiBERT: A Non-autoregressive Pre-trained Model for Text Editing.
Proceedings of the Natural Language Processing and Chinese Computing, 2023

Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask Questions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

2022
Augmented and challenging datasets with multi-step reasoning and multi-span questions for Chinese judicial reading comprehension.
AI Open, January, 2022

InterHT: Knowledge Graph Embeddings by Interaction between Head and Tail Entities.
CoRR, 2022

2021
Various Legal Factors Extraction Based on Machine Reading Comprehension.
Proceedings of the Information Retrieval - 27th China Conference, 2021

2019
IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension.
Proceedings of the Chinese Computational Linguistics - 18th China National Conference, 2019


  Loading...