We stand with Ukraine

We stand with Ukraine

Qingyi Si

Orcid: 0000-0001-8433-0215

According to our database¹, Qingyi Si authored at least 52 papers between 2018 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

GUI Agents for Continual Game Generation.

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, May, 2026

Online Self-Calibration Against Hallucination in Vision-Language Models.

[DOI]

,

,

,

,

,

CoRR, May, 2026

Co-Evolving Policy Distillation.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

Near-Future Policy Optimization.

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2026

EasyVideoR1: Easier RL for Video Understanding.

[DOI]

,

,

,

,

,

,

,

,

CoRR, April, 2026

Self-Distilled RLVR.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, April, 2026

Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection.

[DOI]

,

,

,

,

,

CoRR, March, 2026

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models.

[DOI]

,

,

,

,

,

CoRR, March, 2026

A Closer Look into LLMs for Table Understanding.

[DOI]

,

,

,

,

,

CoRR, March, 2026

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2026

System 1&2 Synergy via Dynamic Model Interpolation.

[DOI]

,

,

,

,

,

,

,

,

CoRR, January, 2026

Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning.

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2026

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models.

[DOI]

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Test-time Prompt Intervention.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Sparse Attention Across Multiple-Context KV Cache.

[DOI]

,

,

,

Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025

LouisKV: Efficient KV Cache Retrieval for Long Input-Output Sequences.

[DOI]

,

,

,

,

CoRR, October, 2025

HANDO: Hierarchical Autonomous Navigation and Dexterous Omni-loco-manipulation.

[DOI]

,

,

,

,

,

,

,

,

CoRR, October, 2025

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models.

[DOI]

,

,

CoRR, August, 2025

Stable Reinforcement Learning for Efficient Reasoning.

[DOI]

,

,

CoRR, May, 2025

S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models.

[DOI]

,

,

CoRR, May, 2025

Dynamic Early Exit in Reasoning Models.

[DOI]

,

,

,

,

,

,

,

CoRR, April, 2025

Towards Realistic Generation: A Multi-Task Agent for Imitating Diverse Character Linguistic Styles.

[DOI]

,

,

,

,

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2025

LEAP: An LLM-Based Evidence Augmented Pipeline for Table-Based Fact Verification.

[DOI]

,

,

,

,

,

Proceedings of the Advanced Data Mining and Applications - 21st International Conference, 2025

AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding.

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering.

[DOI]

,

,

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

Cross-modality Multiple Relations Learning for Knowledge-based Visual Question Answering.

[DOI]

,

,

,

,

,

,

ACM Trans. Multim. Comput. Commun. Appl., March, 2024

ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding.

[DOI]

,

,

,

,

,

CoRR, 2024

A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles.

[DOI]

,

,

,

,

,

,

CoRR, 2024

Think out Loud: Emotion Deducing Explanation in Dialogues.

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Are Large Language Models Table-based Fact-Checkers?

[DOI]

,

,

,

,

CoRR, 2024

Towards Flexible Evaluation for Generative Visual Question Answering.

[DOI]

,

,

,

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Unified Interactive Visual Grounding in The Wild.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Towards One-to-Many Visual Question Answering.

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Are Large Language Models Table-based Fact-Checkers?

[DOI]

,

,

,

,

Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

Multimodal Table Understanding.

[DOI]

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Object Attribute Matters in Visual Question Answering.

[DOI]

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Schema Item Matters in Knowledge Base Question Answering.

[DOI]

,

,

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2023

An Empirical Study of Instruction-tuning Large Language Models in Chinese.

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering.

[DOI]

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Combo of Thinking and Observing for Outside-Knowledge VQA.

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering.

[DOI]

,

,

,

,

CoRR, 2022

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene.

[DOI]

,

,

,

,

,

,

,

CoRR, 2022

Visual Dialog for Spotting the Differences between Pairs of Similar Images.

[DOI]

,

,

,

,

,

,

,

Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning.

[DOI]

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Learning Class-Transductive Intent Representations for Zero-shot Intent Detection.

[DOI]

,

,

,

,

,

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Check It Again: Progressive Visual Question Answering via Visual Entailment.

[DOI]

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversation.

[DOI]

,

,

,

,

CoRR, 2020

Learning Disentangled Intent Representations for Zero-shot Intent Detection.

[DOI]

,

,

,

,

,

CoRR, 2020

2019

A Multi-channel Neural Network for Imbalanced Emotion Recognition.

[DOI]

,

,

,

,

,

Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

2018

Multi-Perspective Fusion Network for Commonsense Reading Comprehension.

[DOI]

,

,

,

,

,

Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018

Loading...