Qingyi Si

Orcid: 0000-0001-8433-0215

According to our database1, Qingyi Si authored at least 49 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Near-Future Policy Optimization.
CoRR, April, 2026

EasyVideoR1: Easier RL for Video Understanding.
CoRR, April, 2026

Self-Distilled RLVR.
CoRR, April, 2026

Diagnosing and Repairing Unsafe Channels in Vision-Language Models via Causal Discovery and Dual-Modal Safety Subspace Projection.
CoRR, March, 2026

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models.
CoRR, March, 2026

A Closer Look into LLMs for Table Understanding.
CoRR, March, 2026

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System.
CoRR, March, 2026

System 1&2 Synergy via Dynamic Model Interpolation.
CoRR, January, 2026

Outcome-Grounded Advantage Reshaping for Fine-Grained Credit Assignment in Mathematical Reasoning.
CoRR, January, 2026

IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models.
CoRR, January, 2026

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Test-time Prompt Intervention.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

Sparse Attention Across Multiple-Context KV Cache.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
LouisKV: Efficient KV Cache Retrieval for Long Input-Output Sequences.
CoRR, October, 2025

HANDO: Hierarchical Autonomous Navigation and Dexterous Omni-loco-manipulation.
CoRR, October, 2025

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models.
CoRR, August, 2025

Stable Reinforcement Learning for Efficient Reasoning.
CoRR, May, 2025

S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models.
CoRR, May, 2025

Dynamic Early Exit in Reasoning Models.
CoRR, April, 2025

Towards Realistic Generation: A Multi-Task Agent for Imitating Diverse Character Linguistic Styles.
Proceedings of the International Joint Conference on Neural Networks, 2025

LEAP: An LLM-Based Evidence Augmented Pipeline for Table-Based Fact Verification.
Proceedings of the Advanced Data Mining and Applications - 21st International Conference, 2025

AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Cross-modality Multiple Relations Learning for Knowledge-based Visual Question Answering.
ACM Trans. Multim. Comput. Commun. Appl., March, 2024

ReTaKe: Reducing Temporal and Knowledge Redundancy for Long Video Understanding.
CoRR, 2024

A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles.
CoRR, 2024

Think out Loud: Emotion Deducing Explanation in Dialogues.
CoRR, 2024

Are Large Language Models Table-based Fact-Checkers?
CoRR, 2024

Towards Flexible Evaluation for Generative Visual Question Answering.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Towards Unified Interactive Visual Grounding in The Wild.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

Towards One-to-Many Visual Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Are Large Language Models Table-based Fact-Checkers?
Proceedings of the 27th International Conference on Computer Supported Cooperative Work in Design, 2024

Multimodal Table Understanding.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Object Attribute Matters in Visual Question Answering.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Schema Item Matters in Knowledge Base Question Answering.
Proceedings of the International Joint Conference on Neural Networks, 2023

An Empirical Study of Instruction-tuning Large Language Models in Chinese.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Compressing and Debiasing Vision-Language Pre-Trained Models for Visual Question Answering.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Combo of Thinking and Observing for Outside-Knowledge VQA.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering.
CoRR, 2022

Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene.
CoRR, 2022

Visual Dialog for Spotting the Differences between Pairs of Similar Images.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Towards Robust Visual Question Answering: Making the Most of Biased Samples via Contrastive Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Learning Class-Transductive Intent Representations for Zero-shot Intent Detection.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Check It Again: Progressive Visual Question Answering via Visual Entailment.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
A Hierarchical Transformer with Speaker Modeling for Emotion Recognition in Conversation.
CoRR, 2020

Learning Disentangled Intent Representations for Zero-shot Intent Detection.
CoRR, 2020

2019
A Multi-channel Neural Network for Imbalanced Emotion Recognition.
Proceedings of the 31st IEEE International Conference on Tools with Artificial Intelligence, 2019

2018
Multi-Perspective Fusion Network for Commonsense Reading Comprehension.
Proceedings of the Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data, 2018


  Loading...