We stand with Ukraine

We stand with Ukraine

Qipeng Guo

Orcid: 0000-0002-8805-8789

According to our database¹, Qipeng Guo authored at least 104 papers between 2015 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2025

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Intern-S1: A Scientific Multimodal Foundation Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, August, 2025

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, August, 2025

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, August, 2025

Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, August, 2025

MLP Memory: Language Modeling with Retriever-pretrained External Memory.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, August, 2025

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, July, 2025

Pre-Trained Policy Discriminators are General Reward Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, July, 2025

World-aware Planning Narratives Enhance Large Vision-Language Model Planner.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, June, 2025

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, June, 2025

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, June, 2025

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, June, 2025

Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting.

[BibT_eX]

[DOI]

,

,

,

CoRR, March, 2025

CritiQ: Mining Data Quality Criteria from Human Preferences.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Thus Spake Long-Context Large Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, February, 2025

UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, February, 2025

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ReAttention: Training-Free Infinite Context with Finite Attention Scope.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Case2Code: Scalable Synthetic Data for Code Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

How to Mitigate Overfitting in Weak-to-strong Generalization?

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

FastMCTS: A Simple Sampling Strategy for Data Synthesis.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs.

[BibT_eX]

[DOI]

,

,

,

,

Shenlixing Shenlixing

,

Chenzhan Chenzhan

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CritiQ: Mining Data Quality Criteria from Human Preferences.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Qiuyinzhe Zhang

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Xingcheng Zhang

,

,

,

,

CoRR, 2024

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Xingcheng Zhang

,

CoRR, 2024

Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Case2Code: Learning Inductive Reasoning with Synthetic Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Xingcheng Zhang

,

,

,

,

,

CoRR, 2024

RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

InternLM2 Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

et al.

CoRR, 2024

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

NovelQA: A Benchmark for Long-Range Novel Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Data-freeWeight Compress and Denoise for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Can Language Models Learn to Skip Steps?

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

On Affine Homotopy between Language Encoders.

[BibT_eX]

[DOI]

,

Reda Boumasmoud

,

,

,

,

,

Shauli Ravfogel

,

Mrinmaya Sachan

,

Bernhard Schölkopf

,

Mennatallah El-Assady

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Xingcheng Zhang

,

,

Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Explicit Memory Learning with Expectation Maximization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LongWanjuan: Towards Systematic Measurement for Long Text Quality.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Knowledge-Centric Hallucination Detection.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Benchmarking Hallucination in Large Language Models Based on Unanswerable Math Word Problem.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Reasoning in Flux: Enhancing Large Language Models Reasoning through Uncertainty-aware Adaptive Guidance.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.

[BibT_eX]

[DOI]

,

Keyuchen Keyuchen

,

,

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Full Parameter Fine-tuning for Large Language Models with Limited Resources.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

AdaLomo: Low-memory Optimization with Adaptive Learning Rate.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Full Parameter Fine-tuning for Large Language Models with Limited Resources.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations.

[BibT_eX]

[DOI]

,

,

,

Shauli Ravfogel

,

Mrinmaya Sachan

,

Bernhard Schölkopf

,

CoRR, 2023

Evaluating Open-QA Evaluation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Do Large Language Models Know What They Don't Know?

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Exploiting Abstract Meaning Representation for Open-Domain Question Answering.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Dual Cache for Long Document Neural Coreference Resolution.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

An AMR-based Link Prediction Approach for Document-level Event Argument Extraction.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

Word-Level Representation From Bytes For Language Modeling.

[BibT_eX]

[DOI]

,

,

CoRR, 2022

What Dense Graph Do You Need for Self-Attention?

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2022

Towards Collaborative Question Answering: A Preliminary Study.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2022

BART-Reader: Predicting Relations Between Entities via Reading Their Document-Level Context Information.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2022

What Dense Graph Do You Need for Self-Attention?

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Dialogue Meaning Representation for Task-Oriented Dialogue Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DORE: Document Ordered Relation Extraction based on Generative Framework.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021

Syntax-guided text generation via graph neural network.

[BibT_eX]

[DOI]

,

,

,

Sci. China Inf. Sci., 2021

Text information aggregation with centrality attention.

[BibT_eX]

[DOI]

,

,

,

,

,

Sci. China Inf. Sci., 2021

Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

A Unified Generative Framework for Various NER Subtasks.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020

CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2020

BERT-ATTACK: Adversarial Attack Against BERT Using BERT.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

CoLAKE: Contextualized Language and Knowledge Embedding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 28th International Conference on Computational Linguistics, 2020

GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 28th International Conference on Computational Linguistics, 2020

Joint Parsing and Generation for Abstractive Summarization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Scale Self-Attention for Text Classification.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

Low-Rank and Locality Constrained Self-Attention for Sequence Modeling.

[BibT_eX]

[DOI]

,

,

,

IEEE ACM Trans. Audio Speech Lang. Process., 2019

BP-Transformer: Modelling Long-Range Context via Binary Partitioning.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2019

Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Alexander J. Smola

,

CoRR, 2019

Star-Transformer.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018

Top-Down Tree Structured Text Generation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2018

2015

First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2015

Loading...