Qipeng Guo

Orcid: 0000-0002-8805-8789

According to our database1, Qipeng Guo authored at least 103 papers between 2015 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Intern-S1: A Scientific Multimodal Foundation Model.
CoRR, August, 2025

Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models.
CoRR, August, 2025

InternBootcamp Technical Report: Boosting LLM Reasoning with Verifiable Task Scaling.
CoRR, August, 2025

IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards.
CoRR, August, 2025

Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction.
CoRR, August, 2025

MLP Memory: Language Modeling with Retriever-pretrained External Memory.
CoRR, August, 2025

CodeEvo: Interaction-Driven Synthesis of Code-centric Data through Hybrid and Iterative Feedback.
CoRR, July, 2025

Pre-Trained Policy Discriminators are General Reward Models.
CoRR, July, 2025

Implicit Reward as the Bridge: A Unified View of SFT and DPO Connections.
CoRR, July, 2025

World-aware Planning Narratives Enhance Large Vision-Language Model Planner.
CoRR, June, 2025

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs.
CoRR, June, 2025

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache.
CoRR, June, 2025

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization.
CoRR, June, 2025

Code-Driven Inductive Synthesis: Enhancing Reasoning Abilities of Large Language Models with Sequences.
CoRR, March, 2025

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting.
CoRR, March, 2025

CritiQ: Mining Data Quality Criteria from Human Preferences.
CoRR, February, 2025

Thus Spake Long-Context Large Language Model.
CoRR, February, 2025

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs.
CoRR, February, 2025

UnitCoder: Scalable Iterative Code Synthesis with Unit Test Guidance.
CoRR, February, 2025

VideoRoPE: What Makes for Good Video Rotary Position Embedding?
CoRR, February, 2025

NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

ReAttention: Training-Free Infinite Context with Finite Attention Scope.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Case2Code: Scalable Synthetic Data for Code Generation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

How to Mitigate Overfitting in Weak-to-strong Generalization?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

PerSphere: A Comprehensive Framework for Multi-Faceted Perspective Retrieval and Summarization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

FastMCTS: A Simple Sampling Strategy for Data Synthesis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CritiQ: Mining Data Quality Criteria from Human Preferences.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Capability Salience Vector: Fine-grained Alignment of Loss and Capabilities for Downstream Task Scaling Law.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Scaling of Search and Learning: A Roadmap to Reproduce o1 from Reinforcement Learning Perspective.
CoRR, 2024

GAOKAO-Eval: Does high scores truly reflect strong capabilities in LLMs?
CoRR, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions.
CoRR, 2024

LongSafetyBench: Long-Context LLMs Struggle with Safety Issues.
CoRR, 2024

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders.
CoRR, 2024

DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels.
CoRR, 2024

What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices.
CoRR, 2024

OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection.
CoRR, 2024

Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope.
CoRR, 2024

Case2Code: Learning Inductive Reasoning with Synthetic Data.
CoRR, 2024

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output.
CoRR, 2024

RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models.
CoRR, 2024

InternLM2 Technical Report.
CoRR, 2024

A Survey of Neural Code Intelligence: Paradigms, Advances and Beyond.
CoRR, 2024

NovelQA: A Benchmark for Long-Range Novel Question Answering.
CoRR, 2024

Data-freeWeight Compress and Denoise for Large Language Models.
CoRR, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.
CoRR, 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.
CoRR, 2024

AlchemistCoder: Harmonizing and Eliciting Code Capability by Hindsight Tuning on Multi-source Data.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Can Language Models Learn to Skip Steps?
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

On Affine Homotopy between Language Encoders.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection.
Proceedings of the 43rd IEEE/ACM International Conference on Computer-Aided Design, 2024

Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Turn Waste into Worth: Rectifying Top-k Router of MoE.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Explicit Memory Learning with Expectation Maximization.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LongWanjuan: Towards Systematic Measurement for Long Text Quality.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Knowledge-Centric Hallucination Detection.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Benchmarking Hallucination in Large Language Models Based on Unanswerable Math Word Problem.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Reasoning in Flux: Enhancing Large Language Models Reasoning through Uncertainty-aware Adaptive Guidance.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

F-Eval: Asssessing Fundamental Abilities with Refined Evaluation Methods.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Code Needs Comments: Enhancing Code LLMs with Comment Augmentation.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Identifying Semantic Induction Heads to Understand In-Context Learning.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Synergetic Event Understanding: A Collaborative Approach to Cross-Document Event Coreference Resolution with Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Full Parameter Fine-tuning for Large Language Models with Limited Resources.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

AdaLomo: Low-memory Optimization with Adaptive Learning Rate.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Full Parameter Fine-tuning for Large Language Models with Limited Resources.
CoRR, 2023

All Roads Lead to Rome? Exploring the Invariance of Transformers' Representations.
CoRR, 2023

Evaluating Open-QA Evaluation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

CoLLiE: Collaborative Training of Large Language Models in an Efficient Way.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

StoryAnalogy: Deriving Story-level Analogies from Large Language Models to Unlock Analogical Understanding.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Do Large Language Models Know What They Don't Know?
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Exploiting Abstract Meaning Representation for Open-Domain Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Dual Cache for Long Document Neural Coreference Resolution.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

An AMR-based Link Prediction Approach for Document-level Event Argument Extraction.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Word-Level Representation From Bytes For Language Modeling.
CoRR, 2022

What Dense Graph Do You Need for Self-Attention?
CoRR, 2022

Towards Collaborative Question Answering: A Preliminary Study.
CoRR, 2022

BART-Reader: Predicting Relations Between Entities via Reading Their Document-Level Context Information.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

What Dense Graph Do You Need for Self-Attention?
Proceedings of the International Conference on Machine Learning, 2022

RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Dialogue Meaning Representation for Task-Oriented Dialogue Systems.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

DORE: Document Ordered Relation Extraction based on Generative Framework.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Syntax-guided text generation via graph neural network.
Sci. China Inf. Sci., 2021

Text information aggregation with centrality attention.
Sci. China Inf. Sci., 2021

Fork or Fail: Cycle-Consistent Training with Many-to-One Mappings.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

A Unified Generative Framework for Various NER Subtasks.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
CycleGT: Unsupervised Graph-to-Text and Text-to-Graph Generation via Cycle Training.
CoRR, 2020

BERT-ATTACK: Adversarial Attack Against BERT Using BERT.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

CoLAKE: Contextualized Language and Knowledge Embedding.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

GenWiki: A Dataset of 1.3 Million Content-Sharing Text and Graphs for Unsupervised Graph-to-Text Generation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Joint Parsing and Generation for Abstractive Summarization.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Scale Self-Attention for Text Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Low-Rank and Locality Constrained Self-Attention for Sequence Modeling.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

BP-Transformer: Modelling Long-Range Context via Binary Partitioning.
CoRR, 2019

Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs.
CoRR, 2019

Star-Transformer.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Top-Down Tree Structured Text Generation.
CoRR, 2018

2015
First Step toward Model-Free, Anonymous Object Tracking with Recurrent Neural Networks.
CoRR, 2015


  Loading...