Qian Liu

Orcid: 0009-0004-1230-130X

Affiliations:
  • ByteDance, TikTok AI Innovation Center, Singapore
  • Microsoft Research Asia, China (former)
  • Beihang University, Beijing, China (former, PhD)


According to our database1, Qian Liu authored at least 105 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
The Optimal Token Baseline: Variance Reduction for Long-Horizon LLM-RL.
CoRR, February, 2026

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations.
CoRR, February, 2026

History Doesn't Repeat Itself but Rollouts Rhyme: Accelerating Reinforcement Learning with RhymeRL.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

How Can Synthetic Data Improve Multilingual Language Model Pretraining? A Data Quality Perspective.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Taming the Tail: Stable LLM Reinforcement Learning via Dynamic Vocabulary Pruning.
CoRR, December, 2025

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents.
CoRR, December, 2025

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle.
CoRR, December, 2025

Diffusion Language Models are Super Data Learners.
CoRR, November, 2025

MPFToD: a modularized pre-training framework for consistency identification in task-oriented dialogue.
Frontiers Comput. Sci., October, 2025

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution.
CoRR, October, 2025

Training Optimal Large Diffusion Language Models.
CoRR, October, 2025

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning.
CoRR, September, 2025

History Rhymes: Accelerating LLM Reinforcement Learning with RhymeRL.
CoRR, August, 2025

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling.
CoRR, August, 2025

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?
CoRR, July, 2025

First Return, Entropy-Eliciting Explore.
CoRR, July, 2025

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization.
CoRR, May, 2025

General-Reasoner: Advancing LLM Reasoning Across All Domains.
CoRR, May, 2025

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild.
CoRR, March, 2025

CodeArena: A Collective Evaluation Platform for LLM Code Generation.
CoRR, March, 2025

Tutorial Proposal: Speculative Decoding for Efficient LLM Inference.
CoRR, March, 2025

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs.
CoRR, February, 2025

When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training.
Trans. Mach. Learn. Res., 2025

SkyLadder: Better and Faster Pretraining via Context Window Scheduling.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

ZeCO: Zero-Communication Overhead Sequence Parallelism for Linear Attention.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Predictive Data Selection: The Data That Predicts Is the Data That Teaches.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Improving Your Model Ranking on Chatbot Arena by Vote Rigging.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Unnatural Languages Are Not Bugs but Features for LLMs.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scaling up Masked Diffusion Models on Text.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RegMix: Data Mixture as Regression for Language Model Pre-training.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

When Attention Sink Emerges in Language Models: An Empirical View.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Bootstrapping Language Models with DPO Implicit Rewards.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

NLP+Code: Code Intelligence in Language Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Sparse-to-Dense: A Free Lunch for Lossless Acceleration of Video Understanding in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CodeArena: A Collective Evaluation Platform for LLM Code Generation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), 2025

2024
Mantis: Interleaved Multi-Image Instruction Tuning.
Trans. Mach. Learn. Res., 2024

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models.
CoRR, 2024

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions.
CoRR, 2024

Sailor: Open Language Models for South-East Asia.
CoRR, 2024

StarCoder 2 and The Stack v2: The Next Generation.
CoRR, 2024

Purifying Large Language Models by Ensembling a Small Language Model.
CoRR, 2024

ARKS: Active Retrieval in Knowledge Soup for Code Generation.
CoRR, 2024

Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One.
CoRR, 2024

Test-Time Backdoor Attacks on Multimodal Large Language Models.
CoRR, 2024

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models.
CoRR, 2024

Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Mercury: A Code Efficiency Benchmark for Code Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Model.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Lemur: Harmonizing Natural Language and Code for Language Agents.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

OctoPack: Instruction Tuning Code Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

DataTales: A Benchmark for Real-World Intelligent Data Narration.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

EvoR: Evolving Retrieval for Code Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Sailor: Open Language Models for South-East Asia.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

Beyond Memorization: The Challenge of Random Memory Access in Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Self-Distillation Bridges Distribution Gap in Language Model Fine-Tuning.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
StarCoder: may the source be with you!
Trans. Mach. Learn. Res., 2023

S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models.
CoRR, 2023

OpenAgents: An Open Platform for Language Agents in the Wild.
CoRR, 2023

SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning.
CoRR, 2023

LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition.
CoRR, 2023

Generative Table Pre-training Empowers Models for Tabular Prediction.
CoRR, 2023

From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning.
CoRR, 2023

SantaCoder: don't reach for the stars!
CoRR, 2023

OpenFE: Automated Feature Generation with Expert-level Performance.
Proceedings of the International Conference on Machine Learning, 2023

Bag of Tricks for Training Data Extraction from Language Models.
Proceedings of the International Conference on Machine Learning, 2023

Learning on Large-scale Text-attributed Graphs via Variational Inference.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Generative Table Pre-training Empowers Models for Tabular Prediction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Active Retrieval Augmented Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Reasoning Implicit Sentiment with Chain-of-Thought Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

On Grounded Planning for Embodied Tasks with Language Models.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Reflection of Thought: Inversely Eliciting Numerical Reasoning in Language Models via Solving Linear Systems.
CoRR, 2022

Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models.
CoRR, 2022

LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training.
CoRR, 2022

Reasoning over Hybrid Chain for Table-and-Text Open Domain QA.
CoRR, 2022

Reasoning over Hybrid Chain for Table-and-Text Open Domain Question Answering.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

TAPEX: Table Pre-training via Learning a Neural SQL Executor.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Reasoning Like Program Executors.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Exploring the Secrets Behind the Learning Difficulty of Meaning Representations for Semantic Parsing.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

LEMON: Language-Based Environment Manipulation via Execution-Guided Pre-training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CGIM: A Cycle Guided Interactive Learning Model for Consistency Identification in Task-oriented Dialogue.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

Awakening Latent Grounding from Pretrained Language Models for Semantic Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Learning Algebraic Recombination for Compositional Generalization.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Chase: A Large-Scale and Pragmatic Chinese Dataset for Cross-Database Context-Dependent Text-to-SQL.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ReTraCk: A Flexible and Efficient Framework for Knowledge Base Question Answering.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

2020
Compositional Generalization by Learning Analytical Expressions.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

RECPARSER: A Recursive Semantic Parsing Framework for Text-to-SQL Task.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

How Far are We from Effective Context Modeling? An Exploratory Study on Semantic Parsing in Context.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Incomplete Utterance Rewriting as Semantic Segmentation.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

"What Do You Mean by That?" A Parser-Independent Interactive Approach for Enhancing Text-to-SQL.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Benchmarking Meaning Representations in Neural Semantic Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

You Impress Me: Dialogue Generation via Mutual Persona Perception.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Leveraging Adjective-Noun Phrasing Knowledge for Comparison Relation Prediction in Text-to-SQL.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

A Split-and-Recombine Approach for Follow-up Query Analysis.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

FANDA: A Novel Approach to Perform Follow-Up Query Analysis.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019


  Loading...