Shuo Wang

Orcid: 0000-0001-5408-3145

Affiliations:
  • Tsinghua University, Department of Computer Science and Technology, State Key Laboratory of Intelligent Technology and Systems, Institute for Artificial Intelligence, Beijing, China


According to our database1, Shuo Wang authored at least 65 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization.
CoRR, August, 2025

PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models.
CoRR, August, 2025

LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain.
CoRR, August, 2025

ADAMIX: Adaptive Mixed-Precision Delta-Compression with Quantization Error Optimization for Large Language Models.
CoRR, June, 2025

ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization.
CoRR, June, 2025

MiniCPM4: Ultra-Efficient LLMs on End Devices.
CoRR, June, 2025

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings.
CoRR, May, 2025

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling.
CoRR, May, 2025

Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning.
CoRR, May, 2025

AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage.
CoRR, May, 2025

Monocle: Hybrid Local-Global In-Context Evaluation for Long-Text Generation with Uncertainty-Based Active Learning.
CoRR, May, 2025

A Survey of LLM ⨉ DATA.
CoRR, May, 2025

From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora.
CoRR, May, 2025

LLM⨉MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources.
CoRR, April, 2025

Building a Coding Assistant via the Retrieval-Augmented Language Model.
ACM Trans. Inf. Syst., March, 2025

Exploring the Impact of Personality Traits on LLM Bias and Toxicity.
CoRR, February, 2025

DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection.
CoRR, February, 2025

Process Reinforcement through Implicit Rewards.
CoRR, February, 2025

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation.
CoRR, January, 2025

COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LLM×MapReduce: Simplified Long-Sequence Processing using Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Understanding and Mitigating the Uncertainty in Zero-Shot Translation.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

KBAlign: Efficient Self Adaptation on Specific Knowledge Bases.
CoRR, 2024

LLM⨉MapReduce: Simplified Long-Sequence Processing using Large Language Models.
CoRR, 2024

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation.
CoRR, 2024

Enabling Real-Time Conversations with Minimal Training Costs.
CoRR, 2024

Configurable Foundation Models: Building LLMs from a Modular Perspective.
CoRR, 2024

Enhancing the Code Debugging Ability of LLMs via Communicative Agent Based Data Refinement.
CoRR, 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.
CoRR, 2024

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning.
CoRR, 2024

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression.
CoRR, 2024

From Text to CQL: Bridging Natural Language and Corpus Search Engine.
CoRR, 2024

∞Bench: Extending Long Context Evaluation Beyond 100K Tokens.
CoRR, 2024

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning.
CoRR, 2024

OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models.
CoRR, 2024

OneBit: Towards Extremely Low-bit Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Pluggable Neural Machine Translation Models via Memory-augmented Adapters.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MCTS: A Multi-Reference Chinese Text Simplification Dataset.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ınftyBench: Extending Long Context Evaluation Beyond 100K Tokens.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
INTERVENOR: Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing.
CoRR, 2023

Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf.
CoRR, 2023

TemplateGEC: Improving Grammatical Error Correction with Detection Template.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
A Template-based Method for Constrained Neural Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Integrating Vectorized Lexical Constraints for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Language Models are Good Translators.
CoRR, 2021

On the Language Coverage Bias for Neural Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Neural machine translation: A review of methods, resources, and tools.
AI Open, 2020

Tsinghua University Neural Machine Translation Systems for CCMT 2020.
Proceedings of the Machine Translation - 16th China Conference, 2020

THUMT: An Open-Source Toolkit for Neural Machine Translation.
Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

On the Inference Calibration of Neural Machine Translation.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Improving Back-Translation with Uncertainty-based Confidence Estimation.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019


  Loading...