Shuo Wang

Orcid: 0000-0001-5408-3145

Affiliations:

Tsinghua University, Department of Computer Science and Technology, State Key Laboratory of Intelligent Technology and Systems, Institute for Artificial Intelligence, Beijing, China

According to our database¹, Shuo Wang authored at least 71 papers between 2019 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2025

LLM⨉MapReduce-V3: Enabling Interactive In-Depth Survey Generation through a MCP-Driven Hierarchically Modular Agent System.

[BibT_eX]

[DOI]

CoRR, October, 2025

SurveyBench: Can LLM(-Agents) Write Academic Surveys that Align with Reader Needs?

[BibT_eX]

[DOI]

CoRR, October, 2025

On LLM-Based Scientific Inductive Reasoning Beyond Equations.

[BibT_eX]

[DOI]

CoRR, September, 2025

Chunks as Arms: Multi-Armed Bandit-Guided Sampling for Long-Context LLM Preference Optimization.

[BibT_eX]

[DOI]

CoRR, August, 2025

PC-Sampler: Position-Aware Calibration of Decoding Bias in Masked Diffusion Models.

[BibT_eX]

[DOI]

CoRR, August, 2025

LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Thought Guided Information Gain.

[BibT_eX]

[DOI]

CoRR, August, 2025

ADAMIX: Adaptive Mixed-Precision Delta-Compression with Quantization Error Optimization for Large Language Models.

[BibT_eX]

[DOI]

CoRR, June, 2025

ReCUT: Balancing Reasoning Length and Accuracy in LLMs via Stepwise Trails and Preference Optimization.

[BibT_eX]

[DOI]

CoRR, June, 2025

MiniCPM4: Ultra-Efficient LLMs on End Devices.

[BibT_eX]

[DOI]

CoRR, June, 2025

A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings.

[BibT_eX]

[DOI]

CoRR, May, 2025

ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling.

[BibT_eX]

[DOI]

CoRR, May, 2025

Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning.

[BibT_eX]

[DOI]

CoRR, May, 2025

AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage.

[BibT_eX]

[DOI]

CoRR, May, 2025

Monocle: Hybrid Local-Global In-Context Evaluation for Long-Text Generation with Uncertainty-Based Active Learning.

[BibT_eX]

[DOI]

CoRR, May, 2025

A Survey of LLM ⨉ DATA.

[BibT_eX]

[DOI]

CoRR, May, 2025

From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora.

[BibT_eX]

[DOI]

CoRR, May, 2025

LLM⨉MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources.

[BibT_eX]

[DOI]

CoRR, April, 2025

Building a Coding Assistant via the Retrieval-Augmented Language Model.

[BibT_eX]

[DOI]

ACM Trans. Inf. Syst., March, 2025

Exploring the Impact of Personality Traits on LLM Bias and Toxicity.

[BibT_eX]

[DOI]

CoRR, February, 2025

DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection.

[BibT_eX]

[DOI]

CoRR, February, 2025

Process Reinforcement through Implicit Rewards.

[BibT_eX]

[DOI]

CoRR, February, 2025

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation.

[BibT_eX]

[DOI]

CoRR, January, 2025

COAST: Enhancing the Code Debugging Ability of LLMs through Communicative Agent Based Data Synthesis.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Enabling Real-Time Conversations with Minimal Training Costs.

[BibT_eX]

[DOI]

Proceedings of the Chinese Computational Linguistics - 24th China National Conference, 2025

LegalDuet: Learning Fine-Grained Representations for Legal Judgment Prediction via a Dual-View Contrastive Learning.

[BibT_eX]

[DOI]

Proceedings of the Advanced Data Mining and Applications - 21st International Conference, 2025

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

LLM×MapReduce: Simplified Long-Sequence Processing using Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024

Understanding and Mitigating the Uncertainty in Zero-Shot Translation.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2024

KBAlign: Efficient Self Adaptation on Specific Knowledge Bases.

[BibT_eX]

[DOI]

CoRR, 2024

LLM⨉MapReduce: Simplified Long-Sequence Processing using Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

CoRR, 2024

Enabling Real-Time Conversations with Minimal Training Costs.

[BibT_eX]

[DOI]

CoRR, 2024

Configurable Foundation Models: Building LLMs from a Modular Perspective.

[BibT_eX]

[DOI]

CoRR, 2024

Enhancing the Code Debugging Ability of LLMs via Communicative Agent Based Data Refinement.

[BibT_eX]

[DOI]

CoRR, 2024

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework.

[BibT_eX]

[DOI]

CoRR, 2024

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning.

[BibT_eX]

[DOI]

CoRR, 2024

Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression.

[BibT_eX]

[DOI]

CoRR, 2024

From Text to CQL: Bridging Natural Language and Corpus Search Engine.

[BibT_eX]

[DOI]

CoRR, 2024

∞Bench: Extending Long Context Evaluation Beyond 100K Tokens.

[BibT_eX]

[DOI]

CoRR, 2024

ActiveRAG: Revealing the Treasures of Knowledge via Active Learning.

[BibT_eX]

[DOI]

CoRR, 2024

OMGEval: An Open Multilingual Generative Evaluation Benchmark for Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

OneBit: Towards Extremely Low-bit Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Pluggable Neural Machine Translation Models via Memory-augmented Adapters.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MCTS: A Multi-Reference Chinese Text Simplification Dataset.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

ınftyBench: Extending Long Context Evaluation Beyond 100K Tokens.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MatPlotAgent: Method and Evaluation for LLM-Based Agentic Scientific Data Visualization.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

INTERVENOR: Prompting the Coding Ability of Large Language Models with the Interactive Chain of Repair.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

INTERVENOR: Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing.

[BibT_eX]

[DOI]

CoRR, 2023

Exploring Large Language Models for Communication Games: An Empirical Study on Werewolf.

[BibT_eX]

[DOI]

CoRR, 2023

TemplateGEC: Improving Grammatical Error Correction with Detection Template.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

A Template-based Method for Constrained Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Integrating Vectorized Lexical Constraints for Neural Machine Translation.

[BibT_eX]

[DOI]

Shuo Wang

Zhixing Tan

Yang Liu

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better Translators.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Language Models are Good Translators.

[BibT_eX]

[DOI]

CoRR, 2021

On the Language Coverage Bias for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Neural machine translation: A review of methods, resources, and tools.

[BibT_eX]

[DOI]

AI Open, 2020

Tsinghua University Neural Machine Translation Systems for CCMT 2020.

[BibT_eX]

[DOI]

Proceedings of the Machine Translation - 16th China Conference, 2020

THUMT: An Open-Source Toolkit for Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 14th Conference of the Association for Machine Translation in the Americas, 2020

On the Inference Calibration of Neural Machine Translation.

[BibT_eX]

[DOI]

Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019

Improving Back-Translation with Uncertainty-based Confidence Estimation.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Shuo Wang

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...