Weipeng Chen

Orcid: 0009-0006-5124-0241

According to our database1, Weipeng Chen authored at least 70 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2026
Towards enhanced LLM pretraining: Dynamic checkpoint merging via generation quality.
Inf. Fusion, 2026

2025
PQCache: Product Quantization-based KVCache for Long Context LLM Inference.
Proc. ACM Manag. Data, June, 2025

S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models.
CoRR, May, 2025

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning.
CoRR, March, 2025

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies.
CoRR, March, 2025

Efficient Motion-Aware Video MLLM.
CoRR, March, 2025

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction.
CoRR, February, 2025

Baichuan-M1: Pushing the Medical Capability of Large Language Models.
CoRR, February, 2025

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation.
CoRR, February, 2025

Ocean-OCR: Towards General OCR Application via a Vision-Language Model.
CoRR, January, 2025

Baichuan-Omni-1.5 Technical Report.
CoRR, January, 2025

Med-R<sup>2</sup>: Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine.
CoRR, January, 2025

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM.
CoRR, January, 2025

RK-VQA: Rational knowledge-aware fusion-in-decoder for knowledge-based visual question answering.
Inf. Fusion, 2025

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems.
Proceedings of the ACM on Web Conference 2025, 2025

Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SysBench: Can LLMs Follow System Message?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring the Design Space of Visual Context Representation in Video MLLMs.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DataSculpt: A Holistic Data Management Framework for Long-Context LLMs Training.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Efficient Motion-Aware Video MLLM.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Improving Accuracy and Calibration via Differentiated Deep Mutual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback.
CoRR, 2024

KV Shifting Attention Enhances Language Modeling.
CoRR, 2024

VersaTune: An Efficient Data Composition Framework for Training Multi-Capability LLMs.
CoRR, 2024

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning.
CoRR, 2024

Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining.
CoRR, 2024

Baichuan Alignment Technical Report.
CoRR, 2024

FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback.
CoRR, 2024

Baichuan-Omni Technical Report.
CoRR, 2024

Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models.
CoRR, 2024

DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning.
CoRR, 2024

Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation.
CoRR, 2024

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline.
CoRR, 2024

SysBench: Can Large Language Models Follow System Messages?
CoRR, 2024

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark.
CoRR, 2024

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.
CoRR, 2024

Clover-2: Accurate Inference for Regressive Lightweight Speculative Decoding.
CoRR, 2024

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System.
CoRR, 2024

Towards Event-oriented Long Video Understanding.
CoRR, 2024

Full-ECE: A Metric For Token-level Calibration on Large Language Models.
CoRR, 2024

Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs.
CoRR, 2024

Exploring Context Window of Large Language Models via Decomposed Positional Vectors.
CoRR, 2024

Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge.
CoRR, 2024

Checkpoint Merging via Bayesian Optimization in LLM Pretraining.
CoRR, 2024

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect.
CoRR, 2024

Base of RoPE Bounds Context Length.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Context Window of Large Language Models via Decomposed Positional Vectors.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

HGSVerb: Improving Zero-shot Text Classification via Hierarchical Generative Semantic-Aware Verbalizer.
Proceedings of the International Joint Conference on Neural Networks, 2024

MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Baichuan 2: Open Large-scale Language Models.
CoRR, 2023

2022
Clouds in the Vicinity of the Stratopause Observed with Lidars at Midlatitudes (40.5-41°N) in China.
Remote. Sens., 2022

CMSuG: Competitive mechanism-based superpixel generation method for image segmentation.
J. Intell. Fuzzy Syst., 2022

VMEKNet: Visual Memory and External Knowledge Based Network for Medical Report Generation.
Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Automatic Report Generation Method based on Multiscale Feature Extraction and Word Attention Network.
Proceedings of the Web and Big Data - 6th International Joint Conference, 2022

2021
ComQA: Compositional Question Answering via Hierarchical Graph Neural Networks.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Sentiments Affect Stock Returns: Evidence Based on Big Data.
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

Multi-Lingual Question Generation with Language Agnostic Language Model.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Incorporating Knowledge and Content Information to Boost News Recommendation.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

2018
Automated comprehensive evaluation approach for user interface satisfaction based on concurrent think-aloud method.
Univers. Access Inf. Soc., 2018

基于改进的BP神经网络的网络空间态势感知系统安全评估 (Research on Cyberspace Situation Awareness Security Assessment Based on Improved BP Neural Network).
计算机科学, 2018

Utilizing soft constraints to enhance medical relation extraction from the history of present illness in electronic medical records.
J. Biomed. Informatics, 2018

2017
Automatic Generation and Recommendation for API Mashups.
Proceedings of the 16th IEEE International Conference on Machine Learning and Applications, 2017


  Loading...