We stand with Ukraine

We stand with Ukraine

Weipeng Chen

Orcid: 0009-0006-5124-0241

According to our database¹, Weipeng Chen authored at least 72 papers between 2017 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2026

Towards enhanced LLM pretraining: Dynamic checkpoint merging via generation quality.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Inf. Fusion, 2026

2025

PQCache: Product Quantization-based KVCache for Long Context LLM Inference.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proc. ACM Manag. Data, June, 2025

S2SBench: A Benchmark for Quantifying Intelligence Degradation in Speech-to-Speech Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2025

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

DualToken: Towards Unifying Visual Understanding and Generation with Dual Visual Vocabularies.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, March, 2025

Efficient Motion-Aware Video MLLM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, March, 2025

Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

Baichuan-M1: Pushing the Medical Capability of Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, February, 2025

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, February, 2025

Ocean-OCR: Towards General OCR Application via a Vision-Language Model.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

Baichuan-Omni-1.5 Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Jianqiang Zhang

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

Med-R<sup>2</sup>: Crafting Trustworthy LLM Physicians through Retrieval and Reasoning of Evidence-Based Medicine.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, January, 2025

RK-VQA: Rational knowledge-aware fusion-in-decoder for knowledge-based visual question answering.

[BibT_eX]

[DOI]

,

,

,

,

Inf. Fusion, 2025

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the ACM on Web Conference 2025, 2025

KV Shifting Attention Enhances Language Modeling.

[BibT_eX]

[DOI]

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Maximizing Intermediate Checkpoint Value in LLM Pretraining with Bayesian Optimization.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Needle In A Video Haystack: A Scalable Synthetic Evaluator for Video MLLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SysBench: Can LLMs Follow System Message?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction Tuning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Exploring the Design Space of Visual Context Representation in Video MLLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DataSculpt: A Holistic Data Management Framework for Long-Context LLMs Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

Victor Shea-Jay Huang

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Efficient Motion-Aware Video MLLM.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Improving Accuracy and Calibration via Differentiated Deep Mutual Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

RichRAG: Crafting Rich Responses for Multi-faceted Queries in Retrieval-Augmented Generation.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 31st International Conference on Computational Linguistics, 2025

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2025

LongReD: Mitigating Short-Text Degradation of Long-Context Large Language Models via Restoration Distillation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

KV Shifting Attention Enhances Language Modeling.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

VersaTune: An Efficient Data Composition Framework for Training Multi-Capability LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

From Novice to Expert: LLM Agent Policy Optimization via Step-wise Reinforcement Learning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

Beyond Filtering: Adaptive Image-Text Quality Enhancement for MLLM Pretraining.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Baichuan Alignment Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Baichuan-Omni Technical Report.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Extracting and Transferring Abilities For Building Multi-lingual Ability-enhanced Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

CoRR, 2024

DataSculpt: Crafting Data Landscapes for LLM Post-Training through Multi-objective Partitioning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Boosting Lossless Speculative Decoding via Feature Sampling and Partial Alignment Distillation.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

SysBench: Can Large Language Models Follow System Messages?

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical Benchmark.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Clover-2: Accurate Inference for Regressive Lightweight Speculative Decoding.

[BibT_eX]

[DOI]

,

,

,

CoRR, 2024

PAS: Data-Efficient Plug-and-Play Prompt Augmentation System.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2024

Towards Event-oriented Long Video Understanding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

CoRR, 2024

Full-ECE: A Metric For Token-level Calibration on Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

CoRR, 2024

Needle In A Video Haystack: A Scalable Synthetic Framework for Benchmarking Video MLLMs.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

Exploring Context Window of Large Language Models via Decomposed Positional Vectors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Checkpoint Merging via Bayesian Optimization in LLM Pretraining.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

CoRR, 2024

ShortGPT: Layers in Large Language Models are More Redundant Than You Expect.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2024

Base of RoPE Bounds Context Length.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Exploring Context Window of Large Language Models via Decomposed Positional Vectors.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

HGSVerb: Improving Zero-shot Text Classification via Hierarchical Generative Semantic-Aware Verbalizer.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the International Joint Conference on Neural Networks, 2024

MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

KEEP CHATTING! An Attractive Dataset for Continuous Conversation Agents.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023

Baichuan 2: Open Large-scale Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2023

2022

Clouds in the Vicinity of the Stratopause Observed with Lidars at Midlatitudes (40.5-41°N) in China.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

,

Remote. Sens., 2022

CMSuG: Competitive mechanism-based superpixel generation method for image segmentation.

[BibT_eX]

[DOI]

,

,

,

,

J. Intell. Fuzzy Syst., 2022

VMEKNet: Visual Memory and External Knowledge Based Network for Medical Report Generation.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the PRICAI 2022: Trends in Artificial Intelligence, 2022

Automatic Report Generation Method based on Multiscale Feature Extraction and Word Attention Network.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Web and Big Data - 6th International Joint Conference, 2022

2021

ComQA: Compositional Question Answering via Hierarchical Graph Neural Networks.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the WWW '21: The Web Conference 2021, 2021

Sentiments Affect Stock Returns: Evidence Based on Big Data.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021

Multi-Lingual Question Generation with Language Agnostic Language Model.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020

Incorporating Knowledge and Content Information to Boost News Recommendation.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Natural Language Processing and Chinese Computing, 2020

2018

Automated comprehensive evaluation approach for user interface satisfaction based on concurrent think-aloud method.

[BibT_eX]

[DOI]

,

,

,

Univers. Access Inf. Soc., 2018

基于改进的BP神经网络的网络空间态势感知系统安全评估 (Research on Cyberspace Situation Awareness Security Assessment Based on Improved BP Neural Network).

[BibT_eX]

[DOI]

,

,

,

,

计算机科学, 2018

Utilizing soft constraints to enhance medical relation extraction from the history of present illness in electronic medical records.

[BibT_eX]

[DOI]

,

,

,

,

,

J. Biomed. Informatics, 2018

2017

Automatic Generation and Recommendation for API Mashups.

[BibT_eX]

[DOI]

,

,

,

Mooi Choo Chuah

Proceedings of the 16th IEEE International Conference on Machine Learning and Applications, 2017

Loading...