Benyou Wang

Orcid: 0000-0002-1501-9914

Affiliations:
  • Chinese University of Hong Kong-Shenzhen (CUHK-SZ), Shenzhen, China


According to our database1, Benyou Wang authored at least 164 papers between 2015 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Towards Assessing Medical Ethics from Knowledge to Practice.
CoRR, August, 2025

MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.
CoRR, July, 2025

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation.
CoRR, June, 2025

QFFT, Question-Free Fine-Tuning for Adaptive Reasoning.
CoRR, June, 2025

SRLAgent: Enhancing Self-Regulated Learning Skills through Gamification and LLM Assistance.
CoRR, June, 2025

CoRT: Code-integrated Reasoning within Thinking.
CoRR, June, 2025

FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion.
CoRR, June, 2025

Model Unlearning via Sparse Autoencoder Subspace Guided Projections.
CoRR, May, 2025

From Word to World: Evaluate and Mitigate Culture Bias via Word Association Test.
CoRR, May, 2025

Real-Time Verification of Embodied Reasoning for Generative Skill Acquisition.
CoRR, May, 2025

Learning from Peers in Reasoning Models.
CoRR, May, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs.
CoRR, March, 2025

Large Language Models for Outpatient Referral: Problem Definition, Benchmarking and Challenges.
CoRR, March, 2025

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information.
CoRR, March, 2025

The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models.
CoRR, March, 2025

NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants.
CoRR, February, 2025

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis.
CoRR, February, 2025

Uncertainty-Aware Search and Value Models: Mitigating Search Scaling Flaws in LLMs.
CoRR, February, 2025

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets.
CoRR, February, 2025

Federated Linear Dueling Bandits.
CoRR, February, 2025

Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning.
CoRR, February, 2025

RealCritic: Towards Effectiveness-Driven Evaluation of Language Model Critiques.
CoRR, January, 2025

Enabling Scalable Oversight via Self-Evolving Critic.
CoRR, January, 2025

RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions.
CoRR, January, 2025

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models.
Trans. Mach. Learn. Res., 2025

DialogueLLM: Context and emotion knowledge-tuned large language models for emotion recognition in conversations.
Neural Networks, 2025

Beyond Binary: Towards Fine-Grained LLM-Generated Text Detection via Role Recognition and Involvement Measurement.
Proceedings of the ACM on Web Conference 2025, 2025

Periodical Moving Average Accelerates Gradient Accumulation for Post-Training.
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Huatuo-26M, a Large-scale Chinese Medical QA Dataset.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

LLMs for Mathematical Modeling: Towards Bridging the Gap between Natural and Mathematical Languages.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

UCL-Bench: A Chinese User-Centric Legal Benchmark for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Is Your LLM Outdated? A Deep Look at Temporal Generalization.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Smurfs: Multi-Agent System using Context-Efficient DFSDT for Tool Planning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Less is More: A Simple yet Effective Token Reduction Method for Efficient Multi-modal LLMs.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Soundwave: Less is More for Speech-Text Alignment in LLMs.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Unlocking LLMs' Self-Improvement Capacity with Autonomous Learning for Domain Adaptation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Towards Medical Complex Reasoning with LLMs through Medical Verifiable Problems.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Natural Language Reasoning, A Survey.
ACM Comput. Surv., December, 2024

On Elastic Language Models.
ACM Trans. Inf. Syst., November, 2024

Few-Shot Class-Incremental Learning for Medical Time Series Classification.
IEEE J. Biomed. Health Informatics, April, 2024

Spatio-temporal Contrastive Learning-enhanced GNNs for Session-based Recommendation.
ACM Trans. Inf. Syst., March, 2024

Pre-trained Language Models in Biomedical Domain: A Systematic Survey.
ACM Comput. Surv., March, 2024

A Survey of Quantum-cognitively Inspired Sentiment Analysis Models.
ACM Comput. Surv., January, 2024

Mixture of Latent Experts Using Tensor Products.
Trans. Mach. Learn. Res., 2024

Document-level Relation Extraction with Relation Correlations.
Neural Networks, 2024

On the Compositional Generalization of Multimodal LLMs for Medical Imaging.
CoRR, 2024

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs.
CoRR, 2024

BlenderLLM: Training Large Language Models for Computer-Aided Design with Self-improvement.
CoRR, 2024

Second Language (Arabic) Acquisition of LLMs via Progressive Vocabulary Expansion.
CoRR, 2024

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?
CoRR, 2024

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination.
CoRR, 2024

Roadmap towards Superhuman Speech Understanding using Large Language Models.
CoRR, 2024

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture.
CoRR, 2024

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications.
CoRR, 2024

HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale.
CoRR, 2024

LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them.
CoRR, 2024

LLMs Could Autonomously Learn Without External Supervision.
CoRR, 2024

MotionLLM: Understanding Human Behaviors from Human Motions and Videos.
CoRR, 2024

ORLM: Training Large Language Models for Optimization Modeling.
CoRR, 2024

Mixture of Experts Using Tensor Products.
CoRR, 2024

Mamo: a Mathematical Modeling Benchmark with Solvers.
CoRR, 2024

Evaluating LLMs at Evaluating Temporal Generalization.
CoRR, 2024

Smurfs: Leveraging Multiple Proficiency Agents with Context-Efficiency for Tool Planning.
CoRR, 2024

MileBench: Benchmarking MLLMs in Long Context.
CoRR, 2024

No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks.
CoRR, 2024

Online Training of Large Language Models: Learn while chatting.
CoRR, 2024

Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People.
CoRR, 2024

The FinBen: An Holistic Financial Benchmark for Large Language Models.
CoRR, 2024

ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model.
CoRR, 2024

Humans or LLMs as the Judge? A Study on Judgement Biases.
CoRR, 2024

Pushing The Limit of LLM Capacity for Text Classification.
CoRR, 2024

MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models.
Big Data Min. Anal., 2024

Boosting Protein Language Models with Negative Sample Mining.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2024

FinBen: A Holistic Financial Benchmark for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Alignment at Pre-training! Towards Native Alignment for Arabic LLMs.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

OVM, Outcome-supervised Value Models for Planning in Mathematical Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

CMB: A Comprehensive Medical Benchmark in Chinese.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

AceGPT, Localizing Large Language Models in Arabic.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

MathScale: Scaling Instruction Tuning for Mathematical Reasoning.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Rethinking the Uniformity Metric in Self-Supervised Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Humans or LLMs as the Judge? A Study on Judgement Bias.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

PlatoLM: Teaching LLMs in Multi-Round Dialogue via a User Simulator.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Exploring the Potential of Dense Information in Multimodal Alignment.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Silkie: Preference Distillation for Large Visual Language Models.
CoRR, 2023

MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V.
CoRR, 2023

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs.
CoRR, 2023

Outcome-supervised Verifiers for Planning in Mathematical Reasoning.
CoRR, 2023

DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations.
CoRR, 2023

AceGPT, Localizing Large Language Models in Arabic.
CoRR, 2023

Large Language Model as a User Simulator.
CoRR, 2023

CMB: A Comprehensive Medical Benchmark in Chinese.
CoRR, 2023

On the Difference of BERT-style and CLIP-style Text Encoders.
CoRR, 2023

Injecting Knowledge into Biomedical Pre-trained Models via Polymorphism and Synonymous Substitution.
CoRR, 2023

Is Information Extraction Solved by ChatGPT? An Analysis of Performance, Evaluation Criteria, Robustness and Errors.
CoRR, 2023

Word Grounded Graph Convolutional Network.
CoRR, 2023

Huatuo-26M, a Large-scale Chinese Medical QA Dataset.
CoRR, 2023

Phoenix: Democratizing ChatGPT across Languages.
CoRR, 2023

A Survey on Biomedical Text Summarization with Pre-trained Language Model.
CoRR, 2023

Nature Language Reasoning, A Survey.
CoRR, 2023

Modular Retrieval for Generalization and Interpretation.
CoRR, 2023

Adapting Pre-trained Language Models for Quantum Natural Language Processing.
CoRR, 2023

CMMA: Benchmarking Multi-Affection Detection in Chinese Multi-Modal Conversations.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

HuatuoGPT, Towards Taming Language Model to Be a Doctor.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Lifting the Curse of Capacity Gap in Distilling Language Models.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

One Cannot Stand for Everyone! Leveraging Multiple User Simulators to train Task-oriented Dialogue Systems.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Can Language Models Make Fun? A Case Study in Chinese Comical Crosstalk.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

On the Difference of BERT-style and CLIP-style Text Encoders.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Effective Open Intent Classification with K-center Contrastive Learning and Adjustable Decision Boundary.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Complex-valued Neural Network-based Quantum Language Models.
ACM Trans. Inf. Syst., 2022

Document-level Relation Extraction with Relation Correlations.
CoRR, 2022

Doge Tickets: Uncovering Domain-General Language Models by Playing Lottery Tickets.
Proceedings of the Natural Language Processing and Chinese Computing, 2022

MorphTE: Injecting Morphology in Tensorized Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Exploring extreme parameter compression for pre-trained language models.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Hypoformer: Hybrid Decomposition Transformer for Edge-friendly Neural Machine Translation.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

DPTDR: Deep Prompt Tuning for Dense Passage Retrieval.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
CFN: A Complex-Valued Fuzzy Network for Sarcasm Detection in Conversations.
IEEE Trans. Fuzzy Syst., 2021

Pre-trained Language Models in Biomedical Domain: A Systematic Survey.
CoRR, 2021

Word2Fun: Modelling Words as Functions for Diachronic Word Representation.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Sequential Modeling in Vector Space.
Proceedings of the 11th Italian Information Retrieval Workshop 2021, 2021

On Position Embeddings in BERT.
Proceedings of the 9th International Conference on Learning Representations, 2021

What Does Your Smile Mean? Jointly Detecting Multi-Modal Sarcasm and Sentiment Using Quantum Probability.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Exploration of University Students' Mobility Behavior on Campus.
Proceedings of the CONF-CDS 2021: The 2nd International Conference on Computing and Data Science, 2021

2020
Leveraging Long and Short-Term Information in Content-Aware Movie Recommendation via Adversarial Training.
IEEE Trans. Cybern., 2020

A distant supervision method based on paradigmatic relations for learning word embeddings.
Neural Comput. Appl., 2020

Encoding word order in complex embeddings.
Proceedings of the 8th International Conference on Learning Representations, 2020

University of Padova @ DIACR-Ita (short paper).
Proceedings of the Seventh Evaluation Campaign of Natural Language Processing and Speech Tools for Italian. Final Workshop (EVALITA 2020), 2020

A Multi-task Learning Framework for Opinion Triplet Extraction.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Meta-Learning for Neural Relation Classification with Distant Supervision.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2019
Detection of Network Intrusion Threat Based on the Probabilistic Neural Network Model.
Inf. Technol. Control., December, 2019

Semantic Hilbert Space for Text Representation Learning.
Proceedings of the World Wide Web Conference, 2019

Dynamic Content Monitoring and Exploration using Vector Spaces.
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019

CNM: An Interpretable Complex-valued Network for Matching.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
A quantum-inspired multimodal sentiment analysis framework.
Theor. Comput. Sci., 2018

TextZoo, a New Benchmark for Reconsidering Text Classification.
CoRR, 2018

Detection of subtype blood cells using deep learning.
Cogn. Syst. Res., 2018

Quantum-Inspired Complex Word Embedding.
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

A Multi-task Learning Approach for Image Captioning.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

PLASTIC: Prioritize Long and Short-term Information in Top-n Recommendation using Adversarial Training.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

A Quantum Many-body Wave Function Inspired Language Modeling Approach.
Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

Full Cycle Campus Life of College Students: A Big Data Case in China.
Proceedings of the 2018 IEEE International Conference on Big Data and Smart Computing, 2018

End-to-End Quantum-like Language Models with Application to Question Answering.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Learning to diversify web search results with a Document Repulsion Model.
Inf. Sci., 2017

Regularizing Neural Networks via Retaining Confident Connections.
Entropy, 2017

Leveraging Long and Short-term Information in Content-aware Movie Recommendation.
CoRR, 2017

IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models.
Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2017

Enhanced Embedding Based Attentive Pooling Network for Answer Selection.
Proceedings of the Natural Language Processing and Chinese Computing, 2017

How Users Select Query Suggestions Under Different Satisfaction States?
Proceedings of the Information Retrieval - 23rd China conference, 2017

2016
A Quantum Query Expansion Approach for Session Search.
Entropy, 2016

Exploration of Quantum Interference in Document Relevance Judgement Discrepancy.
Entropy, 2016

A Chinese Question Answering Approach Integrating Count-Based and Embedding-Based Features.
Proceedings of the Natural Language Understanding and Intelligent Applications, 2016

2015
A Real-Time Eye Tracking Based Query Expansion Approach via Latent Topic Modeling.
Proceedings of the 24th ACM International Conference on Information and Knowledge Management, 2015


  Loading...