Ben He

Orcid: 0000-0002-2699-9209

Affiliations:
  • Institute of Software, Chinese Academy of Sciences, Beijing, China


According to our database1, Ben He authored at least 98 papers between 2011 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
CoCoNUTS: Concentrating on Content while Neglecting Uninformative Textual Styles for AI-Generated Peer Review Detection.
CoRR, September, 2025

TFRank: Think-Free Reasoning Enables Practical Pointwise LLM Ranking.
CoRR, August, 2025

LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
CoRR, August, 2025

Beyond Isolated Dots: Benchmarking Structured Table Construction as Deep Knowledge Extraction.
CoRR, July, 2025

Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns.
IEEE Trans. Comput. Soc. Syst., June, 2025

EmbedAgent: Benchmarking Large Language Models in Embedded System Development.
CoRR, June, 2025

ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch.
CoRR, June, 2025

Expanding the Boundaries of Vision Prior Knowledge in Multi-modal Large Language Models.
CoRR, March, 2025

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch.
CoRR, February, 2025

SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency.
CoRR, February, 2025

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides.
CoRR, January, 2025

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models.
CoRR, January, 2025

Editorial.
Inf. Retr. Res. J., 2025

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can LLMs Clarify? Investigation and Enhancement of Large Language Models on Argument Claim Optimization.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Memorizing is Not Enough: Deep Knowledge Injection Through Reasoning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

CRUXEVAL-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

On-Policy Self-Alignment with Fine-grained Knowledge Feedback for Hallucination Mitigation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Code-SPA: Style Preference Alignment to Large Language Models for Effective and Robust Code Debugging.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Not All Terms Matter: Recall-Oriented Adaptive Learning for PLM-aided Query Expansion in Open-Domain Question Answering.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
PARADE: Passage Representation Aggregation forDocument Reranking.
ACM Trans. Inf. Syst., March, 2024

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering.
CoRR, 2024

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
CoRR, 2024

CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution.
CoRR, 2024

On-Policy Fine-grained Knowledge Feedback for Hallucination Mitigation.
CoRR, 2024

Towards Scalable Automated Alignment of LLMs: A Survey.
CoRR, 2024

Spiral of Silences: How is Large Language Model Killing Information Retrieval? - A Case Study on Open Domain Question Answering.
CoRR, 2024

Self-Retrieval: Building an Information Retrieval System with One Large Language Model.
CoRR, 2024

Self-Retrieval: End-to-End Information Retrieval with One Large Language Model.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

ChatGPT Is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

PRP-Graph: Pairwise Ranking Prompting to LLMs with Graph Aggregation for Effective Text Re-ranking.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

XMC-Agent : Dynamic Navigation over Scalable Hierarchical Index for Incremental Extreme Multi-label Classification.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Spiral of Silence: How is Large Language Model Killing Information Retrieval? - A Case Study on Open Domain Question Answering.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Analyze, Generate and Refine: Query Expansion with LLMs for Zero-Shot Open-Domain QA.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Rule or Story, Which is a Better Commonsense Expression for Talking with Large Language Models?
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-Based Retrofitting.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Dealing with textual noise for robust and effective BERT re-ranking.
Inf. Process. Manag., 2023

Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection.
CoRR, 2023

A Drop of Ink Makes a Million Think: The Spread of False Information in Large Language Models.
CoRR, 2023

ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models.
CoRR, 2023

CIP at TREC Deep Learning Track 2023.
Proceedings of the Thirty-Second Text REtrieval Conference Proceedings (TREC 2023), 2023

Offline Pseudo Relevance Feedback for Efficient and Effective Single-pass Dense Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Contextual Interaction for Argument Post Quality Assessment.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Contrastive Distant Supervision for Debiased and Denoised Machine Reading Comprehension.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Understanding Differential Search Index for Text Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Towards Imperceptible Document Manipulations against Neural Ranking Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
CIP at TREC 2022 Deep Learning Track.
Proceedings of the Thirty-First Text REtrieval Conference, 2022

Re-thinking Knowledge Graph Completion Evaluation from an Information Retrieval Perspective.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

Towards Robust Dense Retrieval via Local Ranking Alignment.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Groupwise Query Performance Prediction with BERT.
Proceedings of the Advances in Information Retrieval, 2022

Incorporating Ranking Context for End-to-End BERT Re-ranking.
Proceedings of the Advances in Information Retrieval, 2022

2021
Contextualized query expansion via unsupervised chunk selection for text retrieval.
Inf. Process. Manag., 2021

Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision.
CoRR, 2021

Co-BERT: A Context-Aware BERT Retrieval Model Incorporating Local and Query-specific Context.
CoRR, 2021

CIP at TREC 2021 Deep Learning Track.
Proceedings of the Thirtieth Text REtrieval Conference, 2021

Contextualized Offline Relevance Weighting for Efficient and Effective Neural Retrieval.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Simplified TinyBERT: Knowledge Distillation for Document Retrieval.
Proceedings of the Advances in Information Retrieval, 2021

2020
An end-to-end pseudo relevance feedback framework for neural document retrieval.
Inf. Process. Manag., 2020

PARADE: Passage Representation Aggregation for Document Reranking.
CoRR, 2020

ICIP at TREC-2020 Deep Learning Track.
Proceedings of the Twenty-Ninth Text REtrieval Conference, 2020

End-to-End Multi-task Learning for Allusion Detection in Ancient Chinese Poems.
Proceedings of the Knowledge Science, Engineering and Management, 2020

BERT-QE: Contextualized Query Expansion for Document Re-ranking.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Global Bootstrapping Neural Network for Entity Set Expansion.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

End-to-End Bootstrapping Neural Network for Entity Set Expansion.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Learning to Map Frequent Phrases to Sub-Structures of Meaning Representation for Neural Semantic Parsing.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Learning to Bootstrap for Entity Set Expansion.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Deep Sequence-to-Sequence Entity Matching for Heterogeneous Entity Resolution.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

2018
Neural relevance model using similarities with elite documents for effective clinical decision support.
Int. J. Data Min. Bioinform., 2018

NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Neural Precision Medicine by Mining Implicit Treatment Concepts.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

TDNN: A Two-stage Deep Neural Network for Prompt-independent Automated Essay Scoring.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
A Feedback-Based Approach to Utilizing Embeddings for Clinical Decision Support.
Data Sci. Eng., 2017

A document-based neural relevance model for effective clinical decision support.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

2016
Training query filtering for semi-supervised learning to rank with pseudo labels.
World Wide Web, 2016

A Set-Based Training Query Classification Approach for Twitter Search.
Proceedings of the Web-Age Information Management - 17th International Conference, 2016

2015
Selecting Training Data for Learning-Based Twitter Search.
Proceedings of the Advances in Information Retrieval, 2015

2013
Utilizing term proximity for blog post retrieval.
J. Assoc. Inf. Sci. Technol., 2013

UCAS at TREC-2013 Microblog Track.
Proceedings of The Twenty-Second Text REtrieval Conference, 2013

Learn to Rank Tweets by Integrating Query-Specific Characteristics.
Proceedings of the Behavior and Social Computing, 2013

On Evaluating Query Performance Predictors.
Proceedings of the Pervasive Computing and the Networked World, 2013

Sponsored Search Ad Selection by Keyword Structure Analysis.
Proceedings of the Advances in Information Retrieval, 2013

Clustering-based transduction for learning a ranking model with limited human labels.
Proceedings of the 22nd ACM International Conference on Information and Knowledge Management, 2013

2012
UCAS at TREC 2012 Microblog Track.
Proceedings of The Twenty-First Text REtrieval Conference, 2012

Transductive Learning for Real-Time Twitter Search.
Proceedings of the Sixth International Conference on Weblogs and Social Media, 2012

A Survey of Learning to Rank for Real-Time Twitter Search.
Proceedings of the Pervasive Computing and the Networked World, 2012

Query-biased learning to rank for real-time twitter search.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

Question-answer topic model for question retrieval in community question answering.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

A Ranked-Based Learning Approach to Automated Essay Scoring.
Proceedings of the 2012 Second International Conference on Cloud and Green Computing, 2012

2011
GUCAS at TREC 2011 Microblog Track.
Proceedings of The Twentieth Text REtrieval Conference, 2011

A Comparative Study of Pseudo Relevance Feedback for Ad-hoc Retrieval.
Proceedings of the Advances in Information Retrieval Theory, 2011

Exploring categorization property of social annotations for information retrieval.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Relevance weighting using within-document term statistics.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011


  Loading...