Zhouhong Gu

According to our database1, Zhouhong Gu authored at least 29 papers between 2022 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
AgentGroupChat-V2: Divide-and-Conquer Is What LLM-Based Multi-Agent System Need.
CoRR, June, 2025

CompBench: Benchmarking Complex Instruction-guided Image Editing.
CoRR, May, 2025

LITE: LLM-Impelled efficient Taxonomy Evaluation.
CoRR, April, 2025

RECKON: Large-scale Reference-based Efficient Knowledge Evaluation for Large Language Model.
CoRR, April, 2025

ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection.
CoRR, April, 2025

PII-Bench: Evaluating Query-Aware Privacy Protection Systems.
CoRR, February, 2025

MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments.
CoRR, January, 2025

The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

LLM-GAN: Constructing Generative Adversarial Network Through Large Language Models for Explainable Fake News Detection.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

StrucText-Eval: Evaluating Large Language Model's Reasoning Ability in Structure-Rich Text.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

GAPO: Learning Preferential Prompt through Generative Adversarial Policy Optimization.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection.
CoRR, 2024

VCEval: Rethinking What is a Good Educational Video and How to Automatically Evaluate It.
CoRR, 2024

StrucText-Eval: An Autogenerated Benchmark for Evaluating Large Language Model's Ability in Structure-Rich Text Understanding.
CoRR, 2024

Agent Group Chat: An Interactive Group Chat Simulacra For Better Eliciting Collective Emergent Behavior.
CoRR, 2024

The Missing Piece in Model Editing: A Deep Dive into the Hidden Damage Brought By Model Editing.
CoRR, 2024

ConcEPT: Concept-Enhanced Pre-Training for Language Models.
CoRR, 2024

AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DetectBench: Can Large Language Model Detect and Piece Together Implicit Evidence?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Can Large Language Models Understand Real-World Complex Instructions?
CoRR, 2023

KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases.
CoRR, 2023

Beyond the Obvious: Evaluating the Reasoning Ability In Real-life Scenarios of Language Models on Life Scapes Reasoning Benchmark~(LSR-Benchmark).
CoRR, 2023

Xiezhi: An Ever-Updating Benchmark for Holistic Domain Knowledge Evaluation.
CoRR, 2023

GANTEE: Generative Adversatial Network for Taxonomy Entering Evaluation.
CoRR, 2023

Sem4SAP: Synonymous Expression Mining From Open Knowledge Graph For Language Model Synonym-Aware Pretraining.
CoRR, 2023

GANTEE: Generative Adversarial Network for Taxonomy Enterance Evaluation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Learning What You Need from What You Did: Product Taxonomy Expansion with User Behaviors Supervision.
Proceedings of the 38th IEEE International Conference on Data Engineering, 2022

Parsing Natural Language into Propositional and First-Order Logic with Dual Reinforcement Learning.
Proceedings of the 29th International Conference on Computational Linguistics, 2022


  Loading...