Wanggui He

According to our database1, Wanggui He authored at least 24 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
CoAR: Concept Injection into Autoregressive Models for Personalized Text-to-Image Generation.
CoRR, August, 2025

Fast-Slow Thinking for Large Vision-Language Model Reasoning.
CoRR, April, 2025

Boosting MLLM Reasoning with Text-Debiased Hint-GRPO.
CoRR, March, 2025

CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation.
CoRR, March, 2025

MINT: Multi-modal Chain of Thought in Unified Generative Models for Enhanced Image Generation.
CoRR, March, 2025

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation.
CoRR, February, 2025

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Streaming Video Question-Answering with In-context Video KV-Cache Retrieval.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

TFCustom: Customized Image Generation with Time-Aware Frequency Feature Guidance.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts.
CoRR, 2024

A Comprehensive Survey of Datasets, Theories, Variants, and Applications in Direct Preference Optimization.
CoRR, 2024

LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation.
CoRR, 2024

TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition.
CoRR, 2024

MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance.
CoRR, 2024

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback.
CoRR, 2024

HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models.
CoRR, 2024

2023
TrainerAgent: Customizable and Efficient Model Training through LLM-Powered Multi-Agent System.
CoRR, 2023

2021
Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021


  Loading...