Xuming Hu

Orcid: 0000-0001-6075-4224

Affiliations:
  • Hong Kong University of Science and Technology Guangzhou (HKUST-GZ), Guangzhou, China
  • Hong Kong University of Science and Technology (HKUST), Tai Po Tsai, Hong Kong
  • Tsinghua University, School of Software, Beijing, China (PhD )


According to our database1, Xuming Hu authored at least 156 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models.
CoRR, August, 2025

GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning.
CoRR, August, 2025

WakenLLM: Evaluating Reasoning Potential and Stability in LLMs via Fine-Grained Benchmarking.
CoRR, July, 2025

VLA-Mark: A cross modal watermark for large vision-language alignment model.
CoRR, July, 2025

A Survey of AIOps in the Era of Large Language Models.
CoRR, July, 2025

A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models.
CoRR, July, 2025

Da Yu: Towards USV-Based Image Captioning for Waterway Surveillance and Scene Understanding.
CoRR, June, 2025

Video Signature: In-generation Watermarking for Latent Video Diffusion Models.
CoRR, June, 2025

WaterVG: Waterway Visual Grounding Based on Text-Guided Vision and mmWave Radar.
IEEE Trans. Intell. Transp. Syst., May, 2025

Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities.
CoRR, May, 2025

Compliance-to-Code: Enhancing Financial Compliance Checking via Code Generation.
CoRR, May, 2025

Shifting AI Efficiency From Model-Centric to Data-Centric Compression.
CoRR, May, 2025

LoTA-QAF: Lossless Ternary Adaptation for Quantization-Aware Fine-Tuning.
CoRR, May, 2025

MLLMs are Deeply Affected by Modality Bias.
CoRR, May, 2025

Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?
CoRR, May, 2025

PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions.
CoRR, May, 2025

RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection.
CoRR, May, 2025

FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management.
CoRR, May, 2025

SSR: Speculative Parallel Scaling Reasoning in Test-time.
CoRR, May, 2025

UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models.
CoRR, May, 2025

Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs.
CoRR, May, 2025

Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis.
CoRR, May, 2025

Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent.
CoRR, May, 2025

CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring.
CoRR, May, 2025

Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?
CoRR, May, 2025

Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization.
CoRR, May, 2025

RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.
CoRR, May, 2025

TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering.
CoRR, April, 2025

CoheMark: A Novel Sentence-Level Watermark for Enhanced Text Quality.
CoRR, April, 2025

VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models.
CoRR, April, 2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment.
CoRR, April, 2025

MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection.
CoRR, March, 2025

Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook.
CoRR, March, 2025

LazyMAR: Accelerating Masked Autoregressive Models via Feature Caching.
CoRR, March, 2025

LLM Agents for Education: Advances and Applications.
CoRR, March, 2025

OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation.
CoRR, March, 2025

MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation.
CoRR, March, 2025

Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking.
CoRR, March, 2025

A Survey of Text Watermarking in the Era of Large Language Models.
ACM Comput. Surv., February, 2025

HyperG: Hypergraph-Enhanced LLMs for Structured Knowledge.
CoRR, February, 2025

Understanding Dynamic Diffusion Process of LLM-based Agents under Information Asymmetry.
CoRR, February, 2025

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models.
CoRR, February, 2025

Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning.
CoRR, February, 2025

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?
CoRR, February, 2025

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning.
CoRR, February, 2025

OneForecast: A Universal Framework for Global and Regional Weather Forecasting.
CoRR, February, 2025

ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference.
CoRR, February, 2025

Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for Free.
CoRR, January, 2025

<i>HyperG: </i> Hypergraph-Enhanced LLMs for Structured Knowledge.
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025

DiscoverGPT: Multi-task Fine-tuning Large Language Model for Related Table Discovery.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

PolyJoin: Semantic Multi-key Joinable Table Search in Data Lakes.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Can Watermarked LLMs be Identified by Users via Crafted Prompts?
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Transformers.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

UltraWiki: Ultra-Fine-Grained Entity Set Expansion with Negative Seed Entities.
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025

Multi-view Hypergraph-based Contrastive Learning Model for Cold-Start Micro-video Recommendation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Dataset Distillation with Neural Characteristic Function: A Minmax Perspective.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

StructFact: Reasoning Factual Knowledge from Structured Data with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Unlocking Speech Instruction Data Potential with Query Rewriting.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Reading Broadly to Open Your Mind: Improving Open Relation Extraction With Search Documents Under Self-Supervisions.
IEEE Trans. Knowl. Data Eng., May, 2024

Graph Neural Network with curriculum learning for imbalanced node classification.
Neurocomputing, March, 2024

Accelerating Diffusion Transformers with Dual Feature Caching.
CoRR, 2024

MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection.
CoRR, 2024

Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability.
CoRR, 2024

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
CoRR, 2024

Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey.
CoRR, 2024

Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation.
CoRR, 2024

Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios.
CoRR, 2024

Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models.
CoRR, 2024

NeuGPT: Unified multi-modal Neural GPT.
CoRR, 2024

MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models.
CoRR, 2024

Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality.
CoRR, 2024

ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection.
CoRR, 2024

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models.
CoRR, 2024

Interpretable Contrastive Monte Carlo Tree Search Reasoning.
CoRR, 2024

DRUPI: Dataset Reduction Using Privileged Information.
CoRR, 2024

Reasoning Factual Knowledge in Structured Data with Large Language Models.
CoRR, 2024

Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models.
CoRR, 2024

LLaSA: Large Language and E-Commerce Shopping Assistant.
CoRR, 2024

Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities.
CoRR, 2024

A Survey of AIOps for Failure Management in the Era of Large Language Models.
CoRR, 2024

MarkLLM: An Open-Source Toolkit for LLM Watermarking.
CoRR, 2024

WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar.
CoRR, 2024

Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions.
CoRR, 2024

UltraWiki: Ultra-fine-grained Entity Set Expansion with Negative Seed Entities.
CoRR, 2024

Unraveling Babel: Exploring Multilingual Activation Patterns within Large Language Models.
CoRR, 2024

Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction.
CoRR, 2024

When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models.
CoRR, 2024

Preventing and Detecting Misinformation Generated by Large Language Models.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

A Semantic Invariant Robust Watermark for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

An Unforgeable Publicly Verifiable Watermark for Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Towards Understanding Factual Knowledge of Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

LongGenBench: Long-context Generation Benchmark.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Refiner: Restructure Retrieved Content Efficiently to Advance Question-Answering Capabilities.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Evaluating Robustness of Generative Search Engine on Adversarial Factoid Questions.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Towards Natural Language Interfaces for Data Visualization: A Survey.
IEEE Trans. Vis. Comput. Graph., June, 2023

A Multi-Level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

A Survey of Text Watermarking in the Era of Large Language Models.
CoRR, 2023

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity.
CoRR, 2023

Do Large Language Models Know about Facts?
CoRR, 2023

A Private Watermark for Large Language Models.
CoRR, 2023

Give Me More Details: Improving Fact-Checking with Latent Retrieval.
CoRR, 2023

A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability.
CoRR, 2023

Think Rationally about What You See: Continuous Rationale Extraction for Relation Extraction.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Read it Twice: Towards Faithfully Interpretable Fact Verification by Revisiting Evidence.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

MR2: A Benchmark for Multimodal Retrieval-Augmented Rumor Detection in Social Media.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

SelfLRE: Self-refining Representation Learning for Low-resource Relation Extraction.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Multimodal Named Entity Recognition and Relation Extraction with Retrieval-Augmented Strategy.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Gaussian Prior Reinforcement Learning for Nested Named Entity Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

AMR-based Network for Aspect-based Sentiment Analysis.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Exploring the Compositional Generalization in Context Dependent Text-to-SQL Parsing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Automatic Table Union Search with Tabular Representation Learning.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

GDA: Generative Data Augmentation Techniques for Relation Extraction Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Entity-to-Text based Data Augmentation for various Named Entity Recognition Tasks.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multimodal Relation Extraction with Cross-Modal Retrieval and Synthesis.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Graph Component Contrastive Learning for Concept Relatedness Estimation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
MespaConfig: Memory-Sparing Configuration Auto-Tuning for Co-Located In-Memory Cluster Computing Jobs.
IEEE Trans. Serv. Comput., 2022

Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction.
CoRR, 2022

EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022

Graph Neural Network with Curriculum Learning for Imbalanced Node Classification.
CoRR, 2022

Inferring Commonsense Explanations as Prompts for Future Event Generation.
CoRR, 2022

What Makes the Story Forward?: Inferring Commonsense Explanations as Prompts for Future Event Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

CHEF: A Pilot Chinese Dataset for Evidence-Based Fact-Checking.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Semantic Enhanced Text-to-SQL Parsing via Iteratively Learning Schema Linking Graph.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

Pair-Level Supervised Contrastive Learning for Natural Language Inference.
Proceedings of the IEEE International Conference on Acoustics, 2022

Query-based Instance Discrimination Network for Relational Triple Extraction.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Domain-Specific NER via Retrieving Correlated Samples.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Scene Graph Modification as Incremental Structure Expanding.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Semi-supervised Relation Extraction via Incremental Meta Self-Training.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

2020
SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020


  Loading...