Xuming Hu
Orcid: 0000-0001-6075-4224Affiliations:
- Hong Kong University of Science and Technology Guangzhou (HKUST-GZ), Guangzhou, China
- Hong Kong University of Science and Technology (HKUST), Tai Po Tsai, Hong Kong
- Tsinghua University, School of Software, Beijing, China (PhD )
According to our database1,
Xuming Hu
authored at least 156 papers
between 2020 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2025
A Survey on Parallel Text Generation: From Parallel Decoding to Diffusion Language Models.
CoRR, August, 2025
GM-PRM: A Generative Multimodal Process Reward Model for Multimodal Mathematical Reasoning.
CoRR, August, 2025
WakenLLM: Evaluating Reasoning Potential and Stability in LLMs via Fine-Grained Benchmarking.
CoRR, July, 2025
CoRR, July, 2025
A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models.
CoRR, July, 2025
Da Yu: Towards USV-Based Image Captioning for Waterway Surveillance and Scene Understanding.
CoRR, June, 2025
CoRR, June, 2025
IEEE Trans. Intell. Transp. Syst., May, 2025
Unveiling Instruction-Specific Neurons & Experts: An Analytical Framework for LLM's Instruction-Following Capabilities.
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
CoRR, May, 2025
Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?
CoRR, May, 2025
PhysicsArena: The First Multimodal Physics Reasoning Benchmark Exploring Variable, Process, and Solution Dimensions.
CoRR, May, 2025
RePPL: Recalibrating Perplexity by Uncertainty in Semantic Propagation and Language Generation for Explainable QA Hallucination Detection.
CoRR, May, 2025
FlowKV: Enhancing Multi-Turn Conversational Coherence in LLMs via Isolated Key-Value Cache Management.
CoRR, May, 2025
UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models.
CoRR, May, 2025
Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs.
CoRR, May, 2025
Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis.
CoRR, May, 2025
Building a Stable Planner: An Extended Finite State Machine Based Planning Module for Mobile GUI Agent.
CoRR, May, 2025
CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring.
CoRR, May, 2025
CoRR, May, 2025
Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization.
CoRR, May, 2025
RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.
CoRR, May, 2025
TreeHop: Generate and Filter Next Query Embeddings Efficiently for Multi-hop Question Answering.
CoRR, April, 2025
CoRR, April, 2025
VideoMark: A Distortion-Free Robust Watermarking Framework for Video Diffusion Models.
CoRR, April, 2025
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment.
CoRR, April, 2025
MathAgent: Leveraging a Mixture-of-Math-Agent Framework for Real-World Multimodal Mathematical Error Detection.
CoRR, March, 2025
Retrieval Augmented Generation and Understanding in Vision: A Survey and New Outlook.
CoRR, March, 2025
CoRR, March, 2025
OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation.
CoRR, March, 2025
MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation.
CoRR, March, 2025
Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking.
CoRR, March, 2025
ACM Comput. Surv., February, 2025
Understanding Dynamic Diffusion Process of LLM-based Agents under Information Asymmetry.
CoRR, February, 2025
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models.
CoRR, February, 2025
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning.
CoRR, February, 2025
CoRR, February, 2025
RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning.
CoRR, February, 2025
CoRR, February, 2025
ChunkKV: Semantic-Preserving KV Cache Compression for Efficient Long-Context LLM Inference.
CoRR, February, 2025
CoRR, January, 2025
Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2025
DiscoverGPT: Multi-task Fine-tuning Large Language Model for Related Table Discovery.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Proceedings of the 41st IEEE International Conference on Data Engineering, 2025
Multi-view Hypergraph-based Contrastive Learning Model for Cold-Start Micro-video Recommendation.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025
Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustness.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
ICT: Image-Object Cross-Level Trusted Intervention for Mitigating Object Hallucination in Large Vision-Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
StructFact: Reasoning Factual Knowledge from Structured Data with Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlearning.
Proceedings of the Findings of the Association for Computational Linguistics, 2025
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025
2024
Reading Broadly to Open Your Mind: Improving Open Relation Extraction With Search Documents Under Self-Supervisions.
IEEE Trans. Knowl. Data Eng., May, 2024
Neurocomputing, March, 2024
MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection.
CoRR, 2024
Review-Then-Refine: A Dynamic Framework for Multi-Hop Question Answering with Temporal Adaptability.
CoRR, 2024
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges.
CoRR, 2024
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey.
CoRR, 2024
CoRR, 2024
Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios.
CoRR, 2024
CoRR, 2024
MINER: Mining the Underlying Pattern of Modality-Specific Neurons in Multimodal Large Language Models.
CoRR, 2024
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality.
CoRR, 2024
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection.
CoRR, 2024
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models.
CoRR, 2024
CoRR, 2024
Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models.
CoRR, 2024
Refiner: Restructure Retrieval Content Efficiently to Advance Question-Answering Capabilities.
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
CoRR, 2024
Unraveling Babel: Exploring Multilingual Activation Patterns within Large Language Models.
CoRR, 2024
Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction.
CoRR, 2024
When LLMs Meet Cunning Questions: A Fallacy Understanding Benchmark for Large Language Models.
CoRR, 2024
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024
When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Mind's Mirror: Distilling Self-Evaluation Capability and Comprehensive Thinking from Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Refiner: Restructure Retrieved Content Efficiently to Advance Question-Answering Capabilities.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
On the Robustness of Document-Level Relation Extraction Models to Entity Name Variations.
Proceedings of the Findings of the Association for Computational Linguistics, 2024
Proceedings of the Findings of the Association for Computational Linguistics, 2024
LLMArena: Assessing Capabilities of Large Language Models in Dynamic Multi-Agent Environments.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
IEEE Trans. Vis. Comput. Graph., June, 2023
A Multi-Level Supervised Contrastive Learning Framework for Low-Resource Natural Language Inference.
IEEE ACM Trans. Audio Speech Lang. Process., 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity.
CoRR, 2023
Think Rationally about What You See: Continuous Rationale Extraction for Relation Extraction.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Read it Twice: Towards Faithfully Interpretable Fact Verification by Revisiting Evidence.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Multimodal Named Entity Recognition and Relation Extraction with Retrieval-Augmented Strategy.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023
Prompt Me Up: Unleashing the Power of Alignments for Multimodal Entity and Relation Extraction.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the IEEE International Conference on Acoustics, 2023
RAPL: A Relation-Aware Prototype Learning Approach for Few-Shot Document-Level Relation Extraction.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Enhancing Cross-lingual Natural Language Inference by Soft Prompting with Multilingual Verbalizer.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023
2022
MespaConfig: Memory-Sparing Configuration Auto-Tuning for Co-Located In-Memory Cluster Computing Jobs.
IEEE Trans. Serv. Comput., 2022
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction.
CoRR, 2022
EnTDA: Entity-to-Text based Data Augmentation Approach for Named Entity Recognition Tasks.
CoRR, 2022
CoRR, 2022
CoRR, 2022
What Makes the Story Forward?: Inferring Commonsense Explanations as Prompts for Future Event Generation.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022
HiURE: Hierarchical Exemplar Contrastive Learning for Unsupervised Relation Extraction.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
Proceedings of the 29th International Conference on Computational Linguistics, 2022
2021
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021
2020
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020