Xuezhi Cao

Orcid: 0000-0002-7044-1341

According to our database1, Xuezhi Cao authored at least 56 papers between 2012 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Friend or Foe: How LLMs' Safety Mind Gets Fooled by Intent Shift Attack.
CoRR, November, 2025

CATArena: Evaluation of LLM Agents through Iterative Tournament Competitions.
CoRR, October, 2025

AMO-Bench: Large Language Models Still Struggle in High School Math Competitions.
CoRR, October, 2025

Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation.
CoRR, October, 2025

UNO-Bench: A Unified Benchmark for Exploring the Compositional Law Between Uni-modal and Omni-modal in Omni Models.
CoRR, October, 2025

SOP-Maze: Evaluating Large Language Models on Complicated Business Standard Operating Procedures.
CoRR, October, 2025

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
CoRR, October, 2025

Making Mathematical Reasoning Adaptive.
CoRR, October, 2025

LongCat-Flash-Thinking Technical Report.
CoRR, September, 2025

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models.
CoRR, September, 2025

Instance-level Randomization: Toward More Stable LLM Evaluations.
CoRR, September, 2025

KG-o1: Enhancing Multi-hop Question Answering in Large Language Models via Knowledge Graph Integration.
CoRR, August, 2025

I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal Entity Linking.
CoRR, August, 2025

CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark.
CoRR, July, 2025

HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs.
CoRR, June, 2025

OIBench: Benchmarking Strong Reasoning Models with Olympiad in Informatics.
CoRR, June, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
CoRR, May, 2025

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations.
CoRR, May, 2025

Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement.
CoRR, May, 2025

Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese.
CoRR, May, 2025

Who's the MVP? A Game-Theoretic Evaluation Benchmark for Modular Attribution in LLM Agents.
CoRR, February, 2025

TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025

Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Length Desensitization in Directed Preference Optimization.
CoRR, 2024

A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Conjoin after Decompose: Improving Few-Shot Performance of Named Entity Recognition.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Popularity Bias is not Always Evil: Disentangling Benign and Harmful Bias for Recommendation.
IEEE Trans. Knowl. Data Eng., October, 2023

Entity-Aspect-Opinion-Sentiment Quadruple Extraction for Fine-grained Sentiment Analysis.
CoRR, 2023

Exchanging-based Multimodal Fusion with Transformer.
CoRR, 2023

Meta-Learning Triplet Network with Adaptive Margins for Few-Shot Named Entity Recognition.
CoRR, 2023

Adap-tau: Adaptively Modulating Embedding Magnitude for Recommendation.
CoRR, 2023

FFHR: Fully and Flexible Hyperbolic Representation for Knowledge Graph Completion.
CoRR, 2023

Adap-τ : Adaptively Modulating Embedding Magnitude for Recommendation.
Proceedings of the ACM Web Conference 2023, 2023

Transferable and Efficient: Unifying Dynamic Multi-Domain Product Categorization.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

2021
DisenKGAT: Knowledge Graph Embedding with Disentangled Graph Attention Network.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Iterative Strategy for Named Entity Recognition with Imperfect Annotations.
Proceedings of the Natural Language Processing and Chinese Computing, 2020

Table Fact Verification with Structure-Aware Transformer.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Multi-modal Knowledge Graphs for Recommender Systems.
Proceedings of the CIKM '20: The 29th ACM International Conference on Information and Knowledge Management, 2020

2018
Collaborative Filtering with Graph-based Implicit Feedback.
CoRR, 2018

A Bootstrapping Framework With Interactive Information Modeling for Network Alignment.
IEEE Access, 2018

A Machine Learning Approach to Prevent Malicious Calls over Telephony Networks.
Proceedings of the 2018 IEEE Symposium on Security and Privacy, 2018

Neural Link Prediction over Aligned Networks.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Joint User Modeling Across Aligned Heterogeneous Sites Using Neural Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2017

IMAP: An iterative method for aligning protein-protein interaction networks.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

2016
A Complete & Comprehensive Movie Review Dataset (CCMR).
Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016

Are You Influenced by Others When Rating?: Improve Rating Prediction by Conformity Modeling.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

Joint User Modeling across Aligned Heterogeneous Sites.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

BASS: A Bootstrapping Approach for Aligning Heterogenous Social Networks.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2016

ASNets: A Benchmark Dataset of Aligned Social Networks for Cross-Platform User Modeling.
Proceedings of the 25th ACM International Conference on Information and Knowledge Management, 2016

2015
Revealing Multiple Layers of Hidden Community Structure in Networks.
CoRR, 2015

Recovering Cross-Device Connections via Mining IP Footprints with Ensemble Learning.
Proceedings of the IEEE International Conference on Data Mining Workshop, 2015

2012
News comments generation via mining microblogs.
Proceedings of the 21st World Wide Web Conference, 2012


  Loading...