Xuansheng Wu

Orcid: 0000-0002-7816-7658

According to our database1, Xuansheng Wu authored at least 38 papers between 2021 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Learnable Assessment Skills for LLM-based Automated Scoring: Rubric Construction via Iterative Optimization.
CoRR, May, 2026

Safactory: A Scalable Agentic Infrastructure for Training Trustworthy Autonomous Intelligence.
CoRR, May, 2026

BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation.
CoRR, February, 2026

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs.
CoRR, February, 2026

Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2026, 2026

BRIDGE the Gap: Mitigating Bias Amplification in Automated Scoring of English Language Learners via Inter-group Data Augmentation.
Proceedings of the Artificial Intelligence in Education - 27th International Conference, 2026

AutoSCORE: Enhancing Automated Scoring with Multi-Agent Large Language Models via Structured Component Recognition.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Investigating CoT Monitorability in Large Reasoning Models.
CoRR, November, 2025

Soundness-Aware Level: A Microscopic Signature that Predicts LLM Reasoning Potential.
CoRR, October, 2025

LMOD+: A Comprehensive Multimodal Dataset and Benchmark for Developing and Evaluating Multimodal Large Language Models in Ophthalmology.
CoRR, September, 2025

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement.
CoRR, September, 2025

Is Long-to-Short a Free Lunch? Investigating Inconsistency and Reasoning Efficiency in LRMs.
CoRR, June, 2025

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders.
CoRR, February, 2025

Self-Regularization with Latent Space Explanations for Controllable LLM-based Classification.
CoRR, February, 2025

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Self-Regularization with Sparse Autoencoders for Controllable LLM-based Classification.
Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, V.2, 2025

Concept-Centric Token Interpretation for Vector-Quantized Generative Models.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Artificial Intelligence Bias on English Language Learners in Automatic Scoring.
Proceedings of the Artificial Intelligence in Education - 26th International Conference, 2025

2024
DIRECT: Dual Interpretable Recommendation with Multi-aspect Word Attribution.
ACM Trans. Intell. Syst. Technol., October, 2024

Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring.
CoRR, 2024

Retrieval-Enhanced Knowledge Editing for Multi-Hop Question Answering in Language Models.
CoRR, 2024

Usable XAI: 10 Strategies Towards Exploiting Explainability in the LLM Era.
CoRR, 2024

Applying large language models and chain-of-thought for automatic scoring.
Comput. Educ. Artif. Intell., 2024

Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start Recommendation.
Proceedings of the ACM on Web Conference 2024, 2024

From Language Modeling to Instruction Following: Understanding the Behavior Shift in LLMs after Instruction Tuning.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

InFoBench: Evaluating Instruction Following Ability in Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
Towards Personalized Cold-Start Recommendation with Prompts.
CoRR, 2023

Artificial General Intelligence (AGI) for Education.
CoRR, 2023

Black-box Backdoor Defense via Zero-shot Image Purification.
CoRR, 2023

A Survey of Graph Prompting Methods: Techniques, Applications, and Challenges.
CoRR, 2023

NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence Representation.
CoRR, 2023

Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education.
CoRR, 2023

Black-box Backdoor Defense via Zero-shot Image Purification.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-Shot Prompt Learning for Automatic Scoring in Science Education.
Proceedings of the Artificial Intelligence in Education - 24th International Conference, 2023

2021
Rethinking the Impacts of Overfitting and Feature Quality on Small-scale Video Classification.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021


  Loading...