Yu Xia

Orcid: 0009-0003-9800-1051

Affiliations:
  • University of California San Diego, CA, USA


According to our database1, Yu Xia authored at least 34 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Learning to Hint for Reinforcement Learning.
CoRR, April, 2026

Evaluation on Entity Matching in Recommender Systems.
CoRR, January, 2026

Multi-Agent Collaborative Filtering: Orchestrating Users and Items for Agentic Recommendations.
Proceedings of the ACM Web Conference 2026, 2026


2025
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards.
CoRR, October, 2025

Pluralistic Off-policy Evaluation and Alignment.
CoRR, September, 2025

DICE: Dynamic In-Context Example Selection in LLM Agents via Efficient Knowledge Transfer.
CoRR, July, 2025

CachePrune: Neural-Based Attribution Defense Against Indirect Prompt Injection Attacks.
CoRR, April, 2025

In-context Ranking Preference Optimization.
CoRR, April, 2025

From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System.
CoRR, April, 2025

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models.
CoRR, April, 2025

Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models.
CoRR, March, 2025


Knowledge-Aware Query Expansion with Large Language Models for Textual and Relational Retrieval.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

OCEAN: Offline Chain-of-thought Evaluation and Alignment in Large Language Models.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

SAND: Boosting LLM Agents with Self-Taught Action Deliberation.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

Mitigating Visual Knowledge Forgetting in MLLM Instruction-tuning via Modality-decoupled Gradient Descent.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2025, 2025

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs.
Proceedings of the 31st International Conference on Computational Linguistics, 2025

Embedding-Informed Adaptive Retrieval-Augmented Generation of Large Language Models.
Proceedings of the 31st International Conference on Computational Linguistics, 2025


Doc-React: Multi-page Heterogeneous Document Question-answering.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2025


2024
Toward joint utilization of absolute and relative bandit feedback for conversational recommendation.
User Model. User Adapt. Interact., November, 2024

GUI Agents: A Survey.
CoRR, 2024

Personalized Multimodal Large Language Models: A Survey.
CoRR, 2024

A Survey of Small Language Models.
CoRR, 2024

Federated Large Language Models: Current Progress and Future Directions.
CoRR, 2024

Visual Prompting in Multimodal Large Language Models: A Survey.
CoRR, 2024

Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits.
Proceedings of the ACM on Web Conference 2024, 2024

The Closeness of In-Context Learning and Weight Shifting for Softmax Regression.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Hallucination Diversity-Aware Active Learning for Text Summarization.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

2023
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression.
CoRR, 2023

User-Regulation Deconfounded Conversational Recommender System with Bandit Feedback.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023


  Loading...