Yu Fu

Orcid: 0009-0000-2943-2411

Affiliations:
  • University of California Riverside (UCR), Department of Computer Science and Engineering, CA, USA


According to our database1, Yu Fu authored at least 13 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
Reducing the Safety Tax in LLM Safety Alignment with On-Policy Self-Distillation.
CoRR, May, 2026

VLMs Need Words: Vision Language Models Ignore Visual Detail In Favor of Semantic Anchors.
CoRR, April, 2026

Is Reasoning Capability Enough for Safety in Long-Context Language Models?
CoRR, February, 2026

Thinking Out of Order: When Output Order Stops Reflecting Reasoning Order in Diffusion Language Models.
CoRR, January, 2026

Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
TRAWL: Tensor Reduced and Approximated Weights for Large Language Models.
Proceedings of the Data Science: Foundations and Applications, 2025

Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Cross-Task Defense: Instruction-Tuning LLMs for Content Safety.
CoRR, 2024

Safety Alignment in NLP Tasks: Weakly Aligned Summarization as an In-Context Attack.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Vulnerabilities of Large Language Models to Adversarial Attacks.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), 2024

Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks.
CoRR, 2023

Inverse Reinforcement Learning for Text Summarization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023


  Loading...