Yinglun Xu

According to our database1, Yinglun Xu authored at least 10 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Learning a Pessimistic Reward Model in RLHF.
CoRR, May, 2025

Improving Assembly Code Performance with Large Language Models via Reinforcement Learning.
CoRR, May, 2025

2024
Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks.
CoRR, 2024

Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning.
CoRR, 2024

Reward Poisoning Attack Against Offline Reinforcement Learning.
CoRR, 2024

Efficient Two-Phase Offline Deep Reinforcement Learning from Preference Feedback.
CoRR, 2024

2023
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms.
CoRR, 2023

Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning.
CoRR, 2023

2021
Observation-Free Attacks on Stochastic Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...