Yinglun Xu

According to our database1, Yinglun Xu authored at least 12 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Learning a Pessimistic Reward Model in RLHF.
CoRR, May, 2025

Improving Assembly Code Performance with Large Language Models via Reinforcement Learning.
CoRR, May, 2025

Two-Step Offline Preference-Based Reinforcement Learning on Explicitly Constrained Policies.
Trans. Mach. Learn. Res., 2025

Universal Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning.
Trans. Mach. Learn. Res., 2025

2024
Robust Thompson Sampling Algorithms Against Reward Poisoning Attacks.
CoRR, 2024

Optimal Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning.
CoRR, 2024

Reward Poisoning Attack Against Offline Reinforcement Learning.
CoRR, 2024

Efficient Two-Phase Offline Deep Reinforcement Learning from Preference Feedback.
CoRR, 2024

2023
Efficient Reward Poisoning Attacks on Online Deep Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms.
CoRR, 2023

Black-Box Targeted Reward Poisoning Attack Against Online Deep Reinforcement Learning.
CoRR, 2023

2021
Observation-Free Attacks on Stochastic Bandits.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


  Loading...