Ping-Chun Hsieh

Orcid: 0000-0002-2072-8950

According to our database1, Ping-Chun Hsieh authored at least 39 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Image Deraining via Self-supervised Reinforcement Learning.
CoRR, 2024

Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion.
CoRR, 2024

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning.
CoRR, 2023

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs.
CoRR, 2023

Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games.
CoRR, 2023

Revisiting Domain Randomization via Relaxed State-Adversarial Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2023

Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation.
CoRR, 2022

Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265.
Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Deterministic Bandwidth-Based Packet-Level Traffic Splitting for Datacenter Networks.
Proceedings of the 23rd Asia-Pacific Network Operations and Management Symposium, 2022

2021
Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO.
CoRR, 2021

Escaping from zero gradient: Revisiting action-constrained reinforcement learning via Frank-Wolfe policy optimization.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Optimal Wireless Scheduling for Remote Sensing through Brownian Approximation.
Proceedings of the 40th IEEE Conference on Computer Communications, 2021

Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning.
CoRR, 2020

Fresher content or smoother playback?: a brownian-approximation framework for scheduling real-time wireless video streams.
Proceedings of the Mobihoc '20: The Twenty-first ACM International Symposium on Theory, 2020

Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits.
Proceedings of the 37th International Conference on Machine Learning, 2020

2019
Fresher Content or Smoother Playback? A Brownian-Approximation Framework for Real-Time Video Delivery in Wireless Networks.
CoRR, 2019

Bandit Learning Through Biased Maximum Likelihood Estimation.
CoRR, 2019

Real-Time Streaming Graph Embedding Through Local Actions.
Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging.
Proceedings of the 36th International Conference on Machine Learning, 2019

2018
Heavy-Traffic Analysis of QoE Optimality for On-Demand Video Streams Over Fading Channels.
IEEE/ACM Trans. Netw., 2018

Streaming Network Embedding through Local Actions.
CoRR, 2018

Heteroscedastic Bandits with Reneging.
CoRR, 2018

PULS: Processor-Supported Ultra-Low Latency Scheduling.
Proceedings of the Nineteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2018

A Decentralized Medium Access Protocol for Real-Time Wireless Ad Hoc Networks With Unreliable Transmissions.
Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018

An Experimental Study on Coverage Enhancement of LTE Cat-M1 for Machine-Type Communication.
Proceedings of the 2018 IEEE International Conference on Communications, 2018

Safe Intersection Management for Mixed Transportation Systems with Human-Driven and Autonomous Vehicles.
Proceedings of the 56th Annual Allerton Conference on Communication, 2018

2017
The capacity of QoE for wireless networks with unreliable transmissions.
Queueing Syst. Theory Appl., 2017

Delay-Optimal Scheduling for Queueing Systems with Switching Overhead.
CoRR, 2017

Throughput-Optimal Scheduling for Multi-Hop Networked Transportation Systems With Switch-Over Delay.
Proceedings of the 18th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2017

VisualLink: Strengthening the Connection between Hearing-impaired Elderly and their Family.
Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

2015
WiMAC: Rapid Implementation Platform for User Definable MAC Protocols Through Separation.
Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015

QoE-Optimal Scheduling for On-Demand Video Streams over Unreliable Wireless Networks.
Proceedings of the 16th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2015


  Loading...