Ping-Chun Hsieh

Kai Wang

CoRR, August, 2025

Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation.

[BibT_eX]

[DOI]

CoRR, February, 2025

Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective.

[BibT_eX]

[DOI]

CoRR, February, 2025

DDOT: A Derivative-Directed Dual-Decoder Ordinary Differential Equation Transformer for Dynamic System Modeling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Knowledge Discovery and Data Mining, 2025

Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Action-Constrained Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Efficient Action-Constrained Reinforcement Learning via Acceptance-Rejection Method and Augmented MDPs.

[BibT_eX]

[DOI]

Wei Hung

Shao-Hua Sun

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

BOFormer: Learning to Solve Multi-Objective Bayesian Optimization via Non-Markovian RL.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Extending Automatic Machine Translation Evaluation to Book-Length Documents.

[BibT_eX]

[DOI]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024

Diminishing Exploration: A Minimalist Approach to Piecewise Stationary Multi-Armed Bandits.

[BibT_eX]

[DOI]

Kuan-Ta Li

Yu-Chih Huang

CoRR, 2024

Image Deraining via Self-supervised Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2024

Offline Imitation of Badminton Player Behavior via Experiential Contexts and Brownian Motion.

[BibT_eX]

[DOI]

Proceedings of the Machine Learning and Knowledge Discovery in Databases. Applied Data Science Track, 2024

Diffusion-Reward Adversarial Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Enhancing Value Function Estimation through First-Order State-Action Dynamics in Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

PPO-Clip Attains Global Optimality: Towards Deeper Understandings of Clipping.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Accelerated Policy Gradient: On the Nesterov Momentum for Reinforcement Learning.

[BibT_eX]

[DOI]

Yen-Ju Chen

Nai-Chieh Huang

CoRR, 2023

Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs.

[BibT_eX]

[DOI]

CoRR, 2023

Revisiting Domain Randomization via Relaxed State-Adversarial Policy Optimization.

[BibT_eX]

[DOI]

Yun-Hsuan Lien

Yu-Shuen Wang

Proceedings of the International Conference on Machine Learning, 2023

Q-Pensieve: Boosting Sample Efficiency of Multi-Objective RL Through Memory Sharing of Q-Snapshots.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Coordinate Ascent for Off-Policy RL with Global Convergence Guarantees.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Towards Human-Like RL: Taming Non-Naturalistic Behavior in Deep RL via Adaptive Behavioral Costs in 3D Games.

[BibT_eX]

[DOI]

Proceedings of the Asian Conference on Machine Learning, 2023

Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective.

[BibT_eX]

[DOI]

Yu-Heng Hung

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Neural Contextual Bandits via Reward-Biased Maximum Likelihood Estimation.

[BibT_eX]

[DOI]

Yu-Heng Hung

CoRR, 2022

Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Visual Communications and Image Processing, 2022

Deterministic Bandwidth-Based Packet-Level Traffic Splitting for Datacenter Networks.

[BibT_eX]

[DOI]

Proceedings of the 23rd Asia-Pacific Network Operations and Management Symposium, 2022

2021

Hinge Policy Optimization: Rethinking Policy Improvement and Reinterpreting PPO.

[BibT_eX]

[DOI]

CoRR, 2021

Escaping from zero gradient: Revisiting action-constrained reinforcement learning via Frank-Wolfe policy optimization.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

NeurWIN: Neural Whittle Index Network For Restless Bandits Via Deep RL.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Reinforced Few-Shot Acquisition Function Learning for Bayesian Optimization.

[BibT_eX]

[DOI]

Bing-Jing Hsieh

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Optimal Wireless Scheduling for Remote Sensing through Brownian Approximation.

[BibT_eX]

[DOI]

Daojing Guo

Proceedings of the 40th IEEE Conference on Computer Communications, 2021

Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

Developing Multi-Task Recommendations with Long-Term Rewards via Policy Distilled Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Fresher content or smoother playback?: a brownian-approximation framework for scheduling real-time wireless video streams.

[BibT_eX]

[DOI]

Proceedings of the Mobihoc '20: The Twenty-first ACM International Symposium on Theory, 2020

Exploration Through Reward Biasing: Reward-Biased Maximum Likelihood Estimation for Stochastic Multi-Armed Bandits.

[BibT_eX]

[DOI]

Proceedings of the 37th International Conference on Machine Learning, 2020

2019

Fresher Content or Smoother Playback? A Brownian-Approximation Framework for Real-Time Video Delivery in Wireless Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Bandit Learning Through Biased Maximum Likelihood Estimation.

[BibT_eX]

[DOI]

CoRR, 2019

Real-Time Streaming Graph Embedding Through Local Actions.

[BibT_eX]

[DOI]

Proceedings of the Companion of The 2019 World Wide Web Conference, 2019

Stay With Me: Lifetime Maximization Through Heteroscedastic Linear Bandits With Reneging.

[BibT_eX]

[DOI]

Proceedings of the 36th International Conference on Machine Learning, 2019

2018

Streaming Network Embedding through Local Actions.

[BibT_eX]

[DOI]

CoRR, 2018

Heteroscedastic Bandits with Reneging.

[BibT_eX]

[DOI]

CoRR, 2018

PULS: Processor-Supported Ultra-Low Latency Scheduling.

[BibT_eX]

[DOI]

Simon Yau

Rajarshi Bhattacharyya

Proceedings of the Nineteenth ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2018

A Decentralized Medium Access Protocol for Real-Time Wireless Ad Hoc Networks With Unreliable Transmissions.

[BibT_eX]

[DOI]

Proceedings of the 38th IEEE International Conference on Distributed Computing Systems, 2018

An Experimental Study on Coverage Enhancement of LTE Cat-M1 for Machine-Type Communication.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Communications, 2018

Safe Intersection Management for Mixed Transportation Systems with Human-Driven and Autonomous Vehicles.

[BibT_eX]

[DOI]

P. R. Kumar

Proceedings of the 56th Annual Allerton Conference on Communication, 2018

2017

The capacity of QoE for wireless networks with unreliable transmissions.

[BibT_eX]

[DOI]

Queueing Syst. Theory Appl., 2017

Delay-Optimal Scheduling for Queueing Systems with Switching Overhead.

[BibT_eX]

[DOI]

CoRR, 2017

Throughput-Optimal Scheduling for Multi-Hop Networked Transportation Systems With Switch-Over Delay.

[BibT_eX]

[DOI]

Proceedings of the 18th ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2017

VisualLink: Strengthening the Connection between Hearing-impaired Elderly and their Family.

[BibT_eX]

[DOI]

Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems, 2017

2016

Heavy-traffic analysis of QoE optimality for on-demand video streams over fading channels.

[BibT_eX]

[DOI]

Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016

2015

WiMAC: Rapid Implementation Platform for User Definable MAC Protocols Through Separation.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015

QoE-Optimal Scheduling for On-Demand Video Streams over Unreliable Wireless Networks.

[BibT_eX]

[DOI]