Honghao Wei

Orcid: 0000-0002-1131-326X

According to our database1, Honghao Wei authored at least 32 papers between 2014 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Scalable and Sample Efficient Distributed Policy Gradient Algorithms in Multi-Agent Networked Systems.
IEEE Trans. Netw., 2026

Towards Fast Safe Online Reinforcement Learning via Policy Finetuning.
Trans. Mach. Learn. Res., 2026

Safe Reinforcement Learning for Trustworthy AI: Theory, Algorithms, and Applications.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
An Optimistic Algorithm for online CMDPS with Anytime Adversarial Constraints.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

DATA: Domain-And-Time Alignment for High-Quality Feature Fusion in Collaborative Perception.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object Detection.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
Principle, Design, and Analysis of a Novel Discrete Pulse Control for Single-Phase Voltage Source Inverter.
IEEE Trans. Ind. Electron., June, 2024

A Reinforcement Learning and Prediction-Based Lookahead Policy for Vehicle Repositioning in Online Ride-Hailing Systems.
IEEE Trans. Intell. Transp. Syst., February, 2024

Marvel: Accelerating Safe Online Reinforcement Learning with Finetuned Offline Policy.
CoRR, 2024

Enhancing Safety in Reinforcement Learning with Human Feedback via Rectified Policy Optimization.
CoRR, 2024

Adversarially Trained Actor Critic for offline CMDPs.
CoRR, 2024

Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis.
RLJ, 2024

Safe and Efficient: A Primal-Dual Method for Offline Convex CMDPs under Partial Data Coverage.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Optimistic Joint Flow Control and Link Scheduling with Unknown Utility Functions.
Proceedings of the Twenty-fifth International Symposium on Theory, 2024

Safe Reinforcement Learning with Instantaneous Constraints: The Role of Aggressive Exploration.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
A Discrete Average Current Mode Control CCM Boost PFC Converter With Hybrid Pulse Train Modulation and Dual Edge Modulation.
IEEE Trans. Ind. Electron., October, 2023

Model-Free, Regret-Optimal Best Policy Identification in Online CMDPs.
CoRR, 2023

Sample Efficient Reinforcement Learning in Mixed Systems through Augmented Samples and Its Applications to Queueing Networks.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Provably Efficient Model-Free Algorithms for Non-stationary CMDPs.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Online Convex Optimization with Hard Constraints: Towards the Best of Two Worlds and Beyond.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

On low-complexity quickest intervention of mutated diffusion processes through local approximation.
Proceedings of the MobiHoc '22: The Twenty-third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, Seoul, Republic of Korea, October 17, 2022

Triple-Q: A Model-Free Algorithm for Constrained Reinforcement Learning with Sublinear Regret and Zero Constraint Violation.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2022

A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A Provably-Efficient Model-Free Algorithm for Constrained Markov Decision Processes.
CoRR, 2021

FORK: A FORward-looKing Actor for Model-Free Reinforcement Learning.
Proceedings of the 2021 60th IEEE Conference on Decision and Control (CDC), 2021

2019
QuickStop: A Markov Optimal Stopping Approach for Quickest Misinformation Detection.
Proceedings of the Abstracts of the 2019 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems, 2019

2017
Beyond the Words: Predicting User Personality from Heterogeneous Information.
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017

2016
Immersive Recommendation: News and Event Recommendations Using Personal Digital Traces.
Proceedings of the 25th International Conference on World Wide Web, 2016

GroupLink: Group Event Recommendations Using Personal Digital Traces.
Proceedings of the 19th ACM Conference on Computer Supported Cooperative Work and Social Computing, 2016

2014
Predicting Health Care Risk with Big Data Drawn from Clinical Physiological Parameters.
Proceedings of the Social Media Processing - Third National Conference, 2014


  Loading...