Yuhui Wang

Orcid: 0000-0002-0502-7486

Affiliations:
  • King Abdullah University of Science and Technology, Saudi Arabia
  • Nanjing University of Aeronautics & Astronautics, College of Automation Engineering, China (former)


According to our database1, Yuhui Wang authored at least 23 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Directly Forecasting Belief for Reinforcement Learning with Delays.
CoRR, May, 2025

Highly valued subgoal generation for efficient goal-conditioned reinforcement learning.
Neural Networks, 2025

Mindstorms in Natural Language-Based Societies of Mind.
Comput. Vis. Media, 2025

2024
Scaling Value Iteration Networks to 5000 Layers for Extreme Long-Term Planning.
CoRR, 2024

Highway Reinforcement Learning.
CoRR, 2024

Variational Delayed Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Highway Value Iteration Networks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multiagent Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., 2023

Guiding Online Reinforcement Learning with Action-Free Offline Pretraining.
CoRR, 2023

Learning to Identify Critical States for Reinforcement Learning from Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
Alleviating the estimation bias of deep deterministic policy gradient via co-regularization.
Pattern Recognit., 2022

A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

2021
A Cooperative-Competitive Multi-Agent Framework for Auto-bidding in Online Advertising.
CoRR, 2021

Greedy Multi-step Off-Policy Reinforcement Learning.
CoRR, 2021

Deep Recurrent Belief Propagation Network for POMDPs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
ACRM: Attention Cascade R-CNN with Mix-NMS for Metallic Surface Defect Detection.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Pornographic Image Recognition via Weighted Multiple Instance Learning.
IEEE Trans. Cybern., 2019

Robust Reinforcement Learning in POMDPs with Incomplete and Noisy Observations.
CoRR, 2019

Truly Proximal Policy Optimization.
Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, 2019

Trust Region-Guided Proximal Policy Optimization.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2016
Pornographic image recognition by strongly-supervised deep multiple instance learning.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016


  Loading...