We stand with Ukraine

We stand with Ukraine

Paul Weng

Orcid: 0000-0002-2008-4569

According to our database¹, Paul Weng authored at least 92 papers between 2005 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

OMP: One-step Meanflow Policy with Directional Alignment.

[DOI]

,

,

,

,

,

CoRR, December, 2025

Intervention Efficiency and Perturbation Validation Framework: Capacity-Aware and Robust Clinical Model Selection under the Rashomon Effect.

[DOI]

,

,

CoRR, November, 2025

Learning to Cut Generation in Branch-and-Cut Algorithms for Combinatorial Optimization.

[DOI]

Thi Quynh Trang Vo

,

,

Viet Hung Nguyen

,

ACM Trans. Evol. Learn. Optim., September, 2025

Generalizable Coarse-to-Fine Robot Manipulation via Language-Aligned 3D Keypoints.

[DOI]

,

,

,

,

,

,

CoRR, September, 2025

From fair solutions to compromise solutions in multi-objective deep reinforcement learning.

[DOI]

,

,

,

Neural Comput. Appl., August, 2025

State-novelty guided action persistence in deep reinforcement learning.

[DOI]

,

,

Mach. Learn., January, 2025

ASAP: Learning Generalizable Online Bin Packing via Adaptive Selection After Pruning.

[DOI]

,

,

CoRR, January, 2025

A Survey of Reinforcement Learning from Human Feedback.

[DOI]

,

,

,

Eyke Hüllermeier

Trans. Mach. Learn. Res., 2025

Understanding and Reducing the Class-Dependent Effects of Data Augmentation with A Two-Player Game Approach.

[DOI]

,

,

Trans. Mach. Learn. Res., 2025

Time Reversal Symmetry for Efficient Robotic Manipulations in Deep Reinforcement Learning.

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

Progressive Learning with Human Feedback for Personalized Adaptive Video Streaming.

[DOI]

,

,

,

,

,

Proceedings of the 33rd ACM International Conference on Multimedia, 2025

Comparing Comparisons: Informative and Easy Human Feedback with Distinguishability Queries.

[DOI]

,

,

,

Eyke Hüllermeier

,

,

Proceedings of the Forty-second International Conference on Machine Learning, 2025

Reinforcement Learning from Imperfect Corrective Actions and Proxy Rewards.

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback.

[DOI]

,

,

,

,

Eyke Hüllermeier

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data.

[DOI]

,

,

,

,

Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024

A survey on interpretable reinforcement learning.

[DOI]

,

,

Matthieu Zimmer

,

,

,

,

Mach. Learn., July, 2024

Generalization in Deep RL for TSP Problems via Equivariance and Local Search.

[DOI]

,

,

,

SN Comput. Sci., April, 2024

Imitation Learning from Suboptimal Demonstrations via Meta-Learning An Action Ranker.

[DOI]

,

,

,

,

CoRR, 2024

Enhancing Class Fairness in Classification with A Two-Player Game Approach.

[DOI]

,

,

CoRR, 2024

INViT: A Generalizable Routing Problem Solver with Invariant Nested View Transformer.

[DOI]

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting Data Augmentation in Deep Reinforcement Learning.

[DOI]

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

Differentiable Logic Machines.

[DOI]

Matthieu Zimmer

,

,

,

,

,

,

,

,

Trans. Mach. Learn. Res., 2023

Analytics and machine learning in vehicle routing research.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

Int. J. Prod. Res., 2023

Unsupervised Salient Patch Selection for Data-Efficient Reinforcement Learning.

[DOI]

,

Proceedings of the Machine Learning and Knowledge Discovery in Databases: Research Track, 2023

Improving Subtour Elimination Constraint Generation in Branch-and-Cut Algorithms for the TSP with Machine Learning.

[DOI]

Thi Quynh Trang Vo

,

,

Viet Hung Nguyen

,

Proceedings of the Learning and Intelligent Optimization - 17th International Conference, 2023

Fair Deep Reinforcement Learning with Preferential Treatment.

[DOI]

,

,

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Fair Deep Reinforcement Learning with Generalized Gini Welfare Functions.

[DOI]

,

,

Proceedings of the Autonomous Agents and Multiagent Systems. Best and Visionary Papers - AAMAS 2023 Workshops, London, UK, May 29, 2023

Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning.

[DOI]

,

,

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

2022

A comparative study of linearization methods for Ordered Weighted Average.

[DOI]

Thi Quynh Trang Vo

,

,

Viet Hung Nguyen

,

Proceedings of the 12th International Workshop on Resilient Networks Design and Modeling, 2022

Neuro-Symbolic Hierarchical Rule Induction.

[DOI]

,

,

,

,

Matthieu Zimmer

,

,

,

Proceedings of the International Conference on Machine Learning, 2022

Planning with Q-Values in Sparse Reward Reinforcement Learning.

[DOI]

,

,

,

Proceedings of the Intelligent Robotics and Applications - 15th International Conference, 2022

Solving Complex Manipulation Tasks with Model-Assisted Model-Free Reinforcement Learning.

[DOI]

,

Proceedings of the Conference on Robot Learning, 2022

CVaR-Regret Bounds for Multi-armed Bandits.

[DOI]

,

Proceedings of the Asian Conference on Machine Learning, 2022

2021

Hyperparameter Auto-Tuning in Self-Supervised Robotic Learning.

[DOI]

,

,

Matthieu Zimmer

,

,

,

IEEE Robotics Autom. Lett., 2021

Neuro-Symbolic Hierarchical Rule Induction.

[DOI]

,

,

,

,

Matthieu Zimmer

,

,

CoRR, 2021

Learning Symbolic Rules for Interpretable Deep Reinforcement Learning.

[DOI]

,

,

,

Hankz Hankui Zhuo

,

,

,

CoRR, 2021

Improving Generalization of Deep Reinforcement Learning-based TSP Solvers.

[DOI]

,

,

,

,

Proceedings of the IEEE Symposium Series on Computational Intelligence, 2021

Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning.

[DOI]

Matthieu Zimmer

,

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Safe Distributional Reinforcement Learning.

[DOI]

,

Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

2020

Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement Learning.

[DOI]

,

,

Matthieu Zimmer

,

,

,

IEEE Robotics Autom. Lett., 2020

Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning.

[DOI]

Matthieu Zimmer

,

,

CoRR, 2020

Learning Fair Policies in Multiobjective (Deep) Reinforcement Learning with Average and Discounted Rewards.

[DOI]

,

,

Matthieu Zimmer

CoRR, 2020

Reinforcement Learning.

[DOI]

,

Olivier Pietquin

,

CoRR, 2020

Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards.

[DOI]

,

,

Matthieu Zimmer

Proceedings of the 37th International Conference on Machine Learning, 2020

Decomposed Deep Reinforcement Learning for Robotic Control.

[DOI]

,

,

,

,

,

Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Reinforcement Learning.

[DOI]

,

Olivier Pietquin

,

Proceedings of the A Guided Tour of Artificial Intelligence Research: Volume I: Knowledge Representation, 2020

2019

Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation.

[DOI]

,

,

Matthieu Zimmer

,

,

CoRR, 2019

Invariant Transform Experience Replay.

[DOI]

,

,

Matthieu Zimmer

,

,

CoRR, 2019

Fairness in Reinforcement Learning.

[DOI]

CoRR, 2019

Dual Graph Attention Networks for Deep Latent Representation of Multifaceted Social Effects in Recommender Systems.

[DOI]

,

,

,

,

,

,

Proceedings of the World Wide Web Conference, 2019

Dual Sequential Prediction Models Linking Sequential Recommendation and Information Dissemination.

[DOI]

,

,

,

,

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Exploiting the Sign of the Advantage Function to Learn Deterministic Policies in Continuous Domains.

[DOI]

Matthieu Zimmer

,

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

An efficient reinforcement learning algorithm for learning deterministic policies in continuous domains.

[DOI]

Matthieu Zimmer

,

Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019

2018

Hierarchical Electric Vehicle Charging Aggregator Strategy Using Dantzig-Wolfe Decomposition.

[DOI]

,

,

,

Orkun Karabasoglu

,

IEEE Des. Test, 2018

Representing Relative Visual Attributes with a Reference-Point-Based Decision Model.

[DOI]

,

Proceedings of the 24th International Conference on Pattern Recognition, 2018

Mediation of Debates with Dynamic Argumentative Behaviors.

[DOI]

Emmanuel Hadoux

,

Aurélie Beynier

,

,

Proceedings of the Computational Models of Argument, 2018

Adversarial Training Model Unifying Feature Driven and Point Process Perspectives for Event Popularity Prediction.

[DOI]

,

,

,

,

,

Proceedings of the 27th ACM International Conference on Information and Knowledge Management, 2018

2017

Functional Reward Markov Decision Processes: Theory and Applications.

[DOI]

,

Olivier Spanjaard

Int. J. Artif. Intell. Tools, 2017

Optimal Threshold Policies for Robust Data Center Control.

[DOI]

,

,

John A. W. B. Costanzo

,

,

CoRR, 2017

Multi-objective Bandits: Optimizing the Generalized Gini Index.

[DOI]

Róbert Busa-Fekete

,

Balázs Szörényi

,

,

Proceedings of the 34th International Conference on Machine Learning, 2017

An Efficient Primal-Dual Algorithm for Fair Combinatorial Optimization Problems.

[DOI]

Viet Hung Nguyen

,

Proceedings of the Combinatorial Optimization and Applications, 2017

Optimizing Quantiles in Preference-Based Markov Decision Processes.

[DOI]

,

,

Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016

Quantile Reinforcement Learning.

[DOI]

,

CoRR, 2016

Model-Free Reinforcement Learning with Skew-Symmetric Bilinear Utilities.

[DOI]

,

Bruno Zanuttini

,

,

Paolo Viappiani

,

Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016

From Preference-Based to Multiobjective Sequential Decision-Making.

[DOI]

Proceedings of the Multi-disciplinary Trends in Artificial Intelligence, 2016

Finding Risk-Averse Shortest Path with Time-Dependent Stochastic Costs.

[DOI]

,

,

Orkun Karabasoglu

Proceedings of the Multi-disciplinary Trends in Artificial Intelligence, 2016

2015

Reports of the AAAI 2014 Conference Workshops.

[DOI]

AI Mag., 2015

Optimization of Probabilistic Argumentation with Markov Decision Models.

[DOI]

Emmanuel Hadoux

,

Aurélie Beynier

,

,

,

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Solving MDPs with Skew Symmetric Bilinear Utility Functions.

[DOI]

,

Olivier Spanjaard

,

Paolo Viappiani

,

Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015

Qualitative Multi-Armed Bandits: A Quantile-Based Approach.

[DOI]

Balázs Szörényi

,

Róbert Busa-Fekete

,

,

Eyke Hüllermeier

Proceedings of the 32nd International Conference on Machine Learning, 2015

Reducing the Number of Queries in Interactive Value Iteration.

[DOI]

,

Olivier Spanjaard

,

Paolo Viappiani

,

Proceedings of the Algorithmic Decision Theory - 4th International Conference, 2015

2014

Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm.

[DOI]

Róbert Busa-Fekete

,

Balázs Szörényi

,

,

,

Eyke Hüllermeier

Mach. Learn., 2014

Solving Hidden-Semi-Markov-Mode Markov Decision Problems.

[DOI]

Emmanuel Hadoux

,

Aurélie Beynier

,

Proceedings of the Scalable Uncertainty Management - 8th International Conference, 2014

Preface.

[DOI]

Darius Braziunas

,

,

K. Brent Venable

,

,

Proceedings of the Multidisciplinary Workshop on Advances in Preference Handling, 2014

2013

A Compromise Programming Approach to multiobjective Markov Decision Processes.

[DOI]

Wlodzimierz Ogryczak

,

,

Int. J. Inf. Technol. Decis. Mak., 2013

Axiomatic Foundations of Generalized Qualitative Utility.

[DOI]

Proceedings of the Multi-disciplinary Trends in Artificial Intelligence, 2013

Markov Decision Processes with Functional Rewards.

[DOI]

Olivier Spanjaard

,

Proceedings of the Multi-disciplinary Trends in Artificial Intelligence, 2013

Interactive Value Iteration for Markov Decision Processes with Unknown Rewards.

[DOI]

,

Bruno Zanuttini

Proceedings of the IJCAI 2013, 2013

Top-k Selection based on Adaptive Sampling of Noisy Preferences.

[DOI]

Róbert Busa-Fekete

,

Balázs Szörényi

,

,

,

Eyke Hüllermeier

Proceedings of the 30th International Conference on Machine Learning, 2013

Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes.

[DOI]

,

,

,

Proceedings of the Late-Breaking Developments in the Field of Artificial Intelligence, 2013

2012

On WOWA Rank Reversal.

[DOI]

Wlodzimierz Ogryczak

,

,

Proceedings of the Modeling Decisions for Artificial Intelligence, 2012

Ordinal Decision Models for Markov Decision Processes.

[DOI]

Proceedings of the ECAI 2012, 2012

2011

On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes.

[DOI]

Wlodzimierz Ogryczak

,

,

Proceedings of the Algorithmic Decision Theory - Second International Conference, 2011

Committee Selection with a Weight Constraint Based on a Pairwise Dominance Relation.

[DOI]

,

Olivier Spanjaard

,

Proceedings of the Algorithmic Decision Theory - Second International Conference, 2011

Markov Decision Processes with Ordinal Rewards: Reference Point-Based Preferences.

[DOI]

Proceedings of the 21st International Conference on Automated Planning and Scheduling, 2011

2010

On Finding Compromise Solutions in Multiobjective Markov Decision Processes.

[DOI]

,

Proceedings of the ECAI 2010, 2010

2007

Conditions générales pour l'admissibilité de la programmation dynamique dans la décision séquentielle possibiliste.

[DOI]

Rev. d'Intelligence Artif., 2007

2006

Processus de décision markoviens et préférences non classiques.

[DOI]

Rev. d'Intelligence Artif., 2006

Axiomatic Foundations for a Class of Generalized Expected Utility: Algebraic Expected Utility.

[DOI]

Proceedings of the UAI '06, 2006

An Axiomatic Approach in Qualitative Decision Theory with Binary Possibilistic Utility.

Proceedings of the ECAI 2006, 17th European Conference on Artificial Intelligence, August 29, 2006

2005

Qualitative Decision Making Under Possibilistic Uncertainty: Toward more Discriminating Criteria.

[DOI]

Proceedings of the UAI '05, 2005

Algebraic Markov Decision Processes.

[DOI]

,

Olivier Spanjaard

,

Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Loading...