Jianye Hao

Orcid: 0000-0002-0422-8235

According to our database1, Jianye Hao authored at least 306 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Learning from Hierarchical Structure of Knowledge Graph for Recommendation.
ACM Trans. Inf. Syst., January, 2024

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.
Artif. Intell., January, 2024

SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
CoRR, 2024

Reinforced In-Context Black-Box Optimization.
CoRR, 2024

Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.
CoRR, 2024

MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint.
CoRR, 2024

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
CoRR, 2024

DiffuserLite: Towards Real-time Diffusion Planning.
CoRR, 2024

LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation.
CoRR, 2024

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey.
CoRR, 2024

Machine Learning Insides OptVerse AI Solver: Design Principles and Applications.
CoRR, 2024

PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell Modeling.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Multiagent Gumbel MuZero: Efficient Planning in Combinatorial Action Spaces.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Debiased Recommendation with User Feature Balancing.
ACM Trans. Inf. Syst., October, 2023

Empirical Policy Optimization for n-Player Markov Games.
IEEE Trans. Cybern., October, 2023

ASN: action semantics network for multiagent reinforcement learning.
Auton. Agents Multi Agent Syst., October, 2023

Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning.
IEEE Trans. Neural Networks Learn. Syst., August, 2023

Accelerating deep reinforcement learning via knowledge-guided policy network.
Auton. Agents Multi Agent Syst., June, 2023

A Unified Framework for Layout Pattern Analysis With Deep Causal Estimation.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., April, 2023

A benchmark for automatic medical consultation system: frameworks, tasks and datasets.
Bioinform., January, 2023

Contrastive-ACE: Domain Generalization Through Alignment of Causal Mechanisms.
IEEE Trans. Image Process., 2023

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning.
CoRR, 2023

Rethinking Decision Transformer via Hierarchical Reinforcement Learning.
CoRR, 2023

AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model.
CoRR, 2023

A Circuit Domain Generalization Framework for Efficient Logic Synthesis in Chip Design.
CoRR, 2023

Exploiting Counter-Examples for Active Learning with Partial labels.
CoRR, 2023

VOLTA: Diverse and Controllable Question-Answer Pair Generation with Variational Mutual Information Maximizing Autoencoder.
CoRR, 2023

Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning.
CoRR, 2023

Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning.
CoRR, 2023

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
CoRR, 2023

Multi-agent Policy Reciprocity with Theoretical Guarantee.
CoRR, 2023

Traj-MAE: Masked Autoencoders for Trajectory Prediction.
CoRR, 2023

DR-Label: Improving GNN Models for Catalysis Systems by Label Deconstruction and Reconstruction.
CoRR, 2023

The Ladder in Chaos: A Simple and Effective Improvement to General DRL Algorithms by Policy Path Trimming and Boosting.
CoRR, 2023

Reweighted Interacting Langevin Diffusions: an Accelerated Sampling Methodfor Optimization.
CoRR, 2023

Breaking Filter Bubble: A Reinforcement Learning Framework of Controllable Recommender System.
Proceedings of the ACM Web Conference 2023, 2023

Uncertainty-aware Consistency Learning for Cold-Start Item Recommendation.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

RLMixer: A Reinforcement Learning Approach for Integrated Ranking with Contrastive User Preference Modeling.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Transfer Reinforcement Learning Based Negotiating Agent Framework.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2023

Generalized Universal Domain Adaptation with Generative Flow Networks.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

T3S: Improving Multi-Task Reinforcement Learning with Task-Specific Feature Selector and Scheduler.
Proceedings of the International Joint Conference on Neural Networks, 2023

Generative Flow Networks for Precise Reward-Oriented Active Learning on Graphs.
Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence, 2023

Achieving Last-Mile Functional Coverage in Testing Chip Design Software Implementations.
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: Software Engineering in Practice, 2023

MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
Proceedings of the International Conference on Machine Learning, 2023

RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution.
Proceedings of the International Conference on Machine Learning, 2023

ChiPFormer: Transferable Chip Placement via Offline Decision Transformer.
Proceedings of the International Conference on Machine Learning, 2023

EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Out-of-distribution Detection with Implicit Outlier Transformation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Regularized Offline GFlowNets.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

CFlowNets: Continuous Control with Generative Flow Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

GFlowNets with Human Feedback.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

DAG Matters! GFlowNets Enhanced Explainer for Graph Neural Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Learnable Behavior Control: Breaking Atari Human World Records via Sample-Efficient Behavior Selection.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Traj-MAE: Masked Autoencoders for Trajectory Prediction.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

EasySO: Exploration-enhanced Reinforcement Learning for Logic Synthesis Sequence Optimization and a Comprehensive RL Environment.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

EasyMap: Improving Technology Mapping via Exploration-Enhanced Heuristics and Adaptive Sequencing.
Proceedings of the IEEE/ACM International Conference on Computer Aided Design, 2023

Limited Information Opponent Modeling.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2023, 2023

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

TOFU: A Two-Step Floorplan Refinement Framework for Whitespace Reduction.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2023

RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow.
Proceedings of the Fifth International Conference on Distributed Artificial Intelligence, 2023

Co-speech Gesture Synthesis by Reinforcement Learning with Contrastive Pretrained Rewards.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Dual-Process Graph Neural Network for Diversified Recommendation.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

A Hierarchical Imitation Learning-based Decision Framework for Autonomous Driving.
Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, 2023

Transfer Learning based Agent for Automated Negotiation.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Off-Beat Multi-Agent Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Spectral Augmentations for Graph Contrastive Learning.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

Neighbor Auto-Grouping Graph Neural Networks for Handover Parameter Configuration in Cellular Network.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

SplitNet: A Reinforcement Learning Based Sequence Splitting Method for the MinMax Multiple Travelling Salesman Problem.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Learning to select cuts for efficient mixed-integer programming.
Pattern Recognit., 2022

SCC-rFMQ: a multiagent reinforcement learning method in cooperative Markov games with continuous actions.
Int. J. Mach. Learn. Cybern., 2022

Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents.
Frontiers Inf. Technol. Electron. Eng., 2022

HEBO: An Empirical Study of Assumptions in Bayesian Optimisation.
J. Artif. Intell. Res., 2022

Transformer in Transformer as Backbone for Deep Reinforcement Learning.
CoRR, 2022

Planning Immediate Landmarks of Targets for Model-Free Skill Transfer across Agents.
CoRR, 2022

State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning.
CoRR, 2022

Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning.
CoRR, 2022

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow.
CoRR, 2022

ERL-Re<sup>2</sup>: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation.
CoRR, 2022

PTDE: Personalized Training with Distillated Execution for Multi-Agent Reinforcement Learning.
CoRR, 2022

GFlowCausal: Generative Flow Networks for Causal Discovery.
CoRR, 2022

Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning.
CoRR, 2022

On the Convergence Theory of Meta Reinforcement Learning with Personalized Policies.
CoRR, 2022

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes.
CoRR, 2022

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.
CoRR, 2022

API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks.
CoRR, 2022

Generalizable Information Theoretic Causal Representation.
CoRR, 2022

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization.
CoRR, 2022

Introduction to The Dynamic Pickup and Delivery Problem Benchmark - ICAPS 2021 Competition.
CoRR, 2022

Debiased Recommendation with User Feature Balancing.
CoRR, 2022

A review and performance evaluation of clustering frameworks for single-cell Hi-C data.
Briefings Bioinform., 2022

Modeling Scale-free Graphs with Hyperbolic Geometry for Knowledge-aware Recommendation.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Cross-domain adaptive transfer reinforcement learning based on state-action correspondence.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-Based Policy Learning.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2022

Versatile Multi-stage Graph Neural Network for Circuit Representation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Plan To Predict: Learning an Uncertainty-Foreseeing Model For Model-Based Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Multiagent Q-learning with Sub-Team Coordination.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

GALOIS: Boosting Deep Reinforcement Learning via Generalizable Logic Synthesis.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Flat-Aware Cross-Stage Distilled Framework for Imbalanced Medical Image Classification.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

Generalizable Floorplanner through Corner Block List Representation and Hypergraph Embedding.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

RCANet: Root Cause Analysis via Latent Variable Interaction Modeling for Yield Improvement.
Proceedings of the IEEE International Test Conference, 2022

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Promoting Quality and Diversity in Population-based Reinforcement Learning via Hierarchical Trajectory Space Exploration.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Individual Reward Assisted Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization.
Proceedings of the International Conference on Machine Learning, 2022

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration.
Proceedings of the International Conference on Machine Learning, 2022

Learning Pseudometric-based Action Representations for Offline Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

Neuro-Symbolic Hierarchical Rule Induction.
Proceedings of the International Conference on Machine Learning, 2022

Learning State Representations via Retracing in Reinforcement Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Online Ad Hoc Teamwork under Partial Observability.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Invariant Factor Graph Neural Networks.
Proceedings of the IEEE International Conference on Data Mining, 2022

Heterogeneous Graph Neural Network-Based Imitation Learning for Gate Sizing Acceleration.
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

Batch Sequential Black-Box Optimization with Embedding Alignment Cells for Logic Synthesis.
Proceedings of the 41st IEEE/ACM International Conference on Computer-Aided Design, 2022

Efficient Dual-Process Cognitive Recommender Balancing Accuracy and Diversity.
Proceedings of the Database Systems for Advanced Applications, 2022

Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator.
Proceedings of the Distributed Artificial Intelligence - 4th International Conference, 2022

LHNN: lattice hypergraph neural network for VLSI congestion prediction.
Proceedings of the DAC '22: 59th ACM/IEEE Design Automation Conference, San Francisco, California, USA, July 10, 2022

Socially-Attentive Policy Optimization in Multi-Agent Self-Driving System.
Proceedings of the Conference on Robot Learning, 2022

Multiagent Q-learning with Sub-Team Coordination.
Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, 2022

What about Inputting Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Falsification of Cyber-Physical Systems Using Deep Reinforcement Learning.
IEEE Trans. Software Eng., 2021

Generalized Centered 2-D Principal Component Analysis.
IEEE Trans. Cybern., 2021

SC2disease: a manually curated database of single-cell transcriptome for human diseases.
Nucleic Acids Res., 2021

A Survey on Interpretable Reinforcement Learning.
CoRR, 2021

ED2: An Environment Dynamics Decomposition Framework for World Model Construction.
CoRR, 2021

Lifelong Reinforcement Learning with Temporal Logic Formulas and Reward Machines.
CoRR, 2021

Ranking Cost: Building An Efficient and Scalable Circuit Routing Planner with Evolution-Based Optimization.
CoRR, 2021

Exploration in Deep Reinforcement Learning: A Comprehensive Survey.
CoRR, 2021

Contrastive ACE: Domain Generalization Through Alignment of Causal Mechanisms.
CoRR, 2021

Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment.
CoRR, 2021

Learning Symbolic Rules for Interpretable Deep Reinforcement Learning.
CoRR, 2021

Differentiable Logic Machines.
CoRR, 2021

Integrating multi-network topology for gene function prediction using deep neural networks.
Briefings Bioinform., 2021

An end-to-end heterogeneous graph representation learning-based framework for drug-target interaction prediction.
Briefings Bioinform., 2021

Efficient policy detecting and reusing for non-stationarity in Markov games.
Auton. Agents Multi Agent Syst., 2021

An Adversarial Imitation Click Model for Information Retrieval.
Proceedings of the WWW '21: The Web Conference 2021, 2021

A Graph-Enhanced Click Model for Web Search.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Detecting and Learning Against Unknown Opponents for Automated Negotiations.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.
Proceedings of the PRICAI 2021: Trends in Artificial Intelligence, 2021

An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Adaptive Online Packing-guided Search for POMDPs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Model-Based Reinforcement Learning via Imagination with Derived Memory.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dynamic Bottleneck for Robust Self-Supervised Exploration.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

A Multi-Graph Attributed Reinforcement Learning based Optimization Algorithm for Large-scale Hybrid Flow Shop Scheduling Problem.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

FIGCPS: Effective Failure-inducing Input Generation for Cyber-Physical Systems with Deep Reinforcement Learning.
Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering, 2021

Ordering-Based Causal Discovery with Reinforcement Learning.
Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021

A deep reinforcement learning-based agent for negotiation with multiple communication channels.
Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence, 2021

Automatic Web Testing Using Curiosity-Driven Reinforcement Learning.
Proceedings of the 43rd IEEE/ACM International Conference on Software Engineering, 2021

Relational Navigation Learning in Continuous Action Space among Crowds.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Principled Exploration via Optimistic Bootstrapping and Backward Induction.
Proceedings of the 38th International Conference on Machine Learning, 2021

Coalition-based Task Assignment in Spatial Crowdsourcing.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

SEIHAI: A Sample-Efficient Hierarchical AI for the MineRL Competition.
Proceedings of the Distributed Artificial Intelligence - Third International Conference, 2021

CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

CMML: Contextual Modulation Meta Learning for Cold-Start Recommendation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

Addressing Action Oscillations through Learning Policy Inertia.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Efficient Multiagent Policy Optimization Based on Weighted Estimators in Stochastic Cooperative Environments.
J. Comput. Sci. Technol., 2020

HEBO: Heteroscedastic Evolutionary Bayesian Optimisation.
CoRR, 2020

Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning.
CoRR, 2020

SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving.
CoRR, 2020

What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator.
CoRR, 2020

Event-Triggered Multi-agent Reinforcement Learning with Communication under Limited-bandwidth Constraint.
CoRR, 2020

Dynamic Horizon Value Estimation for Model-based Reinforcement Learning.
CoRR, 2020

CausalVAE: Structured Causal Disentanglement in Variational Autoencoder.
CoRR, 2020

Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework.
CoRR, 2020

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning.
CoRR, 2020

A Method for Deploying Distributed Denial of Service Attack Defense Strategies on Edge Servers Using Reinforcement Learning.
IEEE Access, 2020

Cross-data Automatic Feature Engineering via Meta-learning and Reinforcement Learning.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2020

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

A Multi-Task Reinforcement Learning Approach for Navigating Unsignalized Intersections.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Efficient Deep Reinforcement Learning via Adaptive Policy Transfer.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Generating Behavior-Diverse Game AIs with Evolutionary Multi-Objective Deep Reinforcement Learning.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Triple-GAIL: A Multi-Modal Imitation Learning Framework with Generative Adversarial Nets.
Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, 2020

Q-value Path Decomposition for Deep Multiagent Reinforcement Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising.
Proceedings of the 37th International Conference on Machine Learning, 2020

Action Semantics Network: Considering the Effects of Actions in Multiagent Systems.
Proceedings of the 8th International Conference on Learning Representations, 2020

An Empirical Study on Correlation between Coverage and Robustness for Deep Neural Networks.
Proceedings of the 25th International Conference on Engineering of Complex Computer Systems, 2020

Faster Convention Emergence by Avoiding Local Conventions in Reinforcement Social Learning.
Proceedings of the Artificial Intelligence and Soft Computing, 2020

MGHRL: Meta Goal-Generation for Hierarchical Reinforcement Learning.
Proceedings of the Distributed Artificial Intelligence - Second International Conference, 2020


Large Scale Deep Reinforcement Learning in War-games.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2020

Efficient Deep Reinforcement Learning through Policy Transfer.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

Mastering Basketball With Deep Reinforcement Learning: An Integrated Curriculum Training Approach.
Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, 2020

From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Continuous Multiagent Control Using Collective Behavior Entropy for Large-Scale Home Energy Management.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Neighborhood Cognition Consistent Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Multi-Agent Game Abstraction via Graph Attention Neural Network.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
LoopFix: an approach to automatic repair of buggy loops.
J. Syst. Softw., 2019

There is Limited Correlation between Coverage and Robustness for Deep Neural Networks.
CoRR, 2019

Efficient meta reinforcement learning via meta goal generation.
CoRR, 2019

Spectral-based Graph Convolutional Network for Directed Graphs.
CoRR, 2019

Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction.
CoRR, 2019

Attention-based recurrent neural network for influenza epidemic prediction.
BMC Bioinform., 2019

Using deep reinforcement learning to speed up collective cell migration.
BMC Bioinform., 2019

A learning-based framework for miRNA-disease association identification using neural networks.
Bioinform., 2019

SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes.
Auton. Agents Multi Agent Syst., 2019

An Efficient Handover Authentication Mechanism for 5G Wireless Network.
Proceedings of the 2019 IEEE Wireless Communications and Networking Conference, 2019

Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning.
Proceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Deep Reinforcement Learning Framework.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Towards Efficient Detection and Optimal Response against Sophisticated Opponents.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Building Personalized Simulator for Interactive Search.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Explicitly Coordinated Policy Iteration.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Deep Multi-Agent Reinforcement Learning with Discrete-Continuous Hybrid Action Spaces.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas.
Proceedings of the First International Conference on Distributed Artificial Intelligence, 2019

Learning Adaptive Display Exposure for Real-Time Advertising.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

Automatic Feature Engineering by Deep Reinforcement Learning.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Reinforcement Learning for Cooperative Overtaking.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Large-Scale Home Energy Management Using Entropy-Based Collective Multiagent Reinforcement Learning Framework.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

ONECG: Online Negotiation Environment for Coalitional Games.
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019

An Optimal Rewiring Strategy for Cooperative Multiagent Social Learning.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
An Adaptive Markov Strategy for Defending Smart Grid False Data Injection From Malicious Attackers.
IEEE Trans. Smart Grid, 2018

Efficient and Robust Emergence of Norms through Heuristic Collective Learning.
ACM Trans. Auton. Adapt. Syst., 2018

An Adaptive Learning Based Network Selection Approach for 5G Dynamic Environments.
Entropy, 2018

Hierarchical Deep Multiagent Reinforcement Learning.
CoRR, 2018

SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions.
CoRR, 2018

Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents.
CoRR, 2018

Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
CoRR, 2018

An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems.
CoRR, 2018

Hierarchical Heuristic Learning towards Effcient Norm Emergence.
CoRR, 2018

SA-IGA: A Multiagent Reinforcement Learning Method Towards Socially Optimal Outcomes.
CoRR, 2018

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach.
CoRR, 2018

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.
CoRR, 2018

Effective norm emergence in cell systems under limited communication.
BMC Bioinform., 2018

ESRQ: An Efficient Secure Routing Method in Wireless Sensor Networks Based on Q-Learning.
Proceedings of the 17th IEEE International Conference On Trust, 2018

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

Achieving Multiagent Coordination Through CALA-rFMQ Learning in Continuous Action Space.
Proceedings of the PRICAI 2018: Trends in Artificial Intelligence, 2018

A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Recurrent Deep Multiagent Q-Learning for Autonomous Brokers in Smart Grid.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018

Speeding up Collective Cell Migration Using Deep Reinforcement Learning.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

Attention-Based Recurrent Multi-Channel Neural Network for Influenza Epidemic Prediction.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2018

The Dynamics of Opinion Evolution in Gossiper-Media Model with WoLS-CALA Learning.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Recurrent Deep Multiagent Q-Learning for Autonomous Agents in Future Smart Grid.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

Efficient Convention Emergence through Decoupled Reinforcement Social Learning with Teacher-Student Mechanism.
Proceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems, 2018

2017
Blind Image Denoising via Dependent Dirichlet Process Tree.
IEEE Trans. Pattern Anal. Mach. Intell., 2017

Reciprocal Social Strategy in Social Repeated Games and Emergence of Social Norms.
Int. J. Artif. Intell. Tools, 2017

The dynamics of reinforcement social learning in networked cooperative multiagent systems.
Eng. Appl. Artif. Intell., 2017

Automated Software Security Requirements Recommendation Based on FT-SR Model.
Proceedings of the 29th International Conference on Software Engineering and Knowledge Engineering, 2017

FESR: A Framework for Eliciting Security Requirements Based on Integration of Common Criteria and Weakness Detection Formal Model.
Proceedings of the 2017 IEEE International Conference on Software Quality, 2017

An Adaptive Handover Trigger Strategy for 5G C/U Plane Split Heterogeneous Network.
Proceedings of the 14th IEEE International Conference on Mobile Ad Hoc and Sensor Systems, 2017

Defending Against Man-In-The-Middle Attack in Repeated Games.
Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

An Improved Android Collusion Attack Detection Method Based on Program Slicing.
Proceedings of the Formal Methods and Software Engineering, 2017

TLSsem: A TLS Security-Enhanced Mechanism against MITM Attacks in Public WiFis.
Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017

Towards Solving Decision Making Problems Using Probabilistic Model Checking.
Proceedings of the 22nd International Conference on Engineering of Complex Computer Systems, 2017

A Prediction and Learning Based Approach to Network Selection in Dynamic Environments.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2017, 2017

A real-time ensemble classification algorithm for time series data.
Proceedings of the IEEE International Conference on Agents, 2017

Effective norm emergence in cell systems under limited communication.
Proceedings of the 2017 IEEE International Conference on Bioinformatics and Biomedicine, 2017

Optimal Personalized Defense Strategy Against Man-In-The-Middle Attack.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Improved EGT-Based Robustness Analysis of Negotiation Strategies in Multiagent Systems via Model Checking.
IEEE Trans. Hum. Mach. Syst., 2016

Fepchecker: An Automatic Model Checker for Verifying Fairness and Non-Repudiation of Security Protocols in Web Service.
Int. J. Softw. Eng. Knowl. Eng., 2016

Formal Modeling and Verification of Security Protocols on Cloud Computing Systems Based on UML 2.3.
Proceedings of the 2016 IEEE Trustcom/BigDataSE/ISPA, 2016

E-SSL: An SSL Security-Enhanced Method for Bypassing MITM Attacks in Mobile Internet.
Proceedings of the Structured Object-Oriented Formal Language and Method, 2016

Designing minimal effective normative systems with the help of lightweight formal methods.
Proceedings of the 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering, 2016

Accelerating Norm Emergence Through Hierarchical Heuristic Learning.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Socially-Aware Multiagent Learning: Towards Socially Optimal Outcomes.
Proceedings of the ECAI 2016 - 22nd European Conference on Artificial Intelligence, 29 August-2 September 2016, The Hague, The Netherlands, 2016

Dynamic analysis of cell interactions in biological environments under multiagent social learning framework.
Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine, 2016

An Adaptive Learning Framework for Efficient Emergence of Social Norms: (Extended Abstract).
Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems, 2016

2015
Reinforcement social learning of social optimality with influencer agents.
Web Intell., 2015

Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems.
ACM Trans. Auton. Adapt. Syst., 2015

Introducing decision entrustment mechanism into repeated bilateral agent interactions to achieve social optimality.
Auton. Agents Multi Agent Syst., 2015

An Adaptive Markov Strategy for Effective Network Intrusion Detection.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Toward Efficient Agreements in Real-Time Multilateral Agent-Based Negotiations.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Reciprocal Social Strategy in Social Repeated Games.
Proceedings of the 27th IEEE International Conference on Tools with Artificial Intelligence, 2015

Hierarchical Learning for Emergence of Social Norms in Networked Multiagent Systems.
Proceedings of the AI 2015: Advances in Artificial Intelligence, 2015

Heuristic Collective Learning for Efficient and Robust Emergence of Social Norms.
Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems, 2015

2014
CUHKAgent: An Adaptive Negotiation Strategy for Bilateral Negotiations over Multiple Items.
Proceedings of the Novel Insights in Agent-based Complex Automated Negotiation, 2014

An efficient and robust negotiating strategy in bilateral negotiations over multiple items.
Eng. Appl. Artif. Intell., 2014

Robustness Analysis of Negotiation Strategies through Multiagent Learning in Repeated Negotiation Games.
Proceedings of the Multiagent System Technologies - 12th German Conference, 2014

Evaluating Practical Automated Negotiation Based on Spatial Evolutionary Game Theory.
Proceedings of the KI 2014: Advances in Artificial Intelligence, 2014

Networked Reinforcement Social Learning towards Coordination in Cooperative Multiagent Systems.
Proceedings of the 26th IEEE International Conference on Tools with Artificial Intelligence, 2014

Spatial evolutionary game-theoretic perspective on agent-based complex negotiations.
Proceedings of the ECAI 2014 - 21st European Conference on Artificial Intelligence, 18-22 August 2014, Prague, Czech Republic, 2014

Adaptive Defending Strategy for Smart Grid Attacks.
Proceedings of the 2nd Workshop on Smart Energy Grid Security, 2014

2013
Fairness, social optimality and individual rationality in agent interactions.
PhD thesis, 2013

Achieving Socially Optimal Outcomes in Multiagent Systems with Reinforcement Social Learning.
ACM Trans. Auton. Adapt. Syst., 2013

The Dynamics of Reinforcement Social Learning in Cooperative Multiagent Systems.
Proceedings of the IJCAI 2013, 2013

Reinforcement social learning of coordination in cooperative multiagent systems.
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013

2012
Maintaining cooperation in homogeneous multi-agent system.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2012

Probabilistic Model Checking Multi-agent Behaviors in Dispersion Games Using Counter Abstraction.
Proceedings of the PRIMA 2012: Principles and Practice of Multi-Agent Systems, 2012

An Efficient Negotiation Protocol to Achieve Socially Optimal Allocation.
Proceedings of the PRIMA 2012: Principles and Practice of Multi-Agent Systems, 2012

Incorporating Fairness into Agent Interactions Modeled as Two-Player Normal-Form Games.
Proceedings of the PRICAI 2012: Trends in Artificial Intelligence, 2012

Learning to Achieve Socially Optimal Solutions in General-Sum Games.
Proceedings of the PRICAI 2012: Trends in Artificial Intelligence, 2012

Incorporating Fairness into Infinitely Repeated Games with Conflicting Interests for Conflicts Elimination.
Proceedings of the IEEE 24th International Conference on Tools with Artificial Intelligence, 2012

Analyzing multi-agent systems with probabilistic model checking approach.
Proceedings of the 34th International Conference on Software Engineering, 2012

ABiNeS: An Adaptive Bilateral Negotiating Strategy over Multiple Items.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Intelligent Agent Technology, 2012

Achieving Social Optimality with Influencer Agents.
Proceedings of the Complex Sciences - Second International Conference, 2012

2011
Learning to Achieve Social Rationality Using Tag Mechanism in Repeated Interactions.
Proceedings of the IEEE 23rd International Conference on Tools with Artificial Intelligence, 2011

2010
Strategy and Fairness in Repeated Two-agent Interaction.
Proceedings of the 22nd IEEE International Conference on Tools with Artificial Intelligence, 2010

2009
Bus-Based and NoC Infrastructure Performance Emulation and Comparison.
Proceedings of the Sixth International Conference on Information Technology: New Generations, 2009

2007
Theoretical Investigation on Post-Processed LDA for Face and Palmprint Recognition.
Proceedings of the Computational Intelligence and Security, International Conference, 2007


  Loading...