Huimu Wang

Orcid: 0000-0001-7115-8831

According to our database1, Huimu Wang authored at least 22 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control.
CoRR, March, 2026

Towards Efficient and Generalizable Retrieval: Adaptive Semantic Quantization and Residual Knowledge Transfer.
CoRR, February, 2026

RAD-DPO: Robust Adaptive Denoising Direct Preference Optimization for Generative Retrieval in E-commerce.
CoRR, February, 2026

2025
A Simple and Effective Framework for Symmetric Consistent Indexing in Large-Scale Dense Retrieval.
CoRR, December, 2025

Diversity-Driven Offline-to-Online Multi-Player Policy Learning for Football Matches.
Proceedings of the International Joint Conference on Neural Networks, 2025

Stochastic Trajectory Prediction Under Unstructured Constraints.
Proceedings of the IEEE International Conference on Robotics and Automation, 2025

OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning.
Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

2024
Generative Retrieval with Preference Optimization for E-commerce Search.
CoRR, 2024

MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search.
CoRR, 2024

A Preference-oriented Diversity Model Based on Mutual-information in Re-ranking for E-commerce Search.
Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

MODRL-TA: A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search.
Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023
Learning to Play Football From Sports Domain Perspective: A Knowledge-Embedded Deep Reinforcement Learning Framework.
IEEE Trans. Games, December, 2023

Attention Enhanced Reinforcement Learning for Multi agent Cooperation.
IEEE Trans. Neural Networks Learn. Syst., November, 2023

Cognition-Driven Multiagent Policy Learning Framework for Promoting Cooperation.
IEEE Trans. Games, September, 2023

Adaptive Hyper-parameter Learning for Deep Semantic Retrieval.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

2021
Multiagent Hierarchical Cognition Difference Policy for Multiagent Cooperation.
Algorithms, 2021

Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation.
Proceedings of the International Joint Conference on Neural Networks, 2021

2020
Multi-agent Cooperation and Competition with Two-Level Attention Network.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-agent Cooperation.
Proceedings of the Neural Information Processing - 27th International Conference, 2020

A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation.
Proceedings of the 16th IEEE International Conference on Automation Science and Engineering, 2020

2019
Time-sequence Action-Decision and Navigation Through Stage Deep Reinforcement Learning in Complex Dynamic Environments.
Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019


  Loading...