Huimu Wang

Orcid: 0000-0001-7115-8831

According to our database¹, Huimu Wang authored at least 22 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control.

[BibT_eX]

[DOI]

CoRR, March, 2026

Towards Efficient and Generalizable Retrieval: Adaptive Semantic Quantization and Residual Knowledge Transfer.

[BibT_eX]

[DOI]

CoRR, February, 2026

RAD-DPO: Robust Adaptive Denoising Direct Preference Optimization for Generative Retrieval in E-commerce.

[BibT_eX]

[DOI]

CoRR, February, 2026

2025

A Simple and Effective Framework for Symmetric Consistent Indexing in Large-Scale Dense Retrieval.

[BibT_eX]

[DOI]

CoRR, December, 2025

Diversity-Driven Offline-to-Online Multi-Player Policy Learning for Football Matches.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2025

Stochastic Trajectory Prediction Under Unstructured Constraints.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2025

OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2025 - 28th European Conference on Artificial Intelligence, 25-30 October 2025, Bologna, Italy, 2025

2024

Generative Retrieval with Preference Optimization for E-commerce Search.

[BibT_eX]

[DOI]

CoRR, 2024

MODRL-TA:A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search.

[BibT_eX]

[DOI]

CoRR, 2024

A Preference-oriented Diversity Model Based on Mutual-information in Re-ranking for E-commerce Search.

[BibT_eX]

[DOI]

Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024

Breaking the Hourglass Phenomenon of Residual Quantization: Enhancing the Upper Bound of Generative Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: EMNLP 2024, 2024

MODRL-TA: A Multi-Objective Deep Reinforcement Learning Framework for Traffic Allocation in E-Commerce Search.

[BibT_eX]

[DOI]

Proceedings of the 33rd ACM International Conference on Information and Knowledge Management, 2024

2023

Learning to Play Football From Sports Domain Perspective: A Knowledge-Embedded Deep Reinforcement Learning Framework.

[BibT_eX]

[DOI]

IEEE Trans. Games, December, 2023

Attention Enhanced Reinforcement Learning for Multi agent Cooperation.

[BibT_eX]

[DOI]

IEEE Trans. Neural Networks Learn. Syst., November, 2023

Cognition-Driven Multiagent Policy Learning Framework for Promoting Cooperation.

[BibT_eX]

[DOI]

IEEE Trans. Games, September, 2023

Adaptive Hyper-parameter Learning for Deep Semantic Retrieval.

[BibT_eX]

[DOI]

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

2021

Multiagent Hierarchical Cognition Difference Policy for Multiagent Cooperation.

[BibT_eX]

[DOI]

Algorithms, 2021

Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

2020

Multi-agent Cooperation and Competition with Two-Level Attention Network.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 27th International Conference, 2020

STGA-LSTM: A Spatial-Temporal Graph Attentional LSTM Scheme for Multi-agent Cooperation.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing - 27th International Conference, 2020

A Soft Graph Attention Reinforcement Learning for Multi-Agent Cooperation.

[BibT_eX]

[DOI]

Proceedings of the 16th IEEE International Conference on Automation Science and Engineering, 2020

2019

Time-sequence Action-Decision and Navigation Through Stage Deep Reinforcement Learning in Complex Dynamic Environments.

[BibT_eX]

[DOI]

Proceedings of the IEEE Symposium Series on Computational Intelligence, 2019

Huimu Wang

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...