Zongqing Lu

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cross-Embodiment Dexterous Grasping with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer.

[BibT_eX]

[DOI]

Hao Luo

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Discrete Latent Plans via Semantic Skill Abstractions.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GAMEBoT: Transparent Assessment of LLM Reasoning in Games.

[BibT_eX]

[DOI]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

A Fully Decentralized Surrogate for Multi-Agent Policy Optimization.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2024

Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence.

[BibT_eX]

[DOI]

J. Artif. Intell. Res., 2024

Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse.

[BibT_eX]

[DOI]

Inf. Sci., 2024

Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games.

[BibT_eX]

[DOI]

CoRR, 2024

VideoOrion: Tokenizing Object Dynamics in Videos.

[BibT_eX]

[DOI]

CoRR, 2024

Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models.

[BibT_eX]

[DOI]

CoRR, 2024

SELU: Self-Learning Embodied MLLMs in Unknown Environments.

[BibT_eX]

[DOI]

CoRR, 2024

Learning Diverse Bimanual Dexterous Manipulation Skills from Human Demonstrations.

[BibT_eX]

[DOI]

CoRR, 2024

Egocentric Vision Language Planning.

[BibT_eX]

[DOI]

CoRR, 2024

NOLO: Navigate Only Look Once.

[BibT_eX]

[DOI]

Bohan Zhou

CoRR, 2024

MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control.

[BibT_eX]

[DOI]

CoRR, 2024

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study.

[BibT_eX]

[DOI]

CoRR, 2024

Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey.

[BibT_eX]

[DOI]

CoRR, 2024

Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Opponent Modeling based on Subgoal Inference.

[BibT_eX]

[DOI]

Xiaopeng Yu

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RL-GPT: Integrating Reinforcement Learning and Code-as-policy.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Multi-Agent Coordination via Multi-Level Communication.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback.

[BibT_eX]

[DOI]

Wanpeng Zhang

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

LLaMA-Rider: Spurring Large Language Models to Explore the Open World.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Cross-Domain Policy Adaptation by Capturing Representation Mismatch.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEABO: A Simple Search-Based Method for Offline Imitation Learning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniCode: Learning a Unified Codebook for Multimodal Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Pre-trained Visual Dynamics Representations for Efficient Policy Learning.

[BibT_eX]

[DOI]

Hao Luo

Bohan Zhou

Proceedings of the Computer Vision - ECCV 2024, 2024

Reinforcement Learning Friendly Vision-Language Model for Minecraft.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Visual Grounding for Object-Level Generalization in Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

Multi-Agent Alternate Q-Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Language Model Adaption for Reinforcement Learning with Natural Language Action Space.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing.

[BibT_eX]

[DOI]

Ziluo Ding

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Learning Multi-Object Positional Relationships via Emergent Communication.

[BibT_eX]

[DOI]

Yicheng Feng

Boshi An

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control.

[BibT_eX]

[DOI]

IEEE Trans. Knowl. Data Eng., November, 2023

A Survey on Transformers in Reinforcement Learning.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

RLAdapter: Bridging Large Language Models to Reinforcement Learning in Open Worlds.

[BibT_eX]

[DOI]

Wanpeng Zhang

CoRR, 2023

Plan4MC: Skill Reinforcement Learning and Planning for Open-World Minecraft Tasks.

[BibT_eX]

[DOI]

CoRR, 2023

CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft.

[BibT_eX]

[DOI]

CoRR, 2023

Model-Based Decentralized Policy Optimization.

[BibT_eX]

[DOI]

Hao Luo

CoRR, 2023

Learning from Visual Observation via Offline Pretrained State-to-Go Transformer.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mutual-Information Regularized Multi-Agent Policy Iteration.

[BibT_eX]

[DOI]

Deheng Ye

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization.

[BibT_eX]

[DOI]

Deheng Ye

Proceedings of the Eleventh International Conference on Learning Representations, 2023

State Advantage Weighting for Offline RL.

[BibT_eX]

[DOI]

Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

ReLeaPS : Reinforcement Learning-based Illumination Planning for Generalized Photometric Stereo.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Offline Decentralized Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Multi-Agent Automated Machine Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Learning Rates for Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Multi-Agent Language Learning: Symbolic Mapping.

[BibT_eX]

[DOI]

Yicheng Feng

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Online Tuning for Offline Decentralized Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Asynchronous Reinforcement Learning Framework and Knowledge Transfer for Net-Order Exploration in Detailed Routing.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Decentralized Policy Optimization.

[BibT_eX]

[DOI]

CoRR, 2022

Multi-Agent Sequential Decision-Making via Communication.

[BibT_eX]

[DOI]

CoRR, 2022

MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.

[BibT_eX]

[DOI]

Stephen Marcus McAleer

Hao Dong

Song-Chun Zhu

CoRR, 2022

Model-Based Opponent Modeling.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning to Share in Networked Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mildly Conservative Q-Learning for Offline Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination.

[BibT_eX]

[DOI]

Jiafei Lyu

Xiu Li

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

I2Q: A Fully Decentralized Q-Learning Algorithm.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning.

[BibT_eX]

[DOI]

Haoqi Yuan

Proceedings of the International Conference on Machine Learning, 2022

Divergence-Regularized Multi-Agent Actor-Critic.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2022

Difference Advantage Estimation for Multi-Agent Policy Gradients.

[BibT_eX]

[DOI]

Yueheng Li

Guangming Xie

Proceedings of the International Conference on Machine Learning, 2022

2021

Augur: Modeling the Resource Requirements of ConvNets on Mobile Devices.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2021

PicSys: Energy-Efficient Fast Image Search on Distributed Mobile Networks.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2021

Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning.

[BibT_eX]

[DOI]

IEEE Trans. Biomed. Eng., 2021

Learning to Share in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2021

Model-Based Opponent Modeling.

[BibT_eX]

[DOI]

CoRR, 2021

Informative Policy Representations in Multi-Agent Reinforcement Learning via Joint-Action Distributions.

[BibT_eX]

[DOI]

Yifan Yu

CoRR, 2021

Revisiting Prioritized Experience Replay: A Value Perspective.

[BibT_eX]

[DOI]

Ang A. Li

Chenglin Miao

CoRR, 2021

Variationally and Intrinsically motivated reinforcement learning for decentralized traffic signal control.

[BibT_eX]

[DOI]

CoRR, 2021

Quality of Information in Gathering Information via Video Analytics for Military Networks.

[BibT_eX]

[DOI]

Gregory H. Cirincione

Thomas F. La Porta

IEEE Commun. Mag., 2021

FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

The Emergence of Individuality.

[BibT_eX]

[DOI]

Proceedings of the 38th International Conference on Machine Learning, 2021

Asynchronous Reinforcement Learning Framework for Net Order Exploration in Detailed Routing.

[BibT_eX]

[DOI]

Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Hierarchically and Cooperatively Learning Traffic Signal Control.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020

NetVision: On-Demand Video Processing in Wireless Networks.

[BibT_eX]

[DOI]

IEEE/ACM Trans. Netw., 2020

The Emergence of Individuality in Multi-Agent Reinforcement Learning.

[BibT_eX]

[DOI]

CoRR, 2020

Learning Individually Inferred Communication for Multi-Agent Cooperation.

[BibT_eX]

[DOI]

Ziluo Ding

Tiejun Huang

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Graph Convolutional Reinforcement Learning.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Generative Exploration and Exploitation.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

CrowdVision: A Computing Platform for Video Crowdprocessing Using Deep Learning.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2019

Heterogeneous Transfer Learning for Thermal Comfort Modeling.

[BibT_eX]

[DOI]

Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, 2019

Learning Fairness in Multi-Agent Systems.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018

Graph Convolutional Reinforcement Learning for Multi-Agent Cooperation.

[BibT_eX]

[DOI]

Chen Dun

CoRR, 2018

Learning Attentional Communication for Multi-Agent Cooperation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Computing Platform for Video Crowdprocessing Using Deep Learning.

[BibT_eX]

[DOI]

Kevin S. Chan

Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

2017

Cluster-Aware Virtual Machine Collaborative Migration in Media Cloud.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2017

Cooperative Data Offload in Opportunistic Networks: From Mobile Devices to Infrastructure.

[BibT_eX]

[DOI]

Xiao Sun

IEEE/ACM Trans. Netw., 2017

TeamPhone: Networking SmartPhones for Disaster Recovery.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2017

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices.

[BibT_eX]

[DOI]

Proceedings of the 2017 ACM on Multimedia Conference, 2017

On-demand Information Retrieval from Videos Using Deep Learning in Wireless Networks: Demo Abstract.

[BibT_eX]

[DOI]

Proceedings of the Second International Conference on Internet-of-Things Design and Implementation, 2017

2016

Towards Information Diffusion in Mobile Social Networks.

[BibT_eX]

[DOI]

IEEE Trans. Mob. Comput., 2016

Infectious Disease Containment Based on a Wireless Sensor System.

[BibT_eX]

[DOI]

IEEE Access, 2016

Networking smartphones for disaster recovery.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE International Conference on Pervasive Computing and Communications, 2016

Cooperative data offloading in opportunistic mobile networks.

[BibT_eX]

[DOI]

Xiao Sun

Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016

On-demand video processing in wireless networks.

[BibT_eX]

[DOI]

Proceedings of the 24th IEEE International Conference on Network Protocols, 2016

Protection of location privacy in continuous LBSs against adversaries with background information.

[BibT_eX]

[DOI]

Proceedings of the 2016 International Conference on Computing, 2016

Video processing of complex activity detection in resource-constrained networks.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016

2015

Algorithms and Applications for Community Detection in Weighted Networks.

[BibT_eX]

[DOI]

IEEE Trans. Parallel Distributed Syst., 2015

Targeted vaccination based on a wireless sensor system.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communications, 2015

A personalized two-tier cloaking scheme for privacy-aware location-based services.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Computing, Networking and Communications, 2015

Task Allocation for Mobile Cloud Computing in Heterogeneous Wireless Networks.

[BibT_eX]

[DOI]

Proceedings of the 24th International Conference on Computer Communication and Networks, 2015

SymDetector: detecting sound-related respiratory symptoms using smartphones.

[BibT_eX]

[DOI]

Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

2014

Exploring social structures for network protocol designs

[BibT_eX]

[DOI]

PhD thesis, 2014

Distributed Algorithm for Tree-Structured Data Aggregation Service Placement in Smart Grid.

[BibT_eX]

[DOI]

IEEE Syst. J., 2014

Priority-Aware Private Matching Schemes for Proximity-Based Mobile Social Networks.

[BibT_eX]

[DOI]

CoRR, 2014

Skeleton construction in mobile social networks: Algorithms and applications.

[BibT_eX]

[DOI]

Proceedings of the Eleventh Annual IEEE International Conference on Sensing, 2014

Information diffusion in mobile social networks: The speed perspective.

[BibT_eX]

[DOI]

Proceedings of the 2014 IEEE Conference on Computer Communications, 2014

2013

D2F: A Routing Protocol for Distributed Data Fusion in Wireless Sensor Networks.

[BibT_eX]

[DOI]

Wirel. Pers. Commun., 2013

Fusion function placement for Active Networks paradigm in wireless sensor networks.

[BibT_eX]

[DOI]

Wirel. Networks, 2013

Toward Efficient Distributed Algorithms for In-Network Binary Operator Tree Placement in Wireless Sensor Networks.

[BibT_eX]

[DOI]

IEEE J. Sel. Areas Commun., 2013

Community detection in weighted networks: Algorithms and applications.

[BibT_eX]

[DOI]

Proceedings of the 2013 IEEE International Conference on Pervasive Computing and Communications, 2013

2012

Distributed and Asynchronous Solution to Operator Placement in Large Wireless Sensor Networks.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Mobile Ad-hoc and Sensor Networks, 2012

Credit routing for source-location privacy protection in wireless sensor networks.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE International Conference on Mobile Ad-Hoc and Sensor Systems, 2012

2011

Evaluation of a TDMA-based energy efficient MAC protocol for multiple capsule networks.

[BibT_eX]

[DOI]

EURASIP J. Wirel. Commun. Netw., 2011

Function Placement of Data Fusion for Active Networks Paradigm in Wireless Sensor Networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE 8th International Conference on Mobile Adhoc and Sensor Systems, 2011

A power-aware framework for distributed data fusion application in wireless sensor networks.

[BibT_eX]

[DOI]

Proceedings of the IEEE 36th Conference on Local Computer Networks, 2011

Fusion Function Placement Algorithm for Distributed Data Fusion Application in Wireless Sensor Networks.

[BibT_eX]

[DOI]