Zongqing Lu

Orcid: 0000-0003-3967-2704

Affiliations:
  • Peking University, Beijing, China
  • Pennsylvania State Universityeking University, PA, USA
  • Nanyang Technological University, Singapore


According to our database1, Zongqing Lu authored at least 149 papers between 2011 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Multi-Agent Guided Policy Optimization.
CoRR, July, 2025

Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos.
CoRR, July, 2025

Unified Multimodal Understanding via Byte-Pair Visual Encoding.
CoRR, June, 2025

DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning.
CoRR, June, 2025

From Experts to a Generalist: Toward General Whole-Body Control for Humanoid Robots.
CoRR, June, 2025

RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control.
CoRR, June, 2025

MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation.
CoRR, May, 2025

Guided Policy Optimization under Partial Observability.
CoRR, May, 2025

JAEGER: Dual-Level Humanoid Whole-Body Controller.
CoRR, May, 2025

Being-0: A Humanoid Robotic Agent with Vision-Language Models and Modular Skills.
CoRR, March, 2025

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training.
CoRR, March, 2025

Taking Notes Brings Focus? Towards Multi-Turn Multimodal Dialogue Learning.
CoRR, March, 2025

CORD: Generalizable Cooperation via Role Diversity.
CoRR, January, 2025

<i>f</i>-Divergence Policy Optimization in Fully Decentralized Cooperative MARL.
Trans. Mach. Learn. Res., 2025

Best Possible Q-Learning.
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

Creative Agents: Empowering Agents with Imagination for Creative Tasks.
Proceedings of the Conference on Uncertainty in Artificial Intelligence, 2025

LLM-Based Explicit Models of Opponents for Multi-Agent Games.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

MLLM as Retriever: Interactively Learning Multimodal Retrieval for Embodied Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cross-Embodiment Dexterous Grasping with Reinforcement Learning.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Watch Less, Do More: Implicit Skill Discovery for Video-Conditioned Policy.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Cross-Domain Offline Policy Adaptation with Optimal Transport and Dataset Constraint.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Learning Video-Conditioned Policy on Unlabelled Data with Joint Embedding Predictive Transformer.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Discrete Latent Plans via Semantic Skill Abstractions.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Efficient Residual Learning with Mixture-of-Experts for Universal Dexterous Grasping.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

From Pixels to Tokens: Byte-Pair Encoding on Quantized Visual Modalities.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

GAMEBoT: Transparent Assessment of LLM Reasoning in Games.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

A Fully Decentralized Surrogate for Multi-Agent Policy Optimization.
Trans. Mach. Learn. Res., 2024

Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence.
J. Artif. Intell. Res., 2024

Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse.
Inf. Sci., 2024

Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games.
CoRR, 2024

VideoOrion: Tokenizing Object Dynamics in Videos.
CoRR, 2024

Quo Vadis, Motion Generation? From Large Language Models to Large Motion Models.
CoRR, 2024

SELU: Self-Learning Embodied MLLMs in Unknown Environments.
CoRR, 2024

Learning Diverse Bimanual Dexterous Manipulation Skills from Human Demonstrations.
CoRR, 2024

Egocentric Vision Language Planning.
CoRR, 2024

NOLO: Navigate Only Look Once.
CoRR, 2024

MTLight: Efficient Multi-Task Reinforcement Learning for Traffic Signal Control.
CoRR, 2024

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study.
CoRR, 2024

Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey.
CoRR, 2024

Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Opponent Modeling based on Subgoal Inference.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

RL-GPT: Integrating Reinforcement Learning and Code-as-policy.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Multi-Agent Coordination via Multi-Level Communication.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

LLaMA-Rider: Spurring Large Language Models to Explore the Open World.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

Cross-Domain Policy Adaptation by Capturing Representation Mismatch.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

SEABO: A Simple Search-Based Method for Offline Imitation Learning.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

UniCode: Learning a Unified Codebook for Multimodal Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

Pre-trained Visual Dynamics Representations for Efficient Policy Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Reinforcement Learning Friendly Vision-Language Model for Minecraft.
Proceedings of the Computer Vision - ECCV 2024, 2024

Visual Grounding for Object-Level Generalization in Reinforcement Learning.
Proceedings of the Computer Vision - ECCV 2024, 2024

Multi-Agent Alternate Q-Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

Language Model Adaption for Reinforcement Learning with Natural Language Action Space.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

Learning Multi-Object Positional Relationships via Emergent Communication.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
MetaVIM: Meta Variationally Intrinsic Motivated Reinforcement Learning for Decentralized Traffic Signal Control.
IEEE Trans. Knowl. Data Eng., November, 2023

A Survey on Transformers in Reinforcement Learning.
Trans. Mach. Learn. Res., 2023

RLAdapter: Bridging Large Language Models to Reinforcement Learning in Open Worlds.
CoRR, 2023

Plan4MC: Skill Reinforcement Learning and Planning for Open-World Minecraft Tasks.
CoRR, 2023

CLIP4MC: An RL-Friendly Vision-Language Model for Minecraft.
CoRR, 2023

Model-Based Decentralized Policy Optimization.
CoRR, 2023

Learning from Visual Observation via Offline Pretrained State-to-Go Transformer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Mutual-Information Regularized Multi-Agent Policy Iteration.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

State Advantage Weighting for Offline RL.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

ReLeaPS : Reinforcement Learning-based Illumination Planning for Generalized Photometric Stereo.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Offline Decentralized Multi-Agent Reinforcement Learning.
Proceedings of the ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30 - October 4, 2023, Kraków, Poland, 2023

Multi-Agent Automated Machine Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Adaptive Learning Rates for Multi-Agent Reinforcement Learning.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023

Multi-Agent Language Learning: Symbolic Mapping.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Online Tuning for Offline Decentralized Multi-Agent Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Asynchronous Reinforcement Learning Framework and Knowledge Transfer for Net-Order Exploration in Detailed Routing.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

Decentralized Policy Optimization.
CoRR, 2022

Multi-Agent Sequential Decision-Making via Communication.
CoRR, 2022

MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning.
CoRR, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
CoRR, 2022

Model-Based Opponent Modeling.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning to Share in Networked Multi-Agent Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Mildly Conservative Q-Learning for Offline Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

I2Q: A Fully Decentralized Q-Learning Algorithm.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Robust Task Representations for Offline Meta-Reinforcement Learning via Contrastive Learning.
Proceedings of the International Conference on Machine Learning, 2022

Divergence-Regularized Multi-Agent Actor-Critic.
Proceedings of the International Conference on Machine Learning, 2022

Difference Advantage Estimation for Multi-Agent Policy Gradients.
Proceedings of the International Conference on Machine Learning, 2022

2021
Augur: Modeling the Resource Requirements of ConvNets on Mobile Devices.
IEEE Trans. Mob. Comput., 2021

PicSys: Energy-Efficient Fast Image Search on Distributed Mobile Networks.
IEEE Trans. Mob. Comput., 2021

Towards More Accurate Automatic Sleep Staging via Deep Transfer Learning.
IEEE Trans. Biomed. Eng., 2021

Learning to Share in Multi-Agent Reinforcement Learning.
CoRR, 2021

Model-Based Opponent Modeling.
CoRR, 2021

Informative Policy Representations in Multi-Agent Reinforcement Learning via Joint-Action Distributions.
CoRR, 2021

Revisiting Prioritized Experience Replay: A Value Perspective.
CoRR, 2021

Variationally and Intrinsically motivated reinforcement learning for decentralized traffic signal control.
CoRR, 2021

Quality of Information in Gathering Information via Video Analytics for Military Networks.
IEEE Commun. Mag., 2021

FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning.
Proceedings of the 38th International Conference on Machine Learning, 2021

The Emergence of Individuality.
Proceedings of the 38th International Conference on Machine Learning, 2021

Asynchronous Reinforcement Learning Framework for Net Order Exploration in Detailed Routing.
Proceedings of the Design, Automation & Test in Europe Conference & Exhibition, 2021

Hierarchically and Cooperatively Learning Traffic Signal Control.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
NetVision: On-Demand Video Processing in Wireless Networks.
IEEE/ACM Trans. Netw., 2020

The Emergence of Individuality in Multi-Agent Reinforcement Learning.
CoRR, 2020

Learning Individually Inferred Communication for Multi-Agent Cooperation.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Graph Convolutional Reinforcement Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Generative Exploration and Exploitation.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
CrowdVision: A Computing Platform for Video Crowdprocessing Using Deep Learning.
IEEE Trans. Mob. Comput., 2019

Heterogeneous Transfer Learning for Thermal Comfort Modeling.
Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, 2019

Learning Fairness in Multi-Agent Systems.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

2018
Graph Convolutional Reinforcement Learning for Multi-Agent Cooperation.
CoRR, 2018

Learning Attentional Communication for Multi-Agent Cooperation.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

A Computing Platform for Video Crowdprocessing Using Deep Learning.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

2017
Cluster-Aware Virtual Machine Collaborative Migration in Media Cloud.
IEEE Trans. Parallel Distributed Syst., 2017

Cooperative Data Offload in Opportunistic Networks: From Mobile Devices to Infrastructure.
IEEE/ACM Trans. Netw., 2017

TeamPhone: Networking SmartPhones for Disaster Recovery.
IEEE Trans. Mob. Comput., 2017

Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices.
Proceedings of the 2017 ACM on Multimedia Conference, 2017

On-demand Information Retrieval from Videos Using Deep Learning in Wireless Networks: Demo Abstract.
Proceedings of the Second International Conference on Internet-of-Things Design and Implementation, 2017

2016
Towards Information Diffusion in Mobile Social Networks.
IEEE Trans. Mob. Comput., 2016

Infectious Disease Containment Based on a Wireless Sensor System.
IEEE Access, 2016

Networking smartphones for disaster recovery.
Proceedings of the 2016 IEEE International Conference on Pervasive Computing and Communications, 2016

Cooperative data offloading in opportunistic mobile networks.
Proceedings of the 35th Annual IEEE International Conference on Computer Communications, 2016

On-demand video processing in wireless networks.
Proceedings of the 24th IEEE International Conference on Network Protocols, 2016

Protection of location privacy in continuous LBSs against adversaries with background information.
Proceedings of the 2016 International Conference on Computing, 2016

Video processing of complex activity detection in resource-constrained networks.
Proceedings of the 2016 IEEE Global Conference on Signal and Information Processing, 2016

2015
Algorithms and Applications for Community Detection in Weighted Networks.
IEEE Trans. Parallel Distributed Syst., 2015

Targeted vaccination based on a wireless sensor system.
Proceedings of the 2015 IEEE International Conference on Pervasive Computing and Communications, 2015

A personalized two-tier cloaking scheme for privacy-aware location-based services.
Proceedings of the International Conference on Computing, Networking and Communications, 2015

Task Allocation for Mobile Cloud Computing in Heterogeneous Wireless Networks.
Proceedings of the 24th International Conference on Computer Communication and Networks, 2015

SymDetector: detecting sound-related respiratory symptoms using smartphones.
Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing, 2015

2014
Exploring social structures for network protocol designs
PhD thesis, 2014

Distributed Algorithm for Tree-Structured Data Aggregation Service Placement in Smart Grid.
IEEE Syst. J., 2014

Priority-Aware Private Matching Schemes for Proximity-Based Mobile Social Networks.
CoRR, 2014

Skeleton construction in mobile social networks: Algorithms and applications.
Proceedings of the Eleventh Annual IEEE International Conference on Sensing, 2014

Information diffusion in mobile social networks: The speed perspective.
Proceedings of the 2014 IEEE Conference on Computer Communications, 2014

2013
D2F: A Routing Protocol for Distributed Data Fusion in Wireless Sensor Networks.
Wirel. Pers. Commun., 2013

Fusion function placement for Active Networks paradigm in wireless sensor networks.
Wirel. Networks, 2013

Toward Efficient Distributed Algorithms for In-Network Binary Operator Tree Placement in Wireless Sensor Networks.
IEEE J. Sel. Areas Commun., 2013

Community detection in weighted networks: Algorithms and applications.
Proceedings of the 2013 IEEE International Conference on Pervasive Computing and Communications, 2013

2012
Distributed and Asynchronous Solution to Operator Placement in Large Wireless Sensor Networks.
Proceedings of the 8th International Conference on Mobile Ad-hoc and Sensor Networks, 2012

Credit routing for source-location privacy protection in wireless sensor networks.
Proceedings of the 9th IEEE International Conference on Mobile Ad-Hoc and Sensor Systems, 2012

2011
Evaluation of a TDMA-based energy efficient MAC protocol for multiple capsule networks.
EURASIP J. Wirel. Commun. Netw., 2011

Function Placement of Data Fusion for Active Networks Paradigm in Wireless Sensor Networks.
Proceedings of the IEEE 8th International Conference on Mobile Adhoc and Sensor Systems, 2011

A power-aware framework for distributed data fusion application in wireless sensor networks.
Proceedings of the IEEE 36th Conference on Local Computer Networks, 2011

Fusion Function Placement Algorithm for Distributed Data Fusion Application in Wireless Sensor Networks.
Proceedings of the 25th IEEE International Conference on Advanced Information Networking and Applications Workshops, 2011


  Loading...