Mengwei Xu

Orcid: 0000-0001-6271-6993

Affiliations:
  • Beijing University of Posts and Telecommunications, State Key Laboratory of Networking and Switching Technology, Beijing, China


According to our database1, Mengwei Xu authored at least 126 papers between 2016 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Accelerating Mobile Language Model via Speculative Decoding and NPU-Coordinated Execution.
CoRR, October, 2025

From Earth to Orbit: Launch Sequence Optimization for LEO Mega-Constellations.
IEEE Trans. Mob. Comput., September, 2025

EdgeMoE: Empowering Sparse Large Language Models on Mobile Devices.
IEEE Trans. Mob. Comput., August, 2025

Dynamic Sparse Attention on Mobile SoCs.
CoRR, August, 2025

MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs.
CoRR, June, 2025

MCPWorld: A Unified Benchmarking Testbed for API, GUI, and Hybrid Computer Use Agents.
CoRR, June, 2025

Resource-efficient Algorithms and Systems of Foundation Models: A Survey.
ACM Comput. Surv., May, 2025

LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades.
CoRR, May, 2025

UIShift: Enhancing VLM-based GUI Agents through Self-supervised Reinforcement Learning.
CoRR, May, 2025

EdgeLLM: Fast On-Device LLM Inference With Speculative Decoding.
IEEE Trans. Mob. Comput., April, 2025

A Collaborative Cloud-Edge Approach for Robust Edge Workload Forecasting.
IEEE Trans. Mob. Comput., April, 2025

Rethinking Cost-Efficient VM Scheduling on Public Edge Platforms: A Service Provider's Perspective.
IEEE Trans. Mob. Comput., March, 2025

Does Chain-of-Thought Reasoning Help Mobile GUI Agent? An Empirical Study.
CoRR, March, 2025

Every Software as an Agent: Blueprint and Case Study.
CoRR, February, 2025

FedCLR+: Tackling Onboard Label Constraints for Accurate Federated Satellite Computing.
IEEE Trans. Serv. Comput., 2025

ShortcutsBench: A Large-Scale Real-world Benchmark for API-based Agents.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Scout: Tailored Collaborative Workload Forecasting for Multi-Tenant Edge Cloud Platforms.
Proceedings of the IEEE International Conference on Communications, 2025

GUI-Xplore: Empowering Generalizable GUI Agents with One Exploration.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Fast On-device LLM Inference with NPUs.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

Demystifying Small Language Models for Edge Deployment.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers.
IEEE Trans. Mob. Comput., December, 2024

Accelerating Vertical Federated Learning.
IEEE Trans. Big Data, December, 2024

Communication-Efficient Satellite-Ground Federated Learning Through Progressive Weight Quantization.
IEEE Trans. Mob. Comput., September, 2024

Benchmarking Mobile Deep Learning Software.
GetMobile Mob. Comput. Commun., September, 2024

Seamless Cross-Edge Service Migration for Real-Time Rendering Applications.
IEEE Trans. Mob. Comput., June, 2024

A Comprehensive Deep Learning Library Benchmark and Optimal Library Selection.
IEEE Trans. Mob. Comput., May, 2024

FLASH: Heterogeneity-Aware Federated Learning at Scale.
IEEE Trans. Mob. Comput., January, 2024

Toward Efficient Satellite Computing Through Adaptive Compression.
IEEE Trans. Serv. Comput., 2024

Tango: Harmonious Optimization for Mixed Services in Kubernetes-Based Edge Clouds.
IEEE Trans. Serv. Comput., 2024

Large-Scale Measurements and Optimizations on Latency in Edge Clouds.
IEEE Trans. Cloud Comput., 2024

DroidCall: A Dataset for LLM-powered Android Intent Invocation.
CoRR, 2024

PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training.
CoRR, 2024

Small Language Models: Survey, Measurements, and Insights.
CoRR, 2024

Recall: Empowering Multimodal Embedding for Edge Devices.
CoRR, 2024

MobileViews: A Large-Scale Mobile GUI Dataset.
CoRR, 2024

ELMS: Elasticized Large Language Models On Mobile Devices.
CoRR, 2024

FedMoE: Personalized Federated Learning via Heterogeneous Mixture of Experts.
CoRR, 2024

LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Automation Task Evaluation.
CoRR, 2024

LLM as a System Service on Mobile Devices.
CoRR, 2024

A First Look at GPT Apps: Landscape and Vulnerability.
CoRR, 2024

Lightweight Protection for Privacy in Offloaded Speech Understanding.
CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.
CoRR, 2024

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security.
CoRR, 2024

Towards Energy-efficient Federated Learning via INT8-based Training on Mobile DSPs.
Proceedings of the ACM on Web Conference 2024, 2024

High-density Mobile Cloud Gaming on Edge SoC Clusters.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

More is Different: Prototyping and Analyzing a New Form of Edge Server with Massive Mobile SoCs.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

FwdLLM: Efficient Federated Finetuning of Large Language Models with Perturbed Inferences.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

An Empirical Study of Rust-for-Linux: The Success, Dissatisfaction, and Compromise.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation.
Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, 2024

PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks.
Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems, 2024

Exploring Real-Time Satellite Computing: From Energy and Thermal Perspectives.
Proceedings of the IEEE Real-Time Systems Symposium, 2024

SILENCE: Protecting privacy in offloaded speech understanding on resource-constrained devices.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Poster: Efficient and Accurate Mobile Task Automation through Learning from Code.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

Mobile Foundation Model as Firmware.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Deciphering the Enigma of Satellite Computing with COTS Devices: Measurement and Analysis.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

Resource-efficient In-orbit Detection of Earth Objects.
Proceedings of the IEEE INFOCOM 2024, 2024

Flexible LAN-WAN Orchestration for Communication Efficient Federated Learning over Large-Scale Mobile Devices.
Proceedings of the 30th IEEE International Conference on Parallel and Distributed Systems, 2024

FedRDMA: Communication-Efficient Cross-Silo Federated LLM via Chunked RDMA Transmission.
Proceedings of the 4th Workshop on Machine Learning and Systems, 2024

WiP: Efficient LLM Prefilling with Mobile NPU.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

Large Language Models on Mobile Devices: Measurements, Analysis, and Insights.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

5G Edge Computing - Technologies, Applications and Future Visions
Springer, ISBN: 978-981-97-0212-1, 2024

2023
Demystifying the QoS and QoE of Edge-hosted Video Streaming Applications in the Wild with SNESet.
Proc. ACM Manag. Data, December, 2023

A large-scale holistic measurement of crowdsourced edge cloud platform.
World Wide Web (WWW), September, 2023

The First Verification Test of Space-Ground Collaborative Intelligence via Cloud-Native Satellites.
CoRR, 2023

LLMCad: Fast and Scalable On-device Large Language Model Inference.
CoRR, 2023

Rethinking Mobile AI Ecosystem in the LLM Era.
CoRR, 2023

EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models.
CoRR, 2023

Federated Fine-tuning of Billion-Sized Language Models across Mobile Devices.
CoRR, 2023

A Comprehensive Survey on Orbital Edge Computing: Systems, Applications, and Algorithms.
CoRR, 2023

ELASTIC: Edge Workload Forecasting based on Collaborative Cloud-Edge Deep Learning.
Proceedings of the ACM Web Conference 2023, 2023

Boosting DNN Cold Inference on Edge Devices.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

Federated Few-Shot Learning for Mobile NLP.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

Efficient Federated Learning for Modern NLP.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

How Far Have Edge Clouds Gone? A Spatial-Temporal Analysis of Edge Network Latency In the Wild.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

A Holistic QoS View of Crowdsourced Edge Cloud Platform.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

Evaluating and Enhancing the Robustness of Federated Learning System against Realistic Data Corruption.
Proceedings of the 34th IEEE International Symposium on Software Reliability Engineering, 2023

Privacy as a Resource in Differentially Private Federated Learning.
Proceedings of the IEEE INFOCOM 2023, 2023

Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors.
Proceedings of the Service-Oriented Computing - 21st International Conference, 2023

Tango: Harmonious Management and Scheduling for Mixed Services Co-located among Distributed Edge-Clouds.
Proceedings of the 52nd International Conference on Parallel Processing, 2023

Towards Practical Few-shot Federated NLP.
Proceedings of the 3rd Workshop on Machine Learning and Systems, 2023

FedAdapter: Efficient Federated Learning for Mobile NLP.
Proceedings of the ACM Turing Award Celebration Conference - China 2023, 2023

2022
SoC-Cluster as an Edge Server: an Application-driven Measurement Study.
CoRR, 2022

Federated NLP in Few-shot Scenarios.
CoRR, 2022

AUG-FedPrompt: Practical Few-shot Federated NLP with Data-augmented Prompts.
CoRR, 2022

Device-centric Federated Analytics At Ease.
CoRR, 2022

Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading.
CoRR, 2022

Understanding and Optimizing Deep Learning Cold-Start Latency on Edge Devices.
CoRR, 2022

AutoFedNLP: An efficient FedNLP framework.
CoRR, 2022

A Comprehensive Benchmark of Deep Learning Libraries on Mobile Devices.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Commutativity-guaranteed Docker Image Reconstruction towards Effective Layer Sharing.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

Towards Robust Intelligence in Space.
Proceedings of the IEEE Smartworld, 2022

Melon: breaking the memory wall for resource-efficient on-device machine learning.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

Mandheling: mixed-precision on-device DNN training with DSP offloading.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

Position Paper: Renovating Edge Servers with ARM SoCs.
Proceedings of the 7th IEEE/ACM Symposium on Edge Computing, 2022

2021
A Case for Camera-as-a-Service.
IEEE Pervasive Comput., 2021

Joint Placement of UPF and Edge Server for 6G Network.
IEEE Internet Things J., 2021

Autonomous Learning System Towards Mobile Intelligence.
Int. J. Softw. Informatics, 2021

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data.
Proceedings of the WWW '21: The Web Conference 2021, 2021

Video Analytics with Zero-streaming Cameras.
Proceedings of the 2021 USENIX Annual Technical Conference, 2021

TaintStream: fine-grained taint tracking for big data platforms through dynamic code translation.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

Towards Ubiquitous Learning: A First Measurement of On-Device Training Performance.
Proceedings of the EMDL@MobiSys 2021: Proceedings of the 5th International Workshop on Embedded and Mobile Deep Learning, 2021

Boosting Mobile CNN Inference through Semantic Memory.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

From cloud to edge: a first look at public edge platforms.
Proceedings of the IMC '21: ACM Internet Measurement Conference, 2021

Tiansuan Constellation: An Open Research Platform.
Proceedings of the IEEE International Conference on Edge Computing, 2021

2020
DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning.
IEEE Trans. Mob. Comput., 2020

Hierarchical Federated Learning through LAN-WAN Orchestration.
CoRR, 2020

Heterogeneity-Aware Federated Learning.
CoRR, 2020

Neural Architecture Search over Decentralized Data.
CoRR, 2020

Approximate query service on autonomous IoT cameras.
Proceedings of the MobiSys '20: The 18th Annual International Conference on Mobile Systems, 2020

A query engine for zero-streaming cameras.
Proceedings of the MobiCom '20: The 26th Annual International Conference on Mobile Computing and Networking, 2020

2019
MUIT: A Domain-Specific Language and its Middleware for Adaptive Mobile Web-Based User Interfaces in WS-BPEL.
IEEE Trans. Serv. Comput., 2019

Approximate Query Processing on Autonomous Cameras.
CoRR, 2019

Supporting Video Queries on Zero-Streaming Cameras.
CoRR, 2019

A First Look at Deep Learning Apps on Smartphones.
Proceedings of the World Wide Web Conference, 2019

2018
DeepType: On-Device Deep Learning for Input Personalization Service with Minimal Privacy Concern.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

PrivacyShield: A Mobile System for Supporting Subtle Just-in-time Privacy Provisioning through Off-Screen-based Touch Gestures.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

When Mobile Apps Going Deep: An Empirical Study of Mobile Deep Learning.
CoRR, 2018

DeepCache: Principled Cache for Mobile Deep Vision.
Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, 2018

Using Touch-screen Gestures for Just-in-time Privacy Provisioning.
Proceedings of the 2018 ACM International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable Computers, 2018

Power sandbox: power awareness redefined.
Proceedings of the Thirteenth EuroSys Conference, 2018

2017
ShuffleDog: Characterizing and Adapting User-Perceived Latency of Android Apps.
IEEE Trans. Mob. Comput., 2017

Enabling Cooperative Inference of Deep Learning on Wearables and Smartphones.
CoRR, 2017

Accelerating Convolutional Neural Networks for Continuous Mobile Vision via Cache Reuse.
CoRR, 2017

AppHolmes: Detecting and Characterizing App Collusion among Third-Party Android Markets.
Proceedings of the 26th International Conference on World Wide Web, 2017

2016
MUIT: A Middleware for Adaptive Mobile Web-based User Interfaces in WS-BPEL.
CoRR, 2016


  Loading...