Daliang Xu

Orcid: 0000-0002-6775-0688

According to our database1, Daliang Xu authored at least 28 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Training With Integer-Only Arithmetic: Energy-Efficient Federated Learning With Mobile DSP Offloading.
IEEE Trans. Mob. Comput., May, 2026

NanoSpec: Accelerating Speculative Decoding using Minimalist In-Context Vocabularies.
CoRR, May, 2026

Quant.npu: Enabling Efficient Mobile NPU Inference for on-device LLMs via Fully Static Quantization.
CoRR, May, 2026

VLMCache: Efficient On-Device Vision-Language Model Inference.
Proceedings of the 24th Annual International Conference on Mobile Systems, 2026

ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference.
Proceedings of the 24th Annual International Conference on Mobile Systems, 2026

2025
Accelerating Mobile Language Model via Speculative Decoding and NPU-Coordinated Execution.
CoRR, October, 2025

Dynamic Sparse Attention on Mobile SoCs.
CoRR, August, 2025

MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs.
CoRR, June, 2025

EdgeLLM: Fast On-Device LLM Inference With Speculative Decoding.
IEEE Trans. Mob. Comput., April, 2025

Niagara+: Scheduling Live ML Analytics Across Heterogeneous Device Processors and Edge Servers.
IEEE Trans. Serv. Comput., 2025

Elastic On-Device LLM Service.
Proceedings of the 31st Annual International Conference on Mobile Computing and Networking, 2025

Fast On-device LLM Inference with NPUs.
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024
Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers.
IEEE Trans. Mob. Comput., December, 2024

ELMS: Elasticized Large Language Models On Mobile Devices.
CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.
CoRR, 2024

Towards Energy-efficient Federated Learning via INT8-based Training on Mobile DSPs.
Proceedings of the ACM on Web Conference 2024, 2024

PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks.
Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems, 2024

WiP: Efficient LLM Prefilling with Mobile NPU.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023
LLMCad: Fast and Scalable On-device Large Language Model Inference.
CoRR, 2023

Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors.
Proceedings of the Service-Oriented Computing - 21st International Conference, 2023

Satellite Computing: From Space to Your Screen.
Proceedings of the Service-Oriented Computing - ICSOC 2023 Workshops - AI-PA, ASOCA, SAPD, SQS, SSCOPE, WESOACS and Satellite Events, Rome, Italy, November 28, 2023

2022
Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading.
CoRR, 2022

Mandheling: mixed-precision on-device DNN training with DSP offloading.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

2020
S3Library: Automatically Eliminating C/C++ Buffer Overflow using Compatible Safer Libraries.
CoRR, 2020

DangKiller: Eliminating Dangling Pointers Efficiently via Implicit Identifier.
CoRR, 2020

SMA: Eliminate Memory Spatial Errors via Saturation Memory Access.
CoRR, 2020

2019
An adaptive template matching-based single object tracking algorithm with parallel acceleration.
J. Vis. Commun. Image Represent., 2019


  Loading...