We stand with Ukraine

We stand with Ukraine

Daliang Xu

Orcid: 0000-0002-6775-0688

According to our database¹, Daliang Xu authored at least 28 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2026

Training With Integer-Only Arithmetic: Energy-Efficient Federated Learning With Mobile DSP Offloading.

[DOI]

,

,

,

,

,

,

Shangguang Wang

IEEE Trans. Mob. Comput., May, 2026

NanoSpec: Accelerating Speculative Decoding using Minimalist In-Context Vocabularies.

[DOI]

,

,

,

,

,

CoRR, May, 2026

Quant.npu: Enabling Efficient Mobile NPU Inference for on-device LLMs via Fully Static Quantization.

[DOI]

,

,

,

,

,

,

,

CoRR, May, 2026

VLMCache: Efficient On-Device Vision-Language Model Inference.

[DOI]

,

,

,

,

,

,

Proceedings of the 24th Annual International Conference on Mobile Systems, 2026

ShadowNPU: System and Algorithm Co-design for NPU-Centric On-Device LLM Inference.

[DOI]

,

,

,

,

Proceedings of the 24th Annual International Conference on Mobile Systems, 2026

2025

Accelerating Mobile Language Model via Speculative Decoding and NPU-Coordinated Execution.

[DOI]

,

,

,

,

Shangguang Wang

,

CoRR, October, 2025

Dynamic Sparse Attention on Mobile SoCs.

[DOI]

,

,

,

,

CoRR, August, 2025

MobiEdit: Resource-efficient Knowledge Editing for Personalized On-device LLMs.

[DOI]

,

,

,

,

,

,

Shangguang Wang

,

CoRR, June, 2025

EdgeLLM: Fast On-Device LLM Inference With Speculative Decoding.

[DOI]

,

,

,

,

,

,

,

IEEE Trans. Mob. Comput., April, 2025

Niagara+: Scheduling Live ML Analytics Across Heterogeneous Device Processors and Edge Servers.

[DOI]

,

,

,

,

,

Shangguang Wang

,

,

,

,

IEEE Trans. Serv. Comput., 2025

Elastic On-Device LLM Service.

[DOI]

,

,

,

,

,

Proceedings of the 31st Annual International Conference on Mobile Computing and Networking, 2025

Fast On-device LLM Inference with NPUs.

[DOI]

,

,

,

,

,

,

Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2025

2024

Efficient, Scalable, and Sustainable DNN Training on SoC-Clustered Edge Servers.

[DOI]

,

,

,

,

,

,

IEEE Trans. Mob. Comput., December, 2024

ELMS: Elasticized Large Language Models On Mobile Devices.

[DOI]

,

,

,

,

,

CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

Shangguang Wang

,

,

,

,

CoRR, 2024

Towards Energy-efficient Federated Learning via INT8-based Training on Mobile DSPs.

[DOI]

,

Shangguang Wang

,

,

,

,

,

Proceedings of the ACM on Web Conference 2024, 2024

PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks.

[DOI]

,

,

,

,

,

,

Proceedings of the 22nd ACM Conference on Embedded Networked Sensor Systems, 2024

WiP: Efficient LLM Prefilling with Mobile NPU.

[DOI]

,

,

,

,

,

Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers.

[DOI]

,

,

,

,

,

,

Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

2023

LLMCad: Fast and Scalable On-device Large Language Model Inference.

[DOI]

,

,

,

,

,

,

CoRR, 2023

Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors.

[DOI]

,

,

,

,

,

Shangguang Wang

,

,

,

Proceedings of the Service-Oriented Computing - 21st International Conference, 2023

Satellite Computing: From Space to Your Screen.

[DOI]

,

Proceedings of the Service-Oriented Computing - ICSOC 2023 Workshops - AI-PA, ASOCA, SAPD, SQS, SSCOPE, WESOACS and Satellite Events, Rome, Italy, November 28, 2023

2022

Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading.

[DOI]

,

,

,

Shangguang Wang

,

,

,

,

,

CoRR, 2022

Mandheling: mixed-precision on-device DNN training with DSP offloading.

[DOI]

,

,

,

Shangguang Wang

,

,

,

,

,

Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

2020

S3Library: Automatically Eliminating C/C++ Buffer Overflow using Compatible Safer Libraries.

[DOI]

,

,

,

,

CoRR, 2020

DangKiller: Eliminating Dangling Pointers Efficiently via Implicit Identifier.

[DOI]

,

,

,

,

,

CoRR, 2020

SMA: Eliminate Memory Spatial Errors via Saturation Memory Access.

[DOI]

,

,

,

,

,

,

CoRR, 2020

2019

An adaptive template matching-based single object tracking algorithm with parallel acceleration.

[DOI]

,

,

,

,

,

,

J. Vis. Commun. Image Represent., 2019

Loading...