Yunxin Liu

Orcid: 0000-0001-7352-8955

Affiliations:
  • Tsinghua University, Beijing, China
  • Microsoft Research Asia, China (former)


According to our database1, Yunxin Liu authored at least 156 papers between 2009 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
Serving MoE Models on Resource-Constrained Edge Devices via Dynamic Expert Swapping.
IEEE Trans. Computers, August, 2025

Fine-Grained Structured Sparse Computing for FPGA-Based AI Inference.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., July, 2025

Hijacking JARVIS: Benchmarking Mobile GUI Agents against Unprivileged Third Parties.
CoRR, July, 2025

Mobile-Bench-v2: A More Realistic and Comprehensive Benchmark for VLM-based Mobile Agents.
CoRR, May, 2025

LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps.
CoRR, May, 2025

AdaEvo: Edge-Assisted Continuous and Timely DNN Model Evolution for Mobile Devices.
IEEE Trans. Mob. Comput., April, 2025

Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash.
CoRR, April, 2025

AdaWiFi, Collaborative WiFi Sensing for Cross-Environment Adaptation.
IEEE Trans. Mob. Comput., February, 2025

DSTC: Dual-Side Sparse Tensor Core for DNNs Acceleration on Modern GPU Architectures.
IEEE Trans. Computers, February, 2025

A goal-oriented document-grounded dialogue based on evidence generation.
Data Knowl. Eng., 2025

Region-based Content Enhancement for Efficient Video Analytics at the Edge.
Proceedings of the 22nd USENIX Symposium on Networked Systems Design and Implementation, 2025

Empower Vision Applications with LoRA LMM.
Proceedings of the Twentieth European Conference on Computer Systems, 2025

2024
Seamless Cross-Edge Service Migration for Real-Time Rendering Applications.
IEEE Trans. Mob. Comput., June, 2024

HiMoDepth: Efficient Training-Free High-Resolution On-Device Depth Perception.
IEEE Trans. Mob. Comput., May, 2024

FLASH: Heterogeneity-Aware Federated Learning at Scale.
IEEE Trans. Mob. Comput., January, 2024

TIM: Enabling Large-Scale White-Box Testing on In-App Deep Learning Models.
IEEE Trans. Inf. Forensics Secur., 2024

AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation.
CoRR, 2024

MobiFuse: A High-Precision On-device Depth Perception System with Multi-Data Fusion.
CoRR, 2024

V-LoRA: An Efficient and Flexible System Boosts Vision Applications with LoRA LMM.
CoRR, 2024

LoRA-Switch: Boosting the Efficiency of Dynamic LLM Adapters via System-Algorithm Co-design.
CoRR, 2024

A Survey of Resource-efficient LLM and Multimodal Foundation Models.
CoRR, 2024

Personal LLM Agents: Insights and Survey about the Capability, Efficiency and Security.
CoRR, 2024

Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

Poster: Enabling Agent-centric Interaction on Smartphones with LLM-based UI Reassembling.
Proceedings of the 22nd Annual International Conference on Mobile Systems, 2024

FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

AutoDroid: LLM-powered Task Automation in Android.
Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024

A First Look At Efficient And Secure On-Device LLM Inference Against KV Leakage.
Proceedings of the 19th Workshop on Mobility in the Evolving Internet Architecture, 2024

BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge.
Proceedings of the IEEE INFOCOM 2024, 2024

WiP: An On-device LLM-based Approach to Query Privacy Protection.
Proceedings of the Workshop on Edge and Mobile Foundation Models, 2024

Amanda: Unified Instrumentation Framework for Deep Neural Networks.
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2024

SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
LEAP: TrustZone Based Developer-Friendly TEE for Intelligent Mobile Apps.
IEEE Trans. Mob. Comput., December, 2023

${{\sf S \text{-}UbiTap}}$S-UbiTap: Leveraging Acoustic Dispersion for Ubiquitous and Scalable Touch Interface on Solid Surfaces.
IEEE Trans. Mob. Comput., November, 2023

MVPose: Realtime Multi-Person Pose Estimation Using Motion Vector on Mobile Devices.
IEEE Trans. Mob. Comput., June, 2023

DAPPER: Label-Free Performance Estimation after Personalization for Heterogeneous Mobile Sensing.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2023

Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations.
CoRR, 2023

Empowering LLM to use Smartphone for Intelligent Task Automation.
CoRR, 2023

Serving MoE Models on Resource-constrained Edge Devices via Dynamic Expert Swapping.
CoRR, 2023

Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints.
CoRR, 2023

AIGC Empowering Telecom Sector White Paper_chinese.
CoRR, 2023

6G Network Business Support System.
CoRR, 2023

6G Network Operation Support System.
CoRR, 2023

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments.
CoRR, 2023

LUT-NN: Towards Unified Neural Network Inference by Table Lookup.
CoRR, 2023

NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors.
Proceedings of the 21st Annual International Conference on Mobile Systems, 2023

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup.
Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, 2023

A Proposal-Improved Few-Shot Embedding Model with Contrastive Learning.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023

PatchBackdoor: Backdoor Attack against Deep Neural Networks without Model Modification.
Proceedings of the 31st ACM International Conference on Multimedia, 2023

Retrieval-based Battery Degradation Prediction for Battery Energy Storage System Operations.
Proceedings of the 2023 IEEE International Conferences on Internet of Things (iThings) and IEEE Green Computing & Communications (GreenCom) and IEEE Cyber, 2023

OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization.
Proceedings of the 50th Annual International Symposium on Computer Architecture, 2023

The First Decade of Computing and Network Convergence.
Proceedings of the IEEE International Conference on Communications, 2023

FedTherapist: Mental Health Monitoring with User-Generated Linguistic Expressions on Smartphones via Federated Learning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
Characterizing Embedded Web Browsing in Mobile Apps.
IEEE Trans. Mob. Comput., 2022

Model Protection: Real-Time Privacy-Preserving Inference Service for Model Privacy at the Edge.
IEEE Trans. Dependable Secur. Comput., 2022

A Cloud-Edge Collaboration Framework for Cognitive Service.
IEEE Trans. Cloud Comput., 2022

WheelLoc: Practical and Accurate Localization for Wheeled Mobile Targets via Integrated Sensing and Communication.
IEEE J. Sel. Areas Commun., 2022

Automation Slicing and Testing for in-App Deep Learning Models.
CoRR, 2022

Sample Selection with Deadline Control for Efficient Federated Learning on Heterogeneous Clients.
CoRR, 2022

Brisk-Yolo: A Lightweight Object Detection Algorithm for Edge Devices.
Proceedings of the IEEE Smartworld, 2022

Hyperion: A Generic and Distributed Mobile Offloading Framework on OpenCL.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

TailorFL: Dual-Personalized Federated Learning under System and Data Heterogeneity.
Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, 2022

Melon: breaking the memory wall for resource-efficient on-device machine learning.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

FedBalancer: data and pace control for efficient federated learning on heterogeneous clients.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices.
Proceedings of the MobiSys '22: The 20th Annual International Conference on Mobile Systems, Applications and Services, Portland, Oregon, 27 June 2022, 2022

MobiDepth: real-time depth estimation using on-device dual cameras.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

Romou: rapidly generate high-performance tensor kernels for mobile GPUs.
Proceedings of the ACM MobiCom '22: The 28th Annual International Conference on Mobile Computing and Networking, Sydney, NSW, Australia, October 17, 2022

ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization.
Proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture, 2022

Representational Continuity for Unsupervised Continual Learning.
Proceedings of the Tenth International Conference on Learning Representations, 2022

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training.
Proceedings of the IEEE 40th International Conference on Computer Design, 2022

2021
Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs.
IEEE Trans. Parallel Distributed Syst., 2021

Operating Systems for Resource-adaptive Intelligent Software: Challenges and Opportunities.
ACM Trans. Internet Techn., 2021

EPASS360: QoE-Aware 360-Degree Video Streaming Over Mobile Devices.
IEEE Trans. Mob. Comput., 2021

S2Net: Preserving Privacy in Smart Home Routers.
IEEE Trans. Dependable Secur. Comput., 2021

nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices.
GetMobile Mob. Comput. Commun., 2021

A Case for Camera-as-a-Service.
IEEE Pervasive Comput., 2021

DAPPER: Performance Estimation of Domain Adaptation in Mobile Sensing.
CoRR, 2021

Rethinking the Representational Continuity: Towards Unsupervised Continual Learning.
CoRR, 2021

App Developer Centric Trusted Execution Environment.
CoRR, 2021

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data.
Proceedings of the WWW '21: The Web Conference 2021, 2021

TaintStream: fine-grained taint tracking for big data platforms through dynamic code translation.
Proceedings of the ESEC/FSE '21: 29th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, 2021

nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices.
Proceedings of the MobiSys '21: The 19th Annual International Conference on Mobile Systems, Applications, and Services, Virtual Event, Wisconsin, USA, 24 June, 2021

ParallelFusion: Towards Maximum Utilization of Mobile GPU for DNN Inference.
Proceedings of the EMDL@MobiSys 2021: Proceedings of the 5th International Workshop on Embedded and Mobile Deep Learning, 2021

Towards Ubiquitous Learning: A First Measurement of On-Device Training Performance.
Proceedings of the EMDL@MobiSys 2021: Proceedings of the 5th International Workshop on Embedded and Mobile Deep Learning, 2021

Elf: accelerate high-resolution mobile deep vision with content-aware parallel offloading.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

PECAM: privacy-enhanced video streaming and analytics via securely-reversible transformation.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

AsyMo: scalable and efficient deep-learning inference on asymmetric mobile CPUs.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

Flexible high-resolution object detection on edge devices with tunable latency.
Proceedings of the ACM MobiCom '21: The 27th Annual International Conference on Mobile Computing and Networking, 2021

Boosting Mobile CNN Inference through Semantic Memory.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

To Bridge Neural Network Design and Real-World Performance: A Behaviour Study for Neural Networks.
Proceedings of the Fourth Conference on Machine Learning and Systems, 2021

Dual-side Sparse Tensor Core.
Proceedings of the 48th ACM/IEEE Annual International Symposium on Computer Architecture, 2021

2020
Distributed fine-tuning of CNNs for image retrieval on multiple mobile devices.
Pervasive Mob. Comput., 2020

TapSnoop: Leveraging Tap Sounds to Infer Tapstrokes on Touchscreen Devices.
IEEE Access, 2020

MobiPose: real-time multi-person pose estimation on mobile devices.
Proceedings of the SenSys '20: The 18th ACM Conference on Embedded Networked Sensor Systems, 2020

Approximate query service on autonomous IoT cameras.
Proceedings of the MobiSys '20: The 18th Annual International Conference on Mobile Systems, 2020

EMO: real-time emotion recognition from single-eye images for resource-constrained eyewear devices.
Proceedings of the MobiSys '20: The 18th Annual International Conference on Mobile Systems, 2020

A query engine for zero-streaming cameras.
Proceedings of the MobiCom '20: The 26th Annual International Conference on Mobile Computing and Networking, 2020

SCYLLA: QoE-aware Continuous Mobile Vision with FPGA-based Dynamic Deep Neural Network Reconfiguration.
Proceedings of the 39th IEEE Conference on Computer Communications, 2020

Fast Hardware-Aware Neural Architecture Search.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

PaGraph: Scaling GNN training on large graphs via computation-aware caching.
Proceedings of the SoCC '20: ACM Symposium on Cloud Computing, 2020

Profiling and optimizing deep learning inference on mobile GPUs.
Proceedings of the APSys '20: 11th ACM SIGOPS Asia-Pacific Workshop on Systems, 2020

2019
Hardware-aware One-Shot Neural Architecture Search in Coordinate Ascent Framework.
CoRR, 2019

Approximate Query Processing on Autonomous Cameras.
CoRR, 2019

Supporting Video Queries on Zero-Streaming Cameras.
CoRR, 2019

A First Look at Deep Learning Apps on Smartphones.
Proceedings of the World Wide Web Conference, 2019

secGAN: A Cycle-Consistent GAN for Securely-Recoverable Video Transformation.
Proceedings of the 2019 Workshop on Hot Topics in Video Analytics and Intelligent Edges, 2019

Live Video Analytics with FPGA-based Smart Cameras.
Proceedings of the 2019 Workshop on Hot Topics in Video Analytics and Intelligent Edges, 2019

Occlumency: Privacy-preserving Remote Deep-learning Inference Using SGX.
Proceedings of the 25th Annual International Conference on Mobile Computing and Networking, 2019

HotEdgeVideo'19: Workshop on Hot Topics in Video Analytics and Intelligent Edges.
Proceedings of the 25th Annual International Conference on Mobile Computing and Networking, 2019

Characterizing and orchestrating NFV-ready servers for efficient edge data processing.
Proceedings of the International Symposium on Quality of Service, 2019

DRL360: 360-degree Video Streaming with Deep Reinforcement Learning.
Proceedings of the 2019 IEEE Conference on Computer Communications, 2019

Efficient and Effective Sparse LSTM on FPGA with Bank-Balanced Sparsity.
Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, 2019

Collaborative learning between cloud and end devices: an empirical study on location prediction.
Proceedings of the 4th ACM/IEEE Symposium on Edge Computing, 2019

SeerNet: Predicting Convolutional Neural Network Feature-Map Sparsity Through Low-Bit Quantization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
<i>i</i>-<i>Jacob</i>: An Internetware-Oriented Approach to Optimizing Computation-Intensive Mobile Web Browsing.
ACM Trans. Internet Techn., 2018

Characterizing Privacy Risks of Mobile Apps with Sensitivity Analysis.
IEEE Trans. Mob. Comput., 2018

A Tale of Two Fashions: An Empirical Study on the Performance of Native Apps and Web Apps on Android.
IEEE Trans. Mob. Comput., 2018

When Mobile Apps Going Deep: An Empirical Study of Mobile Deep Learning.
CoRR, 2018

Behavior Recognition Based on Wi-Fi CSI: Part 2.
IEEE Commun. Mag., 2018

Aladdin: Automating Release of Deep-Link APIs on Android.
Proceedings of the 2018 World Wide Web Conference on World Wide Web, 2018

UbiTap: Leveraging Acoustic Dispersion for Ubiquitous Touch Interface on Solid Surfaces.
Proceedings of the 16th ACM Conference on Embedded Networked Sensor Systems, SenSys 2018, 2018

Cutting the Cord: Designing a High-quality Untethered VR System with Low Latency Remote Rendering.
Proceedings of the 16th Annual International Conference on Mobile Systems, 2018

DeepCache: Principled Cache for Mobile Deep Vision.
Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, 2018

2017
SWAROVsky: Optimizing Resource Loading for Mobile Web Browsing.
IEEE Trans. Mob. Comput., 2017

ReWAP: Reducing Redundant Transfers for Mobile Web Browsing via App-Specific Resource Packaging.
IEEE Trans. Mob. Comput., 2017

ShuffleDog: Characterizing and Adapting User-Perceived Latency of Android Apps.
IEEE Trans. Mob. Comput., 2017

Accelerating Convolutional Neural Networks for Continuous Mobile Vision via Cache Reuse.
CoRR, 2017

Behavior Recognition Based on Wi-Fi CSI: Part 1.
IEEE Commun. Mag., 2017

AppHolmes: Detecting and Characterizing App Collusion among Third-Party Android Markets.
Proceedings of the 26th International Conference on World Wide Web, 2017

Systematically testing background services of mobile apps.
Proceedings of the 32nd IEEE/ACM International Conference on Automated Software Engineering, 2017

Enabling accurate and efficient modeling-based CPU power estimation for smartphones.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017

Latency-based WiFi congestion control in the air for dense WiFi networks.
Proceedings of the 25th IEEE/ACM International Symposium on Quality of Service, 2017

Aladdin: automating release of Android deep links to in-app content.
Proceedings of the 39th International Conference on Software Engineering, 2017

On Building a Programmable Wireless High-Quality Virtual Reality System Using Commodity Hardware.
Proceedings of the 8th Asia-Pacific Workshop on Systems, Mumbai, India, September 2, 2017, 2017

BikeLoc: a Real-time High-Precision Bicycle Localization System Using Synthetic Aperture Radar.
Proceedings of the First Asia-Pacific Workshop on Networking, 2017

2016
Demystifying the Imperfect Client-Side Cache Performance of Mobile Web Browsing.
IEEE Trans. Mob. Comput., 2016

RETHINKING ENERGYPERFORMANCE TRADE-OFF in Mobile Web Page Loading.
GetMobile Mob. Comput. Commun., 2016

ReWAP: Reducing Redundant Transfers for Mobile Web Applications via App-Specific Resource Packaging.
CoRR, 2016

Poster: TapSnoop - Inferring Tapstrokes from Listening to Tap Sound on Mobile Devices.
Proceedings of the 14th Annual International Conference on Mobile Systems, 2016

AMIL: Localizing neighboring mobile devices through a simple gesture.
Proceedings of the IEEE Conference on Computer Communications Workshops, 2016

Smart and Secure: Preserving Privacy in Untrusted Home Routers.
Proceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems, 2016

2015
Data-Driven Composition for Service-Oriented Situational Web Applications.
IEEE Trans. Serv. Comput., 2015

Measurement and Analysis of Mobile Web Cache Performance.
Proceedings of the 24th International Conference on World Wide Web, 2015

Rethinking Energy-Performance Trade-Off in Mobile Web Page Loading.
Proceedings of the 21st Annual International Conference on Mobile Computing and Networking, 2015

Mash Droid: An Approach to Mobile-Oriented Dynamic Services Discovery and Composition by In-App Search.
Proceedings of the 2015 IEEE International Conference on Web Services, 2015

Characterizing RESTful Web Services Usage on Smartphones: A Tale of Native Apps and Web Apps.
Proceedings of the 2015 IEEE International Conference on Web Services, 2015

2014
Design, Realization, and Evaluation of DozyAP for Power-Efficient Wi-Fi Tethering.
IEEE/ACM Trans. Netw., 2014

2013
Towards better CPU power management on multicore smartphones.
Proceedings of the Workshop on Power-Aware Computing and Systems, 2013

V-edge: Fast Self-constructive Power Modeling of Smartphones Based on Battery Voltage Dynamics.
Proceedings of the 10th USENIX Symposium on Networked Systems Design and Implementation, 2013

Optimizing background email sync on smartphones.
Proceedings of the 11th Annual International Conference on Mobile Systems, 2013

MoodScope: building a mood sensor from smartphone usage patterns.
Proceedings of the 11th Annual International Conference on Mobile Systems, 2013

AppMobiCloud: improving mobile web applications by mobile-cloud convergence.
Proceedings of the 5th Asia-Pacific Symposium on Internetware, 2013

2012
DozyAP: power-efficient Wi-Fi tethering.
Proceedings of the 10th International Conference on Mobile Systems, 2012

2010
Design, Realization, and Evaluation of xShare for Impromptu Sharing of Mobile Phones.
IEEE Trans. Mob. Comput., 2010

2009
xShare: supporting impromptu sharing of mobile phones.
Proceedings of the 7th International Conference on Mobile Systems, 2009


  Loading...