Hongchao Du

Orcid: 0000-0001-7995-1834

According to our database1, Hongchao Du authored at least 16 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
LAPS: A Length-Aware-Prefill LLM Serving System.
CoRR, January, 2026

Breaking the I/O Bottleneck: I/O Coordination Optimization for Efficient Large-Scale LLM Fine-Tuning.
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2026

ClawMobile: Rethinking Smartphone-Native Agentic Systems.
Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, 2026

2025
SolFS: An Operation-Log Versioning File System for Hash-free Efficient Mobile Cloud Backup.
Proceedings of the 2025 USENIX Annual Technical Conference, 2025

Memory-Optimized Offloading: Enabling LLM Fine-Tuning on Memory-Constrained Hardware.
Proceedings of the 14th IEEE Non-Volatile Memory Systems and Applications Symposium, 2025

FlexPB: Achieving Flexible and Efficient Persistent User Buffers for Mobile Systems.
Proceedings of the 14th IEEE Non-Volatile Memory Systems and Applications Symposium, 2025

EvoP: Robust LLM Inference via Evolutionary Pruning.
Proceedings of the Natural Language Processing and Chinese Computing, 2025

ArtMem: Adaptive Migration in Reinforcement Learning-Enabled Tiered Memory.
Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

FlexInfer: Breaking Memory Constraint via Flexible and Efficient Offloading for On-Device LLM Inference.
Proceedings of the 5th Workshop on Machine Learning and Systems, 2025

2024
On the Compressibility of Quantized Large Language Models.
CoRR, 2024

When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Multi-Granularity Shadow Paging with NVM Write Optimization for Crash-Consistent Memory-Mapped I/O.
Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

2022
A Practical Highly Paralleled ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions.
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

2021
DAP-Sketch: An accurate and effective network measurement sketch with Deterministic Admission Policy.
Comput. Networks, 2021

2020
PattPIM: A Practical ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions.
Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019
Accurate Network Flow Measurement with Deterministic Admission Policy.
Proceedings of the Algorithms and Architectures for Parallel Processing, 2019


  Loading...