Hongchao Du

Orcid: 0000-0001-7995-1834

According to our database¹, Hongchao Du authored at least 16 papers between 2019 and 2026.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2026

LAPS: A Length-Aware-Prefill LLM Serving System.

[BibT_eX]

[DOI]

CoRR, January, 2026

Breaking the I/O Bottleneck: I/O Coordination Optimization for Efficient Large-Scale LLM Fine-Tuning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software, 2026

ClawMobile: Rethinking Smartphone-Native Agentic Systems.

[BibT_eX]

[DOI]

Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, 2026

2025

SolFS: An Operation-Log Versioning File System for Hash-free Efficient Mobile Cloud Backup.

[BibT_eX]

[DOI]

Proceedings of the 2025 USENIX Annual Technical Conference, 2025

Memory-Optimized Offloading: Enabling LLM Fine-Tuning on Memory-Constrained Hardware.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE Non-Volatile Memory Systems and Applications Symposium, 2025

FlexPB: Achieving Flexible and Efficient Persistent User Buffers for Mobile Systems.

[BibT_eX]

[DOI]

Proceedings of the 14th IEEE Non-Volatile Memory Systems and Applications Symposium, 2025

EvoP: Robust LLM Inference via Evolutionary Pruning.

[BibT_eX]

[DOI]

Proceedings of the Natural Language Processing and Chinese Computing, 2025

ArtMem: Adaptive Migration in Reinforcement Learning-Enabled Tiered Memory.

[BibT_eX]

[DOI]

Proceedings of the 52nd Annual International Symposium on Computer Architecture, 2025

FlexInfer: Breaking Memory Constraint via Flexible and Efficient Offloading for On-Device LLM Inference.

[BibT_eX]

[DOI]

Proceedings of the 5th Workshop on Machine Learning and Systems, 2025

2024

On the Compressibility of Quantized Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

When Compression Meets Model Compression: Memory-Efficient Double Compression for Large Language Models.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023

Multi-Granularity Shadow Paging with NVM Write Optimization for Crash-Consistent Memory-Mapped I/O.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Symposium on High-Performance Computer Architecture, 2023

2022

A Practical Highly Paralleled ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions.

[BibT_eX]

[DOI]

IEEE Trans. Comput. Aided Des. Integr. Circuits Syst., 2022

2021

DAP-Sketch: An accurate and effective network measurement sketch with Deterministic Admission Policy.

[BibT_eX]

[DOI]

Comput. Networks, 2021

2020

PattPIM: A Practical ReRAM-Based DNN Accelerator by Reusing Weight Pattern Repetitions.

[BibT_eX]

[DOI]

Proceedings of the 57th ACM/IEEE Design Automation Conference, 2020

2019

Accurate Network Flow Measurement with Deterministic Admission Policy.

[BibT_eX]

[DOI]

Proceedings of the Algorithms and Architectures for Parallel Processing, 2019

Hongchao Du

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...