Fahao Chen
Orcid: 0000-0002-4345-1296
According to our database1,
Fahao Chen
authored at least 21 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
Semi-Clairvoyant Scheduling of Speculative Decoding Requests to Minimize LLM Inference Latency.
CoRR, May, 2025
Peer Peer Netw. Appl., April, 2025
Corrections to "Giant Could Be Tiny: Efficient Inference of Giant Models on Resource-Constrained UAVs".
IEEE Internet Things J., March, 2025
Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management.
CoRR, January, 2025
Serving Transformer Models via Joint Requst Scheduling and Batching in the Network Edge.
IEEE Trans. Sustain. Comput., 2025
Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management.
Proceedings of the IEEE INFOCOM 2025, 2025
SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models.
Proceedings of the IEEE INFOCOM 2025, 2025
2024
Giant Could Be Tiny: Efficient Inference of Giant Models on Resource-Constrained UAVs.
IEEE Internet Things J., June, 2024
Non-Clairvoyant Scheduling of Distributed Machine Learning With Inter-Job and Intra-Job Parallelism on Heterogeneous GPUs.
IEEE Trans. Cloud Comput., 2024
Communication-Efficient Sparsely-Activated Model Training via Sequence Migration and Token Condensation.
CoRR, 2024
2023
DGC: Training Dynamic Graphs with Spatio-Temporal Non-Uniformity using Graph Partitioning by Chunks.
Proc. ACM Manag. Data, December, 2023
IEEE Trans. Cloud Comput., 2023
Proceedings of the 98th IEEE Vehicular Technology Conference, 2023
Proceedings of the IEEE Intl Conf on Dependable, 2023
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023
2022
IEEE Trans. Parallel Distributed Syst., 2022
Proceedings of the 51st International Conference on Parallel Processing, 2022
Hare: Exploiting Inter-job and Intra-job Parallelism of Distributed Machine Learning on Heterogeneous GPUs.
Proceedings of the HPDC '22: The 31st International Symposium on High-Performance Parallel and Distributed Computing, Minneapolis, MN, USA, 27 June 2022, 2022
2021
Proceedings of the International Conference on Information and Communication Technologies for Disaster Management, 2021
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021