Fahao Chen

Orcid: 0000-0002-4345-1296

According to our database1, Fahao Chen authored at least 21 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Semi-Clairvoyant Scheduling of Speculative Decoding Requests to Minimize LLM Inference Latency.
CoRR, May, 2025

Internet of Agents: Fundamentals, Applications, and Challenges.
CoRR, May, 2025

Efficient multi-job federated learning scheduling with fault tolerance.
Peer Peer Netw. Appl., April, 2025

Corrections to "Giant Could Be Tiny: Efficient Inference of Giant Models on Resource-Constrained UAVs".
IEEE Internet Things J., March, 2025

Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management.
CoRR, January, 2025

Serving Transformer Models via Joint Requst Scheduling and Batching in the Network Edge.
IEEE Trans. Sustain. Comput., 2025

Mell: Memory-Efficient Large Language Model Serving via Multi-GPU KV Cache Management.
Proceedings of the IEEE INFOCOM 2025, 2025

SPIN: Accelerating Large Language Model Inference with Heterogeneous Speculative Models.
Proceedings of the IEEE INFOCOM 2025, 2025

2024
Giant Could Be Tiny: Efficient Inference of Giant Models on Resource-Constrained UAVs.
IEEE Internet Things J., June, 2024

Non-Clairvoyant Scheduling of Distributed Machine Learning With Inter-Job and Intra-Job Parallelism on Heterogeneous GPUs.
IEEE Trans. Cloud Comput., 2024

Communication-Efficient Sparsely-Activated Model Training via Sequence Migration and Token Condensation.
CoRR, 2024

2023
DGC: Training Dynamic Graphs with Spatio-Temporal Non-Uniformity using Graph Partitioning by Chunks.
Proc. ACM Manag. Data, December, 2023

Edge-Assisted Short Video Sharing With Guaranteed Quality-of-Experience.
IEEE Trans. Cloud Comput., 2023

Low-Latency Perception Sharing Services for Connected Autonomous Vehicles.
Proceedings of the 98th IEEE Vehicular Technology Conference, 2023

Efficient Scheduling for Multi-Job Federated Learning Systems with Client Sharing.
Proceedings of the IEEE Intl Conf on Dependable, 2023

Efficient Giant Graph Unlearning via Push-Pull Tuning.
Proceedings of the IEEE Intl Conf on Parallel & Distributed Processing with Applications, 2023

2022
FedGraph: Federated Graph Learning With Intelligent Sampling.
IEEE Trans. Parallel Distributed Syst., 2022

TCB: Accelerating Transformer Inference Services with Request Concatenation.
Proceedings of the 51st International Conference on Parallel Processing, 2022

Hare: Exploiting Inter-job and Intra-job Parallelism of Distributed Machine Learning on Heterogeneous GPUs.
Proceedings of the HPDC '22: The 31st International Symposium on High-Performance Parallel and Distributed Computing, Minneapolis, MN, USA, 27 June 2022, 2022

2021
In-Network Aggregation for Privacy-Preserving Federated Learning.
Proceedings of the International Conference on Information and Communication Technologies for Disaster Management, 2021

On-Camera Content Filtering for Real-Time Video Analytics.
Proceedings of the 2021 IEEE 23rd Int Conf on High Performance Computing & Communications; 7th Int Conf on Data Science & Systems; 19th Int Conf on Smart City; 7th Int Conf on Dependability in Sensor, 2021


  Loading...