Yongchao He

According to our database1, Yongchao He authored at least 19 papers between 2019 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
L4: Low-Latency and Load-Balanced LLM Serving via Length-Aware Scheduling.
CoRR, December, 2025

A Unified Sparse Attention via Multi-Granularity Compression.
CoRR, December, 2025

SIMPLE: Disaggregating Sampling from GPU Inference into a Decision Plane for Faster Distributed LLM Serving.
CoRR, December, 2025

MegatronApp: Efficient and Comprehensive Management on Distributed LLM Training.
CoRR, July, 2025

SiPipe: Bridging the CPU-GPU Utilization Gap for Efficient Pipeline-Parallel LLM Inference.
CoRR, June, 2025

HeteroSpec: Leveraging Contextual Heterogeneity for Efficient Speculative Decoding.
CoRR, May, 2025

Secure Wireless Communication in Active RIS-Assisted DFRC Systems.
IEEE Trans. Veh. Technol., January, 2025

Reconfigurable intelligent surface-aided dual-function radar and communication systems with MU-MIMO communication.
Digit. Commun. Networks, 2025

2024
Joint Beamforming Design for Double Active RIS-Assisted Radar-Communication Coexistence Systems.
IEEE Trans. Cogn. Commun. Netw., October, 2024

2023
RateSheriff: Multipath Flow-aware and Resource Efficient Rate Limiter Placement for Data Center Networks.
Proceedings of the 31st IEEE/ACM International Symposium on Quality of Service, 2023

A Generic Service to Provide In-Network Aggregation for Key-Value Streams.
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2023

2022
Consistent and Fine-Grained Rule Update with In-Network Control for Distributed Rate Limiting.
Proceedings of the 30th IEEE/ACM International Symposium on Quality of Service, 2022

SFP: Service Function Chain Provision on Programmable Switches for Cloud Tenants.
Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium, 2022

2021
NFD: Using Behavior Models to Develop Cross-Platform Network Functions.
Proceedings of the 40th IEEE Conference on Computer Communications, 2021

Scalable On-Switch Rate Limiters for the Cloud.
Proceedings of the 40th IEEE Conference on Computer Communications, 2021

2019
Fully Functional Rate Limiter Design on Programmable Hardware Switches.
Proceedings of the ACM SIGCOMM 2019 Conference Posters and Demos, 2019

SpeedyBox: Low-Latency NFV Service Chains with Cross-NF Runtime Consolidation.
Proceedings of the 39th IEEE International Conference on Distributed Computing Systems, 2019


  Loading...