Hossein Entezari Zarch

According to our database1, Hossein Entezari Zarch authored at least 9 papers between 2021 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
DuetServe: Harmonizing Prefill and Decode for LLM Serving via Adaptive GPU Multiplexing.
CoRR, November, 2025

DELTA: Dynamic Layer-Aware Token Attention for Efficient Long-Context Reasoning.
CoRR, October, 2025

MARché: Fast Masked Autoregressive Image Generation with Cache-Aware Attention.
CoRR, June, 2025

DEL: Context-Aware Dynamic Exit Layer for Efficient Self-Speculative Decoding.
CoRR, April, 2025

YOLOv8 Outperforms Traditional CNN Models in Mammography Classification: Insights From a Multi-Institutional Dataset.
Int. J. Imaging Syst. Technol., January, 2025

KVPR: Efficient LLM Inference with I/O-Aware KV Cache Partial Recomputation.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Efficient LLM Inference with I/O-Aware Partial KV Cache Recomputation.
CoRR, 2024

CADC: Encoding User-Item Interactions for Compressing Recommendation Model Training Data.
CoRR, 2024

2021
Ensemble Neural Representation Networks.
CoRR, 2021


  Loading...