Mark Zhao

Orcid: 0000-0002-0706-5208

According to our database1, Mark Zhao authored at least 25 papers between 2018 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Bayesian Active Learning with Gaussian Processes Guided by LLM Relevance Scoring for Dense Passage Retrieval.
CoRR, April, 2026

SYMI: Efficient Mixture-of-Experts Training via Model and Optimizer State Decoupling.
Proceedings of the 23rd USENIX Symposium on Networked Systems Design and Implementation, 2026

SHARD: A Compatibility Framework for Deploying Transformer Models on Edge NPUs.
Proceedings of the Sixth European Workshop on Machine Learning and Systems, EuroMLSys 2026, 2026

2025
Multimodal Item Scoring for Natural Language Recommendation via Gaussian Process Regression with LLM Relevance Judgments.
CoRR, October, 2025

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning.
CoRR, October, 2025

Strata: Hierarchical Context Caching for Long Context Language Model Serving.
CoRR, August, 2025

Accelerating Mixture-of-Experts Training with Adaptive Expert Replication.
CoRR, April, 2025

Remote Power Side- Channel Attacks on FPGAs.
IEEE Des. Test, February, 2025

Adaptive Semantic Prompt Caching with VectorQ.
CoRR, February, 2025

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MA-DPR: Manifold-aware Distance Metrics for Dense Passage Retrieval.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

2024
cedar: Optimized and Unified Machine Learning Input Data Pipelines.
Proc. VLDB Endow., October, 2024

SlipStream: Adapting Pipelines for Distributed Training of Large DNNs Amid Failures.
CoRR, 2024

cedar: Composable and Optimized Machine Learning Input Data Pipelines.
CoRR, 2024

ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation.
Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles, 2024

High-throughput and Flexible Host Networking for Accelerated Computing.
Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation, 2024

2023
Tectonic-Shift: A Composite Storage Fabric for Large-Scale ML Training.
Proceedings of the 2023 USENIX Annual Technical Conference, 2023

RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure.
Proceedings of the Sixth Conference on Machine Learning and Systems, 2023

2022
Understanding data storage and ingestion for large-scale deep recommendation model training: industrial product.
Proceedings of the ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18, 2022

ShEF: shielded enclaves for cloud FPGAs.
Proceedings of the ASPLOS '22: 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Lausanne, Switzerland, 28 February 2022, 2022

2021
Understanding and Co-designing the Data Ingestion Pipeline for Industry-Scale RecSys Training.
CoRR, 2021

Llama: A Heterogeneous & Serverless Framework for Auto-Tuning Video Analytics Pipelines.
Proceedings of the SoCC '21: ACM Symposium on Cloud Computing, 2021

2018
To Centralize or Not to Centralize: A Tale of Swarm Coordination.
CoRR, 2018

FPGA-Based Remote Power Side-Channel Attacks.
Proceedings of the 2018 IEEE Symposium on Security and Privacy, 2018

HyperFlow: A Processor Architecture for Nonmalleable, Timing-Safe Information Flow Security.
Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, 2018


  Loading...