Michael Wyatt

Orcid: 0000-0003-2860-6859

According to our database1, Michael Wyatt authored at least 9 papers between 2023 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Training a Large Language Model for Medical Coding Using Privacy-Preserving Synthetic Clinical Data.
CoRR, March, 2026

Shift Parallelism: Low-Latency, High-Throughput LLM Inference for Dynamic Workloads.
Proceedings of the 31st ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2026

2025
Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI.
CoRR, July, 2025

Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences.
CoRR, June, 2025

2024
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design.
CoRR, 2024

DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference.
CoRR, 2024

Quant-LLM: Accelerating the Serving of Large Language Models via FP6-Centric Algorithm-System Co-Design on Modern GPUs.
Proceedings of the 2024 USENIX Annual Technical Conference, 2024

2023
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks.
CoRR, 2023

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales.
CoRR, 2023


  Loading...