Connor Holmes

Orcid: 0000-0002-8314-3677

According to our database1, Connor Holmes authored at least 25 papers between 2019 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference.
CoRR, 2024

DeepSpeed Data Efficiency: Improving Deep Learning Model Quality and Training Efficiency via Efficient Data Sampling and Routing.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
An Efficient Global Optimality Certificate for Landmark-Based SLAM.
IEEE Robotics Autom. Lett., March, 2023

Safe and Smooth: Certified Continuous-Time Range-Only Localization.
IEEE Robotics Autom. Lett., 2023

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies.
CoRR, 2023

STAR-loc: Dataset for STereo And Range-based localization.
CoRR, 2023

RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model.
CoRR, 2023

Certifiably Optimal Rotation and Pose Estimation Based on the Cayley Map.
CoRR, 2023

On Semidefinite Relaxations for Matrix-Weighted State-Estimation Problems in Robotics.
CoRR, 2023

Toward Globally Optimal State Estimation Using Automatically Tightened Semidefinite Relaxations.
CoRR, 2023

DeepSpeed-Chat: Easy, Fast and Affordable RLHF Training of ChatGPT-like Models at All Scales.
CoRR, 2023

ZeRO++: Extremely Efficient Collective Communication for Giant Model Training.
CoRR, 2023

Towards Open World NeRF-Based SLAM.
CoRR, 2023

Towards Open World NeRF-Based SLAM.
Proceedings of the 20th Conference on Robots and Vision, 2023

2022
Random-LTD: Random and Layerwise Token Dropping Brings Efficient Training for Large-scale Transformers.
CoRR, 2022

Compressing Pre-trained Transformers via Low-Bit NxM Sparsity for Natural Language Understanding.
CoRR, 2022

A Fine Line: Total Least-Squares Line Fitting as QCQP Optimization.
CoRR, 2022

DGSM: A GPU-Based Subgraph Isomorphism framework with DFS exploration.
Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2022

2021
GraphZero: A High-Performance Subgraph Matching System.
ACM SIGOPS Oper. Syst. Rev., 2021

Practical Design Considerations for Performance and Robustness in the Face of Uncertain Flexible Dynamics in Space Manipulators.
Frontiers Robotics AI, 2021

ELIχR: Eliminating Computation Redundancy in CNN-Based Video Processing.
Proceedings of the IEEE/ACM Redefining Scalability for Diversely Heterogeneous Architectures Workshop, 2021

NxMTransformer: Semi-Structured Sparsification for Natural Language Understanding via ADMM.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Dryadic: Flexible and Fast Graph Pattern Matching at Scale.
Proceedings of the 30th International Conference on Parallel Architectures and Compilation Techniques, 2021

2019
GraphZero: Breaking Symmetry for Efficient Graph Mining.
CoRR, 2019

GRNN: Low-Latency and Scalable RNN Inference on GPUs.
Proceedings of the Fourteenth EuroSys Conference 2019, Dresden, Germany, March 25-28, 2019, 2019


  Loading...