Zhi Zhang

Orcid: 0000-0003-0249-1678

Affiliations:
  • ByteDance Inc.
  • Amazon Web Services (AWS), Santa Clara, CA, USA (2018 - 2021)
  • University of Missouri, Department of Electrical Engineering and Computer Science, Columbia, MO, USA (PhD 2018)


According to our database1, Zhi Zhang authored at least 37 papers between 2015 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
veScale-FSDP: Flexible and High-Performance FSDP at Scale.
CoRR, February, 2026

Each Rank Could be an Expert: Single-Ranked Mixture of Experts LoRA for Multi-task Learning.
Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.1, 2026

OmniScale: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo.
Proceedings of the Fortieth AAAI Conference on Artificial Intelligence, 2026

2025
Cannikin: No Lagger of SLO in Concurrent Multiple LoRA LLM Serving.
IEEE Trans. Parallel Distributed Syst., September, 2025

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo.
CoRR, August, 2025

Let the Code LLM Edit Itself When You Edit the Code.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024
Improving Semantic Segmentation via Efficient Self-Training.
IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

MegaScale: Scaling Large Language Model Training to More Than 10, 000 GPUs.
Proceedings of the 21st USENIX Symposium on Networked Systems Design and Implementation, 2024

SDP4Bit: Toward 4-bit Communication Quantization in Sharded Data Parallelism for LLM Training.
Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Self-Infilling Code Generation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2022
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

ResNeSt: Split-Attention Networks.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
SelfNorm and CrossNorm for Out-of-Distribution Robustness.
CoRR, 2021

Progressive Coordinate Transforms for Monocular 3D Object Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Video Contrastive Learning with Global Context.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

CrossNorm and SelfNorm for Generalization under Distribution Shifts.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
J. Mach. Learn. Res., 2020

A Comprehensive Study of Deep Video Action Recognition.
CoRR, 2020

Improving Semantic Segmentation via Self-Training.
CoRR, 2020

2019
Supervised Deep Feature Embedding With Handcrafted Feature.
IEEE Trans. Image Process., 2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
CoRR, 2019

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.
CoRR, 2019

Just-in-Time Dynamic-Batching.
CoRR, 2019

Bag of Freebies for Training Object Detection Neural Networks.
CoRR, 2019

Bag of Tricks for Image Classification with Convolutional Neural Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation.
IEEE Trans. Multim., 2018

Progressive Neural Networks for Image Classification.
CoRR, 2018

Fast Deep Neural Networks With Knowledge Guided Training and Predicted Regions of Interests for Real-Time Video Object Detection.
IEEE Access, 2018

2017
Knowledge Projection for Deep Neural Networks.
CoRR, 2017

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation.
CoRR, 2017

Spatially supervised recurrent convolutional neural networks for visual object tracking.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2017

Rate-coverage analysis and optimization for joint audio-video multimedia retrieval.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Animal Detection From Highly Cluttered Natural Scenes Using Spatiotemporal Object Region Proposals and Patch Verification.
IEEE Trans. Multim., 2016

Joint Audio-Video Fingerprint Media Retrieval Using Rate-Coverage Optimization.
CoRR, 2016

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking.
CoRR, 2016

2015
Coupled ensemble graph cuts and object verification for animal segmentation from highly cluttered videos.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015


  Loading...