Zangwei Zheng

Orcid: 0000-0002-1505-1535

According to our database¹, Zangwei Zheng authored at least 25 papers between 2021 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Bibliography

2025

Open-Sora 2.0: Training a Commercial-Level Video Generation Model in $200k.

[BibT_eX]

[DOI]

CoRR, March, 2025

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

MERIT: Maximum-normalized Element-wise Ratio for Language Model Large-batch Training.

[BibT_eX]

[DOI]

Proceedings of the Forty-second International Conference on Machine Learning, 2025

2024

Open-Sora: Democratizing Efficient Video Production for All.

[BibT_eX]

[DOI]

CoRR, 2024

DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization.

[BibT_eX]

[DOI]

Proceedings of the ACM on Web Conference 2024, 2024

FSL-QuickBoost: Minimal-Cost Ensemble for Few-Shot Learning.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.

[BibT_eX]

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?

[BibT_eX]

[DOI]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Dataset Growth.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

InfoBatch: Lossless Training Speed Up by Unbiased Dynamic Data Pruning.

[BibT_eX]

[DOI]

CoRR, 2023

Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Study on Transformer Configuration and Training Objective.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Machine Learning, 2023

Preventing Zero-Shot Transfer Degradation in Continual Learning of Vision-Language Models.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

CAME: Confidence-guided Adaptive Memory Efficient Optimization.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

CowClip: Reducing CTR Prediction Model Training Time from 12 Hours to 10 Minutes on 1 GPU.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Prompt Vision Transformer for Domain Generalization.

[BibT_eX]

[DOI]

CoRR, 2022

Deeper vs Wider: A Revisit of Transformer Configuration.

[BibT_eX]

[DOI]

CoRR, 2022

CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU.

[BibT_eX]

[DOI]

CoRR, 2022

2021

Multi-source Few-shot Domain Adaptation.

[BibT_eX]

[DOI]

Alberto L. Sangiovanni-Vincentelli

CoRR, 2021

Sparse-MLP: A Fully-MLP Architecture with Conditional Computation.

[BibT_eX]

[DOI]

CoRR, 2021

Scene-aware Learning Network for Radar Object Detection.

[BibT_eX]

[DOI]

Zangwei Zheng

Xiangyu Yue

Kurt Keutzer

Alberto L. Sangiovanni-Vincentelli

Proceedings of the ICMR '21: International Conference on Multimedia Retrieval, 2021

Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation.

[BibT_eX]

[DOI]

Alberto L. Sangiovanni-Vincentelli

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Zangwei Zheng

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...