Sheng Zha

According to our database1, Sheng Zha authored at least 24 papers between 2018 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Pre-training Differentially Private Models with Limited Public Data.
CoRR, 2024

Extreme Miscalibration and the Illusion of Adversarial Robustness.
CoRR, 2024

2023
Zero redundancy distributed learning with differential privacy.
CoRR, 2023

On the accuracy and efficiency of group-wise clipping in differentially private optimization.
CoRR, 2023

Coupling public and private gradient provably helps optimization.
CoRR, 2023

Large Language Models of Code Fail at Completing Code with Potential Bugs.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HyTrel: Hypergraph-enhanced Tabular Data Representation Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Differentially Private Optimization on Large Model at Small Cost.
Proceedings of the International Conference on Machine Learning, 2023

Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic.
CoRR, 2022

Differentially Private Bias-Term only Fine-tuning of Foundation Models.
CoRR, 2022

Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Meta-learning via Language Model In-context Tuning.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Distiller: A Systematic Study of Model Distillation Methods in Natural Language Processing.
Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021

2020
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
J. Mach. Learn. Res., 2020

Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes.
CoRR, 2020

2019
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.
CoRR, 2019

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.
CoRR, 2019

Just-in-Time Dynamic-Batching.
CoRR, 2019

Dive into Deep Learning for Natural Language Processing.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unlearn Dataset Bias in Natural Language Inference by Fitting the Residual.
Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

2018
Question Type Guided Attention in Visual Question Answering.
Proceedings of the Computer Vision - ECCV 2018, 2018


  Loading...