We stand with Ukraine

We stand with Ukraine

Sheng Zha

According to our database¹, Sheng Zha authored at least 30 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of three.

Timeline

Legend:

Book In proceedings Article PhD thesis Dataset Other

Links

On csauthors.net:

Bibliography

2025

Sequence-level Large Language Model Training with Contrastive Preference Optimization.

[DOI]

,

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024

Pre-training Differentially Private Models with Limited Public Data.

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 37: Annual Conference on Neural Information Processing Systems 2024, 2024

Differentially Private Bias-Term Fine-tuning of Foundation Models.

[DOI]

,

,

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

Revisiting SMoE Language Models by Evaluating Inefficiencies with Task Specific Expert Pruning.

[DOI]

Soumajyoti Sarkar

,

,

,

,

,

Proceedings of the NeurIPS Efficient Natural Language and Speech Processing Workshop, 2024

DEM: Distribution Edited Model for Training with Mixed Data Distributions.

[DOI]

,

,

Momchil Hardalov

,

Nikolaos Pappas

,

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Fine-tuning Language Models for Joint Rewriting and Completion of Code with Potential Bugs.

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics, 2024

Extreme Miscalibration and the Illusion of Adversarial Robustness.

[DOI]

,

,

,

,

,

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Zero redundancy distributed learning with differential privacy.

[DOI]

,

,

,

,

CoRR, 2023

On the accuracy and efficiency of group-wise clipping in differentially private optimization.

[DOI]

,

,

,

,

CoRR, 2023

Coupling public and private gradient provably helps optimization.

[DOI]

,

,

,

,

CoRR, 2023

Python Array API Standard: Toward Array Interoperability in the Scientific Python Ecosystem.

[DOI]

,

,

,

Yao-Lung L. Fang

,

,

,

,

Andreas Müller

,

,

Saul Shanabrook

,

Stephannie Jiménez Gacha

,

Mario Lezcano Casado

,

,

,

Alexandre Passos

,

,

Travis E. Oliphant

,

Consortium for Python Data API Standards

Proceedings of the 22nd Python in Science Conference, 2023

Large Language Models of Code Fail at Completing Code with Potential Bugs.

[DOI]

,

,

,

Renato Negrinho

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

HyTrel: Hypergraph-enhanced Tabular Data Representation Learning.

[DOI]

,

Soumajyoti Sarkar

,

,

Balasubramaniam Srinivasan

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Automatic Clipping: Differentially Private Deep Learning Made Easier and Stronger.

[DOI]

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Differentially Private Optimization on Large Model at Small Cost.

[DOI]

,

,

,

Proceedings of the International Conference on Machine Learning, 2023

Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer.

[DOI]

,

,

,

,

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Better Context Makes Better Code Language Models: A Case Study on Function Call Argument Completion.

[DOI]

,

,

,

,

Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022

Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic.

[DOI]

Soumajyoti Sarkar

,

,

Sailik Sengupta

,

,

,

CoRR, 2022

Differentially Private Bias-Term only Fine-tuning of Foundation Models.

[DOI]

,

,

,

CoRR, 2022

Exploring the Role of Task Transferability in Large-Scale Multi-Task Learning.

[DOI]

Vishakh Padmakumar

,

,

Miguel Ballesteros

,

,

,

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Meta-learning via Language Model In-context Tuning.

[DOI]

,

,

,

,

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021

Distiller: A Systematic Study of Model Distillation Methods in Natural Language Processing.

[DOI]

,

,

,

,

,

Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing, 2021

2020

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

,

J. Mach. Learn. Res., 2020

Accelerated Large Batch Optimization of BERT Pretraining in 54 minutes.

[DOI]

,

,

,

CoRR, 2020

2019

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing.

[DOI]

,

,

,

,

,

,

,

,

,

,

,

,

,

,

CoRR, 2019

Dynamic Mini-batch SGD for Elastic Distributed Training: Learning in the Limbo of Resources.

[DOI]

,

,

,

,

,

,

CoRR, 2019

Just-in-Time Dynamic-Batching.

[DOI]

,

,

,

CoRR, 2019

Dive into Deep Learning for Natural Language Processing.

[DOI]

,

,

,

,

,

,

Alexander J. Smola

Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Unlearn Dataset Bias in Natural Language Inference by Fitting the Residual.

[DOI]

,

,

Proceedings of the 2nd Workshop on Deep Learning Approaches for Low-Resource NLP, 2019

2018

Question Type Guided Attention in Visual Question Answering.

[DOI]

,

Tommaso Furlanello

,

,

Animashree Anandkumar

Proceedings of the Computer Vision - ECCV 2018, 2018

Loading...