Wei Zhang

Yuhai Tu

Nat. Mac. Intell., August, 2023

Soft Random Sampling: A Theoretical and Empirical Analysis.

[BibT_eX]

[DOI]

CoRR, 2023

2022

Robustness to Unbounded Smoothness of Generalized SignSGD.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Asynchronous Decentralized Distributed Training of Acoustic Models.

[BibT_eX]

[DOI]

IEEE ACM Trans. Audio Speech Lang. Process., 2021

Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent.

[BibT_eX]

[DOI]

CoRR, 2021

Project CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks.

[BibT_eX]

[DOI]

CoRR, 2021

CodeNet: A Large-Scale AI for Code Dataset for Learning a Diversity of Coding Tasks.

[BibT_eX]

[DOI]

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

4-Bit Quantization of LSTM-Based Speech Recognition Models.

[BibT_eX]

[DOI]

Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition: A comparison of current training strategies.

[BibT_eX]

[DOI]

IEEE Signal Process. Mag., 2020

Change Detection from Remote Sensing to Guide OpenStreetMap Labeling.

[BibT_eX]

[DOI]

ISPRS Int. J. Geo Inf., 2020

Adam<sup>+</sup>: A Stochastic Method with Adaptive Variance Reduction.

[BibT_eX]

[DOI]

CoRR, 2020

Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition.

[BibT_eX]

[DOI]

CoRR, 2020

A Decentralized Parallel Algorithm for Training Generative Adversarial Nets.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training.

[BibT_eX]

[DOI]

Vijayalakshmi Srinivasan

Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Map Generation from Large Scale Incomplete and Inaccurate Data Labels.

[BibT_eX]

[DOI]

Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

Improving Efficiency in Large-Scale Decentralized Distributed Training.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019

Decentralized Parallel Algorithm for Training Generative Adversarial Nets.

[BibT_eX]

[DOI]

CoRR, 2019

Hybrid 8-bit Floating Point (HFP8) Training and Inference for Deep Neural Networks.

[BibT_eX]

[DOI]

Vijayalakshmi Srinivasan

Xiaodong Cui

Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Distributed Deep Learning Strategies for Automatic Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2019

2018

Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Asynchronous Decentralized Parallel Stochastic Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the 35th International Conference on Machine Learning, 2018

Forecast of Solar Energy Production - A Deep Learning Approach.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Big Knowledge, 2018

2017

Can Decentralized Algorithms Outperform Centralized Algorithms? A Case Study for Decentralized Parallel Stochastic Gradient Descent.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Model Accuracy and Runtime Tradeoff in Distributed Deep Learning: A Systematic Study.

[BibT_eX]

[DOI]

Suyog Gupta

Fei Wang

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017

Accelerator Design for Deep Learning Training: Extended Abstract: Invited.

[BibT_eX]

[DOI]

Ankur Agrawal

Chia-Yu Chen

Jungwook Choi

Jinwook Oh

Sunil Shukla

Viji Srinivasan

Proceedings of the 54th Annual Design Automation Conference, 2017

2016

X10 and APGAS at Petascale.

[BibT_eX]

[DOI]

ACM Trans. Parallel Comput., 2016

META: Middleware for Events, Transactions, and Analytics.

[BibT_eX]

[DOI]

IBM J. Res. Dev., 2016

Staleness-Aware Async-SGD for Distributed Deep Learning.

[BibT_eX]

[DOI]

Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016

AQuA: adaptive quality analytics.

[BibT_eX]

[DOI]

Martin Hirzel

David Grove

Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems, 2016

2015

Model Accuracy and Runtime Tradeoff in Distributed Deep Learning.

[BibT_eX]

[DOI]

Suyog Gupta

Karthikeyan Sankaralingam

Josh Milthorpe

CoRR, 2015

Fixing, preventing, and recovering from concurrency bugs.

[BibT_eX]

[DOI]

Sci. China Inf. Sci., 2015

2014

GLB: lifeline-based global load balancing library in x10.

[BibT_eX]

[DOI]

Proceedings of the first workshop on Parallel programming for analytics applications, 2014

2013

ConMem: Detecting Crash-Triggering Concurrency Bugs through an Effect-Oriented Approach.

[BibT_eX]

[DOI]

ACM Trans. Softw. Eng. Methodol., 2013

Efficient concurrency-bug detection across inputs.

[BibT_eX]

[DOI]

Dongdong Deng

Karthikeyan Sankaralingam

Shan Lu

Proceedings of the 2013 ACM SIGPLAN International Conference on Object Oriented Programming Systems Languages & Applications, 2013

ConAir: featherweight concurrency bug recovery via single-threaded idempotent execution.

[BibT_eX]

[DOI]

Proceedings of the Architectural Support for Programming Languages and Operating Systems, 2013

2012

Automated Concurrency-Bug Fixing.

[BibT_eX]

[DOI]

Guoliang Jin

Dongdong Deng

Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation, 2012

Understanding the Interleaving-Space Overlap across Inputs and Software Versions.

[BibT_eX]

[DOI]

Proceedings of the 4th USENIX Workshop on Hot Topics in Parallelism, 2012

2011

Automated atomicity-violation fixing.

[BibT_eX]

[DOI]

Proceedings of the 32nd ACM SIGPLAN Conference on Programming Language Design and Implementation, 2011

ConSeq: detecting concurrency bugs through sequential errors.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Architectural Support for Programming Languages and Operating Systems, 2011

2010

Applying scalable phonetic context similarity in unit selection of concatenative text-to-speech.

[BibT_eX]

[DOI]

Xiaodong Cui

Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

ConMem: detecting severe concurrency bugs through an effect-oriented approach.

[BibT_eX]

[DOI]