Oleksii Kuchaiev

According to our database1, Oleksii Kuchaiev authored at least 26 papers between 2009 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Nemotron-4 15B Technical Report.
CoRR, 2024

2023
Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying.
CoRR, 2023

HelpSteer: Multi-attribute Helpfulness Dataset for SteerLM.
CoRR, 2023

Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Leveraging Synthetic Targets for Machine Translation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Finding the Right Recipe for Low Resource Domain Adaptation in Neural Machine Translation.
CoRR, 2022

NVIDIA NeMo Offline Speech Translation Systems for IWSLT 2022.
Proceedings of the 19th International Conference on Spoken Language Translation, 2022

2021
NVIDIA NeMo Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21.
CoRR, 2021

NVIDIA NeMo's Neural Machine Translation Systems for English-German and English-Russian News and Biomedical Tasks at WMT21.
Proceedings of the Sixth Conference on Machine Translation, 2021

SPGISpeech: 5, 000 Hours of Transcribed Financial Audio for Fully Formatted End-to-End Speech Recognition.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2020
Quartznet: Deep Automatic Speech Recognition with 1D Time-Channel Separable Convolutions.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
NeMo: a toolkit for building AI applications using Neural Modules.
CoRR, 2019

Stochastic Gradient Methods with Layer-wise Adaptive Moments for Training of Deep Networks.
CoRR, 2019

Jasper: An End-to-End Convolutional Neural Acoustic Model.
Proceedings of the Interspeech 2019, 2019

2018
OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models.
CoRR, 2018

Mixed Precision Training.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Training Deep AutoEncoders for Collaborative Filtering.
CoRR, 2017

Factorization tricks for LSTM networks.
Proceedings of the 5th International Conference on Learning Representations, 2017

2014
An introduction to computational networks and the computational network toolkit (invited talk).
Proceedings of the INTERSPEECH 2014, 2014

2011
GraphCruch 2: Software tool for network modeling, alignment and clustering.
BMC Bioinform., 2011

Integrative network alignment reveals large regions of global network similarity in yeast and human.
Bioinform., 2011

2010
Geometric Evolutionary Dynamics of Protein Interaction Networks.
Proceedings of the Biocomputing 2010: Proceedings of the Pacific Symposium, 2010

2009
Geometric De-noising of Protein-Protein Interaction Networks.
PLoS Comput. Biol., 2009

Learning the Structure of Protein-Protein Interaction Networks.
Proceedings of the Biocomputing 2009: Proceedings of the Pacific Symposium, 2009


  Loading...