Evelina Bakhturina

According to our database1, Evelina Bakhturina authored at least 25 papers between 2017 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, August, 2025

NeMo-Inspector: A Visualization Tool for LLM Generation Analysis.
CoRR, May, 2025

Nemotron-CrossThink: Scaling Self-Learning beyond Math Reasoning.
CoRR, April, 2025

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models.
CoRR, April, 2025

SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset.
Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024
Retrieval meets Long Context Large Language Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

A Chat about Boring Problems: Studying GPT-Based Text Normalization.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners.
CoRR, 2023

P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

NVIDIA NeMo Offline Speech Translation Systems for IWSLT 2023.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of End-to-End ASR Models.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text Normalization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Thutmose Tagger: Single-pass neural model for Inverse Text Normalization.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
A Unified Transformer-based Framework for Duplex Text Normalization.
CoRR, 2021

SGD-QA: Fast Schema-Guided Dialogue State Tracking for Unseen Services.
CoRR, 2021

NeMo Toolbox for Speech Dataset Construction.
CoRR, 2021

A Toolbox for Construction and Analysis of Speech Datasets.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

NeMo Inverse Text Normalization: From Development to Production.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

NeMo (Inverse) Text Normalization: From Development to Production.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Hi-Fi Multi-Speaker English TTS Dataset.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

2020
A Fast and Robust BERT-based Dialogue State Tracker for Schema Guided Dialogue Dataset.
Proceedings of the KDD 2020 Workshop on Conversational Systems Towards Mainstream Adoption co-located with the 26TH ACM SIGKDD Conference on Knowledge Discovery and Data Mining (SIGKDD 2020), 2020

BioMegatron: Larger Biomedical Domain Language Model.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

2017
Sentiment Classification using Images and Label Embeddings.
CoRR, 2017


  Loading...