Krishna C. Puvvada

According to our database1, Krishna C. Puvvada authored at least 19 papers between 2020 and 2025.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Word Level Timestamp Generation for Automatic Speech Recognition and Translation.
CoRR, May, 2025

SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling.
CoRR, April, 2025

Training and Inference Efficiency of Encoder-Decoder Speech Models.
CoRR, March, 2025

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

2024
NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts.
CoRR, 2024

Sortformer: Seamless Integration of Speaker Diarization and ASR by Bridging Timestamps and Tokens.
CoRR, 2024

Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Bestow: Efficient and Streamable Speech Language Model with The Best of Two Worlds in GPT and T5.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System.
CoRR, 2023

Conformer-Based Target-Speaker Automatic Speech Recognition For Single-Channel Audio.
Proceedings of the IEEE International Conference on Acoustics, 2023

Accidental Learners: Spoken Language Identification in Multilingual Self-Supervised Models.
Proceedings of the IEEE International Conference on Acoustics, 2023

Fast Conformer With Linearly Scalable Attention For Efficient Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2021
Unsupervised and Semi-Supervised Few-Shot Acoustic Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Few-Shot Acoustic Event Detection Via Meta Learning.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020


  Loading...