Jagadeesh Balam

According to our database1, Jagadeesh Balam authored at least 27 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models.
CoRR, 2024

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations.
CoRR, 2024

BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5.
CoRR, 2024

Less is More: Accurate Speech Recognition & Translation without Web-Scale Data.
CoRR, 2024

Instruction Data Generation and Unsupervised Adaptation for Speech Language Models.
CoRR, 2024

Discrete Audio Representation as an Alternative to Mel-Spectrograms for Speaker and Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Enhancing Speaker Diarization with Large Language Models: A Contextual Beam Search Approach.
Proceedings of the IEEE International Conference on Acoustics, 2024

Stateful Conformer with Cache-Based Inference for Streaming Automatic Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2024

Investigating End-to-End ASR Architectures for Long Form Audio Transcription.
Proceedings of the IEEE International Conference on Acoustics, 2024

SALM: Speech-Augmented Language Model with in-Context Learning for Speech Recognition and Translation.
Proceedings of the IEEE International Conference on Acoustics, 2024

Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System.
CoRR, 2023

Property-Aware Multi-Speaker Data Simulation: A Probabilistic Modelling Technique for Synthetic Data Generation.
CoRR, 2023

Flexible Multichannel Speech Enhancement for Noise-Robust Frontend.
Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2023

A Compact End-to-End Model with Local and Global Context for Spoken Language Identification.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Fast Conformer With Linearly Scalable Attention For Efficient Speech Recognition.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
AmberNet: A Compact End-to-End Model for Spoken Language Identification.
CoRR, 2022

NeMo Open Source Speaker Diarization System.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Multi-scale Speaker Diarization with Dynamic Scale Weighting.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
CarneliNet: Neural Mixture Model for Automatic Speech Recognition.
CoRR, 2021

SPGISpeech: 5, 000 Hours of Transcribed Financial Audio for Fully Formatted End-to-End Speech Recognition.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021

Cross-Language Transfer Learning and Domain Adaptation for End-to-End Automatic Speech Recognition.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

2008
A Transcoding-Free Multiple Description Coder for Voice over Mobile Ad-Hoc Networks.
Proceedings of the WCNC 2008, IEEE Wireless Communications & Networking Conference, March 31 2008, 2008

2007
Multiple Descriptions and Path Diversity for Voice Communications Over Wireless Mesh Networks.
IEEE Trans. Multim., 2007

Two-Hop Two-Path Voice Communications Over a Mobile Ad-Hoc Network.
Proceedings of the Global Communications Conference, 2007

2006
Multiple descriptions and path diversity using the AMR-WB speech codec for voice communication over MANETs.
Proceedings of the International Conference on Wireless Communications and Mobile Computing, 2006


  Loading...