Leda Sari

Orcid: 0000-0002-3754-1156

According to our database1, Leda Sari authored at least 29 papers between 2014 and 2023.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model.
CoRR, 2023

Augmenting text for spoken language understanding with Large Language Models.
CoRR, 2023

Towards Selection of Text-to-speech Data to Augment ASR Training.
CoRR, 2023

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition.
CoRR, 2023

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Self-Supervised Representations for Singing Voice Conversion.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
Seamless equal accuracy ratio for inclusive CTC speech recognition.
Speech Commun., 2022

Biased Self-supervised learning for ASR.
CoRR, 2022

Towards Measuring Fairness in Speech Recognition: Casual Conversations Dataset Transcriptions.
Proceedings of the IEEE International Conference on Acoustics, 2022


2021
Learning speech embeddings for speaker adaptation and speech understanding
PhD thesis, 2021

Counterfactually Fair Automatic Speech Recognition.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Auxiliary Networks for Joint Speaker Adaptation and Speaker Change Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

A Multi-View Approach to Audio-Visual Speaker Verification.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Identify Speakers in Cocktail Parties with End-to-End Attention.
Proceedings of the Interspeech 2020, 2020

Deep F-Measure Maximization for End-to-End Speech Understanding.
Proceedings of the Interspeech 2020, 2020

Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Training Spoken Language Understanding Systems with Non-Parallel Speech and Text.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

2019
Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks.
Proceedings of the Interspeech 2019, 2019

Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Speaker Adaptive Audio-Visual Fusion for the Open-Vocabulary Section of AVICAR.
Proceedings of the Interspeech 2018, 2018

2016
Score normalization for keyword search.
Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

Template-based Keyword Search with pseudo posteriorgrams.
Proceedings of the 24th Signal Processing and Communication Application Conference, 2016

2015
Discriminative training of the keyword search confusion model.
Proceedings of the 2015 23nd Signal Processing and Communications Applications Conference (SIU), 2015

Posteriorgram based approaches in keyword search.
Proceedings of the 2015 23nd Signal Processing and Communications Applications Conference (SIU), 2015

Fusion of LVCSR and posteriorgram based keyword search.
Proceedings of the INTERSPEECH 2015, 2015

2014
Texture Defect Detection Using Independent Vector Analysis in Wavelet Domain.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014


  Loading...