Muhammad Shakeel

Orcid: 0000-0003-3822-0917

Affiliations:
  • Tokyo Institute of Technology, School of Engineering, Japan
  • Sapienza University of Rome, Italy (former)


According to our database1, Muhammad Shakeel authored at least 20 papers between 2015 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition.
CoRR, June, 2025

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning.
CoRR, June, 2025

2024
4D ASR: Joint Beam Search Integrating CTC, Attention, Transducer, and Mask Predict Decoders.
CoRR, 2024

Contextualized Automatic Speech Recognition With Dynamic Vocabulary.
Proceedings of the IEEE Spoken Language Technology Workshop, 2024

OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss.
Proceedings of the 25th Annual Conference of the International Speech Communication Association, 2024

Contextualized Automatic Speech Recognition With Attention-Based Bias Phrase Boosted Beam Search.
Proceedings of the IEEE International Conference on Acoustics, 2024

Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation.
Proceedings of the IEEE International Conference on Acoustics, 2024

OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Metric-Based Multimodal Meta-Learning for Human Movement Identification Via Footstep Recognition.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023

FPGA based Power-Efficient Edge Server to Accelerate Speech Interface for Socially Assistive Robotics.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2023

4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Time-synchronous one-pass Beam Search for Parallel Online and Offline Transducers with Dynamic Block Training.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2023

2022
Streaming Automatic Speech Recognition with Re-blocking Processing Based on Integrated Voice Activity Detection.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

2021
Detecting earthquakes: a novel deep learning-based approach for effective disaster response.
Appl. Intell., 2021

Assessment of a Beamforming Implementation Developed for Surface Sound Source Separation.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2021

EMC: Earthquake Magnitudes Classification on Seismic Signals via Convolutional Recurrent Networks.
Proceedings of the IEEE/SICE International Symposium on System Integration, 2021

2015
Environmental sensing using millimeter wave sensor for extreme conditions.
Proceedings of the 2015 IEEE International Symposium on Safety, 2015


  Loading...