Nilesh Madhu

Orcid: 0000-0001-9131-3309

According to our database1, Nilesh Madhu authored at least 51 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Correction: Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios.
EURASIP J. Audio Speech Music. Process., December, 2024

Spatially Selective Speaker Separation Using a DNN With a Location Dependent Feature Extraction.
IEEE ACM Trans. Audio Speech Lang. Process., 2024

Bitrate-Informed Coded Speech Enhancement Model.
Proceedings of the International Conference on Electronics, Information, and Communication, 2024

On the Disentanglement and Robustness of Self-Supervised Speech Representations.
Proceedings of the International Conference on Electronics, Information, and Communication, 2024

2023
Robustness of ad hoc microphone clustering using speaker embeddings: evaluation under realistic and challenging scenarios.
EURASIP J. Audio Speech Music. Process., December, 2023

Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement.
Sensors, July, 2023

Margin-Mixup: A Method for Robust Speaker Verification In Multi-Speaker Audio.
Proceedings of the IEEE International Conference on Acoustics, 2023

Aiding Speech Harmonic Recovery in DNN-Based Single Channel Noise Reduction Using Cepstral Excitation Manipulation (CEM) Components.
Proceedings of the IEEE International Conference on Acoustics, 2023

Exploiting Speaker Embeddings for Improved Microphone Clustering and Speech Separation in ad-hoc Microphone Arrays.
Proceedings of the IEEE International Conference on Acoustics, 2023

Improved Deep Speaker Localization and Tracking: Revised Training Paradigm and Controlled Latency.
Proceedings of the IEEE International Conference on Acoustics, 2023

Detecting Translocation of DNA Nanostructures Through Nanopores: First Steps Towards Structural Barcode Readout.
Proceedings of the 31st European Signal Processing Conference, 2023

Enhancing Multichannel Laser-Doppler Vibrometry Signals with Application to (Carotid-Femoral) Pulse Transit Time Estimation.
Proceedings of the 45th Annual International Conference of the IEEE Engineering in Medicine & Biology Society, 2023

2022
Improved CEM for Speech Harmonic Enhancement in Single Channel Noise Suppression.
IEEE ACM Trans. Audio Speech Lang. Process., 2022

Portable and Non-Intrusive Fill-State Detection for Liquid-Freight Containers Based on Vibration Signals.
Sensors, 2022

DOA-guided source separation with direction-based initialization and time annotations using complex angular central Gaussian mixture models.
EURASIP J. Audio Speech Music. Process., 2022

Audio-Visual Active Speaker Identification: A comparison of dense image-based features and sparse facial landmark-based features.
Proceedings of the Sensor Data Fusion: Trends, Solutions, Applications, 2022

Drone Ego-Noise Cancellation for Improved Speech Capture using Deep Convolutional Autoencoder Assisted Multistage Beamforming.
Proceedings of the 25th International Conference on Information Fusion, 2022

SoftVAD in iVector-Based Acoustic Scene Classification for Robustness to Foreground Speech.
Proceedings of the 30th European Signal Processing Conference, 2022

Improved Separation of Closely-spaced Speakers by Exploiting Auxiliary Direction of Arrival Information within a U-Net Architecture.
Proceedings of the 18th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2022

2021
Exploiting Temporal Context in CNN Based Multisource DOA Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

NMF-weighted SRP for multi-speaker direction of arrival estimation: robustness to spatial aliasing while exploiting sparsity in the atom-time domain.
EURASIP J. Audio Speech Music. Process., 2021

Robust Acoustic Scene Classification in the Presence of Active Foreground Speech.
Proceedings of the 29th European Signal Processing Conference, 2021

Neural Networks Using Full-Band and Subband Spatial Features for Mask Based Source Separation.
Proceedings of the 29th European Signal Processing Conference, 2021

2D Acoustic Source Localisation Using Decentralised Deep Neural Networks on Distributed Microphone Arrays.
Proceedings of the 14th ITG Conference on Speech Communication, online, September 29, 2021

2020
Spectral refinement with adaptive window-size selection for voicing detection and fundamental frequency estimation.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2020

Least-Squares DOA Estimation with an Informed Phase Unwrapping and Full Bandwidth Robustness.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Robust Period Estimation of Automated Cutting Systems by Improved Autocorrelation & Linear Regression Techniques.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Generalized Refinement of Short-Term Fourier Spectra in Time- and Frequency- Domain and its Combination with Polyphase Filterbanks.
Proceedings of the IEEE International Symposium on Signal Processing and Information Technology, 2019

A New Weighted NMF Algorithm For Missing Data Interpolation And Its Application To Speech Enhancement.
Proceedings of the 27th European Signal Processing Conference, 2019

2018
DNN-Supported Speech Enhancement With Cepstral Estimation of Both Excitation and Envelope.
IEEE ACM Trans. Audio Speech Lang. Process., 2018

Source Separation by Feature-Based Clustering of Microphones in Ad Hoc Arrays.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

A Priori SNR Computation for Speech Enhancement Based on Cepstral Envelope Estimation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

Steered Mixture-of-Experts Approximation of Spherical Image Data.
Proceedings of the 26th European Signal Processing Conference, 2018

Source separation by fuzzy-membership value aware beamforming and masking in ad hoc arrays.
Proceedings of the 13th ITG Symposium on Speech Communication, 2018

2017
Instantaneous A Priori SNR Estimation by Cepstral Excitation Manipulation.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Two-stage speech enhancement with manipulation of the cepstral excitation.
Proceedings of the Hands-free Speech Communications and Microphone Arrays, 2017

New insights into the role of the head radius in model-based binaural speaker localization.
Proceedings of the 25th European Signal Processing Conference, 2017

2015
Ideal Time-Frequency Masking Algorithms Lead to Different Speech Intelligibility and Quality in Normal-Hearing and Cochlear Implant Listeners.
IEEE Trans. Biomed. Eng., 2015

An iterative speech model-based a priori SNR estimator.
Proceedings of the INTERSPEECH 2015, 2015

2014
Speaker recognition performance under ideal-knowledge noise suppression: an investigation.
Proceedings of the 2nd International Workshop on Speech, Language and Audio in Multimedia, 2014

A hierarchical approach for the online, on-board detection and localisation of brake squeal using microphone arrays.
Proceedings of the 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2014

2013
The Potential for Speech Intelligibility Improvement Using the Ideal Binary Mask and the Ideal Wiener Filter in Single Channel Noise Reduction Systems: Application to Auditory Prostheses.
IEEE Trans. Speech Audio Process., 2013

Consistent iterative hard thresholding for signal declipping.
Proceedings of the IEEE International Conference on Acoustics, 2013

2012
Reference Estimation in EEG: Analysis of Equivalent Approaches.
IEEE Signal Process. Lett., 2012

A unified treatment of the reference estimation problem in depth EEG recordings.
Medical Biol. Eng. Comput., 2012

2011
A Versatile Framework for Speaker Separation Using a Model-Based Speaker Localization Approach.
IEEE Trans. Speech Audio Process., 2011

2010
A New Performance Criterion for Microphone Array Geometries and Filterand- Sum Beamforming.
Proceedings of the 9. ITG-Fachtagung Sprachkommunikation 2010, 2010

2008
AN EM-based probabilistic approach for Acoustic Echo Suppression.
Proceedings of the IEEE International Conference on Acoustics, 2008

Temporal smoothing of spectral masks in the cepstral domain for speech separation.
Proceedings of the IEEE International Conference on Acoustics, 2008

2005
Robust speaker localization through adaptive weighted pair TDOA (AWEPAT) estimation.
Proceedings of the INTERSPEECH 2005, 2005

2003
Independent component analysis (ICA) for blind equalization of frequency selective channels.
Proceedings of the NNSP 2003, 2003


  Loading...