Visar Berisha

Orcid: 0000-0001-8804-8874

According to our database1, Visar Berisha authored at least 101 papers between 2005 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Consonant-Vowel Transition Models Based on Deep Learning for Objective Evaluation of Articulation.
IEEE ACM Trans. Audio Speech Lang. Process., 2023

Active Sequential Two-Sample Testing.
CoRR, 2023

Learning Repeatable Speech Embeddings Using An Intra-class Correlation Regularizer.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Decorrelating Language Model Embeddings for Speech-Based Prediction of Cognitive Impairment.
Proceedings of the IEEE International Conference on Acoustics, 2023

Requirements For Mass Adoption Of Assistive Listening Technology By The General Public.
Proceedings of the IEEE International Conference on Acoustics, 2023

Does Human Speech Follow Benford's Law?
Proceedings of the IEEE International Conference on Acoustics, 2023

Smoothly Giving up: Robustness for Simple Models.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
TorchDIVA: An Extensible Computational Model of Speech Production built on an Open-Source Machine Learning Library.
CoRR, 2022

Unsupervised EEG channel selection based on nonnegative matrix factorization.
Biomed. Signal Process. Control., 2022

A label efficient two-sample test.
Proceedings of the Uncertainty in Artificial Intelligence, 2022

Investigating the Impact of Speech Compression on the Acoustics of Dysarthric Speech.
Proceedings of the Interspeech 2022, 2022

Are reported accuracies in the clinical speech machine learning literature overoptimistic?
Proceedings of the Interspeech 2022, 2022

2021
A Deep Learning Algorithm for Objective Assessment of Hypernasality in Children With Cleft Palate.
IEEE Trans. Biomed. Eng., 2021

Revisiting the accuracy problem in network analysis using a unique dataset.
Soc. Networks, 2021

Digital medicine and the curse of dimensionality.
npj Digit. Medicine, 2021

Computationally-efficient voice activity detection based on deep neural networks.
Proceedings of the IEEE Workshop on Signal Processing Systems, 2021

Restoring Degraded Speech via a Modified Diffusion Model.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

The Impact of Forced-Alignment Errors on Automatic Pronunciation Evaluation.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

An Attention Model for Hypernasality Prediction in Children with Cleft Palate.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Robust Estimation of Hypernasality in Dysarthria With Acoustic Model Likelihood Features.
IEEE ACM Trans. Audio Speech Lang. Process., 2020

Early detection and tracking of bulbar changes in ALS via frequent and remote speech analysis.
npj Digit. Medicine, 2020

Author Correction: Early detection and tracking of bulbar changes in ALS via frequent and remote speech analysis.
npj Digit. Medicine, 2020

A Review of Automated Speech and Language Features for Assessment of Cognitive and Thought Disorders.
IEEE J. Sel. Top. Signal Process., 2020

An 8.93 TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity for On-Device Speech Recognition.
IEEE J. Solid State Circuits, 2020

Finding the Homology of Decision Boundaries with Active Learning.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

UncommonVoice: A Crowdsourced Dataset of Dysphonic Speech.
Proceedings of the Interspeech 2020, 2020

Compressing LSTM Networks with Hierarchical Coarse-Grain Sparsity.
Proceedings of the Interspeech 2020, 2020

Deep Learning Based Prediction of Hypernasality for Clinical Applications.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Regularization via Structural Label Smoothing.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

2019
Articulation constrained learning with application to speech emotion recognition.
EURASIP J. Audio Speech Music. Process., 2019

Guest Editorial: Algorithms and Architectures for Machine Learning Based Speech Processing.
Circuits Syst. Signal Process., 2019

Robust Estimation of Hypernasality in Dysarthria.
CoRR, 2019

A Review of Language and Speech Features for Cognitive-Linguistic Assessment.
CoRR, 2019

Residual + Capsule Networks (ResCap) for Simultaneous Single-Channel Overlapped Keyword Recognition.
Proceedings of the Interspeech 2019, 2019

Objective Assessment of Social Skills Using Automated Language Analysis for Identification of Schizophrenia and Bipolar Disorder.
Proceedings of the Interspeech 2019, 2019

Say What? A Dataset for Exploring the Error Patterns That Two ASR Engines Make.
Proceedings of the Interspeech 2019, 2019

Do Conversational Partners Entrain on Articulatory Precision?
Proceedings of the Interspeech 2019, 2019

Investigating the Effects of Word Substitution Errors on Sentence Embeddings.
Proceedings of the IEEE International Conference on Acoustics, 2019

Joint Optimization of Quantization and Structured Sparsity for Compressed Deep Neural Networks.
Proceedings of the IEEE International Conference on Acoustics, 2019

Objective Measures of Plosive Nasalization in Hypernasal Speech.
Proceedings of the IEEE International Conference on Acoustics, 2019

Objective Assessment of Vocal Tremor.
Proceedings of the IEEE International Conference on Acoustics, 2019

A 8.93-TOPS/W LSTM Recurrent Neural Network Accelerator Featuring Hierarchical Coarse-Grain Sparsity With All Parameters Stored On-Chip.
Proceedings of the 45th IEEE European Solid State Circuits Conference, 2019

2018
Direct Estimation of Density Functionals Using a Polynomial Basis.
IEEE Trans. Signal Process., 2018

A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment.
Proceedings of the Interspeech 2018, 2018

Investigating the Role of L1 in Automatic Pronunciation Evaluation of L2 Speech.
Proceedings of the Interspeech 2018, 2018

Triplet Network with Attention for Speaker Diarization.
Proceedings of the Interspeech 2018, 2018

Direct Ensemble Estimation of Density Functionals.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Towards a Wearable Cough Detector Based on Neural Networks.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Simulating Dysarthric Speech for Training Data Augmentation in Clinical Speech Applications.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Online Machine Learning Experiments in HTML5.
Proceedings of the IEEE Frontiers in Education Conference, 2018

2017
Articulation Entropy: An Unsupervised Measure of Articulatory Precision.
IEEE Signal Process. Lett., 2017

Improving efficiency in sparse learning with the feedforward inhibitory motif.
Neurocomputing, 2017

A data-driven basis for direct estimation of functionals of distributions.
CoRR, 2017

Interpretable Objective Assessment of Dysarthric Speech Based on Deep Neural Networks.
Proceedings of the Interspeech 2017, 2017

Float Like a Butterfly Sting Like a Bee: Changes in Speech Preceded Parkinsonism Diagnosis for Muhammad Ali.
Proceedings of the Interspeech 2017, 2017

Objective assessment of pathological speech using distribution regression.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Interpretable phonological features for clinical applications.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Minimizing area and energy of deep learning hardware design using collective low precision and structured compression.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

Improved finite-sample estimate of a nonparametric f-divergence.
Proceedings of the 51st Asilomar Conference on Signals, Systems, and Computers, 2017

2016
Empirically Estimable Classification Bounds Based on a Nonparametric Divergence Measure.
IEEE Trans. Signal Process., 2016

Reducing the Model Order of Deep Neural Networks Using Information Theory.
Proceedings of the IEEE Computer Society Annual Symposium on VLSI, 2016

A Convex Model for Linguistic Influence in Group Conversations.
Proceedings of the Interspeech 2016, 2016

Accent Identification by Combining Deep Neural Networks and Recurrent Neural Networks Trained on Long and Short Term Features.
Proceedings of the Interspeech 2016, 2016

Empirically-estimable multi-class classification bounds.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Ranking the parameters of deep neural networks using the fisher information.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Online speaking rate estimation using recurrent neural networks.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Noise robust dysarthric speech classification using domain adaptation.
Proceedings of the Digital Media Industry & Academic Forum, 2016

Models for objective evaluation of dysarthric speech from data annotated by multiple listeners.
Proceedings of the 50th Asilomar Conference on Signals, Systems and Computers, 2016

2015
Convex Weighting Criteria for Speaking Rate Estimation.
IEEE ACM Trans. Audio Speech Lang. Process., 2015

Empirical Non-Parametric Estimation of the Fisher Information.
IEEE Signal Process. Lett., 2015

Active data labeling for improved classifier generalizability.
Signal Process., 2015

Removing data with noisy responses in regression analysis.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Estimating speaking rate in spontaneous discourse.
Proceedings of the 49th Asilomar Conference on Signals, Systems and Computers, 2015

2014
Empirically Estimable Classification Bounds Based on a New Divergence Measure.
CoRR, 2014

Domain invariant speech features using a new divergence measure.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Modeling pathological speech perception from data with similarity labels.
Proceedings of the IEEE International Conference on Acoustics, 2014

2013
Bandwidth Extension of Speech Using Perceptual Criteria
Synthesis Lectures on Algorithms and Software in Engineering, Morgan & Claypool Publishers, ISBN: 978-3-031-01521-2, 2013

Towards a clinical tool for automatic intelligibility assessment.
Proceedings of the IEEE International Conference on Acoustics, 2013

Selecting disorder-specific features for speech pathology fingerprinting.
Proceedings of the IEEE International Conference on Acoustics, 2013

2011
Editorial.
Digit. Signal Process., 2011

Semi-supervised hierarchy learning using multiple-labeled data.
Proceedings of the 2011 IEEE International Workshop on Machine Learning for Signal Processing, 2011

2010
An auditory-domain based speech enhancement algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2010

2009
Experiments With Sensor Motes and Java-DSP.
IEEE Trans. Educ., 2009

A Frequency/Detector Pruning Approach for Loudness Estimation.
IEEE Signal Process. Lett., 2009

A Sensor Network for Real-time Acoustic Scene Analysis.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2009), 2009

Energy-constrained discriminant analysis.
Proceedings of the IEEE International Conference on Acoustics, 2009

Low-complexity sinusoidal component selection using loudness patterns.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Gradient projection-based channel equalization under sustained fading.
Signal Process., 2008

Real-time sensing and acoustic scene characterization for security applications.
Proceedings of the Third International Symposium on Wireless Pervasive Computing, 2008

A low-complexity loudness estimation algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2008

2007
Wideband Speech Recovery Using Psychoacoustic Criteria.
EURASIP J. Audio Speech Music. Process., 2007

Dual-Mode Wideband Speech Compression.
Proceedings of the IEEE 9th Workshop on Multimedia Signal Processing, 2007

Sparse Manifold Learning with Applications to SAR Image Classification.
Proceedings of the IEEE International Conference on Acoustics, 2007

A Scalable Bandwidth Extension Algorithm.
Proceedings of the IEEE International Conference on Acoustics, 2007

2006
Bandwidth Extension of Audio Based on Partial Loudness Criteria.
Proceedings of the IEEE 8th Workshop on Multimedia Signal Processing, 2006

Real-time acoustic monitoring using wireless sensor motes.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2006), 2006

Real-Time Collaborative Monitoring in Wireless Sensor Networks.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Enhancing the Quality of Coded Audio Using Perceptual Criteria.
Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, 2005

Enhancing vocoder performance for music signals.
Proceedings of the International Symposium on Circuits and Systems (ISCAS 2005), 2005

Interactive Java modules for the MPEG-1 psychoacoustic model [audio coding teaching applications].
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005


  Loading...