Gábor Gosztolya

Orcid: 0000-0002-2864-6466

According to our database1, Gábor Gosztolya authored at least 90 papers between 2003 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Using Hybrid HMM/DNN Embedding Extractor Models in Computational Paralinguistic Tasks.
Sensors, 2023

Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks.
CoRR, 2023

Identifying Subjects Wearing a Mask from the Speech by Means of Encoded Speech Representations.
Proceedings of the Text, Speech, and Dialogue - 26th International Conference, 2023

Aggregation Strategies of Wav2vec 2.0 Embeddings for Computational Paralinguistic Tasks.
Proceedings of the Speech and Computer - 25th International Conference, 2023

Using Custom X-vectors for the Automatic Screening of COVID-19 Based on Coughing Audio Samples.
Proceedings of the 17th IEEE International Symposium on Applied Computational Intelligence and Informatics, 2023

2022
Optimizing the Ultrasound Tongue Image Representation for Residual Network-Based Articulatory-to-Acoustic Mapping.
Sensors, 2022

Estimating the degree of conflict in speech by employing Bag-of-Audio-Words and Fisher Vectors.
Expert Syst. Appl., 2022

Optimizing class priors to improve the detection of social signals in audio data.
Eng. Appl. Artif. Intell., 2022

Automatic screening of mild cognitive impairment and Alzheimer's disease by means of posterior-thresholding hesitation representation.
Comput. Speech Lang., 2022

Linguistic Parameters of Spontaneous Speech for Identifying Mild Cognitive Impairment and Alzheimer Disease.
Comput. Linguistics, 2022

On the Use of Ensemble X-Vector Embeddings for Improved Sleepiness Detection.
Proceedings of the Speech and Computer - 24th International Conference, 2022

Identification of Subjects Wearing a Surgical Mask from Their Speech by Means of X-vectors and Fisher Vectors.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2022

Using Spectral Sequence-to-Sequence Autoencoders to Assess Mild Cognitive Impairment.
Proceedings of the IEEE International Conference on Acoustics, 2022

Automatic Assessment of the Degree of Clinical Depression from Speech Using X-Vectors.
Proceedings of the IEEE International Conference on Acoustics, 2022

Using Acoustic Deep Neural Network Embeddings to Detect Multiple Sclerosis From Speech.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Ensemble Bag-of-Audio-Words Representation Improves Paralinguistic Classification Accuracy.
IEEE ACM Trans. Audio Speech Lang. Process., 2021

Cross-lingual detection of mild cognitive impairment based on temporal parameters of spontaneous speech.
Comput. Speech Lang., 2021

Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging.
CoRR, 2021

Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input.
CoRR, 2021

Using the Fisher Vector Approach for Cold Identification.
Acta Cybern., 2021

Neural Speaker Embeddings for Ultrasound-Based Silent Speech Interfaces.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Identifying Conflict Escalation and Primates by Using Ensemble X-Vectors and Fisher Vector Features.
Proceedings of the Interspeech 2021, 22nd Annual Conference of the International Speech Communication Association, Brno, Czechia, 30 August, 2021

Deep Neural Network Embeddings for the Estimation of the Degree of Sleepiness.
Proceedings of the IEEE International Conference on Acoustics, 2021

Improving Neural Silent Speech Interface Models by Adversarial Training.
Proceedings of the International Conference on Artificial Intelligence and Computer Vision, 2021

2020
Social Signal Detection by Probabilistic Sampling DNN Training.
IEEE Trans. Affect. Comput., 2020

Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks.
CoRR, 2020

Investigating the Corpus Independence of the Bag-of-Audio-Words Approach.
Proceedings of the Text, Speech, and Dialogue, 2020

Predicting a Cold from Speech Using Fisher Vectors; SVM and XGBoost as Classifiers.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

Making a Distinction Between Schizophrenia and Bipolar Disorder Based on Temporal Parameters in Spontaneous Speech.
Proceedings of the Interspeech 2020, 2020

Very Short-Term Conflict Intensity Estimation Using Fisher Vectors.
Proceedings of the Interspeech 2020, 2020

Ultrasound-Based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis.
Proceedings of the Interspeech 2020, 2020

2019
Calibrating AdaBoost for phoneme classification.
Soft Comput., 2019

Posterior-thresholding feature extraction for paralinguistic speech classification.
Knowl. Based Syst., 2019

Identifying Mild Cognitive Impairment and mild Alzheimer's disease based on spontaneous speech using ASR and linguistic features.
Comput. Speech Lang., 2019

Reducing the Inter-speaker Variance of CNN Acoustic Models Using Unsupervised Adversarial Multi-task Training.
Proceedings of the Speech and Computer - 21st International Conference, 2019

Assessing Alzheimer's Disease from Speech Using the i-vector Approach.
Proceedings of the Speech and Computer - 21st International Conference, 2019

Differentiating Laughter Types via HMM/DNN and Probabilistic Sampling.
Proceedings of the Speech and Computer - 21st International Conference, 2019

Assessing Parkinson's Disease from Speech Using Fisher Vectors.
Proceedings of the Interspeech 2019, 2019

Calibrating DNN Posterior Probability Estimates of HMM/DNN Models to Improve Social Signal Detection from Audio Data.
Proceedings of the Interspeech 2019, 2019

Using the Bag-of-Audio-Word Feature Representation of ASR DNN Posteriors for Paralinguistic Classification.
Proceedings of the Interspeech 2019, 2019

Using Fisher Vector and Bag-of-Audio-Words Representations to Identify Styrian Dialects, Sleepiness, Baby & Orca Sounds.
Proceedings of the Interspeech 2019, 2019

Ultrasound-Based Silent Speech Interface Built on a Continuous Vocoder.
Proceedings of the Interspeech 2019, 2019

Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces.
Proceedings of the International Joint Conference on Neural Networks, 2019

Automatic recognition of temporal speech features in type 2 diabetes mellitus with mild cognitive impairment.
Proceedings of the 10th IEEE International Conference on Cognitive Infocommunications, 2019

2018
A feature selection-based speaker clustering method for paralinguistic tasks.
Pattern Anal. Appl., 2018

Multi-Band Processing With Gabor Filters and Time Delay Neural Nets for Noise Robust Speech Recognition.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

Posterior Calibration for Multi-Class Paralinguistic Classification.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

User-centric Evaluation of Automatic Punctuation in ASR Closed Captioning.
Proceedings of the Interspeech 2018, 2018

Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces.
Proceedings of the Interspeech 2018, 2018

General Utterance-Level Feature Extraction for Classifying Crying Sounds, Atypical & Self-Assessed Affect and Heart Beats.
Proceedings of the Interspeech 2018, 2018

Identifying Schizophrenia Based on Temporal Parameters in Spontaneous Speech.
Proceedings of the Interspeech 2018, 2018

F0 Estimation for DNN-Based Ultrasound Silent Speech Interfaces.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
DNN-Based Feature Extraction for Conflict Intensity Estimation From Speech.
IEEE Signal Process. Lett., 2017

A Comparative Evaluation of GMM-Free State Tying Methods for ASR.
Proceedings of the Interspeech 2017, 2017

Training Context-Dependent DNN Acoustic Models Using Probabilistic Sampling.
Proceedings of the Interspeech 2017, 2017

DNN-Based Feature Extraction and Classifier Combination for Child-Directed Speech, Cold and Snoring Identification.
Proceedings of the Interspeech 2017, 2017

Optimized Time Series Filters for Detecting Laughter and Filler Events.
Proceedings of the Interspeech 2017, 2017

DNN-Based Ultrasound-to-Speech Conversion for a Silent Speech Interface.
Proceedings of the Interspeech 2017, 2017

2016
Adaptation of DNN Acoustic Models Using KL-divergence Regularization and Multi-task Training.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Detecting Laughter and Filler Events by Time Series Smoothing with Genetic Algorithms.
Proceedings of the Speech and Computer - 18th International Conference, 2016

Detecting Mild Cognitive Impairment from Spontaneous Speech by Correlation-Based Phonetic Feature Selection.
Proceedings of the Interspeech 2016, 2016

GMM-Free Flat Start Sequence-Discriminative DNN Training.
Proceedings of the Interspeech 2016, 2016

Estimating the Sincerity of Apologies in Speech by DNN Rank Learning and Prosodic Analysis.
Proceedings of the Interspeech 2016, 2016

Determining Native Language and Deception Using Phonetic Features and Classifier Combination.
Proceedings of the Interspeech 2016, 2016

Detecting Mild Cognitive Impairment by Exploiting Linguistic Information from Transcripts.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Automatic detection of mild cognitive impairment from spontaneous speech using ASR.
Proceedings of the INTERSPEECH 2015, 2015

Assessing the degree of nativeness and parkinson's condition using Gaussian processes and deep rectifier neural networks.
Proceedings of the INTERSPEECH 2015, 2015

On evaluation metrics for social signal detection.
Proceedings of the INTERSPEECH 2015, 2015

Conflict intensity estimation from speech using Greedy forward-backward feature selection.
Proceedings of the INTERSPEECH 2015, 2015

Building context-dependent DNN acoustic models using Kullback-Leibler divergence-based state tying.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2014
A Sequence Training Method for Deep Rectifier Neural Networks in Speech Recognition.
Proceedings of the Speech and Computer - 16th International Conference, 2014

Applying Representative Uninorms for Phonetic Classifier Combination.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2014

Detecting the intensity of cognitive and physical load using AdaBoost and deep rectifier neural networks.
Proceedings of the INTERSPEECH 2014, 2014

On the Concept of Correct Hits in Spoken Term Detection.
Proceedings of the Second International Workshop on Artificial Intelligence and Cognition (AIC 2014), 2014

2013
Using the Logarithmic Generator Function in the Spoken Term Detection Task.
Proceedings of the Modeling Decisions for Artificial Intelligence, 2013

Detecting autism, emotions and social signals using adaboost.
Proceedings of the INTERSPEECH 2013, 2013

2011
Spoken term detection from noisy input.
Proceedings of the 6th IEEE International Symposium on Applied Computational Intelligence and Informatics, 2011

2010
Improving speed and accuracy in automatic speech recognition
PhD thesis, 2010

2009
Applying the Generalized Dombi Operator Family to the Speech Recognition Task.
J. Comput. Inf. Technol., 2009

Using One-Class Classification Techniques in the Anti-phoneme Problem.
Proceedings of the Pattern Recognition and Image Analysis, 4th Iberian Conference, 2009

2008
Cross-lingual portability of MLP-based tandem features - a case study for English and Hungarian.
Proceedings of the INTERSPEECH 2008, 2008

Detection of Phoneme Boundaries Using Spiking Neurons.
Proceedings of the Artificial Intelligence and Soft Computing, 2008

2006
The use of speed-up techniques for a speech recognizer system.
Int. J. Speech Technol., 2006

A Hierarchical Evaluation Methodology in Speech Recognition.
Acta Cybern., 2006

2005
Speeding Up Dynamic Search Methods in Speech Recognition.
Proceedings of the Innovations in Applied Artificial Intelligence, 2005

2004
Telephone Speech Recognition via the Combination of Knowledge Sources in a Segmental Speech Model.
Acta Cybern., 2004

Aggregation Operators and Hypothesis Space Reductions in Speech Recognition.
Proceedings of the Text, Speech and Dialogue, 7th International Conference, 2004

Replicator Neural Networks for Outlier Modeling in Segmental Speech Recognition.
Proceedings of the Advances in Neural Networks, 2004

2003
Various Robust Search Methods in a Hungarian Speech Recognition System.
Acta Cybern., 2003

Improving the Multi-stack Decoding Algorithm in a Segment-Based Speech Recognizer.
Proceedings of the Developments in Applied Artificial Intelligence, 2003


  Loading...