Simon Dobrisek

Orcid: 0000-0002-9130-0345

According to our database1, Simon Dobrisek authored at least 54 papers between 1992 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Dynamic speaker localization based on a novel lightweight R-CNN model.
Neural Comput. Appl., May, 2023

MaskFaceGAN: High-Resolution Face Editing With Masked GAN Latent Code Optimization.
IEEE Trans. Image Process., 2023

FICE: Text-Conditioned Fashion Image Editing With Guided GAN Inversion.
CoRR, 2023

ChildNet: Structural Kinship Face Synthesis Model With Appearance Control Mechanisms.
IEEE Access, 2023

2022
Making the Most of Single Sensor Information: A Novel Fusion Approach for 3D Face Recognition Using Region Covariance Descriptors and Gaussian Mixture Models.
Sensors, 2022

Validation of Speech Data for Training Automatic Speech Recognition Systems.
Proceedings of the 30th European Signal Processing Conference, 2022

2021
High Resolution Face Editing with Masked GAN Latent Code Optimization.
CoRR, 2021

Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion in Highly Inflected Languages.
Proceedings of the Intelligent Computing Theories and Application, 2021

2020
Simultaneous multi-descent regression and feature learning for facial landmarking in depth images.
Neural Comput. Appl., 2020

2019
Simultaneous regression and feature learning for facial landmarking.
CoRR, 2019

Face Hallucination Revisited: An Exploratory Study on Dataset Bias.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
Face hallucination using cascaded super-resolution and identity priors.
CoRR, 2018

Localization of Facial Landmarks in Depth Images Using Gated Multiple Ridge Descent.
Proceedings of the IEEE International Work Conference on Bioinspired Intelligence, 2018

2016
Robust Depth Image Acquisition Using Modulated Pattern Projection and Probabilistic Graphical Models.
Sensors, 2016

Exploiting Spatio-Temporal Information for Light-Plane Labeling in Depth-Image Sensors Using Probabilistic Graphical Models.
Informatica, 2016

A Composition Algorithm of Compact Finite-State Super Transducers for Grapheme-to-Phoneme Conversion.
Proceedings of the Text, Speech, and Dialogue - 19th International Conference, 2016

Deep pair-wise similarity learning for face recognition.
Proceedings of the 4th International Conference on Biometrics and Forensics, 2016

Report on the BTAS 2016 Video Person Recognition Evaluation.
Proceedings of the 8th IEEE International Conference on Biometrics Theory, 2016

2015
Modest face recognition.
Proceedings of the 3rd International Workshop on Biometrics and Forensics, 2015

Speaker de-identification using diphone recognition and speech synthesis.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Face recognition in the wild with the Probabilistic Gabor-Fisher Classifier.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

2014
Intelligibility Assessment of the De-Identified Speech Obtained Using Phoneme Recognition and Speech Synthesis Systems.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Incorporating Duration Information into I-Vector-Based Speaker Recognition Systems.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014

SIFT vs. FREAK: Assessing the usefulness of two keypoint descriptors for 3D face verification.
Proceedings of the 37th International Convention on Information and Communication Technology, 2014

2013
Smart Surveillance Technologies in Border Control.
Eur. J. Law Technol., 2013

Speaker state recognition using an HMM-based feature extraction method.
Comput. Speech Lang., 2013


Combining 3D face representations using region covariance descriptors and statistical models.
Proceedings of the 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2013

2012
Analysis and Assessment of State Relevance in HMM-Based Feature Extraction Method.
Proceedings of the Text, Speech and Dialogue - 15th International Conference, 2012

2011
University of Ljubljana System for Interspeech 2011 Speaker State Challenge.
Proceedings of the INTERSPEECH 2011, 2011

Time- and Acoustic-Mediated Alignment Algorithms for Speech Recognition Evaluation.
Proceedings of the INTERSPEECH 2011, 2011

2010
Towards the Optimal Minimization of a Pronunciation Dictionary Model.
Proceedings of the Text, Speech and Dialogue, 13th International Conference, 2010

Confidence Weighted Subspace Projection Techniques for Robust Face Recognition in the Presence of Partial Occlusions.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

2009
An Edit-Distance Model for the Approximate Matching of Timed Strings.
IEEE Trans. Pattern Anal. Mach. Intell., 2009

Emotion recognition using linear transformations in combination with video.
Proceedings of the INTERSPEECH 2009, 2009

A sequential minimization algorithm for finite-state pronunciation lexicon models.
Proceedings of the INTERSPEECH 2009, 2009

Combining Audio and Video for Detection of Spontaneous Emotions.
Proceedings of the Biometric ID Management and Multimodal Communication, 2009

2005
Initial considerations in building a speech-to-speech translation system for the Slovenian-English language pair.
Proceedings of the 10th EAMT Conference: Practical applications of machine translation, 2005

2003
Spoken Language Resources at LUKS of the University of Ljubljana.
Int. J. Speech Technol., 2003

Evolution of the Information-Retrieval System for Blind and Visually-Impaired People.
Int. J. Speech Technol., 2003

Homer II - man-machine interface to internet for blind and visually impaired people.
Comput. Commun., 2003

A voice-driven web browser for blind people.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
A Voice-Driven Web Browser for Blind People.
Proceedings of the Text, Speech and Dialogue, 5th International Conference, 2002

2000
Rules for Automatic Grapheme-to-Allophone Transcription in Slovene.
Proceedings of the Text, Speech and Dialogue - Third International Workshop, 2000

Corpora of Slovene Spoken Language for Multi-lingual Applications.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

1999
Language Model Representations for the GOPOLIS Database.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

Speech Segmentation Aspects of Phone Transition Acoustical Modelling.
Proceedings of the Text, Speech and Dialogue - Second International Workshop, 1999

A slovenian spoken dialog system for air flight inquiries.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Acoustical modelling of phone transitions: biphones and diphones - what are the differences?
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

1998
Recording and labelling of the GOPOLIS slovenian speech database.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998

1997
A multiresolutionally oriented approach for determination of cepstral features in speech recognition.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

1996
Segmentation and Labelling of Slovenian Diphone Inventories.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

1995
Multi-variate mixture probability density modelling of VQ codebook using gradient descent algorithm.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1992
Feature representations and classification procedures for Slovene phoneme recognition.
Pattern Recognit. Lett., 1992


  Loading...