Marek Hrúz

Orcid: 0000-0002-7851-9879

According to our database1, Marek Hrúz authored at least 50 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024

2023
MuTr: Multi-Stage Transformer for Hand Pose Estimation from Full-Scene Depth Image.
Sensors, 2023

Learning from What is Already Out There: Few-shot Sign Language Recognition with Online Dictionaries.
Proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition, 2023


Improving Handwritten Cyrillic OCR by Font-Based Synthetic Text Generator.
Proceedings of the Dynamics of Information Systems - 6th International Conference, 2023

Exploring the Relationship between Dataset Size and Image Captioning Model Performance.
Proceedings of the 26th Computer Vision Winter Workshop (CVWW 2023), 2023

Overview of SnakeCLEF 2023: Snake Identification in Medically Important Scenarios.
Proceedings of the Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2023), 2023

Overview of LifeCLEF 2023: Evaluation of AI Models for the Identification and Prediction of Birds, Plants, Snakes and Fungi.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2023

Voice-Interactive Learning Dialogue on a Low-Cost Device.
Proceedings of the Pattern Recognition - 7th Asian Conference, 2023

Domain-centric ADAS Datasets.
Proceedings of the Workshop on Artificial Intelligence Safety 2023 (SafeAI 2023) co-located with the Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI 2023), 2023

2022
One Model is Not Enough: Ensembles for Isolated Sign Language Recognition.
Sensors, 2022

Combining Efficient and Precise Sign Language Recognition: Good pose estimation library is all you need.
CoRR, 2022

Sign Pose-based Transformer for Word-level Sign Language Recognition.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2022

Neural Criticality Metric for Object Detection Deep Neural Networks.
Proceedings of the Computer Safety, Reliability, and Security. SAFECOMP 2022 Workshops, 2022

Overview of SnakeCLEF 2022: Automated Snake Species Identification on a Global Scale.
Proceedings of the Working Notes of CLEF 2022 - Conference and Labs of the Evaluation Forum, Bologna, Italy, September 5th - to, 2022

Overview of LifeCLEF 2022: An Evaluation of Machine-Learning Based Species Identification and Species Distribution Prediction.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2022

2021
Hand Pose Estimation in the Task of Egocentric Actions.
IEEE Access, 2021

X-Bridge: Image-to-Image Translation with Reconstruction Capabilities.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

OCR Improvements for Images of Multi-page Historical Documents.
Proceedings of the Speech and Computer - 23rd International Conference, 2021

Mutual Support of Data Modalities in the Task of Sign Language Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Neural Criticality: Validation of Convolutional Neural Networks.
Proceedings of the Workshop on Artificial Intelligence Safety 2021 (SafeAI 2021) co-located with the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), 2021

2020
An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents.
Proceedings of the Speech and Computer - 22nd International Conference, 2020

Evaluation of Image Synthesis for Automotive Purposes.
Proceedings of the Interactive Collaborative Robotics - 5th International Conference, 2020


2019
Detection of Overlapping Speech for the Purposes of Speaker Diarization.
Proceedings of the Speech and Computer - 21st International Conference, 2019

Combination of Positions and Angles for Hand Pose Estimation.
Proceedings of the Speech and Computer - 21st International Conference, 2019

Identity Extraction from Clusters of Multi-modal Observations.
Proceedings of the Speech and Computer - 21st International Conference, 2019

Semantic Segmentation of Historical Documents via Fully-Convolutional Neural Network.
Proceedings of the Speech and Computer - 21st International Conference, 2019

UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge.
Proceedings of the Interspeech 2019, 2019

2018
Recurrent Neural Network Based Speaker Change Detection from Text Transcription Applied in Telephone Speaker Diarization System.
Proceedings of the Text, Speech, and Dialogue - 21st International Conference, 2018

LSTM Neural Network for Speaker Change Detection in Telephone Conversations.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Generation of Synthetic Images of Full-Text Documents.
Proceedings of the Speech and Computer - 20th International Conference, 2018

Towards Processing of the Oral History Interviews and Related Printed Documents.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

ZCU-NTIS Speaker Diarization System for the DIHARD 2018 Challenge.
Proceedings of the Interspeech 2018, 2018

Multimodal Name Recognition in Live TV Subtitling.
Proceedings of the Interspeech 2018, 2018

Sign Language Numeral Gestures Recognition Using Convolutional Neural Network.
Proceedings of the Interactive Collaborative Robotics - Third International Conference, 2018

2017
Phase Analysis and Labeling Strategies in a CNN-Based Speaker Change Detection System.
Proceedings of the Speech and Computer - 19th International Conference, 2017

Speaker Diarization Using Convolutional Neural Network for Statistics Accumulation Refinement.
Proceedings of the Interspeech 2017, 2017

Convolutional Neural Network for speaker change detection in telephone speaker diarization system.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Convolutional Neural Network in the Task of Speaker Change Detection.
Proceedings of the Speech and Computer - 18th International Conference, 2016

An Analysis of Visual Faces Datasets.
Proceedings of the Interactive Collaborative Robotics - First International Conference, 2016

2011
Automatic fingersign-to-speech translation system.
J. Multimodal User Interfaces, 2011

Towards Automatic Annotation of Sign Language Dictionary Corpora.
Proceedings of the Text, Speech and Dialogue - 14th International Conference, 2011

Multi-modal dialogue system with sign language capabilities.
Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility, 2011

Automatic sign categorization using visual data.
Proceedings of the 13th International ACM SIGACCESS Conference on Computers and Accessibility, 2011

2008
Speech and sliding text aided sign retrieval from hearing impaired sign news videos.
J. Multimodal User Interfaces, 2008

Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Feature space transforms for Czech sign-language recognition.
Proceedings of the INTERSPEECH 2008, 2008

2007
Design and recording of Czech sign language corpus for automatic sign language recognition.
Proceedings of the INTERSPEECH 2007, 2007


  Loading...