Egor Lakomkin

According to our database1, Egor Lakomkin authored at least 21 papers between 2011 and 2023.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2023
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data.
CoRR, 2023

End-to-End Speech Recognition Contextualization with Large Language Models.
CoRR, 2023

Prompting Large Language Models with Speech Recognition Abilities.
CoRR, 2023

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision.
CoRR, 2023

Egocentric Audio-Visual Noise Suppression.
Proceedings of the IEEE International Conference on Acoustics, 2023

SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Egocentric Audio-Visual Noise Suppression.
CoRR, 2022

Being Greedy Does Not Hurt: Sampling Strategies for End-To-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022

2020
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition.
Proceedings of the Interspeech 2020, 2020

2019
Predictive Auxiliary Variational Autoencoder for Representation Learning of Global Speech Characteristics.
Proceedings of the Interspeech 2019, 2019

Incorporating End-to-End Speech Recognition Models for Sentiment Analysis.
Proceedings of the International Conference on Robotics and Automation, 2019

2018
On the Robustness of Speech Emotion Recognition for Human-Robot Interaction with Deep Neural Networks.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

The OMG-Emotion Behavior Dataset.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

EmoRL: Continuous Acoustic Emotion Classification Using Deep Reinforcement Learning.
Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018

Combining Articulatory Features with End-to-End Learning in Speech Recognition.
Proceedings of the Artificial Neural Networks and Machine Learning - ICANN 2018, 2018

Image-to-Text Transduction with Spatial Self-Attention.
Proceedings of the 26th European Symposium on Artificial Neural Networks, 2018

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

2017
GradAscent at EmoInt-2017: Character and Word Level Recurrent Neural Network Models for Tweet Emotion Intensity Detection.
Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, 2017

Reusing Neural Speech Representations for Auditory Emotion Recognition.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Automatically augmenting an emotion dataset improves classification using audio.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

2011
MVC Web Framework Based on eXist Application Server and XRX Architecture.
Proceedings of the Seventh Spring Researchers Colloquium on Databases and Information Systems, 2011


  Loading...