Dmitry Ustalov

Orcid: 0000-0002-9979-2188

Affiliations:
  • Toloka, Belgrade, Serbia
  • Yandex, Saint Petersburg, Russia (former)
  • University of Mannheim, Germany (former)
  • Krasovskii Institute of Mathematics and Mechanics, Russia (former)
  • Ural Federal University, Russia (former)


According to our database1, Dmitry Ustalov authored at least 48 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification - Extended Abstract.
Proceedings of the Advances in Information Retrieval, 2024

2023
Toloka Visual Question Answering Benchmark.
CoRR, 2023

4th Crowd Science Workshop - CANDLE: Collaboration of Humans and Learning Algorithms for Data Labeling.
Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, 2023

Best Prompts for Text-to-Image Models and How to Find Them.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Crowdsourcing for Information Retrieval.
Proceedings of the Advances in Information Retrieval, 2023

Clustering Without Knowing How To: Application and Evaluation.
Proceedings of the Advances in Information Retrieval, 2023

WSDM Cup 2023 Challenge on Visual Question Answering.
Proceedings of the 4th Crowd Science Workshop on Collaboration of Humans and Learning Algorithms for Data Labeling co-located with ACM International WSDM Conference (WSDM 2023), 2023

Data Labeling for Machine Learning Engineers: Project-Based Curriculum and Data-Centric Competitions.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Challenges in Data Production for AI with Human-in-the-Loop.
Proceedings of the WSDM '22: The Fifteenth ACM International Conference on Web Search and Data Mining, Virtual Event / Tempe, AZ, USA, February 21, 2022

Improving Recommender Systems with Human-in-the-Loop.
Proceedings of the RecSys '22: Sixteenth ACM Conference on Recommender Systems, Seattle, WA, USA, September 18, 2022

Web Engineering with Human-in-the-Loop.
Proceedings of the Web Engineering - 22nd International Conference, 2022

REGROW: Reimagining Global Crowdsourcing for Better Human-AI Collaboration.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

2021
IMDB-WIKI-SbS: An Evaluation Dataset for Crowdsourced Pairwise Comparisons.
CoRR, 2021

A General-Purpose Crowdsourcing Computational Quality Control Toolkit for Python.
CoRR, 2021

Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription.
CoRR, 2021

CrowdSpeech and Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

VLDB 2021 Crowd Science Challenge on Aggregating Crowdsourced Audio Transcriptions.
Proceedings of the 2nd Crowd Science Workshop: Trust, 2021

2020
Practice of Efficient Data Collection via Crowdsourcing: Aggregation, Incremental Relabelling, and Pricing.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

Crowdsourcing Practice for Efficient Data Labeling: Aggregation, Incremental Relabeling, and Pricing.
Proceedings of the 2020 International Conference on Management of Data, 2020

Word Sense Disambiguation for 158 Languages using Word Embeddings Only.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

2019
Watset.
Comput. Linguistics, 2019

TextGraphs 2019 Shared Task on Multi-Hop Inference for Explanation Regeneration.
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing, 2019

HHMM at SemEval-2019 Task 2: Unsupervised Frame Induction using Contextualized Word Embeddings.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

The Role of Student Projects in Teaching Machine Learning and High Performance Computing.
Proceedings of the Supercomputing, 2019

2018
Local-Global Graph Clustering with Applications in Sense and Frame Induction.
CoRR, 2018

RUSSE: The First Workshop on Russian Semantic Similarity.
CoRR, 2018

RUSSE'2018: A Shared Task on Word Sense Induction for the Russian Language.
CoRR, 2018

Equidistant Nodes Clustering: a Soft Clustering Algorithm Applied for Synset Induction.
Proceedings of the Selected Papers of the XX International Conference on Data Analytics and Management in Data Intensive Domains (DAMDID/RCDL 2018), 2018

An Unsupervised Word Sense Disambiguation System for Under-Resourced Languages.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Improving Hypernymy Extraction with Distributional Semantic Classes.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Unsupervised Sense-Aware Hypernymy Extraction.
Proceedings of the 14th Conference on Natural Language Processing, 2018

Unsupervised Semantic Frame Induction using Triclustering.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Fighting with the Sparsity of Synonymy Dictionaries.
CoRR, 2017

Watset: Automatic Induction of Synsets from a Graph of Synonyms.
CoRR, 2017

Unsupervised, Knowledge-Free, and Interpretable Word Sense Disambiguation.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Negative Sampling Improves Hypernymy Extraction Based on Projection Learning.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Fighting with the Sparsity of Synonymy Dictionaries for Automatic Synset Induction.
Proceedings of the Analysis of Images, Social Networks and Texts, 2017

Automatic Induction of Synsets from a Graph of Synonyms.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Eliminating Fuzzy Duplicates in Crowdsourced Lexical Resources.
Proceedings of the 8th Global WordNet Conference, 2016

YARN: Spinning-in-Progress.
Proceedings of the 8th Global WordNet Conference, 2016

Human and Machine Judgements for Russian Semantic Relatedness.
Proceedings of the Analysis of Images, Social Networks and Texts, 2016

2015
TagBag: Annotating a Foreign Language Lexical Resource with Pictures.
Proceedings of the Analysis of Images, Social Networks and Texts, 2015

2014
Towards Crowdsourcing and Cooperation in Linguistic Resources.
Proceedings of the Information Retrieval, 2014

NLPub: каталог и сообщество русских лингвистических ресурсов (NLPub: a Catalogue and a Community for Russian Linguistic Resources).
Proceedings of the Selected Papers of XVI All-Russian Scientific Conference "Digital libraries: Advanced Methods and Technologies, 2014

Words Worth Attention: Predicting Words of the Week on the Russian Wiktionary.
Proceedings of the Knowledge Engineering and the Semantic Web, 2014

A Spinning Wheel for YARN: User Interface for a Crowdsourced Thesaurus.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Enhancing Russian Wordnets Using the Force of the Crowd.
Proceedings of the Analysis of Images, Social Networks and Texts, 2014

2013
Orchestrating the Natural Language Processing Software in the Cloud Computing Environment.
J. Digit. Inf. Manag., 2013


  Loading...