Andrey Kutuzov

Orcid: 0000-0003-2540-5912

Affiliations:
  • University of Oslo, Norway
  • National Research University Higher School of Economics, Moscow, Russia
  • Tyumen State University, Russia


According to our database1, Andrey Kutuzov authored at least 58 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Enriching Word Usage Graphs with Cluster Definitions.
CoRR, 2024

A New Massive Multilingual Dataset for High-Performance Language Technologies.
CoRR, 2024

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

2023
NorBench - A Benchmark for Norwegian Language Models.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

Trained on 100 million words and still in shape: BERT meets British National Corpus.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change Analysis.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
RuDSI: graph-based word sense induction dataset for Russian.
CoRR, 2022

Contextualized language models for semantic change detection: lessons learned.
CoRR, 2022

SemEval 2022 Task 10: Structured Sentiment Analysis.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

NorDiaChange: Diachronic Semantic Change Dataset for Norwegian.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change.
Proceedings of the 3rd Workshop on Computational Approaches to Historical Language Change, 2022

2021
Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks.
CoRR, 2021

Multilingual ELMo and the Effects of Corpus Sampling.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Large-Scale Contextualised Language Modelling for Norwegian.
Proceedings of the 23rd Nordic Conference on Computational Linguistics, 2021

Representing ELMo embeddings as two-dimensional text online.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 2021

Grammatical Profiling for Semantic Change Detection.
Proceedings of the 25th Conference on Computational Natural Language Learning, 2021

Three-part diachronic semantic change dataset for Russian.
Proceedings of The 2nd International Workshop on Computational Approaches to Historical Language Change 2021, 2021

2020
UiO-UvA at SemEval-2020 Task 1: Contextualised Embeddings for Lexical Semantic Change Detection.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Word Sense Disambiguation for 158 Languages using Word Embeddings Only.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

RuSemShift: a dataset of historical lexical semantic change in Russian.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Semantic Recommendation System for Bilingual Corpus of Academic Papers.
Proceedings of the Recent Trends in Analysis of Images, Social Networks and Texts, 2020

ELMo and BERT in Semantic Change Detection for Russian.
Proceedings of the Analysis of Images, Social Networks and Texts, 2020

2019
To lemmatize or not to lemmatize: how word normalisation affects ELMo performance in word sense disambiguation.
CoRR, 2019

Tracing cultural diachronic semantic shifts in Russian using word embeddings: test sets and baselines.
CoRR, 2019

Learning Graph Embeddings from WordNet-based Similarity Measures.
Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics, 2019

ÚFAL-Oslo at MRP 2019: Garage Sale Semantic Parsing.
Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning, 2019

Double-Blind Peer-Reviewing and Inclusiveness in Russian NLP Conferences.
Proceedings of the Analysis of Images, Social Networks and Texts, 2019

Vec2graph: A Python Library for Visualizing Word Embeddings as Graphs.
Proceedings of the Analysis of Images, Social Networks and Texts, 2019

Making Fast Graph-based Algorithms with Graph Metric Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Measuring Diachronic Evolution of Evaluative Adjectives with Word Embeddings: the Case for English, Norwegian, and Russian.
Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

One-to-X Analogical Reasoning on Word Embeddings: a Case for Diachronic Armed Conflict Prediction from News Texts.
Proceedings of the 1st International Workshop on Computational Approaches to Historical Language Change, 2019

2018
Learning Graph Embeddings from WordNet-based Similarity Measures.
CoRR, 2018

Russian word sense induction by clustering averaged word embeddings.
CoRR, 2018

Universal Dependencies-based syntactic features in detecting human translation varieties.
Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories, 2018

Diachronic word embeddings and semantic shifts: a survey.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

RusNLP: Semantic Search Engine for Russian NLP Conference Papers.
Proceedings of the Analysis of Images, Social Networks and Texts, 2018

Unsupervised Semantic Frame Induction using Triclustering.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
Redefining Context Windows for Word Embedding Models: An Experimental Study.
Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Word vectors, reuse, and replicability: Towards a community repository of large-text resources.
Proceedings of the 21st Nordic Conference on Computational Linguistics, 2017

Temporal dynamics of semantic relations in word embeddings: an application to predicting armed conflict participants.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Building Web-Interfaces for Vector Semantic Models with the WebVectors Toolkit.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus.
Proceedings of the Analysis of Images, Social Networks and Texts, 2017

Tracing armed conflicts with diachronic word embedding models.
Proceedings of the Events and Stories in the News Workshop@ACL 2017, 2017

Clustering of Russian Adjective-Noun Constructions using Word Embeddings.
Proceedings of the 6th Workshop on Balto-Slavic Natural Language Processing, 2017

2016
Clustering Comparable Corpora of Russian and Ukrainian Academic Texts: Word Embeddings and Semantic Fingerprints.
CoRR, 2016

Exploration of register-dependent lexical semantics using word embeddings.
Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities, 2016

Neural Embedding Language Models in Semantic Clustering of Web Search Results.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Cross-Lingual Trends Detection for Named Entities in News Texts with Dynamic Neural Embedding Models.
Proceedings of the First International Workshop on Recent Trends in News Information Retrieval co-located with 38th European Conference on Information Retrieval (ECIR 2016), 2016

Redefining part-of-speech classes with distributional semantic models.
Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, 2016

WebVectors: A Toolkit for Building Web Interfaces for Vector Semantic Models.
Proceedings of the Analysis of Images, Social Networks and Texts, 2016

2015
Texts in, meaning out: neural language models in semantic similarity task for Russian.
CoRR, 2015

Comparing Neural Lexical Models of a Classic National Corpus and a Web Corpus: The Case for Russian.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2015

2014
Russian Learner Translator Corpus - Design, Research Potential and Applications.
Proceedings of the Text, Speech and Dialogue - 17th International Conference, 2014

Semantic Clustering of Russian Web Search Results: Possibilities and Problems.
Proceedings of the Information Retrieval, 2014

Untangling the Semantic Web: Microdata Use in Russian Video Content Delivery Sites.
Proceedings of the Analysis of Images, Social Networks and Texts, 2014

2013
Improving English-Russian sentence alignment through POS tagging and Damerau-Levenshtein distance.
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing, 2013

2010
Change of word types to word tokens ratio in the course of translation (based on Russian translations of K. Vonnegut novels)
CoRR, 2010

2008
Using descriptive mark-up to formalize translation quality assessment
CoRR, 2008


  Loading...