Irina Piontkovskaya

Orcid: 0009-0003-0299-5849

According to our database1, Irina Piontkovskaya authored at least 33 papers between 2017 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Unveiling Intrinsic Dimension of Texts: from Academic Abstract to Creative Story.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders.
Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics, 2026

2025
Gamayun's Path to Multilingual Mastery: Cost-Efficient Training of a 1.5B-Parameter LLM.
CoRR, December, 2025

Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning.
CoRR, January, 2025

Fast Thinking with Structured Prompts: Enabling LLM Reasoning without Chain-of-Thought Generation.
Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing, 2025

CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

YABLoCo: Yet Another Benchmark for Long Context Code Generation.
Proceedings of the IEEE/ACM International Workshop on Large Language Models for Code, 2025

Quantifying Logical Consistency in Transformers via Query-Key Alignment.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025

LAMeD: LLM-generated Annotations for Memory Leak Detection.
Proceedings of the 29th International Conference on Evaluation and Assessment in Software Engineering, 2025

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

2024
Listening to the Wise Few: Select-and-Copy Attention Heads for Multiple-Choice QA.
CoRR, 2024

Improving Interpretability and Robustness for the Detection of AI-Generated Images.
CoRR, 2024

Robust AI-Generated Text Detection by Restricted Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

2023
Artificial Text Boundary Detection with Topological Data Analysis and Sliding Window Techniques.
CoRR, 2023

Can BERT eat RuCoLA? Topological Data Analysis to Explain.
CoRR, 2023

PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing.
CoRR, 2023

Sinkhorn Transformations for Single-Query Postprocessing in Text-Video Retrieval.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Topological Data Analysis for Speech Processing.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GEC-DePenD: Non-Autoregressive Grammatical Error Correction with Decoupled Permutation and Decoding.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Betti numbers of attention graphs is all you really need.
CoRR, 2022

Template-based Approach to Zero-shot Intent Recognition.
CoRR, 2022

Ask Me Anything in Your Native Language.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Acceptability Judgements via Examining the Topology of Attention Maps.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

2021
Multiple Teacher Distillation for Robust and Greener Models.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

InFoBERT: Zero-Shot Approach to Natural Language Understanding Using Contextualized Word Embedding.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

Single Example Can Improve Zero-Shot Data Generation.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

Artificial Text Detection via Examining the Topology of Attention Maps.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
SumTitles: a Summarization Dataset with Low Extractiveness.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

2018
Distributed Fine-tuning of Language Models on Private Data.
Proceedings of the 6th International Conference on Learning Representations, 2018

2017
Differentially Private Distributed Learning for Language Modeling Tasks.
CoRR, 2017


  Loading...