Aline Villavicencio

Iryna Gurevych

CoRR, 2022

SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding.

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Sample Efficient Approaches for Idiomaticity Detection.

[BibT_eX]

[DOI]

Dylan Phelps

Xuan-Rui Fan

Edward Gow-Smith

Carolina Scarton

Proceedings of the 18th Workshop on Multiword Expressions, 2022

Improving Tokenisation by Alternative Treatment of Spaces.

[BibT_eX]

[DOI]

Edward Gow-Smith

Carolina Scarton

Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

2021

Unsupervised Word Segmentation from Discrete Speech Units in Low-Resource Settings.

[BibT_eX]

[DOI]

CoRR, 2021

What if the whole is greater than the sum of the parts? Modelling Complex (Multiword) Expressions (invited paper).

[BibT_eX]

[DOI]

Proceedings of the First Workshop on Current Trends in Text Simplification (CTTS 2021) co-located with the 37th Conference of the Spanish Society for Natural Language Processing (SEPLN2021), 2021

AStitchInLanguageModels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models.

[BibT_eX]

[DOI]

Edward Gow-Smith

Carolina Scarton

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Probing for idiomaticity in vector space models.

[BibT_eX]

[DOI]

Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Assessing the Representations of Idiomaticity in Vector Models with a Noun Compound Dataset Labeled at Type and Token Levels.

[BibT_eX]

[DOI]

Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

CogNLP-Sheffield at CMCL 2021 Shared Task: Blending Cognitively Inspired Features with Transformer-based Language Models for Predicting Eye Tracking Patterns.

[BibT_eX]

[DOI]

Peter Vickers

Rosa Wainwright

Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, 2021

2020

Investigating alignment interpretability for low-resource NMT.

[BibT_eX]

[DOI]

Mach. Transl., 2020

Investigating Language Impact in Bilingual Approaches for Computational Language Documentation.

[BibT_eX]

[DOI]

Proceedings of the 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages and Collaboration and Computing for Under-Resourced Languages, 2020

2019

Discovering multiword expressions.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2019

How the Brain Represents Language and Answers Questions? Using an AI System to Understand the Underlying Neurobiological Mechanisms.

[BibT_eX]

[DOI]

Frontiers Comput. Neurosci., 2019

How Does Language Influence Documentation Workflow? Unsupervised Word Discovery Using Translations in Multiple Languages.

[BibT_eX]

[DOI]

CoRR, 2019

Why So Down? The Role of Negative (and Positive) Pointwise Mutual Information in Distributional Semantics.

[BibT_eX]

[DOI]

CoRR, 2019

Unsupervised Compositionality Prediction of Nominal Compounds.

[BibT_eX]

[DOI]

Comput. Linguistics, 2019

When the whole is greater than the sum of its parts: Multiword expressions and idiomaticity.

[BibT_eX]

[DOI]

Proceedings of the Joint Workshop on Multiword Expressions and WordNet, 2019

Empirical Evaluation of Sequence-to-Sequence Models for Word Discovery in Low-Resource Settings.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

2018

Incorporating Subword Information into Matrix Factorization Word Embeddings.

[BibT_eX]

[DOI]

CoRR, 2018

A Small Griko-Italian Speech Translation Corpus.

[BibT_eX]

[DOI]

Antonios Anastasopoulos

Marika Lekakou

Proceedings of the 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages, 2018

A Corpus Study of Verbal Multiword Expressions in Brazilian Portuguese.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2018

Similarity Measures for the Detection of Clinical Conditions with Verbal Fluency Tasks.

[BibT_eX]

[DOI]

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

The brWaC Corpus: A New Open Resource for Brazilian Portuguese.

[BibT_eX]

[DOI]

Jorge A. Wagner Filho

Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Unsupervised Word Segmentation from Speech with Attention.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Restricted Recurrent Neural Tensor Networks: Exploiting Word Frequency and Compositionality.

[BibT_eX]

[DOI]

Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017

Restricted Recurrent Neural Tensor Networks: Exploiting Word Frequency and Compositionality for Increased Model Capacity and Performance With No Computational Overhead.

[BibT_eX]

[DOI]

CoRR, 2017

LexSubNC: A Dataset of Lexical Substitution for Nominal Compounds.

[BibT_eX]

[DOI]

Proceedings of the IWCS 2017 - 12th International Conference on Computational Semantics - Short papers, Montpellier, France, September 19, 2017

Unwritten languages demand attention too! Word discovery with encoder-decoder models.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

2016

Enhancing the LexVec Distributed Word Representation Model Using Positional Contexts and External Memory.

[BibT_eX]

[DOI]

CoRR, 2016

UFRGS&LIF at SemEval-2016 Task 10: Rule-Based MWE Identification and Predominant-Supersense Tagging.

[BibT_eX]

[DOI]

Silvio Cordeiro

Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Joining Forces for Multiword Expression Identification.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2016

The Portuguese B ^2 2 SG: A Semantic Test for Distributional Thesaurus.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2016

Crawling by Readability Level.

[BibT_eX]

[DOI]

Jorge A. Wagner Filho

Proceedings of the Computational Processing of the Portuguese Language, 2016

Filtering and Measuring the Intrinsic Quality of Human Compositionality Judgments.

[BibT_eX]

[DOI]

Silvio Cordeiro

Proceedings of the 12th Workshop on Multiword Expressions, 2016

VerbLexPor: a lexical resource with semantic roles for Portuguese.

[BibT_eX]

[DOI]

Leonardo Zilio

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

B2SG: a TOEFL-like Task for Portuguese.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Multiword Expressions in Child Language.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

mwetoolkit+sem: Integrating Word Embeddings in the mwetoolkit for Semantic MWE Processing.

[BibT_eX]

[DOI]

Silvio Cordeiro

Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Matrix Factorization using Window Sampling and Negative Sampling for Improved Word Representations.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

How Naked is the Naked Truth? A Multilingual Lexicon of Nominal Compound Compositionality.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Predicting the Compositionality of Nominal Compounds: Giving Word Embeddings a Hard Time.

[BibT_eX]

[DOI]

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Automatic Construction of Large Readability Corpora.

[BibT_eX]

[DOI]

Jorge A. Wagner Filho

Proceedings of the Workshop on Computational Linguistics for Linguistic Complexity, 2016

2015

VerbLexPor: um recurso léxico com anotação de papéis semânticos para o português (VerbLexPor: a lexical resource annotated with semantic roles for Portuguese).

[BibT_eX]

[DOI]

Leonardo Zilio

Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology, 2015

Distributional Thesauri for Portuguese: methodology evaluation.

[BibT_eX]

[DOI]

Proceedings of the 10th Brazilian Symposium in Information and Human Language Technology, 2015

2014

brWaC: A WaCky Corpus for Brazilian Portuguese.

[BibT_eX]

[DOI]

Proceedings of the Computational Processing of the Portuguese Language, 2014

Comparing Similarity Measures for Distributional Thesauri.

[BibT_eX]

[DOI]

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Comparing the Quality of Focused Crawlers and of the Translation Resources Obtained from them.

[BibT_eX]

[DOI]

Bruno Laranjeira

Viviane Pereira Moreira

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Identification of Multiword Expressions in the brWaC.

[BibT_eX]

[DOI]

Rodrigo Boos

Kassius Prestes

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Size Does Not Matter. Frequency Does. A Study of Features for Measuring Lexical Complexity.

[BibT_eX]

[DOI]

Alessandro Dalla Vecchia

Muntsa Padró

Proceedings of the Advances in Artificial Intelligence - IBERAMIA 2014, 2014

Nothing like Good Old Frequency: Studying Context Filters for Distributional Thesauri.

[BibT_eX]

[DOI]

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

2013

Computational Modeling as a Methodology for Studying Human Language Learning.

[BibT_eX]

[DOI]

Proceedings of the Cognitive Aspects of Computational Language Acquisition, 2013

Introduction to the special issue on multiword expressions: From theory to practice and use.

[BibT_eX]

[DOI]

Valia Kordoni

ACM Trans. Speech Lang. Process., 2013

Language Acquisition and Probabilistic Models: keeping it simple.

[BibT_eX]

[DOI]

Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012

<i>Syntax-Based Collocation Extraction</i>, by Violeta Seretan. Berlin: Springer, 2011. ISBN-10 9400701330, ISBN-13 978-9400701335. $139.00/£90.00 (Hardcover) xi + 220 pages.

[BibT_eX]

[DOI]

Nat. Lang. Eng., 2012

A large scale annotated child language construction database.

[BibT_eX]

[DOI]

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

2011

Improving Lexical Alignment Using Hybrid Discriminative and Post-Processing Techniques.

[BibT_eX]

[DOI]

Paulo Schreiner

Leonardo Zilio

Proceedings of the 8th Brazilian Symposium in Information and Human Language Technology, 2011

Extração e Validação de Ontologias a partir de Recursos Digitais.

[BibT_eX]

[DOI]

Proceedings of Joint IV Seminar on Ontology Research in Brazil and VI International Workshop on Metamodels, 2011

Sistema de Aquisição Semi-Automática de Ontologias.

[BibT_eX]

[DOI]

Gabriel Gonçalves

Proceedings of Joint IV Seminar on Ontology Research in Brazil and VI International Workshop on Metamodels, 2011

Identifying and Analyzing Brazilian Portuguese Complex Predicates.

[BibT_eX]

[DOI]

Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011

Fast and Flexible MWE Candidate Generation with the mwetoolkit.

[BibT_eX]

[DOI]

Vitor de Araújo

Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011

Identification and Treatment of Multiword Expressions Applied to Information Retrieval.

[BibT_eX]

[DOI]

Otávio Costa Acosta

Viviane Pereira Moreira

Proceedings of the Workshop on Multiword Expressions: from Parsing and Generation to the Real World, 2011

2010

Alignment-based extraction of multiword expressions.

[BibT_eX]

[DOI]

Maria das Graças Volpe Nunes

Lang. Resour. Evaluation, 2010

Identificação de Expressões Multipalavra em Domínios Específicos.

[BibT_eX]

[DOI]

André Machado

Linguamática, 2010

An Investigation on the Influence of Frequency on the Lexical Organization of Verbs.

[BibT_eX]

[DOI]

Daniel Cerato Germann

Maity Siqueira

Proceedings of TextGraphs@ACL 2010 Workshop on Graph-based Methods for Natural Language Processing, 2010

Question Answering for Portuguese: How Much Is Needed?

[BibT_eX]

[DOI]

Proceedings of the Advances in Artificial Intelligence - SBIA 2010, 2010

A Hybrid Approach for Multiword Expression Identification.

[BibT_eX]

[DOI]

André Machado

Proceedings of the Computational Processing of the Portuguese Language, 2010

mwetoolkit: a Framework for Multiword Expression Identification.

[BibT_eX]

[DOI]

Christian Boitet

Proceedings of the International Conference on Language Resources and Evaluation, 2010

COMUNICA - A Question Answering System for Brazilian Portuguese.

[BibT_eX]

[DOI]

Proceedings of the COLING 2010, 2010

Multiword Expressions in the wild? The mwetoolkit comes in handy.

[BibT_eX]

[DOI]

Christian Boitet

Proceedings of the COLING 2010, 2010

Web-based and combined language models: a case study on noun compound identification.

[BibT_eX]

[DOI]

Christian Boitet

Proceedings of the COLING 2010, 2010

2009

Prepositions in Applications: A Survey and Introduction to the Special Issue.

[BibT_eX]

[DOI]

Timothy Baldwin

Valia Kordoni

Comput. Linguistics, 2009

Statistically-Driven Alignment-Based Multiword Expression Identification for Technical Domains.

[BibT_eX]

[DOI]

André Machado

Proceedings of the Workshop on Multiword Expressions: Identification, 2009

2008

Picking them up and Figuring them out: Verb-Particle Constructions, Noise and Idiomaticity.

[BibT_eX]

[DOI]

Proceedings of the Twelfth Conference on Computational Natural Language Learning, 2008

UFRGS@CLEF2008: Indexing Multiword Expressions for Information Retrieval.

[BibT_eX]

[DOI]

Otávio Costa Acosta

André Pinto Geraldo

Viviane Moreira Orengo

Proceedings of the Working Notes for CLEF 2008 Workshop co-located with the 12th European Conference on Digital Libraries (ECDL 2008) , 2008

2007

Validation and Evaluation of Automatically Acquired Multiword Expressions for Grammar Engineering.

[BibT_eX]

[DOI]

Proceedings of the EMNLP-CoNLL 2007, 2007

2005

Introduction to the special issue on multiword expressions: Having a crack at a hard nut.

[BibT_eX]

[DOI]

Comput. Speech Lang., 2005

The availability of verb-particle constructions in lexical resources: How much is enough?

[BibT_eX]

[DOI]

Comput. Speech Lang., 2005

2004

A Multilingual Database of Idioms.

[BibT_eX]

[DOI]

Timothy Baldwin

Benjamin Waldron

Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

2002

The acquisition of a unification-based generalised categorial grammar.

[BibT_eX]

[DOI]

PhD thesis, 2002

Multiword expressions: linguistic precision and reusability.

[BibT_eX]

[DOI]

Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Learning to Distinguish PP Arguments from Adjuncts.

[BibT_eX]

[DOI]

Proceedings of the 6th Conference on Natural Language Learning, 2002

Extracting the Unextractable: A Case Study on Verb-particles.

[BibT_eX]

[DOI]

Timothy Baldwin

Proceedings of the 6th Conference on Natural Language Learning, 2002

2000

The Acquisition of Word Order by a Computational Learning System.

[BibT_eX]

[DOI]

Proceedings of the Fourth Conference on Computational Natural Language Learning, 2000

1999

Representing a System of Lexical Types Using Default Unification.

[BibT_eX]

[DOI]

Proceedings of the EACL 1999, 1999

1995

Part-of-Speech Tagging for Portuguese Texts.

[BibT_eX]

[DOI]

José Gabriel Pereira Lopes

Nuno M. C. Marques

Fabio Villavicencio

Proceedings of the Advances in Artificial Intelligence, 1995

A Hierarchial Description of the Portuguese Verb.

[BibT_eX]

[DOI]

Paul McFetridge