Hinrich Schütze

Orcid: 0000-0001-9514-7934

Affiliations:
  • Ludwig Maximilian University of Munich, Center for Information and Language Processing, Germany
  • University of Stuttgart, Institute for Natural Language Processing, Germany (former)
  • Stanford University, CA, USA (former)


According to our database1, Hinrich Schütze authored at least 424 papers between 1989 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Geographic Adaptation of Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2024

GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages.
CoRR, 2024

MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment.
CoRR, 2024

LangSAMP: Language-Script Aware Multilingual Pretraining.
CoRR, 2024

How Transliterations Improve Crosslingual Alignment.
CoRR, 2024

MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions.
CoRR, 2024

CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation.
CoRR, 2024

SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists.
CoRR, 2024

ChatZero:Zero-shot Cross-Lingual Dialogue Generation via Pseudo-Target Language.
CoRR, 2024

Problem Solving Through Human-AI Preference-Based Cooperation.
CoRR, 2024

Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts.
CoRR, 2024

A Recipe of Parallel Corpora Exploitation for Multilingual Large Language Models.
CoRR, 2024

Learn it or Leave it: Module Composition and Pruning for Continual Learning.
CoRR, 2024

BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning.
CoRR, 2024

Robustness Testing of Multi-Modal Models in Varied Home Environments for Assistive Robots.
CoRR, 2024

TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data.
CoRR, 2024

XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples.
CoRR, 2024

MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory.
CoRR, 2024

Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena.
CoRR, 2024

Decomposed Prompting: Unveiling Multilingual Linguistic Structure Knowledge in English-Centric Large Language Models.
CoRR, 2024

What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects.
CoRR, 2024

HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy.
CoRR, 2024

MaLA-500: Massive Language Adaptation of Large Language Models.
CoRR, 2024

MoSECroT: Model Stitching with Static Word Embeddings for Crosslingual Zero-shot Transfer.
CoRR, 2024

A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation.
Proceedings of the Machine Learning and Knowledge Discovery in Databases. Research Track, 2024

Rehearsal-Free Modular and Compositional Continual Learning for Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2024

OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024

SynthEval: Hybrid Behavioral Testing of NLP Models with Synthetic Evaluation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Better Call SAUL: Fluent and Consistent Language Model Editing with Generation Regularization.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Consistent Document-level Relation Extraction via Counterfactuals.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

LongForm: Effective Instruction Tuning with Reverse Instructions.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

ChatZero: Zero-Shot Cross-Lingual Dialogue Generation via Pseudo-Target Language.
Proceedings of the ECAI 2024 - 27th European Conference on Artificial Intelligence, 19-24 October 2024, Santiago de Compostela, Spain, 2024

Kardeş-NLU: Transfer to Low-Resource Languages with Big Brother's Help - A Benchmark and Evaluation for Turkic Languages.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks.
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024

Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

UCxn: Typologically-Informed Annotation of Constructions Atop Universal Dependencies.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

SilverAlign: MT-Based Silver Data Algorithm for Evaluating Word Alignment.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

GlotScript: A Resource and Tool for Low Resource Writing System Identification.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

MaiBaam: A Multi-Dialectal Bavarian Universal Dependency Treebank.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

GNNavi: Navigating the Information Flow in Large Language Models by Graph Neural Network.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

MaskLID: Code-Switching Language Identification through Iterative Masking.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

2023
Explaining pretrained language models' understanding of linguistic structures using construction grammar.
Frontiers Artif. Intell., February, 2023

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Trans. Mach. Learn. Res., 2023

Multilingual Word Embeddings for Low-Resource Languages using Anchors and a Chain of Related Languages.
CoRR, 2023

LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation.
CoRR, 2023

Towards Language-Based Modulation of Assistive Robots through Multimodal Models.
CoRR, 2023

Politeness Stereotypes and Attack Vectors: Gender Stereotypes in Japanese and Korean Language Models.
CoRR, 2023

Evaluate What You Can't Evaluate: Unassessable Generated Responses Quality.
CoRR, 2023

RET-LLM: Towards a General Read-Write Memory for Large Language Models.
CoRR, 2023

mPLM-Sim: Unveiling Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models.
CoRR, 2023

A study of conceptual language similarity: comparison and evaluation.
CoRR, 2023

Language-Agnostic Bias Detection in Language Models.
CoRR, 2023

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs.
CoRR, 2023

Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages.
CoRR, 2023

LongForm: Optimizing Instruction Tuning for Long Text Generation with Corpus Extraction.
CoRR, 2023

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks.
CoRR, 2023

MenuCraft: Interactive Menu System Design with Large Language Models.
CoRR, 2023

Construction Grammar Provides Unique Insight into Neural Language Models.
CoRR, 2023

Does Manipulating Tokenization Aid Cross-Lingual Transfer? A Study on POS Tagging for Non-Standardized Languages.
Proceedings of the Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, 2023

Semantic-Oriented Unlabeled Priming for Large-Scale Language Models.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

NLNDE at SemEval-2023 Task 12: Adaptive Pretraining and Source Language Selection for Low-Resource Multilingual Sentiment Analysis.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Cross-Lingual Constituency Parsing for Middle High German: A Delexicalized Approach.
Proceedings of the Ancient Language Processing Workshop, 2023

A Survey of Corpora for Germanic Low-Resource Languages and Dialects.
Proceedings of the 24th Nordic Conference on Computational Linguistics, 2023

GIRT-Data: Sampling GitHub Issue Report Templates.
Proceedings of the 20th IEEE/ACM International Conference on Mining Software Repositories, 2023

Is Prompt-Based Finetuning Always Better than Vanilla Finetuning? Insights from Cross-Lingual Language Understanding.
Proceedings of the 19th Conference on Natural Language Processing (KONVENS 2023), 2023

On the Copying Problem of Unsupervised NMT: A Training Schedule with a Language Discriminator Loss.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GradSim: Gradient-Based Language Grouping for Effective Multilingual Training.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Unleashing the Multilingual Encoder Potential: Boosting Zero-Shot Performance via Probability Calibration.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Language-Agnostic Bias Detection in Language Models with Bias Probing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

MEAL: Stable and Active Learning for Few-Shot Prompting.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Language Models with Rationality.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

GlotLID: Language Identification for Low-Resource Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

A Crosslingual Investigation of Conceptualization in 1335 Languages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

ECOLA: Enhancing Temporal Knowledge Embeddings with Contextualized Language Representations.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse.
Dataset, January, 2022

True Few-Shot Learning With Prompts - A Real-World Perspective.
Trans. Assoc. Comput. Linguistics, 2022

Learning interpretable word embeddings via bidirectional alignment of dimensions with semantic concepts.
Inf. Process. Manag., 2022

Modeling Content-Emotion Duality via Disentanglement for Empathetic Conversation.
CoRR, 2022

Measuring Causal Effects of Data Statistics on Language Model's 'Factual' Predictions.
CoRR, 2022

Don't Forget Cheap Training Signals Before Building Unsupervised Bilingual Word Embeddings.
CoRR, 2022

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes.
CoRR, 2022

Domain Adaptation for Sparse-Data Settings: What Do We Gain by Not Using Bert?
CoRR, 2022

Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations.
CoRR, 2022

Position Information in Transformers: An Overview.
Comput. Linguistics, 2022

This joke is [MASK]: Recognizing Humor and Offense with Prompting.
Proceedings of the Transfer Learning for Natural Language Processing Workshop, 2022

LMTurk: Few-Shot Learners as Crowdsourcing Workers in a Language-Model-as-a-Service Framework.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Modeling Ideological Salience and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

Towards a Broad Coverage Named Entity Resource: A Data-Efficient Approach for Many Diverse Languages.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging.
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing, 2022

The Reddit Politosphere: A Large-Scale Text and Network Resource of Online Political Discourse.
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, 2022

Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology.
Proceedings of the International Conference on Machine Learning, 2022

The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Federated Continual Learning for Text Classification via Selective Inter-client Transfer.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

CaMEL: Case Marker Extraction without Labels.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Flow-Adapter Architecture for Unsupervised Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Modular and Parameter-Efficient Multimodal Fusion with Prompting.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Graph Neural Networks for Multiparallel Word Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Improving Scene Graph Classification by Exploiting Knowledge from Texts.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP.
Trans. Assoc. Comput. Linguistics, 2021

Erratum: Measuring and Improving Consistency in Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2021

Measuring and Improving Consistency in Pretrained Language Models.
Trans. Assoc. Comput. Linguistics, 2021

LMTurk: Few-Shot Learners as Crowdsourcing Workers.
CoRR, 2021

Active Learning for Argument Mining: A Practical Approach.
CoRR, 2021

Scene Graph Generation for Better Image Captioning?
CoRR, 2021

BERT Cannot Align Characters.
CoRR, 2021

Locating Language-Specific Information in Contextualized Embeddings.
CoRR, 2021

Modeling Ideological Agenda Setting and Framing in Polarized Online Groups with Graph Neural Networks and Structured Sparsity.
CoRR, 2021

Enriching a Model's Notion of Belief using a Persistent Memory.
CoRR, 2021

Few-Shot Learning of an Interleaved Text Summarization Model by Pretraining with Synthetic Data.
CoRR, 2021

Improving Visual Reasoning by Exploiting The Knowledge in Texts.
CoRR, 2021

Does He Wink or Does He Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models.
CoRR, 2021

Superbizarre Is Not Superb: Improving BERT's Interpretations of Complex Words with Derivational Morphology.
CoRR, 2021

Semantic Text Segment Classification of Structured Technical Content.
Proceedings of the Natural Language Processing and Information Systems, 2021

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Multi-source Neural Topic Modeling in Multi-view Embedding Spaces.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Static Embeddings as Efficient Knowledge Bases?
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Data Centric Domain Adaptation for Historical Text with OCR Errors.
Proceedings of the 16th International Conference on Document Analysis and Recognition, 2021

Discrete and Soft Prompting for Multilingual Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Continuous Entailment Patterns for Lexical Inference in Context.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Generating Datasets with Pretrained Language Models.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Few-Shot Text Generation with Natural Language Instructions.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Wine is not v i n. On the Compatibility of Tokenizations across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Graph Algorithms for Multiparallel Word Alignment.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Does She Wink or Does She Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Language Models for Lexical Inference in Context.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

A Closer Look at Few-Shot Crosslingual Transfer: The Choice of Shots Matters.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

ParCourE: A Parallel Corpus Explorer for a Massively Multilingual Corpus.
Proceedings of the Joint Conference of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Dynamic Contextualized Word Embeddings.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Semi-Automated Labeling of Requirement Datasets for Relation Extraction.
Proceedings of the 14th Workshop on Building and Using Comparable Corpora, 2021

2020
Relational and Fine-Grained Argument Mining.
Datenbank-Spektrum, 2020

A Closer Look at Few-Shot Crosslingual Transfer: Variance, Benchmarks and Baselines.
CoRR, 2020

Few-Shot Text Generation with Pattern-Exploiting Training.
CoRR, 2020

Subword Sampling for Low Resource Word Alignment.
CoRR, 2020

Transformers Are Better Than Humans at Identifying Generated Text.
CoRR, 2020

Investigating Pretrained Language Models for Graph-to-Text Generation.
CoRR, 2020

Pre-trained Language Models as Symbolic Reasoners over Knowledge?
CoRR, 2020

Modeling Graph Structure via Relative Position for Better Text Generation from Knowledge Graphs.
CoRR, 2020

Unsupervised Embedding-based Detection of Lexical Semantic Changes.
CoRR, 2020

Generating Derivational Morphology with BERT.
CoRR, 2020

Identifying Necessary Elements for BERT's Multilinguality.
CoRR, 2020

Masking as an Efficient Alternative to Finetuning for Pretrained Language Models.
CoRR, 2020

SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings.
CoRR, 2020

Inexpensive Domain Adaptation of Pretrained Language Models: A Case Study on Biomedical Named Entity Recognition.
CoRR, 2020

Multipurpose Intelligent Process Automation via Conversational Assistant.
CoRR, 2020

EmbLexChange at SemEval-2020 Task 1: Unsupervised Embedding-based Detection of Lexical Semantic Changes.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

ThaiLMCut: Unsupervised Pretraining for Thai Word Segmentation.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Embedding Space Correlation as a Measure of Domain Similarity.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Neural Topic Modeling with Continual Lifelong Learning.
Proceedings of the 37th International Conference on Machine Learning, 2020

Explainable and Discourse Topic-aware Neural Language Understanding.
Proceedings of the 37th International Conference on Machine Learning, 2020

Masking as an Efficient Alternative to Finetuning for Pretrained Language Models.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Quantifying the Contextualization of Word Representations with Semantic Class Probing.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

An Unsupervised Joint System for Text Generation from Knowledge Graphs and Semantic Parsing.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Inexpensive Domain Adaptation of Pretrained Language Models: Case Studies on Biomedical NER and Covid-19 QA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

E-BERT: Efficient-Yet-Effective Entity Embeddings for BERT.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

DagoBERT: Generating Derivational Morphology with a Pretrained Language Model.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Identifying Elements Essential for BERT's Multilinguality.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

TopicBERT for Energy Efficient Document Classification.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Are Pretrained Language Models Symbolic Reasoners over Knowledge?
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Monolingual and Multilingual Reduction of Gender Bias in Contextualized Representations.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Increasing Learning Efficiency of Self-Attention Networks through Direct Position Interactions, Learnable Temperature, and Convoluted Attention.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

A Graph Auto-encoder Model of Derivational Morphology.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Predicting the Growth of Morphological Families from Social and Linguistic Factors.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

LMU Bilingual Dictionary Induction System with Word Surface Similarity Scores for BUCC 2020.
Proceedings of the 13th Workshop on Building and Using Comparable Corpora, 2020

Fine-Grained Argument Unit Recognition and Classification.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Rare Words: A Major Problem for Contextualized Embeddings and How to Fix it by Attentive Mimicking.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

TRENDNERT: A Benchmark for Trend and Downtrend Detection in a Scientific Domain.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
SMAPH: A Piggyback Approach for Entity-Linking in Web Queries.
ACM Trans. Inf. Syst., 2019

Neural architectures for open-type relation argument extraction.
Nat. Lang. Eng., 2019

Type-aware Convolutional Neural Networks for Slot Filling.
J. Artif. Intell. Res., 2019

Extending Machine Language Models toward Human-Level Language Understanding.
CoRR, 2019

BERT is Not a Knowledge Base (Yet): Factual Knowledge vs. Name-Based Reasoning in Unsupervised QA.
CoRR, 2019

Negated LAMA: Birds cannot fly.
CoRR, 2019

Multi-view and Multi-source Transfers in Neural Topic Modeling with Pretrained Topic and Word Embeddings.
CoRR, 2019

Neural Architectures for Fine-Grained Propaganda Detection in News.
CoRR, 2019

Generating Multi-Sentence Abstractive Summaries of Interleaved Texts.
CoRR, 2019

Robust Argument Unit Recognition and Classification.
CoRR, 2019

Unsupervised Text Generation from Structured Data.
CoRR, 2019

Attentive Mimicking: Better Word Embeddings by Attending to Informative Contexts.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Neural Semi-Markov Conditional Random Fields for Robust Character-Based Part-of-Speech Tagging.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

News Article Teaser Tweets and How to Generate Them.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Towards Summarization for Social Media - Results of the TL;DR Challenge.
Proceedings of the 12th International Conference on Natural Language Generation, 2019

Texttovec: Deep Contextualized Neural autoregressive Topic Models of Language with Distributed Compositional Prior.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-View Domain Adapted Sentence Embeddings for Low-Resource Unsupervised Duplicate Question Detection.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Analytical Methods for Interpretable Ultradense Word Embeddings.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Linguistically Informed Relation Extraction and Neural Architectures for Nested Named Entity Recognition in BioNLP-OST 2019.
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019

BioNLP-OST 2019 RDoC Tasks: Multi-grain Neural Relevance Ranking Using Topics and Attention Based Query-Document-Sentence Interactions.
Proceedings of The 5th Workshop on BioNLP Open Shared Tasks, 2019

A Multilingual BPE Embedding Space for Universal Sentiment Lexicon Induction.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Automatic Domain Adaptation Outperforms Manual Domain Adaptation for Predicting Financial Outcomes.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Learning Semantic Representations for Novel Words: Leveraging Both Form and Context.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Neural Relation Extraction within and across Sentence Boundaries.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

Document Informed Neural Autoregressive Topic Models with Distributional Prior.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms.
Trans. Assoc. Comput. Linguistics, 2018

Joint Semantic Synthesis and Morphological Analysis of the Derived Word.
Trans. Assoc. Comput. Linguistics, 2018

Corpus-Level Fine-Grained Entity Typing.
J. Artif. Intell. Res., 2018

A Stronger Baseline for Multilingual Word Embeddings.
CoRR, 2018

Aligning Very Small Parallel Corpora Using Cross-Lingual Word Embeddings and a Monogamy Objective.
CoRR, 2018

Multi-Multi-View Learning: Multilingual and Multi-Representation Entity Typing.
CoRR, 2018

Neural Relation Extraction Within and Across Sentence Boundaries.
CoRR, 2018

textTOvec: Deep Contextualized Neural Autoregressive Models of Language with Distributed Compositional Prior.
CoRR, 2018

Document Informed Neural Autoregressive Topic Models.
CoRR, 2018

End-Task Oriented Textual Entailment via Deep Exploring Inter-Sentence Interactions.
CoRR, 2018

A Universal Semantic Space.
CoRR, 2018

Evaluating neural network explanation methods using hybrid documents and morphological prediction.
CoRR, 2018

Replicated Siamese LSTM in Ticketing System for Similarity Learning and Retrieval in Asymmetric Texts.
Proceedings of the Third Workshop on Semantic Deep Learning, SemDeep@COLING 2018, 2018

Evaluating Word Embeddings in Multi-label Classification Using Fine-Grained Name Typing.
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

Fortification of Neural Morphological Segmentation Models for Polysynthetic Minimal-Resource Languages.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Deep Temporal-Recurrent-Replicated-Softmax for Topical Trends over Time.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Joint Bootstrapping Machines for High Confidence Relation Extraction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Task Proposal: The TL;DR Challenge.
Proceedings of the 11th International Conference on Natural Language Generation, 2018

Multi-View Learning: Multilingual and Multi-Representation Entity Typing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Interpretable Textual Neuron Representations for NLP.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

LISA: Explaining Recurrent Neural Network Judgments via Layer-wIse Semantic Accumulation and Example to Pattern Transformation.
Proceedings of the Workshop: Analyzing and Interpreting Neural Networks for NLP, 2018

Recurrent One-Hop Predictions for Reasoning over Knowledge Graphs.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Embedding Learning Through Multilingual Concept Induction.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Two Methods for Domain Adaptation of Bilingual Tasks: Delightfully Simple and Broadly Applicable.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

2017
From Characters to Understanding Natural Language (C2NLU): Robust End-to-End Deep Learning for NLP (Dagstuhl Seminar 17042).
Dagstuhl Reports, 2017

Impact of Coreference Resolution on Slot Filling.
CoRR, 2017

Attentive Convolution.
CoRR, 2017

Comparative Study of CNN and RNN for Natural Language Processing.
CoRR, 2017

Statistical Models for Unsupervised, Semi-Supervised, and Supervised Transliteration Mining.
Comput. Linguistics, 2017

AutoExtend: Combining Word Embeddings with Semantic Resources.
Comput. Linguistics, 2017

Unlabeled Data for Morphological Generation With Character-Based Sequence-to-Sequence Models.
Proceedings of the First Workshop on Subword and Character Level Models in NLP, 2017

Past, Present, Future: A Computational Investigation of the Typology of Tense in 1000 Languages.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Noise Mitigation for Neural Entity Typing and Relation Extraction.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Multi-level Representations for Fine-Grained Typing of Knowledge Base Entities.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

End-to-End Trainable Attentive Decoder for Hierarchical Entity Classification.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Neural Multi-Source Morphological Reinflection.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Exploring Different Dimensions of Attention for Uncertainty Detection.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Nonsymbolic Text Representation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

The LMU System for the CoNLL-SIGMORPHON 2017 Shared Task on Universal Morphological Reinflection.
Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection, 2017

Training Data Augmentation for Low-Resource Morphological Inflection.
Proceedings of the CoNLL SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection, 2017

Overview of Character-Based Models for Natural Language Processing.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2017

One-Shot Neural Cross-Lingual Transfer for Paradigm Completion.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs.
Trans. Assoc. Comput. Linguistics, 2016

Why and How to Pay Different Attention to Phrase Alignments of Different Intensities.
CoRR, 2016

Attention-Based Convolutional Neural Network for Machine Comprehension.
CoRR, 2016

A Piggyback System for Joint Entity Mention Detection and Linking in Web Queries.
Proceedings of the 25th International Conference on World Wide Web, 2016

MED: The LMU System for the SIGMORPHON 2016 Shared Task on Morphological Reinflection.
Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, 2016

Combining Recurrent and Convolutional Neural Networks for Relation Classification.
Proceedings of the NAACL HLT 2016, 2016

Ultradense Word Embeddings by Orthogonal Transformation.
Proceedings of the NAACL HLT 2016, 2016

A Joint Model of Orthography and Morphological Segmentation.
Proceedings of the NAACL HLT 2016, 2016

Comparing Convolutional Neural Networks to Traditional Models for Slot Filling.
Proceedings of the NAACL HLT 2016, 2016

Bi-directional recurrent neural network with ranking loss for spoken language understanding.
Proceedings of the 2016 IEEE International Conference on Acoustics, 2016

Neural Morphological Analysis: Encoding-Decoding Canonical Segments.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

LAMB: A Good Shepherd of Morphologically Rich Languages.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Morphological Segmentation Inside-Out.
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, 2016

Simple Question Answering by Attentive Convolutional Neural Network.
Proceedings of the COLING 2016, 2016

Table Filling Multi-Task Recurrent Neural Network for Joint Entity and Relation Extraction.
Proceedings of the COLING 2016, 2016

Learning Word Meta-Embeddings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Intrinsic Subspace Evaluation of Word Embedding Representations.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Word Embedding Calculus in Meaningful Ultradense Subspaces.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Single-Model Encoder-Decoder with Explicit Morphological Representation for Reinflection.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

Morphological Smoothing and Extrapolation of Word Embeddings.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
Exploring the relationship between intonation and the lexicon: Evidence for lexicalised storage of intonation.
Speech Commun., 2015

Learning Word Meta-Embeddings by Using Ensembles of Embedding Sets.
CoRR, 2015

The Operation Sequence Model - Combining N-Gram-Based and Phrase-Based Statistical Machine Translation.
Comput. Linguistics, 2015

A Linguistically Informed Convolutional Neural Network.
Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, 2015

CIS at TAC Cold Start 2015: Neural Networks and Coreference Resolution for Slot Filling.
Proceedings of the 2015 Text Analysis Conference, 2015

CIS-positive: A Combination of Convolutional Neural Networks and Support Vector Machines for Sentiment Analysis in Twitter.
Proceedings of the 9th International Workshop on Semantic Evaluation, 2015

Discriminative Phrase Embedding for Paraphrase Identification.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Convolutional Neural Network for Paraphrase Identification.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Robust Morphological Tagging with Word Representations.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Morphological Word-Embeddings.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Online Updating of Word Representations for Part-of-Speech Tagging.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Corpus-level Fine-grained Entity Typing Using Contextual Information.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Learning Better Embeddings for Rare Words Using Distributional Representations.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Joint Lemmatization and Morphological Tagging with Lemming.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Multichannel Variable-Size Convolution for Sentence Classification.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Labeled Morphological Segmentation with Semi-Markov Models.
Proceedings of the 19th Conference on Computational Natural Language Learning, 2015

Evaluating Learning Language Representations.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2015

MultiGranCNN: An Architecture for General Matching of Text Chunks on Multiple Levels of Granularity.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

AutoExtend: Extending Word Embeddings to Embeddings for Synsets and Lexemes.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

2014
FLORS: Fast and Simple Domain Adaptation for Part-of-Speech Tagging.
Trans. Assoc. Comput. Linguistics, 2014

Deep Learning Embeddings for Discontinuous Linguistic Units.
Proceedings of the 2nd International Conference on Learning Representations, 2014

Distributional Models and Deep Learning Embeddings: Combining the Best of Both Worlds.
Proceedings of the 2nd International Conference on Learning Representations, 2014

The SMAPH system for query entity recognition and disambiguation.
Proceedings of the ERD'14, 2014

Dependency parsing with latent refinements of part-of-speech tags.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Fine-Grained Contextual Predictions for Hard Sentiment Words.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Using Mined Coreference Chains as a Resource for a Semantic Task.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Multi-Domain Sentiment Relevance Classification with Automatic Representation Learning.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Picking the Amateur's Mind - Predicting Chess Player Strength from Game Annotations.
Proceedings of the COLING 2014, 2014

Unsupervised Training Set Generation for Automatic Acquisition of Technical Terminology in Patents.
Proceedings of the COLING 2014, 2014

An Exploration of Embeddings for Generalized Phrases.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

CoSimRank: A Flexible & Efficient Graph-Theoretic Similarity Measure.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

Improving Citation Polarity Classification with Product Reviews.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Two SVDs produce more focal deep learning representations
Proceedings of the 1st International Conference on Learning Representations, 2013

Cutting Recursive Autoencoder Trees
Proceedings of the 1st International Conference on Learning Representations, 2013

Knowledge Sources for Constituent Parsing of German, a Morphologically Rich and Less-Configurational Language.
Comput. Linguistics, 2013

CodeX: Combining an SVM Classifier and Character N-gram Language Models for Sentiment Analysis on Twitter Text.
Proceedings of the 7th International Workshop on Semantic Evaluation, 2013

Bootstrapping Semantic Lexicons for Technical Domains.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Multilingual Lexicon Bootstrapping - Improving a Lexicon Induction System Using a Parallel Corpus.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Towards Robust Cross-Domain Domain Adaptation for Part-of-Speech Tagging.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

The Topology of Semantic Knowledge.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Efficient Higher-Order CRFs for Morphological Tagging.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Unsupervised Feature Adaptation for Cross-Domain NLP with an Application to Compositionality Grading.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2013

Sentiment Relevance.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Preliminary study of technical terminology for the retrieval of scientific book metadata records.
Proceedings of the 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

A Comparative Investigation of Morphological Language Modeling for the Languages of the European Union.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Active Learning for Coreference Resolution.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Unsupervised sentiment analysis with a simple and fast Bayesian model using Part-of-Speech feature selection.
Proceedings of the 11th Conference on Natural Language Processing, 2012

Automatic generation of short informative sentiment summaries.
Proceedings of the EACL 2012, 2012

Automatic Detection of Point of View Differences in Wikipedia.
Proceedings of the COLING 2012, 2012

Classification of Inconsistent Sentiment Words using Syntactic Constructions.
Proceedings of the COLING 2012, 2012

Towards a Generic and Flexible Citation Classifier Based on a Faceted Classification Scheme.
Proceedings of the COLING 2012, 2012

Crosslingual distant supervision for extracting relations of different complexity.
Proceedings of the 21st ACM International Conference on Information and Knowledge Management, 2012

2011
Half-Context Language Models.
Comput. Linguistics, 2011

Self Organizing Maps in NLP: Exploration of Coreference Feature Space.
Proceedings of the Advances in Self-Organizing Maps - 8th International Workshop, 2011

Sense discrimination for physics retrieval.
Proceedings of the Proceeding of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2011

Expanding Queries with Term and Phrase Translations in Patent Retrieval.
Proceedings of the Multidisciplinary Information Retrieval, 2011

Speech Events are Recoverable from Unlabeled Articulatory Data: Using an Unsupervised Clustering Approach on Data Obtained from Electromagnetic Midsaggital Articulography (EMA).
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

User Perspectives on Query Difficulty.
Proceedings of the Advances in Information Retrieval Theory, 2011

Prosodic Variability in Lexical Sequences: Intonation Entrenches Too.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

Context Sequence Model of Speech Production Enriched with Articulatory Features.
Proceedings of the 17th International Congress of Phonetic Sciences, 2011

A Cascaded Classification Approach to Semantic Head Recognition.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Active Learning with Amazon Mechanical Turk.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Supervised Coreference Resolution with SUCRE.
Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, 2011

Integrating history-length interpolation and classes in language modeling.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Improved Modeling of Out-Of-Vocabulary Words Using Morphological Classes.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Bootstrapping coreference resolution using word associations.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

Teaching IR: Curricular Considerations.
Proceedings of the Teaching and Learning in Information Retrieval, 2011

2010
Syllable frequency effects in a context-sensitive segment production model.
J. Phonetics, 2010

Visuelle Textanalyse - Interaktive Exploration von semantischen Inhalten.
Inform. Spektrum, 2010

Multilevel Exemplar Theory.
Cogn. Sci., 2010

SUCRE: A Modular System for Coreference Resolution.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Bitext-Based Resolution of German Subject-Object Ambiguities.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

Identification of Rare & Novel Senses Using Translations in a Parallel Corpus.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Building a Cross-lingual Relatedness Thesaurus using a Graph Similarity Measure.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Fine-Grained Geographical Relation Extraction from Wikipedia.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Frequency of occurrence effects on pitch accent realisation.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

A subjective logic formalisation of the principle of polyrepresentation for information needs.
Proceedings of the Information Interaction in Context Symposium, 2010

IR, NLP, and Visualization.
Proceedings of the Advances in Information Retrieval, 2010

Sentiment Translation through Multi-Edge Graphs.
Proceedings of the COLING 2010, 2010

A Linguistically Grounded Graph Model for Bilingual Lexicon Extraction.
Proceedings of the COLING 2010, 2010

Self-Annotation for fine-grained geospatial relation extraction.
Proceedings of the COLING 2010, 2010

Relational feature engineering of natural language processing.
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010

Preliminary study into query translation for patent retrieval.
Proceedings of the 3rd International Workshop on Patent Information Retrieval, 2010

2009
Information Retrieval: Concepts and Practical Considerations for Teaching a Rising Topic.
Datenbank-Spektrum, 2009

Visual Exploration of Classifiers for Hybrid Textual and Geospatial Matching.
Proceedings of the 14th International Workshop on Vision, Modeling, and Visualization, 2009

On a Generic Uncertainty Model for Position Information.
Proceedings of the Quality of Context, First International Workshop, 2009

Word Alignment by Thresholded Two-Dimensional Normalization.
Proceedings of Machine Translation Summit XII: Posters, 2009

RENS - Enabling a Robot to Identify a Person.
Proceedings of the Intelligent Robotics and Applications, Second International Conference, 2009

Frequency Matters: Pitch Accents and Information Status.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

Rich Bitext Projection Features for Parse Reranking.
Proceedings of the EACL 2009, 12th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Conference, Athens, Greece, March 30, 2009

2008
Disorder inequality: a combinatorial approach to nearest neighbor search.
Proceedings of the International Conference on Web Search and Web Data Mining, 2008

A Question Answering System for German. Experiments with Morphological Linguistic Resources.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

An Inverted Index for Storing and Retrieving Grammatical Dependencies.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Examining pitch-accent variability from an exemplar-theoretic perspective.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Automatic acquisition of vernacular places.
Proceedings of the iiWAS'2008, 2008

A Graph-theoretic Model of Lexical Syntactic Acquisition.
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 2008

Stopping Criteria for Active Learning of Named Entity Recognition.
Proceedings of the COLING 2008, 2008

Introduction to information retrieval.
Cambridge University Press, ISBN: 978-0-521-86571-5, 2008

2007
Prepositional Phrase Attachment without Oracles.
Comput. Linguistics, 2007

Improving active learning recall via disjunctive boolean constraints.
Proceedings of the SIGIR 2007: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2007

Towards a context model driven german geo-tagging system.
Proceedings of the 4th ACM Workshop On Geographic Information Retrieval, 2007

2006
Language-Derived Information and Context Models.
Proceedings of the 4th IEEE Conference on Pervasive Computing and Communications Workshops (PerCom 2006 Workshops), 2006

A Lattice-Based Framework for Enhancing Statistical Parsers with Information from Unlabeled Corpora.
Proceedings of the Tenth Conference on Computational Natural Language Learning, 2006

Performance thresholding in practical text classification.
Proceedings of the 2006 ACM CIKM International Conference on Information and Knowledge Management, 2006

The Effect of Corpus Size in Combining Supervised and Unsupervised Training for Disambiguation.
Proceedings of the ACL 2006, 2006

2004
GAPSCORE: finding gene and protein names one word at a time.
Bioinform., 2004

2003
Inclusion of Textual Documentation in the Analysis of Multidimensional Data Sets: Application to Gene Expression Data.
Mach. Learn., 2003

2002
Research Paper: Creating an Online Dictionary of Abbreviations from MEDLINE.
J. Am. Medical Informatics Assoc., 2002

Personalized search.
Commun. ACM, 2002

2001
Foundations of statistical natural language processing.
MIT Press, ISBN: 978-0-262-13360-9, 2001

1999
Multimodal browsing of images in Web documents.
Proceedings of the Document Recognition and Retrieval VI, 1999

1998
Automatic Word Sense Discrimination.
Comput. Linguistics, 1998

1997
A Cooccurrence-Based Thesaurus and Two Applications to Information Retrieval.
Inf. Process. Manag., 1997

Projections for Efficient Document Clustering.
Proceedings of the SIGIR '97: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1997

Automatic Detection of Text Genre.
Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and 8th Conference of the European Chapter of the Association for Computational Linguistics, 1997

Ambiguity resolution in language learning - computational and cognitive models.
CSLI lecture notes series 71, CSLI, ISBN: 978-1-57586-075-6, 1997

1996
Xerox TREC-5 Site Report: Routing, Filtering, NLP, and Spanish Tracks.
Proceedings of The Fifth Text REtrieval Conference, 1996

Method Combination For Document Filtering.
Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1996

1995
Distributional Part-of-Speech Tagging
CoRR, 1995

Xerox Site Report: Four TREC-4 Tracks.
Proceedings of The Fourth Text REtrieval Conference, 1995

A Comparison of Classifiers and Document Representations for the Routing Problem.
Proceedings of the SIGIR'95, 1995

1994
Xerox TREC-3 Report: Combining Exact and Fuzzy Predictors.
Proceedings of The Third Text REtrieval Conference, 1994

A Cooccurence-Based. Thesaurus and two Applications on Information Retrieval.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 1994

Part-of-Speech Tagging using a Variable Memory Markov Model.
Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, 1994

1993
Distributed syntactic representations with an application to part-of-speech tagging.
Proceedings of International Conference on Neural Networks (ICNN'88), San Francisco, CA, USA, March 28, 1993

Part-of-Speech Induction from Scratch.
Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, 1993

1992
Dimensions of Meaning.
Proceedings of the Proceedings Supercomputing '92, 1992

Word Space.
Proceedings of the Advances in Neural Information Processing Systems 5, [NIPS Conference, Denver, Colorado, USA, November 30, 1992

1991
The Treatment of Plurality in L-LILOG.
Proceedings of the Text Understanding in LILOG, 1991

Communication and Inference through Situations.
Proceedings of the 12th International Joint Conference on Artificial Intelligence. Sydney, 1991

1989
Pluralbehandlung in natürlichsprachlichen Wissensverabeitungssystemen
IWBS Report, 1989


  Loading...