Andrew McCallum

Affiliations:
  • University of Massachusetts Amherst, USA


According to our database1, Andrew McCallum authored at least 327 papers between 1990 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Incremental Extractive Opinion Summarization Using Cover Trees.
CoRR, 2024

2023
Fast, Scalable, Warm-Start Semidefinite Programming with Spectral Bundling and Sketching.
CoRR, 2023

Multistage Collaborative Knowledge Distillation from Large Language Models.
CoRR, 2023

PaRaDe: Passage Ranking using Demonstrations with Large Language Models.
CoRR, 2023

To Copy, or not to Copy; That is a Critical Issue of the Output Softmax Layer in Neural Sequential Recommenders.
CoRR, 2023

Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens.
CoRR, 2023

Answering Compositional Queries with Set-Theoretic Embeddings.
CoRR, 2023

Machine Reading Comprehension using Case-based Reasoning.
CoRR, 2023

Adaptive Selection of Anchor Items for CUR-based k-NN search with Cross-Encoders.
CoRR, 2023

Editable User Profiles for Controllable Text Recommendation.
CoRR, 2023

KwikBucks: Correlation Clustering with Cheap-Weak and Expensive-Strong Signals.
Proceedings of The Fourth Workshop on Simple and Efficient Natural Language Processing, 2023

Editable User Profiles for Controllable Text Recommendations.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

Large Language Model Augmented Narrative Driven Recommendations.
Proceedings of the 17th ACM Conference on Recommender Systems, 2023

Online Level-wise Hierarchical Clustering.
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

Efficient k-NN Search with Cross-Encoders using Adaptive Multi-Round CUR Decomposition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Machine Reading Comprehension using Case-based Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

PaRaDe: Passage Ranking using Demonstrations with LLMs.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Longtonotes: OntoNotes with Longer Coreference Chains.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

Low-Resource Compositional Semantic Parsing with Concept Pretraining.
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

Improving Dual-Encoder Training through Dynamic Indexes for Negative Mining.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

Causal Matching with Text Embeddings: A Case Study in Estimating the Causal Effects of Peer Review Policies.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Revisiting the Architectures like Pointer Networks to Efficiently Improve the Next Word Distribution, Summarization Factuality, and Beyond.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Multi-CLS BERT: An Efficient Alternative to Traditional Ensembling.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Augmenting Scientific Creativity with Retrieval across Knowledge Domains.
CoRR, 2022

CBR-iKB: A Case-Based Reasoning Approach for Question Answering over Incomplete Knowledge Bases.
CoRR, 2022

Modeling Transitivity and Cyclicity in Directed Graphs via Binary Code Box Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Structured Energy Network As a Loss.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

DISAPERE: A Dataset for Discourse Structure in Peer Review Discussions.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Inducing and Using Alignments for Transition-based AMR Parsing.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Entity Linking via Explicit Mention-Mention Coreference Modeling.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

A Distant Supervision Corpus for Extracting Biomedical Relationships Between Chemicals, Diseases and Genes.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Enhanced Distant Supervision with State-Change Information for Relation Extraction.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Knowledge Base Question Answering by Case-based Reasoning over Subgraphs.
Proceedings of the International Conference on Machine Learning, 2022

Interactive Correlation Clustering with Existential Cluster Constraints.
Proceedings of the International Conference on Machine Learning, 2022

Modeling Label Space Interactions in Multi-label Classification using Box Embeddings.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Efficient Nearest Neighbor Search for Cross-Encoder Models using Matrix Factorization.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

You can't pick your neighbors, or can you? When and How to Rely on Retrieval in the kNN-LM.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Unsupervised Partial Sentence Matching for Cited Text Identification.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

Meta-Adapters: Parameter Efficient Few-shot Fine-tuning through Meta-Learning.
Proceedings of the International Conference on Automated Machine Learning, 2022

Event-Event Relation Extraction using Probabilistic Box Embedding.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2022

Word2Box: Capturing Set-Theoretic Semantics of Words using Box Embeddings.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Softmax Bottleneck Makes Language Models Unable to Represent Multi-mode Word Distributions.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

Sublinear Time Approximation of Text Similarity Matrices.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

An Evaluative Measure of Clustering Methods Incorporating Hyperparameter Sensitivity.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
A Dataset for Discourse Structure in Peer Review Discussions.
CoRR, 2021

Entity Linking and Discovery via Arborescence-based Supervised Clustering.
CoRR, 2021

Word2Box: Learning Word Representation Using Box Embeddings.
CoRR, 2021

Exact and approximate hierarchical clustering using A.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Min/max stability and box distributions.
Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, 2021

Knowledge Informed Semantic Parsing for Conversational Question Answering.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Simultaneously Self-Attending to Text and Entities for Knowledge-Informed Text Representations.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

Box-To-Box Transformations for Modeling Joint Hierarchies.
Proceedings of the 6th Workshop on Representation Learning for NLP, 2021

CSFCube - A Test Collection of Computer Science Research Articles for Faceted Query by Example.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Capacity and Bias of Learned Geometric Embeddings for Directed Graphs.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Probabilistic Box Embeddings for Uncertain Knowledge Graph Reasoning.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Clustering-based Inference for Biomedical Entity Linking.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Scalable Hierarchical Agglomerative Clustering.
Proceedings of the KDD '21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021

Improved Latent Tree Induction with Distant Supervision via Span Constraints.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

MS-Mentions: Consistently Annotating Entity Mentions in Materials Science Procedural Text.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Case-based Reasoning for Natural Language Queries over Knowledge Bases.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Box Embeddings: An open-source library for representation learning using geometric structures.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 2021

Diverse Distributions of Self-Supervised Tasks for Meta-Learning in NLP.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Multi-facet Universal Schema.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Changing the Mind of Transformers for Topically-Controllable Language Generation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Low resource recognition and linking of biomedical concepts from a large ontology.
Proceedings of the BCB '21: 12th ACM International Conference on Bioinformatics, 2021

DAG-Structured Clustering by Nearest Neighbors.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Cluster Trellis: Data Structures & Algorithms for Exact Inference in Hierarchical Clustering.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

Scaling Within Document Coreference to Long Texts.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

Modeling Fine-Grained Entity Types with Box Embeddings.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Benchmarking Scalable Methods for Streaming Cross Document Entity Coreference.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

MOLEMAN: Mention-Only Linking of Entities with a Mention Annotation Network.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models.
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, 2021

Long Document Summarization in a Low Resource Setting using Pretrained Language Models.
Proceedings of the ACL-IJCNLP 2021 Student Research Workshop, 2021

Extending Multi-Sense Word Embedding to Phrases and Sentences for Unsupervised Semantic Applications.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Using error decay prediction to overcome practical issues of deep active learning for named entity recognition.
Mach. Learn., 2020

Inorganic Materials Synthesis Planning with Literature-Trained Neural Networks.
J. Chem. Inf. Model., 2020

Scalable Bottom-Up Hierarchical Clustering.
CoRR, 2020

Clustering-based Inference for Zero-Shot Biomedical Entity Linking.
CoRR, 2020

Probabilistic Case-based Reasoning for Open-World Knowledge Graph Completion.
CoRR, 2020

Energy-Based Reranking: Improving Neural Machine Translation Using Energy-Based Models.
CoRR, 2020

Compact Representation of Uncertainty in Hierarchical Clustering.
CoRR, 2020

Improving Local Identifiability in Probabilistic Box Embeddings.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types.
Proceedings of the KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2020

An Instance Level Approach for Shallow Semantic Parsing in Scientific Procedural Text.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Unsupervised Parsing with S-DIORA: Single Tree Encoding for Deep Inside-Outside Recursive Autoencoders.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Probabilistic Case-based Reasoning in Knowledge Bases.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

ProtoQA: A Question Answering Dataset for Prototypical Common-Sense Reasoning.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Unsupervised Pre-training for Biomedical Question Answering.
Proceedings of the Working Notes of CLEF 2020, 2020

Using BibTeX to Automatically Generate Labeled Data for Citation Field Extraction.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Predicting Institution Hierarchies with Set-based Models.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Representing Joint Hierarchies with Box Embeddings.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

A Simple Approach to Case-Based Reasoning in Knowledge Bases.
Proceedings of the Conference on Automated Knowledge Base Construction, 2020

Energy and Policy Considerations for Modern Deep Learning Research.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

Simultaneously Linking Entities and Extracting Relations from Biomedical Text without Mention-Level Supervision.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Overcoming Practical Issues of Deep Active Learning and its Applications on Named Entity Recognition.
CoRR, 2019

Unsupervised Latent Tree Induction with Deep Inside-Outside Recursive Autoencoders.
CoRR, 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference.
Proceedings of the Thirteenth Workshop on Graph-Based Methods for Natural Language Processing, 2019

Search-Guided, Lightly-Supervised Training of Structured Prediction Energy Networks.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

OpenKI: Integrating Open Information Extraction and Knowledge Bases with Relation Inference.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Unsupervised Latent Tree Induction with Deep Inside-Outside Recursive Auto-Encoders.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Scalable Hierarchical Clustering with Tree Grafting.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Paper Matching with Local Fairness Constraints.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

Supervised Hierarchical Clustering with Exponential Linkage.
Proceedings of the 36th International Conference on Machine Learning, 2019

Smoothing the Geometry of Probabilistic Box Embeddings.
Proceedings of the 7th International Conference on Learning Representations, 2019

Building Dynamic Knowledge Graphs from Text using Machine Reading Comprehension.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-step Retriever-Reader Interaction for Scalable Open-domain Question Answering.
Proceedings of the 7th International Conference on Learning Representations, 2019

Unsupervised Labeled Parsing with Deep Inside-Outside Recursive Autoencoders.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, 2019

Roll Call Vote Prediction with Knowledge Augmented Models.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

Integrating User Feedback under Identity Uncertainty in Knowledge Base Construction.
Proceedings of the 1st Conference on Automated Knowledge Base Construction, 2019

The Materials Science Procedural Text Corpus: Annotating Materials Synthesis Procedures with Shallow Semantic Structures.
Proceedings of the 13th Linguistic Annotation Workshop, 2019

Optimal Transport-based Alignment of Learned Character Representations for String Similarity.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Energy and Policy Considerations for Deep Learning in NLP.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

A2N: Attending to Neighbors for Knowledge Graph Inference.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Multi-step Entity-centric Information Retrieval for Multi-Hop Question Answering.
Proceedings of the 2nd Workshop on Machine Reading for Question Answering, 2019

2018
Syntax Helps ELMo Understand Semantics: Is Syntax Still Relevant in a Deep Neural Architecture for SRL?
CoRR, 2018

Efficient Graph-based Word Sense Induction by Distributional Inclusion Vector Embeddings.
Proceedings of the Twelfth Workshop on Graph-Based Methods for Natural Language Processing, 2018

Compact Representation of Uncertainty in Clustering.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Training Structured Prediction Energy Networks with Indirect Supervision.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Distributional Inclusion Vector Embedding for Unsupervised Hypernymy Detection.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

Linguistically-Informed Self-Attention for Semantic Role Labeling.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Marginal Likelihood Training of BiLSTM-CRF for Biomedical Named Entity Recognition from Disjoint Label Sets.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

An Interface for Annotating Science Questions.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 2018

Embedded-State Latent Conditional Random Fields for Sequence Labeling.
Proceedings of the 22nd Conference on Computational Natural Language Learning, 2018

Hierarchical Losses and New Resources for Fine-grained Entity Typing and Linking.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Probabilistic Embedding of Knowledge Graphs with Box Lattice Measures.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

A Systematic Classification of Knowledge, Reasoning, and Context within the ARC Dataset.
Proceedings of the Workshop on Machine Reading for Question Answering@ACL 2018, 2018

2017
Automatically Extracting Action Graphs from Materials Science Synthesis Procedures.
CoRR, 2017

Unsupervised Hypernym Detection by Distributional Inclusion Vector Embedding.
CoRR, 2017

Low-Rank Hidden State Embeddings for Viterbi Sequence Labeling.
CoRR, 2017

Improved Representation Learning for Predicting Commonsense Ontologies.
CoRR, 2017

Fast and Accurate Sequence Labeling with Iterated Dilated Convolutions.
CoRR, 2017

An Online Hierarchical Algorithm for Extreme Clustering.
CoRR, 2017

Active Bias: Training a More Accurate Neural Network by Emphasizing High Variance Samples.
CoRR, 2017

SemEval 2017 Task 10: ScienceIE - Extracting Keyphrases and Relations from Scientific Publications.
Proceedings of the 11th International Workshop on Semantic Evaluation, 2017

Active Bias: Training More Accurate Neural Networks by Emphasizing High Variance Samples.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

A Hierarchical Algorithm for Extreme Clustering.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

End-to-End Learning for Structured Prediction Energy Networks.
Proceedings of the 34th International Conference on Machine Learning, 2017

Learning a Natural Language Interface with Neural Programmer.
Proceedings of the 5th International Conference on Learning Representations, 2017

Fast and Accurate Entity Recognition with Iterated Dilated Convolutions.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Dependency Parsing with Dilated Iterated Graph CNNs.
Proceedings of the 2nd Workshop on Structured Prediction for Natural Language Processing, 2017

Generalizing to Unseen Entities and Entity Pairs with Row-less Universal Schema.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Chains of Reasoning over Entities, Relations, and Text using Recurrent Neural Networks.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Attending to All Mention Pairs for Full Abstract Biological Relation Extraction.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

Learning String Alignments for Entity Aliases.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

Finer Grained Entity Typing with TypeNet.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

Entity-centric Attribute Feedback for Interactive Knowledge Bases.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

Go for a Walk and Arrive at the Answer: Reasoning Over Knowledge Bases with Reinforcement Learning.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

RelNet: End-to-end Modeling of Entities & Relations.
Proceedings of the 6th Workshop on Automated Knowledge Base Construction, 2017

Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

2016
Ask the GRU: Multi-Task Learning for Deep Text Recommendations.
CoRR, 2016

Extracting Multilingual Relations under Limited Resources: TAC 2016 Cold-Start KB construction and Slot-Filling using Compositional Universal Schema.
Proceedings of the 2016 Text Analysis Conference, 2016

<i>Ask the GRU</i>: Multi-task Learning for Deep Text Recommendations.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

Multilingual Relation Extraction using Compositional Universal Schema.
Proceedings of the NAACL HLT 2016, 2016

Structured Prediction Energy Networks.
Proceedings of the 33nd International Conference on Machine Learning, 2016

Row-less Universal Schema.
Proceedings of the 5th Workshop on Automated Knowledge Base Construction, 2016

Call for Discussion: Building a New Standard Dataset for Relation Extraction Tasks.
Proceedings of the 5th Workshop on Automated Knowledge Base Construction, 2016

Incorporating Selectional Preferences in Multi-hop Relation Extraction.
Proceedings of the 5th Workshop on Automated Knowledge Base Construction, 2016

2015
Word Representations via Gaussian Embedding.
Proceedings of the 3rd International Conference on Learning Representations, 2015

Reports on the 2015 AAAI Spring Symposium Series.
AI Mag., 2015

Bethe Projections for Non-Local Inference.
Proceedings of the Thirty-First Conference on Uncertainty in Artificial Intelligence, 2015

Building Knowledge Bases with Universal Schema: Cold Start and Slot-Filling Approaches.
Proceedings of the 2015 Text Analysis Conference, 2015

Embedded Representations of Lexical and Knowledge-Base Semantics.
Proceedings of the 2015 International Conference on The Theory of Information Retrieval, 2015

Learning Dynamic Feature Selection for Fast Sequential Prediction.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Compositional Vector Space Models for Knowledge Base Completion.
Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, 2015

Compositional Vector Space Models for Knowledge Base Inference.
Proceedings of the 2015 AAAI Spring Symposia, 2015

2014
Training for Fast Sequential Prediction Using Dynamic Feature Selection.
CoRR, 2014

Message Passing for Soft Constraint Dual Decomposition.
Proceedings of the Thirtieth Conference on Uncertainty in Artificial Intelligence, 2014

Efficient Non-parametric Estimation of Multiple Embeddings per Word in Vector Space.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Lexicon Infused Phrase Embeddings for Named Entity Resolution.
Proceedings of the Eighteenth Conference on Computational Natural Language Learning, 2014

Learning Soft Linear Constraints with Application to Citation Field Extraction.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

2013
Latent Relation Representations for Universal Schemas
Proceedings of the 1st International Conference on Learning Representations, 2013

Anytime Belief Propagation Using Sparse Domains.
CoRR, 2013

Universal Schema for Slot Filling and Cold Start: UMass IESL at TACKBP 2013.
Proceedings of the Sixth Text Analysis Conference, 2013

Relation Extraction with Matrix Factorization and Universal Schemas.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Dynamic Knowledge-Base Alignment for Coreference Resolution.
Proceedings of the Seventeenth Conference on Computational Natural Language Learning, 2013

Universal schema for entity type prediction.
Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

A joint model for discovering and linking entities.
Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

Assessing confidence of knowledge base content with an experimental study in entity resolution.
Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

Joint inference of entities, relations, and coreference.
Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

Transition-based Dependency Parsing with Selectional Branching.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
An Introduction to Conditional Random Fields.
Found. Trends Mach. Learn., 2012

An Integrated, Conditional Model of Information Extraction and Coreference with Applications to Citation Matching
CoRR, 2012

Combining joint models for biomedical event extraction.
BMC Bioinform., 2012

Selecting actions for resource-bounded information extraction using reinforcement learning.
Proceedings of the Fifth International Conference on Web Search and Web Data Mining, 2012

MAP Inference in Chains using Column Generation.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Probabilistic Databases of Universal Schema.
Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, 2012

Human-Machine Cooperation: Supporting User Corrections to Automatically Constructed KBs.
Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, 2012

Monte Carlo MCMC: Efficient Inference by Sampling Factors.
Proceedings of the Joint Workshop on Automatic Knowledge Base Construction and Web-scale Knowledge Extraction, 2012

Topic models for taxonomies.
Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries, 2012

Monte Carlo MCMC: Efficient Inference by Approximate Sampling.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Parse, Price and Cut--Delayed Column and Row Generation for Graph Based Parsers.
Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, 2012

Unsupervised Relation Discovery with Sense Disambiguation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

A Discriminative Hierarchical Model for Fast Coreference at Large Scale.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Query-Aware MCMC.
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011

SampleRank: Training Factor Graphs with Atomic Gradients.
Proceedings of the 28th International Conference on Machine Learning, 2011

Structured Relation Discovery using Generative Models.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Fast and Robust Joint Models for Biomedical Event Extraction.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Optimizing Semantic Coherence in Topic Models.
Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, 2011

Toward interactive training and evaluation.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

Model Combination for Event Extraction in BioNLP 2011.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

Robust Biomedical Event Extraction with Dual Decomposition and Minimal Domain Adaptation.
Proceedings of BioNLP Shared Task 2011 Workshop, Portland, Oregon, USA, June 24, 2011, 2011

Large-Scale Cross-Document Coreference Using Distributed Inference and Hierarchical Models.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Scalable Probabilistic Databases with Factor Graphs and MCMC.
Proc. VLDB Endow., 2010

Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data.
J. Mach. Learn. Res., 2010

Distantly Labeling Data for Large Scale Cross-Document Coreference
CoRR, 2010

Inference by Minimizing Size, Divergence, or their Sum.
Proceedings of the UAI 2010, 2010

Modeling Relations and Their Mentions without Labeled Text.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2010

Resource-Bounded Information Extraction: Acquiring Missing Feature Values on Demand.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2010

Constraint-Driven Rank-Based Learning for Information Extraction.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2010

High-Performance Semi-Supervised Learning using Discriminatively Constrained Generative Models.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

Collective Cross-Document Relation Extraction Without Labelled Data.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Machine Translation Using Overlapping Alignments and SampleRank.
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Research Papers, 2010

2009
Piecewise training for structured prediction.
Mach. Learn., 2009

Alternating Projections for Learning with Expectation Constraints.
Proceedings of the UAI 2009, 2009

An Entity Based Model for Coreference Resolution.
Proceedings of the SIAM International Conference on Data Mining, 2009

Bi-directional Joint Inference for Entity Resolution and Segmentation Using Imperatively-Defined Factor Graphs.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2009

Training Factor Graphs with Reinforcement Learning for Efficient MAP Inference.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Rethinking LDA: Why Priors Matter.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

FACTORIE: Probabilistic Programming via Imperatively Defined Factor Graphs.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Efficient methods for topic model inference on streaming document collections.
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28, 2009

Polylingual Topic Models.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Active Learning by Labeling Features.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Generalized Expectation Criteria for Bootstrapping Extractors using Record-Text Alignment.
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, 2009

Joint Inference for Natural Language Processing.
Proceedings of the Thirteenth Conference on Computational Natural Language Learning, 2009

Semi-supervised Learning of Dependency Parsers using Generalized Expectation Criteria.
Proceedings of the ACL 2009, 2009

2008
Topic Models Conditioned on Arbitrary Features with Dirichlet-multinomial Regression.
Proceedings of the UAI 2008, 2008

Learning from labeled features using generalized expectation criteria.
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2008

A Discriminative Approach to Ontology Mapping.
Proceedings of the International Workshop on New Trends in Information Integration, 2008

A unified approach for schema matching, coreference and canonicalization.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

Unsupervised deduplication using cross-field dependencies.
Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2008

InterNano: e-Science for the Nanomanufacturing Community.
Proceedings of the Fourth International Conference on e-Science, 2008

Generalized Expectation Criteria for Semi-Supervised Learning of Conditional Random Fields.
Proceedings of the ACL 2008, 2008

2007
WebKDD/SNAKDD 2007: web mining and social network analysis post-workshop report.
SIGKDD Explor., 2007

Dynamic Conditional Random Fields: Factorized Probabilistic Models for Labeling and Segmenting Sequence Data.
J. Mach. Learn. Res., 2007

Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email.
J. Artif. Intell. Res., 2007

Improved Dynamic Schedules for Belief Propagation.
Proceedings of the UAI 2007, 2007

Nonparametric Bayes Pachinko Allocation.
Proceedings of the UAI 2007, 2007

Efficient Computation of Entropy Gradient for Semi-Supervised Conditional Random Fields.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

First-Order Probabilistic Models for Coreference Resolution.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007


Generalized component analysis for text with heterogeneous attributes.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Expertise modeling for matching papers with reviewers.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Semi-supervised classification with hybrid generative/discriminative methods.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Canonicalization of database records using adaptive similarity measures.
Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007

Organizing the OCA: learning faceted subjects from a library of digital books.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2007

Mining a digital library for influential authors.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2007

Improving Author Coreference by Resource-Bounded Information Gathering from the Web.
Proceedings of the IJCAI 2007, 2007

Piecewise pseudolikelihood for efficient training of conditional random fields.
Proceedings of the Machine Learning, 2007

Mixtures of hierarchical topics with Pachinko allocation.
Proceedings of the Machine Learning, 2007

Simple, robust, scalable semi-supervised learning via expectation regularization.
Proceedings of the Machine Learning, 2007

Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval.
Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), 2007

Cryptogram Decoding for OCR Using Numerization Strings.
Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), 2007

People-LDA: Anchoring Topics to People using Face Recognition.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

Resource-Bounded Information Gathering for Correlation Clustering.
Proceedings of the Learning Theory, 20th Annual Conference on Learning Theory, 2007

2006
Table extraction for answer retrieval.
Inf. Retr., 2006

Information extraction from research papers using conditional random fields.
Inf. Process. Manag., 2006

Corrective feedback and persistent learning for information extraction.
Artif. Intell., 2006

Reducing Weight Undertraining in Structured Discriminative Learning.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Integrating Probabilistic Extraction Models and Data Mining to Discover Relations and Patterns in Text.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

Topics over time: a non-Markov continuous-time model of topical trends.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Information extraction, data mining and joint inference.
Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2006

Bibliometric impact measures leveraging topic analysis.
Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, 2006

Combining Generative and Discriminative Methods for Pixel Classification with Multi-Conditional Learning.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Joint Group and Topic Discovery from Relations and Text.
Proceedings of the Statistical Network Analysis: Models, Issues, and New Directions, 2006

Pachinko allocation: DAG-structured mixture models of topic correlations.
Proceedings of the Machine Learning, 2006

Sparse Forward-Backward Using Minimum Divergence Beams for Fast Training Of Conditional Random Fields.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Learning Field Compatibilities to Extract Database Records from Unstructured Text.
Proceedings of the EMNLP 2006, 2006

Exploring the Use of Conditional Random Field Models and HMMs for Historical Handwritten Document Recognition.
Proceedings of the Second International Workshop on Document Image Analysis for Libraries (DIAL 2006), 2006

Multi-Conditional Learning: Generative/Discriminative Training for Clustering and Classification.
Proceedings of the Proceedings, 2006

Semi-Supervised Text Classification Using EM.
Proceedings of the Semi-Supervised Learning, 2006

2005
Information extraction: distilling structured data from unstructured text.
ACM Queue, 2005

Disambiguating Web appearances of people in a social network.
Proceedings of the 14th international conference on World Wide Web, 2005

Piecewise Training for Undirected Models.
Proceedings of the UAI '05, 2005

A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance.
Proceedings of the UAI '05, 2005

Group and Topic Discovery from Relations and Their Attributes.
Proceedings of the Advances in Neural Information Processing Systems 18 [Neural Information Processing Systems, 2005

Composition of Conditional Random Fields for Transfer Learning.
Proceedings of the HLT/EMNLP 2005, 2005

Group and topic discovery from relations and text.
Proceedings of the 3rd international workshop on Link discovery, 2005

Detecting Anomalies in Network Traffic Using Maximum Entropy Estimation.
Proceedings of the 5th Internet Measurement Conference, 2005

Topic and Role Discovery in Social Networks.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Multi-way distributional clustering via pairwise interactions.
Proceedings of the Machine Learning, 2005

Joint Parsing and Semantic Role Labeling.
Proceedings of the Ninth Conference on Computational Natural Language Learning, 2005

Collective multi-label classification.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Joint deduplication of multiple record types in relational data.
Proceedings of the 2005 ACM CIKM International Conference on Information and Knowledge Management, Bremen, Germany, October 31, 2005

Semi-Supervised Sequence Modeling with Syntactic Topic Models.
Proceedings of the Proceedings, 2005

Reducing Labeling Effort for Structured Prediction Tasks.
Proceedings of the Proceedings, 2005

2004
An Integrated, Conditional Model of Information Extraction and Coreference with Appli.
Proceedings of the UAI '04, 2004

Conditional Models of Identity Uncertainty with Application to Noun Coreference.
Proceedings of the Advances in Neural Information Processing Systems 17 [Neural Information Processing Systems, 2004

Accurate Information Extraction from Research Papers using Conditional Random Fields.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

Confidence Estimation for Information Extraction.
Proceedings of HLT-NAACL 2004: Short Papers, Boston, Massachusetts, USA, May 2-7, 2004, 2004

Chinese Segmentation and New Word Detection using Conditional Random Fields.
Proceedings of the COLING 2004, 2004

Extracting social networks and contact information from email and the Web.
Proceedings of the CEAS 2004, 2004

Interactive Information Extraction with Constrained Conditional Random Fields.
Proceedings of the Nineteenth National Conference on Artificial Intelligence, 2004

2003
Rapid development of Hindi named entity recognition using conditional random fields and feature induction.
ACM Trans. Asian Lang. Inf. Process., 2003

Challenges in information retrieval and language modeling: report of a workshop held at the center for intelligent information retrieval, University of Massachusetts Amherst, September 2002.
SIGIR Forum, 2003

Efficiently Inducing Features of Conditional Random Fields.
Proceedings of the UAI '03, 2003

Classification with Hybrid Generative/Discriminative Models.
Proceedings of the Advances in Neural Information Processing Systems 16 [Neural Information Processing Systems, 2003

Toward Conditional Models of Identity Uncertainty with Application to Proper Noun Coreference.
Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03), 2003

Table Extraction Using Conditional Random Fields.
Proceedings of the 2003 Annual National Conference on Digital Government Research, 2003

Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons.
Proceedings of the Seventh Conference on Natural Language Learning, 2003

2002
Learning with Scope, with Application to Information Extraction and Classification.
Proceedings of the UAI '02, 2002

2001
Toward Optimal Active Learning through Sampling Estimation of Error Reduction.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data.
Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001), Williams College, Williamstown, MA, USA, June 28, 2001

2000
Text Classification from Labeled and Unlabeled Documents using EM.
Mach. Learn., 2000

Automating the Construction of Internet Portals with Machine Learning.
Inf. Retr., 2000

Learning to Understand the Web.
IEEE Data Eng. Bull., 2000

Learning to construct knowledge bases from the World Wide Web.
Artif. Intell., 2000

Efficient clustering of high-dimensional data sets with application to reference matching.
Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, 2000

Maximum Entropy Markov Models for Information Extraction and Segmentation.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Learning to Create Customized Authority Lists.
Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29, 2000

Information Extraction with HMM Structures Learned by Stochastic Optimization.
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on on Innovative Applications of Artificial Intelligence, July 30, 2000

1999
A Machine Learning Approach to Building Domain-Specific Search Engines.
Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence, 1999

Using Reinforcement Learning to Spider the Web Efficiently.
Proceedings of the Sixteenth International Conference on Machine Learning (ICML 1999), Bled, Slovenia, June 27, 1999

1998
Distributional Clustering of Words for Text Classification.
Proceedings of the SIGIR '98: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998

Improving Text Classification by Shrinkage in a Hierarchy of Classes.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998

Employing EM and Pool-Based Active Learning for Text Classification.
Proceedings of the Fifteenth International Conference on Machine Learning (ICML 1998), 1998

Learning to Classify Text from Labeled and Unlabeled Documents.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

Learning to Extract Symbolic Knowledge from the World Wide Web.
Proceedings of the Fifteenth National Conference on Artificial Intelligence and Tenth Innovative Applications of Artificial Intelligence Conference, 1998

1995
Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State.
Proceedings of the Machine Learning, 1995

1994
Instance-Based State Identification for Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 7, 1994

1993
Overcoming Incomplete Perception with Utile Distinction Memory.
Proceedings of the Machine Learning, 1993

1992
Using Transitional Proximity for Faster Reinforcement Learning.
Proceedings of the Ninth International Workshop on Machine Learning (ML 1992), 1992

1990
Using Genetic Algorithms to Learn Disjunctive Rules from Examples.
Proceedings of the Machine Learning, 1990


  Loading...