Robert J. Gaizauskas

Orcid: 0000-0002-3356-5126

Affiliations:
  • University of Sheffield, Department of Computer Science


According to our database1, Robert J. Gaizauskas authored at least 174 papers between 1991 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Obituary: Yorick Wilks.
Comput. Linguistics, September, 2023

ScANT: A Small Corpus of Scene-Annotated Narrative Texts.
Proceedings of Text2Story, 2023

2022
A Pilot Study on the Collection and Computational Analysis of Linguistic Differences Amongst Men and Women in a Kuwaiti Arabic WhatsApp Dataset.
Proceedings of the The Seventh Arabic Natural Language Processing Workshop, 2022

A Language Modelling Approach to Quality Assessment of OCR'ed Historical Text.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

SNuC: The Sheffield Numbers Spoken Language Corpus.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Predicting the Presence of Reasoning Markers in Argumentative Text.
Proceedings of the 9th Workshop on Argument Mining, 2022

2021
A Pilot Study on Annotating Scenes in Narrative Text using SceneML.
Proceedings of Text2Story, 2021

2019
Introduction.
Proceedings of the Using Comparable Corpora for Under-Resourced Areas of Machine Translation, 2019

Collecting Comparable Corpora.
Proceedings of the Using Comparable Corpora for Under-Resourced Areas of Machine Translation, 2019

Cross-Language Comparability and Its Applications for MT.
Proceedings of the Using Comparable Corpora for Under-Resourced Areas of Machine Translation, 2019


Mapping and Aligning Units from Comparable Corpora.
Proceedings of the Using Comparable Corpora for Under-Resourced Areas of Machine Translation, 2019

2018
Visual and Semantic Knowledge Transfer for Large Scale Semi-Supervised Object Detection.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

2017
Using Section Headings to Compute Cross-Lingual Similarity of Wikipedia Articles.
Proceedings of the Advances in Information Retrieval, 2017

The SENSEI Overview of Newspaper Readers' Comments.
Proceedings of the Advances in Information Retrieval, 2017

2016
The SENSEI Annotated Corpus: Human Summaries of Reader Comment Conversations in On-line News.
Proceedings of the SIGDIAL 2016 Conference, 2016

Cross-validating Image Description Datasets and Evaluation Metrics.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

A Document Repository for Social Media and Speech Conversations.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

What's the Issue Here?: Task-based Evaluation of Reader Comment Summarization Systems.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation.
Proceedings of the INLG 2016, 2016

Automatic label generation for news comment clusters.
Proceedings of the INLG 2016, 2016

A Graph-Based Approach to Topic Clustering for Online Comments to News.
Proceedings of the Advances in Information Retrieval, 2016

Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016


Overview of the ImageCLEF 2016 Scalable Concept Image Annotation Task.
Proceedings of the Working Notes of CLEF 2016, 2016

Summarizing Multi-Party Argumentative Conversations in Reader Comment on News.
Proceedings of the Third Workshop on Argument Mining, 2016

2015
Generating descriptive multi-document summaries of geo-located entities using entity type models.
J. Assoc. Inf. Sci. Technol., 2015

Exploring relation types for literature-based discovery.
J. Am. Medical Informatics Assoc., 2015

Comment-to-Article Linking in the Online News Domain.
Proceedings of the SIGDIAL 2015 Conference, 2015

Temporal Relation Classification using a Model of Tense and Aspect.
Proceedings of the Recent Advances in Natural Language Processing, 2015

The SENSEI Project: Making Sense of Human Conversations.
Proceedings of the Future and Emergent Trends in Language Technology, 2015

Generating Image Descriptions with Gold Standard Visual Inputs: Motivation, Evaluation and Baselines.
Proceedings of the ENLG 2015, 2015

Combining Geometric, Textual and Visual Features for Predicting Prepositions in Image Descriptions.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Overview of the ImageCLEF 2015 Scalable Image Annotation, Localization and Sentence Generation task.
Proceedings of the Working Notes of CLEF 2015, 2015

Defining Visually Descriptive Language.
Proceedings of the Fourth Workshop on Vision and Language, 2015

2014
Bilingual dictionaries for all EU languages.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

Bootstrapping Term Extractors for Multiple Languages.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

A Hybrid Approach to Multi-document Summarization of Opinions in Reviews.
Proceedings of the INLG 2014, 2014

Collective Named Entity Disambiguation using Graph Ranking and Clique Partitioning Approaches.
Proceedings of the COLING 2014, 2014

Graph Ranking for Collective Named Entity Disambiguation.
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014

A Poodle or a Dog? Evaluating Automatic Image Annotation Using Human Descriptions at Different Levels of Granularity.
Proceedings of the Third Workshop on Vision and Language, 2014

2013
Multi-Document Summarization Techniques for Generating Image Descriptions: A Comparative Analysis.
Proceedings of the Multi-source, Multilingual Information Extraction and Summarization, 2013

Do humans have conceptual models about geographic objects? A user study.
J. Assoc. Inf. Sci. Technol., 2013

Summarizing Online Reviews Using Aspect Rating Distributions and Language Modeling.
IEEE Intell. Syst., 2013

Empirical Validation of Reichenbach's Tense Framework.
Proceedings of the 10th International Conference on Computational Semantics, 2013

Information Retrieval for Temporal Bounding.
Proceedings of the International Conference on the Theory of Information Retrieval, 2013

Named Entity Disambiguation Using HMMs.
Proceedings of the 2013 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, 2013

Temporal Signals Help Label Temporal Relations.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Extracting bilingual terminologies from comparable corpora.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Methods for Collection and Evaluation of Comparable Documents.
Proceedings of the Building and Using Comparable Corpora., 2013

2012
A Data Driven Approach to Query Expansion in Question Answering
CoRR, 2012

A Corpus-based Study of Temporal Signals
CoRR, 2012

An Annotation Scheme for Reichenbach's Verbal Tense Structure
CoRR, 2012

Using Signals to Improve Automatic Classification of Temporal Relations
CoRR, 2012

Redundancy reduction for multi-document summaries using A* search and discriminative training.
Proceedings of the 2nd International Workshop on Exploiting Large Knowledge Repositories, 2012

Collecting and Using Comparable Corpora for Statistical Machine Translation.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

TIMEN: An Open Temporal Expression Normalisation Resource.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Assessing the Comparability of News Texts.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

A light way to collect comparable corpora from the Web.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Investigating Summarization Techniques for Geo-Tagged Image Indexing.
Proceedings of the Advances in Information Retrieval, 2012

Automatic Bilingual Phrase Extraction from Comparable Corpora.
Proceedings of the COLING 2012, 2012

Named Entity Based Document Similarity with SVM-Based Re-ranking for Entity Linking.
Proceedings of the Advanced Machine Learning Technologies and Applications, 2012

2011
USFD at KBP 2011: Entity Linking, Slot Filling and Temporal Bounding.
Proceedings of the Fourth Text Analysis Conference, 2011

STARLET: Multi-document Summarization of Service and Product Reviews with Balanced Rating Distributions.
Proceedings of the Data Mining Workshops (ICDMW), 2011

Time-Surfer: Time-Based Graphical Access to Document Content.
Proceedings of the Advances in Information Retrieval, 2011

Understanding the types of information humans associate with geographic objects.
Proceedings of the 20th ACM Conference on Information and Knowledge Management, 2011

2010
The University of Sheffield System at TAC KBP 2010.
Proceedings of the Third Text Analysis Conference, 2010

USFD2: Annotating Temporal Expresions and TLINKs for TempEval-2.
Proceedings of the 5th International Workshop on Semantic Evaluation, 2010

Automatic image captioning from the web for GPS photographs.
Proceedings of the 11th ACM SIGMM International Conference on Multimedia Information Retrieval, 2010

Analysing Temporally Annotated Corpora with CAVaT.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Using Dialogue Corpora to Extend Information Extraction Patterns for Natural Language Understanding of Dialogue.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

English-Hindi Transliteration using Multiple Similarity Metrics.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Developing Morphological Analysers for South Asian Languages: Experimenting with the Hindi and Gujarati Languages.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Model Summaries for Location-related Images.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

A Collection of Comparable Corpora for Under-resourced Languages.
Proceedings of the Human Language Technologies - The Baltic Perspective, 2010

Multi-Document Summarization Using A* Search and Discriminative Learning.
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 2010

Generating Image Descriptions Using Dependency Relational Patterns.
Proceedings of the ACL 2010, 2010

2009
The TempEval challenge: identifying temporal relations in text.
Lang. Resour. Evaluation, 2009

Building a semantically annotated corpus of clinical texts.
J. Biomed. Informatics, 2009

Summary Generation for Toponym-referenced Images using Object Type Language Models.
Proceedings of the Recent Advances in Natural Language Processing, 2009

Disambiguation of Biomedical Abbreviations.
Proceedings of the BioNLP Workshop, BioNLP@HLT-NAACL 2009, 2009

2008
Disambiguation of biomedical text using diverse sources of information.
BMC Bioinform., 2008

Mining clinical relationships from patient narratives.
BMC Bioinform., 2008

Combining Terminology Resources and Statistical Methods for Entity Recognition: an Evaluation.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

ANNALIST - ANNotation ALIgnment and Scoring Tool.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Acquiring Sense Tagged Examples using Relevance Feedback.
Proceedings of the COLING 2008, 2008

Knowledge Sources for Word Sense Disambiguation of Biomedical Text.
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, 2008

Extracting Clinical Relationships from Patient Narratives.
Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing, 2008

2007
SemEval-2007 Task 15: TempEval Temporal Relation Identification.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

USFD: Preliminary Exploration of Features and Classifiers for the TempEval-2007 Task.
Proceedings of the 4th International Workshop on Semantic Evaluations, 2007

The CLEF Corpus: Semantic Annotation of Clinical Text.
Proceedings of the AMIA 2007, 2007

2006
Web Service Architectures for Text Mining: An Exploration of the Issues via an E-Science Demonstrator.
Int. J. Web Serv. Res., 2006

The University of Sheffield's TREC 2006 Q&A Experiments.
Proceedings of the Fifteenth Text REtrieval Conference, 2006

Task-Oriented Extraction of Temporal Information: The Case of Clinical Narratives.
Proceedings of the 13th International Symposium on Temporal Representation and Reasoning (TIME 2006), 2006

: Three Approaches to GO-Tagging Biomedical Abstracts.
Proceedings of the Second International Symposium on Semantic Mining in Biomedicine, 2006

Language Resources for Background Gathering.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Simulating Cub Reporter Dialogues: The collection of naturalistic human-human dialogues for information access to text archives.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

Experiments in Passage Selection and Answer Identification for Question Answering.
Proceedings of the Advances in Natural Language Processing, 2006

2005
The Role of Inference in the Temporal Annotation and Analysis of Text.
Lang. Resour. Evaluation, 2005

The University of Sheffield's TREC 2005 Q&A Experiments.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

SUPPLE: A Practical Parser for Natural Language Engineering Applications.
Proceedings of the Ninth International Workshop on Parsing Technology, 2005

Experiments on Statistical and Pattern-Based Biographical Summarization.
Proceedings of the Progress in Artificial Intelligence, 2005

Aligning Words in English-Hindi Parallel Corpora.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

A Hybrid Approach to Align Sentences and Words in English-Hindi Parallel Corpora.
Proceedings of the Workshop on Building and Using Parallel Texts@ACL 2005, 2005

Using Semantic Inferences for Temporal Annotation Comparison.
Proceedings of the Language of Time - A Reader., 2005

The Specification Language TimeML.
Proceedings of the Language of Time - A Reader., 2005

2004
Information retrieval for question answering a SIGIR 2004 workshop.
SIGIR Forum, 2004

Corpus Linguistics and South Asian Languages: Corpus Creation and Tool Development.
Lit. Linguistic Comput., 2004

Sheffield University and the TREC 2004 Genomics Track: Query Expansion Using Synonymous Terms.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

The University of Sheffield's TREC 2004 QA Experiments.
Proceedings of the Thirteenth Text REtrieval Conference, 2004

Representing Temporal and Event Knowledge for QA Systems.
Proceedings of the New Directions in Question Answering, 2004

A Labelled Corpus for Prepositional Phrase Attachment.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

A Large-Scale Resource for Storing and Recognizing Technical Terminology.
Proceedings of the Fourth International Conference on Language Resources and Evaluation, 2004

Mining On-line Sources for Definition Knowledge.
Proceedings of the Seventeenth International Florida Artificial Intelligence Research Society Conference, 2004

Evaluating Passage Retrieval Approaches for Question Answering.
Proceedings of the Advances in Information Retrieval, 2004

Text Mining into Distributed Bioinformatics Workflows: A Web Services Implementation.
Proceedings of the 2004 IEEE International Conference on Services Computing (SCC 2004), 2004

2003
Recent Advances in Computational Terminology edited by Didier Bourigault, Christian Jacquemin, and Marie-Claude L'Homme.
Comput. Linguistics, 2003

Protein Structures and Information Extraction from Biological Texts: The PASTA System.
Bioinform., 2003

CM-Builder: A Natural Language-Based CASE Tool for Object-Oriented Analysis.
Autom. Softw. Eng., 2003

The University of Sheffield's TREC 2003 Q&A Experiments.
Proceedings of The Twelfth Text REtrieval Conference, 2003

TimeML: Robust Specification of Event and Temporal Expressions in Text.
Proceedings of the New Directions in Question Answering, 2003


2002
The University of Sheffield TREC 2002 Q&A System.
Proceedings of The Eleventh Text REtrieval Conference, 2002

A Comparison of Machine Learning Algorithms for Prepositional Phrase Attachment.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Building and annotating a corpus for the study of journalistic text reuse.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

EMILLE, A 67-Million Word Corpus of Indic Languages: Data Collection, Mark-up and Harmonisation.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Using Edit Distance Algorithms to Compare Alternative Approaches to ITS Authoring.
Proceedings of the Intelligent Tutoring Systems, 6th International Conference, 2002

Utilizing text mining results: The Pasta Web System.
Proceedings of the ACL 2002 Workshop on Natural Language Processing in the Biomedical Domain, 2002

METER: MEasuring TExt Reuse.
Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, 2002

2001
Visual Tools for Natural Language Processing.
J. Vis. Lang. Comput., 2001

Natural language question answering: the view from here.
Nat. Lang. Eng., 2001

A Method Based on the Chi-Square Test for Document Classification.
Proceedings of the SIGIR 2001: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 2001

Intelligent Access to Text: Integrating Information Extraction Technology into Text Browsers.
Proceedings of the First International Conference on Human Language Technology Research, 2001

QA-LaSIE: A Natural Language Question Answering System.
Proceedings of the Advances in Artificial Intelligence, 2001

Using HLT for Acquiring, Retrieving and Publishing Knowledge in AKT.
Proceedings of the Workshop on Human Language Technology and Knowledge Management@ACL 2001, 2001

2000
Bioinformatics applications of information extraction from scientific journal articles.
J. Inf. Sci., 2000

University of Sheffield TREC-9 Q&A System.
Proceedings of The Ninth Text REtrieval Conference, 2000

A combined IR/NLP approach to question answering against large text collections.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Annotating Events and Temporal Information in Newswire Texts.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Automatically Augmenting Terminological Lexicons from Untagged Text.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

CM-Builder: An Automated NL-Based CASE Tool.
Proceedings of the Fifteenth IEEE International Conference on Automated Software Engineering, 2000

Using Corpus-derived Name Lists for Named Entity Recognition.
Proceedings of the 6th Applied Natural Language Processing Conference, 2000

Experiments on Sentence Boundary Detection.
Proceedings of the 6th Applied Natural Language Processing Conference, 2000

1999
Evaluating two methods for Treebank grammar compaction.
Nat. Lang. Eng., 1999

Using a Language Independent Domain Model for Multilingual Information Extraction.
Appl. Artif. Intell., 1999

University of Sheffield TREC-8 Q&A System.
Proceedings of The Eighth Text REtrieval Conference, 1999

Using Coreference Chains for Text Summarization.
Proceedings of the Coreference and Its Applications@ACL 1999, 1999

1998
Karen Sparck Jones and Julia Galliers, <i>Evaluating Natural Language Processing Systems: An Analysis and Review</i>. Berlin: Springer-Verlag, 1996. ISBN 3 540 61309 9, Price DM54.00 (paperback), 228 pages.
Nat. Lang. Eng., 1998

Information Extraction: Beyond Document Retrieval.
Int. J. Comput. Linguistics Chin. Lang. Process., 1998

Evaluation in language and speech technology.
Comput. Speech Lang., 1998

University of Sheffield: Description of the LaSIE-II System as Used for MUC-7.
Proceedings of the Seventh Message Understanding Conference: Proceedings of a Conference Held in Fairfax, 1998

A scheme for comparative evaluation of diverse parsing systems.
Proceedings of the First International Conference on Language Resources and Evaluation, 1998

Compacting the Penn Treebank Grammar.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Evaluating a Focus-Based Approach to Anaphora Resolution.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1997
Using a semantic network for information extraction.
Nat. Lang. Eng., 1997

Visual Execution and Data Visualization in Natural Language Processing.
Proceedings of the Proceedings 1997 IEEE Symposium on Visual Languages, 1997

Conception vs. Lexicons: An Architecture for Multilingual Information Extraction.
Proceedings of the Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology, 1997

Coupling information retrieval and information extraction: A new text technology for gathering information from the web.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 1997

On the Marriage of Information Retrieval and Information Extraction.
Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Aberdeen, UK. 8th-9th April 1997, 1997

GATE - a General Architecture for Text Engineering.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

Software Infrastructure for Natural Language Processing.
Proceedings of the 5th Applied Natural Language Processing Conference, 1997

1996
New Methods, Current Trends and Software Infrastructure for NLP
CoRR, 1996

A General Architecture for Language Engineering (GATE) - a new approach to Language Engineering R&D
CoRR, 1996

Report of the Study Group on Assessment and Evaluation
CoRR, 1996

NEC Corporation and University of Sheffield: "Description of NEC/Sheffleld System Used For MET Japanese".
Proceedings of the TIPSTER TEXT PROGRAM PHASE II: Proceedings of a Workshop held at Vienna, 1996

TIPSTER-Compatible Projects at Sheffield.
Proceedings of the TIPSTER TEXT PROGRAM PHASE II: Proceedings of a Workshop held at Vienna, 1996

GATE: An Environment to Support Research and Development in Natural Language Engineering.
Proceedings of the Eigth International Conference on Tools with Artificial Intelligence, 1996

Evaluation of an Algorithm for the Recognition and Classification of Proper Names.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

GATE-a General Architecture for Text Engineering.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

1995
POETIC: A system for gathering and disseminating traffic information.
Nat. Lang. Eng., 1995

University of Sheffield: description of the LaSIE system as used for MUC-6.
Proceedings of the 6th Conference on Message Understanding, 1995

1993
Sussex University: description of the Sussex system used for MUC-5.
Proceedings of the 5th Conference on Message Understanding, 1993

1991
Deriving Answers to Logical Queries Via Answer Composition.
Proceedings of the 3rd UK Conference on Logic Programming, Edinburgh, 10-12 April 1991, 1991


  Loading...