Anoop Sarkar

Affiliations:
  • Simon Fraser University, Burnaby, Canada


According to our database1, Anoop Sarkar authored at least 107 papers between 1993 and 2023.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2023
Learning Nearest Neighbour Informed Latent Word Embeddings to Improve Zero-Shot Machine Translation.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

Language Model Based Target Token Importance Rescaling for Simultaneous Neural Machine Translation.
Proceedings of the 20th International Conference on Spoken Language Translation, 2023

SpEL: Structured Prediction for Entity Linking.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Decipherment as Regression: Solving Historical Substitution Ciphers by Learning Symbol Recurrence Relations.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

2022
Sequence Models for Document Structure Identification in an Undeciphered Script.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Auxiliary Subword Segmentations as Related Languages for Low Resource Multilingual Translation.
Proceedings of the 23rd Annual Conference of the European Association for Machine Translation, 2022

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021

Better Neural Machine Translation by Extracting Linguistic Information from BERT.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Measuring and Improving Faithfulness of Attention in Neural Machine Translation.
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2021

Compositionality of Complex Graphemes in the Undeciphered Proto-Elamite Script using Image and Text Embedding Models.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Training with Adversaries to Improve Faithfulness of Attention in Neural Machine Translation.
Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing: Student Research Workshop, 2020

Effectively pretraining a speech translation decoder with Machine Translation data.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Compact Rule Extraction for Hierarchical Phrase-based Translation.
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers, 2020

2019
An analysis of clausal coordination using synchronous tree adjoining grammar.
J. Log. Comput., 2019

Pointer-based Fusion of Bilingual Lexicons into Neural Machine Translation.
CoRR, 2019

Sign Clustering and Topic Extraction in Proto-Elamite.
Proceedings of the 3rd Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, 2019

Interrogating the Explanatory Power of Attention in Neural Machine Translation.
Proceedings of the 3rd Workshop on Neural Generation and Translation@EMNLP-IJCNLP 2019, 2019

Deconstructing Supertagging into Multi-Task Sequence Prediction.
Proceedings of the 23rd Conference on Computational Natural Language Learning, 2019

2018
An Easily Extensible HMM Word Aligner.
Prague Bull. Math. Linguistics, 2018

GraphNER: Using Corpus Level Similarities and Graph Propagation for Named Entity Recognition.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

Decipherment of Substitution Ciphers with Neural Language Models.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Top-down Tree Structured Decoding with Syntactic Connections for Neural Machine Translation and Parsing.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Prediction Improves Simultaneous Neural Machine Translation.
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, October 31, 2018

Simultaneous Translation using Optimized Segmentation.
Proceedings of the 13th Conference of the Association for Machine Translation in the Americas, 2018

Prefix Lexicalization of Synchronous CFGs using Synchronous TAG.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

In-domain Context-aware Token Embeddings Improve Biomedical Named Entity Recognition.
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis, 2018

Decipherment for Adversarial Offensive Language Detection.
Proceedings of the 2nd Workshop on Abusive Language Online, 2018

2017
Joint Prediction of Word Alignment with Alignment Types.
Trans. Assoc. Comput. Linguistics, 2017

Coordination in TAG without the Conjoin Operation.
Proceedings of the 13th International Workshop on Tree Adjoining Grammars and Related Formalisms, 2017

Lexicalized Reordering for Left-to-Right Hierarchical Phrase-based Translation.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Evaluating the Value of Lensing Wikipedia During the Information Seeking Process.
Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, 2017

2016
The Challenge of Simultaneous Speech Translation.
Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, 2016

Graph-based Semi-supervised Gene Mention Tagging.
Proceedings of the 15th Workshop on Biomedical Natural Language Processing, 2016

What's Hot in Human Language Technology: Highlights from NAACL HLT 2015.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
A Python-based Interface for Wide Coverage Lexicalized Tree-adjoining Grammars.
Prague Bull. Math. Linguistics, 2015

Learning segmentations that balance latency versus quality in spoken language translation.
Proceedings of the 12th International Workshop on Spoken Language Translation: Papers, 2015

Improving Statistical Machine Translation with a Multilingual Paraphrase Database.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields.
Proceedings of the Eighteenth International Conference on Artificial Intelligence and Statistics, 2015

2014
Incremental translation using hierarchichal phrase-based translation system.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014

Two Improvements to Left-to-Right Decoding for Hierarchical Phrase-based Machine Translation.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Expressive hierarchical rule extraction for left-to-right translation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

Bayesian iterative-cascade framework for hierarchical phrase-based translation.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

Pivot-based triangulation for low-resource languages.
Proceedings of the 11th Conference of the Association for Machine Translation in the Americas: MT Researchers Track, 2014

2013
Multi-Metric Optimization Using Ensemble Tuning.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2013

Scalable Variational Inference for Extracting Hierarchical Phrase-based Translation Rules.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Ensemble Triangulation for Statistical Machine Translation.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

An Online Algorithm for Learning over Constrained Latent Representations using Multiple Views.
Proceedings of the Sixth International Joint Conference on Natural Language Processing, 2013

Efficient Left-to-Right Hierarchical Phrase-Based Translation with Improved Reordering.
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, 2013

Knowledge base population and visualization using an ontology based on semantic roles.
Proceedings of the 2013 workshop on Automated knowledge base construction, 2013

Graph Propagation for Paraphrasing Out-of-Vocabulary Words in Statistical Machine Translation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

Stacking for Statistical Machine Translation.
Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, 2013

2012
Kriya - An end-to-end Hierarchical Phrase-based MT System.
Prague Bull. Math. Linguistics, 2012

Kriya - The SFU System for Translation Task at WMT-12.
Proceedings of the Seventh Workshop on Statistical Machine Translation, 2012

Improved Reordering for Shallow-n Grammar based Hierarchical Phrase-based Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, 2012

LensingWikipedia: Parsing text for the interactive visualization of human history.
Proceedings of the 7th IEEE Conference on Visual Analytics Science and Technology, 2012

Domain Adaptation Techniques for Machine Translation and Their Evaluation in a Real-World Setting.
Proceedings of the Advances in Artificial Intelligence, 2012

Bootstrapping via Graph Propagation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

Mixing Multiple Translation Models in Statistical Machine Translation.
Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, July 8-14, 2012, Jeju Island, Korea, 2012

2011
Parsing Schemata for Practical Text Analysis Carlos Gómez Rodríguez (University of A Coruña) London: Imperial College Press (Mathematics, computing, language, and life series, edited by Carlos Martin-Vide, volume 1), 2010, xiv+275 pp; hardbound, ISBN 978-1-84816-560-1, $89.00.
Comput. Linguistics, 2011

Bayesian Extraction of Minimal SCFG Rules for Hierarchical Phrase-based Translation.
Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011

An Ensemble Model that Combines Syntactic and Semantic Clustering for Discriminative Dependency Parsing.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 19-24 June, 2011, Portland, Oregon, USA, 2011

Combining Morpheme-based Machine Translation with Post-processing Morpheme Prediction.
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 2011

2010
Incremental Decoding for Phrase-Based Statistical Machine Translation.
Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR, 2010

2009
Active Learning for Statistical Phrase-based Machine Translation.
Proceedings of the Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, May 31, 2009

Training Global Linear Models for Chinese Word Segmentation.
Proceedings of the Advances in Artificial Intelligence, 2009

Active Learning for Multilingual Statistical Machine Translation.
Proceedings of the ACL 2009, 2009

2008
Training a Perceptron with Global and Local Features for Chinese Word Segmentation.
Proceedings of the Third International Joint Conference on Natural Language Processing, 2008

Homotopy-Based Semi-Supervised Hidden Markov Models for Sequence Labeling.
Proceedings of the COLING 2008, 2008

2007
Semi-supervised model adaptation for statistical machine translation.
Mach. Transl., 2007

Analysis of Semi-Supervised Learning with the Yarowsky Algorithm.
Proceedings of the UAI 2007, 2007

Simultaneous Identification of Biomedical Named-Entity and Functional Relation Using Statistical Parsing Techniques.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Exploiting Rich Syntactic Information for Relationship Extraction from Biomedical Articles.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2007

Recognition of Multi-sentence n-ary Subcellular Localization Mentions in Biomedical Abstracts.
Proceedings of the Short Paper Proceedings of the 2nd International Symposium on Languages in Biology and Medicine (LBM 2007), 2007

Experimental Evaluation of LTAG-Based Features for Semantic Role Labeling.
Proceedings of the EMNLP-CoNLL 2007, 2007

Question Answering Summarization of Multiple Biomedical Documents.
Proceedings of the Advances in Artificial Intelligence, 2007

Transductive learning for statistical machine translation.
Proceedings of the ACL 2007, 2007

2006
Using LTAG-Based Features for Semantic Role Labeling.
Proceedings of the Eighth International Workshop on Tree Adjoining Grammar and Related Formalisms, 2006

Tutorial on Inductive Semi-supervised Learning Methods: with Applicability to Natural Language Processing.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, 2006

A Clustering Approach for Nearly Unsupervised Recognition of Nonliteral Language.
Proceedings of the EACL 2006, 2006

Voting between Dictionary-Based and Subword Tagging Models for Chinese Word Segmentation.
Proceedings of the Fifth Workshop on Chinese Language Processing, 2006

2005
Synonym-Based Expansion and Boosting-Based Re-Ranking: A Two-phase Approach for Genomic Information Retrieval.
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Intimate Learning: A Novel Approach for Combining Labelled and Unlabelled Data.
Proceedings of the IJCAI-05, Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, UK, July 30, 2005

Voting Between Multiple Data Representations for Text Chunking.
Proceedings of the Advances in Artificial Intelligence, 2005

2004
Discriminative Reranking for Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

A Smorgasbord of Features for Statistical Machine Translation.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2004

2003
D-LTAG System: Discourse Parsing with a Lexicalized Tree-Adjoining Grammar.
J. Log. Lang. Inf., 2003

Example Selection for Bootstrapping Statistical Parsers.
Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, 2003

Using LTAG Based Features in Parse Reranking.
Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2003

Bootstrapping statistical parsers from small datasets.
Proceedings of the EACL 2003, 2003

2002
A Note on Typing Feature Structures.
Comput. Linguistics, 2002

Statistical Morphological Tagging and Parsing of Korean with an LTAG Grammar.
Proceedings of the Sixth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2002

Learning Verb Argument Structure from Minimally Annotated Corpora.
Proceedings of the 19th International Conference on Computational Linguistics, 2002

2001
Applying Co-Training Methods to Statistical Parsing.
Proceedings of the Language Technologies 2001: The Second Meeting of the North American Chapter of the Association for Computational Linguistics, 2001

2000
Practical experiments in parsing using Tree Adjoining Grammars.
Proceedings of the Fifth International Workshop on Tree Adjoining Grammar and Related Frameworks, 2000

Learning Verb Subcategorization from Corpora: Counting Frame Subsets.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Automatic Extraction of Subcategorization Frames for Czech.
Proceedings of the COLING 2000, 18th International Conference on Computational Linguistics, Proceedings of the Conference, 2 Volumes, July 31, 2000

Some Experiments on Indicators of Parsing Complexity for Lexicalized Grammars.
Proceedings of the Workshop on Efficiency In Large-Scale Parsing Systems, 2000

1999
Typing as a means for validating feature structures.
Proceedings of the Computational Linguistics in the Netherlands 1999, 1999

1998
Separating Dependency from Constituency in a Tree Rewriting System
CoRR, 1998

Prefix probabilities for linear indexed grammars.
Proceedings of the Fourth International Workshop on Tree Adjoining Grammars and Related Frameworks, 1998

Conditions on Consistency of Probabilistic Tree Adjoining Grammars.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

Prefix Probabilities from Stochastic Tree Adjoining Grammars.
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, 1998

1996
Coordination in Tree Adjoining Grammars: Formalization and Implementation.
Proceedings of the 16th International Conference on Computational Linguistics, 1996

Incremental Parser Generation for Tree Adjoining Grammars.
Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, 1996

1995
University of Pennsylvania: description of the University of Pennsylvania system used for MUC-6.
Proceedings of the 6th Conference on Message Understanding, 1995

1993
Extending Kimmo's Two-Level Model of Morphology.
Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics, 1993


  Loading...