Shervin Malmasi

Orcid: 0000-0001-6250-5571

According to our database1, Shervin Malmasi authored at least 88 papers between 2013 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Controllable Decontextualization of Yes/No Question and Answers into Factual Statements.
Proceedings of the Advances in Information Retrieval, 2024

Instant Answering in E-Commerce Buyer-Seller Messaging Using Message-to-Question Reformulation.
Proceedings of the Advances in Information Retrieval, 2024

2023
eCom'23: The SIGIR 2023 Workshop on eCommerce.
Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2023

SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2).
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

The 6th Workshop on e-eommerce and NLP (ECNLP 6).
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023

A Simple Loss Function for Convergent Algorithm Synthesis using RNNs.
Proceedings of the First Tiny Papers Track at ICLR 2023, 2023

Follow-on Question Suggestion via Voice Hints for Voice Assistants.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

InstructPTS: Instruction-Tuning LLMs for Product Title Summarization.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023, 2023

MultiCoNER v2: a Large Multilingual dataset for Fine-grained and Noisy Named Entity Recognition.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Faithful Low-Resource Data-to-Text Generation through Cycle Training.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Answering Unanswered Questions through Semantic Reformulations in Spoken QA.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

Generate-then-Retrieve: Intent-Aware FAQ Retrieval in Product Search.
Proceedings of the The 61st Annual Meeting of the Association for Computational Linguistics: Industry Track, 2023

2022
CoSearcher: studying the effectiveness of conversational search refinement and clarification through user simulation.
Inf. Retr. J., 2022

CycleNER: An Unsupervised Training Approach for Named Entity Recognition.
Proceedings of the WWW '22: The ACM Web Conference 2022, Virtual Event, Lyon, France, April 25, 2022

eCom'22: The SIGIR 2022 Workshop on eCommerce.
Proceedings of the SIGIR '22: The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, Madrid, Spain, July 11, 2022

SemEval-2022 Task 11: Multilingual Complex Named Entity Recognition (MultiCoNER).
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Dynamic Gazetteer Integration in Multilingual Models for Cross-Lingual and Cross-Domain Named Entity Recognition.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks.
Proceedings of the KDD '22: The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, August 14, 2022

CycleKQR: Unsupervised Bidirectional Keyword-Question Rewriting.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Distilling Multilingual Transformers into CNNs for Scalable Intent Classification.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

Reinforced Question Rewriting for Conversational Question Answering.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing: EMNLP 2022 - Industry Track, Abu Dhabi, UAE, December 7, 2022

MultiCoNER: A Large-scale Multilingual Dataset for Complex Named Entity Recognition.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

Wizard of Tasks: A Novel Conversational Dataset for Solving Real-World Tasks in Conversational Settings.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

2021
ECOM'21: The SIGIR 2021 Workshop on eCommerce.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

Gazetteer Enhanced Named Entity Recognition for Code-Mixed Web Queries.
Proceedings of the SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2021

GEMNET: Effective Gated Gazetteer Representations for Recognizing Complex Entities in Low-context Input.
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Studying the Effectiveness of Conversational Search Refinement Through User Simulation.
Proceedings of the Advances in Information Retrieval, 2021

2020
ConvERSe'20: The WSDM 2020 Workshop on Conversational Systems for E-Commerce Recommendations and Search.
Proceedings of the WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining, 2020

ECOM'20: The SIGIR 2020 Workshop on eCommerce.
Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, 2020

Evaluating Aggression Identification in Social Media.
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

2019
Comparing information extraction techniques for low-prevalence concepts: The case of insulin rejection by patients.
J. Biomed. Informatics, 2019

Findings of the 2019 Conference on Machine Translation (WMT19).
Proceedings of the Fourth Conference on Machine Translation, 2019

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval).
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

UTFPR at SemEval-2019 Task 5: Hate Speech Identification with Recurrent Neural Networks.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Predicting the Type and Target of Offensive Posts in Social Media.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

2018
Challenges in discriminating profanity from hate speech.
J. Exp. Theor. Artif. Intell., 2018

Classifier Ensembles for Dialect and Language Variety Identification.
CoRR, 2018

Native Language Identification With Classifier Stacking and Ensembles.
Comput. Linguistics, 2018

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Discriminating between Indo-Aryan Languages Using SVM Ensembles.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

German Dialect Identification Using Classifier Ensembles.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Portuguese Native Language Identification.
Proceedings of the Computational Processing of the Portuguese Language, 2018

Benchmarking Aggression Identification in Social Media.
Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, 2018

A Report on the Complex Word Identification Shared Task 2018.
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications@NAACL-HLT 2018, 2018

A Portuguese Native Language Identification Dataset.
Proceedings of the Thirteenth Workshop on Innovative Use of NLP for Building Educational Applications@NAACL-HLT 2018, 2018

Canary: an Information Extraction Platform for Researchers and Clinicians.
Proceedings of the AMIA 2018, 2018

Classifying Patent Applications with Ensemble Methods.
Proceedings of the Australasian Language Technology Association Workshop 2018, 2018

2017
Multilingual native language identification.
Nat. Lang. Eng., 2017

Native Language Identification using Stacked Generalization.
CoRR, 2017

Open-Set Language Identification.
CoRR, 2017

Canary: An NLP Platform for Clinicians and Researchers.
Appl. Clin. Inform., 2017

Findings of the VarDial Evaluation Campaign 2017.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

Arabic Dialect Identification Using iVectors and ASR Transcripts.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

German Dialect Identification in Interview Transcriptions.
Proceedings of the Fourth Workshop on NLP for Similar Languages, 2017

Detecting Hate Speech in Social Media.
Proceedings of the International Conference Recent Advances in Natural Language Processing, 2017

Exploring the Use of Text Classification in the Legal Domain.
Proceedings of the Second Workshop on Automated Semantic Analysis of Information in Legal Texts co-located with the 16th International Conference on Artificial Intelligence and Law (ICAIL 2017), 2017

Including Dialects and Language Varieties in Author Profiling.
Proceedings of the Working Notes of CLEF 2017, 2017

A Report on the 2017 Native Language Identification Shared Task.
Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, 2017

Canary: An Information Extraction Platform for Clinical Researchers.
Proceedings of the AMIA 2017, 2017

Extracting Healthcare Quality Information from Unstructured Data.
Proceedings of the AMIA 2017, 2017

Unsupervised Text Segmentation Based on Native Language Characteristics.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Feature Hashing for Language and Dialect Identification.
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, 2017

Complex Word Identification: Challenges in Data Annotation and System Performance.
Proceedings of the 4th Workshop on Natural Language Processing Techniques for Educational Applications, 2017

2016
Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Arabic Dialect Identification in Speech Transcripts.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

Subdialectal Differences in Sorani Kurdish.
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, 2016

MAZA at SemEval-2016 Task 11: Detecting Lexical Complexity Using a Decision Stump Meta-Classifier.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

LTG at SemEval-2016 Task 11: Complex Word Identification with Classifier Ensembles.
Proceedings of the 10th International Workshop on Semantic Evaluation, 2016

Predicting Post Severity in Mental Health Forums.
Proceedings of the 3rd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, 2016

Modeling Language Change in Historical Corpora: The Case of Portuguese.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

Discriminating Similar Languages: Evaluations and Explorations.
Proceedings of the Tenth International Conference on Language Resources and Evaluation LREC 2016, 2016

2015
Discriminating Similar Languages: Persian and Dari.
Tiny Trans. Comput. Sci., 2015

Norwegian Native Language Identification.
Proceedings of the Recent Advances in Natural Language Processing, 2015

Arabic Dialect Identification Using a Parallel Multidialectal Corpus.
Proceedings of the Computational Linguistics, 2015

Location Mention Detection in Tweets and Microblogs.
Proceedings of the Computational Linguistics, 2015

Large-Scale Native Language Identification with Cross-Corpus Evaluation.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

The Jinan Chinese Learner Corpus.
Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015

Oracle and Human Baselines for Native Language Identification.
Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015

Measuring Feature Diversity in Native Language Identification.
Proceedings of the Tenth Workshop on Innovative Use of NLP for Building Educational Applications, 2015

Clinical Information Extraction Using Word Representations.
Proceedings of the Australasian Language Technology Association Workshop, 2015

Cognate Identification using Machine Translation.
Proceedings of the Australasian Language Technology Association Workshop, 2015

2014
Arabic Native Language Identification.
Proceedings of the EMNLP 2014 Workshop on Arabic Natural Language Processing, 2014

From Visualisation to Hypothesis Construction for Second Language Acquisition.
Proceedings of TextGraphs@EMNLP 2014: the 9th Workshop on Graph-based Methods for Natural Language Processing, 2014

Language Transfer Hypotheses with Linear SVM Weights.
Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, 2014

Chinese Native Language Identification.
Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, 2014

Finnish Native Language Identification.
Proceedings of the Australasian Language Technology Association Workshop, 2014

A Data-driven Approach to Studying Given Names and their Gender and Ethnicity Associations.
Proceedings of the Australasian Language Technology Association Workshop, 2014

2013
NLI Shared Task 2013: MQ Submission.
Proceedings of the Eighth Workshop on Innovative Use of NLP for Building Educational Applications, 2013


  Loading...