Manish Shrivastava

Orcid: 0000-0001-8705-6637

Affiliations:
  • International Institute of Information Technology, Hyderabad, India


According to our database1, Manish Shrivastava authored at least 158 papers between 2006 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages.
CoRR, 2024

Zero-Shot Multi-task Hallucination Detection.
CoRR, 2024

Can LLMs Compute with Reasons?
CoRR, 2024

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages.
CoRR, 2024

Fine-grained Contract NER using instruction based model.
CoRR, 2024

2023
A Computational Algebraic Analysis of Hindi Syntax.
J. Log. Lang. Inf., December, 2023

Unsupervised Approach to Evaluate Sentence-Level Fluency: Do We Really Need Reference?
CoRR, 2023

MEE4 and XLsim : IIIT HYD's Submissions' for WMT23 Metrics Shared Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

IIIT HYD's Submission for WMT23 Test-suite Task.
Proceedings of the Eighth Conference on Machine Translation, 2023

PrecogIIITH@WASSA2023: Emotion Detection for Urdu-English Code-mixed Text.
Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, 2023

Attention at SemEval-2023 Task 10: Explainable Detection of Online Sexism (EDOS).
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

LTRC at SemEval-2023 Task 6: Experiments with Ensemble Embeddings.
Proceedings of the The 17th International Workshop on Semantic Evaluation, 2023

Event Annotation and Detection in Kannada-English Code-Mixed Social Media Data.
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, 2023

Mukhyansh: A Headline Generation Dataset for Indic Languages.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

The WEAVE 2.0 Corpus: Role Labelled Synthetic Chemical Procedures from Patents with Chemical Named Entities.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

Fine-grained Contract NER using instruction based mode.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

MultiFacet: A Multi-Tasking Framework for Speech-to-Sign Language Generation.
Proceedings of the International Conference on Multimodal Interaction, 2023

LTRC_IIITH's 2023 Submission for Prompting Large Language Models as Explainable Metrics Task.
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems, 2023

PMIndiaSum: Multilingual and Cross-lingual Headline Summarization for Languages in India.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

TourismNLG: A Multi-lingual Generative Benchmark for the Tourism Domain.
Proceedings of the Advances in Information Retrieval, 2023

BRR-QA: Boosting Ranking and Reading in Open-Domain Question Answering.
Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD), 2023

X-RiSAWOZ: High-Quality End-to-End Multilingual Dialogue Datasets and Few-shot Agents.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Is My Model Using The Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning.
Trans. Assoc. Comput. Linguistics, 2022

PreCogIIITH at HinglishEval : Leveraging Code-Mixing Metrics & Language Model Embeddings To Estimate Code-Mix Quality.
CoRR, 2022

REUSE: REference-free UnSupervised Quality Estimation Metric.
Proceedings of the Seventh Conference on Machine Translation, 2022

Unsupervised Embedding-based Metric for MT Evaluation with Improved Human Correlation.
Proceedings of the Seventh Conference on Machine Translation, 2022

Tesla at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Transformer-based Models with Data Augmentation.
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

Bilingual Tabular Inference: A Case Study on Indic Languages.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

HashSet - A Dataset For Hashtag Segmentation.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

TeSum: Human-Generated Abstractive Summarization Corpus for Telugu.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Named Entity Recognition for Code-Mixed Kannada-English Social Media Data.
Proceedings of the 19th International Conference on Natural Language Processing, 2022

Generalised Spherical Text Embedding.
Proceedings of the 19th International Conference on Natural Language Processing, 2022

SConE: Contextual Relevance based Significant CompoNent Extraction from Contracts.
Proceedings of the 19th International Conference on Natural Language Processing, 2022

Indian Language Summarization using Pretrained Sequence-to-Sequence Models.
Proceedings of the Working Notes of FIRE 2022, 2022

DocInfer: Document-level Natural Language Inference using Optimal Evidence Selection.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Leveraging Data Recasting to Enhance Tabular Reasoning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

MARCUS: An Event-Centric NLP Pipeline that generates Character Arcs from Narratives.
Proceedings of Text2Story, 2022

LTRC @MuP 2022: Multi-Perspective Scientific Document Summarization Using Pre-trained Generation Models.
Proceedings of the Third Workshop on Scholarly Document Processing, 2022

"Kanglish alli names!" Named Entity Recognition for Kannada-English Code-Mixed Social Media Data.
Proceedings of the Eighth Workshop on Noisy User-generated Text, 2022

Diverse Multi-Answer Retrieval with Determinantal Point Processes.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

LTRC @ Causal News Corpus 2022: Extracting and Identifying Causal Elements using Adapters.
Proceedings of the 5th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text, 2022

Towards Fine-grained Classification of Climate Change related Social Media Text.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2022

SyMCoM - Syntactic Measure of Code Mixing A Study Of English-Hindi Code-Mixing.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
A3-108 Machine Translation System for Similar Language Translation Shared Task 2021.
Proceedings of the Sixth Conference on Machine Translation, 2021

"Subverting the Jewtocracy": Online Antisemitism Detection Using Multimodal Deep Learning.
Proceedings of the WebSci '21: 13th ACM Web Science Conference 2021, 2021

Topic Shift Detection for Mixed Initiative Response.
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, 2021

Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning.
Proceedings of the 15th International Workshop on Semantic Evaluation, 2021

A Dynamic Head Importance Computation Mechanism for Neural Machine Translation.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

Battling Hateful Content in Indic Languages HASOC'21.
Proceedings of the Working Notes of FIRE 2021, 2021

Fusion of Intrinsic & Extrinsic Sentential Traits for Text Coherence Assessment.
Proceedings of the CODS-COMAD 2021: 8th ACM IKDD CODS and 26th COMAD, 2021

CoMeT: Towards Code-Mixed Translation Using Parallel Monolingual Sentences.
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

Translate and Classify: Improving Sequence Level Classification for English-Hindi Code-Mixed Data.
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching, 2021

2020
Semantic Textual Similarity of Sentences with Emojis.
Proceedings of the Companion of The 2020 Web Conference 2020, 2020

A3-108 Machine Translation System for Similar Language Translation Shared Task 2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

ConfNet2Seq - Full Length Answer Generation from Spoken Questions.
Proceedings of the Text, Speech, and Dialogue, 2020

Cross-Lingual Transfer for Hindi Discourse Relation Identification.
Proceedings of the Text, Speech, and Dialogue, 2020

SIS@IIITH at SemEval-2020 Task 8: An Overview of Simple Text Classification Methods for Meme Analysis.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Word Embeddings as Tuples of Feature Probabilities.
Proceedings of the 5th Workshop on Representation Learning for NLP, 2020

NoEl: An Annotated Corpus for Noun Ellipsis in English.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Tag2Risk: Harnessing Social Music Tags for Characterizing Depression Risk.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020

Modeling ASR Ambiguity for Neural Dialogue State Tracking.
Proceedings of the Interspeech 2020, 2020

Creation of Corpus and Analysis in Code-Mixed Kannada-English Social Media Data for POS Tagging.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

The WEAVE Corpus: Annotating Synthetic Chemical Procedures in Patents with Chemical Named Entities.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

Improving Passage Re-Ranking with Word N-Gram Aware Coattention Encoder.
Proceedings of the 17th International Conference on Natural Language Processing, 2020

Principle-to-Program: Neural Methods for Similar Question Retrieval in Online Communities.
Proceedings of the Advances in Information Retrieval, 2020

MEE : An Automatic Metric for Evaluation Using Embeddings for Machine Translation.
Proceedings of the 7th IEEE International Conference on Data Science and Advanced Analytics, 2020

Finding The Right One and Resolving it.
Proceedings of the 24th Conference on Computational Natural Language Learning, 2020

AVADHAN: System for Open-Domain Telugu Question Answering.
Proceedings of the CoDS-COMAD 2020: 7th ACM IKDD CoDS and 25th COMAD, 2020

Creation of Corpus and analysis in Code-Mixed Kannada-English Twitter data for Emotion Prediction.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

AbuseAnalyzer: Abuse Detection, Severity and Target Prediction for Gab Posts.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

A Simple and Effective Dependency Parser for Telugu.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

SCAR: Sentence Compression using Autoencoders for Reconstruction.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, 2020

A Multi-Dimensional View of Aggression when voicing Opinion.
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

2019
Curriculum Learning Strategies for Hindi-English Codemixed Sentiment Analysis.
CoRR, 2019

Fermi at SemEval-2019 Task 8: An elementary but effective approach to Question Discernment in Community QA Forums.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

FERMI at SemEval-2019 Task 5: Using Sentence embeddings to Identify Hate Speech Against Immigrants and Women in Twitter.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Fermi at SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media using Sentence Embeddings.
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Using Argumentative Semantic Feature for Summarization.
Proceedings of the 13th IEEE International Conference on Semantic Computing, 2019

Using Syntax to Resolve NPE in English.
Proceedings of the International Conference on Recent Advances in Natural Language Processing, 2019

A Pregroup Representation of Word Order Alternation Using Hindi Syntax.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Curriculum Learning Strategies for Hindi-English Code-Mixed Sentiment Analysis.
Proceedings of the Artificial Intelligence. IJCAI 2019 International Workshops, 2019

Inductive Transfer Learning for Detection of Well-Formed Natural Language Search Queries.
Proceedings of the Advances in Information Retrieval, 2019

Predicting Algorithm Classes for Programming Word Problems.
Proceedings of the 5th Workshop on Noisy User-generated Text, 2019

Corpus Creation and Analysis for Named Entity Recognition in Telugu-English Code-Mixed Social Media Data.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

De-Mixing Sentiment from Code-Mixed Text.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Gender Prediction in English-Hindi Code-Mixed Social Media Content: Corpus and Baseline System.
Computación y Sistemas, 2018

SWDE : A Sub-Word And Document Embedding Based Engine for Clickbait Detection.
CoRR, 2018

Cross-Lingual Task-Specific Representation Learning for Text Classification in Resource Poor Languages.
CoRR, 2018

A Corpus of English-Hindi Code-Mixed Tweets for Sarcasm Detection.
CoRR, 2018

An English-Hindi Code-Mixed Corpus: Stance Annotation and Baseline System.
CoRR, 2018

Neural Network Architecture for Credibility Assessment of Textual Claims.
CoRR, 2018

BoWLer: A neural approach to extractive text summarization.
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, 2018

Too Many Questions? What Can We Do? : Multiple Question Span Detection.
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation, 2018

Corpus Creation and Emotion Prediction for Hindi-English Code-Mixed Social Media Text.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics, 2018

Universal Dependency Parsing for Hindi-English Code-Switching.
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2018

Humor Detection in English-Hindi Code-Mixed Social Media Content : Corpus and Baseline System.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Towards Word Embeddings for Improved Duplicate Bug Report Retrieval in Software Repositories.
Proceedings of the 2018 ACM SIGIR International Conference on Theory of Information Retrieval, 2018

LWE: LDA refined word embeddings for duplicate bug report detection.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

DWEN: deep word embedding network for duplicate bug report detection in software repositories.
Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, 2018

Deep Learning methods for Semantic Role Labeling in Indian Languages.
Proceedings of the 15th International Conference on Natural Language Processing, 2018

"Is This A Joke?": A Large Humor Classification Dataset.
Proceedings of the 15th International Conference on Natural Language Processing, 2018

Transzaar: Empowers Human Translators.
Proceedings of the 18th International Conference on Computational Science and Applications, 2018

A Dataset for Detecting Irony in Hindi-English Code-Mixed Social Media Text.
Proceedings of 4th Workshop on Sentic Computing, 2018

Degree based Classification of Harmful Speech using Twitter Data.
Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, 2018

Twitter corpus of Resource-Scarce Languages for Sentiment Analysis and Multilingual Emoji Prediction.
Proceedings of the 27th International Conference on Computational Linguistics, 2018

Automatic Normalization of Word Variations in Code-Mixed Social Media Text.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

Neural Network Architecture for Credibility Assessment of Textual Claims (Best Paper Award, First Place).
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

Contrastive Learning of Emoji-Based Representations for Resource-Poor Languages.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

Emotions Are Universal: Learning Sentiment Based Representations of Resource-Poor Languages Using Siamese Networks.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

Sentiment Analysis of Code-Mixed Languages Leveraging Resource Rich Languages.
Proceedings of the Computational Linguistics and Intelligent Text Processing, 2018

Named Entity Recognition for Hindi-English Code-Mixed Social Media Text.
Proceedings of the Seventh Named Entities Workshop, 2018

Automatic Question Generation using Relative Pronouns and Adverbs.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

Exploring Chunk Based Templates for Generating a subset of English Text.
Proceedings of ACL 2018, Melbourne, Australia, July 15-20, 2018, Student Research Workshop, 2018

A Dataset of Hindi-English Code-Mixed Social Media Text for Hate Speech Detection.
Proceedings of the Second Workshop on Computational Modeling of People's Opinions, 2018

Transliteration Better than Translation? Answering Code-mixed Questions over a Knowledge Base.
Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching@ACL 2018, 2018

Aggression Detection on Social Media Text Using Deep Neural Networks.
Proceedings of the 2nd Workshop on Abusive Language Online, 2018

2017
Relevance Scoring of Triples Using Ordinal Logistic Classification - The Celosia Triple Scorer at WSDM Cup 2017.
CoRR, 2017

An Unsupervised Approach for Mapping between Vector Spaces.
CoRR, 2017

Unsupervised Morphological Expansion of Small Datasets for Improving Word Embeddings.
CoRR, 2017

LTRC IIITH at IBEREVAL 2017: Stance and Gender Detection in Tweets on Catalan Independence.
Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017) co-located with 33th Conference of the Spanish Society for Natural Language Processing (SEPLN 2017), 2017

Classification Of Spanish Election Tweets (COSET) 2017 : Classifying Tweets Using Character and Word Level Features.
Proceedings of the Second Workshop on Evaluation of Human Language Technologies for Iberian Languages (IberEval 2017) co-located with 33th Conference of the Spanish Society for Natural Language Processing (SEPLN 2017), 2017

DNN-HMM Acoustic Modeling for Large Vocabulary Telugu Speech Recognition.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2017

Significance of DNN-AM for Multimodal Sentiment Analysis.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2017

Significance of neural phonotactic models for large-scale spoken language identification.
Proceedings of the 2017 International Joint Conference on Neural Networks, 2017

Injecting Word Embeddings with Another Language's Resource : An Application of Bilingual Embeddings.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Deep Neural Network based system for solving Arithmetic Word problems.
Proceedings of the IJCNLP 2017, Tapei, Taiwan, November 27, 2017

Beyond Word2Vec: Embedding Words and Phrases in Same Vector Space.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

End to End Dialog System for Telugu.
Proceedings of the 14th International Conference on Natural Language Processing, 2017

Improve performance of machine translation service using memcached.
Proceedings of the Computational Science and Its Applications - ICCSA 2017, 2017

Sentiment analysis using relative prosody features.
Proceedings of the Tenth International Conference on Contemporary Computing, 2017

Exploiting Morphological Regularities in Distributional Word Representations.
Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2017

Transition-Based Deep Input Linearization.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

Joining Hands: Exploiting Monolingual Treebanks for Parsing of Code-mixing Data.
Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, 2017

WebShodh: A Code Mixed Factoid Question Answering System for Web.
Proceedings of the Experimental IR Meets Multilinguality, Multimodality, and Interaction, 2017

Word Similarity Datasets for Indian Languages: Annotation and Baseline Systems.
Proceedings of the 11th Linguistic Annotation Workshop, 2017

The Unusual Suspects: Deep Learning Based Mining of Interesting Entity Trivia from Knowledge Graphs.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
Deep Feature Fusion Network for Answer Quality Prediction in Community Question Answering.
CoRR, 2016

Articulatory Gesture Rich Representation Learning of Phonological Units in Low Resource Settings.
Proceedings of the Statistical Language and Speech Processing, 2016

Mirror on the Wall: Finding Similar Questions with Deep Structured Topic Modeling.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2016

Shallow Parsing Pipeline - Hindi-English Code-Mixed Social Media Text.
Proceedings of the NAACL HLT 2016, 2016

Transition-Based Syntactic Linearization with Lookahead Features.
Proceedings of the NAACL HLT 2016, 2016

Kathaa: A Visual Programming Framework for NLP Applications.
Proceedings of the Demonstrations Session, 2016

Multimodal Sentiment Analysis Using Deep Neural Networks.
Proceedings of the Mining Intelligence and Knowledge Exploration, 2016

Vaidya: A Spoken Dialog System for Health Domain.
Proceedings of the 13th International Conference on Natural Language Processing, 2016

Towards Deep Learning in Hindi NER: An approach to tackle the Labelled Data Sparsity.
Proceedings of the 13th International Conference on Natural Language Processing, 2016

Code Mixed Entity Extraction in Indian Languages using Neural Networks.
Proceedings of the Working notes of FIRE 2016, 2016

Hand in Glove: Deep Feature Fusion Network Architectures for Answer Quality Prediction in Community Question Answering.
Proceedings of the COLING 2016, 2016

Towards Sub-Word Level Compositions for Sentiment Analysis of Hindi-English Code Mixed Text.
Proceedings of the COLING 2016, 2016

Together we stand: Siamese Networks for Similar Question Retrieval.
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, 2016

2015
A language model based approach towards large scale and lightweight language identification systems.
CoRR, 2015

"Answer ka type kya he?": Learning to Classify Questions in Code-Mixed Language.
Proceedings of the 24th International Conference on World Wide Web Companion, 2015

IIITH at BioASQ Challange 2015 Task 3b: Bio-Medical Question Answering System.
Proceedings of the Working Notes of CLEF 2015, 2015

IIITH at BioASQ Challenge 2015 Task 3a: Extreme Classification of PubMed Articles using MeSH Labels.
Proceedings of the Working Notes of CLEF 2015, 2015

2014
Do not do processing, when you can look up: Towards a Discrimination Net for WSD.
Proceedings of the Seventh Global Wordnet Conference, 2014

PaCMan : Parallel Corpus Management Workbench.
Proceedings of the 11th International Conference on Natural Language Processing, 2014

IIIT-H System Submission for FIRE2014 Shared Task on Transliterated Search.
Proceedings of the Forum for Information Retrieval Evaluation, 2014

2013
Cluster formation through improved weighted clustering algorithm (IWCA) for mobile ad-hoc networks.
Proceedings of the Tenth International Conference on Wireless and Optical Communications Networks, 2013

2006
Morphological Richness Offsets Resource Demand - Experiences in Constructing a POS Tagger for Hindi.
Proceedings of the ACL 2006, 2006


  Loading...