Ritesh Kumar

Orcid: 0000-0002-5151-2546

Affiliations:

Council for Strategic and Defense Research, Division of Artificial Intelligence and Linguistics, Delhi, India
UnReaL-TecE LLP, Agra, India
Jawaharlal Nehru University, Centre for Linguistics, India
Dr. Bhimrao Ambedkar University, Department of Linguistics, K. M. Institute of Hindi and Linguistics, Agra, India

According to our database¹, Ritesh Kumar authored at least 35 papers between 2010 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Bibliography

2024

A multilingual, multimodal dataset of aggression and bias: the ComMA dataset.

[BibT_eX]

[DOI]

Laishram Niranjana Devi

Lang. Resour. Evaluation, June, 2024

Demo of LiFE: A web app for collection, management and annotation of linguistic data.

[BibT_eX]

[DOI]

Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

HarmPot: An Annotation Framework for Evaluating Offline Harm Potential of Social Media Text.

[BibT_eX]

[DOI]

Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2022

Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi.

[BibT_eX]

[DOI]

CoRR, 2022

Developing Universal Dependency Treebanks for Magahi and Braj.

[BibT_eX]

[DOI]

CoRR, 2022

Language Resources and Technologies for Non-Scheduled and Endangered Indian Languages.

[BibT_eX]

[DOI]

Ritesh Kumar

Bornini Lahiri

CoRR, 2022

Aggression in Hindi and English Speech: Acoustic Correlates and Automatic Identification.

[BibT_eX]

[DOI]

CoRR, 2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022).

[BibT_eX]

[DOI]

Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

The ComMA Dataset V0.2: Annotating Aggression and Bias in Multilingual Social Media Discourse.

[BibT_eX]

[DOI]

Laishram Niranjana Devi

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

UniMorph 4.0: Universal Morphology.

[BibT_eX]

[DOI]

Khuyagbaatar Batsuren

Jaime Rafael Montoya Samame

Delio Siticonatzi Camaiteri

Gema Celeste Silva Villegas

Lucas Torroba Hennigen

Adam Ek

David Guriel

Peter Dirix

Jean-Philippe Bernardy

Andrey Scherbakov

Aziyana Bayyr-ool

Antonios Anastasopoulos

Natalia Krizhanovskaya

Jonathan North Washington

Maria Nepomniashchaya

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021

Aggressive and Offensive Language Identification in Hindi, Bangla, and English: A Comparative Study.

[BibT_eX]

[DOI]

Ritesh Kumar

Bornini Lahiri

Atul Kr. Ojha

SN Comput. Sci., 2021

Creating and Managing a large annotated parallel corpora of Indian languages.

[BibT_eX]

[DOI]

CoRR, 2021

Challenges in Developing LRs for Non-Scheduled Languages: A Case of Magahi.

[BibT_eX]

[DOI]

Ritesh Kumar

CoRR, 2021

Towards automatic identification of linguistic politeness in Hindi texts.

[BibT_eX]

[DOI]

Ritesh Kumar

CoRR, 2021

The ComMA Dataset V0.2: Annotating Aggression and Bias in Multilingual Social Media Discourse.

[BibT_eX]

[DOI]

Ritesh Kumar

Enakshi Nandi

Laishram Niranjana Devi

CoRR, 2021

SIGTYP 2021 Shared Task: Robust Spoken Language Identification.

[BibT_eX]

[DOI]

CoRR, 2021

Demo of the Linguistic Field Data Management and Analysis System - LiFE.

[BibT_eX]

[DOI]

Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

2020

NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020.

[BibT_eX]

[DOI]

Atul Kr. Ojha

Priya Rani

Akanksha Bansal

Bharathi Raja Chakravarthi

Ritesh Kumar

John P. McCrae

Proceedings of the Fifth Conference on Machine Translation, 2020

ComMA@FIRE 2020: Exploring Multilingual Joint Training across different Classification Tasks.

[BibT_eX]

[DOI]

Proceedings of the Working Notes of FIRE 2020, 2020

Evaluating Aggression Identification in Social Media.

[BibT_eX]

[DOI]

Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

Developing a Multilingual Annotated Corpus of Misogyny and Aggression.

[BibT_eX]

[DOI]

Shiladitya Bhattacharya

Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

2019

Panlingua-KMI MT System for Similar Language Translation Task at WMT 2019.

[BibT_eX]

[DOI]

Proceedings of the Fourth Conference on Machine Translation, 2019

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval).

[BibT_eX]

[DOI]

Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Predicting the Type and Target of Offensive Posts in Social Media.

[BibT_eX]

[DOI]

Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

KMI-Panlingua at HASOC 2019: SVM vs BERT for Hate Speech and Offensive Content Detection.

[BibT_eX]

[DOI]

Ritesh Kumar

Atul Kr. Ojha

Proceedings of the Working Notes of FIRE 2019, 2019

2018

Automatic Identification of Closely-related Indian Languages: Resources and Experiments.

[BibT_eX]

[DOI]

CoRR, 2018

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.

[BibT_eX]

[DOI]

Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Benchmarking Aggression Identification in Social Media.

[BibT_eX]

[DOI]

Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, 2018

2014

Developing Politeness Annotated Corpus of Hindi Blogs.

[BibT_eX]

[DOI]

Ritesh Kumar

Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2012

Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi.

[BibT_eX]

[DOI]

Ritesh Kumar

Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Developing a POS tagger for Magahi: A Comparative Study.

[BibT_eX]

[DOI]

Ritesh Kumar

Bornini Lahiri

Deepak Alok

Proceedings of the 10th Workshop on Asian Language Resources, 2012

2011

A politeness recognition tool for Hindi: with special emphasis on online texts.

[BibT_eX]

[DOI]

Ritesh Kumar

Proceedings of the 20th International Conference on World Wide Web, 2011

A register-based annotation scheme for CO3H.

[BibT_eX]

[DOI]

Ritesh Kumar

Proceedings of the International Conference on Web Intelligence, Mining and Semantics, 2011

Developing LRs for Non-scheduled Indian Languages - A Case of Magahi.

[BibT_eX]

[DOI]

Ritesh Kumar

Bornini Lahiri

Deepak Alok

Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

2010

Translating politeness across cultures: case of Hindi and English.

[BibT_eX]

[DOI]

Ritesh Kumar

Girish Nath Jha

Proceedings of the 3rd international conference on Intercultural collaboration, 2010

Ritesh Kumar

Timeline

Legend:

Links

Online presence:

On csauthors.net:

Bibliography

Loading...