Ritesh Kumar

Orcid: 0000-0002-5151-2546

Affiliations:
  • Council for Strategic and Defense Research, Division of Artificial Intelligence and Linguistics, Delhi, India
  • UnReaL-TecE LLP, Agra, India
  • Jawaharlal Nehru University, Centre for Linguistics, India
  • Dr. Bhimrao Ambedkar University, Department of Linguistics, K. M. Institute of Hindi and Linguistics, Agra, India


According to our database1, Ritesh Kumar authored at least 35 papers between 2010 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
A multilingual, multimodal dataset of aggression and bias: the ComMA dataset.
Lang. Resour. Evaluation, June, 2024

Demo of LiFE: A web app for collection, management and annotation of linguistic data.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

HarmPot: An Annotation Framework for Evaluating Offline Harm Potential of Social Media Text.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2022
Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi.
CoRR, 2022

Developing Universal Dependency Treebanks for Magahi and Braj.
CoRR, 2022

Language Resources and Technologies for Non-Scheduled and Endangered Indian Languages.
CoRR, 2022

Aggression in Hindi and English Speech: Acoustic Correlates and Automatic Identification.
CoRR, 2022

Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022).
Proceedings of the 16th International Workshop on Semantic Evaluation, SemEval@NAACL 2022, 2022

The ComMA Dataset V0.2: Annotating Aggression and Bias in Multilingual Social Media Discourse.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

UniMorph 4.0: Universal Morphology.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

2021
Aggressive and Offensive Language Identification in Hindi, Bangla, and English: A Comparative Study.
SN Comput. Sci., 2021

Creating and Managing a large annotated parallel corpora of Indian languages.
CoRR, 2021

Challenges in Developing LRs for Non-Scheduled Languages: A Case of Magahi.
CoRR, 2021

Towards automatic identification of linguistic politeness in Hindi texts.
CoRR, 2021

The ComMA Dataset V0.2: Annotating Aggression and Bias in Multilingual Social Media Discourse.
CoRR, 2021

SIGTYP 2021 Shared Task: Robust Spoken Language Identification.
CoRR, 2021

Demo of the Linguistic Field Data Management and Analysis System - LiFE.
Proceedings of the 18th International Conference on Natural Language Processing (ICON 2021), National Institute of Technology Silchar, Silchar, India, December 16, 2021

2020
NUIG-Panlingua-KMI Hindi-Marathi MT Systems for Similar Language Translation Task @ WMT 2020.
Proceedings of the Fifth Conference on Machine Translation, 2020

ComMA@FIRE 2020: Exploring Multilingual Joint Training across different Classification Tasks.
Proceedings of the Working Notes of FIRE 2020, 2020

Evaluating Aggression Identification in Social Media.
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

Developing a Multilingual Annotated Corpus of Misogyny and Aggression.
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, 2020

2019
Panlingua-KMI MT System for Similar Language Translation Task at WMT 2019.
Proceedings of the Fourth Conference on Machine Translation, 2019

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval).
Proceedings of the 13th International Workshop on Semantic Evaluation, 2019

Predicting the Type and Target of Offensive Posts in Social Media.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

KMI-Panlingua at HASOC 2019: SVM vs BERT for Hate Speech and Offensive Content Detection.
Proceedings of the Working Notes of FIRE 2019, 2019

2018
Automatic Identification of Closely-related Indian Languages: Resources and Experiments.
CoRR, 2018

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign.
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, 2018

Benchmarking Aggression Identification in Social Media.
Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, 2018

2014
Developing Politeness Annotated Corpus of Hindi Blogs.
Proceedings of the Ninth International Conference on Language Resources and Evaluation, 2014

2012
Challenges in the development of annotated corpora of computer-mediated communication in Indian Languages: A Case of Hindi.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

Developing a POS tagger for Magahi: A Comparative Study.
Proceedings of the 10th Workshop on Asian Language Resources, 2012

2011
A politeness recognition tool for Hindi: with special emphasis on online texts.
Proceedings of the 20th International Conference on World Wide Web, 2011

A register-based annotation scheme for CO3H.
Proceedings of the International Conference on Web Intelligence, Mining and Semantics, 2011

Developing LRs for Non-scheduled Indian Languages - A Case of Magahi.
Proceedings of the Human Language Technology Challenges for Computer Science and Linguistics, 2011

2010
Translating politeness across cultures: case of Hindi and English.
Proceedings of the 3rd international conference on Intercultural collaboration, 2010


  Loading...