Sumanth Doddapaneni

Orcid: 0000-0003-4248-646X

According to our database1, Sumanth Doddapaneni authored at least 17 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages.
CoRR, 2024

User Embedding Model for Personalized Language Prompting.
CoRR, 2024

A Comprehensive Analysis of Adapter Efficiency.
Proceedings of the 7th Joint International Conference on Data Science & Management of Data (11th ACM IKDD CODS and 29th COMAD), 2024

2023
A Survey of Adversarial Defenses and Robustness in NLP.
ACM Comput. Surv., 2023

IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages.
CoRR, 2023

Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages.
Proceedings of the IEEE International Conference on Acoustics, 2023

Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Varta: A Large-Scale Headline-Generation Dataset for Indic Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages.
Trans. Assoc. Comput. Linguistics, 2022

Offence Detection in Dravidian Languages Using Code-Mixing Index-Based Focal Loss.
SN Comput. Sci., 2022

IndicXTREME: A Multi-Task Benchmark For Evaluating Indic Languages.
CoRR, 2022

A Survey in Adversarial Defences and Robustness in NLP.
CoRR, 2022

Towards Building ASR Systems for the Next Billion Users.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Offense Detection in Dravidian Languages using Code-Mixing Index based Focal Loss.
CoRR, 2021

A Primer on Pretrained Multilingual Language Models.
CoRR, 2021

Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages.
CoRR, 2021


  Loading...