Varun Gumma
Orcid: 0009-0002-5746-3017
  According to our database1,
  Varun Gumma
  authored at least 20 papers
  between 2021 and 2025.
  
  
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
  2025
    CoRR, September, 2025
    
  
The role of synthetic data in Multilingual, Multi-cultural AI systems: Lessons from Indic Languages.
    
  
    CoRR, September, 2025
    
  
Towards Inducing Long-Context Abilities in Multilingual Neural Machine Translation Models.
    
  
    Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025
    
  
  2024
HEALTH-PARIKSHA: Assessing RAG Models for Health Chatbots in Real-World Multilingual Settings.
    
  
    CoRR, 2024
    
  
On the Interchangeability of Positional Embeddings in Multilingual Neural Machine Translation Models.
    
  
    CoRR, 2024
    
  
Beyond Metrics: Evaluating LLMs' Effectiveness in Culturally Nuanced, Low-Resource Real-World Scenarios.
    
  
    CoRR, 2024
    
  
    Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2024, 2024
    
  
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks.
    
  
    Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024
    
  
    Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency, 2024
    
  
PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data.
    
  
    Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
    
  
    Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
    
  
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
    
  
    Proceedings of the Findings of the Association for Computational Linguistics: EACL 2024, 2024
    
  
  2023
IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages.
    
  
    Trans. Mach. Learn. Res., 2023
    
  
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks.
    
  
    CoRR, 2023
    
  
An Empirical Study of Leveraging Knowledge Distillation for Compressing Multilingual Neural Machine Translation Models.
    
  
    Proceedings of the 24th Annual Conference of the European Association for Machine Translation, 2023
    
  
  2022
    Proceedings of the 19th International Conference on Security and Cryptography, 2022
    
  
  2021