Karan Dua
According to our database1,
Karan Dua authored at least 5 papers
between 2025 and 2026.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2026
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data.
CoRR, January, 2026
2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models.
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 6: Industry Track), 2025