Peerat Limkonchotiwat

According to our database1, Peerat Limkonchotiwat authored at least 40 papers between 2020 and 2025.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
WangchanThaiInstruct: An instruction-following Dataset for Culture-Aware, Multitask, and Multi-domain Evaluation in Thai.
CoRR, August, 2025

SEA-BED: Southeast Asia Embedding Benchmark.
CoRR, August, 2025

SEADialogues: A Multilingual Culturally Grounded Multi-turn Dialogue Dataset on Southeast Asian Languages.
CoRR, August, 2025

Mangosteen: An Open Thai Corpus for Language Model Pretraining.
CoRR, July, 2025

Language Surgery in Multilingual Large Language Models.
CoRR, June, 2025

Decom-Renorm-Merge: Model Merging on the Right Space Improves Multitasking.
CoRR, May, 2025

Assessing Thai Dialect Performance in LLMs with Automatic Benchmarks and Human Evaluation.
CoRR, April, 2025

SEA-LION: Southeast Asian Languages in One Network.
CoRR, April, 2025

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025


SEA-HELM: Southeast Asian Holistic Evaluation of Language Models.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Towards Better Understanding of Program-of-Thought Reasoning in Cross-Lingual and Multilingual Environments.
Proceedings of the Findings of the Association for Computational Linguistics, 2025

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation.
CoRR, 2024

Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ?
CoRR, 2024

WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines.
CoRR, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024

WangchanLion and WangchanX MRC Eval.
CoRR, 2024

Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

On Creating an English-Thai Code-switched Machine Translation in Medical Domain.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

An Empirical Study of Multilingual Reasoning Distillation for Question Answering.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024


McCrolin: Multi-consistency Cross-lingual Training for Retrieval Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Space Decomposition for Sentence Embedding.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

Seed-Free Synthetic Data Generation Framework for Instruction-Tuning LLMs: A Case Study in Thai.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

Identifying and Mitigating Annotation Bias in Natural Language Understanding using Causal Mediation Analysis.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
An Efficient Self-Supervised Cross-View Training For Sentence Embedding.
Trans. Assoc. Comput. Linguistics, 2023

PyThaiNLP: Thai Natural Language Processing in Python.
CoRR, 2023

Two-stage Thai Misspelling Correction based on Pre-trained Language Models.
Proceedings of the 20th IEEE International Joint Conference on Computer Science and Software Engineering, 2023

mReFinED: An Efficient End-to-End Multilingual Entity Linking System.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Typo-Robust Representation Learning for Dense Retrieval.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2023

2022
Thai Wav2Vec2.0 with CommonVoice V8.
CoRR, 2022

CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022

ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

Thai Nested Named Entity Recognition Corpus.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

2021
AI Builders: Teaching Thai Students to Build End-to-End Machine Learning Projects Online.
Proceedings of the 2021 IEEE International Conference on Engineering, 2021

Robust Fragment-Based Framework for Cross-lingual Sentence Retrieval.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021

Handling Cross- and Out-of-Domain Samples in Thai Word Segmentation.
Proceedings of the Findings of the Association for Computational Linguistics: ACL/IJCNLP 2021, 2021

2020
Domain Adaptation of Thai Word Segmentation Models using Stacked Ensemble.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020


  Loading...