Zheng Xin Yong

CoRR, October, 2025

Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models.

[BibT_eX]

[DOI]

Yik Siu Chan

Zheng-Xin Yong

CoRR, July, 2025

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability.

[BibT_eX]

[DOI]

CoRR, June, 2025

The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It.

[BibT_eX]

[DOI]

CoRR, May, 2025

Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack.

[BibT_eX]

[DOI]

CoRR, May, 2025

Crosslingual Reasoning through Test-Time Scaling.

[BibT_eX]

[DOI]

Zheng-Xin Yong

CoRR, May, 2025

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs.

[BibT_eX]

[DOI]

CoRR, February, 2025

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR.

[BibT_eX]

[DOI]

Proceedings of the 26th Annual Conference of the International Speech Communication Association, 2025

2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

Muhammad Ravi Shulthan Habibi

Rahmad Mahendra

Salsabil Maulana Akbar

Lester James V. Miranda

Joseph Marvin Imperial

Onno Pepijn Kampman

Joel Ruben Antony Moniz

Patrick Amadeus Irawan

Bin Wang

Muhammad Dehan Al Kautsar

Chenxi Whitehouse

Ivan Halim Parmonangan

Sonny Lazuardi Hermawan

Dan John Velasco

Willy Fitra Hendria

Yasmin Moslem

Noah Flynn

Peerat Limkonchotiwat

CoRR, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.

[BibT_eX]

[DOI]

David Romero

Chenyang Lyu

Haryo Akbarianto Wibowo

Henok Biadglign Ademtew

Hernán Maina

Israel Abebe Azime

Jesús-Germán Ortiz-Barajas

Jay P. Gala

Jiahui Geng

Jinheon Baek

Jocelyn Dunstan

Laura Alonso Alemany

Kumaranage Ravindu Yasas Nagasinghe

Luciana Benotti

Luis Fernando D'Haro

Marcelo Viridiano

Marcos Estecha-Garitagoitia

Maria Camila Buitrago Cabrera

Mario Rodríguez-Cantelar

Mélanie Jouitteau

Mihail Mihaylov

Mohamed Fazli Mohamed Imam

Munkhjargal Gochoo

Munkh-Erdene Otgonbold

Jesús-Germán Ortiz-Barajas

Toqeer Ehsan

Vladimir Araujo

Yova Kementchedjhieva

CoRR, 2024

A Safe Harbor for AI Evaluation and Red Teaming.

[BibT_eX]

[DOI]

CoRR, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.

[BibT_eX]

[DOI]

David Romero

Chenyang Lyu

Haryo Akbarianto Wibowo

Santiago Góngora

Aishik Mandal

Sukannya Purkayastha

Munkh-Erdene Otgonbold

Frederico Belcavello

Marcelo Viridiano

Christian Salamea Palacios

Vladimir Araujo

Yova Kementchedjhieva

Mihail Mihaylov

Israel Abebe Azime

Henok Biadglign Ademtew

Bontu Fufa Balcha

Naome A. Etori

Maria Camila Buitrago Cabrera

Rada Mihalcea

Atnafu Lambebo Tonja

Gisela Vallejo

Marcos Estecha-Garitagoitia

Ruochen Zhang

Mario Rodríguez-Cantelar

Toqeer Ehsan

Rendi Chevi

Mohamed Fazli Mohamed Imam

Kumaranage Ravindu Yasas Nagasinghe

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Position: A Safe Harbor for AI Evaluation and Red Teaming.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons.

[BibT_eX]

[DOI]

Cristina Menghini

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

Muhammad Ravi Shulthan Habibi

Rahmad Mahendra

Salsabil Maulana Akbar

Lester James V. Miranda

Joseph Marvin Imperial

Onno Kampman

Joel Ruben Antony Moniz

Patrick Amadeus Irawan

Bin Wang

Muhammad Dehan Al Kautsar

Chenxi Whitehouse

Ivan Halim Parmonangan

Sonny Lazuardi Hermawan

Dan John Velasco

Willy Fitra Hendria

Yasmin Moslem

Noah Flynn

Peerat Limkonchotiwat

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Preference Tuning For Toxicity Mitigation Generalizes Across Languages.

[BibT_eX]

[DOI]

Xiaochen Li

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model.

[BibT_eX]

[DOI]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023

Low-Resource Languages Jailbreak GPT-4.

[BibT_eX]

[DOI]

Cristina Menghini

CoRR, 2023

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.

[BibT_eX]

[DOI]

Long Phan

Yin Lin Tan

Alham Fikri Aji

CoRR, 2023

Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation.

[BibT_eX]

[DOI]

A. Seza Dogruöz

Sunayana Sitaram

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges.

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Crosslingual Generalization through Multitask Finetuning.

[BibT_eX]

[DOI]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.

[BibT_eX]

[DOI]

CoRR, 2022

An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes.

[BibT_eX]

[DOI]

CoRR, 2022

Adapting BigScience Multilingual Model to Unseen Languages.

[BibT_eX]

[DOI]

Vassilina Nikoulina

CoRR, 2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.

[BibT_eX]

[DOI]

CoRR, 2022

Frame Shift Prediction.

[BibT_eX]

[DOI]

Patrick D. Watson

Oliver Czulo

Collin F. Baker

Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Multitask Prompted Training Enables Zero-Shot Task Generalization.

[BibT_eX]

[DOI]

Proceedings of the Tenth International Conference on Learning Representations, 2022

What Language Model to Train if You Have One Million GPU Hours?

[BibT_eX]

[DOI]

Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.

[BibT_eX]

[DOI]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2021

Multitask Prompted Training Enables Zero-Shot Task Generalization.

[BibT_eX]

[DOI]

CoRR, 2021

2020

Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction.

[BibT_eX]

[DOI]