Zheng Xin Yong

This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.

Bibliography

2025
Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models.
CoRR, July, 2025

Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR.
CoRR, June, 2025

Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability.
CoRR, June, 2025

The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It.
CoRR, May, 2025

Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack.
CoRR, May, 2025

Crosslingual Reasoning through Test-Time Scaling.
CoRR, May, 2025

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs.
CoRR, February, 2025

Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.
CoRR, 2024

A Safe Harbor for AI Evaluation and Red Teaming.
CoRR, 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024


LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024


Preference Tuning For Toxicity Mitigation Generalizes Across Languages.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
Low-Resource Languages Jailbreak GPT-4.
CoRR, 2023

Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.
CoRR, 2023

Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Crosslingual Generalization through Multitask Finetuning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting.
CoRR, 2022

An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes.
CoRR, 2022

Adapting BigScience Multilingual Model to Unseen Languages.
CoRR, 2022

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
CoRR, 2022

Frame Shift Prediction.
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022


What Language Model to Train if You Have One Million GPU Hours?
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022


2021
Multitask Prompted Training Enables Zero-Shot Task Generalization.
CoRR, 2021

2020
Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020


  Loading...