Zheng Xin Yong
This page is a disambiguation page, it actually contains mutiple papers from persons of the same or a similar name.
Bibliography
2025
Can We Predict Alignment Before Models Finish Thinking? Towards Monitoring Misaligned Reasoning Models.
CoRR, July, 2025
Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR.
CoRR, June, 2025
Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability.
CoRR, June, 2025
The State of Multilingual LLM Safety Research: From Measuring the Language Gap to Mitigating It.
CoRR, May, 2025
Improving LLM First-Token Predictions in Multiple-Choice Question Answering via Prefilling Attack.
CoRR, May, 2025
CoRR, February, 2025
Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025
2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024
2023
Prompting Multilingual Large Language Models to Generate Code-Mixed Texts: The Case of South East Asian Languages.
CoRR, 2023
Representativeness as a Forgotten Lesson for Multilingual and Code-switched Data Collection and Preparation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
2022
An Exploration of Neural Radiance Field Scene Reconstruction: Synthetic, Real-world and Dynamic Scenes.
CoRR, 2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
CoRR, 2022
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022
2021
2020
Semi-supervised Deep Embedded Clustering with Anomaly Detection for Semantic Frame Induction.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020