We stand with Ukraine

We stand with Ukraine

Lester James V. Miranda

Orcid: 0000-0002-7872-6464

According to our database¹, Lester James V. Miranda authored at least 23 papers between 2018 and 2025.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

Online presence:

on orcid.org

On csauthors.net:

Bibliography

2025

FilBench: Can LLMs Understand and Generate Filipino?

[BibT_eX]

[DOI]

Lester James V. Miranda

,

,

,

Jan Christian Blaise Cruz

,

Joseph Marvin Imperial

CoRR, August, 2025

R3: Robust Rubric-Agnostic Reward Models.

[BibT_eX]

[DOI]

,

,

Lester James V. Miranda

,

,

Mohammad Rifqi Farhansyah

,

,

,

Genta Indra Winata

CoRR, May, 2025

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia.

[BibT_eX]

[DOI]

Samuel Cahyawijaya

,

,

Joel Ruben Antony Moniz

,

,

Mohammad Rifqi Farhansyah

,

Thant Thiri Maung

,

Frederikus Hudi

,

,

Muhammad Ravi Shulthan Habibi

,

Muhammad Reza Qorib

,

,

Joseph Marvin Imperial

,

Hitesh Laxmichand Patel

,

,

Bahrul Ilmi Nasution

,

Manuel Antonio Rufino

,

Genta Indra Winata

,

Rian Adam Rajagede

,

Carlos Rafael Catalan

,

Mohamed Fazli Imam

,

Priyaranjan Pattnayak

,

Salsabila Zahirah Pranida

,

,

,

Adisai Na-Thalang

,

Patricia Nicole Monderin

,

,

Christian Simon

,

Lynnette Hui Xian Ng

,

Richardy Lobo' Sapan

,

Taki Hasan Rafi

,

,

,

Kanyakorn Veerakanjana

,

Piyalitt Ittichaiwong

,

Matthew Theodore Roque

,

Karissa Vincentio

,

Takdanai Kreangphet

,

Phakphum Artkaew

,

Kadek Hendrawan Palgunadi

,

,

Rochana Prih Hastuti

,

,

,

Adrian Xuan Wei Lim

,

Aye Hninn Khine

,

Hanif Muhammad Zhafran

,

,

Audra Aurora Izzani

,

,

,

Jauza Akbar Krito

,

Michael Anugraha

,

Fenal Ashokbhai Ilasariya

,

,

John Amadeo Daniswara

,

Filbert Aurelian Tjiaranata

,

Eryawan Presma Yulianrifat

,

Can Udomcharoenchaikit

,

Fadil Risdian Ansori

,

Mahardika Krisna Ihsani

,

,

Anab Maulana Barik

,

Dan John Velasco

,

Rifo Ahmad Genadi

,

,

,

,

Kenneth Ko Han Chen

,

Anjela Gail Santos

,

,

,

,

Meisyarah Dwiastuti

,

,

Jan Christian Blaise Cruz

,

,

Ikhlasul Akmal Hanif

,

M. Alif Al Hakim

,

Muhammad Rizky Sya'ban

,

Kun Kerdthaisong

,

Lester James V. Miranda

,

,

Tirana Noor Fatyanosa

,

Alham Fikri Aji

,

Jostin Jerico Rosal

,

,

,

Onno P. Kampman

,

,

Börje F. Karlsson

,

Peerat Limkonchotiwat

CoRR, March, 2025

MMTEB: Massive Multilingual Text Embedding Benchmark.

[BibT_eX]

[DOI]

Kenneth C. Enevoldsen

,

,

,

,

,

,

,

,

Dominik Krzeminski

,

Genta Indra Winata

,

,

,

Mathieu Ciancone

,

Marion Schaeffer

,

Gabriel Sequeira

,

,

,

Jonathan Rystrøm

,

Roman Solomatin

,

,

,

Martin Bernstorff

,

,

Akshita Sukhlecha

,

,

,

Kranthi Kiran GV

,

,

,

Björn Plüster

,

Jan Philipp Harries

,

,

,

Mariya Hendriksen

,

,

Hippolyte Gisserot-Boukhlef

,

,

,

Konrad Wojtasik

,

,

,

,

,

,

Andrianos Michail

,

,

,

Aleksei Vatolin

,

,

,

,

Pranjal A. Chitale

,

Simone Tedeschi

,

,

,

Michael Günther

,

,

,

,

,

Gayatri Krishnakumar

,

,

,

Maria Tikhonova

,

,

Aleksandr Abramov

,

Malte Ostendorff

,

,

Simon Clematide

,

Lester James V. Miranda

,

Alena Fenogenova

,

,

Ruqiya Bin Safi

,

,

Alessia Borghini

,

Federico Cassano

,

,

,

,

,

,

,

Vaibhav Adlakha

,

,

,

Niklas Muennighoff

CoRR, February, 2025

2 OLMo 2 Furious.

[BibT_eX]

[DOI]

CoRR, January, 2025

RewardBench: Evaluating Reward Models for Language Modeling.

[BibT_eX]

[DOI]

,

Valentina Pyatkin

,

,

Lester James V. Miranda

,

Bill Yuchen Lin

,

Khyathi Raghavi Chandu

,

,

,

,

,

,

Hannaneh Hajishirzi

Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback.

[BibT_eX]

[DOI]

Lester James Validad Miranda

,

,

,

,

Valentina Pyatkin

,

,

,

Hannaneh Hajishirzi

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

M-RewardBench: Evaluating Reward Models in Multilingual Settings.

[BibT_eX]

[DOI]

,

Lester James Validad Miranda

,

Shayekh Bin Islam

,

Rishabh Maheshwary

,

,

Gusti Triandi Winata

,

,

Sebastian Ruder

,

,

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia.

[BibT_eX]

[DOI]

Samuel Cahyawijaya

,

,

Joel Ruben Antony Moniz

,

,

Mohammad Rifqi Farhansyah

,

Thant Thiri Maung

,

Frederikus Hudi

,

,

Muhammad Ravi Shulthan Habibi

,

Muhammad Reza Qorib

,

,

Joseph Marvin Imperial

,

Hitesh Laxmichand Patel

,

,

Bahrul Ilmi Nasution

,

Manuel Antonio Rufino

,

Genta Indra Winata

,

Rian Adam Rajagede

,

Carlos Rafael Catalan

,

Mohamed Fazli Mohamed Imam

,

Priyaranjan Pattnayak

,

Salsabila Zahirah Pranida

,

,

,

Adisai Na-Thalang

,

Patricia Nicole Monderin

,

,

Christian Simon

,

Lynnette Hui Xian Ng

,

Richardy Lobo' Sapan

,

Taki Hasan Rafi

,

,

,

Kanyakorn Veerakanjana

,

Piyalitt Ittichaiwong

,

Matthew Theodore Roque

,

Karissa Vincentio

,

Takdanai Kreangphet

,

Phakphum Artkaew

,

Kadek Hendrawan Palgunadi

,

,

Rochana Prih Hastuti

,

,

,

Adrian Xuan Wei Lim

,

Aye Hninn Khine

,

Hanif Muhammad Zhafran

,

,

Audra Aurora Izzani

,

,

,

Jauza Akbar Krito

,

Michael Anugraha

,

Fenal Ashokbhai Ilasariya

,

,

John Amadeo Daniswara

,

Filbert Aurelian Tjiaranata

,

Eryawan Presma Yulianrifat

,

Can Udomcharoenchaikit

,

Fadil Risdian Ansori

,

Mahardika Krisna Ihsani

,

,

Anab Maulana Barik

,

Dan John Velasco

,

Rifo Ahmad Genadi

,

,

,

Isaiah Edri W. Flores

,

Kenneth Ko Han Chen

,

Anjela Gail Santos

,

,

,

,

Meisyarah Dwiastuti

,

,

Jan Christian Blaise Cruz

,

,

Ikhlasul Akmal Hanif

,

M. Alif Al Hakim

,

Muhammad Rizky Sya'ban

,

Kun Kerdthaisong

,

Lester James Validad Miranda

,

,

Tirana Noor Fatyanosa

,

Alham Fikri Aji

,

Jostin Jerico Rosal

,

,

,

Onno P. Kampman

,

,

Börje F. Karlsson

,

Peerat Limkonchotiwat

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

The UD-NewsCrawl Treebank: Reflections and Challenges from a Large-scale Tagalog Syntactic Annotation Project.

[BibT_eX]

[DOI]

Angelina Aspra Aquino

,

Lester James Validad Miranda

,

Elsie Marie T. Or

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training.

[BibT_eX]

[DOI]

CoRR, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.

[BibT_eX]

[DOI]

CoRR, 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

,

Rahmad Mahendra

,

Salsabil Maulana Akbar

,

Lester James V. Miranda

,

Jennifer Santoso

,

,

Akhdan Fadhilah

,

Jonibek Mansurov

,

Joseph Marvin Imperial

,

Onno Pepijn Kampman

,

Joel Ruben Antony Moniz

,

Muhammad Ravi Shulthan Habibi

,

Frederikus Hudi

,

Railey Montalan

,

,

Joanito Agili Lopo

,

,

Börje F. Karlsson

,

,

Ryandito Diandaru

,

,

Patrick Amadeus Irawan

,

,

Jan Christian Blaise Cruz

,

Chenxi Whitehouse

,

Ivan Halim Parmonangan

,

,

,

,

Reynard Adha Ryanda

,

Sonny Lazuardi Hermawan

,

Dan John Velasco

,

Muhammad Dehan Al Kautsar

,

Willy Fitra Hendria

,

,

,

Muhammad Farid Adilazuarda

,

,

,

,

,

Muhammad Reza Qorib

,

Amirbek Djanibekov

,

,

,

Niklas Muennighoff

,

Tanrada Pansuwan

,

Ilham Firdausi Putra

,

,

,

Ayu Purwarianti

,

Sebastian Ruder

,

William-Chandra Tjhi

,

Peerat Limkonchotiwat

,

Alham Fikri Aji

,

,

Genta Indra Winata

,

,

,

,

Samuel Cahyawijaya

CoRR, 2024

Consent in Crisis: The Rapid Decline of the AI Data Commons.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark.

[BibT_eX]

[DOI]

,

,

,

,

,

Joseph Marvin Imperial

,

Börje Karlsson

,

,

Nikola Ljubesic

,

Lester James V. Miranda

,

,

,

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.

[BibT_eX]

[DOI]

,

Rahmad Mahendra

,

Salsabil Maulana Akbar

,

Lester James V. Miranda

,

Jennifer Santoso

,

,

Akhdan Fadhilah

,

Jonibek Mansurov

,

Joseph Marvin Imperial

,

,

Joel Ruben Antony Moniz

,

Muhammad Ravi Shulthan Habibi

,

Frederikus Hudi

,

Jann Railey Montalan

,

Ryan Hadiwijaya

,

Joanito Agili Lopo

,

,

Börje Karlsson

,

,

Ryandito Diandaru

,

,

Patrick Amadeus Irawan

,

,

Jan Christian Blaise Cruz

,

Chenxi Whitehouse

,

Ivan Halim Parmonangan

,

,

,

,

Reynard Adha Ryanda

,

Sonny Lazuardi Hermawan

,

Dan John Velasco

,

Muhammad Dehan Al Kautsar

,

Willy Fitra Hendria

,

,

,

Muhammad Farid Adilazuarda

,

,

,

,

,

Muhammad Reza Qorib

,

Amirbek Djanibekov

,

,

,

Niklas Muennighoff

,

Tanrada Pansuwan

,

Ilham Firdausi Putra

,

,

,

Ayu Purwarianti

,

Sebastian Ruder

,

William-Chandra Tjhi

,

Peerat Limkonchotiwat

,

Alham Fikri Aji

,

,

Genta Indra Winata

,

,

,

,

Samuel Cahyawijaya

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

2023

calamanCy: A Tagalog Natural Language Processing Toolkit.

[BibT_eX]

[DOI]

Lester James V. Miranda

CoRR, 2023

Developing a Named Entity Recognition Dataset for Tagalog.

[BibT_eX]

[DOI]

Lester James V. Miranda

CoRR, 2023

2022

Multi hash embeddings in spaCy.

[BibT_eX]

[DOI]

Lester James V. Miranda

,

,

,

Sofie Van Landeghem

,

Anders Søgaard

,

Matthew Honnibal

CoRR, 2022

2019

Geomancer: An Open-Source Framework for Geospatial Feature Engineering.

[BibT_eX]

[DOI]

Lester James V. Miranda

,

Mark Steve Samson

,

Alfiero K. Orden II

,

Bianca S. Silmaro

,

Ram K. De Guzman III

,

Stephanie S. Sy

CoRR, 2019

2018

PySwarms: a research toolkit for Particle Swarm Optimization in Python.

[BibT_eX]

[DOI]

Lester James V. Miranda

J. Open Source Softw., 2018

Feature Extraction Using a Mutually-Competitive Autoencoder for Protein Function Prediction.

[BibT_eX]

[DOI]

Lester James V. Miranda

,

Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2018

A Deep Learning Approach Based on Stacked Denoising Autoencoders for Protein Function Prediction.

[BibT_eX]

[DOI]

Lester James V. Miranda

,

Proceedings of the 2018 IEEE 42nd Annual Computer Software and Applications Conference, 2018

Loading...