Joseph Marvin Imperial

Orcid: 0000-0003-1073-6129

Affiliations:
  • University of Bath, UK
  • National University College of Computer Studies, Manila, Philippines


According to our database1, Joseph Marvin Imperial authored at least 51 papers between 2019 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2025
FilBench: Can LLMs Understand and Generate Filipino?
CoRR, August, 2025

UniversalCEFR: Enabling Open Multilingual Research on Language Proficiency Assessment.
CoRR, June, 2025

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation.
CoRR, April, 2025

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025

Standardizing Intelligence: Aligning Generative AI for Regulatory and Operational Compliance.
CoRR, March, 2025


Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

Natural Language Generation with Expert Standards.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages.
CoRR, 2024

Risks and Opportunities of Open-Source Generative AI.
CoRR, 2024

Near to Mid-term Risks and Opportunities of Open Source Generative AI.
CoRR, 2024

Introducing v0.5 of the AI Safety Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, 2024

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024



SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024


2023
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark.
CoRR, 2023

Flesch or Fumble? Evaluating Readability Standard Alignment of Instruction-Tuned Language Models.
CoRR, 2023

Predicting the Use Behavior of Higher Education Students on ChatGPT: Evidence from the Philippines.
Proceedings of the IEEE International Conference on Teaching, 2023

CebuaNER: A New Baseline Cebuano Named Entity Recognition Model.
Proceedings of the 37th Pacific Asia Conference on Language, 2023

Discovering Insights via Hybrid Thematic Analysis: A Case Study on Disaster Risk Reduction and Management for Legazpi City, Albay.
Proceedings of the Machine Learning and Artificial Intelligence, 2023

Uniform Complexity for Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

BasahaCorpus: An Expanded Linguistic Resource for Readability Assessment in Central Philippine Languages.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

Automatic Readability Assessment for Closely Related Languages.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Uniform Complexity for Text Generation.
CoRR, 2022

A Baseline Readability Model for Cebuano.
CoRR, 2022

NU HLT at CMCL 2022 Shared Task: Multilingual and Crosslingual Prediction of Human Reading Behavior in Universal Language Space.
CoRR, 2022

Changing Topics for Changing Times: Thematic and Temporal-Based Analysis of the Philippine Senatorial and Midterm Elections.
Proceedings of the 2022 6th International Conference on Natural Language Processing and Information Retrieval, 2022

WikAnalytics: A Web-based Application for Identifying Linguistic Features of a Text Group Supporting Filipino, English, and Taglish Languages.
Proceedings of the 5th International Conference on Machine Learning and Machine Intelligence, 2022

Is Twitter an Echo Chamber? Connecting Online Public Sentiments to Actual Results From the 2019 Philippine Midterm Elections.
Proceedings of the International Conference on Asian Language Processing, 2022

On Applicability of Neural Language Models for Readability Assessment in Filipino.
Proceedings of the Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners' and Doctoral Consortium, 2022

2021
How Do Pedophiles Tweet? Investigating the Writing Styles and Online Personas of Child Cybersex Traffickers in the Philippines.
CoRR, 2021

Knowledge-Rich BERT Embeddings for Readability Assessment.
CoRR, 2021

A Simple Post-Processing Technique for Improving Readability Assessment of Texts using Word Mover's Distance.
CoRR, 2021

Application of Lexical Features Towards Improvement of Filipino Readability Identification of Children's Literature.
CoRR, 2021

BERT Embeddings for Automatic Readability Assessment.
Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), 2021

Science Mapping of Publications in Natural Language Processing in the Philippines: 2006 to 2020.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021

Under the Microscope: Interpreting Readability Assessment Models for Filipino.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021

Cross-Textual Analysis of COVID-19 Tweets: On Themes and Trends Over Time.
Proceedings of Sixth International Congress on Information and Communication Technology, 2021

Diverse Linguistic Features for Assessing Reading Difficulty of Educational Filipino Texts.
Proceedings of the Proceeding of the 29th International Conference on Computers in Education, 2021

Audio-Based Hate Speech Classification from Online Short-Form Videos.
Proceedings of the International Conference on Asian Language Processing, 2021

Deploying Kalahok 1.0: Profiling Disaster-Stricken Communities Towards Intervention Initiatives.
Proceedings of the IEEE Global Humanitarian Technology Conference, 2021

2020
Semi-automatic Construction of Sight Words Dictionary for Filipino Text Readability.
Proceedings of the Knowledge Management and Acquisition for Intelligent Systems, 2020

A Simple Disaster-Related Knowledge Base for Intelligent Agents.
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation, 2020

Exploring Hybrid Linguistic Feature Sets to Measure Filipino Text Readability.
Proceedings of the International Conference on Asian Language Processing, 2020

2019
Sentiment Analysis of Typhoon Related Tweets using Standard and Bidirectional Recurrent Neural Networks.
CoRR, 2019

An experimental Tagalog Finite State Automata spellchecker with Levenshtein edit-distance feature.
Proceedings of the International Conference on Asian Language Processing, 2019

Developing a machine learning-based grade level classifier for Filipino children's literature.
Proceedings of the International Conference on Asian Language Processing, 2019


  Loading...