Koki Maeda

Orcid: 0009-0008-0529-3152

According to our database1, Koki Maeda authored at least 15 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models.
CoRR, April, 2026

JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation.
CoRR, April, 2026

JaWildText: A Benchmark for Vision-Language Models on Japanese Scene Text Understanding.
CoRR, March, 2026

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models.
CoRR, February, 2026

Autonomous Decentralized TRP Group Selection Method to Maximize System Throughput in Downlink Distributed MIMO with Overlapped TRP Group Configuration.
IEICE Trans. Commun., 2026

2025
Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models.
CoRR, March, 2025

Constructing Multimodal Datasets from Scratch for Rapid Development of a Japanese Visual Language Model.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

LegalViz: Legal Text Visualization by Text To Diagram Generation.
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, 2025

2024
Why We Build Local Large Language Models: An Observational Analysis from 35 Japanese and Multilingual LLMs.
CoRR, 2024

Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction.
CoRR, 2024

Autonomous Decentralized TRP Group Selection to Maximize System Throughput in Downlink Distributed MIMO System.
Proceedings of the 100th IEEE Vehicular Technology Conference, 2024

COM Kitchens: An Unedited Overhead-View Video Dataset as a Vision-Language Benchmark.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
Query-based Image Captioning from Multi-context 360cdegree Images.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

DueT: Image-Text Contrastive Transfer Learning with Dual-adapter Tuning.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

2022
IMPARA: Impact-Based Metric for GEC Using Parallel Data.
Proceedings of the 29th International Conference on Computational Linguistics, 2022


  Loading...