Jaeyoung Do

Orcid: 0000-0003-1275-1621

Affiliations:
  • Microsoft Research


According to our database1, Jaeyoung Do authored at least 48 papers between 2009 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

Online presence:

On csauthors.net:

Bibliography

2026
MI-CXR: A Benchmark for Longitudinal Reasoning over Multi-Interval Chest X-rays.
CoRR, May, 2026

Report of the 5th PVUW Challenge: Towards More Diverse Modalities in Pixel-Level Understanding.
CoRR, April, 2026

Dynin-Omni: Omnimodal Unified Large Diffusion Language Model.
CoRR, April, 2026

MEDIC-AD: Towards Medical Vision-Language Model's Clinical Intelligence.
CoRR, March, 2026

VIRST: Video-Instructed Reasoning Assistant for SpatioTemporal Segmentation.
CoRR, March, 2026

3rd Place of MeViS-Audio Track of the 5th PVUW: VIRST-Audio.
CoRR, March, 2026

Is Retraining-Free Enough? The Necessity of Router Calibration for Efficient MoE Compression.
CoRR, March, 2026

RFEval: Benchmarking Reasoning Faithfulness under Counterfactual Reasoning Intervention in Large Reasoning Models.
CoRR, February, 2026

MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering.
CoRR, February, 2026

VALUEFLOW: Toward Pluralistic and Steerable Value-based Alignment in Large Language Models.
CoRR, February, 2026

2025
MatKV: Trading Compute for Flash Storage in LLM Inference.
CoRR, December, 2025

Exploring and Leveraging Class Vectors for Classifier Editing.
CoRR, October, 2025

MMPB: It's Time for Multi-Modal Personalization.
CoRR, September, 2025

Turbocharging Vector Databases using Modern SSDs.
Proc. VLDB Endow., July, 2025

Don't Let It Fade: Preserving Edits in Diffusion Language Models via Token Timestep Allocation.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2025, 2025

SECOND: Mitigating Perceptual Hallucination in Vision-Language Models via Selective and Contrastive Decoding.
Proceedings of the Forty-second International Conference on Machine Learning, 2025

MathReader : Text-to-Speech for Mathematical Documents.
Proceedings of the 2025 IEEE International Conference on Acoustics, 2025

MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula.
Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence, 2025

2024
MathBridge: A Large Corpus Dataset for Translating Spoken Mathematical Expressions into LaTeX Formulas for Improved Readability.
CoRR, 2024

AscleAI: A LLM-based Clinical Note Management System for Enhancing Clinician Productivity.
Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2024

Aligning Large Language Models via Fine-grained Supervision.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

2023
Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays.
ACM Trans. Storage, February, 2023

Accelerating Large-Scale Graph-Based Nearest Neighbor Search on a Computational Storage Platform.
IEEE Trans. Computers, 2023

Data Augmentation for Improving Tail-traffic Robustness in Skill-routing for Dialogue Systems.
CoRR, 2023

Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Grounding Counterfactual Explanation of Image Classifiers to Textual Concept Space.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Large-scale Lifelong Learning of In-context Instructions and How to Tackle It.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 5: Industry Track), 2023

2022
Accelerating Large-Scale Graph-based Nearest Neighbor Search on a Computational Storage Platform.
CoRR, 2022

A Dual-Mode Similarity Search Accelerator based on Embedding Compression for Online Cross-Modal Image-Text Retrieval.
Proceedings of the 30th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2022

Debiasing Neighbor Aggregation for Graph Neural Network in Recommender Systems.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Better database cost/performance via batched I/O on programmable SSD.
VLDB J., 2021

Programming an SSD Controller to Support Batched Writes for Variable-Size Pages.
Proceedings of the 37th IEEE International Conference on Data Engineering, 2021

Accelerating Large-Scale Nearest Neighbor Search with Computational Storage Device.
Proceedings of the 29th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines, 2021

Computational Storage: Where Are We Today?
Proceedings of the 11th Conference on Innovative Data Systems Research, 2021

2020
Cost-effective, Energy-efficient, and Scalable Storage Computing for Large-scale AI Applications.
ACM Trans. Storage, 2020

ALEX: An Updatable Adaptive Learned Index.
Proceedings of the 2020 International Conference on Management of Data, 2020

Lessons learned from the early performance evaluation of Intel optane DC persistent memory in DBMS.
Proceedings of the 16th International Workshop on Data Management on New Hardware, 2020

2019
Programmable solid-state storage in future cloud datacenters.
Commun. ACM, 2019

Improving CPU I/O Performance via SSD Controller FTL Support for Batched Writes.
Proceedings of the 15th International Workshop on Data Management on New Hardware, 2019

2018
MPP 2018 Keynote.
Proceedings of the 2018 IEEE International Parallel and Distributed Processing Symposium Workshops, 2018

2016
Aggressive buffer pool warm-up after restart in SQL Server.
Proceedings of the 32nd IEEE International Conference on Data Engineering Workshops, 2016

2014
Query Processing on Smart SSDs.
IEEE Data Eng. Bull., 2014

2013
Query processing on smart SSDs: opportunities and challenges.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2013

Fast peak-to-peak behavior with SSD buffer pool.
Proceedings of the 29th IEEE International Conference on Data Engineering, 2013

2011
Turbocharging DBMS buffer pool using SSDs.
Proceedings of the ACM SIGMOD International Conference on Management of Data, 2011

2009
Fast Statistical Alignment.
PLoS Comput. Biol., 2009

Join processing for flash SSDs: remembering past lessons.
Proceedings of the Fifth International Workshop on Data Management on New Hardware, 2009


  Loading...