Michael Krumdick

According to our database1, Michael Krumdick authored at least 16 papers between 2017 and 2026.

Collaborative distances:

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
FrontierFinance: A Long-Horizon Computer-Use Benchmark of Real-World Financial Tasks.
CoRR, April, 2026

Cost-Efficient Estimation of General Abilities Across Benchmarks.
CoRR, April, 2026

2025
On Finding Inconsistencies in Documents.
CoRR, December, 2025

Complexity Scaling Laws for Neural Models using Combinatorial Optimization.
CoRR, June, 2025

BLEUBERI: BLEU is a surprisingly effective reward for instruction following.
CoRR, May, 2025

No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding.
CoRR, March, 2025

Language Model Probabilities are Not Calibrated in Numeric Contexts.
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2025

2024
Are Language Model Logits Calibrated?
CoRR, 2024

SEC-QA: A Systematic Evaluation Corpus for Financial QA.
CoRR, 2024

An Analysis of Multilingual FActScore.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

DocFinQA: A Long-Context Financial Reasoning Dataset.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), 2024

BizBench: A Quantitative Reasoning Benchmark for Business and Finance.
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2024

2023
A Graphical Approach to Document Layout Analysis.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

2022
Watermarking for Data Provenance in Object Detection.
Proceedings of the 51st IEEE Applied Imagery Pattern Recognition Workshop, 2022

2020
APRICOT: A Dataset of Physical Adversarial Attacks on Object Detection.
Proceedings of the Computer Vision - ECCV 2020, 2020

2017
Recognition of Image-Orientation-Based Iris Spoofing.
IEEE Trans. Inf. Forensics Secur., 2017


  Loading...