Mubashara Akhtar

Orcid: 0009-0003-6346-2392

According to our database1, Mubashara Akhtar authored at least 22 papers between 2022 and 2026.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book  In proceedings  Article  PhD thesis  Dataset  Other 

Links

On csauthors.net:

Bibliography

2026
Benchmarking and Enhancing Text-to-Image Models for Generating Visual Representations in Early Arithmetic Education.
CoRR, May, 2026

When AI Benchmarks Plateau: A Systematic Study of Benchmark Saturation.
CoRR, February, 2026

Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking.
Trans. Assoc. Comput. Linguistics, 2026

Efficient Test-Time Scaling of Multi-Step Reasoning by Probing Internal States of Large Language Models.
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2026

2025
Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads.
CoRR, November, 2025

Who Evaluates AI's Social Impacts? Mapping Coverage and Gaps in First and Third Party Evaluations.
CoRR, November, 2025

Compose and Fuse: Revisiting the Foundational Bottlenecks in Multimodal Reasoning.
CoRR, September, 2025

Chimera: Diagnosing Shortcut Learning in Visual-Language Understanding.
CoRR, September, 2025

LEXam: Benchmarking Legal Reasoning on 340 Law Exams.
CoRR, May, 2025

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
CoRR, March, 2025

TANQ: An Open Domain Dataset of Table Answered Questions.
Trans. Assoc. Comput. Linguistics, 2025

2024
The Automated Verification of Textual Claims (AVeriTeC) Shared Task.
CoRR, 2024

A Standardized Machine-readable Dataset Documentation Format for Responsible AI.
CoRR, 2024

Croissant: A Metadata Format for ML-Ready Datasets.
CoRR, 2024



ChartCheck: Explainable Fact-Checking over Real-World Chart Images.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
ChartCheck: An Evidence-Based Fact-Checking Dataset over Real-World Chart Images.
CoRR, 2023

Multimodal Automated Fact-Checking: A Survey.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2023, 2023

Reading and Reasoning over Chart Images for Evidence-based Automated Fact-Checking.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

2022
PubHealthTab: A Public Health Table-based Dataset for Evidence-based Fact Checking.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2022, 2022


  Loading...