Karthik Valmeekam

CoRR, April, 2025

A Systematic Evaluation of the Planning and Scheduling Abilities of the Reasoning Model o1.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2025

On the self-verification limitations of large language models on reasoning and planning tasks.

[BibT_eX]

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach.

[BibT_eX]

[DOI]

CoRR, 2024

Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1.

[BibT_eX]

[DOI]

CoRR, 2024

LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench.

[BibT_eX]

[DOI]

CoRR, 2024

Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning.

[BibT_eX]

[DOI]

CoRR, 2024

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks.

[BibT_eX]

[DOI]

CoRR, 2024

Chain of Thoughtlessness? An Analysis of CoT in Planning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Position: LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks.

[BibT_eX]

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023

Can Large Language Models Really Improve by Self-critiquing Their Own Plans?

[BibT_eX]

[DOI]

Matthew Marquez

CoRR, 2023

On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark).

[BibT_eX]

[DOI]

Sarath Sreedharan

Matthew Marquez

Alberto Olmo Hernandez

CoRR, 2023

On the Planning Abilities of Large Language Models - A Critical Investigation.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change.

[BibT_eX]

[DOI]

Matthew Marquez

Alberto Olmo Hernandez

Sarath Sreedharan

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences.

[BibT_eX]

[DOI]

Lin Guan

Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022

Large Language Models Still Can't Plan (A Benchmark for LLMs on Planning and Reasoning about Change).

[BibT_eX]

[DOI]

Alberto Olmo Hernandez

Sarath Sreedharan