Karthik Valmeekam
According to our database1,
Karthik Valmeekam
authored at least 21 papers
between 2021 and 2025.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2025
CoRR, May, 2025
CoRR, May, 2025
A Systematic Evaluation of the Planning and Scheduling Abilities of the Reasoning Model o1.
Trans. Mach. Learn. Res., 2025
On the self-verification limitations of large language models on reasoning and planning tasks.
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
2024
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1.
CoRR, 2024
LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench.
CoRR, 2024
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
2023
CoRR, 2023
On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark).
CoRR, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences.
Proceedings of the Eleventh International Conference on Learning Representations, 2023
2022
Large Language Models Still Can't Plan (A Benchmark for LLMs on Planning and Reasoning about Change).
CoRR, 2022
RADAR-X: An Interactive Mixed Initiative Planning Interface Pairing Contrastive Explanations and Revised Plan Suggestions.
Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling, 2022
2021
RADAR-X: An Interactive Interface Pairing Contrastive Explanations with Revised Plan Suggestions.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021