Karthik Valmeekam

According to our database1, Karthik Valmeekam authored at least 16 papers between 2021 and 2024.

Collaborative distances:
  • Dijkstra number2 of five.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Planning in Strawberry Fields: Evaluating and Improving the Planning and Scheduling Capabilities of LRM o1.
CoRR, 2024

LLMs Still Can't Plan; Can LRMs? A Preliminary Evaluation of OpenAI's o1 on PlanBench.
CoRR, 2024

Robust Planning with LLM-Modulo Framework: Case Study in Travel Planning.
CoRR, 2024

Chain of Thoughtlessness: An Analysis of CoT in Planning.
CoRR, 2024

On the Self-Verification Limitations of Large Language Models on Reasoning and Planning Tasks.
CoRR, 2024

LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks.
CoRR, 2024

Position: LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

2023
Can Large Language Models Really Improve by Self-critiquing Their Own Plans?
CoRR, 2023

On the Planning Abilities of Large Language Models (A Critical Investigation with a Proposed Benchmark).
CoRR, 2023

On the Planning Abilities of Large Language Models - A Critical Investigation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about Change.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Planning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Relative Behavioral Attributes: Filling the Gap between Symbolic Goal Specification and Reward Learning from Human Preferences.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
Large Language Models Still Can't Plan (A Benchmark for LLMs on Planning and Reasoning about Change).
CoRR, 2022

RADAR-X: An Interactive Mixed Initiative Planning Interface Pairing Contrastive Explanations and Revised Plan Suggestions.
Proceedings of the Thirty-Second International Conference on Automated Planning and Scheduling, 2022

2021
RADAR-X: An Interactive Interface Pairing Contrastive Explanations with Revised Plan Suggestions.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021


  Loading...