Copilot Arena: A Platform for Code LLM Evaluation in the Wild.
CoRR, February, 2025
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
The Impact of Element Ordering on LM Agent Performance.
CoRR, 2024
Enabling Limited Resource-Bounded Disjunction in Scheduling.
J. Aerosp. Inf. Syst., June, 2021
Analyzing the effectiveness of rescheduling and Flexible Execution methods to address uncertainty in execution duration for a planetary rover.
Robotics Auton. Syst., 2021
Symbolic Music Generation with Transformer-GANs.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021
Automated Volcano Monitoring Using Multiple Space and Ground Sensors.
J. Aerosp. Inf. Syst., April, 2020
Generating Music with a Self-Correcting Non-Chronological Autoregressive Model.
Proceedings of the 21th International Society for Music Information Retrieval Conference, 2020
Scheduling with Complex Consumptive Resources for a Planetary Rover.
Proceedings of the Thirtieth International Conference on Automated Planning and Scheduling, 2020
Temporal Brittleness Analysis of Task Networks for Planetary Rovers.
Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019
Optimizing Parameters for Uncertain Execution and Rescheduling Robustness.
Proceedings of the Twenty-Ninth International Conference on Automated Planning and Scheduling, 2019
Embedding a Scheduler in Execution for a Planetary Rover.
Proceedings of the Twenty-Eighth International Conference on Automated Planning and Scheduling, 2018