Robust Function-Calling for On-Device Language Model via Function Masking.
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the Thirteenth International Conference on Learning Representations, 2025
Stochastic <i>k</i>-Submodular Bandits with Full Bandit Feedback.
CoRR, 2024
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking.
,
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
Gradient Methods for Online DR-Submodular Maximization with Stochastic Long-Term Constraints.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024
Size-constrained k-submodular maximization in near-linear time.
Proceedings of the Uncertainty in Artificial Intelligence, 2023
A Framework for Adapting Offline Algorithms to Solve Combinatorial Multi-Armed Bandit Problems with Bandit Feedback.
Proceedings of the International Conference on Machine Learning, 2023
An explore-then-commit algorithm for submodular maximization under full-bandit feedback.
Proceedings of the Uncertainty in Artificial Intelligence, 2022