Kianté Brantley

Orcid: 0000-0002-8395-594X

According to our database1, Kianté Brantley authored at least 27 papers between 2015 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning.
CoRR, 2024

LLMs Are In-Context Reinforcement Learners.
CoRR, 2024

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF.
CoRR, 2024

REBEL: Reinforcement Learning via Regressing Relative Rewards.
CoRR, 2024

Dataset Reset Policy Optimization for RLHF.
CoRR, 2024

RL for Consistency Models: Faster Reward Guided Text-to-Image Generation.
CoRR, 2024

A Surprising Failure? Multimodal LLMs and the NLVR Challenge.
CoRR, 2024

Reviewer2: Optimizing Review Generation Through Prompt Generation.
CoRR, 2024

Ranking with Long-Term Constraints.
Proceedings of the 17th ACM International Conference on Web Search and Data Mining, 2024

RL for Consistency Models: Reward Guided Text-to-Image Generation with Fast Inference.
RLJ, 2024

Coactive Learning for Large Language Models using Implicit User Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

When is Transfer Learning Possible?
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Adversarial Imitation Learning via Boosting.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
Policy-Gradient Training of Language Models for Ranking.
CoRR, 2023

Learning to Generate Better Than Your LLM.
CoRR, 2023

Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Interactive Text Generation.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

lilGym: Natural Language Visual Reasoning with Reinforcement Learning.
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2021
Expert-in-the-Loop for Sequential Decisions and Predictions.
PhD thesis, 2021

Successor Feature Sets: Generalizing Successor Representations Across Policies.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Constrained episodic reinforcement learning in concave-convex and knapsack settings.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

Disagreement-Regularized Imitation Learning.
Proceedings of the 8th International Conference on Learning Representations, 2020

Active Imitation Learning with Noisy Guidance.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

2019
Reinforcement Learning with Convex Constraints.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Non-Monotonic Sequential Text Generation.
Proceedings of the 36th International Conference on Machine Learning, 2019

2017
The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task.
Proceedings of the Second Conference on Machine Translation, 2017

2015
LDAExplore: Visualizing Topic Models Generated Using Latent Dirichlet Allocation.
CoRR, 2015


  Loading...