Aditya Gopalan

Orcid: 0000-0002-7323-2975

According to our database1, Aditya Gopalan authored at least 80 papers between 2010 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Towards Reliable Alignment: Uncertainty-aware RLHF.
CoRR, 2024

Testing the Feasibility of Linear Programs with Bandit Feedback.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Model-Based Best Arm Identification for Decreasing Bandits.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

A Unified Framework for Discovering Discrete Symmetries.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2024

2023
Data flow dissemination in a network.
Queueing Syst. Theory Appl., December, 2023

When are Bandits Robust to Misspecification?
CoRR, 2023

On the Minimax Regret for Linear Bandits in a wide variety of Action Spaces.
CoRR, 2023

WiROS: A QoS Software Solution for ros2 in a WiFi Network.
Proceedings of the 15th International Conference on COMmunication Systems & NETworkS, 2023

Bregman Deviations of Generic Exponential Families.
Proceedings of the Thirty Sixth Annual Conference on Learning Theory, 2023

Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference.
Proceedings of the International Conference on Artificial Intelligence and Statistics, 2023

2022
Sequential Multi-Hypothesis Testing in Multi-Armed Bandit Problems: An Approach for Asymptotic Optimality.
IEEE Trans. Inf. Theory, 2022

Actor-Critic based Improper Reinforcement Learning.
CoRR, 2022

Approximate Q-learning and SARSA(0) under the ε-greedy Policy: a Differential Inclusion Analysis.
CoRR, 2022

Adaptive Estimation of Random Vectors with Bandit Feedback.
CoRR, 2022

Improved Pure Exploration in Linear Bandits with No-Regret Learning.
Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, 2022

Actor-Critic based Improper Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2022

2021
On Slowly-varying Non-stationary Bandits.
CoRR, 2021

Better than the Best: Gradient-based Improper Reinforcement Learning for Network Scheduling.
CoRR, 2021

Improper Learning with Gradient-based Policy Optimization.
CoRR, 2021

Bandit Quickest Changepoint Detection.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021


Reinforcement Learning in Parametric MDPs with Exponential Families.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

No-regret Algorithms for Multi-task Bayesian Optimization.
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 2021

2020
Reduced-State, Optimal Scheduling for Decentralized Medium Access Control of a Class of Wireless Networks.
IEEE/ACM Trans. Netw., 2020

Stability and Scalability of Blockchain Systems.
Proc. ACM Meas. Anal. Comput. Syst., 2020

Stochastic Linear Bandits with Protected Subspace.
CoRR, 2020

Explicit Best Arm Identification in Linear Bandits Using No-Regret Learners.
CoRR, 2020

How Reliable are Test Numbers for Revealing the COVID-19 Ground Truth and Applying Interventions?
CoRR, 2020

Throughput Optimal Decentralized Scheduling with Single-bit State Feedback for a Class of Queueing Systems.
CoRR, 2020

On the Latency in Vehicular Control using Video Streaming over Wi-Fi.
Proceedings of the 2020 National Conference on Communications, 2020

From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model.
Proceedings of the 37th International Conference on Machine Learning, 2020

CORNET: A Co-Simulation Middleware for Robot Networks.
Proceedings of the 2020 International Conference on COMmunication Systems & NETworkS, 2020

Best-item Learning in Random Utility Models with Subset Choices.
Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2020

Sequential Mode Estimation with Oracle Queries.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

On Adaptivity in Information-Constrained Online Learning.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Towards Optimal and Efficient Best Arm Identification in Linear Bandits.
CoRR, 2019

On Batch Bayesian Optimization.
CoRR, 2019

Regret Minimisation in Multinomial Logit Bandits.
CoRR, 2019

Combinatorial Bandits with Relative Feedback.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Bayesian Optimization under Heavy-tailed Payoffs.
Proceedings of the Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, 2019

Learning to Detect an Anomalous Target with Observations from an Exponential Family.
Proceedings of the IEEE Data Science Workshop, 2019

PAC Battling Bandits in the Plackett-Luce Model.
Proceedings of the Algorithmic Learning Theory, 2019

Active Ranking with Subset-wise Preferences.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

Online Learning in Kernelized Markov Decision Processes.
Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics, 2019

2018
Collaborative Learning of Stochastic Bandits Over a Social Network.
IEEE/ACM Trans. Netw., 2018

On the Whittle Index for Restless Multiarmed Hidden Markov Bandits.
IEEE Trans. Autom. Control., 2018

PAC-Battling Bandits with Plackett-Luce: Tradeoff between Sample Complexity and Subset Size.
CoRR, 2018

Battle of Bandits.
Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence, 2018

On a Class of Restless Multi-armed Bandits with Deterministic Policies.
Proceedings of the 2018 International Conference on Signal Processing and Communications (SPCOM), 2018

Reduced-State, Optimal Medium Access Control for Wireless Data Collection Networks.
Proceedings of the 2018 IEEE Conference on Computer Communications, 2018

Online Learning for Structured Loss Spaces.
Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, 2018

2017
Optimal Odd Arm Identification with Fixed Confidence.
CoRR, 2017

When to arrive in a congested system: Achieving equilibrium via learning algorithm.
Proceedings of the 15th International Symposium on Modeling and Optimization in Mobile, 2017

On Kernelized Multi-armed Bandits.
Proceedings of the 34th International Conference on Machine Learning, 2017

A Hidden Markov Restless Multi-armed Bandit Model for Playout Recommendation Systems.
Proceedings of the Communication Systems and Networks - 9th International Conference, 2017

Restless bandits that hide their hand and recommendation systems.
Proceedings of the 9th International Conference on Communication Systems and Networks, 2017

Weighted Bandits or: How Bandits Learn Distorted Values That Are Not Expected.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

Misspecified Linear Bandits.
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017

2016
On the Whittle Index for Restless Multi-armed Hidden Markov Bandits.
CoRR, 2016

Stochastic bandits on a social network: Collaborative learning with local information sharing.
CoRR, 2016

Low-rank Bandits with Latent Mixtures.
CoRR, 2016

Optimizing distributed actor systems for dynamic interactive services.
Proceedings of the Eleventh European Conference on Computer Systems, 2016

Optimal recommendation to users that react: Online learning for a class of POMDPs.
Proceedings of the 55th IEEE Conference on Decision and Control, 2016

Randomised Procedures for Initialising and Switching Actions in Policy Iteration.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Wireless scheduling with partial channel state information: large deviations and optimality.
Queueing Syst. Theory Appl., 2015

Optimal WiFi sensing via dynamic programming.
Proceedings of the 13th International Symposium on Modeling and Optimization in Mobile, 2015

Thompson Sampling for Learning Parameterized Markov Decision Processes.
Proceedings of The 28th Conference on Learning Theory, 2015

A restless bandit with no observable states for recommendation systems and communication link scheduling.
Proceedings of the 54th IEEE Conference on Decision and Control, 2015

2014
Epidemic Spreading With External Agents.
IEEE Trans. Inf. Theory, 2014

Thompson Sampling for Learning Parameterized MDPs.
CoRR, 2014

Thompson Sampling for Complex Online Problems.
Proceedings of the 31th International Conference on Machine Learning, 2014

2013
On the Value of Coordination and Delayed Queue Information in Multicellular Scheduling.
IEEE Trans. Autom. Control., 2013

Thompson Sampling for Complex Bandit Problems.
CoRR, 2013

Thompson Sampling for Online Learning with Linear Experts.
CoRR, 2013

2012
On Wireless Scheduling With Partial Channel-State Information.
IEEE Trans. Inf. Theory, 2012

On distributed scheduling with heterogeneously delayed network-state information.
Queueing Syst. Theory Appl., 2012

Low-delay wireless scheduling with partial channel-state information.
Proceedings of the IEEE INFOCOM 2012, Orlando, FL, USA, March 25-30, 2012, 2012

2011
Random mobility and the spread of infection.
Proceedings of the INFOCOM 2011. 30th IEEE International Conference on Computer Communications, 2011

User rankings from comparisons: Learning permutations in high dimensions.
Proceedings of the 49th Annual Allerton Conference on Communication, 2011

2010
Wireless scheduling with heterogeneously delayed network-state information.
Proceedings of the 48th Annual Allerton Conference on Communication, 2010


  Loading...