Bruno C. da Silva
Orcid: 0000-0002-3708-5728Affiliations:
- University of Massachusetts, Amherst, MA, USA
- Federal University of Rio Grande do Sul (UFRGS), Institute of Informatics, Porto Alegre, Brazil (former)
According to our database1,
Bruno C. da Silva
authored at least 58 papers
between 2004 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
Online presence:
-
on orcid.org
On csauthors.net:
Bibliography
2024
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation.
CoRR, 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs.
CoRR, 2024
Mitigating the Curse of Horizon in Monte-Carlo Returns.
RLJ, 2024
Proceedings of the Forty-first International Conference on Machine Learning, 2024
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
A Toolkit for Reliable Benchmarking and Research in Multi-Objective Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
Proceedings of the 45th IEEE/ACM International Conference on Software Engineering: ICSE 2023 Companion Proceedings, 2023
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization.
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, 2023
2022
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Proceedings of the IEEE Symposium on Computers and Communications, 2022
Proceedings of the International Conference on Machine Learning, 2022
Optimistic Linear Support and Successor Features as a Basis for Optimal Policy Transfer.
Proceedings of the International Conference on Machine Learning, 2022
Proceedings of the Tenth International Conference on Learning Representations, 2022
RADAR: Reactive and Deliberative Adaptive Reasoning - Learning When to Think Fast and When to Think Slow.
Proceedings of the IEEE International Conference on Development and Learning, 2022
2021
Quantifying the impact of non-stationarity in reinforcement learning-based traffic signal control.
PeerJ Comput. Sci., 2021
Patterns of high-risk drinking among medical students: A web-based survey with machine learning.
Comput. Biol. Medicine, 2021
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021
Proceedings of the 38th International Conference on Machine Learning, 2021
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection.
Proceedings of the AAMAS '21: 20th International Conference on Autonomous Agents and Multiagent Systems, 2021
2020
Knowl. Eng. Rev., 2020
CoRR, 2020
2019
Proceedings of the 19th International Conference on New Interfaces for Musical Expression, 2019
A Methodology for Neural Network Architectural Tuning Using Activation Occurrence Maps.
Proceedings of the International Joint Conference on Neural Networks, 2019
Proceedings of the Joint IEEE 9th International Conference on Development and Learning and Epigenetic Robotics, 2019
Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, 2019
2018
A task-and-technique centered survey on visual analytics for deep learning model engineering.
Comput. Graph., 2018
Proceedings of the VIII Brazilian Symposium on Computing Systems Engineering, 2018
Comparing Multi-Armed Bandit Algorithms and Q-learning for Multiagent Action Selection: a Case Study in Route Choice.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
Towards Designing Optimal Reward Functions in Multi-Agent Reinforcement Learning Problems.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018
2017
CoRR, 2017
Proceedings of the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2017
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 2017
2016
Using Topological Statistics to Bias and Accelerate Route Choice: Preliminary Findings in Synthetic and Real-World Road Networks.
Proceedings of the Ninth International Workshop on Agents in Traffic and Transportation (ATT 2016) co-located with the 25th International Joint Conference On Artificial Intelligence (IJCAI 2016), 2016
Proceedings of the 33nd International Conference on Machine Learning, 2016
2014
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014
Proceedings of the 31th International Conference on Machine Learning, 2014
2013
Proceedings of the International conference on Autonomous Agents and Multi-Agent Systems, 2013
2012
Proceedings of the 29th International Conference on Machine Learning, 2012
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012
2010
2007
Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2007), 2007
2006
Proceedings of the Machine Learning, 2006
Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator.
Proceedings of the 4th European Workshop on Multi-Agent Systems EUMAS'06, 2006
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006
Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS 2006), 2006
Proceedings of the Proceedings, 2006
2004
Proceedings of the Innovative Internet Community Systems, 4th InternationalWorkshop, 2004