Dhawal Gupta

Orcid: 0000-0002-2486-866X

According to our database1, Dhawal Gupta authored at least 18 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Safe Exploration Strategy for Model-free Task Adaptation in Safety-constrained Grid Environments.
CoRR, 2024

Adaptive Switching Based Data-Communication Model for Internet of Healthcare Things Networks.
IEEE Access, 2024

ICU-Sepsis: A Benchmark MDP Built from Real Medical Data.
RLJ, 2024

Mitigating the Curse of Horizon in Monte-Carlo Returns.
RLJ, 2024

From Past to Future: Rethinking Eligibility Traces.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Exploring the impact of low-rank adaptation on the performance, efficiency, and regularization of RLHF.
CoRR, 2023

Coagent Networks: Generalized and Scaled.
CoRR, 2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
CoRR, 2023

Offline Reinforcement Learning for Mixture-of-Expert Dialogue Management.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Behavior Alignment via Reward Function Optimization.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

A Mixture-of-Expert Approach to RL-based Dialogue Management.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2021
A Unified Dialogue Management Strategy for Multi-intent Dialogue Conversations in Multiple Languages.
ACM Trans. Asian Low Resour. Lang. Inf. Process., 2021

A hierarchical approach for efficient multi-intent dialogue policy learning.
Multim. Tools Appl., 2021

Emotion Aided Dialogue Act Classification for Task-Independent Conversations in a Multi-modal Framework.
Cogn. Comput., 2021

Structural Credit Assignment in Neural Networks using Reinforcement Learning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Towards integrated dialogue policy learning for multiple domains and intents using Hierarchical Deep Reinforcement Learning.
Expert Syst. Appl., 2020

Gradient Temporal-Difference Learning with Regularized Corrections.
Proceedings of the 37th International Conference on Machine Learning, 2020

2018
Reinforcement Learning Based Dialogue Management Strategy.
Proceedings of the Neural Information Processing - 25th International Conference, 2018


  Loading...