Souradip Chakraborty

According to our database1, Souradip Chakraborty authored at least 34 papers between 2020 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
On the Sample Complexity of a Policy Gradient Algorithm with Occupancy Approximation for General Utility Reinforcement Learning.
CoRR, 2024

AIME: AI System Optimization via Multiple LLM Evaluators.
CoRR, 2024

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
CoRR, 2024

SAIL: Self-Improving Efficient Online Alignment of Large Language Models.
CoRR, 2024

Is poisoning a real threat to LLM alignment? Maybe more so than you think.
CoRR, 2024

DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning.
CoRR, 2024

Transfer Q Star: Principled Decoding for LLM Alignment.
CoRR, 2024

Provably Sample Efficient RLHF via Active Preference Optimization.
CoRR, 2024

On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities.
CoRR, 2024

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences.
CoRR, 2024

Beyond Text: Improving LLM's Decision Making for Robot Navigation via Vocal Cues.
CoRR, 2024

MaxMin-RLHF: Alignment with Diverse Human Preferences.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Position: On the Possibilities of AI-Generated Text Detection.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

PARL: A Unified Framework for Policy Alignment in Reinforcement Learning from Human Feedback.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023
A Survey on the Possibilities & Impossibilities of AI-generated Text Detection.
Trans. Mach. Learn. Res., 2023

REBEL: A Regularization-Based Solution for Reward Overoptimization in Reinforcement Learning from Human Feedback.
CoRR, 2023

Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey.
CoRR, 2023

Aligning Agent Policy with Externalities: Reward Design via Bilevel RL.
CoRR, 2023

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in Multi-Agent RL.
CoRR, 2023

On the Possibilities of AI-Generated Text Detection.
CoRR, 2023

RE-MOVE: An Adaptive Policy Design Approach for Dynamic Environments via Language-Based Feedback.
CoRR, 2023

Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policy Optimization.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning.
Proceedings of the International Conference on Machine Learning, 2023

Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies.
CoRR, 2022

On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces.
Proceedings of the International Conference on Machine Learning, 2022

HTRON: Efficient Outdoor Navigation with Sparse Rewards via Heavy Tailed Adaptive Reinforce Algorithm.
Proceedings of the Conference on Robot Learning, 2022

2020
Learning Representation for Mixed Data Types with a Nonlinear Deep Encoder-Decoder Framework.
CoRR, 2020

Transformers at SemEval-2020 Task 11: Propaganda Fragment Detection Using Diversified BERT Architectures Based Ensemble Learning.
Proceedings of the Fourteenth Workshop on Semantic Evaluation, 2020

Graph Spectral Feature Learning for Mixed Data of Categorical and Numerical Type.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

FairMixRep: Self-supervised Robust Representation Learning for Heterogeneous Data with Fairness constraints.
Proceedings of the 20th International Conference on Data Mining Workshops, 2020

G-SimCLR: Self-Supervised Contrastive Learning with Guided Projection via Pseudo Labelling.
Proceedings of the 20th International Conference on Data Mining Workshops, 2020

BioMedBERT: A Pre-trained Biomedical Language Model for QA and IR.
Proceedings of the 28th International Conference on Computational Linguistics, 2020


  Loading...