2025

GuardSet-X: Massive Multi-Domain Safety Policy-Grounded Guardrail Dataset.

[DOI]

Mintong Kang

Zhaorun Chen

CoRR, June, 2025

ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning.

[DOI]

Zhaorun Chen

Mintong Kang

Bo Li

CoRR, March, 2025

MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models.

[DOI]

CoRR, March, 2025

FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance.

[DOI]

Mintong Kang

Vinayshekhar Bannihatti Kumar

Shamik Roy

Abhishek Kumar

Sopan Khosla

Balakrishnan Narayanaswamy

Rashmi Gangadharaiah

CoRR, March, 2025

MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models.

[DOI]

et al.

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

Eia: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage.

[DOI]

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models.

[DOI]

Mintong Kang

Chejian Xu

Bo Li

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

R2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning.

[DOI]

Mintong Kang

Bo Li

Proceedings of the Thirteenth International Conference on Learning Representations, 2025

2024

MgSvF: Multi-Grained Slow versus Fast Framework for Few-Shot Class-Incremental Learning.

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., March, 2024

AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models.

[DOI]

Mintong Kang

Chejian Xu

Bo Li

CoRR, 2024

AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents.

[DOI]

CoRR, 2024

R<sup>2</sup>-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning.

[DOI]

Mintong Kang

Bo Li

CoRR, 2024

Certifiably Byzantine-Robust Federated Conformal Prediction.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

C-RAG: Certified Generation Risks for Retrieval-Augmented Language Models.

[DOI]

Proceedings of the Forty-first International Conference on Machine Learning, 2024

COLEP: Certifiably Robust Learning-Reasoning Conformal Prediction via Probabilistic Circuits.

[DOI]

Proceedings of the Twelfth International Conference on Learning Representations, 2024

2023

FaShapley: Fast and Approximated Shapley Based Model Pruning Towards Certifiably Robust DNNs.

[DOI]

Mintong Kang

Linyi Li

Bo Li

Proceedings of the 2023 IEEE Conference on Secure and Trustworthy Machine Learning, 2023

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification.

[DOI]

Mintong Kang

Dawn Song

Bo Li

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Label-Assemble: Leveraging Multiple Datasets with Partial Labels.

[DOI]

Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

2022

Certifying Some Distributional Fairness with Subpopulation Decomposition.

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Fairness in Federated Learning via Core-Stability.

[DOI]

Bhaskar Ray Chaudhury

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

2021

Data, Assemble: Leveraging Multiple Datasets with Heterogeneous and Partial Labels.

[DOI]

CoRR, 2021