2025
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning.
CoRR, March, 2025

Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control.
CoRR, March, 2025

Cosmos World Foundation Model Platform for Physical AI.
CoRR, January, 2025

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.
Proceedings of the Findings of the Association for Computational Linguistics: NAACL 2025, Albuquerque, New Mexico, USA, April 29, 2025

2024
FAN-Unet: Enhancing Unet with vision Fourier Analysis Block for Biomedical Image Segmentation.
CoRR, 2024

Edify 3D: Scalable High-Quality 3D Asset Generation.
CoRR, 2024

Med-TTT: Vision Test-Time Training model for Medical Image Segmentation.
CoRR, 2024

Securing Multi-turn Conversational Language Models Against Distributed Backdoor Triggers.
CoRR, 2024

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation.
CoRR, 2024

HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation.
CoRR, 2024

Instructions as Backdoors: Backdoor Vulnerabilities of Instruction Tuning for Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Instructional Fingerprinting of Large Language Models.
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), 2024

Comparison and Application of FPGA and ASIC in Digital System Design.
Proceedings of the International Conference on Decision Science & Management, 2024

Securing Multi-turn Conversational Language Models From Distributed Backdoor Attacks.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2024, 2024

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges.
Proceedings of the 60th Annual Allerton Conference on Communication, 2024

2023
DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models.
CoRR, 2023

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.
CoRR, 2023

Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation.
CoRR, 2023

Mixup Feature: A Pretext Task Self-Supervised Learning Method for Enhanced Visual Feature Learning.
IEEE Access, 2023

MATEKG: A Large-scale Multi-class Equipment Knowledge Graph for Military Auxiliary Tasks.
Proceedings of the 6th IEEE International Conference on Information Systems and Computer Aided Educatio, 2023

Can NLI Provide Proper Indirect Supervision for Low-resource Biomedical Relation Extraction?
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023

2022
Surgical gestures as a method to quantify surgical performance and predict patient outcomes.
npj Digit. Medicine, 2022

EM-Paste: EM-guided Cut-Paste with DALL-E Augmentation for Image-level Weakly Supervised Instance Segmentation.
CoRR, 2022

Self-supervised Model Based on Masked Autoencoders Advance CT Scans Classification.
CoRR, 2022

DALL-E for Detection: Language-driven Context Image Synthesis for Object Detection.
CoRR, 2022

Unified Semantic Typing with Meaningful Label Inference.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022

X-Norm: Exchanging Normalization Parameters for Bimodal Fusion.
Proceedings of the International Conference on Multimodal Interaction, 2022

Neural-Sim: Learning to Generate Training Data with NeRF.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

2020
Deep Learning Segmentation and Classification for Urban Village Using a Worldview Satellite Image Based on U-Net.
Remote. Sens., 2020