Task Vectors are Cross-Modal.
CoRR, 2024
Secure Wearable Apps for Remote Healthcare Through Modern Cryptography.
CoRR, 2024
Shape-Guided Diffusion with Inside-Outside Attention.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Readout Guidance: Learning Control from Diffusion Features.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Diffusion Hyperfeatures: Searching Through Time and Space for Semantic Correspondence.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
G^3: Geolocation via Guidebook Grounding.
CoRR, 2022
Twitter-COMMs: Detecting Climate, COVID, and Military Multimodal Misinformation.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2022
Focus! Relevant and Sufficient Context Selection for News Image Captioning.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
G3: Geolocation via Guidebook Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media.
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021