2025
Seurat: From Moving Points to Depth.
CoRR, April, 2025

Exploring Temporally-Aware Features for Point Tracking.
CoRR, January, 2025

DiffFace: Diffusion-based face swapping with facial guidance.
Pattern Recognit., 2025

Multi-Granularity Video Object Segmentation.
Proceedings of the AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25, 2025

2024
Pose-Diversified Augmentation with Diffusion Model for Person Re-Identification.
CoRR, 2024

Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unifying Feature and Cost Aggregation with Transformers for Semantic and Visual Correspondence.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

Local All-Pair Correspondence for Point Tracking.
Proceedings of the Computer Vision - ECCV 2024, 2024

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

FlowTrack: Revisiting Optical Flow for Long-Range Dense Tracking.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
CATs++: Boosting Cost Aggregation With Convolutions and Transformers.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

DäRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth Adaptation.
CoRR, 2023

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation.
CoRR, 2023

DäRF: Boosting Radiance Fields from Sparse Input Views with Monocular Depth Adaptation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

LANIT: Language-Driven Image-to-Image Translation for Unlabeled Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

MIDMs: Matching Interleaved Diffusion Models for Exemplar-Based Image Translation.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
DiffFace: Diffusion-based Face Swapping with Facial Guidance.
CoRR, 2022

Integrative Feature and Cost Aggregation with Transformers for Dense Correspondence.
CoRR, 2022

AggMatch: Aggregating Pseudo Labels for Semi-Supervised Learning.
CoRR, 2022

Neural Matching Fields: Implicit Representation of Matching Fields for Visual Correspondence.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Cost Aggregation with 4D Convolutional Swin Transformer for Few-Shot Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
Cost Aggregation Is All You Need for Few-Shot Segmentation.
CoRR, 2021

Semantic Correspondence with Transformers.
CoRR, 2021

CATs: Cost Aggregation Transformers for Visual Correspondence.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021