2025
Eye2Eye: A Simple Approach for Monocular-to-Stereo Video Synthesis.
CoRR, May, 2025
DynVFX: Augmenting Real Videos with Dynamic Content.
CoRR, February, 2025
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space.
CoRR, January, 2025
2024
Still-Moving: Customized Video Generation without Customized Video Data.
ACM Trans. Graph., December, 2024
State of the Art on Diffusion Models for Visual Computing.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Comput. Graph. Forum, May, 2024
Disentangling Structure and Appearance in ViT Feature Space.
ACM Trans. Graph., February, 2024
What Makes for a Good Stereoscopic Image?
,
,
,
,
,
,
,
,
,
,
CoRR, 2024
What's in the Image? A Deep-Dive into the Vision of Vision Language Models.
CoRR, 2024
Generative Omnimatte: Learning to Decompose Video into Layers.
CoRR, 2024
VidPanos: Generative Panoramic Videos from Casual Panning Videos.
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
Lumiere: A Space-Time Diffusion Model for Video Generation.
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
,
Proceedings of the SIGGRAPH Asia 2024 Conference Papers, 2024
TokenFlow: Consistent Diffusion Features for Consistent Video Editing.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
DINO-Tracker: Taming DINO for Self-supervised Point Tracking in a Single Video.
Proceedings of the Computer Vision - ECCV 2024, 2024
Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
2023
DreamSim: Learning New Dimensions of Human Visual Similarity using Synthetic Data.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
SceneScape: Text-Driven Consistent Scene Generation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023
MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation.
Proceedings of the International Conference on Machine Learning, 2023
Teaching CLIP to Count to Ten.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Neural Congealing: Aligning Images to a Joint Semantic Atlas.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
Imagic: Text-Based Real Image Editing with Diffusion Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
2022
Diverse Video Generation from a Single Video.
CoRR, 2022
Self-Distilled StyleGAN: Towards Generation from Internet Photos.
Proceedings of the SIGGRAPH '22: Special Interest Group on Computer Graphics and Interactive Techniques Conference, Vancouver, BC, Canada, August 7, 2022
Associating Objects and Their Effects in Video through Coordination Games.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Diverse Generation from a Single Video Made Possible.
Proceedings of the Computer Vision - ECCV 2022, 2022
Text2LIVE: Text-Driven Layered Image and Video Editing.
Proceedings of the Computer Vision - ECCV 2022, 2022
On the Effectiveness of ViT Features as Local Semantic Descriptors.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022
Splicing ViT Features for Semantic Appearance Transfer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
2021
Consistent depth of moving objects in video.
ACM Trans. Graph., 2021
Layered neural atlases for consistent video editing.
ACM Trans. Graph., 2021
MannequinChallenge: Learning the Depths of Moving People by Watching Frozen People.
IEEE Trans. Pattern Anal. Mach. Intell., 2021
Deep ViT Features as Dense Visual Descriptors.
CoRR, 2021
Unveiling unexpected training data in internet video.
Commun. ACM, 2021
Omnimatte: Associating Objects and Their Effects in Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021
2020
Layered neural rendering for retiming people in video.
ACM Trans. Graph., 2020
Semantic Pyramid for Image Generation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
SpeedNet: Learning the Speediness in Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020
2019
SinGAN: Learning a Generative Model From a Single Natural Image.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019
Speech2Face: Learning the Face Behind a Voice.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Learning the Depths of Moving People by Watching Frozen People.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
2018
Looking to listen at the cocktail party: a speaker-independent audio-visual model for speech separation.
ACM Trans. Graph., 2018
Best-Buddies Similarity - Robust Template Matching Using Mutual Nearest Neighbors.
IEEE Trans. Pattern Anal. Mach. Intell., 2018
MoSculp: Interactive Visualization of Shape and Time.
Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology, 2018
Modifying Non-Local Variations Across Multiple Views.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
Sparse, Smart Contours to Represent and Edit Images.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
Detecting moving regions in CrowdCam images.
Comput. Vis. Image Underst., 2017
Smart, Sparse Contours to Represent and Edit Images.
CoRR, 2017
On the Effectiveness of Visible Watermarks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017
2015
Deviation magnification: revealing departures from ideal geometries.
ACM Trans. Graph., 2015
Revealing and modifying non-local variations in a single image.
ACM Trans. Graph., 2015
Best-Buddies Similarity for robust template matching.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015
2014
Int. J. Comput. Vis., 2014
2013
Stereo Seam Carving a Geometrically Consistent Approach.
IEEE Trans. Pattern Anal. Mach. Intell., 2013
Multi-view Scene Flow Estimation: A View Centered Variational Approach.
Int. J. Comput. Vis., 2013
Space-Time Tradeoffs in Photo Sequencing.
Proceedings of the IEEE International Conference on Computer Vision, 2013
2012
Structure and motion from scene registration.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012
Racing Bib Numbers Recognition.
Proceedings of the British Machine Vision Conference, 2012
2011
Geometrically consistent stereo seam carving.
Proceedings of the IEEE International Conference on Computer Vision, 2011