Armin Mustafa

Partha Pratim Chakraborty

CoRR, 2024

Boosting Camera Motion Control for Video Diffusion Transformers.

[BibT_eX]

[DOI]

CoRR, 2024

RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification.

[BibT_eX]

[DOI]

CoRR, 2024

Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification.

[BibT_eX]

[DOI]

CoRR, 2024

Single-image coherent reconstruction of objects and humans.

[BibT_eX]

[DOI]

Sarthak Batra

Simon Hadfield

CoRR, 2024

NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative.

[BibT_eX]

[DOI]

CoRR, 2024

An Effective-Efficient Approach for Dense Multi-Label Action Detection.

[BibT_eX]

[DOI]

CoRR, 2024

CAD - Contextual Multi-modal Alignment for Dynamic AVQA.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

ForecasterFlexOBM: A Multi-View Audio-Visual Dataset for Flexible Object-Based Media Production.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Max-AST: Combining Convolution, Local and Global Self-Attentions for Audio Event Classification.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2024, 2024

S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet.

[BibT_eX]

[DOI]

CoRR, 2023

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SEM-POS: Grammatically and Semantically Correct Video Captioning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

4D Temporally Coherent Multi-Person Semantic Reconstruction and Segmentation.

[BibT_eX]

[DOI]

Chris Russell

Int. J. Comput. Vis., 2022

Pose Guided Multi-person Image Generation From Text.

[BibT_eX]

[DOI]

CoRR, 2022

KPE: Keypoint Pose Encoding for Transformer-based Image Generation.

[BibT_eX]

[DOI]

Proceedings of the 33rd British Machine Vision Conference 2022, 2022

There and Back Again: 3D Sign Language Generation from Text Using Back-Translation.

[BibT_eX]

[DOI]

Stephanie Stoll

Jean-Yves Guillemaut

Proceedings of the International Conference on 3D Vision, 2022

2021

Temporally Coherent General Dynamic Scene Reconstruction.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2021

Temporal Consistency Loss for High Resolution Textured and Clothed 3DHuman Reconstruction from Monocular Video.

[BibT_eX]

[DOI]

Akin Caliskan

CoRR, 2021

Multi-Person Implicit Reconstruction From a Single Image.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction From Monocular Video.

[BibT_eX]

[DOI]

Akin Caliskan

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Light Field Video for Immersive Content Production.

[BibT_eX]

[DOI]

Proceedings of the Adversarial and Uncertain Reasoning for Adaptive Cyber Defense, 2020

Semantically Coherent 4D Scene Flow of Dynamic Scenes.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

RealMonoDepth: Self-Supervised Monocular Depth Estimation for General Scenes.

[BibT_eX]

[DOI]

Mertalp Ocal

CoRR, 2020

A*3D Dataset: Towards Autonomous Driving in Challenging Environments.

[BibT_eX]

[DOI]

Quang-Hieu Pham

Pierre Sevestre

Ramanpreet Singh Pahwa

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Multi-view Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019

MSFD: Multi-Scale Segmentation-Based Feature Detection for Wide-Baseline Scene Reconstruction.

[BibT_eX]

[DOI]

Hansung Kim

IEEE Trans. Image Process., 2019

Learning Dense Wide Baseline Stereo Matching for People.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

U4D: Unsupervised 4D Dynamic Scene Understanding.

[BibT_eX]

[DOI]

Chris Russell

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Light Field Compression using Eigen Textures.

[BibT_eX]

[DOI]

Proceedings of the 2019 International Conference on 3D Vision, 2019

2017

General 4D dynamic scene reconstruction from multiple view video.

[BibT_eX]

[DOI]

PhD thesis, 2017

Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

4D Temporally Coherent Light-Field Video.

[BibT_eX]

[DOI]

Proceedings of the 2017 International Conference on 3D Vision, 2017

2016

4D Match Trees for Non-rigid Surface Alignment.

[BibT_eX]

[DOI]

Hansung Kim

Proceedings of the Computer Vision - ECCV 2016, 2016

Temporally Coherent 4D Reconstruction of Complex Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015

General Dynamic Scene Reconstruction from Multiple View Video.

[BibT_eX]

[DOI]

Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Segmentation Based Features for Wide-Baseline Multi-view Reconstruction.

[BibT_eX]

[DOI]

Proceedings of the 2015 International Conference on 3D Vision, 2015

2011

Background Reflectance Modeling for Robust Finger Gesture Detection in Highly Dynamic Illumination.

[BibT_eX]

[DOI]