Armin Mustafa

Orcid: 0000-0002-1779-2775

According to our database1, Armin Mustafa authored at least 42 papers between 2011 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Boosting Camera Motion Control for Video Diffusion Transformers.
CoRR, 2024

RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification.
CoRR, 2024

Attend-Fusion: Efficient Audio-Visual Fusion for Video Classification.
CoRR, 2024

Single-image coherent reconstruction of objects and humans.
CoRR, 2024

NarrativeBridge: Enhancing Video Captioning with Causal-Temporal Narrative.
CoRR, 2024

An Effective-Efficient Approach for Dense Multi-Label Action Detection.
CoRR, 2024

CAD - Contextual Multi-modal Alignment for Dynamic AVQA.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

ForecasterFlexOBM: A Multi-View Audio-Visual Dataset for Flexible Object-Based Media Production.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Max-AST: Combining Convolution, Local and Global Self-Attentions for Audio Event Classification.
Proceedings of the IEEE International Conference on Acoustics, 2024

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing.
Proceedings of the Computer Vision - ECCV 2024, 2024

S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet.
CoRR, 2023

PAT: Position-Aware Transformer for Dense Multi-Label Action Detection.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

SEM-POS: Grammatically and Semantically Correct Video Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
4D Temporally Coherent Multi-Person Semantic Reconstruction and Segmentation.
Int. J. Comput. Vis., 2022

Pose Guided Multi-person Image Generation From Text.
CoRR, 2022

KPE: Keypoint Pose Encoding for Transformer-based Image Generation.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

There and Back Again: 3D Sign Language Generation from Text Using Back-Translation.
Proceedings of the International Conference on 3D Vision, 2022

2021
Temporally Coherent General Dynamic Scene Reconstruction.
Int. J. Comput. Vis., 2021

Temporal Consistency Loss for High Resolution Textured and Clothed 3DHuman Reconstruction from Monocular Video.
CoRR, 2021

Multi-Person Implicit Reconstruction From a Single Image.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Temporal Consistency Loss for High Resolution Textured and Clothed 3D Human Reconstruction From Monocular Video.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

SILT: Self-supervised Lighting Transfer Using Implicit Image Decomposition.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Light Field Video for Immersive Content Production.
Proceedings of the Adversarial and Uncertain Reasoning for Adaptive Cyber Defense, 2020

Semantically Coherent 4D Scene Flow of Dynamic Scenes.
Int. J. Comput. Vis., 2020

RealMonoDepth: Self-Supervised Monocular Depth Estimation for General Scenes.
CoRR, 2020

A*3D Dataset: Towards Autonomous Driving in Challenging Environments.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Multi-view Consistency Loss for Improved Single-Image 3D Reconstruction of Clothed People.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
MSFD: Multi-Scale Segmentation-Based Feature Detection for Wide-Baseline Scene Reconstruction.
IEEE Trans. Image Process., 2019

Learning Dense Wide Baseline Stereo Matching for People.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

U4D: Unsupervised 4D Dynamic Scene Understanding.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Light Field Compression using Eigen Textures.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2017
General 4D dynamic scene reconstruction from multiple view video.
PhD thesis, 2017

Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

4D Temporally Coherent Light-Field Video.
Proceedings of the 2017 International Conference on 3D Vision, 2017

2016
4D Match Trees for Non-rigid Surface Alignment.
Proceedings of the Computer Vision - ECCV 2016, 2016

Temporally Coherent 4D Reconstruction of Complex Dynamic Scenes.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
General Dynamic Scene Reconstruction from Multiple View Video.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Segmentation Based Features for Wide-Baseline Multi-view Reconstruction.
Proceedings of the 2015 International Conference on 3D Vision, 2015

2011
Background Reflectance Modeling for Robust Finger Gesture Detection in Highly Dynamic Illumination.
Proceedings of the Convergence and Hybrid Information Technology, 2011


  Loading...