Dima Damen

Orcid: 0000-0001-8804-6238

According to our database1, Dima Damen authored at least 131 papers between 2007 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
An Outlook into the Future of Egocentric Vision.
Int. J. Comput. Vis., November, 2024

It's Just Another Day: Unique Video Captioning by Discriminative Prompting.
CoRR, 2024

HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision.
CoRR, 2024

Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind.
CoRR, 2024

Every Shot Counts: Using Exemplars for Repetition Counting in Videos.
CoRR, 2024

Video Editing for Video Retrieval.
CoRR, 2024

Rank2Reward: Learning Shaped Reward Functions from Passive Video.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AMEGO: Active Memory from Long EGOcentric Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024

GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

TIM: A Time Interval Machine for Audio-Visual Action Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Learning from One Continuous Video Stream.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Editorial: Special Section on Egocentric Perception.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Get a Grip: Reconstructing Hand-Object Stable Grasps in Egocentric Videos.
CoRR, 2023

Perception Test 2023: A Summary of the First Challenge And Outcome.
CoRR, 2023

EPIC Fields: Marrying 3D Geometry and Video Understanding.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

What can a cook in Italy teach a mechanic in India? Action Recognition Generalisation Over Scenarios and Locations.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Play It Back: Iterative Attention For Audio Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2023

Epic-Sounds: A Large-Scale Dataset of Actions that Sound.
Proceedings of the IEEE International Conference on Acoustics, 2023

The Wisdom of Crowds: Temporal Progressive Attention for Early Action Prediction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Use Your Head: Improving Long-Tail Video Recognition.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Centre Stage: Centricity-based Audio-Visual Temporal Action Detection.
Proceedings of the 34th British Machine Vision Conference Workshop Proceedings, 2023

Learning Temporal Sentence Grounding From Narrated EgoVideos.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Rescaling Egocentric Vision: Collection, Pipeline and Challenges for EPIC-KITCHENS-100.
Int. J. Comput. Vis., 2022

Egocentric video summarisation via purpose-oriented frame scoring and selection.
Expert Syst. Appl., 2022

Inertial Hallucinations - When Wearable Inertial Devices Start Seeing Things.
CoRR, 2022

Egocentric Video-Language Pretraining @ Ego4D Challenge 2022.
CoRR, 2022

An Evaluation of OCR on Egocentric Data.
CoRR, 2022

Egocentric Video-Language Pretraining.
CoRR, 2022

Temporal Progressive Attention for Early Action Prediction.
CoRR, 2022

TVNet: Temporal Voting Network for Action Localization.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022

Egocentric Video-Language Pretraining.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

UnweaveNet: Unweaving Activity Stories.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


Dual-Domain Image Synthesis using Segmentation-Guided GAN.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Refining Action Boundaries for One-stage Detection.
Proceedings of the 18th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2022

Hand-Object Interaction Reasoning.
Proceedings of the 18th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2022

ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval.
Proceedings of the Computer Vision - ACCV 2022, 2022

2021
The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Integration of Experts' and Beginners' Machine Operation Experiences to Obtain a Detailed Task Model.
IEICE Trans. Inf. Syst., 2021

Domain Adaptation in Multi-View Embedding for Cross-Modal Video Retrieval.
CoRR, 2021

Ego4D: Around the World in 3, 000 Hours of Egocentric Video.
CoRR, 2021

No Need for a Lab: Towards Multi-sensory Fusion for Ambient Assisted Living in Real-world Living Homes.
Proceedings of the 16th International Joint Conference on Computer Vision, 2021

Slow-Fast Auditory Streams for Audio Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021

On Semantic Similarity in Video Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Temporal-Relational CrossTransformers for Few-Shot Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Efficient and Robust Skeleton-Based Quality Assessment and Abnormality Detection in Human Action Performance.
IEEE J. Biomed. Health Informatics, 2020

Person Re-ID by Fusion of Video Silhouettes and Wearable Signals for Home Monitoring Applications.
Sensors, 2020

Supervision Levels Scale (SLS).
CoRR, 2020

Rescaling Egocentric Vision.
CoRR, 2020

Human-Centric Object Interactions - A Fine-Grained Perspective from Egocentric Videos.
Proceedings of the HuMA'20: Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis, 2020

Centroids Triplet Network and Temporally-Consistent Embeddings for In-Situ Object Recognition.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Action Modifiers: Learning From Adverbs in Instructional Videos.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Play Fair: Frame Attributions in Video Models.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Meta-Learning with Context-Agnostic Initialisations.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
DS-KCF: a real-time tracker for RGB-D data.
J. Real Time Image Process., 2019

An Evaluation of Action Recognition Models on EPIC-Kitchens.
CoRR, 2019

Hotspots Integrating of Expert and Beginner Experiences of Machine Operations through Egocentric Vision.
Proceedings of the 16th International Conference on Machine Vision Applications, 2019

Learning Discriminative Embeddings for Object Recognition on-the-fly.
Proceedings of the International Conference on Robotics and Automation, 2019

Sit-to-Stand Analysis in the Wild Using Silhouettes for Longitudinal Health Monitoring.
Proceedings of the Image Analysis and Recognition - 16th International Conference, 2019

Deep Compact Person Re-Identification with Distractor Synthesis via Guided DC-GANs.
Proceedings of the Image Analysis and Processing - ICIAP 2019, 2019

Retro-Actions: Learning 'Close' by Time-Reversing 'Open' Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Multi-Modal Domain Adaptation for Fine-Grained Action Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Who Goes There? Exploiting Silhouettes and Wearable Signals for Subject Identification in Multi-Person Environments.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

EPIC-Tent: An Egocentric Video Dataset for Camping Tent Assembly.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Weakly-Supervised Completion Moment Detection using Temporal Attention.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

Fine-Grained Action Retrieval Through Multiple Parts-of-Speech Embeddings.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

A Fine-grained Perspective onto Object Interactions from First-person Views.
Proceedings of the 14th International Joint Conference on Computer Vision, 2019

DDLSTM: Dual-Domain LSTM for Cross-Dataset Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Action Recognition From Single Timestamp Supervision in Untrimmed Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

The Pros and Cons: Rank-Aware Temporal Attention for Skill Determination in Long Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Learning Visual Actions Using Multiple Verb-Only Labels.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

2018
Depth-Based Whole Body Photoplethysmography in Remote Pulmonary Function Testing.
IEEE Trans. Biomed. Eng., 2018

Energy expenditure estimation using visual and inertial sensors.
IET Comput. Vis., 2018

A Guide to the SPHERE 100 Homes Study Dataset.
CoRR, 2018

Scaling Egocentric Vision: The EPIC-KITCHENS Dataset.
CoRR, 2018

Instance-level Object Recognition Using Deep Temporal Coherence.
Proceedings of the Advances in Visual Computing - 13th International Symposium, 2018

Human Routine Change Detection using Bayesian Modelling.
Proceedings of the 24th International Conference on Pattern Recognition, 2018

Markerless Active Trunk Shape Modelling for Motion Tolerant Remote Respiratory Assessment.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Semantically Selective Augmentation for Deep Compact Person Re-Identification.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Scaling Egocentric Vision: The Dataset.
Proceedings of the Computer Vision - ECCV 2018, 2018

Towards an Unequivocal Representation of Actions.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Who's Better? Who's Best? Pairwise Deep Ranking for Skill Determination.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

CaloriNet: From silhouettes to calorie estimation in private environments.
Proceedings of the British Machine Vision Conference 2018, 2018

Action Completion: A Temporal Model for Moment Detection.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Remote, Depth-Based Lung Function Assessment.
IEEE Trans. Biomed. Eng., 2017

Multiple human tracking in RGB-depth data: a survey.
IET Comput. Vis., 2017

Detecting the Moment of Completion: Temporal Models for Localising Action Completion.
CoRR, 2017

Improving Classification by Improving Labelling: Introducing Probabilistic Multi-Label Object Interaction Recognition.
CoRR, 2017

Who's Better, Who's Best: Skill Determination in Video using Deep Ranking.
CoRR, 2017

Hotspots detection for machine operation in egocentric vision.
Proceedings of the Fifteenth IAPR International Conference on Machine Vision Applications, 2017

Recurrent Assistance: Cross-Dataset Training of LSTMs on Kitchen Tasks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video.
Proceedings of the IEEE International Conference on Computer Vision, 2017

A general descriptor for detecting abnormal action performance from skeletal data.
Proceedings of the 2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2017

Unsupervised Long-Term Routine Modelling Using Dynamic Bayesian Networks.
Proceedings of the 2017 International Conference on Digital Image Computing: Techniques and Applications, 2017

A Dataset for Persistent Multi-target Multi-camera Tracking in RGB-D.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Automated capture and delivery of assistive task guidance with an eyewear computer: the GlaciAR system.
Proceedings of the 8th Augmented Human International Conference, 2017

2016
A comparative study of pose representation and dynamics modelling for online motion quality assessment.
Comput. Vis. Image Underst., 2016

You-Do, I-Learn: Egocentric unsupervised discovery of objects and their modes of interaction towards video-based guidance.
Comput. Vis. Image Underst., 2016

Multiple Human Tracking in RGB-D Data: A Survey.
CoRR, 2016

SEMBED: Semantic Embedding of Egocentric Action Videos.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Beyond Action Recognition: Action Completion in RGB-D Data.
Proceedings of the British Machine Vision Conference 2016, 2016

Calorie Counter: RGB-Depth Visual Estimation of Energy Expenditure at Home.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

3D Data Acquisition and Registration Using Two Opposing Kinects.
Proceedings of the Fourth International Conference on 3D Vision, 2016

2015
Cognitive Robotics Systems - Concepts and Applications.
J. Intell. Robotic Syst., 2015

Correspondence, Matching and Recognition.
Int. J. Comput. Vis., 2015

You-Do, I-Learn: Unsupervised Multi-User egocentric Approach Towards Video-Based Guidance.
CoRR, 2015

Estimating visual attention from a head mounted IMU.
Proceedings of the 2015 ACM International Symposium on Wearable Computers, 2015

Efficient Texture-less Object Detection for Augmented Reality Guidance.
Proceedings of the 2015 IEEE International Symposium on Mixed and Augmented Reality Workshops, 2015

A multi-modal sensor infrastructure for healthcare in a residential environment.
Proceedings of the IEEE International Conference on Communication, 2015

A comparative home activity monitoring study using visual and inertial sensors.
Proceedings of the 17th International Conference on E-health Networking, 2015

Real-time RGB-D Tracking with Depth Scaling Kernelised Correlation Filters and Occlusion Handling.
Proceedings of the British Machine Vision Conference 2015, 2015

Remote pulmonary function testing using a depth sensor.
Proceedings of the IEEE Biomedical Circuits and Systems Conference, 2015

Unsupervised daily routine modelling from a depth sensor using top-down and bottom-up hierarchies.
Proceedings of the 3rd IAPR Asian Conference on Pattern Recognition, 2015

2014
Multi-User Egocentric Online System for Unsupervised Assistance on Object Usage.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Online quality assessment of human motion from skeleton data.
Proceedings of the British Machine Vision Conference, 2014

You-Do, I-Learn: Discovering Task Relevant Objects and their Modes of Interaction from Multi-User Egocentric Video.
Proceedings of the British Machine Vision Conference, 2014

2012
Detecting Carried Objects from Sequences of Walking Pedestrians.
IEEE Trans. Pattern Anal. Mach. Intell., 2012

Explaining Activities as Consistent Groups of Events - A Bayesian Framework Using Attribute Multiset Grammars.
Int. J. Comput. Vis., 2012

Integrating 3D object detection, modelling and tracking on a mobile phone.
Proceedings of the 11th IEEE International Symposium on Mixed and Augmented Reality, 2012

Egocentric Real-time Workspace Monitoring using an RGB-D camera.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Real-time Learning and Detection of 3D Texture-less Objects: A Scalable Approach.
Proceedings of the British Machine Vision Conference, 2012

2009
Activity analysis : finding explanations for sets of events.
PhD thesis, 2009

Recognizing linked events: Searching the space of feasible explanations.
Proceedings of the 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 2009

Attribute Multiset Grammars for Global Explanations of Activities.
Proceedings of the British Machine Vision Conference, 2009

2008
Detecting Carried Objects in Short Video Sequences.
Proceedings of the Computer Vision, 2008

2007
Associating People Dropping off and Picking up Objects.
Proceedings of the British Machine Vision Conference 2007, 2007


  Loading...