Oswald Lanz

Orcid: 0000-0003-4793-4276

According to our database1, Oswald Lanz authored at least 95 papers between 2003 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Improving semantic video retrieval models by training with a relevance-aware online mining strategy.
Comput. Vis. Image Underst., 2024

Video Analytics for Volleyball: Preliminary Results and Future Prospects of the 5VREAL Project.
Proceedings of the Ital-IA Intelligenza Artificiale, 2024

Spatiotemporal Modeling Encounters 3D Medical Image Analysis: Slice-Shift UNet with Multi-View Fusion.
Proceedings of the 2024 7th International Conference on Machine Vision and Applications, 2024

Fractals as Pre-training Datasets for Anomaly Detection and Localization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Your Image Is My Video: Reshaping the Receptive Field via Image-to-Video Differentiable AutoAugmentation and Fusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GRASP-GCN: Graph-Shape Prioritization for Neural Architecture Search under Distribution Shifts.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Video question answering supported by a multi-task learning objective.
Multim. Tools Appl., October, 2023

Gate-Shift-Fuse for Video Action Recognition.
IEEE Trans. Pattern Anal. Mach. Intell., September, 2023

Learning to Recognize Actions on Objects in Egocentric Video With Attention Dictionaries.
IEEE Trans. Pattern Anal. Mach. Intell., June, 2023

Multi-View Video Synthesis Through Progressive Synthesis and Refinement.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

2022
Audio-Visual Tracking of Concurrent Speakers.
IEEE Trans. Multim., 2022

Inductive Attention for Video Action Anticipation.
CoRR, 2022

UniUD-FBK-UB-UniBZ Submission to the EPIC-Kitchens-100 Multi-Instance Retrieval Challenge 2022.
CoRR, 2022

NVIDIA-UNIBZ Submission for EPIC-KITCHENS-100 Action Anticipation Challenge 2022.
CoRR, 2022

Neural Turing Machines for the Remaining Useful Life estimation problem.
Comput. Ind., 2022

A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval.
Proceedings of the MM '22: The 30th ACM International Conference on Multimedia, Lisboa, Portugal, October 10, 2022

Relevance-based Margin for Contrastively-trained Video Retrieval Models.
Proceedings of the ICMR '22: International Conference on Multimedia Retrieval, Newark, NJ, USA, June 27, 2022

Unified Recurrence Modeling for Video Action Anticipation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Higher-Order Recurrent Network with Space-Time Attention for Video Early Action Recognition.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Learning Video Retrieval Models with Relevance-Aware Online Mining.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Implicit texture mapping for multi-view video synthesis.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

Estimating the Remaining Useful Life via Neural Sequence Models: a Comparative Study (short paper).
Proceedings of the 2nd Italian Workshop on Artificial Intelligence and Applications for Business and Industries (AIABI 2022) co-located with 21st International Conference of the Italian Association for Artificial Intelligence (AI*IA 2022), 2022

2021
SAIC_Cambridge-HuPBA-FBK Submission to the EPIC-Kitchens-100 Action Recognition Challenge 2021.
CoRR, 2021

Higher Order Recurrent Space-Time Transformer.
CoRR, 2021

2020
A Spatio-Temporal Multi-Scale Binary Descriptor.
IEEE Trans. Image Process., 2020

FBK-HUPBA Submission to the EPIC-Kitchens Action Recognition 2020 Challenge.
CoRR, 2020

Data Augmentation Techniques for the Video Question Answering Task.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Gate-Shift Networks for Video Action Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Novel-View Human Action Synthesis.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Multi-Speaker Tracking From an Audio-Visual Sensing Device.
IEEE Trans. Multim., 2019

Top-down attention recurrent VLAD encoding for action recognition in videos.
Intelligenza Artificiale, 2019

An Analysis of Deep Neural Networks with Attention for Action Recognition from a Neurophysiological Perspective.
CoRR, 2019

FBK-HUPBA Submission to the EPIC-Kitchens 2019 Action Recognition Challenge.
CoRR, 2019

Hierarchical Feature Aggregation Networks for Video Action Recognition.
CoRR, 2019

Learnable Masks for Pose-Guided View Synthesis.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

View-LSTM: Novel-View Video Synthesis Through View Decomposition.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Accurate Target Annotation in 3D from Multimodal Streams.
Proceedings of the IEEE International Conference on Acoustics, 2019

LSTA: Long Short-Term Attention for Egocentric Action Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Joint Estimation of Human Pose and Conversational Groups from Social Scenes.
Int. J. Comput. Vis., 2018

MORB: A Multi-Scale Binary Descriptor.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

3D Mouth Tracking from a Compact Microphone Array Co-Located with a camera.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Multi-Camera Matching of Spatio-Temporal Binary Features.
Proceedings of the 21st International Conference on Information Fusion, 2018

Pose Guided Human Image Synthesis by View Disentanglement and Enhanced Weighting Loss.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Residual Stacked RNNs for Action Recognition.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
An automatic image-to-DEM alignment approach for annotating mountains pictures on a smartphone.
Mach. Vis. Appl., 2017

Convolutional Long Short-Term Memory Networks for Recognizing First Person Interactions.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Learning to detect violent videos using convolutional long short-term memory.
Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2017

SALSA: A Multimodal Dataset for the Automated Analysis of Free-Standing Social Interactions.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

Exploring Multitask and Transfer Learning Algorithms for Head Pose Estimation in Dynamic Multiview Scenarios.
Proceedings of the Group and Crowd Behavior for Computer Vision, 1st Edition, 2017

2016
A Multi-Task Learning Framework for Head Pose Estimation under Target Motion.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

SALSA: A Novel Dataset for Multimodal Group Behavior Analysis.
IEEE Trans. Pattern Anal. Mach. Intell., 2016

2015
Dynamic task decomposition for decentralized object tracking in complex scenes.
Comput. Vis. Image Underst., 2015

Jointly Estimating Interactions and Head, Body Pose of Interactors from Distant Social Scenes.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Analyzing Free-standing Conversational Groups: A Multimodal Approach.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Uncovering Interactions and Interactors: Joint Estimation of Head, Body Orientation and F-Formations from Surveillance Videos.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Exploring Transfer Learning Approaches for Head Pose Classification from Multi-view Surveillance Images.
Int. J. Comput. Vis., 2014

Evaluating Multi-task Learning for Multi-view Head-Pose Classification in Interactive Environments.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Dynamic Task Decomposition for Probabilistic Tracking in Complex Scenes.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Exploiting Color Constancy for Robust Tracking Under Non-uniform Illumination.
Proceedings of the Image Analysis and Recognition - 11th International Conference, 2014

Learning Contours for Automatic Annotations of Mountains Pictures on a Smartphone.
Proceedings of the International Conference on Distributed Smart Cameras, 2014

Wide-area Multi-camera Multi-object Tracking with Dynamic Task Decomposition.
Proceedings of the International Conference on Distributed Smart Cameras, 2014

Personalizing a smartwatch-based gesture interface with transfer learning.
Proceedings of the 22nd European Signal Processing Conference, 2014

2013
On the relationship between head pose, social attention and personality prediction for unstructured and dynamic group interactions.
Proceedings of the 2013 International Conference on Multimodal Interaction, 2013

Multi-scale f-formation discovery for group detection.
Proceedings of the IEEE International Conference on Image Processing, 2013

Learning the Scene Illumination for Color-Based People Tracking in Dynamic Environment.
Proceedings of the Image Analysis and Processing - ICIAP 2013, 2013

Multicamera People Tracking Using a Locus-based Probabilistic Occupancy Map.
Proceedings of the Image Analysis and Processing - ICIAP 2013, 2013

No Matter Where You Are: Flexible Graph-Guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion.
Proceedings of the IEEE International Conference on Computer Vision, 2013

2012
Active transfer learning for multi-view head-pose classification.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Boosting-based transfer learning for multi-view head-pose classification from surveillance videos.
Proceedings of the 20th European Signal Processing Conference, 2012

An Adaptation Framework for Head-Pose Classification in Dynamic Multi-view Scenarios.
Proceedings of the Computer Vision, 2012

2011
Dynamic resource allocation for probabilistic tracking via attentive sensing and sampling.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
Space speaks: towards socially and personality aware visual surveillance.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

Tracking Multiple People with Illumination Maps.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

A joint particle filter to track the position and head orientation of people using audio visual cues.
Proceedings of the 18th European Signal Processing Conference, 2010


2009
Estimation of Head Pose.
Proceedings of the Computers in the Human Interaction Loop, 2009

Extracting Interaction Cues: Focus of Attention, Body Pose, and Gestures.
Proceedings of the Computers in the Human Interaction Loop, 2009

Person Tracking.
Proceedings of the Computers in the Human Interaction Loop, 2009

Multimodal Classification of Activities of Daily Living Inside Smart Homes.
Proceedings of the Distributed Computing, 2009

A HJS filter to track visually interacting targets.
Proceedings of the IEEE International Conference on Acoustics, 2009

A Sampling Algorithm for Occlusion Robust Multi Target Detection.
Proceedings of the Sixth IEEE International Conference on Advanced Video and Signal Based Surveillance, 2009

2008
Optimised Meeting Recording and Annotation Using Real-Time Video Analysis.
Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

2007
Tracking Visitors in a Museum.
Proceedings of the PEACH - Intelligent Interfaces for Museum Visits, 2007

An information theoretic rule for sample size adaptation in particle filtering.
Proceedings of the 14th International Conference on Image Analysis and Processing (ICIAP 2007), 2007

An Appearance-Based Particle Filter for Visual Tracking in Smart Rooms.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Joint Bayesian Tracking of Head Location and Pose from Low-Resolution Video.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Approximate Bayesian Multibody Tracking.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Dynamic Head Location and Pose from Video.
Proceedings of the 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2006

A Generative Approach to Audio-Visual Person Tracking.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

2005
Hybrid Joint-Separable Multibody Tracking.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005

2004
Occlusion robust Tracking Ofmultiple Objects.
Proceedings of the International Conference on Computer Vision and Graphics, 2004

Automatic Lens distortion estimation for an Active Camera.
Proceedings of the International Conference on Computer Vision and Graphics, 2004

2003
Olympus: an ambient intelligence architecture on the verge of reality.
Proceedings of the 12th International Conference on Image Analysis and Processing (ICIAP 2003), 2003


  Loading...