Rainer Stiefelhagen

Orcid: 0000-0001-8046-4945

Affiliations:
  • Karlsruhe Institute of Technology, Germany


According to our database1, Rainer Stiefelhagen authored at least 454 papers between 1997 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of two.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Behind Every Domain There is a Shift: Adapting Distortion-Aware Vision Transformers for Panoramic Semantic Segmentation.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

Deep Interactive Segmentation of Medical Images: A Systematic Review and Taxonomy.
IEEE Trans. Pattern Anal. Mach. Intell., December, 2024

EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving.
IEEE Trans. Intell. Transp. Syst., November, 2024

Learning human actions from complex manipulation tasks and their transfer to robots in the circular factory.
Autom., September, 2024

Managing uncertainty in product and process design for the circular factory.
Autom., September, 2024

CoBEV: Elevating Roadside 3D Object Detection With Depth and Height Complementarity.
IEEE Trans. Image Process., 2024

Designing a Tactile Document UI for 2D Refreshable Tactile Displays: Towards Accessible Document Layouts for Blind People.
Multimodal Technol. Interact., 2024

Recognizing affective states from the expressive behavior of tennis players using convolutional neural networks.
Knowl. Based Syst., 2024

Every Component Counts: Rethinking the Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks.
CoRR, 2024

Masked Differential Privacy.
CoRR, 2024

LIMIS: Towards Language-based Interactive Medical Image Segmentation.
CoRR, 2024

@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology.
CoRR, 2024

OneBEV: Using One Panoramic Image for Bird's-Eye-View Semantic Mapping.
CoRR, 2024

Data Diet: Can Trimming PET/CT Datasets Enhance Lesion Segmentation?
CoRR, 2024

Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression.
CoRR, 2024

Occlusion-Aware Seamless Segmentation.
CoRR, 2024

Referring Atomic Video Action Recognition.
CoRR, 2024

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.
CoRR, 2024

Rethinking Annotator Simulation: Realistic Evaluation of Whole-Body PET Lesion Interactive Segmentation Methods.
CoRR, 2024

Skeleton-Based Human Action Recognition with Noisy Labels.
CoRR, 2024

360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Towards Video-based Activated Muscle Group Estimation in the Wild.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Anatomy-Guided Pathology Segmentation.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2024, 2024

Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Chart4Blind: An Intelligent Interface for Chart Accessibility Conversion.
Proceedings of the 29th International Conference on Intelligent User Interfaces, 2024

Style Transfer and Pseudo-Label Filtering Improve Transferability in Cell Organelle Segmentation Scenarios.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

Sliding Window Fastedit: A Framework for Lesion Annotation in Whole-Body Pet Images.
Proceedings of the IEEE International Symposium on Biomedical Imaging, 2024

MateRobot: Material Recognition in Wearable Robotics for People with Visual Impairments.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

SynthAct: Towards Generalizable Human Action Recognition based on Synthetic Data.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

AltChart: Enhancing VLM-Based Chart Summarization Through Multi-pretext Tasks.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

ACCSAMS: Automatic Conversion of Exam Documents to Accessible Learning Material for Blind and Visually Impaired.
Proceedings of the Computers Helping People with Special Needs, 2024

Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation.
Proceedings of the Computers Helping People with Special Needs, 2024

ChartFormer: A Large Vision Language Model for Converting Chart Images into Tactile Accessible SVGs.
Proceedings of the Computers Helping People with Special Needs, 2024

STS New Methods for Creating Accessible Material in Higher Education - Introduction to the Special Thematic Session.
Proceedings of the Computers Helping People with Special Needs, 2024

Elevating Skeleton-Based Action Recognition with Efficient Multi-Modality Self-Supervision.
Proceedings of the IEEE International Conference on Acoustics, 2024

Touch for Accessibility: Haptic SVG Diagrams for Visually Impaired and Blind Individuals.
Proceedings of the IEEE Haptics Symposium, 2024

SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading.
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024

Open Panoramic Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Statewide Visual Geolocalization in the Wild.
Proceedings of the Computer Vision - ECCV 2024, 2024

RoDLA: Benchmarking the Robustness of Document Layout Analysis Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Navigating Open Set Scenarios for Skeleton-Based Action Recognition.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation With Transformers.
IEEE Trans. Intell. Transp. Syst., December, 2023

Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning.
IEEE Trans. Intell. Transp. Syst., April, 2023

Delving Deep Into One-Shot Skeleton-Based Action Recognition With Diverse Occlusions.
IEEE Trans. Multim., 2023

C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation.
CoRR, 2023

AutoPET Challenge 2023: Sliding Window-based Optimization of U-Net.
CoRR, 2023

Unveiling the Hidden Realm: Self-supervised Skeleton-based Action Recognition in Occluded Environments.
CoRR, 2023

Towards Privacy-Supporting Fall Detection via Deep Unsupervised RGB2Depth Adaptation.
CoRR, 2023

OAFuser: Towards Omni-Aperture Fusion for Light Field Semantic Segmentation of Road Scenes.
CoRR, 2023

Towards Unifying Anatomy Segmentation: Automated Generation of a Full-body CT Dataset via Knowledge Aggregation and Anatomical Guidelines.
CoRR, 2023

Accurate Fine-Grained Segmentation of Human Anatomy in Radiographs via Volumetric Pseudo-Labeling.
CoRR, 2023

FeatFSDA: Towards Few-shot Domain Adaptation for Video-based Activity Recognition.
CoRR, 2023

MuscleMap: Towards Video-based Activated Muscle Group Estimation.
CoRR, 2023

Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

AutoChemplete - Making Chemical Structural Formulas Accessible.
Proceedings of the 20th International Web for All Conference, 2023

Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2023

Enabling People with Blindness to Distinguish Lines of Mathematical Charts with Audio-Tactile Graphic Readers.
Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments, 2023

Display and Use of Station Floor Plans on 2D Pin Matrix Displays for Blind and Visually Impaired People.
Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments, 2023

Accessible Document Layout: An Interface for 2D Tactile Displays.
Proceedings of the 16th International Conference on PErvasive Technologies Related to Assistive Environments, 2023

Guiding the Guidance: A Comparative Analysis of User Guidance Signals for Interactive Segmentation of Volumetric Images.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2023, 2023

On Transferability of Driver Observation Models from Simulated to Real Environments in Autonomous Cars.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

Multimodal Interactive Lung Lesion Segmentation: A Framework for Annotating PET/CT Images Based on Physiological and Anatomical Cues.
Proceedings of the 20th IEEE International Symposium on Biomedical Imaging, 2023

Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments.
IROS, 2023

Line Graphics Digitization: A Step Towards Full Automation.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Mirror U-Net: Marrying Multimodal Fission with Multi-task Learning for Semantic Segmentation in Medical Imaging.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Open Scene Understanding: Grounded Situation Recognition Meets Segment Anything for Helping People with Visual Impairments.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

The First Visual Object Tracking Segmentation VOTS2023 Challenge Results.
, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , ,
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Mouse-Based Hand Gesture Interaction in Virtual Reality.
Proceedings of the HCI International 2023 Posters, 2023

FishDreamer: Towards Fisheye Semantic Completion via Unified Image Outpainting and Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Decoupled Semantic Prototypes enable learning from diverse annotation types for semi-weakly segmentation in expert-driven domains.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Uncertainty-Aware Vision-Based Metric Cross-View Geolocalization.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Delivering Arbitrary-Modal Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Exploring Event-Driven Dynamic Context for Accident Scene Segmentation.
IEEE Trans. Intell. Transp. Syst., 2022

Trans4Trans: Efficient Transformer for Transparent Object and Semantic Scene Segmentation in Real-World Navigation Assistance.
IEEE Trans. Intell. Transp. Syst., 2022

Omnisupervised Omnidirectional Semantic Segmentation.
IEEE Trans. Intell. Transp. Syst., 2022

Is My Driver Observation Model Overconfident? Input-Guided Calibration Networks for Reliable and Interpretable Confidence Estimates.
IEEE Trans. Intell. Transp. Syst., 2022

MASS: Multi-Attentional Semantic Segmentation of LiDAR Data for Dense Top-View Understanding.
IEEE Trans. Intell. Transp. Syst., 2022

Transfer Beyond the Field of View: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation.
IEEE Trans. Intell. Transp. Syst., 2022

Traveling More Independently: A Study on the Diverse Needs and Challenges of People with Visual or Mobility Impairments in Unfamiliar Indoor Environments.
ACM Trans. Access. Comput., 2022

AutoPET Challenge: Combining nn-Unet with Swin UNETR Augmented by Maximum Intensity Projection Classifier.
CoRR, 2022

Behind Every Domain There is a Shift: Adapting Distortion-aware Vision Transformers for Panoramic Semantic Segmentation.
CoRR, 2022

Trans4Map: Revisiting Holistic Top-down Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers.
CoRR, 2022

CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers.
CoRR, 2022

Transformer-based Knowledge Distillation for Efficient Semantic Segmentation of Road-driving Scenes.
CoRR, 2022

ProFormer: Learning Data-efficient Representations of Body Movement with Prototype-based Feature Augmentation and Visual Transformers.
CoRR, 2022

Erfassung und Interpretation menschlicher Handlungen für die Programmierung von Robotern in der Produktion.
Autom., 2022

Breaking with Fixed Set Pathology Recognition Through Report-Guided Contrastive Training.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2022, 2022

A Comparative Analysis of Decision-Level Fusion for Multimodal Driver Behaviour Understanding.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Multimodal Generation of Novel Action Appearances for Synthetic-to-Real Recognition of Activities of Daily Living.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Continuous Self-Localization on Aerial Images Using Visual and Lidar Sensors.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2022

Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Revisiting Click-Based Interactive Video Object Segmentation.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Interface for Automatic Tactile Display of Data Plots.
Proceedings of the Computers Helping People with Special Needs, 2022

Indoor Navigation Assistance for Visually Impaired People via Dynamic SLAM and Panoptic Segmentation with an RGB-D Sensor.
Proceedings of the Computers Helping People with Special Needs, 2022

The Accessible Tactile Indoor Maps (ATIM) Symbol Set: A Common Symbol Set for Different Printing Methods.
Proceedings of the Computers Helping People with Special Needs, 2022

An Audio-Tactile System for Visually Impaired People to Explore Indoor Maps.
Proceedings of the Computers Helping People with Special Needs, 2022

Digital Solutions for Inclusive Mobility: Solutions and Accessible Maps for Indoor and Outdoor Mobility - Introduction to the Special Thematic Session.
Proceedings of the Computers Helping People with Special Needs, 2022

Accessible Chemical Structural Formulas Through Interactive Document Labeling.
Proceedings of the Computers Helping People with Special Needs, 2022

Listening First: Egocentric Textual Descriptions of Indoor Spaces for People with Blindness.
Proceedings of the Computers Helping People with Special Needs, 2022

Traveling to Unknown Buildings: Accessibility Features for Indoor Maps.
Proceedings of the Computers Helping People with Special Needs, 2022

Split it Up: Allocentric Descriptions of Indoor Maps for People with Visual Impairments.
Proceedings of the Computers Helping People with Special Needs, 2022

Tabletop 3D Digital Map Interaction with Virtual Reality Handheld Controllers.
Proceedings of the Virtual, Augmented and Mixed Reality: Design and Development, 2022

Audio-Tactile Reader (ATR): Interaction Concepts for Students with Blindness to Explore Digital STEM Documents on a 2D Haptic Device.
Proceedings of the 2022 IEEE Haptics Symposium, 2022

Multi-modal Depression Estimation Based on Sub-attentional Fusion.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Graph-Constrained Contrastive Regularization for Semi-weakly Volumetric Segmentation.
Proceedings of the Computer Vision - ECCV 2022, 2022

ModSelect: Automatic Modality Selection for Synthetic-to-Real Domain Generalization.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

Pose-based Contrastive Learning for Domain Agnostic Activity Representations.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Should I take a walk? Estimating Energy Expenditure from Video Data.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Towards Robust Semantic Segmentation of Accident Scenes via Multi-Source Mixed Sampling and Meta-Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Detailed Annotations of Chest X-Rays via CT Projection for Report Understanding.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

MatchFormer: Interleaving Attention in Transformers for Feature Matching.
Proceedings of the Computer Vision - ACCV 2022, 2022

Reference-Guided Pseudo-Label Generation for Medical Semantic Segmentation.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Is Context-Aware CNN Ready for the Surroundings? Panoramic Semantic Segmentation in the Wild.
IEEE Trans. Image Process., 2021

Certainty Volume Prediction for Unsupervised Domain Adaptation.
CoRR, 2021

UBR<sup>2</sup>S: Uncertainty-Based Resampling and Reweighting Strategy for Unsupervised Domain Adaptation.
CoRR, 2021

Adaptiope: A Modern Benchmark for Unsupervised Domain Adaptation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Unsupervised Meta-Domain Adaptation for Fashion Retrieval.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Flying Guide Dog: Walkable Path Discovery for the Visually Impaired Utilizing Drones and Transformer-based Semantic Segmentation.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2021

Perception Framework through Real-Time Semantic Segmentation and Scene Recognition on a Wearable System for the Visually Impaired.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021

Panoptic Lintention Network: Towards Efficient Navigational Perception for the Visually Impaired.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2021

From Driver Talk To Future Action: Vehicle Maneuver Prediction by Learning from Driving Exam Dialogs.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

DR-TANet: Dynamic Receptive Temporal Attention Network for Street Scene Change Detection.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

An Evaluation of Different Methods for 3D-Driver-Body-Pose Estimation.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

DensePASS: Dense Panoramic Semantic Segmentation via Unsupervised Domain Adaptation with Attention-Augmented Context Exchange.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Prediction of Low-Kev Monochromatic Images From Polyenergetic CT Scans For Improved Automatic Detection of Pulmonary Embolism.
Proceedings of the 18th IEEE International Symposium on Biomedical Imaging, 2021

ISSAFE: Improving Semantic Segmentation in Accidents by Fusing Event-based Data.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Let's Play for Action: Recognizing Activities of Daily Living by Learning from Life Simulation Video Games.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Trans4Trans: Efficient Transformer for Transparent Object Segmentation to Help Visually Impaired People Navigate in the Real World.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

HIDA: Towards Holistic Indoor Understanding for the Visually Impaired via Semantic Instance Segmentation with a Wearable Solid-State LiDAR Sensor.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Vi<sup>2</sup>CLR: Video and Image for Visual Contrastive Learning of Representation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Affect-DML: Context-Aware One-Shot Recognition of Human Affect using Deep Metric Learning.
Proceedings of the 16th IEEE International Conference on Automatic Face and Gesture Recognition, 2021

Pose2Drone: A Skeleton-Pose-based Framework for Human-Drone Interaction.
Proceedings of the 29th European Signal Processing Conference, 2021

Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Every Annotation Counts: Multi-Label Deep Supervision for Medical Image Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Capturing Omni-Range Context for Omnidirectional Segmentation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

UBR²S: Uncertainty-Based Resampling and Reweighting Strategy for Unsupervised Domain Adaptation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Video Face Clustering With Self-Supervised Representation Learning.
IEEE Trans. Biom. Behav. Identity Sci., 2020

Helping the Blind to Get through COVID-19: Social Distancing Assistant Using Real-Time Semantic Segmentation on RGB-D Video.
Sensors, 2020

ShiSha: Enabling Shared Perspective With Face-to-Face Collaboration Using Redirected Avatars in Virtual Reality.
Proc. ACM Hum. Comput. Interact., 2020

Can we cover navigational perception needs of the visually impaired by panoptic segmentation?
CoRR, 2020

Deep Multimodal Feature Encoding for Video Ordering.
CoRR, 2020

Open Set Driver Activity Recognition.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Deep Classification-driven Domain Adaptation for Cross-Modal Driver Behavior Recognition.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

In Defense of Multi-Source Omni-Supervised Efficient ConvNet for Robust Semantic Segmentation in Heterogeneous Unseen Domains.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

DS-PASS: Detail-Sensitive Panoramic Annular Semantic Segmentation through SwaftNet for Surrounding Sensing.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

CNN-based Driver Activity Understanding: Shedding Light on Deep Spatiotemporal Representations.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

Dynamic Interaction Graphs for Driver Activity Recognition.
Proceedings of the 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

Multi-Task Learning for Calorie Prediction on a Novel Large-Scale Recipe Dataset Enriched with Nutritional Information.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Uncertainty-sensitive Activity Recognition: A Reliability Benchmark and the CARING Models.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Detective: An Attentive Recurrent Model for Sparse Object Detection.
Proceedings of the 25th International Conference on Pattern Recognition, 2020

Bring the Environment to Life: A Sonification Module for People with Visual Impairments to Improve Situation Awareness.
Proceedings of the ICMI '20: International Conference on Multimodal Interaction, 2020

Developing a Magnification Prototype Based on Head and Eye-Tracking for Persons with Low Vision.
Proceedings of the Computers Helping People with Special Needs, 2020

AccessibleMaps: Addressing Gaps in Maps for People with Visual and Mobility Impairments.
Proceedings of the Computers Helping People with Special Needs, 2020

Can We Unify Perception and Localization in Assisted Navigation? An Indoor Semantic Visual Positioning System for Visually Impaired People.
Proceedings of the Computers Helping People with Special Needs, 2020

Image-Based Recognition of Braille Using Neural Networks on Mobile Devices.
Proceedings of the Computers Helping People with Special Needs, 2020

Calibration of Diverse Tracking Systems to Enable Local Collaborative Mixed Reality Applications.
Proceedings of the Virtual, Augmented and Mixed Reality. Design and Interaction, 2020

Enabling Interaction with Arbitrary 2D Applications in Virtual Environments.
Proceedings of the HCI International 2020 - Posters - 22nd International Conference, 2020

Künstliche Intelligenz - Die dritte Welle.
Proceedings of the 50. Jahrestagung der Gesellschaft für Informatik, INFORMATIK 2020 - Back to the Future, Karlsruhe, Germany, 28. September, 2020

Clustering based Contrastive Learning for Improving Face Representations.
Proceedings of the 15th IEEE International Conference on Automatic Face and Gesture Recognition, 2020

Large Scale Holistic Video Understanding.
Proceedings of the Computer Vision - ECCV 2020, 2020

Activity-aware Attributes for Zero-Shot Driver Behavior Recognition.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Understanding what you feel: A Mobile Audio-Tactile System for Graphics Used at Schools with Students with Visual Impairment.
Proceedings of the CHI '20: CHI Conference on Human Factors in Computing Systems, 2020

Anchor-free Small-scale Multispectral Pedestrian Detection.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Unsupervised Domain Adaptation by Uncertain Feature Alignment.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

CLEVR: A Customizable Interactive Learning Environment for Users with Low Vision in Virtual Reality.
Proceedings of the ASSETS '20: The 22nd International ACM SIGACCESS Conference on Computers and Accessibility, 2020

Travelling more independently: A Requirements Analysis for Accessible Journeys to Unknown Buildings for People with Visual Impairments.
Proceedings of the ASSETS '20: The 22nd International ACM SIGACCESS Conference on Computers and Accessibility, 2020

Self-guided Multiple Instance Learning for Weakly Supervised Disease Classification and Localization in Chest Radiographs.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Holistic Large Scale Video Understanding.
CoRR, 2019

SPaSe - Multi-Label Page Segmentation for Presentation Slides.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Self-supervised Face-Grouping on Graphs.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

End-to-end Prediction of Driver Intention using 3D Convolutional Neural Networks.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

3D Object Trajectory Reconstruction using Instance-Aware Multibody Structure from Motion and Stereo Sequence Constraints.
Proceedings of the 2019 IEEE Intelligent Vehicles Symposium, 2019

WiSe - Slide Segmentation in the Wild.
Proceedings of the 2019 International Conference on Document Analysis and Recognition, 2019

Learning Fine-Grained Image Representations for Mathematical Expression Recognition.
Proceedings of the 13th IAPR International Workshop on Graphics Recognition, 2019

Drive&Act: A Multi-Modal Dataset for Fine-Grained Driver Behavior Recognition in Autonomous Vehicles.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

DynamoNet: Dynamic Action and Motion Network.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Self-Supervised Learning of Face Representations for Video Face Clustering.
Proceedings of the 14th IEEE International Conference on Automatic Face & Gesture Recognition, 2019

A Vision-based System for Breathing Disorder Identification: A Deep Learning Perspective.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

Motion Signatures for the Analysis of Seizure Evolution in Epilepsy.
Proceedings of the 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2019

DynGraph: Visual Question Answering via Dynamic Scene Graphs.
Proceedings of the Pattern Recognition, 2019

Efficient Parameter-Free Clustering Using First Neighbor Relations.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Analysis of Deep Fusion Strategies for Multi-Modal Gesture Recognition.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

UAV-Net: A Fast Aerial Vehicle Detector for Mobile Platforms.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

It's Not About the Journey; It's About the Destination: Following Soft Paths Under Question-Guidance for Visual Reasoning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Weakly Supervised Object Discovery by Generative Adversarial & Ranking Networks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Content and Colour Distillation for Learning Image Translations with the Spatial Profile Loss.
Proceedings of the 30th British Machine Vision Conference 2019, 2019

Towards a Standardized Grammar for Navigation Systems for Persons with Visual Impairments.
Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility, 2019

2018
Stereo 3D Object Trajectory Reconstruction.
CoRR, 2018

Driver observation and shared vehicle control: supporting the driver on the way back into the control loop.
Autom., 2018

Personal Perspective: Using Modified World Views to Overcome Real-Life Limitations in Virtual Reality.
Proceedings of the 2018 IEEE Conference on Virtual Reality and 3D User Interfaces, 2018

Body Pose and Context Information for Driver Secondary Task Detection.
Proceedings of the 2018 IEEE Intelligent Vehicles Symposium, 2018

Can Image Enhancement be Beneficial to Find Smoke Images in Laparoscopic Surgery?
Proceedings of the 26th Color and Imaging Conference, 2018

Estimating mental load in passive and active tasks from pupil and gaze changes using bayesian surprise.
Proceedings of the Workshop on Modeling Cognitive Processes from Multimodal Data, 2018

Accessible EPUB: Making EPUB 3 Documents Universal Accessible.
Proceedings of the Computers Helping People with Special Needs, 2018

Optical Braille Recognition.
Proceedings of the Computers Helping People with Special Needs, 2018

Prototype Development of a Low-Cost Vibro-Tactile Navigation Aid for the Visually Impaired.
Proceedings of the Computers Helping People with Special Needs, 2018

An Inclusive and Accessible LaTeX Editor.
Proceedings of the Computers Helping People with Special Needs, 2018

UML4ALL Syntax - A Textual Notation for UML Diagrams.
Proceedings of the Computers Helping People with Special Needs, 2018

Visual Shoreline Detection for Blind and Partially Sighted People.
Proceedings of the Computers Helping People with Special Needs, 2018

Multi-user Collaboration on Complex Data in Virtual and Augmented Reality.
Proceedings of the HCI International 2018 - Posters' Extended Abstracts, 2018

Interaction of Distant and Local Users in a Collaborative Virtual Environment.
Proceedings of the Virtual, Augmented and Mixed Reality: Interaction, Navigation, Visualization, Embodiment, and Simulation, 2018

Capability for Collision Avoidance of Different User Avatars in Virtual Reality.
Proceedings of the HCI International 2018 - Posters' Extended Abstracts, 2018

qVRty: Virtual Keyboard with a Haptic, Real-World Representation.
Proceedings of the HCI International 2018 - Posters' Extended Abstracts, 2018

Towards a Fair Evaluation of Zero-Shot Action Recognition Using External Data.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

MoQA - A Multi-modal Question Answering Architecture.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

3D Vehicle Trajectory Reconstruction in Monocular Video Data Using Environment Structure Constraints.
Proceedings of the Computer Vision - ECCV 2018, 2018

Taming the Cross Entropy Loss.
Proceedings of the Pattern Recognition - 40th German Conference, 2018

Classification-Driven Dynamic Image Enhancement.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

A Pose-Sensitive Embedding for Person Re-Identification With Expanded Cross Neighborhood Re-Ranking.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Informed Democracy: Voting-based Novelty Detection for Action Recognition.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Accessibility Beyond the Desktop.
Inform. Spektrum, 2017

A Multimodal Assistive System for Helping Visually Impaired in Social Interactions.
Inform. Spektrum, 2017

UML4ALL: Gemeinsam in Diversity Teams Software modellieren - Für Menschen mit und ohne Seheinschränkung.
Inform. Spektrum, 2017

Deep Perceptual Mapping for Cross-Modal Face Recognition.
Int. J. Comput. Vis., 2017

Object Discovery By Generative Adversarial & Ranking Networks.
CoRR, 2017

3D Trajectory Reconstruction of Dynamic Objects Using Planarity Constraints.
CoRR, 2017

Breathing Rate Monitoring during Sleep from a Depth Camera under Real-Life Conditions.
Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision, 2017

Real time driver body pose estimation for novel assistance systems.
Proceedings of the 20th IEEE International Conference on Intelligent Transportation Systems, 2017

CNN-based sensor fusion techniques for multimodal human activity recognition.
Proceedings of the 2017 ACM International Symposium on Wearable Computers, 2017

Using Technology Developed for Autonomous Cars to Help Navigate Blind People.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Mind the Gap: Virtual Shorelines for Blind and Partially Sighted People.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Fashion Forward: Forecasting Visual Style in Fashion.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Digital Map Table VR: Bringing an Interactive System to Virtual Reality.
Proceedings of the Virtual, Augmented and Mixed Reality, 2017

Interaction with Three Dimensional Objects on Diverse Input and Output Devices: A Survey.
Proceedings of the HCI International 2017 - Posters' Extended Abstracts, 2017

Marlin: A High Throughput Variable-to-Fixed Codec Using Plurally Parsable Dictionaries.
Proceedings of the 2017 Data Compression Conference, 2017

DriveAHead - A Large-Scale Driver Head Pose Dataset.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Person Re-identification by Deep Learning Attribute-Complementary Information.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Deep View-Sensitive Pedestrian Attribute Inference in an end-to-end Model.
Proceedings of the British Machine Vision Conference 2017, 2017

2016
Transfer metric learning for action similarity using high-level semantics.
Pattern Recognit. Lett., 2016

Exact Maximum Entropy Inverse Optimal Control for Modelling Human Attention Switching and Control.
CoRR, 2016

Relaxed Earth Mover's Distances for Chain- and Tree-connected Spaces and their use as a Loss Function in Deep Learning.
CoRR, 2016

HeHOP: Highly efficient head orientation and position estimation.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Naming TV characters by watching and analyzing dialogs.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

Exact Maximum Entropy Inverse Optimal Control for modeling human attention switching and control.
Proceedings of the 2016 IEEE International Conference on Systems, Man, and Cybernetics, 2016

Predicting lane keeping behavior of visually distracted drivers using inverse suboptimal control.
Proceedings of the 2016 IEEE Intelligent Vehicles Symposium, 2016

Sleep position classification from a depth camera using Bed Aligned Maps.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Mobile Interactive Image Sonification for the Blind.
Proceedings of the Computers Helping People with Special Needs, 2016

User Requirements Regarding Information Included in Audio-Tactile Maps for Individuals with Blindness.
Proceedings of the Computers Helping People with Special Needs, 2016

Zebra Crossing Detection from Aerial Imagery Across Countries.
Proceedings of the Computers Helping People with Special Needs, 2016

MovieQA: Understanding Stories in Movies through Question-Answering.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Automatic generation of scene-specific person trackers.
Proceedings of the 13th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2016

2015
Aligning plot synopses to videos for story-based retrieval.
Int. J. Multim. Inf. Retr., 2015

What's the point? Frame-wise Pointing Gesture Recognition with Latent-Dynamic Conditional Random Fields.
CoRR, 2015

On the Distribution of Salient Objects in Web Images and its Influence on Salient Object Detection.
CoRR, 2015

Deep Perceptual Mapping for Thermal to Visible Face Recognition.
CoRR, 2015

3D Pictorial Structures for Human Pose Estimation with Supervoxels.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

How to Transfer? Zero-Shot Object Recognition via Hierarchical Transfer of Semantic Attributes.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

Action recognition in bed using BAMs for assisted living and elderly care.
Proceedings of the 14th IAPR International Conference on Machine Vision Applications, 2015

Accio: A Data Set for Face Track Retrieval in Movies Across Age.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015

KIT at MediaEval 2015 - Evaluating Visual Cues for Affective Impact of Movies Task.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015

A Web-based Platform for Interactive Image Sonification.
Proceedings of the Mensch und Computer 2015, 2015

Pedestrian intention recognition using Latent-dynamic Conditional Random Fields.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

A Controlled Interactive Multiple Model Filter for Combined Pedestrian Intention Recognition and Path Prediction.
Proceedings of the IEEE 18th International Conference on Intelligent Transportation Systems, 2015

Interactive Web-based Image Sonification for the Blind.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Multimodal Public Speaking Performance Assessment.
Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, Seattle, WA, USA, November 09, 2015

Color decorrelation helps visual saliency detection.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

Combining view-based pose normalization and feature transform for cross-pose face recognition.
Proceedings of the International Conference on Biometrics, 2015

Learning to Juggle in an Interactive Virtual Reality Environment.
Proceedings of the HCI International 2015 - Posters' Extended Abstracts, 2015

Improved weak labels using contextual cues for person identification in videos.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

Action unit intensity estimation using hierarchical partial least squares.
Proceedings of the 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition, 2015

3D Facial Landmark Detection: How to Deal with Head Rotations?
Proceedings of the Pattern Recognition - 37th German Conference, 2015

Book2Movie: Aligning video scenes with book chapters.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

Deep Perceptual Mapping for Thermal to Visible Face Recogntion.
Proceedings of the British Machine Vision Conference 2015, 2015

Transferring attributes for person re-identification.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

Message from general chairs.
Proceedings of the 12th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2015

2014
An evaluation of the compactness of superpixels.
Pattern Recognit. Lett., 2014

Automatic understanding of group behavior using fuzzy temporal logic.
J. Ambient Intell. Smart Environ., 2014

"Important stuff, everywhere!" Activity recognition with salient proto-objects as context.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Learning Semantic Attributes via a Common Latent Space.
Proceedings of the VISAPP 2014, 2014

Story-based Video Retrieval in TV series using Plot Synopses.
Proceedings of the International Conference on Multimedia Retrieval, 2014

Erfassung der Oberkörperpose im Kraftfahrzeug.
Proceedings of the Mensch & Computer 2014 - Workshopband, 14. Fachübergreifende Konferenz für Interaktive und Kooperative Medien - Interaktiv unterwegs - Freiräume gestalten, 31. August, 2014

"Look at this!" learning to guide visual saliency in human-robot interaction.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Total Cluster: A person agnostic clustering method for broadcast videos.
Proceedings of the 2014 Indian Conference on Computer Vision, 2014

Manifold Alignment for Person Independent Appearance-Based Gaze Estimation.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

What to Transfer? High-Level Semantics in Transfer Metric Learning for Action Similarity.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Cleaning up after a face tracker: False positive removal.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Kinect unbiased.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Cognitive Evaluation of Haptic and Audio Feedback in Short Range Navigation Tasks.
Proceedings of the Computers Helping People with Special Needs, 2014

Way to Go! Detecting Open Areas Ahead of a Walking Person.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

StoryGraphs: Visualizing Character Interactions as a Timeline.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

A time pooled track kernel for person identification.
Proceedings of the 11th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2014

Real Time Head Model Creation and Head Pose Estimation on Consumer Depth Cameras.
Proceedings of the 2nd International Conference on 3D Vision, 2014

2013
Visuelle Perzeption für die multimodale Mensch-Maschine-Interaktion in und mit aufmerksamen Räumen.
Autom., 2013

Multimodale Interaktion.
Autom., 2013

How to choose element sizes for novel interactive systems.
Proceedings of the ACM International Conference on Interactive Tabletops and Surfaces, 2013

Kinect Unleashed: Getting Control over High Resolution Depth Maps.
Proceedings of the 13. IAPR International Conference on Machine Vision Applications, 2013

Automated Multi-Camera System for Long Term Behavioral Monitoring in Intensive Care Units.
Proceedings of the 13. IAPR International Conference on Machine Vision Applications, 2013

Dynamic Gaussian Force Field Controlled Kalman Filtering For Pointing Interaction.
Proceedings of the Mensch & Computer 2013: Interaktive Vielfalt, 2013

GlueTK: a framework for multi-modal, multi-display human-machine-interaction.
Proceedings of the 18th International Conference on Intelligent User Interfaces, 2013


Accessible section detection for visual guidance.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2013

How the distribution of salient objects in images influences salient object detection.
Proceedings of the IEEE International Conference on Image Processing, 2013

"Wow!" Bayesian surprise for salient acoustic event detection.
Proceedings of the IEEE International Conference on Acoustics, 2013

How to Click in Mid-Air.
Proceedings of the Distributed, Ambient, and Pervasive Interactions, 2013

Semi-supervised Learning with Constraints for Person Identification in Multimedia Data.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

"BAM!" Depth-Based Body Analysis in Critical Care.
Proceedings of the Computer Analysis of Images and Patterns, 2013

RPM: Random Points Matching for Pair wise Face-Similarity.
Proceedings of the British Machine Vision Conference, 2013

Person tracking-by-detection with efficient selection of part-detectors.
Proceedings of the 10th IEEE International Conference on Advanced Video and Signal Based Surveillance, 2013

2012
Best of Automatic Face and Gesture Recognition 2011.
Image Vis. Comput., 2012

Predicting human gaze using quaternion DCT image signature saliency and face detection.
Proceedings of the IEEE Workshop on Applications of Computer Vision, 2012

On-line Action Recognition from Sparse Feature Flow.
Proceedings of the VISAPP 2012, 2012

Quaero at TRECVID 2012: Semantic Indexing.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012

Rule-Based High-Level Situation Recognition from Incomplete Tracking Data.
Proceedings of the Rules on the Web: Research and Applications, 2012

KIT at MediaEval 2012 - Content - based Genre Classification with Visual Cues.
Proceedings of the Working Notes Proceedings of the MediaEval 2012 Workshop, 2012

Video-based pedestrian head pose estimation for risk assessment.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Multimodal saliency-based attention: A lazy robot's approach.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Measuring and evaluating the compactness of superpixels.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Learning robust color name models from web images.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Robust multi-pose face tracking by multi-stage tracklet association.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Breath rate monitoring during sleep using near-ir imagery and PCA.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

A ranking model for face alignment with Pseudo Census Transform.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

Vision-based handwriting recognition for unrestricted text input in mid-air.
Proceedings of the International Conference on Multimodal Interaction, 2012

An Assistive Vision System for the Blind That Helps Find Lost Things.
Proceedings of the Computers Helping People with Special Needs, 2012

Quaternion-Based Spectral Saliency Detection for Eye Fixation Prediction.
Proceedings of the Computer Vision - ECCV 2012, 2012

"Knock! Knock! Who is it?" probabilistic person identification in TV-series.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Improving foreground segmentations with probabilistic superpixel Markov random fields.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012

Analysis of partial least squares for pose-invariant face recognition.
Proceedings of the IEEE Fifth International Conference on Biometrics: Theory, 2012

Face Alignment Using a Ranking Model based on Regression Trees.
Proceedings of the British Machine Vision Conference, 2012

Contextual Constraints for Person Retrieval in Camera Networks.
Proceedings of the Ninth IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2012

Automatic Behavior Understanding in Crisis Response Control Rooms.
Proceedings of the Ambient Intelligence - Third International Joint Conference, 2012

2011
Person re-identification in TV series using robust face recognition and user feedback.
Multim. Tools Appl., 2011

Quaero at TRECVID 2011: Semantic Indexing and Multimedia Event Detection.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011

Identifying Important People in Broadcast News Videos.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2011), 2011

Multimodal saliency-based attention for object-based scene analysis.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Combined intention, activity, and motion recognition for a humanoid household robot.
Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2011

Tue-SeA Real-Time Speech Command Detector for a Smart Control Room.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Evaluating image segments by applying the description length to sets of superpixels.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

High-level situation recognition using Fuzzy Metric Temporal Logic, case studies in surveillance and smart environments.
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2011

The KIT Robo-kitchen data set for the evaluation of view-based activity recognition systems.
Proceedings of the 11th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2011), 2011

Combined Head Localization and Head Pose Estimation for Video-Based Advanced Driver Assistance Systems.
Proceedings of the Pattern Recognition - 33rd DAGM Symposium, Frankfurt/Main, Germany, August 31, 2011

Boosting Pseudo Census Transform Features for Face Alignment.
Proceedings of the British Machine Vision Conference, 2011

Part-based clothing segmentation for person retrieval.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

AVSS 2011 demo session: Interactive person-retrieval in a distributed camera network.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

Evaluation of local features for person re-identification in image sequences.
Proceedings of the 8th IEEE International Conference on Advanced Video and Signal-Based Surveillance, 2011

2010
A video-based door monitoring system using local appearance-based face models.
Comput. Vis. Image Underst., 2010

Quaero at TRECVID 2010: Semantic Indexing.
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010

Content-based video genre classification using multiple cues.
Proceedings of the 3rd International Workshop on Automated Information Extraction in Media Production, 2010

Efficient person identification using active cameras in a smartroom.
Proceedings of the 1st ACM international workshop on Multimodal pervasive video analysis, 2010

Interactive person-retrieval in TV series and distributed surveillance video.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Image and Video Analysis to Enable Human-Friendly Systems.
Proceedings of the Emerging Research Directions in Computer Science, Karlsruhe, Germany, July 26-27, 2010. Proceedings, 2010

Towards High-Level Human Activity Recognition through Computer Vision and Temporal Logic.
Proceedings of the KI 2010: Advances in Artificial Intelligence, 2010

Multi-view Based Estimation of Human Upper-Body Orientation.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Multi-resolution Local Appearance-Based Face Verification.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Automatic Frequency Band Selection for Illumination Robust Face Recognition.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

3D user-perspective, voxel-based estimation of visual focus of attention in dynamic meeting scenarios.
Proceedings of the 12th International Conference on Multimodal Interfaces / 7. International Workshop on Machine Learning for Multimodal Interaction, 2010

Adaptive color transformation for person re-identification in camera networks.
Proceedings of the 2010 Fourth ACM/IEEE International Conference on Distributed Smart Cameras, Atlanta, GA, USA - August 31, 2010

Robust Open-Set Face Recognition for Small-Scale Convenience Applications.
Proceedings of the Pattern Recognition, 2010

Interactive person re-identification in TV series.
Proceedings of the 2010 International Workshop on Content-Based Multimedia Indexing, 2010

Multi-pose Face Recognition for Person Retrieval in Camera Networks.
Proceedings of the Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance, 2010


2009
Computers in the Human Interaction Loop.
Proceedings of the Computers in the Human Interaction Loop, 2009

Estimation of Head Pose.
Proceedings of the Computers in the Human Interaction Loop, 2009

Perceptual Technologies: Analyzing the Who, What, Where of Human Interaction.
Proceedings of the Computers in the Human Interaction Loop, 2009

Activity Classification.
Proceedings of the Computers in the Human Interaction Loop, 2009

Perceptual Component Evaluation and Data Collection.
Proceedings of the Computers in the Human Interaction Loop, 2009

Extracting Interaction Cues: Focus of Attention, Body Pose, and Gestures.
Proceedings of the Computers in the Human Interaction Loop, 2009

Person Tracking.
Proceedings of the Computers in the Human Interaction Loop, 2009

Multimodal identity tracking in a smart room.
Pers. Ubiquitous Comput., 2009

Universität Karlsruhe (TH) at TRECVID 2009.
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009

Extending touch: towards interaction with large-scale surfaces.
Proceedings of the ACM International Conference on Interactive Tabletops and Surfaces, 2009

A System for Probabilistic Joint 3D Head Tracking and Pose Estimation in Low-Resolution, Multi-view Environments.
Proceedings of the Computer Vision Systems, 2009

Open-Set Face Recognition-Based Visitor Interface System.
Proceedings of the Computer Vision Systems, 2009

Person tracking in camera networks using graph-based bayesian inference.
Proceedings of the Third ACM/IEEE International Conference on Distributed Smart Cameras, 2009

Pose Normalization for Local Appearance-Based Face Recognition.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Generic versus Salient Region-Based Partitioning for Local Appearance Face Recognition.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Why Is Facial Occlusion a Challenging Problem?.
Proceedings of the Advances in Biometrics, Third International Conference, 2009

Real-Time GPU-Based Voxel Carving with Systematic Occlusion Handling.
Proceedings of the Pattern Recognition, 2009

2008
Introduction to the special issue on multimodal corpora for modeling human multimodal behavior.
Lang. Resour. Evaluation, 2008

Evaluating Multiple Object Tracking Performance: The CLEAR MOT Metrics.
EURASIP J. Image Video Process., 2008

Universität Karlsruhe (TH) at TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

A context-aware virtual secretary in a smart office environment.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Probabilistic integration of sparse audio-visual cues for identity tracking.
Proceedings of the 16th International Conference on Multimedia 2008, 2008

Visual Focus of Attention in Dynamic Meeting Scenarios.
Proceedings of the Machine Learning for Multimodal Interaction, 5th International Workshop, 2008

Data Collection for the CHIL CLEAR 2007 Evaluation Campaign.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Deducing the visual focus of attention from head pose estimation in dynamic multi-view meeting scenarios.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

Tracking identities and attention in smart environments - contributions and progress in the CHIL project.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Face recognition for smart interactions.
Proceedings of the 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), 2008

Dynamic Integration of Generalized Cues for Person Tracking.
Proceedings of the Computer Vision, 2008

2007
Visuelle Perzeption von Menschen für Mensch-Maschine Interaktion
, 2007

Enabling Multimodal Human-Robot Interaction for the Karlsruhe Humanoid Robot.
IEEE Trans. Robotics, 2007

3-D Face Recognition Using Local Appearance-Based Models.
IEEE Trans. Inf. Forensics Secur., 2007

The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms.
Lang. Resour. Evaluation, 2007

Visual recognition of pointing gestures for human-robot interaction.
Image Vis. Comput., 2007

Universität Karlsruhe (TH) at TRECVID 2007.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Audio-visual multi-person tracking and identification for smart environments.
Proceedings of the 15th International Conference on Multimedia 2007, 2007

Face Recognition in Smart Rooms.
Proceedings of the Machine Learning for Multimodal Interaction , 2007

Face Recognition for Smart Interactions.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007

Video-based Face Recognition on Real-World Data.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007

State Synchronous Modeling on Phone Boundary for Audio Visual Speech Recognition and Application to Muti-View Face Images.
Proceedings of the IEEE International Conference on Acoustics, 2007

Fast audio-visual multi-person tracking for a humanoid stereo camera head.
Proceedings of the 2007 7th IEEE-RAS International Conference on Humanoid Robots, November 29th, 2007

Multi-modal Person Identification in a Smart Environment.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Automatic Person Detection and Tracking using Fuzzy Controlled Active Cameras.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

Head Pose Estimation in Single- and Multi-view Environments - Results on the CLEAR'07 Benchmarks.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

The CLEAR 2007 Evaluation.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

ISL Person Identification Systems in the CLEAR 2007 Evaluations.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

Multi-level Particle Filter Fusion of Features and Cues for Audio-Visual Person Tracking.
Proceedings of the Multimodal Technologies for Perception of Humans, 2007

2006
Audio-visual perception of a lecturer in a smart seminar room.
Signal Process., 2006

The Connector Service-Predicting Availability in Mobile Contexts.
Proceedings of the Machine Learning for Multimodal Interaction, 2006

Activity Recognition and Room-Level Tracking in an Office Environment.
Proceedings of the 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2006

A Bayesian Approach for Multi-view Head Pose Estimation.
Proceedings of the 2006 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems, 2006

Mouth Region Localization Method Based on Gaussian Mixture Model.
Proceedings of the Advances in Machine Vision, 2006

Multimodal Identity Tracking in a Smartroom.
Proceedings of the Artificial Intelligence Applications and Innovations, 2006

Detection-Assisted Initialization, Adaptation and Fusion of Body Region Trackers for Robust Multiperson Tracking.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Tracking head pose and focus of attention with multiple far-field cameras.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

MyConnector: analysis of context cues to predict human availability for communication.
Proceedings of the 8th International Conference on Multimodal Interfaces, 2006

Tracking of the Articulated Upper Body on Multi-View Stereo Image Sequences.
Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006), 2006

Block Selection in the Local Appearance-based Face Recognition Scheme.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Analysis of Local Appearance-Based Face Recognition: Effects of Feature Selection and Feature Normalization.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2006

Neural Network-Based Head Pose Estimation and Multi-view Fusion.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

The CLEAR 2006 Evaluation.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

An Audio-Visual Particle Filter for Speaker Tracking on the CLEAR'06 Evaluation Dataset.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

Multi-and Single View Multiperson Tracking for Smart Room Environments.
Proceedings of the Multimodal Technologies for Perception of Humans, 2006

2005
Capturing Interactions in Meetings with Omnidirectional Cameras.
Int. J. Distance Educ. Technol., 2005

Estimating the Lecturer's Head Pose in Seminar Scenarios - A Multi-view Approach.
Proceedings of the Machine Learning for Multimodal Interaction, 2005

A joint particle filter for audio-visual speaker tracking.
Proceedings of the 7th International Conference on Multimodal Interfaces, 2005

The connector: facilitating context-aware communication.
Proceedings of the 7th International Conference on Multimodal Interfaces, 2005

A cognitive architecture for a humanoid robot: a first approach.
Proceedings of the 5th IEEE-RAS International Conference on Humanoid Robots, 2005

Local appearance based face recognition using discrete cosine transform.
Proceedings of the 13th European Signal Processing Conference, 2005

A GENERIC FACE REPRESENTATION APPROACH FOR LOCAL APPEARANCE BASED FACE VERIFICATION.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2005

Multi-View Head Pose Estimation using Neural Networks.
Proceedings of the Second Canadian Conference on Computer and Robot Vision (CRV 2005), 2005

2004
Natural human-robot interaction using speech, head pose and gestures.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

Identifying the addressee in human-human-robot interactions based on head pose and speech.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Implementation and evaluation of a constraint-based multimodal fusion system for speech and 3D pointing gestures.
Proceedings of the 6th International Conference on Multimodal Interfaces, 2004

Head Pose Estimation Using Stereo Vision For Human-Robot Interaction.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

3D-Tracking of Head and Hands for Pointing Gesture Recognition in a Human-Robot Interaction Scenario.
Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition (FGR 2004), 2004

Real-Time Person Tracking and Pointing Gesture Recognition for Human-Robot Interaction.
Proceedings of the Computer Vision in Human-Computer Interaction, 2004

Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit.
Proceedings of the Pattern Recognition, 26th DAGM Symposium, August 30, 2004

2003
Pointing gesture recognition based on 3D-tracking of face, hands and head orientation.
Proceedings of the 5th International Conference on Multimodal Interfaces, 2003

SMaRT: the Smart Meeting Room Task at ISL.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

Real-Time Recognition of 3D-Pointing Gestures for Human-Machine-Interaction.
Proceedings of the Pattern Recognition, 2003

2002
Tracking and modeling focus of attention in meetings.
PhD thesis, 2002

Modeling focus of attention for meeting indexing based on multiple cues.
IEEE Trans. Neural Networks, 2002

Tracking Focus of Attention in Meetings.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Towards Vision-Based 3-D People Tracking in a Smart Room.
Proceedings of the 4th IEEE International Conference on Multimodal Interfaces (ICMI 2002), 2002

Head orientation and gaze direction in meetings.
Proceedings of the Extended abstracts of the 2002 Conference on Human Factors in Computing Systems, 2002

2001
Estimating focus of attention based on gaze and sound.
Proceedings of the Auditory-Visual Speech Processing, 2001

2000
Towards Unrestricted Lip Reading.
Int. J. Pattern Recognit. Artif. Intell., 2000

Simultaneous Tracking of Head Poses in a Panoramic View.
Proceedings of the 15th International Conference on Pattern Recognition, 2000

1999
From Gaze to Focus of Attention.
Proceedings of the Visual Information and Information Systems, 1999

Modeling focus of attention for meeting indexing.
Proceedings of the 7th ACM International Conference on Multimedia '99, Orlando, FL, USA, October 30, 1999

1998
Visual Tracking for Multimodal Human Computer Interaction.
Proceedings of the Proceeding of the CHI '98 Conference on Human Factors in Computing Systems, 1998

Real-Time Face and Facial Feature Tracking and Applications.
Proceedings of the Auditory-Visual Speech Processing, 1998

1997
A Model-Based Gaze Tracking System.
Int. J. Artif. Intell. Tools, 1997

Real-time lip-tracking for lipreading.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

Gaze tracking for multimodal human-computer interaction.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Preprocessing of visual speech under real world conditions.
Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, 1997


  Loading...