Esa Rahtu

Orcid: 0000-0001-8767-0864

According to our database1, Esa Rahtu authored at least 163 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Cascaded and Generalizable Neural Radiance Fields for Fast View Synthesis.
IEEE Trans. Pattern Anal. Mach. Intell., May, 2024

Temporally Aligned Audio for Video with Autoregression.
CoRR, 2024

UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM.
CoRR, 2024

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing.
CoRR, 2024

GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation.
CoRR, 2024

MuSHRoom: Multi-Sensor Hybrid Room Dataset for Joint 3D Reconstruction and Novel View Synthesis.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Detecting Anomalies in Textured Images Using Modified Transformer Masked Autoencoder.
Proceedings of the 19th International Joint Conference on Computer Vision, 2024

Anomaly Detection and Localization for Images of Running Paper Web in Paper Manufacturing.
Proceedings of the 19th International Joint Conference on Computer Vision, 2024

Synchformer: Efficient Synchronization From Sparse Cues.
Proceedings of the IEEE International Conference on Acoustics, 2024

Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
PanDepth: Joint Panoptic Segmentation and Depth Completion.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

IFMix: Utilizing Intermediate Filtered Images for Domain Adaptation in Classification.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

Visual Anomaly Detection and Localization with a Patch-Wise Transformer and Convolutional Model.
Proceedings of the 18th International Joint Conference on Computer Vision, 2023

LiDAR Place Recognition Evaluation with the Oxford Radar RobotCar Dataset Revised.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

BS3D: Building-Scale 3D Reconstruction from RGB-D Images.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

FinnWoodlands Dataset.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

TAU-Indoors Dataset for Visual and LiDAR Place Recognition.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

MSDA: Monocular Self-supervised Domain Adaptation for 6D Object Pose Estimation.
Proceedings of the Image Analysis - 22nd Scandinavian Conference, 2023

Region of Interest Enabled Learned Image Coding for Machines.
Proceedings of the 25th IEEE International Workshop on Multimedia Signal Processing, 2023

NN-VVC: Versatile Video Coding boosted by self-supervisedly learned image coding for machines.
Proceedings of the IEEE International Symposium on Multimedia, 2023

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Momentum Adapt: Robust Unsupervised Adaptation for Improving Temporal Consistency in Video Semantic Segmentation During Test-Time.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
The Weighting Game: Evaluating Quality of Explainability Methods.
CoRR, 2022

HRF-Net: Holistic Radiance Fields from Sparse Inputs.
CoRR, 2022

Beyond Visual Field of View: Perceiving 3D Environment with Echoes and Vision.
CoRR, 2022

Fast Neural Architecture Search for Lightweight Dense Prediction Networks.
CoRR, 2022

FATALRead - Fooling visual speech recognition models.
Appl. Intell., 2022

Single Source One Shot Reenactment using Weighted Motion from Paired Feature Points.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Lightweight Monocular Depth with a Novel Neural Architecture Search Method.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

V-SlowFast Network for Efficient Visual Sound Separation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Visually Guided Sound Source Separation and Localization using Self-Supervised Motion Representations.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

SemSegDepth: A Combined Model for Semantic Segmentation and Depth Completion.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022

Evaluation of RGB and LiDAR Combination for Robust Place Recognition.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022

Evaluation of Long-term Deep Visual Place Recognition.
Proceedings of the 17th International Joint Conference on Computer Vision, 2022

The Lottery Ticket Adaptation for Neural Video Coding.
Proceedings of the IEEE International Symposium on Multimedia, 2022

TPSAD: Learning to Detect and Localize Anomalies With Thin Plate Spline Transformation.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Long-term Visual Place Recognition.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

Bridging the Gap Between Image Coding for Machines and Humans.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

Online Panoptic 3D Reconstruction as a Linear Assignment Problem.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

Enhanced Data-Recalibration: Utilizing Validation Data to Mitigate Instance-Dependent Noise in Classification.
Proceedings of the Image Analysis and Processing - ICIAP 2022, 2022

AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Optimal Correction Cost for Object Detection Evaluation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

SingleDemoGrasp: Learning to Grasp From a Single Image Demonstration.
Proceedings of the 18th IEEE International Conference on Automation Science and Engineering, 2022

Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

A Practical Overview of Safety Concerns and Mitigation Methods for Visual Deep Learning Algorithms.
Proceedings of the Workshop on Artificial Intelligence Safety 2022 (SafeAI 2022) co-located with the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI2022), 2022

SC6D: Symmetry-agnostic and Correspondence-free 6D Object Pose Estimation.
Proceedings of the International Conference on 3D Vision, 2022

2021
Fully Automated DCNN-Based Thermal Images Annotation Using Neural Network Pretrained on RGB Data.
Sensors, 2021

FACEGAN: Facial Attribute Controllable rEenactment GAN.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

Effect of Label Noise on Robustness of Deep Neural Network Object Detectors.
Proceedings of the Computer Safety, Reliability, and Security. SAFECOMP 2021 Workshops, 2021

Supervised Fine-tuning Evaluation for Long-term Visual Place Recognition.
Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021

Towards a Real-Time Facial Analysis System.
Proceedings of the 23rd International Workshop on Multimedia Signal Processing, 2021

Adaptation and Attention for Neural Video Coding.
Proceedings of the IEEE International Symposium on Multimedia, 2021

Enhancing Image Coding for Machines with Compressed Feature Residuals.
Proceedings of the IEEE International Symposium on Multimedia, 2021

Content-adaptive convolutional neural network post-processing filter.
Proceedings of the IEEE International Symposium on Multimedia, 2021

Learned Enhancement Filters for Image Coding for Machines.
Proceedings of the IEEE International Symposium on Multimedia, 2021

Evaluation of Long-term LiDAR Place Recognition.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Learned Image Coding for Machines: A Content-Adaptive Approach.
Proceedings of the 2021 IEEE International Conference on Multimedia and Expo, 2021

On the Importance of Encrypting Deep Features.
Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2021

Boosting Monocular Depth Estimation with Lightweight 3D Point Fusion.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Image Coding For Machines: an End-To-End Learned Approach.
Proceedings of the IEEE International Conference on Acoustics, 2021

Automatic Dataset Generation From CAD for Vision-Based Grasping.
Proceedings of the 20th International Conference on Advanced Robotics, 2021

FlipReID: Closing the Gap Between Training and Inference in Person Re-Identification.
Proceedings of the 9th European Workshop on Visual Information Processing, 2021

Selective Probabilistic Classifier Based on Hypothesis Testing.
Proceedings of the 9th European Workshop on Visual Information Processing, 2021

Sample Selection for Efficient Image Annotation.
Proceedings of the 9th European Workshop on Visual Information Processing, 2021

Leveraging Category Information for Single-Frame Visual Sound Source Separation.
Proceedings of the 9th European Workshop on Visual Information Processing, 2021

Learned Video Compression With Intra-Guided Enhancement and Implicit Motion Information.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Taming Visually Guided Sound Generation.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

RGBD-Net: Predicting Color and Depth Images for Novel Views Synthesis.
Proceedings of the International Conference on 3D Vision, 2021

Monocular Depth Estimation Primed by Salient Point Detection and Normalized Hessian Loss.
Proceedings of the International Conference on 3D Vision, 2021

2020
Automated Video Face Labelling for Films and TV Material.
IEEE Trans. Pattern Anal. Mach. Intell., 2020

Separating Sounds from a Single Image.
CoRR, 2020

Sequential Neural Rendering with Transformer.
CoRR, 2020

ICface: Interpretable and Controllable Face Reenactment Using GANs.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

L<sup>2</sup>C - Learning to Learn to Compress.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Deep Learning Off-the-shelf Holistic Feature Descriptors for Visual Place Recognition in Challenging Conditions.
Proceedings of the 22nd IEEE International Workshop on Multimedia Signal Processing, 2020

Deep Audio-Visual Saliency: Baseline Model and Data.
Proceedings of the ETRA '20: 2020 Symposium on Eye Tracking Research and Applications, 2020

Guiding Monocular Depth Estimation Using Depth-Attention Volume.
Proceedings of the Computer Vision - ECCV 2020, 2020

End-to-End Learning for Video Frame Compression with Self-Attention.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Multi-modal Dense Video Captioning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Uncovering Hidden Challenges in Query-Based Video Moment Retrieval.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

A Better Use of Audio-Visual Cues: Dense Video Captioning with Bi-modal Transformer.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

Sequential View Synthesis with Transformer.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Visually Guided Sound Source Separation Using Cascaded Opponent Filter Network.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
DAVE: A Deep Audio-Visual Embedding for Dynamic Saliency Prediction.
CoRR, 2019

Multimodal Machine Learning-based Knee Osteoarthritis Progression Prediction from Plain Radiographs and Clinical Data.
CoRR, 2019

Digging Deeper Into Egocentric Gaze Prediction.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

DGC-Net: Dense Geometric Correspondence Network.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Predicting Novel Views Using Generative Adversarial Query Network.
Proceedings of the Image Analysis - 21st Scandinavian Conference, 2019

CIIDefence: Defeating Adversarial Attacks by Fusing Class-Specific Image Inpainting and Image Denoising.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

MLAttack: Fooling Semantic Segmentation Networks by Multi-layer Attacks.
Proceedings of the Pattern Recognition, 2019

Rethinking the Evaluation of Video Summaries.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
ADVIO: An Authentic Dataset for Visual-Inertial Odometry.
Dataset, July, 2018

ADVIO: An Authentic Dataset for Visual-Inertial Odometry.
Dataset, July, 2018

ADVIO: An Authentic Dataset for Visual-Inertial Odometry.
Dataset, July, 2018

Summarization of User-Generated Sports Video by Using Deep Action Recognition Features.
IEEE Trans. Multim., 2018

PIVO: Probabilistic Inertial-Visual Odometry for Occlusion-Robust Navigation.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

Real-time Human Pose Estimation with Convolutional Neural Networks.
Proceedings of the 13th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2018), 2018

Bottom-Up Attention Guidance for Recurrent Image Recognition.
Proceedings of the 2018 IEEE International Conference on Image Processing, 2018

Inertial Odometry on Handheld Smartphones.
Proceedings of the 21st International Conference on Information Fusion, 2018

ADVIO: An Authentic Dataset for Visual-Inertial Odometry.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning Image-to-Image Translation Using Paired and Unpaired Training Samples.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Exploiting inter-image similarity and ensemble of extreme learners for fixation prediction using deep features.
Neurocomputing, 2017

Automatic Knee Osteoarthritis Diagnosis from Plain Radiographs: A Deep Learning-Based Approach.
CoRR, 2017

Investigating Natural Image Pleasantness Recognition using Deep Features and Eye Tracking for Loosely Controlled Human-computer Interaction.
CoRR, 2017

Relative Camera Pose Estimation Using Convolutional Neural Networks.
CoRR, 2017

A Novel Method for Automatic Localization of Joint Area on Knee Plain Radiographs.
Proceedings of the Image Analysis - 20th Scandinavian Conference, 2017

Image-Based Localization Using Hourglass Networks.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Relative Camera Pose Estimation Using Convolutional Neural Networks.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2017

2016
Real-time Human Pose Estimation from Video with Convolutional Neural Networks.
CoRR, 2016

On the Contribution of Saliency in Visual Tracking.
Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2016), 2016

Siamese network features for image matching.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Robust loop closures for scene reconstruction by combining odometry and visual correspondences.
Proceedings of the 2016 IEEE International Conference on Image Processing, 2016

Learning Joint Representations of Videos and Sentences with Web Image Search.
Proceedings of the Computer Vision - ECCV 2016 Workshops, 2016

Video Summarization Using Deep Semantic Features.
Proceedings of the Computer Vision - ACCV 2016, 2016

Image Patch Matching Using Convolutional Descriptors with Euclidean Distance.
Proceedings of the Computer Vision - ACCV 2016 Workshops, 2016

2015
Adaptive Kalman filtering and smoothing for gravitation tracking in mobile systems.
Proceedings of the 2015 International Conference on Indoor Positioning and Indoor Navigation, 2015

Online Face Recognition System Based on Local Binary Patterns and Facial Landmark Tracking.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2015

2014
Total Cluster: A person agnostic clustering method for broadcast videos.
Proceedings of the 2014 Indian Conference on Computer Vision, 2014

Emotional Valence Recognition, Analysis of Salience and Eye Movements.
Proceedings of the 22nd International Conference on Pattern Recognition, 2014

Analysis of Sampling Techniques for Learning Binarized Statistical Image Features Using Fixations and Salience.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

An <i>O</i>(n \log n) Cutting Plane Algorithm for Structured Output Ranking.
Proceedings of the Pattern Recognition - 36th German Conference, 2014

Understanding Objects in Detail with Fine-Grained Attributes.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

Generating Object Segmentation Proposals Using Global and Local Search.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Local Phase Quantization for Blur Insensitive Texture Description.
Proceedings of the Local Binary Patterns: New Variants and Applications, 2013

Automatic Dynamic Texture Segmentation Using Local Descriptors and Optical Flow.
IEEE Trans. Image Process., 2013

Stochastic bottom-up fixation prediction and saccade generation.
Image Vis. Comput., 2013

Fine-Grained Visual Classification of Aircraft.
CoRR, 2013

Saliency Detection Using Joint Temporal and Spatial Decorrelation.
Proceedings of the Image Analysis, 18th Scandinavian Conference, 2013

Non Maximal Suppression in Cascaded Ranking Models.
Proceedings of the Image Analysis, 18th Scandinavian Conference, 2013

Spherical Center-Surround for Video Saliency Detection Using Sparse Sampling.
Proceedings of the Advanced Concepts for Intelligent Vision Systems, 2013

2012
Local phase quantization for blur-insensitive image analysis.
Image Vis. Comput., 2012

BSIF: Binarized statistical image features.
Proceedings of the 21st International Conference on Pattern Recognition, 2012

TriCoS: A Tri-level Class-Discriminative Co-segmentation Method for Image Classification.
Proceedings of the Computer Vision - ECCV 2012, 2012

Temporal Saliency for Fast Motion Detection.
Proceedings of the Computer Vision - ACCV 2012 Workshops, 2012

2011
Fast and Efficient Saliency Detection Using Sparse Sampling and Kernel Density Estimation.
Proceedings of the Image Analysis - 17th Scandinavian Conference, 2011

Watermark Recovery from a Dual Layer Hologram with a Digital Camera.
Proceedings of the Image Analysis - 17th Scandinavian Conference, 2011

Volume Local Phase Quantization for Blur-Insensitive Dynamic Texture Classification.
Proceedings of the Image Analysis - 17th Scandinavian Conference, 2011

Real-Time Detection of Landscape Scenes.
Proceedings of the Image Analysis - 17th Scandinavian Conference, 2011

Learning a category independent object detection cascade.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2010
The Structural Form in Image Categorization.
Proceedings of the VISAPP 2010 - Proceedings of the Fifth International Conference on Computer Vision Theory and Applications, Angers, France, May 17-21, 2010, 2010

Compressing Sparse Feature Vectors Using Random Ortho-Projections.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Improved Blur Insensitivity for Decorrelated Local Phase Quantization.
Proceedings of the 20th International Conference on Pattern Recognition, 2010

Segmenting Salient Objects from Images and Videos.
Proceedings of the Computer Vision - ECCV 2010, 2010

2009
Dense and Deformable Motion Segmentation for Wide Baseline Images.
Proceedings of the Image Analysis, 16th Scandinavian Conference, 2009

Applying Visual Object Categorization and Memory Colors for Automatic Color Constancy.
Proceedings of the Image Analysis and Processing, 2009

A Simple and efficient saliency detector for background subtraction.
Proceedings of the 12th IEEE International Conference on Computer Vision Workshops, 2009

2008
Rotation invariant local phase quantization for blur insensitive texture analysis.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Recognition of blurred faces using Local Phase Quantization.
Proceedings of the 19th International Conference on Pattern Recognition (ICPR 2008), 2008

Object recognition and segmentation by non-rigid quasi-dense matching.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
Nonlinear Functionals in the Construction of Multiscale Affine Invariants.
Proceedings of the Image Analysis, 15th Scandinavian Conference, 2007

2006
A New Convexity Measure Based on a Probabilistic Interpretation of Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2006

Generalized affine moment invariants for object recogntion.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006

Properties of Patch Based Approaches for the Recognition of Visual Object Classes.
Proceedings of the Pattern Recognition, 2006

Multiscale Autoconvolution Histograms for Affine Invariant Pattern Recognition.
Proceedings of the British Machine Vision Conference 2006, 2006

A New Affine Invariant Image Transform Based on Ridgelets.
Proceedings of the British Machine Vision Conference 2006, 2006

2005
Affine Invariant Pattern Recognition Using Multiscale Autoconvolution.
IEEE Trans. Pattern Anal. Mach. Intell., 2005

A New Method for Affine Registration of Images and Point Sets.
Proceedings of the Image Analysis, 14th Scandinavian Conference, 2005

Affine registration with multi-scale autoconvolution.
Proceedings of the 2005 International Conference on Image Processing, 2005

A New Efficient Method for Producing Global Affine Invariants.
Proceedings of the Image Analysis and Processing, 2005

2004
Convexity Recognition Using Multi-Scale Autoconvolution.
Proceedings of the 17th International Conference on Pattern Recognition, 2004

Object Classification with Multi-Scale Autoconvolution.
Proceedings of the 17th International Conference on Pattern Recognition, 2004


  Loading...