Zsolt Kira

Orcid: 0000-0002-2626-2004

According to our database1, Zsolt Kira authored at least 122 papers between 2004 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA.
Trans. Mach. Learn. Res., 2024

Rethinking Weight Decay for Robust Fine-Tuning of Foundation Models.
CoRR, 2024

Neural Fields in Robotics: A Survey.
CoRR, 2024

ReLIC: A Recipe for 64k Steps of In-Context Reinforcement Learning for Embodied AI.
CoRR, 2024

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge.
CoRR, 2024

ICE-G: Image Conditional Editing of 3D Gaussian Splats.
CoRR, 2024

Grounding Multimodal Large Language Models in Actions.
CoRR, 2024

Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control.
CoRR, 2024

Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

N-QR: Natural Quick Response Codes for Multi-Robot Instance Correspondence.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024


NeRF-MAE: Masked AutoEncoders for Self-supervised 3D Representation Learning for Neural Radiance Fields.
Proceedings of the Computer Vision - ECCV 2024, 2024

Reinforcement Learning via Auxiliary Task Distillation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Adaptive Memory Replay for Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Seeing the Unseen: Visual Common Sense for Semantic Placement.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis.
CoRR, 2023

Memory in Plain Sight: A Survey of the Uncanny Resemblances between Diffusion Models and Associative Memories.
CoRR, 2023

HePCo: Data-Free Heterogeneous Prompt Consolidation for Continual Federated Learning.
CoRR, 2023

CLIP-GCD: Simple Language Guided Generalized Category Discovery.
CoRR, 2023

We Need to Talk: Identifying and Overcoming Communication-Critical Scenarios for Self-Driving.
CoRR, 2023

OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav.
CoRR, 2023

Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Fast Trainable Projection for Robust Fine-tuning.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Training Energy-Based Normalizing Flow with Score-Matching Objectives.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ConstraintMatch for Semi-constrained Clustering.
Proceedings of the International Joint Conference on Neural Networks, 2023

Communication-Critical Planning via Multi-Agent Trajectory Exchange.
Proceedings of the IEEE International Conference on Robotics and Automation, 2023

Adaptive Coordination in Social Embodied Rearrangement.
Proceedings of the International Conference on Machine Learning, 2023

BC-IRL: Learning Generalizable Reward Functions from Demonstrations.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

NeO 360: Neural Fields for Sparse View Synthesis of Outdoor Scenes.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Trainable Projected Gradient Method for Robust Fine-Tuning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

A Closer Look at Rehearsal-Free Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

CODA-Prompt: COntinual Decomposed Attention-Based Prompting for Rehearsal-Free Continual Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

ConStruct-VL: Data-Free Continual Structured VL Concepts Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

HAAV: Hierarchical Aggregation of Augmented Views for Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023


2022
Biological underpinnings for lifelong learning machines.
Nat. Mach. Intell., 2022

FedFOR: Stateless Heterogeneous Federated Learning with First-Order Regularization.
CoRR, 2022

On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition.
CoRR, 2022

Lifelong Wandering: A realistic few-shot online continual learning setting.
CoRR, 2022

A Closer Look at Rehearsal-Free Continual Learning.
CoRR, 2022

A Closer Look at Knowledge Distillation with Features, Logits, and Gradients.
CoRR, 2022

Polyhistor: Parameter-Efficient Multi-Task Adaptation for Dense Vision Tasks.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Striking the Right Balance: Recall Loss for Semantic Segmentation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Open-Set Semi-Supervised Object Detection.
Proceedings of the Computer Vision - ECCV 2022, 2022

ShAPO: Implicit Representations for Multi-object Shape, Appearance, and Pose Optimization.
Proceedings of the Computer Vision - ECCV 2022, 2022

Unbiased Teacher v2: Semi-supervised Object Detection for Anchor-free and Anchor-based Detectors.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Beyond a Pre-Trained Object Detector: Cross-Modal Textual and Visual Context for Image Captioning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022


System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games.
Proceedings of the Second International Conference on AI-ML Systems, 2022

2021
LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud Segmentation.
IEEE Robotics Autom. Lett., 2021

Exploring Covariate and Concept Shift for Detection and Calibration of Out-of-Distribution Data.
CoRR, 2021

Safe Model-Based Reinforcement Learning Using Robust Control Barrier Functions.
CoRR, 2021

Enhancing Multi-Robot Perception via Learned Data Association.
CoRR, 2021

A Geometric Perspective towards Neural Calibration via Sensitivity Decomposition.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Habitat 2.0: Training Home Assistants to Rearrange their Habitat.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Overcoming Obstructions via Bandwidth-Limited Multi-Agent Spatial Handshaking.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Memory-Efficient Semi-Supervised Continual Learning: The World is its Own Replay Buffer.
Proceedings of the International Joint Conference on Neural Networks, 2021

Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

Unbiased Teacher for Semi-Supervised Object Detection.
Proceedings of the 9th International Conference on Learning Representations, 2021

Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
3D for Free: Crossmodal Transfer Learning using HD Maps.
CoRR, 2020

Frustratingly Simple Domain Generalization via Image Stylization.
CoRR, 2020

Posterior Re-calibration for Imbalanced Datasets.
Proceedings of the Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, 2020

UNO: Uncertainty-aware Noisy-Or Multimodal Fusion for Unanticipated Input Degradation.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Who2com: Collaborative Perception via Learnable Handshake Communication.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Learning to Generate Grounded Visual Captions Without Localization Supervision.
Proceedings of the Computer Vision - ECCV 2020, 2020

FeatMatch: Feature-Based Augmentation for Semi-supervised Learning.
Proceedings of the Computer Vision - ECCV 2020, 2020

When2com: Multi-Agent Perception via Communication Graph Grouping.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Generalized ODIN: Detecting Out-of-Distribution Image Without Learning From Out-of-Distribution Data.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Action Segmentation With Joint Self-Supervised Temporal Domain Adaptation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Path Ranking with Attention to Type Hierarchies.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
TS-LSTM and temporal-inception: Exploiting spatiotemporal dynamics for activity recognition.
Signal Process. Image Commun., 2019

Multi-View Incremental Segmentation of 3-D Point Clouds for Mobile Robots.
IEEE Robotics Autom. Lett., 2019

Deep Learning Approach to Point Cloud Scene Understanding for Automated Scan to 3D Reconstruction.
J. Comput. Civ. Eng., 2019

Manifold Graph with Learned Prototypes for Semi-Supervised Image Classification.
CoRR, 2019

Learning to Generate Grounded Image Captions without Localization Supervision.
CoRR, 2019

Leveraging Semantics for Incremental Learning in Multi-Relational Embeddings.
CoRR, 2019

Temporal Attentive Alignment for Video Domain Adaptation.
CoRR, 2019

Unsupervised Continual Learning and Self-Taught Associative Memory Hierarchies.
CoRR, 2019

Multi-view Incremental Segmentation of 3D Point Clouds for Mobile Robots.
CoRR, 2019

Data-Efficient Graph Embedding Learning for PCB Component Detection.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

RoboCSE: Robot Common Sense Embedding.
Proceedings of the International Conference on Robotics and Automation, 2019

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation.
Proceedings of the 7th International Conference on Learning Representations, 2019

Multi-class classification without multi-class labels.
Proceedings of the 7th International Conference on Learning Representations, 2019

A Closer Look at Few-shot Classification.
Proceedings of the 7th International Conference on Learning Representations, 2019

Temporal Attentive Alignment for Large-Scale Video Domain Adaptation.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

The Regretful Agent: Heuristic-Aided Navigation Through Progress Estimation.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Re-evaluating Continual Learning Scenarios: A Categorization and Case for Strong Baselines.
CoRR, 2018

A probabilistic constrained clustering for transfer learning and image category discovery.
CoRR, 2018

Learning to Cluster for Proposal-Free Instance Segmentation.
Proceedings of the 2018 International Joint Conference on Neural Networks, 2018

Deep Reinforcement Learning Methods for Navigational Aids.
Proceedings of the Smart Multimedia - First International Conference, 2018

Learning to cluster in order to transfer across domains and tasks.
Proceedings of the 6th International Conference on Learning Representations, 2018

Attend and Interact: Higher-Order Object Interactions for Video Understanding.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Grounded Objects and Interactions for Video Captioning.
CoRR, 2017

How to Train Your DRAGAN.
CoRR, 2017

2016
Deep Image Category Discovery using a Transferred Similarity Function.
CoRR, 2016

Fusing LIDAR and images for pedestrian detection using convolutional neural networks.
Proceedings of the 2016 IEEE International Conference on Robotics and Automation, 2016

A Continuous Optimization Approach for Efficient and Accurate Scene Flow.
Proceedings of the Computer Vision - ECCV 2016, 2016

2015
Neural network-based clustering using pairwise constraints.
CoRR, 2015

An evaluation of features for classifier transfer during target handoff across aerial and ground robots.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

2014
Transfer of sparse coding representations and object classifiers across heterogeneous robots.
Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2014

Eliminating conditionally independent sets in factor graphs: A unifying perspective based on smart factors.
Proceedings of the 2014 IEEE International Conference on Robotics and Automation, 2014

Mining Structure Fragments for Smart Bundle Adjustment.
Proceedings of the British Machine Vision Conference, 2014

2013
R-MASTIF: robotic mobile autonomous system for threat interrogation and object fetch.
Proceedings of the Intelligent Robots and Computer Vision XXX: Algorithms and Techniques, 2013

2012
Multi-modal pedestrian detection on the move.
Proceedings of the 2012 IEEE International Conference on Technologies for Practical Robot Applications, 2012

Long-Range Pedestrian Detection using stereo and a cascade of convolutional network classifiers.
Proceedings of the 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2012

Detecting leadership and cohesion in spoken interactions.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Unsupervised topic modeling for leader detection in spoken discourse.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

2010
Inter-robot transfer learning for perceptual classification.
Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2010), 2010

2009
Transferring embodied concepts between perceptually heterogeneous robots.
Proceedings of the 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009

Exerting human control over decentralized robot swarms.
Proceedings of the 4th International Conference on Autonomous Robots and Agents, 2009

Mapping Grounded Object Properties across Perceptually Heterogeneous Embodiments.
Proceedings of the Twenty-Second International Florida Artificial Intelligence Research Society Conference, 2009

2007
Modeling cross-sensory and sensorimotor correlations to detect and localize faults in mobile robots.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

2006
Continuous and Embedded Learning for Multi-Agent Systems.
Proceedings of the 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006

2004
Forgetting bad behavior: memory for case-based navigation.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004


  Loading...