2025
Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation.
CoRR, April, 2025

FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models.
CoRR, February, 2025

2024
Appearance-Based Refinement for Object-Centric Motion Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Shap-Editor: Instruction-guided Latent 3D Editing in Seconds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Moving Object Segmentation: All You Need is SAM (and Flow).
Proceedings of the Computer Vision - ACCV 2024, 2024

AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description.
Proceedings of the Computer Vision - ACCV 2024, 2024

2023
Robot Orientation Learning Based on Interaction Primitives for Human-Robot Collaboration.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2023

2022
Segmenting Moving Objects via an Object-Centric Layered Representation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Combined On-line Variable Speed Limit and Ramp Metering Control for Highway Bottleneck.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2022