Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation.
CoRR, April, 2025
FacaDiffy: Inpainting Unseen Facade Parts Using Diffusion Models.
CoRR, February, 2025
Appearance-Based Refinement for Object-Centric Motion Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024
Shap-Editor: Instruction-guided Latent 3D Editing in Seconds.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Moving Object Segmentation: All You Need is SAM (and Flow).
Proceedings of the Computer Vision - ACCV 2024, 2024
AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description.
Proceedings of the Computer Vision - ACCV 2024, 2024
Robot Orientation Learning Based on Interaction Primitives for Human-Robot Collaboration.
Proceedings of the IEEE International Conference on Real-time Computing and Robotics, 2023
Segmenting Moving Objects via an Object-Centric Layered Representation.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022
Combined On-line Variable Speed Limit and Ramp Metering Control for Highway Bottleneck.
Proceedings of the IEEE International Conference on Networking, Sensing and Control, 2022