Anelia Angelova

Proceedings of the Computer Vision - ECCV 2024, 2024

Mirasol3B: A Multimodal Autoregressive Model for Time-Aligned and Contextual Modalities.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

On Scaling Up a Multilingual Vision and Language Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023

MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.

[BibT_eX]

[DOI]

Trans. Mach. Learn. Res., 2023

Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection.

[BibT_eX]

[DOI]

CoRR, 2023

Diversifying Joint Vision-Language Tokenization Learning.

[BibT_eX]

[DOI]

Vardaan Pahuja

CoRR, 2023

Joint Adaptive Representations for Image-Language Learning.

[BibT_eX]

[DOI]

CoRR, 2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.

[BibT_eX]

[DOI]

CoRR, 2023

Open-Vocabulary Object Detection upon Frozen Vision and Language Models.

[BibT_eX]

[DOI]

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Feature Masking Open-Vocabulary Vision Transformer.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Learning Open-World Object Proposals Without Learning to Classify.

[BibT_eX]

[DOI]

IEEE Robotics Autom. Lett., 2022

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models.

[BibT_eX]

[DOI]

CoRR, 2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model.

[BibT_eX]

[DOI]

CoRR, 2022

Pre-training image-language transformers for open-vocabulary tasks.

[BibT_eX]

[DOI]

CoRR, 2022

Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering.

[BibT_eX]

[DOI]

CoRR, 2022

Mechanical Search on Shelves with Efficient Stacking and Destacking of Objects.

[BibT_eX]

[DOI]

Proceedings of the Robotics Research, 2022

Mechanical Search on Shelves using a Novel "Bluction" Tool.

[BibT_eX]

[DOI]

Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Video Question Answering with Iterative Video-Text Co-tokenization.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

FindIt: Generalized Localization with Natural Language Queries.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2022, 2022

2021

TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?

[BibT_eX]

[DOI]

CoRR, 2021

Unsupervised Action Segmentation for Instructional Videos.

[BibT_eX]

[DOI]

CoRR, 2021

TokenLearner: Adaptive Space-Time Tokenization for Videos.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mechanical Search on Shelves using Lateral Access X-RAY.

[BibT_eX]

[DOI]

Huang Huang

Marcus Dominguez-Kuhne

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Visionary: Vision architecture discovery for robot learning.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Robotics and Automation, 2021

4D-Net for Learned Multi-Modal Alignment.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SMURF: Self-Teaching Multi-Frame Unsupervised RAFT With Full-Image Warping.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Taskology: Utilizing Task Relations at Scale.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive Intermediate Representations for Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Unsupervised Discovery of Actions in Instructional Videos.

[BibT_eX]

[DOI]

Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020

Special Issue on Deep Learning for Robotic Vision.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2020

Improving Semantic Segmentation through Spatio-Temporal Consistency Learned from Videos.

[BibT_eX]

[DOI]

CoRR, 2020

Probabilistic Object Detection: Definition and Evaluation.

[BibT_eX]

[DOI]

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions.

[BibT_eX]

[DOI]

Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Differentiable Mapping Networks: Learning Structured Map Representations for Sparse Visual Localization.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures.

[BibT_eX]

[DOI]

Proceedings of the 8th International Conference on Learning Representations, 2020

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

AssembleNet++: Assembling Modality Representations via Attention Connections.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Adversarial Generative Grammars for Human Activity Prediction.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

What Matters in Unsupervised Optical Flow.

[BibT_eX]

[DOI]

Proceedings of the Computer Vision - ECCV 2020, 2020

Evolving Losses for Unsupervised Video Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects.

[BibT_eX]

[DOI]

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Monocular Depth Learning in Dynamic Scenes.

[BibT_eX]

[DOI]

Proceedings of the 4th Conference on Robot Learning, 2020

Differentiable Grammars for Videos.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019

SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong.

[BibT_eX]

[DOI]

CoRR, 2019

Tiny Video Networks.

[BibT_eX]

[DOI]

CoRR, 2019

Evolving Losses for Unlabeled Video Representation Learning.

[BibT_eX]

[DOI]

CoRR, 2019

Learning Differentiable Grammars for Continuous Data.

[BibT_eX]

[DOI]

CoRR, 2019

Evolving Space-Time Neural Architectures for Videos.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Depth From Videos in the Wild: Unsupervised Monocular Depth Learning From Unknown Cameras.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

OnboardDepth: Depth Prediction for Onboard Systems.

[BibT_eX]

[DOI]

Proceedings of the 2019 European Conference on Mobile Robots, 2019

Unsupervised Monocular Depth and Ego-Motion Learning With Structure and Semantics.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Depth Prediction without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos.

[BibT_eX]

[DOI]

Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018

Future Segmentation Using 3D Structure.

[BibT_eX]

[DOI]

CoRR, 2018

Object category learning and retrieval with weak supervision.

[BibT_eX]

[DOI]

CoRR, 2018

Unsupervised Learning of Depth and Ego-Motion From Monocular Video Using 3D Geometric Constraints.

[BibT_eX]

[DOI]

Reza Mahjourian

Martin Wicke

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017

Geometry-based next frame prediction from monocular video.

[BibT_eX]

[DOI]

Reza Mahjourian

Martin Wicke

Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Learning with proxy supervision for end-to-end visual learning.

[BibT_eX]

[DOI]

Jiri Cermak

Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs.

[BibT_eX]

[DOI]

Michael Gygli

Mohammad Norouzi

Proceedings of the 34th International Conference on Machine Learning, 2017

2016

Improved generator objectives for GANs.

[BibT_eX]

[DOI]

Ben Poole

Alexander A. Alemi

Jascha Sohl-Dickstein

CoRR, 2016

2015

Object Recognition from Short Videos for Robotic Perception.

[BibT_eX]

[DOI]

Ivan Bogun

Navdeep Jaitly

CoRR, 2015

Real-time grasp detection using convolutional neural networks.

[BibT_eX]

[DOI]

Joseph Redmon

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Pedestrian detection with a Large-Field-Of-View deep network.

[BibT_eX]

[DOI]

Alex Krizhevsky

Vincent Vanhoucke

Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Real-Time Pedestrian Detection with Deep Network Cascades.

[BibT_eX]

[DOI]

Proceedings of the British Machine Vision Conference 2015, 2015

2014

Generalized feature learning and indexing for object localization and recognition.

[BibT_eX]

[DOI]

Ning Zhou

Jianping Fan

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Feature combination with Multi-Kernel Learning for fine-grained visual classification.

[BibT_eX]

[DOI]

Alexandru Niculescu-Mizil

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Benchmarking large-scale Fine-Grained Categorization.

[BibT_eX]

[DOI]

Philip M. Long

Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

2013

Image segmentation for large-scale subcategory flower recognition.

[BibT_eX]

[DOI]

Shenghuo Zhu

Yuanqing Lin

Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Efficient Object Detection and Segmentation for Fine-Grained Recognition.

[BibT_eX]

[DOI]

Shenghuo Zhu

Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2009

Terrain Adaptive Navigation for planetary rovers.

[BibT_eX]

[DOI]

Daniel M. Helmick

Larry H. Matthies

J. Field Robotics, 2009

2008

Visual Prediction of Rover Slip: Learning Algorithms and Field Experiments.

[BibT_eX]

[DOI]

PhD thesis, 2008

2007

Learning and prediction of slip from visual information.

[BibT_eX]

[DOI]

J. Field Robotics, 2007

Computer Vision on Mars.

[BibT_eX]

[DOI]

Int. J. Comput. Vis., 2007

Dimensionality Reduction Using Automatic Supervision for Vision-Based Terrain Learning.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems III, 2007

Learning slip behavior using automatic mechanical supervision.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Fast Terrain Classification Using Variable-Length Representation for Autonomous Navigation.

[BibT_eX]

[DOI]

Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006

Towards learned traversability for robot navigation: From underfoot to the far field.

[BibT_eX]

[DOI]

J. Field Robotics, 2006

Slip Prediction Using Visual Information.

[BibT_eX]

[DOI]

Proceedings of the Robotics: Science and Systems II, 2006

Learning to Predict Slip for Ground Robots.

[BibT_eX]

[DOI]

Proceedings of the 2006 IEEE International Conference on Robotics and Automation, 2006

2005

Pruning Training Sets for Learning of Object Categories.

[BibT_eX]

[DOI]