Anelia Angelova

Orcid: 0000-0003-1822-7943

According to our database1, Anelia Angelova authored at least 84 papers between 2005 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation.
CoRR, 2024

Mirasol3B: A Multimodal Autoregressive Model for Time-Aligned and Contextual Modalities.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024


2023
MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks.
Trans. Mach. Learn. Res., 2023

Detection-Oriented Image-Text Pretraining for Open-Vocabulary Detection.
CoRR, 2023

Diversifying Joint Vision-Language Tokenization Learning.
CoRR, 2023

Joint Adaptive Representations for Image-Language Learning.
CoRR, 2023

PaLI-X: On Scaling up a Multilingual Vision and Language Model.
CoRR, 2023

Open-Vocabulary Object Detection upon Frozen Vision and Language Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Contrastive Feature Masking Open-Vocabulary Vision Transformer.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Learning Open-World Object Proposals Without Learning to Classify.
IEEE Robotics Autom. Lett., 2022

F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models.
CoRR, 2022

PaLI: A Jointly-Scaled Multilingual Language-Image Model.
CoRR, 2022

Pre-training image-language transformers for open-vocabulary tasks.
CoRR, 2022

Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering.
CoRR, 2022

Mechanical Search on Shelves with Efficient Stacking and Destacking of Objects.
Proceedings of the Robotics Research, 2022

Mechanical Search on Shelves using a Novel "Bluction" Tool.
Proceedings of the 2022 International Conference on Robotics and Automation, 2022

Video Question Answering with Iterative Video-Text Co-tokenization.
Proceedings of the Computer Vision - ECCV 2022, 2022

FindIt: Generalized Localization with Natural Language Queries.
Proceedings of the Computer Vision - ECCV 2022, 2022

2021
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?
CoRR, 2021

Unsupervised Action Segmentation for Instructional Videos.
CoRR, 2021

TokenLearner: Adaptive Space-Time Tokenization for Videos.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Mechanical Search on Shelves using Lateral Access X-RAY.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021

Visionary: Vision architecture discovery for robot learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2021

4D-Net for Learned Multi-Modal Alignment.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

SMURF: Self-Teaching Multi-Frame Unsupervised RAFT With Full-Image Warping.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Taskology: Utilizing Task Relations at Scale.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Adaptive Intermediate Representations for Video Understanding.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2021

Unsupervised Discovery of Actions in Instructional Videos.
Proceedings of the 32nd British Machine Vision Conference 2021, 2021

2020
Special Issue on Deep Learning for Robotic Vision.
Int. J. Comput. Vis., 2020

Improving Semantic Segmentation through Spatio-Temporal Consistency Learned from Videos.
CoRR, 2020

Probabilistic Object Detection: Definition and Evaluation.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

X-Ray: Mechanical Search for an Occluded Object by Minimizing Support of Learned Occupancy Distributions.
Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020

Differentiable Mapping Networks: Learning Structured Map Representations for Sparse Visual Localization.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

AssembleNet: Searching for Multi-Stream Neural Connectivity in Video Architectures.
Proceedings of the 8th International Conference on Learning Representations, 2020

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification.
Proceedings of the Computer Vision - ECCV 2020, 2020

AssembleNet++: Assembling Modality Representations via Attention Connections.
Proceedings of the Computer Vision - ECCV 2020, 2020

Adversarial Generative Grammars for Human Activity Prediction.
Proceedings of the Computer Vision - ECCV 2020, 2020

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve.
Proceedings of the Computer Vision - ECCV 2020, 2020

What Matters in Unsupervised Optical Flow.
Proceedings of the Computer Vision - ECCV 2020, 2020

Evolving Losses for Unsupervised Video Representation Learning.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Unsupervised Monocular Depth Learning in Dynamic Scenes.
Proceedings of the 4th Conference on Robot Learning, 2020

Differentiable Grammars for Videos.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
SPIN: A High Speed, High Resolution Vision Dataset for Tracking and Action Recognition in Ping Pong.
CoRR, 2019

Tiny Video Networks.
CoRR, 2019

Evolving Losses for Unlabeled Video Representation Learning.
CoRR, 2019

Learning Differentiable Grammars for Continuous Data.
CoRR, 2019

Evolving Space-Time Neural Architectures for Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Depth From Videos in the Wild: Unsupervised Monocular Depth Learning From Unknown Cameras.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

OnboardDepth: Depth Prediction for Onboard Systems.
Proceedings of the 2019 European Conference on Mobile Robots, 2019

Unsupervised Monocular Depth and Ego-Motion Learning With Structure and Semantics.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

Depth Prediction without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos.
Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, 2019

2018
Future Segmentation Using 3D Structure.
CoRR, 2018

Object category learning and retrieval with weak supervision.
CoRR, 2018

Unsupervised Learning of Depth and Ego-Motion From Monocular Video Using 3D Geometric Constraints.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Geometry-based next frame prediction from monocular video.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Learning with proxy supervision for end-to-end visual learning.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs.
Proceedings of the 34th International Conference on Machine Learning, 2017

2016
Improved generator objectives for GANs.
CoRR, 2016

2015
Object Recognition from Short Videos for Robotic Perception.
CoRR, 2015

Real-time grasp detection using convolutional neural networks.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Pedestrian detection with a Large-Field-Of-View deep network.
Proceedings of the IEEE International Conference on Robotics and Automation, 2015

Real-Time Pedestrian Detection with Deep Network Cascades.
Proceedings of the British Machine Vision Conference 2015, 2015

2014
Generalized feature learning and indexing for object localization and recognition.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Feature combination with Multi-Kernel Learning for fine-grained visual classification.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Benchmarking large-scale Fine-Grained Categorization.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

2013
Image segmentation for large-scale subcategory flower recognition.
Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision, 2013

Efficient Object Detection and Segmentation for Fine-Grained Recognition.
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013

2009
Terrain Adaptive Navigation for planetary rovers.
J. Field Robotics, 2009

2008
Visual Prediction of Rover Slip: Learning Algorithms and Field Experiments.
PhD thesis, 2008

2007
Learning and prediction of slip from visual information.
J. Field Robotics, 2007

Computer Vision on Mars.
Int. J. Comput. Vis., 2007

Dimensionality Reduction Using Automatic Supervision for Vision-Based Terrain Learning.
Proceedings of the Robotics: Science and Systems III, 2007

Learning slip behavior using automatic mechanical supervision.
Proceedings of the 2007 IEEE International Conference on Robotics and Automation, 2007

Fast Terrain Classification Using Variable-Length Representation for Autonomous Navigation.
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007

2006
Towards learned traversability for robot navigation: From underfoot to the far field.
J. Field Robotics, 2006

Slip Prediction Using Visual Information.
Proceedings of the Robotics: Science and Systems II, 2006

Learning to Predict Slip for Ground Robots.
Proceedings of the 2006 IEEE International Conference on Robotics and Automation, 2006

2005
Pruning Training Sets for Learning of Object Categories.
Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2005), 2005


  Loading...