Yusuf Aytar

According to our database1, Yusuf Aytar authored at least 46 papers between 2007 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation.
Trans. Mach. Learn. Res., 2024

OVR: A Dataset for Open Vocabulary Temporal Repetition Counting in Videos.
CoRR, 2024

Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models.
CoRR, 2024

FlexCap: Generating Rich, Localized, and Flexible Captions in Images.
CoRR, 2024

RoboTAP: Tracking Arbitrary Points for Few-Shot Visual Imitation.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024


Learning from One Continuous Video Stream.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation.
CoRR, 2023

Perception Test: A Diagnostic Benchmark for Multimodal Video Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Lossless Adaptation of Pretrained Vision Models For Robotic Manipulation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

2022
TAP-Vid: A Benchmark for Tracking Any Point in a Video.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

Learning transferable motor skills with hierarchical latent mixture policies.
Proceedings of the Tenth International Conference on Learning Representations, 2022

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation.
Proceedings of the Tenth International Conference on Learning Representations, 2022

2021
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

Manipulator-Independent Representations for Visual Imitation.
Proceedings of the Robotics: Science and Systems XVII, Virtual Event, July 12-16, 2021., 2021

With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Semi-supervised reward learning for offline reinforcement learning.
CoRR, 2020

Offline Learning from Demonstrations and Unlabeled Experience.
CoRR, 2020

Large-scale multilingual audio visual dubbing.
CoRR, 2020

Scaling data-driven robotics with reward sketching and batch reinforcement learning.
Proceedings of the Robotics: Science and Systems XVI, 2020

Self-Supervised Sim-to-Real Adaptation for Visual Robotic Manipulation.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Learning rich touch representations through cross-modal self-supervision.
Proceedings of the 4th Conference on Robot Learning, 2020

2019
A Framework for Data-Driven Robotics.
CoRR, 2019

Temporal Cycle-Consistency Learning.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Cross-Modal Scene Networks.
IEEE Trans. Pattern Anal. Mach. Intell., 2018

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL.
CoRR, 2018

Playing hard exploration games by watching YouTube.
Proceedings of the Advances in Neural Information Processing Systems 31: Annual Conference on Neural Information Processing Systems 2018, 2018

2017
See, Hear, and Read: Deep Aligned Representations.
CoRR, 2017

Is Saki #delicious?: The Food Perception Gap on Instagram and Its Relation to Health.
Proceedings of the 26th International Conference on World Wide Web, 2017

Face-to-BMI: Using Computer Vision to Infer Body Mass Index on Social Media.
Proceedings of the Eleventh International Conference on Web and Social Media, 2017

Exploiting Convolution Filter Patterns for Transfer Learning.
Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops, 2017

Learning Cross-Modal Embeddings for Cooking Recipes and Food Images.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
SoundNet: Learning Sound Representations from Unlabeled Video.
Proceedings of the Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016, 2016

Learning Aligned Cross-Modal Representations from Weakly Aligned Data.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

How Transferable Are CNN-Based Features for Age and Gender Classification?
Proceedings of the 2016 International Conference of the Biometrics Special Interest Group, 2016

2015
Part level transfer regularization for enhancing exemplar SVMs.
Comput. Vis. Image Underst., 2015

2014
Transfer learning for object category detection.
PhD thesis, 2014

Multi-Task Multi-Sample Learning.
Proceedings of the Computer Vision - ECCV 2014 Workshops, 2014

Immediate, Scalable Object Category Detection.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2012
Enhancing Exemplar SVMs using Part Level Transfer Regularization.
Proceedings of the British Machine Vision Conference, 2012

2011
Tabula rasa: Model transfer for object category detection.
Proceedings of the IEEE International Conference on Computer Vision, 2011

2008
Utilizing semantic word similarity measures for video retrieval.
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008

2007
University of Central Florida at TRECVID 2007 Semantic Video Classification and Automatic Search.
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007

Improving Semantic Concept Detection and Retrieval using Contextual Estimates.
Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, 2007


  Loading...