Robinson Piramuthu

Orcid: 0000-0002-1767-8382

According to our database1, Robinson Piramuthu authored at least 51 papers between 1998 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
"Don't Forget to Put the Milk Back!" Dataset for Enabling Embodied Agents to Detect Anomalous Situations.
IEEE Robotics Autom. Lett., October, 2024

T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance Design.
CoRR, 2024

FlexEControl: Flexible and Efficient Multimodal Control for Text-to-Image Generation.
CoRR, 2024

S-EQA: Tackling Situational Queries in Embodied Question Answering.
CoRR, 2024

ε-ViLM : Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, 2024

Zero-Shot Controllable Image-to-Video Animation via Motion Decomposition.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

Decision Making for Human-in-the-loop Robotic Agents via Uncertainty-Aware Reinforcement Learning.
Proceedings of the IEEE International Conference on Robotics and Automation, 2024

2023
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer.
CoRR, 2023

Characterizing Video Question Answering with Sparsified Inputs.
CoRR, 2023

RREx-BoT: Remote Referring Expressions with a Bag of Tricks.
IROS, 2023

A Simple Approach for Visual Room Rearrangement: 3D Mapping and Semantic Search.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

2022
CLIP-Nav: Using CLIP for Zero-Shot Vision-and-Language Navigation.
CoRR, 2022

A Simple Approach for Visual Rearrangement: 3D Mapping and Semantic Search.
CoRR, 2022

SSL Enables Learning from Sparse Rewards in Image-Goal Navigation.
Proceedings of the International Conference on Machine Learning, 2022

Video in 10 Bits: Few-Bit VideoQA for Efficiency and Privacy.
Proceedings of the Computer Vision - ECCV 2022 Workshops, 2022

VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2022, 2022

TEACh: Task-Driven Embodied Agents That Chat.
Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021
Self-attentive 3D human pose and shape estimation from videos.
Comput. Vis. Image Underst., 2021

2020
Mobile Head Tracking for eCommerce and Beyond.
Proceedings of the Mobile Devices and Multimedia: Enabling Technologies, 2020

Weakly-Supervised Semantic Segmentation via Sub-Category Exploration.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization.
Proceedings of the 31st British Machine Vision Conference 2020, 2020

2019
Give Me a Hint! Navigating Image Databases Using Human-in-the-Loop Feedback.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Understanding Image Quality and Trust in Peer-to-Peer Marketplaces.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2019

Adversarial Learning for Fine-Grained Image Search.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2019

2018
Towards the Success Rate of One: Real-Time Unconstrained Salient Object Detection.
Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision, 2018

ModaNet: A Large-scale Street Fashion Dataset with Polygon Annotations.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Conditional Image-Text Embedding Networks.
Proceedings of the Computer Vision - ECCV 2018, 2018

Brand > Logo: Visual Analysis of Fashion Brands.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

2017
Visual Search at eBay.
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, August 13, 2017

2016
Fashion apparel detection: The role of deep convolutional neural network and pose-dependent priors.
Proceedings of the 2016 IEEE Winter Conference on Applications of Computer Vision, 2016

GraB: Visual Saliency via Novel Graph Model and Background Priors.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Im2Fit: Fast 3D Model Fitting and Anthropometrics using Single Consumer Depth Camera and Synthetic Data.
Proceedings of the 3D Image Processing, 2016

2015
Efficient Media Retrieval from Non-Cooperative Queries.
Proceedings of the Computer Vision Systems - 10th International Conference, 2015

Mine the fine: Fine-grained fragment discovery.
Proceedings of the 2015 IEEE International Conference on Image Processing, 2015

HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

ConceptLearner: Discovering visual concepts from weakly labeled image collections.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2015

2014
HD-CNN: Hierarchical Deep Convolutional Neural Network for Image Classification.
CoRR, 2014

Geometric VLAD for Large Scale Image Search.
CoRR, 2014

When relevance is not Enough: Promoting Visual Attractiveness for Fashion E-commerce.
CoRR, 2014

Enhancing Visual Fashion Recommendations with Users in the Loop.
CoRR, 2014

Is a picture really worth a thousand words?: - on the role of images in e-commerce.
Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, 2014

Furniture-geek: Understanding fine-grained furniture attributes from freely associated text and tags.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Im2depth: Scalable exemplar based depth transfer.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

Large scale visual recommendations from street fashion images.
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2014

Cascaded sparse color-localized matching for logo retrieval.
Proceedings of the 2014 IEEE International Conference on Image Processing, 2014

Region-Based Discriminative Feature Pooling for Scene Text Recognition.
Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Palette power: enabling visual search through colors.
Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013

Style Finder: Fine-Grained Clothing Style Detection and Retrieval.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013

1999
Minimax Emission Computed Tomography using High-Resolution Anatomical Side Information and B-Spline Models.
IEEE Trans. Inf. Theory, 1999

1998
Side Information Averaging Method for PML Emission Tomography.
Proceedings of the 1998 IEEE International Conference on Image Processing, 1998

Penalized maximum likelihood image reconstruction with min-max incorporation of noisy side information.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998


  Loading...