Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models.
CoRR, May, 2025
HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models.
,
,
,
,
,
,
,
,
,
,
,
,
CoRR, March, 2025
OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB.
CoRR, 2024
Unlocking Exocentric Video-Language Data for Egocentric Video Representation Learning.
CoRR, 2024
HyperMix: Out-of-Distribution Detection and Classification in Few-Shot Settings.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Propose, Assess, Search: Harnessing LLMs for Goal-Oriented Planning in Instructional Videos.
Proceedings of the Computer Vision - ECCV 2024, 2024
EgoSG: Learning 3D Scene Graphs from Egocentric RGB-D Sequences.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024
Relational Space-Time Query in Long-Form Videos.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023
GKNet: Grasp keypoint network for grasp candidates detection.
Int. J. Robotics Res., 2022
Primitive Shape Recognition for Object Grasping.
CoRR, 2022
An Affordance Keypoint Detection Network for Robot Manipulation.
IEEE Robotics Autom. Lett., 2021
Improving vision-based robotic manipulation with affordance understanding.
PhD thesis, 2020
Using Synthetic Data and Deep Networks to Recognize Primitive Shapes for Object Grasping.
Proceedings of the 2020 IEEE International Conference on Robotics and Automation, 2020
Learning Affordance Segmentation for Real-World Robotic Manipulation via Synthetic Images.
IEEE Robotics Autom. Lett., 2019
Toward Affordance Detection and Ranking on Novel Objects for Real-World Robotic Manipulation.
IEEE Robotics Autom. Lett., 2019
Detecting Robotic Affordances on Novel Objects with Regional Attention and Attributes.
CoRR, 2019
Real-World Multiobject, Multigrasp Detection.
IEEE Robotics Autom. Lett., 2018
Deep Grasp: Detection and Localization of Grasps with Deep Neural Networks.
CoRR, 2018
The Helping Hand: An Assistive Manipulation Framework Using Augmented Reality and a Tongue-Drive.
CoRR, 2018
Learning to Navigate: Exploiting Deep Networks to Inform Sample-Based Planning During Vision-Based Navigation.
CoRR, 2018
Hands-Free Assistive Manipulator Using Augmented Reality and Tongue Drive System.
Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018
The Helping Hand: An Assistive Manipulation Framework Using Augmented Reality and Tongue-Drive Interfaces.
Proceedings of the 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2018
When crowdsourcing meets mobile sensing: a social network perspective.
IEEE Commun. Mag., 2015
Supervised Collective Classification for Crowdsourcing.
Proceedings of the 2015 IEEE Globecom Workshops, San Diego, CA, USA, December 6-10, 2015, 2015