IRI - GanHand: Predicting human grasp affordances in multi-object scenes

Publication

GanHand: Predicting human grasp affordances in multi-object scenes

Conference Article

Conference

IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Edition

2020

Pages

5030-5040

Doc link

http://dx.doi.org/10.1109/CVPR42600.2020.00508

File

Download the digital copy of the doc pdf document

Authors

Projects associated

Abstract

The rise of deep learning has brought remarkable progress in estimating hand geometry from images where the hands are part of the scene. This paper focuses on a new problem not explored so far, consisting in predicting how a human would grasp one or several objects, given a single RGB image of these objects. This is a problem with enormous potential in eg augmented reality, robotics or prosthetic design. In order to predict feasible grasps, we need to understand the semantic content of the image, its geometric structure and all potential interactions with a hand physical model. To this end, we introduce a generative model that jointly reasons in all these levels and 1) regresses the 3D shape and pose of the objects in the scene; 2) estimates the grasp types; and 3) refines the 51-DoF of a 3D hand model that minimize a graspability loss. To train this model we build the YCB-Affordance dataset, that contains more than 133k images of 21 objects in the YCB-Video dataset. We have annotated these images with more than 28M plausible 3D human grasps according to a 33-class taxonomy. A thorough evaluation in synthetic and real images shows that our model can robustly predict realistic grasps, even in cluttered scenes with multiple objects in close contact.

Scientific reference

E. Corona, A. Pumarola, G. Alenyà, F. Moreno-Noguer and G. Rogez. GanHand: Predicting human grasp affordances in multi-object scenes, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, Seattle, WA, USA (Virtual), pp. 5030-5040.

Publication

GanHand: Predicting human grasp affordances in multi-object scenes

Conference Article

Conference

Edition

Pages

Doc link

File

Authors

Corona Puyané, Enric

Pumarola Peris, Albert

Alenyà Ribas, Guillem

Moreno Noguer, Francesc

Rogez, Grègory

Projects associated

MdM: Unit of Excellence María de Maeztu

HuMoUR: Markerless 3D human motion understanding for adaptive robot behavior

IPALM: Interactive Perception-Action-Learning for Modelling Objects

Abstract

Categories

Scientific reference