IRI - Tracking and approaching persons using Deep Learning techniques

Master Thesis

Tracking and approaching persons using Deep Learning techniques

Student/s

Javier Laplaza Galindo

Supervisor/s

Alberto Sanfeliu Cortés

Information

Started: 08/11/2017
Finished: 17/09/2018

Description

This project proposes a solution in order to enable a social robot to approach and follow one speciﬁc person using a vision based system. The idea is to introduce this features to the robot so that it will be able, int he futur, to autonomously interact with people. To do so, the algorithms developed in the project use Convolutional Neural Networks to identify where is the person given an image provided by the robot camera. In order to accomplish these tasks, two algorithms are created: an object detector using YOLO algorithm and an object tracker using a Siamese network. In order to fully understand how these algorithms work, the methods and architectures on which they are based are explained. Also, one of the algorithms is attempted to train with Google Colaboratory, even though the training results aren’t used in the ﬁnal implementation. Diﬀerent recordings are ﬁlmed teleoperating the robot simulating real approaching and following operations. Each recording is then labelled frame by frame to use this data in the training. Since the results of the training aren’t used in the ﬁnal implementation, the recordings are used to test how well the ﬁnal implementation is able to track the target in a frame. Therefore, diﬀerent metrics are evaluated in the recordings, separating the ”approaching operation” videos from the ”following operation” videos, thus obtaining separated results for the two operations. Also, since the recordings where ﬁlmed considering diﬀerent light conditions, it is possible to analyze how light variations aﬀect the results when performing both approaching and following operations. These algorithms are run and tested in a Jetson TX2, using the GPU of the embedded device to enhance the performance of the algorithms. The implementation is build using PyTorch.

https://upcommons.upc.edu/handle/2117/127124