PhD Thesis

Geometric Computer Vision meets Deep Learning for Autonomous Driving Applications

Work default illustration


  • Started: 01/09/2016


Autonomous driving is currently at the forefront of many topics in robotics research and one of the industrial flagships in the automotive industry. Among the different sensors involved in the perception of the environment, color cameras play one of the most important roles, due to their affordability and potential to provide rich interpretations of the scene.

In this thesis we will investigate future trends in the use of cameras in autonomous vehicles. We will consider several RGB cameras deployed all around the structure of the car, primarily on the back and on the wind mirrors. Our goal will be to integrate and process all data of the cameras and provide real-time driver’s assistance and information about the surrounding area, including wide-angle view of the rear or the car, and alerts and identifications of the potential dangers.

In order to make this possible we will combine the best of computer vision and machine learning research. In particular we plan to integrate well established practices in geometric computer vision (multiple-view geometry, structure from motion and optical flow), with the powerful deep neural networks. We foresee that the combination of these tools will provide both accurate geometric reconstructions and rich semantic interpretations of the environment.

The work is under the scope of the following projects:

  • RobInstruct: Instructing robots using natural communication skills (web)