PhD Thesis

Fashion Discovery: A Computer Vision Approach

Work default illustration



  • Started: 01/10/2015


Fashion has a large impact in society and is a multi-billion dollar industry. Online sales in the year 2017 are expected to reach 191 billion Euros in Europe and 370 billion dollars in the USA. Developing tools to improve the interaction between sellers and buyers will be of paramount importance.
In this project we will exploit current advances in computer vision and machine learning to explore large amounts of publicly available fashion data, for automatically reasoning and discovering patterns and trends. This data, besides images and videos, may include varying types of meta-information such as categorical attributes or textual descriptions. Processing this diverse “big data” will not only require advanced machine learning algorithms, but also developing very efficient approaches.
The project will consists of two phases:
Initially, the student will participate in the creation of a large fashion dataset and develop efficient algorithms for the extraction of low and mid level features, such as 2D human pose estimation, foreground/background segmentation, and garment segmentation. In a second stage, she/he will work on improving its implementation to be able to scale and distribute well to large datasets, as well as integrating it with the Wide Eyes infrastructure.
The project will be carried in cooperation with the following three institutions:
- Institut de Robòtica i Informàtica Industrial in the Universitat Politècnica de Catalunya.
- Wide Eyes Technologies, in Barcelona.
- Waseda University (Tokyo, Japan), institution to which one of the academic co-advisors belongs.

The work is under the scope of the following projects:

  • ViSen: Visual Sense, Tagging visual data with semantic descriptions (web)
  • RobInstruct: Instructing robots using natural communication skills (web)