Enriching vision representation by deep neural networks and self-supervised learning