Quick start imagenet in pytorch view markdown
ImageNet has become a staple dataset in computer vision, but is still pretty difficult to download/install. These are some simple instructions to get up and running in pytorch.
step 1: download/preprocessing
- begin by following the instructions for downloading the ImageNet dataset here
- the dataset contains ~1.2 million training images and 50,000 validation images
- note that the dataset is quite large: the .tar files are 138G for train and 6.3G for val
- once extracted the train data is 177G and the val data is 8.3G
- the folder hierarchy looks liks this (val looks similar, there are 1000 folders, one for each class):
train/ n04550184/ n04550184_9946.JPEG n04550184_9945.JPEG ... n04550180/ ... n04550180/
step 2: get the names for each class
- to get the names for each of the classes, look at the class_names_imagenet.py file, which contains a dictionary containing the class labels
step 3: set up a dataloader
- in pytorch, the dataloader should be set up using an ImageFolder
- there is an example here