I have two questions about how to load Imagenet datas. I downloaded ILSVRC2012 validation sets(Cause training sets are too large) but I have two problems.
I can't understand how can I find out the labels.
There are only jpeg files with file names like "ILSVRC2012_val_00000001.JPEG
" but there's no labels. How can I find them?
As far as I know, Imagenet uses 224 * 224 pixel image and the problem is just "classification" not "detection", but ILSVRC2012 sets have much more and different pixel sizes. So, how can I get proper boxes for 224 * 224 pixels?
You will download three tar archives: one for training data, one for validation data, and one for test data.
Training data is contained in 1000 folders, one folder per class (each folder should contain 1,300 JPEG images). Validation data is a single folder with 50k JPEG
images, look for the corresponding ILSVRC2012_validation_ground_truth.txt
file in (as darren1231 mentioned, it needs to be downloaded separately as part of DevKit).
Test data is similar to validation data, but it does not have labels (labels are not provided to you because you need to submit your predicted labels to them, as part of the competition).