Input preprocessing for VGG #5

borisdayma · 2021-09-24T18:51:56Z

Hi,

In the README, it is mentioned that input should be between 0 and 1.

In the training code, they seem to be between -1 and 1.

In the torchvision doc, they seem to be loaded between 0 and 1 and then normalized with

normalize = transforms.Normalize(mean=[0.485, 0.456, 0.406],
                                 std=[0.229, 0.224, 0.225])

Should they be preprocessed as per the torchvision docs?

The text was updated successfully, but these errors were encountered:

borisdayma · 2021-09-24T18:55:20Z

Interestingly, in lpips module, they are just normalized to [-1, 1].

matthias-wright · 2021-09-24T19:15:27Z

Hi @borisdayma!
When you are using the pretrained weights (pretrained='imagenet'), the input should be between 0 and 1.
The torchvision normalization is applied to the input in the __call__ method so you don't have to do that, see here.
I am considering to add an additional argument normalization so that people can use the imagenet weights without using the torchvision normalization, what do you think about that?

For the training code I used the range [-1, 1] because that range has worked better for me in the past.

Yeah, for lpips the input is [-1, 1] and then the input is normalized using mean=[-.030,-.088,-.188] and std=[.458,.448,.450].

borisdayma · 2021-09-24T19:38:57Z

Thanks, this is much clearer.
I think it's a good idea to add the argument normalize as a parameter.

matthias-wright · 2021-09-24T20:41:38Z

Thanks for the feedback! I added the argument:

import flaxmodels as fm

vgg16 = fm.VGG16(output='logits', pretrained='imagenet', normalize=False)

This way the imagenet weights are used but the images are not normalized internally.

matthias-wright added a commit that referenced this issue Sep 24, 2021

#5 add option to disable normalization for vgg & resnet

14a8f30

matthias-wright closed this as completed Sep 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input preprocessing for VGG #5

Input preprocessing for VGG #5

borisdayma commented Sep 24, 2021

borisdayma commented Sep 24, 2021

matthias-wright commented Sep 24, 2021

borisdayma commented Sep 24, 2021

matthias-wright commented Sep 24, 2021

Input preprocessing for VGG #5

Input preprocessing for VGG #5

Comments

borisdayma commented Sep 24, 2021

borisdayma commented Sep 24, 2021

matthias-wright commented Sep 24, 2021

borisdayma commented Sep 24, 2021

matthias-wright commented Sep 24, 2021