Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training on CIFAR10 #8

Open
akhauriyash opened this issue Mar 10, 2021 · 1 comment
Open

Training on CIFAR10 #8

akhauriyash opened this issue Mar 10, 2021 · 1 comment

Comments

@akhauriyash
Copy link

Hello,

Thank you for this excellent repository!

Do you have any suggestions of changes to make to train BYOL on the CIFAR10 dataset?

The way I am doing this (in main.py) (I am also training my own custom models, but I do not think that is too relevant)

DATASET='CIFAR10' # Can change to STL10

if DATASET=='STL10':
    train_dataset = datasets.STL10('/workspace/STLDataset', split='train+unlabeled', download=True,
                                    transform=MultiViewDataInjector([data_transform, data_transform]))
elif DATASET=='CIFAR10':
    train_dataset = datasets.CIFAR10('/workspace/CIFAR10Dataset', train=True, download=True,
                                    transform=MultiViewDataInjector([data_transform, data_transform]))
else:
    print("Error, dataset not supported, choose CIFAR10 or STL10")
    exit(0)

I also change the config to have: input_shape: (32,32,3).

Further, I may not have taken a very deep look into this code-base, but how do we produce the 'STL10 Top 1' accuracies(75.2%) after training the model on the self-supervised task? Do we take the trained model and fine-tune on the STL10 supervised dataset? I assume that code is not included in this library?

Thank you!

@akhauriyash akhauriyash changed the title Tips on training on CIFAR10 Training on CIFAR10 Mar 10, 2021
@khangt1k25
Copy link

Hi Akhauriyash, you can just modify the input shape and name of the dataset.
I am testing with the model but it doesn't work well with CIFAR10, ~ 54% top1 accuracy and I wonder the config is the same or different on learning rate?
Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants