Query regarding Fine-Tuning/Feature Extraction

Chokerino · November 29, 2020, 4:13pm

Hello,

I tried to use the DGL-LifeSci library for deep learning on protein structures but the performance was poor. I was advised to train the model on a bigger dataset and keep the convolution filters of the model to fine-tune the model on my classification task. Are there any examples I can follow to do this?

mufeili · November 30, 2020, 3:43am

Basically, you need to train a model from scratch on a large dataset and then save the learned weights with

torch.save(model.state_dict(), PATH)

Then you can use the saved weights to initialize the model to train on the smaller dataset.

# Initialize a model instance
model = ...
# Load trained weights
model.load_state_dict(torch.load(PATH))

Chokerino · December 4, 2020, 7:51am

Wouldn’t i need to remove some layers because the larger dataset will have foreign samples which when trained would result in weights being changed in ways which i do not want?

mufeili · December 5, 2020, 6:09pm

What do you mean by foreign samples?

Chokerino · December 5, 2020, 6:29pm

I mean these graphs would not belong to either of the classes I want to train my model on.

mufeili · December 5, 2020, 7:00pm

In that case, you can remove the last several layers and re-initialize them.

Chokerino · December 5, 2020, 10:57pm

Is there any guide on how I can do so?

mufeili · December 6, 2020, 7:31am

You may find this PyTorch discussion thread helpful: https://discuss.pytorch.org/t/how-to-load-part-of-pre-trained-model/1113/3