Get Features from fine tuned BERT Model

Hi,

It’s my first time using DGL. I notice that many people use Bert as the feature for the GNN nodes. But my question is, can I fine-tune my Bert in the same dataset first. Then use the fine-tuned Bert as the features for the GNN nodes? I haven’t seen people do this before, so it is a little bit confusing for me.

For example, if I am doing a classification task, I first train a model based on a Bert and FNN. Then I choose the fine-tuned Bert with best dev set performance as the features of my GCN model. Then I do this task with the GCN model. Does it make sense? Or is it some kind of cheating?

I think it makes sense as long as you do not train the Bert on the validation and test set. You might find this paper relevant.

Thanks for your reply. It really helps a lot!

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.