I have experimented with different training strategies with BERT as the base architecture which I have fine tuned for text classification (In this case fake news classification). I chose BERT as the ...