What does model.train() do in PyTorch?

Aerin picture Aerin · Jul 20, 2018 · Viewed 52k times · Source

Does it call forward() in nn.Module? I thought when we call the model, forward method is being used. Why do we need to specify train()?

Answer

Umang Gupta picture Umang Gupta · Jul 20, 2018

model.train() tells your model that you are training the model. So effectively layers like dropout, batchnorm etc. which behave different on the train and test procedures know what is going on and hence can behave accordingly.

More details: It sets the mode to train (see source code). You can call either model.eval() or model.train(mode=False) to tell that you are testing. It is somewhat intuitive to expect train function to train model but it does not do that. It just sets the mode.