Skip to content

Commit 2de0b5e

Browse files
authored
Update 2020-08-08-pytorch-1.6-now-includes-stochastic-weight-averaging.md
1 parent 38a9c1b commit 2de0b5e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

_posts/2020-08-08-pytorch-1.6-now-includes-stochastic-weight-averaging.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ title: 'PyTorch 1.6 now includes Stochastic Weight Averaging'
44
author: Pavel Izmailov and Andrew Gordon Wilson
55
---
66

7-
Do you use stochastic gradient descent (SGD) or Adam? Regardless of the procedure you use to train your neural network, you can likely achieve significantly better generalization at virtually no additional cost with a simple new technique now natively supported in PyTorch 1.6, Stochastic Weight Averaging (SWA) [1]. Even if you have already trained your model, it’s easy to realize the benefits of SWA by running SWA for a small number of epochs starting with a pre-trained model. [More](https://twitter.com/MilesCranmer/status/1282140440892932096) and [more](https://twitter.com/leopd/status/1285969855062192129), researchers are discovering that SWA improves the performance of well-tuned models in a wide array of practical applications with little cost or effort!
7+
Do you use stochastic gradient descent (SGD) or Adam? Regardless of the procedure you use to train your neural network, you can likely achieve significantly better generalization at virtually no additional cost with a simple new technique now natively supported in PyTorch 1.6, Stochastic Weight Averaging (SWA) [1]. Even if you have already trained your model, it’s easy to realize the benefits of SWA by running SWA for a small number of epochs starting with a pre-trained model. [Again](https://twitter.com/MilesCranmer/status/1282140440892932096) and [again](https://twitter.com/leopd/status/1285969855062192129), researchers are discovering that SWA improves the performance of well-tuned models in a wide array of practical applications with little cost or effort!
88

99

1010
SWA has a wide range of applications and features:

0 commit comments

Comments
 (0)