File tree Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Expand file tree Collapse file tree 1 file changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -573,7 +573,7 @@ chain to get the free energy of the negative phase. Note that the
573
573
``chain_end`` is a symbolical Theano variable expressed in terms of the model
574
574
parameters, and if we would apply ``T.grad`` naively, the function will
575
575
try to go through the Gibbs chain to get the gradients. This is not what we
576
- want (it will mess up our gradients) and therefire we need to indicate to
576
+ want (it will mess up our gradients) and therefore we need to indicate to
577
577
``T.grad`` that ``chain_end`` is a constant. We do this by using the argument
578
578
``consider_constant`` of ``T.grad``.
579
579
@@ -651,7 +651,7 @@ all bits are independent. Therefore,
651
651
652
652
Here :math:`x_{-i}` denotes the set of all bits of :math:`x` except bit
653
653
:math:`i`. The log-PL is therefore the sum of the log-probabilities of each
654
- bit :math:`x_i`, conditionned on the state of all other bits. For MNIST, this
654
+ bit :math:`x_i`, conditioned on the state of all other bits. For MNIST, this
655
655
would involve summing over the 784 input dimensions, which remains rather
656
656
expensive. For this reason, we use the following stochastic approximation to
657
657
log-PL:
You can’t perform that action at this time.
0 commit comments