Comparison of optimization and sampling from a distribution defined by an energy function. I use a continuous version of the Ising model spin lattice energy.

First, optimization from a random initial state using gradient descent with momentum, using the SGD optimizer in PyTorch.

Comments