Comparison of optimization and sampling from a distribution defined by an energy function. I use a continuous version of the Ising model spin lattice energy. First, optimization from a random initial state using gradient descent with momentum, using the SGD optimizer in PyTorch. - ThreadSky

About ThreadSky

paulfharrison.bsky.social • 5 days ago

Comparison of optimization and sampling from a distribution defined by an energy function. I use a continuous version of the Ising model spin lattice energy.

First, optimization from a random initial state using gradient descent with momentum, using the SGD optimizer in PyTorch.

Comments

paulfharrison.bsky.social•5 days ago

Second, sampling from the distribution with a Langevin Dynamics simulation. The algorithm is almost identical to gradient descent with momentum, but we add just the right amount of noise to the momentum at each step.

paulfharrison.bsky.social•5 days ago

The optimizer tries to find the lowest energy. This closely resembles "maximum likelihood" or "maximum a posteriori" estimation in statistics. We might hope this finds the most representative estimate. Clearly, here it does not! The estimate is smoother than most samples from the distribution.

paulfharrison.bsky.social•5 days ago

Also the optimizer has not found the lowest energy state, which would be all-dark or all-light. It might take a very long time to reach one of these optima!

paulfharrison.bsky.social•5 days ago

... BlueSky seems to have thrown out most of the detail in my second video, so here's the original:

https://logarithmic.net/pfh-files/random/sampler.mov

Posting Rules

Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service

Comments

Posting Rules

Reply