A few simple logical steps yield our main theoretical result👇 This magical equation connects two seemingly distinct concepts: 1⃣ P_obs: the subjective probability your brain assigns to its observations 2⃣ KL divergence between the true world and your internal beliefs 🧵[4/n] - ThreadSky

hadivafaii.bsky.social • 44 days ago

A few simple logical steps yield our main theoretical result👇

This magical equation connects two seemingly distinct concepts:

1⃣ P_obs: the subjective probability your brain assigns to its observations
2⃣ KL divergence between the true world and your internal beliefs

🧵[4/n]

Comments

hadivafaii.bsky.social•44 days ago

Here's our first deep insight. The following are equivalent:

you adapt to the world
↔️
you accurately predict the world
↔️
fewer nasty surprises that could get you killed
↔️
KL ( world || your beliefs ) drops

🧵[5/n]

hadivafaii.bsky.social•44 days ago

Importantly, KL divergence naturally emerged from the mathematics of likelihood evaluation for an observer living in a world.

I didn't put it in by hand, or make crazy assumptions.

✅ There is something truly fundamental, unique, and privileged about KL divergence.

🧵[6/n]

hadivafaii.bsky.social•44 days ago

So, adaptation and KL minimization are intimately related. But what about learning?

✅ In section 4, I apply these foundational principles to "derive" (almost) all of machine learning from KL minimization.

(first pointed out by @alexalemi.bsky.social : https://blog.alexalemi.com/kl-is-all-you-need.html)
🧵[7/n]

hadivafaii.bsky.social•44 days ago

In section 5, I explore the relative information interpretation of KL through an intuitive "Treasure Hunt" story.

I then connect this to philosophy of science, and experiments as "truth-revealing actions."

(another relevant @alexalemi.bsky.social piece: https://blog.alexalemi.com/kl.html)
🧵[8/n]

hadivafaii.bsky.social•44 days ago

This Treasure Hunt example sheds light on the directional nature of KL:

✅ KL divergence must be asymmetric: because it captures the flow of information from the world to the brain (when you read a book or perform an experiment)

🧵[9/n]

hadivafaii.bsky.social•44 days ago

Recap so far:

✅ Brains: adapt to survive, survive to adapt
✅ adaptation = KL minimization
✅ most of ML = KL minimization
✅ KL asymmetry captures the flow of information
✅ KL minimization = flow of information from the world into your brain, updating your beliefs

🧵[10/n]

Comments

Posting Rules

Reply