Neat visualization that came up in the ARBOR project: this shows DeepSeek "thinking" about a question, and color is the probability that, if it exited thinking, it would give the right answer. (Here yellow means correct.) - ThreadSky

About ThreadSky

wattenberg.bsky.social • 3 days ago

Neat visualization that came up in the ARBOR project: this shows DeepSeek "thinking" about a question, and color is the probability that, if it exited thinking, it would give the right answer. (Here yellow means correct.)

Comments

davidhuang.blog•3 days ago

📌

ravavyr.bsky.social•3 days ago

why is it so "verbose" in it's thinking? so many superfluous words in there. Just wondering how it works. Wouldn't it be faster if it took a technical approach, broke the question into chunks, generated a list of values/data and then made a conclusion from that?

wattenberg.bsky.social•3 days ago

Great questions! Maybe it would be faster... or maybe it's doing something important under the hood that we can't see? I genuinely have no idea.

ravavyr.bsky.social•3 days ago

yea i expected it to be more mathematical

wattenberg.bsky.social•3 days ago

You can see the model go down the wrong path, "realize" it's not right, then find the correct answer! To see more visualizations, or if you have related ideas, join the discussion here!
https://github.com/ARBORproject/arborproject.github.io/discussions/11#discussioncomment-12309423 (vis by @yidachen.bsky.social in conversation with @diatkinson.bsky.social )

mrfeinberg.com•3 days ago

What determines "right" in this context? Is it grad students?

wattenberg.bsky.social•3 days ago

It's based on a data set of multiple-choice questions that have a known right answer, so this visualization only works when you have labeled ground truth. Definitely wouldn't shock me if those answers were labeled by grad students, though!

alexander.hudek.org•3 days ago

A bit disturbing that it has the answer but doesn't really know why it has the answer and can't figure it out if it's really the right answer.

wattenberg.bsky.social•3 days ago

We also see cases where it starts out with the right answer, but eventually "convinces itself" of the wrong answer! I would love to understand the dynamics better.

Posting Rules

Be respectful to others
No spam or self-promotion
Stay on topic
Follow Bluesky's terms of service

Comments

Posting Rules

Reply