I literally don't know what the hell you're on about at this point. You already know these programs are effectively black box algorithms, that nobody has access to. We don't know what they're trained on exactly, or how predictions are weighted. - ThreadSky

lazyblueberry.bsky.social • 82 days ago

I literally don't know what the hell you're on about at this point. You already know these programs are effectively black box algorithms, that nobody has access to. We don't know what they're trained on exactly, or how predictions are weighted.

Comments

lacrunk.bsky.social•82 days ago

"cutting-edge research in mechanistic interpretability" -> researchers have devised methods to trace LLM outputs to selections of training data.

Ergo, if you found a false claim in the outputs, and traced it back to training data marked as synthetic, you'd be able to prove this relationship.

lacrunk.bsky.social•82 days ago

The alternative is "misinfo outputs might just be the result of randomly selecting the best, but wrong, tokens"

When that happens, rerunning a completion usually produces a different output. In that way, you can determine whether misinfo comes from the attention mechanism or from bad source data.

lazyblueberry.bsky.social•82 days ago

Yes you can do it that way, but with datasets this large, it's honestly impractical, if not basically impossible. It's why they haven't bothered.

The process you're describing is a human manual process that cannot be automated by these programs. These programs cannot tell the truth from fiction.

lacrunk.bsky.social•82 days ago

Au contraire—here's one paper detailing automation to solve the fact tracing problem in LLMs, that even predates the release of ChatGPT:

https://arxiv.org/pdf/2205.11482

Disclaimer, I haven't read this, just showing that this is a studiable problem. We don't have to guess.

lazyblueberry.bsky.social•82 days ago

This is the problem. This is from their conclusion.

lacrunk.bsky.social•82 days ago

Totally. Same as "glue pizza" was correctly referencing an incorrect source (some reddit comment).

Authority laundering via LLM is def a problem and we will need to develop social conventions to combat it.

lazyblueberry.bsky.social•82 days ago

What we do know is that these programs already give erroneous answers given the datasets they're currently using. If they're generating synthetic data from this dataset which again the entirety of the Internet is what they claim- the obvious conclusion is that synthetic data is erroneous as well

Comments

Posting Rules

Reply