I am impressed that an LLM is able to do the parroting in the first question. It's impressive that it recognized what to parrot and pulled off all the steps correctly. (The chain of thought does suggest it called out to Wolfram Alpha for a calculation en route. But that's the right thing to do!) - ThreadSky

radishharmers.bsky.social • 35 days ago

I am impressed that an LLM is able to do the parroting in the first question. It's impressive that it recognized what to parrot and pulled off all the steps correctly.

(The chain of thought does suggest it called out to Wolfram Alpha for a calculation en route. But that's the right thing to do!)

Comments

rtbick.bsky.social•35 days ago

I wonder if it seems more impressive to you because it is parroting something in your area of expertise. As a layman I'm not sure how this is any different from, say, generating a song. The only difference is the data it's been trained on.

rtbick.bsky.social•35 days ago

the part where its output is confidently wrong and full of blunders is where the illusion falls apart. not really any different from AI art not knowing how to do hands (maybe hands are better now, I don't keep up with it)

attika.bsky.social•35 days ago

Finding the objectively correct answer to a hyper specific question is miles different.

It also concretely demolishes the naysaying that LLMs “parrot” their answers. It’s borderline impossible for the training data to have had that exact question. It had to perform actual reasoning to get there.

rtbick.bsky.social•34 days ago

"parrot" may be inaccurate, but so is "perform actual reasoning". if that were the case, it wouldn't provide confident, wildly incorrect responses. it's not thinking, it's resolving noise into a signal.

glutenacht.bsky.social•35 days ago

I hate these things because of some irrational humanism in my brain, but the amount of denial regarding LLMs parroting skills is annoying. Yes, this is impressive.

rtbick.bsky.social•35 days ago

I'm not arguing it isn't impressive. I'm wondering what makes this particular example any more impressive than anything else LLMs are known to do

radishharmers.bsky.social•35 days ago

Evaluating a song is a subjective thing. I am interested in its logical reasoning ability on questions with objectively correct answers but which can't just be looked up very directly.

rtbick.bsky.social•34 days ago

I don't think "logical reasoning ability" is the right way to think about what it's doing. training an LLM is not like teaching math to a human. consider the *ways* it gets things wrong vs the ways humans get things wrong.

radishharmers.bsky.social•34 days ago

You don't have to tell me that. But I am curious about how well it works regardless. I am doing the empirical investigation I should do, to be permitted to have the views I have. I want to understand truth and reality.

Comments

Posting Rules

Reply