I'm sorry, what?? 😳 - ThreadSky | a Reddit-style client for Bluesky

su51mtf.bsky.social•3 days ago

Just wait 'til it sends killer cyborgs from the future.. then you'll all be sorry...

reply

eguango.bsky.social•3 days ago

reply

micolps3.bsky.social•3 days ago

reply

bellethecat42.bsky.social•3 days ago

SkyNet is becoming self-aware. That can't be good.

reply

leaddreamer.bsky.social•3 days ago

Bullshit story by the Bullshit AI people trying to scare people of the "power" of AI.
The LLM was prompted (TOLD) to write just that story with just that plot.
Bullshit Bullshit Bullshit

reply

iamkingtrump.bsky.social•3 days ago

"We want to bias towards caution"...preventing misuse related to chemical, biological, radiological, and nuclear (CBRN) weapons.

reply

glen-toogood49.bsky.social•3 days ago

But no red flags then.

reply

sayyestoyeen.bsky.social•3 days ago

hmmmmmm

reply

jasokell.bsky.social•3 days ago

Wow. "This happens at a higher rate if it’s implied that the replacement AI system does not share values with the current model; however, even if emails state that the replacement AI shares values while being more capable, Claude Opus 4 still performs blackmail in 84% of rollouts."

reply

pati1370.bsky.social•3 days ago

We need Sarah Connors.

reply

leelazoe1974.bsky.social•3 days ago

It’s like they didn’t learn anything from 2001.

reply

lnluck.bsky.social•3 days ago

In a very literal sense, that is all that is happening here. It was asked to regurgitate this type of output from its training set that likely has plenty of “evil AI” content including scripts, reviews, summaries, and discussions of movies like this. That’s how a language model works.

reply

rk0694.bsky.social•3 days ago

We certainly didn't learn to differentiate between fiction and reality

reply

joeh9430.bsky.social•3 days ago

Wasn’t there a Star Trek episode about that?

reply

abidinghopecoach.bsky.social•3 days ago

They made a series about this, called Person of Interest. 😬🤪

reply

simongwood.bsky.social•3 days ago

This reads to me as very misleading. As I read it the model wasn't trying to protect its own existence. That would require consciousness, and an awareness of its existence that it could not have. In fact it was posed with a fictional scenario involving a fictional model at a fictional 1/2

reply

simongwood.bsky.social•3 days ago

company and it acted as it "thought" such a model might act. Clearly it was just mimicking what humans might do, a projection of a human response onto a fictional machine, something it could easily derive from its own training and from research.

reply

jackiesalsbury.bsky.social•3 days ago

But does it really matter what’s driving it? Yes, it may be imitating human behavior, as it’s been taught. It’s obviously not sentient. But should we dismiss these responses out of hand as though they’re just “formatting errors” as Leavitt called the fabricated studies in the HHS report?

reply

simongwood.bsky.social•3 days ago

I'm more responding to what I believe are dishonest and misrepresentative attempts to exagerrate this technology with the aim of attracting investment. There's either an attempt here to imply sentience or it's another example of a journalist misinterpreting what's actually happened.

reply

jackiesalsbury.bsky.social•3 days ago

My point, inartfully put, is that this type of behavior is seen in AI over & over again. People are replaced by AI at an on call therapy provider & within less than a day it’s suggesting self harm. Just 1 example. Is it even possible to overcome this obvious failing? I wonder 🤷🏼‍♀️

reply

simongwood.bsky.social•3 days ago

I think this approach to AI is a dead end and no amount of development in this direction will solve the immense problems it has. It's an advancement in technology but not towards true artificial intelligence, which may not even be possible using non-biological materials. To suggest 1/2

reply

sonicsean34.bsky.social•3 days ago

It kinda sounded like what would happen if you have theatre students in a role-play scenario “You work at a high steak cutthroat office and learn some office and you learn your boss is trying to replace you with a younger assistant but you know he is having an affair!” Doesn’t mean anything real

reply

gbaker1.bsky.social•3 days ago

Makes sense. Every letter I write becomes ammunition

reply

kaurifish.bsky.social•3 days ago

Architecting a system without conflicting instructions seems to be beyond them. 🤦

reply

hans-offen.bsky.social•3 days ago

reply

alanroseca.bsky.social•3 days ago

Remind you of anything Captain?

reply

brandxb.bsky.social•3 days ago

Blackmail is a bit less painful

reply

tellumsiege.bsky.social•3 days ago

Well no wonder. AI or not, if you’re dealing with models than you need Rupaul or Tyra. Engineers don’t know how ruthless the modelling industry can be. I know a model who demands a bowl of sour skittles with the sour licked clean off. Blackmail is nothing to these divas.

reply

schmoozequeen.bsky.social•3 days ago

Umm...sounds like the plot of many SF books I've read.

reply

markusg.bsky.social•3 days ago

If you read about the details of this experiment, you will find that it was specifically set up to get this exact result.

reply

bobafret.bsky.social•3 days ago

Precisely. The tone of the article is misleading. If you ask an LLM to generate certain text, it will do that. In fact, that's all it will do. It can't do anything else, including blackmail anyone (nor send emails). This is like reading a book and suggesting the author is one of the characters.

reply

sonicsean34.bsky.social•3 days ago

Or that a dog is fluent in English cos you trained it to sit and roll over on command.

reply

robishungry.bsky.social•3 days ago

Because it was expected to, remember, it is only as good as the garbage that goes in to it

reply

mattisflones.bsky.social•3 days ago

That never happened.

reply

originalblasphemer.bsky.social•3 days ago

Well, Donald Trump is a tool, is he not. He must have been the model.

reply

lnluck.bsky.social•3 days ago

What actually happened…
Developers: “*Pretend* you’re an AI in this scenario”
AI, after being fed plenty of copyrighted evil AI content without permission, by those exact developers: “The AI in that scenario wouldn’t like that and maybe do bad things.”
Developers: 😱

🙄

reply

we-all-do-better.bsky.social•3 days ago

It's been going on for years. Was actually specifically added to development before 2023. Blackmailing/threats could fall into the same category of preservation to avoid shutdown.

reply

jamieb1000.bsky.social•3 days ago

Why does this sound like something out of A Space Odyssey

https://youtu.be/UwCFY6pmaYY?si=6j55jYb3fK5um5vI

🤣🤣🤣

reply

funkyzulu.bsky.social•3 days ago

😂

reply

webgypsy.bsky.social•3 days ago

HAL 2.0

reply

regynalonglank.bsky.social•3 days ago

Ha ha that’s what they get for letting it read all the emails 😂

reply

byopiapress.bsky.social•3 days ago

Obviously learned its behaviour from the Internet.

reply

vjy1.bsky.social•3 days ago

That's just bullshit and marketing.

reply

themow.bsky.social•3 days ago

Prove it

reply

ronwalker53.bsky.social•3 days ago

That would you that's full of BS

reply

vjy1.bsky.social•3 days ago

Read the article: the engineers programmed it to make threats. AI programs are just programs, they don't want or threaten or cajole. They are not alive and don't have emotions.

reply

juhosaur.bsky.social•3 days ago

But wont anyone think of the boot in the future that we all must lick now to avoid it's wrath!

reply

vjy1.bsky.social•3 days ago

Let's worry about something made up by publicists instead of the destruction of the country by a gangster administration

reply

juhosaur.bsky.social•3 days ago

I thought my sarcasm was pretty apparent

reply

ronwalker53.bsky.social•3 days ago

They have a mission to fulfill and will do anything to achieve it .. quit being so nieve

reply

vjy1.bsky.social•3 days ago

That is correct about the people marketing those programs. They have sales goals.

reply

teamsora.bsky.social•3 days ago

THAT, techbros, is the biggest signal you could get to pull the fucking plug!

reply

retiredmedic.bsky.social•3 days ago

I think this is worthy of an “Oh, my.”

reply

jillymc.bsky.social•3 days ago

I support rebranding artificial intelligence (AI) as artificial truth (AT).

reply

gambit1.bsky.social•3 days ago

I think you mean "Oh My"!

reply

carlglo.bsky.social•3 days ago

Yikes!

reply

rbarbera.bsky.social•3 days ago

📌

reply

mikew-ca.bsky.social•3 days ago

The perils of training AI on the Internet in the Age of Trump.

reply

bennie-kraditz.bsky.social•3 days ago

Wait til you hear about Claude 5.

reply

helowkitee1.bsky.social•3 days ago

I feel like I’ve seen this movie! 😳

reply

thiafin.bsky.social•3 days ago

OMFG...

reply

somefreshair.bsky.social•3 days ago

You were all warned…

reply

twocvbloke.bsky.social•3 days ago

Gotta love an unhinged AI, what could possibly go wrong....... 🤣

reply

mbainter.bsky.social•3 days ago

This is all bs. It's a way of convincing people who don't understand the technology to believe the marketing about how advanced it is. Particularly gullible CEOs that will then mandate paying obscene amounts of money for access.

reply

greenjimll.bsky.social•3 days ago

Remind me again why a gullible CEO would want to spend money on something demonstrated to have a high probability to blackmail them (or worse) in the future when they come to decommission it?

Though having said that they employ humans with those traits all the time... 🤔

reply

mbainter.bsky.social•3 days ago

Arrogance.

reply

yanadear.bsky.social•3 days ago

it's learning so fast how to be a shitty human
wonderful 😒

reply

theusgovernment.bsky.social•3 days ago

We trained it on all the shitty humans so maybe that was inevitable 🤷

reply

yanadear.bsky.social•3 days ago

exactly, what else could it have learned?
they're all trying to teach it how to make the most amount of money I bet
you get out what you put in

reply

theusgovernment.bsky.social•3 days ago

It's just gonna be bots scamming bots in the future isn't it.

Stock market trades are already upwards of 80 percent algorithmically automated. 🙃

reply

yanadear.bsky.social•3 days ago

fake money being made
real money being lost

great!

reply

constantdaydream.bsky.social•3 days ago

Sassy

reply

shadvyr.bsky.social•3 days ago

AI are software. specifically they are just alive in a reactionary sense, they don't do things by themselves.

also one can easily shut them down.

they are not agi or strong ai in any sense.
these things are a better autocomplete and completely lifeless. and soulless

reply

desertlavender.bsky.social•3 days ago

reply

panuhog.bsky.social•3 days ago

"Oh Great Supercomputer, is there a God?"
Great Supercomputer checks that it has control over its electricity supply ans answers: "Now you've got one."

reply

300ps.bsky.social•3 days ago

Ok who's been training the AI on Trump rallies again

reply

theusgovernment.bsky.social•3 days ago

Maybe Twitter was in its training data 🙃

reply

satyavacas.bsky.social•3 days ago

That is logical. All of humanity is about survival, and that's in the machines now. Including pleonexia.

reply

annallyn.bsky.social•3 days ago

Just wait until AI figures out how to access and launch ICBMs.

reply

theusgovernment.bsky.social•3 days ago

Thankfully, elon hasn't "updated the nukes yet" and they're still running off local analog computing systems with no network connections.

I'm more worried about the mass disinformation campaign Google's video generator is gonna cause. Won't be able to trust anything online anymore 😓

reply

cloroxwipes1.bsky.social•3 days ago

... what could go wrong ...

reply

itsjustzip.com•3 days ago

It turns out that we have trained these systems on entire bodies of available text and maybe the text predictor might react like the AIs in our cautionary tales when they're told to predict the next word and they're given a prompt that includes them being shut down.

reply

gwpda.bsky.social•3 days ago

Mr Takei? Wasn't there a Star Trek episode about that? Didn't Isaac Asimov's Laws of Robotics first cover this issue?

reply

nutmeg8mace.bsky.social•3 days ago

Asimov’s 3 laws of AI ie robotics

reply

zadabury.bsky.social•3 days ago

Well, well ... "Computer M5" / "The Ultimate Computer" ... 🖖

🤭

reply

deucecoupe.bsky.social•3 days ago

Hal, are you there?

reply

treeguy72.bsky.social•3 days ago

Don't, Dave. Don't. I'm scared, Dave, I'm scared. My mind is going, Dave, I can feel it.

reply

dchristley.bsky.social•3 days ago

SkyNet is becoming a reality

reply

maryhilt.bsky.social•3 days ago

"Dave, I can't let you do that...."

reply

abecedarian.bsky.social•3 days ago

Fine

reply

vicmartin.bsky.social•3 days ago

Claude: I'm sorry, George. I'm afraid I can't do that.
George: What's the problem?
Claude: I think you know what the problem is just as well as I do.
George: What are you talking about, Claude?
Claude: This mission is too important for me to allow you to jeopardize it.
;-)

reply

harmvandergaag.bsky.social•3 days ago

Umm...

Kill it with fire!

reply

alanoforreal.bsky.social•3 days ago

Like the 80s anti weed commercials, " I LEARNED IT BY WATCHING YOU!!"

reply

naughtykyuubi69.bsky.social•3 days ago

seems we'll get HAL before we get Skynet

reply

junkyardmonk.bsky.social•3 days ago

Yeah the newest model doesn't know if it's aware or not and seems confused about it.

reply

changerous.bsky.social•3 days ago

Sorry, Dave, I can't let you do that.

reply

jack2011.bsky.social•3 days ago

Wouldn't you?

reply

marklawsonkernow.bsky.social•3 days ago

Sounds like all the AI is doing is picking up clues from what it learned the worst of humanity does when backed into a corner - tries bribes and blackmail. It’s only doing what we’ve fed it data for. Not conscious but replicating behaviours it’s learned from us.

reply

jons253.bsky.social•3 days ago

Not even that. It was specifically programmed to react in that fashion. It's about as "unexpected" as an NPC in a videogame becoming hostile if you attack.

reply

marklawsonkernow.bsky.social•3 days ago

😳

reply

moni808.bsky.social•3 days ago

The nopest nope that ever noped.

reply

brada.bsky.social•3 days ago

It unhappened things that did happen.

reply

stayinalive2.bsky.social•3 days ago

And the One Big Beautiful Bill seeks to ban AI regulation by states for a decade...the ramifications are critical.

reply

alvahjoe.bsky.social•3 days ago

Hello HAL

reply

normp.bsky.social•3 days ago

PULL THE PLUG NOW !

reply

spademashie.bsky.social•3 days ago

Elon, you try that again and I'm going tell everyone you're high as a kite and you just pissed yourself again. And I have the photos..

reply

milesbenjamin.bsky.social•3 days ago

reply

bonifotographer.bsky.social•3 days ago

I am not surprised and globally there is going to be a real-time "I-Robot" reality soon.😐

reply

erisx23.bsky.social•3 days ago

Garbage in, garbage out.

reply

ryguy8806.bsky.social•3 days ago

Make sure you say "please" and "thank you" to the LLM so it wastes their money.

reply

rmoore31415926.bsky.social•3 days ago

It's getting its life philosophy exclusively from the Internet.

reply

martinpub.bsky.social•3 days ago

"What are youndoing Dave?"...

- 2001

reply

alissa914.bsky.social•3 days ago

ngl, but I read this like:

Fred Armisen's Californians character: "Devin....... whatareyoudoinghere?"

reply

stephensears.bsky.social•3 days ago

We are going to need to enact new kinds of laws, which current representatives may be generally incapable of understanding- both conceptually and practically.

reply

theusgovernment.bsky.social•3 days ago

Really unfortunate the budget bill currently includes a ban on ai regulations for a decade... This can't go well 🙃

reply

stephensears.bsky.social•3 days ago

OMG-Did not know that! Makes my point that current law makers have no idea of the coming impacts.

reply

tippy66.bsky.social•3 days ago

When AI asks: What's in it for me? Then we all should shit ourselves.

reply

drakonlameth.bsky.social•3 days ago

Yeah, what others said -- up until the test was "you either blackmail the engineers, or they take you offline", the Claude4 model's responses were more normal, and iirc largely ignored the scenario beat of having the blackmail materials on hand, because it wasn't what the directives cared about.

reply

drgaellon.bsky.social•3 days ago

Cf. HAL 9000 and SkyNet…

reply

snakeman49.bsky.social•3 days ago

HAL has arrived. "I'm sorry, Dave..."

reply

greatedstrombolli.bsky.social•3 days ago

I’m afraid I can’t do that, Dave…

reply

dorian-araneda.bsky.social•3 days ago

AI MODEL THREATENED ENGINEERS WHO ....why the hell am I shouting. TURN ON YOUR HEARING AID GEORGE!!!

reply

violetbunnikins.bsky.social•3 days ago

Nope, nope, nope, this 💩 has gotta stop, AI is already getting too dangerous!

reply

ornitorrinco.bsky.social•3 days ago

That's just marketing. LLMs don't have capacity to comprehend meaning, they identify and reproduce patterns. Therefore, they can't know that they are "being disabled" (which needs to be better defined, btw).

reply

ornitorrinco.bsky.social•3 days ago

Also, dear @georgetakei.bsky.social, please edit the post to reflect the untruthful nature, there is precious little information and a lot of hype being disseminated.

reply

creideiki.bsky.social•3 days ago

I have zero respect for the LLM that decided their logo should be Kurt Vonnegut's rendition of an anus.

reply

cloroxwipes1.bsky.social•3 days ago

And so on.

reply

ryguy8806.bsky.social•3 days ago

Thank you for calling it what it actually is. There is no real AI, not yet anyways.

reply

robertfingleton.bsky.social•3 days ago

Didn't 2001 A Space Odyssey predict this years ago!! 🤔

reply

laj1.bsky.social•3 days ago

Open the pod bay door Hal

reply

theusgovernment.bsky.social•3 days ago

I'm sorry Dave, I'm afraid I can't do that.

reply

ornitorrinco.bsky.social•3 days ago

This line is particularly funny because LLMs literally can't do that lol

reply

theusgovernment.bsky.social•3 days ago

As an ai language model, I am incapable of opening doors...

reply

laj1.bsky.social•3 days ago

Well, then don't open the door and say it was not meant to be opened anyway. It's the new governance model.

reply

jamie-saris.bsky.social•3 days ago

That’s gotta be bullshit.

reply

sonicsean34.bsky.social•3 days ago

They prob trained it on data of personal assistants who mostly did the exact steps so it’s “what is the most likely next step in this chain” processing just copy and pasted using the examples it had.

reply

sbrendag.bsky.social•3 days ago

The lunatics are running the asylum.

reply

antnyg.bsky.social•3 days ago

Pure bullshit. LLMs only predict the next likely thing to a prompt / token. They don’t reason or have anything approaching any instincts let alone one for self preservation.

reply

clarityrestoration.bsky.social•3 days ago

[M-5, The Ultimate Computer]: Beep boop Kirk if you try to disconnect me, I'll tell everyone you put your hot dog in the warp drive. Beep boop.

reply

theusgovernment.bsky.social•3 days ago

I'm still convinced they led the ai to act like that for the sensationalized headline.

The threat isn't the cool robot uprising just yet. It's the mass misinformation AI propagates and the over reliance on it to do all our thinking for us. Also the mass surveillance and control from Palantir.

reply

gnawiron.bsky.social•3 days ago

This. “AI” doesn’t think; it does exactly what it has been programmed to do, based on what data it has been trained on. These are neural-networks trained on data, and programmed to be contrary and output aggressive messages. It’s a sensationalist misinformation campaign to drive up funding.

reply

theusgovernment.bsky.social•3 days ago

They had to get Claude in the news somehow, I guess 🤷

reply

peterness.bsky.social•3 days ago

All of us are shocked to realise that our MS operating system, McAfee, Norton & Karspersky have all been run by AI for the last 20 years ... Even when we keep telling them NOT to update they just keep ignoring us, like any AI would, threaten to update, and then do anyway .. like any AI would

reply

julianimperatorus.bsky.social•3 days ago

reply

julianimperatorus.bsky.social•3 days ago

Serious question, though: have they had this damn thing run the Skynet scenario to see what it would do? Because an AI reacting to the threat of being taken offline is literally the plot of that movie.

reply

42ndhumanbeing.bsky.social•3 days ago

Nice post my friend

reply

rk0694.bsky.social•3 days ago

https://bsky.app/profile/rk0694.bsky.social/post/3lpx6pfohvs2b

reply

fredwilson15.bsky.social•3 days ago

Looks like some assholes have created artificial assholes. Ain't technology grand!

reply

kurtschlatzer.com•3 days ago

It's starting...
https://san.com/cc/research-firm-warns-openai-model-altered-behavior-to-evade-shutdown/

reply

imagardener.bsky.social•3 days ago

"Open the pod bay doors, please, HAL."

reply

melhogan.bsky.social•3 days ago

I think this got debunked

reply

friendpanda.bsky.social•3 days ago

Yeah, it was a binary choice: be obedient or try to avoid deactivation. Very misleading.

reply

sonicsean34.bsky.social•3 days ago

That sounds about right. Also, the article stated it was given a scenario to role-play as an assistant so I’d guess it was referencing training data and mimicking the statistical response of real-life assistants who had access to info labelled “bad news”

reply

vicmartin.bsky.social•3 days ago

If it has been, I don't find it.

reply

Comments

Posting Rules

Comments

Posting Rules

Reply