Have been expecting to see these kinds of LLM poisoning attacks for a while. Would hope the big foundational models will have efforts to block, though I guess we'll find out.
Reposted from
Nina Jankowicz
A pro-Russia content aggregation network is churning out at least 3 MILLION pieces of propaganda per year, all on sites that are virtually unusable by humans.
So what's the goal? We explore the idea that it might be to flood LLMs with pro-Russia content:
static1.squarespace.com/static/6612c... 1/
So what's the goal? We explore the idea that it might be to flood LLMs with pro-Russia content:
static1.squarespace.com/static/6612c... 1/
Comments
https://en.wikipedia.org/wiki/Gray_goo
'No one would ever do that to us!'
'They did what to us?!'
'Why didn't anyone warn us!'
'Not our fault!'
Write it on a card, put in an envelope, mark to be opened after a LLM extols the virtues of Putin.
Tick them off as they come to pass.
Remember prophecy is a curse.
Like A delayed auditory feedback device
https://www.youtube.com/watch?v=cwoTXE0PQiw