Deepseek is like if we banned RAM sticks of more than 2 GB to China and they built a version of Chrome that runs on 256 MB on a Raspberry PI. Simply wild innovation. - ThreadSky

sqlgene.com • 35 days ago

Deepseek is like if we banned RAM sticks of more than 2 GB to China and they built a version of Chrome that runs on 256 MB on a Raspberry PI. Simply wild innovation.

Comments

dazzactl.bsky.social•35 days ago

Well it could be another MySpace/Facebook or Yahoo/Google moment. Sometimes someone comes along and does it better!

But it will be killed by Nationalism! #MAGA

anotarian.bsky.social•35 days ago

we can stop crack sales by outlawing Michaels craft stores selling crack bags

anotarian.bsky.social•35 days ago

I am excited to meet our new overlords.

kilgor3trout.bsky.social•35 days ago

The part I'm confused about is, I keep seeing talk about it being open source, but then see peoples screenshots of it censoring talks about tank man.

What good is open source nature of it if the data is still headed overseas to be ingested. Can that be changed by developers?

sqlclause.no•35 days ago

If you take the LLM and run it on your local machine offline, it won't send any data overseas.

So the model is OSS but the Deepseek service people use is not from how I understand it

kilgor3trout.bsky.social•35 days ago

Gotcha. Thanks!

sqlgene.com•35 days ago

That's my understanding as well. Currently downloading one of the smaller models to run on LM Studio locally.

LLM models are reasonably easy to "jailbreak" if you have enough compute, is my understanding. So it wouldn't be terribly difficult to uncensor with enough compute power. No idea how much.

sqlgene.com•35 days ago

Running a distilled version locally. Seems quite happy to tell me about US events in 1989.

sqlgene.com•35 days ago

It's nice enough to self-narrate

kilgor3trout.bsky.social•35 days ago

Any good recommendations on tutorials for setting this up? I've never personally played around with any self hosting llms.

plaicebo.com•35 days ago

Sometimes you don't really need much compute to "jailbreak" an LLM. I guess you'd need to explain what you mean by that. You want to fully uncensor the LLM? That's not quite the same thing in my book.

sqlgene.com•35 days ago

"Uncensor" is probably a more accurate term. What I mean is updating the weights of the model to permanently change it.

In my mind it's all circumventing guardrails put in place, but yes you are correct that prompt injection and other techniques can get around guardrails without modifying the model

jeroendekker.bsky.social•35 days ago

Just used it to take edit a tmdl script of a table and provide all the Dax measures with comments explaining them, in Dutch.

Its not bad at all. Certainly as good as ChatGPT

sqlgene.com•35 days ago

LLMs are fairly well optimized for translation, by design.

Now the real test is if it can write 10 sentences ending in the word Apple 🤣. ChatGPt 4o struggles.

kllykvn.bsky.social•35 days ago

Can't wait to see how this goes for their 2nd iteration

joschkos.bsky.social•35 days ago

„More investment does not necessarily lead to more innovation. Otherwise, large companies would take over all innovation.“ Liang said.

Maybe this is again a kind of Linux moment.

buckwoodymsft.bsky.social•35 days ago

"Necessity is the mother of invention".

One lesson here is that if it takes more than 2 pizza's to feed your team, the project will be bloated and inefficient. :)

sqlgene.com•35 days ago

What I'm hearing is we need to ban pizzas to drive innovation!

buckwoodymsft.bsky.social•35 days ago

Exactly. I think I was very clear.

Comments

Posting Rules

Reply