The unlearning goal says: U(D, S, A(D)) ≈ A(D◦S) Meaning: for a request S to delete/add data in set D, your “unlearning algorithm” U should produce a model U(D, S, A(D)) that looks like a model A(D◦S) re-trained from scratch on dataset D◦S. But does it actually "delete" information requested in S? 👀 - ThreadSky

rishav84ia.bsky.social • 67 days ago

The unlearning goal says: U(D, S, A(D)) ≈ A(D◦S)
Meaning: for a request S to delete/add data in set D, your “unlearning algorithm” U should produce a model U(D, S, A(D)) that looks like a model A(D◦S) re-trained from scratch on dataset D◦S. But does it actually "delete" information requested in S? 👀

Comments

rishav84ia.bsky.social•67 days ago

🚨Here's a real-world example that unlearning fails🚨
@samdoesarts.bsky.social was among the first big-name illustrators whose style was illegally copied by AI models trained solely on his copyrighted art. These illegal models are still publicly available!
https://huggingface.co/models?search=samdoesart

rishav84ia.bsky.social•67 days ago

In 2023, frustrated artists filed a joint-action lawsuit against corporations illegally profiting off of their copyrighted works. There's a strong sentiment in the community that creators are not being protected against an entire industry for laundering copyrighted sources

rishav84ia.bsky.social•67 days ago

🎉 Artists are seeing small wins against big AI firms (Stability AI, Midjourney, DeviantArt) in federal court. If they triumph, can Machine Unlearning truly stand on its promise to scrub these multi-million-dollar (or even billion-dollar) models of all copyrighted data?!

The answer is NO!

rishav84ia.bsky.social•67 days ago

‼️ To dodge infringement claims, “AI bros” are creating second-gen datasets of synthetic art, derived from models trained on copyrighted originals. They omit the literal works but preserve their essence—and spark the same legal concerns.

rishav84ia.bsky.social•67 days ago

Why does “U(D, S, A(D)) ≈ A(D◦S)” fail in reality? Because if D has copyrighted data, the model A(D) can produce first-gen synthetic copies that influence the add/remove request S. In Sam’s case, the second-gen dataset D◦S was created using first-gen images from A(D), so S isn’t independent of A(D).

rishav84ia.bsky.social•67 days ago

No matter how perfectly a machine unlearning algorithm U(D, S, A(D)) simulates retraining on A(D◦S), the “essence” from the original copyrighted data lingers.

Check out my (slightly technical) paper on why machine unlearning, as it is right now, does not really work!

Comments

Posting Rules

Reply