Really cool work introducing a gradient-free method for unlearning organically memorized sensitive information from LMs! (we also curate two datasets of organically memorized sensitive information) Check out the 🧵 below and come talk to us at @aclmeeting.bsky.social in Vienna 🍻 - ThreadSky

Really cool work introducing a gradient-free method for unlearning organically memorized sensitive information from LMs!

(we also curate two datasets of organically memorized sensitive information)

Check out the 🧵 below and come talk to us at @aclmeeting.bsky.social in Vienna 🍻

Reposted from

🚨New paper at #ACL2025 Findings!
REVS: Unlearning Sensitive Information in LMs via Rank Editing in the Vocabulary Space.
LMs memorize and leak sensitive data—emails, SSNs, URLs from their training.
We propose a surgical method to unlearn it.
🧵👇w/ @boknilev.bsky.social @mtutek.bsky.social
1/8

Comments

Posting Rules

Comments

Posting Rules

Reply