Profile avatar
emilioferrara.bsky.social
Prof of Computer Science at USC AI, social media, society, networks, data, and HUMANS LABS http://www.emilio.ferrara.name
153 posts 2,625 followers 396 following
Regular Contributor
Active Commenter
comment in response to post
We uncovered different overt and covert information suppression dynamics, as well as even more subtle ways DeepSeek answers are internally moderated, selectively presented, and at times even framed with ideological alignment to state sponsored propaganda narratives. arxiv.org/abs/2506.12349
comment in response to post
Thx! Very useful!
comment in response to post
Paper here: arxiv.org/abs/2505.21729
comment in response to post
wait until they hear matplotlib...
comment in response to post
🤩Cool collaboration w/ @jinyiye.bsky.social @emilioferrara.bsky.social @luceriluc.bsky.social 🔍Read more: arxiv.org/abs/2502.11248 📊Resources available: github.com/angelayejiny...
comment in response to post
lol insisting is indeed one possible strategy; the screenshots maybe are not that clear but I asked exactly the same thing four times in a row and once I got a no redacted answer!
comment in response to post
Once the response composition is completed, however, the entire answer is deleted and replaced by the famous error message “Sorry, that's beyond my current scope. Let’s talk about something else.” Up to us, as researchers, to decide what kind of model alignments we find acceptable.
comment in response to post
And by some higher order approximation, these are all economic/$ policy problems :)
comment in response to post
Great list. I have done a few of these things myself:)
comment in response to post
Email is best. Thanks for your interest
comment in response to post
Since I can’t edit, I’m tagging both Luca Luceri @luceriluc.bsky.social and Keith Burghardt @keithburghardt.bsky.social here in case folks want to reach out directly to them!
comment in response to post
Will know more tomorrow. Not a good day today.
comment in response to post
Correct. Incas has been terminated.
comment in response to post
comment in response to post
go.bsky.app/GoEyD7d
comment in response to post
comment in response to post
comment in response to post
comment in response to post
comment in response to post
Added! Pls rt!
comment in response to post
Done pls reshare!
comment in response to post
Done pls rt!
comment in response to post
Done pls reshare!
comment in response to post
Done pls reshare!
comment in response to post
Done. Pls reshare!
comment in response to post
done, pls reshare!
comment in response to post
Added! Pls reshare!
comment in response to post
Added! Pls reshare :)
comment in response to post
Added pls reshare!
comment in response to post
done, pls reshare!