manueltonneau.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

Why did Grok suddenly start talking about “white genocide in South Africa” even if asked about baseball or cute dogs? Because someone at Musk’s xAi deliberately did this, and we only found out because they were clumsy. My piece on the real dangers of AI. Gift link: www.nytimes.com/2025/05/17/o...

submitted 39 days ago • 8 comments

New preprint with @jbakcoleman.bsky.social @lewan.bsky.social @randomwalker.bsky.social @orbenamy.bsky.social @lfoswaldo.bsky.social where we argue for a complex-system perspective to understand the causal effects of social media on society and for a triangulation of methods arxiv.org/abs/2505.09254

submitted 41 days ago • 2 comments

What do experts think about the potential negative impacts of social media on adolescent mental health? We have a new consensus statement with 120 experts on this topic. Check it out to see where experts agree and where they think more evidence is needed!

submitted 41 days ago • 0 comments

Excited to share that two of our papers got into ACL 2025! 🎉 📌 Main: HateDay: A Global Hate Speech Dataset Representative of Twitter (arxiv.org/abs/2411.15462) 📌 Findings: When Claims Evolve – Robustness to Misinformation Edits (arxiv.org/abs/2503.03417) See you all in Vienna! 🇦🇹 #ACL2025 #NLProc

submitted 40 days ago • 0 comments

Just published at CHI ’25: How Commercial Content Moderation APIs over- and under-moderate hate speech dl.acm.org/doi/10.1145/... w/ @dawiet.bsky.social ky.social, Amin Oueslati @hheuer.bsky.social cial @dimitristaufer.bsky.social al Lena Pohlmann. 🧵

submitted 44 days ago • 1 comment

🎓For today's lab seminar, it was a pleasure to have @manueltonneau.bsky.social presenting "HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter" ✨ #NLProc #hatespeech

submitted 47 days ago • 0 comments

⏰ Early‑bird registration for #IC2S2’25 in Norrköping ends May 9—lock in your spot (and the discount) today: www.ic2s2-2025.org/register/

submitted 50 days ago • 0 comments

Had a blast presenting my research on hate speech moderation on social media and the potential of human-AI collaboration to improve it, thanks a lot for the invite @hertiedatascience.bsky.social ! Check out this blog post for details on our preliminary results: www.hertie-school.org/en/datascien...

submitted 48 days ago • 0 comments

🚨 New preprint on AI persuasion and public health 🚨 A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (who obv knew it was AI & consented!)—BUT reading standard public-health material still outperformed chatbots in impact and longevity. Details below 👇

submitted 56 days ago • 2 comments

Thrilled to be at the CHI conference this week where PhD student @schafer.bsky.social will present our paper — co-authored w/ Rachel Moran (1st author) & @mertcanbayar.bsky.social — titled, "The End of Trust and Safety?" dl.acm.org/doi/10.1145/...

submitted 59 days ago • 3 comments

🚨 New CHI'25 EA paper! 🚨 How can we design culturally sensitive mental health chatbots for Indian adolescents? 🇮🇳📱 Our mixed-methods study reveals key design insights—from stigma to personalization. Read it here: arxiv.org/abs/2503.08562 #CHI2025 #HCI #MentalHealth #AIforGood #India

submitted 66 days ago • 0 comments

🚨 New paper out! 🚨 @trfetzer.com and I ask a simple question with big stakes: Do LLMs deliver reliable health advice around the globe? We benchmark 6 leading LLMs, ask advice in 21 languages & 9K real-world claims from. Short thread for results, data & code included. 1/6

submitted 58 days ago • 2 comments

New! In his latest opinion piece, Associate Professor, @mmosleh.bsky.social @oii.ox.ac.uk asks how we can encourage engagement with online fact-checking? Read the full article: www.oii.ox.ac.uk/news-events/...

submitted 64 days ago • 0 comments

Many feel that politics globally has become toxic and hostile over recent years. But is this actually the case? And if so, who's to blame? What issues are driving toxicity? We analyze 18M tweets from politicians in 17 countries to find out! w/ @julianachueri.bsky.social arxiv.org/pdf/2503.22411

submitted 86 days ago • 4 comments

📆 Reminder! 📆 🚀 The #WOAH2025 submission deadline is just over two weeks away! (April 18, 2025, Anywhere on Earth!) 🔗 CfP: workshopononlineabuse.com/cfp.html We're excited to see your submissions! 🤩 #ACL2025 #NLProc

submitted 86 days ago • 0 comments

It’s insane to think that the Trump administration was more welcoming this week to the Tate brothers than to the President of Ukraine.

submitted 116 days ago • 657 comments

🚨 Deadline Extended! 🚨 We've extended the submission deadline to Friday, April 18, 2025 (AoE)! Please share widely! www.workshopononlineabuse.com/cfp.html

submitted 116 days ago • 0 comments

Remarkable assessment by an incoming German chancellor. “for me it is an absolute priority to strengthen Europe as quickly as possible, so that we achieve independence from the US, step by step.” www.dw.com/en/german-el...

submitted 122 days ago • 65 comments

"During the meeting, Ukraine was told it faced imminent shutoff of the Starlink service if it did not reach a deal on critical minerals, said the source, who requested anonymity to discuss closed negotiations." Astonishing betrayal of an ally. www.reuters.com/business/us-...

submitted 124 days ago • 145 comments

The US Oligarch and Nazi Elon Musk has been defeated by a German court. Germany has ruled that Musk must immediately provide researchers with access to X's data on politically related content ahead of the country’s election. www.politico.eu/article/berl...

submitted 137 days ago • 383 comments

🚨OpEd+data: Meta is out of step with public opinion🚨 Zuck cut moderation b/c he said people no longer want it. But he's wrong! We polled 1k Americans and most people, including majority of Reps: i) want content moderation ii) don't want Community Notes w/o fact-checkers thehill.com/opinion/tech...

submitted 148 days ago • 6 comments

Today, we are releasing MSTS, a new Multimodal Safety Test Suite for vision-language models! MSTS is exciting because it tests for safety risks created by multimodality. Each prompt consists of a text + image that only in combination reveal their full unsafe meaning. 🧵

submitted 155 days ago • 2 comments

New blog! @oiioxford.bsky.social doctoral researchers @deeliu97.bsky.social, @manueltonneau.bsky.social and Juliette Zaccour propose a series of recommendations for effective data access and data governance in light of the EU’s Digital Service Act. Read the full article: bit.ly/49DJyhn

submitted 190 days ago • 0 comments

💙 𝗗𝗮𝘁𝗮 𝗔𝗻𝗻𝗼𝘁𝗮𝘁𝗶𝗼𝗻 𝗕𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸 𝗮𝗻𝗱 𝗔𝗰𝘁𝗶𝘃𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗳𝗼𝗿 𝗡𝗟𝗣 𝗶𝗻 𝘁𝗵𝗲 𝗘𝗿𝗮 𝗼𝗳 𝗟𝗟𝗠𝘀 💡 Have you ever had to overcome a lack of labeled data to deal with an NLP task? We are conducting a survey to explore the strategies used to overcome this bottleneck. #NLP #ML

submitted 192 days ago • 2 comments

📣Online Talk Series (14): Diyi Liu - “The Legitimacy of Platformized Speech Governance: A Mixed-Methods Case Study Approach” 🗓️December 17, 2024 🕛3-4 pm (CET) 📍Online Event @deeliu97.bsky.social will present in the 14th session of "Behind The Scenes". 👉Info & Registration: lnkd.in/gXttfycT

submitted 195 days ago • 0 comments

Content moderation is a power platforms exercise with consequence at moments of collective vulnerability, such as during elections. Assuming that platforms act for the betterment of all is, at this point, one assumption too many, write Sandra González-Bailón and David Lazer.

submitted 196 days ago • 1 comment

🚨New WP🚨 We examine news sharing on 7 platforms: 1)Right-leaning platforms=lower quality news 2)Echo-platforms: Right-leaning news gets more engagement on right-leaning platforms, vice-versa for left-leaning 3)But low-quality news gets more engagement EVERYWHERE, even BlueSky! osf.io/preprints/ps...

submitted 198 days ago • 7 comments

Trump has vowed to crack down on universities involved in misinformation research, or what he dubs the "censorship cartel" - e.g. by curbing funds to those that have "flagged content for removal”, or legal threats “I'm pretty fucking scared," one professor tells me. www.ft.com/content/bfb4...

submitted 213 days ago • 80 comments

📣 New article from me, Thomas Struett, @paufder.bsky.social & @rwg.aoir.social.ap.brid.gy, available free & #openaccess in the International Journal of Communication: Can This Platform Survive? Governance Challenges for the Fediverse ijoc.org/index.php/ij... #academia

submitted 207 days ago • 3 comments

New @acm-cscw.bsky.social paper, new content moderation paradigm. Post Guidance lets moderators prevent rule-breaking by triggering interventions as users write posts! We implemented PG on Reddit and tested it in a massive field experiment (n=97k). It became a feature! arxiv.org/abs/2411.16814

submitted 210 days ago • 5 comments

🥳We're excited to share that WOAH 2025, our 9th edition, will take place at #ACL2025 in Vienna! @aclmeeting.bsky.social Our special theme this year will be "Harms Beyond Hate Speech". CfP and more details soon 🚀

submitted 210 days ago • 0 comments

We are hosting the 11th International Conference on Computational Social Science in Sweden 🚀The IC2S2'25 website is LIVE, and submissions are OPEN! 📍Norrköping | July 21-24, 2025 Call for Abstracts (until Feb 24) Call for Tutorials (until Jan 17) 🔗Explore details & submit: ic2s2-2025.org

submitted 243 days ago • 0 comments

Can we detect #hatespeech at scale on social media? To answer this, we introduce 🤬HateDay🗓️, a global hate speech dataset representative of a day on Twitter. The answer: not really! Detection perf is low and overestimated by traditional eval methods arxiv.org/abs/2411.15462 🧵

submitted 211 days ago • 1 comment

Manuel Tonneau, Diyi Liu, Niyati Malhotra, Scott A. Hale, Samuel P. Fraiberger, Victor Orozco-Olvera, Paul R\"ottger HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter https://arxiv.org/abs/2411.15462

submitted 211 days ago • 1 comment

New paper: Do social media algorithms shape affective polarization? We ran a field experiment on X/Twitter (N=1,256) using LLMs to rerank content in real-time, adjusting exposure to polarizing posts. Result: Algorithmic ranking impacts feelings toward the political outgroup! 🧵⬇️

submitted 212 days ago • 33 comments

Want to see a real-time map of the carbon intensity of electricity around the world? Electricity Maps has the goods! 🔌💡

submitted 217 days ago • 3 comments

Ready for another Computational Social Science Starter Pack? Here is number 2! More amazing folks to follow! Many students and the next gen represented! go.bsky.app/GoEyD7d

submitted 223 days ago • 34 comments