Profile avatar
manueltonneau.bsky.social
PhD candidate @oiioxford.bsky.social NLP, Computational Social Science @WorldBank manueltonneau.com
35 posts 671 followers 537 following
Prolific Poster
Conversation Starter

Why did Grok suddenly start talking about “white genocide in South Africa” even if asked about baseball or cute dogs? Because someone at Musk’s xAi deliberately did this, and we only found out because they were clumsy. My piece on the real dangers of AI. Gift link: www.nytimes.com/2025/05/17/o...

New preprint with @jbakcoleman.bsky.social @lewan.bsky.social @randomwalker.bsky.social @orbenamy.bsky.social @lfoswaldo.bsky.social where we argue for a complex-system perspective to understand the causal effects of social media on society and for a triangulation of methods arxiv.org/abs/2505.09254

What do experts think about the potential negative impacts of social media on adolescent mental health? We have a new consensus statement with 120 experts on this topic. Check it out to see where experts agree and where they think more evidence is needed!

Excited to share that two of our papers got into ACL 2025! 🎉 📌 Main: HateDay: A Global Hate Speech Dataset Representative of Twitter (arxiv.org/abs/2411.15462) 📌 Findings: When Claims Evolve – Robustness to Misinformation Edits (arxiv.org/abs/2503.03417) See you all in Vienna! 🇦🇹 #ACL2025 #NLProc

Just published at CHI ’25: How Commercial Content Moderation APIs over- and under-moderate hate speech dl.acm.org/doi/10.1145/... w/ @dawiet.bsky.social ky.social, Amin Oueslati @hheuer.bsky.social cial @dimitristaufer.bsky.social al Lena Pohlmann. 🧵

🎓For today's lab seminar, it was a pleasure to have @manueltonneau.bsky.social presenting "HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter" ✨ #NLProc #hatespeech

⏰ Early‑bird registration for #IC2S2’25 in Norrköping ends May 9—lock in your spot (and the discount) today: www.ic2s2-2025.org/register/

Had a blast presenting my research on hate speech moderation on social media and the potential of human-AI collaboration to improve it, thanks a lot for the invite @hertiedatascience.bsky.social ! Check out this blog post for details on our preliminary results: www.hertie-school.org/en/datascien...

🚨 New preprint on AI persuasion and public health 🚨 A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (who obv knew it was AI & consented!)—BUT reading standard public-health material still outperformed chatbots in impact and longevity. Details below 👇

Thrilled to be at the CHI conference this week where PhD student @schafer.bsky.social will present our paper — co-authored w/ Rachel Moran (1st author) & @mertcanbayar.bsky.social — titled, "The End of Trust and Safety?" dl.acm.org/doi/10.1145/...

🚨 New CHI'25 EA paper! 🚨 How can we design culturally sensitive mental health chatbots for Indian adolescents? 🇮🇳📱 Our mixed-methods study reveals key design insights—from stigma to personalization. Read it here: arxiv.org/abs/2503.08562 #CHI2025 #HCI #MentalHealth #AIforGood #India

🚨 New paper out! 🚨 @trfetzer.com and I ask a simple question with big stakes: Do LLMs deliver reliable health advice around the globe? We benchmark 6 leading LLMs, ask advice in 21 languages & 9K real-world claims from. Short thread for results, data & code included. 1/6

New! In his latest opinion piece, Associate Professor, @mmosleh.bsky.social @oii.ox.ac.uk asks how we can encourage engagement with online fact-checking? Read the full article: www.oii.ox.ac.uk/news-events/...

Many feel that politics globally has become toxic and hostile over recent years. But is this actually the case? And if so, who's to blame? What issues are driving toxicity? We analyze 18M tweets from politicians in 17 countries to find out! w/ @julianachueri.bsky.social arxiv.org/pdf/2503.22411

📆 Reminder! 📆 🚀 The #WOAH2025 submission deadline is just over two weeks away! (April 18, 2025, Anywhere on Earth!) 🔗 CfP: workshopononlineabuse.com/cfp.html We're excited to see your submissions! 🤩 #ACL2025 #NLProc

It’s insane to think that the Trump administration was more welcoming this week to the Tate brothers than to the President of Ukraine.

🚨 Deadline Extended! 🚨 We've extended the submission deadline to Friday, April 18, 2025 (AoE)! Please share widely! www.workshopononlineabuse.com/cfp.html

Remarkable assessment by an incoming German chancellor. “for me it is an absolute priority to strengthen Europe as quickly as possible, so that we achieve independence from the US, step by step.” www.dw.com/en/german-el...

"During the meeting, Ukraine was told it faced imminent shutoff of the Starlink service if it did not reach a deal on critical minerals, said the source, who requested anonymity to discuss closed negotiations." Astonishing betrayal of an ally. www.reuters.com/business/us-...

The US Oligarch and Nazi Elon Musk has been defeated by a German court. Germany has ruled that Musk must immediately provide researchers with access to X's data on politically related content ahead of the country’s election. www.politico.eu/article/berl...

🚨OpEd+data: Meta is out of step with public opinion🚨 Zuck cut moderation b/c he said people no longer want it. But he's wrong! We polled 1k Americans and most people, including majority of Reps: i) want content moderation ii) don't want Community Notes w/o fact-checkers thehill.com/opinion/tech...

Today, we are releasing MSTS, a new Multimodal Safety Test Suite for vision-language models! MSTS is exciting because it tests for safety risks *created by multimodality*. Each prompt consists of a text + image that *only in combination* reveal their full unsafe meaning. 🧵

New blog! @oiioxford.bsky.social doctoral researchers @deeliu97.bsky.social, @manueltonneau.bsky.social and Juliette Zaccour propose a series of recommendations for effective data access and data governance in light of the EU’s Digital Service Act. Read the full article: bit.ly/49DJyhn

💙 𝗗𝗮𝘁𝗮 𝗔𝗻𝗻𝗼𝘁𝗮𝘁𝗶𝗼𝗻 𝗕𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸 𝗮𝗻𝗱 𝗔𝗰𝘁𝗶𝘃𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 𝗳𝗼𝗿 𝗡𝗟𝗣 𝗶𝗻 𝘁𝗵𝗲 𝗘𝗿𝗮 𝗼𝗳 𝗟𝗟𝗠𝘀 💡 Have you ever had to overcome a lack of labeled data to deal with an NLP task? We are conducting a survey to explore the strategies used to overcome this bottleneck. #NLP #ML

📣Online Talk Series (14): Diyi Liu - “The Legitimacy of Platformized Speech Governance: A Mixed-Methods Case Study Approach” 🗓️December 17, 2024 🕛3-4 pm (CET) 📍Online Event @deeliu97.bsky.social will present in the 14th session of "Behind The Scenes". 👉Info & Registration: lnkd.in/gXttfycT

Content moderation is a power platforms exercise with consequence at moments of collective vulnerability, such as during elections. Assuming that platforms act for the betterment of all is, at this point, one assumption too many, write Sandra González-Bailón and David Lazer.

🚨New WP🚨 We examine news sharing on 7 platforms: 1)Right-leaning platforms=lower quality news 2)Echo-platforms: Right-leaning news gets more engagement on right-leaning platforms, vice-versa for left-leaning 3)But low-quality news gets more engagement EVERYWHERE, even BlueSky! osf.io/preprints/ps...

Trump has vowed to crack down on universities involved in misinformation research, or what he dubs the "censorship cartel" - e.g. by curbing funds to those that have "flagged content for removal”, or legal threats “I'm pretty fucking scared," one professor tells me. www.ft.com/content/bfb4...

📣 New article from me, Thomas Struett, @paufder.bsky.social & @rwg.aoir.social.ap.brid.gy, available free & #openaccess in the International Journal of Communication: Can This Platform Survive? Governance Challenges for the Fediverse ijoc.org/index.php/ij... #academia

New @acm-cscw.bsky.social paper, new content moderation paradigm. Post Guidance lets moderators prevent rule-breaking by triggering interventions as users write posts! We implemented PG on Reddit and tested it in a massive field experiment (n=97k). It became a feature! arxiv.org/abs/2411.16814

🥳We're excited to share that WOAH 2025, our 9th edition, will take place at #ACL2025 in Vienna! @aclmeeting.bsky.social Our special theme this year will be "Harms Beyond Hate Speech". CfP and more details soon 🚀

We are hosting the 11th International Conference on Computational Social Science in Sweden 🚀The IC2S2'25 website is LIVE, and submissions are OPEN! 📍Norrköping | July 21-24, 2025 Call for Abstracts (until Feb 24) Call for Tutorials (until Jan 17) 🔗Explore details & submit: ic2s2-2025.org

Can we detect #hatespeech at scale on social media? To answer this, we introduce 🤬HateDay🗓️, a global hate speech dataset representative of a day on Twitter. The answer: not really! Detection perf is low and overestimated by traditional eval methods arxiv.org/abs/2411.15462 🧵

Manuel Tonneau, Diyi Liu, Niyati Malhotra, Scott A. Hale, Samuel P. Fraiberger, Victor Orozco-Olvera, Paul R\"ottger HateDay: Insights from a Global Hate Speech Dataset Representative of a Day on Twitter https://arxiv.org/abs/2411.15462

New paper: Do social media algorithms shape affective polarization? We ran a field experiment on X/Twitter (N=1,256) using LLMs to rerank content in real-time, adjusting exposure to polarizing posts. Result: Algorithmic ranking impacts feelings toward the political outgroup! 🧵⬇️

Want to see a real-time map of the carbon intensity of electricity around the world? Electricity Maps has the goods! 🔌💡

Ready for another Computational Social Science Starter Pack? Here is number 2! More amazing folks to follow! Many students and the next gen represented! go.bsky.app/GoEyD7d