Profile avatar
williamheld.com
Modeling Linguistic Variation to expand ownership of NLP tools Views my own, but affiliations that might influence them: ML PhD Student under Prof. Diyi Yang 2x RS Intern🦙 Pretraining Alum NYU Abu Dhabi Burqueño he/him
87 posts 2,137 followers 450 following
Regular Contributor
Active Commenter

I've only seen Veo 3 (or any other video generation model) used to produce viral videos. The fake videos seem to successfully trick the majority of commenters and have no visible watermark or disclosure of AI use.

What would you say if you saw it in another country? A senator from a coequal branch of government dragged away by security from asking a question of a Cabinet official

🚨 70 million US workers are about to face their biggest workplace transmission due to AI agents. But nobody’s asking them what they want. While AI R&D races to automate everything, we took a different approach: auditing what workers want vs. what AI can deliver across the US workforce.🧵

Really cool to see theory connect to practice! We observed this phenomenon when trying to do deeper WSD cooldowns of our 8B model in the marin.community project! We Z-Lossed our way through the pain, but cool to see some stronger theory: marin.readthedocs.io/en/latest/re...

What foreign power could do as much damage to the United States as Trump is doing to it right now? www.whitehouse.gov/presidential...

Based on current administration policies, China is about to have an influx of returning talent and a accelerated advantage in research investments. You need to be both sinophobic and irrational to expect the US to continue as the global scientific powerhouse with these policy own-goals.

"“From time-to-time instances will arise in which the society, or segments of it, threaten the very mission of the university & its values... In such a crisis, it becomes the obligation of the university as an institution to oppose such measures & actively to defend its interests and its values.”

Super excited Marin is finally out! Come see what we've been building! Code/platform for training fully reproducible models end-to-end, from data to evals. Plus a new high quality 8B base model. Percy did a good job explaining it on the other place. marin.community x.com/percyliang/s...

How much faster would the science of large-scale AI advance if we could open-source the *process* of building a frontier model? Not just the final models/code/data, but also negative results, toy experiments, and even spontaneous discussions. That's what we're trying @ marin.community

It feels worth conference organizers running a study to see if this significantly impacts reviewer scores. I hope things like this are placebos, but if not we need to seriously consider whether existing peer-review processes for big ML conferences are providing value.

Introducing CAVA: The Comprehensive Assessment for Voice Assistants A new benchmark for evaluating the capabilities required for speech-in-speech-out voice assistants! - Latency - Instruction following - Function calling - Tone awareness - Turn taking - Audio Safety TalkArena.org/cava

How does the public conceptualize AI? Rather than self-reported measures, we use metaphors to understand the nuance and complexity of people’s mental models. In our #FAccT2025 paper, we analyzed 12,000 metaphors collected over 12 months to track shifts in public perceptions.

I wrote something up for AI people who want to get into bluesky and either couldn't assemble an exciting feed or gave up doomscrolling when their Following feed switched to talking politics 24/7.

Worth noting that a number of universities have now sued over withheld and canceled grants, but no university has yet sued over the arrest, detention, and threatened deportation of its foreign students. www.nytimes.com/2025/04/19/o...

Mahmoud Khalil writes movingly about what his detention by ICE means for America: www.washingtonpost.com/opinions/202...

aside: a stunning comment from David Baker, UW professor who won the Nobel Prize in 2024. Now 15 lab members are looking for positions overseas. “There’s so many amazing people who want to come in, & we can’t take them. The Nobel Prize was just a little blip. But things have gotten quite bleak.”

Financial Times: "Since 1990, America has lost over 5 million manufacturing jobs. In that time, it has gained 11.8 million roles in professional and business services, and 3.3 million in transportation and logistical activities, linked to multinational supply chains." #EconSky

The Model Context Protocol is cool because it gives external developers a way to add meaningful functionality on top of LLM platforms. To limit test this, I made a "Realtime Voice" MCP using free STT, VAD, and TTS systems. The result is a janky, but makes me me excited about the ecosystem to come!

people being in the streets means something. never let your cynicism convince you otherwise.

The Trump administration’s roundup of students who protested Israel’s bombardment of Gaza marks an astonishing, radical break with what one might justifiably think of as the central American idea. I wrote about it for @theguardian.com. www.theguardian.com/commentisfre...

Working with an interdisciplinary team, we have developed a website to communicate how the White House's proposed cuts to health research would cause losses of $16B and 68,500 jobs. Find out how your community may be impacted. Explore more at SCIMaP: scienceimpacts.org a đź§µ

We have had nearly two decades of panic about Free Speech on Campus and not a single case, not even the ones they made up, were as bad as what's happening now

Step 1) Install the #chi2025 module to your Claude/ChatGPT: knollapp.com/add/ZlRKvCmB... Step 2) Ask the LLM, "Given my interests, what are some CHI 2025 papers I should check out?" (If the model doesn't already know your interests, you might need to state them.)

Exclusive: Navajo Code Talkers disappear from military websites after Trump DEI order

Arresting and threatening to deport students because of their participation in political protest is the kind of action one ordinarily associates with the world’s most repressive regimes. It’s genuinely shocking that this appears to be what’s going on right here. 1/

We are getting closer to have agents operating in the real physical world. However, can we trust frontier models to make embodied decisions 🎮 aligned with human norms 👩‍⚖️ ? With EgoNormia, a 1.8k ego-centric video 🥽 QA benchmark, we show that this is surprisingly challenging!

Great to see the International AI Safety Report highlight research on dialect prejudice, including our work on covert racism in LLMs! www.nature.com/articles/s41...