Profile avatar
yoavartzi.com
LM/NLP/ML researcher ¯\_(ツ)_/¯ yoavartzi.com / associate professor @ Cornell CS + Cornell Tech campus @ NYC / nlp.cornell.edu / associate faculty director @ arXiv.org / researcher @ ASAPP / starting @colmweb.org / building RecNet.io
391 posts 5,257 followers 311 following
Regular Contributor
Active Commenter

A friend eloquently summarized generative AI by just referring me to Marx youtu.be/cHxGUe1cjzM?...

The submission email for the workshops call is now available on the call for workshops: colmweb.org/cfw.html

Come work with us at MSR AI Frontiers and help us figure out reasoning! We're hiring at the Senior Researcher level (eg post phd). Please drop me a DM if you do! jobs.careers.microsoft.com/us/en/job/17...

No surveys here, so I ran this on X. So this brings about the next question: why are we still doing two columns in some venues?! Or tiny fonts in others? We need templates that fit how people read: fonts, columns, margins, etc

“When a measure becomes a target, it ceases to be a good measure.” -- Goodhart’s Law just saying, without explicitly pointing to lmarena Thank you @beenwrekt.bsky.social .... I need to print it on my office door

The catalyst for this pretty extensive update were brutal reviews at ICLR. Mostly, they just missed the whole point (like: why is this not supervised learning? 🙈). But, whatever, I really like the new version, so ¯\_(ツ)_/¯

We recently pushed an update to our in-context RL paper. Usually, updates don't justify a post, but this one is exceptionally contentful -> 🧵 tl;dr: all the findings are stronger, and the behaviors are super cool! arxiv.org/abs/2410.05362

We are expecting🫄 A 3rd BabyLM👶, as a workshop @emnlpmeeting.bsky.social Kept: all New: Interaction (education, agentic) track Workshop papers More in 🧵 Even more: arxiv.org/abs/2502.10645 babylm.github.io #AI #LLMs #MachineLearning #language #cognition #NLP #data 🤖📈

Are there stats about how many people read academic papers on screens vs. paper? Screens being computers, tablets, e-ink, your smartwatch (😅), your roomba touch panel, whatever ....

We now have a form for postdoc applications: forms.gle/tiydAChgV1wL... I am looking at candidates on a rolling basis, so while there's no deadline, there's an advantage of throwing your name in the ring earlier than later

Archive Request xkcd.com/3052

yoavartzi.com/log/ai-term....

Increasingly frustrated with the misunderstanding of basic experimental design in the community. People confuse experimental testbeds designed to show effect and answer specific research questions vs. the applicability of the approach. Leading to experimental demands that stifle research