Profile avatar
manfreddiaz.bsky.social
Ph.D. Candidate @Mila and University of Montreal interested in AI/ML connections with economics, game theory and social choice theory. https://manfreddiaz.github.io
28 posts 2,181 followers 690 following
Getting Started
Active Commenter

Elo drives most LLM evaluations, but we often overlook its assumptions, benefits, and limitations. While working on SCO, we wanted to understand the SCO-Elo distinction, so I looked and uncovered some intriguing findings and documented them in these notes. I hope you find them valuable!

Looking for a principled evaluation method for ranking of *general* agents or models, i.e. that get evaluated across a myriad of different tasks? I’m delighted to tell you about our new paper, Soft Condorcet Optimization (SCO) for Ranking of General Agents, to be presented at AAMAS 2025! 🧵 1/N

Last week, Michael I. Jordan's insightful talk at the AI Action Summit (www.youtube.com/live/W0QLq4q...) reminded us of the meaningful connections between AI, ML, economics, game theory, and mechanism design. But I'd argue the relationship goes deeper—it's profound, historical, and foundational. ⬇️

Very happy to announce the publication of our latest paper: A theory of appropriateness with applications to generative artificial intelligence arxiv.org/abs/2412.19010 And happy new year everyone!

Concordia is a library for generative agent-based modeling that works like a table-top role-playing game. It's open source and model agnostic. Try it today! github.com/google-deepm...

🚨 Petition to get NeurIPS to join Bluesky 🚨 I just wrote the NeurIPS board requesting them to consider joining Bluesky. It took about 2 minutes. I invite you to do the same. neurips.cc/Help/Contact If they changed the name of the conference for the greater good, there's a chance! Please repost!

Lets get the multi-agent learning community started up here: go.bsky.app/9gsefkW