manfreddiaz.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

manfreddiaz.bsky.social

Ph.D. Candidate @Mila and University of Montreal interested in AI/ML connections with economics, game theory and social choice theory. https://manfreddiaz.github.io

28 posts 2,181 followers 690 following

Posts 7 Comments 26

Elo drives most LLM evaluations, but we often overlook its assumptions, benefits, and limitations. While working on SCO, we wanted to understand the SCO-Elo distinction, so I looked and uncovered some intriguing findings and documented them in these notes. I hope you find them valuable!

submitted 3 days ago • 0 comments

Looking for a principled evaluation method for ranking of general agents or models, i.e. that get evaluated across a myriad of different tasks? I’m delighted to tell you about our new paper, Soft Condorcet Optimization (SCO) for Ranking of General Agents, to be presented at AAMAS 2025! 🧵 1/N

submitted 4 days ago • 1 comment

Last week, Michael I. Jordan's insightful talk at the AI Action Summit (www.youtube.com/live/W0QLq4q...) reminded us of the meaningful connections between AI, ML, economics, game theory, and mechanism design. But I'd argue the relationship goes deeper—it's profound, historical, and foundational. ⬇️

submitted 16 days ago • 1 comment

Very happy to announce the publication of our latest paper: A theory of appropriateness with applications to generative artificial intelligence arxiv.org/abs/2412.19010 And happy new year everyone!

submitted 59 days ago • 2 comments

Concordia is a library for generative agent-based modeling that works like a table-top role-playing game. It's open source and model agnostic. Try it today! github.com/google-deepm...

submitted 103 days ago • 2 comments

🚨 Petition to get NeurIPS to join Bluesky 🚨 I just wrote the NeurIPS board requesting them to consider joining Bluesky. It took about 2 minutes. I invite you to do the same. neurips.cc/Help/Contact If they changed the name of the conference for the greater good, there's a chance! Please repost!

submitted 105 days ago • 3 comments

Lets get the multi-agent learning community started up here: go.bsky.app/9gsefkW

submitted 106 days ago • 5 comments