saxon.me - Profile | ThreadSky | a Reddit-style client for Bluesky

And here is the remote pre-recording of my talk for the wultiwodal meta-evaluation tutorial at CVPR youtu.be/ymwPz1sioJI

submitted 1 day ago • 0 comments

I love this film so much, because when you think about it the government's response to discovering Kaiju are real being a bunch of bureaucrats immediately trying to establish that "dealing with Godzilla does not fall within the purview of my department" is actually the most realistic take yet.

submitted 1 day ago • 7 comments

I have at least two funny meme LM evaluation project ideas for any interested collaborators

submitted 2 days ago • 1 comment

Breaking News, I am just getting word that they are in fact called "Multi-", not "Wulti-modal models" Thank you @shaily99.bsky.social for informing me

submitted 3 days ago • 3 comments

Check out our CVPR tutorial "Evaluating Large Wulti-modal Models: Challenges and Methods" on Wednesday from 1-5pm in room 109! Unfortunately, I won't be there to present but my labmate Xiao will share some of my slides during her section :) lmm-understand.github.io

submitted 3 days ago • 0 comments

broooo what if some of those that work forces are the same that burn crosses

submitted 5 days ago • 7 comments

This piece rhetorically asks: "Should the climate movement start demanding that everyone stop listening to Spotify? Would that be a good use of our time?" unfortunately I think many would say 'yes'. andymasley.substack.com/p/individual...

submitted 6 days ago • 4 comments

"The cost of living has increased, but the cost of owning has increased more" says the rent hike letter from a landlord who had funded 0 repairs since I've lived here, in CA with frozen property tax

submitted 8 days ago • 2 comments

A cheeseburger uses a lot more water than a ChatGPT request 🍔 Actual farms, not the data center variety, are sucking up groundwater more quickly than surface water, explains @markgongloff.bsky.social 🎥

submitted 9 days ago • 1052 comments

Proposed to cut number of people involved in NSF activities by 70%. We are literally on the chopping board. Call your reps.

submitted 13 days ago • 0 comments

To be honest, I kinda love grok? (when it isn't being Elonbotomized to be a racism machine) So many rightoid maniacs query it expecting to see their conspiracist beliefs echoed back at them only to repeatedly get gently corrected with factual information lmao

submitted 14 days ago • 0 comments

I cannot stop thinking about Andor. Masterpiece, must watch for pretty much everyone imo

submitted 15 days ago • 1 comment

Sent my thesis in to my committee this week, will defend June 2 at 1pm PT! If you're interested in catching it on zoom, here's a calendar link! calendar.google.com/calendar/u/0...

submitted 21 days ago • 1 comment

Despite clickbaity title this is a great level-headed piece from a real scientist who tried working in AI for science. The key point that AI is a tool not an all encompassing revolution is common sense but the details are interesting and illuminating open.substack.com/pub/understa...

submitted 21 days ago • 1 comment

If we just add a few more annoying tasks for authors and a few more for reviewers we can fix peer review in AI!

submitted 25 days ago • 0 comments

According to a 2021 report, the University of California system: • generated $82B in economic activity in California • supported 529K jobs in the state • generated $21 in economic output for every $1 received Public divestment from higher ed makes no sense, even in the narrowest economic terms.

submitted 26 days ago • 8 comments

Michael News! I will be joining the Tech Policy Lab at the University of Washington @ischool.uw.edu and UW NLP working with @aylincaliskan.bsky.social as a postdoc in the fall, to work on situated evaluation, multimodal/lingual/cultural genAI, and new directions in safety, fairness, and alignment!

submitted 28 days ago • 3 comments

"Women are PIs on 58% of the canceled grants, although they are PIs on only 34% of all active NSF grants. Similarly, Blacks are PIs on 17% of the terminated grants, although they make only 4% of the total pool. Hispanic PIs and those with disabilities were twice as likely to lose a grant."

submitted 30 days ago • 32 comments

There's no escape! Even in my sister's bar admission ceremony the bar president starts talking about AI 🤣

submitted 30 days ago • 0 comments

We were interviewed for IEEE spectrum about reasoning models! spectrum.ieee.org/chain-of-tho...

submitted 35 days ago • 0 comments

What is it about the City of Berkeley and Country of England that makes interest in AI safety and weirder fringe stuff like AI consciousness so prevalent? Like why are these topics so big there and not in like Seattle or Pittsburgh??

submitted 36 days ago • 1 comment

Finally a study mix I can get behind www.youtube.com/watch?v=0tR5...

submitted 37 days ago • 0 comments

"LLM on way to replace doctors" gets published in Nature. meanwhile "LLM judgement not as good as human MDs" gets a spot in "Physical Therapy and Rehabilitation Journal".

submitted 39 days ago • 2 comments

Very interesting oral history -- interviews with some top NLP folks on the effects of GenAI on their field: www.quantamagazine.org/when-chatgpt...

submitted 43 days ago • 0 comments

We won an outstanding paper award!! 2025.naacl.org/blog/best-pa...

submitted 48 days ago • 1 comment

PSA for NAACL peeps from a southwest boi (sadly I won't be there): be sure to find a place to eat New Mexico style stacked enchiladas. You can get it "Christmas style" where its served with both red and green hatch chile. The hatch chile is integral, do not skip. Not photogenic, but very delicious

submitted 45 days ago • 1 comment