Profile avatar
kazzorr.bsky.social
Physical Oceanographer 🌊, ML/AI at UCLA.
49 posts 1,236 followers 1,716 following
Regular Contributor
Active Commenter

If you're training models with more than one loss term, I can again strongly recommend our ConFIG optimizer: tum-pbs.github.io/ConFIG/ , simply swap out Adam&Co. for ConFIG, and you can potentially see substantial reductions in your training loss 😁 We'd also be curious to hear how it works for you

The densest water in the world ocean forms around Antarctica, and then spreads out to flood the global abyss. We’ve seen the production of this water dwindle over the past few decades, a change that is now impacting the North Atlantic. 🌊🧪❄️ www.nature.com/articles/s41...

We have a new paper out in @science.org today, led by Helge Goessling from #AWI: The recent global temperature surge in 2023 was intensified by a record-low planetary #albedo 👉 doi.org/10.1126/scie... @thomasjung.bsky.social @ecmwf.bsky.social This is what we found (🧵1/8)

Okay, I have to do a thread on this pretty amazing article because it is just seriously chockers with eye-opening stats about 'direct air capture' - ie, sucking up air and trying to remove carbon dioxide from it news.mit.edu/2024/reality...

Graph Transformers (GTs) can handle long-range dependencies and resolve information bottlenecks, but they’re computationally expensive. Our new model, Spexphormer, helps scale them to much larger graphs – check it out at NeurIPS next week, or the preview here! [1/13] #NeurIPS2024

A common question nowadays: Which is better, diffusion or flow matching? 🤔 Our answer: They’re two sides of the same coin. We wrote a blog post to show how diffusion models and Gaussian flow matching are equivalent. That’s great: It means you can use them interchangeably.

🧵 Today with @polymathicai.bsky.social and others we're releasing two massive datasets that span dozens of fields - from bacterial growth to supernova! We want this to enable multi-disciplinary foundation model research.

'Global emergence of regional heatwave hotspots outpaces climate model simulations' our new paper in @pnas.org with S. Bartusek, R. Seager. J. Schellnhuber and M. Ting investigating the tail behaviour of extreme heatwave trends. @iiasa.ac.at @columbiaclimate.bsky.social @lamontearth.bsky.social

I'm excited to share our APEBench paper arxiv.org/abs/2411.00180 and code github.com/tum-pbs/apeb..., to be presented at #NeurIPS. Congratulations Felix and Simon 😀 👍 At its core, APEBench features a lightning-fast ⚡️ fully differentiable spectral solver with a huge range of different PDEs.

If you think Direct Air Capture (DAC) is going to save us, please read this. This is isn’t some left-wing fringe environmental group saying this. It’s MIT. news.mit.edu/2024/reality...

I am happy to share here our paper: "Spontaneous symmetry breaking in generative diffusion models", published at Neurips 2023. We found that the generative capabilities of diffusion models are the result of a phase transition! Preprint: arxiv.org/abs/2305.19693 Code: github.com/gabrielraya/...

Since I'm moving over, I would like to reshare my most recent blog post on why you should learn Optimal Transport. Part 2 is coming, most likely during the winter break :) mufan-li.github.io/OT1/

Even as an interpretable ML researcher, I wasn't sure what to make of Mechanistic Interpretability, which seemed to come out of nowhere not too long ago. But then I found the paper "Mechanistic?" by @nsaphra.bsky.social and @sarah-nlp.bsky.social, which clarified things.

Thrilled to announce Boltz-1, the first open-source and commercially available model to achieve AlphaFold3-level accuracy on biomolecular structure prediction! An exciting collaboration with Jeremy, Saro, and an amazing team at MIT and Genesis Therapeutics. A thread!

Important new study shows that current climate models underestimate the human-caused slowing of the #AMOC (Atlantic Meridional Overturning Circulation), because they neglect freshwater influx from Greenland melt and other sources. /1 🌊 www.nature.com/articles/s41...

🍏 New preprint alert! 🍏 PoM: Efficient Image and Video Generation with the Polynomial Mixer arxiv.org/abs/2411.12663 This is my latest "summer project" and it was so big I had to call in reinforcements (Thanks @nicolasdufour.bsky.social) TL;DR Transformers are for boomers, welcome to the future 🧵👇

I heard bluesky likes links. So here is a link to a book I’m writing. github.com/NannyML/The-...

Interested in machine learning in science? Timo and I recently published a book, and even if you are not a scientist, you'll find useful overviews of topics like causality and robustness. The best part is that you can read it for free: ml-science-book.com