Profile avatar
allenanie.bsky.social
Stanford CS PhD working on RL and LLMs with Emma Brunskill and Chris Piech. Co-creator of Trace. Prev @GoogleDeepMind @MicrosoftResearch Specifically - Offline RL - In-context RL - Causality https://anie.me/about Unverified hot takes go to this account
33 posts 2,295 followers 445 following
Regular Contributor
Active Commenter

Check out Tianwei’s latest work on using unlikelihood objective to distill search traces back to base model to boost reasoning capabilities of LLMs!

For all the RL PhDs and people interested in Planning and MDPs, there's a summer internship opportunity at AWS Science that specializes in LLM post-training, RLHF, LLM agents, and benchmarks like WebArena. Interested students can send their CV to [email protected]

For education and psychometrics people, this dataset is very useful!

People say Ching-an and I are indistinguishable…is that true 🤣

Come check us out near the Tesla Booth in West Exhibition Hall A 3-5pm! Come and claim your mug 🤣 we have an identity crisis — people keep thinking we are from IBM for some reason…

Unveiling Trace v0.1.3 at NeurIPS 2024, a library for building an RL-style AI Agent that learns from the environment and human feedback. Today's LLM Agent libraries are not RL agents. They specify a workflow, and it remains unchanged regardless of user feedback. #NotRL vimeo.com/1036224270

arxiv.org/abs/2411.17668 Our postdoc zihan slays another COLT open problem! proceedings.mlr.press/v247/kornows...

For people who like RL theory, this is a must follow!

Hello...world? Trying to reconstruct my academic networks over here :) Follow me if we know each other or if you're interested in machine learning for healthcare/social equity! Please retweet, or resky, or whatever they call it over here.

Here is a list of ML OSS & Open Source / Science enthusiasts I found on Bluesky 🦋 go.bsky.app/8MFcfXd Let me know if you find such people here! I'm still new here and probably the list misses many must-add people, so let's built it together💪

How to save/bookmark posts on 🦋?

I wanted to contribute to "Starter Pack Season" with one for Stanford NLP+HCI: go.bsky.app/VZBhuJ5 Here are some other great starter packs: - CSS: go.bsky.app/GoEyD7d + go.bsky.app/CYmRvcK - NLP: go.bsky.app/SngwGeS + go.bsky.app/JgneRQk - HCI: go.bsky.app/p3TLwt - Women in AI: go.bsky.app/LaGDpqg

Wow, I guess 🦋 is taking off 😆 If you don't know me or my work, here are some highlights: In-Context RL / LLM Agents EVOLvE arxiv.org/pdf/2410.06238 Accelerate Distributed System arxiv.org/pdf/2410.15625 RL / Causal Inference + Human arxiv.org/pdf/2304.04933 arxiv.org/abs/2407.09975

The RL (and some non-RL folks) starter pack is almost full. Pretty clear that the academic move here has succeeded go.bsky.app/3WPHcHg

This talk is just fascinating — “o1 has an effective way to scale compute at inference time” — but you just can’t tell us what it exactly is 🤣

Noam Brown giving a talk on o1 at Stanford right now 🔥