allenanie.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

allenanie.bsky.social

Stanford CS PhD working on RL and LLMs with Emma Brunskill and Chris Piech. Co-creator of Trace. Prev @GoogleDeepMind @MicrosoftResearch Specifically - Offline RL - In-context RL - Causality https://anie.me/about Unverified hot takes go to this account

33 posts 2,295 followers 445 following

Posts 16 Comments 22

Check out Tianwei’s latest work on using unlikelihood objective to distill search traces back to base model to boost reasoning capabilities of LLMs!

submitted 59 days ago • 0 comments

For all the RL PhDs and people interested in Planning and MDPs, there's a summer internship opportunity at AWS Science that specializes in LLM post-training, RLHF, LLM agents, and benchmarks like WebArena. Interested students can send their CV to [email protected]

submitted 135 days ago • 0 comments

For education and psychometrics people, this dataset is very useful!

submitted 193 days ago • 0 comments

People say Ching-an and I are indistinguishable…is that true 🤣

submitted 193 days ago • 0 comments

Come check us out near the Tesla Booth in West Exhibition Hall A 3-5pm! Come and claim your mug 🤣 we have an identity crisis — people keep thinking we are from IBM for some reason…

submitted 193 days ago • 0 comments

Unveiling Trace v0.1.3 at NeurIPS 2024, a library for building an RL-style AI Agent that learns from the environment and human feedback. Today's LLM Agent libraries are not RL agents. They specify a workflow, and it remains unchanged regardless of user feedback. #NotRL vimeo.com/1036224270

submitted 194 days ago • 2 comments

arxiv.org/abs/2411.17668 Our postdoc zihan slays another COLT open problem! proceedings.mlr.press/v247/kornows...

submitted 207 days ago • 1 comment

For people who like RL theory, this is a must follow!

submitted 208 days ago • 0 comments

Hello...world? Trying to reconstruct my academic networks over here :) Follow me if we know each other or if you're interested in machine learning for healthcare/social equity! Please retweet, or resky, or whatever they call it over here.

submitted 211 days ago • 3 comments

Here is a list of ML OSS & Open Source / Science enthusiasts I found on Bluesky 🦋 go.bsky.app/8MFcfXd Let me know if you find such people here! I'm still new here and probably the list misses many must-add people, so let's built it together💪

submitted 213 days ago • 41 comments

How to save/bookmark posts on 🦋?

submitted 211 days ago • 4 comments

I wanted to contribute to "Starter Pack Season" with one for Stanford NLP+HCI: go.bsky.app/VZBhuJ5 Here are some other great starter packs: - CSS: go.bsky.app/GoEyD7d + go.bsky.app/CYmRvcK - NLP: go.bsky.app/SngwGeS + go.bsky.app/JgneRQk - HCI: go.bsky.app/p3TLwt - Women in AI: go.bsky.app/LaGDpqg

submitted 219 days ago • 2 comments

Wow, I guess 🦋 is taking off 😆 If you don't know me or my work, here are some highlights: In-Context RL / LLM Agents EVOLvE arxiv.org/pdf/2410.06238 Accelerate Distributed System arxiv.org/pdf/2410.15625 RL / Causal Inference + Human arxiv.org/pdf/2304.04933 arxiv.org/abs/2407.09975

submitted 215 days ago • 0 comments

The RL (and some non-RL folks) starter pack is almost full. Pretty clear that the academic move here has succeeded go.bsky.app/3WPHcHg

submitted 216 days ago • 12 comments

This talk is just fascinating — “o1 has an effective way to scale compute at inference time” — but you just can’t tell us what it exactly is 🤣

submitted 215 days ago • 0 comments

Noam Brown giving a talk on o1 at Stanford right now 🔥

submitted 215 days ago • 1 comment