schaul.bsky.social - Profile | ThreadSky | a Reddit-style client for Bluesky

schaul.bsky.social

RL researcher at DeepMind https://schaul.site44.com/ 🇱🇺

24 posts 3,109 followers 278 following

Posts 12 Comments 15

Ever thought of joining DeepMind's RL team? We're recruiting for a research engineering role in London: job-boards.greenhouse.io/deepmind/job... Please spread the word!

submitted 1 day ago • 0 comments

When faced with a challenge (like debugging) it helps to think back to examples of how you've overcome challenges in the past. Same for LLMs! The method we introduce in this paper is efficient because examples are chosen for their complementarity, leading to much steeper inference-time scaling! 🧪

submitted 64 days ago • 0 comments

Some extra motivation for those of you in RLC deadline mode: our line-up of keynote speakers -- as all accepted papers get a talk, they may attend yours! @rl-conference.bsky.social

submitted 88 days ago • 0 comments

200 great visualisations: 200 facets and nuances of 1 planetary story.

submitted 112 days ago • 0 comments

Reposting David Silver's talk about how RL is the way to intelligence. No particular reason www.youtube.com/watch?v=pkpJ...

submitted 116 days ago • 0 comments

Excited to announce the first RLC 2025 keynote speaker, a researcher who needs little introduction, whose textbook we've all read, and who keeps pushing the frontier on RL with human-level sample efficiency

submitted 135 days ago • 0 comments

Could language games (and playing many of them) be the renewable energy that Ilya was hinting at yesterday? They do address two core challenges of self-improvement -- let's discuss! My talk is today at 11:40am, West Meeting Room 220-222, #NeurIPS2024 language-gamification.github.io/schedule/

submitted 160 days ago • 0 comments

Don't get to talk enough about RL during #neurips2024? Then join us for more, tomorrow night at The Pearl!

submitted 164 days ago • 0 comments

This year's (first-ever) RL conference was a breath of fresh air! And now that it's established, the next edition is likely to be even better: Consider sending your best and most original RL work there, and then join us in Edmonton next summer!

submitted 172 days ago • 0 comments

Are there limits to what you can learn in a closed system? Do we need human feedback in training? Is scale all we need? Should we play language games? What even is "recursive self-improvement"? Thoughts about this and more here: arxiv.org/abs/2411.16905

submitted 176 days ago • 7 comments

RLC will be held at the Univ. of Alberta, Edmonton, in 2025. I'm happy to say that we now have the conference's website out: rl-conference.cc/index.html Looking forward to seeing you all there! @rl-conference.bsky.social #reinforcementlearning

submitted 182 days ago • 2 comments

Twitter-optional NeurIPS? Sounds like an appealing prospect!

submitted 189 days ago • 1 comment