π¨ New reinforcement learning algorithms π¨
Excited to announce MaxInfoRL, a class of model-free RL algorithms that solves complex continuous control tasks (including vision-based!) by steering exploration towards informative transitions.
Details in the thread π
Excited to announce MaxInfoRL, a class of model-free RL algorithms that solves complex continuous control tasks (including vision-based!) by steering exploration towards informative transitions.
Details in the thread π
Comments
Jax (built on jaxrl): https://github.com/sukhijab/maxinforl_jax
Pytorch (based on @araffin.bsky.socialβs SB3): https://github.com/sukhijab/maxinforl_torch
Paper: https://arxiv.org/abs/2412.12098
Website: https://sukhijab.github.io/projects/maxinforl/