When using JAX (which you can easily leverage via Keras) you can have your entire RL environment included in your compiled JAX program, delivering extreme speedups

https://github.com/instadeepai/jumanji

Comments