When using JAX (which you can easily leverage via Keras) you can have your entire RL environment included in your compiled JAX program, delivering extreme speedups
https://github.com/instadeepai/jumanji
https://github.com/instadeepai/jumanji
Comments