finally got a predictive coding RL agent kinda working on Cart Pole. no target network, though I only got to 500 reward with one random seed, and out of 5 seeds I tried, only one other seed got above 200 💀. next up: proper tuning experiments
Comments
Log in with your Bluesky account to leave a comment
Comments