Pretty large MDP – it has 2 action dimensions and 9 state dimensions.
I’m using the parallelized version, running 500 trajectories of planning per step. More information up later.
Pretty large MDP – it has 2 action dimensions and 9 state dimensions.
I’m using the parallelized version, running 500 trajectories of planning per step. More information up later.