Cross Entropy optimization applied to humanoid locomotion in a noisy domain with a deterministic model.


In this experiment, the true domain has a large amount of noise; between 10-50 newtons of force is randomly applied to one randomly selected joint at each time step.  During the rollouts, there is no such noise, so the planner must deal with unpredictable pertubations of the domain.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: