Predictive Projections. Sprague. IJCAI 2009.

  1. Deals with developing policies in very high dimensional state spaces
  2. Proposes a linear dimensionality reduction (projection?) algorithm that discovers predictive projections
  3. A predictive projection is a prediction of future states by nearest-neighbor learning
  4. Consider robotics where state information is camera input – this raw state is too high D to work in natively
    1. Want something that can reduce the dimension planning has to take place it
    2. In the projected space, the idea is that the same action in two similar projected states should produce a similar outcome
  5. Work here is based on Gradient-based distance metric learning
    1. More specifically, based on something called Neighborhood Components Analysis (NCA) which minimizes error for nearest-neighbor classification
  6. Projection is made to maintain accuracy on estimates of future states.
    1. Problem is that because it is a least-squares (I think) metric, noise and outliers cause problems, so another trick has to be used
  7. “The predictive projections algorithm as described above may not perform well in cases where the effects of different actions are restricted to specific state dimensions.”
  8. They use LSPI to do learn the policy, test on Lagoudakis’ pendulum
  9. Other stuff, skipping.  I think I found the wrong paper…

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: