...gradient.m).1
https://igi-web.tugraz.at/lehre/MLB/WS10/task1.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... cart-pole.2
https://igi-web.tugraz.at/lehre/MLB/WS10/task2.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... Learning3
https://igi-web.tugraz.at/lehre/intern/MDP_Theory.pdf
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... files4
https://igi-web.tugraz.at/lehre/intern/MountainCarDemo.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... files5
https://igi-web.tugraz.at/lehre/intern/CartPoleDemo.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... Toolbox6
https://igi-web.tugraz.at/lehre/MLB/WS10/intern/MountainCarDemo.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
...swimmer.zip7
https://igi-web.tugraz.at/lehre/MLB/WS10/intern/swimmer.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... 8
This function points to the file evaluate.m
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
...\space 9
Note that in the lecture slides the gradient $ \log \pi(\dot{\mathbf{y}}_t\vert \mathbf{z}_t, x_t; \mathbf{b})$ is given by $ \propto \epsilon \Phi(x)^T$ , which is typically a matrix not a vector. The gradient vector can be obtained by transforming the matrix to a vector $ [\epsilon_t(1) \Phi(x_t); \epsilon_t(2) \Phi(x_t)]$
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
...cannon.zip10
https://igi-web.tugraz.at/lehre/MLB/WS10/intern/cannon.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
...Gibbs.zip11
https://igi-web.tugraz.at/lehre/MLB/WS10/intern/Gibbs.zip
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... endeffector-position12
The endeffector position is given by $ [x,y]^T = [l_1 sin(q_1) + l_2 sin(q_1 + q_2), l_1 cos(q_1) + l_2 cos(q_1 + q_2)]$ , where $ q_i$ denotes the joint angle of the $ i$ th joint.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.