Skip to content

Kinematics

Bone

A bone is a UI widget for visualizing and controlling a 3D rigid transformation.

Canonical Bone

A canonical bone of length ll lies across the x-axis is two points: the tail (0,0,0)(0, 0, 0) and tip (l,0,0)(l, 0, 0). it can be twisted around xx axis, bending around zz axis, and then twist along yy axis. Note that twist around xx axis has no impact on the position of tail or tip, but it impacts the skinning bind to the bone.

Rest Bone

To need to map the canonical bones to its position and orientation in the underformed model. For each bone, define a rigid transformation

T^3×4=[[R^]3×3[t^]3×1]\hat T_{3\times 4} = \big[[\hat R]_{3\times 3}[\hat t]_{3\times 1}\big]

where R^\hat R is the rotation matrix and t^\hat t is the translation matrix. Note that the rigid transformation is homogeneous, so that the rest tail's position is

s^=T^[0001]=R^0+t^=t^\hat s = \hat T \begin{bmatrix}0\\0\\0\\1\end{bmatrix} = \hat R\cdot 0 + \hat t = \hat t

and the rest tip d^\hat d is at

d^=T^[l001]=R^[l00]+t^\hat d = \hat T \begin{bmatrix}l\\0\\0\\1\end{bmatrix} = \hat R \begin{bmatrix}l\\0\\0\end{bmatrix} + \hat t

Typically, a rest tail is coincident with the rest tip of its parent.

Pose Bone

Each rest bone undergoes a rigid transformation TT, composed of rotation RR and translation tt, mapping each of its rest point x^\hat x to posed position xx, i.e. x=Tx^x = T\hat x. In particular, we would like each bone to rotate about its parent's tip, but this position is determined by the parent's pose transformation, which in turn should rotate about the grandparent's tip and so on.

Forward Kinematics

For each bone ii, the rigid pose transformation TiT_iis determined by the aggregate relative rotations Rˉi3×3\bar{R}_{i_{3\times3}} between ii and its parent pip_i.

Ti=Tpi[[T^i]0001][0[Rˉi]000001][[T^i]0001]1T_i = T_{p_i}\begin{bmatrix}&[\hat T_i]&&\\0&0&0&1\end{bmatrix} \begin{bmatrix}&&&0\\&[\bar R_i]&&0\\&&&0\\0&0&0&1\end{bmatrix} \begin{bmatrix}&[\hat T_i]&&\\0&0&0&1\end{bmatrix}^{-1}

From right to left, we first transforms the bone back to rest bone, then apply the aggregate rotations, transform backs to its position, and then transforms around its parent TpiT_{p_i}.
Note that this algorithm is defined recursively, for a bone at the root, its parent transformation is II.
Consider some tail s^i\hat s_i, note that rotation won't change s^i\hat s_i's position, so that

Tis^i=Tpis^i=Tpid^piT_i\hat s_i = T_{p_i}\hat s_i = T_{p_i}\hat{d}_{p_i}

which is the position of the tip of its parent bone.

The aggregate rotation matrix Rˉi\bar{R}_i can be represented as (θi1,θi2,θi3)(\theta_{i1}, \theta_{i2}, \theta_{i3}),

[0[Rˉi]000001]=[0[Rˉx(θi3)]000001][0[Rˉx(θi2]000001][0[Rˉx(θi1]000001]\begin{bmatrix}&&&0\\&[\bar R_i]&&0\\&&&0\\0&0&0&1\end{bmatrix} = \begin{bmatrix}&&&0\\&[\bar R_x(\theta_{i3})]&&0\\&&&0\\0&0&0&1\end{bmatrix} \begin{bmatrix}&&&0\\&[\bar R_x(\theta_{i2}]&&0\\&&&0\\0&0&0&1\end{bmatrix} \begin{bmatrix}&&&0\\&[\bar R_x(\theta_{i1}]&&0\\&&&0\\0&0&0&1\end{bmatrix}

which is the twist-bend-twist.

Keyframe Animation

Note that is too difficult to define the Euler angles for every bone every frame manually. Instead, we define some keyframes (tj,(θij1,θij2,θij3))\big(t_{j}, (\theta_{ij1}, \theta_{ij2}, \theta_{ij3})\big) for some time point t1,...,tmt_1,...,t_m, and determine the relative bone transformations for each frame by interpolate values. A linear interpolation on the Euler angles is easiest, while it will give a choppy robotic animation.

Catmull Rom Interpolation


png

Inverse Kinematics

Instead of manually tuning the angles, we want to minimize the distance between the tip of the bone (usually fingers, or the endpoint) to the wanted position (x,y,z)(x, y, z). We can consider this as an optimization problem.

Define
aR3m:=[θ11,θ12,θ13,...,θm1,θm2,θm3]Ta \in \mathbb R^{3m} := [\theta_{11}, \theta_{12}, \theta_{13},...,\theta_{m1}, \theta_{m2}, \theta_{m3}]^T be the collections of Euler angles of all bones 1,...,m1,...,m
xb(a)=Tbd^bx_b(a) = T_b\hat d_b be the position of the tip of bone bb, note that TbT_b depends on θb1,θb2,θb3,Tpi\theta_{b1}, \theta_{b2}, \theta_{b3}, T_{p_i}
x^b\hat x_{b} be the wanted position of some bone bb.

Therefore, we are to optimize

minai=1k[E(x(a))=xbi(a)x^bi2]\min_a \sum_{i=1}^k \bigg[E(x(a)) = \|x_{b_i}(a) - \hat{x}_{b_i}\|^2\bigg]

In addition, we can specify the min/max values for the Euler angles, for example,

0θ10,0θ2170,0θ300 \leq \theta_1\leq 0, 0\leq \theta_2\leq 170, 0\leq \theta_3\leq 0

will allow only the bending in one direction, then we can stack all the constraint as amin,amaxa_{\min}, a_{\max} and optimize under such constraint.

For some dimensionality

E(x)=E(xb1,...,xbk):RkRE(x) = E(x_{b_1},..., x_{b_k}):\mathbb R^k\rightarrow \mathbb R
x(a)=x(θ11,θ12,θ13,...,θm1,θm2,θm3):R3mRkx(a) = x(\theta_{11}, \theta_{12}, \theta_{13}, ..., \theta_{m1}, \theta_{m2}, \theta_{m3}):\mathbb R^{3m} \rightarrow \mathbb R^k

Projected Gradient Descent

As a non-linear optimization problem, we use gradient descent

aaσ(ddaE(x(a)))T=aσ(ddax(a))T(ddxE(x))a\leftarrow a - \sigma\big(\frac{d}{da}E(x(a))\big)^T = a - \sigma \big(\frac{d}{da}x(a)\big)^T\big(\frac{d}{dx}E(x)\big)

Define J:=ddax(a)Rk×3mJ:= \frac{d}{da}x(a)\in \mathbb R^{k\times 3m} so that

aaσJTddxE(x)a\leftarrow a - \sigma J^T\frac{d}{dx}E(x)

and to make sure the constraint amin,amaxa_{\min}, a_{\max} is preserved, after each step, we clip aia_i by [amin,amax][a_{\min}, a_{\max}]

Then, to calculate this

Ji,j=xiaj=limh01h(xi(a+hej) xi(a))limit def.1h(xi(a+hej) xi(a))finite differentiation\begin{align*} J_{i,j} &= \frac{\partial x_i}{\partial a_j}\\ &= \lim_{h\rightarrow0}\frac1h\big(x_i(a + he_j)\ - x_i(a)\big) &\text{limit def.}\\ &\approx \frac1h\big(x_i(a + he_j)\ - x_i(a)\big) &\text{finite differentiation} \end{align*}

For each xi(a+hej)x_i(a+h e_j), we need 3m3m calls to forward kinematics function and we have kk bones to consider, i.e. O(mk)O(mk) operations.

xiE(x)=2(xi(a)x^i)\frac{\partial}{\partial x_i}E(x) = 2(x_{i}(a) - \hat x_i)

which requires O(k)O(k) calls.

Also, note that this whole system is a bunch of matrix calculates, and we can use automatic differentiation.

To determine a proper σ\sigma, we simply start with a very large σ\sigma, check whether E(aσJTdxE)<E(a)E(a - \sigma J^T d_xE) < E(a), if not, then reduce σ\sigma by a factor.

Linear Blend Skinning

For each vertex of the object model, let v^iR3\hat v_i\in\mathbb R^3 be its rest position we bind it with several bones, then the pose position

vi=j=1mwi,jTj[v^i1]v_i = \sum_{j=1}^m w_{i,j} T_j \begin{bmatrix}\hat v_i\\1\end{bmatrix}

where wi,jw_{i,j} is the weight of vertex ii on bone jj.

Source code