A bone is a UI widget for visualizing and controlling a 3D rigid transformation.
Canonical Bone
A canonical bone of length l lies across the x-axis is two points: the tail (0,0,0) and tip (l,0,0). it can be twisted around x axis, bending around z axis, and then twist along y axis. Note that twist around x axis has no impact on the position of tail or tip, but it impacts the skinning bind to the bone.
Rest Bone
To need to map the canonical bones to its position and orientation in the underformed model. For each bone, define a rigid transformation
T^3×4=[[R^]3×3[t^]3×1]
where R^ is the rotation matrix and t^ is the translation matrix. Note that the rigid transformation is homogeneous, so that the rest tail's position is
s^=T^0001=R^⋅0+t^=t^
and the rest tip d^ is at
d^=T^l001=R^l00+t^
Typically, a rest tail is coincident with the rest tip of its parent.
Pose Bone
Each rest bone undergoes a rigid transformation T, composed of rotation R and translation t, mapping each of its rest point x^ to posed position x, i.e. x=Tx^. In particular, we would like each bone to rotate about its parent's tip, but this position is determined by the parent's pose transformation, which in turn should rotate about the grandparent's tip and so on.
Forward Kinematics
For each bone i, the rigid pose transformation Tiis determined by the aggregate relative rotations Rˉi3×3 between i and its parent pi.
From right to left, we first transforms the bone back to rest bone, then apply the aggregate rotations, transform backs to its position, and then transforms around its parent Tpi. Note that this algorithm is defined recursively, for a bone at the root, its parent transformation is I. Consider some tail s^i, note that rotation won't change s^i's position, so that
Tis^i=Tpis^i=Tpid^pi
which is the position of the tip of its parent bone.
The aggregate rotation matrix Rˉi can be represented as (θi1,θi2,θi3),
Note that is too difficult to define the Euler angles for every bone every frame manually. Instead, we define some keyframes (tj,(θij1,θij2,θij3)) for some time point t1,...,tm, and determine the relative bone transformations for each frame by interpolate values. A linear interpolation on the Euler angles is easiest, while it will give a choppy robotic animation.
Catmull Rom Interpolation
Inverse Kinematics
Instead of manually tuning the angles, we want to minimize the distance between the tip of the bone (usually fingers, or the endpoint) to the wanted position (x,y,z). We can consider this as an optimization problem.
Define a∈R3m:=[θ11,θ12,θ13,...,θm1,θm2,θm3]T be the collections of Euler angles of all bones 1,...,m xb(a)=Tbd^b be the position of the tip of bone b, note that Tb depends on θb1,θb2,θb3,Tpi x^b be the wanted position of some bone b.
Therefore, we are to optimize
amini=1∑k[E(x(a))=∥xbi(a)−x^bi∥2]
In addition, we can specify the min/max values for the Euler angles, for example,
0≤θ1≤0,0≤θ2≤170,0≤θ3≤0
will allow only the bending in one direction, then we can stack all the constraint as amin,amax and optimize under such constraint.
For some dimensionality
E(x)=E(xb1,...,xbk):Rk→R
x(a)=x(θ11,θ12,θ13,...,θm1,θm2,θm3):R3m→Rk
Projected Gradient Descent
As a non-linear optimization problem, we use gradient descent
a←a−σ(dadE(x(a)))T=a−σ(dadx(a))T(dxdE(x))
Define J:=dadx(a)∈Rk×3m so that
a←a−σJTdxdE(x)
and to make sure the constraint amin,amax is preserved, after each step, we clip ai by [amin,amax]