W. Smith
Computational Science and Engineering Department,
C.C.L.R.C. Daresbury Laboratory,
Daresbury, Warrington WA4 4AD, U.K.
| = | Ipxx |
||
| = | Ipyy |
(1) | |
| = | Ipzz |
| = | |||
| = | cos |
(2) | |
| = | - |
Perhaps the first conceptual difficulty with Euler's equations (1) is that they are presented in the principal frame. This is, of course, the frame of reference in which the moment of inertia tensor I is diagonal. (The moment of inertia tensor is a 3x3 symmetric matrix, which can always be diagonalised, and so we may assume a suitable body fixed frame of reference always exists.) The body fixed frame of reference rotates with the rigid body and so the tensor I remains diagonal in that frame. This is in marked contrast to the behaviour of I in the laboratory frame, where the six independent components vary as the body rotates.
So where's the conceptual difficulty? This, I'm sure, can be expressed in various ways by different people, but for me the difficulty lies in the fact that the motion of the body is being described in a frame of reference which moves with the body. This seems absurd, surely if the frame of reference is fixed in the body, how can the body move with respect to it? If one tries to think along such lines, one can, at a pinch, imagine some situations where this does make sense, such as a body that grows or shrinks in the rotating frame, a strange form of motion perhaps, but one that doesn't interfere with one's grasp of what the frame of reference means. However, in general there is little profit in thinking in this way. It is much better to think in terms of vectors. (In so doing it is best always to think of them as real physical objects, which retain their identity in any frame of reference. This is particularly helpful when dealing with angular velocity, where it is easy to slip into thinking that the angular velocity vector of a rotating body vanishes in the body frame, because the body is no longer `rotating' in that frame.)
How does a vector representation clarify matters? Imagine the rotating
rigid body `frozen' at an instant in time. At this instant a simple
relationship exists between the components of a vector
in
the body (principal) frame and the laboratory frame:
|
|
(3) |
So much for a pictorial description, let's see what's going on mathematically.
If we go back to basic texts on vectors [1] to find out how they behave in a rotating system, then we soon encounter the famous equation:
|
|
(4) |
One of the easiest ways to visualise equation (3) is to imagine a constant (i.e. fixed) vector in the rotating frame, set at some angle to the axis of rotation, and then think how this might appear to a laboratory observer. (The rotating vector traces out a `coolie' hat!) The tip of the vector traces a circle (the rim of the coolie hat) in the laboratory frame, and the velocity of the tip along this circle, at any instant, is the time derivative of the vector in inertial space. If, in addition, the vector in the rotating frame is changing with time, the `velocity' (i.e. rate-of-change) vector of that must be added vectorially to the circular velocity.
All this make good physical and intuitive sense, but there is a
potential source of confusion. This arises when one asks what are the
components of the vectors in these equations. Clearly, for this
equation to make sense, all the vectors must be defined with respect
to the same frame of reference, otherwise corresponding components
would not have the same meaning. But on the left of (4) the
time derivative must be in an inertial frame, while on the right
it must be in the rotating frame. (And speaking frankly, at this point,
I don't know what frame the cross product is meant to
be in, except that
and
should together be in
the same frame to provide a meaningful vector product.)
There is a smart trick to deal with this. If we construct an inertial frame which at one instant in time just so happens to coincide with the body fixed frame, then at that one instant, the components of the time derivative on both sides of (3) are in the same frame of reference. If we also evaluate the cross product in that same frame, we can sum up all the components on the right in a way that is physically meaningful. Note that this instantaneous frame is otherwise unrelated to the rotating frame, and it has no rotational motion of its own. It follows that when constructing this frame, we can use a simple rotational transformation of our original laboratory frame, without adding extra centrifugal terms. The disadvantage of this `trick' is that it makes one think that Euler's equation is expressed in a frame of reference that is co-rotating with the body. This is simply not true, such a frame would be non-inertial. It is expressed in a frame of reference which is only instantaneously coincident with the body frame, and which is fundamentally interial in character.
So, after all this discussion, we can write (4) in the form
|
|
(5) |
Now, going back to Euler's equations, let us derive them with what we know.
Any standard textbook [1] that introduces the reader to rigid body motion will show the derivation of the rotational equation of motion of a rigid body about its own centre of mass in an inertial frame:
|
|
(6) |
|
|
(7) |
|
|
(8) |
|
|
(9) |
|
|
(10) |
|
|
(11) |
As is apparent from the above that equation (4) is the key to deriving Euler's equation. However it is equally apparent that one has to deal with some slippery concepts on the way. Can one take a more pedestrian route, one that doesn't require so much physical intuition? The answer is yes, but the derivation is much longer, which probably accounts for its absence from the text books.
We begin by noting that equation (8) is not restricted to the principal frame, and can be generally written as
|
|
(12) |
|
|
(13) |
We can now consider an instant in time and ask what must be done to
diagonalise
. The answer is that we must find a matrix
(and its transpose
) such that
|
|
(14) |
|
|
(15) |
|
|
(16) |
|
|
(17) |
The reader now has three choices. The first is to accept that equations (1) and (17), are one and the same (which is where the physics stops) and skip to the next section. The second is to follow the rest of the demonstration and appreciate the brevity of Euler's approach. The third, strictly for young Turks, is to go away and complete it independently.
To proceed further, we need to look closely at the moment of inertia
tensor. We shall consider the case where it is derived for a rigid
molecule consisting of point masses mi at displacements
from the molecular centre of mass. Thus
|
|
(18) |
|
I |
(19) |
|
|
(20) |
|
R |
(21) |
|
|
(22) |
|
|
(23) |
|
|
(24) |
|
|
(25) |
|
|
(26) |
| = | |||
| = | (27) | ||
| = |
| ( |
= | ||
| ( |
= | (28) | |
| ( |
= |
So far we have said little about the auxiliary equations for the
motion of the Euler angles (2). These are obtained in a straighforward
manner by defining angular velocity vectors
,
,
,
associated with rotation through the respective Euler angles, and
transforming these vectors into the principal frame.
These equations can, in principle, be solved directly by some
numerical algorithm, but nobody does this any more. This is because
the equations (2) generate infinities when the angle
becomes
zero, as it is perfectly free to do! For this reason Denis Evans and
Sohail Murad famously introduced quaternions into molecular dynamics
[2], which did away with this difficulty forever. I don't
propose to elaborate on the topic, given the fact that the method is
now universal and is described in detail in the standard texts
[3]. I would however like to suggest an alternative approach.
First we should note that equation (25) can also be written in the laboratory frame:
|
|
(29) |
|
|
(30) |
|
|
(31) |
There is an important omission however. Nowhere do we calculate the
rotation matrix
, and without it we cannot find the positions
of the atoms of the rigid molecule in space. Thus the calculation of
the torque becomes problematical! But this is not an insurmountable
problem as we can diagonalise the intertia tensor using Jacobi's
method and so obtain
as the matrix of eigenvectors. This
need not be as expensive an operation as it first seems, since the
rotation matrix from the proceeding time step will be an excellent
approximation to it, and can be used to partially diagonalise
. The number of iterations in the Jacobi method will therefore be
be much reduced. I offer this algorithm as an alternative for those
who find quaternions too esoteric!