-
PDF
- Split View
-
Views
-
Cite
Cite
Prasenjit Saha, Interpreting the Kustaanheimo–Stiefel transform in gravitational dynamics, Monthly Notices of the Royal Astronomical Society, Volume 400, Issue 1, November 2009, Pages 228–231, https://doi.org/10.1111/j.1365-2966.2009.15437.x
- Share Icon Share
Abstract
The Kustaanheimo–Stiefel (KS) transform turns a gravitational two-body problem into a harmonic oscillator, by going to four dimensions. In addition to the mathematical-physics interest, the KS transform has proved very useful in N-body simulations, where it helps to handle close encounters. Yet the formalism remains somewhat arcane, with the role of the extra dimension being especially mysterious. This paper shows how the basic transformation can be interpreted as a rotation in three dimensions. For example, if we slew a telescope from zenith to a chosen star in one rotation, we can think of the rotation axis and angle as the KS transform of the star. The non-uniqueness of the rotation axis encodes the extra dimension. This geometrical interpretation becomes evident on writing KS transforms in quaternion form, which also helps to derive concise expressions for regularized equations of motion.
1 INTRODUCTION
The Kustaanheimo–Stiefel (KS) transform is a remarkable relation between the two most important elementary problems in dynamics: under a transformation of coordinates and time, a Kepler problem changes into a harmonic oscillator. Especially noteworthy is that the collision singularity in the Kepler problem is transformed into a regular point. The name comes from the works by Kustaanheimo (1964) and Kustaanheimo & Stiefel (1965), while the book by Stiefel & Scheifele (1971), which is largely devoted to the KS transform and its consequences, is perhaps the best-known source. For a very short summary, see ‘regularization’ in Binney & Tremaine (2008). An important application of the KS transformation is in numerical orbit integration, where the singularity removal is used to great advantage for simulating dense stellar systems with near collisions (Aarseth & Zare 1974a,b; Jernigan & Porter 1989; Mikkola & Aarseth 1990, 1993). Some recent papers also re-examine the formalism itself (Bartsch 2003; Waldvogel 2006).
In two dimensions there is a much simpler version of the KS transform going back to Levi-Civita (1920). In the Levi-Civita transform, the coordinate plane is read as the complex plane, and the complex square root of the coordinate becomes the transformed coordinate. The geometrical interpretation is clear: the complex phase gets halved. The KS transform is also a kind of square root, but in four dimensions. One wonders how the geometrical interpretation generalizes.
It turns out that slewing a telescope is a convenient geometrical analogy. Suppose the telescope is at zenith and we want to slew it to a particular star in one rotation. Normally we would simply move along a great circle from the zenith to the star. However, we might prefer a different rotation (to avoid crossing the moon, say). For example, we could choose the mid-point on the above great circle and rotate about it by 180°. In any case, the chosen rotation axis and the rotation angle are effectively the KS transform of the star. This idea of rotation in three dimensions about a non-unique axis generalizes the idea of halving the phase in a complex square root.
This paper attempts to provide some new insight into the KS transform by providing some reformulations and new derivations of known results, and especially to make the geometrical interpretation evident.
2 QUATERNIONS AND ROTATION
Before considering KS theory, it is useful to review a concise algebraic way of specifying rotations in three dimensions, not often used in astrophysics but standard in computer graphics: quaternions.







For an arbitrary (i.e. non-unit) quaternion R, the expression (5) amounts to a rotation combined with scalar multiplication.


3 THE KUSTAANHEIMO–STIEFEL TRANSFORM





Everything so far is already in the literature. The new result in this paper is that we can readily visualize Q, including its non-uniqueness.




An interesting special case is φ= 0, which gives q=r(cos θ k+ sin θ i) and . Then QI is effectively the complex square root of q (we need to read k as the real axis and i as the imaginary axis). In other words, the planar case can be reduced to the Levi-Civita transform by a suitable gauge.
Quaternion formulations of the KS transform have been discussed by several authors: Stiefel & Scheifele (1971) mention quaternions but appear to dislike them, while later authors (for example Vivarelli 1994; Waldvogel 2006) are more favourable. The precise definition adopted for the transform varies, but is equivalent to equation (11). That Q represents a rotation and shrinking/stretching of q is also known. Bartsch (2003) specifically notes that the rotation axis is unique in two dimensions but not in three. However, the explicit description of the implied rotations, as above, appears to be new.
4 THE CANONICAL MOMENTUM







That P (as defined in equation 22, or equivalently) completes a canonical transformation is a standard part of KS theory, but the derivation of the canonical condition using quaternion identities appears to be new.
5 THE TWO-BODY PROBLEM AND THE HARMONIC OSCILLATOR





The Hamiltonian Γ is remarkable indeed. For E < 0 (bound orbits) it is a harmonic oscillator. Since Q has four components, Γ is like a mass on an isotropic spring in four Euclidean dimensions. Thus the well-known fact that the bound Kepler problem has a dynamical O(4) symmetry. For the unbound case, the symmetry group is different: formally the Lorentz group, but with a physical meaning completely different from special relativity. And – perhaps most importantly – Hamilton's equations for Γ are well behaved even at Q= 0 (a collision). This is known as regularization and was the original motivation for KS theory.
The effect of an external force F is simple. From (22) it follows immediately that F will add an extra contribution of −2kQF to dP/dt, which amounts to a contribution of −2Q2kQF to dP/ds. Provided the external force is non-singular, the equations of motion in s remain regular.
6 REGULARIZING THE THREE-BODY PROBLEM
Application of KS regularization to N-body simulations involves expressing the gravitating system either as a tree-like hierarchy of coupled two-body systems (Jernigan & Porter 1989) or as a chain (Mikkola & Aarseth 1990, 1993). The basic idea can be described using the three-body problem with all masses unity. Here again, quaternions enable a concise formulation.








7 DISCUSSION
In dynamical astronomy the KS transformation is profound, but may appear mysterious. This paper attempts to make it less mysterious, and hopefully therefore more useful, by explaining it in three-dimensional geometric terms. There are several possible directions in which the KS transformation may turn out to be useful.
First, one can imagine new orbit integrators specialized to nearly Keplerian problems. Work on dense stellar systems with near collisions has already been mentioned (for reviews see the books Aarseth 2003; Heggie & Hut 2003). In the planetary regime, which differs from the dense-stellar case in having few bodies but many more orbital times, time transformations reminiscent of (28) used for KS regularization have proved useful for highly eccentric orbits (Mikkola 1997; Emel'yanenko 2002), while some integration algorithms (Mikkola & Tanikawa 1999; Preto & Tremaine 1999) apply the time transformation (28) implicitly. Could the KS transformation itself be exploited here?Fukushima (2005) has some further ideas.
Secondly, it is conceivable that KS variables could simplify perturbation theory. Perturbation theory in classical celestial mechanics (see for example Murray & Dermott 2000) is algebraically frighteningly complicated, basically because the natural variables for the unperturbed and perturbed parts (being the Keplerian action-angles and real-space coordinate) are related through an implicit equation. On the other hand, the action-angles of the KS-transformed Kepler problem are explicitly related to space coordinates – the implicit equation is transferred to the time variable. Could some major simplification be achieved through KS variables? Some progress has been made by Vrbik (2006).
Thirdly, the KS transformation might provide new insight into analogous quantum problem. Bander & Itzykson (1966a,b) derive the symmetry groups of the bound and unbound Coulomb problems. These turn out to be the same four-dimensional symmetries as in KS theory. Is the KS transformation implicit in that work?
I am grateful to Seppo Mikkola for introducing me to KS theory, and to Marcel Zemp, Scott Tremaine and the referee, Jörg Waldvogel, for suggesting improvements in the manuscript.
REFERENCES