Abstract

In isotropic ray tracing, the ray approximation to the wavefield undergoes a phase shift when the ray crosses a caustic. The cumulative number of such phase shifts along a ray is usually called the KMAH index. The sign of these phase shifts is prescribed by the sign of the angular frequency in combination with the sign convention used for the Fourier transformation. In isotropic media the KMAH index always increases by one or by two, depending on the type of caustic crossed. For (quasi-)shear waves in anisotropic media the KMAH index may decrease. This is the case if the associated slowness sheet is locally concave in one or two of its principal directions of curvature.

1 Introduction

In isotropic ray tracing, it is well known that a phase shift occurs in the ray approximation to the wavefield when a ray propagates through a caustic. At a caustic the cross-sectional area of an infinitesimally thin ray tube around a central ray vanishes. Such a cross-sectional area is modelled by the determinant det [Q(s) ] of a 2 × 2 matrix which occurs in the dynamic ray-tracing formalism of Červený (1985), where s is the curve length along the ray. In isotropic media, assuming a time-harmonic phase factor of exp (−iωt), the phase is increased by −sgn (ω)π/2 for each zero encountered in det [Q(s) ], where double zeros are counted twice. For positive frequencies all phase shifts are in the same direction.

For (quasi-)shear waves in anisotropic media ‘anomalous’ phase shifts of opposite sign may occur. This phenomenon has been discussed previously by several authors, either implicitly [e.g. a very general treatment by Lewis (1965) ] or explicitly [e.g. Garmany (1988a,b), dealing with WKBJ approximations in stratified media, or Kravtsov & Orlov (1993), in a context of stationary-phase approximations]. Only recently, Klimeš (1997, 1998) described the phase shifts at caustics in anisotropic media in terms of traces and determinants of the fundamental 2 × 2 matrix solutions, computed in Červený’s formalism of dynamic ray tracing. The analysis was given for a bundle of rays which starts from a point-source. Klimeš’s analysis rests on several specific symplectic properties of the solution of the dynamic ray-tracing system. This paper gives an alternative, more geometrical description of the phase shifts at caustics in anisotropic media, fully in terms of the curvature of the wave front in the vicinity of the caustic. The sign of the phase shift turns out to be related to the local convexity or concavity of the associated slowness sheet. The criteria for computing the sign of the phase shift, as presented here, are shown to be consistent with Klimeš’s results, and these are valid for rather general ray bundles, rather than only for ray bundles which start at a point-source.

2 Fundamentals Of Anisotropic Dynamic Ray Tracing

Consider a bundle of rays, parametrized by two ray parameters γ1 and γ2 (e.g. shooting angles). We adopt the notation and concepts of isotropic dynamic ray tracing as formulated by Červený (1985). Let x(s) and p(s) denote the position of a ray in this bundle and the associated slowness vector (i.e. the gradient of traveltime τ), respectively, as a function of curve length s along the ray. In isotropic dynamic ray tracing, it is convenient to use a ray-centred system of orthonormal basis vectors {e1 (s),e2 (s),t(s) } Popov & Pšenčík 1978), where e1 (s) and e2 (s) span the tangent plane of the wave front at x(s), and t(s) is a unit vector tangent to the ray. Here we prefer to interpret t(s) as the normalized slowness vector, which allows a generalization of such a ray-centred formulation to anisotropic dynamic ray tracing (Červený 1997; Bakker 1996). Generally, t(s) is not tangent to the ray in the anisotropic case, but it is perpendicular to the wave front. Along a ray we denote the group (or ray) velocity by vgr , and the phase velocity (i.e. the reciprocal of the length of p) by vph . In isotropic media one has v = vgr = vph .

The dynamic ray-tracing system can be formulated in terms of components along these ray-centred basis vectors. Let Q(s) and P(s) be the 2 × 2 matrices with entries QIJ (s) = 〈eI (s),∂x/∂γJ (s) 〉 and PIJ (s) = 〈eI (s),∂p/∂γJ (s) 〉, respectively. Then the dynamic ray-tracing system reads

The specific form of the matrices SIJ [which can be computed on the basis of results in Gajewski & Pšenčík (1990) ] is not of importance here, except for the symmetry of S12 and S21 , and the equality S11 = −ST22 . These properties imply the usual symplecticity of the dynamic ray-tracing system. Since the trace of the 4 × 4 matrix in eq.(1) vanishes, it follows from Liouville’s theorem that Q(s) and P(s) cannot have a common null vector for any s if they do not have a common null vector at the start of the ray. It can be shown that this remains true in cases where interfaces are crossed (Červený 1997). For purposes of interpretation, it is worthwhile to mention that S11 , S21 and S22 vanish in a homogeneous medium. Furthermore, in an isotropic medium one has S12 = v2 I.

For both isotropic and anisotropic media, one defines the matrix M = PQ−1 of the second-order derivatives of the traveltime in the directions tangent to the wavefront, and vph (s)M(s) is the curvature matrix of the wavefront at x(s).

Assuming a phase factor exp (−iωt) for Fourier transformation from the frequency domain to the time domain, the ray approximation to the wavefield along a ray in a smoothly varying medium is expressed by

where ψ is a constant vector, calibrated at the start, and ρ is the density. In an isotropic medium u should be interpreted as the coordinates of the particle displacement along the (varying) ray-centred basis vectors (e.g. Červený 1985), and in an anisotropic medium u should be interpreted as the coordinates along the (varying) polarization vectors associated with each of the three wave modes (e.g. Pšenčík & Teles 1996; Červený 1997). In isotropic media it is well known that 1/{det [Q(s) ] }1/2 is to be interpreted as (1/ |det [Q(s) ] |1/2 )exp [−isgn (ω) (π/2)k(s,s0 ) ], where the KMAH index k(s,s0 ) counts the number of zeros in det [Q(s) ] in the interval (s0 ,s) (double zeros counted twice), assuming that the ray starts at s0 .

Hence, for ω > 0, in isotropic media, the phase is decreased by π/2 for each zero encountered in det [Q(s) ]. In Section 3 it is shown, however, that this is not always the case for (quasi-)shear waves in anisotropic media.

3 Phase Shift At Caustics

Assuming that P(s0 ) and Q(s0 ) do not have a common null vector at the initial point s0 , we can have the following three situations at a point sc which is on a caustic:

(1)case 1: Q(sc ) = 0 (double zero, i.e. point caustic) and P(sc ) is of full rank;

(2)case 2: Q(sc ) has rank 1 (single zero, i.e. line caustic) and P(sc ) is of full rank;

(3)case 3: Q(sc ) has rank 1 (single zero, i.e. line caustic) and P(sc ) has rank 1.

The appropriate treatment of the phase shift at a caustic can be based on the paraxial approximation of the wavefield at x(s,q) = x(s) + q1e1 (s) + q2e2 (s) in a paraxial vicinity of a central ray. With qT = (q1 ,q2 ), the wavefield is approximated by u(ω,s,q) = U(ω,s,q)exp [iωτ(s) ], where

In the vicinity of a caustic this paraxial approximation has to be interpreted as a limiting case for analytic continuation of the traveltime τ(s), where Im [M(s) ] > 0 is used for its second-order derivative, and ω > 0 is assumed. This requirement is consistent with the Gaussian-beam approach as presented by Červený & Pšenčík (1983) and Červený (1985).

By rewriting det (Q) as det (P)det (M−1 ) in eq.(3), one finds that the singularities in the vicinity of the point sc at a caustic are contained in the factor {det [M(s) ] }1/2exp [iω(1/2)qT M(s)q], provided that det [P(sc ) ] ≠ 0. Expressions of this kind typically occur as stationary-phase approximations in Maslov theory. For a 2-D situation, it was stated by Kravtsov & Orlov (1993) that the phase factor of the square root depends on the sign of the quadratic form in the exponent. A similar conclusion can be drawn from Appendix F of Lewis (1965). Note that in anisotropic media the polarization vector depends smoothly on s, unless the ray accidentally hits the caustic with a slowness vector pointing into a singular direction of the associated slowness sheet. Such exceptions are not considered in this paper.

In the following, we shall analyse the singularity in the phase factor in more detail. Let μ1 (s) and μ2 (s) be the eigenvalues of M(s), with m1 (s) and m2 (s) being the associated normalized eigenvectors. We consider cases 1–3 listed above.

Case 1

As Q(sc ) = 0, one has Q(s) ≈ [ (s − sc )/vgr (sc ) ]S12(sc )P(sc ) in the vicinity of the caustic, which implies M−1 (s) ≈  [ (s − sc )/vgr (sc ) ]S12 (sc ). This shows that the eigensystem of M−1 (s) is smoothly varying if S12 (sc ) is not rank-deficient, with

where the σI are the eigenvalues of S12 (sc ). Consequently, one obtains

Now suppose that M(s), and hence its eigenvalues, are perturbed into the complex z domain, with positive imaginary parts. Then Fig. 1 shows the path travelled by [μI (s) ]−1 when s passes the point sc at the caustic in the forward direction, depending on the sign of the σI . One reads off the phase changes from the plotted paths of the principal branches of the square roots [μI (s) ]−1/2, which should not cross the branch line formed by the negative real axis. In the limit for vanishing imaginary parts of the eigenvalues, the square roots [μI (s) ]−1/2 should be multiplied by a phase factor exp [i(π/2)sgn (σI ) ] if s passes the point sc in the forward direction.

Paths of the complex extensions of 1/μI (s) and 1/ [μI (s) ]1/2 for positive (left) and negative (right) eigenvalues σI .
Figure 1.

Paths of the complex extensions of 1/μI (s) and 1/ [μI (s) ]1/2 for positive (left) and negative (right) eigenvalues σI .

Hence, the amplitude should be multiplied by a factor

Rays for which the matrix S12 is rank-deficient at the caustic (i.e. the signs of its eigenvalues are questionable) are not considered in this paper.

Case 2

Now M−1 (sc ) has two distinct eigenvalues, and hence the eigensystem of M−1 (s) depends smoothly on s in the vicinity of sc . Again eq.(5) is valid, but now only one of the eigenvalues μI (s), say μ1 , has a singularity. Starting with the equalities dM/ds = (dP/ds)Q−1+P(dQ−1/ds) and dQ−1/ds = −Q−1 (dQ/ds)Q−1 , we investigate the singular behaviour of μ1 . By substitution of the dynamic ray-tracing system (eq.1) into these relations, one obtains

and hence

where S11 = −ST22 , and where S12 and S21 are symmetric. As the eigenvalues of M−1 (sc ) are well separated, one obtains vgr [d(1/μ1 )/ds]=vgrmT1 (dM−1 /ds)m1 → (mT1S12m1 ) (sc ) for s → sc . Consequently, as an analogue to eq.(4), one obtains

which implies, similarly to case (1), that the amplitude has to be multiplied by

when s passes the point sc in the forward direction. The exceptional case where (mT1S12m1 ) (sc ) = 0 is not treated in this paper.

Case 3

The situation in which both Q(sc ) and P(sc ) have rank 1 may occur as an exception in 3-D, but if 2-D modelling is embedded in 3-D modelling (when P has always rank 1, i.e. no geometrical spreading out of the plane of the cross-section) this is just a normal case of a caustic.

In order to analyse this situation, let a be a null vector of P(sc ), and let b be a null vector of Q(sc ). Since M(s)Q(s)a = P(s)a0 for ssc , and Q(sc )a ≠ 0, one of the eigenvalues of M(s), say μ2 , takes the value 0 at sc . Similarly, one obtains M(s)Q(s)bP(sc )b ≠ 0, while Q(s)b0 for ssc , which implies 1/μ1 (s)→0. Furthermore, the eigenvectors of M(s) vary smoothly with s, and at sc we can make the identification m2 (sc ) = Q(sc )a and m1 (sc ) = P(sc )b, where appropriate scaling of a and b is assumed such that the mI are orthonormal.

In this case, rewriting det (Q) as det (P)det (M−1 ) in eq.(3) is not convenient. Rather, we investigate Q in terms of the orthonormal eigenvectors mI . Since μ1mT1Q = mT1MQ = mT1P, we obtain

The determinant of the latter matrix is non-zero at sc . Otherwise, this matrix would have a null vector m for which mT1Pm = 0 and mT2Qm = 0, which in turn would imply that m is a common null vector of P(sc ) and Q(sc ), since m1 (sc ) spans the range of P(sc ), and m2 (sc ) spans the range of Q(sc ) at sc .

In this case the factor exp [iω(1/2)μ2 (qTm2 )2 ] behaves smoothly in the vicinity of sc . Therefore, the phase jump at the caustic is determined by the phase jump of μ1/21exp [iω(1/2)μ1 (qTm1 )2 ], and eqs (9) and (10) are again valid as a consequence of eq.(7), provided (mT1S12m1 ) (sc ) ≠ 0. Thus, the present case 3 leads to the same phase jump as case 2, i.e. the vanishing of det (P) at the caustic does not influence the phase jump.

4 KMAH Index For Anisotropic Media

The results of the analysis in Section 3 can be summarized in terms of the KMAH index for anisotropic media. For ω > 0, and with the phase factor exp (−iωt) in the Fourier transformation, the factor 1/[det (Q) ]1/2 is rewritten as {1/ |det [Q(s) ] |1/2 }exp [−i(π/2)k(s,s0 ) ], where the KMAH index k(s,s0 ) accumulates the phase shifts in the following way.

(1)If S12 (sc ) is positive definite, then k(s,s0 ) is increased by 2 in the case of a point caustic (case 1), and by 1 in the case of a line caustic (cases 2 and 3).

(2)If S12 (sc ) is negative definite, then k(s,s0 ) is decreased by 2 in the case of a point caustic (case 1), and by 1 in the case of a line caustic (cases 2 and 3).

(3)If S12 (sc ) is neither positive definite nor negative definite, then k(s,s0 ) remains unchanged for a point caustic (case 1), and in the case of a line caustic (cases 2 and 3) the value of sgn [ (mT1S12m1 ) (sc ) ] is added. Some ambiguous situations are not considered here: the situation for a point caustic in which S12 (sc ) has an eigenvalue equal to 0, and the situation for a line caustic in which (mT1S12m1 ) (sc ) = 0.

Some interpretation of this treatment of the KMAH index is appropriate here. It can be shown (Bakker 1996; Červený 1997) that the slowness sheet of the wave mode of interest associated with the stiffness tensor at x(sc ) is convex or concave at p(sc ) if and only if S12 (sc ) is positive or negative definite, respectively. Thus, the number of positive or negative eigenvalues of S12 (sc ) equals the number of positive or negative principal curvatures of the relevant slowness sheet at p(sc ). Moreover, the analysis of Section 3 shows that the values (mTIS12mI ) (sc ) determine whether the singular principal curvatures of the wave front change from negative to positive at the caustic, or vice versa.

Note that in an isotropic medium S12 is a scalar multiple of the unit matrix, and therefore positive definite. This implies that the slowness sheets are always convex, and that the singular principal curvatures of a wave front always change from negative to positive values when a caustic is crossed in the forward direction. In anisotropic media the inner slowness sheet (for the P wave) is always convex if it is detached from the other two (for the S waves) (e.g. Duff 1960). Hence anisotropic P waves will have the usual phase-shift behaviour, but anisotropic S waves may have anomalous phase changes, as their slowness sheets can be locally concave in one or two principal directions.

A similar link between the sign of the phase shift and the convexity/concavity of the slowness sheets was made earlier by Garmany (1988a,b) for stratified media. Note also that the results shown here for point caustics are consistent with the so-called index of a point-source, described by Burridge (1967).

Acknowledgements

The author thanks Shell International Exploration and Production BV for permission to publish this paper. Two reviewers are acknowledged for some detailed comments.

References

Bakker
P.M.
,
1996
.
Theory of anisotropic dynamic ray tracing in ray-centred coordinates
.
Pure appl. Geophys.
148
583
589
.

Burridge
R.
,
1967
.
The singularity of plane lids of the wave surface of elastic media with cubic symmetry
.
Q. J. Mech. applied Math.
20
41
56
.

Červený
V.
,
1985
.
The application of ray tracing to the propagation of shear waves in complex media, in
.
1
Section I Geophysical Press, London.

Červený
V.
,
1997
.
Seismic ray theory, in
.
Seismic Waves in Complex 3-D Structures
Report 5, Charles University, Prague.

Červený
V.
Pšenčík
I.
,
1983
.
Gaussian beams and paraxial ray approximation in three-dimensional elastic inhomogeneous media
.
J. Geophys.
53
1
15
.

Duff
G.F.D.
,
1960
.
The Cauchy problem for elastic waves in an anisotropic medium
.
Phil. Trans. R. Soc. Lond.
A,
252
249
273
.

Gajewski
D.
Pšenčík
I.
,
1990
.
Vertical seismic profile synthetics by dynamic ray tracing in laterally varying layered anisotropic structures
.
J. geophys. Res.
95
11
301
11
315.

Garmany
J.
,
1988
.
Seismograms in stratified anisotropic media-I. WKBJ theory
.
Geophys. J. R. astr. Soc.
92
365
377
.

Garmany
J.
,
1988
.
Seismograms in stratified anisotropic media-II. Uniformly asymptotic approximations
.
Geophys. J. R. astr. Soc.
92
379
389
.

Klimeš
L.
,
1997
.
Phase shift of the Green function due to caustics in anisotropic media
.
Expanded Abstracts of 1997 SEG meeting, Dallas
1834
1837
.

Klimeš
L.
,
1998
.
Phase shift of the Green function due to caustics in anisotropic media
.
Wave Motion
submitted.

Kravtsov
Yu.A.
Orlov
Yu.I.
,
1993
.
Caustics, Catastrophes and Wave Fields
Springer, Berlin.

Lewis
R.M.
,
1965
.
Asymptotic theory of wave-propagation
.
Arch. Rat. Mech. & Anal.
20
191
250
.

Popov
M.M.
Pšenčík
I.
,
1978
.
Ray amplitudes in inhomogeneous media with curved interfaces
.
Geofys. Sbornik
(Academia Praha),
24
111
129
.

Pšenčík
I.
Teles
T.N.
,
1996
.
Point source radiation in inhomogeneous anisotropic structures
.
Pure appl. Geophys.
148
591
623
.

Appendix

Appendix A:Relation To Klimeš’s Results

Klimeš (1997) has analysed the phase shift at caustics for a bundle of rays generated by a point source.

For a point caustic (case 1) Klimeš’s results are readily seen to be equivalent to the results presented in this paper. For the point caustic Klimeš uses the matrix [ (dQ/ds)P−1 ] (sc ) instead of S12 (sc ). This is evidently the same because Q(sc ) = 0 in such a case, and hence dQ/ds = (1/vgr ) (S12P) at sc .

For a line caustic (cases 2 and 3) the comparison is somewhat more tricky. In these cases Klimeš concludes that sgn [K1 (sc )K2 (sc ) ] has to be added to the KMAH index, where

with K = Q−1det (Q) (which varies smoothly when s crosses sc ). Hence, the question is whether sgn (K1 K2 ) equals sgn (mT1S12m1 ) at sc .

For case 2, where P(sc ) has full rank, m2 (sc ) spans the range of M−1 (sc ), which equals (QP−1 ) (sc ). Since Q(sc ) has rank 1, m2 (sc ) also spans the range of Q(sc ). This latter property was also shown for case 3, where P(sc ) has rank 1. Moreover, the range of Q(sc ) equals the null space of K(sc ), and vice versa. Consequently, m2 (sc ) is in the null space of K(sc ).

Since tr (Q−1P) = tr (QQ−1PQ−1 ), we have

Furthermore, K2 = tr (KS11Q + KS12P). Here, the first term vanishes at sc , because tr (KS11Q) = tr (KS11K−1 )det (Q) = tr (S11) det(Q). For the second term we have tr(KS12P) = tr (K−1KS12PK) = tr (S12PK), which equals mT1S12PKm1 at sc . Hence,

Since m1 is an eigenvector of PQ−1 , it is also an eigenvector of PK. Now, combination of eqs (A2) and (A3) gives the desired equality