-
PDF
- Split View
-
Views
-
Cite
Cite
Joel Bergé, Richard Massey, Quentin Baghi, Pierre Touboul, Exponential shapelets: basis functions for data analysis of isolated features, Monthly Notices of the Royal Astronomical Society, Volume 486, Issue 1, June 2019, Pages 544–559, https://doi.org/10.1093/mnras/stz787
- Share Icon Share
Abstract
We introduce one- and two-dimensional ‘exponential shapelets’: orthonormal basis functions that efficiently model isolated features in data. They are built from eigenfunctions of the quantum mechanical hydrogen atom, and inherit mathematics with elegant properties under Fourier transform, and hence (de)convolution. For a wide variety of data, exponential shapelets compress information better than Gauss–Hermite/Gauss–Laguerre (‘shapelet’) decomposition, and generalize previous attempts that were limited to 1D or circularly symmetric basis functions. We discuss example applications in astronomy, fundamental physics, and space geodesy.
1 INTRODUCTION
A frequent task in data analysis is to categorize and quantify the shapes of localized objects – such as transient events in a (one-dimensional) time-series, or regions of interest in a (two-dimensional) image. It is such a universal challenge that methods developed for one field frequently turn out to be useful in others. For example, astrophysicists measure the shapes of distant galaxies by decomposing them into orthogonal basis functions, such as CHEFs (Jiménez-Teja & Benítez 2012) or (Gaussian) shapelets (Bernstein & Jarvis 2002; Refregier 2003; Refregier & Bacon 2003; Massey & Refregier 2005). Shapelets have been used to analyse data in other branches of astrophysics, modelling extrasolar planets (Hoekstra et al. 2005; Amara & Quanz 2012), the distribution of dark matter (Birrer, Amara & Refregier 2015; Tagore & Jackson 2016), or flashes of pulsars (Ellis & Cornish 2015; Lentati et al. 2015; Desvignes et al. 2016). They have also been used in medical imaging (Weissman, Hancewicz & Kaplan 2004; Apostolopoulos et al. 2017), pattern recognition in human vision (Sharpee & Victor 2009) or artificial vision (Sabzmeydani & Mori 2007), data compression (Holbrey 2006), and the manufacture of nanoscale thin films (Suderman & Lizotte 2015; Akdeniz, Lizotte & Mohieddin Abukhdeir 2018).
Gaussian shapelets are based on eigenfunctions of the quantum mechanics harmonic oscillator. In one-dimensional form, they are Gauss–Hermite functions, which seem to be well adapted in several time-series analyses, especially when transients (whose shape is close to damped oscillations) must be detected, characterized, and/or corrected for. This is the case for instance in fundamental physics for MICROSCOPE ‘crackles’ (e.g. Baghi et al. 2015; Bergé et al. 2015), space geodesy for GRACE ‘twangs’ (Flury, Bettadpur & Tapley 2008; Peterseim, Jakob & Schlicht 2010; Peterseim et al. 2014), or ‘glitches’ in gravitational wave searches with LIGO (Cornish & Littenberg 2015; Powell et al. 2015, 2017; Principe & Pinto 2017 and references therein). In two-dimensional form, they can be expressed equivalently as either Gauss–Hermite (Cartesian; Refregier 2003) or Gauss–Laguerre (polar; Bernstein & Jarvis 2002; Massey & Refregier 2005) functions. Owing to their quantum mechanical origin, they have elegant properties (they make a complete set) under Fourier transform, and are hence efficient for operations involving convolution or deconvolution of two images.
The main limitation of shapelets is that they are perturbations around a Gaussian, which is flat near its peak. They inefficiently parametrize peaky features, including the distant galaxies for which they were originally suggested (Melchior et al. 2010). Galaxies have a two-dimensional surface density that decreases approximately exponentially with distance from the centre. Capturing the steep gradient near the centre requires a weighted sum of many Gaussian shapelet basis functions, which then overfit noise in the extended wings. Attempting to overcome this limitation, Ngan et al. (2009) developed ‘Sersiclets’, although they were forced to be circularly symmetric, and have not seen wider applications.
In this paper, we extend the quantum mechanical framework underpinning Gaussian shapelets, and define 1D and 2D exponential shapelets based on wavefunctions of the hydrogen atom. These functions are perturbations around a decreasing exponential, and should efficiently model any arbitrarily shaped but centrally peaked regions of interest. The perturbations themselves are Laguerre polynomials.
We introduce 1D exponential shapelets in Section 2, and 2D exponential shapelets in Section 3. In both cases, we describe their main properties, compare them to Gaussian shapelets, and provide example uses. We conclude in Section 4.
2 1D EXPONENTIAL SHAPELETS
2.1 Definition
Note that, contrary to normal procedure in quantum physics, we restrict 1D exponential shapelets on positive x, and refrain from defining the negative-x part |$\Psi _n^-(x; \beta) = -\Psi _n^+(-x; \beta)$| (for x < 0). Hence, we do not follow Palma & Raff (2006) and do not define the even and odd wavefunctions |$\Psi _n^+(x; \beta) \pm \Psi _n^-(x; \beta)$|. Events in time-series can often be adequately described by their behaviour after a certain moment (in this case, x = 0). Henceforth, we shall therefore drop the + superscript, to write more simply |$\Psi _n(x; \beta) \equiv \Psi _n^+(x; \beta)$|. These functions are continuously differentiable, smoothly departing from zero at x = 0 and tending back to zero as x → ∞.
Because the basis functions (1) are eigenfunctions of the 1D hydrogen atom’s Hamiltonian, they form an orthogonal basis of the square integrable functions L2([0, ∞[, 〈 ·, ·〉) Hilbert space equipped with the inner product |$\langle f(x),g(x) \rangle =\int _0^\infty f(x)g^*(x) {\rm d}x$| and an asterisk denotes complex conjugation. They are orthonormal, in the sense that |$\int _0^\infty \Psi _n(x; \beta)\Psi _m(x; \beta)\, {\rm d}x = \delta _{nm}$|, where δnm is the Kronecker symbol, and complete, in the sense that |$\sum _{n=1}^{\infty } \Psi _n(x; \beta) \Psi _n(x^{\prime }; \beta) = \delta (x - x^{\prime })$|.
2.2 Properties
2.2.1 Maximum and minimum effective scales
The first few 1D exponential shapelet basis functions Ψn(x; β) are shown in Fig. 1. As the order n increases (with constant β), the largest-scale oscillation dominates, and rapidly acquires a larger extent. However, the smallest oscillations always remain roughly the same size.

The first few 1D exponential shapelets Ψn(x), for β = 1. The inset shows a zoom on the smallest x.
As nmax → ∞, the range of scales modelled by a 1D exponential shapelet decomposition, θmax/θmin ∝ nmax = ncoeffs. However, the resolution of an exponential shapelet model is greatest near the origin, and decreases away from it. This behaviour is very different from Gaussian shapelets, where the resolution is more spatially uniform: Increasing n increases the scale of the model slowly, while simultaneously adding smaller-scale oscillations (see fig. 1 of Refregier 2003). The wide effective range of scales for exponential shapelets is what will allow them to efficiently describe spiky but long-duration events.
2.2.2 Fourier and Laplace transforms
Although equations (8) and (10) are not as simple as for Gaussian shapelets (the Fourier transform of a Gaussian shapelet is itself a Gaussian shapelet), closed forms exist for exponential shapelets that are easy to implement. The Fourier and Laplace transforms of 1D exponential shapelets are also well localized in frequency space (Fig. 2).

Exponential shapelets are well localized in frequency space. This shows the modulus of the Fourier transform of the first few 1D exponential shapelets Ψn(x; β), for β = 1.
2.2.3 Convolution
Convolution is an inevitable operation during signal acquisition, whereby an input (‘real’) signal undergoes the measurement apparatus’ transfer function. What is ultimately measured is the convolution of the input signal with the transfer function.
2.2.4 Rescaling
2.3 Shape characterization
Note that the series (19) converges only if the amplitudes of the 1D exponential shapelet coefficients decrease faster than n−6, which may not always be the case. Care must therefore be taken to check for convergence when characterizing the shape of a feature using this technique.
2.4 Exponential shapelet modelling in practice
As shown above (equation 2), a 1D feature is straightforward to model for a given couple (nmax, β), where nmax is the maximum order of the truncated sum (2). For example, a linear least-square method can be efficiently used for this purpose. Then the model depends non-linearly on the two parameters nmax and β that can be optimized by iteratively minimizing the residuals between the observed feature and its model. This procedure was described at length, in the 2D case, by Massey & Refregier (2005).
2.5 Example applications
This section demonstrates three possible applications of 1D exponential shapelets. We start by modelling exponentially suppressed oscillations, which are measurements throughout experimental physics, including the response of accelerometers1 onboard the space missions MICROSCOPE (Touboul, Métris & Rodrigues 2017), GRACE (Tapley et al. 2004), and GOCE (Rummel, Yi & Stummer 2011). We then discuss a potential application to unmodelled bursts in the analysis of gravitational waves. Exponential shapelets may also be convenient to model charge transfer inefficiency trailing due to radiation damage in spacebourne imaging detectors (Massey et al. 2010), although we do not explore that further here.
2.5.1 Cleaning accelerometer time series data, and modelling an experiment’s response function
MICROSCOPE tested the weak equivalence principle (WEP) by precisely measuring the differential acceleration experienced by two concentric cylindrical test masses onboard a drag-free satellite in Low Earth Orbit.2 In theory, any non-zero difference at a well-defined frequency fEP (which depends on the orbit and attitude characteristics of the satellite) is a signature of violation of the WEP.
In practice, many transients are apparent in MICROSCOPE data (the upper panel of Fig. 3 shows a typical high-signal-to-noise example). These transients are generally caused by crackles of the satellite’s coating (because of temperature variations), crackles of the satellite’s gas tanks (as their pressure decreases as the gas is consumed), or impacts with micro-meteorites. Such transients occupy frequencies higher than fEP, so they do not directly impact a possible WEP violation signal. However, it is necessary to detect and mask them in measured time-series, and then either reconstruct the corresponding ‘missing data’ (Bergé et al. 2015; Pires et al. 2016) to allow for a least-square fit of the expected WEP violation signal in the frequency domain (Touboul et al. 2017), or adapt the maximum likelihood technique to take missing data into account (Baghi et al. 2015, 2016). Existing techniques are suboptimal, as they may affect the noise characteristics. Moreover, transients could be considered as conveying useful information. They are created by an external impulse; if this is assumed to be instantaneous (Dirac), the shape and relaxation time of the observed signal are a measurement of MICROSCOPE’s transfer function.

Test data for 1D exponential shapelets. Upper panel: a transient in MICROSCOPE accelerometer time series, as observed (black) and reconstructed from a 1D exponential model (red), with model residuals (green). Lower panel: convergence of the model as we increase the order of the model nmax.
1D exponential shapelets are well-matched to these transient signals. The upper panel of Fig. 3 shows a transient in the time domain: the observed data are shown in black, a model (fitted between t = 10 s and t = 25 s) and its 68 per cent confidence interval are shown in red, and residuals consistent with noise are shown in green. This model uses nmax = 15, meaning that it has compressed 60 data points in the time domain description (15 s sampled at 4 Hz) into 15 shapelet coefficients. None the less, the model has rich, empirical freedom to capture multiple response modes of the instrument’s complex structure. The lower panel of Fig. 3 shows the convergence of the model as we increase nmax, quantified as the square residuals between the observed transient and the model. Most of the information is contained in coefficients 3 ≤ n ≤ 13.
An extension to this process will be presented in a future paper. By fitting 1D exponential shapelet coefficients to many transients, it is possible to model temporal variations in relaxation time of MICROSCOPE’s instrument via the parameter β or the exponential envelope scale rrms. Interpolating these models then yields a model of the transfer function at any time, to either gain insight into the instrument’s performance or correct (deconvolve) signals with an ‘inverse transfer function’ transform. Note that this procedure is similar to techniques developed for 2D astronomical image processing, where a point spread function (PSF) is determined from the shape of stars then interpolated across the data (see e.g. Bergé et al. 2012; Gentile, Courbin & Meylan 2013; Kilbinger 2015).
2.5.2 Characterization of perturbing signals in space-borne geodesy missions
The GRACE mission (Tapley et al. 2004) revolutionized geodesy by measuring the Earth’s gravitational field with unprecedented precision. Two satellites followed each other on the same orbit, monitoring the distance between them via microwave ranging. In theory, any variation in their relative speed or distance can be ascribed to variations in the Earth’s geopotential.
In practice, the satellites were also subject to non-gravitational forces. These were measured by ultrasensitive accelerometers, for removal during post-processing (Flury et al. 2008; Peterseim et al. 2010). Peterseim (2014) and Peterseim et al. (2014) modelled transients (known as ‘spikes’) in accelerometer data using a piecewise function made from the derivative of a Gaussian, a third-order polynomial, and a damped oscillation. Some were successfully classified as either due to the satellite’s heaters or due to activation of its magnetic torquers – but no physical origin could be assigned to others, known as ‘twangs’.
Tentative correlations of twangs with the position of the satellite along its orbit hint at a possible geophysical origin. Both categories of twangs are compactly represented as 1D exponential shapelets, so we will investigate in a future paper whether these provide a cleaner set of shape parameters to categorize and understand their origin.
2.5.3 Unmodelled bursts and glitches in gravitational wave data analysis
A wealth of methods have been developed to search for, characterize, and classify unmodelled bursts and instrumental glitches in searches for gravitational waves (see e.g. Cornish & Littenberg 2015; Powell et al. 2015, 2017; Principe & Pinto 2017, and references therein). Glitches are often modelled using ‘sine-Gaussian waveforms’ (Principe & Pinto 2017). These have similar properties to Gaussian shapelets, although shapelets can encode more details. One-dimensional exponential shapelets could further optimize the data compression of complex glitch shapes that often exhibit damped oscillations.
Shapelets might therefore improve the detection, characterization, and classification of instrumental glitches in gravitational wave detectors. Indeed, bursts and glitches are usually detected in the time–frequency domain, which is easily reproduced in Gaussian shapelet-time space. If 1D exponential shapelets more efficiently model the information in a glitch, exponential shapelet-time space would be even better. We will investigate in a future paper whether glitches can be detected by scanning a matched filter and correlating the measured signal with a 1D exponential shapelet model while leaving β (and possibly nmax) free.
2.6 Comparison with Gaussian shapelets
The MICROSCOPE transient modelled as exponential shapelets (nmax = 15) in Fig. 3 is modelled as Gaussian shapelets (nmax = 18) in Fig. 4. Achieving a similar precision of fit requires more coefficients, and creates more small-scale artefacts: The data are overfitted near the centre of the model, and underfitted at the extremes. We also find that its coefficients are highly covariant, with the good fit produced by large positive and negative basis functions almost precisely cancelling each other. This is much less apparent for the 1D exponential shapelet coefficients. Indeed, we find that estimating the model again on points different from those observed data fails to reproduce the overall shape of the transient, whereas the 1D exponential shapelet model is itself predictive.

Gaussian shapelet model of the MICROSCOPE transient whose 1D exponential shapelet model is shown in Fig. 3.
In this example, it appears that 1D exponential shapelets outperform Gaussian shapelets. This is because their perturbations around an exponential decay are better matched to the underlying signal, and because their wider extent spreads information density more evenly. In general, the type of shapelet to choose should depend on the decay rate of the target function.
3 2D EXPONENTIAL SHAPELETS
3.1 Definition
These functions form an orthonormal set of basis functions in the L2([0, ∞[ × [0, 2π[, 〈 ·, ·〉) space equipped with inner product |$\langle \Psi _{n,m}(r,\phi), \Psi _{n^{\prime },m^{\prime }}(r,\phi)\rangle = \int _0^{2\pi } \int _0^\infty \Psi _{n,m}(r,\phi)\, \Psi ^*_{n^{\prime },m^{\prime }}(r,\phi) \, r\, {\rm d}r\, {\rm d}\phi = \delta _{nn^{\prime }}\delta _{mm^{\prime }}$|.
It may also be possible to define elliptical 2D exponential shapelets by applying a shear transformation to the coordinate system, as Nakajima & Bernstein (2007) suggested for Gaussian shapelets. This preserves their orthonormality, and can increase their rate of data compression, at the cost of two additional non-linear parameters to specify the shear.
3.2 Properties
3.2.1 Maximum and minimum effective scales
The first few 2D exponential shapelet basis functions are illustrated in Fig. 5, and their radial component is shown in Fig. 6, defined via Ψn,m(r, ϕ; β) ≡ Rn,m(r; β)exp (−imϕ). Their resemblance with Gaussian polar shapelets is striking (see fig. 2 of Massey & Refregier 2005). However, due to their exponential kernel, exponential shapelets are both more peaky and further spread out than Gaussian shapelets. The size difference between the lowest-order basis function Ψ00 and even a low-order one like Ψ40 is striking. It is this behaviour that will help to describe spatially extended features.

The first few 2D exponential shapelet basis functions: real part (left) and imaginary part (right). Red is positive; blue is negative. The normalization of both the colour scale and the spatial scale is arbitrary, but is the same in every box.

Radial component of the first few 2D exponential shapelet basis functions, Rn,m(r). Functions with the same n are of the same colour (blue for n = 0, green for n = 1, red for n = 2, black for n = 3). Functions with m ≠ 0 are shown as dashed lines.
3.2.2 Fourier transformation and convolution
3.2.3 Coordinate transforms
It can be useful to know the response of the basis functions to 2D linear coordinate transforms: either to know how to mix the coefficients of a shapelet decomposition in order to perform that transform, or to form combinations of coefficients that are invariant under some transforms. This was used to construct estimators of the shear distortion applied to images of galaxies by the effect of weak gravitational lensing (Kuijken 2006; Massey et al. 2007).
A convenient shortcut to calculating those transforms for Gaussian shapelets was provided by the ladder operators associated with the quantum mechanical harmonic oscillator (Refregier & Bacon 2003). Unfortunately, we have not yet found a useful form of the ladder operators for the 2D hydrogen atom. If it becomes necessary to perform linear transformations on 2D exponential shapelets, it will be necessary to apply the transforms manually to the basis functions (a long and arduous task, but one that is guaranteed to yield a closed form, because the basis is complete).
3.3 Object shape measurement
Once a feature has been decomposed into 2D exponential shapelets, its coefficients can be used to construct characteristic measurements of its shape. In this section, we derive expressions for an object’s (azimuthally averaged) radial profile, flux, centroid, unweighted quadrupoles, size, and ellipticity, in terms of its shapelet coefficients. We have not attempted it here, but it should also be possible to develop an exponential-shapelet classification of galaxy morphologies via e.g. their concentration, asymmetry, and symmetry (Conselice, Bershady & Jangren 2000).
3.3.1 Radial profile
3.3.2 Flux
3.3.3 Centroid
3.3.4 Quadrupole moments
3.3.5 Size and ellipticity
3.3.6 Convergence
Expressions for the shape estimators resemble those obtained for polar Gaussian shapelets in section 6 of Massey & Refregier (2005). In particular, a feature’s radial profile, flux, and size are encoded within the n = 0 coefficients, while the centroid is described by the n = 1 coefficients and ellipticity by the n = 2 coefficients. The expressions also contain the same power of β as those for Gaussian shapelets, because that encodes information about the units in which the data is expressed.
However, shape measures for 2D exponential shapelets contain higher powers of n than those for 2D Gaussian shapelets, and will converge more slowly. Some of this is due to the normalization of the basis functions against an inner product. In the Gaussian shapelet case, this happens to yield an integral of basis functions (equation 31) that is independent of n, while in exponential shapelets it is ∝n3/2. That power of n could have been included in the basis functions and removed from the coefficients – although doing so would merely make them look more stable rather than actually altering convergence. In the limit that R → ∞, convergence of the flux estimator requires |fn,0| to decrease faster than n−5/2; convergence of centroid requires |fn,1| to decrease faster than n−11/2; and convergence of size and ellipticity requires |fn,0| and |fn,2| to decrease faster than n−13/2. This may not be a problem, since we expect 2D exponential shapelets to converge faster than shapelets (i.e. to have a lower nmax for a given galaxy). However, if this does not hold true, e.g. due to measurement noise, care should be taken to restrict the decomposition and measurement of photometry inside a finite aperture, or to truncate the 2D exponential shapelet model at finite nmax.
3.4 Example application
This section demonstrates the use of 2D exponential shapelets to model the shapes of galaxies seen outside our own Milky Way (they may also be convenient to model the shapes of gamma-ray events; S. Pires, private communication). We examine the convergence speed of 2D exponential shapelets, and compare their performance to that of 2D Gaussian shapelets.
3.4.1 Distant galaxies in astronomical imaging
Galaxies are collections of (hundreds of billions of) stars, with a combined surface brightness that peaks sharply near the centre, and decreases to large radii in a way that is often exponential. Four galaxies in the COSMOS survey, (Scoville et al. 2007a,b) observed by the Hubble Space Telescope (Koekemoer et al. 2007), are shown in Fig. 7. As mentioned in Section 2.4 for the 1D case, we follow the algorithm described by Massey & Refregier (2005) to choose the best non-linear parameters of the models –nmax, β, and centroid). Note that galaxies in this figure were modelled into shapelets with the ‘diamond’ truncation scheme introduced in Massey et al. (2007), which is intended to reduce small-scale oscillations by ignoring the higher m terms of the decomposition. In this case, the number of coefficients is divided by 2, allowing also a better compression of the information.

Test images of four galaxies from the Hubble Space Telescope COSMOS survey (Scoville et al. 2007a,b). For each galaxy, we show the observed data (upper left); the 2D exponential shapelet model (upper middle) and its residuals (upper right); the Gaussian shapelet model (lower middle) and its residuals (lower right). Also shown for each galaxy are the maximum order of decomposition nmax and the total number of coefficients ncoeffs in each model. For a given galaxy, the grey-scales are the same in all panels.
In all four cases, 2D exponential shapelets require fewer coefficients (lower nmax; here ncoeffs = 85) to provide a model whose residuals are consistent with the noise than Gaussian shapelets (here ncoeffs = 121). Even more interestingly, exponential shapelets tend to provide a (visually) cleaner model of the central region of the galaxies. The top left galaxy has an irregular morphology, and the 2D exponential shapelet model shows fewer artefacts than the polar shapelet model. For an elliptical (top right) or edge-on spiral galaxy (bottom left), the ringing common in Gaussian shapelets disappears with 2D exponential shapelets: They do not possess high-frequency ripples at large radii that need large coefficients alternating in sign for them to cancel. Even for highly eccentric galaxies (bottom right), 2D exponential shapelets provide cleaner models than polar shapelets, especially in the outskirts of the galaxies, as the exponential wings are fundamentally a better match to the underlying signal.
3.4.2 Convergence speed
Fig. 8 compares the radial profiles fitted with 2D exponential shapelets (upper row) or Gaussian shapelets (lower row), for different maximum orders of decomposition nmax, and coarsely optimised β (this could be further optimized at low nmax in the presence of noise). The normalization of the ordinate depends on bn and pixelization, and so is arbitrary; only its relative scaling is informative. For all three galaxy types, 2D exponential shapelets greatly outperform Gaussian shapelets. A decomposition to nmax = 2 or 4 is sufficient to model an exponential galaxy. Depending on the signal to noise of real data, a decomposition to nmax = 6 may be sufficient for an intermediate ns = 2 galaxy. It is more challenging to model a de Vaucouleurs galaxy, and even exponential shapelets require nmax = 12 to capture its extended tails three orders of magnitude fainter than the peak. However, they do so without the oscillatory ‘ringing’ introduced by the cancellation of positive and negative basis functions in Gaussian shapelets.

Exponential (upper panels) and Gaussian (lower panels) shapelet models’ speed to converge on the profile of different types of (noiseless) elliptical galaxies: exponential (ns = 1, left), intermediate ns = 2 (centre), and de Vaucouleurs galaxy (ns = 4, right). In each panel, squares represent the profile of the galaxy estimated in pixel space.
To illustrate the convergence speed, Fig. 9 shows the mean square residual of the models of the same galaxies as in Fig. 8, as a function of the maximum shapelet order nmax (upper row), or the total number of coefficients ncoeffs. The normalization of the ordinate is arbitrary, and only its relative scaling is informative. Once again, it is clear that 2D exponential shapelets (solid lines) outperform Gaussian shapelets (dashed lines). For exponential and intermediate galaxies (left-hand and middle columns), the residual of the 2D exponential shapelet model decreases consistently, and ends up several orders of magnitude better than that of Gaussian shapelets. The enhanced performance of exponential shapelets is even more prominent for a de Vaucouleurs galaxy. The limitation for exponential shapelets in this case is numerical precision of our algorithm to pixelate the basis functions. At high n, this breaks down, causing spurious residuals at the very centre of the model, shown as an artificial upturn in the bottom right-hand panel of Fig. 9. This could be circumvented by more accurate pixelization, or by imposing limits on θmin.

Convergence speed as a function of shapelets’ nmax (upper panels) and number of coefficients (lower panels), for exponential (solid lines) and Gaussian (dashed lines) shapelets, for the same Sersic galaxies as in Fig. 8 (ns = 1, 2, 4 from left to right).
4 CONCLUSION
In this paper, we introduced exponential shapelets, a family of orthonormal basis functions that can be efficiently used to model 1D and 2D objects. They borrow many concepts from the original Gaussian shapelets, but are perturbations around a decaying exponential rather than a Gaussian, and more efficiently compress the information in damped 1D oscillations or centrally concentrated 2D features. In particular, they can simultaneously capture the central peak and the extended wings of galaxies in astronomical imaging, thereby solving the main criticism levelled at Gaussian shapelets in the field for which they were originally intended.
Modelling a feature using exponential shapelets first requires a choice of characteristic scale size β and expansion order nmax; these can be selected using an non-linear optimization method developed for Gaussian shapelets (Massey & Refregier 2005). The function is then decomposed into a weighted sum of exponential shapelet basis functions using linear regression. A least-squares fit is often sufficient, although in some cases modelling a transient may be stabilized by regularization techniques such as sparsity constraints.
Once a feature is described as a weighted sum of exponential shapelets, simple combinations of the weights yield expressions for its total flux, centroid, size, and ellipticity. These are similar to those for Gaussian shapelets. However, convolution and deconvolution in exponential shapelets are significantly more difficult than in the Gaussian case, with the convolution tensor being time-consuming to calculate via numerical integration.
We described possible applications of exponential shapelets in several fields of experimental and observational science. One-dimensional exponential shapelets can be used to model (and subtract or understand) spurious transients in time-series, such as measurements by the MICROSCOPE or GRACE satellites, or the LIGO search for gravitational waves. Thanks to their exponential decay, 2D exponential shapelets overcome the main criticism aimed at Gaussian shapelets (though at the cost of losing some simplicity) and can be used to measure the brightness, shape, and size of distant galaxies in astronomical imaging. The characteristics of exponential shapelets should make them well suited to data compression and analysis in a wide range of fields; their convenient mathematical properties should see them frequently adopted.
ACKNOWLEDGEMENTS
The concepts in this paper matured over many years, after a preliminary study to generalize Gaussian polar shapelets for Hubble Space Telescope data, for which we acknowledge support from NASA through grant HST-AR-11747 – and which was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with NASA. We are indebted to Barnaby Rowe for researching Kummer functions and the literature of the 2D hydrogen atom. We also acknowledge helpful and motivating discussions with Jason Rhodes, Richard Ellis, Alexandre Refregier, Sandrine Pires, James Nightingale, Bernard Foulon, Bruno Christophe, Gilles Métris, Manuel Rodrigues, Alain Robert, and Nicolas Touquoy. This work makes use of technical data from the CNES-ESA-ONERA-CNRS-OCA Microscope mission, and has received financial support from ONERA and CNES. JB acknowledges the financial support of the UnivEarthS Labex programme at Sorbonne Paris Cité (ANR-10-LABX-0023 and ANR-11-IDEX-0005-02) and of CNES through the APR programme (LISA project). RM is supported by a Royal Society University Research Fellowship and through STFC grant ST/N001494/1.
Footnotes
The mission ended on 2018 October 16; data analysis is still underway.
REFERENCES
APPENDIX A: DERIVATION OF CONVOLUTION KERNEL TENSOR Cnml FOR 1D EXPONENTIAL SHAPELETS
A1 Proof of equation (13)
It can be noted that Cnml is a complex number, unless u + v + w is even, such that iu + v + w = −1. If u + v + w is odd, then the integral is 0 (see equation A2). Hence, we do not need to impose any restriction on u + v + w for Cnml to be real. We can also note that since u ≤ 2m, v ≤ 2l, and w ≤ 2n, the integral in equation (13) converges.
A2 Properties of |$\mathcal {I}_{u,v,w}^{m,l,n}$|’s integrand
A2.1 Alternative definition
A2.2 Parity
It is obvious that f(− x) = (− 1)u + v + wf(x), so that f is even (resp. odd) when u + v + w is even (resp. odd). Hence, |$\mathcal {I}_{u,v,w}^{m,l,n}$| vanishes when u + v + w is odd, in which case Cnml = 0. As mentioned in the main text, Cnml is real if u + v + w is even (then, Cnml is real for all combinations of u, v, w, n, m, l). In the remainder of this appendix, we restrict ourselves to x ≥ 0.
A2.3 Value in 0
Evidently, f(0) = 0 if u + v + w ≠ 0. If u + v + w = 0, then f(0) = 1.
A2.4 Limit at x → ∞
Since 0 ≤ u ≤ 2m, 0 ≤ v ≤ 2l, and 0 ≤ w ≤ 2n, it is easy to see that limx → ∞ = 0.
A2.5 Dependence on n, m, l
For a given x, f quickly decreases as m2(m + 1) (and similarly for n and l), showing that only the contributions of low n, m, and l are significant in Cnml. Then, the series hn (equation A6) converges quickly.
The left-hand panel of Fig. A1 shows f(x) for some realistic combinations (α, β, γ, n, ml, u, v, w). It can be seen that f(x) is well behaved, and tends quickly towards 0 for large x. Its extent depends on α, β, and γ. As mentioned above, for a given x, it quickly decreases when n, m, l increase. The right-hand panel of Fig. A1 shows f(x) when γ = α = β = 0.1, n = 2, m = 1 l = 1, w = 2, u = 1 and v = 1, together with the values x = min ((mα)−1, (lβ)−1, (nγ)−1) and x = max ((mα)−1, (lβ)−1, (nγ)−1) (borders of the grey area), which are key values in computing |$\mathcal {I}_{u,v,w}^{m,l,n}$| (see below).
![f(x) function. Left: f(x) for different values of α, β, γ, n, m, l, u, v, w. Right: f(x) for γ = α = β) = 1, n = 2, m = 1 l = 1, w = 2, u = 1, v = 1; the grey region shows the domain [min ((mα)−1, (lβ)−1, (nγ)−1), max ((mα)−1, (lβ)−1, (nγ)−1)].](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/mnras/486/1/10.1093_mnras_stz787/1/m_stz787figa1.jpeg?Expires=1749449856&Signature=1RAcIHtzo3g02QTLgu2PBPXzCOjxrUXzd9~HsdqihH4Sk4s2Z3l6zQ0lEfnukksc6dUvRuNGf6S~bVWYd5Kov425lQTDO-YgKrs6Qbbb98cEPz3WxecPLb7TdT0iV3H7wEtNiNknCv2-tkH19GaoIBWSHn5U6NSyzNOS4AMbJqLGrGbZsC0k3FVIzftQPY5ppo7Ar8fkZF42xWim1k2iwNPj~t~JTQmdqD5G014d~qGPwHSOvvI3ZXHU8fSV4qOXDY~najX2j24gmuij~rfYZPJMm1PCG9e~6eIbjsEVWPbxIX4sW~t-aWzlGg1vskf13RPskEOl0XX45ueBI7Ic7Q__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
f(x) function. Left: f(x) for different values of α, β, γ, n, m, l, u, v, w. Right: f(x) for γ = α = β) = 1, n = 2, m = 1 l = 1, w = 2, u = 1, v = 1; the grey region shows the domain [min ((mα)−1, (lβ)−1, (nγ)−1), max ((mα)−1, (lβ)−1, (nγ)−1)].