-
PDF
- Split View
-
Views
-
Cite
Cite
Rodrigo S De Negri, Robin S Matoza, Patrick Hupe, Alexis Le Pichon, Kaelynn M Rose, Sandrine Cevuard, John J Niroa, Evaluating the temporal capability of empirical climatologies for rapid long-range volcanic infrasound propagation estimates using a multidecadal data set of persistent Vanuatu volcanic eruptions, Geophysical Journal International, Volume 241, Issue 1, April 2025, Pages 268–290, https://doi.org/10.1093/gji/ggaf027
- Share Icon Share
SUMMARY
Powerful infrasound (acoustic waves
1 INTRODUCTION
Explosive volcanic eruptions are formidable sources of infrasound (acoustic waves
The International Monitoring System (IMS) of the Comprehensive Nuclear-Test-Ban Treaty Organization (CTBTO) includes a global infrasound network that currently has 53 certified of 60 planned infrasound arrays (Christie & Campus 2010; Marty 2019). The IMS, designed to detect explosions with a yield greater than 1 kT TNT equivalent anywhere on Earth (Le Pichon et al. 2009a; Green & Bowers 2010; Marty 2019), has recorded numerous eruptions (Campus & Christie 2009; Dabrowa et al. 2011; Matoza et al. 2019). For example, 68 of 110 volcanic events worldwide (
Previous studies have shown that we can use several infrasound arrays to detect, locate and track the chronology of distant volcanic eruptions, and provide eruption source parameters for ash transport and dispersal models (e.g. Matoza et al. 2007, 2011a, b, 2017, 2018; Garcés et al. 2008; Fee et al. 2010; Dabrowa et al. 2011; Green et al. 2013; Marchetti et al. 2013; Caudron et al. 2015; Taisne et al. 2019; Perttu et al. 2020). Although most practical algorithms require detection by at least two or three infrasound arrays (stations) to locate the source (e.g. Evers & Haak 2005; Arrowsmith et al. 2008; Modrak et al. 2010; Morton & Arrowsmith 2014; Matoza et al. 2011a, 2017), single-station detection (i.e. detection by one array) of volcanic infrasound signals can still provide valuable information on eruption occurrence (e.g. Fee et al. 2010; De Angelis et al. 2012; Marchetti et al. 2019; Matoza et al. 2019; Ortiz et al. 2020; De Negri et al. 2022; Morelli et al. 2022; Gheri et al. 2023). Long-range infrasound also provides observational constraints on the activity of very remote volcanoes, such as those in the South Sandwich Islands, that can be challenging to observe with satellite remote sensing and for which local instrumentation is difficult to maintain (De Negri et al. 2022).
The dynamic nature of the atmosphere presents a major challenge for infrasound monitoring. Spatiotemporal variability at temporal scales including hourly, daily and seasonal, and at spatial scales from the microscale to synoptic scale, is partially captured by various different atmospheric specification products (Garcés et al. 1998; Drob et al. 2010; Smets et al. 2016; Schwaiger et al. 2019). One of the most important atmospheric phenomena influencing infrasound propagation are the vector wind fields, which can create infrasound waveguides (along-path winds), but also impose cross-path winds (cross-winds) leading to deviations from the great-circle path connecting the true source and receiver locations (back-azimuth deviations), and upwinds resulting in shadow zones (e.g. Evers & Haak 2005; Matoza et al. 2011a; Green et al. 2012; De Negri & Matoza 2023). The back-azimuth deviations can be up to
Morton & Arrowsmith (2014) showed that by combining 3-D ray tracing techniques with realistic atmospheric profiles, lookup tables for celerity (ratio of range over total traveltime) and back-azimuth priors can be produced to describe global seasonal effects that could improve infrasound detection, location and association. A similar approach was used by Drob et al. (2010), who proposed combining the
Building on the previous ideas, Matoza et al. (2018) and De Negri & Matoza (2023) showed that the mislocation of known volcanic sources could be reduced by an order of magnitude by applying corrections to observed back-azimuths before locating the source with a brute-force, grid-search and cross-bearings method (Matoza et al. 2017). De Negri & Matoza (2023) compared the results of using different atmospheric models in this approach, and found that statistical models, also known as empirical climatologies [i.e. the Horizontal Wind Model HWM14 (Drob et al. 2015) and the MSIS2.0 model (Emmert et al. 2020)], can be used to generate back-azimuth corrections that perform comparably well to those calculated using the more advanced and accurate hybrid models [e.g. the European Centre for Medium-Range Weather Forecasts (ECMWF) reanalysis v5 (ERA 5) for the first
It is clear that, in terms of temporal and spatial accuracy, the best available descriptors of the lower atmosphere (
The methodology introduced by De Negri & Matoza (2023) uses readily accessible numerical tools and offer a reproducible workflow for calculation of lookup back-azimuth tables for arbitrary source–receiver locations, considering infrasound arrivals up to thermospheric altitudes. The rapid, first-order approach also reduces the computation burden that more accurate models require. Here, we evaluate the accuracy of their methodology with observations from the IMS infrasound array IS22, New Caledonia, which has recorded persistent infrasound from volcanoes in the Vanuatu volcanic archipelago (
In Section 2, we introduce the area of study, highlight the most infrasonically active volcanoes in the region, and present the data sets we use to compare with our models. In Section 3, we introduce our methodology to estimate year-long infrasound back-azimuth deviations, and the different atmospheric and ray tracing propagation schemes we consider. In Section 4, we summarize the annual back-azimuth prediction results, compare them with the multiyear data set, and measure the accuracy of each model on a year-long time window. In Section 5, we discuss the results and the limitations of our approach, and pinpoint some interesting results. Finally, in Section 6 we review our findings and suggest the next steps forward.
2 THE VANUATU ARCHIPELAGO VOLCANIC ARC
Along the South Vanuatu Trench in the South Pacific Ocean, more than 80 islands stretch northeastward to form the Vanuatu archipelago volcanic arc (Fig. 1; Eissen et al. 1991; Simkin & Siebert 1994). The active volcanoes of this archipelago have a rare diversity of eruptive styles which range from effusive to highly explosive, hence becoming a valuable laboratory for volcanologists to better understand eruption dynamics (Beaumais et al. 2016; Coppola et al. 2016; Meier et al. 2016; Vergniolle & Métrich 2016).

Study region (see box in upper right inset), comprising the Vanuatu archipelago and the main island of New Caledonia (Grande Terre). Black square: location of IS22, New Caledonia. Red triangles: location of the most active volcanoes; names on right. Grey dashed lines: great-circle distances and azimuths from IS22 to each volcano (annotated). Lower-right: IS22 array geometry in Cartesian coordinates (km).
As shown in Coppola et al. (2016), from 2000 to 2015 Yasur (Tanna island), Lopevi (Lopevi island), Ambrym (Ambrym island), Lombenben (Ambae island, herein referred as Ambae) and Mont Garet (Gaua island, herein referred as Gaua; see also Park et al. 2021) volcanoes were the most active volcanoes of Vanuatu archipelago. Although most local multiparameter studies in the region target Yasur due to its essentially continuous explosive activity (Marchetti et al. 2013; Meier et al. 2016; Spina et al. 2016; Maher et al. 2022; ; Matoza et al. 2022a).
2.1 IS22: multidecadal long-range volcanic infrasound records
At
Coherent infrasound signals propagating across IS22 can be extracted from non-coherent infrasound with the progressive multichannel correlation (PMCC) time-domain method (Cansi 1995). The PMCC method can obtain the coherent signal’s mean frequency, trace velocity, amplitude and azimuth remarkably well under low signal-to-noise ratios (
Among many natural and human-made sources detected by IS22 (e.g. microbaroms, surf, mining explosions, etc.; see Fig. 2), the persistent eruptive activity of Yasur volcano (19.532

Infrasound detections at IS22 from mid-2003 through 2022 (Hupe et al. 2022). (a) 2D colour-shaded histogram of detections by year (x-axis) versus azimuth (y-axis), coloured by the number of detections in each bin. Each 2-D bin is 277.4 hr (
2.2 Open access infrasound data products
In this study, we use open access infrasound data products developed by Hupe et al. (2022), which represent summaries of array processing detection lists rather than the raw lists. These data products started as the result of synthesizing 2003 to 2020 IMS infrasound detections produced from PMCC array processing with a 26-band logarithmic-spaced parametrization, and since then have been updated yearly. The open access data products (see Hupe et al. 2021) use a quality value to gather the most robust PMCC detections and classify them into four specific frequency ranges, which target the most common natural and human-made phenomena: 0.02–0.07 Hz “very low frequency” (e.g. mountain associated wave infrasound), 0.15–0.35 Hz “low-frequency microbarom”, 0.45–0.65 Hz “high-frequency microbarom”, and 1–3 Hz “high-frequency” (e.g. volcanic eruptions and surf).
Here, we will use the “high-frequency” (1–3 Hz) open access products (herein, “PMCC products”) for IS22 covering from years 2003 to 2023 (Fig. 2; see also Fig. S1 in the Supporting Information), as most of the infrasound at IS22 from Lopevi/Ambrym and Yasur volcanoes centres around 1 and 2 Hz, respectively (Le Pichon et al. 2005a; Antier et al. 2007; Assink et al. 2014a; Morelli et al. 2022).
For comparison and validation of these data summary products, we compare with full array detection lists using 15-band (2003–2017) (Matoza et al. 2013; Ceranna et al. 2019) and 26-band (2003–2023) (Hupe et al. 2022) PMCC parametrizations in the Supporting Information (see Figs S4–S8 and S11–S13 in the Supporting Information). Unlike the summaries by Hupe et al. (2022), these detection lists are the direct output of PMCC array processing of the waveforms recorded at the station IS22.
2.3 Chronological and “folded” observations
Volcanic, surf and possibly mining infrasound (“anthropogenic”), are the three main types of “high-frequency” (1–3 Hz) detections at IS22 from 2003 to 2022 (see labels in Fig. 2a). Non-volcanic infrasound falls at much higher back-azimuths than Yasur (
The periods of volcanic activity catalogued by the Vanuatu Meteorology and Geo-hazards Department (Figs 2b and c in grey vertical lines and shaded areas; also by relative RSAM from a couple of available stations near Yasur volcano in Fig. 2b) serve as valuable complementary observations (e.g. McKee et al. 2021a, b; Matoza et al. 2022a). Their observations indicate that overlapping of infrasound sources among the Ambrym group seems feasible. In particular, during the second half of 2010 (Fig. 2c), a large group of detections appears around the azimuth of Gaua volcano, coincident with a long eruptive period that occurred from September 2009 to July 2010 (Bani et al. 2016).
Sorting the detections by day of year (i.e. “folding” into a year; Figs 2d–f), shows that most of the 2003–2022 IS22 infrasound detections come from the directions of Yasur and the Ambrym group volcanoes. The abundant detections that chronologically resemble “bottom half sinusoid” curves from the back-azimuth of Yasur (Figs 2a and b) are sorted into a “capped arc” now (Fig. 2e). This feature shows up between October and March (Fig. 2e; days
3 ANNUAL LONG-RANGE PROPAGATION MODELS
3.1 Methodology
We attempt to model the bulk seasonal to daily effects to first-order (e.g. the “capped arc” and “main arc” features; Fig. 2e), which could provide valuable insights and potentially seasonal predictions of the azimuth deviation from each source. To do so, we use the methodology introduced by De Negri & Matoza (2023) to simulate the observed detections at IS22 from the locations of Yasur and Ambrym volcanoes (see the “ARCADE” open-source repository link in Data Availability) in year-long time windows.
The ARCADE procedure (De Negri & Matoza 2023) is designed to automatically look for ground intercepts (arrivals) at pre-defined receiver locations (stations) coming from an arbitrary number of source locations (volcanoes), at specified dates and times. By default, it couples empirical climatologies (HWM14/MSIS2.0) (Drob et al. 2015; Emmert et al. 2020) and models infrasound propagation with 3-D ray tracing (infraGA) (Blom & Waxler 2012, 2017; Blom 2020). The atmospheric models can also be discretized as range-independent (layered spherical) or range-dependent (3-D grid for the area), allowing different levels of detail in the specifications. The modularity of this approach permits using different atmospheric specifications that we will test in terms of accuracy.
3.1.1 Hybrid model
The empirical climatologies are stochastic models based on interpolations of historical data collections, which are improved over time as new global data sets are gathered. This means that they are robust atmospheric descriptions, but smooth out smaller spatiotemporal changes that occur in the lower parts of the atmosphere (troposphere/lower stratosphere), as the quasi-biennal oscillation (QBO; Anstey & Shepherd 2014; Kidston et al. 2015) and sudden stratospheric warming (SSW) events (Drob et al. 2010; Assink et al. 2014a; Smets et al. 2016; Drob 2019). To try to account for this issue, ARCADE (De Negri & Matoza 2023) does include the option of using a hybrid model that merges the ECMWF ERA 5, or ERA 5 (for simplicity) descriptions (horizontal winds, pressure and temperature) below
The hybrid models will be the only type able to distinguish each year, as the empirical climatologies are interpolations of multiyear data into one characteristic year. These models will therefore be more accurate in representing transient non-seasonal changes, in detriment of representing repetitive (seasonal) phenomena in a multiyear scale.
3.1.2 Perturbed climatologies
One possible approach to account for phenomena not represented in the empirical climatologies is to generate multiple fine-scale variations of the average descriptions (multiple statistical realizations) simulating fine-scale structure or gravity wave effects (e.g. Assink et al. 2014a; Smets et al. 2016). Here, however, we take a simpler parametric approach, incorporating the addition of a Gaussian wind jet (Jones 1986) centred at a specific altitude, which creates perturbations of the range-independent climatological (HWM14/MSIS2.0) descriptions. This simplified type of perturbation is intended to adequately compensate for missing fine-scale structure that is not included in either hybrid atmospheric specifications nor empirical climatologies, without having to resort to a full gravity-wave model parametrization and multiple ensemble realizations. This approach, which was further motivated by De Negri & Matoza (2023), aims to sufficiently “nudge” the profiles to form a duct when the observations imply that the duct existed, yet the unperturbed profiles do not predict arrivals.
3.1.2.1 Static perturbation
Originally, De Negri & Matoza (2023) defined the Gaussian perturbation parameters after manually analysing effective wind profiles for the cases studied. The parameters of altitude, width and amplitude that would increase the horizontal winds were set to avoid unrealistic arrivals. In this type of perturbation, the Gaussian enhancement is fixed at 40 km altitude by default, increasing the along-path winds with an amplitude,
where
3.1.2.2 Adaptive perturbation
Attempting to approximate the higher amplitude and smaller oscillation lengths of typical gravity wave perturbations (e.g. Green et al. 2011; Drob et al. 2013; Smets et al. 2014), we tested several reasonable parameter modifications of eq. (1), to examine how the altitudes, widths and amplitudes of perturbations could increase stratospheric arrivals at IS22. After these preliminary experiments, we incorporated a second type of perturbation based on eq. (1). We decided to (1) increase the amplitude perturbation (

Example of mean horizontal winds versus altitude for 2019 January 15 at 00:00 UTC from Yasur to IS22 (x-axis) in steps of 0.5 km altitude (y-axis), calculated with climatologies (black line), static perturbed climatologies (blue dashed line), adaptive perturbed climatologies (green dashed line) and the hybrid model (red line) descriptions. (a) Zonal winds (positive towards the east). (b) Meridional winds (positive towards the north). (c) Crosswinds (positive towards north–east). (d) Effective sound speed considering the along-path winds and temperature in altitude.
To summarize, in this study we will use “raw climatologies”, “hybrid models”, “static perturbed climatologies” (default perturbation) and “adaptive perturbed climatologies” (last introduced) to obtain the atmospheric descriptions for each modelled case (Fig. 3).
3.2 Propagation models
After inspecting the multiyear (2003–2022) set of detections (Figs 2a and b; also Figs S4 and S5 in the Supporting Information), we decided to calculate year-long infrasound propagation models corresponding to 2004 for Ambrym volcano (herein, “2004 Ambrym”) and 2019 for Yasur volcano (herein, “2019 Yasur”). These two years have numerous candidate detections for each case, presenting complete data sets to compare with the models.
We cover all the possible combinations with this methodology, calculating six different models per volcano and year: range-independent with raw climatologies, range-independent with static perturbed climatologies, range-independent with adaptive perturbed climatologies, range-independent with hybrid model, range-dependent with raw climatologies and range-dependent with hybrid model.
With each model, an iterative ray tracing search is automatically performed for every day of the year, every six hours [00:00, 06:00, 12:00 and 18:00 Coordinated Universal Time (UTC)]. The source–year–day–time runs try to estimate the back-azimuth deviation using arrivals near enough IS22 (
Using the year-long sets of back-azimuth deviations for each model, we build year-long 1-D spline interpolations based on the daily means (Fig. 4, see green lines for each model). The interpolated curves will serve to compare models, and more importantly, to calculate the accuracy of each model when compared with the observations.

Modelled arrival relative back-azimuths (centred around each volcano) coloured by turning height threshold (see bottom legend) for all times (00:00, 06:00, 12:00 and 18:00 UTC). Red dots: arrivals with turning height less than 60 km (usually near 30–50 km). Black dots: arrivals with turning height more than 60 km (usually near 110–120 km). Green line: 1-D spline interpolations of the daily mean deviations, with a smoothing factor of 10 (less smooth) for the climatology-based models, and 50 (more smooth) for the hybrid-based models. Both curves were calculated using interpolations tools included in the module Scipy of the Python 3.9 libraries (splrep and splev), which are based on the FITPACK Fortran libraries (Dierckx 1995). Left subplots: Ambrym cases. Right subplots: Yasur cases. (a) and (b): range-independent, raw climatologies. (c) and (d): range-independent, static perturbed climatologies. (e) and (f): range-independent, adaptive perturbed climatologies. (g) and (h): range-independent, hybrid. (i) and (j): range-dependent, hybrid. (k) and (l): range-dependent, raw-climatologies (note that (k) has no results).
All models managed to produce a mostly complete set of arrivals to calculate the back-azimuth associated with each source–year–date–time combination. However, the range-dependent, raw climatology model for Ambrym volcano failed to produce ground intercepts (arrivals) near enough IS22 (Fig. 4k). We interpret this issue as the indication of an unstable propagation model, where the ray propagation becomes too sensitive to small changes in the launching parameters. However, further clarifications will be part of future research as we gathered enough results to complete the comparison with the data sets.
4 RESULTS
4.1 Regional winds and detectability
Yasur and Ambrym groups of detections have a marked yearly oscillating back-azimuth behaviour. This phenomenon correlates well with the yearly regional zonal wind patterns, which are clearly captured by climatology-based and hybrid atmospheric models (e.g. Fig. 5 for winds during 2019).

Seasonal atmospheric conditions at 00:00 UTC from 2019 January 1 to 2019 December 31. (a, b) Captured by a hybrid model combining ECMWF ERA 5 atmospheric specifications below
Infrasound propagating from Ambrym and Yasur to IS22 (Fig. 1) will encounter eastward stratospheric (
4.2 Observations: interpretations and shortcomings
4.2.1 Seasonal arcs and transient phenomena
The shared arc-like (sinusoidal) behaviour of the modelled detections (Figs 4 and 5) suggests that the candidate detections coming from Yasur and Ambrym back-azimuths (Fig. 2) are mainly due to the effect of the seasonal stratospheric zonal winds.
The near-continuous volcanic explosive signals from Yasur to IS22 shows as “bottom half sinusoids” or “capped arc” features due to low detectability during upwind conditions in winter (eastward winds; e.g. Fig. 5 for 2019). Transient stratospheric westwards ducts during dominant eastwards winds, linked with sudden stratospheric warming events in the region (Assink et al. 2014a) (e.g. Fig. 5b near July and September for 2019), allow detections to be seen for days to weeks in a row (e.g. Figs 4g and i for Ambrym and Figs 4h and j for Yasur near day 170).
Most of the detections of the “Ambrym group” occur during transition zonal wind regimes (March to April and August to October), which means that the bulk of the Ambrym group happens during a weakened or non-existent stratospheric duct. Indeed, the atmospheric effective sound speeds profiles for the region suggest that these candidate detections are enabled by the thermospheric duct (
It is likely that the “second arc” present in the folded observations (Figs 2d and f) is an overlap of candidate detections from Gaua and Ambae volcanoes. The models indicate that the daily variations (Fig. 4) are not higher than the back-azimuth differences from Ambrym/Lopevi and Gaua/Ambae (
4.2.2 Artifacts and frequency bands
At the start of 2018, we observe two groups of detections spanning from
4.3 Comparison with year-long and multiyear data sets
We aim to compare the back-azimuth interpolated model curves from year-long infrasound back-azimuth deviation estimations at IS22 (see Fig. 4), with the high-frequency (1–3 Hz) infrasound products (Hupe et al. 2021) for the years 2004, 2019, and the whole stacked (folded) multiyear time window (2003–2022). However, the ever-present non-volcanic overlapping sources (clutter), and propagation scatter effects make the comparison of a single interpolated curve difficult. Therefore, before calculating the error estimation of each model, we need to apply reduction techniques to the observations to represent the average behavior of the candidate (volcanic) detections.
4.3.1 Data reduction
To reduce clutter, we first select the high-frequency products (Hupe et al. 2021) that have a quality parameter
4.3.2 2004 Ambrym and 2019 Yasur comparison
The interpolated curves, centred at each back-azimuth, generally follow the most numerous candidate detection groups for both 2004 Ambrym and 2019 Yasur cases (Figs 6a and c). The 7-d 90 per cent confidence reduced observations are a much more clear indication of the fitting of each model to the observations (Figs 6b and d).

Modelled predicted back-azimuth deviations compared to IS22 observations (high-frequency 1–3 Hz products of Hupe et al. 2021). (a, c) 2-D binned observations coloured by number of detections (see Fig. 2) versus the interpolated curves of the models coloured by type (see Fig. 4) for 2004 Ambrym and 2019 Yasur, respectively. (b, d) 7-d means of reduced observations within 90 per cent confidence interval from the mean as coloured circles by mean frequency, with circle size by number of “families” (a measure of time-frequency span of the signals) versus the same interpolations.
For 2004 Ambrym case, the reduced observations have a large negative deviation (
For 2019 Yasur case, the reduced observations show a good agreement with all the interpolated curves when the westward stratospheric duct is present (days 1 to
4.3.3 Ambrym and Yasur multiyear (2003–2022) comparison
The models show general agreement with the multiyear observations again from days 1 to
The hybrid models, despite their specific time dependency, better represent the slope of the transitions from negative to positive deviations (e.g. day 100–120).
Between day
More outlier detections build up in the multiyear view for Yasur’s comparison (Fig. 7d; days
4.3.4 Accuracy of the models
Up to this point, we have shown that the models are qualitatively comparable with the reduced data sets (Figs 6 and 7). Now, we formulate a simple monthly and yearly quantitative evaluation of the model-data discrepancy.
Following the 7-d time window of each reduced data point (see Section 4.3.1), we first calculate the mean of the modelled back-azimuth deviations in 7-d time intervals. Then, we calculate the monthly discrepancy using the Euclidean distances between the two sets of 7-d values (model versus observations), building means by month (Fig. 8). These values preserve the sign that corresponds to the difference between the observations and modelled back-azimuth deviations, reflecting the expected azimuthal direction of the discrepancy. Additionally, we calculate the year-long differences by averaging the magnitudes of their monthly discrepancies. These will be the final values representing the overall accuracy of each model with respect to the reduced observations (Table 1; also Figs S9 to S13 in the Supporting Information).

Euclidean distance between reduced high-frequency infrasound observations (Hupe et al. 2021) (Figs 6 and 7) and interpolated modelled back-azimuths per month (non-leap year). (a) 2004 Ambrym. (b) 2003–2022 Ambrym. (c) 2019 Yasur. (d) 2003–2022 Yasur. The grey labels on top separated by vertical grey lines mark each month. (see Figs S9 and S10 in the Supporting Information for more details).
Summary of mean differences with standard deviations (
Volcano . | Year(s) . | Model type . | Mean diff. |
---|---|---|---|
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2004 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2019 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid |
Volcano . | Year(s) . | Model type . | Mean diff. |
---|---|---|---|
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2004 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2019 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid |
Summary of mean differences with standard deviations (
Volcano . | Year(s) . | Model type . | Mean diff. |
---|---|---|---|
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2004 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2019 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid |
Volcano . | Year(s) . | Model type . | Mean diff. |
---|---|---|---|
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2004 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2019 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Ambrym | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | No data | ||
Range-dep., hybrid | |||
Range-ind., raw clim. | |||
Range-ind., stat. pert. clim. | |||
Range-ind., adapt. pert. clim. | |||
Yasur | 2003–2022 | Range-ind., hybrid | |
Range-dep., raw clim. | |||
Range-dep., hybrid |
Ambrym monthly differences for 2004 (Fig. 8a) oscillate between a negative maximum of
The range-dependent, hybrid model shows largest difference, with a yearly mean of 1.59
The Yasur monthly differences for 2019 (Fig. 8c) are mostly below 1
The model with the smallest yearly mean difference in this case is the range-independent, adaptive perturbed climatologies (0.43
Overall, the considerable spread of the mean euclidean differences for both cases shows that the models, rather than better or worse in absolute terms, are comparable. This indicates that the empirical climatologies are valid to understand the specific year-long, first-order behaviour of the infrasound observations for 2004 and 2019.
The comparison with the 2003–2022 stacked data set shows an increased yearly mean difference for all models (Table 1; Figs 8b and d; also Fig. S10 in the Supporting Information). As the multiyear comparison is suited for the empirical climatologies, the hybrid descriptions will be an upper limit reference of the monthly and year-long differences.
The monthly differences show that, when stacking the data, the overall discrepancies are negative and peak around August and May for Ambrym and Yasur cases, respectively (see Figs 8b and d). As previously shown, these negative peaks occur during periods of low numbers of candidate detections from either Ambrym or Yasur, caused by unfavourable ducting conditions, and non-seasonal detections that can be associated with transient favourable ducts (QBO or SSW events) and clutter infrasound. February to April, and October to November are overall the months with smallest differences for Ambrym, while September to February (cyclically) are the ones for Yasur. These months align with the arc-like features pointed out in Figs 2(e) and (f), during times of favourable ducting conditions.
Both multiyear mean differences show the same relative results (Table 1; also Fig. S10 in the Supporting Information). The range-independent, adaptive perturbed climatologies provide the minimum differences: 0.90
5 DISCUSSION
The accuracy measurements we present in Table 1 are an attempt to compare our rather simple models with a large amount of observations which have a much more complex behaviour. The mean differences represent a measure of agreement between the year-long back-azimuth deviation interpolated curves and the reduced data, for a specific year (2004 or 2019) or as a multiyear stack (2003–2022). We remark that these values serve to compare between the models we tested for this specific area and the times considered, but do not provide a general interpretation of the performance of each model scheme for the globe unless future experiments show similar results. Keeping this in mind, we see that all the models perform similarly well despite the considerable differences among the used atmospheric descriptions (i.e. climatologies versus hybrid; raw climatologies versus perturbed climatologies) and ray tracing model types (range-independent versus range-dependent). The mean differences remain in the range
There is significant spread (
The range-independent models provide comparable or better results than the range-dependent models (Table 1). This seems counterintuitive, but it tells us that the atmospheric descriptions in the area of study only change significantly in altitude at each modelled time. Indeed, the smallest horizontal resolution is 0.25 deg (ERA 5), or about 27 km, meaning that atmospheric changes in the order of
The hybrid models are the only ones able to map shorter term wind reversals that happen in the middle of the year (see Figs 6 and 7). However, the monthly means (Figs 8a and b) indicate that the hybrid models predict back-azimuths that deviate significantly from those observed. A rule of thumb to calculate year-long predictions for a specific area would be to favour climatology-based models during periods with stable seasonal ducting, while switching to NWP-based atmospheric models when events as SSW or QBO-driven wind reversals have been observed in the past. Such analysis could also benefit from the characterization of seasonal trends of stratospheric waveguides and detectability on pre-defined areas or station locations (e.g. Blom et al. 2023).
We note that here we use the persistent Yasur and Ambrym volcanic sources to evaluate our method on continuous data. However, this method is generally intended for sudden, unexpected, large explosive eruptions without recent priors, such as the 2011 Puyehue-Cordón Caulle eruption, Chile; or the 2015 Calbuco eruption, Chile (De Negri & Matoza 2023). For well documented, persistent, infrasound sources such as Yasur or Ambrym volcanoes, other approaches based on data regression or machine learning could better perform in terms of their predictive accuracy. However, these approaches would not be suitable for volcanic observatories without long records of prior detections and new back-azimuths, real time atmospheric data access, and/or computational resources. That is, climatologies with ray tracing (predictions based on atmospheric structure only) are still needed to obtain critical information to characterize new transient infrasound during a volcanic emergency (De Negri & Matoza 2023).
5.1 Seasonal and daily changes of the models
Two interesting characteristics stand out at seasonal and diurnal time scales (Figs 9 and 10, for 2004 Ambrym and 2019 Yasur).

Comparison of back-azimuth deviation results for infrasound propagation from Yasur to IS22 during year 2019. For each 6-hr time snapshot, we plot the mean azimuth deviation of predicted arrivals falling within a distance of 5.5 km from IS22 relative to Yasur’s azimuth (

Similar to Fig. 9, but with back-azimuth deviation results for infrasound propagation from Ambrym volcano to IS22 during 2004. In this case, the range-dependent raw-climatologies do not predict arrivals within our criteria (within 5.5 km from IS22).
First, the asymmetry of the negative versus positive peak seasonal back-azimuth deviations is much more marked in the empirical climatologies than the hybrid models. This peak-to-peak asymmetry seems to be driven mainly by the 12:00 UTC arrivals, especially in the climatology-based results. For Yasur volcano (southwest propagation), the empirical climatologies show deviations that have maximum values of
Second, all models present a similar semidiurnal behaviour, showing the biggest deviations at 00:00 and 12:00 UTC, and the smallest deviations at 06:00 and 18:00 UTC (e.g. days 1 to

Example 2-D histograms of detections by time of day for the 2003–2022 PMCC high-frequency (1–3 Hz) products (see also Fig. S15 in the Supporting Information). (a, c, e, g) Detections around the back-azimuth of Ambrym volcano. (b, d, f, h) Detections around the back-azimuth of Yasur volcano. (a, b) Time versus azimuth (deg.). (c, d) Time versus frequency (Hz). (e, f) Time versus apparent velocity (m s−1). (g, h) Time versus amplitude (Pa).
The diurnal standard deviations can be above 1
5.2 Feasibility of thermospheric arrivals
Our models use ground intercepts with turning heights at stratospheric (
Interestingly, when arranging the multiyear observations from Ambrym direction by time of day (Figs 11b, d, f and h; also Fig. S15 in the Supporting Information), we see two 6-hr time windows with larger numbers of detections centred at
The diurnal variability is likely the effect of migrating solar tides in the mesosphere and thermosphere with dominant periods of 24, 12 and 8 hr (Le Pichon et al. 2005b). Throughout the year, the thermospheric duct turning height, mainly driven by changes in temperature, is consistently lower at 06:00 and 18:00 UTC (
The thermospheric duct dependence of Ambrym detections can also explain why they appear during zonal wind transition periods (equinoxes) in the multiyear stacked data set (March–June; August–October; Fig. 2f). From the ray tracing modelling point of view, when the stratospheric duct weakens, the cross-winds reduce for North–South infrasound, making long-range infrasound propagation less sensitive to initial launching parameters. Then, the modelled rays rely on the thermospheric duct, which does not impose strong cross-winds, allowing the iterative search to find arrivals near IS22 at the cost of increased transmission losses (see Figs 2f and 5). On the other hand, the detections from Yasur are produced by East–West infrasound propagation, and will be mostly dependent on the westward stratospheric duct (January–March; November–December; Fig. 7). This means that the diurnal variations will be less relevant as most of the detections are not influenced by thermospheric ducting changes.
We further investigate the transmission loss versus time of day relation with our methodology. Using range-independent hybrid atmospheric descriptions, we calculate transmission loss (TL) estimations for 2004 Ambrym and 2019 Yasur every 30 d at 00:00, 06:00, 12:00 and 18:00 UTC from the medians of the arrivals that fall at less than 50 km from IS22. Our results show that, as hypothesized, smaller attenuations concentrate at 06:00 and 18:00 UTC for both volcanoes (Fig. 12; Table S2, Supporting Information). Based on the wealth of observations at IS22 and ray tracing simulations, there is good evidence to indicate that thermospheric arrivals from Ambrym have indeed been detected at IS22, at 06:00 and 18:00 UTC, aided by lower thermospheric duct altitudes.
![Transmission loss (TL) [dB] values (medians and median absolute deviations) per year at 00:00, 06:00, 12:00 and 18:00 UTC for the cases of 2004 Ambrym and 2019 Yasur calculated with infraGA ray tracing (1 Hz). The values in this plot are also displayed in Table S2 in the Supporting Information. Note that all TL values are shown as magnitudes, avoiding the negative representation for easier comparison [i.e. greater value means higher TL (more negative)].](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/gji/241/1/10.1093_gji_ggaf027/1/m_ggaf027fig12.jpeg?Expires=1749150617&Signature=1Q5P01qOdBMO7BwonMFsu~hLA~ub9Twz1fyT3CUsGF~JUE-6bTkrkqukufl7VtHqr05INN7YkNX6Nk6wFMdz6~Wxg7Rqw5NInlsu1o7NNKFA1JE8dra4nYyaltWm2mafhV8EtiQenfradcTiSrctTJOdi0BJ9ov2TUkYew6sDmgZDBTYPn42vJV3W30Y5XgIYmqYohPKTkrsU~4jK-dvQbPAlS62M03ioFhvyyiDGFVUSV3268fOanymN6BQQPX8vqU5oMugXhWHu4WdxXzOTO5wZl26ECkUiH9XTZDD91o7oyDVOr3b4OR7IjaVxbNE8LKabT6X497fP~rtPpmsSA__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
Transmission loss (TL) [dB] values (medians and median absolute deviations) per year at 00:00, 06:00, 12:00 and 18:00 UTC for the cases of 2004 Ambrym and 2019 Yasur calculated with infraGA ray tracing (1 Hz). The values in this plot are also displayed in Table S2 in the Supporting Information. Note that all TL values are shown as magnitudes, avoiding the negative representation for easier comparison [i.e. greater value means higher TL (more negative)].
6 CONCLUSIONS
We tested a methodology that aims to calculate robust, first-order estimates of long-range infrasound arrival characteristics, particularly the back-azimuth deviation, using empirical climatologies. We produced year-long models of infrasound propagation for arrivals at IMS station IS22 (Grande Terre, New Caledonia) from the Vanuatu archipelago region, including from Yasur (Tanna island, at
Our goal was to investigate if the empirical climatologies were sufficient, and to what extent, to model the infrasound propagation path effects of the atmospheric changes in altitude of the horizontal winds, temperatures, densities and pressures during a year. We chose a case where the source locations can be assumed as known and nearly-continuous, hence leaving the source mislocation and detectability effects as induced by the atmospheric variability in time. We modelled such atmospheric parameters for infrasound propagation from the ground to 150 km with empirical climatologies (HWM14/MSIS2.0), and also considered two approaches for automatically perturbing the descriptions to enhance along-path stratospheric winds. Our simplified perturbation schemes are intended to mimic the effect of gravity-wave perturbations, without having to resort to a full statistical ensemble treatment and thus saving computation time and simplifying the implementation.
To evaluate the accuracy and limitations of the above climatology-based approaches, we also repeated the calculations using more accurate hybrid models that merge the ERA 5 descriptions (winds and temperatures) from the ground to
We found that the range-dependent propagation models provide comparable results to the range-independent cases, supporting the use of a simplified, layered model of the atmosphere for regional and global calculations. From July to April, all of the models depict seasonal back-azimuth changes that are generally aligned with the year-long and stacked multiyear observations. From May to June, the reversal of the zonal stratospheric winds from prevalent westwards to eastwards winds shows a reduction in observations, larger differences with the models, and only transient groups of detections likely produced by regional scale wind perturbations and transient favourable ducts (e.g. Sudden Stratospheric Warming events). The mean difference per year between the models and the high-frequency PMCC products shows that the empirical climatologies can provide comparable results with the hybrid models. The match is generally better for Yasur volcano than for Ambrym, which is likely the combined effect of increased crosswinds and propagation path lengths, plus other possible eruptive sources from similar locations such as Gaua and Ambae volcanoes complicating the attribution of detections uniquely to Ambrym. For specific years, only the hybrid models were capable of reproducing detections from transient ducts during the worst matching period (April–June). The hybrid models also better characterize the back-azimuth changes during the equinox transition periods towards the stratospheric seasonal wind reversal (around April and October).
The results from our analysis of the IS22 persistent multiyear long-range volcanic-infrasound detections indicate that empirical climatologies are useful and adequate for a rapid infrasound propagation calculation scenario (such as for a volcano monitoring system). This represents a step forwards from assuming a single value of celerity and neglecting cross-winds, which has previously been assumed for rapid calculations (e.g. Matoza et al. 2017). In particular, we found that the range-independent, raw climatology models are well-suited to obtain robust first-order estimates of arrival back-azimuths for large-scale, year-long infrasound propagation in this region. However, the climatologies do not perform as well as the hybrid models when the stratospheric duct weakens (equinoxes) or when short-term (days) wind reversals allow favourable ducts to appear during upwind conditions (summer). In such cases, a more accurate model with hybrid descriptions should be performed at a later time, given more time to download the specifications and run the simulations. We also emphasize that propagation models based on climatologies can be pre-computed, since climatologies do not use any current atmospheric data. Pre-computation of infrasound propagation paths using climatologies could be useful for near-real-time infrasound monitoring applications such as in volcano-monitoring systems (De Negri & Matoza 2023).
Our proposed adaptive perturbation of the empirical climatologies, which is intended as a simplified alternative to a full gravity-wave realization approach (ensemble averaging), was successful at enhancing stratospheric predictions. This approach may also be convenient for a rapid scenario where only an approximation of expected backazimuth deviations is needed. As stratospheric gravity waves are known to considerably reduce infrasound transmission losses (i.e. enhance propagation and ducting conditions; Listowski et al. 2024), future enhancements to the adaptive approach could be explored.
Finally, we remark that none of the atmospheric models we considered, including the hybrid specifications, fully explain all of the observations (e.g. more negative deviations on the first
This work focuses on volcanic infrasound detections at one station in the International Monitoring System. Generalizing these results will be important, both for other volcano-to-array paths globally and for other monitoring applications (e.g. global nuclear test monitoring), as mislocations due to unmodelled backazimuth deviations are a global issue. We suggest that additional long-range studies using other persistent volcanic source locations with large available detection records (e.g. Etna volcano) could be a step forward in this direction, further evaluating the applicability of this method beyond volcanic infrasound source location in near-real time. Additionally, considering other persistent anthropogenic or natural infrasound sources with large historical sets of detections (e.g. microbaroms, dams, etc.) could serve to understand to what extent this methodology could be applied to other infrasound generating phenomena.
ACKNOWLEDGMENTS
RSD thanks C. Scanlon for reviewing and improving the manuscript. All authors thank the CTBTO and the IMS station operators for guaranteeing the high quality of the infrasound data. The views expressed herein are those of the authors and do not necessarily reflect the views of the CTBTO Preparatory Commission. We also wish to thank Dr. David Green and Dr. Stephen Arrowsmith for their reviews, which helped us to improve the manuscript. This work was funded by National Science Foundation (NSF) Grant EAR-1847736.
DATA AVAILABILITY
All International Monitoring System data are available for scientific studies through the CTBTO vDEC platform (https://www.ctbto.org/specials/vdec/). The PMCC products used in this study can be accessed through the Geoportal of the Federal Institute for Geosciences and Natural Resources (BGR, Germany; https://geoportal.bgr.de) (see Hupe et al. 2022). For the hybrid models, we used the CDS API tools freely provided by European Centre for Medium-Range Weather Forecasts (ECMWF) to obtain the necessary ERA 5 reanalysis profiles, publicly available for academic research (https://confluence.ecmwf.int/display/CKB/How+to+download+ERA5). The climatology (HWM/MSISE) model codes are available from Drob et al. (2015) and Emmert et al. (2020), respectively. The 3-D ray tracing infrasound propagation algorithms are available for research purposes from the Los Alamos National Laboratory (LANL, USA) GitHub repository (https://github.com/LANL-Seismoacoustics/infraGA). The main code used in this project (ARCADE), is available for research purposes in the GitHub repository (https://github.com/rodrum/arcade) along with instructions and examples. The last stable code release can be accessed in Zenodo in 10.5281/zenodo.13383432 (De Negri 2024).
REFERENCES
Supplementary data
Figure S1. All detections from 2003 to 2022 for the four BGR products considered in this study (Hupe et al., 2021). (a) “High-frequency” products (1–3 Hz), which are used in this study. (b) “Microbarom high-frequency” (0.45–0.65 Hz) products, that also have detections (less numerous than the high-frequency) at Ambrym/Lopevi back-azimuths. (c) “Microbarom low-frequency” (0.15–0.35 Hz) products. (d) “Mountain-associated waves” (0.02–0.07 Hz) products.
Figure S2. (a) 2-D histogram of PMCC products centred around Yasur volcano back-azimuth (
Figure S3. Unlabelled version of Fig. 2 in main paper.
Figure S4. Similar to Fig. 2 in main paper, but using 26-band PMCC detections (2003 to 2022) instead.
Figure S5. Similar to Fig. 2 in main paper, but using 15-band PMCC detections from 2003 to 2017 instead.
Figure S6. Similar to Fig. 10 in main paper, but using 26-band PMCC detections from 2003 to 2022 (see Fig. S4).
Figure S7. Similar to Fig. 9 in main paper, but using 26-band PMCC detections for Ambrym (2004) and Yasur (2019) instead (see Figure S4).
Figure S8. Similar to Fig. 10 in main paper, but using 15-band PMCC detections from 2003 to 2017 (see Fig. S5).
Figure S9. Comparison of 7-d mean of detections with 90 per cent confidence interval and model results. (a) Scatter plot of the high-frequency (1–3 Hz) BGR detection products (Hupe et al. 2021) for 2004 (black diamonds) and all the models calculated for Ambrym (coloured dots, see legend on bottom). (b) Euclidean distance between the detections and the models for monthly means coloured by model type. (c) Mean yearly difference based on the monthly means coloured by model type with standard deviations. (d–f) Analogous but for Yasur in 2019.
Figure S10. Similar to Fig. S9, but from 2003 to 2022 (Hupe et al. 2021).
Figure S11. Similar to Fig. S9, but using 26-band PMCC detections.
Figure S12. Similar to Fig. S9, but using 26-band PMCC detection products from 2003 to 2022.
Figure S13. Similar to Fig. S9, but using 15-band PMCC detection products from 2003 to 2017.
Figure S14. Tested model parametrizations for perturbed profiles for 2004 Yasur. On the left: ground intercept (arrival) day versus turning height (see colourbar on the bottom). On the right: histogram of turning height of all ground intercepts per year. For simplicity, the parameters and summary of results are displayed in Table S1 following the labels of each subfigure.
Figure S15. Example 2-D histograms of detections by time of day for to the 2003–2017 15-band PMCC 1 to 3 Hz bandpass filtered data set (analogous to the PMCC products used in the main paper). (a, c, e, g) Detections around the back-azimuth of Ambrym volcano. (b, d, f, h) Detections around the back-azimuth of Yasur volcano. (a, b) Time versus azimuth (deg.). (c, d) Time versus frequency (Hz). (e, f) Time versus apparent velocity (m s−1). (g, h) Time versus amplitude (Pa).
Figure S16. Example of mean effective sound speed [
Table S1. Summary of parameters and results for perturbed models for 2004 Yasur. The three changing parameters are the maximum amplitude (
Table S2. Summary of TL values calculated with 3-D ray tracing (infraGA). The TL values were calculated for every day of the year, at four different UTC times of the day (00:00, 06:00, 12:00 and 18:00) for 2004 in the case of Ambrym, and 2019 for Yasur. The values in the table correspond to the UTC-time yearly medians with their median absolute deviations. The last column is the mean of each UTC-time yearly median with their standard deviation.