Pseudo-prospective earthquake forecasting experiment in Italy based on temporal variation of the b-value of the Gutenberg–Richter law

Summary of tested forecasting models.

Model	Main features	References
BVAL	Deterministic space/time-dependent model based on the observation that the b-value of the Gutenberg–Richter law inversely correlates with the accumulation of differential stress in crustal rocks. An alarm is triggered whenever the b-value drops below a certain threshold value.	(Scholz 1968, 2015; Amitrano 2003)
FORE	Deterministic space/time-dependent model based on the occurrence of potential foreshocks as precursor signals. An alarm is issued every time an earthquake within the potential foreshock magnitude range occurs.	(Gasperini et al. 2021)
ETAS	Epidemic-type aftershock sequence model based on the hypothesis that each earthquake can perturb the rate of subsequent earthquakes and generate its own Omori-like decay sequence. In this work, the SVP model (Console & Murru, 2001) is used to assess the background seismicity. The ETAS model is applied using an alarm-based deterministic approach, where an alarm is issued whenever the expected daily rate of target events exceeds a pre-defined threshold.	(Ogata 1988, 1989; Ogata & Zhuang 2006)

Model	Main features	References
BVAL	Deterministic space/time-dependent model based on the observation that the b-value of the Gutenberg–Richter law inversely correlates with the accumulation of differential stress in crustal rocks. An alarm is triggered whenever the b-value drops below a certain threshold value.	(Scholz 1968, 2015; Amitrano 2003)
FORE	Deterministic space/time-dependent model based on the occurrence of potential foreshocks as precursor signals. An alarm is issued every time an earthquake within the potential foreshock magnitude range occurs.	(Gasperini et al. 2021)
ETAS	Epidemic-type aftershock sequence model based on the hypothesis that each earthquake can perturb the rate of subsequent earthquakes and generate its own Omori-like decay sequence. In this work, the SVP model (Console & Murru, 2001) is used to assess the background seismicity. The ETAS model is applied using an alarm-based deterministic approach, where an alarm is issued whenever the expected daily rate of target events exceeds a pre-defined threshold.	(Ogata 1988, 1989; Ogata & Zhuang 2006)

Table 1.

Summary of tested forecasting models.

Model	Main features	References
BVAL	Deterministic space/time-dependent model based on the observation that the b-value of the Gutenberg–Richter law inversely correlates with the accumulation of differential stress in crustal rocks. An alarm is triggered whenever the b-value drops below a certain threshold value.	(Scholz 1968, 2015; Amitrano 2003)
FORE	Deterministic space/time-dependent model based on the occurrence of potential foreshocks as precursor signals. An alarm is issued every time an earthquake within the potential foreshock magnitude range occurs.	(Gasperini et al. 2021)
ETAS	Epidemic-type aftershock sequence model based on the hypothesis that each earthquake can perturb the rate of subsequent earthquakes and generate its own Omori-like decay sequence. In this work, the SVP model (Console & Murru, 2001) is used to assess the background seismicity. The ETAS model is applied using an alarm-based deterministic approach, where an alarm is issued whenever the expected daily rate of target events exceeds a pre-defined threshold.	(Ogata 1988, 1989; Ogata & Zhuang 2006)

Model	Main features	References
BVAL	Deterministic space/time-dependent model based on the observation that the b-value of the Gutenberg–Richter law inversely correlates with the accumulation of differential stress in crustal rocks. An alarm is triggered whenever the b-value drops below a certain threshold value.	(Scholz 1968, 2015; Amitrano 2003)
FORE	Deterministic space/time-dependent model based on the occurrence of potential foreshocks as precursor signals. An alarm is issued every time an earthquake within the potential foreshock magnitude range occurs.	(Gasperini et al. 2021)
ETAS	Epidemic-type aftershock sequence model based on the hypothesis that each earthquake can perturb the rate of subsequent earthquakes and generate its own Omori-like decay sequence. In this work, the SVP model (Console & Murru, 2001) is used to assess the background seismicity. The ETAS model is applied using an alarm-based deterministic approach, where an alarm is issued whenever the expected daily rate of target events exceeds a pre-defined threshold.	(Ogata 1988, 1989; Ogata & Zhuang 2006)

ANALYSIS REGION AND EARTHQUAKE CATALOGUE

As application region, we consider the overlap of two tessellations (⁠|${R}_1$| and |${R}_2$|⁠) of the Italian territory spanning from 7° E to 19° E in longitude and from 36° N to 47° N in latitude. Both tessellations consist of square cells with side length of |$L\ = \ 30\sqrt 2 $| km. The tessellation |${R}_2$| is shifted by L/2 both in latitude and in longitude with respect to |${R}_1$| (Fig. 1). The purpose of this dual tessellation is to enhance the spatial alignment between the forecasting features detected by different algorithms with the actual earthquakes. This approach almost doubles the portion of space covered by the alarms, thus reducing the efficiency of the various algorithms, but avoids failing a correct prediction because the earthquake to be predicted, although close, is located slightly outside the alarmed cell as in the sketch of Fig. 2, in which the precursor seismicity (blue circles) is able to predict the target earthquake (red star) using the red grid but not the black grid.

$Analysis region used for the models’ calibration, application and comparison. The red and black partially overlapped squares of side $L = 30\sqrt 2 \ $ km represent the tessellation ${R}_1$ and ${R}_2$, respectively. Each cell contain at least one earthquake with $\mathrm{ Mw} \ge 4.0$ occurred inland from 1600 to 1959 according to the CPTI15 catalogue (Rovida et al. 2022).$

Figure 1.

Analysis region used for the models’ calibration, application and comparison. The red and black partially overlapped squares of side |$L = 30\sqrt 2 \ $| km represent the tessellation |${R}_1$| and |${R}_2$|⁠, respectively. Each cell contain at least one earthquake with |$\mathrm{ Mw} \ge 4.0$| occurred inland from 1600 to 1959 according to the CPTI15 catalogue (Rovida et al. 2022).

Figure 2.

Sketch explaining the role of the double grid. The precursor seismicity (blue circles) is able to predict the target earthquake (red star) only using grid 2 (red lines) but not using grid 1 (black lines) because in the latter case it occurs close but outside the cell where the target occurs.

The tessellation |${R}_1$| was developed starting from the initial point |${x}_1$| = (7° E, 47° N) and moving the centre from |${x}_1$| in steps of L km for each cell, until reaching |${x}_2$| = (19° E, 36° N). The centres of the cells in tessellation |${R}_2$| are shifted relative to |${R}_1$| by −L/2 km in longitude and +L/2 km in latitude. To ensure uniform cell dimensions, we converted the coordinates of points |${x}_1$| and |${x}_2$| from the WGS84 geographic reference system to kilometric coordinates in the RDN2008 Italy zone (E-N) CRS EPSG:7794 using the QGIS software (see the Data Availability section).

Following Gasperini et al. (2021), Biondini et al. (2023) and Biondini & Gasperini (2023) we only consider cells in which at least one earthquake of magnitude |${M}_\mathrm{ w} \ge 4.0$| |$( {z \le 50\ \mathrm{ km}} )$| occurred on the Italian mainland during the period 1600–1959 according to the seismic catalogue CPTI15 (Rovida et al. 2022). Isolated cells not contiguous with the two main tessellations are excluded. The resulting grids are composed of 167 and 170 cells for |${R}_1$| and |${R}_2$|⁠, respectively (Fig. 1). This approach is consistent with the methodology defined by Gasperini et al. (2021) to prevent the overestimation of the forecasting performance by excluding predominantly aseismic areas, such as the island of Sardinia.

To calibrate and pseudo-prospectively test the relative efficiency of various forecasting models, we use the HORUS catalogue (Lolli et al. 2020; see the Data Availability section), which collect earthquakes from 1960 to the present and is homogeneous in terms of magnitudes. To balance statistical resolution and the inherent uncertainties in magnitude measurements, we preliminarily bin the magnitudes to a single decimal digit. Hence, all magnitude thresholds mentioned below must be understood as reduced by half of the bin size (0.05).

During the learning period ranging from 1990 to 2004, the catalogue reports 18 target shocks with |${M}_\mathrm{w} \ge 5.0$| and 8 with |${M}_\mathrm{w} \ge 5.5$| within the double tessellation. In the testing period ranging from 2005 to 2022, it reports 34 target shocks with |${M}_\mathrm{w} \ge 5.0$| and 12 with |${M}_\mathrm{w} \ge 5.5$|⁠.

According to Lolli et al. (2020), the HORUS catalogue can be considered complete for the Italian mainland for |${M}_\mathrm{w} \ge 4.0\ $| since 1960, for |${M}_\mathrm{w} \ge 3.0$| since 1981, for |${M}_\mathrm{w} \ge \ 2.5$| since 1990, for |${M}_\mathrm{w} \ge 2.1$| since 2003 and for |${M}_\mathrm{w} \ge 1.8$| since 2005.

However, for the purpose of determining the b-value, using the method of magnitude differences (van der Elst 2021; Tinti & Gasperini 2024), which is notably resilient to incompleteness, we re-estimated the completeness magnitude (⁠|${M}_\mathrm{ c}$|⁠) using the maximum curvature method (Wiemer & Wyss 2000; Woessner & Wiemer 2005), which is less accurate but preserves more data. The seismicity distribution used for these calculations is illustrated in Fig. 3, which shows the epicentres of earthquakes within the analysis region (R) and the CPTI15 polygon, highlighting the spatial coverage and density of seismic events in the study area. The re-estimated |${M}_\mathrm{ c}$| are |${M}_\mathrm{w} \ge 2.7\ $| since 1960, |${M}_\mathrm{w} \ge 2.0\ $| since 1981, |${M}_\mathrm{w} \ge \ 2.1$| since 1997, |${M}_\mathrm{w} \ge 1.9$| since 2003 and |${M}_\mathrm{w} \ge 1.2$| since 2005 (Fig. 4).

Figure 3.

Seismicity map of Italy for the period 1990–2023 from HORUS catalogue (Lolli et al. 2020). Black dots indicate the epicentres of earthquakes within the analysis region (R) represented by the magenta shape, while grey dots indicate the epicentres of earthquakes within the CPTI15 polygon (Rovida et al. 2020) represented by the blue shape.

$Time versus magnitude distribution of earthquakes (black dots) occurred within the analysis regions from 1960 to 2023 ($z \le 50$ km) according to the HORUS catalogue (Lolli et al. 2020) and used for the models’ calibration, testing and optimization. The red line represents the magnitude of completeness estimated using the maximum curvature method (Wiemer & Wyss 2000).$

Figure 4.

Time versus magnitude distribution of earthquakes (black dots) occurred within the analysis regions from 1960 to 2023 (⁠|$z \le 50$| km) according to the HORUS catalogue (Lolli et al. 2020) and used for the models’ calibration, testing and optimization. The red line represents the magnitude of completeness estimated using the maximum curvature method (Wiemer & Wyss 2000).

COMPUTATION OF b-VALUE

Up to a few years ago, the determination of the b-value was usually done through a formula, derived many decades ago by Utsu (1965) and Aki (1965), which assumes a continuous exponential distribution of magnitudes. If the magnitudes are binned to only one decimal digit, Utsu (1966) suggested an approximate correction to the formula consisting of subtracting one-half of the binning size (0.05) from the minimum magnitude of completeness of the data set. As the distribution of binned magnitudes is not continuous but discrete, Guttorp & Hopkins (1986) derived the exact formula, using the geometric distribution theory. Such formula, however, was almost totally neglected by the following literature (see Tinti & Gasperini 2024 for a thorough overview).

More recently, van der Elst (2021) showed that the b-value can be determined using the distribution of positive differences between magnitudes of successive shocks and, in this case, the estimation is more resilient to the incompleteness of the data set, particularly if the differences lower than a given trimming threshold |${\rm{\Delta }}{M}_\mathrm{ c}^{\prime}$| are discarded. Tinti & Gasperini (2024) confirmed that the effect of incompleteness is mitigated by using trimmed magnitude differences but not that positive differences are significantly better than negative ones. However, as the so-called b-positive (b⁺) estimator proposed by van der Elst (2021) is largely used in the most recent literature, we adopt it even in this work. For computing the b-value, we use the expression proposed by Tinti & Gasperini (2024):

$$\begin{eqnarray} b = \frac{1}{{2\delta {\rm{ln}}\left( {10} \right)}}{\rm{ln}}\left[ {\frac{{\overline {{\rm{\Delta }}M} - {\rm{\Delta }}M_\mathrm{ c}^{\rm{^{\prime}}} + 2\delta }}{{\overline {{\rm{\Delta }}M} - {\rm{\Delta }}M_\mathrm{ c}^{\rm{^{\prime}}}}}} \right], \end{eqnarray}$$

(1)

where |$\delta $| is equal to one half of the magnitude bin size (typically 0.05) and |$\overline {{\rm{\Delta }}M} $| is the average of the positive magnitude differences. Such expression is algebraically equivalent to that proposed by van der Elst (2021) in its eq. (A3) but can be preferred because it does not use the hyperbolic arc-cotangent function, which is not provided in some software environments. Although the b⁺ estimator is more resilient than methods based on the distribution of magnitudes, it is not completely insensitive to incompleteness. Hence, a completeness analysis (Woessner & Wiemer 2005) is needed anyhow before using it.

van der Elst (2021), in analysing real sequences, considered two options: in the first one, he used all catalogued earthquakes and then applied the b⁺ estimator above a minimum positive magnitude difference |${\rm{\Delta }}{M}_\mathrm{ c}^{\prime}$| ranging from 0.3 to 0.4 units, while in the second option, he used only the earthquakes with magnitude above the completeness magnitude |${M}_\mathrm{ c}$| computed as that of maximum curvature of the FMD + 0.2 according to Woessner & Wiemer (2005) and applied the b⁺ estimator to all positive magnitude differences. He obtained similar results in the two cases. However, as the true b-value of real sequences is not known, it is not possible to decide which strategy is the best. Both selection strategies imply a reduction of the number of remaining data available for b-value determination but, according to the computation made by van der Elst (2021) for the 2019 Ridgecrest Sequence, the first strategy (⁠|${\rm{\Delta }}{M}_\mathrm{ c}^{\prime}$| cut and not |${M}_\mathrm{ c}$| cut) would seem to preserve more data.

Tinti & Gasperini (2024), by statistical analyses of simulated data sets, showed that the theoretical b-value is correctly determined when the trimming threshold is equal to the bin size (only |${\rm{\Delta }}M = 0$| discarded), if the cutting magnitude |${M}_\mathrm{ c}$| is assumed to be that of maximum curvature of the FMD (without + 0.2 correction), and when the data set is strongly incomplete data sets (without |${M}_\mathrm{ c}$| cut), if using trimming thresholds four to five times the binning size. Such analyses also indicate that the number of remaining data is 20–30 per cent larger in the former case. Hence in the following, we will adopt the strategy of cutting magnitudes below the maximum curvature of the FMD and considering all positive differences (⁠|${\rm{\Delta }}M > 0$|⁠).

SETTING UP THE BVAL FORECASTING METHOD

We first remove all magnitudes below the point of maximum curvature of the FMD determined as reported in section analysis region and earthquake catalogue. The b-value is then calculated in each tessellation cell starting from the beginning of the catalogue, using a sampling window with a fixed number of events, |${N}_m = 150$| according to Gulia et al. (2024). For each sampling window, we compute the difference vector D obtained by subtracting the magnitude of each earthquake from the magnitude of the preceding one, that is, |${D}_i\ = \ ( {{m}_i\ - \ {m}_{i - 1}} )$| for |$i\ = \ 2,\ 3,\ \ldots,\ {N}_m$|⁠. We then filter out any negative or zero difference and compute the average of the remaining ones denoted as |$\overline {{\rm{\Delta }}M} $|⁠. The latter is used in eq. (1) to calculate the b-value. We create a temporal series of the b-values by moving forward the sampling windows one event at time and recalculating the parameter for each new window until the end of the data set is reached. According to this forecasting model, alarms are triggered each time the b-value falls below a critical threshold value |${b}_{\mathrm{ th}}$|⁠, coming from a larger value. The alarm lasts for a duration |$\Delta t$|⁠, starting from the time of the last earthquake in the sample of |${N}_m$| events. If the b-value in the previous window was already below the threshold, no alarm is issued to avoid repeated warnings for the same low b-value condition. The choice of the optimal threshold |${b}_{\mathrm{ th}}$| is determined in a specific calibration step where various thresholds are tested using the learning data set to find the value that best balances the sensitivity (detecting potential seismic events) and specificity (minimizing false alarms). Using the calibrated threshold, we evaluate pseudo-prospectively the efficiency of the forecasting method by applying it to the testing data set.

TESTING ALARM-BASED FORECASTING METHODS

A target earthquake is considered successfully predicted if it occurs within a cell where one or more alarm windows of duration |${\rm{\Delta }}t$| are active. Conversely, it is considered a failure to predict if it occurs outside any active alarm window. Thus, the prediction of a target earthquake is evaluated according to a dichotomy: predicted or not predicted.

According to Molchan (1990, 1991) and Molchan & Kagan (1992), the miss rate |$\nu $|⁠, that is the fraction of unpredicted earthquakes, is computed as:

$$\begin{eqnarray} \nu = \frac{{N - h}}{N}, \end{eqnarray}$$

(2)

where N is the total number of target events and h is the number of target events that were successfully predicted. Additionally, the overall fraction of space-time occupied by alarms is computed as:

$$\begin{eqnarray} \tau = \frac{1}{K}\mathop \sum \limits_{i = 1}^K \left( {\frac{{d{c}_i}}{T}} \right), \end{eqnarray}$$

(3)

where K is the total number of cells, T is the total duration of the forecasting experiment and |$d{c}_i$| is the total time coverage of alarms within each i-th spatial cell. |$d{c}_i$| can be calculated by multiplying the alarm time |${\rm{\Delta }}t$| by the number |$n\ $| of defined alarms and then subtracting the sum of the intersections between the different alarm windows |$\cap {t}_\mathrm{ s}$| as follows:

$$\begin{eqnarray} d{c}_i = \left( {{\rm{\Delta }}t\ \times n} \right) - \sum \cap {t}_\mathrm{ s}. \end{eqnarray}$$

(4)

Following Shebalin et al. (2011), we also calculate a weighted fraction of space-time occupied by alarms, which accounts for the long-term earthquake rate within each cell. Such rates are computed using the Italian historical earthquake data in the period 1620–1959, according to the CPTI15 V4.0 catalogue (Rovida et al. 2020, 2022; see the Data Availability section), following the procedure described in Gasperini et al. (2021). The weighted fraction of space-time occupied by alarms is then computed as:

$$\begin{eqnarray} {\tau }_\mathrm{ w} = \frac{{\mathop \sum \nolimits_{i = 1}^K {{\bar{\lambda }}}_i\left( {\frac{{d{c}_i}}{T}} \right)}}{{\mathop \sum \nolimits_{i = 1}^K {{\bar{\lambda }}}_i}}, \end{eqnarray}$$

(5)

where |${\bar{\lambda }}_i$| is the long-term average rate of earthquakes |${M}_\mathrm{w} \ge 4.0$| for each i-th cell. Details of weight computations are reported in Table S1 (Supporting Information). This weighting procedure penalizes forecasts in highly seismic areas, where predictions are inherently easier, by increasing the space-time fraction occupied by alarms proportionally to the historical earthquake rate.

The miss rate |$\nu $| and the fractions of space-time occupied by alarms |${\tau }_\mathrm{ u}$| (unweighted) or |${\tau }_\mathrm{ w}$| (weighted) are used to construct the so-called Molchan diagram (Molchan 1990, 1991, 1997; Molchan & Kagan 1992; Molchan & Keilis-Borok 2008). The latter is a graphical tool commonly used to assess the forecasting performance of alarm-based models (see for example the one reported in Fig. 5).

$Molchan diagram and AS scores of the BVAL model for target earthquake with ${{\it{M}}}_{\rm{w}} \ge 5.0$ occurred from 2005 to 2022. Red and blue lines indicate the Molchan trajectories for ${{\it{b}}}_{{\rm{th}}} = 0.9$, using unweighted (red) and weighted (blue) space-time occupied by alarms (${{\rm{\tau }}}_{\rm{u}}$ and ${{\rm{\tau }}}_{\rm{w}}$), respectively (see the main text). The solid black diagonal line represents the Molchan trajectory of a purely random method, which forecasts earthquakes in proportion to the space-time occupied by alarms. This line divides the diagram into two parts: the area above the diagonal, indicating unskilled performance, and the area below the diagonal, indicating skilled performance. The light blue, violet and green lines represent the confidence limits for ${\rm{\alpha }} = 50,{\rm{\ }}5$ and $1$ per cent, respectively. The grey dotted lines indicate the probability gains G = 2, 5, 10, 20 and 40.$

Figure 5.

Molchan diagram and AS scores of the BVAL model for target earthquake with |${{\it{M}}}_{\rm{w}} \ge 5.0$| occurred from 2005 to 2022. Red and blue lines indicate the Molchan trajectories for |${{\it{b}}}_{{\rm{th}}} = 0.9$|⁠, using unweighted (red) and weighted (blue) space-time occupied by alarms (⁠|${{\rm{\tau }}}_{\rm{u}}$| and |${{\rm{\tau }}}_{\rm{w}}$|⁠), respectively (see the main text). The solid black diagonal line represents the Molchan trajectory of a purely random method, which forecasts earthquakes in proportion to the space-time occupied by alarms. This line divides the diagram into two parts: the area above the diagonal, indicating unskilled performance, and the area below the diagonal, indicating skilled performance. The light blue, violet and green lines represent the confidence limits for |${\rm{\alpha }} = 50,{\rm{\ }}5$| and |$1$| per cent, respectively. The grey dotted lines indicate the probability gains G = 2, 5, 10, 20 and 40.

In a paradoxical forecasting method that does not issue any alarm, we have |$\tau = 0$| and it is impossible to forecast any target event. This corresponds to the point |$( {\tau ,\ \nu } ) = ( {0\ \mathrm{ per}\ \textrm{cent},\ 100\ \mathrm{ per}\ \textrm{cent}} )$| in the upper left corner of the diagram of Fig. 5. On the other hand, if the method issues a set of alarms that cover the entire space-time volume, all target earthquakes are certainly predicted. This corresponds to the point |$( {\tau ,\ \nu } ) = ( {100\ \mathrm{ per}\ \textrm{cent},\ \ 0\ \mathrm{ per}\ \textrm{cent}} )$|⁠. The diagonal line connecting these two points of equation |$\nu = 1 - \tau $| represents the prediction performance of a purely random method that simply forecast earthquakes proportionally to the space-time fraction occupied by alarms.

By analysing different alarm durations |$\Delta t$|⁠, we calculated the corresponding values of |$\tau $| and |$\nu $| for each duration, allowing us to plot a set of points on the Molchan diagram forming a path known as the `Molchan trajectory`. Models with trajectories that lie below the diagonal line and are close to the lower left-hand corner of coordinates |$( {\tau ,\nu } ) = ( {0\ \mathrm{ per}\ \textrm{cent},0\ \mathrm{ per}\ \textrm{cent}} )$| show higher predictive capabilities than the random prediction model. In contrast, models with trajectories extending above the diagonal line show lower predictive performance than the random model (see Fig. 5). The ratio between the success rate and the space-time fraction occupied by alarms,

$$\begin{eqnarray} G = \frac{{1 - \nu }}{\tau } \end{eqnarray}$$

(6)

indicates the probability gain compared to the random chance (Kagan 2009). A value of |$G = 1$| aligns with the random chance diagonal, while |$G > 1$| characterises skilled forecasting methods|$.$|

The Molchan diagram provides insights into how the miss rate fluctuates with changes in the fraction of space-time covered by alarms, highlighting the essential trade-off between alarm coverage and minimizing failure rates. However, it remains primarily a visual tool. To quantitatively assess the forecasting performance of models, Zechar & Jordan (2008, 2010) introduced the area skill (AS) score |${a}_\mathrm{f}( \tau )$|⁠. The latter quantifies how much better or worse a model performs compared to a random prediction baseline and is defined as the integral of the success rate |$1 - \nu ( \tau )$| normalized to the alarm space-time coverage |$\tau $|⁠:

$$\begin{eqnarray} {a}_\mathrm{f}\left( \tau \right) = \frac{1}{\tau }\mathop \smallint \limits_0^\tau \left[ {1 - \nu \left( t \right)} \right]\mathrm{ d}t. \end{eqnarray}$$

(7)

The overall AS score, calculated across the entire Molchan trajectory ranging from |$\tau = 0$| per cent to |$\tau = 100$| per cent, quantifies the area above the Molchan trajectory. The AS score is normalized such that its value ranges from 0 to 1, with values closer to 1 indicating better forecasting performance than the reference random model, represented by the diagonal line (Fig. 5), which is characterized by an overall AS score of 0.5.

Zechar & Jordan (2008, 2010) also derived a Gaussian asymptotical estimate of the variance of |${a}_\mathrm{f}( \tau )$| as |${\sigma }^2 = \frac{1}{{12}}N$|⁠, where N indicates the number of target shocks. While the AS score can be computed for any |$\tau $|⁠, Zechar & Jordan (2008, 2010) suggested that the statistical power of the test generally improves with larger values of |$\tau $|⁠. Therefore, they recommended to use |${a}_\mathrm{f}( {\tau = 100\ \mathrm{ per}\ \textrm{cent}} )$| for hypothesis testing to maximize the sensitivity of the analysis.

CALIBRATION AND OPTIMIZATION OF FORECASTING MODELS

BVAL, FORE and ETAS models are applied to forecast target earthquakes with |$M_{ \mathrm{ w}} \ge 5.0$|⁠, which occurred inland in Italy during the learning period 1990–2004. As possible precursory seismicity, also the data in the preceding 15 yr interval from 1975 to 1989 is considered.

Following the approach of Biondini & Gasperini (2023), we used the maximum AS score as the criterion for selecting the optimal forecasting thresholds for each analysed model. Additionally, we also provide information on the fraction of space-time (⁠|${\tau }_{1\mathrm{ yr}}$|⁠) occupied by alarms of one year duration (⁠|${\rm{\Delta }}t = \ $|1 yr) for each analysed threshold. This metric indicates the proportion of time and geographic area where alarms were issued. Even if not used for the optimal threshold selection, this additional information provides a comprehensive view of the operational impact of the chosen thresholds. For example, a high fraction of space-time occupied by alarms may indicate a high number of false alarms, potentially reducing the model practical utility. Conversely, a too-low fraction may suggest that the chosen precursor is overly prudential and may miss target events.

For the BVAL model, in Fig. 6 (with numerical values reported in Table S2 of Supporting Information) we show the behaviour of the AS scores (lines) and of |${\tau }_{1\mathrm{ yr}}$| (grey bars) with the b-value threshold |${b}_{\mathrm{ th}}$| varying from 0.50 to 1.30. The red and blue lines indicate the AS scores as a function of the unweighted and weighted fraction of space-time occupied of alarms, respectively. For both curves the highest overall scores (⁠|${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100{\rm{\ per\ cent}}} )$|=0.81 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100{\rm{\ per\ cent}}} )=$|0.79, respectively) are obtained for |${b}_{\mathrm{ th}} = 0.90$|⁠. For such threshold the fraction of space-time covered by alarms of duration |${\rm{\Delta }}t = \ $|1 yr is |${\tau }_{1\mathrm{ yr}} = 0.017$| that is 1.7 per cent of the total duration of the learning time interval.

$AS score trend for the BVAL model computed for targets earthquakes with magnitude ${{\it{M}}}_{\rm{w}} \ge 5.0$, using the unweighted (red line) and weighted (blue line) fractions of space-time occupied by alarms. The grey bars represent the fractions of space-time occupied by alarms of duration equal to 1 yr $( {{{\rm{\tau }}}_{{1}_{{\rm{yr}}}}} )$. The chosen b-value threshold (${{\it{b}}}_{{\rm{th}}} = 0.9$) is characterized by the highest AS score and is indicated by the black arrowhead. The AS is represented on the left vertical axis, and the fractions of space-time occupied by alarms are reported on the right vertical axis.$

Figure 6.

AS score trend for the BVAL model computed for targets earthquakes with magnitude |${{\it{M}}}_{\rm{w}} \ge 5.0$|⁠, using the unweighted (red line) and weighted (blue line) fractions of space-time occupied by alarms. The grey bars represent the fractions of space-time occupied by alarms of duration equal to 1 yr |$( {{{\rm{\tau }}}_{{1}_{{\rm{yr}}}}} )$|⁠. The chosen b-value threshold (⁠|${{\it{b}}}_{{\rm{th}}} = 0.9$|⁠) is characterized by the highest AS score and is indicated by the black arrowhead. The AS is represented on the left vertical axis, and the fractions of space-time occupied by alarms are reported on the right vertical axis.

For the FORE model, an alarm is issued in a spatial cell every time a potential foreshock with |$M_{ \mathrm{ w}} = M \pm {\rm{\Delta }}M$| occurs within that cell (Gasperini et al. 2021). The optimal range of potential foreshocks was determined by comparing the AS scores obtained in the learning interval by using magnitude ranges with central values |$4.1 \le M \le 4.8$|⁠, and deviations from the central value |$0.1 \le {\rm{\Delta }}M \le 0.4$| but ensuring that the upper bound is not larger than 4.9 (⁠|$M + {\rm{\Delta }}M \le 4.9$|⁠), so that the magnitude of the foreshocks is lower than that of any target earthquakes (Fig. 7; with values in Table S3 of Supporting Information). The maximum overall AS scores (⁠|${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100{\rm{\ per\ cent}}} )=$|0.90 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100{\rm{\ per\ cent}}} )$|0.88) are obtained for |$M_{ \mathrm{ w}} = 4.5 \pm 0.3$|⁠. For such ranges, the fraction of space-time covered by alarms of duration |${\rm{\Delta }}t = \ $|1 yr is |${\tau }_{1\mathrm{ yr}} = 0.045$| that is 4.5 per cent of the total duration of the learning time interval.

$Same as Fig. 6 for the FORE model. The AS score and the fraction of space-time occupied by alarms are calculated based on the potential foreshock magnitude range. The arrowhead indicates the selected optimal magnitude range (Mw = 4.5 ± 0.3).$

Figure 7.

Same as Fig. 6 for the FORE model. The AS score and the fraction of space-time occupied by alarms are calculated based on the potential foreshock magnitude range. The arrowhead indicates the selected optimal magnitude range (M_w = 4.5 ± 0.3).

For the ETAS model, an alarm is issued, in a spatial cell, whenever the expected daily rate (⁠|${\lambda }_{\textrm{daily}}$|⁠) exceeds a given threshold. ETAS parameters were fitted using the maximum likelihood approach (see details in Appendix A) for the learning period (1990–2004), separately for the two tessellations, R₁ and R₂ (see Table A1). Following Biondini & Gasperini (2023), the expected rate of earthquakes with |${M}_\mathrm{w} \ge 5.0$| is recalculated for each cell whenever an earthquake with |${M}_\mathrm{w} \ge 2.5$| occurs within the entire observation region R = (⁠|${R}_1 + {R}_2)$|⁠. Each cell is thus characterised by its own time-series of expected |${\lambda }_{\textrm{daily}}$|⁠.

The optimal alarm threshold |${\lambda }_{\mathrm{ th}}$| for the expected daily earthquake rate is determined by varying it logarithmically from |$5 \times {10}^{ - 7}$| to |$9 \times {10}^{ - 3}$| (Fig. 8; with values in Table S4 in the Supporting Information). The maximum overall AS scores (⁠|${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100{\rm{\ per\ cent}}} )=$| 0.88 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100{\rm{\ per\ cent}}} )=$| 0.83) are obtained for |${\lambda }_{\mathrm{ th}} = 1 \times {10}^{ - 5}$|⁠. For such threshold the fraction of space-time covered by alarms of duration |${\rm{\Delta }}t = \ $| 1 yr is |${\tau }_{1\mathrm{ yr}} = 0.337$| that is 33.7 per cent of the total duration of the learning time interval. Such |${\tau }_{1\mathrm{ yr}}$|⁠, definitely larger than the one obtained by optimizing other two methods, highlights, right from this optimization stage, a different behaviour of the ETAS model compared to the other two forecasting models, which might condition its practical applicability.

$Same as Fig. 6 for the ETAS model. The AS score and the fraction of space-time occupied by alarms are calculated based on the daily expected rates thresholds. The arrowhead indicates the selected optimal alarm rate threshold ${{\rm{\lambda }}}_{{\rm{th}}}$ = 1.00E − 5.$

Figure 8.

Same as Fig. 6 for the ETAS model. The AS score and the fraction of space-time occupied by alarms are calculated based on the daily expected rates thresholds. The arrowhead indicates the selected optimal alarm rate threshold |${{\rm{\lambda }}}_{{\rm{th}}}$| = 1.00E − 5.

RESULTS OF THE PSEUDO-PROSPECTIVE TESTING

We used the optimal thresholds and ranges determined in the previous step for applying the simple model BVAL, FORE and ETAS to pseudo-prospectively predict 34 earthquakes with magnitudes |${M}_\mathrm{w} \ge 5.0$| and 12 earthquakes with magnitudes |${M}_\mathrm{w} \ge $| 5.5 occurred during the 18 yr of the pseudo-prospective test period 2005–2022. As possible precursor seismicity of targets events in the testing period 2005–2022, we used also the data in the preceding 18 yr interval from 1986 to 2004.

We use the same optimal thresholds even for applying the two ensemble models EADD and EMUL, obtained by summing and intersecting, respectively, the alarm windows of the two simple BVAL and FORE models.

In Fig 5, the Molchan trajectories of the BVAL model applied to the prediction of target earthquakes |$M \ge 5.0$| are shown. They are obtained by varying the alarm time |$( {{\rm{\Delta }}t} )$| from fractions of seconds to the total duration of the pseudo-prospective prediction experiment (T = 18 yr). The red and blue lines correspond to unweighted (⁠|${\tau }_\mathrm{u}$|⁠) and weighted (⁠|${\tau }_\mathrm{w}$|⁠) fractions of space-time occupied by alarms, respectively (see in Table S5 of the Supporting Information, the numerical values of the plotted trajectories). Both the red and blue Molchan trajectories lie well below the diagonal line representing the predictive performance of a purely random method and even below the 1 per cent confidence level threshold, highlighting their higher predictive ability. The BVAL model fails to predict the total number of target earthquakes with |${M}_\mathrm{w} \ge 5.0$| even with |${\rm{\Delta }}t$|=18 yr, for which only 30/34 (88.2 per cent) are predicted with time-space coverages of |${\tau }_\mathrm{u} \approx $| 33 per cent and |${\tau }_\mathrm{w} \approx $| 51 per cent. With |${\rm{\Delta }}t$| = 1 yr, approximately 68 per cent of target earthquakes are successfully forecasted, achieving space-time coverages of |${\tau }_\mathrm{u}$|= 8.6 per cent and |${\tau }_\mathrm{w}$|= 12.7 per cent. For a |${\rm{\Delta }}t$| = 3 months (0.25 yr), 64.7 per cent of targets earthquakes are successfully forecasted with |${\tau }_\mathrm{u}$| = 2.96 per cent and |${\tau }_\mathrm{w}$| = 4.45 per cent. Using |${\rm{\Delta }}t$| = 1 d, 50 per cent of target earthquakes are successfully forecasted, with |${\tau }_\mathrm{u}$| = 0.05 per cent and |${\tau }_\mathrm{w}$| = 0.09 per cent. The overall AS scores |${a}_\mathrm{f}( {{\tau }_{\mathrm{ u}} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.88 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.85 are significantly larger of the expected score of the random model (0.5), based on the Student's t-test. The Molchan diagram for the BVAL model applied to predict target earthquakes with |${M}_\mathrm{w} \ge 5.5$| is shown in Fig. S1 of Supporting Information. It shows high prediction skills, with overall AS scores of |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.92 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.90 for the unweighted and weighted fraction of space-time occupied by alarms, respectively. Even with |${\rm{\Delta }}t$| =18 yr, the model forecasts only 11 of the 12 target events (92 per cent), with |${\tau }_\mathrm{u}$| and |${\tau }_\mathrm{w}$| remaining unchanged from the previous case due to consistent alarm issuance. For |${\rm{\Delta }}t$| = 1 yr and |${\rm{\Delta }}t$| = 1 d, 83 per cent (10/12) and 67 per cent (8/12) of target shocks are predicted, respectively. The numerical values of the plotted miss rates |$\nu $| and of the AS scores as a function |${\tau }_\mathrm{u}$| and |${\tau }_\mathrm{w}$| for targets with |${M}_\mathrm{w} \ge 5.0$| And |${M}_\mathrm{w} \ge 5.5$| are reported in Tables S5 and S6 (Supporting Information), respectively.

For the FORE model, Fig. 9 shows the Molchan trajectories for target earthquakes |${M}_\mathrm{w} \ge 5.0$|⁠. As for the BVAL model, also the FORE model overperform the prediction performance of the random method since the Molchan trajectories for both unweighted and weighted fraction of space-time occupied by alarms lies well below the diagonal line and the 1 per cent confidence level threshold. Unlike the BVAL model, the FORE model predicts all the target earthquakes with |${M}_\mathrm{w} \ge 5.0$| for |${\rm{\Delta }}t$|=18 yr, for which |${\tau }_\mathrm{u}$| = 41.5 per cent and |${\tau }_\mathrm{w}$| = 58.2 per cent. For |${\rm{\Delta }}t$|= 1 yr, approximately 79 per cent (27/34) of targets are predicted with |${\tau }_\mathrm{u}$| = 3.4 per cent and |${\tau }_\mathrm{w}$| = 5.5 per cent. With |${\rm{\Delta }}t$| = 3 months, 74 per cent (25/34) of target earthquakes are predicted with |${\tau }_\mathrm{u}$| = 0.95 per cent and |${\tau }_\mathrm{w}$|=1.58 per cent. For |${\rm{\Delta }}t$| = 1 d, 47 per cent (16/34) of target earthquakes are predicted with |${\tau }_{\mathrm{ u}}$| = 0.01 per cent and |${\tau }_\mathrm{w}$| = 0.03 per cent. The overall AS scores |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$|=0.95 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.92 are significantly higher than the random model's expected AS score (0.5). The Molchan diagram of the FORE model applied to |${M}_\mathrm{w} \ge $| 5.5 earthquakes (Fig. S2, Supporting Information) show very high prediction skills, with |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.98 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100{\rm{\ per\ cent}}} )$| = 0.97. The total number of target events (12) are predicted starting from |${\rm{\Delta }}t$| = 9 yr with |${\tau }_\mathrm{u}$| = 24.8 per cent and |${\tau }_\mathrm{w}$| = 36.3 per cent. For |${\rm{\Delta }}t$| = 1 yr and |${\rm{\Delta }}t$| = 1 d the 92 per cent (11/12) and the 58 per cent (7/12) of target shocks are predicted. Detailed numerical values and AS scores are provided in Tables S7 and S8 (Supporting Information).

$Same as Fig. 4 but for the FORE model applied to predict target shocks with ${{\it{M}}}_{\rm{w}} \ge 5.0$.$

Figure 9.

Same as Fig. 4 but for the FORE model applied to predict target shocks with |${{\it{M}}}_{\rm{w}} \ge 5.0$|⁠.

For the ETAS model, Fig. 10 displays the Molchan trajectories for target earthquakes with |${M}_\mathrm{w} \ge 5.0$|⁠. Like the two previous models, ETAS outperforms the random prediction method, as both the unweighted and weighted Molchan trajectories lie well below the diagonal line and even below the 1 per cent confidence level threshold. It predicts all target events with |${\rm{\Delta }}t$| = 3 d or larger, with |${\tau }_\mathrm{u}$| = 30.1 per cent and |${\tau }_\mathrm{w}$| = 48.1 per cent. With |${\rm{\Delta }}t$| = 1 yr, the fraction of space-time occupied by alarm increase to |${\tau }_{\mathrm{ u}}$| = 33.4 per cent and |${\tau }_\mathrm{w}$| = 53.2 per cent. For |${\rm{\Delta }}t$| = 1 d, 97 per cent (33/34) of target earthquakes are predicted with |${\tau }_\mathrm{u}$| = 20.8 per cent and |${\tau }_\mathrm{w}$| = 33.9 per cent. The overall AS scores are |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.97 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.95. The Molchan diagram for the ETAS model applied to |${M}_\mathrm{w} \ge $| 5.5 earthquakes (Fig. S3, Supporting Information) show very high prediction skills, with |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.99 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.98. All 12 target events are predicted starting from |${\rm{\Delta }}t$| = 0.125 d (3 hr), with corresponding |${\tau }_\mathrm{u}$| = 5.0 per cent and |${\tau }_\mathrm{w}$| = 8.0 per cent. Detailed numerical values and AS scores are provided in Tables S9 and S10 (Supporting Information).

$Same as Fig. 4 but for the ETAS model applied to predict target shocks with ${{\it{M}}}_{\rm{w}} \ge 5.0$.$

Figure 10.

Same as Fig. 4 but for the ETAS model applied to predict target shocks with |${{\it{M}}}_{\rm{w}} \ge 5.0$|⁠.

For the ensemble EADD model, Fig. 11 shows the Molchan trajectories for target earthquakes |${M}_\mathrm{w} \ge 5.0$|⁠. As for other models, it overperform the prediction performance of the random method. It predicts all the 34 target earthquakes with |${M}_\mathrm{w} \ge 5.0$| from |${\rm{\Delta }}t$| = 16 yr for which |${\tau }_\mathrm{u}$| |$= 48.4$| per cent and |${\tau }_\mathrm{w} = 67.3$| per cent. For |${\rm{\Delta }}t$| = 1 yr, approximately 82 per cent (28/34) are predicted with |${\tau }_\mathrm{u}$| = 10.2 per cent and |${\tau }_\mathrm{w}$| = 15.1 per cent. For |${\rm{\Delta }}t$| = 3 months, it predicts 76 per cent (26/34) of target earthquakes with |${\tau }_\mathrm{u}$| = 3.5 per cent and |${\tau }_\mathrm{w}$| = 5.3 per cent. For |${\rm{\Delta }}t$| = 1 d, 68 per cent (23/34) of target earthquakes are predicted with |${\tau }_\mathrm{u}$| = 0.06 per cent and |${\tau }_\mathrm{w}$| = 0.11 per cent. The overall AS scores |${a}_\mathrm{f}( {{\tau }_{\mathrm{ u}} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.96 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.94 are significantly higher than the random model expected AS score (0.5). The Molchan diagram of the EADD model applied to |${M}_\mathrm{w} \ge $| 5.5 earthquakes (Fig. S4, Supporting Information) show very high prediction skills, with |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.97 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.96. The total number of target events (12) are predicted starting from |${\rm{\Delta }}t$| = 6 yr with |${\tau }_\mathrm{u}$| = 31.0 per cent and |${\tau }_\mathrm{w}$| = 45.0 per cent. For |${\rm{\Delta }}t$| = 1 yr and |${\rm{\Delta }}t$| = 1 d, the 92 per cent (11/12) and the 83 per cent (10/12) of target shocks are predicted, respectively. Detailed numerical values and AS scores are provided in Tables S11 and S12 (Supporting Information).

$Same as the Fig. 5 but for the EADD ensemble model applied to predict target shocks with ${{\it{M}}}_{\rm{w}} \ge 5.0$.$

Figure 11.

Same as the Fig. 5 but for the EADD ensemble model applied to predict target shocks with |${{\it{M}}}_{\rm{w}} \ge 5.0$|⁠.

For the ensemble EMUL model, Fig. 12 shows the Molchan trajectories for target earthquakes |${M}_\mathrm{w} \ge 5.0$|⁠. As for other models, it overperforms the prediction performance of the random method since both the Molchan trajectories for unweighted and weighted fraction of space-time occupied by alarms lies well below the diagonal line. Even with |${\rm{\Delta }}t$| = 18 yr, it predicts only 30 of 34 target earthquakes (88 per cent) with |${M}_\mathrm{w} \ge 5.0$|⁠. For |${\rm{\Delta }}t$| = 1 yr, approximately 65 per cent (22/34) are predicted with |${\tau }_\mathrm{u}$| = 1.7 per cent and |${\tau }_\mathrm{w}$| = 3.0 per cent. For |${\rm{\Delta }}t$| = 3 months predicts 62 per cent (21/34) of target earthquakes with |${\tau }_\mathrm{u}$|=0.4 per cent and |${\tau }_\mathrm{w}$|=0.8 per cent. For |${\rm{\Delta }}t$| = 1 d, 29 per cent (10/34) of target earthquakes are predicted with |${\tau }_\mathrm{u}$|=0.003 per cent and |${\tau }_\mathrm{w}$| = 0.01 per cent. The overall AS scores |${a}_\mathrm{f}( {{\tau }_{\mathrm{ u}} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.89 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.86 are significantly higher than the random model's expected AS score (0.5), The Molchan diagram of the EMUL model applied to |${M}_\mathrm{w} \ge $| 5.5 earthquakes (Fig. S5, Supporting Information) show high prediction skills, with |${a}_\mathrm{f}( {{\tau }_\mathrm{u} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.94 and |${a}_\mathrm{f}( {{\tau }_\mathrm{w} = 100\ \mathrm{ per}\ \textrm{cent}} )$| = 0.92. Even with |${\rm{\Delta }}t$| = 18 yr, it predicts only 11 of 12 target earthquakes (92 per cent) with |${M}_\mathrm{w} \ge 5.5$|⁠. For |${\rm{\Delta }}t$| = 1 yr and |${\rm{\Delta }}t$| = 1 d the 83 per cent (10/12) and the 42 per cent (5/12) of target shocks are predicted, respectively. Detailed numerical values and AS scores are provided in Tables S13 and S14 (Supporting Information).

$Same as Fig. 5 but for the EMUL ensemble model applied to predict target shocks with ${{\it{M}}}_{\rm{w}} \ge 5.0$.$

Figure 12.

Same as Fig. 5 but for the EMUL ensemble model applied to predict target shocks with |${{\it{M}}}_{\rm{w}} \ge 5.0$|⁠.

ANALYSIS OF PERFORMANCE OF MODELS BASED ON ALARM DISTRIBUTION

Based on the overall AS score (computed for |$\tau = 100\ \mathrm{ per}\ \textrm{cent}$|⁠), the ETAS model appears to be the best performing model for predicting target earthquake with both |${M}_\mathrm{w} \ge 5.0$| and |${M}_\mathrm{w} \ge 5.5$|⁠. However, a closer examination reveals significant issues associated with its implementation using the alarm-based approach. During the 18-yr testing period, the ETAS model casts more than 1600 000 alarms. These alarms were concentrated in only 134 out of 337 cells, leading to an average of approximately 1.8 alarms per day in each cell. While this high frequency of alarms explain why ETAS can predict all target earthquakes even with very short alarm times (a few days for |${M}_\mathrm{w} \ge $| 5.0 and a few hours for |${M}_\mathrm{w} \ge $| 5.5) it also results in an overwhelming number of alerts. This could be impractical for a real application, where the large number of alarms may complicate the decision-making process and strain the public resources.

Conversely, the BVAL and FORE models issues significantly fewer alarms during the same period, with the BVAL model generating 1796 alarms (less than one each year on average in each cell) and the FORE 782 alarms (about one every three years on average per cell).

The differences between these models are illustrated by their behaviour in Fig. 13 for cell 179, which includes three targets (blue vertical bars): the 2016 August 24 Amatrice earthquake (⁠|${M}_\mathrm{w} = 6.2$|⁠) and two significant aftershocks occurred on 2017 January 18 (⁠|${M}_\mathrm{w} = 5.3\ {\rm{and}}\ 5.7)$|⁠. All the three methods are able to forecast the three target shocks with a minimum |${\rm{\Delta }}t$| of 6 hr for BVAL and FORE, and of about 30 s for ETAS. ETAS alarms (pins) are almost continuous throughout the 18-yr testing period, even when considering short alarm times |${\rm{\Delta }}t$| of just a few hours or minutes. This nearly constant state of alert reflects ETAS's high sensitivity but also highlights its potential for generating ‘alarm fatigue’, where the frequency of alarms diminishes their credibility (Bliss et al. 1995; Graham & Cvach 2010).

Figure 13.

Time distribution of alarms for BVAL, FORE and ETAS forecasting methods in the cell nr. 179 where the 2016 August 24, Amatrice (M_w 6.2) main shock and other two strong aftershocks of 2017 January 18 (indicated with blue bars) occurred.

By comparison, the alarms issued by the BVAL and FORE models in the same cell are much sparser. While these models may not capture every possible precursor signal, they offer a more balanced approach, issuing alarms selectively and potentially providing more reliable warnings. This contrast highlights the fundamental trade-off in earthquake forecasting between the comprehensive coverage of ETAS and the more targeted and practical utility of BVAL and FORE. This comparison offers useful insights into the practical effects of each forecasting method, helping to determine which model is the most suitable, based on the desired balance between sensitivity and reliability.

As noted by Zechar & Jordan (2008, 2010), the overall AS Score (computed for |$\tau = 100\ \mathrm{ per}\ \textrm{cent}$|⁠) is a valuable criterion to compare forecasting models because its power generally improves with increasing |$\tau $|⁠. Although the ETAS model achieves high overall AS scores, the frequency and duration of the alarms it generates may limit its overall effectiveness. A forecasting method that can achieve a good predictive efficiency with alarms concentrated in relatively short-time intervals (like BVAL and FORE) could be probably more effective and may be better received by the users.

In Fig. 14, we compare the performance of various methods by reporting the AS scores, for targets with |${M}_\mathrm{w} \ge 5.0$|⁠, for |${\tau }_\mathrm{w}$| varying from 0.01 per cent to 100 per cent. For |${\tau }_\mathrm{w}$| < 20 per cent, the higher scores are obtained by the FORE and EADD methods. Even though the ETAS model has the highest score for |${\tau }_\mathrm{w} \ge 20$| per cent, for |${\tau }_\mathrm{w}$|<4 per cent it results even worse than BVAL and EMUL. Note that |${\tau }_\mathrm{w} = 0.01$| per cent means that in each cell there is on average less than one hour of alarm each year, |${\tau }_\mathrm{w} = 0.10$| per cent, less than 9 hr each year, |${\tau }_\mathrm{w} = 1$| per cent, less than 4 d each year, and |${\tau }_\mathrm{w} = 10$| per cent about 36 d.

$AS scores of various forecasting models as a function of the weighted fraction of space occupied by alarms ${{\rm{\tau }}}_{\rm{w}}$ for ${\it{M}}$ $\ge 5.0$ target shocks in the testing period 2005–2022.$

Figure 14.

AS scores of various forecasting models as a function of the weighted fraction of space occupied by alarms |${{\rm{\tau }}}_{\rm{w}}$| for |${\it{M}}$| |$\ge 5.0$| target shocks in the testing period 2005–2022.

Similar inferences can be drawn from Fig. 15 where the rate of successful predictions (hit rate, |$h = 1 - \nu $|⁠) is plotted versus |${\tau }_\mathrm{w}$|⁠. In this case, ETAS is the worst model for |${\tau }_\mathrm{w}$| < 2 per cent and becomes the best one for |${\tau }_\mathrm{w}$| > 5 per cent, which mean about 8 and 20 d of alarm each year, respectively, on average in each cell. While ETAS excels in scenarios with extended alarm durations, methods like FORE and EADD may be more suitable for situations where the goal is to maintain high predictive accuracy with minimal disruption to daily life, especially where frequent alarms could lead to public desensitization.

$Hit rates of various forecasting models as a function of the weighted fraction of space occupied by alarms ${{\rm{\tau }}}_{\rm{w}}$ for ${\it{M}}$ $\ge 5.0$ target shocks in the testing period 2005–2022.$

Figure 15.

Hit rates of various forecasting models as a function of the weighted fraction of space occupied by alarms |${{\rm{\tau }}}_{\rm{w}}$| for |${\it{M}}$| |$\ge 5.0$| target shocks in the testing period 2005–2022.

MOLCHAN-SHEBALIN DIAGRAM

Following the methodology introduced by Shebalin et al. (2011), we can compare the miss rates (⁠|$\nu $|⁠) of various analysed models to a skilled reference model (for this study, the ETAS model) playing the role of the purely random prediction on the Molchan diagonal. The skilled reference model is characterized by its space-time fractions occupied by alarms (⁠|${\tau }_\mathrm{ref}$|⁠) and miss rates (⁠|${\nu }_\mathrm{ref}$|⁠). The miss rates (⁠|${\nu }_\mathrm{ref}$|⁠) are plotted along the diagonal line of the Molchan diagram, where |${\tau }_\mathrm{ref} = 1 - {\nu }_\mathrm{ref}$|⁠. For the other models being compared, their expected miss rates (⁠|$\nu $|⁠) must also be plotted against |$\tau = 1 - {\nu }_\mathrm{ref}$|⁠. Since these models may have been computed at different |$\tau $| values compared to the |${\tau }_\mathrm{ref}$| values of the ETAS model, linear interpolation is required to align the |$\tau $| values appropriately. Specifically, for each |${\tau }_\mathrm{ref}$| of the ETAS reference model, we compute the miss rates (⁠|${\nu }_{\mathrm{ int}}$|⁠) for the other models (BVAL, FORE, EADD and EMUL) using the following linear interpolation formula (Biondini & Gasperini 2023):

$$\begin{eqnarray} {\nu }_{\mathrm{ int}} = \frac{{{\nu }_\mathrm{ a} - {\nu }_\mathrm{ b}}}{{{\tau }_\mathrm{ a} - {\tau }_\mathrm{ b}}}\left( {{\tau }_\mathrm{ref} - {\tau }_\mathrm{ b}} \right) + {\nu }_\mathrm{ b}, \end{eqnarray}$$

(8)

where |${\tau }_\mathrm{ a}$| and |${\tau }_\mathrm{ b}$| are the fractions of space-time occupied by alarms for the compared models, immediately larger and smaller than |${\tau }_\mathrm{ref}$|⁠, respectively, and |${\nu }_\mathrm{ a}$| and |${\nu }_\mathrm{ b}$| the corresponding miss rates, respectively. We make such interpolation using as |${\tau }_\mathrm{ref}\ $| both |${\tau }_\mathrm{u}$| and |${\tau }_\mathrm{u}$| of the reference ETAS model. The rationale for this interpolation is to provide a consistent basis for comparing the miss rates of different models at the same values of |$\tau $|⁠. Similarly to the Molchan diagrams showed above (Figs 5, and 9 –12), if the miss rates of a compared model |$({\nu }_{\mathrm{ int}})$| for the corresponding |$\tau = 1 - {\nu }_\mathrm{ref}$|⁠, are lower than |${\nu }_\mathrm{ref}$|⁠, such model has a better predictive ability than the reference model. This method provides a rigorous framework for evaluating the predictive capabilities of seismic forecasting models, allowing for a fair comparison based on equivalent conditions. For each model, we also compute a sort of a Molchan–Shebalin area skill (now on MSAS) score, according to eq. (7) where |$\nu $| is substituted by |${\nu }_{\mathrm{ int}}$| and |$\tau $| by |${\tau }_\mathrm{ref}$|⁠. For the reference ETAS model |${\nu }_{\mathrm{ int}} = {\nu }_\mathrm{ref} = 1 - {\tau }_\mathrm{ref}$|⁠, then its AS score is exactly 0.5.

Using the weighted |${\tau }_\mathrm{w}$| of the ETAS model for |${M}_\mathrm{w} \ge 5.0$| as |${\tau }_\mathrm{ref}$|⁠, in Fig. 16 (with values in Table 2) we show that all other models lie below the diagonal representing the ETAS model for |${\tau }_\mathrm{ref}$| < 60 per cent and the FORE and EADD models for |${\tau }_\mathrm{ref}$| < 80 per cent. All models have an overall MSAS score higher than the reference model ETAS. For FORE and EADD models, the score is significantly higher than for the ETAS according to the Student's t-test. The same plot for the |${M}_\mathrm{w} \ge 5.5$| is reported in Fig. S6 of the Supporting Information. The results are like those of Fig. 16 with a slightly better performance of BVAL, FORE, EADD and EMUL with respect to ETAS.

$Molchan–Shebalin diagram and MSAS scores (see text) of all compared models for target earthquakes with ${{\it{M}}}_{\rm{w}} \ge 5.0$, considering the ETAS model as reference and weighted fraction of space-time occupied by alarms $({{\rm{\tau }}}_{\rm{w}})$. The continuous diagonal line (orange) represents the miss rates of the ETAS model, distinguishing models with higher performance (below the line) from those with lower performance (above the line) compared to the reference.$

Figure 16.

Molchan–Shebalin diagram and MSAS scores (see text) of all compared models for target earthquakes with |${{\it{M}}}_{\rm{w}} \ge 5.0$|⁠, considering the ETAS model as reference and weighted fraction of space-time occupied by alarms |$({{\rm{\tau }}}_{\rm{w}})$|⁠. The continuous diagonal line (orange) represents the miss rates of the ETAS model, distinguishing models with higher performance (below the line) from those with lower performance (above the line) compared to the reference.

Table 2.

Values of variables in Molchan–Shebalin plot of Fig. 12 for target shocks with |$M \ge 5.0$| and unweighted fractions of space-time occupied by alarms |$({\tau }_\mathrm{u}$|⁠).

\|${{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ ref}}}}$\|	\|${{\boldsymbol{a}}}_{\boldsymbol{\mathrm{ f}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ BVAL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ BVAL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ FORE}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ FORE}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EADD}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EADD}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EMUL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EMUL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|
0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000
0.000	1.000	0.000	0.912	0.000	0.893	0.000	0.853	0.000	0.853	0.000
0.059	0.941	0.029	0.907	0.091	0.805	0.151	0.853	0.147	0.850	0.149
0.088	0.912	0.044	0.865	0.098	0.708	0.182	0.853	0.147	0.819	0.154
0.088	0.912	0.044	0.786	0.098	0.643	0.182	0.690	0.147	0.695	0.154
0.088	0.912	0.044	0.759	0.098	0.600	0.182	0.608	0.147	0.618	0.154
0.118	0.882	0.059	0.725	0.138	0.543	0.243	0.555	0.215	0.556	0.219
0.147	0.853	0.074	0.638	0.174	0.475	0.293	0.460	0.270	0.515	0.268
0.206	0.794	0.103	0.519	0.245	0.399	0.370	0.366	0.361	0.484	0.335
0.324	0.676	0.162	0.474	0.339	0.350	0.463	0.315	0.469	0.471	0.403
0.382	0.618	0.191	0.441	0.370	0.324	0.494	0.294	0.504	0.441	0.425
0.471	0.529	0.235	0.441	0.406	0.317	0.529	0.294	0.542	0.381	0.455
0.471	0.529	0.235	0.441	0.406	0.299	0.529	0.294	0.542	0.368	0.455
0.529	0.471	0.265	0.441	0.423	0.254	0.550	0.294	0.560	0.353	0.476
0.588	0.412	0.294	0.400	0.438	0.206	0.572	0.277	0.576	0.353	0.493
0.853	0.147	0.426	0.325	0.500	0.206	0.641	0.201	0.633	0.353	0.541
0.912	0.088	0.456	0.324	0.511	0.206	0.651	0.176	0.645	0.279	0.550
0.971	0.029	0.485	0.206	0.525	0.154	0.661	0.105	0.658	0.235	0.562
0.971	0.029	0.485	0.176	0.525	0.118	0.661	0.059	0.658	0.176	0.562
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.039	0.666	0.108	0.570
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.133	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.120	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.118	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.117	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.116	0.534	0.059	0.669	0.029	0.666	0.103	0.570
1.000	0.000	0.500	0.115	0.534	0.059	0.669	0.029	0.666	0.102	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.113	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570

\|${{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ ref}}}}$\|	\|${{\boldsymbol{a}}}_{\boldsymbol{\mathrm{ f}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ BVAL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ BVAL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ FORE}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ FORE}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EADD}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EADD}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EMUL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EMUL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|
0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000
0.000	1.000	0.000	0.912	0.000	0.893	0.000	0.853	0.000	0.853	0.000
0.059	0.941	0.029	0.907	0.091	0.805	0.151	0.853	0.147	0.850	0.149
0.088	0.912	0.044	0.865	0.098	0.708	0.182	0.853	0.147	0.819	0.154
0.088	0.912	0.044	0.786	0.098	0.643	0.182	0.690	0.147	0.695	0.154
0.088	0.912	0.044	0.759	0.098	0.600	0.182	0.608	0.147	0.618	0.154
0.118	0.882	0.059	0.725	0.138	0.543	0.243	0.555	0.215	0.556	0.219
0.147	0.853	0.074	0.638	0.174	0.475	0.293	0.460	0.270	0.515	0.268
0.206	0.794	0.103	0.519	0.245	0.399	0.370	0.366	0.361	0.484	0.335
0.324	0.676	0.162	0.474	0.339	0.350	0.463	0.315	0.469	0.471	0.403
0.382	0.618	0.191	0.441	0.370	0.324	0.494	0.294	0.504	0.441	0.425
0.471	0.529	0.235	0.441	0.406	0.317	0.529	0.294	0.542	0.381	0.455
0.471	0.529	0.235	0.441	0.406	0.299	0.529	0.294	0.542	0.368	0.455
0.529	0.471	0.265	0.441	0.423	0.254	0.550	0.294	0.560	0.353	0.476
0.588	0.412	0.294	0.400	0.438	0.206	0.572	0.277	0.576	0.353	0.493
0.853	0.147	0.426	0.325	0.500	0.206	0.641	0.201	0.633	0.353	0.541
0.912	0.088	0.456	0.324	0.511	0.206	0.651	0.176	0.645	0.279	0.550
0.971	0.029	0.485	0.206	0.525	0.154	0.661	0.105	0.658	0.235	0.562
0.971	0.029	0.485	0.176	0.525	0.118	0.661	0.059	0.658	0.176	0.562
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.039	0.666	0.108	0.570
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.133	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.120	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.118	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.117	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.116	0.534	0.059	0.669	0.029	0.666	0.103	0.570
1.000	0.000	0.500	0.115	0.534	0.059	0.669	0.029	0.666	0.102	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.113	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570

|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ ref}}}}{\rm{\ }}$| is the miss rate of the ETAS model (reference), |${\tau }_{\mathrm{ re}}$| are the fractions of space time occupied by alarms used to plot the Molchan diagram and given by |$1 - {\nu }_\mathrm{ref}$|⁠, |${\nu }_\mathrm{ref}$| is the reference miss rate, |${\nu }_\mathrm{ref}$| is the interpolated miss rate, |${a}_\mathrm{f}( {{\tau }_{\mathrm{ re}}} ),{\rm{\ }}$| |${a}_{\textrm{BVAL}}( {{\tau }_{\mathrm{ re}}} )$| and |${a}_{\textrm{FORE}}( {{\tau }_{\mathrm{ re}}} )$|⁠, |${a}_{\textrm{EADD}}( {{\tau }_{\mathrm{ re}}} )$| and |${a}_{\textrm{EMUL}}( {{\tau }_{\mathrm{ re}}} )$| are the area skill scores computed for the reference ETAS, BVAL, FORE, EADD and EMUL models, respectively.

Table 2.

Values of variables in Molchan–Shebalin plot of Fig. 12 for target shocks with |$M \ge 5.0$| and unweighted fractions of space-time occupied by alarms |$({\tau }_\mathrm{u}$|⁠).

\|${{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ ref}}}}$\|	\|${{\boldsymbol{a}}}_{\boldsymbol{\mathrm{ f}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ BVAL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ BVAL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ FORE}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ FORE}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EADD}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EADD}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EMUL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EMUL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|
0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000
0.000	1.000	0.000	0.912	0.000	0.893	0.000	0.853	0.000	0.853	0.000
0.059	0.941	0.029	0.907	0.091	0.805	0.151	0.853	0.147	0.850	0.149
0.088	0.912	0.044	0.865	0.098	0.708	0.182	0.853	0.147	0.819	0.154
0.088	0.912	0.044	0.786	0.098	0.643	0.182	0.690	0.147	0.695	0.154
0.088	0.912	0.044	0.759	0.098	0.600	0.182	0.608	0.147	0.618	0.154
0.118	0.882	0.059	0.725	0.138	0.543	0.243	0.555	0.215	0.556	0.219
0.147	0.853	0.074	0.638	0.174	0.475	0.293	0.460	0.270	0.515	0.268
0.206	0.794	0.103	0.519	0.245	0.399	0.370	0.366	0.361	0.484	0.335
0.324	0.676	0.162	0.474	0.339	0.350	0.463	0.315	0.469	0.471	0.403
0.382	0.618	0.191	0.441	0.370	0.324	0.494	0.294	0.504	0.441	0.425
0.471	0.529	0.235	0.441	0.406	0.317	0.529	0.294	0.542	0.381	0.455
0.471	0.529	0.235	0.441	0.406	0.299	0.529	0.294	0.542	0.368	0.455
0.529	0.471	0.265	0.441	0.423	0.254	0.550	0.294	0.560	0.353	0.476
0.588	0.412	0.294	0.400	0.438	0.206	0.572	0.277	0.576	0.353	0.493
0.853	0.147	0.426	0.325	0.500	0.206	0.641	0.201	0.633	0.353	0.541
0.912	0.088	0.456	0.324	0.511	0.206	0.651	0.176	0.645	0.279	0.550
0.971	0.029	0.485	0.206	0.525	0.154	0.661	0.105	0.658	0.235	0.562
0.971	0.029	0.485	0.176	0.525	0.118	0.661	0.059	0.658	0.176	0.562
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.039	0.666	0.108	0.570
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.133	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.120	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.118	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.117	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.116	0.534	0.059	0.669	0.029	0.666	0.103	0.570
1.000	0.000	0.500	0.115	0.534	0.059	0.669	0.029	0.666	0.102	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.113	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570

\|${{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ ref}}}}$\|	\|${{\boldsymbol{a}}}_{\boldsymbol{\mathrm{ f}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ BVAL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ BVAL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ FORE}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ FORE}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EADD}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EADD}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|	\|${{\boldsymbol{\nu }}}_{{\boldsymbol{\mathrm{ EMUL}}}}$\|	\|${{\boldsymbol{a}}}_{{\boldsymbol{\mathrm{ EMUL}}}}( {{{\boldsymbol{\tau }}}_{{\boldsymbol{\mathrm{ re}}}}} )$\|
0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000	1.000	0.000
0.000	1.000	0.000	0.912	0.000	0.893	0.000	0.853	0.000	0.853	0.000
0.059	0.941	0.029	0.907	0.091	0.805	0.151	0.853	0.147	0.850	0.149
0.088	0.912	0.044	0.865	0.098	0.708	0.182	0.853	0.147	0.819	0.154
0.088	0.912	0.044	0.786	0.098	0.643	0.182	0.690	0.147	0.695	0.154
0.088	0.912	0.044	0.759	0.098	0.600	0.182	0.608	0.147	0.618	0.154
0.118	0.882	0.059	0.725	0.138	0.543	0.243	0.555	0.215	0.556	0.219
0.147	0.853	0.074	0.638	0.174	0.475	0.293	0.460	0.270	0.515	0.268
0.206	0.794	0.103	0.519	0.245	0.399	0.370	0.366	0.361	0.484	0.335
0.324	0.676	0.162	0.474	0.339	0.350	0.463	0.315	0.469	0.471	0.403
0.382	0.618	0.191	0.441	0.370	0.324	0.494	0.294	0.504	0.441	0.425
0.471	0.529	0.235	0.441	0.406	0.317	0.529	0.294	0.542	0.381	0.455
0.471	0.529	0.235	0.441	0.406	0.299	0.529	0.294	0.542	0.368	0.455
0.529	0.471	0.265	0.441	0.423	0.254	0.550	0.294	0.560	0.353	0.476
0.588	0.412	0.294	0.400	0.438	0.206	0.572	0.277	0.576	0.353	0.493
0.853	0.147	0.426	0.325	0.500	0.206	0.641	0.201	0.633	0.353	0.541
0.912	0.088	0.456	0.324	0.511	0.206	0.651	0.176	0.645	0.279	0.550
0.971	0.029	0.485	0.206	0.525	0.154	0.661	0.105	0.658	0.235	0.562
0.971	0.029	0.485	0.176	0.525	0.118	0.661	0.059	0.658	0.176	0.562
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.039	0.666	0.108	0.570
1.000	0.000	0.500	0.147	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.133	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.120	0.534	0.059	0.669	0.029	0.666	0.105	0.570
1.000	0.000	0.500	0.118	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.117	0.534	0.059	0.669	0.029	0.666	0.104	0.570
1.000	0.000	0.500	0.116	0.534	0.059	0.669	0.029	0.666	0.103	0.570
1.000	0.000	0.500	0.115	0.534	0.059	0.669	0.029	0.666	0.102	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.114	0.534	0.059	0.669	0.029	0.666	0.101	0.570
1.000	0.000	0.500	0.113	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.100	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.112	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.099	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.111	0.534	0.059	0.669	0.029	0.666	0.098	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570
1.000	0.000	0.500	0.000	0.534	0.000	0.669	0.000	0.666	0.000	0.570

DISCUSSION AND CONCLUSIONS

We developed the BVAL method to forecast potentially damaging target earthquakes (⁠|${M}_\mathrm{w} \ge 5.0$|⁠) in Italy, based on the temporal variations of the slope (b-value) of the frequency–magnitude distribution (Gutenberg & Richter 1944), and we compared it with other forecasting methods based on the statistical properties of seismicity: the FORE method using the occurrence of strong foreshocks (Gasperini et al. 2021), the ETAS method using the epidemic principle that any shock occurred acts as a trigger of the following ones (Ogata 1988), and two ensemble methods obtained by the union (EADD) and the intersection (EMUL) of space time alarm windows of BVAL and FORE.

We were inspired by the hypothesis formulated by Scholz (1968, 2015) and Amitrano (2003) that b-value decreases with increasing differential stress and with the increasing probability of occurrence of strong earthquakes. The same hypothesis brought to the formulation of the Foreshock Traffic Light System (Gulia & Wiemer 2010, 2019, 2021; Gulia et al. 2018) predicting whenever a shock with |${M}_\mathrm{w} \ge 6.0$| is the strongest in a seismic sequence (green light, high b-value) or an even stronger shock is likely to occur close in space in the near future (red light, low b-value).

Our algorithm declares an alarm of duration |${\rm{\Delta }}t$|⁠, which varies from a fraction of second to the entire duration of the experiment, in a square cell with sides of about 42.4 km, every time the b-value in that cell decreases below the threshold |${b}_{\mathrm{ th}} = \ $| 0.9 from a higher value. Such threshold was determined by optimizing the ability of the algorithm to forecast earthquakes with |${M}_\mathrm{w} \ge 5.$|0 of the HORUS catalogue (Lolli et al. 2020) from 1990 to 2004.

If a target earthquake occurs in a cell where an alarm window is active, we count a hit, if a target earthquake occurs outside any alarm window we count a miss. To not miss the prediction of target earthquakes located only slightly outside an alarmed cell, we used a double tessellation with two grids staggered by half the grid side.

We pseudo-prospectively evaluated the predictive efficiency of the forecasting method using the AS score (Zechar & Jordan 2008, 2010) measuring the area above the Molchan trajectory (Molchan 1990, 1991), based on the data of the HORUS catalogue from 2005 to 2022. We obtained the Molchan trajectory by plotting the miss rate |$\nu $| as a function of the fraction of space-time occupied by alarms |$\tau $|⁠, varying the alarm duration |${\rm{\Delta }}t$| from a fraction of second to the total duration of the testing period (18 yr). The lower the miss rates of the Molchan trajectory the larger the AS score and then the more accurate the forecasting method.

We also penalized the forecasting performance of models in highly seismic areas, where random forecasts are easier, by weighting the fraction of space-time occupied by alarms proportionally to the long-term earthquake occurrence rate deduced from the catalogue CPTI15 V4.0 (Rovida et al. 2020, 2022) from 1600 to 1959.

Based on the overall AS score (computed for |$\tau = 100$| per cent) the best performing model would be the ETAS, which achieves a near-perfect AS score (97–99 per cent). While this might indicate ETAS as the most effective model, it might also evidence a critical limitation of the Molchan diagram for testing models, like ETAS, issuing extremely frequent and dense alarms which results in a rapid reduction of the miss rate to zero even for notably short alarms of a few seconds or minutes. This creates an illusion of perfect predictive capability, as all target earthquakes are technically predicted, but at the cost of an overwhelming and nearly constant state of alarm in some cells.

This characteristic of ETAS, while beneficial in a purely statistical sense, undermines its broader applicability. The constant stream of alarms not only risks desensitising the population and creating ‘alarm fatigue’, effective management of emergency responses to natural disasters. In highly alarmed cells, the ETAS model alarms are so continuous that it becomes difficult to distinguish between periods of genuine increased risk and the model default state of alert. This diminishes its overall effectiveness for the operational utility where the goal is to balance predictive accuracy and minimizing unnecessary alerts.

More in general, we can note that high fractions (e.g. >10 per cent) of space-time occupied by alarms may lead to a diminished trust in the model, as the constant presence of alarms could make it harder to discern meaningful signals from the background noise. Hence, we also computed the AS scores of BVAL and other forecasting models even at smaller |$\tau $| down to 0.01 per cent.

If compared with other methods, the BVAL is worse than FORE at any |$\tau $| but better than ETAS for |$\tau < 2 - 4$| per cent. The ensemble model EADD obtained by the sum of alarms issued by both BVAL and FORE is the best performing for space time occupations between 0.04 per cent to 2 per cent, which is the range of time-space occupation for which the forecasting algorithms might be more useful for civil protection purposes.

This preference for other methods with respect to ETAS is also confirmed by the Molchan–Shebalin diagram (Shebalin et al. 2011) in which the role of the random forecast along the Molchan diagonal is played by a reference skilled method (in our case the ETAS method itself). In fact, all other methods describe Molchan trajectories below the reference ETAS diagonal for |$\tau $| < 60 per cent and have the area above the trajectory, larger than 0.5 that corresponds to the ETAS reference model score. In summary, the BVAL forecasting method works rather well so confirming the validity of the basic hypothesis that low b-values might indicate the future occurrence of potentially damaging earthquakes. It is anyhow slightly worse than the FORE method based on the occurrence of strong foreshocks (Gasperini et al. 2021). However, BVAL in additive combination with the FORE improves the efficiency of the latter, particularly at relatively low fractions (<5 per cent) of space-time occupied by alarm that are those that are most likely to be useful for seismic hazard mitigations.

ACKNOWLEDGEMENTS

We thank Rodolfo Console and Aron Mirwald for thoughtful comments and suggestions that helped to improve the paper. This paper benefitted from funding provided by the European Union within the ambit of the H2020 project RISE (no. 821115) and the ETHZ, which in particular fully financed the Postdoc grant of one of the authors (BE). We thank the Istituto Nazionale di Geofisica e Vulcanologia, Italy, grant "Progetto INGV Pianeta Dinamico (NEMESIS)’—code CUP D53J19000170001–funded by Italian Ministry MIUR.

CONFLICT OF INTEREST

The authors declare that there is no conflict of interest regarding the publication of this manuscript.

DATA AVAILABILITY

The data of the HORUS and CPTI15 earthquake catalogues are available from public providers: http://horus.bo.ingv.it and https://emidius.mi.ingv.it/CPTI15-DBMI15/, respectively.

The QGIS geographical software was downloaded from www.qgis.org.

REFERENCES

Aki

1965

Maximum likelihood estimate of b in the formula log N = a − bM and its confidence limits

Bull. Earthq. Res. Inst.

237

–

239

Amitrano

2003

Brittle-ductile transition and associated seismicity: experimental and numerical studies and relationship with the b value

J. geophys. Res.

108

(

doi:10.1029/2001JB000680

10.1029/2001JB000680

10.1080/00140139508925269

Bayona

J.A.

Savran

W.H.

Rhoades

D.A.

Werner

M.J.

2022

Prospective evaluation of multiplicative hybrid earthquake forecasting models in California

Geophys. J. Int.

229

1736

–

1753

Biondini

Gasperini

2023

Comparison between alarm-based and probability-based earthquake forecasting methods

Geophys. J. Int.

235

(

1541

–

1551

Biondini

Rhoades

D.A.

Gasperini

2023

Application of the EEPAS earthquake forecasting model to Italy

Geophys. J. Int.

234

(

1681

–

1700

Bliss

J.P.

Gilson

R.D.

Deaton

J.E.

1995

Human probability matching behaviour in response to alarms of varying reliability

Ergonomics

2300

–

2312

Chan

Tseng

Lin

Chen

2012

Spatial and temporal evolution of b-values before large earthquakes in Taiwan

Tectonophysics

532–535

215

–

222

10.1016/j.tecto.2012.02.004

Chen

S.K.

Chen

P.-Y.

Y.-M.

Chen

C.-C.

Chan

C.-H.

(

2023

Temporal variations of earthquake magnitude-frequency relation in the source area of M ≥ 6.0 earthquakes: a systematic survey in Taiwan

Earth Space Sci.

e2023EA002927

doi:10.1029/2023EA002927

Console

Murru

2001

A simple and testable model for earthquake clustering

J. geophys. Res.

106

8699

–

8711

Console

Murru

Lombardi

A.M.

2003

Refining earthquake clustering models

J. geophys. Res.

108

(

B10

doi:10.1029/2002JB002130

10.1029/2002JB002130

Console

Rhoades

D.A.

Murru

Evison

F.F.

Papadimitriou

E.E.

Karakostas

V.G.

2006

Comparative performance of time-invariant, long-range and short-range forecasting models on the earthquake catalogue of Greece

J. geophys. Res.

111

(

B09304

doi:10.1029/2005JB004113

DeSalvio

N.D.

Rudolph

M.L.

2021

A retrospective analysis of b-value changes preceding strong earthquakes

Seismol. Res. Lett.

364

–

375

10.1785/022021014

10.1016/j.tecto.2013.12.001

El-Isa

Z.H.

Eaton

D.W.

2014

Spatiotemporal variations in the b-value of earthquake magnitude–frequency distributions: classification and causes

Tectonophysics

615–616

–

Falcone

Spassiani

Ashkenazy

Shapira

Hofstetter

Havlin

Marzocchi

2021

An operational earthquake forecasting experiment for Israel: preliminary results

Front. Earth Sci.

–

10.3389/feart.2021.729282

Frankel

1995

Mapping seismic hazard in the central and eastern United States

Seismol. Res. Lett.

(

–

Garcia-Hernández

D'Auria

Barrancos

Padilla

G.D.

Pérez

N.M.

2021

Multiscale temporal and spatial estimation of the b-value

Seismol. Res. Lett.

3712

–

3724

Gasperini

Biondini

Lolli

Petruccelli

Vannucci

2021

Retrospective short-term forecasting experiment in Italy based on the occurrence of strong (fore) shocks

Geophys. J. Int.

225

1192

–

1206

Graham

K.C.

Cvach

2010

Monitor alarm fatigue: standardizing use of physiological monitors and decreasing nuisance alarms

Am. J. Crit. Care

–

Gulia

Wiemer

2010

The influence of tectonic regimes on the earthquake size distribution: a case study for Italy

Geophys. Res. Lett.

(

–

Gulia

Wiemer

2019

Real-time discrimination of earthquake foreshocks and aftershocks

Nature

574

193

–

199

10.1038/s41586-019-1606-4

Gulia

Wiemer

2021

Comment on “Two Foreshock Sequences Post Gulia and Wiemer (2019)” by Kelian Dascher-Cousineau, Thorne Lay, and Emily E. Brodsky

Seismol. Res. Lett.

3251

–

3258

Gulia

Rinaldi

A.P.

Tormann

Vannucci

Enescu

Wiemer

2018

The effect of a mainshock on the size distribution of the aftershocks

Geophys. Res. Lett.

13277

–

Gulia

Wiemer

Biondini

Enescu

Vannucci

(

2024

Improving the foreshock traffic light systems for real-time discrimination between foreshocks and aftershocks

Seismol. Res. Lett.

(

3579

–

3592

Guo

Ogata

1997

Statistical relations between the parameters of aftershocks in time, space, and magnitude

J. Geophys. Res. Solid Earth

102

2857

–

2873

Gutenberg

Richter

C.F.

1944

Frequency of earthquakes in California

Bull. seism. Soc. Am.

185

–

118

10.1785/BSSA0340040185

Guttorp

Hopkins

1986

On estimating varying b values

Bull. seism. Soc. Am.

889

–

895

10.1785/BSSA0760030889

Kagan

Y.Y.

2009

Testing long-term earthquake forecasts: likelihood methods and error diagrams

Geophys. J. Int.

177

532

–

542

10.1111/j.1365-246X.2008.04064.x

Lolli

Boschi

Gasperini

2009

A comparative analysis of different models of aftershock rate decay by maximum likelihood estimation of simulated sequences

J. Geophys. Res. Solid Earth

114

(

B01305

–

10.1007/s00024-017-1482-5

Lolli

Randazzo

Vannucci

Gasperini

2020

The homogenized instrumental seismic catalog (HORUS) of Italy from 1960 to present

Seismol. Res. Lett.

3208

–

3222

Lombardi

A.M.

Marzocchi

2010

A double-branching model applied to long-term forecasting of Italian seismicity (ML ≥ 5.0) within the CSEP project

Ann. Geophys.

(

–

2017

Long-term b value variations of shallow earthquakes in New Zealand: a HMM-based analysis

Pure appl. Geophys.

174

1629

–

1641

Mizrahi

Nandan

Savran

Wiemer

Ben-Zion

2023

Question-driven ensembles of flexible ETAS models

Seismol. Res. Lett.

829

–

843

Molchan

Keilis-Borok

2008

Earthquake prediction: probabilistic aspect

Geophys. J. Int.

173

1012

–

1017

10.1111/j.1365-246X.2008.03785.x

Molchan

G.M.

Kagan

Y.Y.

1992

Earthquake prediction and its optimization

J. geophys. Res.

4823

doi:10.1029/91JB03095

Molchan

G.M.

1990

Strategies in strong earthquake prediction

Phys. Earth planet. Inter.

–

10.1016/0031-9201(90)90097-H

Molchan

G.M.

1991

Structure of optimal strategies in earthquake prediction

Tectonophysics

193

267

–

276

10.1016/0040-1951(91)90336-Q

Molchan

G.M.

1997

Earthquake prediction as a decision-making problem

Pure appl. Geophys.

149

233

–

247

Montuori

Murru

Falcone

2016

Spatial variation of the b-value observed for the periods preceding and following the 24 August 2016, Amatrice earthquake (ML6.0) (Central Italy)

Ann. Geophys.

doi:10.4401/ag-7273

10.4401/ag-7273

10.1016/j.tecto.2008.09.010

Murru

Console

Falcone

2009

Real time earthquake forecasting in Italy

Tectonophysics

470

214

–

223

Murru

Console

Lisi

2004

Seismicity and mean magnitude variations correlated to the strongest earthquakes of the 1997 Umbria–Marche sequence (central Italy)

J. geophys. Res.

109

B01304

doi:10.1029/2002JB002276

Nuannin

Kulhanek

Persson

2005

2004 Spatial and temporal b value anomalies preceding the devastating off coast of NW Sumatra earthquake of December 26

Geophys. Res. Lett.

L11307

doi:10.1029/2005GL022679

Ogata

Zhuang

2006

Space–time ETAS models and an improved extension

Tectonophysics

413

–

10.1016/j.tecto.2005.10.016

Ogata

1988

Statistical models for earthquake occurrences and residual analysis for point processes

J. Am. Stat. Assoc.

–

10.1080/01621459.1988.10478560

Ogata

1989

Statistical model for standard seismicity and detection of anomalies by residual analysis

Tectonophysics

169

159

–

174

10.1016/0040-1951(89)90191-1

Ogata

Jones

L.M.

Toda

2003

When and where the aftershock activity was depressed: contrasting decay patterns of the proximate large earthquakes in southern California

J. Geophys. Res. Solid Earth

108

(

–

Öztürk

2020

A study on the variations of recent seismicity in and around the Central Anatolian region of Turkey

Phys. Earth Planet. Int.

301

–

10.1016/j.pepi.2020.106453

Parsons

2007

Forecast experiment: do temporal and spatial b value variations along the Calaveras fault portend M ≥ 4.0 earthquakes?

J. geophys. Res.

112

–

Pastoressa

A.E.

Murru

Taroni

Console

Montuori

Falcone

Di Stefano

2023

Temporal variations of seismicity rates and Gutenberg–Richter b-values for a stochastic declustered catalog: an example in Central Italy

Seismol. Res. Lett.

1566

–

1578

Rovida

Locati

Camassi

Lolli

Gasperini

2020

The Italian earthquake catalogue CPTI15

Bull. Earthq. Eng.

2953

–

2984

10.1007/s10518-020-00818-y

Rovida

Locati

Camassi

Lolli

Gasperini

Antonucci

2022

Parametric Catalogue of Italian Earthquakes (CPTI15), version 4.0

Istituto Nazionale di Geofisica e Vulcanologia (INGV)

doi:10.13127/cpti/cpti15.4

10.13127/cpti/cpti15.4

Scholz

C.H.

1968

The frequency-magnitude relation of microfracturing in rock and its relation to earthquakes

Bull. seism. Soc. Am.

399

–

415

10.1785/BSSA0580010399

Scholz

C.H.

2015

On the stress dependence of the earthquake b value

Geophys. Res. Lett.

1399

–

1402

Sharma

Hainzl

Zöller

2023

Seismicity parameters dependence on main shock-induced co-seismic stress

Geophys. J. Int.

235

509

–

517

Shebalin

Narteau

Holschneider

Schorlemmer

2011

Short-term earthquake forecasting using early aftershock statistics

Bull. seism. Soc. Am.

101

297

–

312

Tinti

Gasperini

2024

The estimation of b-value of the frequency-magnitude distribution and of its one-sigma intervals from binned magnitude data

Geophys. J. Int.

238

433

–

458

Tsukakoshi

Shimazaki

2008

Decreased b-value prior to the M 6.2 Northern Miyagi, Japan, earthquake of 26 July 2003

Earth Planets Space

915

–

924

Utsu

1965

A method for determining the value of b in a formula log n = a − bM showing the magnitude-frequency relation for earthquakes

Geophys. Bull. Hokkaido Univ.

–

103

10.5194/nhess-12-3473-2012

Utsu

1966

A statistical significance test of the difference in b-value between two earthquake groups

J. Phys. Earth

–

10.4294/jpe1952.14.37

van der Elst

2021

B-positive: a robust estimator of aftershock magnitude distribution in transiently incomplete catalogs

J. Geophys. Res. Solid Earth

126

–

Varotsos

P.A.

Sarlis

N.V.

Skordas

E.S.

2012

Order parameter fluctuations in natural time and b-value variation before large earthquakes

Nat. Hazards Earth Syst. Sci.

3473

–

3481

Wiemer

Wyss

2000

Minimum magnitude of completeness in earthquake catalogs: examples from Alaska, the western United States, and Japan

Bull. seism. Soc. Am.

859

–

869

Woessner

Wiemer

2005

Assessing the quality of earthquake catalogues: estimating the magnitude of completeness and its uncertainty

Bull. seism. Soc. Am.

684

–

698

Zechar

J.D.

Jordan

T.H.

2008

Testing alarm-based earthquake predictions

Geophys. J. Int.

172

715

–

724

10.1111/j.1365-246X.2007.03676.x

Zechar

J.D.

Jordan

T.H.

2010

The area skill score statistic for evaluating earthquake predictability experiments

Pure appl. Geophys.

167

893

–

906

10.1007/s00024-010-0086-0

Zhuang

Ogata

Vere-Jones

2004

Analyzing earthquake clustering features by using stochastic reconstruction

J. geophys. Res.

109

–

Zhuang

Werner

M.J.

Zhou

Harte

Hainzl

2011

Basic models of seismicity: spatiotemporal models

Community Online Resource for Statistical Seismicity Analysis. doi:10.5078/corssa-07487583

10.5078/corssa-07487583

APPENDIX A: ETAS MODEL FORMULATION AND PARAMETERS FITTING

The ETAS formulation and the background model SVP

The ETAS model, introduced by Ogata (1988), is a stochastic point process used to model aftershock sequences and provide probabilistic estimates of earthquake occurrences. The model addresses the limitation of using a single Omori–Utsu law to describe the temporal decay of aftershock sequences. Guo & Ogata (1997) and Ogata et al. (2003) demonstrated that significant aftershocks often generate their own secondary sequences of aftershocks. As a result, the ETAS model treats each aftershock as a potential trigger for additional aftershocks. The occurrence rate at a given time t is derived from the superposition of time-shifted Omori–Utsu decay functions, each weighted by the magnitude of the parent event (Ogata & Zhuang 2006).

The ETAS model is widely used for earthquake forecasting, including various applications in Italy (e.g. Console et al. 2006; Lombardi & Marzocchi 2010; Falcone et al. 2021; Biondini et al. 2023). The temporal dependence in these models is typically represented by a sum of Omori's law decays, initiated at the times of each earthquake occurrence:

$$\begin{eqnarray} f\left( t \right) = \mathop \sum \limits_{i = 1}^n \frac{{H\left( {t - {t}_i} \right)K{e}^{\alpha \left( {{m}_i - {m}_\mathrm{ c}} \right)}}}{{{{\left( {t - {t}_i + c} \right)}}^p}}, \end{eqnarray}$$

(A1)

where K, c, p and |$\alpha $| are free parameters, and |$H( {t - {t}_i} )$| is the Heaviside step function, which equals 1 if |$t - {t}_i > 0$| and 0 otherwise. n in the sum represents the total number of earthquakes considered, with each |$i - $|th term in the sum corresponding to an individual earthquake in the sequence. The earthquake productivity is assumed to be an exponential function of the magnitude of the |$i - $|th parent earthquake |${m}_i$| and of the completeness magnitude |${m}_\mathrm{ c}$|⁠.

The spatial decay of the productivity can be described using various probability density function that represent the nucleation kernel (Ogata 1988; Console et al. 2003; Zhuang et al. 2004, 2011; Lombardi & Marzocchi 2010). In this work, we used the spatial probability density function proposed by Ogata & Zhuang (2006), where the smoothing term is an exponential function of the magnitude:

$$\begin{eqnarray} g\left( {x,y} \right) = \frac{{\left( {q - 1} \right){{\left[ {{D}^2{e}^{\gamma \left( {{m}_i - {m}_\mathrm{ c}} \right)}} \right]}}^{q - 1}\ }}{{\pi {{\left[ {{{\left( {x - {x}_i} \right)}}^2 + {{\left( {y - {y}_i} \right)}}^2 + {D}^2{e}^{\gamma \left( {{m}_i - {m}_\mathrm{ c}} \right)}} \right]}}^q}}, \end{eqnarray}$$

(A2)

where q, D and |$\gamma $| are free parameters.

The frequency magnitude distribution of earthquakes adheres to the Gutenberg–Richter law (1944):

$$\begin{eqnarray} h\left( m \right) = \beta {e}^{ - \beta \left( {m - {m}_\mathrm{ c}} \right)}, \end{eqnarray}$$

(A3)

where |$\beta = b\ {\rm{ln}}10$| and b is a free parameter.

By incorporating a time-invariant background seismicity term |${\lambda }_0( {x,y,m} )$|⁠, the rate density function for the ETAS model is given by:

$$\begin{eqnarray} \lambda \left( {t,x,y,m} \right) = \mu {\lambda }_0\left( {x,y,m} \right) + \left[ {f\left( t \right)g\left( {x,y} \right)h\left( m \right)} \right], \end{eqnarray}$$

(A4)

which explicitly becomes:

$$\begin{eqnarray} \lambda \left( {t,x,y,m} \right) = \mu {\lambda }_0\left( {x,y,m} \right) + \left\{ {\mathop \sum \limits_{i = 1}^n \frac{{H\left( {t - {t}_i} \right)K}}{{{{\left( {t - {t}_i + c} \right)}}^p}}{e}^{\alpha \left( {{m}_i - {m}_\mathrm{ c}} \right)}\frac{{\left( {q - 1} \right){{\left[ {{D}^2{e}^{\gamma \left( {{m}_i - {m}_\mathrm{ c}} \right)}} \right]}}^{q - 1}{\rm{\ }}}}{{\pi {{\left[ {{{\left( {x - {x}_i} \right)}}^2 + {{\left( {y - {y}_i} \right)}}^2 + {D}^2{e}^{\gamma \left( {{m}_i - {m}_\mathrm{ c}} \right)}} \right]}}^q}}\beta {e}^{ - \beta \left( {m - {m}_\mathrm{ c}} \right)}} \right\}. \end{eqnarray}$$

(A5)

The parameter μ represents the ratio between the expected number of independent events of the background seismicity |${\lambda }_0( {x,y,m} )$| and the total number of events.

To describe the background seismicity, we employ the spatially variable poisson (SVP) model. The SVP rate density is given by:

$$\begin{eqnarray} {\lambda }_0\left( {x,y,m} \right) = {\mu }_0\left( {x,y} \right)\beta {e}^{\left[ { - \beta \left( {m - {m}_\mathrm{ c}} \right)} \right]}, \end{eqnarray}$$

(A6)

where |${\mu }_0( {x,y} )$| is the spatial rate density of earthquake with |$m \ge {m}_\mathrm{ c}$|⁠, which is a continuous smoothed function of the geographic location. To assess the spatial rate density, we divide each cell of the tessellation into 16 subcells of side |$l = \frac{{30\sqrt 2 }}{4}$| km. The number of earthquakes |${N}_k$| with magnitude |$m \ge {m}_\mathrm{ c}$| is calculated for each subcell. Each |${N}_k$| value, representative of a single subcell, is then smoothed by a Gaussian filter with correlation distance |${{\it{d}}}_\mathrm{ c}$| and normalized to preserve the total number of events as described in Frankel (1995). For each cell, the smoothed |${N}_k$| is given by

$$\begin{eqnarray} {\tilde{N}}_k = \frac{{{\Sigma }_l{N}_k\exp \left( { - \Delta _{kl}^2/{d}_\mathrm{ c}^2} \right)}}{{{\Sigma }_l\exp \left( { - \Delta _{kl}^2/{d}_\mathrm{ c}^2} \right)}}, \end{eqnarray}$$

(A7)

where |${\Delta }_{kl}$| is the distance between the centre of the |${k}_{{\rm{th}}}$| and the |${l}_{{\rm{th}}}$| cells. To obtain |${\tilde{N}}_k$| in terms of number of events per unit of time and area, it must be divided by the total duration of the earthquake catalogue and by the area of the cell. The spatial density |${\mu }_0( {x,y} )$| at any given point is calculated taking the weighted average of the four nearest surrounding subcells, where the weights are inversely proportional to the distance from each subcell to the point. To determine the optimal |${d}_\mathrm{ c}$| of the SVP background model, we used the method proposed by Console & Murru (2001). This involves dividing the earthquake catalogue into two subcatalogues of approximately equal temporal length. The value of |${d}_\mathrm{ c}$| is then chosen by maximizing the log-likelihood of one subcatalogue using the smoothed seismicity derived from the other subcatalogue. This analysis is performed separately for both subcatalogues, yielding |${d}_{\mathrm{ c}1}$| = 16.0 and |${d}_{\mathrm{ c}2}$| = 13.0. The optimal correlation distance |${d}_\mathrm{ c}$| is given by the average of these two values, resulting in |${d}_\mathrm{ c}$| = 14.5. Given the parameter |${d}_\mathrm{ c}$|⁠, the spatial density of earthquakes |${\mu }_0( {x,y} )$| for the SVP background model can be evaluated for each cell and point in space.

ETAS parameters fitting

The best fit for the free parameters of the ETAS model is achieved by maximizing the log-likelihood of an inhomogeneous Poisson point process, using the data set of earthquakes occurred in the learning period (1990–2004). The log-likelihood function is given by:

$$\begin{eqnarray} \ln L = \mathop \sum \limits_{{t}_i \in \left( {{t}_\mathrm{ a},{t}_\mathrm{ b}} \right);{m}_i \ge {m}_T;\left( {{x}_i,{y}_i} \right) \in R} \ln \lambda \left( {{t}_i,{m}_i,{x}_i,{y}_i} \right) - \mathop \int \limits_{{t}_\mathrm{ a}}^{{t}_\mathrm{ b}} \mathop \int \limits_{{m}_\mathrm{ T}}^{{m}_\mathrm{ u}} \mathop \int\!\!\!\int \limits_R \lambda \left( {t,m,x,y} \right)\ \mathrm{ d}y\ \mathrm{ d}x\ \mathrm{ d}m\ \mathrm{ d}t, \end{eqnarray}$$

(A8)

where |$\lambda ( {{t}_i,{m}_i,{x}_i,{y}_i} )$| is the rate density function of the ETAS model (eq. A5); |$( {{t}_\mathrm{ a},{t}_\mathrm{ b}} )$| represent the fitting period; |${m}_\mathrm{ T}$| and |${m}_\mathrm{ u}$| are the lower and upper bounds of the magnitude range, set to 2.5 and 7.5, respectively; and R is the spatial region used for parameter fitting.

The analysis region R consists of two different tessellations, which form a grid of partially overlapping cells. Consequently, the fitting of the free parameters was performed separately for each tessellation, |${R}_1$| and |${R}_2$|⁠, (see Fig. A1). The b-value is estimated independently for each tessellation using the b⁺ estimator introduced by van der Elst (2021) and recently explored by Tinti & Gasperini (2024). To assess the b-value, we use the seismicity data from the learning period (1990–2004) within each tessellation, resulting in b-values of 1.035 for |${R}_1$| and 1.033 for |${R}_2$|⁠. The parameter q is set to 1.5 according to Lombardi & Marzocchi (2010) showing that the static stress change decreases with epicentral distance as |${r}^{ - 3}$|⁠. Similarly, the parameters k, p, c, |$\alpha $|⁠, d and |$\nu $| have been fitted by maximizing the log-likelihood of earthquakes with magnitude |$m \ge 2.5$| occurred within each individual tessellation, |${R}_1$| and |${R}_2,\ $| respectively, during the period 1990–2004 using the interior-point method (Wright 2004). The standard error (SE) for each parameter is computed as the square root of the corresponding diagonal element of the variance–covariance matrix computed as the inverse of the Hessian matrix for log-likelihood function at the maximum (Ogata 1988; Lolli et al. 2009). The optimal fitted parameters with their standard deviations are reported in Table A1. Additionally, the expected number of events (E) and the log-likelihood (⁠|$\mathrm{ ln}L$|⁠), shown in Table A1 are calculated based on the fit to earthquakes with magnitudes |$m \ge 5.0$|⁠. This approach ensures a more robust fit by initially using a broader range of magnitudes (⁠|$m \ge 2.5$|⁠), as described above, to obtain a more reliable parameter estimation with a larger data set. Subsequently, the log-likelihood and the expected number of events for earthquakes with |$m \ge 5.0$| are recalculated with the previously fitted parameters,. This two step-process leverages the extensive data set for parameter estimation and then focuses on higher-magnitude events for assessing model performance.

$Tessellations R1 (left) and R2 (right) used to fit the optimal parameters of the ETAS model. Each cell has a side of $30\sqrt 2 {\rm{\ km}}$. See the main text for details.$

Figure A1.

Tessellations R₁ (left) and R₂ (right) used to fit the optimal parameters of the ETAS model. Each cell has a side of |$30\sqrt 2 {\rm{\ km}}$|⁠. See the main text for details.

Table A1.

List of estimated best fit parameters for the ETAS model.

	Tessellation
Parameter	\|${R}_1$\|	\|${R}_2$\|
b	1.035	1.033
k	0.0216 \|$\pm\, 0.0006$\|	0.0228 \|$\pm $\| 0.0006
c	0.0034 \|$\pm $\| 0.0001	0.0034 \|$\pm $\| 0.0001
p	1.0914 \|$\pm $\| 0.0042	1.0884 \|$\pm $\| 0.0039
d	1.0927 \|$\pm $\| 0.0340	1.0735 \|$\pm $\| 0.0320
\|$\gamma $\|	0.4499 \|$\pm $\| 0.0565	0.4925 \|$\pm $\| 0.0525
\|$\alpha $\|	1.1544 \|$\pm $\| 0.0314	1.1714 \|$\pm $\| 0.0299
q	1.5*	1.5*
\|${f}_\mathrm{ r}$\|	0.4560 \|$\pm $\| 0.0076	0.4410 \|$\pm $\| 0.0074
E	19.9814	21.2766
\|$\mathrm{ ln}L$\|	326.2935	327.3842

	Tessellation
Parameter	\|${R}_1$\|	\|${R}_2$\|
b	1.035	1.033
k	0.0216 \|$\pm\, 0.0006$\|	0.0228 \|$\pm $\| 0.0006
c	0.0034 \|$\pm $\| 0.0001	0.0034 \|$\pm $\| 0.0001
p	1.0914 \|$\pm $\| 0.0042	1.0884 \|$\pm $\| 0.0039
d	1.0927 \|$\pm $\| 0.0340	1.0735 \|$\pm $\| 0.0320
\|$\gamma $\|	0.4499 \|$\pm $\| 0.0565	0.4925 \|$\pm $\| 0.0525
\|$\alpha $\|	1.1544 \|$\pm $\| 0.0314	1.1714 \|$\pm $\| 0.0299
q	1.5*	1.5*
\|${f}_\mathrm{ r}$\|	0.4560 \|$\pm $\| 0.0076	0.4410 \|$\pm $\| 0.0074
E	19.9814	21.2766
\|$\mathrm{ ln}L$\|	326.2935	327.3842

Parameter fitted separately (see text of Appendix A for more details).

Table A1.