The Saturated Pairwise Interaction Gibbs Point Process as a Joint Species Distribution Model

Short-range (first three rows, in blue) and medium-range (last two rows, in red) potential functions [Colour figure can be viewed at https://dbpia.nl.go.kr]

Potential function	Definition	Shape
Exponential	$\exp [- r \ln (2) / R^{S}]$
Square bump	$1 - \exp [- {(R^{S})}^{2} \ln (2) / r^{2}]$
Step	$1_{[0, R^{S}]} (r)$
Normal	$\exp [- \frac{4 {(r - (R^{M} + R^{L}) / 2)}^{2} \ln (2)}{{(R^{L} - R^{M})}^{2}}]$
Geyer	$1_{[R^{M}, R^{L}]} (r)$

Potential function	Definition	Shape
Exponential	$\exp [- r \ln (2) / R^{S}]$
Square bump	$1 - \exp [- {(R^{S})}^{2} \ln (2) / r^{2}]$
Step	$1_{[0, R^{S}]} (r)$
Normal	$\exp [- \frac{4 {(r - (R^{M} + R^{L}) / 2)}^{2} \ln (2)}{{(R^{L} - R^{M})}^{2}}]$
Geyer	$1_{[R^{M}, R^{L}]} (r)$

Notes: In the table below, $R^{S}$ ⁠, $R^{M}$ and $R^{L}$ are, respectively, the short-range, medium-range and long-range interaction radii.

TABLE 1

Short-range (first three rows, in blue) and medium-range (last two rows, in red) potential functions [Colour figure can be viewed at https://dbpia.nl.go.kr]

Potential function	Definition	Shape
Exponential	$\exp [- r \ln (2) / R^{S}]$
Square bump	$1 - \exp [- {(R^{S})}^{2} \ln (2) / r^{2}]$
Step	$1_{[0, R^{S}]} (r)$
Normal	$\exp [- \frac{4 {(r - (R^{M} + R^{L}) / 2)}^{2} \ln (2)}{{(R^{L} - R^{M})}^{2}}]$
Geyer	$1_{[R^{M}, R^{L}]} (r)$

Potential function	Definition	Shape
Exponential	$\exp [- r \ln (2) / R^{S}]$
Square bump	$1 - \exp [- {(R^{S})}^{2} \ln (2) / r^{2}]$
Step	$1_{[0, R^{S}]} (r)$
Normal	$\exp [- \frac{4 {(r - (R^{M} + R^{L}) / 2)}^{2} \ln (2)}{{(R^{L} - R^{M})}^{2}}]$
Geyer	$1_{[R^{M}, R^{L}]} (r)$

Notes: In the table below, $R^{S}$ ⁠, $R^{M}$ and $R^{L}$ are, respectively, the short-range, medium-range and long-range interaction radii.

2.3 Model

Our model is parametrised by the following quantities.

1.
An intercept vector ${(β_{1, 0}, \dots, β_{p, 0})}^{T} \in ℝ^{p}$ which is interpreted as the log-intensities of the different species, if there were no interactions.
2.
Environmental covariates $X_{1}, \dots, X_{K}$ which are assumed to be bounded.
3.
For $1 \leq i \leq p$ and $1 \leq k \leq K$ ⁠, a coefficient $β_{i, k}$ that represents the response of species $i$ to environmental covariate $k$ ⁠.
4.
A function $u (z, {(ω ∖ {z})}_{i_{2}})$ representing the short-range interactions between species $i_{2}$ in $ω$ and an individual $z = (x, i_{1}, m)$ of species $i_{1}$ with mark $m$ at location $x$ ⁠.
5.
A function $v (z, {(ω ∖ {z})}_{i_{2}})$ that models the medium-range interactions between species $i_{2}$ in $ω$ and an individual $z$ as in (4).
6.
For $1 \leq i_{1}, i_{2} \leq p$ ⁠, a coefficient $α_{i_{1}, i_{2}}$ which represents the magnitude of short-range interactions between species $i_{1}$ and species $i_{2}$ ⁠. Positive values of $α_{i_{1}, i_{2}}$ correspond to attraction between species $i_{1}$ and species $i_{2}$ while negative values are associated with repulsion. Note that it is assumed that $α$ is symmetric, in the sense that $α_{i_{1}, i_{2}} = α_{i_{2}, i_{1}}$ ⁠.
7.
For $1 \leq i_{1}, i_{2} \leq p$ ⁠, a symmetric coefficient $γ_{i_{1}, i_{2}}$ which is the magnitude of medium-range interactions between each pair of species $i_{1}$ and species $i_{2}$ ⁠. As in (6), we interpret the sign of $γ_{i_{1}, i_{2}}$ as indicating either attraction or repulsion.

The model is specified by its density, defined by

\begin{align} j (ω) & = C \exp [\sum_{(x, i, m) \in ω} (β_{i, 0} + \sum_{k = 1}^{K} β_{i, k} X_{k} (x)) \\ + \sum_{i_{2} = 1}^{p} \sum_{z = (x_{1}, i_{1}, m_{1}) \in ω} α_{i_{1}, i_{2}} u (z, {(ω ∖ {z})}_{i_{2}}) + \sum_{i_{2} = 1}^{p} \sum_{z = (x_{1}, i_{1}, m_{1}) \in ω} γ_{i_{1}, i_{2}} v (z, {(ω ∖ {z})}_{i_{2}})], \end{align}

(1)

for $ω \in 𝒩$ ⁠, and where $C > 0$ is a normalising constant. The Papangelou conditional intensity $π$ directly follows from (1) by the formula $π ((x, i, m), ω) : = j (ω \cup {(x, i, m)}) / j (ω)$ for $(x, i, m) \notin ω$ ⁠. We compute $π$ explicitly in Appendix S1.

As mentioned above, the function $u (z, {(ω ∖ {z})}_{i_{2}})$ is interpreted as the saturated sum of short-range interactions between species $i_{2}$ in $ω$ and an individual $z = (x_{1}, i_{1}, m_{1})$ of species $i_{1}$ at $x_{1}$ and with mark $m_{1}$ ⁠. Letting $R_{i_{1}, i_{2}}^{S}$ denote the short range interaction distance between species $i_{1}$ and $i_{2}$ ⁠, we propose to define $u$ as either

u_{unmarked} ((x_{1}, i_{1}, m_{1}), ω_{i_{2}}) : = \max_{η \in S (ω_{i_{2}}, N)} \sum_{(x_{2}, i_{2}, m_{2}) \in η} φ_{R_{i_{1}, i_{2}}^{S}} (‖ x_{1} - x_{2} ‖),

(2)

or, taking into account marks,

u_{marked} ((x_{1}, i_{1}, m_{1}), ω_{i_{2}}) : = \max_{η \in S (ω_{i_{2}}, N)} \sum_{(x_{2}, i_{2}, m_{2}) \in η} φ_{R_{i_{1}, i_{2}}^{S}} (\frac{2 ‖ x_{1} - x_{2} ‖}{m_{1} + m_{2}}),

(3)

where $N$ is called the saturation parameter, and the set of saturated configurations is defined as $S (ω, N) = {η \subset ω : | η | \leq N}$ ⁠. The quantity $u_{unmarked} ((x_{1}, i_{1}, m_{1}), ω_{i_{2}})$ consists in the sum of the $N$ largest pairwise interactions between the individual at $x_{1}$ and individuals of species $i_{2}$ in $ω$ ⁠. Heuristically, the larger this quantity, the more short-range interactions there are between the individual at $x_{1}$ and species $i_{2}$ ⁠. Our interpretation of the saturation parameter $N$ is similar to that of Rajala et al. (2018) who write that $N$ ‘reproduces the feature that the neighbourhood must eventually saturate with individuals as resources are finite’.

In the first of our two definitions (2), the distances $R_{i_{1}, i_{2}}^{S}$ are interpreted as typical short-range interaction distances between individuals of species $i_{1}$ and $i_{2}$ ⁠. This contrasts with the second definition (3), in which the distances $R_{i_{1}, i_{2}}^{S}$ are measured as a proportion of the average marks of interacting individuals. One could consider other choices involving marks instead of (3), for example interactions proportional to the absolute difference of marks, thereby modelling fiercer competition between dissimilar individuals.

Similarly, letting $R_{i_{1}, i_{2}}^{M}$ (respectively $R_{i_{1}, i_{2}}^{L}$ ⁠) be the medium- (respectively long-) range interaction distances between species $i_{1}$ and $i_{2}$ ⁠, we define

v_{unmarked} ((x_{1}, i_{1}, m_{1}), ω_{i_{2}}) : = \max_{η \in S (ω_{i_{2}}, N)} \sum_{(x_{2}, i_{2}, m_{2}) \in η} ψ_{R_{i_{1}, i_{2}}^{M} \leftrightarrow R_{i_{1}, i_{2}}^{L}} (‖ x_{1} - x_{2} ‖),

(4)

as well as

v_{marked} ((x_{1}, i_{1}, m_{1}), ω_{i_{2}}) : = \max_{η \in S (ω_{i_{2}}, N)} \sum_{(x_{2}, i_{2}, m_{2}) \in η} ψ_{R_{i_{1}, i_{2}}^{M} \leftrightarrow R_{i_{1}, i_{2}}^{L}} (\frac{2 ‖ x_{1} - x_{2} ‖}{m_{1} + m_{2}}),

(5)

where the set of saturated configurations $S (ω_{i_{2}}, N)$ was defined above. The parameters $R_{i_{1}, i_{2}}^{M}$ and $R_{i_{1}, i_{2}}^{L}$ have the same interpretation as $R_{i_{1}, i_{2}}^{S}$ ⁠, but relate to what we call medium- and long-range interactions instead of short-range ones.

2.4 Saturated pairwise interaction Gibbs point process

We call our model defined by (1) a ‘saturated pairwise interaction Gibbs point process’, and the aim of this section is to make explicit why we have settled on this name. As an aside, although to the best of our knowledge saturated pairwise interaction Gibbs point processes have never been described in the scientific literature, spatstat has implemented internally what they call pairsat.family and describe as a ‘Saturated Pairwise Interaction Point Process Family’.

Rewriting the model's density (1), for example in the marked case (3) and (5), we have

\begin{align} j (ω) & = C \prod_{(x, i, m) \in ω} \exp [β_{i, 0} + \sum_{k = 1}^{K} β_{i, k} X_{k} (x)] \\ \times \prod_{i_{2} = 1}^{p} \prod_{z = (x_{1}, i_{1}, m_{1}) \in ω} \max_{η \in S ({(ω ∖ {z})}_{i_{2}}, N)} \prod_{(x_{2}, i_{2}, m_{2}) \in η} \exp [α_{i_{1}, i_{2}} φ_{R_{i_{1}, i_{2}}^{S}} (\frac{2 ‖ x_{1} - x_{2} ‖}{m_{1} + m_{2}})] \\ \times \prod_{i_{2} = 1}^{p} \prod_{z = (x_{1}, i_{1}, m_{1}) \in ω} \max_{η \in S ({(ω ∖ {z})}_{i_{2}}, N)} \prod_{(x_{2}, i_{2}, m_{2}) \in η} \exp [γ_{i_{1}, i_{2}} ψ_{R_{i_{1}, i_{2}}^{M} \leftrightarrow R_{i_{1}, i_{2}}^{L}} (\frac{2 ‖ x_{1} - x_{2} ‖}{m_{1} + m_{2}})] . \end{align}

When $N = \infty$ ⁠, this is precisely a pairwise interaction Gibbs point process (see e.g., Møller & Waagepetersen, 2004, section 6.2) with inhomogeneous intensity for species $i$ given by

\exp [β_{i, 0} + \sum_{k = 1}^{K} β_{i, k} X_{k} (x)], x \in W,

and pairwise interaction functions

\exp [2 α_{i_{1}, i_{2}} φ_{R_{i_{1}, i_{2}}^{S}} (\frac{2 ‖ x_{1} - x_{2} ‖}{m_{1} + m_{2}}) + 2 γ_{i_{1}, i_{2}} ψ_{R_{i_{1}, i_{2}}^{M} \leftrightarrow R_{i_{1}, i_{2}}^{L}} (\frac{2 ‖ x_{1} - x_{2} ‖}{m_{1} + m_{2}})],

(6)

(the factor 2 in front of $α_{i_{1}, i_{2}}$ and $γ_{i_{1}, i_{2}}$ respectively, arises because for any pair $x_{1}$ ⁠, $x_{2}$ of locations in $ω$ ⁠, our model double-counts the pairwise interaction between $x_{1}$ and $x_{2}$ ⁠). Equation (6) above makes clear the joint effect of the short and medium range potentials, as well as the effect of the magnitude and sign of the coefficients $α_{i_{1}, i_{2}}$ and $γ_{i_{1}, i_{2}}$ ⁠. A plot to illustrate this effect is provided in Figure 1.

Two potential functions summed together, for α=1, γ=−1/2, exponential short-range potential, and normal medium-range potential (see Table 1), when N=∞.We plot the short-range potential in densely dashed red (- - - - -), the medium-range potential in loosely dashed blue (− − −) and the sum of the two in solid purple (______). [Colour figure can be viewed at https://dbpia.nl.go.kr]

FIGURE 1

Two potential functions summed together, for $α = 1$ ⁠, $γ = - 1 / 2$ ⁠, exponential short-range potential, and normal medium-range potential (see Table 1), when $N = \infty$ ⁠.We plot the short-range potential in densely dashed red (- - - - -), the medium-range potential in loosely dashed blue (⁠ $- - -$ ⁠) and the sum of the two in solid purple (______). [Colour figure can be viewed at https://dbpia.nl.go.kr]

When $N$ is finite, the model only accounts for interactions between each individual and its $N$ closest neighbours. This explains our use of the adjective ‘saturated’ to qualify our model.

2.5 Some cases of interest

2.5.1 Non-interacting model

Assume that $α_{i_{1}, i_{2}} = γ_{i_{1}, i_{2}} = 0$ ⁠, so that there is neither attraction nor repulsion. Our general model (1) simplifies to

j (ω) = C \prod_{(x, i, m) \in ω} \exp [β_{i, 0} + \sum_{k = 1}^{K} β_{i, k} X_{k} (x)],

which can be seen (see e.g., Daley & Vere,-Jones 2003) to be a multi-type inhomogeneous Poisson point process with intensity for the ith type given by

\exp [β_{i, 0} + \sum_{k = 1}^{K} β_{i, k} X_{k} (x)], x \in W .

In other words, each of the species is modelled independently by inhomogeneous Poisson point processes with log-intensities driven linearly by the environmental covariates.

2.5.2 Multivariate Geyer model

We assume now that $β_{i, k} = 0$ and $γ_{i_{1}, i_{2}} = 0$ ⁠. We further assume that the short range interaction potential is the step potential from Table 1. The density in the unmarked case (2) is equal to

\begin{align} j (ω) & = C \exp [\sum_{i = 1}^{p} | ω_{i} | β_{i, 0} + \sum_{i_{2} = 1}^{p} \sum_{z = (x_{1}, i_{1}, m_{1}) \in ω} α_{i_{1}, i_{2}} \\ \times \min (N, | {(x_{2}, i_{2}, m_{2}) \in {(ω ∖ {z})}_{i_{2}} : ‖ x_{1} - x_{2} ‖ \leq R_{i_{1}, i_{2}}^{S}} |)], \end{align}

which is an instance of the class of models used in Rajala et al. (2018).

3 INFERENCE

3.1 Logistic regression of Baddeley et al

In this subsection, we prove that the assumptions of Baddeley et al. (2014) hold, which ensures that their logistic regression can be used to do inference for our model. This method enables us to estimate the parameters $β$ ⁠, $α$ and $γ$ ⁠.

The density of the model defined in (1) can be written as

j (ω) = C \exp (θ^{⊺} t (ω)) .

(7)

In the equation above, we have defined the parameter vector $θ : = {(θ_{0}^{T}, θ_{1}^{T}, θ_{2}^{T}, θ_{3}^{T})}^{T}$ ⁠, where $θ_{0} : = {(β_{1, 0}, \dots, β_{p, 0})}^{T}$ ⁠, $θ_{1} : = {(β_{1, 1}, \dots, β_{1, n}, \dots, β_{p, 1}, \dots, β_{p, n})}^{T}$ ⁠, $θ_{2} : = {(α_{1, 1}, \dots, α_{p, p})}^{T}$ and $θ_{3} : = {(γ_{1, 1}, \dots, γ_{p, p})}^{T}$ ⁠.

In addition, we have set $t (ω) : = {(t_{0} {(ω)}^{T}, t_{1} {(ω)}^{T}, t_{2} {(ω)}^{T}, t_{3} {(ω)}^{T})}^{T}$ ⁠, where

t_{0} (ω) : = {(| ω_{1} |, \dots, | ω_{p} |)}^{T}, t_{1} (ω) : = {(s_{1} {(ω)}^{T}, \dots, s_{p} {(ω)}^{T})}^{T},

t_{2} (ω) : = {(s_{1, 1} {(ω)}^{T}, \dots, s_{p, p} {(ω)}^{T})}^{T}, t_{3} (ω) : = {({\tilde{s}}_{1, 1} {(ω)}^{T}, \dots, {\tilde{s}}_{p, p} {(ω)}^{T})}^{T},

for

\begin{align} s_{i} (ω) : = {(\sum_{(x, i, m) \in ω} X_{1} (x), \dots, \sum_{(x, i, m) \in ω} X_{K} (x))}^{T}, \\ s_{i, j} (ω) : = \sum_{z = (x, i, m) \in ω} u (z, {(ω ∖ {z})}_{j}), \end{align}

and

{\tilde{s}}_{i, j} (ω) : = \sum_{z = (x, i, m) \in ω} v (z, {(ω ∖ {z})}_{j}) .

Under this new compact notation (7), the Papangelou conditional intensity at $ω \in 𝒩$ and for an individual of species $i \in {1, \dots, p}$ with mark $m$ located at $x \in W$ is readily computed as

π ((x, i, m), ω) = \exp (θ^{⊺} t ((x, i, m), ω)),

(8)

where $t ((x, i, m), ω) : = t (ω \cup {(x, i, m)}) - t (ω)$ ⁠.

The fact that we can write the density and the Papangelou conditional intensities respectively as (7) and (8) guarantees that the assumptions of Baddeley et al. (2014) hold. Given an observed configuration $ω$ ⁠, the logistic regression technique of Baddeley et al. (2014) can be summarised as:

1.
sample a set of dummy points $D$ with known (fixed) intensity, denoted by $ρ$ ⁠;
2.
compute $t (z, ω ∖ {z})$ defined in (8) as $z$ ranges over $ω \cup D$ ⁠;
3.
obtain $θ$ defined above by a logistic regression with response variable 1 when $z = (x, i, m) \in ω$ and 0 otherwise, input variables $t (z, ω ∖ {z})$ and offset term $- \log (ρ (x))$ ⁠.

3.2 Variance-covariance matrix

Our model belongs to the class of Gibbs point processes and as such, SEs and confidence intervals are not straightforward to produce. Indeed, it has been shown in Baddeley et al. (2014) that, although the SEs corresponding to the logistic regression of the previous section are a good approximation, they are in general not accurate. Instead, asymptotic confidence intervals can be estimated by the technique introduced in Coeurjolly and Rubak (1998) (see also section 4 and the appendices of Baddeley et al., 2014). We will not repeat here the details of the construction of the asymptotic variance-covariance matrix, but we draw the reader's attention to the fact that there appears to be multiple typographic errors in equation (A4) of Baddeley et al. (2014). We refer to our package described in Appendix S1 for the details of the implementation.

3.3 Estimation of the other parameters

Section 3.1 dealt with the estimation of $β$ ⁠, $α$ and $γ$ ⁠. It remains to explain how to choose the saturation parameter $N$ ⁠, the shape of the potential functions, as well as the interaction radii between and within species on the short, medium and long ranges.

We shall often fix the potential shapes in order to simplify the analysis. Regarding the saturation parameter $N$ ⁠, in some cases, we shall keep it fixed to 2. This assumption implies that the probability of a new individual being at a given location depends only on its two neighbours with which it interacts most, disregarding other individuals. Another option would be to follow the last paragraph of section 2.2 in Rajala et al. (2018) and set $N$ automatically depending on the observed abundances.

In Rajala et al. (2018), the interaction radii are fixed a priori, and they write as their justification ‘in data analysis one usually has a priori information on relevant ranges (e.g., Uriarte et al., 2004)’. Although a priori fixing these parameters has been done in some of our analyses, we also wanted a straightforward statistical procedure to estimate the interaction radii. This has allowed us to fit the model to different data sets without prior knowledge of the characteristics of the species involved.

Our basic idea is to calibrate the model for various values of the interaction radii, saturation parameters, and potential shapes, and choose the set of values which performs best according to some measure of goodness of fit. Since one of our goals is to apply the model to large-scale datasets, an important requirement for the measure of goodness-of-fit is that it be relatively fast to compute. Consequently, we have refrained from using computationally heavy techniques like that of Møller and Berthelsen (2012) or an explicit computation of the likelihood as in section 8.3.2 of Møller and Waagepetersen (2004). Instead, we propose as a measure of the goodness of fit the pseudo-likelihood corresponding to the logistic regression in Section 3.1. More explicitly, we choose values of the saturation parameter and interaction radii which maximise the logistic pseudo-likelihood.

4 SIMULATION

4.1 Coupling from the past

In some cases, it is possible to use the ‘coupling from the past’ algorithm (sometimes called ‘perfect simulation’ algorithm) to sample from our point process, see section 11 of Møller and Waagepetersen (2004). Contrary to other simulation algorithms, the ‘coupling from the past’ algorithm is not approximate, and produces samples from the actual point process. In order to apply such an algorithm in practice, one needs to prove that its Papangelou conditional intensity is locally stable, that is, that there exists a function $h$ such that $π ((x, i, m), ω) \leq h (x)$ almost everywhere. The following Proposition 1 ensures that our model is locally stable under some additional hypotheses. We define $x^{+} : = \max (x, 0)$ for any real number $x$ ⁠.

Proposition 1
Assuming that for any $i_{1}, i_{2}$ ⁠, $γ_{i_{1}, i_{2}} \leq 0$ ⁠, we have
$π ((x, i, m), ω) \leq h_{1} (x, i),$
for almost any $x \in W$ ⁠, $1 \leq i \leq p$ ⁠, $m \in ℝ$ and $ω \in 𝒩$ ⁠, and where
$h_{1} (x, i) : = \exp [β_{i, 0} + \sum_{k = 1}^{K} β_{i, k} X_{k} (x) + 6 N \sum_{j = 1}^{p} α_{i, j}^{+}] .$

Proof
The proof is a straightforward consequence of Lemma 1 in Appendix S1.

Given Proposition 1 above, we shall often work under the assumption

(H) γ_{i_{1}, i_{2}} \leq 0, 1 \leq i_{1}, i_{2} \leq p,

which is to say that none of the medium-range interactions are attractive. Under (H), Proposition 1 ensures that the ‘coupling from the past’ algorithm can be applied. The details of how the algorithm applies to our setting are provided in Section 3 of Appendix S1.

4.2 Metropolis–Hastings algorithm

Although the algorithm introduced in the previous subsection is extremely powerful, it has two disadvantages. First, it is sometimes slow, and for some values of the parameters, it does not converge in a reasonable time. Second, it requires the additional hypothesis (H) which we would like to relax in some instances. As such, in some cases, we will fall back on the unconditional Metropolis–Hastings algorithm, see algorithm 7.4 of Møller and Waagepetersen (2004). There are a series of possible variations of the algorithm, see for example remark 7.6 of Møller and Waagepetersen (2004) for a specialisation to the locally stable setting.

Since we aim for a version of the algorithm which can be applied to simulate from our model in all settings, we shall choose, in the notation of Møller and Waagepetersen (2004), a probability of birth equal to $1 / 2$ ⁠, uniformly distributed births $q_{b} (\cdot) = 1_{W} (\cdot) / | W |$ ⁠, and a probability $1 / 2$ of a uniformly distributed death distributed according to $q_{d} (\cdot, ω) = 1_{ω} (\cdot) / | ω |$ ⁠, where $ω \in 𝒩$ ⁠.

5 NUMERICAL SIMULATIONS

5.1 Simulation study

We start with a simulation study involving two species. This ensures that the number of parameters is tractable, while still demonstrating that the ‘coupling from the past’ algorithm and the fitting procedure are working as expected. We ran simulation studies involving significantly more species, and we have not observed any decrease in performance. We report the results of a seven species study in Appendix S1. In this first numerical experiment, we consider a ‘saturated pairwise interaction Gibbs point process’ on the square region $W = {[- 1, 1]}^{2}$ ⁠, consisting of $p = 2$ species, with no marks, and whose distribution is driven by two geospatial covariates, $X_{1} (x, y) = x$ and $X_{2} (x, y) = y$ ⁠. We consider uniform short-range interaction radii of $R^{S} = 0.05$ ⁠, medium-range interaction radii of $R^{M} = 0.07$ and long-range interaction radii of $R^{L} = 0.12$ ⁠. The rest of the parameters are given by $β_{0}^{T} = (2.5, 2)$ ⁠, $β_{1}^{T} = (2, 2.5)$ (corresponding to $X_{1}$ ⁠), $β_{2}^{T} = (1, 1.5)$ (corresponding to $X_{2}$ ⁠), and

α = (\begin{matrix} - 0.2 & 0.1 \\ 0.1 & - 0.6 \end{matrix}), γ = (\begin{matrix} - 0.6 & - 0.3 \\ - 0.3 & 0 \end{matrix}) .

We set the saturation parameter $N$ to 2, take as the short-range potential the square bump function, and choose the normal medium-range potential, see Table 1. In order to illustrate our experiment, we plot on the left of Figure 2 a typical sample from this point process.

FIGURE 2

Typical samples considered in our numerical experiments. On the left, a sample from the point process considered in Section 5.1, in the middle, a sample from the point process considered in Section 5.2 and finally on the right, a sample from the point process considered in Section 5.3. [Colour figure can be viewed at https://dbpia.nl.go.kr]

We sampled 1000 independent draws of this point process. Since the assumption (H) from Section 4.1 is satisfied and the simulation procedure is reasonably fast for these parameters, these draws are sampled by the ‘coupling from the past’ algorithm. The saturation parameter, interaction distances and interaction shapes were set to their true values. We then fit each of the samples by the logistic regression technique from Section 3.1, and produced asymptotic confidence intervals according to Section 3.2. The results are presented in Table 2. Our results are satisfying, showing good mean estimates over only 1000 samples, along with coverage probabilities with a mean and median of 95%.

TABLE 2

Parameter estimates & coverage probabilities

Parameter	True value	Mean	Median	RMSE	Coverage prob.
$β_{1, 0}$	2.50	2.50	2.51	0.334	0.94
$β_{2, 0}$	2.00	1.97	1.98	0.341	0.94
$β_{1, 1}$	2.00	2.14	2.10	0.575	0.95
$β_{2, 1}$	2.50	2.67	2.63	0.567	0.97
$β_{1, 2}$	1.00	1.08	1.04	0.418	0.96
$β_{2, 2}$	1.50	1.62	1.58	0.448	0.95
$α_{1, 1}$	$- 0.20$	$- 0.53$	$- 0.34$	1.12	0.95
$α_{1, 2}$	0.10	0.092	0.11	0.273	0.95
$α_{2, 2}$	$- 0.60$	$- 0.82$	$- 0.75$	0.562	0.95
$γ_{1, 1}$	$- 0.60$	$- 0.69$	$- 0.66$	0.397	0.96
$γ_{1, 2}$	$- 0.30$	$- 0.31$	$- 0.30$	0.179	0.96
$γ_{2, 2}$	0.00	$- 0.018$	$- 0.023$	0.0239	0.94

Parameter	True value	Mean	Median	RMSE	Coverage prob.
$β_{1, 0}$	2.50	2.50	2.51	0.334	0.94
$β_{2, 0}$	2.00	1.97	1.98	0.341	0.94
$β_{1, 1}$	2.00	2.14	2.10	0.575	0.95
$β_{2, 1}$	2.50	2.67	2.63	0.567	0.97
$β_{1, 2}$	1.00	1.08	1.04	0.418	0.96
$β_{2, 2}$	1.50	1.62	1.58	0.448	0.95
$α_{1, 1}$	$- 0.20$	$- 0.53$	$- 0.34$	1.12	0.95
$α_{1, 2}$	0.10	0.092	0.11	0.273	0.95
$α_{2, 2}$	$- 0.60$	$- 0.82$	$- 0.75$	0.562	0.95
$γ_{1, 1}$	$- 0.60$	$- 0.69$	$- 0.66$	0.397	0.96
$γ_{1, 2}$	$- 0.30$	$- 0.31$	$- 0.30$	0.179	0.96
$γ_{2, 2}$	0.00	$- 0.018$	$- 0.023$	0.0239	0.94

TABLE 2

Parameter estimates & coverage probabilities

Parameter	True value	Mean	Median	RMSE	Coverage prob.
$β_{1, 0}$	2.50	2.50	2.51	0.334	0.94
$β_{2, 0}$	2.00	1.97	1.98	0.341	0.94
$β_{1, 1}$	2.00	2.14	2.10	0.575	0.95
$β_{2, 1}$	2.50	2.67	2.63	0.567	0.97
$β_{1, 2}$	1.00	1.08	1.04	0.418	0.96
$β_{2, 2}$	1.50	1.62	1.58	0.448	0.95
$α_{1, 1}$	$- 0.20$	$- 0.53$	$- 0.34$	1.12	0.95
$α_{1, 2}$	0.10	0.092	0.11	0.273	0.95
$α_{2, 2}$	$- 0.60$	$- 0.82$	$- 0.75$	0.562	0.95
$γ_{1, 1}$	$- 0.60$	$- 0.69$	$- 0.66$	0.397	0.96
$γ_{1, 2}$	$- 0.30$	$- 0.31$	$- 0.30$	0.179	0.96
$γ_{2, 2}$	0.00	$- 0.018$	$- 0.023$	0.0239	0.94

Parameter	True value	Mean	Median	RMSE	Coverage prob.
$β_{1, 0}$	2.50	2.50	2.51	0.334	0.94
$β_{2, 0}$	2.00	1.97	1.98	0.341	0.94
$β_{1, 1}$	2.00	2.14	2.10	0.575	0.95
$β_{2, 1}$	2.50	2.67	2.63	0.567	0.97
$β_{1, 2}$	1.00	1.08	1.04	0.418	0.96
$β_{2, 2}$	1.50	1.62	1.58	0.448	0.95
$α_{1, 1}$	$- 0.20$	$- 0.53$	$- 0.34$	1.12	0.95
$α_{1, 2}$	0.10	0.092	0.11	0.273	0.95
$α_{2, 2}$	$- 0.60$	$- 0.82$	$- 0.75$	0.562	0.95
$γ_{1, 1}$	$- 0.60$	$- 0.69$	$- 0.66$	0.397	0.96
$γ_{1, 2}$	$- 0.30$	$- 0.31$	$- 0.30$	0.179	0.96
$γ_{2, 2}$	0.00	$- 0.018$	$- 0.023$	0.0239	0.94

5.2 Sensitivity analysis

In this experiment, we study how sensitive our calibration is to mis-specified values of the interaction radii and the saturation parameter $N$ ⁠. We consider a ‘saturated pairwise interaction Gibbs point process’ on $W = {[0, 1]}^{2}$ ⁠, consisting in $p = 2$ species, with no marks, and whose distribution is driven by a single environmental covariate $X_{1} (x, y) = x$ ⁠. We assume that the two species interact over different ranges, and that their distribution is characterised by $β_{0}^{T} = (4, 3.5)$ ⁠, $β_{1}^{T} = (1.5, 2)$ ⁠, and

R^{S} = (\begin{matrix} 0.04 & 0.06 \\ 0.06 & 0.03 \end{matrix}), α = (\begin{matrix} 0.4 & - 0.3 \\ - 0.3 & 0.4 \end{matrix}), γ = (\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}) .

We take as the short-range potential the square bump function from Table 1, and choose a saturation parameter $N = 2$ ⁠.

Although the assumption (H) from Section 4.1 is satisfied, it is faster to sample 100 independent draws of this point process by the Metropolis–Hastings algorithm of Section 4.2, with 100,000 steps. In order to give a sense of the type of point process we are working with, we show in the middle of Figure 2 a typical sample.

In our experiment, we first fit each of the samples by mis-specifying the short-range interaction radii $R^{S}$ ⁠, then assumed a mis-specification of the saturation parameter $N$ ⁠. More specifically, we consider two mis-specifications of the interaction radii, namely

R_{-}^{S} = R^{S} - (\begin{matrix} 0.02 & 0.02 \\ 0.02 & 0.02 \end{matrix}) and R_{+}^{S} = R^{S} + (\begin{matrix} 0.02 & 0.02 \\ 0.02 & 0.02 \end{matrix}) .

We also consider an under-specified saturation parameter $N_{-} = 1$ and an over-specified $N_{+} = 4$ ⁠.

The results for the interaction radii mis-specification are presented in Table 3. The main insight gained from this part of the experiment is that the estimates of the parameters are fairly accurate even when the interaction radii have been mis-specified by around 50%. This is largely due to the shape of our short-range potential function which is flat around the origin, and the high intensity of points in each sample relative to the saturation parameter $N$ ⁠. In addition, we remark that the estimates are notably better when the radius is mis-specified as $R_{+}^{S}$ ⁠. Our interpretation of this fact is that when the user chooses an interaction radius which is larger than the true one, the same broad pairwise interactions are accounted for. When the radius is under-specified instead, some pairwise interaction are strongly discounted, which biases the estimates of some of the parameters.

TABLE 3

Mis-specification of the interaction radii as either an under-specification $R_{-}^{S}$ or an over-specification $R_{+}^{S}$

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.16	0.80	3.77	0.92
$β_{2, 0}$	3.5	3.41	0.90	3.51	0.97
$β_{1, 1}$	1.5	1.64	0.93	1.64	0.93
$β_{2, 1}$	2.0	2.35	0.87	2.06	0.93
$α_{1, 1}$	0.4	0.38	0.92	0.44	0.97
$α_{1, 2}$	$- 0.3$	$- 0.38$	0.87	$- 0.31$	0.97
$α_{2, 2}$	0.4	0.52	0.90	0.30	0.93

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.16	0.80	3.77	0.92
$β_{2, 0}$	3.5	3.41	0.90	3.51	0.97
$β_{1, 1}$	1.5	1.64	0.93	1.64	0.93
$β_{2, 1}$	2.0	2.35	0.87	2.06	0.93
$α_{1, 1}$	0.4	0.38	0.92	0.44	0.97
$α_{1, 2}$	$- 0.3$	$- 0.38$	0.87	$- 0.31$	0.97
$α_{2, 2}$	0.4	0.52	0.90	0.30	0.93

TABLE 3

Mis-specification of the interaction radii as either an under-specification $R_{-}^{S}$ or an over-specification $R_{+}^{S}$

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.16	0.80	3.77	0.92
$β_{2, 0}$	3.5	3.41	0.90	3.51	0.97
$β_{1, 1}$	1.5	1.64	0.93	1.64	0.93
$β_{2, 1}$	2.0	2.35	0.87	2.06	0.93
$α_{1, 1}$	0.4	0.38	0.92	0.44	0.97
$α_{1, 2}$	$- 0.3$	$- 0.38$	0.87	$- 0.31$	0.97
$α_{2, 2}$	0.4	0.52	0.90	0.30	0.93

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.16	0.80	3.77	0.92
$β_{2, 0}$	3.5	3.41	0.90	3.51	0.97
$β_{1, 1}$	1.5	1.64	0.93	1.64	0.93
$β_{2, 1}$	2.0	2.35	0.87	2.06	0.93
$α_{1, 1}$	0.4	0.38	0.92	0.44	0.97
$α_{1, 2}$	$- 0.3$	$- 0.38$	0.87	$- 0.31$	0.97
$α_{2, 2}$	0.4	0.52	0.90	0.30	0.93

The results related to the mis-specification of the saturation parameter $N$ are in Table 4. A few things stand out in this analysis. First, the $β$ parameters (which relate to the abundance) are well estimated even when the saturation parameter is mis-specified. Indeed, the mean estimated values of $β_{1, 0}$ ⁠, $β_{2, 0}$ ⁠, $β_{1, 1}$ ⁠, $β_{2, 1}$ ⁠, $β_{1, 2}$ and $β_{2, 2}$ are very close to the true values, and the associated coverage probabilities are of the right magnitude. Second, some interaction coefficients have very bad coverage probabilities, but broadly speaking their signs and magnitude are properly recovered by the estimation procedure. Third, when the saturation parameter is under-specified, the corresponding interaction coefficients are larger in magnitude, while when it is over-specified the interaction coefficients are smaller. Heuristically, this is due to the fact that when the saturation parameter is under-specified, there are less interactions accounted for in the sum of short-range interactions (2), and consequently the corresponding interaction coefficient that multiplies the sum ought to be larger.

TABLE 4

Mis-specification of the saturation parameter as either $N_{-}$ or $N_{+}$

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.06	0.88	4.03	0.91
$β_{2, 0}$	3.5	3.51	0.94	3.41	0.91
$β_{1, 1}$	1.5	1.69	0.89	1.53	0.96
$β_{2, 1}$	2.0	2.19	0.90	2.10	0.94
$α_{1, 1}$	0.4	0.52	0.91	0.22	0.48
$α_{1, 2}$	$- 0.3$	$- 0.54$	0.62	$- 0.19$	0.53
$α_{2, 2}$	0.4	0.35	0.96	0.28	0.90

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.06	0.88	4.03	0.91
$β_{2, 0}$	3.5	3.51	0.94	3.41	0.91
$β_{1, 1}$	1.5	1.69	0.89	1.53	0.96
$β_{2, 1}$	2.0	2.19	0.90	2.10	0.94
$α_{1, 1}$	0.4	0.52	0.91	0.22	0.48
$α_{1, 2}$	$- 0.3$	$- 0.54$	0.62	$- 0.19$	0.53
$α_{2, 2}$	0.4	0.35	0.96	0.28	0.90

TABLE 4

Mis-specification of the saturation parameter as either $N_{-}$ or $N_{+}$

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.06	0.88	4.03	0.91
$β_{2, 0}$	3.5	3.51	0.94	3.41	0.91
$β_{1, 1}$	1.5	1.69	0.89	1.53	0.96
$β_{2, 1}$	2.0	2.19	0.90	2.10	0.94
$α_{1, 1}$	0.4	0.52	0.91	0.22	0.48
$α_{1, 2}$	$- 0.3$	$- 0.54$	0.62	$- 0.19$	0.53
$α_{2, 2}$	0.4	0.35	0.96	0.28	0.90

		Under-specification		Over-specification
Parameter	True value	Mean	Coverage prob.	Mean	Coverage prob.
$β_{1, 0}$	4.0	4.06	0.88	4.03	0.91
$β_{2, 0}$	3.5	3.51	0.94	3.41	0.91
$β_{1, 1}$	1.5	1.69	0.89	1.53	0.96
$β_{2, 1}$	2.0	2.19	0.90	2.10	0.94
$α_{1, 1}$	0.4	0.52	0.91	0.22	0.48
$α_{1, 2}$	$- 0.3$	$- 0.54$	0.62	$- 0.19$	0.53
$α_{2, 2}$	0.4	0.35	0.96	0.28	0.90

5.3 Inference of the interaction radii

In this paragraph, we assume that the true interaction radii are unknown, and we study how well the model is able to recover them using our proposed method from Section 3.3. We do not choose the same parameters as in the previous Section 5.2 since, as observed there, the model is not very sensitive to the actual value of the interaction radius. Instead, we purposely choose strong interaction coefficient values to allow our fitting procedure to recover the true values of the interaction radii.

We choose an observation window $W = {[0, 1]}^{2}$ ⁠, with $p = 2$ species, no marks, and whose distribution is driven by a single geospatial covariate $X_{1} (x, y) = x - 0.5$ ⁠. We assume that all interactions occur at a distance of 0.05 and in addition we assume that the interactions at those ranges are quite strong, so that the calibration procedure is able to pick them up. To be explicit, the rest of the parameters are given by $β_{0}^{T} = (6.5, 2.6)$ ⁠, $β_{1}^{T} = (- 1, 1)$ ⁠, and

α = (\begin{matrix} - 1 & - 0.5 \\ - 0.5 & 2 \end{matrix}), γ = (\begin{matrix} 0 & 0 \\ 0 & 0 \end{matrix}) .

We choose $N = 2$ for the saturation parameter and take as the short-range potential the exponential function from Table 1. A typical sample is shown on the right of Figure 2.

We sampled 1000 independent draws of this point process. Although the assumption (H) from Section 4.1 is satisfied, these draws are sampled with 1,000,000 steps of the Metropolis–Hastings algorithm which is quicker for such extreme values of the interaction coefficients. For each draw of the point process, we find the optimal short-range interaction coefficient by maximising the pseudo-likelihood. We find in Figure 3 that for around 4% of samples, the pseudo-likelihood is actually maximised by choosing the largest possible interaction radius. When removing these outliers, the mean estimated short-range interaction radius is found to be 0.06. If instead we keep these samples, then the mean estimate significantly overestimates the true interaction radius, and the median actually works best.

FIGURE 3

Optimal short-range interaction radius for each draw, obtained by pseudo-likelihood maximization. The maximization was done on a discrete grid between 0.0025 and 0.5. The true value of the interaction radius is shown in red, the median estimate is in blue, and the average estimate (including the values hitting the hard limit at 0.5) is drawn in yellow. [Colour figure can be viewed at https://dbpia.nl.go.kr]

In order to explore how well our method is actually performing, we also searched for the interaction radius which maximises the average pseudo-likelihood over all draws. Although not practical, since this method requires the observation of multiple replications of the point process, we show in Figure 4 that this technique properly recovers the true value of the short-range interaction radius.

FIGURE 4

Pseudo-log-likelihood averaged over all samples, for a given value of the short-range interaction coefficient. The value that maximizes the average log-likelihood is found to be the true value of the interaction radius, $R^{S}$ =0.05, shown here in red. [Colour figure can be viewed at https://dbpia.nl.go.kr]

Although we have reported here the results of a study with quite extreme values of the interaction coefficients, our reported findings are representative of a range of other tested values. In running the simulation with other interaction coefficients, we find that the main change is in the proportion of samples for which the method does not properly converge. We found this proportion to vary between $4 %$ and $30 %$ ⁠. We gather from this experiment that the method introduced in Section 3.3 works reasonably well to estimate unknown interaction radii, except in certain cases where the pseudo-likelihood maximising radius appears to be infinite. In conclusion, we caution the reader to not put much confidence in estimated values of the interaction radii hitting the hard upper-bound, especially when the corresponding interaction coefficient is not statistically significant.

6 REAL APPLICATIONS

In this section, we consider three different case studies from plant ecology. In each case we give examples of ecological insights derived from our model. All three data sets consist of the locations of trees, however, differing in their biome, plot size, density of individuals and number of species.

6.1 Norway spruces

In this subsection we consider the locations of 134 Norway spruce trees in a natural forest stand in Saxonia, Germany. The original source of the data is unknown, but it has been widely studied in the point process literature, see for example section 4 of Fiksel (1988) and example 2 in Goulard et al. (1996). The diameter at breast height in metres has been recorded for each individual tree in the dataset, and will serve as our marks. There are no associated environmental covariates, and instead the dataset is often used as an example of a regular marked point process, with interaction distances thought to be proportional to marks. What we call interaction radii are sometimes described in the literature on this dataset as ‘influence zones’ (Goulard et al., 1996), ‘hard-core’ and “interaction” radii (Penttinen et al., 1992). Various estimates of these values have been derived in previous analyses and one of our aims shall be to compare our results to the literature. In Figure 5, we show the locations of the spruces along with discs proportional to their diameters.

FIGURE 5

Norway spruces with marks representing their diameter at breast height. The background colour gradient is the fitted log-Papangelou conditional intensity. [Colour figure can be viewed at https://dbpia.nl.go.kr]

6.1.1 Results

Following Goulard et al. (1996), we assume that interactions take place at distances proportional to the marks, and so we choose (3) and (5), which in words assumes that individual to individual interactions are proportional to the average marks of the two individuals. In order to estimate the interaction radii, potential function shapes and saturation parameter, we deployed the multi-dimensional maximisation outlined in Section 3.3, using the pseudo-likelihood of the logistic regression as the objective function. Our only constraint is restricting the saturation parameter to the range of values ${1, 2, 4, 6}$ ⁠; however, we found that the fit was not significantly influenced by these values. The results of our model are summarised in Table 5.

TABLE 5

Norway spruce dataset results

	Parameter	Estimate	95% CI
Intercept	$β_{0}$	$- 1.88$	$(- 2.57, - 1.19)$
Short-range coefficient	$α$	$- 5.18$	$(- 6.92, - 3.43)$
Medium-range coefficient	$γ$	0.14	$(0.05, 0.23)$
Short-range radius	$R^{S}$	2.41	$∖$
Medium-range radius	$R^{M}$	16.40	$∖$
Long-range radius	$R^{L}$	24.43	$∖$
Short-range shape	$φ_{R^{S}}$	Exponential	$∖$
Medium-range shape	$ψ_{R^{M} \leftrightarrow R^{L}}$	Geyer	$∖$
Saturation	$N$	6	$∖$

	Parameter	Estimate	95% CI
Intercept	$β_{0}$	$- 1.88$	$(- 2.57, - 1.19)$
Short-range coefficient	$α$	$- 5.18$	$(- 6.92, - 3.43)$
Medium-range coefficient	$γ$	0.14	$(0.05, 0.23)$
Short-range radius	$R^{S}$	2.41	$∖$
Medium-range radius	$R^{M}$	16.40	$∖$
Long-range radius	$R^{L}$	24.43	$∖$
Short-range shape	$φ_{R^{S}}$	Exponential	$∖$
Medium-range shape	$ψ_{R^{M} \leftrightarrow R^{L}}$	Geyer	$∖$
Saturation	$N$	6	$∖$

Notes: We do not give the 95% confidence intervals for the parameters fitted by the ad hoc pseudo-likelihood maximisation. The other confidence intervals are produced by the method outlined in Section 3.2.

TABLE 5

Norway spruce dataset results

	Parameter	Estimate	95% CI
Intercept	$β_{0}$	$- 1.88$	$(- 2.57, - 1.19)$
Short-range coefficient	$α$	$- 5.18$	$(- 6.92, - 3.43)$
Medium-range coefficient	$γ$	0.14	$(0.05, 0.23)$
Short-range radius	$R^{S}$	2.41	$∖$
Medium-range radius	$R^{M}$	16.40	$∖$
Long-range radius	$R^{L}$	24.43	$∖$
Short-range shape	$φ_{R^{S}}$	Exponential	$∖$
Medium-range shape	$ψ_{R^{M} \leftrightarrow R^{L}}$	Geyer	$∖$
Saturation	$N$	6	$∖$

	Parameter	Estimate	95% CI
Intercept	$β_{0}$	$- 1.88$	$(- 2.57, - 1.19)$
Short-range coefficient	$α$	$- 5.18$	$(- 6.92, - 3.43)$
Medium-range coefficient	$γ$	0.14	$(0.05, 0.23)$
Short-range radius	$R^{S}$	2.41	$∖$
Medium-range radius	$R^{M}$	16.40	$∖$
Long-range radius	$R^{L}$	24.43	$∖$
Short-range shape	$φ_{R^{S}}$	Exponential	$∖$
Medium-range shape	$ψ_{R^{M} \leftrightarrow R^{L}}$	Geyer	$∖$
Saturation	$N$	6	$∖$

Recall that the radii in Table 5 are given as a proportion of the marks, so that for example two individuals of size $0.2 m$ interact on the short-range at a distance of $0.2 R^{S} = 0.482 m$ ⁠. Our fitted estimates are broadly in line with what other researchers have estimated or a prior fixed in the relevant literature, see Fiksel (1988), Penttinen et al. (1992) and Goulard et al. (1996). Indeed, as others have observed, there are strong negative short-range interactions between the locations of the spruces. In addition, the authors of Penttinen et al. (1992) choose a ‘hard-core radius’ of $1 m$ ⁠, where our short-range interaction radius amounts to $0.6 m$ on average (calculated as $R^{S}$ times the average tree diameter of $25 cm$ ⁠). We find medium-range interactions that occur at an average distance of $5.1 m$ (calculated as the mean of $R^{M}$ and $R^{L}$ times the average tree diameter), which is analogous to the quantity Penttinen et al. (1992) call an ‘interaction radius’ and set to $3.5 m$ ⁠. The authors in Goulard et al. (1996) choose an influence zone of five times the diameter, which again is comparable to our fitted short-range interaction radius. The best short-range potential function is found to be the exponential, which is notably the shape chosen for interactions in the pairwise Gibbs point process used in Penttinen et al. (1992).

We have also gone further than some of the existing models. To the best of our knowledge, other models do not capture the statistically significant medium-range positive interactions in the dataset, occurring between 16 and 24 times the diameter at breast height. This property of the point pattern might be caused by a mixture of pollination and seed dispersal. These ecological mechanisms would tend to increase the likelihood of finding individuals surrounded by others at these medium ranges.

6.2 South Carolina Savannah river site

In this subsection, we study the locations of 734 individual trees in a $200 m \times 50 m$ plot in the Savannah river site, South Carolina, USA. Seven different plots were originally set up by Bill Good, and a first analysis of their spatial patterns was conducted in Good and Whipple (1982), see also the subsequent analyses in Jones et al. (1994) and Dixon (2002). We focus on one of the plots from the original experiment shown in Figure 6. The data set can be obtained using the R language (R Core Team, 2019) as ecespa::swamp from the ecespa package available on CRAN.

FIGURE 6

South Carolina Savannah river site [Colour figure can be viewed at https://dbpia.nl.go.kr]

There are no known environmental covariates related to this data set; however, the (unmeasured) water level is thought to be an important driver of the spatial distribution. Contrary to Section 6.1 and to simplify the analysis, we assume that the saturation parameter $N$ is equal to 2, that the short-range interaction potential is the square exponential from Table 1, and finally we assume that there are no medium-range interactions. We also let the interaction radii be on a discrete grid, with grid size $1 m$ ⁠, and constrain them to be less than $20 m$ ⁠.

6.2.1 Fitting of the interaction radii

In order to estimate the different interaction radii, we follow the procedure outlined in Section 3.3 and implemented in Section 6.1. We find that the fitted short range interaction distances $R^{S}$ in metres are given by

\begin{array}{r} Carolina ash: \\ Swamp tupelo: \\ Water tupelo: \\ Other species: \\ Bald cypress: \end{array} (\begin{matrix} 1 & 20 & 1 & 5 & 20 \\ 20 & 3 & 1 & 10 & 9 \\ 1 & 1 & 5 & 20 & 6 \\ 5 & 10 & 20 & 1 & 20 \\ 20 & 9 & 6 & 20 & 1 \end{matrix}),

where entry $i, j$ of the matrix above corresponds to $R_{i, j}^{S}$ ⁠, the short-range interaction distance between species $i$ and $j$ ⁠. We have put in bold values of the interaction distances which are later found to be associated with significant interactions, and greyed out values which are found not to be. Since their corresponding interactions are weak, greyed out values carry weak statistical weight. In addition, values of the interaction radius attaining our hard upper-bound of $20 m$ should not be taken at face value given our findings in Section 5.3.

We observe that the short-range interaction radii $R_{i, i}^{S}$ within each of the species has a mean of around $2 m$ while the interaction radii $R_{i, j}^{S}$ between species are on average five times larger. Thus, the intra-species and inter-species short-range interaction radii appear to relate to different underlying ecological processes. The intra-species interaction radii $R_{i, i}^{S}$ might be related to the seed dispersal distance and the range within which individuals (of the same species) compete for resources. The inter-species interaction radii $R_{i, j}^{S}$ could be due to unmeasured environmental variation and/or be the range within which individuals (of different species) compete for resources.

6.2.2 Results

The fitted values for the matrix of short-range interaction coefficients $α$ are presented in Figure 7. The results support the hypothesis of strong clustering within each species, with the exception of the bald cypress in which we observe mild repulsion, although the parameter estimate is not statistically significant. Similar results were already obtained in Dixon (2002), where it was written that the particular status of the cypress “may be due to logging … or it may represent some other difference between cypress and the other tree species”.

FIGURE 7

On the left-hand side, short range interaction coefficients within each of the species $α_{i, i}$ ⁠. On the right-hand side, short range interaction coefficients between each of the species $α_{i, j}$ ⁠. We provide estimates along with the corresponding $95 %$ confidence intervals. [Colour figure can be viewed at https://dbpia.nl.go.kr]

The estimates of the pairwise short-range interaction radii are all negative and all but two of the 95% confidence intervals do not overlap with zero. However, we recall that we have used a two-step procedure in which the interaction radii were specifically chosen to maximise the pseudo-likelihood, and in addition we have not made any correction for the multiple testing problem. Hence, we should be cautious in interpreting the confidence intervals. Broadly speaking however, there is evidence of competition rather than facilitation between species. We note in particular that many of the strongest repulsive associations involve the swamp tupelo. These results also corroborate what was observed in the existing literature on this dataset, see in particular Dixon (2002). However, the technique introduced in Dixon (2002) did not find most of the inter-species interactions to be statistically significant, perhaps due to the fact that heterogeneity in the interaction radii could not be accounted for.

6.3 Barro Colorado Island

Fully mapped out forest plots are a rare occurrence in ecology. These are, however, crucial in understanding the relative importance of dispersal limitation, biotic interactions and habitat filtering in explaining species' distributions. Many seminal studies of spatial distributions within forest plots have been unable to account for inter-species associations (Condit et al., 2000; John et al., 2007; Shen et al., 2013; Wiegand et al., 2007) and when they have it is via an analysis of pair correlation functions (Deyi et al., 2020; Uriarte et al., 2004). By contrast, our model allows us to conduct the analysis within a fully integrated model-based framework.

In this section, we study the $1000 m \times 500 m$ tropical moist forest plot at Barro Colorado Island, Panama. All woody trees and shrubs whose stems have a diameter of at least $1 cm$ have been censused in multiple years (see Condit, 1998; Condit et al., 1999; Hubbell et al., 2012 for more details). Regarding the analysis of the Barro Colorado Island data set specifically, attempts at analysing ecological drivers of multi-species distributions within a unified framework have been scarce, and we shall mostly compare our results to Rajala et al. (2018, section 5) and Waagepetersen et al. (2016, section 6.2) which are the most extensive studies to date.

A wide range of environmental covariates are available for the Barro Colorado Island dataset, for example information about the soil type, elevation, etc. We settled upon six ecologically relevant covariates, namely slope and elevation, solar irradiance, soil pH and phosphorus content, and finally the soil moisture in the mid dry season in a non-drought year from Kupers et al. (2019). Rajala et al. (2018) chose instead six covariates from principal component analysis, which can be difficult to interpret, while Waagepetersen et al. (2016) settled on eleven different covariates including the first five of ours. We remark that our method scales well with the number of environmental covariates, and the reason for restricting our attention to only six of them is simply ease of presentation.

There are around 300 different species and hundreds of thousands of individual trees in the Barro Colorado Island data set, and consequently various techniques have been used to reduce the numerical complexity. The authors in Waagepetersen et al. (2016) restrict their attention to nine seemingly arbitrarily chosen species with intermediate abundance. In Rajala et al. (2018) instead, the authors exclude species for which they do not have an estimate of ‘reproducible size’, which is used as a proxy for the size at which individuals reach reproductive maturity. Then for each species, the authors remove individual trees below the reproducible size threshold, and finally exclude species with less than 50 remaining individuals.

In order to restrict our analysis to that of adult trees which are thought to have a more regular distribution, following Rajala et al. (2018) we remove immature individuals from the data set. Immature individuals were removed based on their size, with estimates of size at reproductive maturity available as a supplement to Flügge et al. (2014). While Rajala et al. (2018) exclude from their analysis the species for which the size at reproductive maturity is not available, we do not since excluding entire species from the data set might lead to missed ecological interactions. Instead, we find that reproductive maturity is well explained by a regression $Y \sim a S^{b}$ ⁠, where $S$ is the maximum diameter of the species and $Y$ is the size at reproductive maturity. This leads us to exclude individuals that are below the reproductive size for their species, or if that trait is not available, below the extrapolated size at reproductive maturity inferred from their maximum diameter. Compared to Rajala et al. (2018), this retains more species. Finally, we group species with less than 70 individuals into a separate category which shall still play a role in the interactions accounted for by the model. After this procedure, we end up with 82 different species comprising around 45,000 individual trees. This constitutes a few thousand more individuals and nine times more species than Waagepetersen et al. (2016); 50% more individuals and roughly the same number of species as Rajala et al. (2018).

We fix the saturation parameter $N$ to 2 and let the shape of the potential functions be the square bump and normal, respectively. We choose $10 m$ as the short-range interaction radius and search for residual medium-range interactions between 20 and $40 m$ ⁠. These values are in line with the results of neighbourhood-dependent growth models, see table 4 in Uriarte et al. (2004). We implemented a Lasso regularisation of the logistic regression of Section 3.1 in order to facilitate the analysis of the many potential resulting interactions. The theoretical justification for using regularisation on the composite likelihood is provided in Daniel et al. (2018), see also Ba and Coeurjolly (2020) for the asymptotic properties of the regularised estimator in our setting. We chose as the regularisation parameter the one that minimises AIC.

6.3.1 Results

We start by presenting in Figure 8 the intra-species interactions coefficients. We broadly observe that most species are clustered, with a few exhibiting very significant clumping. Notably, our three most clustered species are Anaxagorea panamensis, Bactris major and Rinorea sylvatica which were highlighted in Seri et al. (2015) as ‘exceptional species’ in terms of their clustering. In addition, in part due to the removal of immature trees, we find some species which have negative or null intra-species interactions, leading to regular distributions. In Figure 12 we show in more detail the spatial distribution of four such species. Protium panamense is an instructive example that exhibits strong intra-species short-range negative interactions and almost no medium-range interactions. This species was analysed in Waagepetersen et al. (2016) without removing immature trees. Analysing the configuration of mature trees in their framework would be more challenging since the Cox process in their model is restricted to positive associations between individuals and therefore cannot properly account for these negative intra-species interactions.

FIGURE 8

On the left-hand side, short range interaction coefficients within each of the species $α_{i, i}$ ⁠. On the right-hand side, medium-range interaction coefficients between each of the species $γ_{i, i}$ ⁠. The estimates were obtained by averaging out the results of 10 logistic regressions, each with a different binomial draw of the dummy points $D$ ⁠. The error bars represent the variation among these draws. [Colour figure can be viewed at https://dbpia.nl.go.kr]

In Figure 9 we show the inter-species interaction coefficients. We find that our model has properly disentangled two different kinds of associations. First, on the short range, species are generally negatively associated with one another, which is a strong marker of competition for resources. Second, on the medium range, we see substantially more positive associations, possibly indicating some dependency on unmeasured environmental covariates. Others in the literature (Rajala et al., 2018; Waagepetersen et al., 2016) have not been able to disentangle these numerous short-range negative interactions from associations at broader scales. We find that some of the species pairs studied in Waagepetersen et al. (2016) are negatively associated, for example Swartzia simplex with most other species, or Hirtella triandra with Garcinia intermedia. These negative associations were not picked up by Waagepetersen et al. (2016) while they were corroborated by our analysis of Ripley's cross $K$ -function (not shown here). Indeed, all significant interactions in Waagepetersen et al. (2016) were found to be positive. We were unable to compare our results with those of Rajala et al. (2018) more closely since they did not report the species' label in their figures.

FIGURE 9

On the left-hand side, the 40 largest short-range interaction coefficients between the species $α_{i, j}$ ⁠. On the right-hand side, the 40 largest medium range interaction coefficients between each of the species $γ_{i, j}$ ⁠. The coefficients shown in blue are negative, so that the corresponding interactions are repulsive, while those in red are positive, meaning the interactions are attractive. In both panels, the thickness of the cord is proportional to the strength of the interaction. [Colour figure can be viewed at https://dbpia.nl.go.kr]

Ecological processes such as dispersal and competition are expected to display distinct spatial signatures (Seabloom et al., 2005). We hypothesise that the outputs of the model presented here partly result from these ecological processes. Our model has disentangled associations on different scales, providing a basis for dissecting the underlying ecological processes.

In terms of ecological insights, in Figure 10 we show that species with a smaller maximum diameter at breast height tend to be more clustered, with the relationship being statistically significant (⁠ $p = 0.000214$ significance according to a Wald test). This is a well-known feature of the Barro Colorado Island data set that our model has successfully picked up, see for example Condit et al. (2000). We also found that larger species on average have more negative associations with other species, reflecting size-dependent competitive pressure (⁠ $p < 2 \cdot 1 0^{- 16}$ ⁠, Wald test, plot not shown here).

FIGURE 10

Mean of the intra-species interaction coefficient for each species (obtained as the average of $α_{i, i}$ and $γ_{i, i}$ ⁠) as a function of the species' maximum diameter at breast height. The fit shown on the figure is a GAM fit with basis dimension 3, along with its $95 %$ confidence bands. [Colour figure can be viewed at https://dbpia.nl.go.kr]

6.3.2 Model assessment

We shall show next that our model satisfies the following compelling criteria:

(i)
for a given species, conditioning on other species and accounting for the corresponding interactions yields a conditional occurrence probability estimate which captures the inhomogeneity in the point pattern well;
(ii)
the intra-species interaction coefficients indicate clustering or regularity in each of the species' spatial distribution;
(iii)
the inter-species interaction coefficients shown in Figure 9 capture actual associations between species in the data set.

(i) Species-specific intensity

We begin by showing that our model correctly captures the underlying spatial inhomogeneity. Consider a species $i$ ⁠, and the configuration $ω_{- i}$ in which we remove all individuals of species $i$ ⁠. Recall that the Papangelou conditional intensity $π ((x, i, m), ω_{- i})$ is interpreted as the probability of finding an individual of species $i$ around $x$ and with mark around $m$ ⁠, conditional on individuals of other species. We expect individuals of species $i$ to be found at locations where this Papangelou conditional intensity takes large values. We would like to assess how well the Papangelou conditional intensity $π ((x, i, m), ω_{- i})$ is able to separate the region into high and low density of individuals of species $i$ ⁠. For that purpose, we compute the Area Under the ROC Curve (AUC), compare with Nam and D'Agostino (2002). In the point process framework, the AUC is computed by discretising the study area, and choosing as events the presence or absence of an individual in a cell (see, e.g., Lombardo et al., 2018). More precisely, in our context, for a (conditional) intensity $λ$ the AUC is defined (see section 6.7.3 in Baddeley et al., 2015) as

ℙ (λ (U) < λ (X)),

where $X$ is a uniformly chosen point of the point process (in our case, of species $i$ ⁠) and $U$ is a continuous random variable uniformly distributed over the study region. The AUC measures the ability of the intensity to properly separate the region into areas of high and low density of individuals, with a value of 0.5 indicating a lack of discriminatory power. In our analysis, we have discretised the study region into $1 m \times 1 m$ cells, computed $λ$ at each cell and at the location of each individual of species $i$ to produce an estimate of the AUC. We have in practice used the auc.ppp function in spatstat Baddeley et al. (2015).

More precisely, we proceed as follows. First, we fit each species separately according to a Poisson point process driven by the same six environmental covariates used in our case study, and produce a maximum likelihood intensity estimate. Second, for each species, we compute the Papangelou conditional intensity of our fitted Gibbs point process, conditional on other species (as described in the previous paragraph), over the whole area. We then compute the AUC in both cases. We show in Figure 11 the resulting performance gain in terms of AUC species by species. The saturated pairwise interactions Gibbs point process attains an average AUC of 0.76 by conditioning on other species, compared to an average of 0.65 for the standard Poisson point process. We find that the AUC of most species is improved. This shows that inter-species interactions are important in shaping the species' conditional distributions. We acknowledge that part of this improvement is due to our model having more parameters; our main point here is that the model is indeed capturing associations between species and capitalising on these to improve the conditional intensity estimates.

FIGURE 11

Conditional Area Under the ROC Curve (AUC) improvement species by species, when going from an inhomogeneous Poisson point process to the saturated pairwise interaction Gibbs point process. Each blue point corresponds to one species. Points in the top-left quadrant indicate species for which our model produces a better AUC than that of an inhomogeneous Poisson point process model. The average AUC improvement is 0.11 and our model has improved the conditional AUC for 83% of species. [Colour figure can be viewed at https://dbpia.nl.go.kr]

In order to illustrate how well the Papangelou conditional intensity resembles the actual spatial distribution, let us take a closer look at the four species which were found to exhibit most intra-species short-range repulsion, namely Protium panamense (‘protpa’), Prioria copaifera (‘pri2co’), Apeiba membranacea (‘apeime’) and Hura crepitans (‘huracr’). We show in Figure 12 the Papangelou conditional intensity computed at each of the species, conditional on other species. We see clearly that for these species, our model has properly separated the region into locations where the species occurs and others where it does not. The rather large corresponding AUC values for these species ranging from 0.76 to 0.90 corroborate this result.

FIGURE 12

Log-Papangelou conditional intensities of the four most repulsive species in our model, conditional on all other species, see the text for details on how this quantity is defined. Our model has captured most of the spatial inhomogeneity and its conditional intensity has properly separated the area into areas of high and low density of individuals. This is well quantified by the area under the ROC curve (AUC) metric which is quite high for these species (⁠ ${AUC}_{protpa} = 0.76$ ⁠, ${AUC}_{pri2co} = 0.90$ ⁠, ${AUC}_{apeime} = 0.81$ ⁠, ${AUC}_{huracr} = 0.90$ ⁠). [Colour figure can be viewed at https://dbpia.nl.go.kr]

(ii) Intra-species clustering

We now show that the intra-species clustering or regularity is partly captured by the intra-species interaction coefficients. We characterise intra-species clustering in terms of the inhomogeneous $L$ -function defined, for example, on p. 32 of Møller and Waagepetersen (2004). In general, for any two species $i$ and $j$ and a distance $R > 0$ ⁠, we define

\overline{L_{i, j}} : = [\frac{1}{R} \int_{0}^{R} (L_{i, j} (r) - r) d r] / [\frac{1}{R} \int_{0}^{R} r d r] = \frac{2}{R^{2}} \int_{0}^{R} L_{i, j} (r) d r - 1,

(9)

as a measure of the association between individuals of species $i$ and $j$ within a distance $R$ of each other. In the equation above, $L_{i, j} (r)$ is the cross inhomogeneous $L$ -function defined on p. 49 of Møller and Waagepetersen (2004), and which generalises the usual inhomogeneous $L$ -function.

In order to evaluate the degree of clustering in each species, we shall perform a hypothesis test (see chapter 10 in Baddeley et al., 2015). Let our null hypothesis be that a given species $i$ is an inhomogeneous Poisson point process, conditionally on all other species. By Proposition 1 in Appendix S1, this is for example the case if $α_{i, i} = γ_{i, i} = 0$ and the saturation parameter $N$ is sufficiently large. Under these hypotheses, the conditional point process is second-order intensity reweighted stationary (SOIRS), see Møller and Waagepetersen (2004, definition 4.5), and so the standard definition of $L_{i, i}$ makes sense. In particular, in this case $\overline{L_{i, i}}$ is expected to be zero. Again by Proposition 1 in Appendix S1, the intensity of the conditional point process is proportional to $π ((x, i, m), ξ_{- i})$ ⁠, where $ξ_{- i}$ is the point process consisting in individuals of species other than $i$ ⁠. The statistic $L_{i, i}$ could be estimated by normalising the standard estimator by the fitted Papangelou conditional intensity, but we choose instead to rely on the leave-one-out kernel smoother derived in section 2.2 of Baddeley et al. (2000). If the corresponding empirical $L$ -function is outside the simulation envelopes obtained by draws of an inhomogeneous Poisson point process with intensity the standard leave-one-out kernel estimate of the species, then we have grounds to reject the null hypothesis. When the null hypothesis does not hold, strictly speaking, the previous definition of $L_{i, i}$ and of its estimator do not make sense because first, $π ((x, i, m), ξ_{- i})$ can not be viewed as proportional to the intensity and second, even if this were the case, the SOIRS assumption is not met. However, we can expect that under the alternative the estimator of $L_{i, i}$ diverges from the expected value under the null with the same interpretation as under the SOIRS assumption. More precisely, values above $r \mapsto π r^{2}$ of $L_{i, i}$ ⁠, and thus positive values of $\overline{L_{i, i}}$ ⁠, indicate more species-specific clustering than if the species were conditionally an inhomogeneous Poisson point process. Negative values of $\overline{L_{i, i}}$ instead indicate more regularity.

We find in Figure 13 that $86 %$ of species which were above the envelopes (i.e., indicating that the species is significantly more clustered than would be expected if it were conditionally Poisson distributed) were also found to have positive short-range interaction coefficients. Both species which were below the envelopes were also found to have negative short-range interaction coefficients. In addition, we find that the intra-species short-range interaction coefficients $α_{i, i}$ are positively correlated with $\overline{L_{i, i}}$ ⁠, with Pearson coefficient 0.71, and show in Figure 13 a scatter plot of all 82 species. Overall, species which are more clustered than would be expected if they were conditionally Poisson distributed tend to have positive short-range intra-species interaction coefficients, and conversely species which are more regular tend to have negative coefficients. This can also be seen visually in Figure 12, where we show that the four most repulsive species–with their estimated intensity shown in the background–tend to have a more regular distribution than that of a (conditional) inhomogeneous Poisson point process.

Scatter plot of the intra-species short-range interaction coefficients αi,i in terms of Li,i‾, for r ranging from 0 to 20 m. We have superimposed the results of a linear regression along with its 95% confidence bands (slope 1.04). The envelopes are computed with 400 draws of an inhomogeneous Poisson point process. [Colour figure can be viewed at https://dbpia.nl.go.kr]

FIGURE 13

Scatter plot of the intra-species short-range interaction coefficients $α_{i, i}$ in terms of $\overline{L_{i, i}}$ ⁠, for $r$ ranging from 0 to $20 m$ ⁠. We have superimposed the results of a linear regression along with its $95 %$ confidence bands (slope 1.04). The envelopes are computed with 400 draws of an inhomogeneous Poisson point process. [Colour figure can be viewed at https://dbpia.nl.go.kr]

(iii) Inter-species clustering

We characterise inter-species associations in terms of the inhomogeneous cross $L$ -function $L_{i, j} (r)$ described above. We still use definition (9) to analyse inter-species interactions. Assume as the null hypothesis that for two species $i$ and $j$ we have $α_{i, j} = γ_{i, j} = 0$ ⁠. By Proposition 2 in Appendix S1, the two species are independent conditionally on other species. By proposition 4.4 in Møller and Waagepetersen (2004), under these hypotheses, the conditional point process formed of the two species is cross SOIRS (see definition 4.8 in Møller & Waagepetersen, 2004). In this case, the definition of $L_{i, j}$ makes sense and $\overline{L_{i, j}}$ is equal to zero. As in (ii) above, strictly speaking, under the alternative hypothesis the definition of $L_{i, j}$ and its estimator do not make sense. However, we can again expect that values of $L_{i, j} (r)$ above their expectation under the null point to species positively associated, and conversely values below their expectation under the null indicate negatively associated species. Therefore, negative values of $\overline{L_{i, j}}$ correspond to repulsion and positive values correspond to positive associations (at least for small values of $r$ ⁠, see p. 49 of Møller & Waagepetersen, 2004), and so this quantity serves as a good indicator of spatial associations between species.

Heuristically, then, $\overline{L_{i, j}}$ represents the average relative distance to the theoretical cross $L$ -function if the two species were independent conditionally on other species. So, for example $\overline{L_{i, j}} = - 0.5$ indicates that the cross $L$ -function is on average $50 %$ less than if the two species were independent. Envelopes are not as straightforward to produce as in the intra-species setting (ii) above, though. Indeed, the null hypothesis in this case is that species $i$ and $j$ are independent point processes, but they need not be Poissonian. And indeed, in general they are not even saturated pairwise interaction Gibbs point process, and their simulation (conditional on other species containing tens of thousands of individuals) is very computationally demanding. Therefore, in Figure 14 we restrict ourselves to the 28 species which were not found in (ii) to depart from the conditional inhomogeneous Poisson hypothesis.

Scatter plot of the inter-species short-range interaction coefficients αi,j in terms of Li,j‾, for r ranging from 0 to 20 m. We have superimposed the results of a linear regression along with its 95% confidence bands (slope 0.34). The envelopes were computed with 400 draws of two independent inhomogeneous Poisson point processes. [Colour figure can be viewed at https://dbpia.nl.go.kr]

FIGURE 14

Scatter plot of the inter-species short-range interaction coefficients $α_{i, j}$ in terms of $\overline{L_{i, j}}$ ⁠, for $r$ ranging from 0 to $20 m$ ⁠. We have superimposed the results of a linear regression along with its $95 %$ confidence bands (slope 0.34). The envelopes were computed with 400 draws of two independent inhomogeneous Poisson point processes. [Colour figure can be viewed at https://dbpia.nl.go.kr]

We find in Figure 14 that $67 %$ of species which were above the envelopes (indicating that the two species are found closer than would be expected if they were independent) were also found to have positive short-range inter-species interaction coefficients. In addition, $93 %$ of species which were below the envelopes were found to have negative short-range inter-species interaction coefficients. We also show in Figure 14 a scatter plot of all species pairs and also observe that $\overline{L_{i, j}}$ and $α_{i, j}$ are positively correlated with Pearson coefficient 0.48. Our findings lend credence to the fact that the short-range interaction coefficients $α_{i, j}$ capture associations between individuals of different species. Overall, we have shown that the short-range interaction coefficients capture associations between individuals, both within and between species, and the way the model accounts for these associations convincingly models the species' conditional spatial distribution.

7 DISCUSSION

Two main classes of models had previously been proposed to analyse the spatial arrangement of individuals in large multi-species ecological datasets. First, the log-Gaussian Cox process proposed in Waagepetersen et al. (2016) is an elegant model that fits within a Bayesian framework well, but cannot model competition causing repulsion within a species, nor does it scale well with the number of species. In addition, the latent correlated Gaussian fields have no straightforward interpretation in ecological applications. Furthermore, as pointed out when analysing Protium panamense in Section 6.3, the multivariate log-Gaussian Cox process cannot serve as a model for a species with null or negative intra-species interactions that interacts with other species. Second, the saturated Gibbs point process introduced in Rajala et al. (2018) captures pairwise interactions over different ranges, and scales well with the number of species. We find the second class to be more compelling. Inspired by the work of Rajala et al. (2018), in this manuscript we have introduced the ‘saturated pairwise interaction Gibbs point process’ to start working towards a unified framework to untangle the three main drivers underlying community assembly, namely species' dispersal abilities, environmental tolerance and biotic interactions.

In contrast to the model in Rajala et al. (2018), in modelling pairwise interactions, we allow the use of more realistic smooth potential functions instead of linear combinations of step functions. Moreover, our model has a role for marks such as the individuals' size, and these are thought to be influential in affecting species' distribution. These two features have allowed us to handle applications that are out of reach of existing models. For example, the locations of Norway spruces studied in Section 6.1 exhibit exponential pairwise interactions at a distance that is proportional to individuals' diameters. We have also studied other spatial patterns from plant ecology in which competing ecological factors are at play, and have shown how these mechanisms materialise within the framework of the model. We have found that our model has performed well in the Barro Colorado Island analysis in Section 6.3, a dataset containing almost a hundred species and many thousands of individual trees. This has helped us gain additional insights into three very different ecosystems, namely a spruce forest from northern Europe, a subtropical swamp forest, and a neotropical rainforest.

Additionally, we have addressed the problem of simulating this point process, and in particular, we proved in Proposition 1 a crucial result that allows us to apply the ‘coupling from the past’ algorithm to draw samples from the point process. In our manuscript, simulating from the model has helped us carefully validate the model's performance and allowed us to do a sensitivity analysis, see Section 5. We also believe that simulating from the model will be important in future work, since it is necessary to do Monte Carlo simulations as well as compute simulation envelopes and run goodness of fit tests.

Our model can be applied in a wide range of settings, and may also be useful outside of ecology. Indeed, the notion of a physical pairwise interaction making it more or less likely that two individuals occur close by is a compelling assumption that surely also makes sense in physics, epidemiology and economics among others. We have consequently made our fitting and simulation procedures available as an open-source R package, see Appendix S1 for more details.

DATA AVAILABILITY STATEMENT

The model described in this manuscript is openly available on Github at https://www.github.com/iflint1. The analyses of real data sets can be reproduced by executing the corresponding scripts made public as Github gists at https://gist.github.com/iflint1/.

¹Hereon, the term ‘decreasing’ is to be understood in the weak sense, that is, $φ$ is said to be decreasing if for all $x \leq y$ ⁠, $φ (x) \geq φ (y)$ ⁠.

ACKNOWLEDGEMENTS

We thank the anonymous referees whose comments helped improve a previous version of this manuscript. One of the referees in particular has helped us make substantial improvements to an earlier version. This work was supported by Australian Research Council Grant number DP190100613. Open access publishing facilitated by The University of Melbourne, as part of the Wiley - The University of Melbourne agreement via the Council of Australian University Librarians.

REFERENCES

Andrews

J.G.

Ganti

R.K.

Haenggi

Jindal

Weber

(

2010

)

A primer on spatial modeling and analysis in wireless networks

IEEE Commun Mag

156

–

163

Coeurjolly

J.F.

(

2020

)

High-dimensional inference for inhomogeneous Gibbs point processes

Babu

G.J.

Feigelson

E.D.

(

1996

)

Spatial point processes in astronomy

Journal of Statistical Planning and Inference

311

–

326

Baccelli

Błlaszczyszyn

(

2009

)

Stochastic geometry and wireless networks: volume I theory

Hanover, MA

Now Publishers Inc

Available from: https://doi.org/10.1561/1300000006

Baddeley

A.J.

Møller

Waagepetersen

(

2000

)

Non-and semi-parametric estimation of interaction in inhomogeneous point patterns

Statistica Neerlandica

329

–

350

. Available from: https://doi.org/10.1111/1467-9574.00144

Baddeley

Coeurjolly

J.-F.

Rubak

Waagepetersen

(

2014

)

Logistic regression for spatial Gibbs point processes

Biometrika

101

377

–

392

Baddeley

Rubak

Turner

(

2015

)

Spatial point patterns: methodology and applications

London

Chapman & Hall/CRC Press

Coeurjolly

J.-F.

Rubak

(

2013

)

Fast covariance estimation for innovations computed from a spatial Gibbs point process

Scandinavian Journal of Statistics

669

–

684

Condit

(

1998

)

Tropical forest census plots

Berlin, Germany, and Georgetown, Texas

Springer-Verlag and R. G. Landes Company

Condit

Ashton

P.S.

Manokaran

LaFrankie

J.V.

Hubbell

S.P.

Foster

R.B.

(

1999

)

Dynamics of the forest communities at Pasoh and Barro Colorado: comparing two 50-ha plots

Philosophical Transactions of the Royal Society of London Series B, Biological Sciences

354

1739

–

1748

Condit

Ashton

P.S.

Baker

Bunyavejchewin

Gunatilleke

et al. (

2000

)

Spatial patterns in the distribution of tropical tree species

Science

288

1414

–

1418

. Available at: https://science.sciencemag.org/content/288/5470/1414

Connor

C.B.

Hill

B.E.

(

1995

)

Three nonhomogeneous Poisson models for the probability of basaltic volcanism: application to the Yucca mountain region, Nevada

Journal of Geophysical Research: Solid Earth

1978-2012

(

100

10107

–

10125

Daley

D.J.

Vere-Jones

(

2003

)

An introduction to the theory of point processes

Probability and Its Applications

, Vol.

New York

Springer-Verlag

Daley

D.J.

Vere-Jones

(

2008

)

An introduction to the theory of point processes

Probability and Its Applications

, Vol.

New York

Springer-Verlag

Daniel

Horrocks

Umphrey

G.J.

(

2018

)

Penalized composite likelihoods for inhomogeneous Gibbs point process models

Computational Statistics & Data Analysis

124

104

–

116

Deyi

Liu

Cadotte

(

2020

November 13)

Trait dissimilarity and hierarchy predict spatial co-occurrence patterns of tree species in a subtropical forest

Authorea

. Available from: https://doi.org/10.22541/au.160525624.40303010/v1

Dixon

P.M.

(

2002

)

Nearest-neighbor contingency table analysis of spatial segregation for several species

Ecoscience

142

–

151

Fiksel

(

1988

)

Estimation of interaction potentials of Gibbsian point processes

Statistics

–

Flügge

A.J.

Olhede

S.C.

Murrell

D.J.

(

2014

)

A method to detect subcommunities from multivariate spatial associations

Methods in Ecology and Evolution

1214

–

1224

Geyer

C.J.

(

1999

) Likelihood inference for spatial point processes: likelihood and computation. In:

Kendall

Barndroff-Nielsen

van

Lieshout

M.N.

(Eds.)

Stochastic geometry: likelihood and computation

London

Chapman & Hall/CRC Press

, pp.

141

–

172

Good

B.J.

Whipple

S.A.

(

1982

)

Tree spatial patterns: South Carolina bottomland and swamp forests

Bulletin of the Torrey Botanical Club

109

529

–

536

Goulard

Särkkä

Grabarnik

(

1996

)

Parameter estimation for marked Gibbs point processes through the maximum pseudolikelihood method

Scandinavian Journal of Statistics

365

–

379

Hubbell

Condit

Foster

(

2012

)

Barro Colorado forest census plot data

. Available at: http://ctfs.si.edu/webatlas/datasets/bci.

John

Dalling

J.W.

Harms

K.E.

Yavitt

J.B.

Stallard

R.F.

Mirabello

et al. (

2007

)

Soil nutrients influence spatial distributions of tropical tree species

Proceedings of the National Academy of Sciences

104

864

–

869

. Available at: https://www.pnas.org/content/104/3/864

Jones

R.H.

Sharitz

R.R.

James

S.M.

Dixon

P.M.

(

1994

)

Tree population dynamics in seven South Carolina mixed-species forests

Bulletin of the Torrey Botanical Club

121

360

–

368

Kallenberg

(

1983

)

Random measures

, 3rd edition. Berlin, Germany:

Akademie-Verlag

Kupers

S.J.

Wirth

Engelbrecht

B.M.J.

Rüger

(

2019

)

Dry season soil water potential maps of a 50 hectare tropical forest plot on Barro Colorado Island

Panama. Scientific Data

Lombardo

Opitz

Huser

(

2018

)

Point process-based modeling of multiple debris flow landslides using INLA: an application to the 2009 Messina disaster

Stochastic Environmental Research and Risk Assessment

2179

–

2198

Mohler

G.O.

Short

M.B.

Brantingham

P.J.

Schoenberg

F.P.

Tita

G.E.

(

2011

)

Self-exciting point process modeling of crime

Journal of the American Statistical Association

106

100

–

108

Møller

Berthelsen

K.K.

(

2012

)

Transforming spatial point processes into Poisson processes using random superposition

Advances in Applied Probability

–

Møller

Waagepetersen

R.P.

(

2004

)

Statistical inference and simulation for spatial point processes

London

Chapman & Hall/CRC Press

. https://doi.org/10.1007/978-1-4612-0103-8_20

Nam

B.-H.

D'Agostino

R.B.

(

2002

)

Discrimination index, the area under the ROC curve

Boston, MA

Birkhäuser

. Available from:, pp.

267

–

279

Ovaskainen

Tikhonov

Norberg

Guillaume Blanchet

Duan

Dunson

et al. (

2017

)

How to make more out of community data? A conceptual framework and its implementation as models and software

Ecology Letters

561

–

576

Penttinen

Stoyanell

Henttonen

H.M.

(

1992

)

Marked point processes in forest statistics

Forest Science

638

806

–

824

Punchi-Manage

Getzin

Wiegand

Kanagaraj

Gunatilleke

C.V.S.

Gunatilleke

I.A.U.N.

et al. (

2013

)

Effects of topography on structuring local species assemblages in a Sri Lankan mixed dipterocarp forest

Journal of Ecology

101

149

–

160

Rajala

Murrell

D.J.

Olhede

S.C.

(

2018

)

Detecting multivariate interactions in spatial point patterns with Gibbs models and variable selection

Journal of the Royal Statistical Society: Series C

1237

–

1273

R Core Team (

2019

)

R: a language and environment for statistical computing

. Vienna, Austria:

R Foundation for Statistical Computing

. Available from: https://www.R-project.org/

Seabloom

E.W.

Bjørnstad

O.N.

Bolker

B.M.

Reichman

O.J.

(

2005

)

Spatial signature of environmental heterogeneity, dispersal, and competition in successional grasslands

Ecological Monographs

199

–

214

. Available from: https://doi.org/10.1890/03-0841

Seri

Shtilerman

Shnerb

N.M.

(

2015

)

The GLOCAL forest

PLoS One

–

. Available from: https://doi.org/10.1371/journal.pone.0126117

Shen

Waagepetersen

Sun

I.-F.

Hao

Chen

Z.-S.

et al. (

2013

)

Quantifying effects of habitat heterogeneity and other clustering processes on spatial distributions of tree species

Ecology

2436

–

2443

Thompson

(

1955

)

Spatial point processes, with applications to ecology

Biometrika

102

–

115

Uriarte

Condit

Canham

C.D.

Hubbell

S.P.

(

2004

)

A spatially explicit model of sapling growth in a tropical forest: does the identity of neighbours matter?

Journal of Ecology

348

–

360

Waagepetersen

Guan

Jalilian

Mateu

(

2016

)

Analysis of multispecies point patterns by using multivariate log-Gaussian Cox processes

Journal of the Royal Statistical Society: Series C

–

Waller

L.A.

Gotway

C.A.

(

2004

)

Applied spatial statistics for public health data

Boca Raton

John Wiley & Sons

Weiher

Freund

Bunton

Stefanski

Lee

Bentivenga

(

2011

)

Advances, challenges and a developing synthesis of ecological community assembly theory

Philosophical Transactions of the Royal Society B: Biological Sciences

366

2403

–

2413