COMET : Clustering observables modelled by emulated perturbation theory Free

Bias contributions to the power spectrum at linear and one-loop order, which scale as P_L and |$P_L^2$|⁠, respectively.

\|${\cal B}$\|	\|$b_1^2$\|	b₁	1	\|$b_1^2$\|	\|$b_1\, b_2$\|	\|$b_1\, \gamma _2$\|	\|$b_1\, \gamma _{21}$\|	\|$b_2^2$\|	\|$b_2\, \gamma _2$\|	\|$\gamma _2^2$\|	b₂	γ₂	γ₂₁	c₀	c₂	c₄	\|$b_1^2\, c_{\rm nlo}$\|	\|$b_1\, c_{\rm nlo}$\|	c_nlo
linear	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|											\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|
one-loop		\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|

\|${\cal B}$\|	\|$b_1^2$\|	b₁	1	\|$b_1^2$\|	\|$b_1\, b_2$\|	\|$b_1\, \gamma _2$\|	\|$b_1\, \gamma _{21}$\|	\|$b_2^2$\|	\|$b_2\, \gamma _2$\|	\|$\gamma _2^2$\|	b₂	γ₂	γ₂₁	c₀	c₂	c₄	\|$b_1^2\, c_{\rm nlo}$\|	\|$b_1\, c_{\rm nlo}$\|	c_nlo
linear	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|											\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|
one-loop		\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|

Table 1.

Bias contributions to the power spectrum at linear and one-loop order, which scale as P_L and |$P_L^2$|⁠, respectively.

\|${\cal B}$\|	\|$b_1^2$\|	b₁	1	\|$b_1^2$\|	\|$b_1\, b_2$\|	\|$b_1\, \gamma _2$\|	\|$b_1\, \gamma _{21}$\|	\|$b_2^2$\|	\|$b_2\, \gamma _2$\|	\|$\gamma _2^2$\|	b₂	γ₂	γ₂₁	c₀	c₂	c₄	\|$b_1^2\, c_{\rm nlo}$\|	\|$b_1\, c_{\rm nlo}$\|	c_nlo
linear	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|											\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|
one-loop		\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|

\|${\cal B}$\|	\|$b_1^2$\|	b₁	1	\|$b_1^2$\|	\|$b_1\, b_2$\|	\|$b_1\, \gamma _2$\|	\|$b_1\, \gamma _{21}$\|	\|$b_2^2$\|	\|$b_2\, \gamma _2$\|	\|$\gamma _2^2$\|	b₂	γ₂	γ₂₁	c₀	c₂	c₄	\|$b_1^2\, c_{\rm nlo}$\|	\|$b_1\, c_{\rm nlo}$\|	c_nlo
linear	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|											\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|
one-loop		\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|	\|$\checkmark$\|

3.3.1 Projection into multipoles and reconstruction of anisotropic power spectrum

In order to avoid having to emulate the full two-dimensional dependence of |$P_{{\cal B}}$| on k and μ, we project the angular dependence into multipoles,

$$\begin{eqnarray} P_{{\cal B},\ell }(k) = \frac{2\ell +1}{2} \int _{-1}^1 \mathrm{d}\mu \, {\cal L}_{\ell }(\mu)\, P_{{\cal B}}(k,\mu)\, , \end{eqnarray}$$

(45)

and emulate only the monopole, quadrupole and hexadecapole, from which we reconstruct the 2D power spectra using the Legendre expansion. This procedure would be exact if all of the contributions contain powers of, at most, μ⁴. However, the redshift-space galaxy power spectrum contains terms up to μ⁸ and, moreover, the dependence of the IR damping factor on μ² leads to non-zero multipoles for all even ℓ. Our reconstruction using only information up to the hexadecapole therefore introduces an error that becomes more relevant for the higher order multipoles computed through equation (42). In order to approximately correct for that we include the ℓ = 6 multipole |$P_{{\cal B},6}$|⁠, evaluated at fixed redshift z = 1 and for a fixed set of ΛCDM cosmological parameters taken from the Planck TT, TE, EE + low E + lensing constraints (Planck Collaboration VI 2020) in the Legendre expansion:

$$\begin{eqnarray} &&P_{{\cal B}}(\boldsymbol{k}|z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}) \approx \ \sum _{\ell = 0}^2 P_{{\cal B},2\ell }(k|z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e})\, {\cal L}_{2\ell }(\mu) \nonumber \\ &&+ P_{{\cal B},6}\left\lbrace k|f(z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}), P_L\big (k|\boldsymbol{\Theta }^{\rm Planck}_{\rm s},\sigma _{12}(z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e})\big)\right\rbrace \, {\cal L}_6(\mu)\, . \nonumber \\ \end{eqnarray}$$

(46)

Our notation highlights that the fixed Planck values only enter through the shape parameters affecting the linear power spectrum, while the dependence of the growth rate and σ₁₂ on the shape and evolution parameters is correctly accounted for. This is achieved by splitting up each |$P_{{\cal B},6}$| into contributions with different powers of f and scaling their amplitude as follows:

$$\begin{eqnarray} \left.P_{{\cal B},6}\right|_{\rm Planck} \,\,\rightarrow \,\, \left(\frac{\sigma _{12}(z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e})}{\sigma _{12}\left(z=1,\boldsymbol{\Theta }^{\rm Planck}_{\rm s},\boldsymbol{\Theta }^{\rm Planck}_{\rm e}\right)}\right)^{2L}\, \left.P_{{\cal B},6}\right|_{\rm Planck}\, , \nonumber \\ \end{eqnarray}$$

(47)

where L = 1 for all terms indicated as linear in Table 1 and L = 2 for the one-loop terms.⁶ In Section 4.4, we quantify the inaccuracies introduced by equation (46) and demonstrate that they are negligible.

3.3.2 Constructing ratios with the linear power spectrum

The amplitude of |$P_{{\cal B},\ell }$| can be subject to large variations over the full range of values that the emulated parameters can assume. As that makes the emulation more difficult, we instead emulate the ratios, |$\beta _{{\cal B},\ell }$|⁠, of |$P_{{\cal B},\ell }$| and the linear power spectrum (excluding IR resummation), which significantly reduces the dynamical range of the relevant quantities. After emulating the ratios we then need to multiply again by the linear power spectrum, for which we build a separate emulator. However, this emulator can be constructed over the shape parameters alone, as shown by the following (exact) computation:

$$\begin{eqnarray} P_{{\cal B},\ell }(k|z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}) &=& \beta _{{\cal B},\ell }(k|\boldsymbol{\Theta }_{\rm s},\sigma _{12},f)\, P_L(k|\boldsymbol{\Theta }_{\rm s},\sigma _{12}) \nonumber \\ &=& \beta _{{\cal B},\ell }(k|\boldsymbol{\Theta }_{\rm s},\sigma _{12},f)\, P_L \left(k|z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed}\right) \nonumber \\ &&\times \, \left(\frac{\sigma _{12}}{\sigma _{12}\left(z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed}\right)}\right)^2\, , \end{eqnarray}$$

(48)

where we set |$\sigma _{12} = \sigma _{12}(z,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e})$| and used that the dependence on σ₁₂ can be factored out of the linear power spectrum by evaluating P_L at fixed redshift and evolution parameters and rescale the amplitude accordingly.⁷ The value of |$\sigma _{12}\left(z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed}\right)$| can be obtained as an integral over |$P_L(k|z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed})$|⁠, but instead we find it is more efficient to include an emulator for the former as a function of only |$\boldsymbol{\Theta }_{\rm s}$|⁠. As noted in Section 3.2, the finger-of-god damping factor in the VDG model depends on the velocity dispersion and a computation similar to that in equation (48) shows that it is sufficient to predict the value of |$\sigma _v(z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed})$|⁠, which we also emulate as a function of the shape parameters.

3.3.3 Summary

In summary, we thus require an emulation of the following quantities:

|$\beta _{{\cal B},\ell }(k|\boldsymbol{\Theta }_{\rm s},\sigma _{12},f)$| for ℓ = 0, 2, 4, and each |${\cal B}$| from Table 1 ,
|$P_L\left(k|z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed}\right)$| ,
|$\sigma _{12}\left(z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed}\right)$| ,
|$\sigma _{v}\left(z=1,\boldsymbol{\Theta }_{\rm s},\boldsymbol{\Theta }_{\rm e}^{\rm fixed}\right)$| .

We evaluate the ratios and P_L on a range of scales extending from |$7 \times 10^{-4}\, \mathrm{Mpc}^{-1}$| to |$0.35\, \mathrm{Mpc}^{-1}$|⁠, using a total number of 106 points, chosen such that they provide a dense sampling on scales relevant for the BAO wiggles.

3.4 Parameter space and training process

3.4.1 Parameter ranges

Our emulator is constructed for a total of five parameters. In addition to f and σ₁₂, we consider the three shape parameters ω_b, ω_c, and n_s. Each of them is allowed to vary within the ranges given in Table 2, which for the latter three were chosen to span roughly a 12, 30, and 11σ interval around the Planck 2018 best-fitting values, respectively. The growth rate and σ₁₂ capture the dependence on redshift and an arbitrary set of evolution parameters,⁸ and therefore require a more generous support. None the less, the limits on f and σ₁₂ impose restrictions on the range of redshifts for which our emulator can be used. In case of the growth rate the ranges can accommodate any redshift for ω_c ≳ 0.107, while for the most extreme values of the allowed shape parameters the lower boundary imposes the limitation z ≳ 0.1. Tighter restrictions on the supported redshifts come from the range of σ₁₂ and to demonstrate that we compare them in Fig. 1 against the Planck prediction of σ₁₂ as a function of redshift using the best-fitting cosmological parameter values from Planck Collaboration VI (2020). When exploring cosmological parameters using a large-scale structure likelihood function, they will give rise to values of σ₁₂ that are typically close (within |$\sim 10\, {{\ \rm per\ cent}}$|⁠) to the Planck prediction, while the 1σ uncertainty on σ₁₂ is of the order |$5\, {{\ \rm per\ cent}}$| for constraints from the BOSS galaxy survey (Semenaite et al. 2022). To account for these uncertainties, we plot a 20 per cent error band around the Planck prediction in Fig. 1 and expect that any sampled cosmological parameters will correspond to σ₁₂ values falling roughly into that range. We see that this band leaves the σ₁₂ range of our emulator at a redshift z ∼ 3, which means that COMET is no longer guaranteed to provide accurate predictions beyond that redshift.

Prediction of σ12 as a function of redshift using the Planck 2018 TT, TE, EE + lowE + lensing best-fitting cosmological parameters. The blue-shaded band represents a 20 per cent variation around that prediction, while for comparison the supported range of values for σ12 in our emulator COMET is shown as the grey-shaded area.

Figure 1.

Prediction of σ₁₂ as a function of redshift using the Planck 2018 TT, TE, EE + lowE + lensing best-fitting cosmological parameters. The blue-shaded band represents a 20 per cent variation around that prediction, while for comparison the supported range of values for σ₁₂ in our emulator COMET is shown as the grey-shaded area.

Table 2.

Emulator parameter space and the supported range of values.

Parameter	Min. emulator range	Max. emulator range
ω_b	0.0205	0.02415
ω_c	0.085	0.155
n_s	0.92	1.01
σ₁₂	0.2	1.0
f	0.5	1.05

Table 2.

Emulator parameter space and the supported range of values.

Parameter	Min. emulator range	Max. emulator range
ω_b	0.0205	0.02415
ω_c	0.085	0.155
n_s	0.92	1.01
σ₁₂	0.2	1.0
f	0.5	1.05

3.4.2 Generation of training data sets

We generate two separate training sets, one spanning the full parameter space intended for the ratios |$\beta _{{\cal B},\ell }$|⁠, and another covering only the shape parameters for the remaining quantities. Both training sets are built by drawing a number of samples from a Latin Hypercube, using 1500 and 750 samples, respectively. In order to obtain an optimal coverage of the parameter spaces we repeat the sampling step 10 000 times and pick the set which maximizes the minimum (Euclidean) distance between any two of its points. We then evaluate all of the model ingredients using a numerical integrator and starting from camb-generated linear input power spectra. Before training the emulator, we perform one additional pre-processing step, in which we further reduce the dynamical range of the training data by taking the logarithm and in which we normalize each component, such that it has zero mean and unit variance over the full set of samples.

3.4.3 Gaussian process emulation

For the actual emulation step, we employ Gaussian Processes (GP), whose properties are discussed in detail in Rasmussen & Williams (2006). The crucial ingredient in a GP model is the kernel function |$K(\boldsymbol{x},\boldsymbol{x}^{\prime })$|⁠, which describes the covariance between two points |$\boldsymbol{x}$| and |$\boldsymbol{x}^{\prime }$| of the training set. Due to a lack of knowledge about the precise functional form of this covariance, we generated emulators with different combinations of commonly used kernel functions in the literature. After comparing their performance, we settled on the following kernel function:

$$\begin{eqnarray} K(\boldsymbol{x},\boldsymbol{x}^{\prime }) = K_{\rm exp}(\boldsymbol{x},\boldsymbol{x}^{\prime }) + K_{3/2}(\boldsymbol{x},\boldsymbol{x}^{\prime })\, , \end{eqnarray}$$

(49)

which is a combination of a squared exponential kernel,

$$\begin{eqnarray} K_{\rm exp}(\boldsymbol{x},\boldsymbol{x}^{\prime }) = \exp {\left(-\frac{r^2}{2}\right)}\, , \end{eqnarray}$$

(50)

and a Matérn kernel of degree ν = 3/2,

$$\begin{eqnarray} K_{3/2}(\boldsymbol{x},\boldsymbol{x}^{\prime }) = \left(1 + \sqrt{3} r\right)\, \exp {\left(-\sqrt{3} r\right)}\, , \end{eqnarray}$$

(51)

where |$r^2 = \sum _{i=1}^d \left(x_i - x^{\prime }_i\right)^2/l_i^2$| and d is the dimension of the parameter space. The quantities |$\boldsymbol{l}$| represent so-called hyperparameters, which characterize the length scales of typical features in the training data and they can differ between the two kernel functions. The values of these hyperparameters are optimized by maximizing the log-likelihood of our GP models with respect to the training data. We implement this procedure using the publically available package gpy and repeat the optimization step five times with different random initializations, selecting for each emulator the parameter set that provides the largest log-likelihood.

3.5 Computational performance

In this section, we measure the execution times of COMET for the two different RSD models, each for a different number of multipoles and number of scales. The computational performance will of course depend on the given platform, so the reader should keep in mind that all measurements reported here are based on a laptop equipped with an Apple M1 Pro processor, using one CPU (up to 3.22 Ghz) and a single thread.

We measure the execution times using the function perf_counter from python’s time module and in order to reduce uncertainties, we repeat the prediction of the power spectrum multipoles N_exec times for values of N_exec ranging from 1 to 35. We then fit a straight line to the resulting measurements as a function of N_exec, such that the slope of that line provides a robust estimate (and uncertainty) of the execution time per call. In Fig. 2, we plot these estimates for a varying number of scales spaced logarithmically between |$k_{\rm min} = 0.001\, \mathrm{Mpc}^{-1}$| and |$k_{\rm max} = 0.35\, \mathrm{Mpc}^{-1}$| for the EFT model in blue and the VDG model in orange. Filled symbols in each case correspond to the prediction of only the monopole, while open symbols also include the computation of the quadrupole and hexadecapole. We see that the execution times range from |$\sim 9.5\, \mathrm{ms}$| to |$\sim 11.5\, \mathrm{ms}$| for 50–300 bins in case of the EFT model. The VDG model takes on average about |$1\, \mathrm{ms}$| longer, due to the integration over μ also involving the evaluation of the effective damping function W_∞. On the other hand, we see little difference between a prediction of just the monopole, or all three multipoles, since for the reconstruction of the anisotropic power spectrum the emulator of all three multipoles need to be called in any case.

Figure 2.

Computation time as a function of the provided number of bins for the EFT and VDG models, either for the monopole alone, or all three multipoles.

Considering that the execution time for other public codes, such as CLASS-PT (Chudaykin et al. 2020) or PyBird (D’Amico, Senatore & Zhang 2021) is of the order |$\sim 1\, \mathrm{s}$| (based on a timing estimate given by the authors of the former code, but using more than one CPU), COMET achieves a speed-up of at least two orders of magnitude.

4 VALIDATION OF THE EMULATOR

In this section, we perform a number of mock analyses based on synthetic data sets at multiple redshifts using statistical uncertainties corresponding to a volume ten times larger than that expected for the Euclid galaxy survey. These analyses not only allow us to determine the relative uncertainties introduced by the emulator, they also – and this is of much greater relevance – let us test how these uncertainties propagate to the posteriors of cosmological and galaxy bias parameters.

4.1 Generation of synthetic validation data

4.1.1 ΛCDM validation set

Our main validation sample consists of a set of flat ΛCDM cosmologies covering the parameters ω_b, ω_c, n_s, h, and A_s for the EFT model, while for the VDG model we also include a_vir. Each validation set is generated by drawing 1500 random points within these five-, or six-dimensional parameter spaces, satisfying only the minimum and maximum values given in Table 3. For each point in the validation set, we compute the power spectrum multipoles using the exact model at the four redshifts z = 0.9, 1.2, 1.5, and 1.8, and making the following assumptions for the galaxy bias parameters: we fix the value of the linear bias using |$b_1(z) = \sqrt{1+z}$|⁠, which provides a reasonable estimate of the bias of H α galaxies to be selected by Euclid (Rassat et al. 2008; di Porto, Amendola & Branchini 2012), whereas for γ₂ we impose the excursion-set relation of Sheth, Chan & Scoccimarro (2013) and Eggemeier et al. (2020). We then determine the values of b₂ and γ₂₁ according to the peak-background-split and coevolution relations from Lazeyras et al. (2016) and Eggemeier et al. (2019), respectively, as functions of b₁ and γ₂, and set all other counterterm and stochastic model parameters to zero. Each of the synthetic multipoles accounts for Alcock-Paczynski distortions in an exact manner (i.e. not via the approximation discussed in Section 3.3.1), based on a fiducial flat ΛCDM cosmology at the same redshifts as the predictions and with fixed parameters h = 0.67 and ω_m = ω_c + ω_b = 0.1432.

Table 3.

Parameters included in the ΛCDM validation set and their minimum and maximum values. The parameter a_vir is only included in the validation set for the VDG model and its minimum and maximum values are given in units of |$h^{-1}\, \mathrm{Mpc}$|⁠.

Parameter	Minimum	Maximum
ω_b	0.02100	0.02365
ω_c	0.095	0.145
n_s	0.93	1.00
h	0.55	0.85
A_s	0.8	3.0
a_vir	0.0	8.0

Table 3.

Parameters included in the ΛCDM validation set and their minimum and maximum values. The parameter a_vir is only included in the validation set for the VDG model and its minimum and maximum values are given in units of |$h^{-1}\, \mathrm{Mpc}$|⁠.

Parameter	Minimum	Maximum
ω_b	0.02100	0.02365
ω_c	0.095	0.145
n_s	0.93	1.00
h	0.55	0.85
A_s	0.8	3.0
a_vir	0.0	8.0

In a second step, we construct statistical uncertainties for these synthetic measurements by adopting Gaussian covariances matrices, which are computed from the synthetic multipoles at each point in the validation set using the expressions given, for instance, in Grieb et al. (2016). In order to relate these uncertainties to the characteristics of the Euclid survey, we further assume tracer densities that match the number densities of H α galaxies in the Euclid Flagship I mock catalogues (see Table C1), as well as Euclid-like volumes. The latter are derived from redshift shells of width Δz = 0.2, centred on the respective redshifts for which the power spectrum multipoles have been evaluated, and covering a sky area of |$15\,000\, \mathrm{deg}^2$|⁠. To increase the stringency of our validation tests, we then multiply these volumes by a factor of 10, which is equivalent with a reduction of the statistical uncertainties by a factor of ∼3.

We stress that the various choices in generating this synthetic data set are not meant to provide power spectrum measurements that closely mimic those of any real galaxy samples. Nonetheless, we expect the relative uncertainties, σ_ℓ/P_ℓ, to be well representative (apart from the stringency volume factor) for Euclid, such that we can make a meaningful assessment of the performance of our emulator.

4.1.2 Synthetic measurements for fixed set of parameters

For a further validation test we generate one more set of synthetic power spectrum multipoles at fixed cosmological and bias parameters. We compute the power spectrum multipoles from the exact model (in this case only for the EFT) at the same four redshifts as before, but using the bias parameter values for b₁, b₂, γ₂₁, c₀, c₂, c₄, c_nlo, and |$N^P_0$| given in Table C1. The value for γ₂ is again fixed in terms of the excursion set relation, while the cosmological parameters at all four redshifts are set to h = 0.67, ω_c = 0.1212, ω_b = 0.021996, n_s = 0.96, and A_s = 2.11065. As described in Section 4.1.1, we then derive statistical uncertainties for these synthetic measurements in the Gaussian approximation, using the number densities specified in Table C1 and volumes that correspond to the same redshift shells as above, including the stringency factor of 10.

4.2 Results across ΛCDM validation set

Our goal is to quantify the impact of the emulation inaccuracies on mean parameter values and their uncertainties, when performing a full likelihood exploration using all three galaxy power spectrum multipoles. To that end we use the synthetic data vectors and covariance matrices described in Section 4.1.1 and for each combination of cosmological parameters in the validation set we run two MCMC,⁹ one with the exact theory model, the other using the emulator predictions. In those chains, we keep the cosmological parameters fixed, while varying a set of seven bias parameters: b₁, b₂, γ₂₁, the constant shot noise N₀, as well as the three counterterm parameters c₀, c₂, and c₄; γ₂ is fixed in terms of the aforementioned excursion set relation as a function of b₁. We pick a maximum wavemode of |$0.3\, h\, \mathrm{Mpc}^{-1}$| for all three multipoles in this analysis, which means that any significant deviations between the true and emulated models up to that scale can lead to shifts between the posterior mean values recovered from the two chains, as well as to differences in the credible regions.

4.2.1 Relative inaccuracies

Before identifying the effects on the parameter constraints, let us first consider the relative differences between the exact model and the emulator as a function of scale. These are shown exemplarily for the EFT model at z = 0.9 in the upper panels of Fig. 3, where each line has been evaluated for a different point in the validation set and a randomly selected point from the chain over bias parameters that was run for the exact model. We see that the relative differences for the monopole (blue) and quadrupole (orange) grow between |$k = 0.1\, h\, \mathrm{Mpc}^{-1}$| and |$0.2\, h\, \mathrm{Mpc}^{-1}$|⁠, after which they saturate and generally stay below the |$0.2\, {{\ \rm per\ cent}}$| threshold. This can be seen more clearly in the cumulative histogram in the fourth panel, which plots the maximum of the absolute relative difference over all scales and shows that there is only a vanishingly small fraction of cosmologies in the validation data sets that gives rise to discrepancies larger than |$0.2\, {{\ \rm per\ cent}}$|⁠. In fact, we find that |$68\, {{\ \rm per\ cent}}$| of all validation samples have maximum uncertainties smaller than |$0.08\, {{\ \rm per\ cent}}$| and |$0.1\, {{\ \rm per\ cent}}$| for the monopole and quadrupole, respectively. The situation is slightly worse for the hexadecapole, where for the same fraction of validation samples the maximum differences are only below |$0.3\, {{\ \rm per\ cent}}$|⁠, but this is mostly due to the hexadecapole crossing zero for many of the tested cosmologies. It is more meaningful to plot these differences in units of some estimate of the measurement uncertainties, and for the results in the lower panels of Fig. 3 we have picked the standard deviations taken from the covariance matrices discussed in Section 4.1.1. We now obtain the reversed picture: due to the monopole having the smallest uncertainties, its differences appear larger than for both, the quadrupole and hexadecapole, so it will dominate the impact on any parameter posteriors. However, |$68\, {{\ \rm per\ cent}}$| of the validation samples still have a maximum difference smaller than |$0.24\, \sigma$| at 10 times the Euclid volume for this redshift shell. The analogous results for the VDG model and the other redshifts are qualitatively very similar – we only note that the differences in units of the measurement uncertainties decrease with increasing redshift because of the decreasing tracer number densities and thus a larger contribution from shot noise (see Appendix C).

$Inaccuracies of the emulated multipoles as a function of scale for the EFT model at z = 0.9. Differences are shown in per cent (upper panels) and in units of the standard deviation of our synthetic data set (covering 10 times the volume of a Euclid redshift shell, see Section 4.1.1; lower panels) for all cosmologies of the validation set and using a combination of bias parameters from a random point in each chain. The fourth panel shows the cumulative histogram of the maximum absolute differences over the full range of scales with the two vertical dashed lines indicating $68\, {{\ \rm per\ cent}}$ and $95\, {{\ \rm per\ cent}}$ of the validation samples.$

Figure 3.

Inaccuracies of the emulated multipoles as a function of scale for the EFT model at z = 0.9. Differences are shown in per cent (upper panels) and in units of the standard deviation of our synthetic data set (covering 10 times the volume of a Euclid redshift shell, see Section 4.1.1; lower panels) for all cosmologies of the validation set and using a combination of bias parameters from a random point in each chain. The fourth panel shows the cumulative histogram of the maximum absolute differences over the full range of scales with the two vertical dashed lines indicating |$68\, {{\ \rm per\ cent}}$| and |$95\, {{\ \rm per\ cent}}$| of the validation samples.

4.2.2 Shifts in posterior means

In order to study, the impact of these inaccuracies on the parameter posteriors, we extract the 1D-marginalized posterior means for each of the seven bias parameters from the two chains and compute their difference in units of the 1σ parameter uncertainty obtained from the chain using the exact model, σ_{X, true} (with X denoting any of the seven bias parameters). Fig. 4 shows the resulting histograms over the full validation set for all four redshifts and both of the RSD models. We clearly see that none of these cases produces any significant shifts in the varied parameters. More precisely, the means of the distributions stay consistently below |$0.1\, \sigma _{X,\mathrm{true}}$|⁠, while the standard deviations reach a maximum of |$0.2\, \sigma _{X,\mathrm{true}}$|⁠, showing that for the majority of the validation cosmologies we recover the true posterior means with high accuracy, while even the largest shifts remain negligible for the nominal Euclid volume. The parameters b₁ and N₀ generally display the smallest shifts, since they already get well constrained by the large-scale power spectrum, where the emulation inaccuracies are smallest. The higher order bias parameters b₂ and γ₂₁, as well as the counterterm parameters, on the other hand, are mostly constrained from the non-linear regime, such that they are more susceptible to the slightly larger emulation errors for k-modes beyond |$0.2\, h\, \mathrm{Mpc}^{-1}$|⁠. The largest shift (with a mean value of |$\sim 0.25\, \sigma _{X,\mathrm{true}}$|⁠) occurs for c₄ in case of the VDG model at z = 0.9. As we will show in Section 4.4, this is because the VDG model is more heavily affected by inaccuracies from the reconstruction of the anisotropic power spectrum and these inaccuracies are most notable for the small-scale hexadecapole. Since c₄ is mainly constrained by the hexadecapole, it absorbs these mismatches, resulting in the larger shifts. Finally, we note that the standard deviations for the distributions of the shifts are a little smaller than the value of the maximum difference (in units of σ) for |$68\, {{\ \rm per\ cent}}$| of the samples found in Section 4.2.1, but they are generally consistent, suggesting that the latter can serve as a good indicator for the performance of the emulator in explorations of the likelihood.

$Distributions of the differences between the posterior mean values obtained from running MCMC with the exact and emulated model predictions. Chains were run for synthetic data including the monopole, quadrupole and hexadecapole up to $k_{\rm max} = 0.3\, h\, \mathrm{Mpc}^{-1}$ with uncertainties corresponding to expected errors for the Euclid survey (see Section 4.1.1), but with ten times larger volumes. Each column shows the distribution for a different parameter varied in the chain in units of the respective standard deviation extracted from the chain using the true model, while the two different rows correspond to the different RSD models. The transition from dark to bright colours indicates increasing redshifts.$

Figure 4.

Distributions of the differences between the posterior mean values obtained from running MCMC with the exact and emulated model predictions. Chains were run for synthetic data including the monopole, quadrupole and hexadecapole up to |$k_{\rm max} = 0.3\, h\, \mathrm{Mpc}^{-1}$| with uncertainties corresponding to expected errors for the Euclid survey (see Section 4.1.1), but with ten times larger volumes. Each column shows the distribution for a different parameter varied in the chain in units of the respective standard deviation extracted from the chain using the true model, while the two different rows correspond to the different RSD models. The transition from dark to bright colours indicates increasing redshifts.

It is instructive to check whether the largest inaccuracies from the emulator occur dominantly in certain parts of the cosmological parameter space. For that reason in Figs 5 and 6, we plot the shifts in the linear bias parameter for the EFT and VDG models at z = 0.9, respectively, using all two-dimensional projections of the validation parameter set. We see indeed that for certain parameter combinations larger shifts (lighter colours) do not appear randomly, but in well separated regions: for instance, for the EFT model the most obvious separation occurs in the ω_b–ω_c parameter plane, showing that there are larger inaccuracies for large values of ω_b and simultaneously small values of ω_c. A similar trend can also be observed for the VDG model, in which case there is an additional tendency for larger shifts in the ω_b–A_s parameter plane. We obtain similar results for the other redshifts, emphasizing that in central regions of the parameter space, in particular for values of ω_b close to the Planck or Big Bang Nucleosynthesis priors, where one would preferentially sample the cosmological likelihood, the emulator performs best.

Shifts in the posterior means of b1 between the true and emulated EFT model at z = 0.9. Each panel shows a scatter plot of all validation samples, projected into different parameter planes.

Figure 5.

Shifts in the posterior means of b₁ between the true and emulated EFT model at z = 0.9. Each panel shows a scatter plot of all validation samples, projected into different parameter planes.

Figure 6.

Same as Fig. 5, but for the VDG model at z = 0.9.

4.2.3 Impact on credible regions

Finally, let us consider how well we can recover the 1σ credible regions for the parameters varied in the chains. This is more difficult to quantify precisely because, unlike the posterior means, the credible regions are more heavily affected by sampling noise, i.e. they carry a stronger dependence on the initial seed used for the MCMC at fixed convergence criterion. Ideally, we would therefore first quantify the sampling noise for each point in the validation set by running multiple chains with different initial seeds and constructing probability distributions of the 1σ credible region for each parameter, which could be compared between the exact and emulated models. However, as that procedure is very costly due to the need of running many individual chains, we settle on an approximate comparison only.

First we assume that the sampling noise is independent of cosmology and that the values of the 1σ confidence limits for each bias parameter are drawn from Gaussian distributions with means |$\overline{\sigma }_{X,\mathrm{true/emu}}(\boldsymbol{\Theta })$| and variance |$\sigma _{X, \mathrm{sampling}}^2$|⁠, where |$\boldsymbol{\Theta }$| denotes the dependence on cosmological parameters contained in the validation set. The difference in the estimated 1σ confidence limits between the true and emulated models is then also Gaussian distributed with mean |$\overline{\sigma }_{X,\mathrm{emu}}(\boldsymbol{\Theta }) - \overline{\sigma }_{X,\mathrm{true}}(\boldsymbol{\Theta }) = \Delta \overline{\sigma }_{X}(\boldsymbol{\Theta })$| and variance |$2\sigma ^2_{X,\mathrm{sampling}}$|⁠. In the case that |$\Delta \overline{\sigma }_X(\boldsymbol{\Theta })$| is small or has negligible dependence on cosmology, we can regard each value Δσ_X obtained at a different validation cosmology as being independently drawn from the same sampling noise distribution and we can interpret the offset of that distribution from zero mean as the accuracy with which we can recover the 1σ credible regions. On the other hand, if |$\Delta \overline{\sigma }_X(\boldsymbol{\Theta })$| strongly depends on cosmology, the distribution of Δσ_X values over the validation set will be broader and/or have a different shape, and so we cannot immediately assess the significance of the differences in σ_X from the data we have generated.

In Fig. 7, we plot the distributions of Δσ_X, normalized by the average over the entire validation set, <σ_{X, true} >, for the two RSD models and the four different redshifts. In order to quantify the sampling noise, we pick a ΛCDM cosmology with parameter values corresponding to the Planck 2018 TT, TE, EE + low E + lensing constraints, for which we run 1000 chains with the exact predictions for the EFT and VDG models at each redshift, and varying the same seven bias parameters as before, but with different initial seeds. We then determine |$\sigma ^2_{X,\mathrm{sampling}}$| in each case by fitting a Gaussian to the resulting distributions of σ_X/ < σ_X > −1, where <σ_X > is the average of σ_X over the 1000 different chains. This allows us to plot the reference sampling noise distributions (grey dashed lines in Fig. 7) as Gaussians with variance |$2\sigma _{X,\mathrm{sampling}}^2$| and mean given by the median of the EFT distributions (blue histograms).

$Histograms of the differences in the $68\, {{\ \rm per\ cent}}$ credible regions between the true and emulated models, normalized by the average 1σ constraints obtained from the true model of all validation samples, <σX, true >. Each column depicts a different parameter varied in the chains, while each row shows a different redshift of the synthetic data set; the two different colours correspond to the two RSD models. The grey dashed lines are Gaussian distributions, centred on the median value of the EFT histograms, and represent an estimate of the spread due to sampling noise alone (see the text for details).$

Figure 7.

Histograms of the differences in the |$68\, {{\ \rm per\ cent}}$| credible regions between the true and emulated models, normalized by the average 1σ constraints obtained from the true model of all validation samples, <σ_{X, true} >. Each column depicts a different parameter varied in the chains, while each row shows a different redshift of the synthetic data set; the two different colours correspond to the two RSD models. The grey dashed lines are Gaussian distributions, centred on the median value of the EFT histograms, and represent an estimate of the spread due to sampling noise alone (see the text for details).

From these plots, we observe that many of the histograms over the validation set are indeed consistent with sampling noise and moreover, in those cases the differences between σ_{X, true} and σ_{X, emu} are at the per cent level, which is insignificant compared to the spread due to sampling noise. Some parameters, in particular b₁, N₀, and c₀, display broader or skewed distributions, suggesting that in those cases the cosmology dependence of the differences of σ_X is stronger. In those cases, we can only deduce that on average the differences are still at the per cent level (see median values), but it is not possible to judge their significance. The deviations from the sampling noise distributions are largest at z = 0.9, where the synthethic measurements uncertainties are smallest and hence where the emulation inaccuracies carry the strongest weight, while going to z = 1.8 gives again very good consistency with sampling noise for all parameters. We do not find any significant differences between the two RSD models.

4.3 Results for analysis with varying cosmological parameters

Finally, we analyse the synthetic power spectrum multipoles described in Section 4.1.2. As before, we run two chains, one with the exact model, the other using COMET, but instead of keeping the cosmological parameters fixed as in the previous section, we now also vary h, ω_c, and A_s, setting only n_s and ω_b to their fiducial values. Out of the full set of bias parameters we include b₁, b₂, γ₂₁, c₀, c₂, c₄, and N₀ in the chains, fixing γ₂ and c_nlo to the values used in the generation of the synthetic data. Since explorations of the likelihood with varying cosmological parameters is much more computationally expensive for our exact model code, we limits ourselves here to only a single case per redshift.

The chains are run using MultiNest (Feroz & Hobson 2008; Feroz, Hobson & Bridges 2009; Feroz et al. 2019) with 1800 live points in case of COMET and a standard Metropolis–Hastings sampler with a total number of 42 000 accepted steps in case of the exact model. After processing these chains with getdist (Lewis 2019) we obtain the 2d marginalized posteriors for the full set of parameters at redshift z = 0.9 shown in Fig. 8, where the results based on the exact model correspond to the blue contours, the ones based on COMET to the orange contours. We see that the agreement between the posteriors of the two models is close to perfect: the mean posterior values of all parameters are almost identical and any occurring shifts are well below the 1σ level, while the 1σ and 2σ credible regions are equally well recovered. Some slight differences can be observed in the tails of the posterior distributions, but these are most sensitive to the sampling routines and therefore most likely caused by differences in the two samplers used here. Although not shown, we find qualitatively very similar results at the three remaining redshifts, so that these findings confirm our results from Section 4.2 at fixed cosmology.

$Comparison of the 2d and 1d marginalized posteriors obtained from running MCMC with the exact model and COMET for the EFT. The results shown stem from the synthetic data set at redshift z = 0.9, as described in Section 4.1.2 and using a volume corresponding to 10 times the volume of a Euclid redshift shell. The value of γ2 was constrained to the excursion set relation of Eggemeier et al. (2020), while cnlo was fixed to the fiducial value (see Table C1). The three counterterm parameters c0, c2, and c4 are given in units of $(h^{-1}\, \mathrm{Mpc})^2$. Vertical and horizontal dashed lines indicate the fiducial parameter values.$

Figure 8.

Comparison of the 2d and 1d marginalized posteriors obtained from running MCMC with the exact model and COMET for the EFT. The results shown stem from the synthetic data set at redshift z = 0.9, as described in Section 4.1.2 and using a volume corresponding to 10 times the volume of a Euclid redshift shell. The value of γ₂ was constrained to the excursion set relation of Eggemeier et al. (2020), while c_nlo was fixed to the fiducial value (see Table C1). The three counterterm parameters c₀, c₂, and c₄ are given in units of |$(h^{-1}\, \mathrm{Mpc})^2$|⁠. Vertical and horizontal dashed lines indicate the fiducial parameter values.

4.4 Reconstruction of anisotropic power spectrum

In this section, we report the impact of inaccuracies caused by reconstructing the full anisotropic power spectrum from the monopole, quadrupole, and hexadecapole only, as well as the approximate inclusion of the ℓ = 6 multipole discussed in Section 3.3.1. To that end we make again use of the validation set described in Section 4.1.1 and compute the first three multipoles for each validation cosmology using the exact model (i.e. without any input from the emulator), but without inclusion of Alcock–Paczynski distortions and, in case of the VDG model, without the FoG damping term. Like for our emulator, we then reconstruct the anisotropic power spectrum from those multipole moments, apply Alcock–Pacyznski distortions and FoG damping, and as a final step evaluate the observed multipoles. We can compare these predictions with those that do not make use of the multipole reconstruction in order to determine the differences as a function of the cosmological parameters.

This is shown in the top row of Fig. 9, where we plot the maximum difference (taken over all scales up to |$k_{\mathrm{max}} = 0.3\, h\, \mathrm{Mpc}^{-1}$|⁠) in units of the synthetic standard deviations for each validation point at z = 0.9, projected into the h–A_s parameter plane. The first three panels depict these differences for the monopole to hexadecapole for EFT model, the next three for the VDG model. We note that for the EFT model there is virtually no impact on the monopole and quadrupole, even for measurement uncertainties corresponding to 10 times the volume contained in a Euclid redshift shell. The situation is different for the hexadecapole, where we obtain maximum differences of up to ∼0.5σ, in particular for large and small values of h. On the other hand, for the VDG model the effect is noticeably larger with maximum differences going well beyond 0.5σ for the hexadecapole (up to ∼2.5σ) and also more significant inaccuracies for the monopole and quadrupole, preferentially for large values of h and A_s. This happens because the FoG damping factor carries a significant additional line-of-sight dependence, which amplifies the contributions from higher multipole moments not included in the reconstruction.

$Maximum differences up to $k_{\mathrm{max}} = 0.3\, h\, \mathrm{Mpc}^{-1}$ (in units of the standard deviation of our synthetic data set, see Section 4.1) for each point of the validation set between the exact computation of the power spectrum multipoles and when the anisotropic power spectrum before application of Alcock-Paczynski distortions and FoG damping is obtained from a Legendre decomposition truncated at finite order. In the top row the latter is based on the first three non-zero multipoles, while in the bottom row we include the ℓ = 6 multipole moment evaluated at fixed shape parameters (see Section 3.3.1 for details). Both sets of predictions are generated without any input from the emulator.$

Figure 9.

Maximum differences up to |$k_{\mathrm{max}} = 0.3\, h\, \mathrm{Mpc}^{-1}$| (in units of the standard deviation of our synthetic data set, see Section 4.1) for each point of the validation set between the exact computation of the power spectrum multipoles and when the anisotropic power spectrum before application of Alcock-Paczynski distortions and FoG damping is obtained from a Legendre decomposition truncated at finite order. In the top row the latter is based on the first three non-zero multipoles, while in the bottom row we include the ℓ = 6 multipole moment evaluated at fixed shape parameters (see Section 3.3.1 for details). Both sets of predictions are generated without any input from the emulator.

The lower panels of Fig. 9 shows how the maximum difference improve when the ℓ = 6 multipole, evaluated at fixed shape parameters corresponding to the Planck 2018 TT,TE,EE+lowE + lensing values, is included in the Legendre expansion. We see that even when using this approximation the inaccuracies are significantly reduced for both RSD models. Specifically, they now stay below ∼0.2σ for the hexadecapole of the EFT model, while only about |$5\, {{\ \rm per\ cent}}$| of the validation samples reach maximum difference larger than ∼0.2σ and 0.35σ for the quadrupole and hexadecapole of the VDG model, respectively. Returning to the larger shifts in the counterterm parameter c₄ that we noticed in Section 4.2.2 for the VDG model, we find that they depend on cosmology in a very similar way as the differences in the bottom right panel of Fig. 9, implying that these shifts are caused by the remaining reconstruction inaccuracies in the hexadecapole. However, we stress that these are not only negligible, they also occur in a region of the cosmology parameter space that is not relevant for likelihood explorations.

5 CONCLUSIONS

5.1 Summary

We have presented COMET, an emulator of the galaxy power spectrum multipoles in redshift-space based on two different perturbation theory models: the EFT model as employed in the analyses of Ivanov et al. (2020), d’Amico et al. (2020), which fully expands the real- to redshift-space mapping, and the VDG model, which models the impact of small-scale velocity differences via a non-perturbative damping function (Scoccimarro 2004; Taruya et al. 2010; Sánchez et al. 2017). The leading idea that was driving the design of our emulator was to minimise the emulation parameter space, in order to reach an optimal compromise between computation time and accuracy. For that reason we have adopted the evolution mapping approach of Sánchez et al. (2022) and trained COMET internally in units of Mpc over the range |$0.0007\, \mathrm{Mpc}^{-1}$| to |$0.35\, \mathrm{Mpc}^{-1}$| using only the shape parameters ω_b, ω_c, n_s, in addition to σ₁₂ and the growth rate f. In this way we are able to support a broad set of evolution parameters, specifically h, A_s, Ω_K, w₀, and w_a, by mapping them to the corresponding values of σ₁₂ and f at any given redshift (up to an upper limit of z ∼ 3, imposed by our chosen range for σ₁₂). Furthermore, we emulate all independent contributions that arise from the galaxy bias expansion separately, which precludes the associated parameters from the emulator parameter space, and apply AP distortions and the effective damping function in case of the VDG model in a separate step. This gives COMET the flexibility to support any fiducial background cosmologies and arbitrary functional forms of the damping term. A single evaluation of the monopole, quadrupole and hexadecapole at |${\cal O}(100)$| scales takes about |$10\, \mathrm{ms}$| when executed on a single CPU.

Using a series of validation tests, we verified that COMET does not introduce any relevant loss in accuracy in comparison to the exact perturbation theory models. We constructed a large validation set consisting of 1500 synthetic power spectrum measurements at four different redshifts between z = 0.9 and z = 1.8, covering the five-dimensional cosmological parameter space: ω_b, ω_c, n_s, h and A_s. Adopting a fixed set of galaxy bias parameters at each redshift and in case of the EFT model, we find that the relative inaccuracies stay below |$0.08(0.14)\, {{\ \rm per\ cent}}$|⁠, |$0.11(0.17)\, {{\ \rm per\ cent}}$| and |$0.30(1.33)\, {{\ \rm per\ cent}}$| for |$68(95)\, {{\ \rm per\ cent}}$| of the validation samples and the monopole, quadrupole, and hexadecapole, respectively, up to |$k_{\rm max} = 0.3\, h\, \mathrm{Mpc}^{-1}$|⁠. We further generated statistical uncertainties for our synthetic measurements using Gaussian covariances and specifics tied to the Euclid survey, but with a tenfold increase in volume. In units of these uncertainties the emulation errors are below |$0.24(0.42)\, \sigma$|⁠, |$0.10(0.17)\, \sigma$|⁠, and |$0.05(0.10)\, \sigma$|⁠. We then ran MCMC, varying a set of seven bias parameters both for COMET and the exact model and analysing all three synthetic multipoles for each validation sample, finding that the shifts in the mean posterior values closely match the emulation inaccuracies in units of σ. Additionally, we find that the |$68\, {{\ \rm per\ cent}}$| credible regions are very well recovered and any occurring differences are typically smaller than the MCMC sampling noise. For the VDG model the performance is very similar, apart from a slightly less accurate hexadecapole (see discussion in Section 4.4), but without negative effects on the recovery of the posteriors. Finally, we constructed one more synthetic collection of measurements (again at the same four redshifts) for a fixed set of cosmological and bias parameters, which we used to demonstrate that even when running chains over the full parameter space (including cosmological parameters) there is no appreciable difference in the resulting posterior distributions between the exact model and COMET.

While we have explicitly demonstrated that the emulator is accurate up to scales of at least |$k_{\rm max} = 0.3\, h\, \mathrm{Mpc}^{-1}$|⁠, we caution that this does not have to apply to the underlying theoretical model itself. The validity of the one-loop perturbation theory depends on the relative size of the neglected non-linear corrections, as well as the amplitude of the galaxy bias parameters, and is thus a function of redshift and the particular galaxy sample under consideration. Any application to real data must therefore be preceded by a thorough study of the model’s robustness under changes of the maximum scale cuts — a task that is ideally suited for COMET due to its superior computational efficiency.

We leave two important extensions of COMET to forthcoming publications: Firstly, the power spectrum models for both the EFT and VDG can be easily transformed to configuration space and thus, using the same emulation technniques as presented here, we would also be able to make fast predictions of the two-point correlation function multipoles. Secondly, the new generation of galaxy surveys is expected to improve our current constraints on the masses of neutrinos, which is why it is particularly important to be able to predict galaxy clustering statistics as a function of non-zero neutrino masses, unlike we have assumed here.

5.2 Comparison to related emulators in the literature

While most galaxy clustering emulators that have been presented in the literature so far have been built from simulations and focus on the non-linear regime, there are two sets of works, which are closely related to what we have presented here and which we want to briefly compare against. In particular, Donald-McCann et al. (2022), DeRose et al. (2022) have presented two perturbation theory emulators of the galaxy power spectrum multipoles. The former, EFTEMU, is based on the PyBird code (d’Amico et al. 2020), which implements perturbation theory expressions identical to those described in Section 2 for the EFT model apart from slight differences in the infrared resummation procedure¹⁰ and the definition of the galaxy bias parameters (see Section 2.2 for a conversion), while the latter, EmulateLSS, is based on the Lagrangian perturbation theory model of Chen et al. (2021). Although sharing the same goal, there are a number of differences between these two emulators and COMET. On the one hand, this concerns the parameter space and the range of scales for which they provide predictions: EFTEMU supports the five cosmological parameters ω_b, ω_c, h, A_s, and n_s (i.e. it does not allow for deviations from the equation of state of a cosmological constant or for non-flat cosmologies), and with the exception of n_s all of these have smaller ranges than in COMET. EmulateLSS is even more restrictive as it also fixes the spectral index and does not cover the full galaxy bias and counterterm parameter space, ignoring for instance the second- and third-order tidal bias parameters. Moreover, both of these emulators make predictions for a fixed fiducial background cosmology and at fixed redshifts (each new fiducial background cosmology or redshift would require generating a new set of training data) and while EmulateLSS gives predictions in the range of scales from |$0.001\, h\, \mathrm{Mpc}^{-1}$| to |$0.5\, h\, \mathrm{Mpc}^{-1}$|⁠, EFTEMU has a more limiting range with a maximum wavemode of |$0.19\, h\, \mathrm{Mpc}^{-1}$|⁠. On the other hand, the prediction accuracies for the three multipoles quoted by DeRose et al. (2022) are only slightly worse than what we have determined for COMET, and also Donald-McCann et al. (2022) find a sub-per cent accuracy for fixed BOSS-like galaxy bias parameters, making it comparable with COMETin this regard (albeit on a more limited range of scales). Finally, both EFTEMU and EmulateLSS use neural networks instead of Gaussian processes for the emulation procedure and their computation time therefore does not depend on the size of the training set. This leads to a better computational performance than COMET by about one order of magnitude based on the fact that both works quote a computation time of |$\sim 1\, \mathrm{ms}$| (we stress, however, that eventually the computational bottleneck will lie elsewhere, e.g. in the computation of the χ² or, for joint chains with the bispectrum, the computation of the bispectrum model).

The methodology discussed in Zennaro et al. (2021), Aricò et al. (2021), and Kokron et al. (2021) follows the same perturbative galaxy bias expansion as presented in Section 2.2 (although the third order term related to the parameter γ₂₁ is neglected) and they have presented emulators for the various contributions to the galaxy power spectrum based on either perturbation theory (Aricò et al. 2021) like here, or on simulations in order to extend the predictions into the non-linear regime (Zennaro et al. 2021; Kokron et al. 2021). All three works consider scales up to |$1\, h\, \mathrm{Mpc}^{-1}$|⁠, but cover somewhat more restrictive redshift ranges than COMET, z_max = 1.5 (Aricò et al. 2021; Zennaro et al. 2021) and z_max = 2 (Kokron et al. 2021). They also emulate over somewhat different cosmological parameter spaces: the former two also cover an eight-dimensional cosmology parameter space, but instead of Ω_K account for the neutrino mass, while in the latter they use a seven-dimensional parameter space that includes the effective number of relativistic species, but does not allow for either Ω_K or a dynamical dark energy equation of state. Moreover, they generally quote an emulation accuracy of |$\sim 1\, {{\ \rm per\ cent}}$| – one order of magnitude larger than our findings here, but for this comparison one should bear in mind that these results carry a dependency on the nature of the validation tests. While these three works only make predictions for the galaxy power spectrum in real-space, Pellejero Ibañez et al. (2022) extends their formalism in order to account for redshift-space distortions. They show that using a halo displacement field extracted from simulations and a phenomenological finger-of-god model accounting for the velocity dispersion of satellite galaxies and satellite fraction, it is possible to recover power spectrum multipoles up to |$k \sim 0.6\, h\, \mathrm{Mpc}^{-1}$| that are accurate within measurement uncertainties corresponding to a volume of |$3 (h^{-1}\, \mathrm{Gpc})^3$|⁠.

5.3 Functionality of the COMET package

COMET is a freely available python package (https://gitlab.com/aegge/comet-emu), which can be installed via pip and all required tables and emulators are downloaded automatically (re-training the emulators is not necessary). We currently provide emulators for the real-space power spectrum and the redshift-space power spectrum multipoles for the EFT model.¹¹ Predictions can be made using either the Mpc or |$h^{-1}\, \mathrm{Mpc}$| unit systems and for either the native parameter space or for a given dark energy model. The former consists of the three shape parameters ω_b, ω_c, and n_s, in combination with σ₁₂ and f, while in case of the latter one needs to specify the evolution parameters h, A_s, Ω_K, w₀, and w_a at some redshift z instead of σ₁₂ and f. COMET accepts an arbitrary range of scales, but for those scales outside of the range for which we trained the emulators (⁠|$k \in [0.0007,\, 0.35]\, \mathrm{Mpc}^{-1}$|⁠), we apply a power law extrapolation. Further features of the package include the following:

The prediction of the linear matter power spectrum, with or without application of infrared resummation (BAO damping)
The prediction of the real-space tree-level galaxy bispectrum and the tree-level galaxy bispectrum multipoles in redshift-space
The computation of the Gaussian covariance matrices of the power spectrum and bispectrum multipoles (both, real- and redshift-space)
The computation of the χ² for a given set of measurements and their covariance matrix for arbitrary k_max scale cuts; we also provide a functionality that drastically speeds up the χ² computation in case of fixed cosmological parameters (e.g. when running a chain over only galaxy bias parameters)
The possibility to choose between different galaxy bias bases (see Section 2.2)
Exact treatment of discreteness and finite bin width effects for the power spectrum multipoles

More information and some tutorials can be found on the documentation pages: https://comet-emu.readthedocs.io/en/latest/index.html.

ACKNOWLEDGEMENTS

We thank the anonymous referee for useful comments, in particular the suggestion that led to the inclusion of Appendix B. AE is supported at the Argelander Institut für Astronomie by an Argelander Fellowship. BCQ and MC acknowledge support from the Spanish Ministerio de Ciencia e Innovación under grant PGC2018-102021-B-I00, and BCQ additionally acknowledges support from a PhD scholarship from the Secretaria d’Universitats i Recerca de la Generalitat de Catalunya i del Fons Social Europeu. AGS acknowledges the support of the Excellence Cluster ORIGINS, which is funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC-2094 – 390783311. This research made use of matplotlib, a Python library for publication quality graphics (Hunter 2007).

DATA AVAILABILITY

The data underlying this article will be shared on reasonable request to the corresponding author.

Footnotes

1

This means that all pairs of galaxies have the same line-of-sight, which we take to be the |$\hat{z}$|-direction.

2

Using the Legendre polynomial in the definition of |$N^P_{2,2}$| is an arbitrary choice, which simply guarantees that this term can only contribute to the quadrupole of the power spectrum in the absence of Alcock–Paczynski distortions.

3

Note that the value for k_s is defined in units of Mpc⁻¹ opposed to |$h\, \mathrm{Mpc}^{-1}$| since COMET internally works in Mpc units (see Section 3).

4

The growth rate cannot be fully factorized because it also affects the damping term in the IR resummation procedure (see Section 2.4), in particular equation (33).

5

We note that depending on the chosen unit system (Mpc vs. |$h^{-1}\, \mathrm{Mpc}$|⁠), H and D_M need to be given either in units of |$\mathrm{km}\, \mathrm{s}^{-1}\mathrm{Mpc}^{-1}$| and Mpc, or |$\mathrm{km}\, \mathrm{s}^{-1}(h^{-1}\, \mathrm{Mpc})^{-1}$| and |$h^{-1}\, \mathrm{Mpc}$|⁠, respectively.

6

In this way, only the dependence of the IR damping term on f and σ₁₂ is not correctly taken into account, but instead computed for z = 1 and at the fixed Planck cosmological parameters.

7

The fixed values for z and the evolution parameters are not essential, but we used z = 1, h = 0.695, A_s = 2.2078559 and all other potential evolution parameters set to zero.

8

In our current release of COMET, we allow to specify values for h, A_s, ω_K, w₀, and w_a.

9

These chains are run with emcee (Foreman-Mackey et al. 2013), using 32 walkers and for a number of steps at least ten times the maximum autocorrelation length for all varied parameters. The chains are then post-processed with getdist (Lewis 2019) in order to extract posteriors and related statistics.

10

A comparison between CLASS-PT (Chudaykin et al. 2020), which uses the same infrared resummation technique as discussed in Section 2.4, and PyBird has been presented in Nishimichi et al. (2020), finding very similar results.

11

The VDG model will be released as part of an upcoming publication.

REFERENCES

Alcock

C.

,

Paczynski

B.

,

1979

,

Nature

,

281

,

358

10.1038/281358a0

Angulo

R. E.

,

Zennaro

M.

,

Contreras

S.

,

Aricò

G.

,

Pellejero-Ibañez

M.

,

Stücker

J.

,

2021

,

MNRAS

,

507

,

5869

10.1093/mnras/stab2018

Aricò

G.

,

Angulo

R. E.

,

Zennaro

M.

,

2022

,

Open Research Europe

,

1

,

152

Assassi

V.

,

Baumann

D.

,

Green

D.

,

Zaldarriaga

M.

,

2014

,

J. Cosmol. Astropart. Phys.

,

2014

,

056

Baldauf

T.

,

Mirbabayi

M.

,

Simonović

M.

,

Zaldarriaga

M.

,

2015

,

Phys. Rev. D

,

92

,

043514

Ballinger

W. E.

,

Peacock

J. A.

,

Heavens

A. F.

,

1996a

,

MNRAS

,

282

,

877

10.1093/mnras/282.3.877

Ballinger

W. E.

,

Peacock

J. A.

,

Heavens

A. F.

,

1996b

,

MNRAS

,

282

,

877

10.1093/mnras/282.3.877

Baumann

D.

,

Nicolis

A.

,

Senatore

L.

,

Zaldarriaga

M.

,

2012

,

J. Cosmol. Astropart. Phys.

,

2012

,

051

Beutler

F.

et al. ,

2017

,

MNRAS

,

466

,

2242

10.1093/mnras/stw3298

Blas

D.

,

Garny

M.

,

Ivanov

M. M.

,

Sibiryakov

S.

,

2016

,

J. Cosmol. Astropart. Phys.

,

2016

,

028

Carrasco

J. J. M.

,

Hertzberg

M. P.

,

Senatore

L.

,

2012

,

J. High Energy Phys.

,

2012

,

82

10.1007/JHEP12(2020)082

Chan

K. C.

,

Scoccimarro

R.

,

Sheth

R. K.

,

2012

,

Phys. Rev. D

,

85

,

083509

Chen

S.-F.

,

Vlah

Z.

,

Castorina

E.

,

White

M.

,

2021

,

J. Cosmol. Astropart. Phys.

,

2021

,

100

Chudaykin

A.

,

Ivanov

M. M.

,

Philcox

O. H. E.

,

Simonović

M.

,

2020

,

Phys. Rev. D

,

102

,

063533

Cuesta-Lazaro

C.

,

Li

B.

,

Eggemeier

A.

,

Zarrouk

P.

,

Baugh

C. M.

,

Nishimichi

T.

,

Takada

M.

,

2020

,

MNRAS

,

498

,

1175

10.1093/mnras/staa2249

d’Amico

G.

,

Gleyzes

J.

,

Kokron

N.

,

Markovic

K.

,

Senatore

L.

,

Zhang

P.

,

Beutler

F.

,

Gil-Marín

H.

,

2020

,

J. Cosmol. Astropart. Phys.

,

2020

,

005

D’Amico

G.

,

Senatore

L.

,

Zhang

P.

,

2021

,

J. Cosmol. Astropart. Phys.

,

2021

,

006

Dekel

A.

,

Lahav

O.

,

1999

,

ApJ

,

520

,

24

10.1086/307428

DeRose

J.

,

Chen

S.-F.

,

White

M.

,

Kokron

N.

,

2022

,

J. Cosmol. Astropart. Phys.

,

2022

,

056

Desjacques

V.

,

2008

,

Phys. Rev. D

,

78

,

103503

Desjacques

V.

,

Crocce

M.

,

Scoccimarro

R.

,

Sheth

R. K.

,

2010

,

Phys. Rev. D

,

82

,

103529

Desjacques

V.

,

Jeong

D.

,

Schmidt

F.

,

2018a

,

Phys. Rep.

,

733

,

1

Desjacques

V.

,

Jeong

D.

,

Schmidt

F.

,

2018b

,

J. Cosmol. Astropart. Phys.

,

2018

,

035

10.1111/j.1365-2966.2011.19755.x

di Porto

C.

,

Amendola

L.

,

Branchini

E.

,

2012

,

MNRAS

,

419

,

985

Donald-McCann

J.

,

Koyama

K.

,

Beutler

F.

,

2023

,

MNRAS

,

518

,

3106

Eggemeier

A.

,

Scoccimarro

R.

,

Smith

R. E.

,

2019

,

Phys. Rev. D

,

99

,

123514

Eggemeier

A.

,

Scoccimarro

R.

,

Crocce

M.

,

Pezzotta

A.

,

Sánchez

A. G.

,

2020

,

Phys. Rev. D

,

102

,

103530

Eggemeier

A.

,

Scoccimarro

R.

,

Smith

R. E.

,

Crocce

M.

,

Pezzotta

A.

,

Sánchez

A. G.

,

2021

,

Phys. Rev. D

,

103

,

123550

Eisenstein

D. J.

,

Hu

W.

,

1998

,

ApJ

,

496

,

605

10.1086/305424

Euclid Collaboration

,

2019

,

MNRAS

,

484

,

5509

10.1093/mnras/stz197

Euclid Collaboration

,

2021

,

MNRAS

,

505

,

2840

10.1093/mnras/stab1366

10.1111/j.1365-2966.2007.12353.x

Feroz

F.

,

Hobson

M. P.

,

2008

,

MNRAS

,

384

,

449

10.1111/j.1365-2966.2009.14548.x

Feroz

F.

,

Hobson

M. P.

,

Bridges

M.

,

2009

,

MNRAS

,

398

,

1601

Feroz

F.

,

Hobson

M. P.

,

Cameron

E.

,

Pettitt

A. N.

,

2019

,

Open J. Astrophys.

,

2

,

10

10.21105/astro.1306.2144

Foreman-Mackey

D.

,

Hogg

D. W.

,

Lang

D.

,

Goodman

J.

,

2013

,

PASP

,

125

,

306

10.1086/670067

Giblin

B.

,

Cataneo

M.

,

Moews

B.

,

Heymans

C.

,

2019

,

MNRAS

,

490

,

4826

10.1093/mnras/stz2659

Grieb

J. N.

,

Sánchez

A. G.

,

Salazar-Albornoz

S.

,

Dalla Vecchia

C.

,

2016

,

MNRAS

,

457

,

1577

10.1093/mnras/stw065

10.1088/0004-637X/705/1/156

Heitmann

K.

,

Higdon

D.

,

White

M.

,

Habib

S.

,

Williams

B. J.

,

Lawrence

E.

,

Wagner

C.

,

2009

,

ApJ

,

705

,

156

Hunter

J. D.

,

2007

,

Comput. Sci. Eng.

,

9

,

90

10.1109/MCSE.2007.55

Ivanov

M. M.

,

Sibiryakov

S.

,

2018

,

J. Cosmol. Astropart. Phys.

,

2018

,

053

Ivanov

M. M.

,

Simonović

M.

,

Zaldarriaga

M.

,

2020

,

J. Cosmol. Astropart. Phys.

,

2020

,

042

Juszkiewicz

R.

,

Fisher

K. B.

,

Szapudi

I.

,

1998

,

ApJ

,

504

,

L1

10.1086/311558

Kobayashi

Y.

,

Nishimichi

T.

,

Takada

M.

,

Takahashi

R.

,

Osato

K.

,

2020

,

Phys. Rev. D

,

102

,

063504

Kokron

N.

,

DeRose

J.

,

Chen

S.-F.

,

White

M.

,

Wechsler

R. H.

,

2021

,

MNRAS

,

505

,

1422

10.1093/mnras/stab1358

10.1088/0004-637X/810/1/35

Kwan

J.

,

Heitmann

K.

,

Habib

S.

,

Padmanabhan

N.

,

Lawrence

E.

,

Finkel

H.

,

Frontiere

N.

,

Pope

A.

,

2015

,

ApJ

,

810

,

35

Laureijs

R.

et al. ,

2011

,

preprint

(

)

Lazeyras

T.

,

Wagner

C.

,

Baldauf

T.

,

Schmidt

F.

,

2016

,

J. Cosmol. Astropart. Phys.

,

2016

,

018

Levi

M.

et al. ,

2013

,

preprint

(

)

Lewis

A.

,

2019

,

preprint

(

)

Matsubara

T.

,

1999

,

ApJ

,

525

,

543

10.1086/307931

McDonald

P.

,

Roy

A.

,

2009

,

J. Cosmol. Astropart. Phys.

,

2009

,

020

Mirbabayi

M.

,

Schmidt

F.

,

Zaldarriaga

M.

,

2015

,

J. Cosmol. Astropart. Phys.

,

2015

,

030

10.1103/PhysRevD.99.063530

Nishimichi

T.

,

D’Amico

G.

,

Ivanov

M. M.

,

Senatore

L.

,

Simonović

M.

,

Takada

M.

,

Zaldarriaga

M.

,

Zhang

P.

,

2020

,

Phys. Rev. D

,

102

,

123541

Osato

K.

,

Nishimichi

T.

,

Bernardeau

F.

,

Taruya

A.

,

2019

,

Phys. Rev. D

,

99

,

063530

Pellejero Ibañez

M.

,

Stücker

J.

,

Angulo

R. E.

,

Zennaro

M.

,

Contreras

S.

,

Aricò

G.

,

2022

,

MNRAS

,

514

,

3993

10.1093/mnras/stac1602

Perko

A.

,

Senatore

L.

,

Jennings

E.

,

Wechsler

R. H.

,

2016

,

preprint

(

10.1051/0004-6361/201833910

)

Planck Collaboration VI

,

2020

,

A&A

,

641

,

A6

Pueblas

S.

,

Scoccimarro

R.

,

2009

,

Phys. Rev. D

,

80

,

043504

Rasmussen

C. E.

,

Williams

C. K. I.

,

2006

,

Gaussian Processes for Machine Learning

.

MIT Press

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Rassat

A.

et al. ,

2008

,

preprint

(

)

Sánchez

A. G.

,

2020

,

Phys. Rev. D

,

102

,

123511

Sánchez

A. G.

et al. ,

2017

,

MNRAS

,

464

,

1640

10.1093/mnras/stw2443

Sánchez

A. G.

,

Ruiz

A. N.

,

Jara

J. G.

,

Padilla

N. D.

,

2022

,

MNRAS

,

514

,

5673

10.1093/mnras/stac1656

Scoccimarro

R.

,

2004

,

Phys. Rev. D

,

70

,

083007

Scoccimarro

R.

,

2015

,

Phys. Rev. D

,

92

,

083532

Scoccimarro

R.

,

Couchman

H. M. P.

,

Frieman

J. A.

,

1999

,

ApJ

,

517

,

531

10.1086/307220

Semenaite

A.

et al. ,

2022

,

MNRAS

,

512

,

5657

10.1093/mnras/stac829

Senatore

L.

,

2015

,

J. Cosmol. Astropart. Phys.

,

2015

,

007

Senatore

L.

,

Zaldarriaga

M.

,

2014

,

preprint

(

)

Sheth

R. K.

,

1996

,

MNRAS

,

279

,

1310

10.1093/mnras/279.4.1310

Sheth

R. K.

,

Chan

K. C.

,

Scoccimarro

R.

,

2013

,

Phys. Rev. D

,

87

,

083002

Taruya

A.

,

Soda

J.

,

1999

,

ApJ

,

522

,

46

10.1086/307612

Taruya

A.

,

Nishimichi

T.

,

Saito

S.

,

2010

,

Phys. Rev. D

,

82

,

063522

Taruya

A.

,

Nishimichi

T.

,

Bernardeau

F.

,

2013

,

Phys. Rev. D

,

87

,

083509

Vlah

Z.

,

Seljak

U.

,

Yat Chu

M.

,

Feng

Y.

,

2016

,

J. Cosmol. Astropart. Phys.

,

2016

,

057

Yuan

S.

,

Garrison

L. H.

,

Eisenstein

D. J.

,

Wechsler

R. H.

,

2022

,

MNRAS

,

515

,

871

Zennaro

M.

,

Angulo

R. E.

,

Pellejero-Ibáñez

M.

,

Stücker

J.

,

Contreras

S.

,

Aricò

G.

,

2021

,

preprint

(

)

Zhai

Z.

et al. ,

2019

,

ApJ

,

874

,

95

10.3847/1538-4357/ab0d7b

APPENDIX A: PERTURBATION THEORY KERNELS

In the formulation used by COMET, the perturbative term of both redshift-space models, EFT and VDG, can be written in the following way,

$$\begin{eqnarray} P_{gg}(k,\mu) &=& \,\, P^{\rm tree}_{gg,\rm SPT}(k,\mu) + P^{\rm 1-loop}_{gg,\rm SPT}(k,\mu) \nonumber \\ & & + P_{gg}^{\rm {stoch}}(k,\mu) + P_{gg}^{\rm {ctr}}(k,\mu)\, , \end{eqnarray}$$

(A1)

where the individual contribution are given by

$$\begin{eqnarray} P_{gg,\rm SPT}^{{\rm {tree}}}(k,\mu) = Z_1^{\, 2}(\boldsymbol{k}) \, P_{\rm {L}}(k), \end{eqnarray}$$

(A2)

$$\begin{eqnarray} P_{gg,\rm SPT}^{{\rm {1-loop}}}(k,\mu) &=& \,\, P_{gg,22}(k,\mu) + P_{gg,13}(k,\mu) = \nonumber \\ &=& \,\, 2\int _{\boldsymbol{q}} Z_2^{\, 2}(\boldsymbol{q},\boldsymbol{k}-\boldsymbol{q})\, P_{\rm {L}}(|\boldsymbol{k}-\boldsymbol{q}|)\, P_{\rm {L}}(q) \, + \nonumber \\ && + 6\, Z_1(\boldsymbol{k})\, P_{\rm {L}}(k)\int _{\boldsymbol{q}}Z_3(\boldsymbol{q},-\boldsymbol{q},\boldsymbol{k})\, P_{\rm {L}}(q)\, , \end{eqnarray}$$

(A3)

$$\begin{eqnarray} P_{\rm {gg}}^{{\rm {stoch}}}(k,\mu) = \frac{1}{\bar{n}}\left[N_0^P+k^2\left[N_{20}^P+N_{22}^P\mathcal {L}_2(\mu)\right]\right]\, , \end{eqnarray}$$

(A4)

$$\begin{eqnarray} P_{gg}^{{\rm {ctr}}}(k,\mu) &=& \,\, P_{gg}^{{\rm {ctr,LO}}}(k,\mu) + P_{gg}^{{\rm {ctr,NLO}}}(k,\mu) = \nonumber \\ & =& \,\, -2\left[c_0+c_2\mathcal {L}_2(\mu)+c_4\mathcal {L}_4(\mu)\right]k^2P_{\rm {L}}(k) \, +\nonumber \\ && + c_\mathrm{nlo}f^4\mu ^4\, \, k^4\, Z_1(\boldsymbol{k})^2\, P_{\rm {L}}(k)\, . \end{eqnarray}$$

(A5)

Here, the redshift-space kernels Z_n are defined as

$$\begin{eqnarray} Z_1(\boldsymbol{k}) = b_1+f\mu ^2, \end{eqnarray}$$

(A6)

$$\begin{eqnarray} Z_2(\boldsymbol{k}_1,\boldsymbol{k}_2) &=& \,\, {\cal K}_2(\boldsymbol{k}_1,\boldsymbol{k}_2) + f\mu ^2 G_2(\boldsymbol{k}_1,\boldsymbol{k}_2) \, + \nonumber \\ & & + \frac{1}{2}fk\mu \left[\frac{\mu _1}{k_1}\left(b_1+f\mu _2^{\, 2}\right)+\frac{\mu _2}{k_2}\left(b_1+f\mu _1^{\, 2}\right)\right]\, , \end{eqnarray}$$

(A7)

$$\begin{eqnarray} Z_3(\boldsymbol{k}_1,\boldsymbol{k}_2,\boldsymbol{k}_3) &=& \,\, {\cal K}_3(\boldsymbol{k}_1,\boldsymbol{k}_2,\boldsymbol{k}_3) + f\mu ^2 G_3(\boldsymbol{k}_1,\boldsymbol{k}_2,\boldsymbol{k}_3) \, + \nonumber \\ && + \frac{1}{2}f^2k^2\mu ^2\frac{\mu _2\, \mu _3}{k_2\, k_3}\left(b_1+f\mu _1^{\, 2}\right) \, +\nonumber \\ && + fk\mu \frac{\mu _3}{k_3}\left[b_1F_2(\boldsymbol{k}_1,\boldsymbol{k}_2)+f\mu _{12}^{\, 2}G_2(\boldsymbol{k}_1,\boldsymbol{k}_2)\right] \, + \nonumber \\ && + fk\mu \frac{\mu _{23}}{k_{23}}\left(b_1+f\mu _1^{\, 2}\right)G_2(\boldsymbol{k}_2,\boldsymbol{k}_3) \, + \nonumber \\ && +fk\mu \frac{\mu _1}{k_1}\left[\frac{b_2}{2}+\gamma _2K(\boldsymbol{k}_2,\boldsymbol{k}_3)\right]\, , \end{eqnarray}$$

(A8)

where the real-space galaxy kernels |${\cal K}_n$| read

$$\begin{eqnarray} {\cal K}_2(\boldsymbol{k}_1,\boldsymbol{k}_2)=b_1F_2(\boldsymbol{k}_1,\boldsymbol{k}_2)+\frac{b_2}{2}+\gamma _2\, K(\boldsymbol{k}_1,\boldsymbol{k}_2)\, , \end{eqnarray}$$

(A9)

$$\begin{eqnarray} {\cal K}_3(\boldsymbol{k}_1,\boldsymbol{k}_2,\boldsymbol{k}_3)&=& \,\, b_1F_3(\boldsymbol{k}_1,\boldsymbol{k}_2,\boldsymbol{k}_3) + b_2F_2(\boldsymbol{k}_1,\boldsymbol{k}_2)\, +\nonumber \\ &+&2\gamma _2K(\boldsymbol{k}_1,\boldsymbol{k}_2+\boldsymbol{k}_3)F_2(\boldsymbol{k}_2,\boldsymbol{k}_3)\, +\nonumber \\ &+&2\gamma _2K(\boldsymbol{k}_1,\boldsymbol{k}_2+\boldsymbol{k}_3)\left[F_2(\boldsymbol{k}_2,\boldsymbol{k}_3)-G_2(\boldsymbol{k}_2,\boldsymbol{k}_3)\right]\, , \nonumber \\ \end{eqnarray}$$

(A10)

with

$$\begin{eqnarray} K(\boldsymbol{k}_1,\boldsymbol{k}_2)=\frac{\left(\boldsymbol{k}_1\cdot \boldsymbol{k}_2\right)^2}{k_1^{\, 2}k_2^{\, 2}}-1\, . \end{eqnarray}$$

(A11)

APPENDIX B: BINNING AND DISCRETENESS EFFECTS IN ESTIMATES OF THE POWER SPECTRUM

The power spectrum multipoles are estimated from density grids in Fourier space by taking the square of the Fourier coefficients at each wavevector |$\boldsymbol{k}$| and multiplying by the respective Legendre polynomial (for more detail, see Scoccimarro 2015). These estimates are then summarized into measurements at multiple wavemode bins, k_i, by averaging over all wavevectors that fall into a spherical shell centred on k_i with some specified bin width Δk. Both, the finite bin width as well as the discreteness of the Fourier grid, introduce differences compared to the theoretical power spectrum multipoles as defined in Section 3, which are evaluated at fixed k and by integrating over continuous values of μ, the orientation of |$\boldsymbol{k}$| with respect to the line-of-sight (see e.g. equation 45). This can be accounted for by averaging the theoretical power spectrum P(k, μ) in the same way as done for the measurements (e.g. Taruya et al. 2013). The ‘observed’ theoretical multipoles are thus given by

$$\begin{eqnarray} P_{\ell }^{\rm obs}(k_i) = \frac{2\ell + 1}{N_k} \sum _{|\boldsymbol{k}| \in [k_i - \Delta k/2, k_i + \Delta k/2]} P(k,\mu)\, {\cal L}_{\ell }(\mu)\, , \end{eqnarray}$$

(B1)

where N_k are the total number of wavevectors per spherical shell.

Instead of averaging the full power spectrum, a common approximation is to evaluate the power spectrum multipoles at the effective wavemodes, defined as

$$\begin{eqnarray} k_{i,\rm eff} \equiv \frac{1}{N_k} \sum _{|\boldsymbol{k}| \in [k_i - \Delta k/2, k_i + \Delta k/2]} |\boldsymbol{k}|\, , \end{eqnarray}$$

(B2)

which can partially account for the finite bin width. In Fig. B1, we compare equation (B1) (circles) against the power spectrum multipoles evaluated at the effective wavemodes (lines) for a Fourier grid with fundamental frequency, k_f = 2|$\pi$|/L, corresponding to a box size of |$L = 1500\, h^{-1}\, \mathrm{Mpc}$| and bin width Δk = k_f. While the discreteness effect is barely noticeable for the monopole, it already gives rise to per cent level differences in the quadrupole, and even more significant effects in the hexadecapole. This is particularly the case for bins which are close to the fundamental frequency, where there are consequently only a small number of wavevectors per spherical shell, but for the hexadecapole differences up to |$\sim 10\, {{\ \rm per\ cent}}$| persist up to |$k \sim 100\, k_f$|⁠.

$Comparison of the power spectrum multipole predictions evaluated at the effective wavemodes (lines) and averaged over discrete set of k and μ values equation (B1) (circles). The bin width was assumed to be kf, with $k_f \approx 0.0042 h\, \mathrm{Mpc}^{-1}$.$

Figure B1.

Comparison of the power spectrum multipole predictions evaluated at the effective wavemodes (lines) and averaged over discrete set of k and μ values equation (B1) (circles). The bin width was assumed to be k_f, with |$k_f \approx 0.0042 h\, \mathrm{Mpc}^{-1}$|⁠.

We have implemented the exact bin average for arbitrary bin widths (with linear spacing) in COMET by computing equation (B1) over the anisotropic power spectrum reconstructed from the emulated quantities (see Section 3.3.1). We have verified that the reconstruction of the anisotropic power spectrum from a finite number of multipoles does not introduce any appreciable inaccuracies when performing the bin average, as was also the case without bin average (see Section 4.4). In order to speed up the computation of the binned predictions, we first find all discrete values of k and μ for each bin and then perform the following rounding operations

$$\begin{eqnarray} k &\approx \left\lfloor 10\frac{k}{\Delta k}\right\rceil \, \frac{\Delta k}{10}\, , \end{eqnarray}$$

(B3)

$$\begin{eqnarray} \mu &\approx 10^{-3}\, \left\lfloor 10^3\mu \right\rceil \, , \end{eqnarray}$$

(B4)

where ⌊x⌉ denotes the nearest integer. Finally, we determine all unique combinations of k and μ, as well as the number of times they appear, so that we can evaluate equation (B1) only by averaging over those unique combinations, using their respective number of occurrence as weights. While this approximation only leads to negligible inaccuracies, it reduces the computation time and means that in the above example we can evaluate the three discretely averaged multipoles up to |$k \sim 60\, k_f$| without any additional computational cost compared to the standard evaluation. That means the discrete average can be straightforwardly included also in any likelihood analysis.

APPENDIX C: STATISTICS OF EMULATION INACCURACIES AT VARIOUS REDSHIFTS

For the sake of completeness, in Fig. C1 we show the cumulative histograms of the maximum absolute differences between the emulator and exact model for all four redshifts of our synthetic data sets described in Section 4.1.1. As in Section 4.2.1, the maximum differences for each multipole are obtained over a range of scales between |$0.001\, h\, \mathrm{Mpc}^{-1}$| and |$0.3\, h\, \mathrm{Mpc}^{-1}$| and overall the plot demonstrates that we get qualitatively a very similar behaviour for the validation samples at higher redshifts as for the one at z = 0.9 that was presented in the main text. When expressing the maximum differences in terms of the standard deviation we have adopted for our synthetic measurements, we still find the most significant discrepancies in the monopole. The fact that these decrease, however, with increasing redshift is due to the larger relative errors, σ_ℓ(k)/P_ℓ(k), at higher redshifts, which are caused by our assumption of decreasing number densities and the resulting bigger contributions from shot noise to σ_ℓ. For the EFT model we moreover see that the smallest discrepancies always occur in the hexadecapole, while, as explained in Section 4.2.1, they can be significantly larger in case of the VDG model. At z = 0.9, they exceed those of the quadrupole and with increasing redshift they become comparable. These findings are fully compatible with our results on the shifts of mean parameter values in Section 4.2.2, where we have seen that the spread in the shifts become smaller for the higher redshift validation samples. This is, in particular, true for the counterterm parameter c₄, which was most affected by the inaccuracies in the hexadecapole at redshift z = 0.9.

$Cumulative histograms (over the full validation set) of the maximum absolute difference between the power spectrum multipoles computed in the exact model and with our emulator over a range of scales from $0.001\, h\, \mathrm{Mpc}^{-1}$ and $0.3\, h\, \mathrm{Mpc}^{-1}$. Differences are shown in units of the standard deviation of the synthetic data set described in Section 4.1.1. Each column corresponds to a different redshift, while the upper panels show results for the EFT model, lower panels for the VDG model. The dashed horizontal lines indicate $68\, {{\ \rm per\ cent}}$ and $95\, {{\ \rm per\ cent}}$ of the validation samples.$

Figure C1.

Cumulative histograms (over the full validation set) of the maximum absolute difference between the power spectrum multipoles computed in the exact model and with our emulator over a range of scales from |$0.001\, h\, \mathrm{Mpc}^{-1}$| and |$0.3\, h\, \mathrm{Mpc}^{-1}$|⁠. Differences are shown in units of the standard deviation of the synthetic data set described in Section 4.1.1. Each column corresponds to a different redshift, while the upper panels show results for the EFT model, lower panels for the VDG model. The dashed horizontal lines indicate |$68\, {{\ \rm per\ cent}}$| and |$95\, {{\ \rm per\ cent}}$| of the validation samples.

Table C1.

Fiducial values of number densities, galaxy bias and counterterm parameters, used in the generation of the synthetic data sets described in Section 4.1.2.

z	\|$10^3 \times \bar{n}$\|\|$(h^{-1}\, \mathrm{Mpc})^{-3}$\|	b₁	b₂	γ₂₁	c₀\|$(h^{-1}\, \mathrm{Mpc})^2$\|	c₂\|$(h^{-1}\, \mathrm{Mpc})^2$\|	c₄\|$(h^{-1}\, \mathrm{Mpc})^2$\|	c_nlo\|$(h^{-1}\, \mathrm{Mpc})^4$\|	\|$N^P_0$\|
0.9	2.043	1.370	−0.514	0.197	9.542	11.390	2.469	12.972	0.315
1.2	1.029	1.734	−0.193	0.354	15.521	17.888	4.057	78.367	0.046
1.5	0.585	2.024	0.443	0.239	11.937	18.168	3.745	33.842	−0.042
1.8	0.313	2.476	0.563	−0.112	15.377	13.322	2.958	−64.369	0.296

z	\|$10^3 \times \bar{n}$\|\|$(h^{-1}\, \mathrm{Mpc})^{-3}$\|	b₁	b₂	γ₂₁	c₀\|$(h^{-1}\, \mathrm{Mpc})^2$\|	c₂\|$(h^{-1}\, \mathrm{Mpc})^2$\|	c₄\|$(h^{-1}\, \mathrm{Mpc})^2$\|	c_nlo\|$(h^{-1}\, \mathrm{Mpc})^4$\|	\|$N^P_0$\|
0.9	2.043	1.370	−0.514	0.197	9.542	11.390	2.469	12.972	0.315
1.2	1.029	1.734	−0.193	0.354	15.521	17.888	4.057	78.367	0.046
1.5	0.585	2.024	0.443	0.239	11.937	18.168	3.745	33.842	−0.042
1.8	0.313	2.476	0.563	−0.112	15.377	13.322	2.958	−64.369	0.296

Table C1.