ABSTRACT

Photometric data can be analysed using the 3D correlation function ξp to extract cosmological information via e.g. measurement of the Baryon Acoustic Oscillations (BAO). Previous studies modeled ξp assuming a Gaussian photo-z approximation. In this work we improve the modeling by incorporating realistic photo-z distribution. We show that the position of the BAO scale in ξp is determined by the photo-z distribution and the Jacobian of the transformation. The latter diverges at the transverse scale of the separation s, and it explains why ξp traces the underlying correlation function at s, rather than s, when the photo-z uncertainty σz/(1+ z) ≳ 0.02. We also obtain the Gaussian covariance for ξp. Due to photo-z mixing, the covariance of ξp shows strong off-diagonal elements. The high correlation of the data causes some issues to the data fitting. None the less, we find that either it can be solved by suppressing the largest eigenvalues of the covariance or it is not directly related to the BAO. We test our BAO fitting pipeline using a set of mock catalogs. The data set is dedicated for Dark Energy Survey Year 3 (DES Y3) BAO analyses and includes realistic photo-z distributions. The theory template is in good agreement with mock measurement. Based on the DES Y3 mocks, ξp statistic is forecast to constrain the BAO shift parameter α to be 1.001 ± 0.023, which is well consistent with the corresponding constraint derived from the angular correlation function measurements. Thus, ξp offers a competitive alternative for the photometric data analyses.

1 INTRODUCTION

Imaging surveys infer the redshift of galaxy samples, photo-z, using broadband filters. Ongoing and future large-scale structure surveys that collect enormous amount of photo-z data include Kilo-Degree Survey (KiDS),1 Dark Energy Survey (DES),2 Hyper Suprime-Cam (HSC),3 Rubin Observatory’s Legacy Survey of Space and Time (LSST),4 Euclid,5 and the Chinese Survey Space Telescope (CSST).6 These surveys typically use several bandpasses, e.g. DES has grizY and the accuracy of the resultant photo-z is about σ ∼ 0.03(1 + z) [see e.g. Crocce et al. (2019), Carnero Rosell et al. (2021), but it can be improved if more stringent criteria are applied (Rozo et al. 2016)]. Although photometric surveys yield less precise redshift than spectroscopic ones (σ ∼ 10−3), they are more efficient to collect a large volume of survey data with deep magnitude.

Baryonic Acoustic Oscillations (BAO; Peebles & Yu 1970; Sunyaev & Zeldovich 1970) are the primordial acoustic features imprinted in the distribution of the large-scale structure. In the early universe, photons and baryons form a tightly coupled plasma, and acoustic oscillations are excited in it. After the recombination, hydrogen atoms form, the acoustic waves get stalled because there is no medium for propagation. The acoustic patterns are preserved in the distribution of galaxies and they correspond to the sound horizon at the drag epoch, which is about 150 Mpc. Formation of the BAO is governed by well-understood linear physics, see Bond & Efstathiou (1984), Bond & Efstathiou (1987), Hu & Sugiyama (1996), Hu, Sugiyama & Silk (1997), and Dodelson (2003) for the details of the cosmic microwave background physics. Given its robustness, BAO has been regarded as a standard ruler in cosmology, see e.g. Weinberg et al. (2013) and Aubourg et al. (2015). BAO has been measured in numerous spectroscopic data analyses. It was first clearly detected by Eisenstein et al. (2005) and Cole et al. (2005), and followed up in many subsequent studies, e.g. Gaztanaga, Cabre & Hui (2009), Percival et al. (2010), Beutler et al. (2011), Blake et al. (2012), Anderson et al. (2012), Kazin et al. (2014), Ross et al. (2015), and Alam et al. (2017, 2021).

The sample size in terms of volume and depth from photometric surveys may compensate its imprecision in redshift information and give competitive measurements (Seo & Eisenstein 2003; Blake & Bridle 2005; Amendola, Quercellini & Giallongo 2005; Benitez et al. 2009; Zhan, Knox & Tyson 2009; Chaves-Montero, Angulo & Hernández-Monteagudo 2018). Indeed, there are several detections of the BAO feature in the galaxy distribution using photometric data from SDSS (Padmanabhan et al. 2007; Estrada, Sefusatti & Frieman 2009; Hütsi 2010; Carnero et al. 2012; Seo et al. 2012; de Simoni et al. 2013), DES [Y1 (Abbott et al. 2019) and Y3 (Abbott et al. 2022), hereafter DES Y3], and DECaLS (Sridhar et al. 2020).

Because of the photo-z uncertainty, it is natural to perform the analyses of the photometric data using angular statistics such as the angular correlation function (Coupon et al. 2012; Crocce et al. 2016; Abbott et al. 2019; Elvin-Poole et al. 2018; Vakili et al. 2020) or the angular power spectrum (Padmanabhan et al. 2007; Seo et al. 2012; Camacho et al. 2019; Nicola et al. 2020). In particular, these two statistics were adopted as the fiducial statistical tools to measure BAO in DES Y3 analyses. However, given enormous amount photo-z data available, it is worthwhile exploring new methods to enhance information extraction, especially from the BAO. Ross et al. (2017) proposed to measure the BAO using the 3D correlation function characterized by the transverse scale of the separation. This statistic has been applied to photo-z data to get interesting BAO measurements (Abbott et al. 2019; Sridhar et al. 2020). The photo-z uncertainty of the sample must be folded in the theory template. To simplify the modelling, the photo-z distribution was assumed to be Gaussian in Ross et al. (2017). The actual photo-z distribution has non-negligible deviation from Gaussian distribution, e.g. Crocce et al. (2019), Zhou et al. (2021), and Carnero Rosell et al. (2021). This assumption may risk introducing bias, and so this statistic was not adopted in the fiducial DES Y1 and Y3 analyses. In this work, we improve the modelling of the 3D clustering by incorporating general photo-z distribution and present a Gaussian covariance for it.

This paper is organized as follows. In Section 2, we first review the computation of the angular correlation and then show that the general angular correlation can be turned into the 3D correlation function ξp. We consider the Gaussian photo-z limit and use it to understand the effect of photo-z uncertainties on ξp in Section 3. In Section 4, we derive the Gaussian covariance for ξp using the Gaussian covariance for the angular correlation function. In Section 5.1, we present the numerical results on the template and covariance, and compare them with mock results whenever possible. We discuss the issue in data fitting caused by highly correlated data in Section 5.2.1, and then present the BAO fit results in Section 5.2.2. We conclude in Section 6. In Appendix  A, we test our pipeline for the case of spectroscopic data. The default cosmology adopted in this work is the MICE cosmology (Crocce et al. 2015; Fosalba et al. 2015), which is a flat ΛCDM with Ωm = 0.25, ΩΛ = 0.75, h = 0.7, and σ8 = 0.8.

2 ANGULAR AND THREE-DIMENSIONAL TWO-POINT CORRELATION

In this section, we first review the computation of the angular correlation function, w, in the presence of general photo-z uncertainty. We then use the general angular correlation function to derive the 3D two-point correlation ξp.

2.1 Angular correlation function wp

Because in imaging surveys, angles are well determined but there are typically significant redshift uncertainties, it is natural to use angular quantities. It is useful to first clarify the relation between volume number density and tomographic bin angular number density.

Suppose that there are dN objects in a volume element parametrized by the redshift interval dz and the solid angle dΩ, the volume number density n(3) is given by
(1)
The tomographic angular number density n(2) is defined as
(2)
We can go on to define the volume density contrast δ(3) and angular density contrast δ(2)
(3)
(4)
where |$\bar{n}_{(3)}$| and |$\bar{n}_{(2)}$| are their respective mean values. From equation (2), it is clear that δ(3) = δ(2).
We start with the effect of the photo-z on the galaxy number density:
(5)
where f(z|zp)dz is the probability that the true redshift of the galaxy falls within [z, z + dz] given that its photo-z is zp, and np denotes the number density in photo-z space. For photometric data, f(z|zp) is of central importance and it can be estimated by the photo-z codes using template fitting or machine learning methods [see e.g. Benitez (2000); Ilbert et al. (2006); De Vicente, Sánchez & Sevilla-Noarbe (2016); Sadeh, Abdalla & Lahav (2016)].
At the background level, we have
(6)
where |$\bar{n}_{\rm p}$| and |$\bar{n}_{(3)}$| are the angular mean number density in the photo-z and true redshift space, respectively. The perturbation is given by
(7)
and we get
(8)
Therefore, the appropriate true redshift distribution for the density contrast is
(9)
It is easy to see that ϕ(z|zp) indeed furnishes a probability density. Here ,we assume that there are sufficiently many particles in the coarse-grained scale of interest so that the photo-z effect can be modelled by the distribution ϕ. In this regime, the effect of the photo-z is to average δ(3) over z.
Using equation (9), the photo-z angular correlation function wp can be written as
(10)
with |$\theta \equiv \cos ^{-1} (\hat{\boldsymbol {r}} \cdot \hat{\boldsymbol {r}}^{\prime })$|⁠. By expressing δ(3) in terms of its Fourier modes, we have
(11)
Utilizing the variable |$\boldsymbol {s} = \boldsymbol {r} - \boldsymbol {r}^{\prime }$| and applying the Rayleigh expansion of the plane wave
(12)
we have
(13)
Note that we have only one integral involving the spherical Bessel function because we expand the plane wave for |$\boldsymbol {s}$|⁠.

Equation (13) accounts for the photo-z uncertainty exactly and it is valid for the curved sky. We only need to plug in the appropriate power spectrum |$P(\boldsymbol {k} , z,z^{\prime })$|⁠. However, the redshift-space power spectrum in the distant observer limit is often used, and so this effectively limits the results to the flat sky case. As long as the local line of sight (LOS) is used, the effect is mild, e.g. Yoo & Seljak (2015).

In general, because of the azimuthal symmetry about the LOS direction |$\hat{\boldsymbol {e}}$|⁠, the power spectrum can be expanded about the LOS direction as
(14)
where |$\mathcal {L}_\ell$| is the Legendre polynomial and P is the multipole coefficient. Parity invariance further restricts the sum to even ℓ only. Then the angular correlation can be written as
(15)
where jl is the spherical Bessel function. To speed up the computation, the j integrals are evaluated using FFTLog (Hamilton 2000).
For the linear redshift-space power spectrum with linear bias (Kaiser 1987)
(16)
where b is the linear bias, f is dln D/dln a with D being the linear growth factor, and Pm(k, 0, 0) is the matter power spectrum evaluated at z = 0, we have (Hamilton 1992; Cole, Fisher & Weinberg 1994; Crocce, Cabré & Gaztañaga 2011)
(17)
with
(18)
In DES Y3, an anisotropic BAO damping factor is applied to the BAO feature, i.e.
(19)
where Plin and Pnw denotes the linear power spectrum with and without BAO wiggles, respectively. Because Σ is a function of μ, we can no longer express the first three multipoles analytically, instead we compute them numerically. As our primary goal here is to model the BAO feature, we shall use equation (19). But we will use equations (17) and (18) to measure the bias parameters.

2.2 From w to ξp

Because of the statistical rotational invariance, the angular correlation function w(θ, zeff) for the sample at the effective redshift zeff can be parametrized by the angle θ subtended by two points on the sky at the observer. If in addition, the redshifts (or photo-z) of these two points are also specified, e.g. in the form of tomographic bins, w(θ, z, z′) furnishes a representation of the 3D correlation function. On the other hand, the 3D correlation function ξ in real space depends only on the distance between the two points, |$s = |\boldsymbol {r}_1 - \boldsymbol {r}_2|$|⁠, thanks to the statistical translational and rotational invariance. The redshift space distortion, along the LOS direction, breaks the rotational invariance in redshift space and ξ is usually parametrized as a function of s and μ, the latter is the dot product between direction of the separation, |$\hat{\boldsymbol {s}}$| and the LOS |$\hat{ \boldsymbol {e}}$|⁠, or in terms of s and s, the component of |$\boldsymbol {s}$| perpendicular and parallel to the LOS. In practice, both w and ξ can be computed by assigning density to a grid (2D pixelized grid cells for w and 3D grid cells for ξ) or by comparing pair counts between the data catalog and the random one.

Because δ(2) = δ(3), it is clear that the general cross angular correlation function and the 3D correlation are equal provided a cosmology is assumed to convert angle and redshift to distance and the redshift bin is fine enough so that the projection effect ,along the LOS direction is negligible. Depending on how it is parametrized, the correlation function goes by different names. We shall consider the 3D correlation function |$\xi _{\rm p} (s, \mu) = \langle \delta _{\rm p} (\boldsymbol {r}_1) \delta _{\rm p} (\boldsymbol {r}_2) \rangle$| and its projective version characterized by the transverse scale, |$\xi _{\rm p} (s_\perp , \mu) \equiv \langle \delta _{\rm p} (\boldsymbol {r}_1) \delta _{\rm p} (\boldsymbol {r}_2) \rangle _\parallel$|⁠, where the notation 〈…〉 means that we average over ssμ by stacking together the pairs with the same |$s_\perp \equiv s \sqrt{ 1 - \mu ^2 }$|⁠.

We obtain ξp(s, μ) by mapping from w. We first evaluate the general cross correlation between redshift bin i and j separated by angle θk bin, wijk). From its redshift bins and angular separation, s and μ for this pair can be obtained. This allows us to allocate this angular correlation into the appropriate s and μ bins. The relation between w and ξp is schematically shown on the left-hand panel of Fig. 1. Because ξp is measured by averaging over all possible grid cell pairs (or data pairs in the language of pair counting) in the survey data region, we loop over all the wijk) pairs that fit into the data region and give rise to the same s and μ bins. Explicitly, we have
(20)
where fijk denotes the weight for all the cross bin pairs wijk) falling into the s and μ bins. To approximately mimick the pair counting in the computation of ξp, we shall use the weight fijk = NiNj with Ni (Nj) being the number of particles in bin i (j). More accurate weighting can be achieved by counting pairs as in the case of random catalog, but it would be much more computationally expensive.
Left-hand panel: the relation between the angular correlation function specified by two true redshift distributions conditional on photo-z zp and $z_{\rm p}^{\prime }$, subtending an angle θ at the observer. The corresponding 3D correlation function is described by the separation vector $\boldsymbol {s}$ relative to the LOS direction $\hat{\boldsymbol {e}}$. Right-hand panel: The integral over the two distributions can be approximated by a 1D integral over Δzr or the separation s′.
Figure 1.

Left-hand panel: the relation between the angular correlation function specified by two true redshift distributions conditional on photo-z zp and |$z_{\rm p}^{\prime }$|⁠, subtending an angle θ at the observer. The corresponding 3D correlation function is described by the separation vector |$\boldsymbol {s}$| relative to the LOS direction |$\hat{\boldsymbol {e}}$|⁠. Right-hand panel: The integral over the two distributions can be approximated by a 1D integral over Δzr or the separation s′.

Here, we describe binning into s and μ bin as an example. For ξp(s, μ), the procedures are the same except that we compute s from the redshifts of the bin pairs and their angular separation, and then bin the correlation based on s and μ.

The measurement of the ξp statistics requires a fiducial cosmology input, while the tomographic bin correlation in term of angle and redshift can evade the cosmology dependence (Montanari & Durrer 2012). However, the advantage of the ξp statistic is that it can effectively summarize the information in multiple tomographic bins in a single correlation measurement, and it is easier to borrow the techniques developed for the spectroscopic data analysis.

We turn to comment on the definition of the LOS direction associated with a pair of galaxies on the sky. For the angular correlation function, a natural definition of the LOS direction is the angle bisector of the angle subtended by the pairs at the observer. In this ‘angle-centric’ definition, the dot product between LOS direction |$\hat{\boldsymbol {e}}_\theta$| and the unit separation vector |$\hat{\boldsymbol {s}}$| can be expressed as
(21)
where r1 and r2 are the distances to the galaxy pair, respectively, and s is the length of the separation vector. On the other hand, it is natural to align the LOS with the mid-point of the separation for the 3D correlation function. For this ‘separation-centric’ viewpoint, the dot product between LOS direction |$\hat{\boldsymbol {e}}_s$| and |$\hat{\boldsymbol {s}}$| can be written as
(22)
We have checked that these two definitions are essentially identical for our purpose. For example, at zeff = 0.8 and |$s=100 \, \mathrm{Mpc} \, h^{-1}$|⁠, the agreement between these two definitions are better than 10−3. Thus, we can use them interchangeably.

3 UNDERSTANDING THE IMPACT OF PHOTO-z BY A GAUSSIAN CASE STUDY

Ross et al. (2017) used a simple Gaussian setup to study the impact of photo-z uncertainty on the BAO information. The apparatus can be constructed by the following arguments. Suppose that the position vectors of two points are |$\boldsymbol {r}_1$| and |$\boldsymbol {r}_2$| and due to photo-z error, they have additional displacement |$\Delta \boldsymbol {r}_1$| and |$\Delta \boldsymbol {r}_2$|⁠. The total separation vector is
(23)
where |$\boldsymbol {s}_{\rm t \perp }$| and |$\boldsymbol {s}_{\rm t \parallel }$| denote the perpendicular and parallel components of |$\boldsymbol {r}_2 - \boldsymbol {r}_1$| w.r.t. the LOS direction without photo-z error. The photo-z error only affects the direction parallel to the LOS in the plane-parallel limit. If we assume that |$\Delta \boldsymbol {r}_1$| and |$\Delta \boldsymbol {r}_2$| are independent Gaussian random variables with zero mean and constant variance σ2, then |$\Delta \boldsymbol {r}_2 - \Delta \boldsymbol {r}_1$| is Gaussian distributed with zero mean and variance 2σ2. Thus, we can model the effects of the Gaussian photo-z error on the 3D correlation function by a 1D Gaussian distribution.

It is instructive to check under what conditions our general expression would reduce to this result. We will see that this exercise gives valuable insight into our results.

We now assume that the conditional true redshift distributions in equation (10) are Gaussian, namely
(24)
By adopting the variables
(25)
wp can be written as
(26)
After further changing to the ‘Centre-of-Mass’ variables
(27)
we have
(28)
To proceed, we neglect the Δzc dependence in w as it gives an overall shift of the pair in redshift7 and we arrive at
(29)
By mapping to s and μ, we can get the corresponding equation for ξp, i.e.
(30)
This equation invites the interpretation on the right-hand panel of Fig. 1.
Alternatively, we can use the separation distance s′ (shown on the right-hand panel of Fig. 1) as the integration variable
(31)
We can think of s′ as a function of Δzr. When Δzr increases from the negative end, s′ first decreases and it reaches the minimum value s′ = s before starting to increase. The scale s′ = s divides the integral into two parts, and they should be evaluated separately.
The inverse of the Jacobian is given by
(32)
where r(z) and H(z) respectively represent the comoving distance and Hubble rate evaluated at redshift z, c is the light speed, and z1 (z2) denotes |$z_{\rm p} - \frac{ \Delta z_{\rm r} }{ 2 }$| (⁠|$z_{\rm p}^{\prime } + \frac{ \Delta z_{\rm r} }{ 2 }$|⁠). It is important to note that this derivative vanishes when z1 = z2, which corresponds to s′ = s.

Equation (30) reveals that the photo-z correlation function is a projection of the underlying true correlation with a Gaussian weight of zero mean and variance 2σ2, in agreement with equation (6) of Ross et al. (2017). In Fig. 2, we show the wedge correlation function obtained with the Gaussian photo-z distribution. The effective redshift of the two conditional true redshift distributions centred at zp and |$z_{\rm p}^{\prime }$|⁠, respectively, is taken to be zeff = 0.8. The standard deviation of the distribution is assumed to be σ = (1 + zeffz and a suite of σz values are considered: 0.001, 0.01, 0.02, and 0.04. In this plot, for simplicity the linear real-space correlation is used. We have compared the results computed with the approximation equation (31) and by directly integrating over the 2D integral. We find that the 1D approximation is in good agreement with the exact result, but deviations become apparent for high σz.

The wedge correlation function computed with Gaussian photo-z distributions. The upper and low panels show the results for using the separation s and the transverse scale s⊥ as the independent variables, respectively. The 1D approximation equation (31; solid) and the exact 2D integral results (dashed) agree with each other well, but small deviation occurs for large value of σz. Four values of σz are displayed (left-hand to right-hand panels). Different colours indicate wedge correlation of different values of μ.
Figure 2.

The wedge correlation function computed with Gaussian photo-z distributions. The upper and low panels show the results for using the separation s and the transverse scale s as the independent variables, respectively. The 1D approximation equation (31; solid) and the exact 2D integral results (dashed) agree with each other well, but small deviation occurs for large value of σz. Four values of σz are displayed (left-hand to right-hand panels). Different colours indicate wedge correlation of different values of μ.

Interestingly, while for small σz such as 0.001, the BAO bump peaks at the true sound horizon size, ssh, it shifts to larger values as σz increases. Also for large σz, it is clear that the BAO bump moves to larger s as μ increases. In the lower panels of Fig. 2, we show ξp as a function of the transverse scale |$s_\perp = s \sqrt{ 1- \mu ^2 }$|⁠. As observed in Ross et al. (2017), when ξp is plotted against s, the BAO position appears to line up at s = ssh for σz ≳ 0.02. Thus this plot suggests that the BAO peak in the upper panel (the usual way to plot ξ) is directly related to the scale s for σz ≳ 0.02.

To shed light on these results, we show in Fig. 3 the three factors of the integrand of equation (31): the absolute value of the Jacobian |dΔzr/ds′|, the Gaussian weight PG and the underlying correlation function ξ. We shall shortly explain these terms in details. In this illustration, we take |$s(z_{\rm p},z_{\rm p}^{\prime }) = 100 \, \mathrm{Mpc} \, h^{-1}$|⁠.

The factors in the photo-z projection integral (equation 31) as a function of the separation s′. As shown on the right-hand panel of Fig. 1, as Δzr increases, s′ first decreases until it reaches the minimum value s′ = s⊥ and then it increases. Both the Gaussian weight PG (solid, colour) and absolute value of the Jacobian |dΔzr/ds′| (multiplied by a factor of 10, dotted, colour) correspond to the paths traversed as Δzr increases. The point where all the PG curves with different μ meet corresponds to $s^{\prime } = s(z_{\rm p}, z_{\rm p}^{\prime })$ (chosen to be $100 \, \mathrm{Mpc} \, h^{-1}$ here). The Jacobian diverges at s′ = s⊥ and this indicates as the vertical line in the plot. The vertical part for the Jacobian touches the PG curve as the latter turns at s′ = s⊥. The underlying correlation function (multiplied by a factor 103, solid black) is also overplotted. See the text for further explanations.
Figure 3.

The factors in the photo-z projection integral (equation 31) as a function of the separation s′. As shown on the right-hand panel of Fig. 1, as Δzr increases, s′ first decreases until it reaches the minimum value s′ = s and then it increases. Both the Gaussian weight PG (solid, colour) and absolute value of the Jacobian |dΔzr/ds′| (multiplied by a factor of 10, dotted, colour) correspond to the paths traversed as Δzr increases. The point where all the PG curves with different μ meet corresponds to |$s^{\prime } = s(z_{\rm p}, z_{\rm p}^{\prime })$| (chosen to be |$100 \, \mathrm{Mpc} \, h^{-1}$| here). The Jacobian diverges at s′ = s and this indicates as the vertical line in the plot. The vertical part for the Jacobian touches the PG curve as the latter turns at s′ = s. The underlying correlation function (multiplied by a factor 103, solid black) is also overplotted. See the text for further explanations.

For small σz such as 0.001, which is close to the spectroscopic redshift case, the Gaussian weight is narrowly distributed about the scale |$s(z_{\rm p},z_{\rm p}^{\prime })$|⁠. The distribution PG as a function of s′ is in fact mildly tilted for small μ and it becomes more symmetric when μ increases. As we mentioned, equation (32) vanishes at s′ = s, and hence the Jacobian diverges. Except for small μ, such as μ ≲ 0.1, the divergence occurs at the scale, where PG is essentially zero and hence it is irrelevant. Therefore, in this case the BAO scale in ξp is controlled by PG, and it corresponds to the true sound horizon well.

As σz increases, the PG distribution becomes more tilted. Among the cases shown, for σz ≥ 0.01, s′ is clearly not a monotonic function of Δzr, as evident from the multivalued PG curves. The scale where all the μ-curves pass through corresponds to s′ = s. The smallest s′ that each μ-curve attains corresponds to the transverse scale |$s_\perp = s\sqrt{ 1 - \mu ^2}$|⁠. The Jacobian formally diverges at s′ = s. For small μ, as s coincides with s well, the Jacobian and the Gaussian weight both peak at s′ = s, the value of the integral is essentially equal to ξ(s).

For larger μ, it depends on the value of σz. While the Jacobian is sharply peaked at s′ = s and ξ is also rapidly decreasing with s′. For σz ≲ 0.01, the Gaussian weight at s′ = s is suppressed relative to its maximum value at s. Thus, the integral is not well peaked at s′ = s. The distribution is more tilted as σz further increases and becomes almost horizontally laid down for σz ≳ 0.04. Because the Gaussian weight suppression is much more mild for σz ≳ 0.02, the integral is essentially given by ξ(s). This explains why the scale s but not s, is a good indicator of the underlying correlation function when σz ≳ 0.02.

Although we have carried out the calculations explicitly using Gaussian conditional true redshift distribution, the lessons learned are more general. The approximation that the zc dependence can be integrated out and the final results can be approximated by a 1D projection integral should be valid for well-behaved unimodal distributions (modulo small random fluctuations). In particular, the fact that the underlying correlation is reflected by s primarily stems from the divergence of the Jacobian, which is not dependent on the precise form of the distribution.

We mention that the divergent Jacobian plays a similar role as the Limber approximation (Limber 1953; LoVerde & Afshordi 2008), which is phrased as a linear relation between the spherical wavenumber ℓ and the Fourier mode k and is effective in reducing the dimension of the projection integrals. In this approximation, although only the Fourier modes transverse to the LOS are included, it works very well for reasonably large ℓ because those are the only significant contributions to the integral.

As ξp corresponds to the underlying correlation function at the transverse scale s for σz ≳ 0.02, we can stack all the modes of ξp with the same s together to increase the signal to noise. Thus in this sense, the 3D ξp characterized by s is effectively a projected correlation function. Consistent with Seo & Eisenstein (2003), Blake & Bridle (2005), ξp(s) for σz ≳ 0.02 only measures the transverse BAO and cannot be used to probe the radial BAO and hence the Hubble parameter directly.

4 GAUSSIAN COVARIANCE

In this section, after briefly reviewing the Gaussian covariance for the angular correlation function, we then employ it to derive the Gaussian covariance for ξp.

In the case of complete sky coverage, δp in equation (9) can be expressed in terms of the spherical harmonics as
(33)
We can define an estimator for angular correlation function:
(34)
Its expectation value gives (Peebles 1980)
(35)
where |$C_l^{ij}$| is the cross angular power spectrum defined as
(36)
with |$\delta _{\rm K}^{ab}$| being the Kronecker delta. Because galaxies are discrete tracers, we have included the shot noise contribution with |$\bar{ n }_i$| being the tomographic angular number density given by
(37)
Similarly, we can derive its Gaussian covariance (Crocce et al. 2011)
(38)
Poisson shot noise is assumed in this equation. To obtain the covariance for partial sky coverage, we can apply the so-called fsky approximation, i.e. the covariance scaling inversely with the fraction of sky coverage fsky (Scott, Srednicki & White 1994).

A few comments are in order. We evaluate the cross harmonic power spectra |$C_\ell ^{ij}$| using camb sources (Lewis & Challinor 2007). To account for the fact that the data vector is binned into angular bins, we need to use the bin-averaged Legendre polynomial (Salazar-Albornoz et al. 2017). In addition, as the terms are computed numerically, we need to evaluate some of the shot noise terms analytically to ensure numerical convergence (Chan et al. 2018).

Using equation (20), we can then write the covariance of ξp in terms of that of w:
(39)
This enables us to map the Gaussian covariance for w to that for ξp. Similarly, we can get the covariance for ξp(s, μ) using the corresponding weight.
It is worth discussing more about the shot noise term. The shot noise arises from the self-correlation of the discrete particles. In the Poisson model, the shot noise contribution to the correlation is given by
(40)
in which the two-point distribution is converted to a product of a one-point distribution and the Dirac delta distribution. Thus, the shot noise contribution to the 3D correlation function depends on the 3D number density |$\bar{n}_{\rm p}$|⁠. Although the true redshift distribution of the neighbouring bins may overlap with each other for photometric galaxies, there is no cross photo-z (or cross-bin in tomographic analysis) shot noise contribution because shot noise arises from self-correlation and each galaxy is assigned one and only one photo-z.

Our method approximates the 3D correlation and the shot noise by the tomographic version. For the signal, when the tomographic bin size is small compared to photo-z uncertainty, the results are convergent. For the shot noise, we approximate the Dirac delta in redshift in equation (40) by a Kronecker delta and the volume density by the tomographic one.

5 MOCK TESTS

In this section, we first test the theory template and the Gaussian covariance for ξp against mock catalogs in Section 5.1 and then test the BAO fit pipeline, as well as present the fit results in details in Section 5.2. To do so, we use the ICE-COLA mocks (Ferrero et al. 2021), which is a dedicated mock catalog for the DES Y3 BAO analyses. We briefly describe it here and refer readers to Ferrero et al. (2021) for more details.

The ICE-COLA mocks are derived from the COLA simulations which is based on the COLA method (Tassev, Zaldarriaga & Eisenstein 2013) and implemented by the ICE-COLA code (Izard, Crocce & Fosalba 2016). The COLA method combines the second-order Lagrangian perturbation theory with the particle-mesh simulation technique to ensure that the large-scale modes remain accurate when coarse simulation time steps are used. The simulation consists of 20483 particles in a cube of side length 1536 |$\, \mathrm{Mpc} \, h^{-1}$|⁠, matching to the mass resolution of MICE Grand challenge N-body simulations (Crocce et al. 2015; Fosalba et al. 2015). To generate the whole sky light-cone simulation at z ∼ 1.4, the simulation box is replicated three times in each Cartesian direction (altogether 64 copies). The replication has important consequences for the covariance measurements, and we will further comment on it later on. Mock galaxies are assigned to the ICE-COLA haloes using a hybrid Halo Occupation Distribution and Halo Abundance Matching recipe, following a similar approach in Avila et al. (2018). The mocks are calibrated with the redshift distribution and the angular clustering amplitude of the true galaxy samples. From the full-sky light-cone mocks, four maps with the DES Y3 footprint are extracted. Thus, there are 1952 mock catalogs in total, although we will only use a subset of them.

A key challenge is to assign realistic photo-z’s to the mock galaxies. To do so, the photo-z distribution is estimated using 8362 galaxies with known spectroscopic redshift from VIPERS survey (Guzzo et al. 2014). Using these galaxies, which have both the spectroscopic redshift zs and the photo-z from DES, the 2D distribution P(zp, zs) can be estimated. By sampling P(zp, zs) conditioned on the true redshift zs, the candidate mock galaxies are put to the appropriate photometric bins. A further requirement is that the mock galaxies must match the abundance of the observed photometric galaxies. Suppose there are N(zpi) galaxies in the photometric bin zpi, the most luminous N(zpi)P(zpi, zsj) galaxies from the bin (zpi, zsj) is taken as the mock sample. By construction, the mock galaxies will match the observed galaxy abundance and the conditional true redshift distribution.

As the cosmology adopted by the mock catalog is the MICE cosmology, the theory templates, i.e. the theoretical model described in Section 2.2, and the Gaussian covariance are computed in this cosmology, although we will also consider the template computed in Planck cosmology (Planck Collaboration 2020).

5.1 Template and covariance

In this section, we use the ICE-COLA mocks to test the theories developed in Sections 2 and 4. The theory calculations require the conditional true redshift distribution and the bias parameters measured from mocks. Thus we shall first present the measurement of these quantities before confronting the theories with measurements. In Appendix  A, we check the template calculation using the spectroscopic correlation function.

The spread of the conditional true redshift distribution generally increases with redshift. As ϕ depends on the number density in true redshift and the spectroscopic data is not available outside (0.6,1.1), we cannot reliably calibrate ϕ for all bins. Here, we approximate ϕ by f. The preliminary estimate suggests that the effect is small, but when the spectroscopic density is rapidly changing such as for the high redshift bins, it should be taken into account. We shall present more detailed analysis in future work. In the standard tomographic analysis, the samples are divided into a number of redshift bins and their auto (and cross) correlation functions are considered. For example, in the DES Y3 BAO analyses, the samples in the photo-z range (0.6, 1.1) are divided into five tomographic bins with bin width Δz = 0.1. For 3D correlation function, because ϕ is continuously sampled by the galaxy pair counts, we have to use fine photo-z bins in order to sample the ϕ accurately and smoothly.

To investigate the number of photo-z bin width required, we consider a number of ϕ’s measured from the ICE-COLA mocks in the photo-z range (0.6, 1.1). We divide this photo-z range into redshift bins of equal spacing. In Fig. 4, we show a sample of ϕ obtained with 5 and 50 photo-z bins. We also overplot the corresponding Gaussian approximation with the same mean and variance as the true ϕ. The Gaussian approximation is more accurate for the five-bin case as it is averaged over a larger redshift range. At high redshift, ϕ is more irregular and the Gaussian approximation is less accurate.

The conditional true redshift distributions measured from COLA mocks (filled circles) with 5 (upper panel) and 50 (lower panel) photo-z bins. The Gaussian approximation is also overplotted (solid curves). For clarity, in the case of 50 bins, only those whose bin number is divisible by 5 are shown.
Figure 4.

The conditional true redshift distributions measured from COLA mocks (filled circles) with 5 (upper panel) and 50 (lower panel) photo-z bins. The Gaussian approximation is also overplotted (solid curves). For clarity, in the case of 50 bins, only those whose bin number is divisible by 5 are shown.

The photo-z spread is often characterized by
(41)
where σp is the standard deviation of ϕ. We measure σz from ϕ for different number of photo-z bins and the results are displayed in Fig. 5. While σz is overestimated in the 5-bin case, the 20-bin results are consistent with the ones from the 50-bin case except that the 50-bin results are more noisy. This suggests that when the bin width Δz = 0.025 ≲ σz, the photo-z bin is fine enough to probe the intrinsic photo-z uncertainty. We emphasize that σz is used for sanity checks only, we shall use the full distribution in the test of the analysis.
σz (equation 41) of the conditional true redshift distribution for 5 (blue), 20 (orange), and 50 (green) photo-z bins.
Figure 5.

σz (equation 41) of the conditional true redshift distribution for 5 (blue), 20 (orange), and 50 (green) photo-z bins.

Another ingredient required is the bias parameters. We measure the linear bias parameters by fitting the auto-angular correlation (equations 17 and 18) to the mock measurements as in Chan et al. (2018) and Abbott et al. (2019). The fit is performed by fitting to the mean of mocks in the angular range of (0.5,1.5) degree. We show the bias parameters obtained with different number of tomographic bins in Fig. 6. Similar to the case of σz, the 20-bin and 50-bin cases show consistent result with the 50-bin ones showing more random fluctuations, while the five-bin results are slightly higher.

The bias parameters measured from 5 (blue), 20 (orange), and 50 (green) photo-z bins.
Figure 6.

The bias parameters measured from 5 (blue), 20 (orange), and 50 (green) photo-z bins.

We compare the theory template against the mock measurements in Fig. 7. The 3D correlation ξp is measured using the Landy-Szalay estimator (Landy & Szalay 1993) via the public code CUTE (Alonso 2012). Limited by the computing power, we only measure the correlation function from 384 ICE-COLA mocks. Judging from the size of the error bars, which are estimated by the standard error of the mean, the fidelity of the results are well sufficient for our purpose here. We estimate the correlation by considering the pairs with maximum radial separation |$120 \, \mathrm{Mpc} \, h^{-1}$| and transverse separation |$175 \, \mathrm{Mpc} \, h^{-1}$|⁠. To ensure good sampling of the projected correlation within the maximum radial separation, we only consider ξp up to μ = 0.8. We further divide the μ range (0, 0.8) into three equal spacing ‘wedges’. Both ξp versus s and s are shown.

The wedge correlation function ξp as a function of the separation s (upper panel) and the transverse separation s⊥ (lower panel). The μ range in (0, 0.8) is divided into three equal spacing bin, (0,0.27; blue), (0.27,0.53; red), and (0.53,0.8; green). The measurements (filled circles) are the mock mean and the error bars estimated by the standard error are too small to be seen. The theory templates are the solid curves.
Figure 7.

The wedge correlation function ξp as a function of the separation s (upper panel) and the transverse separation s (lower panel). The μ range in (0, 0.8) is divided into three equal spacing bin, (0,0.27; blue), (0.27,0.53; red), and (0.53,0.8; green). The measurements (filled circles) are the mock mean and the error bars estimated by the standard error are too small to be seen. The theory templates are the solid curves.

For the theory template, we first compute wij(θ) in 50 photo-z bins with angular bin width of 0.02 degree. Because the bin width in the radial direction is wide (⁠|$\Delta r \sim 20 \, \mathrm{Mpc} \, h^{-1}$|⁠), we linearly interpolate the resultant angular correlation to a larger redshift grid of 500 bins (Δz = 0.005) before binning all the pairs wijk) into the appropriate ξ(s, μ) bin. It is important, especially for the projected correlation function, to ensure that the maximum ranges in the radial and transverse directions used in template construction match those in the measurements. We find that the template obtained is of slightly lower amplitude than the mock measurement (by about 3 per cent), the amplitude deficit is probably due to sparse sampling in the radial direction. The template shown in Fig. 7 has been multiplied by an overall amplitude factor. Besides this amplitude adjustment, the theory templates are in good agreement with mock results.

In the conventional angular tomographic analysis, the BAO features manifest in different angular scales, which is less direct to see the photo-z effect on the BAO. To take advantage of the fact that the BAO information is condensed to a single bump in ξp, we plot in Fig. 8 the photometric correlation ξp against s and the spectroscopic ξ against s. We have shown the results for both MICE and Planck cosmology. The spectroscopic correlation ξ is the dark matter real space correlation with isotropic BAO damping, obtained by an inverse Fourier transform of an analog of equation (19) at zeff = 0.83. We also show the sound horizon rd in these two cases for comparison. Relative to the spectroscopic correlation, the BAO feature in ξp is less strong. Note that the BAO peak does not precisely correspond to rd, and it is on different sides relative to the peak for the spectroscopic and the photometric cases. A small bias could arise because of the subtle shape differences between these two cosmologies. For example, the broadband terms introduced to take care of the shape difference, if their contribution is rapidly changing, it may slightly shift the BAO position. This is more likely to occur in the photometric case given the features are less sharp than in the spectroscopic one.

The photometric correlation function ξp versus s⊥ (dotted line with filled circles) and the dark matter real space spectroscopic correlation ξ against s (solid) in MICE (blue) and Planck (red) cosmology, respectively. The sound horizons at the drag epoch in these cosmologies are also overplotted as dashed vertical lines (in the same colour code). To enhance the BAO feature, they have been multiplied by the appropriate distance squared.
Figure 8.

The photometric correlation function ξp versus s (dotted line with filled circles) and the dark matter real space spectroscopic correlation ξ against s (solid) in MICE (blue) and Planck (red) cosmology, respectively. The sound horizons at the drag epoch in these cosmologies are also overplotted as dashed vertical lines (in the same colour code). To enhance the BAO feature, they have been multiplied by the appropriate distance squared.

We now turn to the covariance. In Fig. 9, we show the correlation coefficient matrix for ξp
(42)
where Cij is an element of the covariance matrix. Both the results for s and s are shown. In this plot, we have used the Gaussian covariance. We have considered a single μ bin in the range of (0,0.8). These plots demonstrate that the covariance of ξp has strong off-diagonal elements, especially for s. Real space results are known to be more correlated than the Fourier space ones. We argue that the situation for ξp is exacerbated by the photo-z mixing.
The correlation coefficient for ξp. The upper panel shows the results for s and the lower one for s⊥. The results are for one μ bin in the range of (0, 0.8).
Figure 9.

The correlation coefficient for ξp. The upper panel shows the results for s and the lower one for s. The results are for one μ bin in the range of (0, 0.8).

It is instructive to contrast the results with the Gaussian covariance for the angular correlation function, which is plotted in Fig. 10. The setup of the calculations are the same as those in the fiducial DES Y3 analyses. There are five photo-z bins, and in each bin, θ ranges from 0.5 to 5 degree with angular bin width of 0.2 degree. The photo-z mixes modes in the radial direction. Because the width of the redshift bin (Δz = 0.1) is wide compared to the photo-z uncertainty σ [∼0.03(1 + z)], the photo-z mixing does not affect the covariance in the same bin much. The cross-correlation between different redshift bins is mainly caused by photo-z mixing, and the only significant correlation is between neighbouring redshift bins. Again, because the redshift bin is wide compared to σ, the mixing between the neighbouring bins is mild, and so even the neighbouring bin covariance is weak. On the other hand, as photo-z mixing scale can be as high as |$\sim 100 \, \mathrm{Mpc} \, h^{-1}$|⁠, for ξp there can be substantial mixing between the underlying correlation from different scales. Thus, the covariance for ξp can be significantly higher than that for w.

The correlation coefficient for w. The data in the redshift range (0.6,1,1) is split into five tomographic bins with bin size 0.1.
Figure 10.

The correlation coefficient for w. The data in the redshift range (0.6,1,1) is split into five tomographic bins with bin size 0.1.

As we mentioned, construction of the ICE-COLA mock catalogs involves replicating the simulation box to form a full-sky light-cone mock. The repeated structures cause the covariance to be anomalously higher than the correct one, and this prevents us from using the mock covariance as the benchmark to test our Gaussian theory covariance. See Ferrero et al. (2021) for more discussions. Thus in DES Y3, the analytic Gaussian covariance is the default choice. Although we cannot directly test the ξp covariance against mock results, the code used to compute the Gaussian covariance for w, which is the backbone of the ξp results, has been tested in Chan et al. (2018), Avila et al. (2018), and Abbott et al. (2022). Moreover, we have verified that the amplitude ratio difference between the Gaussian theory covariance and the mock covariance are at a similar level for w and ξp, albeit quantitative comparison is not possible.

5.2 BAO fit

In this section, we apply our theory template and covariance to extract the BAO scale from the mock catalogs. With the variable s, the BAO scale is almost equal to the true one, we adopt ξp as a function of s in the BAO fit. To measure the BAO scale, as in standard BAO analyses [e.g. Xu et al. (2012), Anderson et al. (2014), Ross et al. (2017)], we apply the following general model
(43)
where T denotes the theory template and B gives the freedom to adjust the amplitude. The parameter α in the argument of T is the key parameter we are after. It allows us to shift the BAO scale computed in the fiducial cosmology to match that in the data. The polynomial in 1/s is designed to absorb residual broadband effects due to imperfectness in modelling, changes in correlation function shape as a result of the differences between the fiducial cosmology and the actual one, and residual broadband systematic effects.

The default configuration for the BAO fit is as follows: the broadband terms consisting of |$\sum _i A_i / s_\perp ^i$| with i = 0, 1, and 2; the fit range |$[40,140] \, \mathrm{Mpc} \, h^{-1}$|⁠; the bin width |$\Delta s = 3 \, \mathrm{Mpc} \, h^{-1}$|⁠, and the Gaussian theory covariance. We use 384 mocks for the BAO fit test. Different configurations for BAO fit are checked.

We look for the best-fitting α by the maximum likelihood estimator following the procedures in Chan et al. (2018). We first analytically fit the linear parameter Ai, and then minimize the resultant χ2 w.r.t. B subject to the constraint B > 0. At the end of the operations, we are left with the residual χ2 as a function of α. The best-fitting α corresponds to the point where the residual χ2 attains its minimum and the symmetric 1σ error bar is derived from the Δχ2 = 1 criterion. Besides, we also cross check our results with the Markov Chain Monte Carlo (MCMC) method, in which all the parameters are fitted simultaneously.

5.2.1 Highly correlated data fit

The photo-z mixing causes ξp to be strongly correlated, as evident in Fig. 9. Here, we first describe some fitting problems caused by the high correlation, namely the apparently bad fit results and the alarmingly large χ2 per degree of freedom (χ2/dof). The former issue is caused by the largest eigenvalues of the covariance and the latter is due to the small ones. We discuss a few approaches to modify the covariance, and find that the first issue can be resolved by suppressing the largest eigenvalues. For the second issue, while it is not practical to solve it by adjusting the eigenvalues, we demonstrate that the BAO feature is well-fitted by our model and the problem is not directly related to the BAO.

To begin with, we present some peculiar fit results. One of the odd examples appears in the BAO fit to the mean of the mocks, which are shown in Fig. 11. When the MICE cosmology template is used, the resultant fit is very good and it is not surprising that the χ2/dof is very small (0.009). However, when the template computed in Planck cosmology (Planck Collaboration 2020) is used, we find that the resultant χ2/dof is still small (0.11), but the fit is manifestly poor by eye because the best-fitting model is below all the data points and most of the data points are more than 1σ away from the best fit.

The BAO fit results to the mean of the mock measurements (grey data points with error bars). The fit obtained with the MICE template without broadband terms (solid, red) is compared with the Planck template without broadband terms result (solid, green). The latter case is below all the data points. With the inclusion of the broadband terms, the Planck template (dashed, yellow) also offers good fit. The results for the modified covariance are also shown [No broadband terms (solid, green) and with broadband terms (dashed, violet)].
Figure 11.

The BAO fit results to the mean of the mock measurements (grey data points with error bars). The fit obtained with the MICE template without broadband terms (solid, red) is compared with the Planck template without broadband terms result (solid, green). The latter case is below all the data points. With the inclusion of the broadband terms, the Planck template (dashed, yellow) also offers good fit. The results for the modified covariance are also shown [No broadband terms (solid, green) and with broadband terms (dashed, violet)].

Although this issue only appears when the model equation (43) does not include the broadband terms at all and disappears otherwise, it does indicate that something abnormal happens to the fit.8 In the other extreme, when we fit to the individual mocks, we find that the fit results are often fine by eye as the best-fitting model fall within all the 1σ errors, but χ2/dof is alarmingly large, some of them reach as high as 2. Note that the second issue we describe here appears for generic fit configurations, in particular for both Gaussian and mock covariance fit. We point out that the second problem is already implicit in Abbott et al. (2019) and Sridhar et al. (2020), as their ξp fit results look decent by eye but appreciably large χ2/dof were reported. We attribute these problems to the high correlation of the data induced by the photo-z mixing.

Although the covariance of ξp has strong off-diagonal elements, its inverse, the precision matrix Ψ could be approximated by a band matrix with a few non-vanishing elements near the diagonal. The upper panel of Fig. 12 shows the elements of Ψ, and it is approximately a pentadiagonal band matrix, i.e. a band matrix has non-zero elements only for the main diagonal, and the first two upper and two lower diagonals. We can use this to intuitively understand why the best fit may not be driven to the data points as close as possible. By assuming Ψ to be pentadiagonal, the χ2 can be written as
(44)
where Δ denotes the difference between the model and the data. This expression can accommodate the results near the ‘boundaries’ as well if Ψi, j ≡ 0 whenever |$i, \, j \lt 1$| or >N. Now suppose Δj > 0, because Ψj, j > 0, to minimize χ2, Δj is attracted to zero. For diagonal covariance, this is the sole contribution. However, because Ψj − 2, Ψj − 1, Ψj + 1, and Ψj + 2 are negative, it is favourable for Δj − 2, Δj − 1, Δj + 1, and Δj + 2 to be repelled to as negative as possible. Of course these expressions are coupled, more careful analysis is required, but it is likely that the competition between the diagonal terms and the off-diagonal ones lead to a best fit that does not precisely fall on the data.
Upper panel: the elements of the precision matrix Ψij plotted against j for fixed i. The elements for the original Ψ (triangles) and the one obtained by modifying the covariance matrix (circles, see the text for details) are compared. Lower panel: the differences between the modified Ψij and the original one are displayed. The effect of the modified covariance is mainly to shift Ψ to the positive side. As the large eigenvalue modes correspond to eigenvectors of long ‘wavelength’, it also weakly affects the zones far from the diagonal.
Figure 12.

Upper panel: the elements of the precision matrix Ψij plotted against j for fixed i. The elements for the original Ψ (triangles) and the one obtained by modifying the covariance matrix (circles, see the text for details) are compared. Lower panel: the differences between the modified Ψij and the original one are displayed. The effect of the modified covariance is mainly to shift Ψ to the positive side. As the large eigenvalue modes correspond to eigenvectors of long ‘wavelength’, it also weakly affects the zones far from the diagonal.

To look into the covariance, we plot its eigenvalues in Fig. 13. We find that the spectrum spans three orders of magnitude, and most of them are of relatively small values. As a comparison we also show the diagonal elements of C, its range is much more limited. For the fit to the mean of mocks using the Planck template, we find that the fit is dominated by the largest eigenvalue mode. Because of its (excessively) large eigenvalue, its contribution to the χ2 is very small, so that even the fit is visually bad, χ2/dof is still small. On the other hand, for the fit to individual mocks, the large χ2/dof values are caused by significant contributions from modes with small eigenvalues.

The eigenvalue spectrum of the covariance matrix (orange) and the diagonal elements of covariance matrix (blue). While Cii is in its natural order, the eigenvalues have been arranged in descending order.
Figure 13.

The eigenvalue spectrum of the covariance matrix (orange) and the diagonal elements of covariance matrix (blue). While Cii is in its natural order, the eigenvalues have been arranged in descending order.

The problem of fitting highly correlated data is also encountered in the lattice field theory because lattice simulation data are highly correlated. Yoon et al. (2013) surveyed a few methods proposed by the lattice field theory community to tackle this problem. Michael (1994) advocated ignoring the off-diagonal elements in the covariance, but this is clearly a too violent modification. A more mild method is to remove the eigenmodes deemed problematic. Both removing the largest eigenvalue modes (Thacker & Lepage 1991) and the smallest ones (Drummond & Horgan 1993; Kilcup 1994) have been proposed. For the largest eigenvalue modes removal, it was argued that these modes have small contribution to χ2. The smallest eigenvalue modes removal is often favoured in the spirit of the singular value decomposition (Press et al. 2007). A physical argument in support of it is that those modes are generally highly oscillatory and so they are not conducive to getting a smooth fit. In a similar vein, Bernard et al. (2002) proposed to modify instead the correlation coefficient equation (42). They replace the eigenvalues of ρ smaller than certain threshold λth by λth. A new covariance matrix can be constructed using the original eigenvectors and the modified eigenvalues. Modifying the eigenmodes is more attractive as it keeps all the directions in the covariance space constrained.

We test the results obtained with different covariance recipes. While for most of the fit, different approaches give similar results, in a small fraction (order of 10 per cent) the differences are quite pronounced. In Fig. 14, we show the fit results from one of the mocks, in which the differences are clearly visible. The one obtained with the original covariance (blue) is manifestly not optimal. The best fit is 1.006 ± 0.017 in this case. In the other extreme, when the diagonal approximation (red) is used, it is not surprising that the best fit is driven close to data points, but the best fit we get is 1.002 ± 0.032, with the error bar substantially larger than the original one. By suppressing the largest eigenvalues of the covariance (green) or ρ (yellow), we can also get an apparently better fit. We replace the eigenvalues larger than certain thereshold λth by λth. We take λth to be 3 × 10−8 and 1 for the covariance and correlation coefficient modification respectively. The thresholds are chosen so that as few eigenvalues are affected as possible and the affected ones account for about 10 per cent of them. The best fits we get are 1.003 ± 0.016 and 1.005 ± 0.017, respectively.

The best fit to the ξp measurement from one of the mocks (circles, the error bars are 1σ error from the diagonal elements of the covariance). This mock is chosen because the differences among different covariance recipes are pronounced. The results obtained with the original covariance matrix (blue), diagonal approximation (red), suppressing the largest eigenvalues in the covariance (green), and suppressing the largest eigenvalues in the correlation coefficient (yellow) are compared. The raw template (black, dashed) is also plotted.
Figure 14.

The best fit to the ξp measurement from one of the mocks (circles, the error bars are 1σ error from the diagonal elements of the covariance). This mock is chosen because the differences among different covariance recipes are pronounced. The results obtained with the original covariance matrix (blue), diagonal approximation (red), suppressing the largest eigenvalues in the covariance (green), and suppressing the largest eigenvalues in the correlation coefficient (yellow) are compared. The raw template (black, dashed) is also plotted.

In Fig. 12, we also show the elements of Ψ obtained by suppressing the largest eigenvalues of ρ. Overall, the size of the modification is small, of the order 10−3. The main effect of the covariance modification is to shift Ψ to the positive side so that the attraction to zero is enhanced and the repulsion to negative infinity is weakened. Note that the large eigenvalue modes correspond to eigenvectors that are oscillatory in nature with long ‘wavelength’, while the small eigenvalue ones tend to be well localized. Thus the modification also weakly affects the the zone far from the diagonal. Furthermore, we find that the ρ modification recipe often results in better agreement with the original one compared to the covariance modification, hence we will adopt ρ modification in the following analysis. In Fig. 11, we also show the results obtained with ρ modification. Without the broadband terms, the best fit now passes through the data points, although the BAO position appears to be biased. With the default broadband terms (Ai with i = 0, 1, 2, and 3), the bias is removed.

On the other hand, the large χ2/dof problem is caused by the small eigenvalues. The nominal χ2/dof for the example shown in Fig. 14 are similar for different recipes, around 2.7, except for the diagonal approximation, which gives χ2/dof ∼ 0.06. Since only the largest eigenvalues are modified, the χ2 is only increased slightly compared to the original one. As it is clear from the spectrum in Fig. 13, most of the eigenvalues are small and of similar values, we need to remove or modify a large fraction of eigenmodes before the effect becomes noticeable. Removing 40 per cent of the eigenmodes with the smallest eigenvalues, the naive χ2/dof can be reduced to roughly 1. This represents a very substantial modification of the covariance matrix. Moreover, after eigenmodes removal or modification, there is no proper theoretical basis for χ2/dof anymore (Bernard et al. 2002). Thus, we shall not pursue to reduce χ2/dof by adjusting the covariance matrix.

Using the example in Fig. 14 again, we check directly if the BAO scale is well fitted by the model in Fig. 15. To decorrelate the data, we rotate the data vector by the matrix U, which is the orthogonal matrix that diagonalizes the covariance, i.e. C = UTΛU. To highlight the BAO feature in the rotated coordinates, the no-BAO template has been subtracted from both the data vector and the best-fitting model. Note that as the data vector is mixed by U, they no longer strictly correspond to the original distance r. Here, we only use it as a convenient label for the x-axis. None the less, from the difference with the no-BAO template, the location of the BAO feature can be inferred. In this basis, the BAO feature shows up as a series of wiggles in |$r \gtrsim 110 \, \mathrm{Mpc} \, h^{-1}$| instead of a single peak in the usual configuration representation. The error bars on the data are given by the eigenvalues in Λ and they are not correlated in this basis. We have also shown error bars from the modified covariance according to the ρ modification recipe. We see that the modification reduces the error bars around the BAO scale. To highlight the source of large χ2/dof, we show the ratio with respect to the error bar in the lower panel of Fig. 15. Both the results obtained with the original error bars and the modified ones are shown, which coincide with each other except for |$r \gt 130 \, \mathrm{Mpc} \, h^{-1}$|⁠. Significant deviations of the best-fitting model from the data points occur for |$r\lesssim 100 \, \mathrm{Mpc} \, h^{-1}$|⁠, while in the BAO zone, the model and the data agree with each other well for both types of error bars. Thus, we conclude that the large χ2/dof is caused by the part of the fit that is not directly related to the BAO.

Upper panel: the data (filled circles) and the best-fit model (solid blue) shown in the rotated basis. The orthogonal matrix U decorrelates the correlation in the error. To highlight the BAO feature, the no-BAO template has been subtracted. In this basis, the BAO feature manifests as a series of wiggles for $r \gtrsim 110 \, \mathrm{Mpc} \, h^{-1}$. The uncorrelated error bars from the original covariance (black) and the modified covariance obtained from the ρ modification scheme (red) are displayed. The modified error bars are reduced compared to original ones only in the large r range. The data used in this plot is based on the same mock data used in Fig. 14. Lower panel: the results are further divided by the error bars to show the significance of the deviations between the model and the data points. The results obtained with the original error bars (black) and the modified error bars (red) overlap with each other and deviations show up only for $r\gt 130 \, \mathrm{Mpc} \, h^{-1}$. In the BAO zone, the model and the data points agree with each other well. The large χ2/dof is driven by the data points lying outside the BAO feature.
Figure 15.

Upper panel: the data (filled circles) and the best-fit model (solid blue) shown in the rotated basis. The orthogonal matrix U decorrelates the correlation in the error. To highlight the BAO feature, the no-BAO template has been subtracted. In this basis, the BAO feature manifests as a series of wiggles for |$r \gtrsim 110 \, \mathrm{Mpc} \, h^{-1}$|⁠. The uncorrelated error bars from the original covariance (black) and the modified covariance obtained from the ρ modification scheme (red) are displayed. The modified error bars are reduced compared to original ones only in the large r range. The data used in this plot is based on the same mock data used in Fig. 14. Lower panel: the results are further divided by the error bars to show the significance of the deviations between the model and the data points. The results obtained with the original error bars (black) and the modified error bars (red) overlap with each other and deviations show up only for |$r\gt 130 \, \mathrm{Mpc} \, h^{-1}$|⁠. In the BAO zone, the model and the data points agree with each other well. The large χ2/dof is driven by the data points lying outside the BAO feature.

5.2.2 Fit results

In this section, we present the BAO fit results in details. We will consider various metrics to quantify the accuracy of the fit and different variation of the fit configurations. Although these are important, they can be tedious to read. The key default result is given in the first line in Table 1. In particular, the ξp constraint on α is 〈α〉 ± σstd = 1.001 ± 0.023. We have checked that the results are robust against variation of the fit configurations. The default setup uses the Gaussian covariance modified with the ρ modification prescription. We shall also contrast them with the corresponding results for the angular correlation. They are also consistent with the w results even though ξp statistic has considerably larger 〈χ2〉/dof.

Table 1.

BAO fit results on the COLA mocks with different configurations in the analysis. The default setup is as follows: the broadband terms consisting of |$\sum _i A_i / s_\perp ^i$| with i = 0, 1, and 2, the fit range |$[40,140] \, \mathrm{Mpc} \, h^{-1}$|⁠, the bin width |$\Delta s = 3 \, \mathrm{Mpc} \, h^{-1}$|⁠, and Gaussian theory covariance with eigenvalues of ρ suppressed. See the texts for details.

Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Default1.0010.0230.0220.02061%−0.0051.181.002 ± 0.01958.7/29 (2.03)
w fit1.0040.0240.0230.02162%−0.0241.111.004 ± 0.02193.1/89 (1.05)
|$A_i, \, i=0$|1.0010.0230.0220.01959%0.0321.191.002 ± 0.01962.9/31 (2.03)
|$A_i, \, i=0,1$|1.0010.0230.0210.02164%0.0031.121.003 ± 0.02058.6/30 (1.95)
|$A_i, \, i=0,1,2,-1$|1.0020.0230.0210.02060%−0.0051.171.002 ± 0.01956.8/28 (2.03)
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$|1.0000.0270.0240.02259%0.0841.201.003 ± 0.02136.4/16 (2.28)
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0240.02057%−0.0071.251.001 ± 0.02010.2/5 (2.05)
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$|1.0030.0250.0230.02059%−0.0221.171.002 ± 0.02032.1/15 (2.14)
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$|1.0020.0240.0220.02060%0.0081.121.002 ± 0.01981.0/45 (1.80)
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0230.01957%0.0191.371.002 ± 0.018143.1/95 (1.51)
Planck template0.9580.0220.0200.01965%−0.0031.140.957 ± 0.01853.7/29 (1.85)
Original Gaussian cov.1.0020.0230.0220.02164%−0.0051.131.003 ± 0.02057.7/29 (2.00)
COLA cov1.0010.0230.0220.02165%−0.0091.161.002 ± 0.02147.9/29 (1.65)
MCMC fit1.0010.0240.0230.02059%−0.0041.181.003 ± 0.020
Nμ = 30.9990.0280.0250.01747%−0.0241.600.999 ± 0.017139.0/89 (1.56)
Nz = 21.0000.0230.0220.01859%−0.0141.241.001 ± 0.01898.1 /59 (1.66)
Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Default1.0010.0230.0220.02061%−0.0051.181.002 ± 0.01958.7/29 (2.03)
w fit1.0040.0240.0230.02162%−0.0241.111.004 ± 0.02193.1/89 (1.05)
|$A_i, \, i=0$|1.0010.0230.0220.01959%0.0321.191.002 ± 0.01962.9/31 (2.03)
|$A_i, \, i=0,1$|1.0010.0230.0210.02164%0.0031.121.003 ± 0.02058.6/30 (1.95)
|$A_i, \, i=0,1,2,-1$|1.0020.0230.0210.02060%−0.0051.171.002 ± 0.01956.8/28 (2.03)
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$|1.0000.0270.0240.02259%0.0841.201.003 ± 0.02136.4/16 (2.28)
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0240.02057%−0.0071.251.001 ± 0.02010.2/5 (2.05)
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$|1.0030.0250.0230.02059%−0.0221.171.002 ± 0.02032.1/15 (2.14)
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$|1.0020.0240.0220.02060%0.0081.121.002 ± 0.01981.0/45 (1.80)
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0230.01957%0.0191.371.002 ± 0.018143.1/95 (1.51)
Planck template0.9580.0220.0200.01965%−0.0031.140.957 ± 0.01853.7/29 (1.85)
Original Gaussian cov.1.0020.0230.0220.02164%−0.0051.131.003 ± 0.02057.7/29 (2.00)
COLA cov1.0010.0230.0220.02165%−0.0091.161.002 ± 0.02147.9/29 (1.65)
MCMC fit1.0010.0240.0230.02059%−0.0041.181.003 ± 0.020
Nμ = 30.9990.0280.0250.01747%−0.0241.600.999 ± 0.017139.0/89 (1.56)
Nz = 21.0000.0230.0220.01859%−0.0141.241.001 ± 0.01898.1 /59 (1.66)
Table 1.

BAO fit results on the COLA mocks with different configurations in the analysis. The default setup is as follows: the broadband terms consisting of |$\sum _i A_i / s_\perp ^i$| with i = 0, 1, and 2, the fit range |$[40,140] \, \mathrm{Mpc} \, h^{-1}$|⁠, the bin width |$\Delta s = 3 \, \mathrm{Mpc} \, h^{-1}$|⁠, and Gaussian theory covariance with eigenvalues of ρ suppressed. See the texts for details.

Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Default1.0010.0230.0220.02061%−0.0051.181.002 ± 0.01958.7/29 (2.03)
w fit1.0040.0240.0230.02162%−0.0241.111.004 ± 0.02193.1/89 (1.05)
|$A_i, \, i=0$|1.0010.0230.0220.01959%0.0321.191.002 ± 0.01962.9/31 (2.03)
|$A_i, \, i=0,1$|1.0010.0230.0210.02164%0.0031.121.003 ± 0.02058.6/30 (1.95)
|$A_i, \, i=0,1,2,-1$|1.0020.0230.0210.02060%−0.0051.171.002 ± 0.01956.8/28 (2.03)
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$|1.0000.0270.0240.02259%0.0841.201.003 ± 0.02136.4/16 (2.28)
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0240.02057%−0.0071.251.001 ± 0.02010.2/5 (2.05)
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$|1.0030.0250.0230.02059%−0.0221.171.002 ± 0.02032.1/15 (2.14)
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$|1.0020.0240.0220.02060%0.0081.121.002 ± 0.01981.0/45 (1.80)
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0230.01957%0.0191.371.002 ± 0.018143.1/95 (1.51)
Planck template0.9580.0220.0200.01965%−0.0031.140.957 ± 0.01853.7/29 (1.85)
Original Gaussian cov.1.0020.0230.0220.02164%−0.0051.131.003 ± 0.02057.7/29 (2.00)
COLA cov1.0010.0230.0220.02165%−0.0091.161.002 ± 0.02147.9/29 (1.65)
MCMC fit1.0010.0240.0230.02059%−0.0041.181.003 ± 0.020
Nμ = 30.9990.0280.0250.01747%−0.0241.600.999 ± 0.017139.0/89 (1.56)
Nz = 21.0000.0230.0220.01859%−0.0141.241.001 ± 0.01898.1 /59 (1.66)
Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Default1.0010.0230.0220.02061%−0.0051.181.002 ± 0.01958.7/29 (2.03)
w fit1.0040.0240.0230.02162%−0.0241.111.004 ± 0.02193.1/89 (1.05)
|$A_i, \, i=0$|1.0010.0230.0220.01959%0.0321.191.002 ± 0.01962.9/31 (2.03)
|$A_i, \, i=0,1$|1.0010.0230.0210.02164%0.0031.121.003 ± 0.02058.6/30 (1.95)
|$A_i, \, i=0,1,2,-1$|1.0020.0230.0210.02060%−0.0051.171.002 ± 0.01956.8/28 (2.03)
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$|1.0000.0270.0240.02259%0.0841.201.003 ± 0.02136.4/16 (2.28)
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0240.02057%−0.0071.251.001 ± 0.02010.2/5 (2.05)
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$|1.0030.0250.0230.02059%−0.0221.171.002 ± 0.02032.1/15 (2.14)
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$|1.0020.0240.0220.02060%0.0081.121.002 ± 0.01981.0/45 (1.80)
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$|1.0010.0250.0230.01957%0.0191.371.002 ± 0.018143.1/95 (1.51)
Planck template0.9580.0220.0200.01965%−0.0031.140.957 ± 0.01853.7/29 (1.85)
Original Gaussian cov.1.0020.0230.0220.02164%−0.0051.131.003 ± 0.02057.7/29 (2.00)
COLA cov1.0010.0230.0220.02165%−0.0091.161.002 ± 0.02147.9/29 (1.65)
MCMC fit1.0010.0240.0230.02059%−0.0041.181.003 ± 0.020
Nμ = 30.9990.0280.0250.01747%−0.0241.600.999 ± 0.017139.0/89 (1.56)
Nz = 21.0000.0230.0220.01859%−0.0141.241.001 ± 0.01898.1 /59 (1.66)

The default case results for ξp fit are shown in the first row in Table 1. As a reference, we also show the corresponding results for the angular correlation function, which are obtained from a joint five-tomographic fit. See table II in Abbott et al. (2022) for more details on the w fit. The mean of the best-fitting 〈α〉 is 1.001, with small bias which can be attributed to nonlinear evolution (Crocce & Scoccimarro 2008; Padmanabhan & White 2009). Two measures of the spread of the distribution of the best-fitting α are shown: the standard deviation σstd and the half of the width between 16 and 84 percentile of the distribution, σ68, which is less sensitive to the tails of the distribution. We find that σstd is larger than σ68 by a small amount (0.001), suggesting the influence by the tails. As mentioned, the 1σ error bar is derived from the Δχ2 = 1 condition, and for the error bars to be meaningful, 〈σα〉 should agree with the measures of the spread of the distribution. In the case of Gaussian likelihood, they are equal to each other. We find that the mean of the error bar is slightly smaller than the spread of the distribution. These numbers are marginally better than those for w. The bias in the best-fitting 〈α〉 is smaller (1.004 for w) and the measures of spread and error are smaller by ∼0.001. These (mild) improvements could arise from the fact that the ξp statistic makes use of the cross-correlation in the data as well, not only the tomographic bin auto-correlation for w as in the study of DES Y3.

Another indicator for the accuracy of 1σ error bar is the fraction of times that it encloses 〈α〉, which is |$68 {{\ \rm per\ cent}}$| in the case of Gaussian likelihood. In line with the error bar being slightly smaller than the spread, the fraction enclosing 〈α〉 is slightly lower than the Gaussian expectation. The normalized deviation, dnorm ≡ (α − 〈α〉)/σα is designed to study the correlation between the deviation of the best-fitting α from the ensemble mean and the error bar derived. The mean 〈dnorm〉 is close to zero, but the standard deviation |$\sigma _{d_{\text{norm }}}$| exceeds unity by |$18 {{\ \rm per\ cent}}$|⁠. This trend is consistent with the previous observation that the spread of the distribution is slightly larger than the error estimate. In contrast, for w, the fraction of time enclosing 〈α〉 is 62 per cent, and 〈dnorm〉 = 0.02 and |$\sigma _{d_{\rm norm} } = 1.12$|⁠. Both suggest that ξp results are less Gaussian than the w ones.

As a reference, we also show the fit to the mean measurement of the mocks. To show the results for a ‘typical’ fit, we have used the covariance corresponding to a single survey volume. The results are similar to 〈α〉 ± 〈σα〉. However, we note that for w these different measurement are almost always equal to each other, suggesting that the averaging and fitting operations are more commutative for w.

Finally, the |$\langle \chi ^2 \rangle / {\rm dof} \, (\sim 2)$| is also shown for reference, but we should bear in mind the complications mentioned in the previous section. On the other hand, for w, 〈χ2〉/dof is 1.05. We attribute this to the fact that the covariance for w is much less correlated.

A number of tests on the variation of the fitting configurations have been performed. We have tested the number of the broadband terms |$\sum _i A_i / s_\perp ^i$| with i = 0, i = 0, 1, and i = 0, 1, 2, −1. Besides minor fluctuations, the fit results are not sensitive to the number of broadband terms used. This is because the broadband terms are not degenerate with α, but they are degenerate among themselves. When a narrower fitting range |$[70,130] \, \, \mathrm{Mpc} \, h^{-1}$| is used, the constraining power is reduced and so the spread of the distribution is enlarged, especially for σstd. As the error bar σα increases only mildly, the difference between the error bar and the spread of the distribution increases.

The bin width size Δs = 10, 5, 2, and 1 |$\, \mathrm{Mpc} \, h^{-1}$| are compared with the fiducial setting |$\Delta s = 3 \, \, \mathrm{Mpc} \, h^{-1}$|⁠. Among the cases shown, σstd for |$\Delta s = 3 \, \, \mathrm{Mpc} \, h^{-1}$| is the lowest and it is also more consistent with σ68 and 〈σα〉. Note that χ2/dof are smaller for Δs = 2 and 1 |$\, \mathrm{Mpc} \, h^{-1}$|⁠, but this is not vital here as we already mentioned that the BAO region is well-fitted by our model, the reduction in χ2/dof is primarily due to the region lying outside the BAO.

To test if the expected shift in α can be recovered when the cosmology of the template is different from that of the mocks, we generate a template computed with the Planck cosmology (Planck Collaboration 2020). We can estimate the expected α by
(45)
where rd is the sound horizon at the drag epoch and DM is the comoving angular diameter distance evaluated at the effective redshift  = 0.835 following DES Y3. For MICE cosmology, |$r_{\mathrm{d}}=153.4 \, \mathrm{Mpc}$| and |$D_{\mathrm{M}}\left(z_{\text{eff }}\right) = 2959.7 \, \mathrm{Mpc}$|⁠, while for Planck cosmology |$r_{\mathrm{d}}=147.6 \, \mathrm{Mpc}$| and |$D_{\mathrm{M}}\left(z_{\text{eff }}\right)=2967.0 \, \mathrm{Mpc}$|⁠.

Unlike other tests, when an alternative fiducial cosmology is assumed, the measurement of ξp should be performed again as the distance has to be computed in the new cosmology. Furthermore, we mentioned that the Hubble parameter h is contained in the unit |$\, \mathrm{Mpc} \, h^{-1}$| and the distance does not explicitly depend on it.

In Table 2, we compare the values of the α parameter obtained by the simple estimate using equation (45) and the mock measurement results. In contrast, we also show similar results for w from Abbott et al. (2022). Interestingly, we find that the ξp results are closer to the simple estimate than the ones from w. The deviation from the simple estimate could stem from the nonlinear correction and other systematic effects, and they affect the results from both templates in a similar manner.

Table 2.

Comparison of the α values obtained with the Planck and the MICE templates in fitting to the MICE cosmology data. The simple estimates using equation (45) are shown in the first row, and the mock measurement in the second. As a reference, the mock measurements for w from Abbott et al. (2022) are also shown in the last row.

PlanckMICE
Simple estimate0.9591
ξp mock results0.9581.001
w mock results0.9661.004
PlanckMICE
Simple estimate0.9591
ξp mock results0.9581.001
w mock results0.9661.004
Table 2.

Comparison of the α values obtained with the Planck and the MICE templates in fitting to the MICE cosmology data. The simple estimates using equation (45) are shown in the first row, and the mock measurement in the second. As a reference, the mock measurements for w from Abbott et al. (2022) are also shown in the last row.

PlanckMICE
Simple estimate0.9591
ξp mock results0.9581.001
w mock results0.9661.004
PlanckMICE
Simple estimate0.9591
ξp mock results0.9581.001
w mock results0.9661.004

We have shown the results obtained with the original Gaussian covariance. The ρ modification prescription only affects a small fraction of the mocks and the results are statistically similar to the original Gaussian case. Moreover, although the COLA covariance suffers from the overlap issue, we also show its results for reference. Despite the overlap issue, the results, especially the measures of the spread and the error bar, are similar to the default ones. Because we are fitting the mocks with the same covariance derived from it, the χ2/dof = 1.65 is smaller, but still it is much larger than 1. Overall, these suggest the consistency of our results across the covariance used.

The sequential maximum likelihood fitting is applied mostly for its efficiency and convenience. We have checked our results using MCMC, implemented by emcee (Foreman-Mackey et al. 2013), in which all the parameters are fit simultaneously. Similar to the conclusion for w in Chan et al. (2018), we find that the MCMC fit gives consistently similar results for ξp.

Furthermore, we have considered splitting the data in the μ range (0,0.8) into three equal-μ bins and dividing the redshift range (0.6,1.1) into two sub-redshift bins: (0.6, 0.85) and (0.85, 1.1). We have also included cross-μ bin or cross sub-redshift bin covariances. In both cases, we find that the discrepancy between σstd and 〈σα〉 increases. The issue is more serious for Nμ = 3 case, and these invite the interpretation that the discrepancy increases as the number of bins increases. We also see similar trend in the smallest Δs bin case. These could be attributed to the high covariance of the data as the correlated data are more susceptible to random fluctuations. By dividing the data into more bins, there are larger random fluctuations. On one hand, there are larger fluctuations in the best fit, resulting in an increase in the spread of the distribution. On the other hand, the error bar derived from the local curvature of the likelihood function could be underestimated as the likelihood is less smooth and the curvature around the minimum is increased.

In Table 1, we already compared the ξp results in the photo-z range (0.6,1.1) against the w results obtained using the joint fit with five tomographic bins, each of bin width Δzp = 0.1. It is instructive to compare the performance of ξp and w for individual tomographic bins, and the results are shown in Table 3. In this test, the intrinsic photo-z distribution is comparable to the bin width. Unlike the joint-bin fit, there is no cross-bin information. The difference between these statistics is that the data are further projected on to a sphere for w. Because of the limitation in computing power, only 100 mocks are used in this comparison. Overall, the results for w are more stable across redshift bins. Except for the first bin, the measures of the spread of the distribution are larger for w, and this can be interpreted as information loss due to projection. However, we note that for ξp, 〈σα〉 tends to be smaller than the measures of the spread by a larger amount. Besides, |$\sigma _{d_{\rm norm} }$| is also more significantly larger than 1. These are caused by the increase in noise as the data size reduces and are consistent with the trends for Nmu = 3 and Nz = 2 in Table 1. The χ2/dof for ξp is still substantially larger than 1 although it is smaller than the joint fit case, while for w, it remains close to 1. In summary, ξp makes use of more information and hence potentially more constraining, but it is more susceptible to noise in the data.

Table 3.

Comparison of the ξp and w BAO fit results, for which only the data in a single tomographic bin are used.

Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
1, ξp0.9960.0500.0490.03645%0.0511.481.002 ± 0.03944.3/29 (1.53)
1, w0.9960.0490.0450.04566%−0.0261.180.995 ± 0.04917.4/17 (1.02)
2, ξp0.9970.0440.0400.03352%−0.0251.330.999 ± 0.03744.2/29 (1.53)
2, w1.0000.0490.0470.04567%−0.0531.190.999 ± 0.04916.3/17 (0.96)
3, ξp1.0010.0480.0390.03556%−0.0291.381.003 ± 0.03544.8/29 (1.54)
3, w0.9980.0500.0390.04167%−0.0421.200.997 ± 0.04317.6/17 (1.03)
4, ξp0.9920.0430.0350.03260%−0.0061.450.993 ± 0.03439.1/29 (1.35)
4, w1.0050.0500.0500.04359%−0.0671.121.003 ± 0.04317.2/17 (1.01)
5, ξp1.0090.0390.0330.03869%−0.0561.031.007 ± 0.03943.7/29 (1.51)
5, w1.0050.0450.0420.04266%−0.0271.061.006 ± 0.04718.9/17 (1.11)
Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
1, ξp0.9960.0500.0490.03645%0.0511.481.002 ± 0.03944.3/29 (1.53)
1, w0.9960.0490.0450.04566%−0.0261.180.995 ± 0.04917.4/17 (1.02)
2, ξp0.9970.0440.0400.03352%−0.0251.330.999 ± 0.03744.2/29 (1.53)
2, w1.0000.0490.0470.04567%−0.0531.190.999 ± 0.04916.3/17 (0.96)
3, ξp1.0010.0480.0390.03556%−0.0291.381.003 ± 0.03544.8/29 (1.54)
3, w0.9980.0500.0390.04167%−0.0421.200.997 ± 0.04317.6/17 (1.03)
4, ξp0.9920.0430.0350.03260%−0.0061.450.993 ± 0.03439.1/29 (1.35)
4, w1.0050.0500.0500.04359%−0.0671.121.003 ± 0.04317.2/17 (1.01)
5, ξp1.0090.0390.0330.03869%−0.0561.031.007 ± 0.03943.7/29 (1.51)
5, w1.0050.0450.0420.04266%−0.0271.061.006 ± 0.04718.9/17 (1.11)
Table 3.

Comparison of the ξp and w BAO fit results, for which only the data in a single tomographic bin are used.

Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
1, ξp0.9960.0500.0490.03645%0.0511.481.002 ± 0.03944.3/29 (1.53)
1, w0.9960.0490.0450.04566%−0.0261.180.995 ± 0.04917.4/17 (1.02)
2, ξp0.9970.0440.0400.03352%−0.0251.330.999 ± 0.03744.2/29 (1.53)
2, w1.0000.0490.0470.04567%−0.0531.190.999 ± 0.04916.3/17 (0.96)
3, ξp1.0010.0480.0390.03556%−0.0291.381.003 ± 0.03544.8/29 (1.54)
3, w0.9980.0500.0390.04167%−0.0421.200.997 ± 0.04317.6/17 (1.03)
4, ξp0.9920.0430.0350.03260%−0.0061.450.993 ± 0.03439.1/29 (1.35)
4, w1.0050.0500.0500.04359%−0.0671.121.003 ± 0.04317.2/17 (1.01)
5, ξp1.0090.0390.0330.03869%−0.0561.031.007 ± 0.03943.7/29 (1.51)
5, w1.0050.0450.0420.04266%−0.0271.061.006 ± 0.04718.9/17 (1.11)
Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
1, ξp0.9960.0500.0490.03645%0.0511.481.002 ± 0.03944.3/29 (1.53)
1, w0.9960.0490.0450.04566%−0.0261.180.995 ± 0.04917.4/17 (1.02)
2, ξp0.9970.0440.0400.03352%−0.0251.330.999 ± 0.03744.2/29 (1.53)
2, w1.0000.0490.0470.04567%−0.0531.190.999 ± 0.04916.3/17 (0.96)
3, ξp1.0010.0480.0390.03556%−0.0291.381.003 ± 0.03544.8/29 (1.54)
3, w0.9980.0500.0390.04167%−0.0421.200.997 ± 0.04317.6/17 (1.03)
4, ξp0.9920.0430.0350.03260%−0.0061.450.993 ± 0.03439.1/29 (1.35)
4, w1.0050.0500.0500.04359%−0.0671.121.003 ± 0.04317.2/17 (1.01)
5, ξp1.0090.0390.0330.03869%−0.0561.031.007 ± 0.03943.7/29 (1.51)
5, w1.0050.0450.0420.04266%−0.0271.061.006 ± 0.04718.9/17 (1.11)

Before closing this section, we would like to comment further on the dual roles of χ2: the characterization of the goodness of fit and the model differentiation. The χ2/dof is often used to indicate the goodness of the fit.9 We discussed previously that the χ2/dof is substantially larger than 1 although the model offers a good fit to the BAO. A separate question is if we can use the χ2 value for model differentiation, especially for claiming the significance of the BAO detection. Although we may not be able to use the χ2 distribution results, in principle we can use the mocks to answer the question that if there are no BAO signals in the data, how much the chance is that the BAO template yields a χ2 value smaller than certain threshold.

6 CONCLUSIONS

There are numerous large-scale structure photometric surveys ongoing or upcoming. One of the important probes to extract cosmological information from these surveys is via the measurement of the BAO. A common way to measure BAO in photometric data is to divide the sample into tomographic bins and measure their auto (and cross) correlation.

It has been suggested to analyse the photometric data as the spectroscopic ones through the 3D correlation function ξp. Previous modelling was limited to a Gaussian photo-z approximation. In this work, we generalize the modelling to accommodate general photo-z distribution. This eliminates the possibility of biasing the results by assuming Gaussian photo-z approximation. To do so, we bin the general cross angular correlation between different redshift bins wijk) using the variables s and μ (or s and μ) like in usual 3D correlation function analysis. Our approach highlights the connection between the angular correlation function and 3D correlation function.

To help understanding the impact of the photo-z uncertainties on the correlation function, especially the BAO, we specialize the calculations to the Gaussian photo-z case. When σ ≳ 0.02, the BAO peak in the correlation shows up at the true BAO scale only if the transverse scale of the separation, s, is used as the independent variable. We show that this results from the interplay between the photo-z distribution and the Jacobian of the transformation. The latter formally diverges at s, and for σ ≳ 0.02, it dominates the integral, causing ξp to trace the underlying correlation function at the scale s. Large-scale photometric surveys equipped with broadband filters, such as DES, typically have σ ≳ 0.02. In this case, it is favourable to adopt the parametrization ξp(s, μ), which effectively projects the 3D correlation function to the transverse direction. Unfortunately, this also means that ξp only probes the transverse BAO and cannot be used to directly constrain the Hubble parameter.

Our approach also enables us to derive the Gaussian covariance for ξp from the Gaussian covariance for the angular correlation function w. Due to photo-z mixing, the covariance of the ξp statistics has strong off-diagonal elements. This high correlation causes problems to the data fitting. One of the problems is that some of the best-fitting results are manifestly poor, but we find that they can be resolved by suppressing the large eigenvalue modes in the covariance. Another issue is that the mean χ2/dof for the fit is large, even though the fit is apparently good. While this second issue cannot be easily resolved by adjusting the covariance, we find that the issue is primarily caused by the small eigenvalue modes and they are not directly related to the BAO scale. We conclude that the BAO scale is well-fitted by our model (Fig. 15).

We have used a set of dedicated DES Y3 mocks to test our pipeline. This set of mocks include realistic photo-z distribution among other things. We find that the theory template is in good agreement with the mock measurement. After verifying our methodology, we apply our pipeline to the BAO fit in the mock catalog. Overall, we find that the results are consistent with the ones derived from w even with small improvements because ξp makes use of the cross redshift bin correlation as well. However, the angular correlation function measurements are less correlated with χ2/dof ∼ 1. As ξp offers an alternative way to extract the BAO information, we shall apply it to the DES Y3 samples. This can serve as an important cross check as different statistics have different sensitivities to the potential systematics. Some preliminary results on ξp have been presented in Abbott et al. (2022) on the BAO samples, but a detailed quantitative analysis is still lacking. This is particularly interesting, as the DM measurement in Abbott et al. (2022) shows stimulating deviation from the Planck results. Measuring DM using alternative statistics on various samples helps establish the results.

ACKNOWLEDGEMENTS

We thank Fengji Hou for useful discussions. KCC acknowledges the support from the National Science Foundation of China under the grant 11873102, the science research grants from the China Manned Space Project with NO. CMS-CSST-2021-B01, and the Science and Technology Program of Guangzhou, China (no. 202002030360). SA is supported by the MICUES project, funded by the EU H2020 Marie Sklodowska-Curie Actions grant no. 713366 (InterTalentum UAM) and ‘EU-HORIZON-2020-776247 Enabling Weak Lensing Cosmology (EWC)’.

Funding for the DES Projects has been provided by the U.S. Department of Energy, the U.S. National Science Foundation, the Ministry of Science and Education of Spain, the Science and Technology Facilities Council of the United Kingdom, the Higher Education Funding Council for England, the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign, the Kavli Institute of Cosmological Physics at the University of Chicago, the Center for Cosmology and Astro-Particle Physics at the Ohio State University, the Mitchell Institute for Fundamental Physics and Astronomy at Texas A&M University, Financiadora de Estudos e Projetos, Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro, Conselho Nacional de Desenvolvimento Científico e Tecnológico and the Ministério da Ciência, Tecnologia e Inovação, the Deutsche Forschungsgemeinschaft and the Collaborating Institutions in the Dark Energy Survey.

The Collaborating Institutions are Argonne National Laboratory, the University of California at Santa Cruz, the University of Cambridge, Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas-Madrid, the University of Chicago, University College London, the DES-Brazil Consortium, the University of Edinburgh, the Eidgenössische Technische Hochschule (ETH) Zürich, Fermi National Accelerator Laboratory, the University of Illinois at Urbana-Champaign, the Institut de Ciències de l’Espai (IEEC/CSIC), the Institut de Física d’Altes Energies, Lawrence Berkeley National Laboratory, the Ludwig-Maximilians Universität München and the associated Excellence Cluster Universe, the University of Michigan, NSF’s NOIRLab, the University of Nottingham, The Ohio State University, the University of Pennsylvania, the University of Portsmouth, SLAC National Accelerator Laboratory, Stanford University, the University of Sussex, Texas A&M University, and the OzDES Membership Consortium.

Based in part on observations at Cerro Tololo Inter-American Observatory at NSF’s NOIRLab (NOIRLab Prop. ID 2012B-0001; PI: J. Frieman), which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation.

The DES data management system is supported by the National Science Foundation under grant numbers AST-1138766 and AST-1536171. The DES participants from Spanish institutions are partially supported by MICINN under grants ESP2017-89838, PGC2018-094773, PGC2018-102021, SEV-2016-0588, SEV-2016-0597, and MDM-2015-0509, some of which include ERDF funds from the European Union. IFAE is partially funded by the CERCA program of the Generalitat de Catalunya. Research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Program (FP7/2007-2013), including ERC grant agreements 240672, 291329, and 306478. We acknowledge support from the Brazilian Instituto Nacional de Ciência e Tecnologia (INCT) do e-Universo (CNPq grant 465376/2014-2).

This manuscript has been authored by Fermi Research Alliance, LLC under Contract No. DE-AC02-07CH11359 with the U.S. Department of Energy, Office of Science, Office of High Energy Physics.

DATA AVAILABILITY

The data underlying this article will be shared on reasonable request to the corresponding author.

Footnotes

7

The standard 3D correlation analysis neglects the redshift evolution within the redshift bin.

8

The issue is not caused by the fitting method. Usage of the MCMC method also gives the same results.

9

Strictly speaking, the degrees of freedom is only defined for linear models as stressed in Andrae, Schulze-Hartung & Melchior (2010), while the parameter α (and B) appears nonlinearly. Here, we followed the usual practice to assume that the degrees of freedom can be approximated defined for our model.

REFERENCES

Abbott
T.
et al. ,
2019
,
MNRAS
,
483
,
4866

Abbott
T.
et al. ,
2022
,
Phys.Rev. D
,
105
,
043512

Alam
S.
et al. ,
2017
,
MNRAS
,
470
,
2617

Alam
S.
et al. ,
2021
,
Phys.Rev.D
,
103
,
083533

Alonso
D.
,
2012
,
preprint (arXiv:1210.1833)

Amendola
L.
,
Quercellini
C.
,
Giallongo
E.
,
2005
,
MNRAS
,
357
,
429

Anderson
L.
et al. ,
2012
,
MNRAS
,
427
,
3435

Anderson
L.
et al. ,
2014
,
MNRAS
,
441
,
24

Andrae
R.
,
Schulze-Hartung
T.
,
Melchior
P.
,
2010
,
preprint (arXiv:1012.3754)

Aubourg
É.
et al. ,
2015
,
Phys. Rev. D
,
92
,
123516

Avila
S.
et al. ,
2018
,
MNRAS
,
479
,
94

Benitez
N.
,
2000
,
ApJ
,
536
,
571

Benitez
N.
et al. ,
2009
,
ApJ
,
691
,
241

Bernard
C.
et al. ,
2002
,
Phys. Rev. D
,
66
,
094501

Beutler
F.
et al. ,
2011
,
MNRAS
,
416
,
3017

Blake
C.
,
Bridle
S.
,
2005
,
MNRAS
,
363
,
1329

Blake
C.
et al. ,
2012
,
MNRAS
,
425
,
405

Bond
J. R.
,
Efstathiou
G.
,
1984
,
ApJ
,
285
,
L45

Bond
J. R.
,
Efstathiou
G.
,
1987
,
MNRAS
,
226
,
655

Camacho
H.
et al. ,
2019
,
MNRAS
,
487
,
3870

Carnero Rosell
A.
et al. ,
2021
,
MNRAS
,
509
,
778

Carnero
A.
,
Sánchez
E.
,
Crocce
M.
,
Cabré
A.
,
Gaztañaga
E.
,
2012
,
MNRAS
,
419
,
1689

Chan
K. C.
et al. ,
2018
,
MNRAS
,
480
,
3031

Chaves-Montero
J.
,
Angulo
R. E.
,
Hernández-Monteagudo
C.
,
2018
,
MNRAS
,
477
,
3892

Cole
S.
,
Fisher
K. B.
,
Weinberg
D. H.
,
1994
,
MNRAS
,
267
,
785

Cole
S.
et al. ,
2005
,
MNRAS
,
362
,
505

Coupon
J.
et al. ,
2012
,
A&A
,
542
,
A5

Crocce
M.
,
Scoccimarro
R.
,
2008
,
Phys. Rev. D
,
77
,
023533

Crocce
M.
,
Cabré
A.
,
Gaztañaga
E.
,
2011
,
MNRAS
,
414
,
329

Crocce
M.
,
Castander
F. J.
,
Gaztañaga
E.
,
Fosalba
P.
,
Carretero
J.
,
2015
,
MNRAS
,
453
,
1513

Crocce
M.
et al. ,
2016
,
MNRAS
,
455
,
4301

Crocce
M.
et al. ,
2019
,
MNRAS
,
482
,
2807

de Simoni
F.
et al. ,
2013
,
MNRAS
,
435
,
3017

De Vicente
J.
,
Sánchez
E.
,
Sevilla-Noarbe
I.
,
2016
,
MNRAS
,
459
,
3078

Dodelson
S.
,
2003
,
Modern Cosmology
.
Academic Press
,
Amsterdam

Drummond
I. T.
,
Horgan
R. R.
,
1993
,
Phys. Lett. B
,
302
,
271

Eisenstein
D. J.
et al. ,
2005
,
ApJ
,
633
,
560

Elvin-Poole
J.
et al. ,
2018
,
Phys. Rev. D
,
98
,
042006

Estrada
J.
,
Sefusatti
E.
,
Frieman
J. A.
,
2009
,
ApJ
,
692
,
265

Ferrero
I.
et al. ,
2021
,
Astron. Astrophys.
,
656
,
A106

Foreman-Mackey
D.
,
Hogg
D. W.
,
Lang
D.
,
Goodman
J.
,
2013
,
PASP
,
125
,
306

Fosalba
P.
,
Crocce
M.
,
Gaztañaga
E.
,
Castander
F. J.
,
2015
,
MNRAS
,
448
,
2987

Gaztanaga
E.
,
Cabre
A.
,
Hui
L.
,
2009
,
MNRAS
,
399
,
1663

Guzzo
L.
et al. ,
2014
,
A&A
,
566
,
A108

Hamilton
A. J. S.
,
1992
,
ApJ
,
385
,
L5

Hamilton
A. J. S.
,
2000
,
MNRAS
,
312
,
257

Hu
W.
,
Sugiyama
N.
,
1996
,
ApJ
,
471
,
542

Hu
W.
,
Sugiyama
N.
,
Silk
J.
,
1997
,
Nature
,
386
,
37

Hütsi
G.
,
2010
,
MNRAS
,
401
,
2477

Ilbert
O.
et al. ,
2006
,
A&A
,
457
,
841

Izard
A.
,
Crocce
M.
,
Fosalba
P.
,
2016
,
MNRAS
,
459
,
2327

Kaiser
N.
,
1987
,
MNRAS
,
227
,
1

Kazin
E. A.
et al. ,
2014
,
MNRAS
,
441
,
3524

Kilcup
G.
,
1994
,
Nucl. Phys. B Proc. Suppl.
,
34
,
350

Landy
S. D.
,
Szalay
A. S.
,
1993
,
ApJ
,
412
,
64

Lewis
A.
,
Challinor
A.
,
2007
,
Phys. Rev. D
,
76
,
083005

Limber
D. N.
,
1953
,
ApJ
,
117
,
134

LoVerde
M.
,
Afshordi
N.
,
2008
,
Phys. Rev. D
,
78
,
123506

Michael
C.
,
1994
,
Phys. Rev. D
,
49
,
2616

Montanari
F.
,
Durrer
R.
,
2012
,
Phys. Rev. D
,
86
,
063503

Nicola
A.
et al. ,
2020
,
J. Cosmol. Astropart. Phys.
,
03
,
044

Padmanabhan
N.
,
White
M.
,
2009
,
Phys. Rev. D
,
80
,
063508

Padmanabhan
N.
et al. ,
2007
,
MNRAS
,
378
,
852

Peebles
P. J. E.
,
1980
,
The Large-Scale Structure of the Universe
.
Princeton Univ. Press
,
New Jersey

Peebles
P. J. E.
,
Yu
J. T.
,
1970
,
ApJ
,
162
,
815

Percival
W. J.
et al. ,
2010
,
MNRAS
,
401
,
2148

Planck Collaboration VI
,
2020
,
A&A
,
641
,
A6

Press
W. H.
,
Teukolsky
S. A.
,
Vetterling
W. T.
,
Flannery
B. P.
,
2007
,
Numerical Recipes: The Art of Scientific Computing
, 3 edn.
Cambridge Univ. Press
,
New York, NY, USA

Ross
A. J.
,
Samushia
L.
,
Howlett
C.
,
Percival
W. J.
,
Burden
A.
,
Manera
M.
,
2015
,
MNRAS
,
449
,
835

Ross
A. J.
et al. ,
2017
,
MNRAS
,
472
,
4456

Rozo
E.
et al. ,
2016
,
MNRAS
,
461
,
1431

Sadeh
I.
,
Abdalla
F. B.
,
Lahav
O.
,
2016
,
PASP
,
128
,
104502

Salazar-Albornoz
S.
et al. ,
2017
,
MNRAS
,
468
,
2938

Scott
D.
,
Srednicki
M.
,
White
M.
,
1994
,
ApJ
,
421
,
L5

Seo
H.-J.
,
Eisenstein
D. J.
,
2003
,
ApJ
,
598
,
720

Seo
H.-J.
et al. ,
2012
,
ApJ
,
761
,
13

Sridhar
S.
et al. ,
2020
,
ApJ
,
904
,
69

Sunyaev
R. A.
,
Zeldovich
Y. B.
,
1970
,
Ap&SS
,
7
,
3

Tassev
S.
,
Zaldarriaga
M.
,
Eisenstein
D. J.
,
2013
,
J. Cosmol. Astropart. Phys.
,
2013
,
036

Thacker
B. A.
,
Lepage
G. P.
,
1991
,
Phys. Rev. D
,
43
,
196

Vakili
M.
et al. ,
2020
,
preprint (arXiv:2008.13154)

Weinberg
D. H.
,
Mortonson
M. J.
,
Eisenstein
D. J.
,
Hirata
C.
,
Riess
A. G.
,
Rozo
E.
,
2013
,
Phys. Rep.
,
530
,
87

Xu
X.
,
Padmanabhan
N.
,
Eisenstein
D. J.
,
Mehta
K. T.
,
Cuesta
A. J.
,
2012
,
MNRAS
,
427
,
2146

Yoo
J.
,
Seljak
U.
,
2015
,
MNRAS
,
447
,
1789

Yoon
B.
,
Jang
Y.-C.
,
Jung
C.
,
Lee
W.
,
2013
,
J. Korean Phys. Soc.
,
63
,
145

Zhan
H.
,
Knox
L.
,
Tyson
J. A.
,
2009
,
ApJ
,
690
,
923

Zhou
R.
et al. ,
2021
,
MNRAS
,
501
,
3309

APPENDIX A: CHECKS ON THE SPECTROSCOPIC SAMPLE

Although our goal is to consistently include photo-z in our modelling, it is helpful to check our methodology on the redshift-space correlation function.

To test our pipeline, we use the real-space and redshift-space correlation function measured from the ICE-COLA mocks. In order to verify our template method, we also compute the standard linear redshift-space 3D correlation function template directly, which can be obtained by replacing ϕ in equation (17) by a Dirac delta distribution. The real-space template can be retrieved by further setting the f-terms in equation (18) to zero. In Fig. A1, we show the real-space correlation function measurement and the corresponding best-fitting template with the bias as the fitting parameter. The real-space template is in good agreement with the measurements for |$r \gtrsim 20 \, \mathrm{Mpc} \, h^{-1}$|⁠. Note that the BAO feature is stronger in the template as the linear power spectrum is used. The best-fitting effective bias is 1.63, which roughly corresponds to the bias value at the effective redshift 0.83 shown in Fig. 6 (or Fig. A2 below). Moreover, we compare the redshift-space correlation measurement with the corresponding template from the standard direct computation. The wedge correlation function is averaged over μ in the same way as ξp in the main text. We find that the agreement is mildly worse than the real-space result, and the BAO feature damping appears to be weaker than in the real-space case. In the computation, we have used the best-fitting effective bias parameter from the real-space result so as to compare with the template obtained by mapping w to ξ below. We could have used the bias parameter obtained by fitting to the redshift-space correlation and the agreement on small scales would be slightly improved.

The real-space (blue, triangles) and redshift-space (red, circles) correlation measured from the ICE-COLA mocks. The standard linear templates obtained by direct computation (solid) are compared with the corresponding measurements. We also show the redshift-space template obtained by mapping w to ξ (brown, dashed).
Figure A1.

The real-space (blue, triangles) and redshift-space (red, circles) correlation measured from the ICE-COLA mocks. The standard linear templates obtained by direct computation (solid) are compared with the corresponding measurements. We also show the redshift-space template obtained by mapping w to ξ (brown, dashed).

The best-fitting bias obtained from the photometric sample (blue, circles) and spectroscopic sample (orange, triangles).
Figure A2.

The best-fitting bias obtained from the photometric sample (blue, circles) and spectroscopic sample (orange, triangles).

We also compute the spectroscopic template following the method of mapping w to ξ discussed in the main text. Similar to the analysis for the photo-z sample, we first measure the bias parameters of the redshift-space sample using the angular correlation function. In Fig. A2, we compare the bias parameters measured from the photo-z angular correlation function and the spectroscopic angular correlation. In both cases, we divide the data in the redshift range (0.6,1.1) into 50 equal redshift bins. For the spectroscopic sample, the window function is a top-hat of width Δz = 0.01. We find that the recovered tomographic bin bias parameters are consistent with each other, and this demonstrates that fine photo-z bin measurements can indeed recover the underlying intrinsic bias.

We go on to use the bias parameters to compute the 3D correlation. For the case of spectroscopic redshift, we compute the cross angular correlation function wij(θ) with the the sampling width Δz = 0.00333 and Δθ = 0.02°. We find that finer sampling in z does not improve the results. Note that in this case, we have used the linear power spectrum with BAO damping. The template is in decent agreement with the measurement, but in the intermediate range 30–80 |$\, \mathrm{Mpc} \, h^{-1}$| it is slightly worse than the direct computation result. The difference could be caused by the extra binning and averaging in the w to ξ mapping and these may result in smoothing of the correlation function. However, this is less of an issue in the photo-z case as the intrinsic photo-z mixing window is much stronger.

In Table A1, we show the BAO fit results for the spectroscopic samples. We have shown four scenarios with different template and covariance combinations. The template is either the theory one or the mock mean. The theory template is obtained by mapping w to ξ. As there is small difference between the mock measurement and the theory, we also consider directly using the mean of the mock measurement as the template. In passing, in the main text, this possibility is not presented because we found that using the photo-z ξp mock mean as the template leads to a non-smooth likelihood. Two covariances are considered, the mock covariance and the Gaussian one. The Gaussian covariance is computed in a similar way as the photo-z case with 50 bins in redshift.

Table A1.

The BAO fit results for the spectroscopic sample. For each of the case shown, the first entry denotes the type of template, mock mean, or theory and the second one represents the type of covariance, mock, or Gaussian covariance.

Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Mock, Mock1.0000.0100.0100.00770%−0.0291.601.000 ± 0.00924.0/29 (0.83)
Mock, Gaussian1.0000.0100.0110.00552%−0.0221.431.001 ± 0.01130.9/29 (1.07)
Theory, Mock1.0010.0130.0140.01174%−0.0051.251.001 ± 0.01236.6/29 (1.26)
Theory, Gaussian1.0020.0140.0140.00744%−0.0231.381.002 ± 0.00743.1/29 (1.49)
Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Mock, Mock1.0000.0100.0100.00770%−0.0291.601.000 ± 0.00924.0/29 (0.83)
Mock, Gaussian1.0000.0100.0110.00552%−0.0221.431.001 ± 0.01130.9/29 (1.07)
Theory, Mock1.0010.0130.0140.01174%−0.0051.251.001 ± 0.01236.6/29 (1.26)
Theory, Gaussian1.0020.0140.0140.00744%−0.0231.381.002 ± 0.00743.1/29 (1.49)
Table A1.

The BAO fit results for the spectroscopic sample. For each of the case shown, the first entry denotes the type of template, mock mean, or theory and the second one represents the type of covariance, mock, or Gaussian covariance.

Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Mock, Mock1.0000.0100.0100.00770%−0.0291.601.000 ± 0.00924.0/29 (0.83)
Mock, Gaussian1.0000.0100.0110.00552%−0.0221.431.001 ± 0.01130.9/29 (1.07)
Theory, Mock1.0010.0130.0140.01174%−0.0051.251.001 ± 0.01236.6/29 (1.26)
Theory, Gaussian1.0020.0140.0140.00744%−0.0231.381.002 ± 0.00743.1/29 (1.49)
Case〈α〉σstdσ68〈σα|${\rm frac.\, encl.} \langle \alpha \rangle$|dnorm|$\sigma _{d_{\rm norm}}$||${\rm mean\, of\, mocks}$|〈χ2〉/dof
Mock, Mock1.0000.0100.0100.00770%−0.0291.601.000 ± 0.00924.0/29 (0.83)
Mock, Gaussian1.0000.0100.0110.00552%−0.0221.431.001 ± 0.01130.9/29 (1.07)
Theory, Mock1.0010.0130.0140.01174%−0.0051.251.001 ± 0.01236.6/29 (1.26)
Theory, Gaussian1.0020.0140.0140.00744%−0.0231.381.002 ± 0.00743.1/29 (1.49)

When both the template and the covariance are from mocks, it could be the best case scenario. But we still find that the error estimate is smaller than the spread of the distribution and dnorm deviates from Gaussianity significantly. This suggests that the likelihood is non-Gaussian. Moreover, the 〈χ2〉/dof is small relative to 1. This could be caused by issues of the mocks such as the overlapping issue discussed in the main text. When the mock covariance is replaced by the Gaussian one, 〈σα〉 is further reduced and the fraction enclosing 〈α〉 is lowered. Interestingly, 〈χ2〉/dof is closest to unity in this case. When the theory template is used with the mock covariance, the spread of the best-fitting α and 〈χ2〉/dof increase. These can be attributed to the difference between the template and the mock mean. Finally, when both the theory template and the Gaussian covariance are used, the 〈χ2〉/dof is the highest. We note that it is still smaller than the photo-z case by 0.5.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)