-
PDF
- Split View
-
Views
-
Cite
Cite
Kwan Chuen Chan, Ismael Ferrero, Santiago Avila, Ashley J Ross, Martin Crocce, Enrique Gaztañaga, Clustering with general photo-z uncertainties: application to Baryon Acoustic Oscillations, Monthly Notices of the Royal Astronomical Society, Volume 511, Issue 3, April 2022, Pages 3965–3982, https://doi.org/10.1093/mnras/stac340
- Share Icon Share
ABSTRACT
Photometric data can be analysed using the 3D correlation function ξp to extract cosmological information via e.g. measurement of the Baryon Acoustic Oscillations (BAO). Previous studies modeled ξp assuming a Gaussian photo-z approximation. In this work we improve the modeling by incorporating realistic photo-z distribution. We show that the position of the BAO scale in ξp is determined by the photo-z distribution and the Jacobian of the transformation. The latter diverges at the transverse scale of the separation s⊥, and it explains why ξp traces the underlying correlation function at s⊥, rather than s, when the photo-z uncertainty σz/(1+ z) ≳ 0.02. We also obtain the Gaussian covariance for ξp. Due to photo-z mixing, the covariance of ξp shows strong off-diagonal elements. The high correlation of the data causes some issues to the data fitting. None the less, we find that either it can be solved by suppressing the largest eigenvalues of the covariance or it is not directly related to the BAO. We test our BAO fitting pipeline using a set of mock catalogs. The data set is dedicated for Dark Energy Survey Year 3 (DES Y3) BAO analyses and includes realistic photo-z distributions. The theory template is in good agreement with mock measurement. Based on the DES Y3 mocks, ξp statistic is forecast to constrain the BAO shift parameter α to be 1.001 ± 0.023, which is well consistent with the corresponding constraint derived from the angular correlation function measurements. Thus, ξp offers a competitive alternative for the photometric data analyses.
1 INTRODUCTION
Imaging surveys infer the redshift of galaxy samples, photo-z, using broadband filters. Ongoing and future large-scale structure surveys that collect enormous amount of photo-z data include Kilo-Degree Survey (KiDS),1 Dark Energy Survey (DES),2 Hyper Suprime-Cam (HSC),3 Rubin Observatory’s Legacy Survey of Space and Time (LSST),4 Euclid,5 and the Chinese Survey Space Telescope (CSST).6 These surveys typically use several bandpasses, e.g. DES has grizY and the accuracy of the resultant photo-z is about σ ∼ 0.03(1 + z) [see e.g. Crocce et al. (2019), Carnero Rosell et al. (2021), but it can be improved if more stringent criteria are applied (Rozo et al. 2016)]. Although photometric surveys yield less precise redshift than spectroscopic ones (σ ∼ 10−3), they are more efficient to collect a large volume of survey data with deep magnitude.
Baryonic Acoustic Oscillations (BAO; Peebles & Yu 1970; Sunyaev & Zeldovich 1970) are the primordial acoustic features imprinted in the distribution of the large-scale structure. In the early universe, photons and baryons form a tightly coupled plasma, and acoustic oscillations are excited in it. After the recombination, hydrogen atoms form, the acoustic waves get stalled because there is no medium for propagation. The acoustic patterns are preserved in the distribution of galaxies and they correspond to the sound horizon at the drag epoch, which is about 150 Mpc. Formation of the BAO is governed by well-understood linear physics, see Bond & Efstathiou (1984), Bond & Efstathiou (1987), Hu & Sugiyama (1996), Hu, Sugiyama & Silk (1997), and Dodelson (2003) for the details of the cosmic microwave background physics. Given its robustness, BAO has been regarded as a standard ruler in cosmology, see e.g. Weinberg et al. (2013) and Aubourg et al. (2015). BAO has been measured in numerous spectroscopic data analyses. It was first clearly detected by Eisenstein et al. (2005) and Cole et al. (2005), and followed up in many subsequent studies, e.g. Gaztanaga, Cabre & Hui (2009), Percival et al. (2010), Beutler et al. (2011), Blake et al. (2012), Anderson et al. (2012), Kazin et al. (2014), Ross et al. (2015), and Alam et al. (2017, 2021).
The sample size in terms of volume and depth from photometric surveys may compensate its imprecision in redshift information and give competitive measurements (Seo & Eisenstein 2003; Blake & Bridle 2005; Amendola, Quercellini & Giallongo 2005; Benitez et al. 2009; Zhan, Knox & Tyson 2009; Chaves-Montero, Angulo & Hernández-Monteagudo 2018). Indeed, there are several detections of the BAO feature in the galaxy distribution using photometric data from SDSS (Padmanabhan et al. 2007; Estrada, Sefusatti & Frieman 2009; Hütsi 2010; Carnero et al. 2012; Seo et al. 2012; de Simoni et al. 2013), DES [Y1 (Abbott et al. 2019) and Y3 (Abbott et al. 2022), hereafter DES Y3], and DECaLS (Sridhar et al. 2020).
Because of the photo-z uncertainty, it is natural to perform the analyses of the photometric data using angular statistics such as the angular correlation function (Coupon et al. 2012; Crocce et al. 2016; Abbott et al. 2019; Elvin-Poole et al. 2018; Vakili et al. 2020) or the angular power spectrum (Padmanabhan et al. 2007; Seo et al. 2012; Camacho et al. 2019; Nicola et al. 2020). In particular, these two statistics were adopted as the fiducial statistical tools to measure BAO in DES Y3 analyses. However, given enormous amount photo-z data available, it is worthwhile exploring new methods to enhance information extraction, especially from the BAO. Ross et al. (2017) proposed to measure the BAO using the 3D correlation function characterized by the transverse scale of the separation. This statistic has been applied to photo-z data to get interesting BAO measurements (Abbott et al. 2019; Sridhar et al. 2020). The photo-z uncertainty of the sample must be folded in the theory template. To simplify the modelling, the photo-z distribution was assumed to be Gaussian in Ross et al. (2017). The actual photo-z distribution has non-negligible deviation from Gaussian distribution, e.g. Crocce et al. (2019), Zhou et al. (2021), and Carnero Rosell et al. (2021). This assumption may risk introducing bias, and so this statistic was not adopted in the fiducial DES Y1 and Y3 analyses. In this work, we improve the modelling of the 3D clustering by incorporating general photo-z distribution and present a Gaussian covariance for it.
This paper is organized as follows. In Section 2, we first review the computation of the angular correlation and then show that the general angular correlation can be turned into the 3D correlation function ξp. We consider the Gaussian photo-z limit and use it to understand the effect of photo-z uncertainties on ξp in Section 3. In Section 4, we derive the Gaussian covariance for ξp using the Gaussian covariance for the angular correlation function. In Section 5.1, we present the numerical results on the template and covariance, and compare them with mock results whenever possible. We discuss the issue in data fitting caused by highly correlated data in Section 5.2.1, and then present the BAO fit results in Section 5.2.2. We conclude in Section 6. In Appendix A, we test our pipeline for the case of spectroscopic data. The default cosmology adopted in this work is the MICE cosmology (Crocce et al. 2015; Fosalba et al. 2015), which is a flat ΛCDM with Ωm = 0.25, ΩΛ = 0.75, h = 0.7, and σ8 = 0.8.
2 ANGULAR AND THREE-DIMENSIONAL TWO-POINT CORRELATION
In this section, we first review the computation of the angular correlation function, w, in the presence of general photo-z uncertainty. We then use the general angular correlation function to derive the 3D two-point correlation ξp.
2.1 Angular correlation function wp
Because in imaging surveys, angles are well determined but there are typically significant redshift uncertainties, it is natural to use angular quantities. It is useful to first clarify the relation between volume number density and tomographic bin angular number density.
Equation (13) accounts for the photo-z uncertainty exactly and it is valid for the curved sky. We only need to plug in the appropriate power spectrum |$P(\boldsymbol {k} , z,z^{\prime })$|. However, the redshift-space power spectrum in the distant observer limit is often used, and so this effectively limits the results to the flat sky case. As long as the local line of sight (LOS) is used, the effect is mild, e.g. Yoo & Seljak (2015).
2.2 From w to ξp
Because of the statistical rotational invariance, the angular correlation function w(θ, zeff) for the sample at the effective redshift zeff can be parametrized by the angle θ subtended by two points on the sky at the observer. If in addition, the redshifts (or photo-z) of these two points are also specified, e.g. in the form of tomographic bins, w(θ, z, z′) furnishes a representation of the 3D correlation function. On the other hand, the 3D correlation function ξ in real space depends only on the distance between the two points, |$s = |\boldsymbol {r}_1 - \boldsymbol {r}_2|$|, thanks to the statistical translational and rotational invariance. The redshift space distortion, along the LOS direction, breaks the rotational invariance in redshift space and ξ is usually parametrized as a function of s and μ, the latter is the dot product between direction of the separation, |$\hat{\boldsymbol {s}}$| and the LOS |$\hat{ \boldsymbol {e}}$|, or in terms of s⊥ and s∥, the component of |$\boldsymbol {s}$| perpendicular and parallel to the LOS. In practice, both w and ξ can be computed by assigning density to a grid (2D pixelized grid cells for w and 3D grid cells for ξ) or by comparing pair counts between the data catalog and the random one.
Because δ(2) = δ(3), it is clear that the general cross angular correlation function and the 3D correlation are equal provided a cosmology is assumed to convert angle and redshift to distance and the redshift bin is fine enough so that the projection effect ,along the LOS direction is negligible. Depending on how it is parametrized, the correlation function goes by different names. We shall consider the 3D correlation function |$\xi _{\rm p} (s, \mu) = \langle \delta _{\rm p} (\boldsymbol {r}_1) \delta _{\rm p} (\boldsymbol {r}_2) \rangle$| and its projective version characterized by the transverse scale, |$\xi _{\rm p} (s_\perp , \mu) \equiv \langle \delta _{\rm p} (\boldsymbol {r}_1) \delta _{\rm p} (\boldsymbol {r}_2) \rangle _\parallel$|, where the notation 〈…〉∥ means that we average over s∥ ≡ sμ by stacking together the pairs with the same |$s_\perp \equiv s \sqrt{ 1 - \mu ^2 }$|.

Left-hand panel: the relation between the angular correlation function specified by two true redshift distributions conditional on photo-z zp and |$z_{\rm p}^{\prime }$|, subtending an angle θ at the observer. The corresponding 3D correlation function is described by the separation vector |$\boldsymbol {s}$| relative to the LOS direction |$\hat{\boldsymbol {e}}$|. Right-hand panel: The integral over the two distributions can be approximated by a 1D integral over Δzr or the separation s′.
Here, we describe binning into s and μ bin as an example. For ξp(s⊥, μ), the procedures are the same except that we compute s⊥ from the redshifts of the bin pairs and their angular separation, and then bin the correlation based on s⊥ and μ.
The measurement of the ξp statistics requires a fiducial cosmology input, while the tomographic bin correlation in term of angle and redshift can evade the cosmology dependence (Montanari & Durrer 2012). However, the advantage of the ξp statistic is that it can effectively summarize the information in multiple tomographic bins in a single correlation measurement, and it is easier to borrow the techniques developed for the spectroscopic data analysis.
3 UNDERSTANDING THE IMPACT OF PHOTO-z BY A GAUSSIAN CASE STUDY
It is instructive to check under what conditions our general expression would reduce to this result. We will see that this exercise gives valuable insight into our results.
Equation (30) reveals that the photo-z correlation function is a projection of the underlying true correlation with a Gaussian weight of zero mean and variance 2σ2, in agreement with equation (6) of Ross et al. (2017). In Fig. 2, we show the wedge correlation function obtained with the Gaussian photo-z distribution. The effective redshift of the two conditional true redshift distributions centred at zp and |$z_{\rm p}^{\prime }$|, respectively, is taken to be zeff = 0.8. The standard deviation of the distribution is assumed to be σ = (1 + zeff)σz and a suite of σz values are considered: 0.001, 0.01, 0.02, and 0.04. In this plot, for simplicity the linear real-space correlation is used. We have compared the results computed with the approximation equation (31) and by directly integrating over the 2D integral. We find that the 1D approximation is in good agreement with the exact result, but deviations become apparent for high σz.

The wedge correlation function computed with Gaussian photo-z distributions. The upper and low panels show the results for using the separation s and the transverse scale s⊥ as the independent variables, respectively. The 1D approximation equation (31; solid) and the exact 2D integral results (dashed) agree with each other well, but small deviation occurs for large value of σz. Four values of σz are displayed (left-hand to right-hand panels). Different colours indicate wedge correlation of different values of μ.
Interestingly, while for small σz such as 0.001, the BAO bump peaks at the true sound horizon size, ssh, it shifts to larger values as σz increases. Also for large σz, it is clear that the BAO bump moves to larger s as μ increases. In the lower panels of Fig. 2, we show ξp as a function of the transverse scale |$s_\perp = s \sqrt{ 1- \mu ^2 }$|. As observed in Ross et al. (2017), when ξp is plotted against s⊥, the BAO position appears to line up at s⊥ = ssh for σz ≳ 0.02. Thus this plot suggests that the BAO peak in the upper panel (the usual way to plot ξ) is directly related to the scale s⊥ for σz ≳ 0.02.
To shed light on these results, we show in Fig. 3 the three factors of the integrand of equation (31): the absolute value of the Jacobian |dΔzr/ds′|, the Gaussian weight PG and the underlying correlation function ξ. We shall shortly explain these terms in details. In this illustration, we take |$s(z_{\rm p},z_{\rm p}^{\prime }) = 100 \, \mathrm{Mpc} \, h^{-1}$|.

The factors in the photo-z projection integral (equation 31) as a function of the separation s′. As shown on the right-hand panel of Fig. 1, as Δzr increases, s′ first decreases until it reaches the minimum value s′ = s⊥ and then it increases. Both the Gaussian weight PG (solid, colour) and absolute value of the Jacobian |dΔzr/ds′| (multiplied by a factor of 10, dotted, colour) correspond to the paths traversed as Δzr increases. The point where all the PG curves with different μ meet corresponds to |$s^{\prime } = s(z_{\rm p}, z_{\rm p}^{\prime })$| (chosen to be |$100 \, \mathrm{Mpc} \, h^{-1}$| here). The Jacobian diverges at s′ = s⊥ and this indicates as the vertical line in the plot. The vertical part for the Jacobian touches the PG curve as the latter turns at s′ = s⊥. The underlying correlation function (multiplied by a factor 103, solid black) is also overplotted. See the text for further explanations.
For small σz such as 0.001, which is close to the spectroscopic redshift case, the Gaussian weight is narrowly distributed about the scale |$s(z_{\rm p},z_{\rm p}^{\prime })$|. The distribution PG as a function of s′ is in fact mildly tilted for small μ and it becomes more symmetric when μ increases. As we mentioned, equation (32) vanishes at s′ = s⊥, and hence the Jacobian diverges. Except for small μ, such as μ ≲ 0.1, the divergence occurs at the scale, where PG is essentially zero and hence it is irrelevant. Therefore, in this case the BAO scale in ξp is controlled by PG, and it corresponds to the true sound horizon well.
As σz increases, the PG distribution becomes more tilted. Among the cases shown, for σz ≥ 0.01, s′ is clearly not a monotonic function of Δzr, as evident from the multivalued PG curves. The scale where all the μ-curves pass through corresponds to s′ = s. The smallest s′ that each μ-curve attains corresponds to the transverse scale |$s_\perp = s\sqrt{ 1 - \mu ^2}$|. The Jacobian formally diverges at s′ = s⊥. For small μ, as s⊥ coincides with s well, the Jacobian and the Gaussian weight both peak at s′ = s, the value of the integral is essentially equal to ξ(s).
For larger μ, it depends on the value of σz. While the Jacobian is sharply peaked at s′ = s⊥ and ξ is also rapidly decreasing with s′. For σz ≲ 0.01, the Gaussian weight at s′ = s⊥ is suppressed relative to its maximum value at s. Thus, the integral is not well peaked at s′ = s⊥. The distribution is more tilted as σz further increases and becomes almost horizontally laid down for σz ≳ 0.04. Because the Gaussian weight suppression is much more mild for σz ≳ 0.02, the integral is essentially given by ξ(s⊥). This explains why the scale s⊥ but not s, is a good indicator of the underlying correlation function when σz ≳ 0.02.
Although we have carried out the calculations explicitly using Gaussian conditional true redshift distribution, the lessons learned are more general. The approximation that the zc dependence can be integrated out and the final results can be approximated by a 1D projection integral should be valid for well-behaved unimodal distributions (modulo small random fluctuations). In particular, the fact that the underlying correlation is reflected by s⊥ primarily stems from the divergence of the Jacobian, which is not dependent on the precise form of the distribution.
We mention that the divergent Jacobian plays a similar role as the Limber approximation (Limber 1953; LoVerde & Afshordi 2008), which is phrased as a linear relation between the spherical wavenumber ℓ and the Fourier mode k and is effective in reducing the dimension of the projection integrals. In this approximation, although only the Fourier modes transverse to the LOS are included, it works very well for reasonably large ℓ because those are the only significant contributions to the integral.
As ξp corresponds to the underlying correlation function at the transverse scale s⊥ for σz ≳ 0.02, we can stack all the modes of ξp with the same s⊥ together to increase the signal to noise. Thus in this sense, the 3D ξp characterized by s⊥ is effectively a projected correlation function. Consistent with Seo & Eisenstein (2003), Blake & Bridle (2005), ξp(s⊥) for σz ≳ 0.02 only measures the transverse BAO and cannot be used to probe the radial BAO and hence the Hubble parameter directly.
4 GAUSSIAN COVARIANCE
In this section, after briefly reviewing the Gaussian covariance for the angular correlation function, we then employ it to derive the Gaussian covariance for ξp.
A few comments are in order. We evaluate the cross harmonic power spectra |$C_\ell ^{ij}$| using camb sources (Lewis & Challinor 2007). To account for the fact that the data vector is binned into angular bins, we need to use the bin-averaged Legendre polynomial (Salazar-Albornoz et al. 2017). In addition, as the terms are computed numerically, we need to evaluate some of the shot noise terms analytically to ensure numerical convergence (Chan et al. 2018).
Our method approximates the 3D correlation and the shot noise by the tomographic version. For the signal, when the tomographic bin size is small compared to photo-z uncertainty, the results are convergent. For the shot noise, we approximate the Dirac delta in redshift in equation (40) by a Kronecker delta and the volume density by the tomographic one.
5 MOCK TESTS
In this section, we first test the theory template and the Gaussian covariance for ξp against mock catalogs in Section 5.1 and then test the BAO fit pipeline, as well as present the fit results in details in Section 5.2. To do so, we use the ICE-COLA mocks (Ferrero et al. 2021), which is a dedicated mock catalog for the DES Y3 BAO analyses. We briefly describe it here and refer readers to Ferrero et al. (2021) for more details.
The ICE-COLA mocks are derived from the COLA simulations which is based on the COLA method (Tassev, Zaldarriaga & Eisenstein 2013) and implemented by the ICE-COLA code (Izard, Crocce & Fosalba 2016). The COLA method combines the second-order Lagrangian perturbation theory with the particle-mesh simulation technique to ensure that the large-scale modes remain accurate when coarse simulation time steps are used. The simulation consists of 20483 particles in a cube of side length 1536 |$\, \mathrm{Mpc} \, h^{-1}$|, matching to the mass resolution of MICE Grand challenge N-body simulations (Crocce et al. 2015; Fosalba et al. 2015). To generate the whole sky light-cone simulation at z ∼ 1.4, the simulation box is replicated three times in each Cartesian direction (altogether 64 copies). The replication has important consequences for the covariance measurements, and we will further comment on it later on. Mock galaxies are assigned to the ICE-COLA haloes using a hybrid Halo Occupation Distribution and Halo Abundance Matching recipe, following a similar approach in Avila et al. (2018). The mocks are calibrated with the redshift distribution and the angular clustering amplitude of the true galaxy samples. From the full-sky light-cone mocks, four maps with the DES Y3 footprint are extracted. Thus, there are 1952 mock catalogs in total, although we will only use a subset of them.
A key challenge is to assign realistic photo-z’s to the mock galaxies. To do so, the photo-z distribution is estimated using 8362 galaxies with known spectroscopic redshift from VIPERS survey (Guzzo et al. 2014). Using these galaxies, which have both the spectroscopic redshift zs and the photo-z from DES, the 2D distribution P(zp, zs) can be estimated. By sampling P(zp, zs) conditioned on the true redshift zs, the candidate mock galaxies are put to the appropriate photometric bins. A further requirement is that the mock galaxies must match the abundance of the observed photometric galaxies. Suppose there are N(zpi) galaxies in the photometric bin zpi, the most luminous N(zpi)P(zpi, zsj) galaxies from the bin (zpi, zsj) is taken as the mock sample. By construction, the mock galaxies will match the observed galaxy abundance and the conditional true redshift distribution.
As the cosmology adopted by the mock catalog is the MICE cosmology, the theory templates, i.e. the theoretical model described in Section 2.2, and the Gaussian covariance are computed in this cosmology, although we will also consider the template computed in Planck cosmology (Planck Collaboration 2020).
5.1 Template and covariance
In this section, we use the ICE-COLA mocks to test the theories developed in Sections 2 and 4. The theory calculations require the conditional true redshift distribution and the bias parameters measured from mocks. Thus we shall first present the measurement of these quantities before confronting the theories with measurements. In Appendix A, we check the template calculation using the spectroscopic correlation function.
The spread of the conditional true redshift distribution generally increases with redshift. As ϕ depends on the number density in true redshift and the spectroscopic data is not available outside (0.6,1.1), we cannot reliably calibrate ϕ for all bins. Here, we approximate ϕ by f. The preliminary estimate suggests that the effect is small, but when the spectroscopic density is rapidly changing such as for the high redshift bins, it should be taken into account. We shall present more detailed analysis in future work. In the standard tomographic analysis, the samples are divided into a number of redshift bins and their auto (and cross) correlation functions are considered. For example, in the DES Y3 BAO analyses, the samples in the photo-z range (0.6, 1.1) are divided into five tomographic bins with bin width Δz = 0.1. For 3D correlation function, because ϕ is continuously sampled by the galaxy pair counts, we have to use fine photo-z bins in order to sample the ϕ accurately and smoothly.
To investigate the number of photo-z bin width required, we consider a number of ϕ’s measured from the ICE-COLA mocks in the photo-z range (0.6, 1.1). We divide this photo-z range into redshift bins of equal spacing. In Fig. 4, we show a sample of ϕ obtained with 5 and 50 photo-z bins. We also overplot the corresponding Gaussian approximation with the same mean and variance as the true ϕ. The Gaussian approximation is more accurate for the five-bin case as it is averaged over a larger redshift range. At high redshift, ϕ is more irregular and the Gaussian approximation is less accurate.

The conditional true redshift distributions measured from COLA mocks (filled circles) with 5 (upper panel) and 50 (lower panel) photo-z bins. The Gaussian approximation is also overplotted (solid curves). For clarity, in the case of 50 bins, only those whose bin number is divisible by 5 are shown.

σz (equation 41) of the conditional true redshift distribution for 5 (blue), 20 (orange), and 50 (green) photo-z bins.
Another ingredient required is the bias parameters. We measure the linear bias parameters by fitting the auto-angular correlation (equations 17 and 18) to the mock measurements as in Chan et al. (2018) and Abbott et al. (2019). The fit is performed by fitting to the mean of mocks in the angular range of (0.5,1.5) degree. We show the bias parameters obtained with different number of tomographic bins in Fig. 6. Similar to the case of σz, the 20-bin and 50-bin cases show consistent result with the 50-bin ones showing more random fluctuations, while the five-bin results are slightly higher.

The bias parameters measured from 5 (blue), 20 (orange), and 50 (green) photo-z bins.
We compare the theory template against the mock measurements in Fig. 7. The 3D correlation ξp is measured using the Landy-Szalay estimator (Landy & Szalay 1993) via the public code CUTE (Alonso 2012). Limited by the computing power, we only measure the correlation function from 384 ICE-COLA mocks. Judging from the size of the error bars, which are estimated by the standard error of the mean, the fidelity of the results are well sufficient for our purpose here. We estimate the correlation by considering the pairs with maximum radial separation |$120 \, \mathrm{Mpc} \, h^{-1}$| and transverse separation |$175 \, \mathrm{Mpc} \, h^{-1}$|. To ensure good sampling of the projected correlation within the maximum radial separation, we only consider ξp up to μ = 0.8. We further divide the μ range (0, 0.8) into three equal spacing ‘wedges’. Both ξp versus s and s⊥ are shown.

The wedge correlation function ξp as a function of the separation s (upper panel) and the transverse separation s⊥ (lower panel). The μ range in (0, 0.8) is divided into three equal spacing bin, (0,0.27; blue), (0.27,0.53; red), and (0.53,0.8; green). The measurements (filled circles) are the mock mean and the error bars estimated by the standard error are too small to be seen. The theory templates are the solid curves.
For the theory template, we first compute wij(θ) in 50 photo-z bins with angular bin width of 0.02 degree. Because the bin width in the radial direction is wide (|$\Delta r \sim 20 \, \mathrm{Mpc} \, h^{-1}$|), we linearly interpolate the resultant angular correlation to a larger redshift grid of 500 bins (Δz = 0.005) before binning all the pairs wij(θk) into the appropriate ξ(s, μ) bin. It is important, especially for the projected correlation function, to ensure that the maximum ranges in the radial and transverse directions used in template construction match those in the measurements. We find that the template obtained is of slightly lower amplitude than the mock measurement (by about 3 per cent), the amplitude deficit is probably due to sparse sampling in the radial direction. The template shown in Fig. 7 has been multiplied by an overall amplitude factor. Besides this amplitude adjustment, the theory templates are in good agreement with mock results.
In the conventional angular tomographic analysis, the BAO features manifest in different angular scales, which is less direct to see the photo-z effect on the BAO. To take advantage of the fact that the BAO information is condensed to a single bump in ξp, we plot in Fig. 8 the photometric correlation ξp against s⊥ and the spectroscopic ξ against s. We have shown the results for both MICE and Planck cosmology. The spectroscopic correlation ξ is the dark matter real space correlation with isotropic BAO damping, obtained by an inverse Fourier transform of an analog of equation (19) at zeff = 0.83. We also show the sound horizon rd in these two cases for comparison. Relative to the spectroscopic correlation, the BAO feature in ξp is less strong. Note that the BAO peak does not precisely correspond to rd, and it is on different sides relative to the peak for the spectroscopic and the photometric cases. A small bias could arise because of the subtle shape differences between these two cosmologies. For example, the broadband terms introduced to take care of the shape difference, if their contribution is rapidly changing, it may slightly shift the BAO position. This is more likely to occur in the photometric case given the features are less sharp than in the spectroscopic one.

The photometric correlation function ξp versus s⊥ (dotted line with filled circles) and the dark matter real space spectroscopic correlation ξ against s (solid) in MICE (blue) and Planck (red) cosmology, respectively. The sound horizons at the drag epoch in these cosmologies are also overplotted as dashed vertical lines (in the same colour code). To enhance the BAO feature, they have been multiplied by the appropriate distance squared.

The correlation coefficient for ξp. The upper panel shows the results for s and the lower one for s⊥. The results are for one μ bin in the range of (0, 0.8).
It is instructive to contrast the results with the Gaussian covariance for the angular correlation function, which is plotted in Fig. 10. The setup of the calculations are the same as those in the fiducial DES Y3 analyses. There are five photo-z bins, and in each bin, θ ranges from 0.5 to 5 degree with angular bin width of 0.2 degree. The photo-z mixes modes in the radial direction. Because the width of the redshift bin (Δz = 0.1) is wide compared to the photo-z uncertainty σ [∼0.03(1 + z)], the photo-z mixing does not affect the covariance in the same bin much. The cross-correlation between different redshift bins is mainly caused by photo-z mixing, and the only significant correlation is between neighbouring redshift bins. Again, because the redshift bin is wide compared to σ, the mixing between the neighbouring bins is mild, and so even the neighbouring bin covariance is weak. On the other hand, as photo-z mixing scale can be as high as |$\sim 100 \, \mathrm{Mpc} \, h^{-1}$|, for ξp there can be substantial mixing between the underlying correlation from different scales. Thus, the covariance for ξp can be significantly higher than that for w.

The correlation coefficient for w. The data in the redshift range (0.6,1,1) is split into five tomographic bins with bin size 0.1.
As we mentioned, construction of the ICE-COLA mock catalogs involves replicating the simulation box to form a full-sky light-cone mock. The repeated structures cause the covariance to be anomalously higher than the correct one, and this prevents us from using the mock covariance as the benchmark to test our Gaussian theory covariance. See Ferrero et al. (2021) for more discussions. Thus in DES Y3, the analytic Gaussian covariance is the default choice. Although we cannot directly test the ξp covariance against mock results, the code used to compute the Gaussian covariance for w, which is the backbone of the ξp results, has been tested in Chan et al. (2018), Avila et al. (2018), and Abbott et al. (2022). Moreover, we have verified that the amplitude ratio difference between the Gaussian theory covariance and the mock covariance are at a similar level for w and ξp, albeit quantitative comparison is not possible.
5.2 BAO fit
The default configuration for the BAO fit is as follows: the broadband terms consisting of |$\sum _i A_i / s_\perp ^i$| with i = 0, 1, and 2; the fit range |$[40,140] \, \mathrm{Mpc} \, h^{-1}$|; the bin width |$\Delta s = 3 \, \mathrm{Mpc} \, h^{-1}$|, and the Gaussian theory covariance. We use 384 mocks for the BAO fit test. Different configurations for BAO fit are checked.
We look for the best-fitting α by the maximum likelihood estimator following the procedures in Chan et al. (2018). We first analytically fit the linear parameter Ai, and then minimize the resultant χ2 w.r.t. B subject to the constraint B > 0. At the end of the operations, we are left with the residual χ2 as a function of α. The best-fitting α corresponds to the point where the residual χ2 attains its minimum and the symmetric 1σ error bar is derived from the Δχ2 = 1 criterion. Besides, we also cross check our results with the Markov Chain Monte Carlo (MCMC) method, in which all the parameters are fitted simultaneously.
5.2.1 Highly correlated data fit
The photo-z mixing causes ξp to be strongly correlated, as evident in Fig. 9. Here, we first describe some fitting problems caused by the high correlation, namely the apparently bad fit results and the alarmingly large χ2 per degree of freedom (χ2/dof). The former issue is caused by the largest eigenvalues of the covariance and the latter is due to the small ones. We discuss a few approaches to modify the covariance, and find that the first issue can be resolved by suppressing the largest eigenvalues. For the second issue, while it is not practical to solve it by adjusting the eigenvalues, we demonstrate that the BAO feature is well-fitted by our model and the problem is not directly related to the BAO.
To begin with, we present some peculiar fit results. One of the odd examples appears in the BAO fit to the mean of the mocks, which are shown in Fig. 11. When the MICE cosmology template is used, the resultant fit is very good and it is not surprising that the χ2/dof is very small (0.009). However, when the template computed in Planck cosmology (Planck Collaboration 2020) is used, we find that the resultant χ2/dof is still small (0.11), but the fit is manifestly poor by eye because the best-fitting model is below all the data points and most of the data points are more than 1σ away from the best fit.
![The BAO fit results to the mean of the mock measurements (grey data points with error bars). The fit obtained with the MICE template without broadband terms (solid, red) is compared with the Planck template without broadband terms result (solid, green). The latter case is below all the data points. With the inclusion of the broadband terms, the Planck template (dashed, yellow) also offers good fit. The results for the modified covariance are also shown [No broadband terms (solid, green) and with broadband terms (dashed, violet)].](https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/mnras/511/3/10.1093_mnras_stac340/1/m_stac340fig11.jpeg?Expires=1750342576&Signature=iRJ61oVxe~nq6ifcmRC1bSJfJcX6f94U5M7qDbzpHP5uUAY3cQKq-gwE-IAMJsYhc5PBcznuUoQwrJHevNo2OoXNxgufWkfo9EpVT9n4qGFkVXX9d4zuQ4wgzcCLV5Zh9nmqS-MmyBAgoMdpXvrPQzzrvAB3YrBydar9aOOIrGm7TeuXw0Ku20b~wH6IGw2D9hkrieYkrIQoFeyJc50etqY0JwBiyLflhRzASxsg5GfdK3l-LirZtshEKtVM5q99n7reQndx15cEKiAs3~wtCikx9bBtYyPABdCUHIW03-c2nHjDXRYV6ta7x8xHK--N5uNBsLERPJK2J~KV~SDVnQ__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA)
The BAO fit results to the mean of the mock measurements (grey data points with error bars). The fit obtained with the MICE template without broadband terms (solid, red) is compared with the Planck template without broadband terms result (solid, green). The latter case is below all the data points. With the inclusion of the broadband terms, the Planck template (dashed, yellow) also offers good fit. The results for the modified covariance are also shown [No broadband terms (solid, green) and with broadband terms (dashed, violet)].
Although this issue only appears when the model equation (43) does not include the broadband terms at all and disappears otherwise, it does indicate that something abnormal happens to the fit.8 In the other extreme, when we fit to the individual mocks, we find that the fit results are often fine by eye as the best-fitting model fall within all the 1σ errors, but χ2/dof is alarmingly large, some of them reach as high as 2. Note that the second issue we describe here appears for generic fit configurations, in particular for both Gaussian and mock covariance fit. We point out that the second problem is already implicit in Abbott et al. (2019) and Sridhar et al. (2020), as their ξp fit results look decent by eye but appreciably large χ2/dof were reported. We attribute these problems to the high correlation of the data induced by the photo-z mixing.

Upper panel: the elements of the precision matrix Ψij plotted against j for fixed i. The elements for the original Ψ (triangles) and the one obtained by modifying the covariance matrix (circles, see the text for details) are compared. Lower panel: the differences between the modified Ψij and the original one are displayed. The effect of the modified covariance is mainly to shift Ψ to the positive side. As the large eigenvalue modes correspond to eigenvectors of long ‘wavelength’, it also weakly affects the zones far from the diagonal.
To look into the covariance, we plot its eigenvalues in Fig. 13. We find that the spectrum spans three orders of magnitude, and most of them are of relatively small values. As a comparison we also show the diagonal elements of C, its range is much more limited. For the fit to the mean of mocks using the Planck template, we find that the fit is dominated by the largest eigenvalue mode. Because of its (excessively) large eigenvalue, its contribution to the χ2 is very small, so that even the fit is visually bad, χ2/dof is still small. On the other hand, for the fit to individual mocks, the large χ2/dof values are caused by significant contributions from modes with small eigenvalues.

The eigenvalue spectrum of the covariance matrix (orange) and the diagonal elements of covariance matrix (blue). While Cii is in its natural order, the eigenvalues have been arranged in descending order.
The problem of fitting highly correlated data is also encountered in the lattice field theory because lattice simulation data are highly correlated. Yoon et al. (2013) surveyed a few methods proposed by the lattice field theory community to tackle this problem. Michael (1994) advocated ignoring the off-diagonal elements in the covariance, but this is clearly a too violent modification. A more mild method is to remove the eigenmodes deemed problematic. Both removing the largest eigenvalue modes (Thacker & Lepage 1991) and the smallest ones (Drummond & Horgan 1993; Kilcup 1994) have been proposed. For the largest eigenvalue modes removal, it was argued that these modes have small contribution to χ2. The smallest eigenvalue modes removal is often favoured in the spirit of the singular value decomposition (Press et al. 2007). A physical argument in support of it is that those modes are generally highly oscillatory and so they are not conducive to getting a smooth fit. In a similar vein, Bernard et al. (2002) proposed to modify instead the correlation coefficient equation (42). They replace the eigenvalues of ρ smaller than certain threshold λth by λth. A new covariance matrix can be constructed using the original eigenvectors and the modified eigenvalues. Modifying the eigenmodes is more attractive as it keeps all the directions in the covariance space constrained.
We test the results obtained with different covariance recipes. While for most of the fit, different approaches give similar results, in a small fraction (order of 10 per cent) the differences are quite pronounced. In Fig. 14, we show the fit results from one of the mocks, in which the differences are clearly visible. The one obtained with the original covariance (blue) is manifestly not optimal. The best fit is 1.006 ± 0.017 in this case. In the other extreme, when the diagonal approximation (red) is used, it is not surprising that the best fit is driven close to data points, but the best fit we get is 1.002 ± 0.032, with the error bar substantially larger than the original one. By suppressing the largest eigenvalues of the covariance (green) or ρ (yellow), we can also get an apparently better fit. We replace the eigenvalues larger than certain thereshold λth by λth. We take λth to be 3 × 10−8 and 1 for the covariance and correlation coefficient modification respectively. The thresholds are chosen so that as few eigenvalues are affected as possible and the affected ones account for about 10 per cent of them. The best fits we get are 1.003 ± 0.016 and 1.005 ± 0.017, respectively.

The best fit to the ξp measurement from one of the mocks (circles, the error bars are 1σ error from the diagonal elements of the covariance). This mock is chosen because the differences among different covariance recipes are pronounced. The results obtained with the original covariance matrix (blue), diagonal approximation (red), suppressing the largest eigenvalues in the covariance (green), and suppressing the largest eigenvalues in the correlation coefficient (yellow) are compared. The raw template (black, dashed) is also plotted.
In Fig. 12, we also show the elements of Ψ obtained by suppressing the largest eigenvalues of ρ. Overall, the size of the modification is small, of the order 10−3. The main effect of the covariance modification is to shift Ψ to the positive side so that the attraction to zero is enhanced and the repulsion to negative infinity is weakened. Note that the large eigenvalue modes correspond to eigenvectors that are oscillatory in nature with long ‘wavelength’, while the small eigenvalue ones tend to be well localized. Thus the modification also weakly affects the the zone far from the diagonal. Furthermore, we find that the ρ modification recipe often results in better agreement with the original one compared to the covariance modification, hence we will adopt ρ modification in the following analysis. In Fig. 11, we also show the results obtained with ρ modification. Without the broadband terms, the best fit now passes through the data points, although the BAO position appears to be biased. With the default broadband terms (Ai with i = 0, 1, 2, and 3), the bias is removed.
On the other hand, the large χ2/dof problem is caused by the small eigenvalues. The nominal χ2/dof for the example shown in Fig. 14 are similar for different recipes, around 2.7, except for the diagonal approximation, which gives χ2/dof ∼ 0.06. Since only the largest eigenvalues are modified, the χ2 is only increased slightly compared to the original one. As it is clear from the spectrum in Fig. 13, most of the eigenvalues are small and of similar values, we need to remove or modify a large fraction of eigenmodes before the effect becomes noticeable. Removing 40 per cent of the eigenmodes with the smallest eigenvalues, the naive χ2/dof can be reduced to roughly 1. This represents a very substantial modification of the covariance matrix. Moreover, after eigenmodes removal or modification, there is no proper theoretical basis for χ2/dof anymore (Bernard et al. 2002). Thus, we shall not pursue to reduce χ2/dof by adjusting the covariance matrix.
Using the example in Fig. 14 again, we check directly if the BAO scale is well fitted by the model in Fig. 15. To decorrelate the data, we rotate the data vector by the matrix U, which is the orthogonal matrix that diagonalizes the covariance, i.e. C = UTΛU. To highlight the BAO feature in the rotated coordinates, the no-BAO template has been subtracted from both the data vector and the best-fitting model. Note that as the data vector is mixed by U, they no longer strictly correspond to the original distance r. Here, we only use it as a convenient label for the x-axis. None the less, from the difference with the no-BAO template, the location of the BAO feature can be inferred. In this basis, the BAO feature shows up as a series of wiggles in |$r \gtrsim 110 \, \mathrm{Mpc} \, h^{-1}$| instead of a single peak in the usual configuration representation. The error bars on the data are given by the eigenvalues in Λ and they are not correlated in this basis. We have also shown error bars from the modified covariance according to the ρ modification recipe. We see that the modification reduces the error bars around the BAO scale. To highlight the source of large χ2/dof, we show the ratio with respect to the error bar in the lower panel of Fig. 15. Both the results obtained with the original error bars and the modified ones are shown, which coincide with each other except for |$r \gt 130 \, \mathrm{Mpc} \, h^{-1}$|. Significant deviations of the best-fitting model from the data points occur for |$r\lesssim 100 \, \mathrm{Mpc} \, h^{-1}$|, while in the BAO zone, the model and the data agree with each other well for both types of error bars. Thus, we conclude that the large χ2/dof is caused by the part of the fit that is not directly related to the BAO.

Upper panel: the data (filled circles) and the best-fit model (solid blue) shown in the rotated basis. The orthogonal matrix U decorrelates the correlation in the error. To highlight the BAO feature, the no-BAO template has been subtracted. In this basis, the BAO feature manifests as a series of wiggles for |$r \gtrsim 110 \, \mathrm{Mpc} \, h^{-1}$|. The uncorrelated error bars from the original covariance (black) and the modified covariance obtained from the ρ modification scheme (red) are displayed. The modified error bars are reduced compared to original ones only in the large r range. The data used in this plot is based on the same mock data used in Fig. 14. Lower panel: the results are further divided by the error bars to show the significance of the deviations between the model and the data points. The results obtained with the original error bars (black) and the modified error bars (red) overlap with each other and deviations show up only for |$r\gt 130 \, \mathrm{Mpc} \, h^{-1}$|. In the BAO zone, the model and the data points agree with each other well. The large χ2/dof is driven by the data points lying outside the BAO feature.
5.2.2 Fit results
In this section, we present the BAO fit results in details. We will consider various metrics to quantify the accuracy of the fit and different variation of the fit configurations. Although these are important, they can be tedious to read. The key default result is given in the first line in Table 1. In particular, the ξp constraint on α is 〈α〉 ± σstd = 1.001 ± 0.023. We have checked that the results are robust against variation of the fit configurations. The default setup uses the Gaussian covariance modified with the ρ modification prescription. We shall also contrast them with the corresponding results for the angular correlation. They are also consistent with the w results even though ξp statistic has considerably larger 〈χ2〉/dof.
BAO fit results on the COLA mocks with different configurations in the analysis. The default setup is as follows: the broadband terms consisting of |$\sum _i A_i / s_\perp ^i$| with i = 0, 1, and 2, the fit range |$[40,140] \, \mathrm{Mpc} \, h^{-1}$|, the bin width |$\Delta s = 3 \, \mathrm{Mpc} \, h^{-1}$|, and Gaussian theory covariance with eigenvalues of ρ suppressed. See the texts for details.
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Default | 1.001 | 0.023 | 0.022 | 0.020 | 61% | −0.005 | 1.18 | 1.002 ± 0.019 | 58.7/29 (2.03) |
w fit | 1.004 | 0.024 | 0.023 | 0.021 | 62% | −0.024 | 1.11 | 1.004 ± 0.021 | 93.1/89 (1.05) |
|$A_i, \, i=0$| | 1.001 | 0.023 | 0.022 | 0.019 | 59% | 0.032 | 1.19 | 1.002 ± 0.019 | 62.9/31 (2.03) |
|$A_i, \, i=0,1$| | 1.001 | 0.023 | 0.021 | 0.021 | 64% | 0.003 | 1.12 | 1.003 ± 0.020 | 58.6/30 (1.95) |
|$A_i, \, i=0,1,2,-1$| | 1.002 | 0.023 | 0.021 | 0.020 | 60% | −0.005 | 1.17 | 1.002 ± 0.019 | 56.8/28 (2.03) |
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$| | 1.000 | 0.027 | 0.024 | 0.022 | 59% | 0.084 | 1.20 | 1.003 ± 0.021 | 36.4/16 (2.28) |
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.024 | 0.020 | 57% | −0.007 | 1.25 | 1.001 ± 0.020 | 10.2/5 (2.05) |
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$| | 1.003 | 0.025 | 0.023 | 0.020 | 59% | −0.022 | 1.17 | 1.002 ± 0.020 | 32.1/15 (2.14) |
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$| | 1.002 | 0.024 | 0.022 | 0.020 | 60% | 0.008 | 1.12 | 1.002 ± 0.019 | 81.0/45 (1.80) |
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.023 | 0.019 | 57% | 0.019 | 1.37 | 1.002 ± 0.018 | 143.1/95 (1.51) |
Planck template | 0.958 | 0.022 | 0.020 | 0.019 | 65% | −0.003 | 1.14 | 0.957 ± 0.018 | 53.7/29 (1.85) |
Original Gaussian cov. | 1.002 | 0.023 | 0.022 | 0.021 | 64% | −0.005 | 1.13 | 1.003 ± 0.020 | 57.7/29 (2.00) |
COLA cov | 1.001 | 0.023 | 0.022 | 0.021 | 65% | −0.009 | 1.16 | 1.002 ± 0.021 | 47.9/29 (1.65) |
MCMC fit | 1.001 | 0.024 | 0.023 | 0.020 | 59% | −0.004 | 1.18 | 1.003 ± 0.020 | |
Nμ = 3 | 0.999 | 0.028 | 0.025 | 0.017 | 47% | −0.024 | 1.60 | 0.999 ± 0.017 | 139.0/89 (1.56) |
Nz = 2 | 1.000 | 0.023 | 0.022 | 0.018 | 59% | −0.014 | 1.24 | 1.001 ± 0.018 | 98.1 /59 (1.66) |
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Default | 1.001 | 0.023 | 0.022 | 0.020 | 61% | −0.005 | 1.18 | 1.002 ± 0.019 | 58.7/29 (2.03) |
w fit | 1.004 | 0.024 | 0.023 | 0.021 | 62% | −0.024 | 1.11 | 1.004 ± 0.021 | 93.1/89 (1.05) |
|$A_i, \, i=0$| | 1.001 | 0.023 | 0.022 | 0.019 | 59% | 0.032 | 1.19 | 1.002 ± 0.019 | 62.9/31 (2.03) |
|$A_i, \, i=0,1$| | 1.001 | 0.023 | 0.021 | 0.021 | 64% | 0.003 | 1.12 | 1.003 ± 0.020 | 58.6/30 (1.95) |
|$A_i, \, i=0,1,2,-1$| | 1.002 | 0.023 | 0.021 | 0.020 | 60% | −0.005 | 1.17 | 1.002 ± 0.019 | 56.8/28 (2.03) |
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$| | 1.000 | 0.027 | 0.024 | 0.022 | 59% | 0.084 | 1.20 | 1.003 ± 0.021 | 36.4/16 (2.28) |
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.024 | 0.020 | 57% | −0.007 | 1.25 | 1.001 ± 0.020 | 10.2/5 (2.05) |
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$| | 1.003 | 0.025 | 0.023 | 0.020 | 59% | −0.022 | 1.17 | 1.002 ± 0.020 | 32.1/15 (2.14) |
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$| | 1.002 | 0.024 | 0.022 | 0.020 | 60% | 0.008 | 1.12 | 1.002 ± 0.019 | 81.0/45 (1.80) |
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.023 | 0.019 | 57% | 0.019 | 1.37 | 1.002 ± 0.018 | 143.1/95 (1.51) |
Planck template | 0.958 | 0.022 | 0.020 | 0.019 | 65% | −0.003 | 1.14 | 0.957 ± 0.018 | 53.7/29 (1.85) |
Original Gaussian cov. | 1.002 | 0.023 | 0.022 | 0.021 | 64% | −0.005 | 1.13 | 1.003 ± 0.020 | 57.7/29 (2.00) |
COLA cov | 1.001 | 0.023 | 0.022 | 0.021 | 65% | −0.009 | 1.16 | 1.002 ± 0.021 | 47.9/29 (1.65) |
MCMC fit | 1.001 | 0.024 | 0.023 | 0.020 | 59% | −0.004 | 1.18 | 1.003 ± 0.020 | |
Nμ = 3 | 0.999 | 0.028 | 0.025 | 0.017 | 47% | −0.024 | 1.60 | 0.999 ± 0.017 | 139.0/89 (1.56) |
Nz = 2 | 1.000 | 0.023 | 0.022 | 0.018 | 59% | −0.014 | 1.24 | 1.001 ± 0.018 | 98.1 /59 (1.66) |
BAO fit results on the COLA mocks with different configurations in the analysis. The default setup is as follows: the broadband terms consisting of |$\sum _i A_i / s_\perp ^i$| with i = 0, 1, and 2, the fit range |$[40,140] \, \mathrm{Mpc} \, h^{-1}$|, the bin width |$\Delta s = 3 \, \mathrm{Mpc} \, h^{-1}$|, and Gaussian theory covariance with eigenvalues of ρ suppressed. See the texts for details.
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Default | 1.001 | 0.023 | 0.022 | 0.020 | 61% | −0.005 | 1.18 | 1.002 ± 0.019 | 58.7/29 (2.03) |
w fit | 1.004 | 0.024 | 0.023 | 0.021 | 62% | −0.024 | 1.11 | 1.004 ± 0.021 | 93.1/89 (1.05) |
|$A_i, \, i=0$| | 1.001 | 0.023 | 0.022 | 0.019 | 59% | 0.032 | 1.19 | 1.002 ± 0.019 | 62.9/31 (2.03) |
|$A_i, \, i=0,1$| | 1.001 | 0.023 | 0.021 | 0.021 | 64% | 0.003 | 1.12 | 1.003 ± 0.020 | 58.6/30 (1.95) |
|$A_i, \, i=0,1,2,-1$| | 1.002 | 0.023 | 0.021 | 0.020 | 60% | −0.005 | 1.17 | 1.002 ± 0.019 | 56.8/28 (2.03) |
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$| | 1.000 | 0.027 | 0.024 | 0.022 | 59% | 0.084 | 1.20 | 1.003 ± 0.021 | 36.4/16 (2.28) |
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.024 | 0.020 | 57% | −0.007 | 1.25 | 1.001 ± 0.020 | 10.2/5 (2.05) |
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$| | 1.003 | 0.025 | 0.023 | 0.020 | 59% | −0.022 | 1.17 | 1.002 ± 0.020 | 32.1/15 (2.14) |
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$| | 1.002 | 0.024 | 0.022 | 0.020 | 60% | 0.008 | 1.12 | 1.002 ± 0.019 | 81.0/45 (1.80) |
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.023 | 0.019 | 57% | 0.019 | 1.37 | 1.002 ± 0.018 | 143.1/95 (1.51) |
Planck template | 0.958 | 0.022 | 0.020 | 0.019 | 65% | −0.003 | 1.14 | 0.957 ± 0.018 | 53.7/29 (1.85) |
Original Gaussian cov. | 1.002 | 0.023 | 0.022 | 0.021 | 64% | −0.005 | 1.13 | 1.003 ± 0.020 | 57.7/29 (2.00) |
COLA cov | 1.001 | 0.023 | 0.022 | 0.021 | 65% | −0.009 | 1.16 | 1.002 ± 0.021 | 47.9/29 (1.65) |
MCMC fit | 1.001 | 0.024 | 0.023 | 0.020 | 59% | −0.004 | 1.18 | 1.003 ± 0.020 | |
Nμ = 3 | 0.999 | 0.028 | 0.025 | 0.017 | 47% | −0.024 | 1.60 | 0.999 ± 0.017 | 139.0/89 (1.56) |
Nz = 2 | 1.000 | 0.023 | 0.022 | 0.018 | 59% | −0.014 | 1.24 | 1.001 ± 0.018 | 98.1 /59 (1.66) |
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Default | 1.001 | 0.023 | 0.022 | 0.020 | 61% | −0.005 | 1.18 | 1.002 ± 0.019 | 58.7/29 (2.03) |
w fit | 1.004 | 0.024 | 0.023 | 0.021 | 62% | −0.024 | 1.11 | 1.004 ± 0.021 | 93.1/89 (1.05) |
|$A_i, \, i=0$| | 1.001 | 0.023 | 0.022 | 0.019 | 59% | 0.032 | 1.19 | 1.002 ± 0.019 | 62.9/31 (2.03) |
|$A_i, \, i=0,1$| | 1.001 | 0.023 | 0.021 | 0.021 | 64% | 0.003 | 1.12 | 1.003 ± 0.020 | 58.6/30 (1.95) |
|$A_i, \, i=0,1,2,-1$| | 1.002 | 0.023 | 0.021 | 0.020 | 60% | −0.005 | 1.17 | 1.002 ± 0.019 | 56.8/28 (2.03) |
Fit range |$[70,130] \, \mathrm{Mpc} \, h^{-1}$| | 1.000 | 0.027 | 0.024 | 0.022 | 59% | 0.084 | 1.20 | 1.003 ± 0.021 | 36.4/16 (2.28) |
|$\Delta s = 10 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.024 | 0.020 | 57% | −0.007 | 1.25 | 1.001 ± 0.020 | 10.2/5 (2.05) |
|$\Delta s = 5 \, \mathrm{Mpc} \, h^{-1}$| | 1.003 | 0.025 | 0.023 | 0.020 | 59% | −0.022 | 1.17 | 1.002 ± 0.020 | 32.1/15 (2.14) |
|$\Delta s = 2 \, \mathrm{Mpc} \, h^{-1}$| | 1.002 | 0.024 | 0.022 | 0.020 | 60% | 0.008 | 1.12 | 1.002 ± 0.019 | 81.0/45 (1.80) |
|$\Delta s = 1 \, \mathrm{Mpc} \, h^{-1}$| | 1.001 | 0.025 | 0.023 | 0.019 | 57% | 0.019 | 1.37 | 1.002 ± 0.018 | 143.1/95 (1.51) |
Planck template | 0.958 | 0.022 | 0.020 | 0.019 | 65% | −0.003 | 1.14 | 0.957 ± 0.018 | 53.7/29 (1.85) |
Original Gaussian cov. | 1.002 | 0.023 | 0.022 | 0.021 | 64% | −0.005 | 1.13 | 1.003 ± 0.020 | 57.7/29 (2.00) |
COLA cov | 1.001 | 0.023 | 0.022 | 0.021 | 65% | −0.009 | 1.16 | 1.002 ± 0.021 | 47.9/29 (1.65) |
MCMC fit | 1.001 | 0.024 | 0.023 | 0.020 | 59% | −0.004 | 1.18 | 1.003 ± 0.020 | |
Nμ = 3 | 0.999 | 0.028 | 0.025 | 0.017 | 47% | −0.024 | 1.60 | 0.999 ± 0.017 | 139.0/89 (1.56) |
Nz = 2 | 1.000 | 0.023 | 0.022 | 0.018 | 59% | −0.014 | 1.24 | 1.001 ± 0.018 | 98.1 /59 (1.66) |
The default case results for ξp fit are shown in the first row in Table 1. As a reference, we also show the corresponding results for the angular correlation function, which are obtained from a joint five-tomographic fit. See table II in Abbott et al. (2022) for more details on the w fit. The mean of the best-fitting 〈α〉 is 1.001, with small bias which can be attributed to nonlinear evolution (Crocce & Scoccimarro 2008; Padmanabhan & White 2009). Two measures of the spread of the distribution of the best-fitting α are shown: the standard deviation σstd and the half of the width between 16 and 84 percentile of the distribution, σ68, which is less sensitive to the tails of the distribution. We find that σstd is larger than σ68 by a small amount (0.001), suggesting the influence by the tails. As mentioned, the 1σ error bar is derived from the Δχ2 = 1 condition, and for the error bars to be meaningful, 〈σα〉 should agree with the measures of the spread of the distribution. In the case of Gaussian likelihood, they are equal to each other. We find that the mean of the error bar is slightly smaller than the spread of the distribution. These numbers are marginally better than those for w. The bias in the best-fitting 〈α〉 is smaller (1.004 for w) and the measures of spread and error are smaller by ∼0.001. These (mild) improvements could arise from the fact that the ξp statistic makes use of the cross-correlation in the data as well, not only the tomographic bin auto-correlation for w as in the study of DES Y3.
Another indicator for the accuracy of 1σ error bar is the fraction of times that it encloses 〈α〉, which is |$68 {{\ \rm per\ cent}}$| in the case of Gaussian likelihood. In line with the error bar being slightly smaller than the spread, the fraction enclosing 〈α〉 is slightly lower than the Gaussian expectation. The normalized deviation, dnorm ≡ (α − 〈α〉)/σα is designed to study the correlation between the deviation of the best-fitting α from the ensemble mean and the error bar derived. The mean 〈dnorm〉 is close to zero, but the standard deviation |$\sigma _{d_{\text{norm }}}$| exceeds unity by |$18 {{\ \rm per\ cent}}$|. This trend is consistent with the previous observation that the spread of the distribution is slightly larger than the error estimate. In contrast, for w, the fraction of time enclosing 〈α〉 is 62 per cent, and 〈dnorm〉 = 0.02 and |$\sigma _{d_{\rm norm} } = 1.12$|. Both suggest that ξp results are less Gaussian than the w ones.
As a reference, we also show the fit to the mean measurement of the mocks. To show the results for a ‘typical’ fit, we have used the covariance corresponding to a single survey volume. The results are similar to 〈α〉 ± 〈σα〉. However, we note that for w these different measurement are almost always equal to each other, suggesting that the averaging and fitting operations are more commutative for w.
Finally, the |$\langle \chi ^2 \rangle / {\rm dof} \, (\sim 2)$| is also shown for reference, but we should bear in mind the complications mentioned in the previous section. On the other hand, for w, 〈χ2〉/dof is 1.05. We attribute this to the fact that the covariance for w is much less correlated.
A number of tests on the variation of the fitting configurations have been performed. We have tested the number of the broadband terms |$\sum _i A_i / s_\perp ^i$| with i = 0, i = 0, 1, and i = 0, 1, 2, −1. Besides minor fluctuations, the fit results are not sensitive to the number of broadband terms used. This is because the broadband terms are not degenerate with α, but they are degenerate among themselves. When a narrower fitting range |$[70,130] \, \, \mathrm{Mpc} \, h^{-1}$| is used, the constraining power is reduced and so the spread of the distribution is enlarged, especially for σstd. As the error bar σα increases only mildly, the difference between the error bar and the spread of the distribution increases.
The bin width size Δs = 10, 5, 2, and 1 |$\, \mathrm{Mpc} \, h^{-1}$| are compared with the fiducial setting |$\Delta s = 3 \, \, \mathrm{Mpc} \, h^{-1}$|. Among the cases shown, σstd for |$\Delta s = 3 \, \, \mathrm{Mpc} \, h^{-1}$| is the lowest and it is also more consistent with σ68 and 〈σα〉. Note that χ2/dof are smaller for Δs = 2 and 1 |$\, \mathrm{Mpc} \, h^{-1}$|, but this is not vital here as we already mentioned that the BAO region is well-fitted by our model, the reduction in χ2/dof is primarily due to the region lying outside the BAO.
Unlike other tests, when an alternative fiducial cosmology is assumed, the measurement of ξp should be performed again as the distance has to be computed in the new cosmology. Furthermore, we mentioned that the Hubble parameter h is contained in the unit |$\, \mathrm{Mpc} \, h^{-1}$| and the distance does not explicitly depend on it.
In Table 2, we compare the values of the α parameter obtained by the simple estimate using equation (45) and the mock measurement results. In contrast, we also show similar results for w from Abbott et al. (2022). Interestingly, we find that the ξp results are closer to the simple estimate than the ones from w. The deviation from the simple estimate could stem from the nonlinear correction and other systematic effects, and they affect the results from both templates in a similar manner.
Comparison of the α values obtained with the Planck and the MICE templates in fitting to the MICE cosmology data. The simple estimates using equation (45) are shown in the first row, and the mock measurement in the second. As a reference, the mock measurements for w from Abbott et al. (2022) are also shown in the last row.
. | Planck . | MICE . |
---|---|---|
Simple estimate | 0.959 | 1 |
ξp mock results | 0.958 | 1.001 |
w mock results | 0.966 | 1.004 |
. | Planck . | MICE . |
---|---|---|
Simple estimate | 0.959 | 1 |
ξp mock results | 0.958 | 1.001 |
w mock results | 0.966 | 1.004 |
Comparison of the α values obtained with the Planck and the MICE templates in fitting to the MICE cosmology data. The simple estimates using equation (45) are shown in the first row, and the mock measurement in the second. As a reference, the mock measurements for w from Abbott et al. (2022) are also shown in the last row.
. | Planck . | MICE . |
---|---|---|
Simple estimate | 0.959 | 1 |
ξp mock results | 0.958 | 1.001 |
w mock results | 0.966 | 1.004 |
. | Planck . | MICE . |
---|---|---|
Simple estimate | 0.959 | 1 |
ξp mock results | 0.958 | 1.001 |
w mock results | 0.966 | 1.004 |
We have shown the results obtained with the original Gaussian covariance. The ρ modification prescription only affects a small fraction of the mocks and the results are statistically similar to the original Gaussian case. Moreover, although the COLA covariance suffers from the overlap issue, we also show its results for reference. Despite the overlap issue, the results, especially the measures of the spread and the error bar, are similar to the default ones. Because we are fitting the mocks with the same covariance derived from it, the χ2/dof = 1.65 is smaller, but still it is much larger than 1. Overall, these suggest the consistency of our results across the covariance used.
The sequential maximum likelihood fitting is applied mostly for its efficiency and convenience. We have checked our results using MCMC, implemented by emcee (Foreman-Mackey et al. 2013), in which all the parameters are fit simultaneously. Similar to the conclusion for w in Chan et al. (2018), we find that the MCMC fit gives consistently similar results for ξp.
Furthermore, we have considered splitting the data in the μ range (0,0.8) into three equal-μ bins and dividing the redshift range (0.6,1.1) into two sub-redshift bins: (0.6, 0.85) and (0.85, 1.1). We have also included cross-μ bin or cross sub-redshift bin covariances. In both cases, we find that the discrepancy between σstd and 〈σα〉 increases. The issue is more serious for Nμ = 3 case, and these invite the interpretation that the discrepancy increases as the number of bins increases. We also see similar trend in the smallest Δs bin case. These could be attributed to the high covariance of the data as the correlated data are more susceptible to random fluctuations. By dividing the data into more bins, there are larger random fluctuations. On one hand, there are larger fluctuations in the best fit, resulting in an increase in the spread of the distribution. On the other hand, the error bar derived from the local curvature of the likelihood function could be underestimated as the likelihood is less smooth and the curvature around the minimum is increased.
In Table 1, we already compared the ξp results in the photo-z range (0.6,1.1) against the w results obtained using the joint fit with five tomographic bins, each of bin width Δzp = 0.1. It is instructive to compare the performance of ξp and w for individual tomographic bins, and the results are shown in Table 3. In this test, the intrinsic photo-z distribution is comparable to the bin width. Unlike the joint-bin fit, there is no cross-bin information. The difference between these statistics is that the data are further projected on to a sphere for w. Because of the limitation in computing power, only 100 mocks are used in this comparison. Overall, the results for w are more stable across redshift bins. Except for the first bin, the measures of the spread of the distribution are larger for w, and this can be interpreted as information loss due to projection. However, we note that for ξp, 〈σα〉 tends to be smaller than the measures of the spread by a larger amount. Besides, |$\sigma _{d_{\rm norm} }$| is also more significantly larger than 1. These are caused by the increase in noise as the data size reduces and are consistent with the trends for Nmu = 3 and Nz = 2 in Table 1. The χ2/dof for ξp is still substantially larger than 1 although it is smaller than the joint fit case, while for w, it remains close to 1. In summary, ξp makes use of more information and hence potentially more constraining, but it is more susceptible to noise in the data.
Comparison of the ξp and w BAO fit results, for which only the data in a single tomographic bin are used.
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
1, ξp | 0.996 | 0.050 | 0.049 | 0.036 | 45% | 0.051 | 1.48 | 1.002 ± 0.039 | 44.3/29 (1.53) |
1, w | 0.996 | 0.049 | 0.045 | 0.045 | 66% | −0.026 | 1.18 | 0.995 ± 0.049 | 17.4/17 (1.02) |
2, ξp | 0.997 | 0.044 | 0.040 | 0.033 | 52% | −0.025 | 1.33 | 0.999 ± 0.037 | 44.2/29 (1.53) |
2, w | 1.000 | 0.049 | 0.047 | 0.045 | 67% | −0.053 | 1.19 | 0.999 ± 0.049 | 16.3/17 (0.96) |
3, ξp | 1.001 | 0.048 | 0.039 | 0.035 | 56% | −0.029 | 1.38 | 1.003 ± 0.035 | 44.8/29 (1.54) |
3, w | 0.998 | 0.050 | 0.039 | 0.041 | 67% | −0.042 | 1.20 | 0.997 ± 0.043 | 17.6/17 (1.03) |
4, ξp | 0.992 | 0.043 | 0.035 | 0.032 | 60% | −0.006 | 1.45 | 0.993 ± 0.034 | 39.1/29 (1.35) |
4, w | 1.005 | 0.050 | 0.050 | 0.043 | 59% | −0.067 | 1.12 | 1.003 ± 0.043 | 17.2/17 (1.01) |
5, ξp | 1.009 | 0.039 | 0.033 | 0.038 | 69% | −0.056 | 1.03 | 1.007 ± 0.039 | 43.7/29 (1.51) |
5, w | 1.005 | 0.045 | 0.042 | 0.042 | 66% | −0.027 | 1.06 | 1.006 ± 0.047 | 18.9/17 (1.11) |
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
1, ξp | 0.996 | 0.050 | 0.049 | 0.036 | 45% | 0.051 | 1.48 | 1.002 ± 0.039 | 44.3/29 (1.53) |
1, w | 0.996 | 0.049 | 0.045 | 0.045 | 66% | −0.026 | 1.18 | 0.995 ± 0.049 | 17.4/17 (1.02) |
2, ξp | 0.997 | 0.044 | 0.040 | 0.033 | 52% | −0.025 | 1.33 | 0.999 ± 0.037 | 44.2/29 (1.53) |
2, w | 1.000 | 0.049 | 0.047 | 0.045 | 67% | −0.053 | 1.19 | 0.999 ± 0.049 | 16.3/17 (0.96) |
3, ξp | 1.001 | 0.048 | 0.039 | 0.035 | 56% | −0.029 | 1.38 | 1.003 ± 0.035 | 44.8/29 (1.54) |
3, w | 0.998 | 0.050 | 0.039 | 0.041 | 67% | −0.042 | 1.20 | 0.997 ± 0.043 | 17.6/17 (1.03) |
4, ξp | 0.992 | 0.043 | 0.035 | 0.032 | 60% | −0.006 | 1.45 | 0.993 ± 0.034 | 39.1/29 (1.35) |
4, w | 1.005 | 0.050 | 0.050 | 0.043 | 59% | −0.067 | 1.12 | 1.003 ± 0.043 | 17.2/17 (1.01) |
5, ξp | 1.009 | 0.039 | 0.033 | 0.038 | 69% | −0.056 | 1.03 | 1.007 ± 0.039 | 43.7/29 (1.51) |
5, w | 1.005 | 0.045 | 0.042 | 0.042 | 66% | −0.027 | 1.06 | 1.006 ± 0.047 | 18.9/17 (1.11) |
Comparison of the ξp and w BAO fit results, for which only the data in a single tomographic bin are used.
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
1, ξp | 0.996 | 0.050 | 0.049 | 0.036 | 45% | 0.051 | 1.48 | 1.002 ± 0.039 | 44.3/29 (1.53) |
1, w | 0.996 | 0.049 | 0.045 | 0.045 | 66% | −0.026 | 1.18 | 0.995 ± 0.049 | 17.4/17 (1.02) |
2, ξp | 0.997 | 0.044 | 0.040 | 0.033 | 52% | −0.025 | 1.33 | 0.999 ± 0.037 | 44.2/29 (1.53) |
2, w | 1.000 | 0.049 | 0.047 | 0.045 | 67% | −0.053 | 1.19 | 0.999 ± 0.049 | 16.3/17 (0.96) |
3, ξp | 1.001 | 0.048 | 0.039 | 0.035 | 56% | −0.029 | 1.38 | 1.003 ± 0.035 | 44.8/29 (1.54) |
3, w | 0.998 | 0.050 | 0.039 | 0.041 | 67% | −0.042 | 1.20 | 0.997 ± 0.043 | 17.6/17 (1.03) |
4, ξp | 0.992 | 0.043 | 0.035 | 0.032 | 60% | −0.006 | 1.45 | 0.993 ± 0.034 | 39.1/29 (1.35) |
4, w | 1.005 | 0.050 | 0.050 | 0.043 | 59% | −0.067 | 1.12 | 1.003 ± 0.043 | 17.2/17 (1.01) |
5, ξp | 1.009 | 0.039 | 0.033 | 0.038 | 69% | −0.056 | 1.03 | 1.007 ± 0.039 | 43.7/29 (1.51) |
5, w | 1.005 | 0.045 | 0.042 | 0.042 | 66% | −0.027 | 1.06 | 1.006 ± 0.047 | 18.9/17 (1.11) |
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
1, ξp | 0.996 | 0.050 | 0.049 | 0.036 | 45% | 0.051 | 1.48 | 1.002 ± 0.039 | 44.3/29 (1.53) |
1, w | 0.996 | 0.049 | 0.045 | 0.045 | 66% | −0.026 | 1.18 | 0.995 ± 0.049 | 17.4/17 (1.02) |
2, ξp | 0.997 | 0.044 | 0.040 | 0.033 | 52% | −0.025 | 1.33 | 0.999 ± 0.037 | 44.2/29 (1.53) |
2, w | 1.000 | 0.049 | 0.047 | 0.045 | 67% | −0.053 | 1.19 | 0.999 ± 0.049 | 16.3/17 (0.96) |
3, ξp | 1.001 | 0.048 | 0.039 | 0.035 | 56% | −0.029 | 1.38 | 1.003 ± 0.035 | 44.8/29 (1.54) |
3, w | 0.998 | 0.050 | 0.039 | 0.041 | 67% | −0.042 | 1.20 | 0.997 ± 0.043 | 17.6/17 (1.03) |
4, ξp | 0.992 | 0.043 | 0.035 | 0.032 | 60% | −0.006 | 1.45 | 0.993 ± 0.034 | 39.1/29 (1.35) |
4, w | 1.005 | 0.050 | 0.050 | 0.043 | 59% | −0.067 | 1.12 | 1.003 ± 0.043 | 17.2/17 (1.01) |
5, ξp | 1.009 | 0.039 | 0.033 | 0.038 | 69% | −0.056 | 1.03 | 1.007 ± 0.039 | 43.7/29 (1.51) |
5, w | 1.005 | 0.045 | 0.042 | 0.042 | 66% | −0.027 | 1.06 | 1.006 ± 0.047 | 18.9/17 (1.11) |
Before closing this section, we would like to comment further on the dual roles of χ2: the characterization of the goodness of fit and the model differentiation. The χ2/dof is often used to indicate the goodness of the fit.9 We discussed previously that the χ2/dof is substantially larger than 1 although the model offers a good fit to the BAO. A separate question is if we can use the χ2 value for model differentiation, especially for claiming the significance of the BAO detection. Although we may not be able to use the χ2 distribution results, in principle we can use the mocks to answer the question that if there are no BAO signals in the data, how much the chance is that the BAO template yields a χ2 value smaller than certain threshold.
6 CONCLUSIONS
There are numerous large-scale structure photometric surveys ongoing or upcoming. One of the important probes to extract cosmological information from these surveys is via the measurement of the BAO. A common way to measure BAO in photometric data is to divide the sample into tomographic bins and measure their auto (and cross) correlation.
It has been suggested to analyse the photometric data as the spectroscopic ones through the 3D correlation function ξp. Previous modelling was limited to a Gaussian photo-z approximation. In this work, we generalize the modelling to accommodate general photo-z distribution. This eliminates the possibility of biasing the results by assuming Gaussian photo-z approximation. To do so, we bin the general cross angular correlation between different redshift bins wij(θk) using the variables s and μ (or s⊥ and μ) like in usual 3D correlation function analysis. Our approach highlights the connection between the angular correlation function and 3D correlation function.
To help understanding the impact of the photo-z uncertainties on the correlation function, especially the BAO, we specialize the calculations to the Gaussian photo-z case. When σ ≳ 0.02, the BAO peak in the correlation shows up at the true BAO scale only if the transverse scale of the separation, s⊥, is used as the independent variable. We show that this results from the interplay between the photo-z distribution and the Jacobian of the transformation. The latter formally diverges at s⊥, and for σ ≳ 0.02, it dominates the integral, causing ξp to trace the underlying correlation function at the scale s⊥. Large-scale photometric surveys equipped with broadband filters, such as DES, typically have σ ≳ 0.02. In this case, it is favourable to adopt the parametrization ξp(s⊥, μ), which effectively projects the 3D correlation function to the transverse direction. Unfortunately, this also means that ξp only probes the transverse BAO and cannot be used to directly constrain the Hubble parameter.
Our approach also enables us to derive the Gaussian covariance for ξp from the Gaussian covariance for the angular correlation function w. Due to photo-z mixing, the covariance of the ξp statistics has strong off-diagonal elements. This high correlation causes problems to the data fitting. One of the problems is that some of the best-fitting results are manifestly poor, but we find that they can be resolved by suppressing the large eigenvalue modes in the covariance. Another issue is that the mean χ2/dof for the fit is large, even though the fit is apparently good. While this second issue cannot be easily resolved by adjusting the covariance, we find that the issue is primarily caused by the small eigenvalue modes and they are not directly related to the BAO scale. We conclude that the BAO scale is well-fitted by our model (Fig. 15).
We have used a set of dedicated DES Y3 mocks to test our pipeline. This set of mocks include realistic photo-z distribution among other things. We find that the theory template is in good agreement with the mock measurement. After verifying our methodology, we apply our pipeline to the BAO fit in the mock catalog. Overall, we find that the results are consistent with the ones derived from w even with small improvements because ξp makes use of the cross redshift bin correlation as well. However, the angular correlation function measurements are less correlated with χ2/dof ∼ 1. As ξp offers an alternative way to extract the BAO information, we shall apply it to the DES Y3 samples. This can serve as an important cross check as different statistics have different sensitivities to the potential systematics. Some preliminary results on ξp have been presented in Abbott et al. (2022) on the BAO samples, but a detailed quantitative analysis is still lacking. This is particularly interesting, as the DM measurement in Abbott et al. (2022) shows stimulating deviation from the Planck results. Measuring DM using alternative statistics on various samples helps establish the results.
ACKNOWLEDGEMENTS
We thank Fengji Hou for useful discussions. KCC acknowledges the support from the National Science Foundation of China under the grant 11873102, the science research grants from the China Manned Space Project with NO. CMS-CSST-2021-B01, and the Science and Technology Program of Guangzhou, China (no. 202002030360). SA is supported by the MICUES project, funded by the EU H2020 Marie Sklodowska-Curie Actions grant no. 713366 (InterTalentum UAM) and ‘EU-HORIZON-2020-776247 Enabling Weak Lensing Cosmology (EWC)’.
Funding for the DES Projects has been provided by the U.S. Department of Energy, the U.S. National Science Foundation, the Ministry of Science and Education of Spain, the Science and Technology Facilities Council of the United Kingdom, the Higher Education Funding Council for England, the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign, the Kavli Institute of Cosmological Physics at the University of Chicago, the Center for Cosmology and Astro-Particle Physics at the Ohio State University, the Mitchell Institute for Fundamental Physics and Astronomy at Texas A&M University, Financiadora de Estudos e Projetos, Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro, Conselho Nacional de Desenvolvimento Científico e Tecnológico and the Ministério da Ciência, Tecnologia e Inovação, the Deutsche Forschungsgemeinschaft and the Collaborating Institutions in the Dark Energy Survey.
The Collaborating Institutions are Argonne National Laboratory, the University of California at Santa Cruz, the University of Cambridge, Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas-Madrid, the University of Chicago, University College London, the DES-Brazil Consortium, the University of Edinburgh, the Eidgenössische Technische Hochschule (ETH) Zürich, Fermi National Accelerator Laboratory, the University of Illinois at Urbana-Champaign, the Institut de Ciències de l’Espai (IEEC/CSIC), the Institut de Física d’Altes Energies, Lawrence Berkeley National Laboratory, the Ludwig-Maximilians Universität München and the associated Excellence Cluster Universe, the University of Michigan, NSF’s NOIRLab, the University of Nottingham, The Ohio State University, the University of Pennsylvania, the University of Portsmouth, SLAC National Accelerator Laboratory, Stanford University, the University of Sussex, Texas A&M University, and the OzDES Membership Consortium.
Based in part on observations at Cerro Tololo Inter-American Observatory at NSF’s NOIRLab (NOIRLab Prop. ID 2012B-0001; PI: J. Frieman), which is managed by the Association of Universities for Research in Astronomy (AURA) under a cooperative agreement with the National Science Foundation.
The DES data management system is supported by the National Science Foundation under grant numbers AST-1138766 and AST-1536171. The DES participants from Spanish institutions are partially supported by MICINN under grants ESP2017-89838, PGC2018-094773, PGC2018-102021, SEV-2016-0588, SEV-2016-0597, and MDM-2015-0509, some of which include ERDF funds from the European Union. IFAE is partially funded by the CERCA program of the Generalitat de Catalunya. Research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Program (FP7/2007-2013), including ERC grant agreements 240672, 291329, and 306478. We acknowledge support from the Brazilian Instituto Nacional de Ciência e Tecnologia (INCT) do e-Universo (CNPq grant 465376/2014-2).
This manuscript has been authored by Fermi Research Alliance, LLC under Contract No. DE-AC02-07CH11359 with the U.S. Department of Energy, Office of Science, Office of High Energy Physics.
DATA AVAILABILITY
The data underlying this article will be shared on reasonable request to the corresponding author.
Footnotes
The standard 3D correlation analysis neglects the redshift evolution within the redshift bin.
The issue is not caused by the fitting method. Usage of the MCMC method also gives the same results.
Strictly speaking, the degrees of freedom is only defined for linear models as stressed in Andrae, Schulze-Hartung & Melchior (2010), while the parameter α (and B) appears nonlinearly. Here, we followed the usual practice to assume that the degrees of freedom can be approximated defined for our model.
REFERENCES
APPENDIX A: CHECKS ON THE SPECTROSCOPIC SAMPLE
Although our goal is to consistently include photo-z in our modelling, it is helpful to check our methodology on the redshift-space correlation function.
To test our pipeline, we use the real-space and redshift-space correlation function measured from the ICE-COLA mocks. In order to verify our template method, we also compute the standard linear redshift-space 3D correlation function template directly, which can be obtained by replacing ϕ in equation (17) by a Dirac delta distribution. The real-space template can be retrieved by further setting the f-terms in equation (18) to zero. In Fig. A1, we show the real-space correlation function measurement and the corresponding best-fitting template with the bias as the fitting parameter. The real-space template is in good agreement with the measurements for |$r \gtrsim 20 \, \mathrm{Mpc} \, h^{-1}$|. Note that the BAO feature is stronger in the template as the linear power spectrum is used. The best-fitting effective bias is 1.63, which roughly corresponds to the bias value at the effective redshift 0.83 shown in Fig. 6 (or Fig. A2 below). Moreover, we compare the redshift-space correlation measurement with the corresponding template from the standard direct computation. The wedge correlation function is averaged over μ in the same way as ξp in the main text. We find that the agreement is mildly worse than the real-space result, and the BAO feature damping appears to be weaker than in the real-space case. In the computation, we have used the best-fitting effective bias parameter from the real-space result so as to compare with the template obtained by mapping w to ξ below. We could have used the bias parameter obtained by fitting to the redshift-space correlation and the agreement on small scales would be slightly improved.

The real-space (blue, triangles) and redshift-space (red, circles) correlation measured from the ICE-COLA mocks. The standard linear templates obtained by direct computation (solid) are compared with the corresponding measurements. We also show the redshift-space template obtained by mapping w to ξ (brown, dashed).

The best-fitting bias obtained from the photometric sample (blue, circles) and spectroscopic sample (orange, triangles).
We also compute the spectroscopic template following the method of mapping w to ξ discussed in the main text. Similar to the analysis for the photo-z sample, we first measure the bias parameters of the redshift-space sample using the angular correlation function. In Fig. A2, we compare the bias parameters measured from the photo-z angular correlation function and the spectroscopic angular correlation. In both cases, we divide the data in the redshift range (0.6,1.1) into 50 equal redshift bins. For the spectroscopic sample, the window function is a top-hat of width Δz = 0.01. We find that the recovered tomographic bin bias parameters are consistent with each other, and this demonstrates that fine photo-z bin measurements can indeed recover the underlying intrinsic bias.
We go on to use the bias parameters to compute the 3D correlation. For the case of spectroscopic redshift, we compute the cross angular correlation function wij(θ) with the the sampling width Δz = 0.00333 and Δθ = 0.02°. We find that finer sampling in z does not improve the results. Note that in this case, we have used the linear power spectrum with BAO damping. The template is in decent agreement with the measurement, but in the intermediate range 30–80 |$\, \mathrm{Mpc} \, h^{-1}$| it is slightly worse than the direct computation result. The difference could be caused by the extra binning and averaging in the w to ξ mapping and these may result in smoothing of the correlation function. However, this is less of an issue in the photo-z case as the intrinsic photo-z mixing window is much stronger.
In Table A1, we show the BAO fit results for the spectroscopic samples. We have shown four scenarios with different template and covariance combinations. The template is either the theory one or the mock mean. The theory template is obtained by mapping w to ξ. As there is small difference between the mock measurement and the theory, we also consider directly using the mean of the mock measurement as the template. In passing, in the main text, this possibility is not presented because we found that using the photo-z ξp mock mean as the template leads to a non-smooth likelihood. Two covariances are considered, the mock covariance and the Gaussian one. The Gaussian covariance is computed in a similar way as the photo-z case with 50 bins in redshift.
The BAO fit results for the spectroscopic sample. For each of the case shown, the first entry denotes the type of template, mock mean, or theory and the second one represents the type of covariance, mock, or Gaussian covariance.
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Mock, Mock | 1.000 | 0.010 | 0.010 | 0.007 | 70% | −0.029 | 1.60 | 1.000 ± 0.009 | 24.0/29 (0.83) |
Mock, Gaussian | 1.000 | 0.010 | 0.011 | 0.005 | 52% | −0.022 | 1.43 | 1.001 ± 0.011 | 30.9/29 (1.07) |
Theory, Mock | 1.001 | 0.013 | 0.014 | 0.011 | 74% | −0.005 | 1.25 | 1.001 ± 0.012 | 36.6/29 (1.26) |
Theory, Gaussian | 1.002 | 0.014 | 0.014 | 0.007 | 44% | −0.023 | 1.38 | 1.002 ± 0.007 | 43.1/29 (1.49) |
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Mock, Mock | 1.000 | 0.010 | 0.010 | 0.007 | 70% | −0.029 | 1.60 | 1.000 ± 0.009 | 24.0/29 (0.83) |
Mock, Gaussian | 1.000 | 0.010 | 0.011 | 0.005 | 52% | −0.022 | 1.43 | 1.001 ± 0.011 | 30.9/29 (1.07) |
Theory, Mock | 1.001 | 0.013 | 0.014 | 0.011 | 74% | −0.005 | 1.25 | 1.001 ± 0.012 | 36.6/29 (1.26) |
Theory, Gaussian | 1.002 | 0.014 | 0.014 | 0.007 | 44% | −0.023 | 1.38 | 1.002 ± 0.007 | 43.1/29 (1.49) |
The BAO fit results for the spectroscopic sample. For each of the case shown, the first entry denotes the type of template, mock mean, or theory and the second one represents the type of covariance, mock, or Gaussian covariance.
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Mock, Mock | 1.000 | 0.010 | 0.010 | 0.007 | 70% | −0.029 | 1.60 | 1.000 ± 0.009 | 24.0/29 (0.83) |
Mock, Gaussian | 1.000 | 0.010 | 0.011 | 0.005 | 52% | −0.022 | 1.43 | 1.001 ± 0.011 | 30.9/29 (1.07) |
Theory, Mock | 1.001 | 0.013 | 0.014 | 0.011 | 74% | −0.005 | 1.25 | 1.001 ± 0.012 | 36.6/29 (1.26) |
Theory, Gaussian | 1.002 | 0.014 | 0.014 | 0.007 | 44% | −0.023 | 1.38 | 1.002 ± 0.007 | 43.1/29 (1.49) |
Case . | 〈α〉 . | σstd . | σ68 . | 〈σα〉 . | |${\rm frac.\, encl.} \langle \alpha \rangle$| . | 〈dnorm〉 . | |$\sigma _{d_{\rm norm}}$| . | |${\rm mean\, of\, mocks}$| . | 〈χ2〉/dof . |
---|---|---|---|---|---|---|---|---|---|
Mock, Mock | 1.000 | 0.010 | 0.010 | 0.007 | 70% | −0.029 | 1.60 | 1.000 ± 0.009 | 24.0/29 (0.83) |
Mock, Gaussian | 1.000 | 0.010 | 0.011 | 0.005 | 52% | −0.022 | 1.43 | 1.001 ± 0.011 | 30.9/29 (1.07) |
Theory, Mock | 1.001 | 0.013 | 0.014 | 0.011 | 74% | −0.005 | 1.25 | 1.001 ± 0.012 | 36.6/29 (1.26) |
Theory, Gaussian | 1.002 | 0.014 | 0.014 | 0.007 | 44% | −0.023 | 1.38 | 1.002 ± 0.007 | 43.1/29 (1.49) |
When both the template and the covariance are from mocks, it could be the best case scenario. But we still find that the error estimate is smaller than the spread of the distribution and dnorm deviates from Gaussianity significantly. This suggests that the likelihood is non-Gaussian. Moreover, the 〈χ2〉/dof is small relative to 1. This could be caused by issues of the mocks such as the overlapping issue discussed in the main text. When the mock covariance is replaced by the Gaussian one, 〈σα〉 is further reduced and the fraction enclosing 〈α〉 is lowered. Interestingly, 〈χ2〉/dof is closest to unity in this case. When the theory template is used with the mock covariance, the spread of the best-fitting α and 〈χ2〉/dof increase. These can be attributed to the difference between the template and the mock mean. Finally, when both the theory template and the Gaussian covariance are used, the 〈χ2〉/dof is the highest. We note that it is still smaller than the photo-z case by 0.5.