-
PDF
- Split View
-
Views
-
Cite
Cite
Qianjun Hang, Benjamin Joachimi, Eric Charles, John Franklin Crenshaw, Patricia Larsen, Alex I Malz, Sam Schmidt, Ziang Yan, Tianqing Zhang, the LSST Dark Energy Science Collaboration , Impact of survey spatial variability on galaxy redshift distributions and the cosmological 3 × 2-point statistics for the Rubin Legacy Survey of Space and Time (LSST), Monthly Notices of the Royal Astronomical Society, Volume 535, Issue 4, December 2024, Pages 2970–2997, https://doi.org/10.1093/mnras/stae2519
- Share Icon Share
ABSTRACT
We investigate the impact of spatial survey non-uniformity on the galaxy redshift distributions for forthcoming data releases of the Rubin Observatory Legacy Survey of Space and Time (LSST). Specifically, we construct a mock photometry data set degraded by the Rubin OpSim observing conditions, and estimate photometric redshifts of the sample using a template-fitting photo-z estimator, BPZ, and a machine learning method, FlexZBoost. We select the Gold sample, defined as |$i\lt 25.3$| for 10 yr LSST data, with an adjusted magnitude cut for each year and divide it into five tomographic redshift bins for the weak lensing lens and source samples. We quantify the change in the number of objects, mean redshift, and width of each tomographic bin as a function of the coadd i-band depth for 1-yr (Y1), 3-yr (Y3), and 5-yr (Y5) data. In particular, Y3 and Y5 have large non-uniformity due to the rolling cadence of LSST, hence provide a worst-case scenario of the impact from non-uniformity. We find that these quantities typically increase with depth, and the variation can be |$10\!-\!40~{{\rm per\ cent}}$| at extreme depth values. Using Y3 as an example, we propagate the variable depth effect to the weak lensing |$3\times 2$| pt analysis, and assess the impact on cosmological parameters via a Fisher forecast. We find that galaxy clustering is most susceptible to variable depth, and non-uniformity needs to be mitigated below 3 per cent to recover unbiased cosmological constraints. There is little impact on galaxy–shear and shear–shear power spectra, given the expected LSST Y3 noise.
1 INTRODUCTION
Observational cosmology enters the era of high-precision measurements. For example, weak gravitational lensing, which probes the small distortion of distant galaxy shapes due to the gravity of foreground large-scale structures, is particularly sensitive to the clustering parameter |$S_8=\sigma _8\sqrt{\Omega _{\rm m}/0.3}$|. Current weak lensing surveys have measured this parameter to be |$S_8=0.759^{+0.024}_{-0.021}$| by the Kilo-Degree Survey (KiDS-1000; Asgari et al. 2021), |$S_8=0.759^{+0.025}_{-0.023}$| by the Dark Energy Survey (DES-Y3; Amon et al. 2022), and |$S_8=0.760^{+0.031}_{-0.034}$| (|$S_8=0.776^{+0.032}_{-0.033}$|) using the shear power spectra (two-point correlation function) by the Hyper Suprime-Cam (HSC-Y3; Dalal et al. 2023; Li et al. 2023). The constraints are comparable to that measured by Planck Collaboration VI (2020) from the primary cosmic microwave background (CMB), |$S_8=0.830\pm 0.013$|, and the recent result from CMB lensing (Madhavacheril et al. 2024), |$S_8=0.840\pm 0.028$|, but are interestingly lower by |$2-3\sigma$|. The uncertainties of these measurements are already dominated by systematic errors – without a careful treatment of various systematic effects, the cosmological results can be biased up to a few sigma (e.g. Rodríguez-Monroy et al. 2022). The forthcoming Stage IV surveys such as the Rubin Observatory Legacy Survey of Space and Time (LSST) will achieve a combined figure of merit ten times as much as the Stage III experiments as mentioned above (The LSST Dark Energy Science Collaboration 2021). While the high statistical power enables pinning down the nature of such tensions, systematic error needs to be controlled down to sub- per cent level to ensure that our results are not biased.
One major systematic uncertainties come from survey non-uniformity. Galaxy samples detected at different survey depth, for example, will have different flux errors and number of faint objects near the detection limit. This could propagate down to systematic errors in redshift distribution and number density fluctuation. The majority of the LSST footprint will follow the wide-fast-deep (WFD) observing strategy, which means that a large survey region will be covered before building up the survey depth. At early stages of the survey, fluctuations in observing conditions, such as sky brightness, seeing, and airmass, are expected to be significant across the footprint. These can change the per-visit |$5\sigma$| limiting magnitude, |$m_5$|, leading to depth non-uniformity in the early LSST data (Ivezić et al. 2019). The survey strategy later on could also affect uniformity. LSST will adopt a ‘rolling cadence’, which means that during a fixed period, more frequent revisits will be assigned to a particular area of the sky, whereas the rest of the regions are deprioritized by up to 25 per cent of the baseline observing time. The high- and low-priority regions continue to swap, such that the full footprint is covered with the same exposure time after 10 yr. This can result in different limiting magnitudes across the sky at intermediate stages of rolling. This strategy greatly advances LSST’s potential for time domain science for example, denser sampling in light curves. However, it also poses challenges to the analysis of large-scale structure (LSS) probes, which normally prefers a uniform coverage.
Changes in |$m_5$| can change the detected sample of galaxies and its photometric redshifts in two ways. First, a larger |$m_5$| means that fainter, higher redshift galaxies will pass the detection limit. This increases the sample size, and could shift the ensemble mean redshift higher. These faint galaxies also contain large photometric noise, resulting in larger scatter with respect to the true redshift, hence broadening the redshift distribution. Secondly, at fixed magnitude, the signal-to-noise is larger given a larger |$m_5$|. This means that, contrary to the previous effect, the scatter in spec-z versus photo-z will be reduced due to the reduced noise. These effect has been studied previously in a similar context. The density fluctuation is quantified in Awan et al. (2016) via |$1+\delta _{\rm o}=(1+\delta _{\rm t})(1+\delta _{\rm OS})$|, where |$\delta _{\rm o}$| is the observed density contrast, |$\delta _{\rm t}$| is the true density, and |$\delta _{\rm OS}$| is the fluctuation in the observing condition. The effects on photo-z have been investigated in Graham et al. (2018) in the context of LSST. They showed that the photo-z quality can change significantly with respect to different observing conditions, although they did not consider tomographic binning. Heydenreich et al. (2020) and Joachimi et al. (2021) also quantified the effects for KiDS-1000 data, where the depth varies significantly between different pointings. They showed that by varying the r-band limiting magnitude, a significant amount of high redshift objects can be included in the sample, such that the mean number density can double between the deepest and shallowest pointings, and the average redshift for a tomographic bin can shift by as much as |$\Delta \langle z\rangle \sim 0.2$|. Understanding these effects are important, because weak lensing is particularly sensitive to the mean redshift of the lens and source galaxies. Heydenreich et al. (2020) demonstrated that this effect is similar to a spatially varying multiplicative bias, and for cosmic shear analysis in configuration space, constraints in the |$\Omega _{\rm m}-\sigma _8$| plane can shift up to |$\sim 1\sigma$| for a KiDS-like survey with the same area as LSST. Baleato Lizancos & White (2023) also derived an analytic expression for anisotropic redshift distributions for galaxy and lensing two-point statistics in Fourier space. They showed that, assuming a spatial variation of scale |$\ell _{z}$|, the effects are at per cent and sub-per cent level for the current and forthcoming galaxy surveys, and converge to the uniform case at |$\ell \gg \ell _{z}$|.
In this paper, we investigate how survey non-uniformity can affect the redshift distribution of tomographic bins for LSST 1, 3, and 5-yr observation (hereafter Y1, Y3, and Y5, respectively). The LSST Dark Energy Science Collaboration (DESC) Science Requirements Document (The LSST Dark Energy Science Collaboration 2021, hereafter DESC SRD) states that the photometric redshifts needs to achieve a precision of |$\langle \Delta z \rangle = 0.002(1+z)$| (|$0.001(1+z)$|) for Y1 (Y10) weak lensing analysis, and |$\langle \Delta z \rangle = 0.005(1+z)$| (|$0.003(1+z)$|) for Y1 (Y10) large-scale structure analysis. Here, using these numbers as a bench mark, we quantify changes in the mean redshift (|$\langle z \rangle$|) and width (|$\sigma _z$|) of tomographic bins, as depth varies.1 We use the up-to-date LSST observing strategy and the simulated 10-yr observing conditions for Rubin Observatory (OpSim, Delgado & Reuter 2016; Reuter et al. 2016) to quantify the survey non-uniformity, and generate a mock catalogue of true galaxy magnitude in |$ugrizy$|, redshift, and ellipticity based on the Roman-Rubin (DiffSky) simulations (Troxel et al. 2023). The degradation of photometry and photo-z estimation relies on the public software, Redshift Assessment Infrastructure Layers2 (RAIL; LSST-DESC PZ WG, in preparation), which will also be used in the LSST analysis pipeline. Finally, we propagate these effects to the clustering and weak lensing two-point statistics.
This paper is organized as follows. We describe our simulation data sets in Section 2 and introduce our methods in Section 3. The results are presented in Section 4. We show the variation of the angular power spectra with varying depth effects in Section 5. Finally, we conclude in Section 6.
2 SIMULATIONS
This section provides an overview of the simulations used in this work, namely, the Rubin Operation Simulator (OpSim; Section 2.1), which simulates the observing strategy and related properties for Rubin LSST, and the Roman–Rubin simulation (DiffSky; Section 2.2), which provides a truth catalogue complete up to |$z=3$| with realistic galaxy colours.
2.1 Rubin operations simulator (OpSim)
The Operations Simulator3 (OpSim) of the Rubin Observatory is an application that simulates the telescope movements and a complete set of observing conditions across the LSST survey footprint over the 10-yr observation period, providing predictions for the LSST performance with respect to various survey strategies. OpSim uses a historical weather log from Cerro-Tololo Inter-American Observatory (CTIO), Chile from the 10-yr period 1996 to 2005, to simulate weather conditions. An observation is conducted when the weather log is no more than 42 per cent cloudy. This gives about the same amount of total weather downtime as Gemini South and Southern Astrophysical Research (SOAR) telescope. Realistic seeing values for each observation are generated using historical seeing logs from Cerror Pachón, Chile. We utilize OpSim baseline v3.3, the most recent observing strategy. This strategy involves a rolling cadence that starts after the first year of observation. In subsequent years, parts of the sky will receive more visits than others, enabling higher resolution sampling for time domain science. At the end of the fiducial survey, uniformity will be recovered at the expected 10-yr LSST depth. The output of OpSim is evaluated by the metrics analysis framework (MAF), a software tool that computes summary statistics (e.g. mean and median of a particular observing condition over a given period) and derived metrics (e.g. coadd |$5\sigma$| depth) that can be used to assess the performance of the observing strategy, in terms of survey efficiency and various science drivers. The fieldRA and fieldDec positions used in the MAF include the dithering that has been applied. The sky is first tessellated by the telescope field of view (a few degrees in diameter), and the orientation is then randomized at the start of each night. Visits are done in pairs to allow detection of moving solar system objects, so that within a night there is no dithering. The MAF loops over the healpix pixel centres, and for each one finds the observations that overlap with that point, including rejecting observations where the point falls on a chip gap.
For the purpose of this study, we obtain survey condition maps in healpix (Górski et al. 2005) format using the MAF healpix slicer with |$N_{\rm side}=128$| (corresponding to a pixel size of |$755\, {\rm arcmin}^2$|), using the (RA, Dec) coordinates. We do not choose a higher resolution for the map because we expect that survey conditions vary smoothly on large scales, and this choice of |$N_{\rm side}$| is enough to capture the variation with the rolling pattern. For our purposes, we mainly consider the following quantities in each of the |$ugrizy$| filters: extinction-corrected coadd |$5\sigma$| point source depth (ExgalM5, hereafter |$m_5^{\rm ex}$|) and the effective full-width half-maximum seeing (seeingFwhmEff, hereafter |$\theta _{\rm FWHM}^{\rm eff}$|) in unit of arcsecond. The |$m_5^{\rm ex}$| is different from the coadd depth, |$m_5$|, by the fact that it includes the lost of depth near galactic plane. The effective seeing, |$\theta _{\rm FWHM}^{\rm eff}$|, has a wavelength dependence, with a poorer seeing at bluer filters from Kolmogorov turbulence. The MAF also takes into account for increase in point spread function (PSF) size with airmass, X, due to seeing, i.e. |$\theta _{\rm FWHM}^{\rm eff} \propto X^{0.6}$|. However, the MAF does not include the increase in PSF size along the zenith direction with zenith angle, due to differential chromatic refraction. This quantity is used here to convert point-source depth to that for extended objects. We obtain maps of these quantities over the LSST footprint at the end of each full year of observation (e.g. Y3 for |${\rm nights}\lt 1095$|). The coadded depth in each band is computed by assessing the |$5\sigma$|-depth (in magnitudes) of each visit within each healpix pixel, then computing the ‘stacked’ depth. The coadded depth calculation includes the airmass, seeing, and sky brightness of each visit. It is approximated that the whole field of view has values similar to the centre, so that vignetting or sky brightness gradients are not included. For the most part these gradients should be small and average out over many visits. Maps of |$\theta _{\rm FWHM}^{\rm eff}$| contain the median over all visits in a particular healpix pixel.
Throughout the paper, we will use Y1, Y3, and Y5 as examples to showcase the impact of spatial variability on photometric redshifts. Notice that the choice of Y3 and Y5 are a pessimistic one, because the survey strategy is close to uniformity in Y4 and Y7 where cosmological analysis are expected to be conducted. Hence, this paper provides a worst-case scenario of the severity of the impact from spatial variability. Also, the Rubin observing strategy is still being decided, and the rolling cadence may move to different times during the survey. There are ongoing efforts on recommendations about the observing strategy, and hence the results shown here should be interpreted in light of this particular strategy and years chosen. We will focus on the WFD survey programme footprint, and exclude areas with high galactic extinction |$E(B-V)\gt 0.2$| for cosmological studies. Notice that, in practice, additional sky cuts could also be applied (e.g. a depth cut that removes very shallow regions). Specifically, we will focus on the variation with respect to i-band, the detection band of LSST. Fig. 1 shows the spatial variation of the extinction-corrected coadd i-band depth for OpSim baseline v3.3 in Y1, Y3, and Y5. The stripes visible across the footprint in Y3 and Y5 are the characteristics of the rolling cadence. The distribution of all OpSim variables are shown in Fig. 2 for each of the six filters and for selected years of observation. One can see that the coadd depths build up in each band over the years, whereas the distributions of the median effective seeing per visit are relatively unchanged. One can also see a strong skewness in these distributions.

The simulated i-band coadd |$5\sigma$| depth accounting for Galactic extinction, |$m_5^{\rm ex}$|, from the Rubin observatory OpSim baseline v3.3 over the LSST wide-fast-deep (WFD) footprint, for 1-yr (upper), 3-yr (middle), and 5-yr (lower) observations. Notice the stripy patterns visible from the 3 and 5-yr observations are the result of rolling cadence. i-band is shown here because it is the detection band for LSST.

Distribution of the extinction-corrected coadd depth (|$m_5^{\rm ex}$|) and the median effective seeing (|$\theta _{\rm FWHM}^{\rm eff}$|) for the six LSST bands from the OpSim baseline v3.3. The different colours and line styles indicate 1, 3, and 5-yr observations, as shown by the legend.
2.2 Roman–Rubin simulation (DiffSky)
In order to investigate the impact of varying survey conditions on photo-z for LSST, we need a simulated truth catalogue that is complete to beyond the LSST 10-yr depth and realistic in colour-redshift space. For this purpose, we use the joint Roman–Rubin simulation v1.1.3. This simulation is an extension of the effort in Troxel et al. (2023), but with many improvements, including self-consistent, flexible galaxy modelling. The simulation is based on its precursor, CosmoDC2 (Korytov et al. 2019), a synthetic sky catalogue out to |$z=3$| built from the ‘Outer Rim’ N-body cosmological simulation (Heitmann et al. 2019). The N-body simulation contains a trillion particles with a box size of |$(4.225 {\rm \, Gpc})^2$|. The galaxies are simulated with Diffsky,4 based on two differentiable galaxy models: Diffstar (Alarcon et al. 2023) and differentiable stellar population synthesis (DSPS; Hearin et al. 2023). Using Diffstar, one can build a parametric model that links galaxy star formation history with physical parameters in halo mass assembly. Then, with DSPS, one can calculate the SED and photometry of a galaxy as a function of its star formation history, metallicity, dust, and other properties. The advantage of this galaxy model is that the distribution in colour-redshift is smooth and more realistic compared to that in CosmoDC2. This is thanks to the separate modelling for different galaxy components, i.e. bulge, disc, and star-forming regions. The spectral energy distributions (SEDs) built from these different components with different stellar populations makes the colours more realistic for photo-z estimation. The calibration of the Roman–Rubin simulation galaxy colours as a function of redshift matches that of the COSMOS2020 sample (Weaver et al. 2022), although some evidence of a low amount of variance in the near-infrared (NIR) colours at |$z\gt 1$| is obvious. For more details of the Roman–Rubin DiffSky simulation, see the DESC Note by Troxel et al. (in preparation).
We randomly subsample the full simulated catalogue to |$N=10^6$| objects complete to |$i\lt 26.5$| as our truth sample. For each object, we obtain its magnitude in the six LSST bands, true redshift, bulge size |$s_b$|, disc sizes |$s_d$|, bulge-to-total ratio |$f_b$|, and ellipticity e. We obtain the galaxy semimajor and semiminor axes, |$a,b$| via |$a=s/\sqrt{q}$| and |$b= s\sqrt{q}$|, where s is the weighted size of the galaxy, |$s= s_bf_b + s_d(1-f_b)$|, and q is the ratio between the major and minor axes, related to ellipticity via |$q=(1-e)/(1+e)$|.
One caveat of the current sample is that, at |$z\gt 1.5$|, there is an exaggerated bimodal distribution in the |$g-r$| colour and redshifts, which is not found in real galaxy data. As a result, the bluest objects in the sample are almost always found at high redshifts. This could be due to the high-redshift SPS models being less well constrained. One direct consequence of this is that, when training a machine learning algorithm to estimate the photo-z, the high-redshift performance may be too optimistic due to this colour-space clustering.
3 METHODS
This section describes our methodology for generating a mock LSST photometry catalogue for Y1, Y3, and Y5, applying photometric redshift estimation algorithms, and defining metrics to assess the impact of variable depth. Specifically, we describe the degradation process using the LSST error model in Section 3.1, the two photo-z estimators, BPZ and FlexZBoost, in Section 3.2, the tomographic binning strategy in Section 3.3, and the relevant metrics Section 3.4.
3.1 Degradation of the truth sample
Given a galaxy with true magnitudes |$m_{\rm t}=\lbrace ugrizy\rbrace$| falling in a healpix pixel within the footprint, we ‘degrade’ its magnitude with observing conditions associated with that pixel, and assign a set of ‘observed’ magnitudes |$m_{\rm o}$| and the associated magnitude error |$\sigma _{m,{\rm o}}$|, using the following procedure: (1). Apply galactic extinction. (2). Compute the point-source magnitude error for each object in each filter, using the LSST error model detailed in Ivezić et al. (2019). (3). Compute the correction to obtain the extended-source magnitude errors. (4). Sample from the error and add it to the true magnitudes. Steps (2)–(4) are carried out using the python package photerr5 (Crenshaw et al. 2024). We detail each step below.
First, we apply the galactic extinction to each band with the |$E(B-V)$| dust map (Green 2018) via:
where for each of the six LSST filters we adopt |$[A_{\lambda }/E(B-V)]=\lbrace 4.81,3.64,2.70,2.06,1.58,1.31\rbrace$|.
Then, we utilize the LSST error model (Ivezić et al. 2019) to compute the expected magnitude error, |$\sigma _m$|, per band. The magnitude error is related to the noise-to-signal, nsr, via:
The total nsr consists of two components:
where |${\rm nsr}_{\rm sys}$| is the systematic error from the instrument read-out and |${\rm nsr}_{\rm rand}$| is the random error arising from observing conditions on the sky, for extended objects. Notice that in the high signal-to-noise limit where |${\rm nsr}\ll 1$|, |$\sigma _m\sim {\rm nsr}$|, and equation (3) recovers the form in Ivezić et al. (2019). Throughout the paper, we set |${\rm nsr_{\rm sys}}\approx \sigma _{\rm sys}=0.005$|, which corresponds to the maximum value allowed from the LSST requirement. For point sources, the random component of nsr is given by
where |$\gamma$| is a parameter that depends on the system throughput. We adopt the default values from Ivezić et al. (2019), |$\gamma =\lbrace 0.038, 0.039, 0.039, 0.039, 0.039, 0.039\rbrace$| for |$ugrizy$|. x is a parameter that depends on the magnitudes of the object, m, and the corresponding coadd |$5\sigma$| depth, |$m_5$|, in that band:
For extended sources, we adopt the expression in Kuijken et al. (2019); van den Busch et al. (2020), where the nsr receives an additional factor related to the ratio between the angular size of the object and that of the PSF:
Here,
where |$\theta _{\rm FWHM}^{\rm eff}$| is the effective FWHM seeing (it is linked to the seeing by |$\theta _{\rm FWHM}^{\rm eff} = \theta _{\rm FWHM} X^{0.6}$|, where X is the airmass) for a given LSST band. The AP angular size of the object is given by
where |$a, b$| are the galaxy semimajor and minor axis. We make one modification to equation (6), where we replace the denominator by the mean PSF area, |$\sqrt{\langle A_{\rm psf}\rangle }$|, averaged over pixels in the i-band quantiles which we will elaborate shortly. In the approximation that |${\rm nsr}_{\rm rand,pt}\propto x$|, the point-source noise is then proportional to |$\theta _{\rm FWHM}^{\rm eff}$| [see equation (A1)], and so for the extended-source noise, |$\theta _{\rm FWHM}^{\rm eff}$| cancels and equation (6) effectively changes the dependence of |$m_5$| on PSF size to that on the extended aperture size. However, in this work, we utilize the median seeing, for which the cancellation may not be exact. Naively taking equation (6) could lead to unrealistic cases, where, at fixed depth, |${\rm nsr}_{\rm rand,ext}$| increases with a better seeing. We have tested both scenarios, i.e. using individual |$A_{\rm psf}$| or the mean |$\langle A_{\rm psf} \rangle$| in equation (6), and find negligible difference for our main conclusion in the i-band quantiles. However, it does make a significant difference if one were to bin the samples by quantiles of seeing, as investigated in Appendix D.
To obtain the observed magnitudes |$m_{\rm o}$|, we degrade in flux space, |$f_{\rm o}$|, by adding a random noise component |$\Delta f$| drawn from a normal error distribution, |$\Delta f\sim \mathcal {N}(0,{\rm nsr})$|, to the reddened flux |$f_{\rm dust}$| of the object. Here, nsr is computed by setting |$m=m_{\rm dust}$| in equation (5). The flux and magnitude are converted back and forth via
Negative fluxes are set as ‘non-detection’ in that band. The corresponding magnitude error |$\sigma _{m,{\rm o}}$| is computed using equation (2) and setting |$m=m_{\rm o}$| in equation (5), such that the error de-correlates with the observed magnitude.
To focus on the trend in the depth variation in the detection band, we subdivide pixels in the survey footprint into 10 quantiles in i-band |$m_5^{\rm ex}$|, where the first quantile (|${\rm qtl}=0$|) contains the shallowest pixels, and the last quantile (|${\rm qtl}=9$|) contains the deepest. Table 1 shows the mean and standard deviation of each i-band depth quantile. We also show in Table D1 the mean and standard deviation of all other survey condition maps used in the analysis in each of the i-band depth quantiles. Within each quantile, we randomly assign each galaxy to a healpix pixel in that quantile, with its associated observing conditions |$\lbrace E(B-V), m_5^{\rm ex}, \theta _{\rm FWHM}^{\rm eff}\rbrace$| on that pixel for each LSST band, from the OpSim MAF maps. Then, we carry out the above degradation process to our truth sample. On average, each pixel within each quantile is assigned 121 galaxies. Notice that there are many other parameters that could affect the photometric errors, e.g. sky background, exposure time, and atmospheric extinction. Following Ivezić et al. (2019), because these quantities only contribute towards |$m_5$|, we do not include them otherwise in the degradation, and assume that |$m_5^{\rm ex}$| completely captures their variation. Additionally, we explore the relation between |$m_5$| and these extended quantities using OpSim in Appendix A, and we explore the galaxy redshift distribution dependence with other survey properties in Appendix D.
The mean and standard deviation of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, split in 10 quantiles, from the Rubin OpSim baseline v3.3 map with |$N_{\rm side}=128$|, for year 1, 3, and 5, respectively.
qtl (i-band |$m_5^{\rm ex}$|) . | Y1 . | Y3 . | Y5 . |
---|---|---|---|
0 | |$24.95\pm 0.10$| | |$25.46\pm 0.12$| | |$25.75\pm 0.10$| |
1 | |$25.10\pm 0.03$| | |$25.64\pm 0.03$| | |$25.89\pm 0.02$| |
2 | |$25.17\pm 0.02$| | |$25.72\pm 0.02$| | |$25.96\pm 0.02$| |
3 | |$25.22\pm 0.01$| | |$25.78\pm 0.02$| | |$26.01\pm 0.01$| |
4 | |$25.27\pm 0.01$| | |$25.83\pm 0.02$| | |$26.06\pm 0.01$| |
5 | |$25.31\pm 0.01$| | |$25.88\pm 0.01$| | |$26.10\pm 0.01$| |
6 | |$25.35\pm 0.01$| | |$25.93\pm 0.01$| | |$26.14\pm 0.01$| |
7 | |$25.39\pm 0.01$| | |$25.99\pm 0.02$| | |$26.18\pm 0.01$| |
8 | |$25.44\pm 0.02$| | |$26.06\pm 0.03$| | |$26.23\pm 0.02$| |
9 | |$25.53\pm 0.05$| | |$26.18\pm 0.05$| | |$26.33\pm 0.04$| |
qtl (i-band |$m_5^{\rm ex}$|) . | Y1 . | Y3 . | Y5 . |
---|---|---|---|
0 | |$24.95\pm 0.10$| | |$25.46\pm 0.12$| | |$25.75\pm 0.10$| |
1 | |$25.10\pm 0.03$| | |$25.64\pm 0.03$| | |$25.89\pm 0.02$| |
2 | |$25.17\pm 0.02$| | |$25.72\pm 0.02$| | |$25.96\pm 0.02$| |
3 | |$25.22\pm 0.01$| | |$25.78\pm 0.02$| | |$26.01\pm 0.01$| |
4 | |$25.27\pm 0.01$| | |$25.83\pm 0.02$| | |$26.06\pm 0.01$| |
5 | |$25.31\pm 0.01$| | |$25.88\pm 0.01$| | |$26.10\pm 0.01$| |
6 | |$25.35\pm 0.01$| | |$25.93\pm 0.01$| | |$26.14\pm 0.01$| |
7 | |$25.39\pm 0.01$| | |$25.99\pm 0.02$| | |$26.18\pm 0.01$| |
8 | |$25.44\pm 0.02$| | |$26.06\pm 0.03$| | |$26.23\pm 0.02$| |
9 | |$25.53\pm 0.05$| | |$26.18\pm 0.05$| | |$26.33\pm 0.04$| |
The mean and standard deviation of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, split in 10 quantiles, from the Rubin OpSim baseline v3.3 map with |$N_{\rm side}=128$|, for year 1, 3, and 5, respectively.
qtl (i-band |$m_5^{\rm ex}$|) . | Y1 . | Y3 . | Y5 . |
---|---|---|---|
0 | |$24.95\pm 0.10$| | |$25.46\pm 0.12$| | |$25.75\pm 0.10$| |
1 | |$25.10\pm 0.03$| | |$25.64\pm 0.03$| | |$25.89\pm 0.02$| |
2 | |$25.17\pm 0.02$| | |$25.72\pm 0.02$| | |$25.96\pm 0.02$| |
3 | |$25.22\pm 0.01$| | |$25.78\pm 0.02$| | |$26.01\pm 0.01$| |
4 | |$25.27\pm 0.01$| | |$25.83\pm 0.02$| | |$26.06\pm 0.01$| |
5 | |$25.31\pm 0.01$| | |$25.88\pm 0.01$| | |$26.10\pm 0.01$| |
6 | |$25.35\pm 0.01$| | |$25.93\pm 0.01$| | |$26.14\pm 0.01$| |
7 | |$25.39\pm 0.01$| | |$25.99\pm 0.02$| | |$26.18\pm 0.01$| |
8 | |$25.44\pm 0.02$| | |$26.06\pm 0.03$| | |$26.23\pm 0.02$| |
9 | |$25.53\pm 0.05$| | |$26.18\pm 0.05$| | |$26.33\pm 0.04$| |
qtl (i-band |$m_5^{\rm ex}$|) . | Y1 . | Y3 . | Y5 . |
---|---|---|---|
0 | |$24.95\pm 0.10$| | |$25.46\pm 0.12$| | |$25.75\pm 0.10$| |
1 | |$25.10\pm 0.03$| | |$25.64\pm 0.03$| | |$25.89\pm 0.02$| |
2 | |$25.17\pm 0.02$| | |$25.72\pm 0.02$| | |$25.96\pm 0.02$| |
3 | |$25.22\pm 0.01$| | |$25.78\pm 0.02$| | |$26.01\pm 0.01$| |
4 | |$25.27\pm 0.01$| | |$25.83\pm 0.02$| | |$26.06\pm 0.01$| |
5 | |$25.31\pm 0.01$| | |$25.88\pm 0.01$| | |$26.10\pm 0.01$| |
6 | |$25.35\pm 0.01$| | |$25.93\pm 0.01$| | |$26.14\pm 0.01$| |
7 | |$25.39\pm 0.01$| | |$25.99\pm 0.02$| | |$26.18\pm 0.01$| |
8 | |$25.44\pm 0.02$| | |$26.06\pm 0.03$| | |$26.23\pm 0.02$| |
9 | |$25.53\pm 0.05$| | |$26.18\pm 0.05$| | |$26.33\pm 0.04$| |
Finally, we apply an i-band magnitude cut corresponding to the LSST Gold sample selection on the degraded catalogue. For the full 10-yr sample this is defined as |$i\lt 25.3$|. For data with an observation period of |$N_{\rm yr}$| yr, we adjust the gold cut to |$i_{\rm lim}=25.3 + 2.5\log _{10}(\sqrt{N_{\rm yr}/10})$|. Thus for Y1, Y3, and Y5, we adopt the following gold cuts respectively: |$i_{\rm lim}=24.0, 24.6, 24.9$|. Notice that this is slightly shallower than the definition in the DESC SRD, where the Gold cut is defined as one magnitude shallower than the median coadd |$m_5$|. This is due to the fact that OpSim baseline v3.3 has a slightly deeper i-band depth in early years compared to previous expectations. For Y1, the median i-band |$m_5^{\rm ex}$| is |$\sim 25.2$|, giving a DESC SRD Gold cut to be 0.2 mag deeper than what we adopt here. Additionally, for our fiducial sample, we also apply a signal-to-noise cut in i-band: |${\rm SNR}=1/{\rm nsr}\ge 10$|, although we also look at the case with the full sample. This cut is motivated by the selection of the source sample, where shape measurements typically require a high SNR detection in i-band. In this work, we apply this cut to both the weak lensing and clustering samples.
3.2 Photo-z estimators
Methods for photometric redshift estimation can be broadly divided into two main categories: template-fitting and machine learning. Template fitting methods assume a set of SED templates for various types of galaxies, and use these to fit the observed magnitudes of the targets. Machine learning methods, on the other hand, use machine learning algorithms trained on a reference sample, to infer the unknown target redshifts. See Schmidt et al. (2020) for a review and comparison of the performance of various photo-z estimators in the context of Rubin LSST. In this work, we adopt two algorithms with reasonable performance, a template-fitting method, BPZ (Bayesian photometric redshifts), and a machine learning method, FlexZBoost. In this work, before applying these redshift estimators, all observed magnitudes are de-reddened, by applying the inverse of equation (1).
3.2.1 BPZ (Bayesian photometric redshifts)
BPZ (Benítez 2000; Coe et al. 2006) is a template-based photometric estimation code. Given a set of input templates |$\mathbf {t}$|, BPZ computes the joint likelihood |$P(z,\mathbf {t})$| for each galaxy with redshift z. A prior |$P(z,\mathbf {t}|m)$| is included based on the observed magnitude of the galaxy m. For example, the prior restricts bright, elliptical galaxies to lower redshifts. For each galaxy, a likelihood |$P(z,\mathbf {t}|c, m)$| given the galaxy’s colour c and magnitude is computed, and by marginalizing over the templates, one obtains the per-object redshift probability |$P(z)$|.
We use the RAIL interface of the BPZ algorithm, with the list of SED templates adopted in Coe et al. (2006): the CWW+SB4 set introduced by Benítez (2000), the El, Sbc, Scd & Im from Coleman, Wu & Weedman (1980), the SB2 & SB3 from Kinney et al. (1996), and the 25 & 15 Myr ‘SSP’ from Bruzual & Charlot (2003). We set the primary observing band set to i-band, and adopt the prior from the original BPZ paper (Benítez 2000), which was used to fit data from the Hubble Deep Field North (HDF-N; Williams et al. 1996). Notice that these set of SEDs may be different from that in the Roman–Rubin simulation, and the prior distributions may not match exactly. The prior mismatch would only affect samples with low signal-to-noise ratio and hence those posteriors are prior-dominated. For the gold sample considered in this paper, the impact of the prior on the mean difference and scatter of the true and photometric redshifts is expected to be small, although galaxies with broad or bimodal posteriors may end up having a different point estimate (e.g. mode), hence the outlier rate could be slightly higher. We do not include extra SED templates here. The SED template colours are able to cover the range of colours in the Roman–Rubin simulation, as shown in Appendix B.
Additionally we compute the odds parameter, defined as
where |$z_{\rm mode}$| is the mode of |$P(z)$|, and |$\Delta z =\epsilon (1+z_{\rm mode})$| defines an interval around the mode to integrate |$P(z)$|. The maximum value of odds is 1, which means that the probably density is entirely enclosed within the integration range around the mode, whereas a small odds means that the probability density is diffuse given the range. Hence, odds denotes the confidence of the BPZ redshift estimation, and the choice of |$\epsilon$| essentially sets the criteria. The |$(1+z_{\rm mode})$| factor accounts for the fact that larger redshift errors are expected at higher redshifts. We choose |$\epsilon =0.06$| as a nominal photo-z scatter, and we use odds as a BPZ ‘quality control’, where a subsample is selected with |${\rm odds}\ge 0.9$|, as comparison to the baseline sample.
3.2.2 FlexZBoost
FlexZBoost (Dalmasso et al. 2020; hereafter FZBoost) is a machine-learning photo-z estimator based on FlexCode (Izbicki & Lee 2017), a conditional density estimator (CDE) that estimates the conditional probability density |$p(y|\mathbf {x})$| for the response or parameters, y, given the features |$\mathbf {x}$|. The algorithm uses basis expansion of univariate y to turn CDE to a series of univariate regression problems. Given a set of orthonormal basis functions |$\lbrace \phi _i(y) \rbrace _i$|, the unknown probability density can be written as an expansion:
The coefficients |$\beta _j(\mathbf {x})$| can be estimated by a training set |$(\mathbf {x},y)$| using regression. The advantage of FlexCode is the flexibility to apply any regression method towards the CDE. The main hyperparameters involved in training is the number of expansion coefficients and those associated with the regression. Schmidt et al. (2020) found that FZBoost was among the strongest performing photo-z estimators according to the established performance metrics.
In this paper, we utilize the RAIL interface of the FZBoost algorithm with its default training parameters. We construct the training sample by randomly drawing 10 per cent of the degraded objects from each of the deciles, and train each year separately. Notice that this training sample is fully representative of the test data, which is not true in practice. Spectroscopic calibration samples typically have a magnitude distribution that is skewed towards the brighter end, and the selection in colour space can be non-trivial depending on the specific data set used. Although there are methods to mitigate impacts from this incompleteness, such as re-weighting in redshift or colours (Lima et al. 2008), and, more recently, using training data augmentation from simulations (Moskowitz et al. 2024), the photo-z performance is not comparable to having a fully representative sample, and one would expect some level of bias and increased scatter depending on the mitigation method adopted. Here, we are interested in whether our results on the non-uniformity impact changes significantly with an alternative photo-z algorithm. We thus leave the more realistic and sophisticated case with training sample imperfection to future work.
3.2.3 Performance
For both photo-z estimators, we use the mode of the per-object redshift probability, |$P(z)$|, as the point estimate, |$z_{\rm phot}$|. Fig. 3 shows the scatter in spec-z and photo-z for Y1, Y3, and Y5 with BPZ and FZBoost redshifts, for the shallowest (|${\rm qtl}=0$|) and the deepest (|${\rm qtl}=9$|) quantiles in the i-band |$m_5^{\rm ex}$| respectively. The scatter is always larger for the shallower sample in the full sample case (faint dots). This is expected following equations (2) and (4), given that the coadd depths in each band are strongly correlated. At fixed magnitude, the larger the |$m_5$|, the smaller the photometric error, hence also the smaller the scatter in photo-z. The signal-to-noise cut at |${\rm SNR}\ge 10$| removes some extreme scatter as well as objects from the highest redshifts. This is more obvious for the shallowest sample compared to the deepest, due to the better signal-to-noise ratio for the deepest sample at high redshifts.

Photo-z versus true redshifts for the sample degraded with Rubin OpSim baseline v3.3 observing conditions using BPZ (left two columns) and FZBoost (right two columns) mode as the photo-z point estimator. For each photo-z method, we show sample degraded with pixels containing the shallowest 10 per cent i-band Coadded depth with galactic extinction (|${\rm qtl}=0$|), and that from the deepest 10 per cent (|${\rm qtl}=9$|). This is repeated for the cases of Y1, Y3, and Y5 observing conditions with respective gold cut in i-band applied. The faint dots show all the samples included within the gold cut, whereas the solid contour shows the samples (90 per cent contour) with |${\rm SNR}\ge 10$| (fiducial). In the BPZ case, the dashed lines show the 90 per cent contour for the sample with an additional selection of |${\rm odds}\ge 0.9$|. In the FZBoost case, the model is trained on a perfectly representative sample for each observation year.
There is a significant group of outliers in the BPZ case that are at low redshifts but are estimated to be at |$z\gt 2$|, highlighted by the blue contours. By examining individual BPZ posteriors for this group, we find that these objects tend to have very broad or bimodal redshift distributions. This could be a result of confusion between the Lyman break and the 4000 Å Balmer break, and notice that the fraction of this population as well as its location can be influenced by the choice of the BPZ priors. Another possible cause is the spurious bimodal distribution in the colour-redshift space in the Roman–Rubin simulation, as mentioned in Section 2.2. We see that after applying a strict cut with |${\rm odds}\ge 0.9$|, shown by the purple dashed lines enclosing 90 per cent of the sample, the outlier populations are significantly reduced, as expected. This cut retains 20.4 per cent (27.7 per cent), 25.7 per cent (44.4 per cent), 29.5 per cent (44.0 per cent) of the |${\rm SNR}\ge 10$| sample in |${\rm qtl}=0$| (9) for Y1, Y3, and Y5, respectively. We see that this cut further reduces the scatter at |$z_{\rm phot}\sim 1.5$|. FZBoost in general shows a much better performance, given that the training data is fully representative of the test data. Table C1 summarizes these findings for each sample via a few statistics of the distribution of the difference between photo-z and true redshifts: |$\Delta z=(z_{\rm phot}-z_{\rm true})/(1+z_{\rm true})$|. Namely, the median bias |${\rm Median}(\Delta z)$|, the standard deviation, the normalized median absolute deviation (NMAD) |$\sigma _{\rm NMAD}=1.48\, {\rm Median}(|\Delta z|)$|, and the outlier fraction with outliers defined as |$|\Delta z|\gt 0.15$|.
Notice that the odds cut could introduce bias to the galaxy distribution. Given that the relation between photometry and the redshift PDF shape that influences odds is highly complex and non-linear, the odds can be correlated with both galaxy type and redshift. For cosmological analysis, the imposed selection in galaxy type is not a great concern as long as the |$n(z)$| is accurately determined, and the galaxy sample is uniformly distributed spatially. A potential worry is that a spatial variation in the galaxy bias is introduced, or that the bias evolution is changed, due to the odds cut. This would have to be tested out in a large cosmological simulation that includes both realistic photometry and clustering information, which we leave to future work.
3.3 Tomographic bins
In weak lensing analysis, the full galaxy catalogue is sub-divided into a ‘lens’ sample and a ‘source’ sample. The lens sample is often limited at lower redshifts, acting as a tracer of the foreground dark matter field which ‘lenses’ the background galaxies. The source sample contains the background galaxies extending to much higher redshifts, whose shapes are measured precisely to construct the shear catalogue. The two samples together allow measurement of the so-called ‘3|$\times$|2 pt statistics’, including galaxy clustering from the lens sample, galaxy–galaxy lensing from the lens galaxies and source shapes, and cosmic shear from the source shapes alone. Additionally, both the lens and source samples are divided into several tomographic bins, i.e. subsamples separated with sufficient distinction in redshifts. This further includes evolution information that improves cosmological constraints.
We adopt the Y1 tomographic bin definitions in the DESC SRD for all of our samples. The lens sample has five bins equally spaced in |$0.2\lt z\lt 1.2$|, with bin width |$\Delta z =0.2$|, and bin edges defined using |$z_{\rm phot}$|. For source samples, the DESC SRD requires five bins with equal number of galaxies. To do so, we first combine the 10 depth quantiles, and then split the sample into five |$z_{\rm phot}$| quantiles.
Notice that in practice, tomographic binning can be determined in different ways, often with the aim of maximizing the signal-to-noise of the two-point measurements. In some cases, clustering algorithm, e.g. random forest, rather than a photo-z estimator, is used to separate samples into broad redshift bins. We refer the interested readers to Zuntz et al. (2021) for explorations of optimal tomographic binning strategies for LSST. Notice also that, following the DESC SRD, we do not apply additional magnitude cuts for the lens sample. This is done, for example, for the DES Y3 MagLim lens sample, where a selection of |$i\gt 17.5$| and |$i\lt 4z_{\rm phot}+18$| is applied (Porredon et al. 2022). These cuts are applied to reduce faint, low-redshift galaxies in the lens sample, such that the photometric redshift calibration is more robust. Notice that if the lens samples are selected with a brighter cut, one would expect a different and likely reduced depth variation. We explore this particular case in Appendix E.
Fig. 4 shows the normalized true redshift distribution, |$p(z)$|, of the lens and source tomographic bins for Y3 as an example, split by the BPZ redshifts (with or without odds selection) and the FZBoost redshifts. The dashed lines show the |$p(z)$| measured from the shallowest samples, whereas the solid lines show that from the deepest samples. The BPZ case shows more extended tails in each tomographic bin compared to the FZBoost case, and for the source galaxies, a noticeable outlier population at low redshifts in the highest tomographic bin. We see that in most cases, there is a clear difference in |$p(z)$| between the shallow and the deep samples: the deep samples seem to shrink the tails, making |$p(z)$| more peaky towards the mean redshift (although this is not the case for the |${\rm odds}\ge 0.9$| sample), and their |$p(z)$| seems to shift towards higher redshift at the same time. To quantify these changes, we define metrics for the impact of variable depth below.

True redshift distribution for tomographic bins as defined in the DESC SRD for lens (left column) and source galaxies (right column) in Y3. The tomographic bins are determined using the mode of BPZ redshifts (first two rows) and FZBoost (last row). In all cases, the sample has been applied a gold cut |$i\lt 24.6$| and |${\rm SNR}\ge 10$|. The middle row shows the sample selected with an additional cut with |${\rm odds}\ge 0.9$|. The dashed lines show samples degraded using the shallowest 10 per cent pixels in i-band coadd depth (|${\rm qtl}=0$|), and the solid lines show those from the deepest 10 per cent (|${\rm qtl}=9$|).
3.4 Metrics for impact of variable depth
The first metric is the variation in the number of objects in each sample, |$N_{\rm gal}$|, as a function of the coadd i-band depth. This is the most direct impact of varying depth: deeper depth leads to more detection of objects within the selection cut. The result is that the galaxy density contrast, |$\delta _g(\boldsymbol{\theta })=[N(\boldsymbol{\theta })-\bar{N}]/\bar{N}$|, where |$N(\boldsymbol{\theta })$| is the per-pixel number count at pixel |$\boldsymbol{\theta }$|, and |$\bar{N}$| is the mean count over the whole footprint, fluctuates according to the depth variation, leading to a spurious clustering signal in the two-point statistics. To quantify the relative changes, we measure the average number of objects per tomographic bin across all 10 depth quantiles, |$\bar{N}_{\rm gal}=\sum _i N_{{\rm gal},i}w_i$|, where |$i=1,..,10$| denotes the depth bin, and |$w_i\sim 0.1$| is the weight proportional to the number of pixels in that quantile. We quote the change of object number in terms of |$N_{\rm gal}/\bar{N}_{\rm gal}$|.
The second metric quantifies the mean redshift of the tomographic bin as a function of depth:
where |$p(z)$| is the true redshift distribution of the galaxy sample in the tomographic bin with normalization |$\int p(z)\, {\rm d}z=1$|. Weak lensing is particularly sensitive to the mean distance to the source sample: the lensing kernel thus differs on patches with different depth. Here, we look at the difference between the mean redshift |$\langle z \rangle _i$| of depth quantile i and that of the full sample, |$\langle z \rangle _{\rm tot}$|, i.e. |$\Delta \langle z \rangle \equiv \langle z \rangle _i-\langle z \rangle _{\rm tot}$|. More specifically, we look at the quantity |$\Delta \langle z \rangle /(1+\langle z \rangle _{\rm tot})$|, where the weighting accounts for the increase in photo-z error towards higher redshifts. This format also allows us to compare with the DESC SRD requirements.
The third metric quantifies the width of the tomographic bin. This is not a well-defined quantity because the |$p(z)$| in many cases deviate strongly from a Gaussian distribution. One could use the variance, or the second moment of the redshift distribution:
However, this quantity is very sensitive to the tails of the distribution: larger tails of |$p(z)$| increases |$\sigma _z$|, even if the bulk of the distribution does not change much. In our case, the width of the tomographic bin is most relevant for galaxy clustering measurements: the smaller the bin width, the larger the clustering signal. Specifically, in the Limber approximation, the galaxy autocorrelation angular power spectrum is given by
where |$\ell$| is the degree of the spherical harmonics, |$\chi$| is the comoving distance, |$H(z)$| is the expansion rate at redshift z, c is the speed of light, k is the 3D wave vector, and |$P_{gg}$| is the 3D galaxy power spectrum. Assuming that within the tomographic bin, the redshift evolution of galaxy bias is small, and all other functions can be approximated at the mean value at the centre of the bin, the clustering signal is proportional to the integral of the square of the galaxy redshift distribution, |$p(z)$|. This assumption breaks down if the tomographic bin width is broad, for instances, the combination of all five lens bins. Hence, we define the following quantity:
as the LSS diagnostic metric, which corresponds to changes of the two-point angular power spectrum kernel with respect to changes in |$p(z)$|. This is a useful complement to the second moment, |$\sigma _z$|, because |$\sigma _z$| can be sensitive to the tails of the |$p(z)$| distribution caused by a small population of outliers in photo-z; however, the impact of this population could be small for galaxy clustering, which is characterized by |$W_z$|. For both of these quantities, we look at the ratio with the overall sample combining all depth quantiles. We show all the mean metric quantities in each tomographic bin and each quantile for Y1, Y3, and Y5 in Table C2 for BPZ and Table C3 for FZBoost.
Notice that for the |$p(z)$|-related quantities, we have used the true redshifts, but in practice, these are not accessible. Rather, unless one uses a Bayesian hierarchical model such as CHIPPR (Malz & Hogg 2022), one only has access to the calibrated redshift distribution |$p_c(z)$| against some calibration samples via, e.g. a self-organizing map (SOM), which is itself associated with bias and uncertainties that can be impacted by varying depth. The case we present here thus is idealized, where the calibration produces the perfect true |$p(z)$|. This allows us to propagate the actual impact of varying depth on |$p(z)$| to the |$3\times 2$| pt data vector, but does not allow us to assess the bias at the level of modelling due to using an ‘incorrect’ |$p_c(z)$| that is affected also by the varying depth. We leave this more sophisticated case to future work.
4 RESULTS
This section presents our results on the impact of variable depth via three metrics: the number of objects (Section 4.1), mean redshift of the tomographic bin (Section 4.2), and the width of the tomographic bin (Section 4.3).
4.1 Number of objects
Fig. 5 shows the change in the number of objects, |$N_{\rm gal}$|, as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, compared to the overall mean, for lens and source tomographic bins in Y1, Y3, and Y5. In general, we find an approximately linear increase of number of objects as the i-band depth increases, with the higher two redshift bins showing the most extreme variation. For the lower redshift bins, the variation can be |$\sim 10~{{\rm per\ cent}}$| compared to the mean value, whereas for bin 5, the variation can be as large as |$\sim 40~{{\rm per\ cent}}$|. The trend does not seem to change much at different observing years. This is the result of the i-band gold cut and the high SNR selection. The scatter in magnitudes is larger for the shallower sample, hence given a magnitude cut, the shallower sample will have fewer objects. At fixed magnitude, the deeper objects have larger SNR, resulting in more faint galaxies surviving the SNR cut. Given that the gold cut and SNR at given magnitude evolve with depth in the observation year, we expect the trend to be similar across Y1 to Y5. It is interesting to see also that per tomographic bin, the trends for baseline BPZ and FZBoost are similar, despite having quite different features in the photo-z versus spec-z plane. The variation between bins 1–4 is slightly larger in the BPZ case. For the BPZ redshifts, the inclusion of the odds selection increases the variation in object number, especially in the highest redshift bin. The steeper slope might be due to the fact that, objects with larger photometric error from the shallower regions are likely to result in a poorer fit, leading to a smaller |${\rm odds}$| value. Hence, the |${\rm odds}\ge 0.9$| selection removes more objects from the shallower compared to the baseline case.

The number of galaxies in tomographic bins as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, for Y1, Y3, and Y5. The number is normalized by the average number of objects combing all quantiles for each tomographic bin, |$\bar{N}_{\rm gal}$|. The tomographic bins are determined using the mode of BPZ redshifts (left two columns) and FZBoost (right two columns). For each redshift estimator, both lens and source galaxy samples are shown, with the gold cut and |${\rm SNR}\ge 10$|. In the BPZ case, we also show the sample with |${\rm odds}\ge 0.9$| in squares with dashed lines. The vertical solid and dashed lines marks the |$1\sigma$| and |$2\sigma$| regions of the depth distribution.
4.2 Mean redshift
Fig. 6 shows the variation in the mean redshift of the tomographic bin, |$\langle z \rangle$|, as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, for lens and source samples in Y1, Y3, and Y5. In general, |$\langle z \rangle$| increases with the i-band coadd depth. This is expected as more faint, high redshift galaxies that are scattered within the magnitude cut are included in the deeper sample, resulting in an increased high redshift population. In general, the slope of this relation is similar across tomographic bins for both lens and source samples, with a variation of |$|\Delta z /(1+\langle z \rangle)|\sim 0.005-0.01$|. This is not true for bin 5 in the source sample, where the variation with depth is noticeably larger. This could be explained by this bin containing objects with the highest |$z_{\rm phot}$|, which are also most susceptible to scatter in the faint end and outliers in the photo-z estimators. This trend becomes more extreme from Y1 to Y5. By reducing outliers with the BPZ odds cut, the variation in source bin 5 is slightly reduced, although still higher than the nominal level. There are some difference between the BPZ and FZBoost cases: the slope slightly grows from Y1 to Y5 in the BPZ case, whereas it stays consistent in the FZBoost case, but the two cases converge in Y5. On the same figure, we mark the DESC SRD requirements for photo-z as a dark grey band at |$\Delta z /(1+\langle z \rangle)=\pm 0.002$| and a light grey band at |$\Delta z /(1+\langle z \rangle)=\pm 0.005$|. The shifts in mean redshift reach the limit of the requirements for Y1, and exceeds the requirement for Y10.

The change in mean redshift in each tomographic bin as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, for Y1, Y3, and Y5. The difference in mean redshift, |$\Delta z$|, between a given quantile and the combined sample |$\langle z \rangle$|, is normalized by |$1/(1+\langle z \rangle)$| to account for expected larger uncertainties at higher redshifts. The fainter and darker grey bands marks |$\pm 0.005$| and |$\pm 0.002$|, corresponding to the DESC SRD requirements for Y1 large-scale structure and weak lensing science. The tomographic bins are determined using the mode of BPZ redshifts (left two columns) and FZBoost (right two columns). For each redshift estimator, both lens and source galaxy samples are shown, with the gold cut and |${\rm SNR}\ge 10$|. In the BPZ case, we also show the sample with |${\rm odds}\ge 0.9$| in squares with dashed lines. The vertical solid and dashed lines marks the |$1\sigma$| and |$2\sigma$| regions of the depth distribution.
4.3 Width of the tomographic bin
Fig. 7 shows the change in the tomographic bin width parameters, |$\sigma _z$| and |$W_z$|, as defined in Section 3.4 for the lens galaxies as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, in Y1, Y3, and Y5. The width of the tomographic bin can change with depth due to the scatter in the photo-z versus spec-z plane. For example, a deeper sample may have a smaller scatter for the bulk of the sample, but include fainter objects that could result as outliers, resulting a more peaked distribution at the centre with pronounced long tails.

The relative change in the width of the lens tomographic bin as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|, for Y1, Y3, and Y5. The left two columns show the second moment of the normalized redshift distribution, |$\sigma _z$|, in each quantile normalized by that of all quantiles combined, |$\sigma _z^{\rm tot}$|, for each tomographic bin. The right two columns show the LSS diagnostic parameter, |$W_z$|, as defined in equation (15), for each quantile normalized by all quantiles combined |$W_z^{\rm tot}$|. The left and right panels for each width parameter show results with BPZ and FZBoost, respectively. In the case of BPZ, the subsample with selection |${\rm odds}\ge 0.9$| is shown in squares with dashed lines. The vertical solid and dashed lines marks the |$1\sigma$| and |$2\sigma$| regions of the depth distribution.
The left two columns of Fig. 7 show the changes in the second moment, |$\sigma _z$|, for both the BPZ (first column) and FZBoost case (second column). For BPZ, there is little change in this parameter for Y1 at different depth, but for Y3 and Y5, |$\sigma _z$| increases with depth. Including odds selection reduces the trend, and in some cases reverses it. For FZBoost, the trend is similar to BPZ, but bin 1 shows a particularly large variation by as much as |$\sim 30~{{\rm per\ cent}}$|. This is because |$\sigma _z$| is sensitive to the entire distribution, not just the peak, and outliers at high redshift can significantly impact this parameter. Fig. C1 shows same |$p(z)$| distributions for Y3 in logarithmic scale, where the high redshift outliers are visible. Indeed, one can see an enhanced high-redshift population for bin 1 in the FZBoost case. The odds cut removes most of the outliers, so that |$\sigma _z$| is reflecting the change of the peak width with depth, hence giving the reversed trend.
The right two columns of Fig. 7 show the changes in |$W_z$|. Given a tomographic bin, a larger |$W_z$| means a more peaked redshift distribution, hence a larger clustering signal. One can see that |$W_z$| is more sensitive to the bulk of the |$p(z)$| distribution, as it increases with depth in most bins. We see that the variation in |$W_z$| is within 10 per cent from the mean, with the largest variation coming from bins 2, 3, and 4. The highest and lowest tomographic bins, on the other hand, does not change much, despite their |$\sigma _z$| varying significantly with depth. For the BPZ case, adding the additional cut in the odds parameter reduces such trends in general, and the trend in the highest tomographic bin is reversed.
5 IMPACT ON THE WEAK LENSING |$3\times 2$| PT MEASUREMENTS
We use the Y3 FZBoost photo-z as an example to showcase the varying depth effects, by propagating the number density and |$p(z)$| variation from the previous section into the weak lensing |$3\times 2$| pt data vector. In Section 5.1, we describe how the mock large-scale structure and weak lensing shear maps are constructed with the inclusion of non-uniformity. In Section 5.2, we show case the measured |$3\times 2$| pt data vector in both uniform and variable depth case.
5.1 Mock maps with varying depth
To construct the mock LSST catalogue, we use one of the publicly available Gower street simulations (Jeffrey et al. 2024). This is a suite of 800 N-body cosmological simulations created using PKDGRAV3 (Potter, Stadel & Teyssier 2017) with various wCDM cosmological parameters. The simulation outputs are saved as 101 light cones in healpix format with |$N_{\rm side}=2048$| between |$0\lt z\lt 49$|. To fill the full sky, the boxes are repeated 8000 times in a |$20\times 20\times 20$| array. For shells |$z\lt 1.5$|, though, only three replications are required. We use the particular simulation with |$\Lambda$|CDM cosmology: |$w=-1$|, |$h=0.70$|, |$\Omega _m=0.279$|, |$\Omega _b=0.046$|, |$\sigma _8=0.82$|, and |$n_s=0.97$|. The dark matter density contrast map, |$\delta _m$|, is computed using particle counts at |$N_{\rm side}=512$| (corresponding to a pixel size of |$47.2 \, {\rm arcmin}^2$|), and the corresponding lensing convergence map, |$\kappa$|, is produced with Born approximation using BornRayTrace6 (Jeffrey, Alsing & Lanusse 2020). Finally, the shear map, |$(\gamma _1,\gamma _2)$| in spherical harmonic space is produced via
and we transform |$\gamma _{E,\ell m}$| as a spin-2 field, |$\gamma _{\ell m}=\gamma _{E,\ell m} + i \gamma _{B,\ell m}$|, assuming zero B-mode. For more details see Jeffrey et al. (2024).
We construct the lens and source shear maps as follows. In the noise-less case, given a lens (source) redshift distribution, |$p_i(z)$|, for a tomographic bin i, we construct the lens density (source shear) map by |$M_i= \sum _j M_j p_i(z_j)\Delta z_j$|, where j denotes the light-cone shells in the Gower street simulation, |$M_j$| denotes the map in this particular shell, and |$\Delta z_j$| denotes the shell width. The noisy maps are generated in the following way. Lens galaxy counts in tomographic bin i on each pixel |$\boldsymbol{\theta }$| are drawn from a Poisson distribution. For a shell j, the Poisson mean is |$\mu _j(\boldsymbol{\theta })=n_{{\rm gal},j}[1+b\delta _{m,j}(\boldsymbol{\theta })]$|, where b is the linear galaxy bias and |$n_{{\rm gal},j}=n_{\rm gal}p_i(z_j)\Delta z_j$|, with |$n_{\rm gal}$| being the average count per pixel in this tomographic bin. Here, we set |$b=1$| to avoid negative counts in extremely underdens pixels. However, notice that in a magnitude-limited survey, the galaxy bias is typically |$b\gt 1$| and evolves with redshift, not to mention the scale-dependence of bias on non-linear scales. One approach to sample |$b\gt 1$| is to simply set negative counts to zero. However, this may introduce spurious behaviour in the two-point function of the field. Given the main purpose here is to propagate the systematic effects due to depth only, we justify our choice by prioritizing the precision of the measured two-point statistics compared to theory inputs. We assume the ensemble-averaged per-component shape dispersion to be |$\sigma _e=\left\langle \sqrt{(e_1^2+e_2^2)/2} \right\rangle =0.35$|, chosen to roughly match that measured in the Stage III lensing surveys (e.g. Gatti et al. 2021; Joachimi et al. 2021; Li et al. 2022). For a tomographic bin i, we first assign source counts in the same way as above, resulting in |$\hat{n}_{\rm source} (\boldsymbol{\theta })$| galaxies in pixel |$\boldsymbol{\theta }$|. We then randomly assign shapes drawn from a Gaussian distribution, |$\mathcal {N}\sim (0,\sigma _e)$|, for each component |$\hat{n}_{\rm source} (\boldsymbol{\theta })$| times, and we compute the mean shape noise in each pixel. We end up with a shape noise map, which we then add to the true shear map for each tomographic bin.
To imprint the varying depth effects, we divide the footprint into 10 sub-regions containing the pixels in each of the i-band |$m_5^{\rm ex}$| deciles, and repeat the above procedure with distinct number density and |$p(z)$| for both the lens and source galaxies, according to the findings in previous sections. We do not assign depth-varying shape noise, following the finding in Joachimi et al. (2021) that the shape noise is only a weak function of depth. We also produce the noise-less cases for varying depth. For density contrast, we produce two versions: one with varying |$p(z)$| only, and one with additional amplitude modulation |$\delta _m + \Delta \delta$|, where |$\Delta \delta +1=N_{\rm gal}/\bar{N}_{\rm gal}$|, as shown in Fig. 5. The former is to used isolate the effect of varying |$p(z)$| only.
We adopt the cumulative number density of the photometric sample as a function of the i-band limiting magnitude given by the DESC SRD:
where |$f_{\rm mask}$| accounts for the reduction factor for masks due to image defects and bright stars, and |$f_{\rm mask}=0.12$| corresponds to a similar level of reduction in HSC Y1 (The LSST Dark Energy Science Collaboration 2021). Hence, substituting |$i_{\rm lim}=24.6$| for LSST Y3, the expected total number density is |$N(\lt 24.6)=27.1\, {\rm arcmin}^{-2}$|. This is slightly larger but comparable to the HSC Y3 raw number density of |$N=22.9 \, {\rm arcmin}^{-2}$| (Li et al. 2022) at a similar magnitude cut of |$i_{\rm lim}\lt 24.5$| in the cModel magnitude. We estimate the total lens galaxy number density for our sample by |$N_{\rm lens}=N(\lt 24.5)f_{\rm LS}$|, where |$f_{\rm LS}=0.90$| is the ratio between the total number of lens and source samples (averaged over depth bins) from our degraded Roman–Rubin simulation catalogue, hence |$N_{\rm lens}=24.4 \, {\rm arcmin}^{-2}$|. For each lens tomographic bin, we obtain the following mean number density: |$3.93, 6.08, 5.66, 5.71, 3.03 \, {\rm arcmin}^{-2}$|. We also explore the case using a MagLim-like lens sample with a much sparser density in Appendix E. For source sample, it is the effective number density |$n_{\rm eff}$|, rather than the raw number density, that determines the shear signal-to-noise. |$n_{\rm eff}$| accounts for the down-weighting of low signal-to-noise shape measurements, as defined in e.g. Heymans et al. (2012) and Chang et al. (2013). For LSST, |$n_{\rm eff}$| is estimated for Y1 and Y10 with different scenarios in table F1 in the DESC SRD. In the case adopted for forecasting, where the shapes are measured in |$i+r$| and accounting for blending effect, |$n_{\rm eff}$| is |$\sim 60~{{\rm per\ cent}}$| of the raw number density for both Y1 and Y10. We follow this estimation for Y3, hence adopting |$n_{\rm eff}=16.3 \, {\rm arcmin}^{-2}$| for the full source sample, and |$3.26 \, {\rm arcmin}^{-2}$| for each tomographic bin. This is comparable, but slightly more sparse compared to HSC Y3, where |$n_{\rm eff}=19.9 \, {\rm arcmin}^{-2}$| (Li et al. 2022).
Meanwhile, we also generate a uniform sample for comparison, in which the number density and |$p(z)$| are given by the mean of the depth quantiles. We assign uniform weights to lens and source galaxies.
5.2 Weak lensing |$3\times 2$| pt data vector
We use NaMaster (Alonso, Sanchez & Slosar 2019) to measure the |$3\times 2$| pt data vector in Fourier space: |$C_{\ell }^{\rm gg}$|, |$C_{\ell }^{{\rm g}\gamma }$|, and |$C_{\ell }^{\gamma \gamma }$| for the lens and source tomographic bins. NaMaster computes the mixing matrix to account for the masking effects, and produces decoupled band powers. The healpix pixel window function correction is also applied when comparing the data with input theory. We adopt 14 |$\ell$|-bins in range [20,1000] with log spacing. Notice that the maximum |$\ell$| is a conservative choice for |$C_{\ell }^{\gamma \gamma }$| compared to the DESC SRD, where |$\ell _{\rm max}=3000$| is adopted, based on the assumption of improved modelling of non-linearity and baryonic feedback when the LSST data becomes available. Nevertheless, this is sufficient for our purpose to demonstrate the impact of variable depth on relatively large scales. For galaxy clustering and galaxy–galaxy lensing, we apply an additional scale cut at |$\ell _{\rm max}=k_{\rm max}\chi (\langle z \rangle)-0.5$| following the DESC SRD, where |$k_{\rm max}=0.3\, h{\rm Mpc}^{-1}$|, and |$\chi (\langle z \rangle)$| is the comoving distance at the mean redshift |$\langle z \rangle$| of the lens tomographic bin. We generate theory angular power spectra assuming spatial uniformity with the core cosmology library7 (CCL; Chisari et al. 2019). CCL uses halofit (Smith et al. 2003; Takahashi et al. 2012) non-linear power spectrum and Limber approximation when computing the angular power spectra. We compute the Gaussian covariance matrix using NaMaster with theoretical data vectors. The covariance includes mask effects, shot-noise, and shape noise power spectra. It should be noted that this is done assuming uniformity. In the varying depth case, the true covariance contains extra variance, due to spatial correlation in the noise with the number count. Also, the assumption of a purely Gaussian covariance is not completely true. On very large scales, non-Gaussian mode coupling at scales larger than the survey footprint results in a term called supersample covariance (Li, Hu & Takada 2014). Here we expect it to be relatively small because of the large sky coverage of LSST. On small scales, non-linear structure formation also introduces non-Gaussian terms (e.g. Cooray & Hu 2001). With the scale cuts adopted in |$C_{\ell }^{gg}$| and |$C_{\ell }^{g\gamma }$| we expect that such non-Gaussian contribution to be small.
The galaxy clustering angular power spectra measurements, |$C_{\ell }^{\rm gg}$|, are shown in Fig. 8. The tomographic bin number is indicated in the upper right corner as |$(i,i)$| for bin i. The measurements for the uniform case are shown as red dots, and that for the varying depth case are shown in purple. The data points are shot-noise-subtracted. We see a clear difference between the uniform and the varying depth cases at |$\ell \lt 100$|, and it becomes more significant at higher redshifts. The impact at large scales is expected, as the i-band coadd depth varies relatively smoothly and the rolling pattern is imposed at relatively large scales. The trend with redshifts is also expected, due to two main reasons. First, the slope |${\rm d}(N_{\rm gal}/\bar{N}_{\rm gal})/{\rm d}m_5$| increases slightly with redshift, and is significantly larger for bin 5, as shown in the right middle panel of Fig. 5. This means that non-uniformity is most severe in these bins. Secondly, the clustering amplitude increases towards lower redshifts due to structure growth, hence the non-uniformity imprinted in |$\delta _g$| is less obvious in lower redshift bins. In practice, the number density fluctuations are mitigated via the inclusion of the selection weights, |$w(\boldsymbol{\theta })$|, such that the corrected density field is defined as |$\tilde{\delta }_g(\boldsymbol{\theta }) = N(\boldsymbol{\theta })/w(\boldsymbol{\theta }) \bar{N}_w$|, where |$\bar{N}_w=\sum N(\boldsymbol{\theta })/\sum w(\boldsymbol{\theta })$| (see e.g. Nicola et al. 2020). In addition, these weights will be used to compute the mode coupling matrix and shot noise, such that the varying number density is taken into account in the likelihood analysis. A more subtle effect is the difference in redshift distribution at different depth. To isolate its impact, we compare the clustering power spectra from the noise-less sample varying |$p(z)$| only with that from the noise-less uniform case. The ratio of the measurements are shown as dashed lines in the first panel of Fig. 9. We find that once the non-uniformity in number density is removed, the variation in |$p(z)$| does not significantly bias the power spectra, and we recover the uniform case at better than 0.5 per cent.

The lens galaxy density angular power spectrum, |$C^{\rm gg}_{\ell }$|, measured from the mock LSST Y3 data with uniform (red points) and varying depth (purple points). Each panel shows the autocorrelation, |$(i,i)$|, in each tomographic bin i. The lower panels show the ratio between the measurements and the theory (black solid lines), |$C^{\rm th}_{\ell }$|. The grey area indicates excluded data points from the scale cut corresponding to |$k=0.3\,h {\rm Mpc}^{-1}$|. The |$\chi ^2$| per degree of freedom, |$\chi ^2_{\rm dof}$|, is shown for the uniform and variable depth cases in the lower left corner, computed using a Gaussian covariance assuming spatial uniformity. The varying depth case deviates from the theory significantly on large scales.

The ratio between noise-less angular power spectra for the varying depth case and the uniform case. The left, middle, and right panels show the ratio for |$C_{\ell }^{\rm gg}$|, |$C_{\ell }^{{\rm g}\gamma }$|, |$C_{\ell }^{\gamma \gamma }$|, respectively. The solid lines indicate the case where both density non-uniformity and varying |$p(z)$| are applied to the overdensity map, whereas the dashed lines refer to the case where only varying |$p(z)$| is implemented. For |$C_{\ell }^{{\rm g}\gamma }$| and |$C_{\ell }^{\gamma \gamma }$|, we only show the diagonal terms, i.e. the combination |$(i,i)$| for tomographic bin i for the tracers, for visual clarity. The off-diagonal terms vary within a similar range. In case of |$C_{\ell }^{g\gamma }$|, the grey region marks |$\ell \lt 50$| where measurements are unstable.
The galaxy–shear and shear–shear power spectra, |$C_{\ell }^{{\rm g}\gamma }$| and |$C_{\ell }^{\gamma \gamma }$|, are shown in Figs 10 and 11, respectively. The source–lens and source–source combinations are indicated on the upper right as |$(i, j)$|. In both cases, we only show the non-zero E-modes, and we check that the B-modes are consistent with zero. For the galaxy–shear case, measurements from combinations |$i\lt j$| are not shown, because we do not include effects such as magnification or intrinsic alignment, hence these measurements are low signal-to-noise or consistent with zero. We see that, overall, the impact of variable depth is much smaller compared to galaxy clustering. In the galaxy–galaxy shear measurements, only combination (5,5) shows a significant |$\chi ^2$| in the variable depth case, and the main deviations is at |$\ell \lt 100$|. This could be a joint effect where non-uniformity is largest in the highest redshift bin for both lens and source. There is negligible difference in the shear–shear measurements for all other combinations given the measurement error. To look at this further, we take the noise-less case and compute the ratio between measurements from the varying depth sample and the uniform sample. We show some examples along the diagonal, i.e. the |$(i,i)$| combinations, in the middle and right panels of Fig. 9. The off-diagonal measurements lie mostly within the variation range of the ones shown here. In case of |$C_{\ell }^{{\rm g}\gamma }$|, we see that deviations are large at low |$\ell$| when both density and |$p(z)$| is non-uniform (shown as solid line); when the density non-uniformity is removed (shown in dashed line), the results are more consistent within 5 per cent. For |$C_{\ell }^{\gamma \gamma }$|, we see that the largest impact is from the highest tomographic bin reaching up to 0.5 per cent.

The E-mode of the galaxy–shear angular power spectrum, |$C^{{\rm g}\gamma }_{\ell }$|, measured from the mock LSST Y3 data with uniform (red points) and varying depth (purple points). Each panel shows the combination, |$(i,j)$|, for source bin i and lens bin j. The lower panels show the ratio between the measurements and the theory (black solid lines), |$C^{\rm th}_{\ell }$|. The grey area indicates excluded data points from the scale cut corresponding to |$k=0.3\,h {\rm Mpc}^{-1}$| in the lens bin. The |$\chi ^2$| per degree of freedom, |$\chi ^2_{\rm dof}$|, is shown for the uniform and variable depth cases in the lower left corner, computed using a Gaussian covariance assuming spatial uniformity. The uniform and varying depth case do not differ much except for the first few data points in (5,5), where the varying depth case deviates significantly from the theory line.

The EE mode of the shear–shear angular power spectrum, |$C^{\gamma \gamma }_{\ell }$|, measured from the mock LSST Y3 data uniform (red points) and varying depth (purple points). Each panel shows the source–source combination, |$(i,j)$|, tomographic bins i and j. The lower panels show the ratio between the measurements and the theory (black solid lines), |$C^{\rm th}_{\ell }$|. The |$\chi ^2$| per degree of freedom, |$\chi ^2_{\rm dof}$|, is shown for the uniform and variable depth cases in the lower left corner, computed using a Gaussian covariance assuming spatial uniformity.
These results are consistent with the analytical approach in Baleato Lizancos & White (2023), where, in general, the varying depth effect in the redshift distributions is sub-per cent and the weak lensing probes are less susceptible to these variations. Our results are quite different from Heydenreich et al. (2020) (hereafter H20) for KiDS cosmic shear analysis in several aspects. H20 found that the largest impact comes from the sub-pointing, small scales, and for a KiDS-like set-up, the difference between the uniform and variable depth cases is 3 per cent–5 per cent at an angular scale of |$\theta =10\, {\rm arcmin}$|. Furthermore, the variable depth effect is stronger in lower redshift bins than higher redshift bins. Several differences in the analysis may contribute to these different results. First, the non-uniformity in KiDS is rather different from that considered here: the KiDS footprint consists of many |$1\, {\rm deg}^2$| pointings, each having distinctive observing conditions due to that each field only received a single visit. This means that survey properties such as depth are weakly correlated at different pointings. One can write down a scale-dependent function, |$E(\theta)$|, to specify the probability of a pair of galaxies falling in the same pointing at each |$\theta$|, and this essentially gives rise to the scale dependence of the variable depth effect in H20. For LSST, the above assumptions are not true, and |$E(\theta)$| (if one can write it down) would take a very different form compared with that in KiDS. Secondly, due to the single visit, there is a much larger variation in depth, number density, and |$\Delta z$| in KiDS compared to this work (tomographic bin centre can shift up to |$\Delta z\sim 0.2$| in redshift, as shown in fig. 2 of H20). This means that the variable depth effects in KiDS as explored by H20 is significantly larger compared to this work. This also explains their redshift dependence, because for KiDS, the average redshift between pointings varies the most in the lowest redshift bins. Lastly, although our |$\ell _{\rm max}$| here corresponds to |$\theta \sim 10 \, {\rm arcmin}$|. the results are not directly comparable, as H20 conducted the analysis in real space, i.e. |$\xi _{\rm \pm }(\theta)$|.
To sum up, the largest impact of varying depth comes from galaxy clustering, whereas the impact on weak lensing probes is much smaller. Higher redshift bins are more susceptible due to a higher sensitivity in number density and redshifts with depth. Given the mock LSST Y3 uncertainty, one can clearly detect bias in the power spectrum in galaxy clustering and the galaxy–galaxy shear bin (4,4), while all other combinations do not seem to have detectable impacts. Furthermore, once the density non-uniformity is removed, the impact of varying depth is further reduced. There are several ways to mitigate number density variation, such as mode projection (e.g. Rybicki & Press 1992; Elsner, Leistedt & Peiris 2016), template subtraction (e.g. Ross et al. 2011; Ho et al. 2012), iterative regression (e.g. Elvin-Poole et al. 2018; Weaverdyck & Huterer 2021), and machine learning methods using neural networks (Rezaie et al. 2020) and a SOM (Johnston et al. 2021). See Weaverdyck & Huterer (2021) for a thorough review. Notice that, despite these methods, it is difficult to guarantee a complete removal non-uniformity, and in some cases, clustering signal can also be reduced as a result. Additional sky cuts to exclude problematic regions can also effectively reduce density variation, at the cost of losing sky coverage. Finally, for the lens sample, a brighter magnitude cuts can also greatly reduce the variable depth effect (see Appendix E for a MagLim-like lens selection), at the cost of sample sparsity. Nevertheless, non-uniformity in |$p(z)$| only seems to be safely averaged out in the 2-point statics measurements.
5.2.1 Impact on spectroscopic calibration
Here, we consider another potential source of systematics arising from small spectroscopic calibration fields. Redshift calibration for photometric surveys such as LSST are usually done using small but deep spectroscopic surveys, e.g. C3R2 survey (Masters et al. 2019). Each field in these surveys has a coverage of a few |${\rm deg}^2$|. Suppose that a calibration field overlaps with a particularly shallow or deep region, the calibration (e.g. a trained SOM) could cause bias to the overall redshift distribution when it is generalized to the whole field. For example, a SOM trained in a shallow region will contain larger noise, which may increase the scatter for the overall sample. The lack of high redshift, fainter objects in the shallow region could also cause bias when the SOM is applied to objects in deeper regions.
The specific impact will depend on the calibration method and details of the calibration, which is beyond the scope of this paper. Here, we qualitatively assess the impact via the difference in the |$3\times 2$| pt theory vectors computed using the |$p(z)$| from a particular quantile and those computed using the mean |$p(z)$|, as shown in Fig. 12. The solid lines show cases from the shallowest quantile, |${\rm qtl}=0$|, and the dashed lines show cases from the deepest quantile, where |${\rm qtl}=9$|, highlighting the worst case scenarios. For |$C_{\ell }^{g\gamma }$| and |$C_{\ell }^{\gamma \gamma }$|, only cases where the tracers are in the same bin are shown, but the other lens–source combinations have a comparable variation. We see that naively taking the |$p(z)$| from a quantile and assume it as the |$p(z)$| for the full sample can give rise to as much as 10 per cent bias compared to the uniform case.

The ratio between the |$3\times 2$| pt theory vectors computed using the |$p(z)$| from depth quantiles 0 (shallowest, shown as solid lines) and 9 (deepest, shown as dashed lines), and those computed using the mean |$p(z)$|. Different colours show different tracer tomographic bin combinations, as indicated in the legend. For |$C_{\ell }^{g\gamma }$| and |$C_{\ell }^{\gamma \gamma }$|, only cases where the tracers are in the same bin are shown for visual clarity, but the other tracer combinations have a comparable variation.
This effect is reduced by having multiple calibration fields across the LSST footprint. Currently, many of the calibration fields overlaps with the LSST Deept Drilling Field (DDF), which will be much deeper compared to the WFD. Impact of variable depth can then be mitigated via a two-tiered SOM calibration, mapping from the deep to the wide field (Myles et al. 2021), and synthetic source injection (Everett et al. 2022), mimicking the degradation of the deep field objects across the LSST footprint, as done in the DES Y3 analysis.
5.3 Impact on cosmological parameters
We further predict the impact of survey non-uniformity on the cosmological analysis by conducting Fisher forecasting. The Fisher forecast estimates the constraints on cosmological parameters by assuming a Gaussian-likelihood function, a fiducial cosmology, and a covariance matrix on the data vector (Wasserman 2004; Coe 2009; Bhandari et al. 2021). In the Bayesian statistics framework, we can write the Fisher Information matrix as
where |$\boldsymbol {d}$| is the data vector, |$\boldsymbol {\alpha }$| is the model parameter vector, and |$\bf{V}$| is the inverse of the covariance matrix. |$\sigma _{\alpha _i}$| is the standard deviation of the Gaussian prior on parameter |$\alpha _i$|, and |$\delta _{ij}$| is the Kronecker delta. We use the Fisher forecast code developed in Zhang et al. (in preparation). The covariance matrix is computed by NaMaster using the theoretical angular power spectra generated by CCL, assuming Gaussianity.
We use ccl to compute the fiducial data vector of the LSST Y3 |$3\times 2$| pt. We use the non-linear intrinsic alignment (NLA) model as in Krause & Eifler (2017), adopted in the DESC SRD, to describe the contribution of intrinsic alignments to the data vectors. There are four NLA parameters, namely, the overall intrinsic alignment amplitude, |$A_0$|, the power-law luminosity scaling, |$\beta$|, the redshift scaling, |$\eta _l$|, and the additional high-redshift scaling |$\eta _h$|. The fiducial value and prior of the cosmological and astrophysical parameters are taken from the DESC SRD, as shown in Table 2. The fiducial galaxy bias, |$b_i$|, of the lens catalogue in each tomographic bin i, is set to 1.0, with a Gaussian standard deviation of 0.9 and a cut at |$b_i\lt 0$|. The contours shown in this section include the statistical uncertainty of the data vector and the marginalized uncertainty over other cosmological and astrophysical parameters described above. The contour can be overconfident since it does not marginalize over observational systematic uncertainties, which can include photometric redshift uncertainty, PSF uncertainty, and multiplicative shear uncertainty. Additionally, the non-Gaussian contributions to the covariance matrix is not taken into account. Non-linear galaxy bias is also not modelled.
The fiducial value and the Gaussian standard deviation of the prior assumed in the Fisher information matrix for the cosmological and intrinsic alignment parameters as defined in Krause & Eifler (2017).
Parameter . | Fiducial value . | Prior |$\sigma$| . |
---|---|---|
Cosmological | ||
|$\Omega _m$| | 0.279 | 0.15 |
|$\sigma _8$| | 0.82 | 0.2 |
|$w_0$| | |$-$|1 | 0.8 |
|$w_a$| | 0 | 1.3 |
h | 0.7 | 0.125 |
|$n_s$| | 0.97 | 0.2 |
|$\Omega _b$| | 0.046 | 0.003 |
Intrinsic alignment | ||
|$A_0$| | 5.92 | 2.5 |
|$\eta _l$| | |$-$|0.47 | 1.5 |
|$\eta _h$| | 0.0 | 0.5 |
|$\beta$| | 1.1 | 1.0 |
Galaxy bias | ||
|$b_i$| | 1.0 | 0.9 |
Parameter . | Fiducial value . | Prior |$\sigma$| . |
---|---|---|
Cosmological | ||
|$\Omega _m$| | 0.279 | 0.15 |
|$\sigma _8$| | 0.82 | 0.2 |
|$w_0$| | |$-$|1 | 0.8 |
|$w_a$| | 0 | 1.3 |
h | 0.7 | 0.125 |
|$n_s$| | 0.97 | 0.2 |
|$\Omega _b$| | 0.046 | 0.003 |
Intrinsic alignment | ||
|$A_0$| | 5.92 | 2.5 |
|$\eta _l$| | |$-$|0.47 | 1.5 |
|$\eta _h$| | 0.0 | 0.5 |
|$\beta$| | 1.1 | 1.0 |
Galaxy bias | ||
|$b_i$| | 1.0 | 0.9 |
The fiducial value and the Gaussian standard deviation of the prior assumed in the Fisher information matrix for the cosmological and intrinsic alignment parameters as defined in Krause & Eifler (2017).
Parameter . | Fiducial value . | Prior |$\sigma$| . |
---|---|---|
Cosmological | ||
|$\Omega _m$| | 0.279 | 0.15 |
|$\sigma _8$| | 0.82 | 0.2 |
|$w_0$| | |$-$|1 | 0.8 |
|$w_a$| | 0 | 1.3 |
h | 0.7 | 0.125 |
|$n_s$| | 0.97 | 0.2 |
|$\Omega _b$| | 0.046 | 0.003 |
Intrinsic alignment | ||
|$A_0$| | 5.92 | 2.5 |
|$\eta _l$| | |$-$|0.47 | 1.5 |
|$\eta _h$| | 0.0 | 0.5 |
|$\beta$| | 1.1 | 1.0 |
Galaxy bias | ||
|$b_i$| | 1.0 | 0.9 |
Parameter . | Fiducial value . | Prior |$\sigma$| . |
---|---|---|
Cosmological | ||
|$\Omega _m$| | 0.279 | 0.15 |
|$\sigma _8$| | 0.82 | 0.2 |
|$w_0$| | |$-$|1 | 0.8 |
|$w_a$| | 0 | 1.3 |
h | 0.7 | 0.125 |
|$n_s$| | 0.97 | 0.2 |
|$\Omega _b$| | 0.046 | 0.003 |
Intrinsic alignment | ||
|$A_0$| | 5.92 | 2.5 |
|$\eta _l$| | |$-$|0.47 | 1.5 |
|$\eta _h$| | 0.0 | 0.5 |
|$\beta$| | 1.1 | 1.0 |
Galaxy bias | ||
|$b_i$| | 1.0 | 0.9 |
Fisher forecasts can be used to predict bias in the parameters given a shift in the data vector. We take the difference between the biased and fiducial |$3\times 2$| pt power spectra, |$\boldsymbol {d}^{\rm {biased}}$| and |$\boldsymbol {d}$|, respectively, from Section 5.2, and use it to calculate the bias in cosmological parameters that the survey non-uniformity induces, under the assumption of small, linear changes in |$\boldsymbol {d}$| (Huterer et al. 2006; Rau et al. 2017):
where |$\boldsymbol {d}$| is the fiducial |$3\times 2$|pt data vector. The Fisher information matrix used in equation (19) is the full |$16\times 16$| matrix which includes 11 cosmological and intrinsic alignment parameters, as well as five galaxy bias parameters, as shown in Table 2.
The forecasted impact of non-uniformity on LSST Y3 |$3\times 2$| pt cosmological analysis is shown in Fig. 13. When neither non-uniform |$N_{\rm gal}$| nor |$n(z)$| are modelled in the data vector, the forecasted bias on |$\Omega _{\rm m}-\sigma _8$| and |$w_0-w_a$| are both on the order of |$\sim 20 \sigma$|, making the analysis completely unfeasible. Notice that in this case, strictly speaking, the small difference assumption in equation (19) breaks down, and so one should take these numbers with caution. Assuming the non-uniformity residual can be reduced to a level of 10 per cent (orange) and 5 per cent (green), the bias on the cosmological parameters reduces to about |$3\sigma$| and |$1.5\sigma$|, respectively. We observe that the main contributor to the cosmological bias in this case is the galaxy clustering, |$C_\ell ^{\rm gg}$|. When the bias in clustering is set to zero, the overall bias in cosmology is contained within |$1\sigma$|, shown in brick red. The cosmological bias when only non-uniformity of |$n(z)$| is mis-modelled is negligible, as shown in the purple vector.

The black ellipse shows the Fisher forecasted |$1\sigma$| and |$2\sigma$| contour of |$\Omega _{\rm m}-\sigma _8$| and |$w_0-w_a$|, marginalized over 16 parameters as described in Section 5.3. The parameter biases induced by survey non-uniformity are given by the vectors in the plot. The blue, orange, and green vectors show the biases corresponding to 100 per cent, 10 per cent, and 5 per cent of both |$N_{\rm gal}$| and |$n(z)$| non-uniformity. The brick red vector shows the bias corresponding to the 100 per cent case but without the clustering bias. The purple vector shows the bias corresponding to only |$n(z)$| non-uniformity.
As a result of the Fisher forecast, we recommend the |$N_{\rm gal}$| non-uniformity of the LSST |$3\times 2$| pt lens sample should be modelled with less than 3 per cent residual, to ensure an accurate cosmological analysis with bias within |$1\sigma$|. Otherwise, large-scale modes or high-redshift bins of the galaxy clustering signal must be removed from the data vector to avoid the parts where non-uniformity makes the most significant impact, as also shown in Fig. 8.
6 CONCLUSIONS
In this paper, we investigated and quantified the impact of spatial non-uniformity due to survey conditions on redshift distributions in the context of early LSST data. We used the Roman–Rubin simulation as the truth catalogue, and degraded the photometry using the LSST error model implemented in the rail package. The degradation utilizes the survey condition maps from the OpSim baseline v3.3 for the 1, 3, and 5-yr LSST data. We run BPZ and FZBoost photometric redshift estimators on the degraded sample and use the photo-z mode to separate the samples into five lens and five source tomographic bins. Finally, we apply the LSST gold selection and a signal-to-noise cut. Taking the extinction-corrected |$5\sigma$| coadd depth of the detection band, i-band, as the primary source of non-uniformity, we quantify the impact in terms of three measures: the number of objects, the mean redshift of the tomographic bin, and the tomographic bin width. We find that:
The number of objects increases with the i-band depth in general, and at extreme depth values, the number of objects can vary by a factor of two. The trend is relatively consistent between cases using BPZ and FZBoost, although selecting |${\rm odds}\ge 0.9$| for BPZ amplifies the trend. The largest correlation comes from the highest tomographic bin.
The mean redshift in each bin increases with the i-band depth, with a variation of |$|\Delta z /(1+\langle z \rangle)|\sim 0.005-0.01$|. The lens samples show a relatively consistent trend across different tomographic bins, whereas for the source sample, the highest tomographic bin shows the largest variation. This reaches the limit of the requirements of 0.005 for Y1 as listed in the DESC SRD, and exceeds the requirement of 0.003 for Y10. At extreme depth variations, however, deviation in |$\langle z \rangle$| could exceed Y1 requirements.
The width of the lens tomographic bin is measured in terms of |$\sigma _z$|, which is sensitive to the entire redshift distribution, |$p(z)$|, and |$W_z$|, which is sensitive to the peak of |$p(z)$|, both varying at the level of 10 per cent and slightly increases with year. We find that in general, |$\sigma _z$| increases with the i-band depth due to fainter objects included in the deeper sample. |$W_z$| also increases with the i-band depth, due to a more peaked bulk |$p(z)$| as a result of higher SNR in deeper samples, although the trend can be reversed in some cases.
As emphasized before, results derived for Y3 and Y5 are with particularly large rolling non-uniformity. Hence, the variations shown should be interpreted as an upper limit for the early Rubin LSST static science. As shown in Appendix E, if the final LSST lens selection is similar to the DES Y3 MagLim sample with a bright magnitude cut, then the expected variable depth impact will be milder than shown in our baseline cases.
We took the Y3 FZBoost photo-z as an example to propagate the impact of varying depth to the weak lensing |$3\times 2$|pt measurements. To do this, we used one realization of the Gower Street N-body simulation, and generated lens galaxy maps and source shear maps with spatially varying number density and |$p(z)$|. We measure the data vector in harmonic space using NaMaster, and also compare them with the theory expectation generated from the CCL. We find that the largest impact is on |$C_{\ell }^{\rm gg}$| with the higher redshift bin measurements significantly biased. |$C_{\ell }^{{\rm g}\gamma }$| is less sensitive to varying depth effects, although in the source–lens combination (4,4), there is a visible difference at low |$\ell$|. |$C_{\ell }^{\gamma \gamma }$| shows no significant impact in all source–source combinations from varying depth, given the uncertainties in LSST Y3. Finally, we also investigate cases where we do not include noise in the lens and source maps. The difference between uniform and varying depth cases can be up to a few per cent for |$C_{\ell }^{{\rm g}\gamma }$|, and less than 0.5 per cent for |$C_{\ell }^{\gamma \gamma }$|. Furthermore, by removing the density non-uniformity, and varying |$p(z)$| only with depth, one can reduce the bias in |$C_{\ell }^{\rm gg}$| and |$C_{\ell }^{{\rm g}\gamma }$| to sub-per cent level. We use a Fisher forecast to assess the impact of non-uniformity on cosmological parameter inference for the |$3\times 2$| pt data vector. We conclude that the mitigation in number density variation is crucial, and for our baseline setup for LSST Y3, this should be controlled below 3 per cent. Therefore, for early LSST analysis, it is crucial to account for the galaxy density variation, but the impact of varying |$p(z)$| seems to be negligible. We leave the investigation of an accurate mitigation strategy of the number density variation to future work.
Our current approach has some caveats. First, the fidelity of the colour-redshift relation in the Roman–Rubin simulation at |$z\gt 1.5$| is questionable. As already mentioned, the strong bifurcation of the blue objects at this high redshift may lead to worse (in the case of BPZ) or overly optimisic (in the case of FZBoost) performance when estimating the photo-z. Secondly, we have adopted an analytic model to obtain the observed magnitudes in each band based on survey conditions. However, in reality, the observed magnitudes and colours also depend on the way they are measured. For example, for extended objects, cModel (Strauss et al. 2002) and GAaP (Kuijken et al. 2015) methods are often applied. Although the photometry will be calibrated, the magnitude error may not be the same for different methods. This could introduce extra scatter in photo-z. Thirdly, we have only tested on two major photo-z estimators, observing some level of differences in the results. For example, compared to BPZ, the FZBoost samples show more consistency between different tomographic bins regarding to the trend with i-band depth. Therefore, one should take the result as an order of magnitude estimate of the impact, but the specific trends are likely to differ for different photo-z methods. Moreover, when propagating the effects to the data vector, we have made some simplifications. We considered a galaxy bias of |$b=1$|, and did not include systematics such as magnification bias or intrinsic alignments. This choice is to isolate the effect of varying depth on the pure lensing and clustering contribution, but it would be more realistic to include these effects. Finally, we have not folded in the effects of blending, i.e. spatially nearby galaxies are detected as one object. This occurs when the surface density is high and the image is crowded, and could be significant for deep photometric surveys such as LSST. The level of blending depends on both seeing and depth of the survey, hence, it could correlate with the variable depth effects discussed here. The impact of blending on photo-z is the inclusion of a small fraction of ill-defined redshifts in the sample, increasing the photo-z scatter. Clustering redshift calibration, which measures galaxy clustering on small scales, can also be affected as these scales are most susceptible to blending. Moreover, blending can affect shear measurements via e.g. lensing weights, hence introduce impact on galaxy–galaxy lensing and cosmic shear. As such, Nourbakhsh et al. (2022) showed that approximately 12 per cent of the galaxy sample in LSST is unrecognized blends, and can bias |$S_8$| measurement from cosmic shear by |$2\sigma$|.
Furthermore, so far our results are based on the |$p(z)$| of the true redshifts of the sample. In reality, we do not have access to this, and our theory curve will be based on the calibrated redshift distribution |$p_c(z)$|, which itself can be impacted by non-uniformity based the calibration method. For example, in many weak lensing surveys, a SOM is used to calibrate redshifts by training on a photometric subsample with spectroscopic counterparts (Wright et al. 2020; Myles et al. 2021). By taking subsamples from a small calibration field (typically of a few square degrees) located in a particularly shallow region could result in a trained SOM that captures different magnitudes, redshifts, and SNR than that from a deep region, as quanlitatively shown in Section 5.2.1. One remedy may come from calibration using clustering redshifts, which takes advantage of galaxy clustering of the target sample with a spectroscopic sample, spliced in thin redshift bins (den Busch et al. 2020; Gatti et al. 2022; Rau et al. 2023). The non-physical variation with depth will drop out in this method, giving unbiased estimate of |$p(z)$|.
We have only explored the impact of variable depth on two-point statistics here, but there could be potential impact on statistics beyond two-point. For example, for weak lensing shear, a similar effect in manifestation is source clustering, where the number density of source galaxies |$n(\hat{\theta }, z)$| is correlated with the measured shear |$\gamma (\hat{\theta })$| for a given direction |$\hat{\theta }$| on the sky, because source galaxies are themselves clustered. Impact of source clustering is negligible in two-point statistics for Stage III surveys, but is detected significantly in several higher order statistics in the DES Y3 data (Gatti et al. 2024). Given that the variable depth effect also modulates |$n(\hat{\theta }, z)$| (hence imprinting a fake ‘source clustering’), there may be non-negligible impact on higher order statistics with LSST. We leave these explorations to future work.
ACKNOWLEDGEMENTS
QH and BJ are supported by STFC grant ST/W001721/1 and the UCL Cosmoparticle Initiative. This paper has undergone internal review in the LSST Dark Energy Science Collaboration. The authors thank the internal reviewers, Boris Leistedt and Markus Rau, for their thorough and insightful comments. This work also benefited from helpful comments by Federica Bianco, Pat Burchat, Andrew Hearin, Eve Kovacs, Ofer Lahav, Rachel Mandelbaum, Andrina Nicola, and Peter Yoachim. The DESC acknowledges ongoing support from the Institut National de Physique Nucléaire et de Physique des Particules in France; the Science & Technology Facilities Council in the United Kingdom; and the Department of Energy, the National Science Foundation, and the LSST Corporation in the United States. DESC uses resources of the IN2P3 Computing Center (CC-IN2P3–Lyon/Villeurbanne – France) funded by the Centre National de la Recherche Scientifique; the National Energy Research Scientific Computing Center, a DOE Office of Science User Facility supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231; STFC DiRAC HPC Facilities, funded by UK BEIS National E-infrastructure capital grants; and the UK particle physics grid, supported by the GridPP Collaboration. This work was performed in part under DOE Contract DE-AC02-76SF00515. JFC acknowledges support from the U.S. Department of Energy, Office of Science, Office of High Energy Physics Cosmic Frontier Research program under Award Number DE-SC0011665. AIM acknowledges the support of Schmidt Sciences.
We acknowledge the use of arXiv and ADS for references, the use of Python libraries and software mentioned in the main text for data analysis and plotting, and Overleaf for the writing of this paper.
Author contribution statements
QH: Contributed to the conceptualization, data curation, formal analysis, and writing of the draft. Contributed to the development of rail.
BJ: Contributed to the conceptualization, funding acquisition, project administration, and revisions of the text.
EC: Contributed to rail software core functionalities used in the analysis. Minor contributions to revisions of text.
JFC: Contributed to the development of rail and the photerr model.
PL: Contributed to the development of the Roman–Rubin Diffsky simulation.
AIM: Contributed to rail software core functionalities used in the analysis. Minor contributions to revisions of text.
SS: Contributed to rail software used in the analysis, namely the BPZ and FlexZBoost algorithms used in estimation, along with general software development. Minor contributions to revisions of text.
ZY: Contributed to the development of rail and the photerr model; provided reviewing comments for the manuscript.
TZ: Conducted the cosmological forecast for the paper. Contributed to the development of rail, including the LSST error model, and rail’s core API; provided reviewing comments for the manuscript.
DATA AVAILABILITY
The methodology of generating mock LSST photometry with observing conditions is included in the rail pipeline.8 The mock galaxy catalogues with varying depth effect are available upon reasonable request.
Footnotes
Notice that the DESC SRD also provides requirements on the photometric redshift scatter of the full, unbinned sample, |$\sigma _{\Delta z}$|. For weak lensing, this is |$\sigma _{\Delta z}=0.006(1+z)$| (|$0.003(1+z)$|) for Y1 (Y10); for large-scale structure analysis, this is |$\sigma _{\Delta z}=0.1(1+z)$| (|$0.03(1+z)$|) for Y1 (Y10). Because we do not try to optimize the photometric redshift estimation in this paper, we do not compare our results with the DESC SRD |$\sigma _{\Delta z}$| values.
REFERENCES
APPENDIX A: COMPARISON OF LSST ERROR MODEL ON DC2
The |$5\sigma$| depth per visit, |$m_5$|, depends on a set of observing conditions in the following way (Ivezić et al. 2019):
where |$C_m$| is a constant that depend on the overall throughput of the system, |$m_{\rm sky}$| is the sky brightness in AB mag arcsec|$^{-2}$|, |$\theta _{\rm eff}$| is the seeing in arcsec, |$t_{\rm vis}$| is the exposure time in seconds, k is the atmospheric extinction coefficient, and X is the airmass. The default values of the parameters in the above equation per band are given in table 2 in Ivezić et al. (2019). The magnitude error for N-years observation is computed by |$\sigma /Nn_{\rm vis}$|, where the mean number of visits per year |$n_{\rm vis}$| can be derived from table 2 in Ivezić et al. (2019).
In this Appendix, we compare the LSST error model with the Rubin OpSim output as well as the Data Challenge 2 [DC2; LSST Dark Energy Science Collaboration (LSST DESC) et al. 2021] dr6 magnitude error. We perform our tests on the specific OpSim version minion_1016, and we use the 5-yr observing conditions including: coadd |$5\sigma$| point source depth (CoaddM5), single-visit |$5\sigma$| point source depth (fiveSigmaDepth), sky brightness (filtSkyBrightness), and number of visits (Nvisits).
We begin by checking equation (A1) using OpSim MAF maps over the DC2 footprint. The various survey conditions |$m_{\rm sky}$|, |$\theta _{\rm eff}$|, and X are taken as the median values over the 5-yr period, and other parameters |$C_m$|, |$t_{\rm vis}$|, and |$k_m$| are taken as the default values from Ivezić et al. (2019). The results from equation (A1) are the |$5\sigma$| PSF magnitude limit in each band per visit, and we compare it with two quantities: the median |$5\sigma$| depth map, and the equivalent per-visit depth from the coadded map: |$m_5=m_5^{\rm coadd}-2.5\log (\sqrt{N_{\rm vis}})$|, where |$N_{\rm vis}$| is the number of visits at each pixel. The results are shown in Fig. A1. We see that in general, |$m_5$| predicted by equation (A1) tends to be brighter than that from OpSim, and the difference is larger considering the coadd depth than the median depth. It seems that except for i-band which has a slightly different slope from unity, the difference in all other bands can be fixed by introducing a correction to |$C_m$|. For example, for the median |$m_5$| case, the shifts needed are |$\delta C_m=\lbrace -0.053, 0.032, -0.063, 0.070, 0.057, 0.027 \rbrace$| for |$ugrizy$|, respectively.

Comparison of the |$5\sigma$| PSF limiting magnitude computed from equation (A1) with the OpSim output: the median |$m_5$| over 5 yr (bright pink) and the coadded |$5\sigma$| depth converted to the equivalent of per visit (red). The |$m_5$| computed from equation (A1) utlizes the median sky brightness (|$m_{\rm sky}$|), median airmass (X), and median seeing (|$\theta _{\rm eff}$|), and other parameters are set to the default value in Ivezić et al. (2019).
We also explicitly check whether the dependence of the airmass, seeing, and sky brightness are as expected in equation (A1) with the default parameters. This is shown in Fig. A2. In all these exercises, we test whether the dependencies of the particular survey condition with |$m_5$| on the ensemble pixels, fixing all other dependence to a constant C which we fit to the ensemble. We see that the airmass and seeing are well captured by equation (A1), although the dependence of |$m_5$| on airmass is weak. The sky brightness relation is less well captured by equation (A1) especially for u and g. In general, however, we conclude that in absence of a depth map, one can estimate the unbiased |$m_5$| for Rubin observation conditions using equation (A1) with a modification of the |$C_m$| parameters for each band.

The relation between |$m_5$| and other survey conditions using the LSST error model. We show the comparison between the data points from OpSim for each of the six LSST bands, and the relation from the LSST error model using the default parameters as black dashed lines, with a fitted constant C. We see that the LSST error model captures the correlation between |$m_5$| and the underlying survey conditions well. The different colours correspond to different LSST filters, as indicated in the texts next to the data points in the same colour.
We then check equations (3) and (4) with the DC2 DM catalogue, where the magnitude errors are obtained through the detection pipeline, thus supposed to be more realistic. In this case, we directly adopt the coadded depth as |$m_5$|. We also compute in the low SNR limit (equation 2) which allows us to check the fainter magnitudes. For the extended magnitude errors, we compare with the CModel magnitudes in DC2. This is shown in Fig. A3. We see that there is reasonable agreement for the PSF magnitude errors in most bands, except for the u-band, where the LSST error model predicts larger error compared to that measured in DC2. However, it is also noticeable that the DC2 error seems to be underestimated when comparing the observed magnitude to the truth. It is also noticeable that the LSST error model also predicts consistently larger error at the bright end. When we add the extended error from the size of the galaxy (equation 6), we find that the scatter of the magnitude error at fixed magnitude is quite a bit larger than that measured by the cModel in DC2. Both the PSF magnitude error and the scatter for the extended error in DC2 can be matched by the LSST error model by simple scaling of the PSF magnitude error by a constant for each band, as well as scaling the galaxy size |$a_{\rm gal}, b_{\rm gal}$|. We emphasize that due to the known issues in the DC2 catalogue, we do not calibrate the LSST error model to DC2 in our analysis. However, it is worth bearing in mind what the differences are, and that one needs to calibrate the model with the real data.

Comparison of the magnitude error as a function of magnitude in each of the six LSST bands between the DC2 dr6 catalogue and the LSST error model. The red and pink points show the PSF magnitude errors, whereas the dark and light blue points show that of the extended errors compared with the DC2 cModel magnitudes. The coadd |$5\sigma$| depth from OpSim is used to compute the magnitude errors.
APPENDIX B: COMPARISON OF ROMAN–RUBIN GALAXY COLOUR WITH BPZ TEMPLATES
We show the coverage of BPZ templates adopted in this paper for the Roman–Rubin (DiffSky) simulation galaxy colours. We obtain template magnitudes in the LSST six-band filters by integrating each SED templates with the corresponding filter curves, with the template shifted in redshift range |$0\lt z\lt 3$|. We then compare the five colour distributions of the resultant templates with that of the Roman–Rubin galaxies (|$i \lt 24.9$|, corresponding to the Y5 Gold cut). The results are shown in Fig. B1. We see that the colour ranges of the simulated galaxies are captured by the BPZ templates used.

Colours in the LSST filters, for the Roman–Rubin (DiffSky) galaxies with |$i \lt 24.9$| (left), and that derived from the SED templates used in BPZ in the redshift range |$0 \lt z \lt 3$| (right).
APPENDIX C: LENS AND SOURCE TOMOGRAPHIC BIN DETAILS
This section includes some supplementary information for the lens and source tomographic bins for the mock photometry sample, as discussed in Section 3.3.
Fig. C1 shows a similar plot as Fig. 4, but with the y-axis in logarithmic scale, and extended to |$z=3$|. Only tomographic bins 1, 3, and 5 are shown for visual clarity. This scaling enhances the small, high-redshift population for both lens and source galaxies.

Tomographic redshift distribution of the Y3 sample for |${\rm qtl=0}$| (solid lines) and |${\rm qtl=9}$| (dashed lines) in log scale. Only tomographic bins 1, 3, and 5 are shown for visual clarity. The log scale highlights the tails towards high redshifts in each bin, which significantly impact the second moment of the distribution, |$\sigma _z$| for each case.
Table C1 shows the summary statistics on photo-z performance for BPZ and FZBoost at the 10 per cent shallowest i-band coadd depth (|${\rm qtl}=0$|) and deepest depth (|${\rm qtl}=9$|) for the 1, 3, and 5-yr mock LSST data. The summary statistics are: median bias, standard deviation (STD), normalized Median Absolute Deviation (NMAD), and outlier fraction. Tables C2 and C3 show the mean values of the various metrics over the depth quantiles, given the gold cut adjusted for each year. The metrics include mean galaxy number |$\bar{N}_{\rm gal}$| and mean redshift of the tomographic bin |$\langle z \rangle$| for both lens and source samples, and additionally the width metrics |$\sigma _z$| and |$W_z$| for lens samples. In the BPZ case, we include an additional case where we select objects with |${\rm odds}\ge 0.9$|.
The summary statistics on photo-z performance for BPZ and FZBoost at the 10 per cent shallowest i-band coadd depth (|${\rm qtl}=0$|) and deepest depth (|${\rm qtl}=9$|) for the 1, 3, and 5-yr mock LSST data, as shown in Fig. 3. Defining |$\Delta z = (z_{\rm phot}-z_{\rm true})/(1+z_{\rm true})$|, the summary statistics are: median bias, defined as the median of |$\Delta z$|, STD, defined as the standard deviation of |$\Delta z$|, the normalized MAD, defined as |$\sigma _{\rm NMAD}=1.48 {\rm Median}(|\Delta z|)$|, and outlier fraction, defined as the fraction of sample with |$|\Delta z|\gt 0.15$|. Both cases for full sample without cuts and for the high signal-to-noise sample with |${\rm SNR}\ge 10$| are shown. For BPZ, we also show the selection with |${\rm odds}\ge 0.9$|.
Sample . | . | |${\rm qtl}=0$| . | |${\rm qtl}=9$| . | ||||||
---|---|---|---|---|---|---|---|---|---|
. | . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . |
Y1 BPZ | Full | |$-0.011$| | 0.411 | 0.0772 | 20.1 per cent | |$-0.011$| | 0.404 | 0.0634 | 15.5 per cent |
|${\rm SNR}\ge 10$| | |$-0.005$| | 0.444 | 0.0632 | 14.2 per cent | |$-0.009$| | 0.409 | 0.0585 | 12.6 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.001$| | 0.446 | 0.0431 | 5.8 per cent | |$-0.006$| | 0.388 | 0.0401 | 4.5 per cent | |
Y1 FZBoost | Full | 0.008 | 0.122 | 0.0479 | 7.2 per cent | |$-0.006$| | 0.082 | 0.0371 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.008 | 0.072 | 0.0410 | 3.1 per cent | |$-0.004$| | 0.065 | 0.0351 | 2.4 per cent | |
Y3 BPZ | Full | |$-0.013$| | 0.380 | 0.0770 | 20.9 per cent | |$-0.011$| | 0.369 | 0.0613 | 14.3 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.399 | 0.0612 | 13.9 per cent | |$-0.010$| | 0.368 | 0.0586 | 12.1 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.005$| | 0.388 | 0.0407 | 4.7 per cent | |$-0.008$| | 0.337 | 0.0421 | 4.2 per cent | |
Y3 FZBoost | Full | 0.005 | 0.145 | 0.0408 | 8.3 per cent | |$-0.004$| | 0.079 | 0.0257 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.005 | 0.089 | 0.0326 | 3.1 per cent | |$-0.003$| | 0.065 | 0.0247 | 2.7 per cent | |
Y5 BPZ | Full | |$-0.013$| | 0.371 | 0.0774 | 21.3 per cent | |$-0.011$| | 0.353 | 0.0666 | 15.9 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.384 | 0.0620 | 14.2 per cent | |$-0.009$| | 0.350 | 0.0633 | 13.5 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.006$| | 0.372 | 0.0403 | 4.8 per cent | |$-0.007$| | 0.330 | 0.0442 | 4.4 per cent | |
Y5 FZBoost | Full | 0.004 | 0.137 | 0.038 | 8.3 per cent | |$-0.003$| | 0.08 | 0.0256 | 4.7 per cent |
|${\rm SNR}\ge 10$| | 0.004 | 0.086 | 0.0305 | 3.4 per cent | |$-0.003$| | 0.068 | 0.0244 | 3.2 per cent |
Sample . | . | |${\rm qtl}=0$| . | |${\rm qtl}=9$| . | ||||||
---|---|---|---|---|---|---|---|---|---|
. | . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . |
Y1 BPZ | Full | |$-0.011$| | 0.411 | 0.0772 | 20.1 per cent | |$-0.011$| | 0.404 | 0.0634 | 15.5 per cent |
|${\rm SNR}\ge 10$| | |$-0.005$| | 0.444 | 0.0632 | 14.2 per cent | |$-0.009$| | 0.409 | 0.0585 | 12.6 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.001$| | 0.446 | 0.0431 | 5.8 per cent | |$-0.006$| | 0.388 | 0.0401 | 4.5 per cent | |
Y1 FZBoost | Full | 0.008 | 0.122 | 0.0479 | 7.2 per cent | |$-0.006$| | 0.082 | 0.0371 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.008 | 0.072 | 0.0410 | 3.1 per cent | |$-0.004$| | 0.065 | 0.0351 | 2.4 per cent | |
Y3 BPZ | Full | |$-0.013$| | 0.380 | 0.0770 | 20.9 per cent | |$-0.011$| | 0.369 | 0.0613 | 14.3 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.399 | 0.0612 | 13.9 per cent | |$-0.010$| | 0.368 | 0.0586 | 12.1 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.005$| | 0.388 | 0.0407 | 4.7 per cent | |$-0.008$| | 0.337 | 0.0421 | 4.2 per cent | |
Y3 FZBoost | Full | 0.005 | 0.145 | 0.0408 | 8.3 per cent | |$-0.004$| | 0.079 | 0.0257 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.005 | 0.089 | 0.0326 | 3.1 per cent | |$-0.003$| | 0.065 | 0.0247 | 2.7 per cent | |
Y5 BPZ | Full | |$-0.013$| | 0.371 | 0.0774 | 21.3 per cent | |$-0.011$| | 0.353 | 0.0666 | 15.9 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.384 | 0.0620 | 14.2 per cent | |$-0.009$| | 0.350 | 0.0633 | 13.5 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.006$| | 0.372 | 0.0403 | 4.8 per cent | |$-0.007$| | 0.330 | 0.0442 | 4.4 per cent | |
Y5 FZBoost | Full | 0.004 | 0.137 | 0.038 | 8.3 per cent | |$-0.003$| | 0.08 | 0.0256 | 4.7 per cent |
|${\rm SNR}\ge 10$| | 0.004 | 0.086 | 0.0305 | 3.4 per cent | |$-0.003$| | 0.068 | 0.0244 | 3.2 per cent |
The summary statistics on photo-z performance for BPZ and FZBoost at the 10 per cent shallowest i-band coadd depth (|${\rm qtl}=0$|) and deepest depth (|${\rm qtl}=9$|) for the 1, 3, and 5-yr mock LSST data, as shown in Fig. 3. Defining |$\Delta z = (z_{\rm phot}-z_{\rm true})/(1+z_{\rm true})$|, the summary statistics are: median bias, defined as the median of |$\Delta z$|, STD, defined as the standard deviation of |$\Delta z$|, the normalized MAD, defined as |$\sigma _{\rm NMAD}=1.48 {\rm Median}(|\Delta z|)$|, and outlier fraction, defined as the fraction of sample with |$|\Delta z|\gt 0.15$|. Both cases for full sample without cuts and for the high signal-to-noise sample with |${\rm SNR}\ge 10$| are shown. For BPZ, we also show the selection with |${\rm odds}\ge 0.9$|.
Sample . | . | |${\rm qtl}=0$| . | |${\rm qtl}=9$| . | ||||||
---|---|---|---|---|---|---|---|---|---|
. | . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . |
Y1 BPZ | Full | |$-0.011$| | 0.411 | 0.0772 | 20.1 per cent | |$-0.011$| | 0.404 | 0.0634 | 15.5 per cent |
|${\rm SNR}\ge 10$| | |$-0.005$| | 0.444 | 0.0632 | 14.2 per cent | |$-0.009$| | 0.409 | 0.0585 | 12.6 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.001$| | 0.446 | 0.0431 | 5.8 per cent | |$-0.006$| | 0.388 | 0.0401 | 4.5 per cent | |
Y1 FZBoost | Full | 0.008 | 0.122 | 0.0479 | 7.2 per cent | |$-0.006$| | 0.082 | 0.0371 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.008 | 0.072 | 0.0410 | 3.1 per cent | |$-0.004$| | 0.065 | 0.0351 | 2.4 per cent | |
Y3 BPZ | Full | |$-0.013$| | 0.380 | 0.0770 | 20.9 per cent | |$-0.011$| | 0.369 | 0.0613 | 14.3 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.399 | 0.0612 | 13.9 per cent | |$-0.010$| | 0.368 | 0.0586 | 12.1 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.005$| | 0.388 | 0.0407 | 4.7 per cent | |$-0.008$| | 0.337 | 0.0421 | 4.2 per cent | |
Y3 FZBoost | Full | 0.005 | 0.145 | 0.0408 | 8.3 per cent | |$-0.004$| | 0.079 | 0.0257 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.005 | 0.089 | 0.0326 | 3.1 per cent | |$-0.003$| | 0.065 | 0.0247 | 2.7 per cent | |
Y5 BPZ | Full | |$-0.013$| | 0.371 | 0.0774 | 21.3 per cent | |$-0.011$| | 0.353 | 0.0666 | 15.9 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.384 | 0.0620 | 14.2 per cent | |$-0.009$| | 0.350 | 0.0633 | 13.5 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.006$| | 0.372 | 0.0403 | 4.8 per cent | |$-0.007$| | 0.330 | 0.0442 | 4.4 per cent | |
Y5 FZBoost | Full | 0.004 | 0.137 | 0.038 | 8.3 per cent | |$-0.003$| | 0.08 | 0.0256 | 4.7 per cent |
|${\rm SNR}\ge 10$| | 0.004 | 0.086 | 0.0305 | 3.4 per cent | |$-0.003$| | 0.068 | 0.0244 | 3.2 per cent |
Sample . | . | |${\rm qtl}=0$| . | |${\rm qtl}=9$| . | ||||||
---|---|---|---|---|---|---|---|---|---|
. | . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . | Median bias . | STD . | |$\sigma _{\rm NMAD}$| . | Outlier fraction . |
Y1 BPZ | Full | |$-0.011$| | 0.411 | 0.0772 | 20.1 per cent | |$-0.011$| | 0.404 | 0.0634 | 15.5 per cent |
|${\rm SNR}\ge 10$| | |$-0.005$| | 0.444 | 0.0632 | 14.2 per cent | |$-0.009$| | 0.409 | 0.0585 | 12.6 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.001$| | 0.446 | 0.0431 | 5.8 per cent | |$-0.006$| | 0.388 | 0.0401 | 4.5 per cent | |
Y1 FZBoost | Full | 0.008 | 0.122 | 0.0479 | 7.2 per cent | |$-0.006$| | 0.082 | 0.0371 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.008 | 0.072 | 0.0410 | 3.1 per cent | |$-0.004$| | 0.065 | 0.0351 | 2.4 per cent | |
Y3 BPZ | Full | |$-0.013$| | 0.380 | 0.0770 | 20.9 per cent | |$-0.011$| | 0.369 | 0.0613 | 14.3 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.399 | 0.0612 | 13.9 per cent | |$-0.010$| | 0.368 | 0.0586 | 12.1 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.005$| | 0.388 | 0.0407 | 4.7 per cent | |$-0.008$| | 0.337 | 0.0421 | 4.2 per cent | |
Y3 FZBoost | Full | 0.005 | 0.145 | 0.0408 | 8.3 per cent | |$-0.004$| | 0.079 | 0.0257 | 4.2 per cent |
|${\rm SNR}\ge 10$| | 0.005 | 0.089 | 0.0326 | 3.1 per cent | |$-0.003$| | 0.065 | 0.0247 | 2.7 per cent | |
Y5 BPZ | Full | |$-0.013$| | 0.371 | 0.0774 | 21.3 per cent | |$-0.011$| | 0.353 | 0.0666 | 15.9 per cent |
|${\rm SNR}\ge 10$| | |$-0.009$| | 0.384 | 0.0620 | 14.2 per cent | |$-0.009$| | 0.350 | 0.0633 | 13.5 per cent | |
|${\rm odds}\ge 0.9$| | |$-0.006$| | 0.372 | 0.0403 | 4.8 per cent | |$-0.007$| | 0.330 | 0.0442 | 4.4 per cent | |
Y5 FZBoost | Full | 0.004 | 0.137 | 0.038 | 8.3 per cent | |$-0.003$| | 0.08 | 0.0256 | 4.7 per cent |
|${\rm SNR}\ge 10$| | 0.004 | 0.086 | 0.0305 | 3.4 per cent | |$-0.003$| | 0.068 | 0.0244 | 3.2 per cent |
The mean values of the metrics across all depth quantiles for samples with BPZ redshifts. The metrics per tomographic bin include number of galaxies |$\bar{N}_{\rm gal}$| and mean redshift |$\langle z \rangle$|. For lens galaxies, we compute two additional metrics regarding to the width of the tomographic bin: the second moment |$\sigma _z$| and the LSS diagnostic |$W_z$| defined in equation (15). Gold cut in the respective year and |${\rm SNR}\ge 10$| are applied to all samples, and a case with |${\rm odds}\ge 0.9$| is also included for comparison.
Sample . | Metric . | |${\rm SNR}\ge 10$| . | |${\rm SNR}\ge 10, {\rm odds}\ge 0.9$| . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
Y1 lens | |$\bar{N}_{\rm gal}$| | 28796.5 | 32021.9 | 35387.0 | 24288.8 | 8463.0 | 7635.1 | 6158.4 | 10002.0 | 6221.3 | 1875.5 |
|$\langle z \rangle$| | 0.362 | 0.537 | 0.714 | 0.882 | 1.050 | 0.337 | 0.549 | 0.716 | 0.890 | 1.056 | |
|$\sigma _z$| | 0.098 | 0.123 | 0.110 | 0.145 | 0.172 | 0.068 | 0.098 | 0.087 | 0.089 | 0.083 | |
|$W_z$| | 3.216 | 2.797 | 2.785 | 2.535 | 1.945 | 3.453 | 3.453 | 3.311 | 2.919 | 3.062 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.5 | 30534.5 | 30534.5 | 30534.6 | 30534.7 | 7042.0 | 7042.0 | 7042.0 | 7042.0 | 7042.0 |
|$\langle z \rangle$| | 0.350 | 0.482 | 0.661 | 0.813 | 0.883 | 0.310 | 0.508 | 0.678 | 0.817 | 0.845 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 38988.1 | 39330.9 | 50523.5 | 44645.0 | 21978.0 | 13904.9 | 12239.9 | 22208.5 | 14705.3 | 5822.9 |
|$\langle z \rangle$| | 0.373 | 0.541 | 0.728 | 0.912 | 1.067 | 0.345 | 0.550 | 0.721 | 0.895 | 1.064 | |
|$\sigma _z$| | 0.123 | 0.135 | 0.127 | 0.166 | 0.185 | 0.069 | 0.093 | 0.084 | 0.098 | 0.103 | |
|$W_z$| | 3.208 | 3.123 | 2.872 | 2.514 | 2.147 | 3.602 | 3.517 | 3.361 | 2.977 | 2.952 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.8 | 48205.8 | 48205.8 | 48205.5 | 48203.5 | 15578.9 | 15579.0 | 15579.0 | 15579.0 | 15579.0 |
|$\langle z \rangle$| | 0.384 | 0.557 | 0.755 | 0.954 | 1.033 | 0.332 | 0.577 | 0.726 | 0.866 | 0.973 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 44162.9 | 43177.3 | 58004.0 | 54826.6 | 31379.0 | 17002.9 | 15398.3 | 28564.3 | 19690.5 | 8850.2 |
|$\langle z \rangle$| | 0.383 | 0.545 | 0.736 | 0.931 | 1.085 | 0.348 | 0.549 | 0.723 | 0.898 | 1.064 | |
|$\sigma _z$| | 0.156 | 0.151 | 0.160 | 0.198 | 0.208 | 0.071 | 0.090 | 0.083 | 0.102 | 0.111 | |
|$W_z$| | 3.136 | 3.267 | 2.894 | 2.394 | 2.087 | 3.920 | 3.683 | 3.423 | 2.945 | 2.982 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59745.5 | 59747.1 | 59746.1 | 59745.0 | 59725.6 | 20746.7 | 20746.8 | 20746.7 | 20746.8 | 20744.1 |
|$\langle z \rangle$| | 0.417 | 0.594 | 0.804 | 1.028 | 1.150 | 0.345 | 0.599 | 0.751 | 0.904 | 1.019 |
Sample . | Metric . | |${\rm SNR}\ge 10$| . | |${\rm SNR}\ge 10, {\rm odds}\ge 0.9$| . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
Y1 lens | |$\bar{N}_{\rm gal}$| | 28796.5 | 32021.9 | 35387.0 | 24288.8 | 8463.0 | 7635.1 | 6158.4 | 10002.0 | 6221.3 | 1875.5 |
|$\langle z \rangle$| | 0.362 | 0.537 | 0.714 | 0.882 | 1.050 | 0.337 | 0.549 | 0.716 | 0.890 | 1.056 | |
|$\sigma _z$| | 0.098 | 0.123 | 0.110 | 0.145 | 0.172 | 0.068 | 0.098 | 0.087 | 0.089 | 0.083 | |
|$W_z$| | 3.216 | 2.797 | 2.785 | 2.535 | 1.945 | 3.453 | 3.453 | 3.311 | 2.919 | 3.062 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.5 | 30534.5 | 30534.5 | 30534.6 | 30534.7 | 7042.0 | 7042.0 | 7042.0 | 7042.0 | 7042.0 |
|$\langle z \rangle$| | 0.350 | 0.482 | 0.661 | 0.813 | 0.883 | 0.310 | 0.508 | 0.678 | 0.817 | 0.845 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 38988.1 | 39330.9 | 50523.5 | 44645.0 | 21978.0 | 13904.9 | 12239.9 | 22208.5 | 14705.3 | 5822.9 |
|$\langle z \rangle$| | 0.373 | 0.541 | 0.728 | 0.912 | 1.067 | 0.345 | 0.550 | 0.721 | 0.895 | 1.064 | |
|$\sigma _z$| | 0.123 | 0.135 | 0.127 | 0.166 | 0.185 | 0.069 | 0.093 | 0.084 | 0.098 | 0.103 | |
|$W_z$| | 3.208 | 3.123 | 2.872 | 2.514 | 2.147 | 3.602 | 3.517 | 3.361 | 2.977 | 2.952 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.8 | 48205.8 | 48205.8 | 48205.5 | 48203.5 | 15578.9 | 15579.0 | 15579.0 | 15579.0 | 15579.0 |
|$\langle z \rangle$| | 0.384 | 0.557 | 0.755 | 0.954 | 1.033 | 0.332 | 0.577 | 0.726 | 0.866 | 0.973 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 44162.9 | 43177.3 | 58004.0 | 54826.6 | 31379.0 | 17002.9 | 15398.3 | 28564.3 | 19690.5 | 8850.2 |
|$\langle z \rangle$| | 0.383 | 0.545 | 0.736 | 0.931 | 1.085 | 0.348 | 0.549 | 0.723 | 0.898 | 1.064 | |
|$\sigma _z$| | 0.156 | 0.151 | 0.160 | 0.198 | 0.208 | 0.071 | 0.090 | 0.083 | 0.102 | 0.111 | |
|$W_z$| | 3.136 | 3.267 | 2.894 | 2.394 | 2.087 | 3.920 | 3.683 | 3.423 | 2.945 | 2.982 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59745.5 | 59747.1 | 59746.1 | 59745.0 | 59725.6 | 20746.7 | 20746.8 | 20746.7 | 20746.8 | 20744.1 |
|$\langle z \rangle$| | 0.417 | 0.594 | 0.804 | 1.028 | 1.150 | 0.345 | 0.599 | 0.751 | 0.904 | 1.019 |
The mean values of the metrics across all depth quantiles for samples with BPZ redshifts. The metrics per tomographic bin include number of galaxies |$\bar{N}_{\rm gal}$| and mean redshift |$\langle z \rangle$|. For lens galaxies, we compute two additional metrics regarding to the width of the tomographic bin: the second moment |$\sigma _z$| and the LSS diagnostic |$W_z$| defined in equation (15). Gold cut in the respective year and |${\rm SNR}\ge 10$| are applied to all samples, and a case with |${\rm odds}\ge 0.9$| is also included for comparison.
Sample . | Metric . | |${\rm SNR}\ge 10$| . | |${\rm SNR}\ge 10, {\rm odds}\ge 0.9$| . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
Y1 lens | |$\bar{N}_{\rm gal}$| | 28796.5 | 32021.9 | 35387.0 | 24288.8 | 8463.0 | 7635.1 | 6158.4 | 10002.0 | 6221.3 | 1875.5 |
|$\langle z \rangle$| | 0.362 | 0.537 | 0.714 | 0.882 | 1.050 | 0.337 | 0.549 | 0.716 | 0.890 | 1.056 | |
|$\sigma _z$| | 0.098 | 0.123 | 0.110 | 0.145 | 0.172 | 0.068 | 0.098 | 0.087 | 0.089 | 0.083 | |
|$W_z$| | 3.216 | 2.797 | 2.785 | 2.535 | 1.945 | 3.453 | 3.453 | 3.311 | 2.919 | 3.062 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.5 | 30534.5 | 30534.5 | 30534.6 | 30534.7 | 7042.0 | 7042.0 | 7042.0 | 7042.0 | 7042.0 |
|$\langle z \rangle$| | 0.350 | 0.482 | 0.661 | 0.813 | 0.883 | 0.310 | 0.508 | 0.678 | 0.817 | 0.845 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 38988.1 | 39330.9 | 50523.5 | 44645.0 | 21978.0 | 13904.9 | 12239.9 | 22208.5 | 14705.3 | 5822.9 |
|$\langle z \rangle$| | 0.373 | 0.541 | 0.728 | 0.912 | 1.067 | 0.345 | 0.550 | 0.721 | 0.895 | 1.064 | |
|$\sigma _z$| | 0.123 | 0.135 | 0.127 | 0.166 | 0.185 | 0.069 | 0.093 | 0.084 | 0.098 | 0.103 | |
|$W_z$| | 3.208 | 3.123 | 2.872 | 2.514 | 2.147 | 3.602 | 3.517 | 3.361 | 2.977 | 2.952 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.8 | 48205.8 | 48205.8 | 48205.5 | 48203.5 | 15578.9 | 15579.0 | 15579.0 | 15579.0 | 15579.0 |
|$\langle z \rangle$| | 0.384 | 0.557 | 0.755 | 0.954 | 1.033 | 0.332 | 0.577 | 0.726 | 0.866 | 0.973 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 44162.9 | 43177.3 | 58004.0 | 54826.6 | 31379.0 | 17002.9 | 15398.3 | 28564.3 | 19690.5 | 8850.2 |
|$\langle z \rangle$| | 0.383 | 0.545 | 0.736 | 0.931 | 1.085 | 0.348 | 0.549 | 0.723 | 0.898 | 1.064 | |
|$\sigma _z$| | 0.156 | 0.151 | 0.160 | 0.198 | 0.208 | 0.071 | 0.090 | 0.083 | 0.102 | 0.111 | |
|$W_z$| | 3.136 | 3.267 | 2.894 | 2.394 | 2.087 | 3.920 | 3.683 | 3.423 | 2.945 | 2.982 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59745.5 | 59747.1 | 59746.1 | 59745.0 | 59725.6 | 20746.7 | 20746.8 | 20746.7 | 20746.8 | 20744.1 |
|$\langle z \rangle$| | 0.417 | 0.594 | 0.804 | 1.028 | 1.150 | 0.345 | 0.599 | 0.751 | 0.904 | 1.019 |
Sample . | Metric . | |${\rm SNR}\ge 10$| . | |${\rm SNR}\ge 10, {\rm odds}\ge 0.9$| . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
Y1 lens | |$\bar{N}_{\rm gal}$| | 28796.5 | 32021.9 | 35387.0 | 24288.8 | 8463.0 | 7635.1 | 6158.4 | 10002.0 | 6221.3 | 1875.5 |
|$\langle z \rangle$| | 0.362 | 0.537 | 0.714 | 0.882 | 1.050 | 0.337 | 0.549 | 0.716 | 0.890 | 1.056 | |
|$\sigma _z$| | 0.098 | 0.123 | 0.110 | 0.145 | 0.172 | 0.068 | 0.098 | 0.087 | 0.089 | 0.083 | |
|$W_z$| | 3.216 | 2.797 | 2.785 | 2.535 | 1.945 | 3.453 | 3.453 | 3.311 | 2.919 | 3.062 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.5 | 30534.5 | 30534.5 | 30534.6 | 30534.7 | 7042.0 | 7042.0 | 7042.0 | 7042.0 | 7042.0 |
|$\langle z \rangle$| | 0.350 | 0.482 | 0.661 | 0.813 | 0.883 | 0.310 | 0.508 | 0.678 | 0.817 | 0.845 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 38988.1 | 39330.9 | 50523.5 | 44645.0 | 21978.0 | 13904.9 | 12239.9 | 22208.5 | 14705.3 | 5822.9 |
|$\langle z \rangle$| | 0.373 | 0.541 | 0.728 | 0.912 | 1.067 | 0.345 | 0.550 | 0.721 | 0.895 | 1.064 | |
|$\sigma _z$| | 0.123 | 0.135 | 0.127 | 0.166 | 0.185 | 0.069 | 0.093 | 0.084 | 0.098 | 0.103 | |
|$W_z$| | 3.208 | 3.123 | 2.872 | 2.514 | 2.147 | 3.602 | 3.517 | 3.361 | 2.977 | 2.952 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.8 | 48205.8 | 48205.8 | 48205.5 | 48203.5 | 15578.9 | 15579.0 | 15579.0 | 15579.0 | 15579.0 |
|$\langle z \rangle$| | 0.384 | 0.557 | 0.755 | 0.954 | 1.033 | 0.332 | 0.577 | 0.726 | 0.866 | 0.973 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 44162.9 | 43177.3 | 58004.0 | 54826.6 | 31379.0 | 17002.9 | 15398.3 | 28564.3 | 19690.5 | 8850.2 |
|$\langle z \rangle$| | 0.383 | 0.545 | 0.736 | 0.931 | 1.085 | 0.348 | 0.549 | 0.723 | 0.898 | 1.064 | |
|$\sigma _z$| | 0.156 | 0.151 | 0.160 | 0.198 | 0.208 | 0.071 | 0.090 | 0.083 | 0.102 | 0.111 | |
|$W_z$| | 3.136 | 3.267 | 2.894 | 2.394 | 2.087 | 3.920 | 3.683 | 3.423 | 2.945 | 2.982 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59745.5 | 59747.1 | 59746.1 | 59745.0 | 59725.6 | 20746.7 | 20746.8 | 20746.7 | 20746.8 | 20744.1 |
|$\langle z \rangle$| | 0.417 | 0.594 | 0.804 | 1.028 | 1.150 | 0.345 | 0.599 | 0.751 | 0.904 | 1.019 |
Same as Table C2, but for FZBoost redshifts. All samples have |${\rm SNR}\ge 10$|.
Sample . | Metric . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
---|---|---|---|---|---|---|
Y1 lens | |$\bar{N}_{\rm gal}$| | 31054.7 | 38964.9 | 33717.0 | 29871.8 | 9104.6 |
|$\langle z \rangle$| | 0.327 | 0.515 | 0.706 | 0.865 | 1.083 | |
|$\sigma _z$| | 0.119 | 0.103 | 0.114 | 0.118 | 0.108 | |
|$W_z$| | 4.236 | 3.098 | 3.202 | 3.242 | 3.719 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.6 | 30534.5 | 30534.5 | 30534.5 | 30534.7 |
|$\langle z \rangle$| | 0.291 | 0.466 | 0.623 | 0.779 | 1.030 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 34498.5 | 52662.6 | 51564.7 | 52858.0 | 27561.2 |
|$\langle z \rangle$| | 0.320 | 0.500 | 0.702 | 0.894 | 1.093 | |
|$\sigma _z$| | 0.183 | 0.116 | 0.118 | 0.135 | 0.144 | |
|$W_z$| | 4.220 | 3.263 | 3.385 | 3.061 | 3.053 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.9 | 48205.8 | 48205.8 | 48205.9 | 48203.1 |
|$\langle z \rangle$| | 0.328 | 0.531 | 0.718 | 0.893 | 1.213 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 37286.1 | 57641.9 | 59009.5 | 63214.3 | 39344.7 |
|$\langle z \rangle$| | 0.339 | 0.510 | 0.709 | 0.905 | 1.104 | |
|$\sigma _z$| | 0.194 | 0.136 | 0.122 | 0.146 | 0.142 | |
|$W_z$| | 4.150 | 3.332 | 3.428 | 3.037 | 2.908 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59747.0 | 59747.1 | 59747.0 | 59747.2 | 59721.0 |
|$\langle z \rangle$| | 0.350 | 0.567 | 0.764 | 0.971 | 1.342 |
Sample . | Metric . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
---|---|---|---|---|---|---|
Y1 lens | |$\bar{N}_{\rm gal}$| | 31054.7 | 38964.9 | 33717.0 | 29871.8 | 9104.6 |
|$\langle z \rangle$| | 0.327 | 0.515 | 0.706 | 0.865 | 1.083 | |
|$\sigma _z$| | 0.119 | 0.103 | 0.114 | 0.118 | 0.108 | |
|$W_z$| | 4.236 | 3.098 | 3.202 | 3.242 | 3.719 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.6 | 30534.5 | 30534.5 | 30534.5 | 30534.7 |
|$\langle z \rangle$| | 0.291 | 0.466 | 0.623 | 0.779 | 1.030 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 34498.5 | 52662.6 | 51564.7 | 52858.0 | 27561.2 |
|$\langle z \rangle$| | 0.320 | 0.500 | 0.702 | 0.894 | 1.093 | |
|$\sigma _z$| | 0.183 | 0.116 | 0.118 | 0.135 | 0.144 | |
|$W_z$| | 4.220 | 3.263 | 3.385 | 3.061 | 3.053 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.9 | 48205.8 | 48205.8 | 48205.9 | 48203.1 |
|$\langle z \rangle$| | 0.328 | 0.531 | 0.718 | 0.893 | 1.213 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 37286.1 | 57641.9 | 59009.5 | 63214.3 | 39344.7 |
|$\langle z \rangle$| | 0.339 | 0.510 | 0.709 | 0.905 | 1.104 | |
|$\sigma _z$| | 0.194 | 0.136 | 0.122 | 0.146 | 0.142 | |
|$W_z$| | 4.150 | 3.332 | 3.428 | 3.037 | 2.908 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59747.0 | 59747.1 | 59747.0 | 59747.2 | 59721.0 |
|$\langle z \rangle$| | 0.350 | 0.567 | 0.764 | 0.971 | 1.342 |
Same as Table C2, but for FZBoost redshifts. All samples have |${\rm SNR}\ge 10$|.
Sample . | Metric . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
---|---|---|---|---|---|---|
Y1 lens | |$\bar{N}_{\rm gal}$| | 31054.7 | 38964.9 | 33717.0 | 29871.8 | 9104.6 |
|$\langle z \rangle$| | 0.327 | 0.515 | 0.706 | 0.865 | 1.083 | |
|$\sigma _z$| | 0.119 | 0.103 | 0.114 | 0.118 | 0.108 | |
|$W_z$| | 4.236 | 3.098 | 3.202 | 3.242 | 3.719 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.6 | 30534.5 | 30534.5 | 30534.5 | 30534.7 |
|$\langle z \rangle$| | 0.291 | 0.466 | 0.623 | 0.779 | 1.030 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 34498.5 | 52662.6 | 51564.7 | 52858.0 | 27561.2 |
|$\langle z \rangle$| | 0.320 | 0.500 | 0.702 | 0.894 | 1.093 | |
|$\sigma _z$| | 0.183 | 0.116 | 0.118 | 0.135 | 0.144 | |
|$W_z$| | 4.220 | 3.263 | 3.385 | 3.061 | 3.053 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.9 | 48205.8 | 48205.8 | 48205.9 | 48203.1 |
|$\langle z \rangle$| | 0.328 | 0.531 | 0.718 | 0.893 | 1.213 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 37286.1 | 57641.9 | 59009.5 | 63214.3 | 39344.7 |
|$\langle z \rangle$| | 0.339 | 0.510 | 0.709 | 0.905 | 1.104 | |
|$\sigma _z$| | 0.194 | 0.136 | 0.122 | 0.146 | 0.142 | |
|$W_z$| | 4.150 | 3.332 | 3.428 | 3.037 | 2.908 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59747.0 | 59747.1 | 59747.0 | 59747.2 | 59721.0 |
|$\langle z \rangle$| | 0.350 | 0.567 | 0.764 | 0.971 | 1.342 |
Sample . | Metric . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . | Bin 5 . |
---|---|---|---|---|---|---|
Y1 lens | |$\bar{N}_{\rm gal}$| | 31054.7 | 38964.9 | 33717.0 | 29871.8 | 9104.6 |
|$\langle z \rangle$| | 0.327 | 0.515 | 0.706 | 0.865 | 1.083 | |
|$\sigma _z$| | 0.119 | 0.103 | 0.114 | 0.118 | 0.108 | |
|$W_z$| | 4.236 | 3.098 | 3.202 | 3.242 | 3.719 | |
Y1 source | |$\bar{N}_{\rm gal}$| | 30534.6 | 30534.5 | 30534.5 | 30534.5 | 30534.7 |
|$\langle z \rangle$| | 0.291 | 0.466 | 0.623 | 0.779 | 1.030 | |
Y3 lens | |$\bar{N}_{\rm gal}$| | 34498.5 | 52662.6 | 51564.7 | 52858.0 | 27561.2 |
|$\langle z \rangle$| | 0.320 | 0.500 | 0.702 | 0.894 | 1.093 | |
|$\sigma _z$| | 0.183 | 0.116 | 0.118 | 0.135 | 0.144 | |
|$W_z$| | 4.220 | 3.263 | 3.385 | 3.061 | 3.053 | |
Y3 source | |$\bar{N}_{\rm gal}$| | 48205.9 | 48205.8 | 48205.8 | 48205.9 | 48203.1 |
|$\langle z \rangle$| | 0.328 | 0.531 | 0.718 | 0.893 | 1.213 | |
Y5 lens | |$\bar{N}_{\rm gal}$| | 37286.1 | 57641.9 | 59009.5 | 63214.3 | 39344.7 |
|$\langle z \rangle$| | 0.339 | 0.510 | 0.709 | 0.905 | 1.104 | |
|$\sigma _z$| | 0.194 | 0.136 | 0.122 | 0.146 | 0.142 | |
|$W_z$| | 4.150 | 3.332 | 3.428 | 3.037 | 2.908 | |
Y5 source | |$\bar{N}_{\rm gal}$| | 59747.0 | 59747.1 | 59747.0 | 59747.2 | 59721.0 |
|$\langle z \rangle$| | 0.350 | 0.567 | 0.764 | 0.971 | 1.342 |
APPENDIX D: VARIATION WITH OTHER SURVEY PROPERTIES
In the main analysis, we investigated the trend of galaxy number and redshift distribution as a function of the i-band coadd depth. We considered the i-band depth to be most impactful because it is the detection band, and fluxes in all other bands are measured with forced photometry based on the i-band detection. However, other survey properties can also be important. For example, u-band is important for the quality of photo-z estimation, so extreme variation in the u-band depth could cause additional scatter. The effective seeing could be another important factor, which directly impact the noise-to-signal for extended objects. We investigate the variation of galaxy number density and photo-z properties with these other survey properties in this section.
Table D1 summarizes the mean and standard deviation of the coadd depth in the other five LSST bands and the median effective seeing for Y3 survey properties from Rubin OpSim baseline v3.3. The other years show a similar trend, although Y1 has a larger scatter. We see that there is a strong correlation between the i-band depth and these other survey properties. On average, a deeper i-band quantile also contains deeper coadd depth in all other five bands, as well as a smaller median effective seeing, with more scatter in the latter.
The mean and standard deviation of all other survey condition maps used for degradation in this work in 10 quantiles of i-band depth, as shown in Table 1. This particular example shows the case for year 3, but year 1 and year 5 follow a similar trend. There is a strong correlation between these survey conditions and the i-band depth.
Prop. . | band . | i-band |$m_5^{\rm ex}$| quantiles . | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | 0 . | 1 . | 2 . | 3 . | 4 . | 5 . | 6 . | 7 . | 8 . | 9 . |
|$m_5^{\rm ex}$| | u | |$24.00\pm 0.24$| | |$24.19\pm 0.20$| | |$24.30\pm 0.19$| | |$24.40\pm 0.20$| | |$24.49\pm 0.19$| | |$24.57\pm 0.18$| | |$24.63\pm 0.16$| | |$24.68\pm 0.15$| | |$24.76\pm 0.15$| | |$24.91\pm 0.14$| |
g | |$25.66\pm 0.18$| | |$25.82\pm 0.16$| | |$25.91\pm 0.16$| | |$25.99\pm 0.15$| | |$26.06\pm 0.15$| | |$26.13\pm 0.13$| | |$26.18\pm 0.12$| | |$26.22\pm 0.12$| | |$26.31\pm 0.14$| | |$26.49\pm 0.13$| | |
r | |$25.88\pm 0.13$| | |$26.05\pm 0.09$| | |$26.13\pm 0.09$| | |$26.20\pm 0.08$| | |$26.26\pm 0.08$| | |$26.31\pm 0.07$| | |$26.36\pm 0.07$| | |$26.41\pm 0.07$| | |$26.49\pm 0.08$| | |$26.62\pm 0.07$| | |
z | |$24.80\pm 0.13$| | |$24.95\pm 0.09$| | |$25.02\pm 0.09$| | |$25.08\pm 0.08$| | |$25.13\pm 0.08$| | |$25.18\pm 0.07$| | |$25.22\pm 0.07$| | |$25.27\pm 0.07$| | |$25.35\pm 0.07$| | |$25.47\pm 0.07$| | |
y | |$23.89\pm 0.15$| | |$24.03\pm 0.11$| | |$24.10\pm 0.11$| | |$24.16\pm 0.11$| | |$24.21\pm 0.10$| | |$24.25\pm 0.09$| | |$24.28\pm 0.09$| | |$24.32\pm 0.10$| | |$24.40\pm 0.09$| | |$24.50\pm 0.08$| | |
|$\theta _{\rm FWHM}^{\rm eff}$| | u | |$1.25\pm 0.11$| | |$1.20\pm 0.12$| | |$1.17\pm 0.12$| | |$1.15\pm 0.12$| | |$1.13\pm 0.12$| | |$1.12\pm 0.13$| | |$1.10\pm 0.12$| | |$1.10\pm 0.12$| | |$1.12\pm 0.13$| | |$1.05\pm 0.10$| |
g | |$1.07\pm 0.12$| | |$1.02\pm 0.13$| | |$1.00\pm 0.14$| | |$1.00\pm 0.14$| | |$0.99\pm 0.13$| | |$0.97\pm 0.13$| | |$0.96\pm 0.12$| | |$0.96\pm 0.12$| | |$0.96\pm 0.11$| | |$0.89\pm 0.09$| | |
r | |$1.04\pm 0.09$| | |$0.98\pm 0.09$| | |$0.96\pm 0.09$| | |$0.94\pm 0.09$| | |$0.92\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.90\pm 0.06$| | |$0.89\pm 0.06$| | |$0.85\pm 0.05$| | |
i | |$1.01\pm 0.08$| | |$0.95\pm 0.07$| | |$0.94\pm 0.08$| | |$0.93\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.88\pm 0.06$| | |$0.87\pm 0.06$| | |$0.87\pm 0.06$| | |$0.83\pm 0.04$| | |
z | |$1.00\pm 0.07$| | |$0.97\pm 0.07$| | |$0.96\pm 0.07$| | |$0.95\pm 0.07$| | |$0.94\pm 0.07$| | |$0.93\pm 0.07$| | |$0.92\pm 0.07$| | |$0.91\pm 0.07$| | |$0.91\pm 0.07$| | |$0.86\pm 0.05$| | |
y | |$1.02\pm 0.07$| | |$0.97\pm 0.06$| | |$0.96\pm 0.06$| | |$0.95\pm 0.06$| | |$0.94\pm 0.06$| | |$0.93\pm 0.06$| | |$0.93\pm 0.06$| | |$0.92\pm 0.06$| | |$0.91\pm 0.05$| | |$0.88\pm 0.04$| |
Prop. . | band . | i-band |$m_5^{\rm ex}$| quantiles . | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | 0 . | 1 . | 2 . | 3 . | 4 . | 5 . | 6 . | 7 . | 8 . | 9 . |
|$m_5^{\rm ex}$| | u | |$24.00\pm 0.24$| | |$24.19\pm 0.20$| | |$24.30\pm 0.19$| | |$24.40\pm 0.20$| | |$24.49\pm 0.19$| | |$24.57\pm 0.18$| | |$24.63\pm 0.16$| | |$24.68\pm 0.15$| | |$24.76\pm 0.15$| | |$24.91\pm 0.14$| |
g | |$25.66\pm 0.18$| | |$25.82\pm 0.16$| | |$25.91\pm 0.16$| | |$25.99\pm 0.15$| | |$26.06\pm 0.15$| | |$26.13\pm 0.13$| | |$26.18\pm 0.12$| | |$26.22\pm 0.12$| | |$26.31\pm 0.14$| | |$26.49\pm 0.13$| | |
r | |$25.88\pm 0.13$| | |$26.05\pm 0.09$| | |$26.13\pm 0.09$| | |$26.20\pm 0.08$| | |$26.26\pm 0.08$| | |$26.31\pm 0.07$| | |$26.36\pm 0.07$| | |$26.41\pm 0.07$| | |$26.49\pm 0.08$| | |$26.62\pm 0.07$| | |
z | |$24.80\pm 0.13$| | |$24.95\pm 0.09$| | |$25.02\pm 0.09$| | |$25.08\pm 0.08$| | |$25.13\pm 0.08$| | |$25.18\pm 0.07$| | |$25.22\pm 0.07$| | |$25.27\pm 0.07$| | |$25.35\pm 0.07$| | |$25.47\pm 0.07$| | |
y | |$23.89\pm 0.15$| | |$24.03\pm 0.11$| | |$24.10\pm 0.11$| | |$24.16\pm 0.11$| | |$24.21\pm 0.10$| | |$24.25\pm 0.09$| | |$24.28\pm 0.09$| | |$24.32\pm 0.10$| | |$24.40\pm 0.09$| | |$24.50\pm 0.08$| | |
|$\theta _{\rm FWHM}^{\rm eff}$| | u | |$1.25\pm 0.11$| | |$1.20\pm 0.12$| | |$1.17\pm 0.12$| | |$1.15\pm 0.12$| | |$1.13\pm 0.12$| | |$1.12\pm 0.13$| | |$1.10\pm 0.12$| | |$1.10\pm 0.12$| | |$1.12\pm 0.13$| | |$1.05\pm 0.10$| |
g | |$1.07\pm 0.12$| | |$1.02\pm 0.13$| | |$1.00\pm 0.14$| | |$1.00\pm 0.14$| | |$0.99\pm 0.13$| | |$0.97\pm 0.13$| | |$0.96\pm 0.12$| | |$0.96\pm 0.12$| | |$0.96\pm 0.11$| | |$0.89\pm 0.09$| | |
r | |$1.04\pm 0.09$| | |$0.98\pm 0.09$| | |$0.96\pm 0.09$| | |$0.94\pm 0.09$| | |$0.92\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.90\pm 0.06$| | |$0.89\pm 0.06$| | |$0.85\pm 0.05$| | |
i | |$1.01\pm 0.08$| | |$0.95\pm 0.07$| | |$0.94\pm 0.08$| | |$0.93\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.88\pm 0.06$| | |$0.87\pm 0.06$| | |$0.87\pm 0.06$| | |$0.83\pm 0.04$| | |
z | |$1.00\pm 0.07$| | |$0.97\pm 0.07$| | |$0.96\pm 0.07$| | |$0.95\pm 0.07$| | |$0.94\pm 0.07$| | |$0.93\pm 0.07$| | |$0.92\pm 0.07$| | |$0.91\pm 0.07$| | |$0.91\pm 0.07$| | |$0.86\pm 0.05$| | |
y | |$1.02\pm 0.07$| | |$0.97\pm 0.06$| | |$0.96\pm 0.06$| | |$0.95\pm 0.06$| | |$0.94\pm 0.06$| | |$0.93\pm 0.06$| | |$0.93\pm 0.06$| | |$0.92\pm 0.06$| | |$0.91\pm 0.05$| | |$0.88\pm 0.04$| |
The mean and standard deviation of all other survey condition maps used for degradation in this work in 10 quantiles of i-band depth, as shown in Table 1. This particular example shows the case for year 3, but year 1 and year 5 follow a similar trend. There is a strong correlation between these survey conditions and the i-band depth.
Prop. . | band . | i-band |$m_5^{\rm ex}$| quantiles . | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | 0 . | 1 . | 2 . | 3 . | 4 . | 5 . | 6 . | 7 . | 8 . | 9 . |
|$m_5^{\rm ex}$| | u | |$24.00\pm 0.24$| | |$24.19\pm 0.20$| | |$24.30\pm 0.19$| | |$24.40\pm 0.20$| | |$24.49\pm 0.19$| | |$24.57\pm 0.18$| | |$24.63\pm 0.16$| | |$24.68\pm 0.15$| | |$24.76\pm 0.15$| | |$24.91\pm 0.14$| |
g | |$25.66\pm 0.18$| | |$25.82\pm 0.16$| | |$25.91\pm 0.16$| | |$25.99\pm 0.15$| | |$26.06\pm 0.15$| | |$26.13\pm 0.13$| | |$26.18\pm 0.12$| | |$26.22\pm 0.12$| | |$26.31\pm 0.14$| | |$26.49\pm 0.13$| | |
r | |$25.88\pm 0.13$| | |$26.05\pm 0.09$| | |$26.13\pm 0.09$| | |$26.20\pm 0.08$| | |$26.26\pm 0.08$| | |$26.31\pm 0.07$| | |$26.36\pm 0.07$| | |$26.41\pm 0.07$| | |$26.49\pm 0.08$| | |$26.62\pm 0.07$| | |
z | |$24.80\pm 0.13$| | |$24.95\pm 0.09$| | |$25.02\pm 0.09$| | |$25.08\pm 0.08$| | |$25.13\pm 0.08$| | |$25.18\pm 0.07$| | |$25.22\pm 0.07$| | |$25.27\pm 0.07$| | |$25.35\pm 0.07$| | |$25.47\pm 0.07$| | |
y | |$23.89\pm 0.15$| | |$24.03\pm 0.11$| | |$24.10\pm 0.11$| | |$24.16\pm 0.11$| | |$24.21\pm 0.10$| | |$24.25\pm 0.09$| | |$24.28\pm 0.09$| | |$24.32\pm 0.10$| | |$24.40\pm 0.09$| | |$24.50\pm 0.08$| | |
|$\theta _{\rm FWHM}^{\rm eff}$| | u | |$1.25\pm 0.11$| | |$1.20\pm 0.12$| | |$1.17\pm 0.12$| | |$1.15\pm 0.12$| | |$1.13\pm 0.12$| | |$1.12\pm 0.13$| | |$1.10\pm 0.12$| | |$1.10\pm 0.12$| | |$1.12\pm 0.13$| | |$1.05\pm 0.10$| |
g | |$1.07\pm 0.12$| | |$1.02\pm 0.13$| | |$1.00\pm 0.14$| | |$1.00\pm 0.14$| | |$0.99\pm 0.13$| | |$0.97\pm 0.13$| | |$0.96\pm 0.12$| | |$0.96\pm 0.12$| | |$0.96\pm 0.11$| | |$0.89\pm 0.09$| | |
r | |$1.04\pm 0.09$| | |$0.98\pm 0.09$| | |$0.96\pm 0.09$| | |$0.94\pm 0.09$| | |$0.92\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.90\pm 0.06$| | |$0.89\pm 0.06$| | |$0.85\pm 0.05$| | |
i | |$1.01\pm 0.08$| | |$0.95\pm 0.07$| | |$0.94\pm 0.08$| | |$0.93\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.88\pm 0.06$| | |$0.87\pm 0.06$| | |$0.87\pm 0.06$| | |$0.83\pm 0.04$| | |
z | |$1.00\pm 0.07$| | |$0.97\pm 0.07$| | |$0.96\pm 0.07$| | |$0.95\pm 0.07$| | |$0.94\pm 0.07$| | |$0.93\pm 0.07$| | |$0.92\pm 0.07$| | |$0.91\pm 0.07$| | |$0.91\pm 0.07$| | |$0.86\pm 0.05$| | |
y | |$1.02\pm 0.07$| | |$0.97\pm 0.06$| | |$0.96\pm 0.06$| | |$0.95\pm 0.06$| | |$0.94\pm 0.06$| | |$0.93\pm 0.06$| | |$0.93\pm 0.06$| | |$0.92\pm 0.06$| | |$0.91\pm 0.05$| | |$0.88\pm 0.04$| |
Prop. . | band . | i-band |$m_5^{\rm ex}$| quantiles . | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
. | . | 0 . | 1 . | 2 . | 3 . | 4 . | 5 . | 6 . | 7 . | 8 . | 9 . |
|$m_5^{\rm ex}$| | u | |$24.00\pm 0.24$| | |$24.19\pm 0.20$| | |$24.30\pm 0.19$| | |$24.40\pm 0.20$| | |$24.49\pm 0.19$| | |$24.57\pm 0.18$| | |$24.63\pm 0.16$| | |$24.68\pm 0.15$| | |$24.76\pm 0.15$| | |$24.91\pm 0.14$| |
g | |$25.66\pm 0.18$| | |$25.82\pm 0.16$| | |$25.91\pm 0.16$| | |$25.99\pm 0.15$| | |$26.06\pm 0.15$| | |$26.13\pm 0.13$| | |$26.18\pm 0.12$| | |$26.22\pm 0.12$| | |$26.31\pm 0.14$| | |$26.49\pm 0.13$| | |
r | |$25.88\pm 0.13$| | |$26.05\pm 0.09$| | |$26.13\pm 0.09$| | |$26.20\pm 0.08$| | |$26.26\pm 0.08$| | |$26.31\pm 0.07$| | |$26.36\pm 0.07$| | |$26.41\pm 0.07$| | |$26.49\pm 0.08$| | |$26.62\pm 0.07$| | |
z | |$24.80\pm 0.13$| | |$24.95\pm 0.09$| | |$25.02\pm 0.09$| | |$25.08\pm 0.08$| | |$25.13\pm 0.08$| | |$25.18\pm 0.07$| | |$25.22\pm 0.07$| | |$25.27\pm 0.07$| | |$25.35\pm 0.07$| | |$25.47\pm 0.07$| | |
y | |$23.89\pm 0.15$| | |$24.03\pm 0.11$| | |$24.10\pm 0.11$| | |$24.16\pm 0.11$| | |$24.21\pm 0.10$| | |$24.25\pm 0.09$| | |$24.28\pm 0.09$| | |$24.32\pm 0.10$| | |$24.40\pm 0.09$| | |$24.50\pm 0.08$| | |
|$\theta _{\rm FWHM}^{\rm eff}$| | u | |$1.25\pm 0.11$| | |$1.20\pm 0.12$| | |$1.17\pm 0.12$| | |$1.15\pm 0.12$| | |$1.13\pm 0.12$| | |$1.12\pm 0.13$| | |$1.10\pm 0.12$| | |$1.10\pm 0.12$| | |$1.12\pm 0.13$| | |$1.05\pm 0.10$| |
g | |$1.07\pm 0.12$| | |$1.02\pm 0.13$| | |$1.00\pm 0.14$| | |$1.00\pm 0.14$| | |$0.99\pm 0.13$| | |$0.97\pm 0.13$| | |$0.96\pm 0.12$| | |$0.96\pm 0.12$| | |$0.96\pm 0.11$| | |$0.89\pm 0.09$| | |
r | |$1.04\pm 0.09$| | |$0.98\pm 0.09$| | |$0.96\pm 0.09$| | |$0.94\pm 0.09$| | |$0.92\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.90\pm 0.06$| | |$0.89\pm 0.06$| | |$0.85\pm 0.05$| | |
i | |$1.01\pm 0.08$| | |$0.95\pm 0.07$| | |$0.94\pm 0.08$| | |$0.93\pm 0.08$| | |$0.91\pm 0.07$| | |$0.90\pm 0.07$| | |$0.88\pm 0.06$| | |$0.87\pm 0.06$| | |$0.87\pm 0.06$| | |$0.83\pm 0.04$| | |
z | |$1.00\pm 0.07$| | |$0.97\pm 0.07$| | |$0.96\pm 0.07$| | |$0.95\pm 0.07$| | |$0.94\pm 0.07$| | |$0.93\pm 0.07$| | |$0.92\pm 0.07$| | |$0.91\pm 0.07$| | |$0.91\pm 0.07$| | |$0.86\pm 0.05$| | |
y | |$1.02\pm 0.07$| | |$0.97\pm 0.06$| | |$0.96\pm 0.06$| | |$0.95\pm 0.06$| | |$0.94\pm 0.06$| | |$0.93\pm 0.06$| | |$0.93\pm 0.06$| | |$0.92\pm 0.06$| | |$0.91\pm 0.05$| | |$0.88\pm 0.04$| |
To check the dependences of other survey properties, we subdivide each of the i-band deciles into five subquantiles of another survey property (such as depth in another band), and check the variation of the metrics, i.e. number of objects |$N_{\rm gal}$|, mean redshift |$\langle z \rangle$|, and width of the redshift bin |$\sigma _z$|, with these properties. As a reference, we also compute and compare the variation with subquantiles of the i-band depth itself. In this section, we show two representative examples for source tomographic bins determined by FZboost photo-z: the subquantiles in coadd u-band depth and the i-band seeing, for the fainest, median, and deepest i-band deciles: |${\rm qtl}=0, 5, 9$|. In the results presented here, we overplot the variation from the i-band depth subquantiles (as faint, dashed lines) on top of that from the other survey properties (as solid lines), for visual comparison. That is, one can read off the level of fluctuation from the deepest and shallowest u-band depth sub-bin, for example, and compare it with that from the deepest and shallowest i-band depth sub-bin. It should be noted, however, that these reference i-band split cases have a different actual x-axis values from those shown in the plots.
The results are shown in Figs D1 and D2, respectively. We see that in general, these trends are consistent with the i-band depth fluctuation for all three metrics: the deeper (smaller) the depth (seeing), the more objects included in the sample, the higher the mean redshift of the tomographic bin, and the larger |$\sigma _z$|. Also, |${\rm qtl}=0$| has a significantly larger variation compared to |${\rm qtl}=9$| in most cases. Compared to the trends in the i-band depth sub-bins, we see that the |$N_{\rm gal}$| variations are always less strong for other properties. This is understood as selections are primarily taken in i-band. The |$\langle z \rangle$| variations for the u-band tightly follows the i-band, although the first bin can have slightly larger fluctuations. For seeing, on the other hand, the trend is quite different for |${\rm qtl}=0$|, where the smallest seeing does not always correspond to a higher mean redshift. This could happen because the seeing is not as well correlated with depth – there are more scatter in the coadd depth and seeing at the faint end. Finally, the variation in |$\sigma _z$| seems to be relatively minor in most cases.

The variation of the number of objects, |$N_{\rm gal}$|, the mean redshift, |$\langle z \rangle$|, and the standard deviation, |$\sigma _z$|, of the Y3 source tomographic bins, as a function of the extinction-corrected u-band coadd depth, |$m_5^{\rm ex}$|. The u-band depth bins are determined by 5 quantiles subdividing each of the i-band quantiles used in the main analysis. Examples shown here are for the i-band quantiles 0 (dark blue), 5 (purple), and 9 (pink). The tomographic bins are split by FZBoost photo-z. The horizontal lines indicate the combined values as shown in Figs 5–7. The faint, dashed lines indicate a reference case where the subdivision is done for 5 quantiles in the i-band depth. Notice that the i-band split case is only overplotted here to provide a visual comparison of the level of fluctuations, but its actual x-axis values do not align with those on the figure, which are for the u-band depth.

Same as Fig. D1, but for subdivision in the i-band median effective seeing, |$\theta _{\rm FWHM}^{\rm eff}$|. Notice here that the reference i-band depth subquantiles, as indicated by the faint dashed lines, are flipped, i.e. the first quantile in i-band depth is overplotted on top of the last quantile in the i-band seeing. This is because, on average, a deeper coadd depth corresponds to a smaller seeing angle.
From these exercises, we see that within each i-band decile, the number of objects and |$p(z)$| properties can still change significantly with other survey properties such as u-band depth and i-band seeing. Meanwhile, given that these quantities are also quite tightly correlated, we expect that a lot of these variations are also due to the covariation of the i-band depth. Hence, our main analysis, by splitting into the i-band quantiles, should capture the level of variations of the metrics. However, if one wishes to apply this method in e.g. forward modelling, then covariation of all bands need to be taken into account.
APPENDIX E: MAGLIM-LIKE LENS SAMPLE
In this section, we explore the impact of variable depth on a lens sample selected with the DES Y3 MagLim cuts (Porredon et al. 2022). Because this sample has a brighter cut, we relax the i-band signal-to-noise limit to |${\rm SNR\ge 5}$|. The sample is selected with
where we use the FZBoost mode redshift as |$z_{\rm phot}$|. This cut reduces the number of lens sample significantly compared to our fiducial case, resulting in a total sample size of 3.67 per cent of the baseline (Gold cut) lens sample. The true redshift distribution of each tomographic bin is shown in Fig. E1, where the dashed lines show those from the shallowest quantile, and the solid lines show those from the deepest. Notice that the distribution is less smooth due to the sparsity of the sample. Overall, thanks to the bright cut, the redshift distribution for each bin has a smaller tail compared to the baseline case, especially for the highest redshift bin.

True redshift distribution of the LSST Y3 MagLim lens sample, split in tomographic bins as defined in the DESC SRD. The MagLim cuts and the tomographic edhes are determined using the mode of FZBoost redshifts. The sample has also been applied a cut with |${\rm SNR}\ge 5$|. The dashed lines show samples degraded using the shallowest 10 per cent pixels in i-band coadd depth (|${\rm qtl}=0$|), and the solid lines show those from the deepest 10 per cent (|${\rm qtl}=9$|).
Fig. E2 shows the metrics for the variable depth, namely, the galaxy number, mean redshift, and width of the tomographic bin, as a function of the i-band depth. The panels (a)–(d) has the same style as, and should be compared to Figs 5–7. Again, we see a significantly milder, but visible, trend of these metrics with depth, owing to the bright magnitude cut. This shows that the variable depth effect can be greatly reduced, but not completed removed, by introducing a bright cut at the cost of sample size.

Metrics for impact of variable depth for the LSST Y3 MagLim lens sample, split in five tomographic bins. (a). The fractional change in number of galaxies, |$N_{\rm gal}/\bar{N}_{\rm gal}$| in tomographic bins as a function of the i-band extinction-corrected coadd depth, |$m_5^{\rm ex}$|; (b). The scaled shifts in mean redshift, |$\Delta \langle z \rangle / (1+\langle z \rangle)$| as a function of |$m_5^{\rm ex}$|; (c). The fractional change in second moment of the redshift distribution, |$\sigma _z/\sigma _z^{\rm tot}$|, as a function of |$m_5^{\rm ex}$|; (d). The fractional change in the LSS-related kernel, |$W_z/W_z^{\rm tot}$|, as a function of |$m_5^{\rm ex}$|. The MagLim cuts and the tomographic bins edges are determined using the mode of FZBoost redshifts, and the sample has an i-band |${\rm SNR}\ge 5$|. The vertical solid and dashed lines marks the |$1\sigma$| and |$2\sigma$| regions of the depth distribution.
Fig. E3 shows the effect propagated to the galaxy clustering two-point data vector, |$C_{\ell }^{gg}$|. We followed the same procedure as in Section 5.2, and set the number density in each bin to be |$0.135, 0.117, 0.156, 0.219, 0.267$| arcmin|$^{-2}$| to account for the reduction in the overall number density compared to the fiducial case. The impact of variable depth on |$C_{\ell }^{gg}$| is also significantly reduced, especially for (4,4) and (5,5). However, the impact is not negligible still at |$\ell \lt 100$|.
