-
PDF
- Split View
-
Views
-
Cite
Cite
B Hoyle, D Gruen, G M Bernstein, M M Rau, J De Vicente, W G Hartley, E Gaztanaga, J DeRose, M A Troxel, C Davis, A Alarcon, N MacCrann, J Prat, C Sánchez, E Sheldon, R H Wechsler, J Asorey, M R Becker, C Bonnett, A Carnero Rosell, D Carollo, M Carrasco Kind, F J Castander, R Cawthon, C Chang, M Childress, T M Davis, A Drlica-Wagner, M Gatti, K Glazebrook, J Gschwend, S R Hinton, J K Hoormann, A G Kim, A King, K Kuehn, G Lewis, C Lidman, H Lin, E Macaulay, M A G Maia, P Martini, D Mudd, A Möller, R C Nichol, R L C Ogando, R P Rollins, A Roodman, A J Ross, E Rozo, E S Rykoff, S Samuroff, I Sevilla-Noarbe, R Sharp, N E Sommer, B E Tucker, S A Uddin, T N Varga, P Vielzeuf, F Yuan, B Zhang, T M C Abbott, F B Abdalla, S Allam, J Annis, K Bechtol, A Benoit-Lévy, E Bertin, D Brooks, E Buckley-Geer, D L Burke, M T Busha, D Capozzi, J Carretero, M Crocce, C B D’Andrea, L N da Costa, D L DePoy, S Desai, H T Diehl, P Doel, T F Eifler, J Estrada, A E Evrard, E Fernandez, B Flaugher, P Fosalba, J Frieman, J García-Bellido, D W Gerdes, T Giannantonio, D A Goldstein, R A Gruendl, G Gutierrez, K Honscheid, D J James, M Jarvis, T Jeltema, M W G Johnson, M D Johnson, D Kirk, E Krause, S Kuhlmann, N Kuropatkin, O Lahav, T S Li, M Lima, M March, J L Marshall, P Melchior, F Menanteau, R Miquel, B Nord, C R O’Neill, A A Plazas, A K Romer, M Sako, E Sanchez, B Santiago, V Scarpine, R Schindler, M Schubnell, M Smith, R C Smith, M Soares-Santos, F Sobreira, E Suchyta, M E C Swanson, G Tarle, D Thomas, D L Tucker, V Vikram, A R Walker, J Weller, W Wester, R C Wolf, B Yanny, J Zuntz, DES Collaboration, Dark Energy Survey Year 1 Results: redshift distributions of the weak-lensing source galaxies, Monthly Notices of the Royal Astronomical Society, Volume 478, Issue 1, July 2018, Pages 592–610, https://doi.org/10.1093/mnras/sty957
- Share Icon Share
ABSTRACT
We describe the derivation and validation of redshift distribution estimates and their uncertainties for the populations of galaxies used as weak-lensing sources in the Dark Energy Survey (DES) Year 1 cosmological analyses. The Bayesian Photometric Redshift (bpz) code is used to assign galaxies to four redshift bins between z ≈ 0.2 and ≈1.3, and to produce initial estimates of the lensing-weighted redshift distributions |$n^i_{\rm PZ}(z)\propto \mathrm{d}n^i/\mathrm{d}z$| for members of bin i. Accurate determination of cosmological parameters depends critically on knowledge of ni, but is insensitive to bin assignments or redshift errors for individual galaxies. The cosmological analyses allow for shifts |$n^i(z)=n^i_{\rm PZ}(z-\Delta z^i)$| to correct the mean redshift of ni(z) for biases in |$n^i_{\rm PZ}$|. The Δzi are constrained by comparison of independently estimated 30-band photometric redshifts of galaxies in the Cosmic Evolution Survey (COSMOS) field to bpz estimates made from the DES griz fluxes, for a sample matched in fluxes, pre-seeing size, and lensing weight to the DES weak-lensing sources. In companion papers, the Δzi of the three lowest redshift bins are further constrained by the angular clustering of the source galaxies around red galaxies with secure photometric redshifts at 0.15 < z < 0.9. This paper details the bpz and COSMOS procedures, and demonstrates that the cosmological inference is insensitive to details of the ni(z) beyond the choice of Δzi. The clustering and COSMOS validation methods produce consistent estimates of Δzi in the bins where both can be applied, with combined uncertainties of |$\sigma_{\Delta z^i}=0.015, 0.013, 0.011,$| and 0.022 in the four bins. Repeating the photo-z procedure instead using the Directional Neighbourhood Fitting algorithm, or using the ni(z) estimated from the matched sample in COSMOS, yields no discernible difference in cosmological inferences.
1 INTRODUCTION
The Dark Energy Survey (DES) Year 1 (Y1) data places strong constraints on cosmological parameters (DES Collaboration et al. 2017) by comparing theoretical models to measurements of (1) the autocorrelation of the positions of luminous red galaxies at 0.15 < z < 0.9 (Elvin-Poole et al. 2017) selected by the redMaGiC algorithm (Rozo et al. 2016); (2) the cross-correlations among weak-lensing (WL) shear fields (Troxel et al. 2017) inferred from the measured shapes of ‘source’ galaxies divided into four redshift bins (Zuntz et al. 2017); and (3) the cross-correlations of source galaxy shapes around the redMaGiC (‘lens’) galaxy positions (Prat et al. 2017). There are 650 000 galaxies in the redMaGiC catalogue covering the 1321 deg2 DES Y1 analysis area, and 26 million sources in the primary WL catalogue. For both the lens and the source populations, we rely on DES photometry in the griz bands1 to assign galaxies to a redshift bin i. Then, we must determine the normalized distribution ni(z) of galaxies in each bin. This paper describes how the binning and ni(z) determination are done for the source galaxies. These redshift distributions are fundamental to the theoretical predictions of the observable lensing signals. Uncertainties in the ni(z) must be propagated into the cosmological inferences, and should be small enough that induced uncertainties are subdominant to other experimental uncertainties. The bin assignments of the source galaxies can induce selection biases on the shear measurement, so we further discuss in this paper how this selection bias is estimated for our primary shear measurement pipeline. The assignment of redshifts to the lens galaxies, and validation of the resultant lens ni(z)’s, are described elsewhere (Rozo et al. 2016; Elvin-Poole et al. 2017; Cawthon et al. in preparation).
A multitude of techniques have been developed for estimation of redshifts from broad-band fluxes (e.g. Arnouts et al. 1999; Benítez 2000; Bender et al. 2001; Collister & Lahav 2004; Feldmann et al. 2006; Ilbert et al. 2006; Hildebrandt et al. 2010; Carrasco Kind & Brunner 2013; Sánchez et al. 2014; Rau et al. 2015; Hoyle 2016; Sadeh, Abdalla & Lahav 2016; De Vicente, Sanchez & Sevilla-Noarbe 2016). These vary in their statistical methodologies and in their relative reliance on physically motivated assumptions versus empirical ‘training’ data. The DES Y1 analyses begin with a photometric redshift algorithm that produces both a point estimate – used for bin assignment – and an estimate pPZ(z) of the posterior probability of the redshift of a galaxy given its fluxes – used for construction of the bins’ ni(z).
The key challenge to use of photo-z’s in cosmological inference is the validation of the ni(z), i.e. the assignment of meaningful error distributions to them. The most straightforward method, ‘direct’ spectroscopic validation, is to obtain reliable spectroscopic redshifts for a representative subsample of the sources in each bin. Most previous efforts at constraining redshift distributions for cosmic shear analyses used spectroscopic redshifts either as the primary validation method, or to derive the redshift distribution itself (Benjamin et al. 2013; Jee et al. 2013; Schmidt & Thorman 2013; Bonnett et al. 2016; Hildebrandt et al. 2017). Direct spectroscopic validation cannot, however, currently reach the desired accuracy for deep and wide surveys like the Y1 DES, because the completeness of existing spectroscopic surveys is low at the faint end of the DES source-galaxy distribution (Bonnett et al. 2016; Gruen & Brimioulle 2017), and strongly dependent on quantities not observed by DES (Hartley et al., in preparation). In detail, the larger area of the DES Y1 analysis compared to other WL surveys, including the DES Science Verification (SV) analysis (Bonnett et al. 2016), reduces the statistical uncertainties such that the systematic uncertainties from performing a direct calibration using spectra become dominant.
The validation for DES Y1 source galaxies therefore uses high-precision redshift estimates from 30-band photometry of the COSMOS survey field (Laigle et al. 2016), which are essentially complete over the colour–magnitude space of the Y1 source catalogue, in a more sophisticated version of the approach used in Bonnett et al. (2016). This direct approach is then combined with constraints on ni(z) derived from cross-correlation of the source galaxy positions with the redMaGiC galaxy positions as an independent method of photometric redshift validation (see e.g. Newman 2008 for an introduction to the method and Cawthon et al. in preparation; Davis et al. 2017,; Gatti et al. 2017 for the application to DES Y1). The cross-correlation redshift technique will be referred to as ‘WZ,’ and the validation based on the 30-band COSMOS photometric redshifts will be referred to as ‘COSMOS,’ and the estimates returned from photo-z algorithms run on the DES griz photometry will be marked as ‘PZ.’ Indeed, we suggest reading this paper in conjunction with those of Gatti et al. (2017) and Davis et al. (2017), which are dedicated to documenting the WZ procedure in greater detail. We also summarize the salient parts of these papers throughout this manuscript and discuss the issue of the failure of the redMaGiC sample to span the full redshift range of the Y1 lensing sources, which leaves gaps in our knowledge of ni derived from WZ.
This paper begins in Section 2 with a description of the input catalogues, real and simulated, for the source redshift inferences and validation. Section 3 describes the photometric redshift algorithms applied to the DES broad-band fluxes. We describe the direct COSMOS validation method in Section 4. The derivation of WZ constraints from angular clustering is the subject of Cawthon et al. (in preparation), Davis et al. (2017), and Gatti et al. (2017). In Section 5, we combine these WZ constraints on Δzi with those from COSMOS to yield the final constraints. We describe the use of these redshift constraints as priors for the DES Y1 cosmological inference, including an examination of the impact of the assumption in equation (1) and other known shortcomings in our process, in Section 6 and conclude in Section 7.
Aspects of the ni(z) estimation and validation procedure not immediately required for Y1 lensing analyses will be described in Hoyle et al. (in preparation) and Rau et al. (in preparation).
2 INPUT CATALOGUES
Estimation and validation of the binning and ni(z) functions for the Y1 source galaxies require input photometry for these galaxies of course, but also Dark Energy Camera (DECam, Flaugher et al. 2015) data (Sánchez et al. 2014) and external data on the COSMOS field used for validation. Finally, our validation uses simulations of the COSMOS catalogue to estimate sample-variance uncertainties induced by the small sky area of this field. Fluxes and photo-z’s must be estimated for these simulated galaxies.
2.1 Lensing sources
The set of galaxies for which bin assignments and ni(z) estimates are desired are the WL sources defined in the Y1 shear catalogues documented in Zuntz et al. (2017). The primary shear catalogue for DES Y1 is produced by the metacalibration algorithm (Huff & Mandelbaum 2017; Sheldon & Huff 2017), and a secondary catalogue using im3shape (Zuntz et al. 2013) is used as a cross-check. For both shear catalogues, we use a common photo-z catalogue based on our best measurements of fluxes (the ‘MOF’ catalogue described below) to estimate the ni(z) of each bin (see Section 3.3 for details). These ni(z) differ, however, because metacalibration and im3shape implement distinct selection criteria and bin assignments.
The starting point for either shear catalogue is the Y1 Gold catalogue of sources reliably detected on the sum of the r-, i-, and z-band DES images (Drlica-Wagner et al. 2017). Detection and initial photometry are conducted by the SExtractor software (Bertin & Arnouts 1996). Photometric zero-points are assigned to each DES exposure using nightly solutions for zero-points and extinction coefficients derived from standard-star exposures. Exposures from non-photometric nights are adjusted to match those taken in photometric conditions.
As detailed in Drlica-Wagner et al. (2017), the photometric calibration is brought to greater colour uniformity and adjusted for Galactic extinction by stellar locus regression (SLR, Ivezić et al. 2004; MacDonald et al. 2004; High et al. 2009): the i-band fluxes are adjusted according to the Galactic extinction implied by the Schlegel, Finkbeiner & Davis (1998) dust map with the O’Donnell (1994) extinction law. Then, the zero-points of other bands are adjusted to force the stellar colour–colour loci to a common template.
Fluxes used as input to the photo-z programs for both shear catalogues are derived using ngmix2 (Sheldon 2014; Jarvis et al. 2016), which fits a model to the pixel values of each galaxy in the Gold catalogue. The ngmix code fits a highly constrained exponential+deVaucouleurs model to each galaxy: the model is convolved with each exposure’s point spread function (PSF) and compared to pixels from all individual exposures covering the source. The fitting is multi-epoch and multiband: pixels of all exposures in all bands are fit simultaneously, assuming common galaxy shape for all bands and a single free flux per band. The fitting is also multi-object: groups of overlapping galaxy images are fit in iterative fashion, with each fit to a given galaxy subtracting the current estimate of flux from neighbours. These ‘multi-object fitting’ (MOF) fluxes are used as input to photo-z estimators for im3shape and metacalibration catalogue member galaxies (although we use a different flux measurement for bin assignment in the case of metacalibration, see below).
The photo-z assigned to a galaxy depends on its measured multiband fluxes, which will vary if there is shear applied to the galaxy. So the photo-z bin to which a galaxy is assigned might depend on how much it is sheared, leading to a potential selection bias. For im3shape, we have confirmed, using realistic image simulations, that these selection biases are small (at or below the one per cent level), and have added a term in the systematic uncertainty of the shear calibration to account for them (cf. section 7.6.2 of Zuntz et al. 2017, called variation of morphology there). metacalibration, on the contrary, can estimate and correct selection biases on the WL shear inference by producing and remeasuring four artificially sheared renditions of each target galaxy (by γ1 = ±0.01 and γ2 = ±0.01, where γ1, 2 are the two components of the shear). The selection bias correction in metacalibration requires knowing whether each source would have been selected and placed in the same bin if it had been sheared. It is thus necessary for us to run the photo-z estimation software not only on the original fluxes, but also on fluxes measured for each of the four artificially sheared renditions of each galaxy. The latter are not available from the MOF pipeline.
For the metacalibration catalogue, we therefore produce an additional set of photo-z estimates based on a different flux measurement made with the metacalibration pipeline. This measurement makes use of a simplified version of the ngmix procedure described above: the model fit to the galaxies is a PSF-convolved Gaussian, rather than a sum of exponential and deVaucouleurs components. These ‘Metacal’ fluxes do not subtract neighbours’ flux. In addition to fluxes, metacalibration also measures pre-seeing galaxy sizes and galaxy shapes (Zuntz et al. 2017).
There are thus six distinct photo-z’s for the WL source galaxies: one produced using the MOF fluxes for galaxies in either of the im3shape or metacalibration shape catalogues; one produced using Metacal fluxes of the as-observed sources in the metacalibration shape catalogue; and four produced using Metacal fluxes of the four artificially sheared renditions of the sources in the metacalibration catalogue.
2.2 COSMOS catalogue and DES griz
2.2.1 DES fluxes
Our COSMOS validation procedure depends on having griz photometry and external redshift estimates for objects in the COSMOS field. This field was observed by DES and community programs using DECam. These observations were combined, catalogued, and measured using the same DES pipelines as the survey data; we use the Y1A1 D04 catalogue produced as part of the Gold catalogues (Drlica-Wagner et al. 2017). MOF magnitudes and Metacal sizes are also measured for all entries in this catalogue. The COSMOS-field observations used herein are ≈1 mag deeper than the typical Y1 DES data. This mismatch must be kept in mind when using this field for validation (Rau et al. in preparation).
Zero-points for the COSMOS images are determined using the same SLR methods used for the Y1 catalogue. The SLR process is subject to errors that perturb the calibration. We note that the SLR adjustment to the zero-points is below 0.03 mag for most of our data (see fig. A.7 of Drlica-Wagner et al. 2017). The adjustment for Galactic extinction, which is applied as part of the SLR procedure, is of order 0.05 mag in most regions of the survey. In the Y1 data, because it covers a large area with uncorrelated SLR calibrations, there are many independent realizations of these errors and they will average away in the mean ni(z). We must keep in mind, however, that the COSMOS data are based on a single realization of SLR errors, and must therefore allow for the consequent offset of COSMOS photometry from the Y1 mean (Section 4.3).
2.2.2 Redshift data and cross-matching
The COSMOS2015 catalogue from Laigle et al. (2016) provides photometry in 30 different ultraviolet/visible/infraed bands, and probability distribution functions (PDFs) pC30(z) for the redshift of each galaxy based on this photometry using the lephare template-fitting code (Arnouts et al. 1999; Ilbert et al. 2006). Typical, pC30(z) widths for DES source galaxies are ≈0.01(1 + z), far better than the uncertainties in bpz estimates based on DES griz photometry. In Section 6.1, we discuss the influence of errors in pC30(z)’s on our Δzi inferences.
The validation procedure requires assignment of a pC30(z) to each DES-detectable source in the COSMOS field. After limiting the catalogues to their region of overlap, we associate COSMOS2015 objects with DES Gold objects with |$1.5\,\mathrm{ arcmin}$| matching radius. Only 0.3 per cent of DES-detected sources fail to match a COSMOS2015 source, and most of these are very near-mask boundaries around bright stars or other peculiar locations. We conclude that ignoring these unmatched sources causes an insignificant bias in the inferred redshift distribution. Of the matched galaxies, 0.4 per cent have no pC30(z) provided in COSMOS2015, without explanation. For these, we synthesize a pC30(z) by averaging those of ≈10 nearest neighbours in the space of COSMOS2015 ZMINCHI2 and i-band magnitude, where ZMINCHI2 is the 30-band photometric redshift point prediction corresponding to the minimum χ2 fit between fluxes and templates.
We remove from the sample galaxies whose fluxes or pre-seeing sizes could not be measured by the DES pipelines. We note that such objects would be flagged in the lensing source catalogue and removed. A total of 128 563 galaxies with good DES Gold MOF photometry remain in our final COSMOS sample.
We also use spectroscopic subsamples of this complete sample of galaxies with COSMOS2015 results later to validate our calibration (cf. Section 6.1).
2.2.3 PDF rescaling
Following a technique similar to Bordoloi, Lilly & Amara (2010), we rescale the estimated pC30(z)’s to make them more accurately represent true distribution functions of redshift.
The method relies on using the subset of COSMOS2015 galaxies with spectroscopic redshifts from the literature (Lilly et al. 2007, 2009a). While this subset is not representative of the full photometric sample (Bonnett et al. 2016; Gruen & Brimioulle 2017), an excess of outliers in true, spectroscopic redshift relative to pC30(z) is still an indication that the rate of ‘catastrophic failures’ in COSMOS2015 photo-z determinations is higher than that estimated by Laigle et al. (2016). The procedure described here is not a panacea but will lessen such discrepancies.
For each galaxy in COSMOS2015 having a spectroscopic redshift and matching a DES detection, pC30(z) is integrated to a cumulative distribution function (CDF) 0 < c(z) < 1. The value c(zspec) for a distribution of objects is the Probability Integral Transform (PIT, Dawid 1984; Angus 1994). If pC30(z) is a true, statistically rigorous PDF of the spectroscopic redshifts, the PIT values should be uniformly distributed between 0 and 1. In Fig. 1, we show in blue the distribution of PIT values for the original pC30(z)’s. The peaks at 0 and at 1 indicate that the widths of the pC30(z) are underestimated and need to be broadened, and the asymmetry means that a small global offset should be applied to them.

The effect of rescaling the COSMOS2015 photometric redshift PDFs using the PIT distribution. The PIT is the redshift CDF values of the full sample of DES-detected sources evaluated at the spectroscopic redshift, for those sources with known zspec. The original PDFs (blue) depart significantly from the expected uniform distribution (red dashed line). The pC30(z) rescaling procedure yields the orange histogram, much improved, as confirmed by the value of the KL divergence between the histogram and a uniform distribution.
Going into further detail, we determine the best-fitting remapping parameters independently for six subsets in bins of i-band MAG_AUTO (Bertin & Arnouts 1996) magnitude bounded by [16.1, 20.72, 21.48, 21.98, 22.40, 23.03, 99]. The bins are chosen so that they are each populated by approximately 4000 spectra. Remapping in bins of magnitude is seen to yield lower KL values than remapping in redshift bins, or with no binning. We find that the KL values of the training data and the validation data are very similar, indicating that we are not overfitting. The KL divergences in the first and the last bins decrease from 0.88 → 0.14 and 0.52 → 0.13, respectively, with even greater improvement for the full sample as noted in Fig. 1. The only parameter relevant for the mean redshift calibration performed in Section 4 is the shift in the mean of the p(z), μ. The sizes of these in each magnitude bin are all |μ| ≤ 0.001, much smaller than the uncertainty of our ensemble mean redshifts.
2.3 Simulated sky catalogues
We also draw upon simulated data sets generated specifically for the DES collaboration. Specifically, we make use of the Buzzard-v1.1 simulation, a mock DES Y1 survey created from a set of dark-matter-only simulations. This simulation and the galaxy catalogue construction are described in detail elsewhere (DeRose et al. in preparation; Wechsler et al. in preparation; MacCrann et al. in preparation), so here we provide only a brief overview. Buzzard-v1.1 is constructed from a set of three N-body simulations run using l-gadget2, a version of gadget2 modified for memory efficiency, with box lengths ranging from 1 to 4 h−1Gpc from which light-cones were constructed on the fly.
Galaxies are added to the simulations using the Adding Density Dependent GAlaxies to Light-cone Simulations algorithm (Wechsler et al. in preparation). Spectral energy distributions (SEDs) are assigned to the galaxies from a training set of spectroscopic data from Sloan Digital Sky Survey Data Release 7 (SDSS DR7) (Cooper et al. 2011) based on local environmental density. These SEDs are integrated in the DES pass bands to generate griz magnitudes. Galaxy sizes and ellipticities are drawn from distributions fit to SuprimeCam |$i^{^{\prime}}$|-band data (Miyazaki et al. 2002). The galaxy positions, shapes, and magnitudes are then lensed using the multiple-plane ray-tracing code, Curved-sky grAvitational Lensing for Cosmological Light conE simulatioNS (Becker 2013). The simulation is cut to the DES Y1 footprint, and photometric errors are applied to the lensed magnitudes by copying the noise map of the FLUX_AUTO measurements in the real catalogue. More explicitly, the error on the observed flux is determined only by the limiting magnitude at the position of the galaxy, the exposure time, and the noise-free apparent magnitude of the galaxy itself.
2.3.1 Science sample selection in simulations
The source–galaxy samples in simulations are selected so as to roughly mimic the selections and the redshift distributions of the metacalibration shear catalogue described in Zuntz et al. (2017). This is done by first applying flux and size cuts to the simulated galaxies so as to mimic the thresholds used in the Y1 data by using the Y1 depth and PSF maps. The WL effective number density neff in the simulation is matched to a preliminary version of the shape catalogues, and is about 7 per cent higher than for the final, unblinded metacalibration catalogue. Truth values for redshift, flux, and shear are of course available as well as the simulated measurements.
COSMOS-like catalogues are also generated from the Buzzard simulated galaxy catalogues by cutting out 367 non-overlapping COSMOS-shaped footprints from the simulation.
3 PHOTOMETRIC REDSHIFT ESTIMATION
In this section, we describe the process of obtaining photometric redshifts for DES galaxies. We note that we only use the g, r, i, and z DES bands in this process. We have found that the Y band adds little to no predictive power.
3.1 Bayesian photometric redshifts
Posterior probabilities pPZ(z) were calculated for each source galaxy using bpz*, which is a variant of the Bayesian algorithm described by Benítez (2000), and has been modified to provide the photometric redshift point predictions and PDFs required by the DES collaboration directly from fits-format input fluxes, without intermediate steps. The bpz* code is a distilled version of the distributed bpz code, and in particular assumes the synthetic template files for each filter have already been generated. Henceforth, we will refer to these simply as ‘bpz’ results.
3.1.1 Per-galaxy posterior estimation
The redshift posterior is calculated by marginalizing over a set of interpolated model spectral templates, where the likelihood of a galaxy’s photometry belonging to a given template at a given redshift is computed via the χ2 between the observed photometry and those of the filter passbands integrated over the model template. The model templates are grouped into three classes, nominally to represent elliptical, spiral, and starburst galaxies. These classes, it is assumed, follow distinct redshift-evolving luminosity functions which can be used to create a magnitude-dependent prior on the redshift posterior of each object, a.k.a. the ‘luminosity prior’. The prior comprises two components, a spectral class prior which is dependent only on observed magnitude, and the redshift prior of each class – which is itself also magnitude dependent (see Benítez 2000 for more detail).
Six base template spectra for bpz are generated based on original models by Coleman, Wu & Weedman (1980) and Kinney et al. (1996). The SLR used for the DES Y1 data ensures uniformity of colour across the footprint, but there may be small differences in calibration with respect to the empirical templates we wish to use. Moreover, these original templates are derived from galaxies at redshift zero, while our source galaxies cover a wide range in redshift, with an appreciable tail as high as z ∼ 1.5. The colours of galaxies evolve significantly over this redshift range, even at fixed spectral type. Failure to account for this evolution can easily introduce biases in the redshift posteriors that subsequently require large model bias corrections (see Bonnett et al. 2016, for instance). To address these two issues, we compute evolution/calibration corrections to the template fluxes.
We match low-resolution spectroscopic redshifts from the PRIMUS DR1 data set (Coil et al. 2011; Cool et al. 2013) to high signal-to-noise (S/N) DES photometry and obtain the best fit of the six basic templates to each of the highest quality PRIsm MUlti-Object Survey (PRIMUS) objects (quality = 4) at their spectroscopic redshift. The flux of each template in each filter is then corrected as a function of redshift by the median offset between the DES photometry and the template prediction, in a sliding redshift window of width δz = 0.06. The calibration sample numbers 72 176 galaxies and reaches the full depth of our science sample (iDES < 23.5), while maintaining a low rate of mis-assigned redshifts.3 Although the incompleteness in PRIMUS is broadly independent of galaxy colour (Cool et al. 2013) and each template is calibrated separately, we nevertheless expect small residual inaccuracies in our calibration to remain. Our COSMOS and WZ validation strategies serve to calibrate such errors in bpz assignments.
A complete galaxy sample is required for deriving the luminosity prior we use with bpz. No spectroscopic samples are complete to the limit of our source galaxy sample, and so we turn to the accurate photometric redshift sample in the COSMOS field from Laigle et al. (2016), which is complete to the depth of our main survey area despite being selected in the K band. The prior takes the form of smooth exponential functions (see Benítez 2000), which we fit to the COSMOS galaxy population by determining galaxy types at their photometric redshift. Because bpz uses smooth functions rather than the population directly, the luminosity prior used for obtaining posterior redshift probabilities does not replicate the high-frequency line-of-sight structure in the COSMOS field.
bpz is run on the MOF fluxes (see Section 2) to determine pPZ(z) for metacalibration and im3shape, while for the five metacalibration catalogues – the real one and the four artificially sheared versions – bpz is run on the metacalibration fluxes to determine bin assignments (cf. Section 3.3 for details). The luminosity prior is constructed from MOF i-band fluxes for both catalogues. For the Buzzard simulated galaxy catalogues, bpz is run on the single mock flux measurement produced in the simulation.
We also explored a further post-processing step as in Section 2.2.3, but applied to the DES bpz photo-z PDFs. We used the spectroscopic training data, which is not used in bpz, to recalibrate the PDFs in bins of i-band magnitude. We find that this rescaling did not noticeably change the mean or widths of the PDFs on average, and that the statistical properties of the redshift distributions in each tomographic bin also remain unchanged.
3.1.2 Known errors
During bpz processing of the Y1 data, three configuration and software errors were made.
First, the metacalibration catalogues were processed using MOF i-band magnitudes for evaluating the bpz prior rather than Metacal fluxes. This is internally consistent for bpz, but the use of flux measurements that do not exist for artificially sheared galaxies means that the metacalibration shear estimates are not properly corrected for selection biases resulting from redshift bin assignment. We note that small perturbations to the flux used for assigning the luminosity prior have very little impact on the resulting mean redshift and the colours used by bpz in this run are correctly measured by metacalibration on unsheared and sheared galaxy images. Rerunning bpz with the correct, Metacal inputs for i-band magnitude on a subset of galaxies indicates that the induced multiplicative shear bias is below 0.002 in all redshift bins, well below both the level of statistical errors in DES Y1 and our uncertainty in shear bias calibration. We therefore decide to tolerate the resulting systematic uncertainty.
Second, the SLR adjustments to photometric zero-points were not applied to the observed Metacal fluxes in the Y1 catalogues before input to bpz. The principal result of this error is that the observed magnitudes are no longer corrected for Galactic extinction. This results in a shift in the average ni(z) of the source population of each bin, and a spatially coherent modulation of the bin occupations and redshift distributions across the survey footprint. In Section 4, we describe a process whereby the mean ni(z) can be accurately estimated by mimicking the SLR errors on the COSMOS field. In Appendix B, we show that the spurious spatial variation of the redshift distributions causes negligible errors in our estimation of the shear two-point functions used for cosmological inference, and zero error in the galaxy–galaxy lensingestimates.
Finally, when rewriting bpz for a faster version, bpz*, two bugs were introduced in the prior implementation, one causing a bias for bright galaxies (i-band magnitude <18.5) and another which forced uniform prior abundance for the three galaxy templates. These bugs were discovered too late in the DES Y1 analysis to fix. They cause differences in Δz that are subdominant to our calibration uncertainties (below 0.006 among all individual bins). In addition, they are fully calibrated by both COSMOS (which uses the same implementation) and WZ, and hence do not affect our cosmological analysis. We have since implemented all of the above bug fixes, and applied the SLR adjustments correctly, and find negligible changes in the shape and mean of the bpz PDFs, which are fully within the combined systematic uncertainties.
3.1.3 Per-galaxy photo-z precision
While ni(z) are the critical inputs to cosmological inference, it is sometimes of use to know the typical size of the redshift uncertainty for individual galaxies. We define σ68 for each pPZ(z) as the half-width of the 68 percentile region around the median. We select 200 000 galaxies from the metacalibration catalogue at random, and determine the average σ68 in bins of redshift according to the median of pPZ(z). We find that this mean σ68(z) is well fit by a quadratic polynomial in mean bpz redshift and present the best-fitting parameters in Fig. 2.

The average width of the posterior distributions of bpz photometric redshifts for data selected in bins of mean bpz redshift. The posterior width is defined as the 68 per cent spread of the PDF pPZ(z) about its median. The error bars correspond to the standard deviation of the individual source’s σ68 around the average.
Further metrics of the performance of individual galaxies’ photo-z’s but with respect to truth redshifts are provided in Section 4.7.
3.2 Directional Neighbourhood Fitting
Directional Neighbourhood Fitting (DNF, De Vicente et al. 2016) is a machine-learning algorithm for galaxy photometric redshift estimation. We have applied it to reconstruct the redshift distributions for the metacalibration catalogues. DNF takes as reference a training sample whose spectroscopic redshifts are known. Based on the training sample, DNF constructs the prediction hyperplane that best fits the neighbourhood of each target galaxy in multiband flux space. It then uses this hyperplane to predict the redshift of the target galaxy. The key feature of DNF is the definition of a new neighbourhood, the Directional Neighbourhood. Under this definition – and leaving apart degeneracies corresponding to different galaxy types – two galaxies are neighbours not only when they are close in the Euclidean multiband flux space, but also when they have similar relative flux in different bands, i.e. colours. In this way, the neighbourhood does not extend in multiband flux hyperspheres but in elongated hypervolumes that better represent similar colour, and presumably similar redshift. As described in Section 3.3, these DNF photo-z predictions are used to classify the galaxies in tomographic redshift bins.
A random sample from the pPZ(z) of an object is approximated in the DNF method by the redshift of the nearest neighbour within the training sample. It is used as the sample for ni(z) reconstruction and interpreted in Section 3.3 as a random draw from the underlying per-galaxy posterior.
The training sample used for Y1 DNF prediction was collected by the DES Science Portal team (Gschwend et al. 2017) from different spectroscopic surveys and includes the VIMOS Public Extragalactic Redshift Survey (VIPERS) second data release (Scodeggio et al. 2016). The validation of the predictions was based on COSMOS2015 photo-z’s. Objects near the COSMOS data were removed from the training sample. Since the machine learning algorithm can correct for imperfections in the input photometry giving a representative training set, both training and photo-z predictions are based on Metacal photometry without SLR adjustments, for all runs on DNF.
The fiducial DES Y1 cosmological parameter estimation uses the bpz photo-z’s, and DES Collaboration et al. (2017) demonstrate that these estimates are robust to substitution of DNF for bpz.
The ni(z) distributions of bpz and DNF are not expected to be identical, because the algorithms may make different bin assignments for the same source. We therefore do not offer a direct comparison. We do, however, repeat for DNF all of the validation processes described herein for the bpzni(z) estimates. The results for DNF are given in Appendix C.
3.3 Binning and initial ni(z) estimation
Both photo-z codes yield six different posterior distributions pPZ(zj) for each galaxy j in the Y1 shape catalogues, conditional on either the MOF, the unsheared Metacal, or the four sheared Metacal flux measurements. In this section, we describe how these are used to define source redshift bins i and provide an initial estimate of the lensing-weighted ni(z) of each of these bins. Table 1 gives an overview of these steps.
Binning, ni(z) estimation, and mean z calibration for the variants of the shear and photo-z catalogues.
Shear catalogue . | Step . | bpz . | DNF . |
---|---|---|---|
Binned by: | Metacal griz ⟨zj⟩ | Metacal griz ⟨zj⟩ | |
metacalibration | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | Metacal griz|$z_\mathrm{ j}^{\rm PZ}$| |
Calibration by: | COSMOS + WZ | COSMOS + WZ | |
Binned by: | MOF griz ⟨zj⟩ | – | |
im3shape | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | – |
Calibration by: | COSMOS + WZ | – |
Shear catalogue . | Step . | bpz . | DNF . |
---|---|---|---|
Binned by: | Metacal griz ⟨zj⟩ | Metacal griz ⟨zj⟩ | |
metacalibration | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | Metacal griz|$z_\mathrm{ j}^{\rm PZ}$| |
Calibration by: | COSMOS + WZ | COSMOS + WZ | |
Binned by: | MOF griz ⟨zj⟩ | – | |
im3shape | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | – |
Calibration by: | COSMOS + WZ | – |
Binning, ni(z) estimation, and mean z calibration for the variants of the shear and photo-z catalogues.
Shear catalogue . | Step . | bpz . | DNF . |
---|---|---|---|
Binned by: | Metacal griz ⟨zj⟩ | Metacal griz ⟨zj⟩ | |
metacalibration | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | Metacal griz|$z_\mathrm{ j}^{\rm PZ}$| |
Calibration by: | COSMOS + WZ | COSMOS + WZ | |
Binned by: | MOF griz ⟨zj⟩ | – | |
im3shape | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | – |
Calibration by: | COSMOS + WZ | – |
Shear catalogue . | Step . | bpz . | DNF . |
---|---|---|---|
Binned by: | Metacal griz ⟨zj⟩ | Metacal griz ⟨zj⟩ | |
metacalibration | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | Metacal griz|$z_\mathrm{ j}^{\rm PZ}$| |
Calibration by: | COSMOS + WZ | COSMOS + WZ | |
Binned by: | MOF griz ⟨zj⟩ | – | |
im3shape | ni(z) by stacking: | MOF griz|$z_\mathrm{ j}^{\rm PZ}$| | – |
Calibration by: | COSMOS + WZ | – |
Galaxies are assigned to bins based on the expectation value of their posterior, ⟨zj⟩ = ∫zj pPZ(zj) dzj. We use four bins between the limits [0.20,0.43,0.63,0.90,1.30]. These tomographic boundaries exclude ⟨zj⟩ < 0.2 and ⟨zj⟩ > 1.3 that have large photo-z biases. We place three tomographic bins at ⟨zj⟩ < 0.9 with approximately equal effective source density neff, a proxy for the statistical uncertainty of shear signals in the metacalibration catalogue, since z = 0.9 is the upper limit of the WZ constraints. The fourth bin, 0.9 < ⟨zj⟩ < 1.3, is thus validated only by the COSMOS method.
For metacalibration sources, this bin assignment is made based on the ⟨zj⟩ of the photo-z run on Metacal photometry, instead of MOF photometry. The reason for this is that flux measurements, and therefore photo-z bin assignments, can depend on the shear a galaxy is subject to. This can cause selection biases in shear due to photo-z binning, which can be corrected in metacalibration. The latter requires that the bin assignment can be repeated using a photo-z estimate made from measurements made on artificially sheared images of the respective galaxy (cf. Huff & Mandelbaum 2017; Sheldon & Huff 2017; Zuntz et al. 2017), and only the Metacal measurement provides that.
For im3shape sources, the bin assignment is made based on the ⟨zj⟩ of the photo-z run on MOF photometry, which has higher S/N and lower susceptibility to blending effects than Metacal photometry. This provides more precise (and possibly more accurate) photo-z estimates.
We note that this means that for each combination of shear and photo-z pipeline, bin assignments and effective weights of galaxies are different. The redshift distributions and calibrations derived below can therefore not be directly compared between the different variants.
The stacked redshift distribution ni(z) of each of the tomographic bins is estimated by the lensing-weighted stack of random samples |$z_\mathrm{ j}^{\rm PZ}$| from the pPZ(zj) of each of all galaxies j in bin i. Given the millions of galaxies in each bin, the noise due to using only one random sample from each galaxy is negligible. For both the metacalibration and im3shape catalogues, we use random samples from the pPZ(z) estimated by bpz run on MOF photometry to construct the ni(z), this being the lower noise and more reliable flux estimate. In the case of DNF, we use the Metacal photometry run for both the binning and initial ni(z) estimation.
By the term lensing weighted above, we mean the effective weight |$w^{\rm eff}_\mathrm{ j}$| a source j has in the lensing signals we measure in Troxel et al. (2017) and Prat et al. (2017). In the case of metacalibration, sources are not explicitly weighted in these papers. Since the ellipticities of galaxies in metacalibration have different responses to shear (Huff & Mandelbaum 2017; Sheldon & Huff 2017), and since we measure correlation functions of metacalibration ellipticities that we then correct for the mean response of the ensemble, however, sources do have an effective weight that is proportional to their response. As can be derived by considering a mixture of subsamples at different redshifts and with different mean response, the correct redshift distribution to use is therefore one weighted by |$w_\mathrm{ j}^{\rm eff}\propto (R_{\gamma_{1,1},\mathrm{ j}}+R_{\gamma_{2,2},\mathrm{ j}}),$| where the R’s are shear responses defined in Zuntz et al. (2017). In the case of im3shape, explicit weights |$w$|j are used in the measurements, and sources have a response to shear (1 + mj) with the calibrated multiplicative shear bias mj (Zuntz et al. 2017). The correct effective weights for im3shape are therefore |$w_\mathrm{ j}^{\rm eff}\propto (1+m_\mathrm{ j})\times w_\mathrm{ j}$|.
We note that for other uses of the shape catalogues, such as with the optimal ΔΣ estimator (Sheldon et al. 2004), the effective weights of sources could be different, which has to be accounted for in the photo-z calibration.
4 VALIDATING THE REDSHIFT DISTRIBUTION USING COSMOS MULTIBAND PHOTOMETRY
In Bonnett et al. (2016), we made use of COSMOS photometric redshifts as an independent estimate and validation of the redshift distribution of the WL source galaxies. We made cuts in magnitude, full width at half-maximum, and surface brightness to the source catalogue from DECam images in the COSMOS field that were depth-matched to the main survey area. These cuts approximated the selection function of the shape catalogues used for the cosmic shear analysis. Similar techniques that find COSMOS samples of galaxies matched to a lensing source catalogue by a combination of magnitude, colour, and morphological properties have been applied by numerous studies (Applegate et al. 2014; Hoekstra et al. 2015; Okabe & Smith 2016; Cibirka et al. 2017; Amon et al. 2018). In this work, we modify the approach to reduce statistical and systematic uncertainty on its estimate of mean redshift and carefully estimate the most significant sources of systematic error.
We wish to validate the ni(z) derived for a target sample A of galaxies using a sample B with known redshifts. Ideally, for every galaxy in A, we would find a galaxy in B that looks exactly like it when observed in the same conditions. The match would need to be made in all properties we use to select and weight the galaxy in the WL sample that also correlate with redshift.
Then, the mean redshift distribution of the matched B galaxies, weighted the same way as the A galaxies are for WL measures, will yield the desired ni(z). This goal is unattainable without major observational, image processing, and simulation efforts, but we can approximate it with a method related to the one of Lima et al. (2008) and estimate the remaining uncertainties. We also need to quantify uncertainties resulting from the finite size of sample B, and from possible errors in the ‘known’ redshifts of B. Here, our sample A are the galaxies in either the im3shape or metacalibration Y1 WL catalogues, spread over the footprint of DES Y1, and sample B is the COSMOS2015 catalogue of Laigle et al. (2016).
4.1 Methodology
We begin by selecting a random subsample of 200 000 galaxies from each WL source catalogue, spread over the whole Y1 footprint, and assigning to each a match in the COSMOS2015 catalogue. The match is made by griz MOF flux and pre-seeing size (not by position), and the matching algorithm proceeds as follows, for each galaxy in the WL source sample:
Gaussian noise is added to the DES griz MOF fluxes and sizes of the COSMOS galaxies until their noise level is equal to that of the target galaxy. COSMOS galaxies whose flux noise is above that of the target galaxy are considered ineligible for matching. While this removes 13 per cent of potential DES–COSMOS pairs, this is unlikely to induce redshift biases, because the noise level of the COSMOS griz catalogue is well below that of the Y1 survey in most regions of either, so it should be rare for the true COSMOS ‘twin’ of a Y1 source to have higher errors. The discarded pairs predominantly are cases of large COSMOS and small Y1 galaxies (since large size raises flux errors), and the size mismatch means these galaxies would never be good matches. Other discarded pairs come from COSMOS galaxies lying in a shallow region of the DECam COSMOS footprint, such as near a mask or a shallow part of the dither pattern, and this geometric effect will not induce a redshift bias. Note that the MOF fluxes used here make use of the SLR zero-points, for both COSMOS and Y1 catalogues. The size metric is the one produced by metacalibration.
- The matched COSMOS2015 galaxy is selected as the one that minimizes the flux-and-size χ2,where fb and s are the fluxes in band b and the size, respectively, and σb and σs are the measurement errors in these for the chosen source. We also find the galaxy that minimizes the |$\chi_{\rm flux}^2$| from flux differences only:(3)\begin{eqnarray*}\chi^2 \equiv \sum_{b\in griz} \left(\frac{f_\mathrm{ b}^{\rm Y1} - f_\mathrm{ b}^{\rm COSMOS}}{\sigma_\mathrm{ b}}\right)^2 + \left(\frac{s^{\rm Y1}-s^{\rm COSMOS}}{\sigma_\mathrm{ s}}\right)^2, \end{eqnarray*}If the least |$\chi_{\rm flux}^2$| is smaller than (χ2 − 4) of the galaxy with the least flux-and-size χ2, we use this former galaxy instead. Without this criterion, we could be using poor matches in flux (which is more predictive of redshift than size) by requiring a good size match (that does not affect redshift distributions much). It applies to about 15 per cent of cases.(4)\begin{eqnarray*}\chi_{\rm flux}^2 \equiv \sum_{b\in griz} \left(\frac{f_\mathrm{ b}^{\rm Y1} - f_\mathrm{ b}^{\rm COSMOS}}{\sigma_\mathrm{ b}}\right)^2\,. \end{eqnarray*}
A redshift ztrue is assigned by drawing from the pC30(z) of the matched COSMOS2015 galaxy, using the rescaling of Section 2.2.3.
- A bin assignment is made by running the bpz procedures of Section 3.1 on the noise-matched griz fluxes of the COSMOS match, using the mean value of each galaxy’s posterior pPZ(z), as before. For the im3shape catalogue, the MOF photometry of the COSMOS galaxy is used, just as is done for the Y1 main survey galaxies. The metacalibration treatment is more complex: we generate simulated Metacal fluxes |$f_\mathrm{ b}^{\rm meta, COSMOS}$| for the COSMOS galaxy viaThis has the effect of imposing on COSMOS magnitudes the same difference between Metacal and MOF as is present in Y1, thus imprinting on to COSMOS simulations any errors in the Y1 catalogue metacalibration magnitudes due to neglect of the SLR or other photometric errors. For the flux uncertainty of these matched fluxes, for both MOF and Metacal, we assign the flux errors of the respective Y1 galaxy.(5)\begin{eqnarray*}f_\mathrm{ b}^{\rm meta,COSMOS} = f_\mathrm{ b}^{\rm MOF,COSMOS} \frac{f_\mathrm{ b}^{\rm meta,Y1}}{f_\mathrm{ b}^{\rm MOF,Y1}}\,. \end{eqnarray*}
The effective WL weight |$w$| of the original source galaxy is assigned to its COSMOS match (cf. Section 3.3).
As a check on the matching process, we examined the distribution of the |$\chi_{\rm flux}^2$| between matched galaxies. The distribution is skewed towards significantly lower χ2 values than expected from a true χ2 distribution with 4 degrees of freedom. This indicates that the COSMOS-Y1 matches are good: COSMOS galaxies are photometrically even more similar to the Y1 target galaxies than they would be to re-observed versions of themselves.
A second check on the matching algorithm is to ask whether the individual COSMOS galaxies are being resampled at the expected rates. As expected, most sufficiently bright galaxies in COSMOS are used more than once, while the faintest galaxies are used more rarely or never. Fig. 3 shows the number of times each of the COSMOS galaxies is matched to metacalibration (if it is bright enough to be matched at all) in our fiducial matched catalogue. We see that there is no unwanted tendency for a small fraction of the COSMOS galaxies to bear most of the resampling weight. All COSMOS galaxies with more than 50 repetitions are brighter than i = 18.5 and have a typical redshift of z ≈ 0.15.

Repetitions of COSMOS galaxies in the fiducial matched metacalibration sample of 200 000 objects. The overall weight of galaxies with more than 20 matches, which are typically bright, is below one per cent of the total weight. The high-usage outliers are a few of the very bright COSMOS galaxies.
This construction properly averages Δz over the observing conditions (including photometric zero-point errors) and weights of the Y1 WL sources. These estimated Δz values using bpz are tabulated in Table 2 for both WL source catalogues.
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0073 | ±0.0077 | ±0.0039 | ±0.0070 |
COSMOS limited sample size | ±0.0009 | ±0.0017 | ±0.0018 | ±0.0030 |
COSMOS photometric calibration errors | ±0.0030 | ±0.0040 | ±0.0039 | ±0.0059 |
COSMOS hidden variables | ±0.0066 | ±0.0066 | ±0.0066 | ±0.0066 |
COSMOS errors in matching | ±0.0073 | ±0.0073 | ±0.0073 | ±0.0073 |
COSMOS single-bin Δzi uncertainty | ±0.013 | ±0.013 | ±0.011 | ±0.014 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.006 ± 0.020 | −0.014 ± 0.021 | +0.018 ± 0.018 | −0.018 ± 0.022 |
WZ final Δzi | +0.007 ± 0.026 | −0.023 ± 0.017 | +0.003 ± 0.014 | – |
Combined final Δzi | −0.001 ± 0.016 | −0.019 ± 0.013 | +0.009 ± 0.011 | −0.018 ± 0.022 |
im3shape | ||||
COSMOS final Δzi, tomographic uncertainty | +0.001 ± 0.020 | −0.014 ± 0.021 | +0.008 ± 0.018 | −0.057 ± 0.022 |
WZ final Δzi | +0.008 ± 0.026 | −0.031 ± 0.017 | −0.010 ± 0.014 | – |
Combined final Δzi | +0.004 ± 0.016 | −0.024 ± 0.013 | −0.003 ± 0.011 | −0.057 ± 0.022 |
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0073 | ±0.0077 | ±0.0039 | ±0.0070 |
COSMOS limited sample size | ±0.0009 | ±0.0017 | ±0.0018 | ±0.0030 |
COSMOS photometric calibration errors | ±0.0030 | ±0.0040 | ±0.0039 | ±0.0059 |
COSMOS hidden variables | ±0.0066 | ±0.0066 | ±0.0066 | ±0.0066 |
COSMOS errors in matching | ±0.0073 | ±0.0073 | ±0.0073 | ±0.0073 |
COSMOS single-bin Δzi uncertainty | ±0.013 | ±0.013 | ±0.011 | ±0.014 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.006 ± 0.020 | −0.014 ± 0.021 | +0.018 ± 0.018 | −0.018 ± 0.022 |
WZ final Δzi | +0.007 ± 0.026 | −0.023 ± 0.017 | +0.003 ± 0.014 | – |
Combined final Δzi | −0.001 ± 0.016 | −0.019 ± 0.013 | +0.009 ± 0.011 | −0.018 ± 0.022 |
im3shape | ||||
COSMOS final Δzi, tomographic uncertainty | +0.001 ± 0.020 | −0.014 ± 0.021 | +0.008 ± 0.018 | −0.057 ± 0.022 |
WZ final Δzi | +0.008 ± 0.026 | −0.031 ± 0.017 | −0.010 ± 0.014 | – |
Combined final Δzi | +0.004 ± 0.016 | −0.024 ± 0.013 | −0.003 ± 0.011 | −0.057 ± 0.022 |
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0073 | ±0.0077 | ±0.0039 | ±0.0070 |
COSMOS limited sample size | ±0.0009 | ±0.0017 | ±0.0018 | ±0.0030 |
COSMOS photometric calibration errors | ±0.0030 | ±0.0040 | ±0.0039 | ±0.0059 |
COSMOS hidden variables | ±0.0066 | ±0.0066 | ±0.0066 | ±0.0066 |
COSMOS errors in matching | ±0.0073 | ±0.0073 | ±0.0073 | ±0.0073 |
COSMOS single-bin Δzi uncertainty | ±0.013 | ±0.013 | ±0.011 | ±0.014 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.006 ± 0.020 | −0.014 ± 0.021 | +0.018 ± 0.018 | −0.018 ± 0.022 |
WZ final Δzi | +0.007 ± 0.026 | −0.023 ± 0.017 | +0.003 ± 0.014 | – |
Combined final Δzi | −0.001 ± 0.016 | −0.019 ± 0.013 | +0.009 ± 0.011 | −0.018 ± 0.022 |
im3shape | ||||
COSMOS final Δzi, tomographic uncertainty | +0.001 ± 0.020 | −0.014 ± 0.021 | +0.008 ± 0.018 | −0.057 ± 0.022 |
WZ final Δzi | +0.008 ± 0.026 | −0.031 ± 0.017 | −0.010 ± 0.014 | – |
Combined final Δzi | +0.004 ± 0.016 | −0.024 ± 0.013 | −0.003 ± 0.011 | −0.057 ± 0.022 |
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0073 | ±0.0077 | ±0.0039 | ±0.0070 |
COSMOS limited sample size | ±0.0009 | ±0.0017 | ±0.0018 | ±0.0030 |
COSMOS photometric calibration errors | ±0.0030 | ±0.0040 | ±0.0039 | ±0.0059 |
COSMOS hidden variables | ±0.0066 | ±0.0066 | ±0.0066 | ±0.0066 |
COSMOS errors in matching | ±0.0073 | ±0.0073 | ±0.0073 | ±0.0073 |
COSMOS single-bin Δzi uncertainty | ±0.013 | ±0.013 | ±0.011 | ±0.014 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.006 ± 0.020 | −0.014 ± 0.021 | +0.018 ± 0.018 | −0.018 ± 0.022 |
WZ final Δzi | +0.007 ± 0.026 | −0.023 ± 0.017 | +0.003 ± 0.014 | – |
Combined final Δzi | −0.001 ± 0.016 | −0.019 ± 0.013 | +0.009 ± 0.011 | −0.018 ± 0.022 |
im3shape | ||||
COSMOS final Δzi, tomographic uncertainty | +0.001 ± 0.020 | −0.014 ± 0.021 | +0.008 ± 0.018 | −0.057 ± 0.022 |
WZ final Δzi | +0.008 ± 0.026 | −0.031 ± 0.017 | −0.010 ± 0.014 | – |
Combined final Δzi | +0.004 ± 0.016 | −0.024 ± 0.013 | −0.003 ± 0.011 | −0.057 ± 0.022 |
The COSMOS validation also yields an estimate of ni(z) by a weighted average of the rescaled pC30(z)’s of the matches (or, equivalently, of samples drawn from them). Fig. 4 plots these resampled-COSMOS estimates along with the original |$n^i_{\rm PZ}(z)$| from bpz. Here, it is apparent that in some bins, these two estimates differ by more than just a simple shift in redshift – the shapes of the ni(z) distributions differ significantly. In Section 6.2, we demonstrate that these differences do not bias our cosmological inferences.

The redshift distributions ni(z) derived from three different methods are plotted for each of the four WL metacalibration source bin populations i = 1,…,4. The top (bottom) figure shows the first and third (second and fourth) tomographic redshift bins. The clustering methodology (WZ) can only constrain ni(z) for 0.15 < z < 0.9, and the normalization of the distribution is arbitrary for the bins extending beyond this range. The band around the COSMOS ni(z) depicts the uncertainties as described in Section 4 and Table 2, and the error bars on WZ are statistical noise. There is some significant disagreement in the shapes of the distributions, particularly in n2(z). We demonstrate in Section 6.2 that this does not bias the DES Y1 cosmological inferences.
In the following subsections, we determine several contributions to the uncertainty of these Δzi. All of these are presented for the metacalibration sample binned by bpz redshift estimates. For im3shape galaxies with bpz, we use the same uncertainties. Results for DNF are in Appendix C.
From the resampling procedure, we also determine common metrics on the photo-z performance in Section 4.7.
4.2 Sample variance contribution
The first contribution to the uncertainty in the COSMOS Δzi’s is from sample variance from the small angular size of the COSMOS2015 catalogue. Any attempt at analytic estimation of this uncertainty would be complicated by the reweighting/sampling procedure that alters the native n(z) of the COSMOS line of sight, so we instead estimate the covariance matrix of the Δzi by repeating our procedures on different realizations of the COSMOS field in the Buzzard simulated galaxy catalogues.
The resampling procedure of Section 4.1 is repeated using a fixed single draw of 200 000 galaxies from a Buzzard simulated Y1 WL sample (Section 2.3) as catalogue A, and 367 randomly placed COSMOS-shaped cutouts from the Buzzard truth catalogue, i.e. a catalogue with noiseless flux information, as catalogue B. Each of these yields an independent n(z) of the matched COSMOS catalogues (cf. Fig. 5), and consequently an independent sample variance realization of the Δzi. There are significant correlations between the Δzi bins, especially bins 1 and 2, as shown in Section 6. The diagonal elements are listed as ‘COSMOS footprint sampling’ in Table 2.

Redshift distributions of the full simulated lensing sample from the Buzzard catalogue (grey) and two examples of samples from COSMOS-sized footprints in the Buzzard catalogues that have been resampled and weighted to match the full distribution (blue and orange).

Correlation coefficients of error on Δzi due to sample variance in COSMOS-resamplings between our four source redshift bins. Shown is the correlation matrix for the metacalibration sample binned by bpz.
Since we use the same subset of the Buzzard lensing sample for each of the COSMOS-like resamplings, this variance estimate does not include the uncertainty due to the limited subsample size of 200 000 galaxies. We estimate the latter effect by resampling of the Δzi in this sample, and find it to be subdominant (|$\sigma_{\Delta z}^i <0.003$| in all redshift bins, ‘limited sample size’ in Table 2).
4.3 Photometric calibration uncertainty
The griz DECam photometry of the COSMOS field has uncertainties in its zero-point due to errors in the SLR-based calibration. While the Y1 catalogue averages over the SLR errors of many fields, the validation is sensitive to the single realization of SLR errors in the COSMOS field. We estimate the distribution of zero-point errors by comparing the SLR zero-points in the Y1 catalogue to those derived from the superior ‘forward global calibration module’ (FGCM) and reddening correction applied to three years’ worth of DES exposures by Burke et al. (2017). In this, we only use regions with Galactic extinction E(B − V) < 0.1, since the COSMOS field has relatively low extinction and strong reddening might cause larger differences between the FGCM and SLR calibration. The root-mean-square zero-point offsets between SLR and FGCM calibration are between 0.007 (z) and 0.017 (g).
We estimate the impact on Δzi by drawing 200 mean-subtracted samples of photometric offsets from the observed (FGCM-SLR) distribution, applying each to the COSMOS fluxes, and repeating the derivation of Δzi as per Section 4.1. Table 2 lists the uncertainty of the Δzi of each of the four tomographic bins due to those, which are 0.003–0.006.
4.4 Hidden-variable uncertainty
We have matched COSMOS galaxies to the shear catalogue galaxies by their griz fluxes and by their estimated pre-seeing size. This set of parameters is likely not completely predictive of a galaxy’s selection and weight in our shear catalogue. Other morphological properties (such as the steepness of its profile) probably matter and do correlate with redshift (e.g. Soo et al. 2017). In addition, the matching in size is only done in 85 per cent of cases to begin with Section 4.1.
To estimate the effect of any variables hidden to our matching algorithm, we repeat the process while ignoring the size variable. We find changes in Δzi for metacalibration to be (+0.010, +0.015, +0.009, +0.014) in the four bins. Soo et al. (2017) found that the single morphological parameter that provides the greatest improvement in σ68 and outlier fraction is galaxy size. Since we therefore expect the size to have the strongest influence on both lensing and redshift, and we are correcting for size, we estimate the potential influence of any further variables as no more than half of the size effect. We do not assume that these systematic errors found in simulations are exactly equal in the data – rather, we only assume that the two are of similar size, and thus use the rms of offsets found in the simulation as the width of a Gaussian systematic uncertainty on the data. We take half of the quadratic mean of the shifts in the four redshift bins, ±0.0066, as our estimate of the hidden-variable uncertainty in each bin. These biases are likely to be correlated between bins. In Section 4.6, we describe a modification to our single-bin uncertainties that accounts for potential correlations.
4.5 Systematic errors in matching
Even in the absence of the above uncertainties, the resampling algorithm described above might not quite reproduce the true redshift distribution of the input sample. The matching algorithm may not, for example, pick a COSMOS galaxy which is an unbiased estimator of the target galaxy’s redshift, especially given the sparsity and inhomogeneous distribution of the COSMOS sample in the four- to five-dimensional space of griz fluxes and size.
We estimate the size of this effect on Δzi using the mean offset in binned mean true redshift of the 367 realizations of resampled COSMOS-like catalogues in the Buzzard simulations (see Section 4.2) from the binned mean true redshift of the underlying Buzzard shape sample.
We find differences in mean true redshift of sample A matched B of (0.0027, 0.0101, 0.0094, 0.004) for the four redshift bins. Since the simulation is not fully realistic, we do not attempt to correct the result of our resampling with these values. Rather, we take them as indicators of possible systematic uncertainties of the resampling algorithm. Following the argument in Section 4.4,we thus use the quadratic mean of these values (0.0073) as a systematic uncertainty in each bin.
4.6 Combined uncertainties and correlation between redshift bins
The final uncertainties on the Δzi are estimated by adding in quadrature the contributions listed above, yielding the ‘COSMOS total Δzi uncertainty’ in Table 2. These values are derived independently for each redshift bin, but it is certain that the Δzi have correlated errors, e.g. from sample variance as shown in Fig. 6, and such correlations should certainly be included in the inference of cosmological parameters. The values of the off-diagonal elements of the combined COSMOS Δz covariance matrix, are, however, difficult to estimate with any precision. In Appendix A, we demonstrate that by increasing the diagonal elements of the covariance matrix by a factor (1.6)2 and nulling the off-diagonal elements, we can ensure that any inferences based on the Δzi are conservatively estimated for any reasonable values of the off-diagonal elements. We therefore apply a factor of 1.6 to all of the single-bin uncertainties in deriving the ‘COSMOS final Δzi’ constraints for metacalibration and im3shape given in Table 2.
4.7 Standard photo-z performance metrics
Although not a critical input to the cosmological tests of DES Collaboration et al. (2017), we determine here some standard metrics of photo-z performance. We define the residual R as the difference between the mean of the pPZ(z) using the MOF photometry and a random draw from the COSMOS pC30(z) matched during resampling. We use a random draw from pC30(z) rather than the peak, so that uncertainty in these ‘truth’ z’s is included in the metrics. Because the width of pC30(z) is much smaller than that of pPZ(z), this does not affect the results significantly.
We define σ68(R) as the 68 per cent spread of R around its median. In this section, σ68(R) measures the departure of the mean of pPZ(z) from the true z, whereas the σ68 in Fig. 2 is a measure of width of pPZ(z) independent of any truth redshifts. We also measure the outlier fraction, defined as the fraction of data for which |R| > 2 × σ68. If the redshift distribution were Gaussian, the outlier fraction would be 5 per cent, and this metric is a measure of the tails of the R distribution.
We calculate the uncertainties on these metrics from sample variance, COSMOS photometric calibration uncertainty, and selection of the lensing sample by hidden variables (cf. Sections 4.2–4.4). We add each of the these uncertainties in quadrature in each tomographic bin, and highlight that the largest source of uncertainty is due to sample variance.
Table 3 presents the metric values and uncertainties of the galaxies in each redshift bin, using the metacalibration sample and binning.
Common performance metrics and uncertainties measured using bpz point predictions and draws from the rescaled COSMOS2015 PDFs. The quantity σ68(R) is the 68 per cent spread of the residual distribution R, about the median. The outlier fraction is defined as the fraction of galaxies with griz redshift estimates than 2 × σ68(R) from the COSMOS2015 value.
Metric . | 0.20 < z < 0.43 . | 0.43 < z < 0.63 . | 0.63 < z < 0.90 . | 0.90 < z < 1.30 . |
---|---|---|---|---|
bpz metacalibration binning, MOF pPZ(z) | ||||
σ68(R) | 0.12 ± 0.01 | 0.16 ± 0.01 | 0.12 ± 0.01 | 0.17 ± 0.01 |
Outlier fraction per cent | 3.3 ± 0.5 | 3.6 ± 0.8 | 6.1 ± 0.4 | 6.6 ± 0.5 |
DNF metacalibration | ||||
σ68(R) | 0.10 ± 0.01 | 0.16 ± 0.01 | 0.16 ± 0.01 | 0.21 ± 0.01 |
Outlier fraction per cent | 5.0 ± 0.4 | 3.8 ± 0.8 | 8.6 ± 0.6 | 7.5 ± 0.5 |
Metric . | 0.20 < z < 0.43 . | 0.43 < z < 0.63 . | 0.63 < z < 0.90 . | 0.90 < z < 1.30 . |
---|---|---|---|---|
bpz metacalibration binning, MOF pPZ(z) | ||||
σ68(R) | 0.12 ± 0.01 | 0.16 ± 0.01 | 0.12 ± 0.01 | 0.17 ± 0.01 |
Outlier fraction per cent | 3.3 ± 0.5 | 3.6 ± 0.8 | 6.1 ± 0.4 | 6.6 ± 0.5 |
DNF metacalibration | ||||
σ68(R) | 0.10 ± 0.01 | 0.16 ± 0.01 | 0.16 ± 0.01 | 0.21 ± 0.01 |
Outlier fraction per cent | 5.0 ± 0.4 | 3.8 ± 0.8 | 8.6 ± 0.6 | 7.5 ± 0.5 |
Common performance metrics and uncertainties measured using bpz point predictions and draws from the rescaled COSMOS2015 PDFs. The quantity σ68(R) is the 68 per cent spread of the residual distribution R, about the median. The outlier fraction is defined as the fraction of galaxies with griz redshift estimates than 2 × σ68(R) from the COSMOS2015 value.
Metric . | 0.20 < z < 0.43 . | 0.43 < z < 0.63 . | 0.63 < z < 0.90 . | 0.90 < z < 1.30 . |
---|---|---|---|---|
bpz metacalibration binning, MOF pPZ(z) | ||||
σ68(R) | 0.12 ± 0.01 | 0.16 ± 0.01 | 0.12 ± 0.01 | 0.17 ± 0.01 |
Outlier fraction per cent | 3.3 ± 0.5 | 3.6 ± 0.8 | 6.1 ± 0.4 | 6.6 ± 0.5 |
DNF metacalibration | ||||
σ68(R) | 0.10 ± 0.01 | 0.16 ± 0.01 | 0.16 ± 0.01 | 0.21 ± 0.01 |
Outlier fraction per cent | 5.0 ± 0.4 | 3.8 ± 0.8 | 8.6 ± 0.6 | 7.5 ± 0.5 |
Metric . | 0.20 < z < 0.43 . | 0.43 < z < 0.63 . | 0.63 < z < 0.90 . | 0.90 < z < 1.30 . |
---|---|---|---|---|
bpz metacalibration binning, MOF pPZ(z) | ||||
σ68(R) | 0.12 ± 0.01 | 0.16 ± 0.01 | 0.12 ± 0.01 | 0.17 ± 0.01 |
Outlier fraction per cent | 3.3 ± 0.5 | 3.6 ± 0.8 | 6.1 ± 0.4 | 6.6 ± 0.5 |
DNF metacalibration | ||||
σ68(R) | 0.10 ± 0.01 | 0.16 ± 0.01 | 0.16 ± 0.01 | 0.21 ± 0.01 |
Outlier fraction per cent | 5.0 ± 0.4 | 3.8 ± 0.8 | 8.6 ± 0.6 | 7.5 ± 0.5 |
5 COMBINED CONSTRAINTS
To supplement the constraints on Δzi derived above using the COSMOS2015 photo-z’s, we turn to the ‘correlation redshift’ methodology (Newman 2008; Ménard et al. 2013; Schmidt et al. 2013), whereby one measures the angular correlations between the unknown sample (the WL sources) and a population of objects with relatively well-determined redshifts. In our case, the known population are the redMaGiC galaxies, selected precisely so that their griz colours yield high-accuracy photometric redshift estimates.
An important complication of applying WZ to DES Y1 is that we do not have a sufficient sample of galaxies with known redshift available that spans the redshift range of the DES Y1 lensing source galaxies – the redMaGiC galaxies do not extend beyond 0.2 < z < 0.9. Constraints on the mean redshift of a source population can still be derived in this case, but only by assuming a shape for the n(z) distribution, whose mean is then determined by the clustering signal in a limited redshift interval. A mismatch in shape between the assumed and true n(z) is a source of systematic uncertainty in such a WZ analysis. One of the main results of Gatti et al. (2017), which describes the implementation and full estimation of uncertainties of the WZ method for DES Y1 source galaxies, is that while this systematic uncertainty needs to be accounted for, it is not prohibitively large. This statement is validated in Gatti et al. (2017) for the degree of mismatch between the true n(z) and the n(z) found in a number of photometric redshift methods applied to simulated galaxy catalogues. The redshift distributions of the DES weak lensing sources as estimated by bpz, as far as we can judge this from the comparison with the COSMOS estimates of their true n(z), show a similar level of mismatch to the truth. The systematic uncertainty budget derived in Gatti et al. (2017) is therefore applicable to the data. We do not, however, attempt to correct the systematic offsets in WZ estimates of Δzi introduced due to this effect – for this, we would require the galaxy populations and photometric measurements in the simulations to be perfectly realistic.
The method is applied to DES Y1 data in Davis et al. (2017). A similar analysis was performed on the DES SV data set in Davis et al. (2018). The resultant estimates of Δzi are listed in Table 2 and plotted in Fig. 7. The full ni(z)’s estimated from the WZ method are plotted in Fig. 4. Note that the WZ method obtains no useful constraint for bin 4 because the redMaGiC sample is confined to z < 0.9 and thus has little overlap with bin 4. Due to the lack of independent confirmation, the redshift calibration of this bin should be used with greater caution – in DES Collaboration et al. (2017) and Gruen et al. (2017), we indeed show that constraints do not significantly shift when the bin is removed from the analysis.

Constraints on the shifts Δzi applied to the metacalibrationnPZ(z) distributions for the weak lensing source galaxies are plotted for three different validation techniques. Shifts derived from resampling the COSMOS 30-band redshifts are described in this paper, and agree well with those derived (for bins 1–3 only) using angular correlations between the source population and redMaGiC galaxies (WZ) by Davis et al. (2017) (COSMOS constraints plotted here have been expanded as per Appendix A to include the effects of poorly known correlation between bins). These are also consistent with the weak lensing shear-ratio tests conducted by Prat et al. (2017). The final validation constraints on Δzi are taken as the combination of the COSMOS and WZ results for each redshift bin (where available), and yield the 68 per cent confidence intervals denoted by the black points and error bars in the one-dimensional marginal plots. The dashed lines at Δzi = 0 indicate no mean shift from the bpz posteriors – the validation processes yield shifts that are non-zero at ≈1σ level.
In the three lower redshift bins, the COSMOS and WZ validation methods generate estimates of Δzi that are fully consistent. Indeed even their ni(z) curves show qualitative agreement. We therefore proceed to combine their constraints on Δzi to yield our most accurate and reliable estimates. The statistical errors of the COSMOS and WZ methods are uncorrelated (sample variance in the COSMOS field versus shot noise in the measurements of angular correlations in the wide field). The dominant systematic errors of the two methods should also be uncorrelated, e.g. shortcomings in our resampling for COSMOS versus uncertainties in the bias evolution of source galaxies for WZ. We are therefore confident that we can treat the COSMOS and WZ constraints as independent, and we proceed to combine them by multiplying their respective one-dimensional Gaussian distributions for each Δzi, i.e. inverse-variance weighting. In bin 4, the final constraints are simply the COSMOS constraints since WZ offers no information.
The resultant constraints, listed for both metacalibration and im3shape catalogues in Table 2, are the principal result of this work, and are adopted as input to the cosmological inferences of Troxel et al. (2017) and DES Collaboration et al. (2017). The adopted 68 per cent-confidence ranges for each Δzi are denoted by the grey bands in the one-dimensional marginal plots of Fig. 7.
One relevant question is whether our calibration finds that significant non-zero shifts are required to correct the photo-z estimates of the mean redshift. For the fiducial metacalibration bpz, this is not the case: the |$\chi^2=\sum_i(\Delta z^i/\sigma_{\Delta z^i})^2$| is 3.5 with 4 bins. However, the combined Δz4 is non-zero at 2.6σ for im3shape bpz and the Δz2 is non-zero at 3.3σ for metacalibration DNF, indicating that there are significant alterations being made to some of the |$n_{\rm PZ}^i(z)$| estimates.
A further check of the accuracy of our ni(z) estimation is presented by Prat et al. (2017) using the ratios of lensing shear on the different source bins induced by a common set of lens galaxies. Initially proposed as a cosmological test (Jain & Taylor 2003), the shear ratio is in fact much less sensitive to cosmological parameters than to potential errors in either the calibration of the shear measurement or the determination of the ni(z). We plot in Fig. 7 the constraints on Δzi inferred by Prat et al. (2017) after marginalization over the estimated errors in shear calibration and assuming a fixed Λ cold dark matter cosmology with Ωm = 0.3. The shear-ratio test is fully consistent with the COSMOS and WZ estimates of Δzi, though we should keep in mind that this test is also dependent on the validity of the shear calibration and some other assumptions in the analysis, and importantly is covariant with the WZ method, because both methods rely on correlation functions as measured with respect to the same galaxy samples.
6 USE FOR COSMOLOGICAL INFERENCE
The final rows of Table 2 provide the prior on errors in the redshift distributions used during inference of cosmological parameters for the DES Y1 data, under the assumption that errors in the ni(z) resulting from the photo-z analysis follow equation (1). Determination of redshift distributions is and will continue to be one of the most difficult tasks for obtaining precision cosmology from broad-band imaging surveys such as DES, so it is important to examine the potential impact of assumptions in our analysis choices. Further, we wish to identify areas where our methodology can be improved and thereby increase the precision and accuracy of future cosmological analyses.
6.1 Dependence on COSMOS2015 redshifts
First, we base our COSMOS validation on the COSMOS2015 redshift catalogue derived from fitting spectral templates to 30-band fluxes. Our COSMOS validation rests on the assumption that Laigle et al. (2016) have correctly estimated the redshift posteriors of their sources. Overall, redshift biases in the COSMOS2015 redshifts are significant, unrecognized sources of error in our cosmological inferences if they approach or exceed the δz ≈ 0.01–0.02 range of uncertainty in our Δzi constraints. More precisely, this bias must accrue to the portion of the COSMOS2015 catalogue that is bright enough to enter the DES Y1 shear catalogues.
For the subset of their sources with spectroscopic redshifts, Laigle et al. (2016) report that galaxies in the magnitude interval 22 < i < 23 have ‘catastrophic’ disagreement between photo-z and spectroscopic z for only 1.7 per cent (0.6 per cent) for star-forming (quiescent) galaxies (their table 4). This is the magnitude range holding the 50 per cent completeness threshold of the DES Y1 shear catalogues. Brighter bins have lower catastrophic-error rates, and only about 5 per cent of weight in the metacalibration lensing catalogue is provided by galaxies fainter than i = 23. It would thus be difficult for these catastrophic errors to induce photo-z errors of 0.01 or more.
About 30 per cent of the galaxies used for the COSMOS weak lensing validation have spectroscopic redshifts from the latest 20,000 I < 22.5 selected zCOSMOS DATA Release DR3, covering 1.7 deg2 of the COSMOS field to z < 1.2 (Lilly et al. 2009b). We can thus use this subsample as an additional test of this statement. In all redshift bins, the shifts in the mean redshift estimated using this spectroscopic subset are very similar (less than 1σ of our error estimate) to the corresponding shifts estimated with photometric redshifts in the full sample. The difference between the 30-band (corrected) photometric mean redshifts and the corresponding spectroscopic redshifts for this subset is also within our error estimates. These tests indicate that the potential (unknown) biases in the 30-band photometric redshifts are smaller than other sources of uncertainty in the mean redshifts used for our WL analysis.
Of greater concern is the potential for bias in the portion of the DES detection regime for which spectroscopic validation of COSMOS2015 photo-z’s is not possible. Neither we nor Laigle et al. (2016) have direct validation of this subsample, so we are relying on the success of their template-based method and broad spectral coverage in the spectroscopic regime to extend into the non-spectroscopic regime. Our confidence is boosted, however, by the agreement in Δzi between the COSMOS validation and the independent WZ validation in bins 1, 2, and 3.
Finally, we note that we have also attempted to validate the photo-z distributions using only the galaxies with spectroscopic redshifts in the COSOMOS field, and find consistent, albeit uncompetitive results. The number of galaxies with spectra (20k) is an order of magnitude less than those with reliable photometric redshifts which increases statistical uncertainties, cosmic variance uncertainties and uncertainties from data reweighting.
6.2 Insensitivity to ni(z) shape
Equation (1) assumes that the only errors in the |$n_{\rm PZ}^i(z)$| distributions take the form of a translation of the distribution in redshift. We do not expect that errors in the photo-z distribution actually take this form; rather we assume that the shape of ni(z) has little impact on our cosmological inference as long as the mean of the distribution is conserved – and our methodology forces the mean of the ni(z) to match that derived from the COSMOS2015 resampling. The validity of this assumption can be tested by assuming that any errors in the shifted-bpzni(z) from equation (1) are akin to the difference between these distributions and |$n^i_{\rm COSMOS}(z)$| derived from the resampled COSMOS catalogues during the validation process of Section 4.1. We produce a simulated data vector for the DES Y1 cosmology analysis of DES Collaboration et al. (2017) from a noiseless theoretical prediction using the |$n^i_{\rm COSMOS}(z)$| distributions. We then fit this data using a model that assumes the shifted bpz distributions. The best-fitting cosmological parameters depart from those in the input simulation by less than ten per cent of the uncertainty of DES Collaboration et al. (2017). We therefore confirm that the detailed shape of ni(z) is not important to the Y1 analysis.
6.3 Depth variation
A third assumption in our analysis is that the ni(z) are the same for all portions of the DES Y1 catalogue footprint (aside, of course, from the intrinsic density fluctuations that we wish to measure). This is not the case: the failure to apply SLR adjustment to our fluxes in bpz (Section 3.1.2) means that we have not corrected our Metacal fluxes for Galactic extinction, and therefore have angular variations in survey depth and photo-z assignments. Even without this error, we would have significant depth fluctuations because of variation in the number and quality of exposures on different parts of the survey footprint.
Appendix B provides an approximate quantification of the impact of ni(z) inhomogeneities on our measurements of the two-point correlation functions involving the shear catalogue. There we conclude that the few per cent fluctuations in survey depth and colour calibration that exist in our source catalogues should not significantly influence our cosmological inferences, as long as we use the source-weighted mean ni(z) over the survey footprint. Both the COSMOS and WZ validation techniques produce source-weighted estimates of Δzi, as required.
7 CONCLUSIONS AND FUTURE PROSPECTS
We have estimated redshift distributions and defined tomographic bins of source galaxies in DES Y1 lensing analyses from photometric redshifts based on their griz photometry. While we use traditional photo-z methods in these steps, we independently determine posterior probability distributions for the mean redshift of each tomographic bin that are then used as priors for subsequent lensing analyses.
The method for determining these priors developed in this paper is to match galaxies with COSMOS2015 30-band photometric redshift estimates to DES Y1 lensing source galaxies, selecting and weighting the former to resemble the latter in their griz flux and pre-seeing size measurements. The mean COSMOS2015 photo-z of the former sample is our estimate of the mean redshift of the latter. We determine uncertainties in this estimate, which we find have comparable, dominant contributions from
sample variance in COSMOS, i.e. the scatter in measured mean redshift calibration due to the limited footprint of the COSMOS field,
the influence of morphological parameters such as galaxy size on the lensing source sample selection, and
systematic mismatches of the original and matched sample in the algorithm we use.
A significant reduction of the overall uncertainty of the mean redshift priors derived in this work would thus only be possible with considerable additional observations and algorithmic advances.
Subdominant contributions, in descending order, are due to
errors in photometric calibration of the griz data in the COSMOS field and
the finite subsample size from the DES Y1 shear catalogues that we use for resampling.
The COSMOS2015 30-band photometric estimates of the mean redshifts, supplemented by consistent measures by means of angular correlation against DES redMaGiC galaxies in all but the highest redshift bins (Cawthon et al. in preparation; Davis et al. 2017; Gatti et al. 2017), have allowed us to determine the mean redshifts of four bins of WL source galaxies to 68 per cent CL accuracy ±0.011–0.022, independent of the original bpz redshift estimates used to define the bins and the nominal |$n_{\rm PZ}^i(z)$| distributions. These redshift uncertainties are a highly subdominant contributor to the error budget of the DES Y1 cosmological parameter determinations of DES Collaboration et al. (2017) when marginalizing over the full set of nuisance parameters. Likewise, the methodology of marginalizing over the mean redshift uncertainty only, rather than over the full shape of the ni(z), biases our analyses at less than 10 per cent of their uncertainty. Thus, the methods and approximations herein are sufficient for the Y1 analyses.
DES is currently analysing survey data covering nearly four times the area used in the Y1 analyses of this paper and DES Collaboration et al. (2017), and there are ongoing improvements in depth, calibration, and methodology. Thus, we expect >2× reduction in the statistical and systematic uncertainties in future cosmological constraints, compared to the Y1 work. Uncertainties in ni(z) will become the dominant source of error in future analyses of DES and other imaging surveys, without substantial improvement in the methodology presented here. We expect that the linear-shift approximation in equation (1) will no longer suffice for quantifying the validation of the ni(z). Significant improvement will be needed in some combination of: spectroscopic and/or multiband photometric validation data; photo-z methodology; redshift range, bias constraints, and statistical errors of WZ measurements; and treatment of survey inhomogeneity. The redshift characterization of broad-band imaging surveys is a critical and active area of research, and will remain so in the years to come.
ACKNOWLEDGEMENTS
Support for DG was provided by NASA through Einstein Postdoctoral Fellowship grant number PF5-160138 awarded by the Chandra X-ray Center, which is operated by the Smithsonian Astrophysical Observatory for NASA under contract NAS8-03060.
Funding for the DES Projects has been provided by the U.S. Department of Energy, the U.S. National Science Foundation, the Ministry of Science and Education of Spain, the Science and Technology Facilities Council of the United Kingdom, the Higher Education Funding Council for England, the National Center for Supercomputing Applications at the University of Illinois at Urbana-Champaign, the Kavli Institute of Cosmological Physics at the University of Chicago, the Center for Cosmology and Astro-Particle Physics at the Ohio State University, the Mitchell Institute for Fundamental Physics and Astronomy at Texas A&M University, Financiadora de Estudos e Projetos, Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro, Conselho Nacional de Desenvolvimento Científico e Tecnológico and the Ministério da Ciência, Tecnologia e Inovação, the Deutsche Forschungsgemeinschaft, and the Collaborating Institutions in the DES.
The Collaborating Institutions are Argonne National Laboratory, the University of California at Santa Cruz, the University of Cambridge, Centro de Investigaciones Energéticas, Medioambientales y Tecnológicas-Madrid, the University of Chicago, University College London, the DES-Brazil Consortium, the University of Edinburgh, the Eidgenössische Technische Hochschule Zürich, Fermi National Accelerator Laboratory, the University of Illinois at Urbana-Champaign, the Institut de Ciències de l’Espai (IEEC/CSIC), the Institut de Física d’Altes Energies, Lawrence Berkeley National Laboratory, the Ludwig-Maximilians Universität München and the associated Excellence Cluster Universe, the University of Michigan, the National Optical Astronomy Observatory, the University of Nottingham, The Ohio State University, the University of Pennsylvania, the University of Portsmouth, SLAC National Accelerator Laboratory, Stanford University, the University of Sussex, Texas A&M University, and the OzDES Membership Consortium.
Based in part on observations at Cerro Tololo Inter-American Observatory, National Optical Astronomy Observatory, which is operated by the Association of Universities for Research in Astronomy under a cooperative agreement with the National Science Foundation.
The DES data management system is supported by the National Science Foundation under grant numbers AST-1138766 and AST-1536171. The DES participants from Spanish institutions are partially supported by MINECO under grants AYA2015-71825, ESP2015-88861, FPA2015-68048, SEV-2012-0234, SEV-2016-0597, and MDM-2015-0509, some of which include ERDF funds from the European Union. IFAE is partially funded by the CERCA programme of the Generalitat de Catalunya. Research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP7/2007-2013) including ERC grant agreements 240672, 291329, and 306478. We acknowledge support from the Australian Research Council Centre of Excellence for All-sky Astrophysics, through project number CE110001020.
This manuscript has been authored by Fermi Research Alliance, LLC under Contract No. DE-AC02-07CH11359 with the U.S. Department of Energy, Office of Science, Office of High Energy Physics. The United States Government retains and the publisher, by accepting the article for publication, acknowledges that the United States Government retains a non-exclusive, paid-up, irrevocable, world-wide license to publish or reproduce the published form of this manuscript, or allow others to do so, for United States Government purposes.
Based in part on zCOSMOS observations carried out using the Very Large Telescope at the ESO Paranal Observatory under Programme ID: LP175.A-0839.
Footnotes
While there is Y- band data available, due to its lower depth, strong wavelength overlap with z, and incomplete coverage, we did not use it for photo-z estimation.
The outlier fraction of |$7.85\hbox{ per cent}$| quoted in Cool et al. (2013) includes all objects that lie more than δz > 0.025 from their true redshift. The difference in template photometry caused by such a small change in redshift is well within the scatter of our computed DES-template offsets. Of greater concern is the fraction of objects with large redshift differences, which is <4 per cent.
REFERENCES
APPENDIX A
COMPENSATION FOR UNKNOWN COVARIANCES IN TOMOGRAPHIC ANALYSES
There are four parameters Δxi in our model for errors in the redshift distribution, which will be used when constraining some parameter(s) π of the cosmological model. While we have produced reliable bounds |$\sigma^2_\mathrm{ i}$| of the variance of each of these, we have less knowledge of the off-diagonal elements rijσiσj of the covariance matrix C of the Δxi – perhaps we know only that |rij| ≤ r. We wish to make estimates of π and the uncertainty σπ that we are sure do not underestimate the true error, for any allowed values of the rij, in analyses that combine these redshift bins. We show here this can be done by amplifying the diagonal elements of C by a factor f2, while setting the off-diagonal elements to zero (cf. also Zuntz et al. 2017, their appendix D).
In the case of our Δzi, we have N = 4, and we estimate that correlation coefficients between bins should be modest, |rij| ≤ 0.5 (see Fig. 6). Then, equation (A10) implies that inflating the individual bins’ errors by |$f=\sqrt{2.5}\approx 1.6$| will yield a conservative estimate of the impact of redshift uncertainties on any parameter π.
APPENDIX B
Effect of ni(z) inhomogeneities
The other DES Y1 cosmological observable using the source population is the two-point correlation function of shear ξγ(θ). The shear γ is a two-component field, and there are two non-trivial correlation functions ξ±, or equivalently the spin field can be decomposed into E- and B-mode components. Guzik & Bernstein (2005) analyse the influence of multiplicative inhomogeneities on the full E/B field, and demonstrate that such systematic errors shift power between E and B modes at a level comparable to the change in the E mode. Here, we will consider a simplified scalar version of the Guzik & Bernstein (2005) formalism, which we can think of as quantifying the E-mode errors due to inhomogeneity. If these are small, we do not have to worry about effects on B modes either.
The rms fluctuation in source mean redshift induced by failure to apply the SLR adjustment (Section 3.1.2) is δz ≲ 0.01. We estimate the effect of variations in survey depth by removing sources above an i-band MOF magnitude mlim from the matched COSMOS sample. The derivative of ⟨z⟩ with respect to mlim is below 0.05 mag−1 in the relevant range of mlim for all four source bins. At the variation of depth present in DES Y1 (0.25 mag rms, Drlica-Wagner et al. 2017), this leads to an rms fluctuation in source mean redshift of δz ≲ 0.012. Jointly, the rms due to both effects, which may be partly correlated, are δz ≲ 0.02 rms.
We expect the source density and inverse critical density (i.e. δ and ε) to scale no faster than linearly with the mean redshift of the sample, and the lowest redshift bin has z ≈ 0.3, so ⟨ε2⟩ ≲ (δz/z)2 ≈ 0.004. Thus, we estimate an overall scaling error of the ξγ measurements at roughly this level.
The most accurately measured combination of cosmological parameters in DES Y1 data is S8 = σ8(Ωm/0.3)0.5, which is determined to a fractional accuracy of |${\approx}3.5\hbox{ per cent}$| (DES Collaboration et al. 2017). Since |$\xi_+\propto S_8^2$|, roughly, its error due to uncorrected Galactic extinction is estimated to be ≈8 × smaller than the uncertainty level in the DES Y1 analyses.
APPENDIX C
Calibration of DNF ni(z)
The COSMOS validation procedure of Section 4.1 was repeated for the DNF photo-z’s in the same way as for bpz, as was the WZ validation. The resultant Δzi are shown in Table C1 and the ni(z) and photo-z precision metrics are plotted in Fig. C1. Note that we do not require agreement between the values for DNF and those for bpz, because they apply to different binnings of the source galaxies.

Top panel: as Fig. 2 showing the average width of the posterior distributions of DNF photometric redshifts. Bottom panel: the ni(z) for galaxies with bin assignments and estimated using DNF photo-z’s rather than bpz. The error bars correspond to the standard deviation of the individual source’s σ68 around the average. The bottom panel is otherwise equivalent to Fig. 4.
Values of and error contributions to photo-z shift parameters of DNF ni(z).
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0071 | ±0.0075 | ±0.0053 | ±0.0080 |
COSMOS limited sample size | ±0.0015 | ±0.0014 | ±0.0016 | ±0.0039 |
COSMOS photometric calibration errors | ±0.0023 | ±0.0029 | ±0.0046 | ±0.0045 |
COSMOS hidden variables | ±0.0030 | ±0.0080 | ±0.0090 | ±0.0030 |
COSMOS errors in matching | ±0.0069 | ±0.0069 | ±0.0069 | ±0.0069 |
COSMOS single-bin Δzi uncertainty | ±0.011 | ±0.013 | ±0.013 | ±0.013 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.024 ± 0.017 | −0.042 ± 0.021 | +0.006 ± 0.021 | +0.038 ± 0.020 |
WZ final Δzi | +0.003 ± 0.014 | −0.037 ± 0.014 | +0.005 ± 0.019 | – |
Combined final Δzi | −0.008 ± 0.011 | −0.039 ± 0.012 | +0.006 ± 0.014 | +0.038 ± 0.020 |
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0071 | ±0.0075 | ±0.0053 | ±0.0080 |
COSMOS limited sample size | ±0.0015 | ±0.0014 | ±0.0016 | ±0.0039 |
COSMOS photometric calibration errors | ±0.0023 | ±0.0029 | ±0.0046 | ±0.0045 |
COSMOS hidden variables | ±0.0030 | ±0.0080 | ±0.0090 | ±0.0030 |
COSMOS errors in matching | ±0.0069 | ±0.0069 | ±0.0069 | ±0.0069 |
COSMOS single-bin Δzi uncertainty | ±0.011 | ±0.013 | ±0.013 | ±0.013 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.024 ± 0.017 | −0.042 ± 0.021 | +0.006 ± 0.021 | +0.038 ± 0.020 |
WZ final Δzi | +0.003 ± 0.014 | −0.037 ± 0.014 | +0.005 ± 0.019 | – |
Combined final Δzi | −0.008 ± 0.011 | −0.039 ± 0.012 | +0.006 ± 0.014 | +0.038 ± 0.020 |
Values of and error contributions to photo-z shift parameters of DNF ni(z).
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0071 | ±0.0075 | ±0.0053 | ±0.0080 |
COSMOS limited sample size | ±0.0015 | ±0.0014 | ±0.0016 | ±0.0039 |
COSMOS photometric calibration errors | ±0.0023 | ±0.0029 | ±0.0046 | ±0.0045 |
COSMOS hidden variables | ±0.0030 | ±0.0080 | ±0.0090 | ±0.0030 |
COSMOS errors in matching | ±0.0069 | ±0.0069 | ±0.0069 | ±0.0069 |
COSMOS single-bin Δzi uncertainty | ±0.011 | ±0.013 | ±0.013 | ±0.013 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.024 ± 0.017 | −0.042 ± 0.021 | +0.006 ± 0.021 | +0.038 ± 0.020 |
WZ final Δzi | +0.003 ± 0.014 | −0.037 ± 0.014 | +0.005 ± 0.019 | – |
Combined final Δzi | −0.008 ± 0.011 | −0.039 ± 0.012 | +0.006 ± 0.014 | +0.038 ± 0.020 |
Value . | Bin 1 . | Bin 2 . | Bin 3 . | Bin 4 . |
---|---|---|---|---|
zPZ range | 0.20–0.43 | 0.43–0.63 | 0.63–0.90 | 0.90–1.30 |
COSMOS footprint sampling | ±0.0071 | ±0.0075 | ±0.0053 | ±0.0080 |
COSMOS limited sample size | ±0.0015 | ±0.0014 | ±0.0016 | ±0.0039 |
COSMOS photometric calibration errors | ±0.0023 | ±0.0029 | ±0.0046 | ±0.0045 |
COSMOS hidden variables | ±0.0030 | ±0.0080 | ±0.0090 | ±0.0030 |
COSMOS errors in matching | ±0.0069 | ±0.0069 | ±0.0069 | ±0.0069 |
COSMOS single-bin Δzi uncertainty | ±0.011 | ±0.013 | ±0.013 | ±0.013 |
metacalibration | ||||
COSMOS final Δzi, tomographic uncertainty | −0.024 ± 0.017 | −0.042 ± 0.021 | +0.006 ± 0.021 | +0.038 ± 0.020 |
WZ final Δzi | +0.003 ± 0.014 | −0.037 ± 0.014 | +0.005 ± 0.019 | – |
Combined final Δzi | −0.008 ± 0.011 | −0.039 ± 0.012 | +0.006 ± 0.014 | +0.038 ± 0.020 |
Author notes
Einstein fellow.