ABSTRACT

We introduce the Planck  SZiFi catalogues, a new set of 10 catalogues of galaxy clusters detected through their thermal Sunyaev–Zeldovich (tSZ) signature. The catalogues are produced by applying the SZiFi cluster finder to the Planck PR3 temperature data down to a signal-to-noise threshold of 5. They span three frequency channel combinations (100–857, 100–545, and 100–353 GHz) and seven of them are constructed by spectrally deprojecting the Cosmic Infrared Background (CIB). This approach allows us, for the first time in the context of cluster finding, to carefully assess the impact of the cluster-correlated CIB on the recovered cluster tSZ observables, which we find to be negligible. In addition, we quantify the impact of the relativistic corrections to the tSZ signal, finding them to be at the 5–10 per cent level for the cluster tSZ amplitude but negligible for the signal-to-noise ratio. We compile our catalogues into a single Planck  SZiFi master catalogue containing a total of 1499 detections. We cross-match the master catalogue with several external tSZ and X-ray cluster catalogues, setting a lower bound on the purity of our baseline iMMF6 catalogue of 96 per cent and 99 per cent at a minimum signal-to-noise ratios of 5 and 6, respectively. We validate our cluster detection pipeline by applying it to synthetic observations, recovering cluster number counts for which we are able to produce a theoretical prediction that accurately describes them. This validation exercise indicates that our catalogues are well-suited for cosmological inference. The Planck  SZiFi master catalogue will become publicly available at https://github.com/inigozubeldia/szifi/tree/main/planck_szifi_master_catalogue.

1 INTRODUCTION

Galaxy clusters detected through the thermal Sunyaev–Zeldovich (tSZ) effect (Sunyaev & Zeldovich 1972; see Carlstrom, Holder & Reese 2002; Mroczkowski, Nagai et al. 2019, for reviews) are a powerful cosmological probe from which constraints on cosmological parameters such as the matter density parameter |$\Omega _{\mathrm{m}}$|⁠, the amplitude of matter clustering |$\sigma _8$|⁠, the equation of state of dark energy, and the sum of the neutrino masses can be obtained (e.g. Battye & Weller 2003; Allen, Evrard & Mantz 2011; Weinberg et al. 2013). Over the past 15 yr, a number of galaxy cluster catalogues with |$\sim 10^2$||$10^3$| objects have been constructed using data from the Planck experiment, the Atacama Cosmology Telescope (ACT), and the South Pole Telescope (SPT; e.g. Vanderlinde et al. 2010; Hasselfield et al. 2013; Planck Collaboration XXVII 2016; Hilton et al. 2021; Bleem et al. 2024; Klein, Mohr & Davies 2024). Some of these catalogues have been subsequently used in cosmological number-count analyses (e.g. Vanderlinde et al. 2010; Hasselfield et al. 2013; Planck Collaboration XXIV 2016; Zubeldia & Challinor 2019; Bocquet et al. 2024b; Lee, Battye & Bolliet 2025).

With their large sky coverage and broad frequency range, the Planck temperature maps remain a competitive data set for tSZ cluster finding, allowing for the probing of lower cluster masses at low redshifts compared to what is enabled by ACT and SPT (see e.g. fig. 9 of Bleem et al. 2024). The Planck Collaboration released a series of three cluster catalogues: the Planck ESZ, PSZ1, and PSZ2 catalogues (Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016), which have been widely used in the literature and employed in several cosmological number-count analyses (e.g. Planck Collaboration XXIV 2016; Salvati, Douspis & Aghanim 2018; Zubeldia & Challinor 2019). These catalogues, however, were constructed using all the Planck High Frequency Instrument (HFI) frequency channels without an assessment of contamination from correlated foregrounds. These foregrounds include the Cosmic Infrared Background (CIB), which has been shown to have the potential to introduce biases in the cluster tSZ observables (see Zubeldia, Chluba & Battye 2023b). In addition, the approach used to estimate the noise covariance in the cluster finding process has been shown to lead to a bias in the cluster tSZ observables (the ‘covariance bias’) and to a loss of signal-to-noise ratio (Zubeldia et al. 2023a). Finally, since Planck probes the most massive, nearby clusters, modifications to the tSZ signal from relativistic temperature corrections (Challinor & Lasenby 1998; Sazonov & Sunyaev 1998; Itoh, Kohyama & Nozawa 1998; Chluba et al. 2012) may be relevant (e.g. Remazeilles et al. 2019; Rotti et al. 2021; Perrott 2024). These relativistic corrections were, however, ignored in the official Planck analyses.

In this work, we introduce the Planck  SZiFi catalogues, a new set of Planck cluster catalogues constructed to address these issues and provide a Planck galaxy cluster sample produced independently from the official ones. For cluster finding, we use the Planck PR3 maps, as opposed to the PR2 ones used for the last official Planck catalogue. We search for clusters using SZiFi,1the Sunyaev–Zeldovich iterative Finder, a publicly available enhanced implementation of the multifrequency matched filter (MMF) cluster finding algorithm that has been extensively tested with Planck-like synthetic observations (Zubeldia et al. 2023a, b). We detect cluster candidates down to a signal-to-noise threshold of 5. We estimate the MMF noise covariance iteratively, removing the detections from the noise covariance estimate. This procedure eliminates the covariance bias from the cluster tSZ observables and boosts the signal-to-noise ratio to the correct level. In addition, we carefully assess the impact of CIB contamination by (i) obtaining catalogues for several combinations of the Planck HFI frequency channels and (ii) employing, for the first time on real data, the spectrally constrained MMFs of Zubeldia et al. (2023b), which we apply in combination with the moment expansion approach of Chluba et al. (2013). We produce a total of 10 catalogues, each obtained with a different frequency and/or CIB deprojection combination, and compile them into a single Planck  SZiFi master catalogue. We then cross-match the master catalogue with several existing tSZ and X-ray cluster catalogues containing significantly more clusters than were available when the official Planck catalogues were published. Furthermore, for the first time in the context of tSZ cluster detection, we quantify the impact of the relativistic corrections to the tSZ signal on the cluster tSZ observables. Finally, we validate our cluster detection pipeline by applying it to synthetic Planck-like observations.

This paper is structured as follows. First, in Section 2 we present the data sets that we use for both cluster detection and the confirmation of cluster candidates. Then, in Sections 3 and 4, we describe, respectively, our cluster detection and confirmation pipelines. Next, in Section 5 we discuss the properties of our Planck  SZiFi cluster catalogues, describing the cluster sky position, signal-to-noise and redshift distributions, the purity of the catalogues, and the impact of iterative noise covariance estimation. We then compare one of our catalogues with the official Planck MMF3 catalogue in Section 6 and discuss the impact of the CIB and of the tSZ relativistic corrections in Sections 7 and 8, respectively. Finally, in Section 9 we validate our cluster detection pipeline with synthetic observations and we conclude in Section 10. In a series of appendices, we offer a brief description of the entries in the Planck  SZiFi master catalogue (Appendix  A) and further discussion on the validation of our assessment of CIB contamination (Appendix  B) and of our cluster detection pipeline (Appendix  C).

2 DATA

2.1 Planck data

We search for galaxy clusters using the six Planck HFI frequency maps, which have bandpasses centred at 100, 143, 217, 353, 545, and 857 GHz. In particular, we use the 2018 PR3 temperature maps (Planck Collaboration III 2020), which are publicly available from the Planck Legacy Archive2 (PLA). We do not use the three Planck Low Frequency Instrument (LFI) frequency maps due to their poor spatial resolution and higher instrumental noise.

In our analysis, we apply the Planck PR2 Galactic masks leaving 80 per cent and 70 per cent of the sky unmasked, the former for cluster detection and the latter imposed in order to obtain the final master catalogue (see Section 3). We also use the Planck PR2 temperature point-source masks for all the six HFI channels, which mask point sources detected at more than 5 |$\sigma$| significance for each channel. All the masks are also available from the PLA.

Finally, in the construction of our matched filters used for cluster detection we use the Planck 2018 Reduced Instrument MOdel (RIMO) beams and effective band transmission profiles (Planck Collaboration III 2020), which are also available from the PLA.

2.2 Cluster confirmation data

With the goal of confirming whether our detections correspond to real galaxy clusters and of assigning redshift measurements to them, we cross-match our master catalogue with a number of existing tSZ and X-ray cluster catalogues, which we refer to as our ‘external catalogues’ (see Section 4). We consider the following catalogues and meta-catalogues:

  • The MCSZ meta-catalogue of tSZ-detected clusters, which is publicly available in the M2C Galaxy Cluster Database.3 It contains a total of 2981 objects and was constructed from the Planck ESZ, PSZ1, and PSZ2 catalogues (Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016), the SPT SPT2500 and SPT-ECS catalogues (Bleem et al. 2015, 2020), and the ACT 2008 catalogues of Marriage et al. (2011), Hasselfield et al. (2013). It includes additional validation and redshift information of the Planck clusters from a number of post-Planck sources (Canary Islands Observatories: Barrena et al. 2018; Aguado-Barahona et al. 2019; Streblyanska et al. 2019; Barrena et al. 2020; MegaCam and CFHT: van der Burg et al. 2016, 2018; SDSS: Streblyanska et al. 2018; and various other observatories: Amodeo et al. 2018; Burenin et al. 2018; Zohren et al. 2019; Zaznobin et al. 2019, 2020).

  • The MCXC-II meta-catalogue of X-ray-detected clusters (Sadibekova et al. 2024, updated from the original MCXC meta-catalogue of Piffaretti et al. 2011), also available in the M2C Galaxy Cluster Database. This meta-catalogue contains a total of 2221 objects and is based on the ROSAT All Sky Survey-based catalogues NORAS (Böhringer et al. 2000), REFLEX (Böhringer et al. 2000; Chon & Böhringer 2012), REFLEX II (Chon & Böhringer 2012), BCS (Ebeling et al. 1998, 2000), SGP (Cruddace et al. 2002), NEP (Henry et al. 2006), MACS (Ebeling et al. 2007; Mantz et al. 2010; Mann & Ebeling 2012; Repp & Ebeling 2018), CIZA (Ebeling, Mullis & Tully 2002; Kocevski et al. 2007), and RXGCC (Xu et al. 2022), as well as on the serendipitous catalogues 160SD (Mullis et al. 2003), 400SD (Burenin et al. 2007), SHARC (Burke et al. 2003; Romer et al. 2000), WARPS (Perlman et al. 2002; Horner et al. 2008), and EMSS (Gioia & Luppino 1994; Henry 2004).

  • The ACT DR5 catalogue of Hilton et al. (2018),4 containing 4195 tSZ detections.

  • The CODEX catalogue of X-ray detected clusters (Finoguenov et al. 2020). We only consider the subsample with ‘clean = True’, which contains a total of 2815 objects.

  • The RASS-MCMF catalogue of X-ray-detected clusters (Klein et al. 2023). We consider only the 99 per cent purity subsample, which contains a total of 5506 objects and includes revised redshifts for 5 clusters in the PSZ2 catalogue.

  • The SRG/eROSITA First All-Sky Survey (eRASS1) catalogue of X-ray detected clusters of Bulbul et al. (2024). We consider only the cosmology subsample, which has an estimated 95 per cent purity and contains 5259 objects.

We note that the advantage of working with meta-catalogues, such as the MCSZ and the MCXC-II meta-catalogues, is the identification of common clusters in the constituent catalogues and the homogenisation of key quantities such as the redshift and the mass proxy (SZ mass for the MCSZ and X-ray luminosity for the MCXC-II).

Furthermore, in order to assess the presence of spurious detections in our master catalogue, we cross-match it with the Second Planck Catalogue of Compact Sources (Planck Collaboration XXVI 2016) and with the Planck Catalogue of Galactic cold clumps (GCCs; Planck Collaboration XXVIII 2016), both of which are available from the PLA.

3 CLUSTER DETECTION

3.1 Overview

We detect galaxy clusters blindly by applying the tSZ cluster finder SZiFi to the Planck PR3 HFI frequency maps. SZiFi (Zubeldia et al. 2023a, b) is a publicly available enhanced implementation of the MMF cluster finding method (Herranz et al. 2002; Melin, Bartlett & Delabrouille 2006), a method successfully used for cluster finding by the Planck, ACT, and SPT collaborations (Planck: Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016; ACT: Marriage et al. 2011; Hasselfield et al. 2013; Hilton et al. 2021; SPT: Vanderlinde et al. 2010; Bleem et al. 2015, 2020, 2022). SZiFi incorporates several novel features, most notably iterative noise covariance estimation and the ability to spectrally deproject foregrounds, both of which are used in this work. It is described in detail in Zubeldia et al. (2023a, b), where its performance was extensively tested with Planck-like synthetic observations.

Our cluster detection pipeline can be summarized as follows. First, the Planck frequency maps are tessellated into a set of non-overlapping ‘selection tiles’. Next, for each selection tile, an MMF is constructed and applied to the corresponding frequency map cut-outs (‘frequency fields’), producing a ‘tile catalogue’ by selecting the MMF signal-to-noise peaks greater than a threshold of |$q_{\mathrm{th}}=4$|⁠. This step requires estimating the noise covariance, which is done iteratively, although we also save the detections obtained in the zero iteration, which form a non-iterative tile catalogue. Each tile catalogue includes, for every detection, its signal-to-noise, which we denote by |$q_{\mathrm{obs}}$| following the official Planck analysis, which can be used as a mass observable (or mass proxy) in a cosmological analysis (e.g. Planck Collaboration XXIV 2016). It also includes estimates of the detection Compton-y parameter value at its centre |$\hat{y}_0$|⁠, angular size |$\hat{\theta }_{500}$|⁠, and sky coordinates. In a final step, the (iterative and non-iterative) tile catalogues are merged into a single (iterative and non-iterative) output catalogue. This detection pipeline is described in detail in Section 3.3.

In order to assess the impact of contamination due to signals correlated with the cluster tSZ signal (in particular, the CIB), we consider a set of 10 different MMFs formed by various combinations of frequency channels and spectral deprojections. The choices of MMFs and the Spectral Energy Distributions (SEDs) that are deprojected are described in Section 3.2. For each MMF, the detection pipeline is applied independently, producing one iterative and one non-iterative catalogue per MMF. These individual catalogues are then mutually cross-matched and compiled into a single ‘master catalogue’ in a final step, as described in Section 3.4. The master catalogue is then cross-matched against external catalogues, as explained in Section 4. It is important to note, however, that each of the catalogues in the master catalogue should be regarded an independent catalogue in its own right, with a well-defined selection.

3.2 Matched filters used and CIB deprojection

Any signal that is spatially correlated with the cluster tSZ signal has the potential to bias the cluster tSZ observables delivered by our cluster detection pipeline (e.g. |$q_{\mathrm{obs}}$| and |$\hat{y}_0$|⁠), and therefore the cluster number counts. This bias can, in turn, potentially propagate to number-count cosmological constraints (see e.g. Zubeldia et al. 2023b).5

The CIB, formed by the cumulative dust emission from star-forming galaxies (see e.g. Lagache, Puget & Dole 2005), is a prominent example of an extragalactic signal that is spatially correlated with the tSZ field. This correlation has indeed been measured and has been shown to bias measurements of the tSZ power spectrum and of the cross-correlation of the tSZ field with other tracers of the density field (e.g. Planck Collaboration XXIII 2016; Pandey et al. 2019; Yan et al. 2019; Sánchez et al. 2023; McCarthy & Hill 2024a, b; Coulton et al. 2024). Dust emission has also been measured at the location of galaxy clusters in stacking analyses using IRAS and Planck data (Montier & Giard 2005; Giard et al. 2008; Planck Collaboration XLIII 2016; Planck Collaboration XXIII 2016; Melin et al. 2018). The contribution to the CIB that is spatially correlated with a given cluster sample (the ‘cluster-correlated CIB’) therefore has the potential to bias the cluster tSZ observables.

This possibility has been discussed in two detailed analyses: Melin et al. (2018) and Zubeldia et al. (2023b). In Melin et al. (2018), clusters with dust emission from a model calibrated to Planck data were injected into Planck data and used to assess the impact on the retrieved cluster number counts and cosmological constraints. The impact was found to be small, with about 10 per cent fewer detections in the number counts between |$z=0.4$| and |$z=0.8$| relative to the case with no dust contamination and negligible changes in the derived cosmological parameters. On the other hand, Zubeldia et al. (2023b) performed a simulation-based analysis, using Planck-like synthetic maps from the Websky simulation (Stein, Alvarez & Bond 2019; Stein et al. 2020). They reported a more significant impact on the cluster tSZ observables and, consequently, on the retrieved cluster number counts, suggesting that further analysis of the Planck data may be required.

In order to assess the potential contamination of the cluster observables due to the cluster-correlated CIB, following Zubeldia et al. (2023b) we consider a set of different MMFs constructed with various combinations of frequency channels and spectral deprojections, the latter achieved through the use of spectrally constrained MMFs. The choice of channel combinations is informed by the fact that, at the Planck frequencies, the CIB signal strongly increases with frequency. Based on this, we consider three frequency combinations: all six HFI channels, the five lowest-frequency channels, and the four lowest-frequency channels. Regarding spectral deprojection, as demonstrated in Zubeldia et al. (2023b), spectrally constrained MMFs, which zero or ‘deproject’ the contribution of one or several foregrounds with given SEDs, can be highly effective at suppressing the CIB-induced bias from the cluster tSZ observables and the cluster number counts. These MMFs are particularly powerful in combination with the moment expansion approach to first order (Chluba et al. 2013; Chluba, Hill & Abitbol 2017; Rotti & Chluba 2021; Vacher et al. 2022). In this approach, an estimate for the foreground SED is deprojected together with its derivative with respect to one or several parameters parameterizing it. Doing this mitigates the impact of assuming a deprojection SED that does not perfectly match the true foreground SED, as shown in Zubeldia et al. (2023b) for Planck-like synthetic observations (see, in particular, their fig. 9). This is particularly important for the CIB, as its SED is relatively poorly constrained and varies from cluster to cluster and, on average, as a function of redshift.

Taking these considerations into account, we use the following 10 MMFs, all of which are constructed with iterative noise covariance estimation (i.e. they are ‘iterative’, hence the letter ‘i’ in the following acronyms):

  • iMMF6: Standard iterative matched filter (iMMF) using all six HFI channels.

  • iMMF5: Standard iterative matched filter (iMMF) using the five lowest-frequency HFI channels.

  • iMMF4: Standard iterative matched filter (iMMF) using the four lowest-frequency HFI channels.

  • sciMMF6: Spectrally constrained iterative matched filter (sciMMF) using all six HFI channels, deprojecting a model for the cluster-correlated CIB SED (see below).

  • sciMMF5: Spectrally constrained iterative matched filter (sciMMF) using the five lowest-frequency HFI channels, deprojecting the same model for the cluster-correlated CIB SED.

  • sciMMF4: Spectrally constrained iterative matched filter (sciMMF) using the four lowest-frequency HFI channels, deprojecting the same model for the cluster-correlated CIB SED.

  • sciMMF6 |$\boldsymbol {\beta }$|: Spectrally constrained iterative matched filter (sciMMF) using all HFI channels, deprojecting the same model for the cluster-correlated CIB SED and its first-order moment with respect to the SED parameter |$\beta$| (see below).

  • sciMMF6 |$\boldsymbol {\beta }_T$|: Spectrally constrained iterative matched filter (sciMMF) using all HFI channels, deprojecting the same model for the cluster-correlated CIB SED and its first-order moment with respect to the SED parameter |$\beta _T$| (see below).

  • sciMMF5 |$\boldsymbol {\beta }$|: As sciMMF6 |$\beta$|⁠, but using only the five lowest-frequency HFI channels.

  • sciMMF5 |$\boldsymbol {\beta }_T$|: As sciMMF6 |$\beta _T$|⁠, but using only the five lowest-frequency HFI channels.

A summary of the frequency channel and spectral deprojection combinations of our 10 iterative MMFs can be found in Table 1. We stress that each combination leads to its own catalogue of detections. In addition to these iterative MMFs, we also consider their non-iterative counterparts in order to assess the impact of iterative noise covariance estimation and for better comparison with the official Planck catalogues.

Table 1.

The 10 iterative MMFs used in this work with their corresponding frequency channel and spectral deprojection combination. We obtain a cluster catalogue with each of these MMFs.

MMFFreq. channelsSpec. deprojection
iMMF6100–857 GHz (6)None
iMMF5100–545 GHz (5)None
iMMF4100–353 GHz (4)None
sciMMF6100–857 GHz (6)CIB
sciMMF5100–545 GHz (5)CIB
sciMMF4100–353 GHz (4)CIB
sciMMF6 |$\beta$|100–857 GHz (6)CIB + moment w.r.t. |$\beta$|
sciMMF6 |$\beta _T$|100–857 GHz (6)CIB + moment w.r.t. |$\beta _T$|
sciMMF5 |$\beta$|100–545 GHz (5)CIB + moment w.r.t. |$\beta$|
sciMMF5 |$\beta _T$|100–545 GHz (5)CIB + moment w.r.t. |$\beta _T$|
MMFFreq. channelsSpec. deprojection
iMMF6100–857 GHz (6)None
iMMF5100–545 GHz (5)None
iMMF4100–353 GHz (4)None
sciMMF6100–857 GHz (6)CIB
sciMMF5100–545 GHz (5)CIB
sciMMF4100–353 GHz (4)CIB
sciMMF6 |$\beta$|100–857 GHz (6)CIB + moment w.r.t. |$\beta$|
sciMMF6 |$\beta _T$|100–857 GHz (6)CIB + moment w.r.t. |$\beta _T$|
sciMMF5 |$\beta$|100–545 GHz (5)CIB + moment w.r.t. |$\beta$|
sciMMF5 |$\beta _T$|100–545 GHz (5)CIB + moment w.r.t. |$\beta _T$|
Table 1.

The 10 iterative MMFs used in this work with their corresponding frequency channel and spectral deprojection combination. We obtain a cluster catalogue with each of these MMFs.

MMFFreq. channelsSpec. deprojection
iMMF6100–857 GHz (6)None
iMMF5100–545 GHz (5)None
iMMF4100–353 GHz (4)None
sciMMF6100–857 GHz (6)CIB
sciMMF5100–545 GHz (5)CIB
sciMMF4100–353 GHz (4)CIB
sciMMF6 |$\beta$|100–857 GHz (6)CIB + moment w.r.t. |$\beta$|
sciMMF6 |$\beta _T$|100–857 GHz (6)CIB + moment w.r.t. |$\beta _T$|
sciMMF5 |$\beta$|100–545 GHz (5)CIB + moment w.r.t. |$\beta$|
sciMMF5 |$\beta _T$|100–545 GHz (5)CIB + moment w.r.t. |$\beta _T$|
MMFFreq. channelsSpec. deprojection
iMMF6100–857 GHz (6)None
iMMF5100–545 GHz (5)None
iMMF4100–353 GHz (4)None
sciMMF6100–857 GHz (6)CIB
sciMMF5100–545 GHz (5)CIB
sciMMF4100–353 GHz (4)CIB
sciMMF6 |$\beta$|100–857 GHz (6)CIB + moment w.r.t. |$\beta$|
sciMMF6 |$\beta _T$|100–857 GHz (6)CIB + moment w.r.t. |$\beta _T$|
sciMMF5 |$\beta$|100–545 GHz (5)CIB + moment w.r.t. |$\beta$|
sciMMF5 |$\beta _T$|100–545 GHz (5)CIB + moment w.r.t. |$\beta _T$|

Following Planck Collaboration (2014), we assume that, at the Planck frequencies, the CIB SED at a given redshift can be described with a modified blackbody,

(1)

where |$\nu$| is the observation frequency, |$\beta$| is the dust emissivity index, |$T_0$| is the dust temperature at |$z=0$|⁠, |$\alpha$| is the dust evolution slope, and |$B_\nu (T)$| is Planck’s function. We further define the dust temperature parameter as |$\beta _T \equiv T_0^{-1}$|⁠.

For each of our sciMMFs, we evaluate the CIB SED and, if relevant, its derivatives with respect to |$\beta$| or |$\beta _T$|⁠, at |$\beta =1.75$|⁠, |$T_0=24.4$| K, and |$\alpha =0.36$|⁠. These parameter values are taken to be consistent with their Planck constraints in Planck Collaboration (2014), which were obtained assuming the same SED model as adopted here. In addition, we evaluate the CIB SED at |$z=0.2$|⁠, a redshift that is representative of the Planck cluster sample (see Section 5). We note, in particular, that even if the CIB peaks at higher redshifts, it is the part that is spatially correlated with our cluster sample that must be removed, as any uncorrelated contribution only adds variance to the cluster observables (see Zubeldia et al. 2023b). We also note that we deproject the moment with respect to |$\beta _T$|⁠, rather than |$T_0$|⁠, as |$\beta _T$| has been shown to be a better parametrization than |$T_0$| regarding moment deprojection (Chluba et al. 2017). Finally, in combination with the moment expansion approach, our sciMMFs have been shown to be effectively immune to the choice of the parameter values in the deprojection SED, provided that they are within a reasonable range. In particular, in Zubeldia et al. (2023b) it was shown for Planck-like synthetic data based on the Websky simulation, in which the CIB has |$\beta =1.6$| and |$T_0=20.7$| K, that evaluating |$\beta$| across a range from 1.4 to 1.8 resulted in negligible changes in the recovered signal-to-noise values for sciMMF6 |$\beta$|⁠, remaining unbiased, and in very small changes for sciMMF6 |$\beta _T$| (see their fig. 9). Similarly, evaluating |$T_0$| across a range from 16.8 K to 24.8 K resulted in negligible changes in the signal-to-noise for both sciMMF6 |$\beta _T$| and sciMMF6 |$\beta$|⁠, both remaining unbiased.

Spectral deprojection and/or the use of fewer frequency channels leads to a signal-to-noise penalty relative to the optimal case, which is given by iMMF6, with fewer and less significant detections: foreground mitigation is, therefore, potentially achieved at a cost. This signal-to-noise penalty is quantified in Section 5.

3.3 Cluster detection pipeline

In this section, we describe how we use SZiFi to produce each of our catalogues using the Planck PR3 HFI maps. We refer the reader to Zubeldia et al. (2023a), Zubeldia et al. (2023b) for further details about the code and, especially, for detailed discussions of its novel elements (iterative noise covariance estimation, foreground spectral deprojection), as well as for tests of its performance with synthetic data.

3.3.1 Sky tessellation

SZiFi starts by tessellating the sky into a set of non-overlapping selection tiles, each of which is processed independently. As well as allowing for local noise estimation, with the noise and foregrounds in the Planck maps varying significantly across the sky, this procedure also enables for the use of the flat-sky approximation when analysing the maps in harmonic space. We choose the tessellation scheme provided by a HEALPix pixelation (Górski et al. 2005) with |$N_{\mathrm{side}}=8$|⁠, which leads to a total of 768 tiles, each with an area of 53.7 deg|$^2$| and a typical extent of |$53.7^{1/2}$| deg = 7.3 deg. This scheme guarantees that the entire sky is covered and that each point in the sky belongs to only one tile, avoiding the possibility of double-counting of pixels.

Taking this tessellation scheme into account, SZiFi extracts square cut-outs (hereafter, ‘frequency fields’) of 14.8 deg |$\times$| 14.8 deg from the Planck HFI frequency maps centred at the centre of each selection tile. These fields are obtained as equirectangular (or plate carrée) projections using the healpy6 library. Each frequency field consists of 1024 |$\times$| 1024 pixels with a pixel size of 0.867 arcmin. The chosen cut-out size ensures that every tile is fully contained within its corresponding field, regardless of its shape. We also obtain cut-outs from two masks (‘mask fields’): (i) the Planck PR2 80 per cent Galactic mask and (ii) a point-source mask constructed as the union of the Planck PR2 point-source masks for all HFI channels. The mask fields have the same size and pixelation as the frequency fields. Taking the Galactic mask into account, there is a total of 627 tiles that remain not fully masked.

In order to suppress spectral leakage when taking Fourier transforms, each Galactic mask field is apodized with the ‘smooth’ scheme of pymaster,7 which zeroes all pixels closer to 2.5 times the apodization scale to a masked pixel, then convolves the resulting mask with a Gaussian kernel with its standard deviation equal to the apodization scale, and finally zeroes all the originally masked pixels. We choose an apodization scale of 12 arcmin, noting that the edges of the mask fields are also apodized by this procedure.

3.3.2 Field pre-processing and noise covariance estimation

Consider a given selection tile. SZiFi then proceeds by applying the corresponding point-source mask to each frequency field, masking bright point sources that may lead to spurious detections. The masked regions are then inpainted with the real-space diffusive inpainting algorithm of Gruetjen et al. (2017), using a total 100 iterations, after which the apodized Galactic mask is applied. Next, a Fourier-space top-hat filter is imposed to the frequency fields, with a minimum multipole |$l_{\mathrm{min}} = 100$| and a maximum multipole |$l_{\mathrm{max}} = 2500$|⁠, selecting the scales relevant for cluster finding that are not noise-dominated.

Next, SZiFi estimates the noise cross-channel power spectrum matrix (‘noise covariance’), an essential element in an MMF. Here, by noise we mean instrumental noise as well as foreground signals (the CMB, the CIB, the kSZ signal, Galactic foregrounds, etc.). For each tile, this is done with the inpainted, apodized frequency fields using the MASTER algorithm (Hivon et al. 2002) as implemented in pymaster (Alonso et al. 2019). The cross power spectra are estimated in a set of 51 equally spaced bands centred between |$l = 48.6$| and |$l = 2519.9$|⁠. We properly account for the multipole coupling due to the Galactic mask by multiplying each pseudo power spectrum by the inverse of the mask coupling matrix, which is computed with pymaster for each tile.

In this work, we use the iterative noise covariance estimation approach introduced in Zubeldia et al. (2023a), with one iteration. In this approach, in the initial iteration, the noise covariance is computed using the input frequency maps, as we just described. Once the first, non-iterative tile catalogue is obtained (as detailed below), the same noise covariance estimation pipeline is applied to frequency maps in which the detections above a signal-to-noise threshold |$q_{\mathrm{mask}} = 4$| have been masked out. This procedure leads to an updated, iterative tile catalogue. In Section 3.3.5, we offer a brief review of the benefits of iterative noise estimation.

3.3.3 Matched filtering

For each tile, once the noise covariance is estimated SZiFi constructs an MMF for each of a set of ‘search’ angular scales, assuming (i) a spatial and a spectral template for the expected cluster tSZ signal and (ii) a spectral template (SED) to be deprojected, if relevant.

More precisely, an MMF assumes that the frequency fields at pixel |$\boldsymbol {x}$|⁠, |$\boldsymbol {d} (\boldsymbol {x})$|⁠, can be written as

(2)

Here, |$\boldsymbol {y} (\boldsymbol {x})$| is the signal that is being targeted, i.e. the galaxy cluster tSZ signal. It is assumed that its ith component can be written as |$y_i (\boldsymbol {x}) = a_{i} (y \ast b_i)(\boldsymbol {x})$|⁠, where |$a_{i}$| is the tSZ SED at frequency channel i, |$y(\boldsymbol {x})$| is the Compton-y map describing the spatial variation of the tSZ signal, |$b_i$| is the instrument beam at frequency channel i and |$\ast$| denotes convolution. In addition, |$\boldsymbol {n} (\boldsymbol {x})$| is some additive ‘noise’ that also includes all the other components in the maps. These components (the CMB, the kinetic Sunyaev–Zeldovich signal, the CIB, Galactic foregrounds, etc.) may or may not be correlated with the tSZ signal. It is further assumed that, for a single galaxy cluster, its Compton-y map |$y (\boldsymbol {x})$| can be written as |$y (\boldsymbol {x}) = y_0 y_{\mathrm{t}} (\boldsymbol {x} ; \theta _{500}, \boldsymbol {\theta }_\mathrm{c})$|⁠, where |$y_0$| is an amplitude parameter, which we take to be the value of the cluster’s Compton-y parameter at the cluster centre, and |$y_{\mathrm{t}} (\boldsymbol {x} ; \theta _{500}, \boldsymbol {\theta }_\mathrm{c})$| is a spatial template. This template depends on the cluster angular size |$\theta _{500}$|⁠, defined, as is customary, as the angle subtended by |$R_{500}$|⁠, the radius within which the mean density is equal to 500 times the critical density at the cluster’s redshift. The template also depends on the sky coordinates of the cluster centre, |$\boldsymbol {\theta }_\mathrm{c}$|⁠.

At given input values for |$\theta _{500}$| and |$\boldsymbol {\theta }_\mathrm{c}$|⁠, a general MMF estimator for |$y_0$|⁠, spectrally deprojecting |$N_{\mathrm{dep}}$| components, can then be written, assuming the flat-sky approximation, as (Zubeldia et al. 2023b)

(3)

where

(4)

and where we have left the dependencies of |$\boldsymbol {d}$|⁠, |$y_{\mathrm{t}}$|⁠, and |$\bf{C}$| on multipole |$\boldsymbol {l}$| implicit to avoid clutter in the notation. Here, |$\bf{C}$| is the noise covariance matrix; |$\bf{A} = [\boldsymbol {a} \, \, \boldsymbol {b}_1 \dots \boldsymbol {b}_{N_{\mathrm{dep}}} ]$| is a |$N_{\mathrm{f}} \times N_{\mathrm{dep}} + 1$| mixing matrix where |$N_{\mathrm{f}}$| is the number of frequency channels, |$\boldsymbol {a}$| is the tSZ SED, and |$\boldsymbol {b}_i$| is the SED of the ith component to be deprojected, noting that each row in the matrix is rescaled for each multipole |$\boldsymbol {l}$| by the Fourier transform of the beam; |$\boldsymbol {c}^T = [1 \, \, 0 \dots 0]$| is a vector of dimension |$N_{\mathrm{dep}} + 1$| that ensures that the correct component (the tSZ signal) is retrieved; and |$y_{\mathrm{t}} (\boldsymbol {l} ; \theta _{500}, \boldsymbol {\theta }_\mathrm{c})$| is the Fourier transform of the cluster spatial template. Note that in the absence of any spectral deprojection, |$\bf{A} = [\boldsymbol {a}]$| and this general MMF reduces to the standard MMF (Herranz et al. 2002; Melin et al. 2006) used widely for cluster detection.

The variance of this general MMF estimator is given by |$\sigma _{\mathrm{f}}(\theta _{500}) = N^{-1}$| (Zubeldia et al. 2023b), which allows to define the signal-to-noise ratio as

(5)

At a given angular scale |$\theta _{500}$|⁠, a map of |$\hat{y}_0$| and q estimates can be produced by placing the template at the centre of every pixel. This operation can be thought of as a convolution, and it is implemented as such in SZiFi, boosting significantly the computational performance (see Zubeldia et al. 2023a for more details).

As in Planck Collaboration XXVIII (2016), we take the cluster spatial template to be the Compton-y signal due to the Arnaud et al. (2010) pressure profile, setting the concentration to |$c_{500} = c = 1.177$| also following Planck Collaboration XXVIII (2016). SZiFi integrates the pressure profile numerically up to a radius of |$5 R_{500}$|⁠, at which it is truncated (as in Planck Collaboration XXVIII 2016). For the spectral template |$\boldsymbol {a}$|⁠, we assume the non-relativistic tSZ SED, integrated within each frequency channel weighted by the channel’s effective band transmission profile. When deprojecting the CIB and its moments, we assume the modified blackbody SED described in Section 3.2, also integrated for each channel weighted by the effective band transmission profile.

We produce |$\hat{y}_0$| and q maps at a set of 25 search angular scales logarithmically spaced between |$\theta _{500} = 0.5$| arcmin and |$\theta _{500}=32$| arcmin. The use of several angular scales allows to optimize the detection of clusters with different angular sizes. The chosen range of angular scales is the same as that used in Planck Collaboration XXVIII (2016) and is designed to span the angular sizes of most clusters that are expected to be detected by Planck.

In summary, the matched filtering step produces, for each selection tile, a three-dimensional distribution for |$\hat{y}_0$| and q, |$\hat{y}_0 (\boldsymbol {x} ; \theta _{500})$| and |$q (\boldsymbol {x} ; \theta _{500})$|⁠, spanning the two sky coordinates |$\boldsymbol {x}$| and the angular scale |$\theta _{500}$|⁠. We note that SZiFi also supports extracting |$\hat{y}_0$| and q at a set of input angular scales and sky positions via what is known as its fixed mode. We make use of this functionality in Sections 7 and 8.

3.3.4 Peak finding

Next, SZiFi turns the three-dimensional signal-to-noise distribution into a catalogue of cluster candidates for each tile. In order to achieve this, first, for each angular scale, the signal-to-noise map is masked with a ‘peak-finding mask’ that is defined as the union of the unapodized Galactic mask, the point-source mask, and the tessellation mask, the latter being a binary mask zeroing all the pixels falling outside the corresponding HEALPix tile. The tessellation mask ensures that each point in the sky is only covered once for the purposes of cluster detection, avoiding double-counting of pixels. We note that, thanks to the nature of our tessellation scheme, the tessellation mask ensures that all the pixels falling within it are far away from the edges of the square field used for the construction of the MMF (see Fig. 1). This suppresses the bias in the MMF observables that can arise in detections near the edges. An alternative solution to this problem was followed in Planck Collaboration XXVII (2016), in which the original tessellation of the sky was updated by recentering the tiles at the locations of the detected cluster candidates.

Illustration of the MMF cluster detection pipeline as implemented in SZiFi for one of our selection tiles and our iMMF6 filter. The left column shows the six Planck HFI frequency fields, from 100 GHz (top), where the CMB dominates, to 857 GHz (bottom), where dust becomes dominant. The fields are $14.8 \times 14.8$ deg across; note the point-source mask. In the MMF, these fields are convolved, in an optimal way, with the cluster template for each channel for a set of search angular scales $\theta _{500}$. Here, the middle column shows the template for $\theta _{500} = 5$ arcmin. The output of this matched filtering operation is a signal-to-noise map for each angular scale, shown on the right for $\theta _{500} = 5$ arcmin and after the selection mask has been applied. Note that this mask includes the tessellation mask that corresponds to a HEALPix tessellation with $N_{\mathrm{side}} = 8$ and prevents double-counting of pixels. The detections for this tile (three of them) are shown as the red circles, with their radius corresponding to the angular scale at which they are detected.
Figure 1.

Illustration of the MMF cluster detection pipeline as implemented in SZiFi for one of our selection tiles and our iMMF6 filter. The left column shows the six Planck HFI frequency fields, from 100 GHz (top), where the CMB dominates, to 857 GHz (bottom), where dust becomes dominant. The fields are |$14.8 \times 14.8$| deg across; note the point-source mask. In the MMF, these fields are convolved, in an optimal way, with the cluster template for each channel for a set of search angular scales |$\theta _{500}$|⁠. Here, the middle column shows the template for |$\theta _{500} = 5$| arcmin. The output of this matched filtering operation is a signal-to-noise map for each angular scale, shown on the right for |$\theta _{500} = 5$| arcmin and after the selection mask has been applied. Note that this mask includes the tessellation mask that corresponds to a HEALPix tessellation with |$N_{\mathrm{side}} = 8$| and prevents double-counting of pixels. The detections for this tile (three of them) are shown as the red circles, with their radius corresponding to the angular scale at which they are detected.

A first tile catalogue of cluster candidates is then formed by the local peaks of the three-dimensional masked signal-to-noise distribution |$q (\boldsymbol {x} ; \theta _{500})$| that are above a selection threshold |$q_{\mathrm{th}}$|⁠, which we choose to be |$q_{\mathrm{th}} = 4$|⁠. These local peaks are found with a real-space peak-finding filter that identifies a peak as a (three-dimensional) pixel with a signal-to-noise value larger than all its neighbouring pixels. We denote the signal-to-noise ratio of each detection with |$q_{\mathrm{obs}}$|⁠. The corresponding values of |$\hat{y}_0$| and of the filter angular scale |$\theta _{500}$| are assigned as the detection estimates for |$y_0$| and |$\theta _{500}$|⁠, and the pixel coordinates of the peak as the sky coordinates of the detection. A final ‘selection mask’ is then imposed on the detected cluster candidates. This selection mask is a union of two masks. The first one is the union of the unapodized Galactic mask with the point-source mask increased by zeroing all the pixels closer to masked pixels by a buffer distance of 10 arcmin. This buffering procedure minimizes the chances of spurious detections close to the mask edges due to ringing. The second mask is the tessellation mask.

As discussed in Zubeldia et al. (2021), these ‘blind’ measurements of |$y_0$|⁠, |$\theta _{500}$| and the sky position can be seen as maximum-likelihood estimates and are therefore, in general, biased (except the two sky coordinates, due to symmetry). In addition, for each cluster, the blind or ‘optimal’ signal-to-noise |$q_{\mathrm{obs}}$| is biased high relative to the signal-to-noise ratio that would be extracted at the cluster’s true sky position and angular size, as it is obtained via a maximization procedure over noisy data. This ‘optimization bias’ was first discussed in Vanderlinde et al. (2010) and subsequently studied in detail in Zubeldia et al. (2021). As shown, e.g. in Zubeldia et al. (2021), it can be modelled with a simple prescription, which we follow in the theoretical description of our validation catalogues in Section 5.3.

Fig. 1 illustrates our matched filtering and peak finding process for a reference selection tile (with a HEALPix index of 13) for our iMMF6 filter. The maps on the left correspond to the point-source-masked frequency fields shown, from top to bottom, in increasing frequency. They are convolved by the MMF template at |$\theta _{500} = 5$| arcmin (maps in the middle of the figure), producing a signal-to-noise map (rightmost map) that we show masked by the corresponding selection mask. Note, in particular, the shape of the HEALPix selection tile and how the point-source-masked region has increased in size. In this map detections above a signal-to-noise threshold of 5 (three of them) are shown as the red circles, with the radius of each circle corresponding to the filter angular scale at which the detection was found. We note that this signal-to-noise map corresponds to the second (and final) iteration of the noise covariance estimation.

3.3.5 Iterative noise covariance estimation

Once a first, non-iterative tile catalogue is constructed, SZiFi proceeds by producing a ‘cluster mask’ in which all the pixels falling within |$3 \hat{\theta }_{500}$| arcmin of the estimated centre of each cluster candidate with |$q_{\mathrm{opt}}$| above a masking threshold are zeroed. We choose this threshold to be |$q_{\mathrm{mask}} = 4$|⁠, i.e. equal to the detection threshold. We note that in the construction of the cluster mask, SZiFi also includes cluster candidates which fall within the buffered Galactic + point-source mask, but outside the tessellation mask. This is done because these detections also contribute to the noise covariance estimate, as the tessellation mask is only imposed in the peak-finding step.

After computing the cluster mask, SZiFi multiplies it by the point-source mask, leading to a new ‘point source’ mask. The pipeline is then re-run from the pre-processing step of Section 3.3.2 using the new cluster + point-source mask instead of the original point-source mask in the covariance estimation step. This procedure generates a new noise covariance estimate, which is then used to construct a new set of MMFs that in turn lead to an updated, iterative tile catalogue. We note that the original peak-finding and selection masks are used: the cluster candidates are only masked out for noise estimation purposes. In order to allow for better comparison with the official Planck catalogues we also save each non-iterative tile catalogue (see Section 6).

As shown in Zubeldia et al. (2023a), iterative noise covariance estimation solves two different issues that arise if the noise covariance estimate is simply taken to be the covariance of the frequency maps (our first step in the iterative approach), both of which arise due to the presence of the tSZ signal in what is regarded as the noise in the data. First, if the tSZ signal of the detections is present in the maps used for covariance estimation, the noise covariance is overestimated. This translates into a loss of signal-to-noise ratio, with fewer and less significant detections. Secondly, a negative bias in the tSZ observables (⁠|$q_{\mathrm{obs}}$| and |$\hat{y}_0$|⁠) arises due to the presence of the signal in the (noisy) covariance estimate that is obtained from the same data on which the MMF is subsequently applied. This ‘covariance bias’ is analogous to the so-called Internal Linear Combination (ILC) bias that is encountered in CMB component separation with ILC methods (e.g. Delabrouille et al. 2009; see Coulton et al. 2024 for a bias mitigation approach in the context of power spectrum estimation with parallels to our masking algorithm). As discussed in Zubeldia et al. (2023a), iterative noise covariance estimation with just one iteration effectively removes both issues, boosting the detection signal-to-noise ratio to the correct level and eliminating the covariance bias. We note that the official Planck catalogues were obtained without iterative noise covariance estimation, simply taking the noise covariance to be equal to the covariance of the data (see e.g. Planck Collaboration XXVII 2016).

3.3.6 Tile catalogue merging

Once the iterative tile catalogues are produced, SZiFi merges them into a single survey-wide catalogue for each MMF, assigning them global sky coordinates (Galactic longitude and latitude, as well as equatorial coordinates).

As a final step, SZiFi merges detections lying within a given angular distance of each other into a single one. This merging step solves two problems. First, our peak-finding algorithm, described in Section 3.3.4, sometimes leads to several detections located close to each other that are associated to a single true object, as it finds local peaks in the signal-to-noise three-dimensional distribution, which is noisy. In addition, clusters lying very close to the border between two selection tiles may be detected in both tiles (or in more than two, if the cluster is near the intersection between four tiles).

The following merging algorithm is applied. First, the detection with the highest signal-to-noise ratio in the full catalogue is identified. All the detections falling within an angular distance of its estimated angular scale |$\hat{\theta }_{500}$| are considered as belonging to the same object and dropped from the catalogue. This process is repeated until the lowest signal-to-noise detection in the catalogue is reached. Apart from a few cases (see Sections 4.3 and 9.3), this algorithm successfully merges all detections belonging to the same object. In addition, as we show in Section 9 by applying SZiFi to our synthetic Planck-like maps, this algorithm leads to a cluster catalogue that we can successfully model theoretically. This would not be the case if, e.g. it led to many instances of merging of detections corresponding to different clusters.

This merging algorithm is also applied to the non-iterative tile catalogues, producing survey-wide non-iterative catalogues. Hereafter, we will refer to these catalogues using the same naming convention used for their iterative counterparts but dropping the initial ‘i’ (e.g. MMF6 instead of iMMF6).

Our merging algorithm is similar to the one used for the construction of the official Planck catalogues, although a fixed merging angular separation of 10 arcmin was employed then (e.g. Planck Collaboration XXVIII 2016). It differs significantly, however, from the friends-of-friends merging algorithm used in Zubeldia et al. (2023a, b). We have verified that applying this friends-of-friends merging algorithm with a linking length of |$\theta _{\mathrm{id}} =10$| arcmin to our catalogues leads to a very small (⁠|$\mathcal {O}(1)$|⁠) change in the number of clusters in the output catalogues.

After this merging step, we impose a new selection threshold of |$q_{\mathrm{th}} = 5$|⁠.

3.4 Construction of the master catalogue

After obtaining our 10 iterative catalogues of cluster candidates and their 10 non-iterative counterparts, we mutually cross-match them against each other and compile them into a single ‘master catalogue’, assigning a unique name to each detection. This master catalogue is designed to allow for the retrieval of each individual catalogue while making it easy to access the cluster observable values for each detection for all the filters with which it was detected.

3.4.1 Cross-matching algorithm

The construction of the master catalogue is based on the following cross-matching algorithm. Consider two catalogues, catalogue A and catalogue B. The algorithm starts by finding the object in catalogue A with the largest value of the signal-to-noise, which we denote with a. It then finds all the objects in catalogue B lying within a given cross-matching angular distance |$\theta _{\mathrm{match}}$| of a. If there is only one such object, that object is identified as the same object as a, and is removed from catalogue B. If there are multiple such objects, the object with the largest value of the signal-to-noise ratio is identified as the same object as a, and similarly removed from catalogue B. If there are none, a is labelled as having no counterpart in catalogue B. In any of the three cases, a is then removed from catalogue A. The process is then repeated until the last object in catalogue A has been reached. In the last step, any remaining objects in catalogue B are labelled as having no counterpart in catalogue A.

This cross-matching algorithm returns two cross-matched catalogues with the same number of objects in them, |$A_{\mathrm{C}}$| and |$B_{\mathrm{C}}$|⁠. In them, each entry contains the original entries in A and B, respectively, for each cross-matched object, and a null entry in either |$A_{\mathrm{C}}$| or |$B_{\mathrm{C}}$| if the given object has not been found a counterpart in A or B, respectively.

We adopt a cross-matching distance of |$\theta _{\mathrm{match}} = 10$| arcmin, which is comparable with the Planck HFI beams. Given the low number density of our detections in the sky, we argue that this simple cross-matching algorithm is well-suited for our catalogues. Indeed, our iMMF6 catalogue within the cosmology mask (see below for how the mask is defined), which contains the largest number of detections, features 833 detections over 58.65 per cent of the sky (see below), with 29.04 deg|$^2$| per cluster on average. We also argue that our algorithm is similarly well-suited for cross-matching with the external catalogues (see Section 4), for which the number density is, on average, at most |$\sim 10$| times larger than in our iMMF6 catalogue.

3.4.2 Master catalogue

In order to construct the master catalogue, we start by cross-matching the iMMF6 catalogue, which features the largest number of detections, with its non-iterative counterpart, the MMF6 catalogue, creating a first master catalogue containing both the successful cross-matches and the remaining detections. We then sequentially cross-match the resulting master catalogue with the remaining catalogues, in the following order, with the non-iterative catalogues cross-matched immediately after their iterative counterparts: sciMMF6, iMMF5, sciMMF5, sciMMF6 |$\beta _T$|⁠, iMMF4, sciMMF4, sciMMF6 |$\beta$|⁠, sciMMF5 |$\beta _T$|⁠, sciMMF5 |$\beta$|⁠.

The master catalogue is formed by a total of 1499 detections. Each detection in it contains entries for the cluster observables (⁠|$q_{\mathrm{obs}}$|⁠, |$\hat{y}_{0}$|⁠, |$\hat{\theta }_{500}$|⁠, and sky coordinates) for all 10 iterative and 10 non-iterative catalogues. Each entry contains the corresponding value for all the catalogues in which the detection is found, and a null value otherwise. Each detection also contains an entry for its ‘master sky coordinates’, which are the detection sky coordinates for the first catalogue in our cross-matching procedure in which the detection appears.

After constructing the master catalogue, we apply a more restrictive Galactic mask to it, the Planck PR2 70 per cent Galactic mask, flagging all the detections falling outside of it. We refer to the union of this mask with the buffered Galactic + point-source mask that was employed for cluster finding as the ‘cosmology mask’. It leaves 58.65 per cent of the sky unmasked and can be seen as the grey area in Fig. 2. Within it, the master catalogue contains a total of 1152 detections. In the following (Sections 57), we will always consider our catalogues restricted to the area left unmasked by the cosmology mask.

Detections in the iMMF6 catalogue in Galactic coordinates (light red points) shown over the MMF noise $\sigma _{\mathrm{f}}$ of each selection tile evaluated at $\theta _{500}=4$ arcmin. The cosmology mask is shown as the grey region. We note that $\sigma _{\mathrm{f}}$ includes contributions both from instrumental noise and foregrounds (e.g. the CMB, the CIB, the kSZ effect, and Galactic foregrounds).
Figure 2.

Detections in the iMMF6 catalogue in Galactic coordinates (light red points) shown over the MMF noise |$\sigma _{\mathrm{f}}$| of each selection tile evaluated at |$\theta _{500}=4$| arcmin. The cosmology mask is shown as the grey region. We note that |$\sigma _{\mathrm{f}}$| includes contributions both from instrumental noise and foregrounds (e.g. the CMB, the CIB, the kSZ effect, and Galactic foregrounds).

As already noted, we construct this master catalogue for convenience, as it allows for easy access to the cluster observables obtained with different filters for objects detected with more than one filter, as well as for easy cross-matching with the external catalogues. However, each of our 10 iterative catalogues, as well as each of their non-iterative counterparts, must be thought of as an independent catalogue in its own right, with a simple, well-defined tSZ selection. Each catalogue remains unaffected by the cross-matching algorithm and can be fully retrieved from the master catalogue. The master catalogue as such, on the other hand, presents a more complicated selection function, as it is formed by 20 different catalogues, each with its own selection. We therefore recommend to use the individual catalogues, and not the full master catalogue, for any application (e.g. a cosmological analysis).

4 CLUSTER CONFIRMATION

4.1 Cross-matching with the external catalogues and redshift assignment

We cross-match the Planck  SZiFi master catalogue with the external catalogues described in Section 2 with the goal of determining whether each detection in our master catalogue corresponds to a real galaxy cluster and, if so, of assigning a redshift measurement to it. We recall that these are the MCSZ meta-catalogue, the MCXC-II meta-catalogue, the ACT DR5 catalogue, the CODEX catalogue, the RASS-MCMF catalogue, and the eRASS1 catalogue. We perform the cross-matching using the master sky coordinates of each detection and applying a modified version of the algorithm described in Section 3.4.1. In particular, in the instances for which there are several objects in a given external catalogue within 10 arcmin of one of our detections, we assign the closest object, as opposed to the one with the highest signal-to-noise ratio, as the signal-to-noise ratio is not defined, in general, for the objects in the external catalogues. We assign a cross-matching flag to each successful cross-match in the master catalogue, as well as the name(s) of its counterpart(s) in the external catalogues. Of the 1152 detections in the master catalogue (within the cosmology mask), we find 869 successful cross-matches with the MCSZ meta-catalogue, 486 with the MCXC-II meta-catalogue, 294 with the ACT DR5 catalogue, 286 with the CODEX catalogue, 699 with the RASS-MCMF catalogue, and 309 with the eRASS1 catalogue. All together, we are able to find positive cross-matches for 957 of the 1152 objects in the master catalogue within the cosmology mask. For reference, for our baseline catalogue, iMMF6, we find a total of 812 successful cross-matches out of a total of 861 detections within the cosmology mask, 773 of which are with the MCSZ meta-catalogue, 449 with the MCXC-II meta-catalogue, 254 with the ACT DR5 catalogue, 248 with the CODEX catalogue, 618 with the RASS-MCMF catalogue, and 254 with the eRASS1 catalogue.

We consider all the cross-matched objects in our master catalogue to be real galaxy clusters, except for the objects that have only been cross-matched with an unconfirmed detection in the Planck PSZ1 or PSZ2 catalogues, which are contained in the MCSZ meta-catalogue. In particular, for the PSZ1 catalogue, we consider an unconfirmed detection to be an object of class 3, and for the PSZ2 catalogue, an object of class -1, in both cases provided that the object has not been validated after the catalogue release, as reported in the MCSZ meta-catalogue. Within the cosmology mask we find 0 and 26 such cross-matches with the PSZ1 and PSZ2 catalogues, respectively (0 and 11 of them part of the iMMF6 catalogue, respectively). We introduce a confirmation flag that is set to 1 for all objects with a positive cross-match except for these cross-matches with unconfirmed Planck detections, and to 0 for the remaining objects. We note that all the cross-matches in the master catalogue with only one object in either any of the SPT catalogues hosted in the MCSZ meta-catalogue or in the ACT DR5 catalogue are cross-matches with a confirmed cluster in those catalogues.

In addition, we assign redshift measurements as follows. For every successful cross-match, if there is only a redshift measurement in one of the external catalogues with which the cross-match has been successful, we assign it as the cluster’s redshift. For detections with cross-matches with available redshifts in more than one external catalogue, we adopt the following assignment hierarchy, giving preference to spectroscopic measurements over photometric ones: spec-z in MCSZ, spec-z in MCXC-II, spec-z in RASS-MCMF, spec-z in ACT DR5, spec-z in CODEX, photo-z in MCSZ, photo-z in MCXC-II, photo-z in ACT DR5, photo-z in RASS-MCMF, photo-z in eRASS1. We break this hierarchy for four clusters, PSZ2 G109.86 + 27.94, PSZ2 G181.71−68.65, PSZ2 G281.09-42.51, and PSZ2 G287.00–35.24, in their PSZ2 names, for which updated, significantly different redshift values have been published (see table A1 of Klein et al. 2023). We assign these updated redshift values for these four clusters, instead of those in the MCSZ meta-catalogue. Within the cosmology mask, we are able to assign redshift values to 921 out of a total of 1152 detections (793 out of 861 for iMMF6).

To each cluster with an available redshift measurement we also assign a redshift type flag, identifying the type of redshift measurement (spectroscopic, photometric, or unspecified). Within the cosmology mask, 781 of the redshift measurements in our master catalogue are spectroscopic (689 for iMMF6), 136 photometric (97 for iMMF6), and the remaining 4 of an unspecified type (also 4 for iMMF6).

4.2 Cross-matching with Planck GCCs and point sources

GCCs can be detected by tSZ cluster detection methods, appearing as spurious detections (see e.g. Planck Collaboration XXVII 2016). In order to assess the contamination of the master catalogue by GCCs, we cross-match it with the Planck Catalogue of cold clumps (Planck Collaboration XXVIII 2016), which contains 13 242 objects. We do this using the same cross-matching algorithm employed to cross-match the master catalogue with the external catalogues (Section 4.1). We find 17 cross-matches within the cosmology mask (4 for iMMF6), and flag them in the master catalogue, additionally setting their confirmation flag to −1, indicating that they are likely false detections. Five of these objects (three for iMMF6) also correspond to positive cross-matches with our external catalogues. We have decided to set their confirmation flag to −1 and leave their detailed study to further work. Taking into account these objects and the cross-matches with only unconfirmed PSZ2 objects, within the cosmology mask we are left with a total of 926 confirmed objects in the master catalogue (798 for iMMF6), 916 of them with a redshift measurement (790 for iMMF6). We discuss the confirmed detections without a redshift measurement in Section 5.3.

Moreover, bright millimetre point sources can also be spuriously detected as tSZ detections. We further cross-match the master catalogue with the Second Planck catalogue of Compact Sources (Planck Collaboration XXVI 2016). We find no cross-matches, which confirms that the point-source masking strategy followed in our cluster finding pipeline successfully mitigates against the presence of bright point sources in the Planck maps.

4.3 Inspection of non-cross-matched detections

With the goal of trying to shed light on the nature of the detections in the master catalogue without a positive cross-match with any of the external catalogues nor with the GCC catalogue (183 in total in the master catalogue, 48 in iMMF6), we have inspected the MMF signal-to-noise maps at which each of them is detected. We have found two kinds of unconfirmed detections that we regard true false detections and flagged them as such in the master catalogue by setting their confirmation flag to −1.

The first kind of true false detections is formed by detections made at the border of a given selection tile on a large angular scale, usually the largest one considered (32 arcmin), and that are near a confirmed detection in the neighbouring tile but at an angular distance greater than 10 arcmin. These detections can be explained as follows. Suppose there is a cluster near the edge of tile B, appearing therefore both in the field of tile B and in that of its neighbouring tile, tile A. Suppose also that this cluster is detected in tile B. The cluster will probably also be detectable in the signal-to-noise maps of tile A before the peak-finding mask of tile A is applied in order to restrict peak finding to tile A. If the cluster is significant enough, the region around it above which the signal-to-noise is greater than the detection threshold in the field of tile A may reach tile A. This is more likely to happen at a large search angular scale. In that case, the cluster will also be detected in tile A, typically at a large angular scale and with a poorly estimated sky position. We have identified 16 such detections in our master catalogue (with the iMMF6 catalogue having 5 of them).

The second kind of true false detection is analogous to the first kind, but applies to detections lying near the edge of a point-source-masked region, typically also found at large angular scales. These unconfirmed detections probably correspond to true objects lying inside the region masked out by the buffered point-source mask, being detected at large angular scales on the edge of the point-source masked region. We have found 25 such detections in our master catalogue (with the iMMF6 catalogue having 7 of them).

We have performed a similar inspection analysis to our validation catalogues, finding in them a comparable number of true false detections of the two kinds described here (see Section 9.3).

In addition, we have found two non-cross-matched detections each of which is probably associated with another cross-matched detection separated from it by slightly more than the cross-matching distance of 10 arcmin. As a consequence, we have flagged these detections as true false detections, setting their confirmation flag to −1; we note that none of them is present in iMMF6. Finally, we have identified two selection tiles (number 436 and 437) featuring significant Fourier ringing that leads to a number of non-cross-matched detections (13 in total, 12 of them present in iMMF6). These tiles lie near the Galactic plane and are significantly masked out by the Galactic mask, which may be responsible for the ringing. We have decided to also flag all these detections as true false detections, setting their confirmation flag to −1, and advise against the use of these two selection tiles in a cosmological analysis.

Taking into account these 56 true false detections (24 for iMMF6), within the cosmology mask we are left with a total of 127 non-cross-matched detections that are not true false detections (confirmation flag of 0; 24 for iMMF6). Adding them to their cross-matched counterparts leads to a total of 1078 objects with confirmation flag of 0 or 1 (833 for iMMF6). Of these, 926 are confirmed objects (confirmation flag of 1; 798 for iMMF6), while 152 are unconfirmed ones (confirmation flag of 0; 35 for iMMF6). Assuming that all the objects in the external catalogues, we have deemed to be real clusters are indeed so, this sets an empirical lower bound on the purity of the master catalogue of 85.9 per cent at |$q_{\mathrm{obs}} > 5$| (95.8 per cent for iMMF6; see Section 5.3 for further discussion).

5 THE Planck SZiFi GALAXY CLUSTER CATALOGUES: PROPERTIES

In this section we introduce our new Planck  SZiFi catalogues, describing their properties. We recall that there are a total of 10 catalogues, each obtained with one of our 10 iterative MMFs (see Section 3.2 and Table 1), in addition to their 10 non-iterative counterparts. The catalogues are all compiled for convenience into a single master catalogue, as described in Section 4. The master catalogue will become publicly available at this link and a brief description of its entries can be found in Table A1. We emphasise that the master catalogue is produced and made available for convenience, but that the different catalogues that it contains must be thought of as independent catalogues in their own right, each with a simple, well-defined tSZ selection. The master catalogue itself, in fact, presents a more complicated tSZ selection, as it is the union of 20 different catalogues, each with its own tSZ selection. We, therefore, recommend to use the individual catalogues individually for any application (e.g. cosmological analysis) and not the full master catalogue. In particular, for these reasons, in this section we will study the properties of our individual catalogues, and not those of the master catalogue.

Here, first, in Section 5.1, we study the sky distribution of our detections. Next, in Section 5.2 we describe the distribution across both signal-to-noise and redshift and quantify how the signal-to-noise values of our detections change with the MMF employed. Then, in Section 5.3, we study the confirmation and redshift assignment fraction and, finally, in Section 5.4 we analyse the impact of iterative noise covariance estimation.

Hereafter, when analysing the catalogues in the master catalogue, we will only consider the detections within the cosmology mask (see Section 3.4) and with a confirmation flag of 0 or 1, discarding the detections that we have concluded to be false (see Section 4.3). We will also place a particular focus on the iterative catalogues, considering the non-iterative catalogues only when assessing the impact of iterative noise estimation and for comparison with the Planck MMF3 catalogue, which was constructed without iterative noise covariance estimation (see Section 6).

5.1 Sky distribution

Fig. 2 shows the sky position, in Galactic coordinates, of the detections in the iMMF6 catalogue (light red points), our baseline catalogue, plotted over the MMF noise |$\sigma _{\mathrm{f}}$| for each selection tile at |$\theta _{500} = 4$| arcmin. We recall that |$\sigma _{\mathrm{f}}$| includes contributions due to both noise and foregrounds; see Section 3.3.3 for how it is defined. The cosmology mask, which leaves a total of 58.65 per cent of the sky unmasked, is shown as the grey area. Its two components (Galactic and point-source) are clearly visible. The impact of the Planck scanning strategy is also visible, with lower MMF noise |$\sigma _{\mathrm{f}}$| and, therefore, more detections near the ecliptic poles. The other catalogues feature a similar sky distribution that is not shown here for conciseness.

5.2 Signal-to-noise ratio and redshift distribution

Fig. 3 shows the number counts in the 10 iterative catalogues in the master catalogue, both across signal-to-noise |$q_{\mathrm{obs}}$| (for all the detections in each catalogue) and across redshift (for the confirmed detections with an available redshift measurement). We have binned the detections across signal-to-noise ratio in 10 bins logarithmically spaced between |$q_{\mathrm{obs}} = 5$| and |$q_{\mathrm{obs}} = 50$|⁠, and across redshift in 10 bins linearly spaced between |$z=0$| and |$z=1$|⁠. The total number of detections in each catalogue is reported above the panel showing its corresponding number counts across signal-to-noise ratio, whereas the associated total number of confirmed detections with a redshift measurement is shown above its corresponding redshift counts panel.

Number counts in our 10 iterative cluster catalogues, across both signal-to-noise $q_{\mathrm{obs}}$ and redshift z (left and right panel for each catalogue, respectively). The total number of detections in each catalogue is reported above the panel showing its corresponding counts across signal-to-noise ratio, whereas the total number of confirmed detections with a redshift measurement is shown above the corresponding panel for the redshift counts. The iMMF6 and the sciMMF5 $\beta$ catalogues contain, respectively, the largest and smallest number of clusters.
Figure 3.

Number counts in our 10 iterative cluster catalogues, across both signal-to-noise |$q_{\mathrm{obs}}$| and redshift z (left and right panel for each catalogue, respectively). The total number of detections in each catalogue is reported above the panel showing its corresponding counts across signal-to-noise ratio, whereas the total number of confirmed detections with a redshift measurement is shown above the corresponding panel for the redshift counts. The iMMF6 and the sciMMF5 |$\beta$| catalogues contain, respectively, the largest and smallest number of clusters.

As expected, the number counts decrease with signal-to-noise for every catalogue. This is also the case for the official Planck catalogues (Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016), as well as for other tSZ catalogues (e.g. Hilton et al. 2018; Bleem et al. 2022), and is a consequence of the steepness of the mass dependence of the halo mass function. The redshift distribution, on the other hand, peaks around |$z \simeq 0.2$|⁠, as it does for the official Planck catalogues (Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016).

Spectral deprojection of the CIB and/or the use of fewer frequency channels results in fewer detections in every signal-to-noise or redshift bin. There are two contributions to this effect. First, there is a signal-to-noise penalty due to using fewer channels and/or spectral deprojection (see e.g. Zubeldia et al. 2023b). This penalty increases as fewer channels are used and/or more SEDs (including the moments) are deprojected. Secondly, the amplitude of potential biases in the signal-to-noise due to cluster-correlated emission, such as the CIB, may depend on the channel and/or deprojection combination (see below and also Zubeldia et al. 2023b). As expected, the catalogue with the highest number of detections is the iMMF6 catalogue, which is constructed using all 6 HFI channels and without any spectral deprojection. It contains 833 detections, 790 of them with a redshift measurement. On the other hand, the catalogue with the lowest number of detections is the sciMMF5 |$\beta$| catalogue, constructed using the 5 lowest-frequency HFI channels and spectrally deprojecting the CIB and its first-order moment with respect to |$\beta$|⁠. It contains a total of 277 detections, 263 of them with a redshift measurement.

The change in the number counts when different MMFs are used can be understood as the result of the change in the signal-to-noise of each detection (or missed detection, if the signal-to-noise falls below the detection threshold for any given MMF). This is illustrated in Fig. 4, which depicts the signal-to-noise for the detections in each catalogue (vertical axis) plotted against the corresponding signal-to-noise values in the iMMF6 catalogue (horizontal axis) for the common detections between each pair of catalogues. It is apparent that CIB spectral deprojection and/or the use of fewer channels reduces the signal-to-noise for any detection.

Signal-to-noise values for the detections in each of our catalogues (identified by the title above each panel) plotted against the corresponding values in the iMMF6 catalogue for the common detections in each pair of catalogues. As expected, the iMMF6 catalogue leads to the highest signal-to-noise measurements, with the signal-to-noise decreasing with the use of fewer frequency channels and/or the application of more stringent spectral deprojection. The changes in the signal-to-noise seen here are responsible for the changes in the number counts observed in Fig. 3.
Figure 4.

Signal-to-noise values for the detections in each of our catalogues (identified by the title above each panel) plotted against the corresponding values in the iMMF6 catalogue for the common detections in each pair of catalogues. As expected, the iMMF6 catalogue leads to the highest signal-to-noise measurements, with the signal-to-noise decreasing with the use of fewer frequency channels and/or the application of more stringent spectral deprojection. The changes in the signal-to-noise seen here are responsible for the changes in the number counts observed in Fig. 3.

As mentioned above, there are two contributions to this signal-to-noise change: (i) the signal-to-noise penalty and (ii) different sensitivity to correlated emission, such as the CIB. These two different contributions are difficult to disentangle by considering the signal-to-noise measurements alone. We set out to do so by using the |$\hat{y}_0$| MMF measurements instead, as we describe in detail in Sections 7 and 8.

We note that we do not provide any integrated Compton-y measurements for the detections in our catalogues, regarding |$\hat{y}_0$| and |$q_{\mathrm{obs}}$| as the natural output quantities of our MMFs. However, in Section 7 we use derived integrated Compton-y measurements (specifically, integrated within |$\theta _{500}$|⁠), obtained assuming the Arnaud et al. (2010) profile, in order to estimate the cluster electron temperatures. These derived measurements can be easily obtained from our catalogues using the scaling relations presented in Section 7 (specifically, equations 7 and 8). In addition, in Section 9, we assess how these derived integrated measurements relate to their true values.

5.3 Catalogue confirmation: purity and redshifts

The upper panel of Fig. 5 shows the fraction of detections in our catalogues that we have confirmed as real galaxy clusters in our confirmation procedure (see Section 4) as a function of the minimum signal-to-noise |$q_{\mathrm{th}}$| for six of our catalogues. As expected, the confirmation fraction increases with signal-to-noise. For most MMFs, including the baseline catalogue iMMF6, it goes up from about 0.95 at |$q_{\mathrm{th}}=5$| to 1 at |$q_{\mathrm{th}}=8$|⁠. The most notable exception is the iMMF4 catalogue, for which the confirmation fraction starts at a lower value of 0.9 at |$q_{\mathrm{th}}=5$|⁠, but similarly rises to 1 at |$q_{\mathrm{th}}=8$|⁠. Note, in particular, that spectral deprojection does not degrade the confirmation fraction significantly, with our sciMMFs having a confirmation fraction that is comparable to those for iMMF6 and iMMF5. The number of unconfirmed detections in each catalogue as a function of |$q_{\mathrm{th}}$|⁠, i.e. the number of detections associated with each confirmation fraction, is shown in the lower panel of Fig. 5. For each catalogue it is approximately a power law of |$q_{\mathrm{th}}$|⁠.

Fraction of confirmed detections (upper panel) and number of unconfirmed detections (lower panel) as a function of minimum signal-to-noise $q_{\mathrm{th}}$ for six of our catalogues. Compare with the analogous results for the validation catalogues obtained from our synthetic Planck-like observations (Fig. 13).
Figure 5.

Fraction of confirmed detections (upper panel) and number of unconfirmed detections (lower panel) as a function of minimum signal-to-noise |$q_{\mathrm{th}}$| for six of our catalogues. Compare with the analogous results for the validation catalogues obtained from our synthetic Planck-like observations (Fig. 13).

Assuming that every object in the external catalogues deemed to be a real cluster (all the objects in the external catalogues except a subset of the unconfirmed detections of the PSZ1 and PSZ2 catalogues, see Section 4) is indeed one, the confirmation fraction sets a lower bound on the fraction of true detections in the catalogue, a quantity typically referred to as the catalogue purity. While we have been conservative in our choice of external catalogues, these catalogues very likely present a non-zero level of contamination. This implies that a small fraction of our confirmed detections may be false detections. Our confirmation fractions therefore constitute approximate lower bounds on the purity of our catalogues.

In particular, there is a small set of confirmed objects in our catalogues that is only positively cross-matched with unconfirmed objects in the PSZ2 catalogue of the type not discarded in our confirmation procedure. Though appearing as confirmed objects in our catalogues, none of these objects have a redshift measurement associated with them. For reference, for the iMMF6 catalogue there are 8 such objects, 6 of which with |$q_{obs} < 6$| and the most significant one with |$q_{obs} = 6.71$|⁠. As the PSZ2 catalogue is expected to be highly correlated with our catalogues, there is a possibility that these objects are false detections. We leave a more detailed study of these detections to further work.

The values of the confirmation fraction as a function of |$q_{\mathrm{th}}$|⁠, as well as its behaviour across different MMFs, are broadly similar to what we observe for our validation catalogues, produced by applying our cluster finding pipeline to synthetic sky maps (see Section 9, and, in particular, Fig. 13). The associated numbers of unconfirmed detections are also similar. We note that this similarity should not be over interpreted, as we do not claim our synthetic data to be a perfect description of the Planck sky, particularly regarding, e.g. point sources that may be spuriously detected by our pipeline.

Upper panel: Number counts in our iMMF6 and MMF6 catalogues (solid blue and dashed orange histograms, respectively) across signal-to-noise $q_{\mathrm{obs}}$. We recall that the iMMF6 catalogue is the catalogue obtained with all 6 HFI channels applying iterative noise covariance estimation, whereas the MMF6 catalogue is its non-iterative counterpart. Lower panel: Signal-to-noise measurements in the iMMF6 catalogue (vertical axis), plotted against the corresponding values in the MMF6 catalogue (horizontal axis) for the common detections. As expected, iterative noise covariance estimation results in an increase in the signal-to-noise ratio of each detection due to the correct estimation of the noise covariance and to the removal of the negative covariance bias. This, in turn, leads to a larger number of objects in each signal-to-noise bin as seen in the upper panel.
Figure 6.

Upper panel: Number counts in our iMMF6 and MMF6 catalogues (solid blue and dashed orange histograms, respectively) across signal-to-noise |$q_{\mathrm{obs}}$|⁠. We recall that the iMMF6 catalogue is the catalogue obtained with all 6 HFI channels applying iterative noise covariance estimation, whereas the MMF6 catalogue is its non-iterative counterpart. Lower panel: Signal-to-noise measurements in the iMMF6 catalogue (vertical axis), plotted against the corresponding values in the MMF6 catalogue (horizontal axis) for the common detections. As expected, iterative noise covariance estimation results in an increase in the signal-to-noise ratio of each detection due to the correct estimation of the noise covariance and to the removal of the negative covariance bias. This, in turn, leads to a larger number of objects in each signal-to-noise bin as seen in the upper panel.

Number counts in our SZiFi MMF6 catalogue and in the official Planck MMF3 catalogue (solid blue and dashed orange histograms, respectively) binned as a function of both signal-to-noise ratio (upper left panel) and redshift (upper right panel), for the common mask. In the upper right panel, only the clusters with $q_{\mathrm{obs}} > 7$ are shown, to ensure that all the clusters in the sample have an available redshift measurement. The lower panels show the signal-to-noise values in the SZiFi MMF6 and the MMF3 catalogues for the common detections in both catalogues with $q_{\mathrm{obs}} > 5$ (their values on the lower left panel, their ratio on the lower right panel). We recall that our MMF6 catalogue is obtained without iterative noise covariance estimation, and therefore can be suitably compared with the MMF3 catalogue.
Figure 7.

Number counts in our SZiFi MMF6 catalogue and in the official Planck MMF3 catalogue (solid blue and dashed orange histograms, respectively) binned as a function of both signal-to-noise ratio (upper left panel) and redshift (upper right panel), for the common mask. In the upper right panel, only the clusters with |$q_{\mathrm{obs}} > 7$| are shown, to ensure that all the clusters in the sample have an available redshift measurement. The lower panels show the signal-to-noise values in the SZiFi MMF6 and the MMF3 catalogues for the common detections in both catalogues with |$q_{\mathrm{obs}} > 5$| (their values on the lower left panel, their ratio on the lower right panel). We recall that our MMF6 catalogue is obtained without iterative noise covariance estimation, and therefore can be suitably compared with the MMF3 catalogue.

First column: Measurements of the MMF amplitude parameter $\hat{y}_0$ extracted at the average sky position across all 10 iterative MMFs and at a fixed angular scale $\theta _{500}=5$ arcmin for the common detections in the 10 MMFs (247 detections). In each panel, the vertical axis corresponds to the panel’s MMF (indicated in the axis label), whereas the horizontal axis corresponds to iMMF6. The data points are colour-coded by their detection signal-to-noise ratio for iMMF6. It is apparent that CIB deprojection and/or the use of fewer frequency channels has little impact on $\hat{y}_0$ (see Section 7.3). Second column: As the first column but produced assuming a relativistic tSZ SED (see Section 8). Note that there are barely any changes with respect to the first column. This is due to $\hat{y}_0$ changing by a very similar amount for all MMFs (see Fig. 10). Third and fourth columns: As the first column but for our randomized and correlated Websky catalogues, respectively. These two catalogues are obtained by applying the cluster detection and $\hat{y}_0$ re-extraction pipelines to our two sets of Planck-like synthetic maps, artificially removing the spatial correlation between the CIB and the tSZ signal for one of them and leaving it untouched for the other one (see Section 7).
Figure 8.

First column: Measurements of the MMF amplitude parameter |$\hat{y}_0$| extracted at the average sky position across all 10 iterative MMFs and at a fixed angular scale |$\theta _{500}=5$| arcmin for the common detections in the 10 MMFs (247 detections). In each panel, the vertical axis corresponds to the panel’s MMF (indicated in the axis label), whereas the horizontal axis corresponds to iMMF6. The data points are colour-coded by their detection signal-to-noise ratio for iMMF6. It is apparent that CIB deprojection and/or the use of fewer frequency channels has little impact on |$\hat{y}_0$| (see Section 7.3). Second column: As the first column but produced assuming a relativistic tSZ SED (see Section 8). Note that there are barely any changes with respect to the first column. This is due to |$\hat{y}_0$| changing by a very similar amount for all MMFs (see Fig. 10). Third and fourth columns: As the first column but for our randomized and correlated Websky catalogues, respectively. These two catalogues are obtained by applying the cluster detection and |$\hat{y}_0$| re-extraction pipelines to our two sets of Planck-like synthetic maps, artificially removing the spatial correlation between the CIB and the tSZ signal for one of them and leaving it untouched for the other one (see Section 7).

Distribution of the estimated Compton-y-weighted electron temperatures $T_Y$ for our rSZ cluster subsample, which contains a total of 244 clusters (see Section 8).
Figure 9.

Distribution of the estimated Compton-y-weighted electron temperatures |$T_Y$| for our rSZ cluster subsample, which contains a total of 244 clusters (see Section 8).

Ratio between the measurements of the MMF amplitude parameter $\hat{y}_0$ for the rSZ sample extracted assuming a relativistic tSZ SED and their non-relativistic counterparts, as a function of the Compton-y-weighted electron temperature estimate $T_Y$, for our 10 MMFs. As expected, assuming a relativistic tSZ SED leads to $\hat{y}_0$ measurements that are greater than their non-relativistic counterparts, an effect that increases with the assumed temperature $T_Y$ (see Section 8.2).
Figure 10.

Ratio between the measurements of the MMF amplitude parameter |$\hat{y}_0$| for the rSZ sample extracted assuming a relativistic tSZ SED and their non-relativistic counterparts, as a function of the Compton-y-weighted electron temperature estimate |$T_Y$|⁠, for our 10 MMFs. As expected, assuming a relativistic tSZ SED leads to |$\hat{y}_0$| measurements that are greater than their non-relativistic counterparts, an effect that increases with the assumed temperature |$T_Y$| (see Section 8.2).

As Fig. 10, but for the signal-to-noise $q_{\mathrm{obs}}$. Assuming a relativistic tSZ SED produces $q_{\mathrm{obs}}$ values that are fully in agreement with those obtained with the non-relativistic SED. The impact of the tSZ relativistic corrections on the signal-to-noise measurements in our catalogues is therefore negligible.
Figure 11.

As Fig. 10, but for the signal-to-noise |$q_{\mathrm{obs}}$|⁠. Assuming a relativistic tSZ SED produces |$q_{\mathrm{obs}}$| values that are fully in agreement with those obtained with the non-relativistic SED. The impact of the tSZ relativistic corrections on the signal-to-noise measurements in our catalogues is therefore negligible.

In red, number counts in our validation MMF6 and iMMF6 catalogues (left and right panels, respectively) as a function of both signal-to-noise ratio and redshift (upper and lower panels, respectively); only confirmed (i.e. true) detections are shown. Theoretical predictions for the counts are shown both if the optimization bias is taken into account and if it is neglected (blue and orange points, respectively, with the error bars corresponding to their Poisson errors). The difference between the observed and predicted number counts are also shown (small panels below each main panel). If the optimization bias is taken into account the theoretical prediction provides an excellent description of the observed counts for the iterative (iMMF6) catalogue across both signal-to-noise ratio and redshift.
Figure 12.

In red, number counts in our validation MMF6 and iMMF6 catalogues (left and right panels, respectively) as a function of both signal-to-noise ratio and redshift (upper and lower panels, respectively); only confirmed (i.e. true) detections are shown. Theoretical predictions for the counts are shown both if the optimization bias is taken into account and if it is neglected (blue and orange points, respectively, with the error bars corresponding to their Poisson errors). The difference between the observed and predicted number counts are also shown (small panels below each main panel). If the optimization bias is taken into account the theoretical prediction provides an excellent description of the observed counts for the iterative (iMMF6) catalogue across both signal-to-noise ratio and redshift.

Upper panel: Fraction of true detections (or purity) in our validation catalogues as a function of the signal-to-noise threshold $q_{\mathrm{th}}$. Lower panel: Associated number of false detections. As expected, the purity increases quickly with $q_{\mathrm{th}}$, being 100 per cent at $q_{\mathrm{th}} > 7$.
Figure 13.

Upper panel: Fraction of true detections (or purity) in our validation catalogues as a function of the signal-to-noise threshold |$q_{\mathrm{th}}$|⁠. Lower panel: Associated number of false detections. As expected, the purity increases quickly with |$q_{\mathrm{th}}$|⁠, being 100 per cent at |$q_{\mathrm{th}} > 7$|⁠.

Finally, we note that the estimated purity of the individual cluster catalogues is higher than that of the master catalogue as a whole, which within the cosmology mask has a confirmation fraction of 85.9 per cent at |$q_{\mathrm{th}}=5$| (see Section 4). This is due to the individual catalogues sharing many confirmed detections but proportionally fewer unconfirmed detections, suggesting that a significant fraction of the unconfirmed detections may be false detections. This lower purity must be taken into account if the whole master catalogue is used for any application, something that we discourage in the first place, as it presents a complicated selection function.

5.4 Impact of iterative noise covariance estimation

With the goal of investigating the impact of iterative noise covariance estimation, we show the number counts in the iMMF6 and MMF6 catalogues as a function of signal-to-noise ratio in the upper panel of Fig. 6. We recall that the only difference between these two catalogues is that the iMMF6 catalogue is constructed with iterative noise covariance estimation, MM6 being its non-iterative counterpart. As expected, iterative noise covariance estimation increases the number counts in every signal-to-noise bin. This effect can be understood as a positive change in the signal-to-noise value of each detection. This change can be clearly seen in the lower panel of Fig. 6, which shows the signal-to-noise values for the iMMF6 catalogue plotted against those in the MMF6 catalogue, for the common detections in both catalogues. There it is apparent that iterative noise covariance estimation increases the signal-to-noise value of all detections, with the magnitude of the boost increasing with signal-to-noise value. These results are in agreement with the findings for synthetic observations reported in Zubeldia et al. (2023a, see, in particular, their figs 6 and 7).

There are two contributions to this increase in the signal-to-noise value. First, iterative noise covariance estimation prevents the overestimation of noise covariance due to the presence of the signal, thereby boosting the signal-to-noise value. Moreover, it eliminates the negative covariance bias, further enhancing the signal-to-noise value (see Section 3.3.5 and Zubeldia et al. 2023a).

6 COMPARISON WITH THE Planck MMF3 CATALOGUE

In this section, we compare our MMF6 catalogue with the Planck cluster catalogue that was used in the latest official Planck cosmological analysis, the MMF3 catalogue, which forms part of the Planck PSZ2 catalogue (Planck Collaboration XXVII 2016).

The MMF3 catalogue was obtained using a similar detection pipeline to the one employed in this work. The sky was tessellated into tiles and a matched filter was applied to each tile for a set of angular scales (see Planck Collaboration XXVII 2016). The same cluster profile was used for the matched filter template, the same range of angular scales was considered, and a standard MMF was employed, using the 6 HFI channels, as in our MMF6 catalogue. However, the MMF3 catalogue used the Planck PR2 HFI frequency maps instead of those from PR3 used in this work. In addition, a different tessellation scheme, a less aggressive point-source mask, and a slightly different Galactic mask were applied, and a different multipole range was used in the matched filtering process (all the modes in the cut-outs, instead of |$l_{\mathrm{min}} = 100$| and |$l_{\mathrm{max}} = 2500$| as used here). The noise covariance was estimated non-iteratively assuming it to be equal to the covariance of the data. Despite these differences, the MMF3 catalogue can therefore be most suitably compared with our MMF6 catalogue, as, we recall, the MMF6 catalogue was also obtained using a standard MMF for all 6 HFI channels and without iterative noise covariance estimation.

Fig. 7 shows the number counts as a function of signal-to-noise ratio and redshift in our MMF6 catalogue and in the MMF3 catalogue (including unconfirmed detections) for the union of our cosmology mask and the PSZ2 survey mask (solid blue histogram for MMF6, dashed orange histogram for MMF3). The counts are similar, with the MMF6 catalogue having slightly more detections in the first two signal-to-noise bins. The lower panels show the signal-to-noise values for the common detections in both catalogues (values on the lower left panel and their ratio on the lower right panel). We find 496 common detections with |$q_{\mathrm{obs}} > 5$| in both catalogues (shown here) and 572 common detections if we cross-match the MMF6 catalogue with the full MMF3 catalogue, which was constructed down to |$q_{\mathrm{obs}} = 4.5$|⁠. There is very good agreement in the values between the two catalogues, with the only notable difference being slightly larger signal-to-noise values for MMF3 at the high signal-to-noise end. As expected, there is some scatter. This scatter can be attributed to differences in the maps used (PR2 versus PR3) and in the detection pipeline, such as the tessellation scheme, the bins in Fourier space used for noise covariance estimation, and the multipole range used for matched filtering. This scatter explains why, within the common mask, the MMF6 catalogue includes some detections missing in the MMF3 catalogue and vice versa: near the detection threshold, some objects fluctuate above it in one catalogue and below it in the other one.

Regarding the redshift distribution, the upper right panel of Fig. 7 shows the number counts in the MMF6 and MMF3 catalogues (solid blue and dashed orange histograms, respectively) across redshift for detections above a signal-to-noise threshold of |$q_{\mathrm{th}} = 7$|⁠. We have imposed |$q_{\mathrm{th}} = 7$| to ensure that all the detections in both catalogues have a redshift measurement (see Fig. 5). The redshift distributions for MMF6 and MMF3 are consistent with each other.

7 IMPACT OF THE CIB

In this section, we investigate the impact of the cluster-correlated CIB on our cluster catalogues. We first explain how we set out to do so in Section 7.1. Next, we describe the synthetic observations that we use for comparison with our real measurements in Section 7.2, and we discuss our findings in Section 7.3.

7.1 Re-extracting |$\hat{y}_0$|

As discussed in Section 3.2, any signal that is spatially correlated with the cluster tSZ signal, such as the CIB, can bias the inferred values of the cluster observables. This bias may, in turn, affect number-count cosmological constraints (see also Melin et al. 2018; Zubeldia et al. 2023b). Here, we assess the magnitude of the bias caused by the cluster-correlated CIB (the ‘CIB bias’) by considering the MMF amplitude parameter |$\hat{y}_0$| instead of our preferred cluster observable, the signal-to-noise |$q_{\mathrm{obs}}$|⁠.

The reason for this approach is that |$q_{\mathrm{obs}}$| suffers from a signal-to-noise penalty when fewer frequency channels are used and/or when the CIB is spectrally deprojected, as discussed in Section 5.2. These two effects are difficult to disentangle when considering the signal-to-noise alone. Ignoring selection effects, the value of |$\hat{y}_0$|⁠, however, is not affected by the signal-to-noise penalty: only its measurement uncertainty is. This property makes |$\hat{y}_0$| a more suitable observable for investigating the impact of the CIB bias on our cluster catalogues.

The |$\hat{y}_0$| estimate delivered by the cluster detection pipeline for each cluster is, however, expected to be biased. Indeed, as noted in Section 3.3.4, |$\hat{y}_0$| can be thought of a maximum-likelihood estimate. Maximum-likelihood estimates are, in general, biased (see, e.g. Zubeldia et al. 2021, and also Section 3.3.4); for |$\hat{y}_0$|⁠, this maximum-likelihood bias is analogous to the optimization bias that affects the signal-to-noise (see Sections 3.3.4 and 9 for brief discussions on the optimization bias). In addition, |$\hat{y}_0$| is expected to suffer from a Malmquist-type bias, since the sample is selected on the signal-to-noise, which is strongly correlated with |$\hat{y}_0$|⁠. When comparing |$\hat{y}_0$| values from two different MMFs for a given cluster detected with both MMFs, this bias is expected to be stronger for the MMF featuring a lower signal-to-noise ratio (e.g. due to using fewer channels), as the cluster will lie closer to the selection threshold for that MMF than for the other one.

Due to the presence of these biases in the ‘blind’ |$\hat{y}_0$| measurements, we decide to consider only the clusters detected across all 10 iterative MMFs. We refer to this subsample of the master catalogue, containing a total of 247 clusters, as the ‘CIB subsample’. For the detections in this CIB subsample, we re-extract |$\hat{y}_0$| at the average sky position across the 10 MMFs and at a fixed angular scale of |$\theta _{500} = 5\,$| arcmin. This re-extraction is performed using SZiFi’s fixed mode with iterative noise covariance estimation and all other pipeline specifications set to match those used for cluster detection (see Section 3.3). The re-extraction is expected to reduce both the maximum-likelihood bias and the Malmquist-type bias.

However, these ‘fixed’ |$y_0$| measurements will still be biased due to the mismatch between the angular scale and sky position used for MMF extraction and their true values, an effect commonly known as miscentring for the sky position. In addition, any significant differences between the cluster profile assumed in the MMF and the true cluster profile will also introduce a bias. It is reasonable to assume, however, that these biases will be consistent across all MMFs and that the differences in the |$\hat{y}_0$| measurements across the various MMFs will be dominated by correlated emission, with a potential additional contribution from the Malmquist-type bias and other potential selection effects, which we will quantify with synthetic observations. These differences across the different MMFs will be the focus of the remainder of this section.

7.2 Synthetic observations

We apply our cluster detection and |$\hat{y}_0$| re-extraction pipelines to two sets of synthetic Planck-like maps: one in which the CIB is spatially correlated with the tSZ signal and another one in which this correlation is artificially removed. We do this for two reasons. First, the synthetic observations in which the CIB is not correlated with the tSZ signal provide a way of quantifying the magnitude of the selection biases in our fixed |$\hat{y}_0$| measurements. Secondly, the synthetic observations for which the CIB is correlated with the tSZ signal provide a prediction for the magnitude of the CIB-induced bias on the fixed |$\hat{y}_0$| measurements that can then be contrasted with the real fixed |$\hat{y}_0$| measurements.

We use the same synthetic maps that were used to assess CIB contamination of MMF cluster detection in Zubeldia et al. (2023b). These maps consist of full-sky temperature maps at the six frequency channels of the Planck HFI. They are pixelized using the HEALPix pixelization scheme (Górski et al. 2005) with |$N_{\mathrm{side}} = 2048$| that used in the real Planck HFI maps. Each temperature map contains the following components, all of which are convolved with the corresponding Planck RIMO beam and the HEALPix pixel transfer function (except the instrumental noise, which is only convolved with the pixel transfer function):

  • tSZ: The Compton-y map is taken from the Websky simulation and is rescaled at each frequency by the non-relativistic tSZ SED. Websky8 (Stein et al. 2019, 2020) is an all-sky second-order Lagrangian perturbation theory lightcone simulation with a minimum halo mass |$M_{200} \approx 1.4 \times 10^{12} M_{\odot }$|⁠. Since the Websky Compton-y map has |$N_{\mathrm{side}} = 4096$|⁠, we degrade it to |$N_{\mathrm{side}} = 2048$| with the HEALPix ud_grade function after beam convolution. We degrade in a similar way all the Websky maps that we employ (kSZ, lensed CMB, and CIB). We neglect the effect of relativistic temperature corrections to the tSZ signal (e.g. Itoh et al. 1998; Challinor & Lasenby 1998; Sazonov & Sunyaev 1998; Chluba et al. 2012), which we will quantify with the real Planck data in Section 8.

  • kSZ: The kinetic SZ (kSZ) signal is also taken from the Websky simulation and includes both the late-time and the reionisation contributions (Stein et al. 2020).

  • Lensed CMB: The lensed CMB is also taken from the Websky simulation.

  • CIB: The CIB is also taken from the Websky simulation, which provides a map for each of the Planck HFI frequencies. At the Planck frequencies, the Websky CIB emission from redshift z is given by a modified blackbody (see equation 1), with emissivity index |$\beta = 1.6$|⁠, dust temperature at |$z=0$|  |$T_0 = 20.7$| K, and |$\alpha =0.2$| (Stein et al. 2020).

  • Instrumental noise: We generate a full-sky realization of Gaussian white noise with noise levels, for increasing frequencies, of 77.4 |$\mu$|K arcmin, 33 |$\mu$|K arcmin, 46.8 |$\mu$|K arcmin, 153.6 |$\mu$|K arcmin, 46.8 kJy sr|$^{-1}$| arcmin, and 43.2 kJy sr|$^{-1}$| arcmin (Planck Collaboration VIII 2016). The noise is uncorrelated between different channels, and, as noted above, is convolved only by the HEALPix pixel transfer function, and not by the instrument beam.

As in Zubeldia et al. (2023b), our maps do not contain Galactic foregrounds and radio point sources. However, the Websky CIB includes bright point-like sources that can significantly contaminate cluster detection. In order to minimize their impact, we construct a point-source mask by running SZiFi as a point-source finder on each of the six synthetic frequency maps, producing a point-source catalogue for each frequency map. SZiFi is applied using iterative noise covariance estimation, and a signal-to-noise selection threshold of 5 is imposed. We then merge the individual frequency catalogues to produce a final point-source catalogue. Finally, a binary point-source mask is constructed by zeroing all the pixels within a masking distance of each detected point source. We use twice the beam FWHM of the channel in which each point source was detected as the masking angular distance.

We apply our SZiFi-based cluster detection pipeline to these synthetic frequency maps, using our SZiFi point-source mask and the Planck PR2 80 per cent Galactic mask that we also used for cluster detection in the real Planck maps. We use the same SZiFi configuration (multipole ranges, matched filter template, etc.) as in our analysis of the real Planck maps (see Section 3.3) and produce catalogues for our 10 iterative MMFs (see Section 3.2). We then re-extract |$\hat{y}_0$| for the common detections across all 10 MMFs at the average MMF sky position and at |$\theta _{500} = 5$| arcmin, as we did for the real Planck detections in the CIB subsample.

We perform this cluster detection and |$\hat{y}_0$| re-extraction procedure twice, producing two sets of catalogues. In the first instance, the Websky CIB is left as it is, spatially correlated with the tSZ signal, producing our ‘correlated Websky catalogues’. In the second instance, we artificially remove this correlation, producing our ‘randomized Websky catalogues’. Since SZiFi processes each selection tile independently, we remove the correlation by assigning to each tile the CIB cut-out corresponding to another randomly chosen tile, instead of the CIB cut-out associated with it. For reference, there are a total of 290 common detections across the correlated Websky catalogues and a total of 428 across the randomized ones.

We note that, for our synthetic simulations, the CIB is the only component that can potentially bias our cluster tSZ measurements. Indeed, although the Websky kSZ field is spatially correlated with the tSZ one, it averages out for our clusters due to being proportional to the cluster velocity along the line of sight. This was numerically verified in Zubeldia et al. (2023a, b), where it was shown that, for Websky, removing the CIB correlation leads to unbiased cluster tSZ observables.

7.3 Results and discussion

The first column of Fig. 8 shows the re-extracted fixed |$\hat{y}_0$| measurements in the CIB subsample for the real data. In each panel, the vertical axis corresponds to |$\hat{y}_0$| for the panel’s MMF (indicated in the panel’s vertical axis label), while the horizontal axis corresponds to the same quantity for iMMF6. The data points are colour-coded by their detection signal-to-noise for iMMF6. It can be seen that CIB deprojection and/or the use of fewer frequency channels have little impact on |$\hat{y}_0$|⁠. A small shift is observed for sciMMF5 and sciMMF5 |$\beta$|⁠, and perhaps also for sciMMF5 |$\beta _T$| and iMMF4, at large |$\hat{y}_0$| values. These shifts, however, are of low significance due to the small number of clusters at the high |$\hat{y}_0$| end of the distribution.

These findings imply that CIB contamination of the |$\hat{y}_0$| and signal-to-noise |$q_{\mathrm{obs}}$| measurements in our catalogues is expected to be minimal. In particular, they imply that the catalogue featuring the largest overall signal-to-noise ratio, the iMMF6 catalogue, contains little CIB contamination. Therefore, it may be possible to use it for cosmological inference without modelling this contamination, although further careful assessment at the likelihood analysis level is required.

The third and fourth columns of Fig. 8 are analogous to the first column but show instead the re-extracted fixed measurements for the randomized and correlated synthetic Websky catalogues, respectively. The distributions of the data points for the randomized catalogues are similar to those for the real Planck data seen in the first column. At large |$\hat{y}_0$| values, most MMFs produce |$\hat{y}_0$| measurements that agree with the iMMF6 ones. Small shifts are observed, as for the real data, for sciMMF5 and sciMMF5 |$\beta$|⁠, and perhaps also for sciMMF5 |$\beta _T$| and iMMF4.

We attribute the shifts at the large |$\hat{y}_0$| end for the randomized sciMMF5, sciMMF5 |$\beta$|⁠, sciMMF5 |$\beta _T$|⁠, and iMMF4 catalogues to a selection effect, as they are also obtained from synthetic observations in which the tSZ signal is not correlated with any other signals. Indeed, even though these |$\hat{y}_0$| measurements are not the blind, maximum-likelihood ones, they are extracted from the same maps. Therefore, their fluctuations are also expected to be correlated with the detection signal-to-noise and, as a consequence, they are also expected to be affected by the sample selection. Notably, very similar shifts are observed for the real catalogues obtained with the same MMFs (first column of Fig. 8). This indicates that the shifts seen in these real catalogues can be attributed, at least partially, to this selection effect.

A selection effect may also explain the shapes of the data point distribution at the low |$\hat{y}_0$| end in the two last rows of the randomized CIB column in Fig. 8, with larger |$\hat{y}_0$| measurements for those MMFs than for iMMF6. Note that similar shapes are also observed for the real measurements. As the low |$\hat{y}_0$| end corresponds to objects lying very close to the detection threshold (see the colour code), we attribute these shapes to the Malmquist-type bias mentioned in Section 7.1: for our CIB subsample, those MMFs result in significantly lower signal-to-noise values than iMMF6 does, and hence their |$\hat{y}_0$| measurements are more strongly Malmquist-biased, i.e. larger.

To further investigate the origin of these features in the randomized case (shifts at large |$\hat{y}_0$| values, distribution shapes at low |$\hat{y}_0$| values), we extract |$\hat{y}_0$| from our Websky synthetic maps for a mass-selected cluster sample. In this case, both features disappear, allowing us to fully attribute them to the sample selection. The results for our mass-selected sample are discussed in more detail in Appendix  B.

On the other hand, as seen in the fourth column of Fig. 8, when the Websky CIB is spatially correlated with the tSZ signal there are significant shifts in the extracted |$\hat{y}_0$| values compared to the randomized case. These shifts result from the spatial correlation between the Websky CIB and tSZ fields, which biases |$\hat{y}_0$| for iMMF6. This bias is reduced or completely removed by using fewer frequency channels and/or spectral deprojection. Indeed, in Zubeldia et al. (2023b) it was shown, using the same Planck-like synthetic observations, that spectrally constrained MMFs are highly effective at suppressing the CIB bias, especially when the first-order moment with respect to either |$\beta$| or |$\beta _T$| is deprojected, delivering CIB-free signal-to-noise ratio and Compton-y measurements (see their figs 5, 9, and A2). It must be noted, however, that the significant difference between these shifts and those observed in the real measurements (first column of Fig. 8) indicates that the Websky CIB model around the large-mass, low-redshift clusters detected by Planck is not an accurate description of reality, but that is has a much larger amplitude than we measure from the real Planck data. This puts a cautionary note on the use of simulations for the calibration of the mass-observable relation of tSZ cluster observables, with, e.g. empirical calibration with lensing observations being a more robust alternative (see e.g. Zubeldia & Challinor 2019; Bocquet et al. 2024b).

Finally, we note that for the real re-extracted fixed |$\hat{y}_0$| measurements there is excellent agreement between iMMF6 and iMMF5 (first column of Fig. 8). This agreement implies that the |$\sim 10$| per cent absolute calibration uncertainty of the Planck 857 GHz channel (Planck Collaboration VIII 2016) has a negligible impact on our catalogues. This is something that we have verified for the first time regarding cluster detection, it not having been assessed in the official Planck analyses, which only used all HFI channels (e.g. Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016).

8 IMPACT OF THE TSZ RELATIVISTIC CORRECTIONS

The Planck  SZiFi catalogues have been produced ignoring the relativistic corrections to the tSZ SED, i.e. assuming that the electrons in the intracluster medium (ICM) of the detected clusters move at non-relativistic speeds. Indeed, in our MMFs, the tSZ SED is taken to be the non-relativistic SED, which is independent of electron temperature (see Section 3.3.3). However, it is well known that electrons in the ICM of massive clusters, such as those detectable by Planck, have typical temperatures of a few keV (e.g. Refregier et al. 2000; Nagai, Kravtsov & Vikhlinin 2007). This implies that they move at a fraction of the speed of light, making the tSZ relativistic corrections potentially relevant to cosmological analyses (e.g. Remazeilles et al. 2019; Rotti et al. 2021).

In this section, we investigate the impact of this assumption on our catalogues. Specifically, we quantify the impact of the tSZ relativistic corrections on the cluster observables |$\hat{y}_0$| and |$q_{\mathrm{obs}}$|⁠. In addition, we assess whether the tSZ relativistic corrections contribute in any significant way to the small shifts observed at large |$\hat{y}_0$| values in our re-extracted fixed |$\hat{y}_0$| measurements for the CIB subsample (see Section 7). Our methods are presented in Section 8.1, followed by our results in Section 8.2.

8.1 Obtaining cluster temperatures and re-extracting |$\hat{y}_0$|

We re-extract |$\hat{y}_0$| and |$q_{\mathrm{obs}}$| for the clusters in the CIB subsample with a redshift measurement (referred to as the ‘rSZ subsample’) assuming the relativistic tSZ SED. There are 244 clusters in this rSZ subsample out of a total of 247 clusters in the CIB subsample. We use the same re-extraction pipeline as in our assessment of CIB contamination (Section 7), except that the tSZ SED is no longer non-relativistic.

Since the relativistic tSZ SED depends on the electron temperature (Itoh et al. 1998; Challinor & Lasenby 1998; Sazonov & Sunyaev 1998), we first need a temperature estimate for each cluster. We obtain these temperature estimates using our blind iMMF6 |$\hat{y}_0$| and |$\hat{\theta }_{500}$| measurements and a set of scaling relations. Specifically, we use the following Compton-y–temperature scaling relation from Lee et al. (2022), which has been calibrated with hydrodynamical simulations:9

(6)

Here, |$T_Y$| is the Compton-y-weighted electron temperature, which is the most suitable integrated temperature regarding the tSZ relativistic corrections (Lee et al. 2020, 2022); |$Y_{500,\mathrm{physical}}$| is the cluster’s Compton-y parameter integrated within the cluster’s angular scale |$\theta _{500}$|⁠, in physical units; and |$E(z) = H(z)/H_0$| is the dimensionless expansion rate. The amplitude and exponent parameters, |$A(z)$| and |$B(z)$|⁠, are evaluated at each cluster’s redshift by linearly interpolating the values obtained in Lee et al. (2022) at four redshifts, |$\boldsymbol {z} = (0, 0.25, 0.5, 1, 1.5)$|⁠, at which |$A(z)$| and |$B(z)$| were found to be |$\boldsymbol {A} = (3.123, 2.839, 2.584, 2.113)$| and |$\boldsymbol {B} = (0.364, 0.369, 0.363, 0.361)$|⁠, respectively.

For the Arnaud et al. (2010) profile assumed for cluster detection and for given values of |$y_0$| and |$\theta _{500}$|⁠, the cluster’s Compton-y parameter integrated within |$\theta _{500}$| in angular units, |$Y_{500,\mathrm{angular}}$|⁠, can be written as

(7)

where |$I=0.06728$| is a numerical factor that is independent of the cosmological model, the cluster mass and redshift, and that we compute numerically. In physical units, the integrated parameter, |$Y_{500,\mathrm{physical}}$|⁠, is then given by

(8)

where |$D_{\mathrm{A}}$| is the angular diameter distance to the cluster’s redshift, and where we note that this expression is valid if |$Y_{500,\mathrm{angular}}$| is expressed in arcmin|$^2$|⁠.

For each cluster in the rSZ sample, we obtain an estimate for |$Y_{500,\mathrm{physical}}$| using equations (7) and (8). We use the cluster’s blind |$y_0$| and |$\theta _{500}$| measurements, as obtained with iMMF6, since this MMF provides the highest signal-to-noise ratio. As we show with synthetic observations in Section 9, these |$Y_{500,\mathrm{physical}}$| estimates are highly correlated with their true values, although they present a Malmquist-type bias that we ignore here. Any differences between the true cluster profile and the Arnaud et al. (2010) profile that we assume will cause an additional bias. We then use the scaling relation of equation (6) in order to obtain an estimate of the Compton-y-weighted temperature for each cluster. We evaluate |$E(z)$| and |$D_{\mathrm{A}}$| assuming the same cosmological model that was used to compute the cluster comoving distances in Section 5.1.

The temperature distribution that we obtain for the rSZ subsample is shown in Fig. 9. As expected for the low-redshift, massive clusters that Planck detects, the ICM electrons have temperatures of a few keV, with a sample mean temperature of |$\left\langle T_{Y} \right\rangle = 5.419 \pm 0.081\, {\rm keV}$| that is consistent with the one found in Erler et al. (2018), where we note that the quoted uncertainty is the empirical standard deviation on the mean. We note, however, that these temperature estimates must not be interpreted as accurate measurements, rather as estimates, better than the zero-temperature assumption underlying the non-relativistic tSZ SED. Indeed, they are obtained assuming a specific cluster profile and a simulation-calibrated Compton-y–temperature scaling relation, and are expected to be Malmquist-biased. Therefore, we have not attempted to marginalize over any measurement or modelling uncertainties and have not assigned measurement errors to them.

We then re-extract |$\hat{y}_{0}$| and |$q_{\mathrm{obs}}$| with the re-extraction pipeline of Section 7, i.e. using the MMF-averaged sky position and a fixed angular scale of |$\theta _{500} = 5$| arcmin. However, for each cluster we now construct the MMFs with the relativistic tSZ SED evaluated at the cluster’s |$T_Y$| estimate. We compute the relativistic tSZ SED with the szpack10 package (Chluba et al. 2012, 2013), which we have interfaced with SZiFi. In particular, we use a combination of analytic approximations at the low temperature (⁠|$\lesssim 3\, {\rm keV}$|⁠) end, while above this temperature a pre-computed basis is used in order to avoid convergence issues inherent to the asymptotic expansion approach (see Chluba et al. 2012 for a detailed discussion).

We note that we have followed this simulation-based approach, instead of, e.g. using existing temperature measurements, in order to (i) have a set of homogeneous temperature estimates and (ii) for them to be estimates of the Compton-y-weighted temperature, which is the most suitable temperature for computing the tSZ relativistic corrections (Lee et al. 2020, 2022).

8.2 Results and discussion

Fig. 10 shows the ratio between the re-extracted fixed |$\hat{y}_0$| measurements obtained with the relativistic tSZ SED and their non-relativistic counterparts, plotted against the electron temperature |$T_Y$| estimates for the rSZ subsample and all our MMFs. As expected, this ratio increases with |$T_Y$|⁠. Also as expected, the relativistic |$\hat{y}_0$| measurements take larger values than their non-relativistic counterparts. This is due to the fact that, at the Planck HFI frequencies, both below and above the tSZ null, the relativistic rSZ SED has a lower amplitude than the non-relativistic one, as comparatively more photons are scattered into higher frequencies, above 857 GHz. At a fixed signal, this decrease in the amplitude of the SED is compensated by a larger estimated Compton-y value (see e.g. Remazeilles et al. 2019).

The shifts in the |$\hat{y}_0$| measurements seen in Fig. 10 are at the 5–10 per cent level for most clusters across all MMFs, indicating that the |$\hat{y}_0$| measurements in our catalogues are probably biased by about 5–10 per cent due to the assumption of the non-relativistic tSZ SED for cluster detection (a bias that adds to the other biases mentioned above). Interestingly, these shifts only depend mildly on the MMF used, at the 1–2 per cent level.

Despite |$\hat{y}_0$| changing by about 5–10 per cent, the signal-to-noise |$q_\mathrm{obs}$| exhibits no systematic shift at all.11 This can be seen in Fig. 11, which is the signal-to-noise analogue to Fig. 10. In it, it is apparent that the |$q_\mathrm{obs}$| values are insensitive to the change in the tSZ SED for all MMFs, with any shifts being constrained to less than 1 per cent in |$q_\mathrm{obs}$|⁠. We recall that our temperature estimates are not to be regarded as accurate measurements, but rather as estimates. Their range, however, spans over a factor of 4 in temperature, and no temperature dependence is seen in Fig. 11. It follows that our findings regarding the signal-to-noise ratio would not change significantly if our temperature estimates were, say, a factor of 2 away from the true electron temperatures. We can, therefore, safely conclude that the tSZ relativistic corrections have a negligible impact on the signal-to-noise measurements in our catalogues, even if the inferred |$\hat{y}_0$| values are noticeably biased. This result is particularly relevant for cosmological number-count analyses, in which the signal-to-noise ratio is typically used as the cluster tSZ observable. We note that the tSZ relativistic corrections will, however, add some scatter to the signal-to-noise values, which could be decreased through the use of a relativistic SED for cluster extraction, as performed here for the tSZ subsample.

Finally, the second column of Fig. 8 compares the relativistic-SED re-extracted fixed |$\hat{y}_0$| measurements for each MMF with those for iMMF6. The panels are virtually identical to those in the first column, which correspond to re-extraction with a non-relativistic SED. This was expected given the minimal variation in the shifts of the |$\hat{y}_0$| measurements across different MMFs seen in Fig. 10. Thus, the tSZ relativistic corrections seem an unlikely explanation for the small shifts in the |$\hat{y}_0$| measurements at large |$\hat{y}_0$| values observed for some MMFs.

This is the first time that the significance of the tSZ relativistic corrections on the cluster signal-to-noise measurements of tSZ-selected samples is quantified with real data. Indeed, this issue was not investigated in the official Planck analyses (e.g. Planck Collaboration I 2012; Planck Collaboration XXIX 2014; Planck Collaboration XXVII 2016), nor for the ACT and SPT catalogues (Bleem et al. 2015; Hilton et al. 2018; Bleem et al. 2020, 2022, 2024). In Hilton et al. (2018), a scaling relation including a term accounting for the tSZ relativistic corrections was proposed, but its significance was not quantified with data.

9 VALIDATION OF THE CLUSTER DETECTION PIPELINE

In this section, we describe how we validate our SZiFi-based cluster detection pipeline with synthetic data. In particular, we apply it to synthetic Planck-like frequency maps, producing a ‘validation catalogue’ for each of the 10 MMFs considered in our analysis of the Planck data (Section 9.1; see also Section 3.2). We place a particular emphasis in testing our ability to theoretically predict the cluster counts in our validation catalogues as a function of signal-to-noise ratio and redshift (Section 9.2), as this is an important step forward towards using our real catalogues in a cosmological number-count analysis. We also quantify the purity of the validation catalogues, assessing the number and nature of false detections (Section 9.3), and investigate how the inferred cluster observables (⁠|$\hat{y}_0$|⁠, |$\hat{\theta }_{500}$|⁠, integrated Compton-y, and cluster sky coordinates) are related to their true values (Sections 9.4 and 9.5). Throughout this section, we focus on our validation iMMF6 and MMF6 catalogues, analysing the other validation catalogues in Appendix  C.

9.1 Synthetic maps and cluster catalogues

9.1.1 Synthetic input catalogue

We generate a synthetic cluster catalogue by obtaining samples from the halo mass function using the synthetic catalogue generator of the cosmocnc12 package (Zubeldia & Bolliet 2024; Zubeldia 2024). We use the Tinker et al. (2008) halo mass function, computed within cosmocnc using the cosmopower13 power spectra emulator (Spurio Mancini et al. 2022; Bolliet et al. 2024), and assume a flat |$\Lambda$|CDM cosmological model with |$\Omega _{\mathrm{c}} = 0.26603$|⁠, |$\Omega _{\mathrm{b}} = 0.04897$|⁠, |$h = 0.674$|⁠, |$A_s= 2.08467 \times 10^{-9}$|⁠, |$n_s = 0.96$|⁠, and |$\sum m_\nu = 0.06$| eV. The parameter values are taken to be consistent with the Planck CMB constraints (Planck 2018 results VI 2020). We assume a minimum mass of |$M_{500,\mathrm{min}} = 4 \times 10^{14} M_{\odot }$| and a maximum mass of |$M_{500,\mathrm{max}} = 10^{16} M_{\odot }$|⁠, and minimum and maximum redshifts |$z_{\mathrm{min}} = 0.01$| and |$z_{\mathrm{max}} = 1$|⁠, respectively. The redshift limits and the lower mass limit are chosen so that most clusters to be detected in Planck data are spanned (see, e.g. fig. 1 of Planck 2015 results XXIV 2016).

The synthetic catalogue is obtained by first calculating the mean total number of clusters in the full sky within the mass and redshift range considered, |$\bar{N}_{\mathrm{tot}}$|⁠. This is done by numerically integrating the halo mass function multiplied by the differential volume element. The actual realisation of the total number of clusters, |$N_{\mathrm{tot}}$|⁠, is then obtained as a Poisson draw from |$\bar{N}_{\mathrm{tot}}$|⁠. We obtain |$N_{\mathrm{tot}} = 8683$|⁠. A total of 8683 mass–redshift pairs are then drawn from the product of the halo mass function with the differential volume element, and a random sky position is then assigned to each of them. We refer the reader to Zubeldia & Bolliet (2024) for further details about the catalogue generator.

9.1.2 Synthetic sky maps

We use our synthetic cluster catalogue to produce a full-sky Compton-y map by painting in a Compton-y profile for each cluster. We assume the same cluster profile that is used in our MMFs for cluster finding, namely the Compton-y profile due to the Arnaud et al. (2010) pressure profile truncated at |$5\theta _{500}$|⁠. We evaluate the profile at its best-fitting parameter values and at a biased mass |$\beta _{\mathrm{SZ}} M_{500}$|⁠, where |$\beta _{\mathrm{SZ}} = 0.62$| is the so-called hydrostatic mass bias (hereafter, the ‘SZ mass bias’, also denoted with |$1-b$| in the literature), which accounts for the difference between the X-ray-derived hydrostatic masses underlying the Arnaud et al. (2010) profile and the true halo mass. Its chosen value is consistent with that which is found to reconcile the Planck MMF3 cosmology sample with the Planck CMB (Planck 2015 results XXIV 2016). We use a HEALPix pixellation with |$N_{\mathrm{SIDE}} = 2048$| that is identical to that of the Planck HFI maps.

Next, we generate a synthetic tSZ signal map for each observation frequency channel. We do this by convolving our Compton-y map with the instrument beam for each channel and rescaling it by the non-relativistic tSZ SED at each channel effective frequency. We assume six frequency channels at 100, 143, 217, 353, 545, and 857 GHz, taking the band transmission profiles to be delta functions and Gaussian beams with FWHMs of 9.68, 7.3, 5.02, 4.94, 4.83, and 4.64 arcmin (taken from Planck 2015 results VIII 2016).

We then add noise and beam-convolved foregrounds to our synthetic tSZ maps. We use the same synthetic Planck-like noise and foreground maps that were used in Section 7, recalling that the latter comprise three foregrounds: the lensed CMB, the kSZ and the CIB, all taken from the Websky simulation. By construction, the CIB maps are not spatially correlated with our synthetic tSZ signal.

9.1.3 Validation cluster catalogues

We apply our SZiFi-based cluster finding pipeline to our synthetic sky maps. We use exactly the same configuration that was used in the processing of the real Planck maps (see Section 3) except in the modelling of the beams (using Gaussian beams instead of the RIMO ones) and of the band transmission profiles (assuming delta function profiles). We use the same Galactic mask that was employed for our analysis of the real Planck maps (the Planck PR2 80 per cent Galactic mask) and the point-source mask that we constructed for the Websky CIB in Section 7. We impose a final signal-to-noise threshold of |$q_{\mathrm{th}} = 5$|⁠.

We produce a non-iterative and an iterative catalogue for each of the 10 MMFs considered in our real analysis (see Section 3.2), referring to these catalogues as our validation catalogues. We then cross-match each validation catalogue with the input synthetic catalogue. We use the cross-matching algorithm described in Section 3.4, which here is used to assess whether each detection corresponds to a real cluster (if it has a counterpart in the input catalogue) or is a false detection (if it has no counterpart).

As for our real catalogues (see Section 4.3), after cross-matching the validation catalogues with the input catalogue we manually inspect all the unconfirmed detections, removing what we deem to be true false detections. As discussed in Section 4.3, these are (i) large-angular-scale detections located very close to the tile edges that have not been successfully merged with the corresponding detection in the neighbouring tile with which they form a single object, or (ii) large-angular-scale detections very close to point-source-masked regions. For reference, we find 5 detections of the first kind and 3 of the second kind out of a total of 42 unconfirmed detections in our validation iMMF6 catalogue, 5 of the first kind and 4 of the second kind out of a total of 44 in our validation iMMF5 catalogue, and 6 of the first kind and 3 of the second kind out of a total of 62 in our validation iMMF4 catalogue. These numbers are comparable to those in our real Planck catalogues (see Section 4.3).

9.2 Number counts

We first consider the number counts in our validation catalogues as a function of signal-to-noise and redshift, assessing whether we can produce an accurate theoretical prediction for them. We compute our theoretical number-count prediction with the cosmocnc package, which for a given cluster mass observable (e.g. the tSZ signal-to-noise ratio) can evaluate the cluster number counts as a function of the observable and of redshift. We assume the same halo mass function that we used to generate our input catalogue, and evaluate it at the input cosmology.

Our mass observable is the tSZ signal-to-noise ratio, |$q_{\mathrm{obs}}$|⁠. Following Zubeldia et al. (2021), its expected value at given mass |$M_{500}$| and redshift z, |$\bar{q}_{\mathrm{obs}} (M_{500},z)$|⁠, can be written as

(9)

where |$\bar{q} (M_{500},z)$| is the mean signal-to-noise extracted at the true sky position and angular size of the cluster and f is the number of degrees of freedom over which the signal-to-noise is maximized in the cluster detection process. There are |$f=3$| degrees of freedom: the cluster’s two sky coordinates and its angular scale |$\theta _{500}$|⁠. As discussed in Zubeldia et al. (2021), neglecting this degree-of-freedom correction by setting |$f = 0$| (as done, e.g. in Planck 2015 results XXIV 2016) leads to an ‘optimisation bias’ in the modelling of the clusters’ signal-to-noise measurements (see also Vanderlinde et al. 2010, where this correction was proposed for the first time in a heuristic manner, noting that it was applied in all subsequent SPT number count analyses, e.g. in Bocquet et al. 2024b). For the Arnaud et al. (2010) pressure profile |$\bar{q} (M_{500},z)$| can be related to the cluster mass and redshift with the scaling relation

(10)

where |$y_0$| is the central Compton-y value of the cluster and |$\sigma _{\mathrm{f}}$| is the MMF noise evaluated at the cluster’s angular scale |$\theta _{500}$| (see Section 3.3.3). |$y_0$|⁠, in turn, is given by

(11)

where |$A_{\mathrm{SZ}}=-4.3054$|⁠, |$\alpha _{\mathrm{SZ}}=1.12$|⁠, |$h_{70} = h/0.7$| and |$E(z) = H(z)/H_0$|⁠. All the cosmology-dependent quantities used to compute |$\theta _{500}$| and |$E(z)$| are evaluated at the input cosmology.

Since there is no intrinsic scatter, as the only source of scatter in our synthetic maps is that due to the instrumental noise and foregrounds, within the cosmocnc framework |$q_{\mathrm{obs}}$| is said to be linked to the cluster mass and redshift with a one-layer hierarchical model. In the only layer, the mass observable |$q_{\mathrm{obs}}$| has a mean given by equation (9) and unit-variance Gaussian scatter, as the signal-to-noise is defined by normalising the MMF amplitude parameter |$\hat{y}_0$| by its standard deviation (see equation 5).

Fig. 12 presents histograms of the number counts as a function of |$q_{\mathrm{obs}}$| (upper panels) and redshift (lower panels) for our MMF6 (non-iterative) and iMMF6 (iterative) catalogues (left and right panels, respectively). Only the true detections, i.e. those with a counterpart in the input catalogue, are shown. Fig. 12 also shows the corresponding cosmocnc predictions made assuming the input halo mass function and the scaling relation of equations (9)–(11), both if the optimization bias is accounted for by setting |$f=3$| (blue points) and also if it is ignored by setting |$f=0$| instead (orange points). For each predicted value, we also show the associated Poisson error bar, given by its square root. We stress that these are predictions and not fits to the observed counts. The difference between the observed and predicted number counts |$\Delta N$| are also shown (small panels below each main panel).

As it is apparent in Fig. 12, our prediction with the optimization bias taken into account successfully describes the number counts in the iMMF6 validation catalogue, both across signal-to-noise ratio and redshift. For its non-iterative counterpart, the observed counts fall systematically below the predicted ones by a small amount (⁠|$\sim 1\, \sigma$|⁠). This is due to the negative covariance bias in the signal-to-noise measurements that we briefly discuss in Section 3.3.5 and is investigated in detail in Zubeldia et al. (2023a). Note, in particular, that this effect is not due to the noise covariance in the non-iterative case being overestimated, as the same (overestimated) noise covariance is used in the theoretical prediction (in particular, in equation 10).

It is also clear in Fig. 12 that the optimization bias is a very significant effect at the low signal-to-noise end. While adopting the simple degree-of-freedom correction of equation (5) results in a good prediction for the observed counts, neglecting the optimization bias leads to a significant under prediction of the observed counts. This number-count under prediction at low signal-to-noise ratio spreads over a broad redshift range, as can be seen in the lower panels.

Similar conclusions can be drawn by simply considering the total number of clusters in the catalogue. The iterative catalogue contains a total of 785 clusters. If the optimization bias is taken into account |$818 \pm 29$| clusters are predicted (1.14 |$\sigma$| away from the observed value), whereas if it is ignored, |$688 \pm 26$| clusters are predicted instead (⁠|$-3.73$||$\sigma$| away from the observed value). On the other hand, the non-iterative catalogue consists of 719 clusters. Its corresponding theoretical predictions are |$780 \pm 28$| if the optimization bias is taken into account, and |$654 \pm 25$| otherwise, values which are, respectively, 2.17 |$\sigma$| and |$-2.60$||$\sigma$| away from the observed value.

We note that the mitigation of both biases (optimization and covariance) following the same modelling approach as here was already demonstrated in Zubeldia et al. (2023a) by comparing the retrieved signal-to-noise values with their ‘true’ values (see, in particular, their figs 8 and 9). However, the validation exercise presented here goes further, confronting the recovered number counts with a theoretical prediction, something that was not done in Zubeldia et al. (2023a).

In summary, when the noise covariance is estimated iteratively and the optimization bias is taken into account, our model successfully describes the observed counts in our validation catalogue (iMMF6 catalogue) as a function of both signal-to-noise ratio and redshift. This constitutes an important stepping stone towards using our real iMMF6 catalogue for cosmological inference. In Appendix  C we consider our remaining validation catalogues, similarly finding good agreement between the observed counts and their theoretical prediction (see, in particular, Fig. C1).

9.3 Purity

The upper panel of Fig. 13 shows the fraction of confirmed detections in our validation iMMF6, iMMF5, and iMMF4 catalogues as a function of signal-to-noise threshold. As the catalogue against which the validation catalogues have been cross-matched is the input cluster catalogue, this fraction corresponds to the fraction of true detections in the catalogue, also known as catalogue purity. The lower panel, on the other hand, shows the associated number of unconfirmed detections, which in this case corresponds to the number of false detections.

The overall picture is similar to that for our real Planck catalogues (see Section 5.3). The purity increases quickly with signal-to-noise from about 96 per cent for iMMF6 and iMMF5 and about 90 per cent for iMMF4 at |$q_{\mathrm{th}}=5$| to over 99 per cent for all catalogues at |$q_{\mathrm{th}}=6$| and 100 per cent for all catalogues at |$q_{\mathrm{th}}=7$|⁠. At |$q_{\mathrm{th}}=5$|⁠, the purity for iMMF4 is significantly lower than for iMMF5 and iMMF6, but this difference is reduced significantly at |$q_{\mathrm{th}}=5.5$|⁠. As it was the case for the unconfirmed detections in the real Planck catalogues, for each validation catalogue the number of false detections is approximately a power law of |$q_{\mathrm{th}}$|⁠.

9.4 Cluster observables

Fig. 14 shows the inferred values of four cluster observables for the true detections in our validation iMMF6 catalogue (vertical axes) plotted against their true values in the synthetic data (horizontal axes). In particular, the upper left panel corresponds to the cluster angular scale, whose inferred value for each cluster, |$\hat{\theta }_{500}$|⁠, corresponds to the MMF angular scale at which the signal-to-noise ratio is maximized. The inferred angular scale is found to be correlated with its true value, but with significant scatter. This is a consequence of the signal-to-noise depending weakly on the angular scale for low-significance detections that dominate the sample. Notably, there is a set of clusters that are detected at the smallest search angular scale, |$\hat{\theta }_{500}=0.5$| arcmin, and that appear as outliers in the |$\hat{\theta }_{500}$||$\theta _{500}$| distribution. These detections correspond to clusters that are not resolved by the cluster-finding algorithm for which the signal-to-noise slowly keeps increasing for arbitrarily small values of |$\hat{\theta }_{500}$|⁠.

Inferred cluster observable values plotted against their corresponding true values in the iMMF6 validation catalogue. The observables shown are the cluster angular scale $\theta _{500}$ (upper left panel), the cluster central Compton-y  $y_0$ (which acts as the MMF amplitude parameter; upper right panel), and the cluster integrated Compton-y parameter, both in angular and physical units ($Y_{500,\mathrm{angular}}$ and $Y_{500,\mathrm{physical}}$; lower left and right panels, respectively).
Figure 14.

Inferred cluster observable values plotted against their corresponding true values in the iMMF6 validation catalogue. The observables shown are the cluster angular scale |$\theta _{500}$| (upper left panel), the cluster central Compton-y  |$y_0$| (which acts as the MMF amplitude parameter; upper right panel), and the cluster integrated Compton-y parameter, both in angular and physical units (⁠|$Y_{500,\mathrm{angular}}$| and |$Y_{500,\mathrm{physical}}$|⁠; lower left and right panels, respectively).

The upper right panel is an analogous plot for the MMF amplitude, i.e. the central cluster Compton-y value. Its inferred value, |$\hat{y}_0$|⁠, which corresponds to the MMF amplitude evaluated at the inferred angular scale |$\hat{\theta }_{500}$|⁠, is similarly found to correlate with its true value |$y_0$|⁠. Here, there is also significant scatter, mostly caused by the scatter in |$\hat{\theta }_{500}$|⁠. Indeed, all detections have a signal-to-noise greater than 5, which implies that the noise on |$\hat{y}_0$| at a given angular scale is 20 per cent of its value or less. The outlier clusters of the upper left panel translate here into clusters with very large values of |$\hat{y}_0$|⁠, with |$\hat{y}_0 \sim 10^{-2}$|⁠. This can be thought of as an artefact of the detection method: for unresolved clusters, as |$\hat{\theta }_{500}$| goes to arbitrarily small scales (bound here by the lowest search angular scale), |$\hat{y}_0$| takes arbitrarily large values, so that the signal-to-noise ratio is (approximately) preserved, with it varying very slowly as a function of |$\hat{\theta }_{500}$|⁠.

The lower panels are analogous plots for Compton-y parameter integrated within |$\hat{\theta }_{500}$|⁠, expressed both in angular units, |$Y_{500,\mathrm{angular}}$| (lower left panel), and in physical units, |$Y_{500,\mathrm{physical}}$| (lower right panel). For each cluster, we use equations (7) and (8), which assume the Arnaud et al. (2010) profile, to obtain inferred values for |$Y_{500,\mathrm{angular}}$| and |$Y_{500,\mathrm{physical}}$|⁠, |$\hat{Y}_{500,\mathrm{angular}}$| and |$\hat{Y}_{500,\mathrm{physical}}$|⁠, from the corresponding |$\hat{\theta }_{500}$| and |$\hat{y}_0$| measurements.

As for |$\hat{\theta }_{500}$| and |$\hat{y}_0$|⁠, |$\hat{Y}_{500,\mathrm{physical}}$|⁠, |$\hat{Y}_{500,\mathrm{angular}}$| are found to correlate with their corresponding true values. For |$\hat{Y}_{500,\mathrm{angular}}$|⁠, which can be thought of as being measured more directly by the MMF, a Malmquist-type bias can be clearly observed at the low |$\hat{Y}_{500,\mathrm{angular}}$| end. This selection effect is also visible in the lower right panel, where it is spread over a broader |$\hat{Y}_{500,\mathrm{physical}}$| range. Note that for both |$\hat{Y}_{500,\mathrm{angular}}$| and |$\hat{Y}_{500,\mathrm{physical}}$| there is significant scatter, which is dominated by the uncertainty in the angular scale, as for |$\hat{y}_0$|⁠. Also note that for both integrated quantities there are no outliers, indicating that these quantities are much less affected than |$\hat{\theta }_{500}$| and |$\hat{y}_0$| by the artefact of the detection method described above.

Similar results are obtained for the other MMFs; we have omitted these for conciseness.

9.5 Sky position uncertainty

We quantify the uncertainty in the inferred sky positions of the clusters in our validation catalogues by computing, for each cluster, the angular distance |$\Delta \theta$| between the inferred position and its true sky position. We then model the resulting |$\Delta \theta$| distribution (the ‘miscentring distribution’) as a Rayleigh distribution for each catalogue, fitting for its scale parameter, denoted with |$\sigma _{\mathrm{misc}}$|⁠. This model assumes that the uncertainty in the inferred values for the two cluster sky coordinates follows a Gaussian distribution. We perform the fit in a Bayesian way by assuming each cluster to be statistically independent from each other and to follow the same Rayleigh distribution. Under these assumptions, the likelihood for the |$\Delta \theta$| vector |$\boldsymbol {\Delta } \boldsymbol {\theta }$| at a given value of |$\sigma _{\mathrm{misc}}$| can be written as

(12)

where |$\mathcal {R} (\Delta \theta ; \sigma _{\mathrm{misc}})$| denotes a Rayleigh distribution with scale parameter |$\sigma _{\mathrm{misc}}$| and the product runs over all the clusters in the catalogue. Assuming a flat prior on |$\sigma _{\mathrm{misc}}$|⁠, equation (12) can be thought of a posterior for |$\sigma _{\mathrm{misc}}$|⁠. We use this posterior to infer the value of |$\sigma _{\mathrm{misc}}$| by evaluating it on a |$\sigma _{\mathrm{misc}}$| grid and, in turn, numerically integrating the evaluated posterior, obtaining the inferred mean and standard deviation of |$\sigma _{\mathrm{misc}}$|⁠. We do this taking into account only the clusters with |$\Delta \theta < 3.5$| arcmin, as we find that the clusters with a larger positional uncertainty lead to significantly worse fits to the observed miscentring distribution.

Fig. 15 shows the observed miscentring distribution for the iMMF6 validation catalogue (blue histogram), together with the Rayleigh distribution fit for the mean of the inferred value of the scale parameter, which we find to be |$\sigma _{\mathrm{misc}} = 1.001 \pm 0.018$| arcmin (orange curve). The Rayleigh distribution provides a reasonably good fit to the observed distribution, the only significant departure being a heavier tail in the observed distribution at large values of |$\Delta \theta$| (⁠|$\Delta \theta \gtrsim 3$| arcmin).

Distribution of the angular distance between the inferred cluster sky position and the input value $\Delta \theta$ for the clusters in the iMMF6 validation catalogue (blue histogram) together with its Rayleigh distribution fit (orange curve), which has a scale parameter $\sigma _{\mathrm{misc}} = 1.001 \pm 0.018$.
Figure 15.

Distribution of the angular distance between the inferred cluster sky position and the input value |$\Delta \theta$| for the clusters in the iMMF6 validation catalogue (blue histogram) together with its Rayleigh distribution fit (orange curve), which has a scale parameter |$\sigma _{\mathrm{misc}} = 1.001 \pm 0.018$|⁠.

Similar fits are obtained for our other validation catalogues. The scale parameter values that we find are (in arcmin): |$\sigma _{\mathrm{misc}} = 0.993 \pm 0.018$| (sciMMF6 |$\beta$|⁠), |$\sigma _{\mathrm{misc}} = 1.034 \pm 0.021$| (sciMMF6 |$\beta$|⁠), |$\sigma _{\mathrm{misc}} = 1.048 \pm 0.020$| (sciMMF6 |$\beta _T$|⁠), |$\sigma _{\mathrm{misc}} = 1.013 \pm 0.019$| (iMMF5), |$\sigma _{\mathrm{misc}} 1.023 \pm 0.020$| (sciMMF5), |$\sigma _{\mathrm{misc}} = 1.003 \pm 0.036$| (sciMMF5 |$\beta$|⁠), |$\sigma _{\mathrm{misc}} = 1.040 \pm 0.035$| (sciMMF5 |$\beta _T$|⁠), |$\sigma _{\mathrm{misc}} = 1.029 \pm 0.024$| (iMMF4), |$\sigma _{\mathrm{misc}} = 1.061 \pm 0.030$| (sciMMF4). Note that there is little variation in the miscentring scale parameter across our different filters.

We note that our procedure likely underestimates the position uncertainty, as the clusters in our synthetic data have the same spherically-symmetric profile. The positional errors should also be interpreted as being with respect to the true tSZ peak and not with respect to other cluster centres (e.g. the peak of the gravitational potential). We also note that more complex models for the miscentring distribution can be considered, e.g. models in which the distribution depends on the cluster signal-to-noise ratio (see e.g. Hilton et al. 2018; Bocquet et al. 2024a). These models have the potential to provide a better fit to the observed distribution. We leave the exploration of these models to further work.

10 SUMMARY

In this work, we have introduced the Planck  SZiFi galaxy cluster catalogues, a set of 10 catalogues obtained by applying the SZiFi tSZ cluster finder to the Planck PR3 HFI temperature maps, detecting objects down to a signal-to-noise threshold of 5. Each catalogue is produced with a different frequency channel and CIB deprojection combination (see Table 1). In particular, three of them, the iMMF6 (our baseline catalogue), iMMF5, and iMMF4 catalogues are constructed with a standard iterative MMF: iMMF6 using all HFI channels, iMMF5 excluding the highest-frequency channel, and iMMF4 excluding the two highest-frequency channels. The remaining seven catalogues are obtained for the same three channel combinations, but using SZiFi’s spectrally constrained MMFs instead, deprojecting the CIB and, for four of them, also the first-order moment with respect to the spectral parameter |$\beta$| or |$\beta _T$|⁠. In addition, each catalogue has a non-iterative counterpart that is obtained without iterative noise covariance estimation.

We have compiled all the catalogues into a single Planck  SZiFi master catalogue containing a total of 1499 detections, which we have cross-matched with a set of six existing tSZ and X-ray-selected catalogues and meta-catalogues including a significant number of post-Planck detections. The master catalogue will become publicly available at this link and a brief description of its entries can be found in Table A1. We emphasize, however, that although the master catalogue is produced and made available for convenience, the individual catalogues it contains must be thought of as independent catalogues in their own right, each with a simple, well-defined tSZ selection. The master catalogue, on the other hand, has a more complicated tSZ selection, as well as a significantly lower purity (see Section 5.3). Due to these issues, we strongly recommend to use the individual catalogues (e.g. our baseline catalogue, iMMF6) for any practical applications (e.g. a cosmological analysis), and not the full master catalogue.

The Planck  SZiFi catalogues present two key improvements relative to the official Planck catalogues. First, the MMFs with which they are obtained are constructed using an iterative estimate of the noise covariance. This is achieved by masking the detections made in a first run of the cluster finder from the input temperature maps and re-estimating the covariance with the masked maps. As was shown in Zubeldia et al. (2023b) and is discussed here, using a non-iterative covariance estimate, as done in the official Planck catalogues, leads to a bias in the cluster tSZ observables (signal-to-noise ratio and Compton-y estimate |$\hat{y}_0$|⁠) and to a loss of signal-to-noise ratio. Both effects disappear with our iterative procedure. The second key improvement is the deprojection of the CIB in 7 of the 10 SZiFi catalogues (both iterative and non-iterative) through the use of spectrally constrained MMFs. This constitutes the first application of spectrally constrained MMFs to real data and the first time that any foreground is deprojected in the context of tSZ cluster finding.

Our baseline catalogue, the iMMF6 catalogue, contains the largest number of detections, with 833 of them in total, 790 of which have an available redshift measurement. The redshift distribution peaks at about |$z=0.2$| (see Fig. 3). It has an empirical lower bound on the purity of 96 per cent at a signal-to-noise ratio of 5, 99 per cent at 6, and 100 per cent at 7 (see Fig. 5). As fewer channels are used and/or more SEDs are deprojected, fewer objects are detected, a consequence of the expected associated signal-to-noise penalty (see Figs 3 and 4).

We have compared our non-iterative MMF6 catalogue with the official Planck MMF3 catalogue, as both are obtained using the same frequency channels and the same standard MMF (albeit with several differences, such as the multipole range used and the use of a different Planck data release). Both catalogues are broadly in agreement, featuring similar number counts both as a function of signal-to-noise ratio and redshift, and similar signal-to-noise values for the common detections between the two (see Fig. 7).

We have carefully assessed the impact of the cluster-correlated CIB on the retrieved cluster observables by comparing the Compton-y amplitude |$\hat{y}_0$| values that we have re-extracted with our 10 different MMFs at a fixed angular scale of 5 arcmin for the clusters detected with all 10 MMFs (our CIB subsample). This comparison procedure was chosen in order to minimize selection effects. The differences in these fixed |$\hat{y}_0$| measurements across the different MMFs are very similar to those observed for synthetic data in which the correlation between the tSZ and the CIB fields is artificially removed (see Fig. 8). These findings have led us to conclude that the cluster-correlated CIB likely has a negligible impact on the cluster tSZ observables for the clusters in our catalogues. They also suggest that the impact of the cluster-correlated CIB on cosmological constraints obtained with our baseline iMM6 catalogue (as well as, perhaps, with the MMF3 catalogue) may be negligible (see e.g. Rotti et al. (2021), where the CIB was proposed as a possible explanation for the low SZ mass bias parameter |$1-b$| required to reconcile the Planck MMF3 number counts with the Planck primary CMB power spectra.) This would make the iMMF6 catalogue the most suitable one for a cosmological analysis, as it provides the greatest statistical power. However, this is a possibility that ought to be investigated rigorously at the cosmological analysis level. We note that this assessment is the first of its kind for a Planck cluster sample, not having been performed in the official Planck analyses.

In addition, for the first time in the context of cluster finding, we have quantified the impact of the relativistic corrections to the tSZ SED. We have done so by re-extracting |$\hat{y}_0$| for a cluster subsample (the rSZ subsample) assuming a relativistic tSZ SED, using electron temperature estimates obtained with a simulation-calibrated Compton-y–temperature scaling relation. We have found a significant |$\sim 5$|–10 per cent change on the |$\hat{y}_0$| values but a negligible impact on the signal-to-noise values (see Figs 10 and 11). These findings allow us to conclude that the tSZ relativistic corrections will be negligible in any cosmological analysis of our catalogues in which the signal-to-noise ratio is used as the tSZ mass observable.

Finally, we have validated our SZiFi-based cluster detection pipeline by applying it to synthetic Planck-like data for our 10 MMFs. Most notably, we have been able to produce a theoretical prediction for the number counts in the validation catalogues across both signal-to-noise and redshift that successfully describes them (see Figs 12 and C1). Whilst using a relatively idealized set-up (painting spherically symmetric profiles of massive clusters and employing the CIB from the Websky simulation), this validation exercise, which, to the authors’ knowledge, is the first of its kind for tSZ-detected clusters, constitutes an important stepping stone towards using our catalogues in a cosmological number-count analysis.

We have not attempted to relate the signal-to-noise ratio or |$\hat{y}_0$| measurements in our catalogues to the cluster mass, something that we leave for further work. We also leave to further work the study of other potentially significant effects different from those considered here. These include, e.g. contamination from radio emission, which could be assessed in a similar way to how we quantified the impact of the cluster-correlated CIB, but focusing on the lower frequency channels. Finally, given their high purity (greater than 97 per cent for a signal-to-noise threshold of 5.5 for most catalogues) and the fact that we can successfully produce a theory prediction for our validation catalogues, we note that our Planck  SZiFi catalogues are well suited for a cosmological number-count analysis. We are currently working on one, which we intend to publish within the next year.

ACKNOWLEDGEMENTS

The authors would like to thank Anthony Challinor, Erik Rosenberg, Boris Bolliet, Joseph J. Mohr, Sebastian Bocquet, Matthias Klein, Lindsey Bleem, Nicholas Battaglia, and Fiona McCarthy for useful discussions, and Elizabeth Lee for providing the cluster Compton-y–temperature scaling relation parameter values. The authors would also like to thanks the Aspen Center for Physics, where part of this work was completed.

ÍZ acknowledges support from the STFC (grant numbers ST/W000977/1 and ST/T000414/1). JC was supported by the ERC Consolidator Grant CMBSPEC (no. 725456) and by the Royal Society as a Royal Society University Research Fellow at the University of Manchester, UK (no. URF/R/191023).

This work was performed using resources provided by the Cambridge Service for Data Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service (csd3.cam.ac.uk), provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/T022159/1), and DiRAC funding from the Science and Technology Facilities Council (dirac.ac.uk), within the DiRAC Cosmos dp002 project.

In this project the authors have made use of the M2C Galaxy Cluster Database, constructed as part of the ERC project M2C (The Most Massive Clusters across cosmic time, ERC Advanced Grant No. 340519).

DATA AVAILABILITY

The Planck  SZiFi master catalogue will become publicly available at this link. Before we make it available, it will be shared upon reasonable request. The synthetic data used throughout this work will also be shared upon reasonable request. The cluster finder used in this work, SZiFi, is already publicly available at github.com/inigozubeldia/szifi.

Footnotes

5

The empirical calibration of the mass-to-tSZ observable scaling relation with, e.g. lensing data may mitigate against this potential bias, at the expense of potentially delivering biased scaling relation parameters if the tSZ observable is interpreted as being exclusively due to the tSZ effect. We leave an exploration of this bias-mitigating approach to future work.

9

We thank Elizabeth Lee for providing the fit parameter values for averaging over |$R_{500}$|⁠, which were not included in Lee et al. (2022). We have also corrected a typo in the normalization of equation (16) of Lee et al. (2022).

11

A similar conclusion has also been reached by A. Rotti (private communication).

REFERENCES

Aguado-Barahona
 
A.
,
Barrena
 
R.
,
Streblyanska
 
A.
,
Ferragamo
 
A.
,
Rubiño-Martín
 
J. A.
,
Tramonte
 
D.
,
Lietzen
 
H.
,
2019
,
A&A
,
631
,
A148
 

Allen
 
S. W.
,
Evrard
 
A. E.
,
Mantz
 
A. B.
,
2011
,
ARA&A
,
49
,
409
 

Alonso
 
D.
,
Sanchez
 
J.
,
Slosar
 
A.
,
LSST Dark Energy Science Collaboration
,
2019
,
MNRAS
,
484
,
4127
 

Amodeo
 
S.
 et al. ,
2018
,
ApJ
,
853
,
36
 

Arnaud
 
M.
,
Pratt
 
G. W.
,
Piffaretti
 
R.
,
Böhringer
 
H.
,
Croston
 
J. H.
,
Pointecouteau
 
E.
,
2010
,
A&A
,
517
,
A92
 

Barrena
 
R.
 et al. ,
2018
,
A&A
,
616
,
A42
 

Barrena
 
R.
 et al. ,
2020
,
A&A
,
638
,
A146
 

Battye
 
R. A.
,
Weller
 
J.
,
2003
,
Phys. Rev. D
,
68
,
083506
 

Bleem
 
L. E.
, et al. ,
2015
,
ApJS
,
216
,
27
 

Bleem
 
L. E.
 et al. ,
2020
,
ApJS
,
247
,
25
 

Bleem
 
L. E.
 et al. ,
2022
,
ApJS
,
258
,
36
 

Bleem
 
L. E.
 et al. ,
2024
,
Open J. Astrophys.
,
7
,
13
 

Bocquet
 
S.
 et al. ,
2024a
,
Phys. Rev. D
,
110
,
083509
 

Bocquet
 
S.
 et al. ,
2024b
,
Phys. Rev. D
,
110
,
083510
 

Böhringer
 
H.
 et al. ,
2000
,
ApJS
.
129
,
435
 

Bolliet
 
B.
,
Spurio Mancini
 
A.
,
Hill
 
J. C.
,
Madhavacheril
 
M.
,
Jense
 
H. T.
,
Calabrese
 
E.
,
Dunkley
 
J.
,
2024
,
MNRAS
,
531
,
1351
 

Bulbul
 
E.
 et al. ,
2024
,
A&A
,
685
,
A106
 

Burenin
 
R. A.
,
Vikhlinin
 
A.
,
Hornstrup
 
A.
,
Ebeling
 
H.
,
Quintana
 
H.
,
Mescheryakov
 
A.
,
2007
,
ApJS
,
172
,
561
 

Burenin
 
R. A.
 et al. ,
2018
,
Astron. Lett.
,
44
,
297
 

Burke
 
D. J.
,
Collins
 
C. A.
,
Sharples
 
R. M.
,
Romer
 
A. K.
,
Nichol
 
R. C.
,
2003
,
MNRAS
,
341
,
1093
 

Carlstrom
 
J. E.
,
Holder
 
G. P.
,
Reese
 
E. D.
,
2002
,
ARA&A
,
40
,
643
 

Challinor
 
A.
,
Lasenby
 
A.
,
1998
,
ApJ
,
499
,
1
 

Chluba
 
J.
,
Nagai
 
D.
,
Sazonov
 
S.
,
Nelson
 
K.
,
2012
,
MNRAS
,
426
,
510
 

Chluba
 
J.
,
Switzer
 
E.
,
Nelson
 
K.
,
Nagai
 
D.
,
2013
,
MNRAS
,
430
,
3054
 

Chluba
 
J.
,
Hill
 
J. C.
,
Abitbol
 
M. H.
,
2017
,
MNRAS
,
472
,
1195
 

Chon
 
G.
,
Böhringer
 
H.
,
2012
,
A&A
,
538
,
A35
 

Coulton
 
W.
 et al. ,
2024
,
Phys. Rev. D
,
109
,
063530
 

Cruddace
 
R.
 et al. ,
2002
,
ApJS
,
140
,
239
 

Delabrouille
 
J.
,
Cardoso
 
J. F.
,
Le Jeune
 
M.
,
Betoule
 
M.
,
Fay
 
G.
,
Guilloux
 
F.
,
2009
,
A&A
,
493
,
835
 

Ebeling
 
H.
,
Edge
 
A. C.
,
Bohringer
 
H.
,
Allen
 
S. W.
,
Crawford
 
C. S.
,
Fabian
 
A. C.
,
Voges
 
W.
,
Huchra
 
J. P.
,
1998
,
MNRAS
,
301
,
881
 

Ebeling
 
H.
,
Edge
 
A. C.
,
Allen
 
S. W.
,
Crawford
 
C. S.
,
Fabian
 
A. C.
,
Huchra
 
J. P.
,
2000
,
MNRAS
,
318
,
333
 

Ebeling
 
H.
,
Mullis
 
C. R.
,
Tully
 
R. B.
,
2002
,
ApJ
,
580
,
774
 

Ebeling
 
H.
,
Barrett
 
E.
,
Donovan
 
D.
,
Ma
 
C. J.
,
Edge
 
A. C.
,
van Speybroeck
 
L.
,
2007
,
ApJ
,
661
:
L33
 

Erler
 
J.
,
Basu
 
K.
,
Chluba
 
J.
,
Bertoldi
 
F.
,
2018
,
MNRAS
,
476
,
3360
 

Finoguenov
 
A.
 et al. ,
2020
,
A&A
,
638
,
A114
 

Giard
 
M.
,
Montier
 
L.
,
Pointecouteau
 
E.
,
Simmat
 
E.
,
2008
,
A&A
,
490
,
547
 

Gioia
 
I. M.
,
Luppino
 
G. A.
,
1994
,
ApJS
,
94
,
583
 

Górski
 
K. M.
,
Hivon
 
E.
,
Banday
 
A. J.
,
Wandelt
 
B. D.
,
Hansen
 
F. K.
,
Reinecke
 
M.
,
Bartelmann
 
M.
,
2005
,
ApJ
,
622
,
759
 

Gruetjen
 
H. F.
,
Fergusson
 
J. R.
,
Liguori
 
M.
,
Shellard
 
E. P. S.
,
2017
,
Phys. Rev. D
,
95
,
043532
 

Hasselfield
 
M.
 et al. ,
2013
,
J. Cosmol. Astropart. Phys.
,
7
,
008
 

Henry
 
J. P.
,
2004
,
ApJ
,
609
,
603
 

Henry
 
J. P.
,
Mullis
 
C. R.
,
Voges
 
W.
,
Böhringer
 
H.
,
Briel
 
U. G.
,
Gioia
 
I. M.
,
Huchra
 
J. P.
,
2006
,
ApJS
,
162
,
304
 

Herranz
 
D.
,
Sanz
 
J. L.
,
Hobson
 
M. P.
,
Barreiro
 
R. B.
,
Diego
 
J. M.
,
Martínez-González
 
E.
,
Lasenby
 
A. N.
,
2002
,
MNRAS
,
336
,
1057
 

Hilton
 
M.
, et al. ,
2018
,
ApJS
,
235
,
20
 

Hilton
 
M.
 et al. ,
2021
,
ApJS
,
253
,
3
 

Hivon
 
E.
,
Górski
 
K. M.
,
Netterfield
 
C. B.
,
Crill
 
B. P.
,
Prunet
 
S.
,
Hansen
 
F.
,
2002
,
ApJ
,
567
,
2
 

Horner
 
D. J.
,
Perlman
 
E. S.
,
Ebeling
 
H.
,
Jones
 
L. R.
,
Scharf
 
C. A.
,
Wegner
 
G.
,
Malkan
 
M.
,
Maughan
 
B.
,
2008
,
ApJS
,
176
,
374
 

Itoh
 
N.
,
Kohyama
 
Y.
,
Nozawa
 
S.
,
1998
,
ApJ
,
502
,
7
 

Klein
 
M.
,
Hernández-Lang
 
D.
,
Mohr
 
J. J.
,
Bocquet
 
S.
,
Singh
 
A.
,
2023
,
MNRAS
,
526
,
3757
 

Klein
 
M.
,
Mohr
 
J. J.
,
Davies
 
C. T.
,
2024
,
A&A
,
690
,
A322
 

Kocevski
 
D. D.
,
Ebeling
 
H.
,
Mullis
 
C. R.
,
Tully
 
R. B.
,
2007
,
ApJ
,
662
,
224
 

Lagache
 
G.
,
Puget
 
J.-L.
,
Dole
 
H.
,
2005
,
ARA&A
,
43
,
727
 

Lee
 
E.
,
Chluba
 
J.
,
Kay
 
S. T.
,
Barnes
 
D. J.
,
2020
,
MNRAS
,
493
,
3274
 

Lee
 
E.
 et al. ,
2022
,
MNRAS
,
517
,
5303
 

Lee
 
E.
,
Battye
 
R.
,
Bolliet
 
B.
,
2025
,
MNRAS
,
in press
 

Mann
 
A. W.
,
Ebeling
 
H.
,
2012
,
MNRAS
,
420
,
2120
 

Mantz
 
A.
,
Allen
 
S. W.
,
Ebeling
 
H.
,
Rapetti
 
D.
,
Drlica-Wagner
 
A.
,
2010
,
MNRAS
,
406
,
1773
 

Marriage
 
T. A.
 et al. ,
2011
,
ApJ
,
737
,
61
 

McCarthy
 
F.
,
Hill
 
J. C.
,
2024a
,
Phys. Rev. D
,
109
,
023528
 

McCarthy
 
F.
,
Hill
 
J. C.
,
2024b
,
Phys. Rev. D
,
109
,
023529
 

Melin
 
J. B.
,
Bartlett
 
J. G.
,
Delabrouille
 
J.
,
2006
,
A&A
,
459
,
341
 

Melin
 
J. B.
,
Bartlett
 
J. G.
,
Cai
 
Z. Y.
,
De Zotti
 
G.
,
Delabrouille
 
J.
,
Roman
 
M.
,
Bonaldi
 
A.
,
2018
,
A&A
,
617
,
A75
 

Montier
 
L. A.
,
Giard
 
M.
,
2005
;
A&A
,
439
,
35
 

Mroczkowski
 
T.
,
Nagai
 
D.
 et al. ,
2019
,
Space Sci. Rev.
,
215
:
17
 

Mullis
 
C. R.
 et al. ,
2003
,
ApJ
,
594
,
154
 

Nagai
 
D.
,
Kravtsov
 
A. V.
,
Vikhlinin
 
A.
,
2007
,
ApJ
,
668
,
1
 

Pandey
 
S.
 et al. ,
2019
,
Phys. Rev. D
,
100
,
063519
 

Perlman
 
E. S.
,
Horner
 
D. J.
,
Jones
 
L. R.
,
Scharf
 
C. A.
,
Ebeling
 
H.
,
Wegner
 
G.
,
Malkan
 
M.
,
2002
,
ApJS
,
140
,
265
 

Perrott
 
Y.
,
2024
,
PASA
,
41
,
e087
 

Piffaretti
 
R.
,
Arnaud
 
M.
,
Pratt
 
G. W.
,
Pointecouteau
 
E.
,
Melin
 
J. B.
,
2011
,
A&A
,
534
,
A109
 

Planck Collaboration XXIX
,
2014
,
A&A
,
571
,
A29
 

Planck Collaboration VIII
,
2016
,
A&A
,
594
,
A8
 

Planck Collaboration XXIII
,
2016
,
A&A
,
594
,
A23
 

Planck Collaboration XXIV
,
2016
,
A&A
,
594
,
A24
 

Planck Collaboration XXVI
,
2016
,
A&A
,
594
,
A26
 

Planck Collaboration XXVII
,
2016
,
A&A
,
594
,
A27
 

Planck Collaboration XXVIII,
 
2016
,
A&A
,
594
,
A28
 

Planck Collaboration III
,
2020
,
A&A
,
641
,
A3
 

Planck Collaboration VI
,
2020
,
A&A
,
641
,
A6
 

Planck Collaboration XXX
,
2014
,
A&A
,
571
,
A30
 

Planck Collaboration I
,
2012
,
A&A
,
543
,
A102
 

Planck Collaboration XLIII
,
2016
,
A&A
,
596
,
A104
 

Refregier
 
A.
,
Komatsu
 
E.
,
Spergel
 
D. N.
,
Pen
 
U.-L.
,
2000
,
Phys. Rev. D
,
61
,
123001
 

Remazeilles
 
M.
,
Bolliet
 
B.
,
Rotti
 
A.
,
Chluba
 
J.
,
2019
,
MNRAS
,
483
,
3459
 

Repp
 
A.
,
Ebeling
 
H.
,
2018
,
MNRAS
,
479
,
844
 

Romer
 
A. K.
 et al. ,
2000
,
ApJS
,
126
,
209
 

Rotti
 
A.
,
Chluba
 
J.
,
2021
,
MNRAS
,
500
,
976
 

Rotti
 
A.
,
Bolliet
 
B.
,
Chluba
 
J.
,
Remazeilles
 
M.
,
2021
,
MNRAS
,
503
,
5310
 

Sadibekova
 
T.
,
Arnaud
 
M.
,
Pratt
 
G. W.
,
Tarrío
 
P.
,
Melin
 
J. B.
,
2024
,
A&A
,
688
,
A187
 

Salvati
 
L.
,
Douspis
 
M.
,
Aghanim
 
N.
,
2018
,
A&A
,
614
,
A13
 

Sánchez
 
J.
 et al. ,
2023
,
MNRAS
,
522
,
3163
 

Sazonov
 
S. Y.
,
Sunyaev
 
R. A.
,
1998
,
ApJ
,
508
,
1
 

Spurio Mancini
 
A.
,
Piras
 
D.
,
Alsing
 
J.
,
Joachimi
 
B.
,
Hobson
 
M. P.
,
2022
,
MNRAS
,
511
,
1771
 

Stein
 
G.
,
Alvarez
 
M. A.
,
Bond
 
J. R.
,
2019
,
MNRAS
,
483
,
2236
 

Stein
 
G.
,
Alvarez
 
M. A.
,
Bond
 
J. R.
,
van Engelen
 
A.
,
Battaglia
 
N.
,
2020
,
J. Cosmology Astropart. Phys.
,
2020
,
012
 

Streblyanska
 
A.
,
Barrena
 
R.
,
Rubiño-Martín
 
J. A.
,
van der Burg
 
R. F. J.
,
Aghanim
 
N.
,
Aguado-Barahona
 
A.
,
Ferragamo
 
A.
,
Lietzen
 
H.
,
2018
,
A&A
,
617
,
A71
 

Streblyanska
 
A.
,
Aguado-Barahona
 
A.
,
Ferragamo
 
A.
,
Barrena
 
R.
,
Rubiño-Martín
 
J. A.
,
Tramonte
 
D.
,
Genova-Santos
 
R. T.
,
Lietzen
 
H.
,
2019
,
A&A
,
628
,
A13
 

Sunyaev
 
R. A.
,
Zeldovich
 
Y. B.
,
1972
,
Comment Astrophys. Space Phys.
,
4
,
173

Tinker
 
J.
,
Kravtsov
 
A. V.
,
Klypin
 
A.
,
Abazajian
 
K.
,
Warren
 
M.
,
Yepes
 
G.
,
Gottlöber
 
S.
,
Holz
 
D. E.
,
2008
,
ApJ
,
688
,
709
 

Vacher
 
L.
,
Chluba
 
J.
,
Aumont
 
J.
,
Rotti
 
A.
,
Montier
 
L.
,
2022
,
preprint
()

van der Burg
 
R. F. J.
 et al. ,
2016
,
A&A
,
587
,
A23
 

van der Burg
 
R. F. J.
,
McGee
 
S.
,
Aussel
 
H.
,
Dahle
 
H.
,
Arnaud
 
M.
,
Pratt
 
G. W.
,
Muzzin
 
A.
,
2018
,
A&A
,
618
,
A140
 

Vanderlinde
 
K.
 et al. ,
2010
,
ApJ
,
722
,
1180
 

Weinberg
 
D. H.
,
Mortonson
 
M. J.
,
Eisenstein
 
D. J.
,
Hirata
 
C.
,
Riess
 
A. G.
,
Rozo
 
E.
,
2013
,
Phys. Rep.
,
530
,
87
 

Xu
 
W.
,
Ramos-Ceja
 
M. E.
,
Pacaud
 
F.
,
Reiprich
 
T. H.
,
Erben
 
T.
,
2022
,
A&A
,
658
,
A59
 

Yan
 
Z.
,
Hojjati
 
A.
,
Tröster
 
T.
,
Hinshaw
 
G.
,
van Waerbeke
 
L.
,
2019
,
ApJ
,
884
:
139
 

Zaznobin
 
I. A.
 et al. ,
2019
,
Astron. Lett.
,
45
,
49
 

Zaznobin
 
I. A.
 et al. ,
2020
,
Astron. Lett.
,
46
,
79
 

Zohren
 
H.
,
Schrabback
 
T.
,
van der Burg
 
R. F. J.
,
Arnaud
 
M.
,
Melin
 
J.-B.
,
van den Busch
 
J. L.
,
Hoekstra
 
H.
,
Klein
 
M.
,
2019
,
MNRAS
,
488
,
2523
 

Zubeldia
 
Í.
,
2024
, in
Mayet
 
F.
,
Catalano
 
A.
,
Macías-Pérez
 
J.F.
,
Perotto
 
L.
, eds,
mm Universe 2023 - Observing the Universe at mm Wavelengths, Vol. 293
.
EPJ Web of Conferences
,
France
, p.
00053
 

Zubeldia
 
Í.
,
Bolliet
 
B.
,
2024
,
J. Cosmol. Astropart. Phys.
,
2024
,
018
 

Zubeldia
 
Í.
,
Challinor
 
A.
,
2019
,
MNRAS
,
489
,
401
 

Zubeldia
 
Í.
,
Rotti
 
A.
,
Chluba
 
J.
,
Battye
 
R.
,
2021
,
MNRAS
,
507
,
4852
 

Zubeldia
 
Í.
,
Rotti
 
A.
,
Chluba
 
J.
,
Battye
 
R.
,
2023a
,
MNRAS
,
522
,
4766
 

Zubeldia
 
Í.
,
Chluba
 
J.
,
Battye
 
R.
,
2023b
,
MNRAS
,
522
,
5123
 

APPENDIX A: THE Planck  SZiFi MASTER CATALOGUE: CATALOGUE ENTRIES

In Table A1, we offer a brief description of all the entries in the Planck  SZiFi master catalogue. We recall that the master catalogue must be seen as a container of the 10 Planck  SZiFi catalogues (together with their 10 non-iterative counterparts), each of which has its own tSZ selection using a different MMF. We strongly recommend to use the individual catalogues (e.g. our baseline catalogue, the iMMF6 catalogue) for any practical application instead of the full master catalogue, as they have a simple, well-defined tSZ selection, as well as a significantly higher purity than the full master catalogue (see Section 4).

Table A1.

Description of the entries in the Planck  SZiFi master catalogue.

EntryDescription
nameName of the detection.
SNR_nameMMFDetection signal-to-noise |$q_{\mathrm{obs}}$| for the MMF nameMMF, where nameMMF can be: iMMF6, iMMF5, iMMF4,
 sciMMF6, sciMMF5, sciMMF4, sciMMF6_beta, sciMMF6_betaT, sciMMF5_beta, sciMMF5_betaT. Add
 noit before nameMMF to retrieve the non-iterative signal-to-noise.
y0_nameMMFDetection |$\hat{y}_0$|⁠.
theta500_nameMMFDetection angular scale |$\hat{\theta }_{500}$| in acrmin.
tile_nameMMFDetection selection tile.
glon_nameMMFDetection Galactic longitude.
glat_nameMMFDetection Galactic latitude.
thetax_nameMMFDetection x-coordinate in the local selection tile frame in rad.
thetay_nameMMFDetection y-coordinate in the local selection tile frame in rad.
SNR_fixed_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position (see Section 7).
y0_fixed_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position.
SNR_fixed_rel_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ
 SED (see Section 8).
y0_fixed_rel_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ SED.
glonGalactic longitude, assigned following the master catalogue construction hierarchy.
glatGalactic latitude, assigned following the master catalogue construction hierarchy.
raRight ascension, obtained from glon and glat.
decDeclination, obtained from glon and glat.
cosmology_maskWhether the detection is contained within the cosmology mask (1 if so, -1 otherwise).
cross-matchedWhether cross-match has been found with the external catalogues (1 if so, -1 otherwise).
confirmedWhether the detection is a confirmed cluster (1), an unconfirmed object (0), or a confirmed false
 detection (-1); see Section 4.
flag_gccWhether a cross-match has been found with a Planck GCC (1 if so, -1 otherwise).
redshiftCluster redshift.
redshift_typeRedshift type (spec, phot, or -1).
temperatureCompton-y-weighted temperature estimate in keV (see Section 8).
name_mcszName in the MCSZ meta-catalogue.
name_mcxcName in the MCXC-II meta-catalogue.
name_actdr5Name in the ACT DR5 catalogue.
name_codexName in the CODEX catalogue.
name_rass-mcmfName in the RASS-MCMF catalogue.
name_erassName in the eRASS1 catalogue.
EntryDescription
nameName of the detection.
SNR_nameMMFDetection signal-to-noise |$q_{\mathrm{obs}}$| for the MMF nameMMF, where nameMMF can be: iMMF6, iMMF5, iMMF4,
 sciMMF6, sciMMF5, sciMMF4, sciMMF6_beta, sciMMF6_betaT, sciMMF5_beta, sciMMF5_betaT. Add
 noit before nameMMF to retrieve the non-iterative signal-to-noise.
y0_nameMMFDetection |$\hat{y}_0$|⁠.
theta500_nameMMFDetection angular scale |$\hat{\theta }_{500}$| in acrmin.
tile_nameMMFDetection selection tile.
glon_nameMMFDetection Galactic longitude.
glat_nameMMFDetection Galactic latitude.
thetax_nameMMFDetection x-coordinate in the local selection tile frame in rad.
thetay_nameMMFDetection y-coordinate in the local selection tile frame in rad.
SNR_fixed_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position (see Section 7).
y0_fixed_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position.
SNR_fixed_rel_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ
 SED (see Section 8).
y0_fixed_rel_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ SED.
glonGalactic longitude, assigned following the master catalogue construction hierarchy.
glatGalactic latitude, assigned following the master catalogue construction hierarchy.
raRight ascension, obtained from glon and glat.
decDeclination, obtained from glon and glat.
cosmology_maskWhether the detection is contained within the cosmology mask (1 if so, -1 otherwise).
cross-matchedWhether cross-match has been found with the external catalogues (1 if so, -1 otherwise).
confirmedWhether the detection is a confirmed cluster (1), an unconfirmed object (0), or a confirmed false
 detection (-1); see Section 4.
flag_gccWhether a cross-match has been found with a Planck GCC (1 if so, -1 otherwise).
redshiftCluster redshift.
redshift_typeRedshift type (spec, phot, or -1).
temperatureCompton-y-weighted temperature estimate in keV (see Section 8).
name_mcszName in the MCSZ meta-catalogue.
name_mcxcName in the MCXC-II meta-catalogue.
name_actdr5Name in the ACT DR5 catalogue.
name_codexName in the CODEX catalogue.
name_rass-mcmfName in the RASS-MCMF catalogue.
name_erassName in the eRASS1 catalogue.
Table A1.

Description of the entries in the Planck  SZiFi master catalogue.

EntryDescription
nameName of the detection.
SNR_nameMMFDetection signal-to-noise |$q_{\mathrm{obs}}$| for the MMF nameMMF, where nameMMF can be: iMMF6, iMMF5, iMMF4,
 sciMMF6, sciMMF5, sciMMF4, sciMMF6_beta, sciMMF6_betaT, sciMMF5_beta, sciMMF5_betaT. Add
 noit before nameMMF to retrieve the non-iterative signal-to-noise.
y0_nameMMFDetection |$\hat{y}_0$|⁠.
theta500_nameMMFDetection angular scale |$\hat{\theta }_{500}$| in acrmin.
tile_nameMMFDetection selection tile.
glon_nameMMFDetection Galactic longitude.
glat_nameMMFDetection Galactic latitude.
thetax_nameMMFDetection x-coordinate in the local selection tile frame in rad.
thetay_nameMMFDetection y-coordinate in the local selection tile frame in rad.
SNR_fixed_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position (see Section 7).
y0_fixed_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position.
SNR_fixed_rel_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ
 SED (see Section 8).
y0_fixed_rel_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ SED.
glonGalactic longitude, assigned following the master catalogue construction hierarchy.
glatGalactic latitude, assigned following the master catalogue construction hierarchy.
raRight ascension, obtained from glon and glat.
decDeclination, obtained from glon and glat.
cosmology_maskWhether the detection is contained within the cosmology mask (1 if so, -1 otherwise).
cross-matchedWhether cross-match has been found with the external catalogues (1 if so, -1 otherwise).
confirmedWhether the detection is a confirmed cluster (1), an unconfirmed object (0), or a confirmed false
 detection (-1); see Section 4.
flag_gccWhether a cross-match has been found with a Planck GCC (1 if so, -1 otherwise).
redshiftCluster redshift.
redshift_typeRedshift type (spec, phot, or -1).
temperatureCompton-y-weighted temperature estimate in keV (see Section 8).
name_mcszName in the MCSZ meta-catalogue.
name_mcxcName in the MCXC-II meta-catalogue.
name_actdr5Name in the ACT DR5 catalogue.
name_codexName in the CODEX catalogue.
name_rass-mcmfName in the RASS-MCMF catalogue.
name_erassName in the eRASS1 catalogue.
EntryDescription
nameName of the detection.
SNR_nameMMFDetection signal-to-noise |$q_{\mathrm{obs}}$| for the MMF nameMMF, where nameMMF can be: iMMF6, iMMF5, iMMF4,
 sciMMF6, sciMMF5, sciMMF4, sciMMF6_beta, sciMMF6_betaT, sciMMF5_beta, sciMMF5_betaT. Add
 noit before nameMMF to retrieve the non-iterative signal-to-noise.
y0_nameMMFDetection |$\hat{y}_0$|⁠.
theta500_nameMMFDetection angular scale |$\hat{\theta }_{500}$| in acrmin.
tile_nameMMFDetection selection tile.
glon_nameMMFDetection Galactic longitude.
glat_nameMMFDetection Galactic latitude.
thetax_nameMMFDetection x-coordinate in the local selection tile frame in rad.
thetay_nameMMFDetection y-coordinate in the local selection tile frame in rad.
SNR_fixed_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position (see Section 7).
y0_fixed_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position.
SNR_fixed_rel_nameMMFSNR extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ
 SED (see Section 8).
y0_fixed_rel_nameMMF|$\hat{y}_0$| extracted at angular scale of 5 arcmin and mean sky position assuming the relativistic tSZ SED.
glonGalactic longitude, assigned following the master catalogue construction hierarchy.
glatGalactic latitude, assigned following the master catalogue construction hierarchy.
raRight ascension, obtained from glon and glat.
decDeclination, obtained from glon and glat.
cosmology_maskWhether the detection is contained within the cosmology mask (1 if so, -1 otherwise).
cross-matchedWhether cross-match has been found with the external catalogues (1 if so, -1 otherwise).
confirmedWhether the detection is a confirmed cluster (1), an unconfirmed object (0), or a confirmed false
 detection (-1); see Section 4.
flag_gccWhether a cross-match has been found with a Planck GCC (1 if so, -1 otherwise).
redshiftCluster redshift.
redshift_typeRedshift type (spec, phot, or -1).
temperatureCompton-y-weighted temperature estimate in keV (see Section 8).
name_mcszName in the MCSZ meta-catalogue.
name_mcxcName in the MCXC-II meta-catalogue.
name_actdr5Name in the ACT DR5 catalogue.
name_codexName in the CODEX catalogue.
name_rass-mcmfName in the RASS-MCMF catalogue.
name_erassName in the eRASS1 catalogue.

APPENDIX B: FIXED COMPTON-y RESULTS FOR A MASS-SELECTED CLUSTER SAMPLE

In this appendix, we offer further investigation on the origin of the two features seen in the re-extracted fixed |$\hat{y}_0$| measurements for the randomized Websky catalogues that are discussed in Section 7 (see, in particular, Fig. 8). We recall that |$\hat{y}_0$| is the MMF amplitude parameter and that these features are (i) a small shift in |$\hat{y}_0$| for some MMFs for large |$\hat{y}_0$| values and (ii) the shape of the |$\hat{y}_0$| distribution at the low-|$\hat{y}_0$| end. We do this by extracting |$\hat{y}_0$| from our synthetic Websky-based CIB-randomized maps for all our 10 iterative MMFs for a mass-selected cluster sample, instead of the sample analysed in Section 7. In particular, we consider the Websky halo catalogue with |$M_{500} > 6 \times 10^{14} M_{\odot }$|⁠, which contains 557 objects. We extract |$\hat{y}_0$| using the SZiFi-based re-extraction pipeline presented in Section 7 at the true halo sky positions and at a fixed angular size of 5 arcmin, the latter following what was done in Section 7.

Our measurements are presented in Fig. B1, with each panel showing the |$\hat{y}_0$| measurements for the panel’s MMF on the vertical axis and those for iMMF6 on the horizontal axis. No evidence for either of the features is seen here, with the measurements for each MMF being consistent with those for iMMF6 across the full |$\hat{y}_0$| range. Therefore, we can conclusively attribute the two features to the selection of the sample analysed in Section 7.

Re-extracted fixed measurements of the MMF amplitude parameter $\hat{y}_0$ obtained from our Planck-like Websky-based synthetic maps with the randomized CIB for a mass selected sample. In each panel, the $\hat{y}_0$ measurements for the panel’s MMF are plotted against those for iMMF6 (vertical and horizontal axes, respectively). Excellent agreement across all MMFs is observed for the full $\hat{y}_0$ range. Compare with the analogous measurements for a tSZ-selected sample in Fig. 8.
Figure B1.

Re-extracted fixed measurements of the MMF amplitude parameter |$\hat{y}_0$| obtained from our Planck-like Websky-based synthetic maps with the randomized CIB for a mass selected sample. In each panel, the |$\hat{y}_0$| measurements for the panel’s MMF are plotted against those for iMMF6 (vertical and horizontal axes, respectively). Excellent agreement across all MMFs is observed for the full |$\hat{y}_0$| range. Compare with the analogous measurements for a tSZ-selected sample in Fig. 8.

APPENDIX C: VALIDATION OF CLUSTER DETECTION PIPELINE FOR ALL MMFS

Fig. C1 shows the number counts across both signal-to-noise ratio and redshift for the confirmed clusters in all our iterative validation catalogues (red histograms) together with the corresponding theoretical prediction with the optimization bias taken into account (blue data points). As was the case for the iMMF6 validation catalogue (see Fig. 12), our theoretical model provides, overall, a good description of the observed counts.

Number counts a function of both signal-to-noise ratio and redshift for all our iterative validation catalogues (red histograms), shown together with their corresponding theoretical prediction with the optimization bias taken into account (blue data points). Only the confirmed (i.e. true) detections in the catalogues are considered.
Figure C1.

Number counts a function of both signal-to-noise ratio and redshift for all our iterative validation catalogues (red histograms), shown together with their corresponding theoretical prediction with the optimization bias taken into account (blue data points). Only the confirmed (i.e. true) detections in the catalogues are considered.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.