Modelling baryonic physics in future weak lensing surveys Free

Basic information for the hydrodynamical simulations used in this work.

Simulation	Box length	Total	DM particle	Initial gas	Force softening	Cosmology
		particle #	mass	particle mass	length
OWLS	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 512³	\|$4.06 \times 10^8 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.66 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	0.78 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP3
MassiveBlack-II	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1972³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Illustris	75 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1820³	\|$4.41 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.87 \times 10^5 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.4 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Eagle	67.77 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1504³	\|$6.57 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$1.23 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.8 \|$\ h^{-1}\, \mathrm{kpc}$\|	Planck2013
Horizon-AGN	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1024³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7

Simulation	Box length	Total	DM particle	Initial gas	Force softening	Cosmology
		particle #	mass	particle mass	length
OWLS	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 512³	\|$4.06 \times 10^8 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.66 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	0.78 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP3
MassiveBlack-II	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1972³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Illustris	75 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1820³	\|$4.41 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.87 \times 10^5 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.4 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Eagle	67.77 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1504³	\|$6.57 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$1.23 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.8 \|$\ h^{-1}\, \mathrm{kpc}$\|	Planck2013
Horizon-AGN	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1024³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7

Table 1.

Basic information for the hydrodynamical simulations used in this work.

Simulation	Box length	Total	DM particle	Initial gas	Force softening	Cosmology
		particle #	mass	particle mass	length
OWLS	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 512³	\|$4.06 \times 10^8 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.66 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	0.78 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP3
MassiveBlack-II	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1972³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Illustris	75 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1820³	\|$4.41 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.87 \times 10^5 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.4 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Eagle	67.77 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1504³	\|$6.57 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$1.23 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.8 \|$\ h^{-1}\, \mathrm{kpc}$\|	Planck2013
Horizon-AGN	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1024³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7

Simulation	Box length	Total	DM particle	Initial gas	Force softening	Cosmology
		particle #	mass	particle mass	length
OWLS	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 512³	\|$4.06 \times 10^8 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.66 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	0.78 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP3
MassiveBlack-II	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1972³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Illustris	75 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1820³	\|$4.41 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$8.87 \times 10^5 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.4 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7
Eagle	67.77 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1504³	\|$6.57 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$1.23 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.8 \|$\ h^{-1}\, \mathrm{kpc}$\|	Planck2013
Horizon-AGN	100 \|$\ h^{-1}\, \mathrm{Mpc}$\|	2 × 1024³	\|$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$\|	\|$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$\|	1.85 \|$\ h^{-1}\, \mathrm{kpc}$\|	WMAP7

2.1 OWLS simulation suite

The OWLS simulations are a large suite of cosmological hydrodynamical simulations with varying implementations of subgrid physics to enable investigations of the effects of altering or adding a single physical process on the total matter distribution (Schaye et al. 2010). Here we adopt nine different baryonic simulations from OWLS. We refer readers to van Daalen et al. (2011) for a more detailed description.

REF: The baseline simulation that contains many of the physical processes known to be important for galaxy formation except for the active galactic nucleus (AGN) feedback mechanism. REF includes prescriptions of radiative cooling and heating for 11 different elements, star formation assuming the Chabrier (2003) stellar initial mass function (IMF), stellar evolution, mass-loss, chemical enrichment, and SN feedback in kinetic form (wind mass loading factor η = 2 and initial wind velocity v_w = 600 km s⁻¹ ; all together |$\eta v_{\rm w}^2$| determines the energy injected into the winds per unit stellar mass). The other eight hydro simulations are based on REF, with modifications indicated below.
NOSN: Exclude SN feedback.
NOZCOOL: Exclude metal-line cooling. Only assume primordial abundances when computing cooling rates.
NOSN_NOZCOOL: Exclude both SN feedback and metal-line cooling.
WML1V848: Adopt the same SN feedback energy per unit stellar mass as for REF, but reduce the mass loading factor by a factor of 2 (η = 1) and increase the wind velocity by a factor of |$\sqrt{2}$| (v_w = 848 km s⁻¹).
WDENS: Adopt the same SN feedback energy per unit stellar mass as that of REF, but let η and v_w depend on gas density (⁠|$v_{\rm w} \propto n_{\rm H}^{1/6}$|⁠; |$\eta \propto n_{\rm H}^{-1/3}$|⁠).
WML4: Double SN feedback per unit stellar mass by increasing the mass loading factor by a factor of 2 (η = 4).
DBLIMFV1618: Once the gas reaches a certain pressure threshold, 10 per cent of the star formation activity follows a top-heavy IMF. In this case, more high-mass stars are produced, which leads to higher SN energy feedback.
AGN: In addition to physics included in the REF model, add a subgrid model for BH evolution and AGN feedback following the prescription of Booth & Schaye (2009). BHs inject 1.5 per cent of the rest-mass energy of the accreted gas into the surrounding matter in the form of heat.

The simulation cube for OWLS is |$L = 100 \ h^{-1}\, \mathrm{Mpc}$| in comoving scale on a side. The OWLS-DMO simulation contains 512³ collisionless DM particles; the nine hydro simulations contain an additional 512³ particles in the form of collisional gas or collisionless stars to capture the baryonic processes. The DM and (initial) gas particle masses are |$\approx 4.06 \times 10^8$| and |$8.66 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$|⁠, respectively. The gravitational softening length is |$\epsilon \approx 0.78 \ h^{-1}\, \mathrm{kpc}$| in comoving scale, and is limited to a maximum physical scale of |$2 \ h^{-1}\, \mathrm{kpc}$|⁠. The cosmological parameters used in the simulation are based on WMAP3 results (Spergel et al. 2007): {Ω_m, Ω_b, Ω_Λ, σ₈, n_s, h} = {0.238, 0.0418, 0.762, 0.74, 0.951, 0.73}.

The OWLS simulation sets are not specifically fine-tuned to match with key observables. As indicated in McCarthy et al. (2017), the original OWLS models underpredict the abundance of |$M_* \lt 10^{11} \, \mathrm{M}_{\odot }$| galaxies at the present day due to overly efficient stellar feedback (see their fig. 1). The successor BAHAMAS simulation lowers the wind velocity v_w from 600 to 300 km s⁻¹ in order to provide a better fit to the observed abundance of low-to-intermediate-mass galaxies.⁸

2.2 Eagle simulation

The Eagle simulation (Schaye et al. 2015) is conducted in a cubic periodic box of side length |$L = 67.77 \ h^{-1}\, \mathrm{Mpc}$| (comoving). There are 1504³ DM particles in both hydrodynamical and DMO simulations, and an approximately equal number of baryonic particles in the hydrodynamical run. The mass of each DM particle is |$6.57 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$| and the initial baryonic mass resolution is |$1.23 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$|⁠. The gravitational softening length is |$\epsilon = 1.8 \ h^{-1}\, \mathrm{kpc}$| in comoving units (The EAGLE team 2017). The cosmological parameters used in Eagle are consistent with Planck 2013 results (Planck Collaboration XVI 2014): {Ω_m, Ω_b, Ω_Λ, σ₈, n_s, h} = {0.307, 0.04825, 0.693, 0.8288, 0.9611, 0.6777}.

The subgrid physics used in Eagle is based on OWLS. The physical models include radiative cooling and photoionization heating; star formation associated with stellar mass-loss and energy feedback; BH mergers, gas accretion, and AGN feedback. The most important changes compared to OWLS are: star-forming feedback energy changing in terms of thermal form rather than kinetic; accounting for angular momentum during the accretion of gas onto BHs; inclusion of a metallicity-dependence in the star formation law. In contrast to many hydrodynamical simulations, Eagle employs stellar and AGN feedback only in thermal form, which captures the collective effects of mechanisms such as stellar winds, radiation pressure, SN feedback, and radio- and quasar-mode AGN feedback. One major improvement in the treatment of thermal feedback is that it can be performed without turning off radiative cooling and hydrodynamical forces.

The galaxy stellar mass function of Eagle matches extremely well with observations at z = 0.1, because its stellar and AGN feedback related parameters are specifically calibrated at each resolution to reproduce this observable (see Crain et al. 2015; Schaye et al. 2015 for details of calibration philosophy).

2.3 MassiveBlack-II simulation

The MassiveBlack-II (hereafter MB2) simulation is a high-resolution ΛCDM cosmological simulation (Khandai et al. 2015). Both DMO (Tenneti et al. 2015) and hydrodynamical MB2 simulations are conducted in a cubic simulation box with sides of length |$L = 100 \ h^{-1}\, \mathrm{Mpc}$| in comoving scale. There are 1972³ DM particles in both the MB2-hydro and MB2-DMO simulations, with an additional 1972³ initial number of gas particles in the hydro run. The mass of each DM particle is |$1.1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$| and the initial baryonic mass resolution is |$2.2 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$|⁠. The gravitational softening length is |$\epsilon = 1.85 \ h^{-1}\, \mathrm{kpc}$| in comoving units. The cosmological parameters in MB2 are consistent with WMAP7 results (Komatsu et al. 2011): {Ω_m, Ω_b, Ω_Λ, σ₈, n_s, h} = {0.275, 0.046, 0.725, 0.816, 0.968, 0.701}.

The subgrid models of baryonic physics in MB2 include a multiphase interstellar medium model with star formation and associated feedback by SN and stellar winds (Springel & Hernquist 2003); BH accretion, merger, and associated AGN feedback in quasar-mode (Di Matteo, Springel & Hernquist 2005; Springel, Di Matteo & Hernquist 2005).

The AGN feedback efficiency of MB2 is relatively weak compared with other hydrodynamical simulations that have AGN subgrid physics involved in this work. One outcome of this is that MB2 overpredicts the abundance of massive galaxies at low redshift (Khandai et al. 2015).

2.4 Illustris simulation

The Illustris simulation (Vogelsberger et al. 2014) is carried out in a cubic periodic box with sides of length |$L = 75 \ h^{-1}\, \mathrm{Mpc}$| (comoving). We download the highest resolution snapshot data from the public release website (Nelson et al. 2015) to calculate power spectra for both hydrodynamical and DMO runs. There are 1820³ DM particles in both hydrodynamical and DMO simulations, and an approximately equal number of baryonic particles in the hydrodynamical run. The mass of each DM particle is |$4.41 \times 10^6 \ h^{-1}\, \mathrm{M}_{\odot }$| and the initial baryonic mass resolution is |$8.87 \times 10^5 \ h^{-1}\, \mathrm{M}_{\odot }$|⁠. The gravitational softening length is |$\epsilon = 1.4 \ h^{-1}\, \mathrm{kpc}$| in comoving units. The cosmological parameters adopted in Illustris are consistent with WMAP7 results (Komatsu et al. 2011): {Ω_m, Ω_b, Ω_Λ, σ₈, n_s, h} = {0.2726, 0.0456, 0.7274, 0.809, 0.963, 0.704}.

Illustris incorporates a broad range of galaxy formation physics (Vogelsberger et al. 2013): gas cooling in primordial and metal-lines; stellar evolution associated with chemical enrichment and stellar mass-loss; kinetic stellar feedback driven by SN; BH accretion, merging, and related AGN feedback in terms of quasar- and radio-modes as well as associated radiative electromagnetic feedback.

Illustris is run using the moving-mesh-based code arepo (Springel 2010), which is more efficient in cooling compared with classical particle-based SPH codes (e.g. Springel 2005). The energy input from feedback is designed to be strong to avoid efficient stellar mass build-up. Even with this setting, Illustris still overshoots the observed low-redshift stellar mass function on both high- and low-mass ends. The radio-mode AGN feedback is also too violent for the gas component, under predicting the baryon content in lower redshift high-mass haloes where the radio-mode feedback is the dominant heating channel (Genel et al. 2014; Haider et al. 2016). The successor IllustrisTNG simulation replaces the intense thermal energy dump of radio-mode feedback with kinematic kicks to heat up affected gas particles (Weinberger et al. 2018).

2.5 Horizon-AGN simulation

The Horizon-AGN (Dubois et al. 2014) is carried out in a cubic periodic box of side length |$L = 100 \ h^{-1}\, \mathrm{Mpc}$| (comoving). There are 1024³ DM particles in both the DMO and hydrodynamical runs, with the DM particle mass of |$9.9 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$| for the DMO run, and |$8.3 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$| for the hydrodynamical run. The initial gas particle mass is about |$1 \times 10^7 \ h^{-1}\, \mathrm{M}_{\odot }$|⁠. The cosmological parameters used in the simulation are compatible with WMAP7 cosmology (Komatsu et al. 2011): {Ω_m, Ω_b, Ω_Λ, σ₈, n_s, h} = {0.272, 0.045, 0.728, 0.81, 0.967, 0.704}.

Subgrid physics models for a variety of baryonic physics effects are implemented in Horizon-AGN. Gas is allowed to cool down to 10⁴ K via transition lines of hydrogen and helium as well as metals using the Sutherland & Dopita 1993 model. When the hydrogen number density exceeds a threshold of |$0.1\,\rm {H}\,\rm {cm}^{-3}$|⁠, star formation is triggered following a random Poisson process (Shandarin & Zeldovich 1989; Rasera & Teyssier 2006). SN feedback is taken into account assuming an IMF with a low-mass cut-off at 0.1 M_⊙ and a high-mass cut-off at 100 M_⊙. Chemical enrichment happens along with SN explosions and stellar winds. The AGN feedback is modelled in a combination of two different modes: the kinematic radio mode when |$\dot{M}_{\rm BH} / \dot{M}_{\rm Edd} \lt 0.01$| and the thermal quasar mode otherwise (Dubois et al. 2012).

Although Horizon-AGN is not specifically tuned to reproduce the galaxy stellar mass function at local Universe, it shows reasonable consistency with observations, with slight overproduction of galaxies at the low-mass end (Kaviraj et al. 2017).

2.6 Comparison of power spectra in hydrodynamical versus DMO simulations

From the snapshot data release of Eagle, MB2, and Illustris, we calculate the matter power spectra as detailed in Appendix A. For OWLS and Horizon-AGN simulations, we use the computed results from van Daalen et al. (2011) and Chisari et al. (2018), respectively. Power spectra from DMO simulations, with the same initial condition as their paired hydrodynamical simulations, are also computed in order to perform a fair comparison across simulations with different cosmological parameters and with reduced cosmic variance. For each paired simulation set, only a single realization was available to construct the power spectrum ratio.

Fig. 1 shows the z = 0 ratio of power spectra from different hydrodynamical simulations with respect to their counterpart DMO simulations. The thin lines indicate the nine different baryonic scenarios in the OWLS simulation suite. For the eight baryonic scenarios without AGN feedback, the common feature is a rapid increase in power on small scales. The power enhancement is due to efficient cooling of gas which eventually leads to the formation of galaxies within haloes, and further concentrates the DM distribution (Blumenthal et al. 1986). Simulations without SN feedback (NOSN, NOSN_NOZCOOL) tend to have an even stronger increase in power compared to the reference simulation REF due to the enhanced cooling effect. When adding AGN feedback to REF, the power is suppressed dramatically, with 1 per cent reduction for |$k \approx 0.3 \ h^{-1}\, \mathrm{Mpc}$| and exceeding 10 per cent for |$k \gtrsim 2 \ h^{-1}\, \mathrm{Mpc}$| (van Daalen et al. 2011). The suppression of power is due to baryons being pushed outward by the energetic AGN feedback processes.

$The ratios of the matter power spectra in different hydrodynamical simulations with respect to their counterpart DMO simulations at z = 0. The thick lines show results for the Eagle, MB2, and Illustris simulations, while the thin lines indicate the nine different baryonic scenarios in OWLS simulation suite. The grey vertical line separates between regions where the data points come from direct measurement ($k \lesssim 30 \ h^{-1}\, \mathrm{Mpc}$) and from extrapolation with a quadratic spline fit ($k \gtrsim 30 \ h^{-1}\, \mathrm{Mpc}$; see Appendix B for further details).$

Figure 1.

The ratios of the matter power spectra in different hydrodynamical simulations with respect to their counterpart DMO simulations at z = 0. The thick lines show results for the Eagle, MB2, and Illustris simulations, while the thin lines indicate the nine different baryonic scenarios in OWLS simulation suite. The grey vertical line separates between regions where the data points come from direct measurement (⁠|$k \lesssim 30 \ h^{-1}\, \mathrm{Mpc}$|⁠) and from extrapolation with a quadratic spline fit (⁠|$k \gtrsim 30 \ h^{-1}\, \mathrm{Mpc}$|⁠; see Appendix B for further details).

The thick lines represent power spectra ratio for Eagle, MB2, Illustris, and Horizon-AGN simulations. Although they all involve a broad range of astrophysical processes that are believed to be relevant to galaxy formation, the resulting power spectra show significant differences. The feedback mechanism in Illustris drastically suppresses the power by 35 per cent at |$k \approx 5 \ h^{-1}\, \mathrm{Mpc}$|⁠. Eagle reaches its maximum suppression of power of 20 per cent at |$k \approx 20 \ h^{-1}\, \mathrm{Mpc}$|⁠. A similar trend is also observed in Horizon-AGN, but it reaches its minimum amplitude reduction of 10 per cent at |$k \approx 10 \ h^{-1}\, \mathrm{Mpc}$|⁠. Going towards higher k, we start to see that the ratio curves bend upward and keep increasing beyond k of |$30 \ h^{-1}\, \mathrm{Mpc}$|⁠. The MB2 power spectrum behaves relatively similar to DMO, but still the baryonic prescription prevents the power spectrum ratio from growing too quickly compared to the OWLS scenarios without AGN feedback, which suffer from severe overcooling effect (e.g. Tornatore et al. 2003; McCarthy et al. 2011).

The input cosmologies (⁠|$\boldsymbol p_{\rm co, sim}$|⁠) for the five simulation suites are different. In order to predict matter power spectra with baryonic effects for arbitrary cosmological parameters, we take the power spectrum ratios shown Fig. 1 and apply the following equation:

\begin{eqnarray*} P_{\delta }^{\rm hydro}(k, z\ |\ \boldsymbol p_{\rm co}) = \frac{ P_{\delta }^{\rm hydro, sim}(k, z\ |\ \boldsymbol p_{\rm co, sim}) }{ P_{\delta }^{\rm DMO, sim}(k, z\ |\ \boldsymbol p_{\rm co, sim}) } P_{\delta }^{\rm theory}(k, z\ |\ \boldsymbol p_{\rm co}) \ , \end{eqnarray*}

(1)

where |$P_{\delta }^{\rm hydro, sim}(k, z\ |\ \boldsymbol p_{\rm co, sim})$| denotes the hydrodynamical run from a given simulation; |$P_{\delta }^{\rm DMO, sim}(k, z\ |\ \boldsymbol p_{\rm co, sim})$| is the corresponding DMO run; |$P_{\delta }^{\rm theory}(k, z\ |\ \boldsymbol p_{\rm co})$| is the theoretical power spectrum calculated from halofit (Takahashi et al. 2012) or hmcode (Mead et al. 2015), which are calibrated by DMO simulations.

Equation (1) illustrates the most important assumption in this work: we assume that baryonic effects on the power spectrum can be represented as a fractional change in the power spectrum, and that this fractional change is independent of cosmology. The cosmology enters our analysis only through the theoretical power spectrum |$P_{\delta }^{\rm theory}(k, z\ |\ \boldsymbol p_{\rm co})$|⁠. This is a reasonable assumption. According to van Daalen, McCarthy & Schaye (2019), the power spectrum ratio remains more or less the same when varying cosmologies (see their fig. 6).

3 LIKELIHOOD ANALYSIS METHODOLOGY

Here we present our methodology in estimating the cosmological constraining power for an LSST-like survey. We start by describing the theoretical models used in the work, our mock observations, the covariance matrix constructed for an LSST-like survey, and finally the likelihood formalism used in estimating the posterior distribution of cosmological parameters. The cosmological model considered in our likelihood simulation is flat wCDM, with varying cosmological parameters |$\boldsymbol p_{\rm co}=\lbrace \Omega _{\mathrm{m}},\ \sigma _8,\ \Omega _{\mathrm{b}},\ n_{\mathrm{s}},\ w_0,\ w_{\rm a},\ h\rbrace$|⁠.

3.1 Theoretical models

We rely on two main theoretical models to fit our mock observables in this work. The first one is the Takahashi et al. (2012) version of halofit. It adopts empirically motivated functional forms to characterize the variation of power spectra with cosmology. Having been calibrated with high-resolution N-body simulations, it provides an accurate prediction of the non-linear matter spectrum with 5 per cent precision at |$k \le 1 \ h^{-1}\, \mathrm{Mpc}$| and 10 per cent at |$1 \le k \le 30 \ h^{-1}\, \mathrm{Mpc}$| within the redshift range of 0 ≤ z ≤ 10.

The second fitting routine is hmcode, constructed by M15. It utilizes the halo-model formalism to describe the cosmological change of power spectra via physically motivated parameters. hmcode has prescriptions for capturing the impact of baryons on the matter power spectrum via two free parameters: the amplitude of the concentration–mass relation (A; see equation 14 in M15), and a halo bloating parameter (η₀; see equations 26 and 29 in M15) controlling the change of dark matter halo profiles in a halo mass-dependent way to account for different feedback energy levels. When allowing A and η₀ to vary, it can successfully fit the power spectra from various baryonic scenarios of OWLS (M15). When fixing A = 3.13 and η₀ = 0.6044, hmcode functions as a regular DMO-based emulator, which is calibrated with high-resolution N-body simulations to an accuracy of |$\approx 5{{\ \rm per\ cent}}$| at |$k \le 10 \ h^{-1}\, \mathrm{Mpc}$| for z ≤ 2. We note that the |$\approx 5{{\ \rm per\ cent}}$| discrepancy between the DMO mode of hmcode and halofit is non-negligible within LSST statistics. We therefore construct two sets of mock observables based on each theoretical model.

3.2 Mock observational data

We rely on four hydrodynamical simulations: Eagle, MB2, Illustris, and Horizon-AGN to construct mock observables, and investigate the performances of the PCA method (Eifler et al. 2015, hereafter E15) and the halo model approach (Mead et al. 2015, hereafter M15) on mitigating baryonic effects. These methods will be described in more detail in Section 4. For simplicity, besides baryonic effects, our mock data vectors do not include any other source of noise or systematics.

We consider tomographic weak lensing shear power spectra as the summary statistics. These are defined as

\begin{eqnarray*} C^{ij} (\ell) = \frac{9H_0^4 \Omega _m^2}{4c^4} \int _0^{\chi _{\rm h}} {\rm d} \chi \frac{g^{i}(\chi) g^{j}(\chi)}{a^2(\chi)} P_{\delta } \left(\frac{\ell }{f_K(\chi)},\chi \right) \ . \end{eqnarray*}

(2)

Here C^ij(l) is the convergence power spectrum for tomographic bin combination {i, j} at angular wavenumber l, χ is the comoving distance, χ_h is the comoving horizon distance, f_K(χ) is the comoving angular diameter distance (set to χ since we assume a flat universe), a(χ) is the scale factor, and P_δ is the 3D matter power spectra. The lens efficiency in the i-th tomographic interval is defined as

\begin{eqnarray*} g^i(\chi) = \int _{\chi }^{\chi _{\rm h}} {\rm d} \chi ^{\prime } n^{i}(\chi ^{\prime }) \frac{f_K(\chi ^{\prime }-\chi)}{f_K(\chi ^{\prime })} \end{eqnarray*}

(3)

with nⁱ(χ^′(z)) being the redshift distribution of source galaxies in tomographic bin i. The overall source redshift distribution is parametrized in the form of

\begin{eqnarray*} n(z) \propto z^{\alpha } \exp \left[ - \left(\frac{z}{z_0} \right)^\beta \right]\ , \end{eqnarray*}

(4)

where α = 1.27, β = 1.02, and z₀ = 0.5 following table 2 in Chang et al. (2013), which mimics an LSST cosmic shear source galaxy sample after deblending. The number density of source galaxies is 26 arcmin⁻².

We perform a tomographic analysis by dividing the sources into 10 tomographic bins with equal total number of galaxies in each bin. We also smooth the redshift distribution with a Gaussian kernel to characterize potential photo-z uncertainties. Fig. 2 shows the exact redshift distribution in each bin. This results in 55 unique combinations of auto- and cross-correlation shear tomographic power spectra. For each of the tomographic power spectra, we consider 18 equally spaced logarithmic bins in angular wavenumber ℓ ranging from 23 to 2060. This results in a total of 55 × 18 = 990 data points in our data vector. For the main analysis of this paper, we adopt an upper limit of ℓ_max ≈ 2000. This limit is driven by the resolution of the hydrodynamical simulations used in this work. We refer readers to Appendix B for further details on how we extrapolate power spectra to perform the integration to derive C^ij(ℓ), and how the decision on the ℓ_max ≈ 2000 cut is made.

Figure 2.

The normalized galaxy number density split into 10 Gaussian tomographic photo-z bins as shaded regions from blue (low z) to green (high z). For comparison, we show the true underlying redshift distribution as a solid blue line.

The fiducial cosmology |$\boldsymbol p_{\rm co, fid}$| of the data vectors is set to be consistent with the Planck 2015 (TT+TE+EE+lowP and assuming ΛCDM) results (Planck Collaboration XIII 2016) as summarized in Table 2.

Table 2.

Fiducial cosmology, minimum and maximum of the flat prior on the cosmological parameters, and halo-structural parameters in hmcode.

Parameter	Fiducial	Prior
Ω_m	0.3156	Flat (0.05, 0.6)
σ₈	0.831	Flat (0.5, 1.1)
n_s	0.9645	Flat (0.84, 1.06)
w₀	−1.0	Flat (−2.1, 0.0)
w_a	0.0	Flat (−2.6, 2.6)
Ω_b	0.0049	Flat (0.04, 0.055)
h₀	0.6727	Flat (0.4, 0.9)
A	–	Flat (0.5, 10)
η₀	–	Flat (0.1, 1.2)

Parameter	Fiducial	Prior
Ω_m	0.3156	Flat (0.05, 0.6)
σ₈	0.831	Flat (0.5, 1.1)
n_s	0.9645	Flat (0.84, 1.06)
w₀	−1.0	Flat (−2.1, 0.0)
w_a	0.0	Flat (−2.6, 2.6)
Ω_b	0.0049	Flat (0.04, 0.055)
h₀	0.6727	Flat (0.4, 0.9)
A	–	Flat (0.5, 10)
η₀	–	Flat (0.1, 1.2)

Table 2.

Fiducial cosmology, minimum and maximum of the flat prior on the cosmological parameters, and halo-structural parameters in hmcode.

Parameter	Fiducial	Prior
Ω_m	0.3156	Flat (0.05, 0.6)
σ₈	0.831	Flat (0.5, 1.1)
n_s	0.9645	Flat (0.84, 1.06)
w₀	−1.0	Flat (−2.1, 0.0)
w_a	0.0	Flat (−2.6, 2.6)
Ω_b	0.0049	Flat (0.04, 0.055)
h₀	0.6727	Flat (0.4, 0.9)
A	–	Flat (0.5, 10)
η₀	–	Flat (0.1, 1.2)

Parameter	Fiducial	Prior
Ω_m	0.3156	Flat (0.05, 0.6)
σ₈	0.831	Flat (0.5, 1.1)
n_s	0.9645	Flat (0.84, 1.06)
w₀	−1.0	Flat (−2.1, 0.0)
w_a	0.0	Flat (−2.6, 2.6)
Ω_b	0.0049	Flat (0.04, 0.055)
h₀	0.6727	Flat (0.4, 0.9)
A	–	Flat (0.5, 10)
η₀	–	Flat (0.1, 1.2)

Our mock data vectors for various baryonic physics scenarios are computed with the P_δ term in equation (2) generated from equation (1). Since halofit and hmcode (in DMO mode) agree at the level of |$\lesssim 5{{\ \rm per\ cent}}$| to |$k = 10 \ h^{-1}\, \mathrm{Mpc}$|⁠, and |$\lesssim 10{{\ \rm per\ cent}}$| out to |$k \le 100 \ h^{-1}\, \mathrm{Mpc}$| (see fig. 4 of M15), we create two sets of Eagle/MB2/Illustris/Horizon-AGN data vectors, with |$P_{\delta }^{\rm theory}(k, z\ |\ \boldsymbol p_{\rm co, fid})$| generated from halofit or hmcode, and incorporate the baryonic features through the power spectrum ratio. Throughout our experiment, when relying on halofit or hmcode as the theoretical model to perform fitting, we use the same fitting function to generate the mock observational data vectors for the fiducial cosmology. This way, when comparing the performance of different baryonic mitigation schemes, if one of the methods fails to recover the fiducial cosmological parameters, we can be assured that this failure is purely because of that method’s inability to mitigate the modification of the matter power spectrum due to baryonic physics, not because of an inherent discrepancy between the mock data and the DMO matter power spectrum model.

In Fig. 3 we show the ratio of baryonic to DMO |$C^{00}(\ell , \boldsymbol p_{\rm co, fid})$| shear power spectrum for various simulations. The thin lines indicate the nine baryonic scenarios from the OWLS simulation suite. The thick lines represent the Eagle/MB2/Illustris/Horizon-AGN universes, which are the data vectors that we will use for the LSST-like experiment. One can see that in this lowest tomographic bin, even for large scales at ℓ ≈ 100, the baryonic scenario of Illustris already causes a deviation from DMO at the 5 per cent level, with even more severe suppressions at smaller angular scales. For higher redshift tomographic bins, the deviations between hydrodynamical and DMO simulations are less severe. Semboloni et al. (2011) showed that a scale cut of ℓ_max ≈ 500 would be needed to avoid w₀ bias for a Euclid-like survey if the baryonic scenario of our Universe is like OWLS-AGN. When applying the traditional way of mitigating baryonic uncertainty by omitting small-scale information, we would need to discard a considerable amount of data before we can rely on DMO-based theoretical model to achieve an unbiased cosmological inference.

Figure 3.

The ratio of tomographic shear power spectra of different hydrodynamical simulations with respect to their counterpart DMO simulations for the lowest auto-correlation tomographic bin with the cosmology set at the Planck 2015 result (Table 2). The thick lines represent the cases for Eagle/MB2/Illustris/Horizon-AGN simulations, while the thin lines indicate the nine different baryonic scenarios in OWLS simulation suite.

One subtle feature shown in Fig. 3 is that there is a small but noticeable large-scale excess of power (<0.4 per cent) in the Horizon-AGN simulation. This is because the power spectrum ratio between hydrodynamical and DMO runs of Horizon-AGN has <0.1 per cent excess at large scales (see Fig. 1), even though they share the same initial conditions. The true cause of this subtle excess is not clear. After exploring, Chisari et al. (2018) concluded that this may originate from the box being too small to reach the linear regime at large scales. However, the other simulations studied here are similar in size and do not exhibit this feature.

3.3 Covariance matrix

We generate the analytical covariance matrix of tomographic shear power spectra using cosmolike (Eifler et al. 2014; Krause & Eifler 2017). Briefly, our covariance matrix contains both Gaussian and non-Gaussian parts. The Gaussian covariance matrix contains contributions from cosmic variance and shape noise, derived under the assumption that the 4pt-function of the shear field can be expressed in terms of 2pt-functions (Hu & Jain 2004; Takada & Bridle 2007). The non-Gaussian part is given by the convergence trispectrum derived using the halo model (Cooray & Sheth 2002), which contains one-, two-, three-, and four-halo terms and a halo sample variance term characterizing the scatter of halo number density due to large-scale density fluctuations (Cooray & Hu 2001; Sato et al. 2009; Takada & Jain 2009). The exact equations of our implementation can be found in the appendix of Krause & Eifler (2017).

We assume 18 000 |$\rm {deg}^2$| as the survey area in our covariance matrix and adopt the same redshift distribution and source galaxy number density (⁠|$26\,\rm {arcmin}^{-2}$|⁠) as depicted in Fig. 2. The shape noise is set to be σ_ϵ = 0.26 in each ellipticity component.

3.4 Likelihood formalism

Given a data vector |$\boldsymbol D$| (at some fiducial cosmology and with baryonic effects from Eagle/MB2/Illustris/Horizon-AGN), one can infer the corresponding posterior probability distribution of cosmological parameters |$\boldsymbol p_{\rm co}$| and potential nuisance parameters |$\boldsymbol p_{\rm nu}$| via Bayes’ theorem:

\begin{eqnarray*} P(\boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu}| \boldsymbol D) \propto L(\boldsymbol D| \boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu}) P_r (\boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu}), \end{eqnarray*}

(5)

where |$P_r (\boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu})$| denotes the prior probability distribution and |$L(\boldsymbol D| \boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu})$| is the likelihood. In this work, we assume a Gaussian likelihood function for the observables,

\begin{eqnarray*} L(\boldsymbol D| \boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu}) \propto \exp \left (-\frac{1}{2} \underbrace{\left[ (\boldsymbol D-\boldsymbol M)^t \, \mathbf {C}^{-1} \, (\boldsymbol D-\boldsymbol M) \right]}_{\chi ^2(\boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu})} \right). \end{eqnarray*}

(6)

We further assume that the covariance |$\mathbf {C}$| is constant in parameter space for simplicity (but see Eifler, Schneider & Hartlap 2009; Morrison & Schneider 2013 for likelihood analysis with cosmology-dependent covariance matrix). As described in Section 3.1, the model vector |$\boldsymbol M$| may be derived based on halofit which is a pure function of cosmology |$\boldsymbol M= \boldsymbol M(\boldsymbol p_{\rm co})$|⁠, or it can be a function of some nuisance parameters |$\boldsymbol M= \boldsymbol M(\boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu})$| as well, with factors that are known to affect |$\boldsymbol D$| absorbed in |$\boldsymbol p_{\rm nu}$|⁠. For example, in hmcode, we have A and η₀ acting as nuisance parameters to account for the baryonic effects (see Section 3.1 for details). The final posterior distribution on cosmological parameters then can be derived by marginalizing over all other nuisance parameters in the model

\begin{eqnarray*} P(\boldsymbol p_{\rm co}| \boldsymbol D) \propto \int {\rm d} \boldsymbol p_{\rm nu}\ P(\boldsymbol p_{\rm co}, \boldsymbol p_{\rm nu}| \boldsymbol D). \end{eqnarray*}

(7)

We use the python emcee package (Foreman-Mackey et al. 2013), which relies on the algorithm of Goodman et al. (2010) to sample the parameter space spanned by |$\boldsymbol p_{\rm co}$| ({Ω_m, σ₈, Ω_b, n_s, w₀, w_a, h₀}) as well as |$\boldsymbol p_{\rm nu}$| (if needed depending on the model). Altogether, we have conducted ∼250 likelihood simulations to present the results for this paper. The MCMC (Markov Chain Monte Carlo) chains contain ∼200 000 to 400 000 MCMC steps (after discarding 100 000 steps as burn-in phase), depending on the dimension of the parameter space that ranges from 7 to 16. For simplicity, we assume flat priors for all of our parameters, with their minimum and maximum values summarized in Table 2. For likelihood simulations with informative priors based on Planck, we refer readers to E15. Informative priors help to better constrain n_s, Ω_b, and h, to which cosmic shear is not very sensitive.

We will present in Section 4 on how we implement various baryonic mitigation schemes in the likelihood analysis. But before that, in Fig. 4 we show the posterior distribution of cosmological parameters derived from our LSST likelihood simulation, when naively applying the halofit model on fitting the data vectors contaminated with baryonic effects from Eagle/MB2/Horizon-AGN/Illustris simulations. For ease of visualization, we only show posteriors in the subspace of four cosmological parameters out of seven in total. Depending on the intensity of baryonic feedback as reflected in the ratio of hydrodynamical to DMO power spectra shown in Fig. 1, the resulting cosmology constraints can be severely biased in the case of Illustris (2σ–13σ depending on cosmological parameters) or at 1σ–2σ level in the other three cases. We note that the degree of bias depends on the ℓ_max used in the analysis. Fig. 4 presents the result when applying a cut at ℓ_max ≈ 2000 on |$\boldsymbol D$|⁠, which is the default setting in the paper. In Section 5.4, we will show how this result changes when extending data vectors to ℓ_max ≈ 5000.

Figure 4.

Cosmological parameter constraints for an LSST-like weak lensing survey with data vectors generated using various baryonic physics scenarios: pure DM (grey/solid) and the Eagle (blue/solid), MB2 (red/dashed), Illustris (yellow/dot–dashed), and Horizon-AGN (black/dotted) hydrodynamical simulations. In all cases, baryonic physics was ignored during the likelihood analysis, hence providing a worst-case scenario for biases due to baryonic physics. The analyses are carried out assuming non-informative priors on the parameters. Here, and in all such 2D posterior plots below, the contours depict the 68 per cent confidence levels. Depending on the intensity of the baryonic feedback, the resulting posterior distributions can be significantly away from the fiducial cosmology (marked in grey lines).

4 METHODS OF MITIGATING BARYONIC EFFECTS

In this section, we describe the methods used to mitigate the impact of baryonic physics on the cosmological parameter estimates from weak lensing. The methods can be classified into two categories: PCA-based methods and the halo-model-based approach. We discuss several PCA-based methods that are minor variants of each other in Sections 4.1–4.3. The halo-model based approach is described in Section 4.4. Throughout the work, we use the nine OWLS simulations as our ‘training sample’ to construct PCs for the PCA-based methods, and use the four mock data vectors constructed from Eagle/MB2/Illustris/Horizon-AGN simulations as ‘test sample’ to test methods listed in Table 3.

Table 3.

Summary of baryonic physics mitigation techniques. The first column is the label of each method, which we refer to in the text and plots throughout the work. The second column has simple descriptions that highlight the essential elements of each method. The third column presents the exact |$\boldsymbol \chi ^2$| equations that go into the likelihood analysis. Finally, the last column provides a section number where more information can be found for each method.

Method	Brief description	χ² equation	Section
			reference
A	PCA in difference matrix, with exclusion	\|$[(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]^{\rm t}\ \mathbf {C}^{-1}_{\rm pc, cut}\ [(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]$\|	Section 4.1.1
B	PCA in difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]$\|	Section 4.1.2
C	PCA in \|$\mathbf {L}^{-1}$\| weighted difference matrix, with exclusion	\|$[\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]^{\rm t}\ \mathbf {I}\ [\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]$\|	Section 4.2
D	PCA in fractional difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]$\|	Section 4.3
M	Halo model parameter marginalization	\|$[\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)] \!\!\!\!\!\!$\|	Section 4.4

Method	Brief description	χ² equation	Section
			reference
A	PCA in difference matrix, with exclusion	\|$[(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]^{\rm t}\ \mathbf {C}^{-1}_{\rm pc, cut}\ [(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]$\|	Section 4.1.1
B	PCA in difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]$\|	Section 4.1.2
C	PCA in \|$\mathbf {L}^{-1}$\| weighted difference matrix, with exclusion	\|$[\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]^{\rm t}\ \mathbf {I}\ [\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]$\|	Section 4.2
D	PCA in fractional difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]$\|	Section 4.3
M	Halo model parameter marginalization	\|$[\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)] \!\!\!\!\!\!$\|	Section 4.4

Table 3.

Summary of baryonic physics mitigation techniques. The first column is the label of each method, which we refer to in the text and plots throughout the work. The second column has simple descriptions that highlight the essential elements of each method. The third column presents the exact |$\boldsymbol \chi ^2$| equations that go into the likelihood analysis. Finally, the last column provides a section number where more information can be found for each method.

Method	Brief description	χ² equation	Section
			reference
A	PCA in difference matrix, with exclusion	\|$[(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]^{\rm t}\ \mathbf {C}^{-1}_{\rm pc, cut}\ [(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]$\|	Section 4.1.1
B	PCA in difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]$\|	Section 4.1.2
C	PCA in \|$\mathbf {L}^{-1}$\| weighted difference matrix, with exclusion	\|$[\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]^{\rm t}\ \mathbf {I}\ [\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]$\|	Section 4.2
D	PCA in fractional difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]$\|	Section 4.3
M	Halo model parameter marginalization	\|$[\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)] \!\!\!\!\!\!$\|	Section 4.4

Method	Brief description	χ² equation	Section
			reference
A	PCA in difference matrix, with exclusion	\|$[(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]^{\rm t}\ \mathbf {C}^{-1}_{\rm pc, cut}\ [(\boldsymbol D-\boldsymbol M)_{\rm pc, cut}]$\|	Section 4.1.1
B	PCA in difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)]$\|	Section 4.1.2
C	PCA in \|$\mathbf {L}^{-1}$\| weighted difference matrix, with exclusion	\|$[\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]^{\rm t}\ \mathbf {I}\ [\mathbf {U}_{\rm ch} \mathbf {P}\mathbf {U}_{\rm ch}^{\rm t} \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)]$\|	Section 4.2
D	PCA in fractional difference matrix, with marginalization	\|$[\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, {\boldsymbol Q})]$\|	Section 4.3
M	Halo model parameter marginalization	\|$[\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)]^{\rm t}\ \mathbf {C}^{-1}\ [\boldsymbol D-\boldsymbol M_{\rm HMcode}(\boldsymbol p_{\rm co}, A, \eta _0)] \!\!\!\!\!\!$\|	Section 4.4

4.1 PCA in difference matrix

4.1.1 Summary of the PCA framework (method A)

The original framework for using PCA to mitigate the impact of baryonic physics for weak lensing is described in Eifler et al. (2015). The essential idea is that even though hydrodynamical simulations with different baryonic prescriptions predict a range of variations on the matter power spectra (Fig. 1), we can still extract the common features of those diversity using PCA, and build an empirical model to mitigate baryonic uncertainty based on these hydrodynamical simulations. Below we provide a step-by-step description of the PCA framework.

First, we collect the tomographic shear power spectra constructed from the nine OWLS simulations as our training sample, and label these nine data vectors as |$\boldsymbol B_1, ..., \boldsymbol B_9$|⁠. Next we build a difference matrix |${\boldsymbol \Delta }(\boldsymbol p_{\rm co})$| with dimension of N_data × N_sim = 990 × 9. Each column records the deviation between the baryonic data vector and the DMO model vector |$\boldsymbol M$| at any arbitrary cosmology (recomputed for each MCMC step) in terms of their difference

\begin{eqnarray*} {\boldsymbol \Delta }(\boldsymbol p_{\rm co}) = \begin{bmatrix} {|} & {\,\,\,\,\,\,|} &\,& {\,\,\,\,\,\,|}\\ \boldsymbol B_1-\boldsymbol M& \quad\boldsymbol B_2-\boldsymbol M& \quad\ldots & \quad\boldsymbol B_9-\boldsymbol M \\ {|} & {\,\,\,\,\,\,|} &\,& {\,\,\,\,\,\,|}\\ \end{bmatrix} _{N_\text{data} \times N_\text{sim}} \ . \end{eqnarray*}

(8)

The left-hand panel of Fig. 6 provides a visualization of the entries of the difference vectors used to construct |$\boldsymbol \Delta$|⁠. Here notice that both |$\boldsymbol B_x(\boldsymbol p_{\rm co})$| and |$\boldsymbol M(\boldsymbol p_{\rm co})$| are functions of cosmology, and therefore so is |${\boldsymbol \Delta }$|⁠. We refer readers to Appendix C for details of how we compute the baryon-contaminated data vectors at different |$\boldsymbol p_{\rm co}$|⁠.

The second step is to perform the PCA on the difference matrix, with the goal of identifying the few dominant PCs that signify the directions of largest discrepancy between the baryonic and DMO data vectors from the nine OWLS simulations. To find the PCs, we apply the (full) singular value decomposition (SVD) on |${\boldsymbol \Delta }$|⁠,

\begin{eqnarray*} {\boldsymbol \Delta } = \mathbf {U}\ \boldsymbol \Sigma \ \mathbf {V}^{\rm t}. \end{eqnarray*}

(9)

As shown in Fig. 5, SVD decomposes |$\boldsymbol \Delta$| into the product of three matrices. Both |$\mathbf {U}$| and |$\mathbf {V}$| are square unitary matrices with dimensions of N_data × N_data (990 × 990) and N_sim × N_sim (9 × 9) respectively. The upper N_sim × N_sim (9 × 9) block of |$\boldsymbol \Sigma$| is a diagonal matrix consisting of N_sim (9) positive real singular values σ₁...σ₉ arranged in descending order, and the remaining N_data − N_sim (981) rows have only zeros (indicated by the dashed square). The N_data (990) columns of |$\mathbf {U}$| are eigenvectors of |$\boldsymbol \Delta \boldsymbol \Delta ^{\rm t}$|⁠, with eigenvalues in the diagonal entries of |$\boldsymbol \Sigma \boldsymbol \Sigma ^{\rm t}$|⁠.

$We perform SVD on the difference matrix $\boldsymbol \Delta$ built based on the nine baryonic scenarios of OWLS (see equation 8). $\mathbf {U}$ is a unitary matrix with columns that form an orthonormal basis set to span the 990-dimensional space of our data vector. Among them, the first nine PCs of $\mathbf {U}$ form a complete description of the modifications of the data vector due to baryonic physics in the nine OWLS hydro simulations. We will test whether these nine PCs can also describe the impact of baryonic physics in the Eagle/MB2/Illustris/Horizon-AGN simulations.$

Figure 5.

We perform SVD on the difference matrix |$\boldsymbol \Delta$| built based on the nine baryonic scenarios of OWLS (see equation 8). |$\mathbf {U}$| is a unitary matrix with columns that form an orthonormal basis set to span the 990-dimensional space of our data vector. Among them, the first nine PCs of |$\mathbf {U}$| form a complete description of the modifications of the data vector due to baryonic physics in the nine OWLS hydro simulations. We will test whether these nine PCs can also describe the impact of baryonic physics in the Eagle/MB2/Illustris/Horizon-AGN simulations.

The first nine eigenvectors constitute a set of orthogonal PCs in order of decreasing importance according to the amount of variation they capture in the different training vectors. The right-hand panel of Fig. 6 shows these nine PC modes in projection on the C⁰⁰ tomographic bin. The PC modes span a nine-dimensional subspace within the 990 dimensional space which covers entirely the degrees of freedom to explain baryonic uncertainties in the nine OWLS hydro simulations. In other words, any given |$\boldsymbol B_x(\boldsymbol p_{\rm co})-\boldsymbol M(\boldsymbol p_{\rm co})$|⁠, can be described with nine free parameters via

\begin{eqnarray*} \boldsymbol B_x(\boldsymbol p_{\rm co})-\boldsymbol M(\boldsymbol p_{\rm co}) = \sum _{n = 1}^{9} Q_n\ {\mathbf {PC}}_n (\boldsymbol p_{\rm co}), \end{eqnarray*}

(10)

with Q_n being the amplitude of PC_n. The remaining 981 columns of |$\mathbf {U}$| are silent orthogonal vectors which extends |$\mathbf {U}$| into a unitary matrix. With nine baryonic scenarios as our training sample, we have at most nine independent PCs to describe modifications to the observables due to baryonic physics. One of the goals of this work is to understand how effectively the PCA basis can describe baryonic physics scenarios in other more recent hydrodynamical simulations.

$Left: The difference vectors, $\boldsymbol B-\boldsymbol M$, from the OWLS simulation set used to construct PCs as input in columns of equation (8). The thicker lines indicate the difference vectors for Eagle/MB2/Horizon-AGN/Illustris simulations as our test set. Right: The PC modes constructed from the OWLS simulation set in projection on the difference vector space for the tomographic bin C00. The goal of this work is to check whether these PC modes can flexibly describe the baryonic physics scenarios in the test set hydrodynamical simulations.$

Figure 6.

Left: The difference vectors, |$\boldsymbol B-\boldsymbol M$|⁠, from the OWLS simulation set used to construct PCs as input in columns of equation (8). The thicker lines indicate the difference vectors for Eagle/MB2/Horizon-AGN/Illustris simulations as our test set. Right: The PC modes constructed from the OWLS simulation set in projection on the difference vector space for the tomographic bin C⁰⁰. The goal of this work is to check whether these PC modes can flexibly describe the baryonic physics scenarios in the test set hydrodynamical simulations.

The third step is to transform everything to PC basis, and mitigate baryonic uncertainty by excluding PC modes. In PC basis, our data and model vectors are defined as

\begin{eqnarray*} \boldsymbol D_{\rm pc} = \mathbf {U}^{\rm t} \boldsymbol D \end{eqnarray*}

(11a)

\begin{eqnarray*} \boldsymbol M_{\rm pc} = \mathbf {U}^{\rm t} \boldsymbol M, \end{eqnarray*}

(11b)

and the covariance matrix is

\begin{eqnarray*} \mathbf {C}_{\rm pc} = \mathbf {U}^{\rm t}\ \mathbf {C}\ \mathbf {U}. \end{eqnarray*}

(12)

Viewing from PC coordinate, the majority of the baryonic uncertainties between |$\boldsymbol D_{\rm pc}$| and |$\boldsymbol M_{\rm pc}$| would be absorbed in the first N elements. We can then directly cut the data vector |$\boldsymbol D_{\rm pc}$| to obtain a shorter vector |$\boldsymbol D_{\rm pc, cut}$|⁠, and do the same to the model vector |$\boldsymbol M_{\rm pc} \rightarrow \boldsymbol M_{\rm pc, cut}$| to avoid modelling challenges on these data points.

When doing MCMC analysis, we modify the original equation (8) from E15 to properly account for the change of covariance matrix due to loss of information after PC mode removal. We cut the corresponding rows and columns on |$\mathbf {C}_{\rm pc}$|⁠, and use the corresponding sub-matrix, |$\mathbf {C}_{\rm pc, cut}$| to calculate the inverse covariance |$\mathbf {C}^{-1}_{\rm pc, cut}$| for |$\boldsymbol D_{\rm pc, cut}$|⁠. The χ² equation can then be written as:

\begin{eqnarray*} \chi ^{\prime 2}(\boldsymbol p_{\rm co}) = (\boldsymbol D-\boldsymbol M)_{\rm pc, cut}^{\rm t}\ \mathbf {C}^{-1}_{\rm pc, cut}\ (\boldsymbol D-\boldsymbol M)_{\rm pc, cut}, \end{eqnarray*}

(13)

and the likelihood equation:

\begin{eqnarray*} L(\boldsymbol D| \boldsymbol p_{\rm co}) \propto \exp \left (-\frac{1}{2} \underbrace{\left[ (\boldsymbol D-\boldsymbol M)_{\rm pc, cut}^{\rm t}\ \mathbf {C}^{-1}_{\rm pc, cut}\ (\boldsymbol D-\boldsymbol M)_{\rm pc, cut} \right]}_{\chi ^{\prime 2}(\boldsymbol p_{\rm co})} \right). \nonumber \\ \end{eqnarray*}

(14)

4.1.2 The marginalization version of the PCA framework (method B)

We refer to ‘PC marginalization’ as a method that includes (up to nine) amplitudes of PCs as free parameters to parametrize the impact of baryonic physics on the tomographic shear power spectra. As shown in equation (10), the current nine PCs fully span the baryonic degrees of freedom in the nine OWLS simulations. We can further check whether they are also effective in describing the impact of baryonic physics on the observables in our test set of hydrodynamic simulations by building a new model with the following parametric form:

\begin{eqnarray*} \boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q) = \boldsymbol M(\boldsymbol p_{\rm co}) + \sum _{n = 1}^{m} Q_n\ {\mathbf {PC}}_n (\boldsymbol p_{\rm co}), \end{eqnarray*}

(15)

where m ≦ 9 and |$\boldsymbol Q= \lbrace Q_1, Q_2, ... , Q_m \rbrace$| are free parameters in addition to the cosmological parameters. The likelihood function for the cosmological parameters can be derived by marginalizing over the amplitude parameters:

\begin{eqnarray*} L(\boldsymbol D| \boldsymbol p_{\rm co}) &\propto& \int \mathrm{ d}\boldsymbol Q \times \exp \left (-\frac{1}{2} \left[ (\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q))^{\rm t}\ \mathbf {C}^{-1}\right.\right.\nonumber \\ &&\times \,(\boldsymbol D-\boldsymbol M_B(\boldsymbol p_{\rm co}, \boldsymbol Q)) \big] \Big). \end{eqnarray*}

(16)

Theoretically, one can prove that the likelihood functions of equation (14) (method A) and equation (16) return identical results if the priors on the PC amplitudes are uninformative. We will provide comparisons of the posterior distributions of |$\boldsymbol p_{\rm co}$| in Section 5.1 and further comment on both methods there.

4.2 Noise-weighted PCA – Cholesky decomposition (method C)

As noted at the end of section 2.2 of E15, performing PCA on the difference matrix |$\boldsymbol \Delta$| (equation 8) is not necessarily the most optimal choice. They suggested an option of conducting the PCA on the ‘noise’-weighted |$\boldsymbol \Delta$|⁠. As a result of re-weighting, the derived PCs would be more sensitive in accounting for deviations in data vectors due to baryonic physics at well-measured data points, where larger weighting factors are applied. Therefore, when doing PC mode removal, we tend to more effectively remove baryonic physics degrees of freedom that impact better-measured (lower noise) scales, which may more effectively reduce cosmological parameter biases.

To find the weights, we first decompose our covariance matrix by applying a Cholesky decomposition

\begin{eqnarray*} \mathbf {C}= \mathbf {L}\mathbf {L}^{\rm t}, \end{eqnarray*}

(17)

where |$\mathbf {L}$| is a lower triangular matrix with real and positive diagonal entries. We can then weight our |$\boldsymbol D$| and |$\boldsymbol M$| vectors as

\begin{eqnarray*} \boldsymbol D_{\rm ch} &=& \mathbf {L}^{-1}\boldsymbol D,\nonumber \\ \boldsymbol M_{\rm ch} &=& \mathbf {L}^{-1}\boldsymbol M. \end{eqnarray*}

(18)

After this transformation, our new data vector |$\boldsymbol D_{\rm ch}$| has an identity covariance matrix

, which can be easily proved as follows:

In other words, after applying equation (18), we not only re-weight but also decorrelate the data vector.

Similar to equation (8), we build the new difference matrix as

\begin{eqnarray*} && {\boldsymbol \Delta _{\rm ch}}(\boldsymbol p_{\rm co}) = \begin{bmatrix} {\,|} &\,& {\,\,\,\,\,\,\,|} \nonumber \\ \boldsymbol B_{\rm 1, ch}-\boldsymbol M_{\rm ch} & \quad \ldots & \quad \boldsymbol B_{\rm 9, ch}-\boldsymbol M_{\rm ch} \\ {\,|} &\,& {\,\,\,\,\,\,\,|} \nonumber \\ \end{bmatrix} _{N_{\rm data} \times N_{\rm sim}} \nonumber \\ && = \mathbf {L}^{-1}{\boldsymbol \Delta }(\boldsymbol p_{\rm co}) = \mathbf {U}_{\rm ch} \ \boldsymbol \Sigma _{\rm ch}\ \mathbf {V}_{\rm ch}^{\rm t} \ . \end{eqnarray*}

(20)

Here each of the OWLS training data vectors is weighted by |$\mathbf {L}^{-1}$| as |$\boldsymbol B_{\rm x, ch} = \mathbf {L}^{-1}\boldsymbol B_{\rm x}$|⁠. The |$\boldsymbol \Delta _{\rm ch}$| matrix is equivalent to performing a |$\mathbf {L}^{-1}$| matrix transformation on |$\boldsymbol \Delta$| shown in equation (8). We can then apply SVD to derive the PC basis set as stored in the |$\mathbf {U}_{\rm ch}(\boldsymbol p_{\rm co})$| matrix. The first nine PCs form natural bases to span the weighted difference vector for various baryonic effects

\begin{eqnarray*} \boldsymbol B_{\rm ch} - \boldsymbol M_{\rm ch} = \mathbf {L}^{-1}(\boldsymbol B- \boldsymbol M) = \sum _{n = 1}^{9} Q_n\ {\mathbf {PC}}_n. \end{eqnarray*}

(21)

In Fig. 7, we show the |$\boldsymbol B_{\rm ch} - \boldsymbol M_{\rm ch} = \mathbf {L}^{-1}(\boldsymbol B-\boldsymbol M)$| vectors in our lowest tomographic bin, at |$\boldsymbol p_{\rm co, fid}$|⁠. The thicker lines represent our four test simulations; the thinner lines are for the nine baryonic scenarios in OWLS, which compose the columns of |$\boldsymbol \Delta _{\rm ch}$| in equation (20). Comparing with the left-hand panel of Fig. 6, one can see that after re-weighting by |$\mathbf {L}^{-1}$|⁠, we more strongly emphasize baryonic fluctuations at smaller scales, so the PCs should also be more effective in accounting for small-scale baryonic features.⁹

$The discrepancy between baryon-contaminated data vectors and model in terms of $\boldsymbol B_{\rm ch} - \boldsymbol M_{\rm ch}$ for various hydrodynamical simulations in the lowest tomographic bin. This is similar to the left-hand panel of Fig. 6, but here shows results for the case when applying Cholesky decomposition on our $\boldsymbol B$ and $\boldsymbol M$ vectors. The nine OWLS baryonic scenarios (thinner lines) compose columns of $\boldsymbol \Delta _{\rm ch}$, which are used to build PCs. These PCs are used to span the variation of Eagle/MB2/Illustris/Horizon-AGN simulations in $\boldsymbol D_{\rm ch} - \boldsymbol M_{\rm ch}$ space. After Cholesky decomposition, the largest data-model inconsistency shifts to smaller scales compared with the upper panel of Fig. 6, indicating the PCs trained from $\boldsymbol \Delta _{\rm ch}$ are more efficient at describing small-scale variations in the matter power spectrum due to baryonic physics compared with performing PCA on $\boldsymbol \Delta$.$

Figure 7.

The discrepancy between baryon-contaminated data vectors and model in terms of |$\boldsymbol B_{\rm ch} - \boldsymbol M_{\rm ch}$| for various hydrodynamical simulations in the lowest tomographic bin. This is similar to the left-hand panel of Fig. 6, but here shows results for the case when applying Cholesky decomposition on our |$\boldsymbol B$| and |$\boldsymbol M$| vectors. The nine OWLS baryonic scenarios (thinner lines) compose columns of |$\boldsymbol \Delta _{\rm ch}$|⁠, which are used to build PCs. These PCs are used to span the variation of Eagle/MB2/Illustris/Horizon-AGN simulations in |$\boldsymbol D_{\rm ch} - \boldsymbol M_{\rm ch}$| space. After Cholesky decomposition, the largest data-model inconsistency shifts to smaller scales compared with the upper panel of Fig. 6, indicating the PCs trained from |$\boldsymbol \Delta _{\rm ch}$| are more efficient at describing small-scale variations in the matter power spectrum due to baryonic physics compared with performing PCA on |$\boldsymbol \Delta$|⁠.

Similar to Section 4.1.1, to perform the PC mode removal, we transform everything to the PC basis:

\begin{eqnarray*} \boldsymbol D_{\rm ch, pc} = \mathbf {U}_{\rm ch}^{\rm t} \boldsymbol D_{\rm ch} \end{eqnarray*}

(22a)

\begin{eqnarray*} \boldsymbol M_{\rm ch, pc} = \mathbf {U}_{\rm ch}^{\rm t} \boldsymbol M_{\rm ch} \end{eqnarray*}

(22b)

and then cut all the elements from the data and model vectors and the covariance matrix for the PC modes that are to be removed. Here since |$\mathbf {C}_{\rm ch}$| is an identity matrix, it and its inverse |$\mathbf {C}^{-1}_{\rm ch, pc}$| in the PC basis remain the same after coordinate transformation. The final PC mode removal χ² equation becomes:

\begin{eqnarray*} \chi ^2_{\rm ch}(\boldsymbol p_{\rm co}) &=& (\boldsymbol D_{\rm ch}-\boldsymbol M_{\rm ch})_{\rm pc, cut}^{\rm t}\ \mathbf {C}^{-1}_{\rm ch, pc\ cut}\ (\boldsymbol D_{\rm ch}-\boldsymbol M_{\rm ch})_{\rm pc, cut} \nonumber \\ &=& (\boldsymbol D_{\rm ch}-\boldsymbol M_{\rm ch})_{\rm pc, cut}^{\rm t}\ (\boldsymbol D_{\rm ch}-\boldsymbol M_{\rm ch})_{\rm pc, cut} \end{eqnarray*}

(23)

The marginalization version of method C can be viewed as the following. By reorganizing equation (21), we can build a baryonic model generator as

\begin{eqnarray*} \boldsymbol M_C(\boldsymbol p_{\rm co}, \boldsymbol Q) = \boldsymbol M(\boldsymbol p_{\rm co}) + \mathbf {L}\sum _{n = 1}^{m} Q_n\ {\mathbf {PC}}_n (\boldsymbol p_{\rm co}), \end{eqnarray*}

(24)

where m ≦ 9. The cosmological parameter dependence comes in through the DMO model vector, while the amplitudes of PCs are used as higher order correction for baryonic effects.

4.3 PCA in fractional difference matrix (method D)

Instead of using the difference matrix |$\boldsymbol \Delta$| to perform PCA, Mohammed & Gnedin (2018) identified PCs based on the fractional difference matrix |$\boldsymbol R$| defined as

\begin{eqnarray*} && {\boldsymbol R} = \begin{bmatrix}{|} & {\,\,\,\,\,\,|} &\,& {\,\,\,\,\,\,|}\\ \frac{\boldsymbol B_1-\boldsymbol M}{\boldsymbol M} & \quad \frac{\boldsymbol B_2-\boldsymbol M}{\boldsymbol M} & \quad \ldots & \quad \frac{\boldsymbol B_9-\boldsymbol M}{\boldsymbol M} \\ {|} & {\,\,\,\,\,\,|} &\,& {\,\,\,\,\,\,|}\\ \end{bmatrix}_{990 \times 9} \nonumber \\ && = \mathbf {U}_{\rm R} \ \boldsymbol \Sigma _{\rm R}\ \mathbf {V}_{\rm R}^{\rm t} \ . \end{eqnarray*}

(25)

One fundamental difference between the fractional difference matrix |$\boldsymbol R$| and the difference matrices |$\boldsymbol \Delta$| or |$\boldsymbol \Delta _{\rm chy}$| is that |$\boldsymbol R$| does not depend on cosmology, given our assumption of equation (C2). After the |$\mathbf {U}_{\rm R}$| is derived by SVD analysis, a model for the observables with baryonic physics degrees of freedom spanned by OWLS can be built as

\begin{eqnarray*} \boldsymbol M_R(\boldsymbol p_{\rm co}, \boldsymbol Q) = \boldsymbol M(\boldsymbol p_{\rm co})\ \left[ 1 + \sum _{n = 1}^{m} Q_n\ {\mathbf {PC}}_n \right], \end{eqnarray*}

(26)

where m ≦ 9, and |${\boldsymbol Q} = \lbrace Q_1, Q_2, ... , Q_m \rbrace$| are the free parameters controlling the amplitudes of PCs, and PC₁–PC₉ are in the first nine columns of |$\mathbf {U}_{\rm R}$|⁠. Similar to the methodology in Section 4.1.2, the likelihood function for the cosmological parameters can be derived by marginalizing over the amplitude parameters:

\begin{eqnarray*} L(\boldsymbol D| \boldsymbol p_{\rm co}) &\propto& \int d\boldsymbol Q \times \exp \left (-\frac{1}{2} \left[ (\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, \boldsymbol Q))^{\rm t}\ \mathbf {C}^{-1}\right.\right.\nonumber \\ &&\times \,(\boldsymbol D-\boldsymbol M_R(\boldsymbol p_{\rm co}, \boldsymbol Q)) \big] \bigg). \end{eqnarray*}

(27)

Similar to the concept mentioned in Section 4.2, performing PCs on the matrix |$\boldsymbol R$| can be viewed as putting the weight of |$1/\boldsymbol M$| into the PCA analysis. Since |$\boldsymbol M$| decreases with increasing ℓ, and the overall amplitude of |$\boldsymbol M$| increases towards higher redshift, after taking its inverse, we upweight data points at smaller scales and lower redshift. The fractional difference vectors of OWLS that go into columns of |$\boldsymbol R$| are plotted in Fig. 3. The PCs derived from |$\boldsymbol R$| are expected to be more efficient in accounting for smaller scale and lower redshift variation of the observables due to baryonic physics.

4.4 |$\rm{\small HMCODE}$| (method M)

Finally, we compare the above PCA-based methods with the halo model-based approach proposed by Mead et al. (2015), hmcode. hmcode utilizes two halo profile-related parameters to capture the impact of baryonic physics on the matter power spectrum: the amplitude of the concentration–mass relation (A) and a halo bloating parameter (η₀) controlling the (mass-dependent) change of halo profiles. We refer readers back to Section 3.1 for a brief summary of this approach.

There exists some level of degeneracy between A and η₀, as shown in fig. 6 of M15. Thus, when implementing the likelihood analysis, one can either vary both of the parameters, or change only the single parameter A while fixing

\begin{eqnarray*} \eta _0 = 0.98 - 0.12 A. \end{eqnarray*}

(28)

For example, Joudaki et al. (2017) applied only varying A to marginalize over baryonic physics in CFHTLenS cosmic shear, while MacCrann et al. (2017) and Troxel et al. (2018) varied both parameters to marginalize over baryonic physics in the DES.

Equation (28) is derived based on the OWLS simulation suite. We will test whether it remains valid for the baryonic physics scenarios in Eagle/MB2/Illustris/Horizon-AGN for our forecasted scenario with LSST-like statistical power. Also, we will compare the performances of hmcode (marginalization over halo model parameters) with the above PCA-based methods.

5 PERFORMANCES OF BARYONIC MITIGATION TECHNIQUES

In this section, we present our simulated likelihood analysis for the different baryonic mitigation schemes listed in Table 3. We refer readers back to Section 3 for a description of the simulated likelihood analysis set-up.

Ideally, we need a baryonic physics mitigation strategy that can reduce the biases in cosmological parameters due to inaccuracies in theoretical modelling (as demonstrated in Fig. 4) to a level that is much smaller than the statistical uncertainties. In addition, we hope that the increase in statistical errors on cosmological parameters due to the additional nuisance parameters will be as small as possible. Throughout this section, we will use a criterion of bias <0.5σ (where σ represents the marginalized statistical error) for individual cosmological parameters to evaluate whether a method is effective in mitigating the uncertainties due to baryonic physics under various baryonic scenarios. We also compare their performance based on the degradation of cosmological constraining power through the size of the 1D marginalized uncertainties on cosmological parameters.

5.1 PC mode exclusion versus marginalizing over PC amplitude

We start by presenting the results for methods A and B (see Table 3) with their PCs described using the same difference matrix |$\boldsymbol \Delta$|⁠. In method A, we modify the data vector by excluding the first few PC modes and modify the covariance self-consistently as well. In method B, the data vector and covariance matrix are unmodified, but we introduce free parameters describing the PC amplitudes to marginalize over in the likelihood analysis.

Mathematically, methods A and B are equivalent if no priors are set on the baryonic physics parameters. From an information perspective, when removing data points (and the corresponding covariance elements), we lose all of the information that can constrain the amplitudes of the excluded PC modes. Thus, this should be equivalent to marginalizing over PC amplitudes with uninformative priors.

In Fig. 8, we use simulated likelihood analyses based on Horizon-AGN (yellow dot–dashed & black dotted) and Illustris (blue solid and red dashed) to demonstrate the excellent consistency between PC mode exclusion and PC amplitude marginalization. For the case of Illustris, large residual biases still exist after performing the baryonic physics mitigation. We will discuss this issue in Section 5.3. Although not shown, we have also confirmed the consistency between methods A and B for Eagle and MB2.

Figure 8.

Comparison of the posterior distributions of cosmological parameters between baryonic physics mitigation techniques A and B listed in Table 3. The yellow dot–dashed and black dotted contours indicate the 1σ contours of the posterior probability distributions obtained from methods A and B, respectively, for the Horizon-AGN simulation after excluding or marginalizing over the first PC modes. Similarly, the blue solid and red dashed contours indicate the case for Illustris after excluding or marginalizing over three PC modes. The excellent match between the posterior probability distributions for cosmological parameters between methods A and B confirms that the PC exclusion formula shown in equation (13) is conceptually equivalent to marginalizing over PC amplitudes.

In conclusion, we have demonstrated with examples that the PC exclusion formula shown in equation (13) gives consistent results as when marginalizing over PC amplitudes with an uninformative prior. Method B can provide baryonic information through the constrained PC amplitudes, which can be used as a standard to quantify baryonic effects. So far, we allow the PC amplitudes to vary from (−∞, ∞). Reducing the prior ranges on PC amplitudes could potentially increase the constraining power on cosmology if we can develop a consistent way of setting the priors on PC amplitudes, given our knowledge of baryonic physics. The downside of method B is that it requires running longer MCMC chains to ensure convergence due to an increase in the dimensionality of parameter space. Therefore if one does not care to learn about baryonic physics, and would simply like to marginalize over it, we recommend method A.

5.2 Comparison between various PC construction methods

Here we compare the performances of the PCA-based methods listed Table 3. We have already shown in Section 5.1 that PC mode exclusion (method A) is equivalent to marginalizing over PC amplitudes (method B), so here we only compare methods A, C, and D.

The fundamental difference between these PCA methods is the way the PCs are constructed from the training simulations, which affects their efficiency in describing how baryonic physics modifies the data vectors on larger or smaller scales. We refer readers back to Section 4 for more details about this formalism. Briefly, when PCs are derived from |$\boldsymbol \Delta$| (method A; equation 8), they are most efficient in describing the difference vector |$\boldsymbol D-\boldsymbol M$|⁠. For PCs trained from |$\boldsymbol \Delta _{\rm chy}$| (method C; equation 20), they are most efficient in describing the noise-weighted difference, |$\boldsymbol D_{\rm chy}-\boldsymbol M_{\rm chy} = \mathbf {L}^{-1}(\boldsymbol D-\boldsymbol M)$|⁠, due to baryonic physics. Finally, PCs trained from |$\boldsymbol R$| (method D; equation 25) are most efficient in describing variations in the fractional difference |$\frac{\boldsymbol D-\boldsymbol M}{\boldsymbol M}$| from baryonic effects.

Fig. 9 shows the median of the marginalized 1D posteriors of cosmological parameters under different baryonic physics mitigation techniques for data vectors derived from our four test simulations. The lower and upper error bars represent for the 16th and 84th quantiles of the 1D marginalized posterior distribution. The x-axes indicate numbers of PC modes excluded or numbers of marginalization parameters used in the analysis. We select some cases from Fig. 9 and present their 1D and 2D posteriors in Fig. 10. The brown crosses in Fig. 9 indicate the case when no baryonic physics mitigation scheme is applied. One can see that the deviation from the fiducial cosmological parameters exceeds 1σ for all of our test baryonic scenarios. This is also shown in Fig. 4 on the 2D posterior contours. The blue-circle, red-triangle, and yellow-square markers indicate the results of performing baryonic physics mitigation by PCA-based methods A, C, and D, respectively. When the modifications of the data vectors due to baryonic physics are relatively weak as in MB2/Eagle/Horizon-AGN, we find that removing up to two PC modes is sufficient to marginalize baryonic bias to within 1σ¹⁰ for the cosmological parameters presented here. For the Illustris simulation, due to its strong baryonic feedback, we need to remove up to six PCs for the 1σ posteriors to include the fiducial cosmological parameters.

Figure 9.

Marginalized 1D constraints on cosmological parameters when using different baryonic physics mitigation techniques from Table 3. Each panel is for a different input data vector based on a different hydrodynamical simulation as explained in the plot title. The grey dashed horizontal lines indicate the fiducial cosmological values. The marker position, the lower and upper error bars indicate the median, the 16th and the 84th percentiles of marginalized 1D posteriors. The brown crosses indicate the results when fitting the data vectors with the DMO-based emulator (halofit) without applying any baryonic physics mitigation technique. The blue circles, red triangles and yellow squares show the results when applying PCA-based methods A, C, and D, respectively, with their positions in the x-direction indicating how many PC modes are excluded or numbers of marginalization parameters used when doing the analysis. The black pentagons located at x = 1 indicate the result when only marginalizing over A in hmcode (with η₀ fixed via equation 28). The black pentagons located at x = 2 are the results when marginalizing over both A and η₀ in hmcode. For PCA-based methods, we find that the 1σ posteriors start to enclose the fiducial cosmology after removing two PC modes for MB2/Eagle/Horizon-AGN, while excluding six PC modes is required for more extreme baryonic scenarios of Illustris. When using hmcode to perform marginalization, except for the Illustris simulation for which marginalizing over A alone is enough, generally it is required to vary both A and η₀ to mitigate baryonic effects to within 1σ.

Figure 10.

The 2D posterior distributions on cosmological parameters for some selected cases shown in Fig. 9. Each panel is for a different input data vector based on a different baryonic physics scenario as labelled in the legend. The legend also describes which baryonic mitigation techniques are applied, and how many PC modes are excluded or the hmcode parameters marginalized over.

5.2.1 Method C is superior to methods A and D

In Fig. 11 we plot the w₀ bias (in colour-filled markers; defined as |w_0,best fit − w_0,fid|, with w_0,best fit being the median value of the marginalized posterior distribution of w₀) and the 0.5σ error of w₀ (in open markers; with σ defined as the half difference between the 16th and 84th percentile of the 1D marginalized posterior of w₀) for various baryonic physics mitigation schemes. For a method to be effective in mitigating baryonic-induced parameter biases, we require that the bias be below the 0.5σ errors. For all baryonic physics scenarios, we observe that at fixed number of excluded PC modes, the biases of method C (red-filled triangles) are nearly always smaller than methods A (blue-filled circles) and D (yellow-filled squares). If focusing on the lower left panel of Fig. 10, using Illustris when removing three PC modes as an examples, one can see that the 2D 1σ posteriors of method C (red dot–dashed curves) enclose the fiducial cosmology, while the posteriors of method A (light blue solid curves) are several σ away. Based on these, we conclude that PCs build from |$\boldsymbol \Delta _{\rm chy}$| are potentially more effective than others to mitigate baryonic effects.

The w0 bias and statistical uncertainty under various baryonic physics mitigation techniques listed in Table 3. The darker coloured-filled markers indicate the level of w0 bias, defined as |w0,best fit − w0,fid|. The fainter unfilled markers indicate the 0.5σ statistical uncertainty, with 1σ defined as the half difference between the 16th and 84th quantiles of the marginalized 1D w0 posterior distribution. We adopt a criterion of residual bias <0.5σ error in this work when determining how many PC modes are required to mitigate biases due to baryonic physics. The four main lessons from this plot are that: (i) Of various PCA methods, at fixed number of excluded PC modes, the biases of method C are nearly always smaller than methods A and D, indicating method C is the most efficient PCA method. (ii) For MB2/Eagle/Horizon-AGN simulations, removing ≥2 PC modes is enough to mitigate baryonic physics-induced bias to 0.5σ. For the Illustris simulation, all PCA methods fail to pass the bias <0.5σ criteria even after nine PC modes are removed. (iii) No matter which PCA method (A, C, or D) is applied, after removing ≥6 PC modes, the statistical errors on w0 converge to similar values. (iv) hmcode works particularity well for the Illustris simulation. For MB2/Eagle/Horizon-AGN, marginalizing over both A and η0 is required to safely mitigate baryons to 0.5σ.

Figure 11.

The w₀ bias and statistical uncertainty under various baryonic physics mitigation techniques listed in Table 3. The darker coloured-filled markers indicate the level of w₀ bias, defined as |w_0,best fit − w_0,fid|. The fainter unfilled markers indicate the 0.5σ statistical uncertainty, with 1σ defined as the half difference between the 16th and 84th quantiles of the marginalized 1D w₀ posterior distribution. We adopt a criterion of residual bias <0.5σ error in this work when determining how many PC modes are required to mitigate biases due to baryonic physics. The four main lessons from this plot are that: (i) Of various PCA methods, at fixed number of excluded PC modes, the biases of method C are nearly always smaller than methods A and D, indicating method C is the most efficient PCA method. (ii) For MB2/Eagle/Horizon-AGN simulations, removing ≥2 PC modes is enough to mitigate baryonic physics-induced bias to 0.5σ. For the Illustris simulation, all PCA methods fail to pass the bias <0.5σ criteria even after nine PC modes are removed. (iii) No matter which PCA method (A, C, or D) is applied, after removing ≥6 PC modes, the statistical errors on w₀ converge to similar values. (iv) hmcode works particularity well for the Illustris simulation. For MB2/Eagle/Horizon-AGN, marginalizing over both A and η₀ is required to safely mitigate baryons to 0.5σ.

To understand why method C performs better, we can go back to the χ² equation when both |$\boldsymbol D$| and |$\boldsymbol M$| are set at |$\boldsymbol p_{\rm co, fid}$|⁠:

|$\left\langle \chi _{\rm bary}^2 \right\rangle$| quantifies the amount of χ² caused by baryonic uncertainties. The noise term in our likelihood simulation is zero by construction. Our goal is to reduce |$\left\langle \chi _{\rm bary}^2 \right\rangle$| to avoid bias in cosmological parameters due to baryonic physics. From equation (29), one can see that when doing PC mode exclusion in |$\boldsymbol D_{\rm ch}-\boldsymbol M_{\rm ch}$| (with PCs constructed in |$\boldsymbol \Delta _{\rm chy}$|⁠), there is a direct connection in reducing |$\left\langle \chi _{\rm bary}^2 \right\rangle$|⁠, while when doing PC mode exclusion in |$\boldsymbol D-\boldsymbol M$| (with PCs constructed in |$\boldsymbol \Delta$|⁠), the covariance matrix in between makes the reduction of baryonic uncertainties less direct.

5.2.2 Error bars converge for all PCA methods

Going back to Fig. 11, and focusing on the trend in the 0.5σ error bars of w₀ shown in open fainter coloured markers. Generally, error bars grow as more PC modes are excluded (see also Fig. 10 for the growth of error ellipse on 2D posteriors). The size of the error bars varies among the different PCA methods when fewer PC modes are excluded, but eventually converges/saturates to similar error bar sizes when excluding ≳6 PC modes, independent of how PCs are constructed. This means that the PCs fully absorb the range of matter power spectrum modifications due to baryonic physics across the nine OWLS simulation, characterizing them using six dominant degrees of freedom; the last three PC modes are subjected to very small singular values (σ as depicted in Fig. 5) such that only a tiny amount of baryonic fluctuation would be projected on them. In principle, including more training samples with different features would enrich the PC pool, increasing the number of effective degrees of freedom to characterize other possible baryonic scenarios.

5.3 PCA framework versus hmcode

We now move to a more detailed comparison of the two main ways to marginalize over baryonic uncertainties, namely the PCA-based methods and the halo-model-based approach. Since we already compared in Section 5.2 that of all PCA methods listed in Table 3, method C is more efficient than the other two in mitigating biases in cosmological parameters due to baryonic physics. In the following, we will use method C as a representative for PCA-based methods, and compare it with hmcode (method M).

5.3.1 Comparison on the effectiveness (criterion: bias <0.5σ)

We begin by discussing the performance of hmcode when using only one (A) versus two (both A and η₀) parameters to marginalize over baryonic physics. When only the parameter A is used, hmcode sets the η₀ value via equation (28). Going back to Fig. 11, with w₀ as an example, the pentagons at an x-axis value of 1 indicate the bias (black-filled) and 0.5σ error (grey-open) of only varying A in hmcode. Similarly, the pentagons at an x-axis value of 2 indicate the option for hmcode varying both A and η₀. For the Illustris simulation, both options can successfully mitigate the baryonic bias on w₀ to within our 0.5σ criterion. However, apart from Illustris, for the baryonic scenarios of MB2, Eagle, and Horizon-AGN, we find that varying only A while setting η₀ following equation (28) is not sufficient to mitigate baryonic bias. This implies that the current empirical relation described in equation (28) may not be precise enough for MB2/Eagle/Horizon-AGN-like data vectors with LSST-like statistical power. We therefore recommend that the extra freedom carried by η₀ is needed for upcoming weak lensing surveys to effectively mitigate the impact of baryonic physics on cosmological weak lensing measurements. This is even more true in light of recent findings that indicate that our Universe is not like Illustris, for which the AGN feedback is known to be too strong such that the baryon fractions in massive haloes are too low compared with observations (Haider et al. 2016). In Fig. D1 of Appendix D, we also provide similar bias and error plots for other cosmological parameters: Ω_m, σ₈, and w_a. The same conclusion holds for HMcode on these cosmological parameters (as shown in the filled and open pentagons), except for the w_a constraint for Illustris (Fig. D1 l), where varying only A is not enough to mitigate w_a bias to within 0.5σ.

For the PCA-based method C, as indicated in red triangles of Fig. 11 for w₀ and Fig. D1 for Ω_m, σ₈, and w_a, we find that removing ≥3 PC modes is sufficient to mitigate baryonic uncertainties to within 0.5σ for all cosmological parameters considered here, if our Universe has a baryonic physics scenarios like MB2/Eagle/Horizon-AGN.

For the case of the Illustris simulation, we find that the PCA method fails to mitigate baryonic biases to within 0.5σ for w₀ and Ω_m (Fig. D1d), even after nine PC modes are removed, but just passes the threshold for σ₈ (Fig. D1h) and w_a (Fig. D1 l) after removing seven PC modes. We note that this is likely not a major concern as the baryonic effects of Illustris are unrealistically large, and the next-generation IllustrisTNG hydrodynamical simulation (Pillepich et al. 2018; Springel et al. 2018) will address the defects of the old version.

We provide a summary of the results from the above discussion in Table 4. In Appendix E, we further provide the χ² values computed at the best-fitted cosmological parameters from various baryon mitigation models.

Table 4.

Summary of the effectiveness of baryonic physics mitigation methods in reducing biases to within 0.5σ for various cosmological parameters under different baryonic scenarios. A cosmological parameter is struck out if a mitigation method fails to pass our criterion of bias <0.5σ, where σ represents the marginalized statistical error (see Section 5.3.1 for detail).

5.3.2 Comparison on the level of degradation on cosmology

We now compare the error bars on cosmological parameter constraints between PCA method C and hmcode, on baryonic scenarios of MB2/Eagle/Horizon-AGN, where both methods successfully mitigate the baryonic biases to within 0.5σ. The pink open triangles in Fig. 11 indicate the 0.5σ error of w₀ under method C, and the grey open pentagons indicate the same for hmcode. Besides w₀, other cosmological parameters are also shown in Fig. D1. As discussed in Section 5.2.2, one nice feature of the PCA method is that the error bars converge to a certain level when excluding ≥6 PC modes. We find that the converged error bars for method C generally are smaller than those for hmcode, even though hmcode only utilizes two parameters to marginalize over baryonic physics uncertainties while the PCA method needs three parameters to mitigate baryonic effects to 0.5σ in the case of MB2/Eagle/Horizon-AGN. A similar result can be seen from the 1σ 2D posteriors shown in Fig. 10.

5.3.3 Baryonic feature constraint from hmcode

In Fig. 12, we plot the 2D posterior distributions on A and η₀ for various baryonic scenarios in coloured contours, along with equation (28) shown in the black line. We can see that although relying on this A–η₀ relationship is not effective enough to mitigate baryonic bias in most of baryonic recipes under LSST-like survey, the suggested relationship is still good enough to pass the 68 per cent contours in all cases. Therefore, instead of following a fixed relationship like equation (28) or allowing both A and η₀ to vary unboundedly, setting an A-dependent prior on η₀ may help recover some cosmological constraining power while still reducing biases in cosmological parameters when using hmcode.

The 2D constraints on the hmcode halo structure parameters A and η0 from our simulated likelihood analysis for baryonic scenarios of Eagle (blue/solid), MB2 (red/dotted), Illustris (green/dot–dashed), and Horizon-AGN (yellow/dashed). The black line plots the relationship between A and η0 that is used to provide single-parameter fit in hmcode. Both 68 per cent and 95 per cent confidence levels are shown.

Figure 12.

The 2D constraints on the hmcode halo structure parameters A and η₀ from our simulated likelihood analysis for baryonic scenarios of Eagle (blue/solid), MB2 (red/dotted), Illustris (green/dot–dashed), and Horizon-AGN (yellow/dashed). The black line plots the relationship between A and η₀ that is used to provide single-parameter fit in hmcode. Both 68 per cent and 95 per cent confidence levels are shown.

In Fig. 13, we compare the power spectra generated from hmcode at the best-fitted values of A, η₀ (cross symbols in Fig. 12) to the original power spectra derived directly from the hydrodynamical simulations at redshift z = 0, 1, 2, 3. We note that this is not a fair comparison because the underlying P_δ(k, z) is not constrainable from the projected tomographic power spectra, unless the tomographic bins are fine enough to recover the full 3D information.¹¹ Here we simply use these plots to understand the effects of hmcode parameters on P_δ(k, z). First, hmcode does not have degrees of freedom for the cooling feature of hydro simulations, which leads to a turn-over in the power spectrum ratio at |$k \gtrsim 10 \ h^{-1}\, \mathrm{Mpc}$|⁠. This is expected as according to M15, the halo-model power is accurate to ≈5 per cent only for |$k \le 10 \ h^{-1}\, \mathrm{Mpc}$| and z ≤ 2. Because of this limitation, hmcode tends to produce a shallower suppression of power for a given k (when |$k \le 10 \ h^{-1}\, \mathrm{Mpc}$|⁠) compared with MB2/Eagle/Horizon-AGN, in order to compensate for the lack of cooling prescription at |$k \gt 10 \ h^{-1}\, \mathrm{Mpc}$|⁠. Secondly, the redshift evolution of hmcode power spectra is too monotonic, lacking freedom to capture the complicated evolutionary pattern that generally exists in hydrodynamical simulations. The redshift evolution patterns can be very different for various baryonic scenarios. hmcode’s inability to model redshift evolution may be due to the fact that only two nuisance parameters are involved in describing the complex scale and redshift dependences of baryonic effects seen in the simulations. One straightforward suggestions is to add redshift dependence to A and η₀. Further development of halo model approaches to account for the modification of the matter power spectrum for |$k \gt 10 \ h^{-1}\, \mathrm{Mpc}$| is also needed. Although the current model does not describe all the complexity of possible modifications of P(k) due to baryonic physics, we can still use hmcode to gain insight into the strength of feedback from the constrained values of A and η₀. As shown in Fig. 12, the Illustris-like universe tends to have small A and large η₀.

Comparisons of power spectra generated from hmcode at the best-fitting A and η0 values (solid lines) and power spectra directly derived from hydrodynamical simulations (dotted or dashed lines) at z = 0, 1, 2, 3. The discrepancy indicates that hmcode lacks degrees of freedom to account for the cooling effect at high k, and that it is too simplified to capture the complex redshift evolution patterns present in hydrodynamical simulations.

Figure 13.

Comparisons of power spectra generated from hmcode at the best-fitting A and η₀ values (solid lines) and power spectra directly derived from hydrodynamical simulations (dotted or dashed lines) at z = 0, 1, 2, 3. The discrepancy indicates that hmcode lacks degrees of freedom to account for the cooling effect at high k, and that it is too simplified to capture the complex redshift evolution patterns present in hydrodynamical simulations.

5.4 Pushing to even smaller angular scales: ℓ_max of 5000

Until now, all elements of our analysis have been based on mock tomographic shear data vectors with ℓ_max ≈ 2000, which is a conservative choice under the limitation that we lack accurate power spectra at |$k \gt 30 \ h^{-1}\, \mathrm{Mpc}$|⁠. The ℓ_max ≈ 2000 cut assures that various extrapolation curves on P_δ(k) ratio out to |$k \gt 30 \ h^{-1}\, \mathrm{Mpc}$| would not cause significant change on the resulting C^ij(ℓ) data vector (see Appendix B for details on how the scale cut limit is determined).

To further test the limits of the proposed baryonic mitigation techniques, we generate mock C^ij(ℓ) data vectors with ℓ_max ≈ 5000 (based on the quadratic extrapolation trends derived by fitting the P_δ(k) ratio in |$k \in [10, 30] \ h^{-1}\, \mathrm{Mpc}$|⁠, see the red curve in Fig. B1 as a demonstration), and then perform the same simulated likelihood analyses with mitigation techniques described in Sections 3 and 4. The only difference is that we append three extra data points that with equal logarithmic spacing in ℓ ∈ [2060, 5000] to the original data vector |$\boldsymbol D$| in each tomographic bin. The new length of |$\boldsymbol D$| is thus extended to 55 × (18 + 3) = 1155 data points (see Section 3.2 for the original format of |$\boldsymbol D$|⁠). The covariance matrix is also updated accordingly.

The dark grey contours in Fig. 14 indicate the 2D posterior distributions of the cosmological parameters, when no baryonic physics mitigation technique is applied. Compared with the similar plot shown in Fig. 4, but for ℓ_max ≈ 2000, the biases on cosmological parameters increases to 2σ–19σ for the various cosmological parameters in an Illustris-like universe, and around 1.5σ–6σ for the other cases. This amount of bias is consistent with fig. 5 of E15, who showed the posterior distributions for ℓ_max ∼ 5000 for the OWLS baryonic physics scenarios for an LSST-like likelihood simulations.

Figure 14.

Similar to Fig. 4, but for the cases when pushing our mock observables towards ℓ_max ≈ 5000.

Since we showed in Section 5.2 that method C is the most efficient of the PCA-based methods, we only run simulated likelihood analyses with PCA-based method C, compared with method M using hmcode for ℓ_max ≈ 5000. In Fig. 15, we plot the marginalized w₀ bias (colour-filled symbols) and 0.5σ w₀ uncertainty (open symbols) as a function of the number of excluded PC modes in method C (blue diamonds) and hmcode (yellow hexagons). (The red triangles and black pentagons are simply copies of the data points shown in Fig. 11, to enable easier comparison of results with ℓ_max of 2000 versus 5000.) The bias and error plots for Ω_m, σ₈, and w_a are also provided in Fig. D1.

$The w0 bias and uncertainty for the baryonic physics mitigation methods C (as our representative for PCA-based method) and M (halo model-based method/hmcode) (see Table 3 for details). The darker coloured-filled markers indicate the level of w0 bias, defined as |w0,best fit − w0,fid|. The fainter coloured-open markers indicate the 0.5σ w0 uncertainty, with 1σ defined as the half difference between the 16th and 84th percentile of the marginalized 1D w0 posterior distribution. The four key results on this plot are (i) the PCA method (blue diamonds) mitigates bias to within 0.5σ for the milder baryonic physics scenarios – MB2, Eagle, and Horizon-AGN – after excluding more than three PC modes, but fails for the Illustris scenario. (ii) When marginalizing over both A and η0, hmcode(yellow hexagons) mitigates w0 to within 0.5σ for all baryonic scenarios. (iii) For baryonic scenarios of MB2 and Eagle, the cases for which both methods work, the error bars for the PCA method (light blue open diamonds) converge to smaller values compared with hmcode(yellow open hexagons). (iv) Including more small-scale data in the analysis reduces the statistical error to $\sim 20{{\ \rm per\ cent}}$ for PCA method (light blue open diamonds versus pink open triangles) and to about $12\!-\!30{{\ \rm per\ cent}}$ for hmcode (yellow open hexagons versus grey open pentagons). (v) Including more training simulations in PCA improves reducing the w0 bias induced by neglecting baryonic effects to $\sim 15{{\ \rm per\ cent}}$ for the case of Illustris (brown filled stars versus blue filled diamonds), although the improvement on residual bias is not reaching our criterion of <0.5σ.$

Figure 15.

The w₀ bias and uncertainty for the baryonic physics mitigation methods C (as our representative for PCA-based method) and M (halo model-based method/hmcode) (see Table 3 for details). The darker coloured-filled markers indicate the level of w₀ bias, defined as |w_0,best fit − w_0,fid|. The fainter coloured-open markers indicate the 0.5σ w₀ uncertainty, with 1σ defined as the half difference between the 16th and 84th percentile of the marginalized 1D w₀ posterior distribution. The four key results on this plot are (i) the PCA method (blue diamonds) mitigates bias to within 0.5σ for the milder baryonic physics scenarios – MB2, Eagle, and Horizon-AGN – after excluding more than three PC modes, but fails for the Illustris scenario. (ii) When marginalizing over both A and η₀, hmcode(yellow hexagons) mitigates w₀ to within 0.5σ for all baryonic scenarios. (iii) For baryonic scenarios of MB2 and Eagle, the cases for which both methods work, the error bars for the PCA method (light blue open diamonds) converge to smaller values compared with hmcode(yellow open hexagons). (iv) Including more small-scale data in the analysis reduces the statistical error to |$\sim 20{{\ \rm per\ cent}}$| for PCA method (light blue open diamonds versus pink open triangles) and to about |$12\!-\!30{{\ \rm per\ cent}}$| for hmcode (yellow open hexagons versus grey open pentagons). (v) Including more training simulations in PCA improves reducing the w₀ bias induced by neglecting baryonic effects to |$\sim 15{{\ \rm per\ cent}}$| for the case of Illustris (brown filled stars versus blue filled diamonds), although the improvement on residual bias is not reaching our criterion of <0.5σ.

Similar to Section 5.3, we rely on the bias <0.5σ criterion to validate the effectiveness of baryonic physics mitigation methods, with the results summarized in Table 5. First of all, for hmcode, varying only A is not sufficient to mitigate the bias to within 0.5σ for the Illustris simulation, which hmcode is particularly good at describing. Both A and η₀ must be varied to meet our criterion for MB2 and Eagle. For Horizon-AGN and Illustris, hmcode works well for some cosmological parameters, while it fails for the others. For the PCA method, it still works for baryonic scenarios of MB2/Eagle/Horizon-AGN when pushing to ℓ_max ≈ 5000, but continues to fail to meet our criterion for the Illustris scenario.

Table 5.

http://www.astro-wise.org/projects/KIDS/

Similar to Table 4, but now for the likelihood simulations with mock observables pushing to ℓ_max ≈ 5000.

In terms of degradation on cosmological parameter constraints after marginalization, for the cases of MB2 and Eagle, the scenarios in which both PCA and hmcode succeed in mitigating the bias to within 0.5σ, we see that PCA method yields smaller converged error bars (light blue open diamonds) compared with hmcode using two parameters (yellow open hexagons) to do marginalization.

Does extending the data vectors to ℓ_max ≈ 5000 help to better constrain cosmological parameters compared with ℓ_max ≈ 2000? As shown in Fig. 11, for the PCA method, we observe that the converged w₀ errors for the ℓ_max ≈ 5000 cases (light blue open diamonds) are smaller by |$\sim 20{{\ \rm per\ cent}}$| compared with the errors for ℓ_max ≈ 2000 (pink open triangles). For the cases of hmcode when varying both A and η₀, the w₀ errors reduce by |$\sim 12{{\ \rm per\ cent}}$| for MB2, |$\sim 18{{\ \rm per\ cent}}$| for Eagle, and |$\sim 30{{\ \rm per\ cent}}$| for Horizon-AGN, after extending data points to ℓ_max ≈ 5000 (yellow open hexagons) from ℓ_max ≈ 2000 (grey open pentagons). This means that we do benefit from additional constraining power when including more small-scale data in the analysis, if the baryonic physics effect in our Universe is near the physics implemented in Eagle/MB2/Horizon-AGN.

5.5 Including more AGN prescriptions in the training set

The reason why the PCA method fails to mitigate the impact of baryonic physics on the matter power spectrum in Illustris is that the PCs built from the current training set do not capture the strong variation with k to explain its intense feedback feature. As also discussed in Mohammed & Gnedin (2018), it is better to have a training set that comprises adequately exotic but reasonable models. Of the nine training OWLS simulations, only the OWLS-AGN contains an AGN feedback prescription, and we rely on this single AGN model to explain Illustris. However, this shortcoming can be fixed by incorporating more training simulations into the PCA, so that the resulting PCs will include more degrees of freedom to explain the broader range of outcomes due to baryonic physics.

We try to address the above Illustris problem by including the baryonic scenarios of MB2/Eagle/Horizon-AGN in our training set, and then build a |${\boldsymbol \Delta _{\rm ch}}$| matrix with 12 columns to extend the capability of the derived PCs. In the bottom left panel of Fig. 15, we plot the marginalized w₀ bias (brown filled stars) and error (light brown open stars) for Illustris simulation with PCs trained from the 12 baryonic scenarios, and a scale cut at ℓ_max ≈ 5000. (The results for other cosmological parameters can be found in the last column of Fig. D1 as well.) With this expanded training set, the PCA method now reduces the w₀ bias from 1.5σ (blue filled diamonds) to 0.8σ (brown filled stars), which is an improvement but still does not enable us to meet our criterion of bias <0.5σ.

The error bars when using 12 simulations in the training set converge after removing ≥6 PC modes. Notice that the converged errors become bigger when the PCs are trained from 12 simulations rather than just the 9 OWLS simulations. By including more simulations to construct the PCs, we also enlarge the range of baryonic uncertainties, which is a trade-off to ensure a more effective removal of biases due to baryonic physics for a broad range of baryonic physics scenarios. However, we can also imagine trying to rely on external information from independent observations to rule out baryonic scenarios that fail to describe our Universe. By carefully controlling the uncertainty range of the training set, we could potentially improve the cosmological constraining power after mitigation.

6 SUMMARY AND DISCUSSION

We have explored the two major approaches to mitigate uncertainties in cosmic shear tomographic power spectra due to baryonic physics, with the goal of understanding their performance on cosmological constraints for the upcoming LSST survey. The first approach is the PCA-based analysis proposed by E15. Based on a set of training hydrodynamical simulations with various baryonic prescriptions (spanned by OWLS in this work), a difference matrix (equation 8) is computed. Its columns are filled with difference vectors between these hydro and DMO simulations. PCA is then performed on the difference matrix to find dominant PC modes that can be used to model baryonic effects in other hydro simulations. The second approach is the halo model-based method coded in the package of hmcode by M15, which utilizes two halo structural parameters (A and η₀) related to the halo concentration–mass relation to marginalize over baryonic uncertainties.

We examine the basics of the PCA formalism and provide a modification to properly account for the change of covariance matrix after removal of PC modes. Under the new formalism, we demonstrate that PC mode removal is equivalent to marginalization over PC amplitudes (see Section 5.1). Instead of difference matrices, we also investigate PCA on other kinds of matrix forms with their columns filled with the fractional difference (equation 25) or noise-weighted difference vectors (equation 20) to quantify deviations in the matter power spectrum due to baryonic physics. The derived PC bases from different matrices vary in their efficiency in explaining baryon fluctuations at different angular scales. Difference matrix PCs can more effectively account for large-scale baryonic fluctuations, fractional difference matrix PCs are more effective at describing the small-scale fluctuations, and noise-weighted difference matrix PCs most effectively describe the scales at which the S/N is maximal. We find that performing PCA on the noise-weighted difference matrix, with the weighting factor derived via performing Cholesky decomposition on the covariance matrix (Section 4.2), is the most efficient way to mitigate the impact of baryonic physics on inferred cosmological parameters (Section 5.2). Therefore, for future application on real data, we recommend applying the noise-weighted PCA technique. It should be noted that except for method D, the current PCs are wiggling slightly in their directions at each MCMC step, when cosmology changes. If we would like to quantify baryon physics via PC amplitudes, we hope the constrained PC amplitudes are subjected to a fixed set of PCs. A more complete design of PCA algorithm therefore would be an iteration process. We will first use the current setting to find the best-fitted cosmology, and once the |$\boldsymbol p_{\rm co, best\ fit}$| is determined, we will fix PC basis at |$\boldsymbol p_{\rm co, best\ fit}$|⁠, and constrain the posteriors of PC amplitudes subjected to this PC set.

We apply both the PCA and hmcode techniques on mock shear tomographic data vectors (C^ij(ℓ)) with baryonic physics scenarios of MB2/Eagle/Illustris/Horizon-AGN. We test whether these mitigation techniques can reduce the bias in cosmological parameters induced by neglecting baryonic effects to within 0.5σ. With a scale cut at ℓ_max ≈ 2000, and for milder baryonic physics scenarios like MB2/Eagle/Horizon-AGN, both methods succeed in mitigating the impact of baryonic effects on the inferred cosmological parameters. For the PCA method, we find that excluding 3 PC modes is sufficient to mitigate the bias to within 0.5σ for Ω_m, σ₈, w₀, and w_a. For hmcode, we find that it is safer to vary both A and η₀ when performing marginalization, rather than varying only one of them and having the other follow the suggested relation in M15, at least at the level of LSST statistical power. For the Illustris scenario, only hmcode is sufficient to mitigate the bias to within 0.5σ. The PCA method fails to pass our criterion even after removing nine PC modes. With a more aggressive ℓ_max of 5000, the PCA methods still work for MB2/Eagle/Horizon-AGN, but fail for Illustris. hmcode remains sufficient for MB2/Eagle after marginalizing over two parameters but only works partially on some of the cosmological parameters for Horizon-AGN and Illustris, as summarized in Table 5.

We found that hmcode is most effective at mitigating the impact of baryonic physics for a strong feedback scenario like Illustris, because HMcode is designed to describe the impact of baryonic physics on the matter power spectrum for k ≤ 10 h Mpc⁻¹, where the main feature is the suppression of power due to feedback (Fig. 1). hmcode and halo model-based approaches in general have the advantage over PCA that they have cosmology dependence built in. Although the current version of hmcode lacks the complexity to fully describe various baryonic scenarios (Fig. 13), it provides a good summary of the level of feedback strength through two nuisance parameters (Fig. 12). Future improvements of the halo model to smaller scales of k ≥ 10 h Mpc⁻¹, as well as adding parameters to allow additional freedom in the redshift evolution of baryonic physics effects, may constrain halo structural parameters and baryonic power spectra ratio curve together with cosmology. Exploring the prior ranges on halo model parameters also help to improve cosmological parameter constraint. For example, we can use the posterior constraints from realistic hydrodynamical simulations as shown in Fig. 12 to narrow down the allowed ranges of A and η₀. Joint constraints from galaxy–galaxy lensing together with cosmic shear may also provide additional information from the data itself on the halo structure parameters (Zentner et al. 2008).

There are several advantages of the PCA method. First, it successfully mitigates quite general baryonic fluctuations and complex redshift evolution patterns, when collecting several representative training hydrodynamical simulations to conduct PCA. The complex baryonic behaviours as well as the redshift evolution would then be naturally absorbed in only a few dominant PC modes, and we can use the amplitudes of these PC modes to perform marginalization (or, equivalently, PC mode exclusion). Secondly, the PCA method efficiently accounts for baryonic uncertainties without losing too much cosmological constraining power. As discussed in Section 5.3.2, whenever both methods are successful in removing baryonic bias, the error bars are generally smaller for PCA methods compared with the errors of hmcode. It is quite important to note that even if we do not know in advance how many PC modes must be excluded to safely remove baryonic bias in our Universe, excluding all effective PC modes does not unacceptably increase the errors, which saturate at a certain limit. The maximum number of effective PC modes one can remove is equal to the total number of training simulations used in the PCA (see Section 4.1.1 for detail). Finally, the PCA method has significant flexibility to make adjustments as our knowledge of baryonic physics improves. For example, in Section 5.4 we tried to improve the bias mitigation of the Illustris simulation by including more realistic baryonic scenarios with AGN prescriptions in our training set, which enriches the space of possible baryonic uncertainties that the PCs can describe. After the inclusion of MB2/Eagle/Horizon-AGN as well as the original nine OWLS scenarios in our training set, we can further decrease the residual cosmological parameter bias compared with the results when only using the nine OWLS simulations as training set. The cost is that we lose some constraining power. The flexibility of the PCA framework makes it easy to adjust the model based on changes in our knowledge of baryonic physics, and allows us to regulate errors by controlling the input training simulations in the PCA.

There are several aspects regarding the PCA framework that we do not explore within this work. First, our training hydro simulations are all run under the flat ΛCDM model, and we assume that the baryonic fluctuations, as quantified in terms of power spectrum ratios between hydrodynamical and DMO simulations, remain fixed when cosmology changes. In reality, baryonic and cosmological effects vary jointly. Currently, there is no easy way to investigate this assumption, but future fast hydrodynamical simulations under development would be an ideal tool to systematically study this issue. Secondly, we adopted a power-law extrapolation scheme for P_δ(k) ratio at k ≥ 30 h Mpc⁻¹ (see Appendix B). The most relevant physics that governs the high k behaviour is the cooling and inner stellar density profile of galaxies. Current large-volume cosmological hydrodynamical simulations lack the resolution to resolve the physics of galaxy formation to galaxy centres. We rely on subgrid models of feedback to avoid the overcooling of gas and to mitigate the differences between the observed and simulated galaxies, but discrepancies still exist (Stinson et al. 2010; Bottrell et al. 2017; Furlong et al. 2017). This implies that P_δ(k) in the high k regime is still highly uncertain. Does the P_δ(k) ratio continue the trend of increasing monotonically? Or should it reach a saturation point at some high k regime? How to properly propagate the uncertainties of the poorly understood small-scale P_δ(k) ratio into the errors of integrated C^ij(ℓ) which in turn affects the derived PCs? These questions require higher resolution hydrodynamical simulations to further address. Finally, we have briefly demonstrated in Section 5.5 that depending on the training simulation set, the derived PCs carry different abilities to mitigate baryonic effects, and differ in the final constraining power. It would be worthwhile to systematically investigate various possible combinations of the training simulations, to find a most effective set of PCs that are able to span a wide enough range of baryonic uncertainties but with less degradation on constraining power.

In future extensions of this work, we will apply the PCA framework to a configuration-space tomographic shear analysis on real data to constrain the baryonic feature of our Universe and compare it with hydrodynamical simulations. We aim to develop a consistent way of quantifying priors of PC amplitudes, which would provide us with more constraining power on cosmological parameters by shrinking the allowed range of baryonic physics modifications of the matter power spectrum. We will also develop a PCA tool for joint analysis of galaxy–galaxy lensing and galaxy clustering observables. The full 3 × 2-point analysis then can be self-consistently analysed within the PCA framework to increase the constraining power on cosmology while safely marginalizing over baryonic physics.

ACKNOWLEDGEMENTS

We thank Alex Hall for reviewing this paper and providing useful suggestions to improve the manuscript. We thank Sukhdeep Singh, François Lanusse, Qirong Zhu, Arya Farahi, Hy Trac, Phil Bull, Tiziana Di Matteo, Alexander Mead, Irshad Mohammed, and Ananth Tenneti for many constructive discussions and feedback. RM and HH are supported by the Department of Energy Cosmic Frontier program, grant DE-SC0010118. Part of the research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration and is supported by NASA ROSES ATP 16-ATP16-0084 grant.

Footnotes

1

2

http://hsc.mtk.nao.ac.jp/ssp/

3

www.darkenergysurvey.org/

4

http://www.lsst.org/lsst

5

sci.esa.int/euclid/

6

http://wfirst.gsfc.nasa.gov/

7

https://github.com/alexander-mead/HMcode

8

Due to the low resolution of BAHAMAS, we are not able to include it as one of the hydrodynamical scenarios in this work (see Appendix B2 for details of our resolution requirement).

9

Although we plot |$\boldsymbol D_{\rm ch} - \boldsymbol M_{\rm ch}$| versus ℓ in Fig. 7, we note that actually the new data points are not strictly functions of the original ℓ because of the non-zero off-diagonal terms in |$\mathbf {L}^{-1}$|⁠. However, our take-away point from Fig. 7 still holds due to the fact that our covariance matrix is dominated by Gaussian noise, and thus the off-diagonal terms in |$\mathbf {L}^{-1}$| are small.

10

The 1σ criterion is a looser condition than the 0.5σ constraint we will later use for defining acceptability; we use this looser condition here as we are just trying to compare the basic behaviour of the different PCA methods here. Once we are comparing the best-performing PCA methods with hmcode, we will consistently impose a 0.5σ constraint on both.

11

If using hmcode to directly fit the 3D matter power spectrum including baryonic effects at some specific redshift, according to M15, by adjusting A(z), and η₀(z), hmcode has enough degrees of freedom to match the baryonic power spectra from the OWLS simulations to |$k \gtrsim 10 \ h^{-1}\, \mathrm{Mpc}$|⁠.

12

Chisari et al. (2018) have estimated the effect of cosmic variance on the power spectrum ratio (see their fig. 5) by subsampling the Horizon-AGN simulation with a volume that is eight times smaller than our setting here. So the expected 1σ error due to cosmic variance in their case should be on the order of 1 per cent (⁠|$\sqrt{8}$| times larger than our setting). The total spread of their subsampled power spectra ratios is consistent with our derivation within 3σ.

13

We refer readers to this link for more detail about Chi-square difference test.

REFERENCES

Abbott

T. M. C.

et al. .,

2018

,

Phys. Rev. D

,

98

,

043526

10.1103/PhysRevD.98.043526

10.1016/S0370-1573(02)00135-7

Bernardeau

F.

,

Colombi

S.

,

Gaztañaga

E.

,

Scoccimarro

R.

,

2002

,

Phys. Rep.

,

367

,

1

Blumenthal

G. R.

,

Faber

S. M.

,

Flores

R.

,

Primack

J. R.

,

1986

,

ApJ

,

301

,

27

10.1086/163867

10.1111/j.1365-2966.2009.15043.x

Booth

C. M.

,

Schaye

J.

,

2009

,

MNRAS

,

398

,

53

Bottrell

C.

,

Torrey

P.

,

Simard

L.

,

Ellison

S. L.

,

2017

,

MNRAS

,

467

,

2879

10.1093/mnras/stx276

Chabrier

G.

,

2003

,

PASP

,

115

,

763

10.1086/376392

Chang

C.

et al. .,

2013

,

MNRAS

,

434

,

2121

10.1093/mnras/stt1156

Chisari

N. E.

et al. .,

2018

,

MNRAS

,

480

,

3962

10.1093/mnras/sty2093

Cooray

A.

,

Hu

W.

,

2001

,

ApJ

,

554

,

56

10.1086/321376

10.1016/S0370-1573(02)00276-4

Cooray

A.

,

Sheth

R.

,

2002

,

Phys. Rep.

,

372

,

1

Copeland

D.

,

Taylor

A.

,

Hall

A.

,

2018

,

MNRAS

,

480

,

2247

10.1093/mnras/sty2001

Crain

R. A.

et al. .,

2015

,

MNRAS

,

450

,

1937

10.1093/mnras/stv725

Cui

W.

,

Borgani

S.

,

Murante

G.

,

2014

,

MNRAS

,

441

,

1769

10.1093/mnras/stu673

Dai

B.

,

Feng

Y.

,

Seljak

U.

,

2018

,

J. Cosmol. Astropart. Phys.

,

11

,

009

Di Matteo

T.

,

Springel

V.

,

Hernquist

L.

,

2005

,

Nature

,

433

,

604

Dubois

Y.

et al. .,

2014

,

MNRAS

,

444

,

1453

10.1093/mnras/stu1227

10.1111/j.1365-2966.2011.20236.x

Dubois

Y.

,

Devriendt

J.

,

Slyz

A.

,

Teyssier

R.

,

2012

,

MNRAS

,

420

,

2662

10.1111/j.1365-2966.2011.19502.x

Eifler

T.

,

2011

,

MNRAS

,

418

,

536

10.1051/0004-6361/200811276

Eifler

T.

,

Schneider

P.

,

Hartlap

J.

,

2009

,

A&A

,

502

,

721

Eifler

T.

,

Krause

E.

,

Schneider

P.

,

Honscheid

K.

,

2014

,

MNRAS

,

440

,

1379

10.1093/mnras/stu251

Eifler

T.

,

Krause

E.

,

Dodelson

S.

,

Zentner

A. R.

,

Hearin

A. P.

,

Gnedin

N. Y.

,

2015

,

MNRAS

,

454

,

2451

10.1093/mnras/stv2000

Foreman-Mackey

D.

,

Hogg

D. W.

,

Lang

D.

,

Goodman

J.

,

2013

,

PASP

,

125

,

306

10.1086/670067

Furlong

M.

et al. .,

2017

,

MNRAS

,

465

,

722

10.1093/mnras/stw2740

Genel

S.

et al. .,

2014

,

MNRAS

,

445

,

175

10.1093/mnras/stu1654

Giblin

B.

et al. .,

2018

,

MNRAS

,

480

,

5529

10.1093/mnras/sty2271

10.1088/0067-0049/194/2/46

Gnedin

N. Y.

,

Kravtsov

A. V.

,

Rudd

D. H.

,

2011

,

ApJS

,

194

,

46

Goodman

J.

et al. .,

2010

,

Commun. Appl. Math. Comput. Sci.

,

5

,

65

10.2140/camcos.2010.5.65

Haider

M.

,

Steinhauser

D.

,

Vogelsberger

M.

,

Genel

S.

,

Springel

V.

,

Torrey

P.

,

Hernquist

L.

,

2016

,

MNRAS

,

457

,

3024

10.1093/mnras/stw077

Harnois-Déraps

J.

,

van Waerbeke

L.

,

Viola

M.

,

Heymans

C.

,

2015

,

MNRAS

,

450

,

1212

10.1093/mnras/stv646

Hearin

A. P.

,

Zentner

A. R.

,

Ma

Z.

,

2012

,

J. Cosmol. Astropart. Phys.

,

4

,

034

10.1088/0004-637X/715/1/104

Heitmann

K.

,

White

M.

,

Wagner

C.

,

Habib

S.

,

Higdon

D.

,

2010

,

ApJ

,

715

,

104

10.1088/0004-637X/780/1/111

Heitmann

K.

,

Lawrence

E.

,

Kwan

J.

,

Habib

S.

,

Higdon

D.

,

2014

,

ApJ

,

780

,

111

10.1103/PhysRevD.70.043009

Hu

W.

,

Jain

B.

,

2004

,

Phys. Rev. D

,

70

,

043009

10.1016/j.astropartphys.2005.02.006

Huterer

D.

,

Takada

M.

,

2005

,

Astropart. Phys.

,

23

,

369

Joudaki

S.

et al. .,

2017

,

MNRAS

,

465

,

2033

10.1093/mnras/stw2665

Kaviraj

S.

et al. .,

2017

,

MNRAS

,

467

,

4739

10.1093/mnras/stx126

Khandai

N.

,

Di Matteo

T.

,

Croft

R.

,

Wilkins

S.

,

Feng

Y.

,

Tucker

E.

,

DeGraf

C.

,

Liu

M.-S.

,

2015

,

MNRAS

,

450

,

1349

10.1093/mnras/stv627

Kitching

T. D.

,

Verde

L.

,

Heavens

A. F.

,

Jimenez

R.

,

2016

,

MNRAS

,

459

,

971

10.1093/mnras/stw707

10.1088/0067-0049/192/2/18

Komatsu

E.

et al. .,

2011

,

ApJS

,

192

,

18

Krause

E.

,

Eifler

T.

,

2017

,

MNRAS

,

470

,

2100

10.1093/mnras/stx1261

Krause

E.

et al. .,

2017

, preprint (arXiv:1706.09359)

Lawrence

E.

et al. .,

2017

,

ApJ

,

847

,

50

10.3847/1538-4357/aa86a9

MacCrann

N.

et al. .,

2017

,

MNRAS

,

465

,

2567

10.1093/mnras/stw2849

10.1146/annurev-astro-081817-051928

Mandelbaum

R.

,

2018

,

ARA&A

,

56

,

393

10.1111/j.1365-2966.2012.20879.x

Martizzi

D.

,

Teyssier

R.

,

Moore

B.

,

Wentz

T.

,

2012

,

MNRAS

,

422

,

3081

10.1111/j.1365-2966.2010.18033.x

McCarthy

I. G.

,

Schaye

J.

,

Bower

R. G.

,

Ponman

T. J.

,

Booth

C. M.

,

Dalla Vecchia

C.

,

Springel

V.

,

2011

,

MNRAS

,

412

,

1965

McCarthy

I. G.

,

Schaye

J.

,

Bird

S.

,

Le Brun

A. M. C.

,

2017

,

MNRAS

,

465

,

2936

10.1093/mnras/stw2792

Mead

A. J.

,

Peacock

J. A.

,

Heymans

C.

,

Joudaki

S.

,

Heavens

A. F.

,

2015

,

MNRAS

,

454

,

1958

10.1093/mnras/stv2036

Mead

A. J.

,

Heymans

C.

,

Lombriser

L.

,

Peacock

J. A.

,

Steele

O. I.

,

Winther

H. A.

,

2016

,

MNRAS

,

459

,

1468

10.1093/mnras/stw681

Mohammed

I.

,

Gnedin

N. Y.

,

2018

,

ApJ

,

863

,

173

10.3847/1538-4357/aad3b1

Mohammed

I.

,

Martizzi

D.

,

Teyssier

R.

,

Amara

A.

,

2014

, preprint (arXiv:1410.6826)

Morrison

C. B.

,

Schneider

M. D.

,

2013

,

J. Cosmol. Astropart. Phys.

,

11

,

009

Mummery

B. O.

,

McCarthy

I. G.

,

Bird

S.

,

Schaye

J.

,

2017

,

MNRAS

,

471

,

227

10.1093/mnras/stx1469

Navarro

J. F.

,

Frenk

C. S.

,

White

S. D. M.

,

1996

,

ApJ

,

462

,

563

10.1086/177173

10.1016/j.ascom.2015.09.003

Nelson

D.

et al. .,

2015

,

Astron. Comput.

,

13

,

12

10.1088/0004-637X/806/2/186

Osato

K.

,

Shirasaki

M.

,

Yoshida

N.

,

2015

,

ApJ

,

806

,

186

10.1046/j.1365-8711.2000.03779.x

Peacock

J. A.

,

Smith

R. E.

,

2000

,

MNRAS

,

318

,

1144

Perlmutter

S.

et al. .,

1999

,

ApJ

,

517

,

565

10.1086/307221

Pillepich

A.

et al. .,

2018

,

MNRAS

,

473

,

4077

10.1093/mnras/stx2656

10.1051/0004-6361/201321591

Planck Collaboration XVI

,

2014

,

A&A

,

571

,

A16

10.1051/0004-6361/201525830

Planck Collaboration XIII

,

2016

,

A&A

,

594

,

A13

10.1051/0004-6361:20053116

Rasera

Y.

,

Teyssier

R.

,

2006

,

A&A

,

445

,

1

Riess

A. G.

et al. .,

1998

,

AJ

,

116

,

1009

10.1086/300499

Rudd

D. H.

,

Zentner

A. R.

,

Kravtsov

A. V.

,

2008

,

ApJ

,

672

,

19

10.1086/523836

10.1088/0004-637X/701/2/945

Sato

M.

,

Hamana

T.

,

Takahashi

R.

,

Takada

M.

,

Yoshida

N.

,

Matsubara

T.

,

Sugiyama

N.

,

2009

,

ApJ

,

701

,

945

10.1111/j.1365-2966.2009.16029.x

Schaye

J.

et al. .,

2010

,

MNRAS

,

402

,

1536

Schaye

J.

et al. .,

2015

,

MNRAS

,

446

,

521

10.1093/mnras/stu2058

Schneider

A.

,

Teyssier

R.

,

2015

,

J. Cosmol. Astropart. Phys.

,

12

,

049

10.1088/1475-7516/2019/03/020

Schneider

A.

,

Teyssier

R.

,

Stadel

J.

,

Chisari

N. E.

,

Le Brun

A. M. C.

,

Amara

A.

,

Refregier

A.

,

2019

,

J. Cosmol. Astropart. Phys.

,

2019

,

020

10.1046/j.1365-8711.2000.03715.x

Seljak

U.

,

2000

,

MNRAS

,

318

,

203

10.1111/j.1365-2966.2011.19385.x

Semboloni

E.

,

Hoekstra

H.

,

Schaye

J.

,

van Daalen

M. P.

,

McCarthy

I. G.

,

2011

,

MNRAS

,

417

,

2020

Semboloni

E.

,

Hoekstra

H.

,

Schaye

J.

,

2013

,

MNRAS

,

434

,

148

10.1093/mnras/stt1013

10.1103/RevModPhys.61.185

Shandarin

S. F.

,

Zeldovich

Y. B.

,

1989

,

Rev. Mod. Phys.

,

61

,

185

10.1103/PhysRevLett.107.271301

Simpson

F.

,

James

J. B.

,

Heavens

A. F.

,

Heymans

C.

,

2011

,

Phys. Rev. Lett.

,

107

,

271301

10.1103/PhysRevD.88.083510

PubMed

Simpson

F.

,

Heavens

A. F.

,

Heymans

C.

,

2013

,

Phys. Rev. D

,

88

,

083510

10.1046/j.1365-8711.2003.06503.x

Smith

R. E.

et al. .,

2003

,

MNRAS

,

341

,

1311

Spergel

D. N.

et al. .,

2007

,

ApJS

,

170

,

377

10.1086/513700

Springel

V.

et al. .,

2018

,

MNRAS

,

475

,

676

10.1093/mnras/stx3304

10.1111/j.1365-2966.2005.09655.x

Springel

V.

,

2005

,

MNRAS

,

364

,

1105

10.1111/j.1365-2966.2009.15715.x

Springel

V.

,

2010

,

MNRAS

,

401

,

791

10.1046/j.1365-8711.2003.06206.x

Springel

V.

,

Hernquist

L.

,

2003

,

MNRAS

,

339

,

289

10.1111/j.1365-2966.2005.09238.x

Springel

V.

,

Di Matteo

T.

,

Hernquist

L.

,

2005

,

MNRAS

,

361

,

776

10.1111/j.1365-2966.2010.17187.x

Stinson

G. S.

,

Bailin

J.

,

Couchman

H.

,

Wadsley

J.

,

Shen

S.

,

Nickerson

S.

,

Brook

C.

,

Quinn

T.

,

2010

,

MNRAS

,

408

,

812

Sutherland

R. S.

,

Dopita

M. A.

,

1993

,

ApJS

,

88

,

253

10.1086/191823

10.1088/1367-2630/9/12/446

Takada

M.

,

Bridle

S.

,

2007

,

New J. Phys.

,

9

,

446

10.1103/PhysRevD.87.123504

Takada

M.

,

Hu

W.

,

2013

,

Phys. Rev. D

,

87

,

123504

10.1111/j.1365-2966.2009.14504.x

Takada

M.

,

Jain

B.

,

2009

,

MNRAS

,

395

,

2065

10.1088/0004-637X/761/2/152

Takahashi

R.

,

Sato

M.

,

Nishimichi

T.

,

Taruya

A.

,

Oguri

M.

,

2012

,

ApJ

,

761

,

152

10.1103/PhysRevD.98.083514

Taylor

P. L.

,

Bernardeau

F.

,

Kitching

T. D.

,

2018

,

Phys. Rev. D

,

98

,

083514

Tenneti

A.

,

Mandelbaum

R.

,

Di Matteo

T.

,

Kiessling

A.

,

Khandai

N.

,

2015

,

MNRAS

,

453

,

469

10.1093/mnras/stv1625

The EAGLE team

,

2017

, preprint (arXiv:1706.09899)

10.5281/zenodo.1059119

10.1046/j.1365-8711.2003.06631.x

Tornatore

L.

,

Borgani

S.

,

Springel

V.

,

Matteucci

F.

,

Menci

N.

,

Murante

G.

,

2003

,

MNRAS

,

342

,

1025

10.1103/PhysRevD.98.043528

Troxel

M. A.

et al. .,

2018

,

Phys. Rev. D

,

98

,

043528

10.1111/j.1365-2966.2011.18981.x

van Daalen

M. P.

,

Schaye

J.

,

Booth

C. M.

,

Dalla Vecchia

C.

,

2011

,

MNRAS

,

415

,

3649

van Daalen

M. P.

,

McCarthy

I. G.

,

Schaye

J.

,

2019

, preprint (arXiv:1906.00968)

Velliscig

M.

,

van Daalen

M. P.

,

Schaye

J.

,

McCarthy

I. G.

,

Cacciato

M.

,

Le Brun

A. M. C.

,

Dalla Vecchia

C.

,

2014

,

MNRAS

,

442

,

2641

10.1093/mnras/stu1044

Vogelsberger

M.

,

Genel

S.

,

Sijacki

D.

,

Torrey

P.

,

Springel

V.

,

Hernquist

L.

,

2013

,

MNRAS

,

436

,

3031

10.1093/mnras/stt1789

Vogelsberger

M.

et al. .,

2014

,

MNRAS

,

444

,

1518

10.1093/mnras/stu1536

10.1016/j.physrep.2013.05.001

Weinberg

D. H.

,

Mortonson

M. J.

,

Eisenstein

D. J.

,

Hirata

C.

,

Riess

A. G.

,

Rozo

E.

,

2013

,

Phys. Rep.

,

530

,

87

Weinberger

R.

et al. .,

2018

,

MNRAS

,

479

,

4056

10.1093/mnras/sty1733

10.1103/PhysRevD.77.043507

Zentner

A. R.

,

Rudd

D. H.

,

Hu

W.

,

2008

,

Phys. Rev. D

,

77

,

043507

10.1103/PhysRevD.87.043509

Zentner

A. R.

,

Semboloni

E.

,

Dodelson

S.

,

Eifler

T.

,

Krause

E.

,

Hearin

A. P.

,

2013

,

Phys. Rev. D

,

87

,

043509

APPENDIX A: POWER SPECTRUM COMPUTATION

Here we describe the practical implementation of our power spectrum computation from the simulation snapshots.

A1 The power spectrum estimator

The matter density field in the Universe can be quantified via the overdensity δ(x), defined as |$\delta ({\bf \boldsymbol{ x}}) = \frac{\rho ({\bf \boldsymbol{ x}}) - \bar{\rho }}{\bar{\rho }}$|⁠, where ρ(x) specifies the density function at position x and |$\bar{\rho }$| is the global mean density. We first estimate δ(x) on a uniform grid of 1024 cells across a side of the simulation box with the particle deposition step carried out via nearest grid point assignment. Our estimator is

\begin{eqnarray*} \hat{\delta }({\bf \boldsymbol{ x}}) = \delta ({\bf \boldsymbol{ x}}) W({\bf \boldsymbol{ x}}) = \int _{\rm \mathit{ V}_{box}} \mathrm{ d} {\bf \boldsymbol{ x}^{\prime }} \delta ({\bf \boldsymbol{ x}^{\prime }}) W({\bf \boldsymbol{ x}-\boldsymbol{ x}^{\prime }}). \end{eqnarray*}

(A1)

Here the mass assignment function can be described by W(x) = ∏_iW(x_i), with

\begin{eqnarray*} W(x_i)=\left\lbrace \begin{array}{@{}l@{\quad }l@{}}1/ \Delta L & \mbox{for}\,\,\,\,|x_i| \lt \Delta L /2\\ 0 & \mbox{else} \end{array}\right., \end{eqnarray*}

(A2)

where the grid cell side length ΔL = L_box/1024, and the index i is the axis label of the Cartesian coordinate system. We then perform a discrete Fourier transform on |$\hat{\delta }({\bf \boldsymbol{ x}})$| to derive its Fourier transformation pair |$\hat{\delta }({\bf \boldsymbol{ k}})$|⁠:

\begin{eqnarray*} \hat{\delta }({\bf \boldsymbol{ k}}) = \delta ({\bf \boldsymbol{ k}}) W({\bf \boldsymbol{ k}}). \end{eqnarray*}

(A3)

After the Fourier transform, the convolution operation in equation (A1) becomes a simple product, with W(k) being the Fourier space mass assignment window function:

\begin{eqnarray*} W({\bf \boldsymbol{ k}}) = \prod _{i} W(k_i)= \prod _{i} \frac{\sin {(\frac{k_i \Delta L }{2})}}{ (\frac{k_i \Delta L }{2})}, \end{eqnarray*}

(A4)

where k_i (i = x, y, z) is the i-th component of |$\bf \boldsymbol{ k}$|⁠. The Fourier transformation pair of δ(x) then can be computed by

\begin{eqnarray*} \delta ({\bf \boldsymbol{ k}}) = \frac{\hat{\delta }({\bf \boldsymbol{ k}})}{W({\bf \boldsymbol{ k}})}. \end{eqnarray*}

(A5)

We choose the convention of Fourier transform as δ(k) = ∫dx δ(x)e^{−ik · x}. Under this convention, the power spectrum can be estimated by averaging over all modes |$\bf \boldsymbol{ k}$| with a length of k:

\begin{eqnarray*} \hat{P}_{\delta }(k) = \frac{1}{\rm \mathit{ V}_{box}} \left\langle |\delta ({\bf \boldsymbol{ k}})|^2 \right\rangle _{k = |\bf \boldsymbol{ k}|}, \end{eqnarray*}

(A6)

with V_box being the box size of simulation.

The raw estimation of |$\hat{P}_{\delta }(k)$| above is known to be affected by discreteness effects, which contributes into the power through a constant amplitude called shot noise

\begin{eqnarray*} P_{\rm shot} = {\rm \mathit{ V}_{box}} / N_{\rm eff}. \end{eqnarray*}

(A7)

Here N_eff is the effective number of particles, which accounts for their difference in mass:

\begin{eqnarray*} N_{\rm eff} = \frac{ (\Sigma _{i}^{N} m_i)^2 }{ \Sigma _{i}^{N} m_i^{2} }, \end{eqnarray*}

(A8)

where m_i is the individual particle mass, and N is the total number of particles. For DMO simulations where all tracer particles have equal mass, N_eff = N. The final power spectrum P_δ(k) used throughout this work is derived after subtraction of the shot-noise term:

\begin{eqnarray*} P_{\delta }(k) = \hat{P}_{\delta }(k) - P_{\rm shot}. \end{eqnarray*}

(A9)

A2 The accuracy of power spectrum

On large scales, due to the limited size of simulation boxes, the statistical uncertainties of the estimated matter power spectra are dominated by cosmic variance. The contribution of cosmic variance on P_δ(k) can be estimated by (Takada & Hu 2013)

\begin{eqnarray*} \sigma ^2(k) = 2 \frac{P_{\delta }^2(k)}{N_{\rm modes}(k)}. \end{eqnarray*}

(A10)

N_modes is total number of modes available in the bin range [k − Δk/2, k + Δk/2]:

\begin{eqnarray*} N_{\rm modes}(k) &=& \frac{1}{(2 \pi)^3} V_{\rm box} \int _{k-\Delta k/2}^{k+\Delta k/2} 4 \pi k^2\ {\rm d}k \nonumber \\ &\approx& \frac{1}{2 \pi ^2} V_{\rm box} k^2 \Delta k. \end{eqnarray*}

(A11)

The small-scale error of power spectra is mostly caused by the simulation resolution. For DMO simulations, Heitmann et al. (2010) pointed out that the P_δ(k) values of different resolutions are within 1 per cent agreement for k < k_Ny/2, where the Nyquist wavenumber is set by the inter-particle separation on the initial grid:

\begin{eqnarray*} k_{\rm Ny} = \frac{\pi N_{\rm p}}{\rm L_{box}}, \end{eqnarray*}

(A12)

with N_p being the cube-root of the total number of particles used in simulations. For smaller scales at k > k_Ny/2, we expect a suppression of power for scales around k ∼ k_Ny followed by a steep rise of power at even larger k (see fig. 8 of Heitmann et al. 2010 and fig. A3 of van Daalen et al. 2011). According to Heitmann et al. (2010), the suppression of power is due to discreteness effects in sampling small-scale fluctuations. When fewer low-mass haloes are resolved, P_δ(k) is suppressed at small scales. The steep rise of power is believed to be caused by incorrect shot-noise subtraction. Shot noise should be scale-dependent at small scales rather than a simple constant as in equation (A7).

For hydrodynamical simulations, the convergence properties are more difficult to systematically quantify due to the interplay between resolution effects and galaxy formation physics. For example, as the resolution increases, more lower mass haloes are resolved, leading to an increased power of SN feedback. Subgrid parameters regulating SN feedback then must be modified to account for this effect in order to match the observables like galaxy stellar mass function. Typically subgrid prescriptions are designed to reach some level of convergence with the variation of resolution. However, the functionality of such self-regulation is rather limited. Eagle is currently the only hydro simulation with its subgrid model parameters re-calibrated to match observations when the resolution is changed (see fig. 7 of Schaye et al. 2015). In the left-hand panel of fig. A2 in van Daalen et al. (2011), they showed a convergence comparison between the power spectra of OWLS-REF hydro simulations with k_Ny of 16 (the current OWLS resolution used in this work) and 32, respectively. The two baryonic power spectra agree to within |$\sim 10{{\ \rm per\ cent}}$| out to k ≈ 40 h Mpc⁻¹. We note that this statement only applies to OWLS-REF. There is no general rule on the behaviour of convergence for hydro simulations, given that the galaxy observations are not yet converged, and that the subgrid prescriptions are all different.

APPENDIX B: POWER SPECTRUM RATIO

The power spectrum ratio between hydrodynamical and DMO simulations is an important quantity in this work. We rely on it in equation (1) to derive mock observables at different cosmology, as well as to build difference matrices to perform PCA. Here we discuss the validity of our estimates of this ratio over the range of scales used throughout this work, and describe how we perform extrapolation to smaller scales than those that are well-described in the simulation.

B1 Discussion on the convergence of the power spectrum ratio

The ratios of matter power spectra that we use in this work are accurate to k ≲ 30 h Mpc⁻¹ for Eagle/MB2/Illustris/Horizon-AGN and of k ≲ 10 h Mpc⁻¹ for OWLS. Below we will justify this claim.

We have discussed the statistical uncertainty of P_δ(k) due to cosmic variance in Section A2. Based on the first-order error propagation, |${\rm Var}[\frac{X}{Y}] = \frac{\overline{X}^2}{\overline{Y}^2} \left(\frac{\sigma ^2_{X}}{\overline{X}^2} + \frac{\sigma ^2_Y}{\overline{Y}^2} - 2 \frac{{\rm Cov}[X,Y]}{\overline{X}\ \overline{Y}}\right)$|⁠, the cosmic variance contribution to the uncertainty in the power spectrum ratio is

\begin{eqnarray*} {\rm Var}\left[ \frac{P_{\rm \delta , hydro}(k)}{P_{\rm \delta , DMO}(k)}\right] &=& \frac{1}{P_{\rm \delta , DMO}^2(k)} \sigma ^2_{\rm hydro}(k) + \frac{P_{\rm \delta , hydro}^2(k)}{P_{\rm \delta , DMO}^4(k)} \sigma ^2_{\rm DMO}(k)\nonumber \\ &&- \,2 \frac{P_{\rm \delta , hydro}(k)}{P_{\rm \delta , DMO}^3(k)} {\rm Cov}\left[P_{\rm \delta , hydro}(k),P_{\rm \delta , DMO}(k)\right],\nonumber \\ \end{eqnarray*}

(B1)

where the variance of the power spectrum |$\sigma ^2_{\rm hydro/DMO}(k)$| is expressed in equation (A10).

The hydrodynamical and DMO runs are set at exactly the same initial conditions, and baryonic effects are negligible on large scales. When P_δ,hydro(k) ≈ P_δ,DMO(k) and Cov[P_δ,hydro(k), P_δ,DMO(k)] ≈ 1, as is the case at small k, we expect the variance of the power spectrum ratio in equation (B1) to approach zero.

On small scales, as discussed in Section A2, P_δ(k) would achieve |$1{{\ \rm per\ cent}}$| convergence out to k ≈ k_Ny/2. Hence, we expect the uncertainty in the power spectrum ratio due to limited simulation resolution to be |$\lt 2{{\ \rm per\ cent}}$| to k ≈ k_Ny/2, where k_Ny/2 is ∼30 h Mpc⁻¹ for Eagle/MB2/Illustris/Horizon-AGN and ∼10 h Mpc⁻¹ for OWLS.

For the cosmic variance contribution on small scales, we can derive an upper limit by setting Cov[P_δ,hydro(k), P_δ,DMO(k)] to zero in equation (B1), and with the |$\sigma _{\rm hydro/DMO}^2(k)$| estimated using equations (A10) and (A11):

\begin{eqnarray*} {\rm Var}\left[ \frac{P_{\rm \delta , hydro}(k)}{P_{\rm \delta , DMO}(k)}\right]_{\rm upper} \rightarrow \frac{8\pi ^2 }{V_{\rm box} k^2 \Delta k} \frac{P_{\rm \delta , hydro}^2(k)}{P_{\rm \delta , DMO}^2(k)}. \end{eqnarray*}

(B2)

For k ≈ 10 h Mpc⁻¹, |$L_{\rm box} = 100 \ h^{-1}\, \mathrm{Mpc}$|⁠, Δk = 0.1, and |$\frac{P_{\rm \delta , hydro}}{P_{\rm \delta , DMO}} \approx 0.9$|⁠, the estimated variance of the power spectrum ratio is ∼0.0005. Thus the 1σ uncertainty in the power spectrum ratio due to cosmic variance is expected be |$\lesssim 0.3{{\ \rm per\ cent}}$|⁠.¹²

If we naively derive the power spectrum ratio by using the raw data points of |$P_{\delta , \rm DMO}(k)$| and |$P_{\delta , \rm hydro}(k)$| at k > k_Ny/2, the derived ratio will be overestimated due to the underestimation of the denominator of |$P_{\delta , \rm DMO}(k)$| at scales of several times k_Ny, and underestimated towards even higher k due to the overestimation of |$P_{\delta , \rm DMO}(k)$| (see Section A2). We will introduce our extrapolation scheme below to avoid the biases.

B2 Power spectrum ratio extrapolation scheme

For scales below k < 0.1 h Mpc⁻¹, we simply let the ratio curve asymptotically approach one, which is a justifiable assumption since we know that baryons hardly modify the matter power spectrum on large scales. For small scales, as shown in Fig. 1, the power spectrum ratio between hydrodynamical and DMO simulations tends to increase after k ≳ 20 h Mpc⁻¹. This increase is caused by cooling effects in hydrodynamical simulation. In order to capture this physical effect, we make use of data points in k ∈ [10, 30] h Mpc⁻¹, perform a smooth quadric spline fitting in |$\log (k) - \log (P_{\delta \rm , bary }(k) / P_{\delta \rm , DMO }(k))$| space, and extrapolating the fitted trend out to k > 30 h Mpc⁻¹. Fig. 1 shows the extrapolation curves for all baryonic scenarios.

The above extrapolation scheme holds for Eagle/MB2/IIIustris/Horizon-AGN simulations, where we have reliable power spectrum ratios in the range k ∈ [10, 30] h Mpc⁻¹. For the OWLS simulation set, as discussed above, the ratio data is only well-determined to k < 10 h Mpc⁻¹. The black line in Fig. B1 indicates the naively derived power spectrum ratio from the raw data of OWLS-AGN and OWLS-DMO. In the case of OWLS-AGN, we do need data points slightly beyond k of 10 h Mpc⁻¹ to capture the transition from suppression to enhancement of power. We therefore still make use of the raw data points in the range of k ∈ [10, 30] h Mpc⁻¹, where the uncertainties may |$\gt 2{{\ \rm per\ cent}}$| but not that worse, to perform extrapolation spline line fitting, and the resulting curve is indicated in the red-dashed curve of Fig. B1. The raw data curve is slightly higher than the extrapolating curve when k is large. This is resulting from the underestimation of the DMO power spectrum as discussed in Section A2 and Section B. The extrapolation based on fitting data at k ∈ [10, 30] h Mpc⁻¹ may exhibit uncertainties in the final extrapolation slope at high k. Therefore, we try to explore such uncertainties by setting slightly different extrapolating parameters that are shown as upper (yellow line) and lower (brown dot–dashed) bounds in Fig. B1. Finally, we also check the simplest constant extrapolation scheme as shown in the dark blue line. We will explore the effects of different extrapolation schemes on the resulting tomographic shear power spectra later.

Power spectrum ratio between OWLS-AGN and OWLS-DMO at z = 0, assuming different extrapolation schemes above k > 10 h Mpc−1. The black line shows the raw data calculated by simply taking ratio from the raw OWLS power spectra from van Daalen et al. (2011). The red dashed line indicates the extrapolation scheme by extending a quadric spline fitted curve using the raw data points in k ∈ [10, 30] h Mpc−1. The yellow and brown dot–dashed lines indicate the two possible upper and lower bounds one may derive, if there is some uncertainty in the raw data points k ∈ [10, 30] h Mpc−1. The dark blue line plots a pure flat extrapolation. The light blue dash–dot–dotted line indicates the case of extrapolation when using data points in k ∈ [3, 10] h Mpc−1, which are believed to be well-measured.

Figure B1.

Power spectrum ratio between OWLS-AGN and OWLS-DMO at z = 0, assuming different extrapolation schemes above k > 10 h Mpc⁻¹. The black line shows the raw data calculated by simply taking ratio from the raw OWLS power spectra from van Daalen et al. (2011). The red dashed line indicates the extrapolation scheme by extending a quadric spline fitted curve using the raw data points in k ∈ [10, 30] h Mpc⁻¹. The yellow and brown dot–dashed lines indicate the two possible upper and lower bounds one may derive, if there is some uncertainty in the raw data points k ∈ [10, 30] h Mpc⁻¹. The dark blue line plots a pure flat extrapolation. The light blue dash–dot–dotted line indicates the case of extrapolation when using data points in k ∈ [3, 10] h Mpc⁻¹, which are believed to be well-measured.

If only using data points around k < 10 h Mpc⁻¹ to perform smooth extrapolation, one would fail to capture the cooling effect that typically exists in hydrodynamical simulations at high k. The light blue dash–dot–dotted line of Fig. B1 indicates such an extrapolation based on the data points for k ∈ [3, 10] h Mpc⁻¹. This sets a very important requirement for our method. If we really want to use the PCA framework to achieve better cosmological parameter constraints by including more small scale information, the simulations that are used to build the PC basis must also have high enough resolution to construct a reasonable extrapolation down to the scale that goes into cosmological analysis. This is the reason why we avoid using the Rudd, Zentner & Kravtsov (2008) and Gnedin, Kravtsov & Rudd (2011) simulations to build our PC basis as in the previous work of E15. The half-Nyquist wavenumbers of these two simulations are too low to capture the up-turn in the power spectrum ratio, given the angular scales of ℓ ≈ 2000 used in this work.

We now justify how the choice of angular scale cut at ℓ ≈ 2000 is made. In Fig. B2 we present the computed tomographic shear power spectra in our lowest redshift bin, for various extrapolation schemes on the power spectra ratio shown in Fig. B1. The vertical grey line indicates the angular scale cut of ℓ = 2060 we have adopted. One can see that the C^ij(ℓ) ratio only differs mildly at this scale. Therefore, although our current extrapolation scheme may lead to a considerable error in the P_δ(k) ratio, after the integration process, such error propagation in C^ij(ℓ) ratio is estimated to be within 10 per cent when making an ℓ cut at ≈2000. We make a conservative choice of cutting in ℓ such that the final result is not too sensitive to our extrapolation scheme.

Figure B2.

The ratio of tomographic shear power spectrum between OWLS-AGN and OWLS-DMO simulations for our lowest tomographic bin. Curves in different colours represent different P_δ(k) ratio extrapolation schemes as coloured in Fig. B1. The vertical dashed line indicates the angular scale of ℓ = 2060, where the cut is made for all of tomographic bins in our data vector. This cut is chosen such that the derived C^ij(ℓ) curve would not be too sensitive on different P_δ(k) ratio extrapolation schemes.

APPENDIX C: COMPUTING BARYON-CONTAMINATED DATA VECTORS AT VARYING COSMOLOGY

In this appendix, we describe in detail how we compute baryon-contaminated data vectors as a function of cosmology. This procedure is needed to build the difference matrix (equation 8), weighted difference matrix (equation 20), or ratio matrix (equation 25) when doing PCA.

To produce a baryon-contaminated vector |$\boldsymbol B_x$| at cosmology |$\boldsymbol p_{\rm co}$|⁠, in principle we should rely on equation (1) to generate the matter power spectrum for that cosmology, and integrate it to derive the tomographic shear data vector

\begin{eqnarray*} && {C^{ij}_{{\rm hydro, }x} (\ell |\ \boldsymbol p_{\rm co})} \nonumber \\ && {\quad = \frac{9H_0^4 \Omega _\mathrm{ m}^2}{4c^4} \int _0^{\chi _{\rm h}} {\rm d} \chi \frac{g^{i}(\chi) g^{j}(\chi)}{a^2(\chi)} P_{\delta }^{{\rm hydro, }x} \left(\frac{\ell }{f_K(\chi)},\chi |\ \boldsymbol p_{\rm co}\right).}\nonumber \\ \end{eqnarray*}

(C1)

However, to increase the computational speed, we approximate this step by

\begin{eqnarray*} \boldsymbol B_x(\boldsymbol p_{\rm co}) = C^{ij}_{{\rm hydro,} x}(\boldsymbol p_{\rm co}) = \frac{C^{ij}_{{\rm hydro,} x}(\boldsymbol p_{\rm co, fid})}{C^{ij}_{\rm theory}(\boldsymbol p_{\rm co, fid})} C^{ij}_{\rm theory}(\boldsymbol p_{\rm co}), \end{eqnarray*}

(C2)

where |$C^{ij}_{{\rm hydro,} x}(\boldsymbol p_{\rm co, fid})$| is pre-computed using equation (C1) setting at |$\boldsymbol p_{\rm co, fid}$| and stored. |$C^{ij}_{\rm theory}(\boldsymbol p_{\rm co})$| is our model vector |$\boldsymbol M(\boldsymbol p_{\rm co})$| generated from halofit. Approximating equation (C1) by equation (C2) avoids the need to integrate nine times when constructing the nine columns of |$\boldsymbol \Delta (\boldsymbol p_{\rm co})$|/|$\boldsymbol \Delta _{\rm chy}(\boldsymbol p_{\rm co})$|/|$\boldsymbol R(\boldsymbol p_{\rm co})$| at each MCMC step. In using equation (C2), we basically assume the quantity |${}[\frac{C^{00}_{\rm hydro}(\boldsymbol p_{\rm co})}{C^{00}_{\rm DMO}(\boldsymbol p_{\rm co})}] / [\frac{C^{00}_{\rm hydro}(\boldsymbol p_{\rm co, fid})}{C^{00}_{\rm DMO}(\boldsymbol p_{\rm co, fid})}] \approx 1$| at various |$\boldsymbol p_{\rm co}$|⁠. To check the validity, we compute all the elements in this quantity using equation (C1) and plot it in Fig. C1, with |$\boldsymbol p_{\rm co}$| set at different values of Ω_m or σ₈, while keeping the rest of the cosmological parameters the same as |$\boldsymbol p_{\rm co, fid}$|⁠. As shown, the C⁰⁰ ratio curves are within 0.25 per cent of 1 for various |$\boldsymbol p_{\rm co}$| demonstrated here.

$The ratio of tomographic power spectra ratio at bin C00 between hydrodynamical and DMO simulations evaluated at various $\boldsymbol p_{\rm co}$ versus $\boldsymbol p_{\rm co, fid}$, i.e. ${}[\frac{C^{00}_{\rm hydro}(\boldsymbol p_{\rm co})}{C^{00}_{\rm DMO}(\boldsymbol p_{\rm co})}] / [\frac{C^{00}_{\rm hydro}(\boldsymbol p_{\rm co, fid})}{C^{00}_{\rm DMO}(\boldsymbol p_{\rm co, fid})}]$. Here different curves indicate changes of Ωm or σ8 to values shown in the legend, while keeping the remaining cosmological parameters the same as $\boldsymbol p_{\rm co, fid}$. The fact that all ratio curves are ≈1 to within 0.25 per cent indicates the validity of using equation (C2) as our approximation.$

Figure C1.

The ratio of tomographic power spectra ratio at bin C⁰⁰ between hydrodynamical and DMO simulations evaluated at various |$\boldsymbol p_{\rm co}$| versus |$\boldsymbol p_{\rm co, fid}$|⁠, i.e. |${}[\frac{C^{00}_{\rm hydro}(\boldsymbol p_{\rm co})}{C^{00}_{\rm DMO}(\boldsymbol p_{\rm co})}] / [\frac{C^{00}_{\rm hydro}(\boldsymbol p_{\rm co, fid})}{C^{00}_{\rm DMO}(\boldsymbol p_{\rm co, fid})}]$|⁠. Here different curves indicate changes of Ω_m or σ₈ to values shown in the legend, while keeping the remaining cosmological parameters the same as |$\boldsymbol p_{\rm co, fid}$|⁠. The fact that all ratio curves are ≈1 to within 0.25 per cent indicates the validity of using equation (C2) as our approximation.

APPENDIX D: Constraints on cosmological parameters of Ω_m, σ₈, and w_a

In this appendix, we provide constraints on cosmological parameters of Ω_m, σ₈, and w_a after applying baryon mitigation techniques using hmcode or various PCA methods. The solid markers indicate the amount of residual bias after mitigation, and the open markers indicate the 0.5σ errors on the marginalized 1D posteriors. We rely on this plot to check whether a mitigation method can successfully reduce the baryonic physics-induced bias to within 0.5σ for different cosmological parameters and baryonic scenarios. The results are briefly summarized in Tables 4 and 5. We refer readers back to discussions at Sections 5.3 and 5.4 for details.

Similar to Fig. 15, but for cosmological parameters of Ωm (first row), σ8 (second row), and wa (third row), for various baryonic scenarios categorized in each different column.

Figure D1.

Similar to Fig. 15, but for cosmological parameters of Ω_m (first row), σ₈ (second row), and w_a (third row), for various baryonic scenarios categorized in each different column.

APPENDIX E: GOODNESS OF FIT FOR BARYON MITIGATION MODELS

In this appendix, we summarize the fitting quality for various baryon mitigation methods. In Table E1, we provide the χ² values computed at the best-fittng (fiducial) cosmology for hmcode and the PCA method C when applied on each baryonic scenario for the ℓ_max ≈ 2000 likelihood simulations. Here we define our best-fitted parameters to be the median value at the marginalized 1D posterior distribution.

Table E1.

Goodness of fit for various baryon mitigation models.

	Eagle	MB2	Horizon-AGN	Illustris
No-mitigation at \|$\boldsymbol p_{\rm co, bestfit}$\| (⁠\|$\boldsymbol p_{\rm co, fid}$\|⁠)	2.68 (30.92)	0.93 (5.71)	3.85 (103.2)	107.5 (3258)
hmcode – A	2.09	0.74	2.34	4.84
hmcode – A, η₀	0.91	0.29	1.69	4.80
PCA ex1 at \|$\boldsymbol p_{\rm co, bestfit}$\| (⁠\|$\boldsymbol p_{\rm co, fid}$\|⁠)	1.39 (3.25)	0.70 (5.11)	0.75 (6.75)	11.91 (40.0)
PCA ex2	0.41 (0.39)	0.37 (0.81)	0.45 (2.72)	9.68 (34.2)
PCA ex3	0.28 (0.24)	0.28 (0.32)	0.26 (2.65)	3.08 (12.0)
PCA ex4	0.15 (0.07)	0.18 (0.30)	0.20 (1.38)	2.06 (4.89)
PCA ex5	0.13 (0.04)	0.14 (0.07)	0.16 (0.64)	1.65 (2.55)
PCA ex6	0.09 (0.03)	0.11 (0.06)	0.20 (0.59)	1.58 (2.18)
PCA ex7	0.11 (0.03)	0.12 (0.06)	0.19 (0.54)	1.50 (2.00)
PCA ex8	0.09 (0.03)	0.08 (0.06)	0.15 (0.53)	1.44 (2.00)
PCA ex9	0.07 (0.03)	0.10 (0.05)	0.15 (0.43)	1.37 (2.00)

	Eagle	MB2	Horizon-AGN	Illustris
No-mitigation at \|$\boldsymbol p_{\rm co, bestfit}$\| (⁠\|$\boldsymbol p_{\rm co, fid}$\|⁠)	2.68 (30.92)	0.93 (5.71)	3.85 (103.2)	107.5 (3258)
hmcode – A	2.09	0.74	2.34	4.84
hmcode – A, η₀	0.91	0.29	1.69	4.80
PCA ex1 at \|$\boldsymbol p_{\rm co, bestfit}$\| (⁠\|$\boldsymbol p_{\rm co, fid}$\|⁠)	1.39 (3.25)	0.70 (5.11)	0.75 (6.75)	11.91 (40.0)
PCA ex2	0.41 (0.39)	0.37 (0.81)	0.45 (2.72)	9.68 (34.2)
PCA ex3	0.28 (0.24)	0.28 (0.32)	0.26 (2.65)	3.08 (12.0)
PCA ex4	0.15 (0.07)	0.18 (0.30)	0.20 (1.38)	2.06 (4.89)
PCA ex5	0.13 (0.04)	0.14 (0.07)	0.16 (0.64)	1.65 (2.55)
PCA ex6	0.09 (0.03)	0.11 (0.06)	0.20 (0.59)	1.58 (2.18)
PCA ex7	0.11 (0.03)	0.12 (0.06)	0.19 (0.54)	1.50 (2.00)
PCA ex8	0.09 (0.03)	0.08 (0.06)	0.15 (0.53)	1.44 (2.00)
PCA ex9	0.07 (0.03)	0.10 (0.05)	0.15 (0.43)	1.37 (2.00)

Table E1.