-
PDF
- Split View
-
Views
-
Cite
Cite
Enrique Paillas, Claudia D. P. Lagos, Nelson Padilla, Patricia Tissera, John Helly, Matthieu Schaller, Baryon effects on void statistics in the EAGLE simulation, Monthly Notices of the Royal Astronomical Society, Volume 470, Issue 4, October 2017, Pages 4434–4452, https://doi.org/10.1093/mnras/stx1514
- Share Icon Share
Abstract
Cosmic voids are promising tools for cosmological tests due to their sensitivity to dark energy, modified gravity and alternative cosmological scenarios. Most previous studies in the literature of void properties use cosmological N-body simulations of dark matter (DM) particles that ignore the potential effect of baryonic physics. Using a spherical underdensity finder, we analyse voids using the mass field and subhalo tracers in the Evolution and Assembly of Galaxies and their Environment (EAGLE) simulations, which follow the evolution of galaxies in a Λ cold dark matter universe with state-of-the-art subgrid models for baryonic processes in a (100 cMpc)3 volume. We study the effect of baryons on void statistics by comparing results with DM-only simulations that use the same initial conditions as EAGLE. When identifying voids in the mass field, we find that a DM-only simulation produces 24 per cent more voids than a hydrodynamical one due to the action of galaxy feedback polluting void regions with hot gas, specially for small voids with rvoid ≤ 10 Mpc. We find that the way in which galaxy tracers are selected has a strong impact on the inferred void properties. Voids identified using galaxies selected by their stellar mass are larger and have cuspier density profiles than those identified by galaxies selected by their total mass. Overall, baryons have minimal effects on void statistics, as void properties are well captured by DM-only simulations, but it is important to account for how galaxies populate DM haloes to estimate the observational effect of different cosmological models on the statistics of voids.
1 INTRODUCTION
In the year 1998, two independent groups of astronomers reported evidence for an accelerated expansion of the Universe, using Type Ia supernovae as distance indicators (Riess et al. 1998; Perlmutter et al. 1999). These results strongly suggest the requirement of a non-zero cosmological constant Λ in Einstein's field equations. Together with the fact that as much as 85 per cent of the matter in the Universe appears to be dark (only interacting through gravity), the so-called Λ cold dark matter (ΛCDM) scenario has remained popular for being one of the simplest cosmological models that provides a reasonably good account of many key properties of the Cosmos, such as the expansion history of the Universe (Perlmutter et al. 1999), the large-scale structure in the distribution of galaxies (e.g. Cole et al. 2005; Eisenstein et al. 2005) and the existence and structure of the cosmic microwave background (O'Dwyer et al. 2004; Hinshaw et al. 2013; Planck Collaboration XVI 2014).
The ΛCDM model assumes that general relativity is the correct theory of gravity on cosmological scales. However, alternative approaches that modify the standard theory of gravity have been proposed (e.g. Dvali, Gabadadze & Porrati 2000; Maartens 2004; Carroll et al. 2005). One such example is f (R), a family of gravity theories that modify Einstein's Theory of General Relativity by replacing the Ricci scalar R in the Einstein–Hilbert action by an algebraic function of it (Carroll et al. 2005).
Since the Solar system and cosmological tests set tight constraints on the feasibility of modifications to gravity (e.g. Gu 2011; Guo 2014), f (R) models have incorporated ‘chameleon’ mechanisms that have the effect of removing deviations with respect to general relativity in regions where the gravitational potential is deep enough, such as in our Galaxy (Brax et al. 2008). Modified gravity models can produce expansion histories very similar to ΛCDM, so it becomes necessary to find alternative ways to test whether they can constitute a better match to our Universe.
A promising way to constrain modified gravity models consists in studying regions of low density, where the chameleon mechanism is effectively suppressed. Cosmic voids are the most prominent underdense regions in our Universe: these are vast volumes with very low galaxy and mass densities, surrounded by the walls and filaments of the large-scale cosmic web. Previous theoretical studies have shown that voids might occupy more than 50 per cent of the total volume of the Universe (El-Ad & Piran 1997; Plionis & Basilakos 2002; Cautun et al. 2014), and their low matter density makes them useful tools for cosmological tests due to their sensitivity to dark energy (Li 2011; Bos et al. 2012; Pisani et al. 2015; Demchenko et al. 2016), modified gravity (Li, Zhao & Koyama 2012; Clampitt, Cai & Li 2013; Cai, Padilla & Li 2015; Zivick et al. 2015; Achitouv 2016) and alternative cosmological scenarios (Barreira et al. 2015; Massara et al. 2015; Banerjee & Dalal 2016).
Most studies of voids in the literature are based on cosmological N-body simulations that follow the gravitational interaction of dark matter (DM). The baryonic component of the Universe has been so far ignored in these works, and although DM accounts for a large fraction of the mass of the Universe, baryons play an important role in a cosmological context.
Sawala et al. (2015) showed that because of cosmic reionization, most haloes below |$\rm {3 \times 10^9\ \mathrm{M}_{{\odot }}}$| do not contain observable galaxies. This breaks the assumptions of the commonly used abundance matching method, which relies on the premise that structure formation can be represented by DM-only simulations and that every halo hosts a galaxy. Velliscig et al. (2014) found that gas expulsion and the associated DM expansion induced by supernova-driven winds are important for haloes with masses |$\rm {M_{200} \le 10^{13}\ \mathrm{M}_{{\odot }}}$|, lowering their masses by up to 20 per cent relative to a DM-only model. Schaller et al. (2015a) found that the reduction in mass can be as large as 30 per cent of haloes with |$\rm {M_{200} \le 10^{11}\ \mathrm{M}_{{\odot }}}$|. They also found that baryons can affect the inner density profile of DM haloes, leading to cuspier profiles in the centre due to the presence of stars. Feedback mechanisms triggered by baryons can expell gas from galaxies, even polluting voids with processed material, as suggested by Haider et al. (2016). One would intuitively expect that if feedback processes are strong enough, voids would be more polluted with baryons and hence their properties might differ from their DM-only counterparts. It might also be the case that cooling alters the spatial distribution of gas around voids. This modification of the mass distribution could have consequences for the weak-lensing signal measured around voids.
In this work, we study the effects of baryonic physics on the properties of cosmic voids. We search for voids using the mass field and subhalo tracers in the Evolution and Assembly of Galaxies and their Environment (EAGLE) simulations (Crain et al. 2015; Schaye et al. 2015), a suite of cosmological, hydrodynamical N-body simulations that follow the evolution of baryonic and DM particles in a ΛCDM universe. EAGLE consists of multiple simulations that were run with different mass resolutions, volumes and physical models (the largest simulation consisting on a 100 Mpc on a side comoving box), being one of the first projects that allows the study of baryonic processes in a large simulated volume. The hydrodynamical simulations in the suite have counterparts that were run with the same initial conditions but only following the evolution of DM. By comparing these different models we can study the role of baryons in the context of the cosmic web.
EAGLE presents a unique opportunity to explore the effect of baryons on void regions for various reasons. The simulations implement state-of-the-art subgrid models that follow star formation and feedback processes from stars and active galactic nuclei (AGN). These subgrid models, together with a high-mass resolution, allow us to trace the distribution of gas and DM at different scales in detail, which is a key point in our study given the discussion above.
Another advantage of the EAGLE simulations is that they reproduce the present day stellar mass function, galaxy sizes and many other properties of galaxies and the intergalactic medium with very good precision (e.g. Furlong et al. 2015; Schaye et al. 2015; Bahé et al. 2016; Segers et al. 2016; Trayford et al. 2016). This good agreement with observations becomes important when trying to extrapolate results inferred from voids identified in these simulations to the real Universe. With EAGLE, we can also mimic some of the selections of galaxies in the observations that are used to find voids. In fact, we will show that voids found using galaxies selected by their stellar mass are different than those found if galaxies are selected by their total mass.
We identify voids using a modified version of the algorithm presented in Padilla, Ceccarelli & Lambas (2005, from hereon mP05), which searches for spherical underdense regions in simulations, either using the mass field or halo tracers. Many void finders have been presented in the literature, which use different tracers to define voids (e.g. Arbabi-Bidgoli & Müller 2002; Hoyle & Vogeley 2002; Plionis & Basilakos 2002; Colberg et al. 2005; Brunino et al. 2007; Platen, van de Weygaert & Jones 2007; Neyrinck 2008). Watershed-based methods, such as the Watershed Void Finder (Platen et al. 2007) and ZOBOV (Neyrinck 2008) are suitable when studying the spatial structure of the cosmic web in detail, as they are parameter free and they do not make assumptions about void shapes or topology. Voids identified with these finders usually exhibit a smooth transition from the underdense region of a void to the average density. Other algorithms, such as the one we employ in this work, search for spherical regions that satisfy some density criteria. These void regions are usually characterized by a fast transition to the average density, and as a consequence, they are suitable for weak-lensing studies (e.g. Sánchez et al. 2017). In particular, mP05 has been shown to effectively capture differences between ΛCDM and f (R) models using void statistics in Cai et al. (2015). A comprehensive comparison of different void finding methods can be found in Colberg et al. (2008).
It is important to mention that here we adopt a cosmological model (ΛCDM) and study the effects of baryons on that particular model. It is difficult to predict what the impact of baryon effects would be if we adopted a different cosmology, but future works could expand on this matter.
The paper is organized as follows: In Section 2, we describe the EAGLE simulations, the void finding algorithm and our methodology to identify voids in the simulations. In Section 3, we visually explore the effects on baryons on the large-scale distribution of matter in the simulations, emphasizing on the effects of galaxy feedback on void regions. We present the main results of void statistics in Sections 4, 5 and 6. Conclusions and discussions about the work are presented in Section 7. In the appendix, we show the evolution of some of the results found in previous sections at higher redshift, and we explain the methods employed for the calibration of the void finder and the error estimation in detail.
2 SIMULATIONS AND METHODS
2.1 Simulation overview
2.1.1 The EAGLE project
EAGLE (Schaye et al. 2015) is a project of the Virgo Consortium,1 consisting in a suite of cosmological, hydrodynamic N-body simulations of a flat ΛCDM universe, with parameters inferred from the Planck data (Planck Collaboration XVI 2014); ΩΛ = 0.693, Ωm = 0.307, Ωb = 0.048, σ8 = 0.8288, ns = 0.9611 and H0 = 67.77 km s− 1 Mpc− 1. The main Eagle simulation, referred to as L0100N1504, consists of a (100 cMpc)3 volume, initially containing 15043 gas particles with an initial mass of |$\rm {1.81\times 10^6\ \mathrm{M}_{{\odot }}}$| and the same number of DM particles, with a resolution of |$\rm {9.70\times 10^6\ \mathrm{M}_{{\odot }}}$|.
Initial conditions were generated at z = 127, using second-order Lagrangian perturbation theory (Jenkins 2010). They were evolved using a parallel N-body smooth particle hydrodynamics (SPH) code, an extensively modified version of gadget-3, a more computationally efficient version of gadget-2, described in Springel (2005). The SPH implementation in EAGLE is referred to as Anarchy (Dalla Vecchia, in preparation. See also Schaller et al. 2015b), and it is based on the general formalism described by Hopkins (2013), with improvements to the kernel functions (Dehnen & Aly 2012) and viscosity terms (Cullen & Dehnen 2010). Anarchy alleviates the problems associated with standard SPH in modelling contact discontinuities and fluid instabilities.
Subgrid schemes are applied to model astrophysical phenomena below the scales of the resolution of the simulation. These include models for radiative cooling and photoheating, star formation, stellar mass loss and metal enrichment, stellar feedback from massive stars, black hole growth and feedback from AGN. See Crain et al. (2015) for a description of how the free parameters associated with feedback are calibrated. Each of these subgrid models are summarized as follows:
Cooling and photoheating is implemented following Wiersma, Schaye & Smith (2009a). Under the assumption of ionization equilibrium, exposure to the cosmic microwave background and the ultraviolet and X-ray backgrounds from galaxies and quasars (Haardt & Madau 2001), the abundance of 11 elements that dominate the cooling rates is tracked, tabulated using cloudy (Ferland et al. 1998).
Gas particles are stochastically converted into stars following Schaye & Dalla Vecchia (2008), using the metallicity-dependent density threshold of Schaye (2004). The star formation rate per unit mass of these particles is calculated with the gas pressure using an analytical formula that reproduces the observed Kennicutt–Schmidt law (Kennicutt 1998) in disc galaxies (Schaye & Dalla Vecchia 2008), where the lowest possible gas pressure is set by a polytropic equation of state, P ∝ ρ4/3. A Chabrier (2003) initial mass function (IMF) in the range 0.1-100 M⊙ is adopted, where each particle represents a single age stellar population.
It is assumed that stars with an initial mass above 6 M⊙ explode as supernovae after 3 × 107 yr, transferring the energy from the explosions as heat to the surrounding gas. A fraction of the surrounding gas gets an instant raise in temperature of 107.5 K. This thermal energy is stochastically distributed among gas particles neighbouring the explosion event, without any preferential direction (Dalla Vecchia & Schaye 2012). The probability of energy injection depends on the metallicity and density of the local environment in which the star particle formed. The simulated stellar populations release metals to their surrounding environment via Type Ia supernovae, winds, supernovae from massive stars, and AGB stars, following the methodology discussed in Wiersma et al. (2009b).
The gas particles at the centre of the potential well of haloes that reach masses above |$10^{10}\ \rm {h^{-1}\mathrm{M}_{{\odot }}}$| are converted into black hole particles of |$10^{5}\ \rm {h^{-1}\mathrm{M}_{{\odot }}}$| (Springel, Di Matteo & Hernquist 2005). These black holes can accrete mass based on the modified Bondi–Hoyle model of Rosas-Guevara et al. (2016), as well as undergo merging with other black holes (Booth & Schaye 2009). A fraction of 0.015 of the rest-mass energy from the accreted mass is returned to the surrounding medium as energy. AGN feedback is implemented thermally, similar to stellar evolution feedback. Gas particles receive AGN feedback energy stochastically, having their temperatures raised instantly by 108.5 K.
In this manuscript, we will also use the distribution of subhaloes identified in the simulation to define voids. These subhaloes correspond to gravitationally bound substructures that are located within a major DM halo. Virialized haloes are identified by applying the friends-of-friends method. Substructures within haloes are identified using subfind (Springel et al. 2001; Dolag et al. 2009). We refer to these substructures as subhaloes.
Our results are only valid for voids found using this subhalo finder. Other finders include rockstar (Behroozi, Wechsler & Wu 2013a), ahf (Gill, Knebe & Gibson 2004; Knollmann & Knebe 2009) bdm (Klypin & Holtzman 1997) and others, but it is beyond the scope of this work to study the impact of the subhalo finding algorithm on the statistics of voids.
2.1.2 Simulations used in this work
As mentioned in the previous section, EAGLE is a suite that consists of simulations run with different resolutions, volumes and physical models. Here, we list the simulations that are used in this work. Note that only the first two simulations listed here are used for constructing void catalogues, while the later are used for visualization purposes.
Simulations used for constructing void catalogues:
Ref-L0100N1504: the main EAGLE simulation. It has a (100 cMpc)3 volume, initially containing 15043 gas particles with an initial mass of |$\rm {1.81\times 10^6\ \mathrm{M}_{{\odot }}}$| and the same number of DM particles, with a mass of |$\rm {9.70\times 10^6\ \mathrm{M}_{{\odot }}}$|. The physical model this simulation was run with is referred to as the ‘reference’ EAGLE model, and other simulations run with the same subgrid physics and numerical recipes also include the prefix ‘Ref’ next to its name.
DM-L0100N1504: a simulation run with the same initial conditions as Ref-L0100N1504, but that only includes DM particles. It contains 15043 DM particles with a mass of |$\rm {1.44\times 10^6\ \mathrm{M}_{{\odot }}}$|.
Simulations used for visualization and calibration of the void finding algorithm:
Ref-L0025N0376: a smaller simulation that has a (25 cMpc)3 volume, initially containing 3763 gas particles and an equal number of DM particles, with a resolution of 1.81 × 106 and |$\rm {9.70\times 10^6\ \mathrm{M}_{{\odot }}}$|, respectively.
DM-L0025N0376: a simulation run with the same initial conditions as Ref-L0025N0376, but that only includes DM particles. It has a (25 cMpc)3 volume, initially containing 3763 DM particles with a resolution of |$\rm {1.44\times 10^6\ \mathrm{M}_{{\odot }}}$|.
NoFeedback-L0025N0376: a simulation run with a physical variation of the reference model that suppresses stellar and AGN feedback.
We search for voids on Ref-L0100N1504 and DM-L0100N1504. As these simulations share the same initial conditions, the comparison of their voids provides us with a direct assessment of the effects of baryons. We make use of the smaller simulations Ref-L0025N0376, DM-L0025N0376 and No Feedback-L0025N0376 to visually explore the simulation volume and calibrate our void finding algorithm. Note that we do not perform an analysis of void statistics on these smaller simulations, since the statistics become restrictively poor as only a handful of voids are identified in their limited volume.
2.2 Void identification
2.2.1 Void finding algorithm
We use an extensively modified version of the void finder presented in Padilla et al. (2005), with improvements on its computational efficiency, its convergence for different mass resolutions and box sizes, and adapted to run on parallel computers. Voids can be identified using the mass field or halo tracers in the simulation. This modified version of the void finder will be presented in detail in Paillas & Padilla (in preparation). Here, we briefly describe the algorithm:
It searches for low-density regions in the simulation by constructing a rectangular grid, and calculating the number of tracers that fall in each cell of the grid. The centre of a cell that is empty of tracers is considered to be a prospective void centre.
It measures the underdensity in spheres of increasing radius about prospective void centres until some integrated density threshold, Δvoid, is reached. Here, we adopt a value of Δvoid = −0.8. When the mass field is used for identification, Δvoid corresponds to a mass density. When haloes or subhaloes are used as tracers, the threshold corresponds to a number density. Only the largest sphere about any one centre is kept, and the void radius is defined as the radius of that underdense sphere.
It rejects all spheres whose centres overlap with a larger neighbouring sphere by more than a given percentage of the sum of their radii. Increasing this percentage will also increase the number of voids in the sample, but it could eventually lead to larger covariances in the results. In this work, we construct samples in which we allow two adjacent voids to overlap between a 40 and 20 per cent of the sum of their radii.
The value of Δvoid = −0.8 is commonly used in the literature of void studies, both in observations and theoretical works. It is motivated by a linear theory argument presented in Blumenthal et al. (1992), who showed that the void regions we observe in the present time probably correspond to regions with mass densities around 20 per cent of the mean density in the Universe. Choosing a lower density threshold would result in a catalogue with fewer, more extreme underdense regions.
On average, void regions in the Universe exhibit spherical symmetry when stacked, a fact that is used as an argument for constraining cosmological models via the Alcock–Paczynski test (e.g. Sutter et al. 2014; Hamaus et al. 2016). However, individual void shapes can be far from spherical. Since with our finder we search for spherical regions that satisfy an integrated density criteria, it is not suitable for studies that require a detailed description of the geometry of individual voids. Nevertheless, the regions it identifies show a strong density contrast and a fast transition to the average density, which boosts the weak-lensing signal measured around them. This information has been proven to be useful for testing modified gravity models in N-body simulations as in Cai et al. (2015). Watershed-based void finders have also been used to disentangle gravity models using void statistics as in Zivick et al. (2015), who used the Void Identification and Examination toolkit (Sutter et al. 2015) to differentiate models of f (R) gravity from CDM cosmology. Bose et al. (in preparation) will compare different void finding techniques in the context of differentiating between f (R) gravity models, in an attempt to study the impact of the different assumptions that each algorithm uses to define void regions.
Step (iii) of the algorithm naturally replaces voids of comparable sizes with similar centres by a larger void that occupies the volume of all the smaller voids together. However, as discussed above, individual void shapes in the simulation can deviate significantly from spherical symmetry, and it is therefore necessary to allow a certain degree of overlap between adjacent spheres to sample the low-density regions of the cosmic web to a reasonably good extent. This is something that has to be taken into account during the calculation of errors, since two voids with similar centres may represent only one physical void in the simulation, which could artificially lower the statistical errors. In this work, we try different degrees of overlap to study the impact of this parameter on the results.
It is also important to mention that the grid that is constructed in step (i) of the algorithm determines the resolution at which void centres are identified. Increasing the resolution of this grid also increases the probability of finding the optimal centre of a void in the simulation. However, it also increases the number of spurious void centres that arise because of shot noise. This in turn increases the computational time used by the finder, so there is a trade-off between the optimal grid resolution and the computational resources that are available. If the resolution at which a void centre is identified is poor, the void radius is also going to be affected, since the integrated density threshold about that centre will change. Since identifying voids using subhalo tracers is not computationally expensive, we choose a cell size such that the error associated with the radius of the smallest void in our sample is smaller than 10 per cent (consequently, errors for larger voids in the catalogue will be smaller than this). Identifying voids using the mass field is more computationally expensive (roughly 40 000 h of CPU time), so we cannot arbitrarily increase the resolution of the grid. We instead perform convergence tests to choose this parameter, as described in Appendix A. The minimum void radius for each void sample can be found in Table 1 .
A summary of some basic quantities associated with void catalogues obtained in Ref-L0100N1504 and DM-L0100N1504 (top and bottom tables, respectively) at z = 0.0 The first column shows the type of tracer that was used for void identification: subhaloes selected by their stellar mass, subhaloes selected by their total mass or the mass field (individual particles). The remaining columns, from left to right, show the total number of voids, maximum, minimum and mean void radius for each sample.
Void sample . | Nvoid . | rmax . | rmin . | rmean . |
---|---|---|---|---|
(Mpc) . | (Mpc) . | (Mpc) . | ||
Ref-L0100N1504 . | ||||
40 per cent overlap | ||||
Mstel subhalo selection | 709 | 24.3 | 4.9 | 7.0 |
Mtot subhalo selection | 708 | 21.4 | 4.9 | 6.7 |
Mass field | 7695 | 18.3 | 1.4 | 2.5 |
30 per cent overlap | ||||
Mstel subhalo selection | 485 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 470 | 21.4 | 4.9 | 6.8 |
Mass field | 5680 | 18.3 | 1.4 | 2.5 |
20 per cent overlap | ||||
Mstel subhalo selection | 341 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 330 | 21.4 | 4.9 | 6.9 |
Mass field | 4255 | 18.3 | 1.4 | 2.5 |
DM-L0100N1504 . | ||||
40 per cent overlap | ||||
Mtot subhalo selection | 693 | 21.0 | 4.9 | 6.7 |
Mass field | 9536 | 18.6 | 1.4 | 2.4 |
30 per cent overlap | ||||
Mtot subhalo selection | 466 | 21.0 | 4.9 | 6.8 |
Mass field | 6785 | 18.6 | 1.4 | 2.4 |
20 per cent overlap | ||||
Mtot subhalo selection | 346 | 21.0 | 4.9 | 6.8 |
Mass field | 5021 | 18.6 | 1.4 | 2.4 |
Void sample . | Nvoid . | rmax . | rmin . | rmean . |
---|---|---|---|---|
(Mpc) . | (Mpc) . | (Mpc) . | ||
Ref-L0100N1504 . | ||||
40 per cent overlap | ||||
Mstel subhalo selection | 709 | 24.3 | 4.9 | 7.0 |
Mtot subhalo selection | 708 | 21.4 | 4.9 | 6.7 |
Mass field | 7695 | 18.3 | 1.4 | 2.5 |
30 per cent overlap | ||||
Mstel subhalo selection | 485 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 470 | 21.4 | 4.9 | 6.8 |
Mass field | 5680 | 18.3 | 1.4 | 2.5 |
20 per cent overlap | ||||
Mstel subhalo selection | 341 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 330 | 21.4 | 4.9 | 6.9 |
Mass field | 4255 | 18.3 | 1.4 | 2.5 |
DM-L0100N1504 . | ||||
40 per cent overlap | ||||
Mtot subhalo selection | 693 | 21.0 | 4.9 | 6.7 |
Mass field | 9536 | 18.6 | 1.4 | 2.4 |
30 per cent overlap | ||||
Mtot subhalo selection | 466 | 21.0 | 4.9 | 6.8 |
Mass field | 6785 | 18.6 | 1.4 | 2.4 |
20 per cent overlap | ||||
Mtot subhalo selection | 346 | 21.0 | 4.9 | 6.8 |
Mass field | 5021 | 18.6 | 1.4 | 2.4 |
A summary of some basic quantities associated with void catalogues obtained in Ref-L0100N1504 and DM-L0100N1504 (top and bottom tables, respectively) at z = 0.0 The first column shows the type of tracer that was used for void identification: subhaloes selected by their stellar mass, subhaloes selected by their total mass or the mass field (individual particles). The remaining columns, from left to right, show the total number of voids, maximum, minimum and mean void radius for each sample.
Void sample . | Nvoid . | rmax . | rmin . | rmean . |
---|---|---|---|---|
(Mpc) . | (Mpc) . | (Mpc) . | ||
Ref-L0100N1504 . | ||||
40 per cent overlap | ||||
Mstel subhalo selection | 709 | 24.3 | 4.9 | 7.0 |
Mtot subhalo selection | 708 | 21.4 | 4.9 | 6.7 |
Mass field | 7695 | 18.3 | 1.4 | 2.5 |
30 per cent overlap | ||||
Mstel subhalo selection | 485 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 470 | 21.4 | 4.9 | 6.8 |
Mass field | 5680 | 18.3 | 1.4 | 2.5 |
20 per cent overlap | ||||
Mstel subhalo selection | 341 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 330 | 21.4 | 4.9 | 6.9 |
Mass field | 4255 | 18.3 | 1.4 | 2.5 |
DM-L0100N1504 . | ||||
40 per cent overlap | ||||
Mtot subhalo selection | 693 | 21.0 | 4.9 | 6.7 |
Mass field | 9536 | 18.6 | 1.4 | 2.4 |
30 per cent overlap | ||||
Mtot subhalo selection | 466 | 21.0 | 4.9 | 6.8 |
Mass field | 6785 | 18.6 | 1.4 | 2.4 |
20 per cent overlap | ||||
Mtot subhalo selection | 346 | 21.0 | 4.9 | 6.8 |
Mass field | 5021 | 18.6 | 1.4 | 2.4 |
Void sample . | Nvoid . | rmax . | rmin . | rmean . |
---|---|---|---|---|
(Mpc) . | (Mpc) . | (Mpc) . | ||
Ref-L0100N1504 . | ||||
40 per cent overlap | ||||
Mstel subhalo selection | 709 | 24.3 | 4.9 | 7.0 |
Mtot subhalo selection | 708 | 21.4 | 4.9 | 6.7 |
Mass field | 7695 | 18.3 | 1.4 | 2.5 |
30 per cent overlap | ||||
Mstel subhalo selection | 485 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 470 | 21.4 | 4.9 | 6.8 |
Mass field | 5680 | 18.3 | 1.4 | 2.5 |
20 per cent overlap | ||||
Mstel subhalo selection | 341 | 24.3 | 4.9 | 7.1 |
Mtot subhalo selection | 330 | 21.4 | 4.9 | 6.9 |
Mass field | 4255 | 18.3 | 1.4 | 2.5 |
DM-L0100N1504 . | ||||
40 per cent overlap | ||||
Mtot subhalo selection | 693 | 21.0 | 4.9 | 6.7 |
Mass field | 9536 | 18.6 | 1.4 | 2.4 |
30 per cent overlap | ||||
Mtot subhalo selection | 466 | 21.0 | 4.9 | 6.8 |
Mass field | 6785 | 18.6 | 1.4 | 2.4 |
20 per cent overlap | ||||
Mtot subhalo selection | 346 | 21.0 | 4.9 | 6.8 |
Mass field | 5021 | 18.6 | 1.4 | 2.4 |
2.2.2 Finding voids in EAGLE
We run our void finder on Ref-L0100N1504 and DM-L0100N1504 using the mass field (individual particles) and subhalo tracers. When using the mass field, calculating the integrated density profile for each prospective void centre is computationally demanding due to the large number of particles in these simulations. We therefore find the prospective void centres using all the particles in the simulations, but we select a random subsample of 1 per cent of the total amount of particles to compute the integrated density profile for each centre. To study the effect of this subsampling on the structures that are identified, we calibrate the void finder in Ref-L0025N0376, finding that using this percentage of particles produces relative errors in the radius of the voids of less than 10 percent. Considering that L0100N1504 is 64 times bigger in volume than L0025N0376, we expect that voids found in L0100N1504 using 1 per cent of the particles have errors around 1 per cent in the radius compared to those that would be measured using all the particles in the simulation, because of shot noise alone.
When using subhalo tracers, we construct the following subhalo samples to identify voids:
All subhaloes in Ref-L0100N1504 with stellar mass ≥108 M⊙. This results in 40 076 subhaloes.
The 40 076 subhaloes with the largest total mass in Ref-L0100N1504. The total mass of a subhalo is the sum of all the mass contained in gas, DM, stellar and black hole particles. This sample was constructed to have the same number density of objects as sample (i), but since the scatter between the total and stellar mass can be large (Guo et al. 2016), the lowest stellar mass in the sample is |$\rm {M_{stel}} = 6 \times 10^7\ \rm {\mathrm{M}_{{\odot }}}$|, while the lower total mass is |$\rm {M_{stel}} = 1.6 \times 10^{10}\ \rm {\mathrm{M}_{{\odot }}}$|. The stellar mass distribution of this sample peaks at |$\rm {M_{stel}} = 2.8 \times 10^9\ \rm {\mathrm{M}_{{\odot }}}$| and tappers off at lower masses.
The 40 076 subhaloes with most total mass in DM-L0100N1504. Since this simulation does not include baryons, only DM contributes to the total mass of a subhalo. The lowest subhalo mass in the sample is 2 × 1010 M⊙.
While finding voids using a sample of subhaloes selected by total mass is more related to the approach taken in cosmological simulations, a stellar mass cut is more related to observations. The cut in stellar mass can be compared to a cut in magnitude, as used, for example, by Ceccarelli et al. (2013) with the Sloan Digital Sky Survey (SDSS) main galaxy sample.
As mentioned in the previous section, we can obtain void samples with different degrees of overlap from our algorithm. To test the effect of the choice of this parameter on the results, we construct void catalogues with 40, 30 and 20 per cent of overlap for each of the samples described above.
In Fig. 1, we show a 1.5 Mpc thick slice along the z-axis of a snapshot of DM-L0100N1504 at z = 0.0. Our algorithm successfully identifies underdense structures across the simulation. Regions that look underdense but that are not enclosed by a circle could still be identified as voids whose centres do not fall within the ranges of the slice (which is the case for some of the obvious underdense regions in the figure), or they could have been erased by the overlapping criterion. At the same time, some white circles appear to enclose high-density regions, which is mostly caused by a projection effect due to the fact that the radius of the smallest voids is comparable to the thickness of the slice.

A 1.5 Mpc thick slice along the z-coordinate of DM-L0100N1504 showing the distribution of DM particles in the simulation. A grid with (1200)2 cells on the xy plane was constructed, and the number of particles that fall in each cell is shown by the colourmap, using a logarithmic contrast. Voids identified in the mass field that have centres within the slice are shown by white circles, where the radius of the circle corresponds to the physical radius of the void. We show a void sample with 20 per cent of overlap. Regions that look underdense but do not feature a circle in the figure can still be embedded in voids that have centres outside the slice, or the void associated with them could have been deleted by the overlapping criteria adopted. At the same time, circles that appear to surround overdense regions mainly arise because of projection effects.
3 VISUAL INSPECTION OF BARYON EFFECTS
As a motivation for the following sections that will focus on void statistics, we make use of the small EAGLE simulations to visually explore the effects of baryons on the large-scale distribution of matter. In Fig. 2 , we compare the distribution of gas and DM particles (left-hand and right-hand panels, respectively) in Ref-L0025N0376 at z = 0.0. The distribution of baryons looks more diffuse, and the high-density regions are less concentrated than in the DM distribution. This could be due to different physical mechanisms implemented in the subgrid models of EAGLE, such as feedback from supernovae events, AGN feedback and gas cooling, all of which inject or remove energy from the gas and affect its spatial distribution. This feature has also been observed by Haider et al. (2016) in the Illustris cosmological simulation (Vogelsberger et al. 2014).

A 2.0 Mpc thick slice along the z-coordinate of Ref-L0025N0376 at z = 0, showing the distribution of gas (left-hand panel) and DM (right-hand panel) particles in the simulation. A grid with (1200)2 cells on the xy plane was constructed, and the number of particles that fall in each cell is shown by the colourmap, using a logarithmic contrast. The distribution of gas appears to be more diffuse, with wider filaments and less clumpy regions than the DM distribution. This is due to different baryonic processes that take place within these regions, such as cooling and photoheating, stellar evolution and AGN feedback, all of which affect the properties of gas particles.
In Fig. 3 , we compare the distribution of gas particles in Ref-L0025N0376 (left-hand panel) and NoFeedback-L0025N0376 (right-hand panel) at z = 0.0, where the colourmap now shows the temperature of gas particles. As mentioned previously, NoFeedback-L0025N0376 is a simulation that was run with a physical variation of the EAGLE code, in which stellar evolution feedback and AGN feedback have been suppressed. Although the large-scale gas distribution is similar in both simulations, some clear signatures of baryon effects can be spotted in Ref-L0025N0376, which shows presence of hot gas in the surroundings of high-density regions, as can be seen in the centre of the simulation. The fact that this supply of hot gas is not present in NoFeedback-L0025N0376 suggests that this gas comes from winds expelled from galaxies due to supernovae and AGN feedback.

A 2.0 Mpc thick slice along the z-coordinate showing the distribution of gas particles in Ref-L0025N0376 (left-hand panel) and NoFeedback_Anarchy-L0025N0376 (right-hand panel) at z = 0. A colourmap with a logarithmic contrast shows the temperature of these gas particles. NoFeedback_Anarchy-L0025N0376 is a physical variation of EAGLE which supresses stellar and AGN feedback in the simulation. The simulation with baryonic physics exhibits an overabundance of particles with high temperatures in regions surrounding the dense clumps of matter where haloes are formed. Shock-like structures suggest that part of this mass is being ejected from these haloes into the more underdense regions via feedback events.
Fig. 4 shows the metallicity of gas particles in Ref-L0025N0376. It can be noticed that regions with high metallicity match with regions of high temperature in Fig. 3, and it is interesting to see that the hot gas winds spotted in Fig. 3 show a large metal content, which confirms the picture that this gas corresponds to material that was processed in galaxies. These figures indicate that baryon effects trigger a complex and rich exchange of material between galaxies and the circumgalactic/intergalactic media.

A 2.0 Mpc thick slice along the z-axis of Ref-L0025N0376, showing the distribution of gas particles. A colourmap with a logarithmic contrast shows the metallicity Z of these particles. Regions with Z = 10− 6 have particles with zero metallicity. By comparing with Fig. 3, it can be noticed that particles that are expelled from dense regions in the form of winds have a high metallicity, suggesting that they correspond to processed material being blown away from haloes via SNe or AGN feedback.
In principle, the effects mentioned above could have an impact on void properties. The most intuitive way in which this could happen is when identifying voids using the mass field in the simulations. If feedback mechanisms inject mass in regions that would otherwise be empty, the properties of the voids identified near these regions could change. The voids could either shrink or have their centres displaced, and in a more extreme case a void could disappear from a particular region of the simulation if feedback mechanisms pollute that region with enough mass.
Baryon effects could also affect the properties of voids identified using subhalo tracers. As can be seen in Figs 3 and 4, feedback mechanisms can remove matter from galaxies and expell it to the surrounding medium. The most direct consequence of this would be a change in the mass of subhaloes that undergo these feedback events. Since we select the subhalo tracers using a stellar mass and a total mass cut, a change in the subhalo mass function could result in a population of voids with different properties.
An additional effect that could be present in the analysis of voids identified using subhalo tracers is the scatter in the stellar mass–subhalo mass relation, which naturally arises in a hydrodynamical simulation. Notice that this is the only baryon effect that would also be present to some degree in a semi-analytic galaxy catalogue.
4 VOID ABUNDANCE
As mentioned in Section 2.2.2, for each simulation we obtain voids identified in the mass and the subhalo fields. For the latter case, two different selection cuts were used in Ref-L0100N1504: a stellar mass (Mstel) and a total mass (Mtot) cut.
4.1 Voids identified using the mass field
In Fig. 5 , we show the abundance of voids identified using the mass field in snapshots of Ref-L0100N1504 (black solid line) and DM-L0100N1504 (red dashed line) at z = 0.0, adopting 10 bins in void radius.

Upper panels: the cumulative distribution of sizes for voids identified with mass in Ref-L0100N1504 (black solid line) and DM-L0100N1504 (red dashed line) at z = 0.0. Lower panels: the ratio between each of the lines in the upper panel and the black solid line. The left-hand, middle and right-hand panels show three independent void samples in which two adjacent voids can only have centres separated by more than 40, 30 and 20 per cent of the sum of their radii, respectively. DM-L0100N1504 produces a larger amount of voids with 1.5 < rvoid < 5 Mpc than Ref-L0100N1504. The action of feedback mechanisms (SNe and/or AGN) in Ref-L010015N04 polutes underdense regions with processed material, suppressing the number of small voids identified in this simulation.
The total number of voids is larger when more overlap is allowed between adjacent voids. This is expected, as when two voids share more volume than what is allowed by this parameter, only the largest one is kept in the catalogue. The abundance of voids with 40 per cent overlap is higher than the catalogues with 30 and 20 per cent of overlap by a factor of 1.5 and 2.3, respectively.
When considering void catalogues with 40 per cent of overlap, a quick look at Table 1 reveals that 9536 voids were identified in the DM-only simulation and 7695 in the simulation with baryonic physics. This corresponds to roughly a 24 per cent difference in abundance, and similar values are found for the samples using 30 and 20 per cent of overlap. The DM-only simulation produces a larger number of voids with 1.5 < rvoid < 5 Mpc, which could be a hint of baryonic processes adding (or retaining) mass in regions that would otherwise be more underdense if these mechanisms were not present. DM-L0100N1504 also seems to produce a larger quantity of voids with rvoid ∼ 10 Mpc, although the associated statistical uncertainties prevent us from drawing robust conclusions about this difference. A larger simulation volume is needed in order to study this effect with higher significance.
In principle, an injection of mass into an underdense region would tend to shrink the void if we keep the centre fixed, since the integrated density criteria about that centre would be satisfied at a smaller radius. We must also remember that our algorithm imposes a minimum void radius cut. The motivation behind that threshold is that below some point, spherical regions cannot longer be resolved. If these void regions shrink below the minimum radius adopted, they are removed from the void catalogue, which could explain the difference in the total number of voids between Ref-L0100N1504 and DM-L0100N1504.
Another possible scenario is that feedback mechanisms produce a displacement of the centre of the void. We remind the reader that our algorithm grows spheres about similar centres in the simulation, until the Δ = −0.8 density threshold is satisfied. The overlapping criteria ensures that if two spheres have similar centres but different radii, only the largest sphere is identified as a void. The location and size of this largest sphere can change depending on how these feedback mechanisms are affecting the environment, and a change in the position of a void can also affect smaller voids that surround it. Under this scenario, we could have a case in which a void in DM-L0100N1504 grows very large and the overlapping criteria deletes some of the smaller voids surrounding it. In Ref-L01001N504, this void could be smaller, which could result in the survival of the smallest voids that were deleted in the DM-only case, thus producing a difference in void abundance as seen in Fig. 5 .
4.2 Voids identified using subhaloes
In the upper panels of Fig. 6 , we show the cumulative distribution of sizes for voids identified using subhaloes in snapshots of Ref-L0100N1504 and DM-L0100N1504 at z = 0.0, using 10 bins in void radius. The black solid line shows voids traced by subhaloes selected by their total mass in Ref-L100N1504. The long-dashed blue line shows to voids traced by subhaloes selected by their stellar mass in Ref-L0100N1504. The red dashed line shows voids identified by subhaloes selected by their total mass in DM-L0100N1504. The left-hand, middle and right-hand panels show void samples where two adjacent spheres cannot have centres closer than 40, 30 and 20 per cent of the sum their radii, respectively. The lower subpanels show the ratio between each curve and the black solid line. Grey regions show 1σ Poisson errors for the black solid line.

Upper panels: the cumulative distribution of sizes for voids identified with subhaloes selected by their total mass in Ref-L0100N1504 (black solid line), by their stellar mass in Ref-L0100N1504 (blue, long-dashed line), by their stellar mass in Ref-L100N1504, but after shuffling the stellar masses of all subhaloes in the simulation (cyan, dot–dashed line) and by their total mass in DM-L0100N1504 (red dashed line). All measurements correspond to z = 0.0. Lower panels: the ratio between each of the lines in the upper panel and the black solid line. The left-hand, middle and right-hand panels show three independent void samples in which two adjacent voids can only have centres separated by more than 40, 30 and 20 per cent of the sum of their radii, respectively. There are no significant differences between the reference and the DM-only models when the same total mass selection cut is used to find voids. However, voids traced by subhaloes selected by their stellar mass are systematically larger than those produced by a total mass cut.
Here, the sample with 40 per cent of overlap has roughly twice as many voids as the 20 per cent sample, and nearly 1.5 times more than the 30 per cent sample, similar to what is found using the mass field.
By comparing the black and red lines, we see that differences in the abundance of voids in Ref-L0100N1504 and DM-L0100N1504 are consistent within the errors when subhaloes are selected by their total mass in the simulation with baryonic physics. However, the blue line shows that voids traced by subhaloes selected by their stellar mass are slightly larger than those traced by total mass-selected subhaloes, and even though the total number of voids is roughly equal in all cases, they show an overabundance of more than 30 per cent for rvoid > 10 Mpc for the sample with 40 per cent of overlap. This difference is smaller when less overlap is allowed between voids, which can also be noticed by looking at Table 1, which summarizes the values for the total number of voids, the maximum, minimum and mean void radius for each sample. It is, however, interesting to see that the trends in Fig. 6 are similar regardless of the value of the overlap parameter.
As shown by previous theoretical studies, there is a scatter in the relation between the host subhalo mass and the stellar mass of galaxies (e.g. Contreras et al. 2015; Guo et al. 2016). For instance, a subhalo with low stellar mass can still contain a substantial amount of DM. This implies that selecting subhaloes in EAGLE by their stellar or their total mass can produce a catalogue of objects with different properties. If these two subhalo samples have a different spatial distribution, they might trace the cosmic web differently, which could translate in voids with different properties. To explore this in further detail, we construct a subhalo subsample in which we shuffle the stellar masses of all the subhaloes in Ref-L0100N1504 in 40 bins of total mass. We then construct a new subhalo sample selected by stellar mass with the shuffled catalogue. This way, we obtain a void population almost exactly equal to the one traced by total mass-selected subhaloes, as shown by the cyan dot–dashed line in Fig. 6. This indicates that the relation between total and stellar subhalo mass is driving the differences between the black and the blue curves in Fig. 6.
Abundance matching results show that the galaxy stellar mass–halo mass relation becomes tighter at high stellar masses (≳ 109 M⊙; e.g. Moster et al. 2010; Behroozi, Wechsler & Conroy 2013b; Mitchell et al. 2016). We attempted to adopt a higher stellar mass cut at the time of finding the voids. However, the statistics become restrictively poor, preventing us from reaching robust conclusions for the differences in the void populations caused by the structures used to trace the cosmic web. Such an analysis would require a larger simulated volume. Nevertheless, wide-area optical and near-IR redshift surveys, such as the SDSS and upcoming deep surveys, such as WAVES (Driver et al. 2016), are complete down to stellar masses even lower than we have adopted here, and thus we consider it pertinent to carefully investigate the results of our void finding algorithm using galaxies with stellar masses >108 M⊙.
As discussed above, the fact that voids are larger when traced by subhaloes selected by their stellar mass suggests that these subhaloes have a different spatial distribution than those selected by their total mass. In Fig. B1 of the appendix, we show the two-point autocorrelation function for the two sets of tracers, finding that subhaloes selected by their stellar mass are more strongly clustered than total mass-selected subhaloes. A stronger clustering signal indicates that these objects have a stronger bias with respect to the underlying mass distribution, as they live in higher density peaks of the cosmic web. This produces a catalogue of voids with larger sizes. We remind the reader that our void finding algorithm grows spheres around prospective void centres until the integrated density profile is equal to 20 per cent of the mean number density of subhaloes in the simulation. If the subhalo tracers are preferentially located in higher density peaks, those spheres will be able to grow more before satisfying the density criteria, compared to the case where the spatial distribution of the tracers is more homogeneous.
Given that a significant contribution to the signal of clustering and weak-lensing studies in future surveys will come from z ∼ 0.5, we also obtain void catalogues for snapshots of Ref-L100N1504 at z = 0.5 and z = 1.0, in order to see if the differences in void abundance evolve as a function of time. Figures can be found in Appendix C, where we show that the trends found at z = 0 are persistent at higher redshift.
5 VOID PROFILES
5.1 Void density profiles
An important statistical tool to characterize void regions is their radial density profile. It is defined as the spherically averaged relative deviation of density around a void centre, compared to the mean value |$\overline{\rho }$| across the simulation, |$\rho _{{\rm r}} /\overline{\rho }$|. This profile can be computed in a differential or an integrated way.
When computing the subhalo density profile, wi = 1, as every subhalo is assigned an equal weight in the calculation. When computing the mass density profile wi corresponds to the mass of the particle i, which has a value of 9.70 × 106 M⊙ if it is a DM particle, but can vary if it is a gas, star or black hole particle (gas particles initially weigh 1.81 × 106 M⊙, but they can later gain mass via different processes; see Schaye et al. 2015 for more details). We re-scale each void-to-tracer distance by the void radius before stacking the profiles.
In the upper panels of Fig. 7 , we show the differential subhalo (left) and mass (right) density profiles for voids identified using subhalo tracers in Ref-L0100N1504 and DM-L0100N1504. The black lines show voids identified using subhaloes selected by their total mass in Ref-L0100N1504. Voids identified using subhaloes selected by their stellar mass in Ref-L0100N1504 are shown by the blue long-dashed lines, while voids identified by subhaloes in DM-L100N1504 are represented by the red dashed lines. The lower panels show the absolute difference between each line and the black solid line. The error bars show the scatter around the mean for the black and blue lines. Error bars for the red line are not shown for clarity, but their sizes are similar to the other two, as we have explicitly checked. These profiles correspond to the void sample with 30 per cent of overlap. We have checked that results for the samples with 40 and 20 per cent of overlap show the same trends, with only minor differences in the size of the error bars due to the different number of voids.

Upper panels: subhalo number (left) and mass (right) density profiles for voids traced by: subhaloes selected by their total mass in Ref-L0100N1504 (black solid line), subhaloes selected by their stellar mass in Ref-L0100N1504 (blue long-dashed line), subhaloes selected by their total mass in DM-L0100N1504 (red dashed line). The green dot–dashed line shows the density profile in DM-L0100N1504 for voids traced by total mass-selected subhaloes in Ref-L0100N1504. The magenta solid line in the right-hand panel shows the mass density profile for total mass-selected subhaloes in Ref-L0100N1504, computed using only the gas particles in the simulation. All measurements correspond to z = 0. Lower panels: the absolute difference between each of the lines in the upper panels and the black solid curve of the upper panel. Error bars show the dispersion on the mean velocity profile for each radial bin. There are differences in the profiles of reference and DM-only model voids, even when the same total mass selection cut is applied for the tracers, this being more notorious at the peak of the profile. However, these differences are due to the different spatial distribution of the most massive subhaloes in the simulations. This effect is removed by using the same void centres in both simulations to calculate the profiles, as shown by the green dot–dashed line. Voids traced by stellar mass-selected subhaloes show a significantly higher peak, reflecting an overabundance of tracers near and within the walls of void regions.
The black and the red lines compare results in the simulations with and without baryonic physics, respectively, for voids identified using subhaloes tracers selected by their total mass. Looking at the subhalo number density profile (left-hand panel), very small differences are observed between these two, although they are encompassed within the range of the statistical errors. It is important to stress, however, that it is expected that these two profiles are not identical, even though the initial conditions of both simulations are the same and the subhaloes were selected in the same way. Schaller et al. (2015a) used the EAGLE simulations to show that the mass of a subhalo in a hydrodynamical simulation can change substantially compared to its DM-only counterpart due to baryonic processes. Therefore, the most massive subhaloes in Ref-L0100N1504 and DM-L0100N1504 are located in different places in both simulations, and as a consequence, the voids traced by them can correspond to different structures. To remove this effect, we take the void centres found by total mass-selected subhaloes in Ref-L0100N1504, and compute their subhalo number density profile in DM-L0100N1504. This is represented by the green dot–dashed line, which falls right on top of the black solid line. As shown in the lower panel, these two curves have absolute differences very close to zero, which means that the main effect produced by baryons on the subhalo density profile of voids comes from a change in the subhalo mass function.
It is interesting to notice that voids traced by stellar mass-selected subhaloes show a significantly more pronounced overdense ridge compared to the samples selected by total mass. This is in agreement with the discussion of the previous section, in which we commented on how selecting subhaloes by their stellar mass produces a distribution of tracers that are more strongly clustered. The fact that these subhaloes have a stronger clustering signal implies that they are preferentially located in overdense environments, which shows up as an excess in amplitude at the void ridge when compared to voids traced by less-clustered subhaloes.
Although the subhalo number density profile is in principle something that can be observed in the real Universe through large-scale galaxy surveys, we must keep in mind that subhaloes form in overdense peaks of the mass density field, and as such, they only provide a biased view of the mass distribution around and within voids. The right-hand panel shows the mass density profile for the same voids discussed above. These density profiles were computed taking into account all the mass in the simulation, be it contained in gas, stars, DM or black hole particles. The overall shape of the profiles is similar to those computed using subhaloes, which means that subhaloes trace the mass distribution in voids to a reasonable extent. We have to keep in mind that these voids were selected using a subhalo number density threshold, so no explicit condition regarding the distribution of mass in these voids was imposed for their construction. In spite of this, some interesting differences with respect to the halo density profile can be noted. When looking at the mass distribution inside voids, we see that they are less empty of matter than what can be inferred by looking at the distribution of subhaloes alone. The lowest mass density is found at the void centres, reaching values as low as 20 per cent of the mean mass density in the simulation, while subhalo densities reach slightly lower values around 10 per cent of the mean density.
Another difference that is observed is that the discrepancy between voids identified by total mass-selected subhaloes and stellar mass-selected subhaloes now extends below r/rvoid ∼ 1. This confirms the picture that these two void populations are distinct and they are being traced by different structures in the simulation.
Even though these mass profiles were computed using the mass contained both in baryons and DM, we must keep in mind that DM accounts for nearly 85 per cent of the total mass in the simulation. For this reason, the black and blue lines are mostly dominated by the DM distribution within and around voids in Ref-L0100N1504. Even though baryon effects such as galaxy feedback might also affect the distribution of DM, their most direct impact is expected to be seen in the distribution of gas. In order to study this we computed the mass density profile of voids using only gas particles. We do this for voids identified with total mass-selected subhaloes. This is shown by the magenta solid line in the right-hand panel of Fig. 7. We find an excess of gas density below r/rvoid < 1 compared to the distribution dominated by DM, which could be indicative of baryonic feedback processes polluting voids with pre-processed gas close to their walls. In this sense, the voids we find in the simulation contain a little more gas than DM, which is consistent with the picture of SNe and AGN winds injecting mass in void regions, as discussed in Section 3.
We have calculated the covariance matrices of these density profiles, and we do not find strong off-diagonal correlations among different bins, which means that we can compare the differences between lines and compare them with the error bars with confidence. More details can be found in Appendix D, where we also compute the reduced X2 statistic to have a more quantitative measure of the statistical similarity between the profiles.
5.2 Void velocity profiles
The large-scale structure of the Universe is far from static. Matter continuously flows in and out of void regions, and previous studies have suggested that there is a rich interchange of material between voids and other components of the cosmic web (e.g. Padilla et al. 2005; Cai et al. 2015; Haider et al. 2016). The study of void velocity profiles is of great value for understanding void dynamics and evolution, as well as the relation between voids and the large-scale environment that surrounds them. It can also provide key information regarding redshift-space distortions around voids (Cai et al. 2016), which is essential for the interpretation of methods such as the AP test and the stacking of voids to get an ISW signal.
Fig. 8 shows the velocity profiles for voids identified in Ref-L0100N1504 and DM-L0100N1504. We observe that there is a coherent outflow of subhaloes across a wide range of distances from the void centre. The outflow peaks near the void radius, consistent with the strong abundance of subhaloes in that region. The profile reaches velocities around 60 km s−1, and at r/rvoid > 1 it decreases, converging to zero for larger distances.

Upper panel: radial velocities of subhaloes with respect to the centre of voids traced by: subhaloes selected by their total mass in Ref-L0100N1504 (black solid line), subhaloes selected by their stellar mass in Ref-L0100N1504 (blue long-dashed line), subhaloes selected by their total mass in DM-L0100N1504 (red dashed line). The green dot–dashed line shows the velocity profile in DM-L0100N1504 for voids traced by total mass-selected subhaloes in Ref-L0100N1504. All measurements correspond to z = 0. Lower panels: the absolute difference between each of the lines in the upper panels and the black solid curve. Error bars show the dispersion on the mean velocity profile for each radial bin. There are small differences between the reference and DM-only voids when the same total mass selection cut is applied for the tracers. This time, however, these differences cannot be reconciled by using the same centres in both simulations to compute the profiles (green dot–dashed line).
Very small differences, within the size of the errorbars, are found between voids identified by subhaloes selected by their total mass in the simulations with and without baryonic physics (solid black and dashed red lines, respectively). This is in agreement with previous sections, which have shown that the abundance and density profiles of these voids are fairly similar.
The blue long-dashed line shows the velocity profile for voids traced by stellar mass-selected subhaloes. The outflow velocities of these voids are consistent with the ones found by voids traced by total mass-selected subhaloes near r/rvoid = 1, but between 1 < r/rvoid < 5 the velocities are lower by about 5-10 km s− 1. We only show the error bars for the black line for clarity, but errors associated with the red and blue lines are of a similar magnitude.
It appears that the choice of tracer that is used for void identification has a strong impact not only on the size and density profile of a void, but also on its dynamics. In principle, this is something to be expected, since the dynamics of a void directly depend on the distribution of matter around it. Nevertheless, the trends observed in Fig. 8 need to be interpreted with care, given the small magnitudes of the velocity difference that is observed. Moreover, we have noticed that the differences between these profiles are sensitive to the overlap parameter of the void sample. The samples with 30 and 20 per cent of overlap have larger statistical errors and the differences between the black and blue line appear to be less significant. A larger simulated volume is needed to study this feature in greater detail.
One particular feature that calls our attention in these profiles is that we do not observe inflow velocities at any distance from the void centre. While some voids might be located in regions that are currently expanding, others might reside in overdense environments that are collapsing or will eventually collapse, which would be translated in material flowing into the void. In the next section, we discuss the existence of these two void populations in the context of EAGLE.
6 VOIDS AND THEIR LARGE-SCALE ENVIRONMENT
As we have seen in previous sections, the mass or halo density in voids increases towards the void walls. The steepness of this increase holds important information about the dynamics of the void, as the total mass contained in their walls might overcompensate or undercompensate the mass that is missing inside the void. Under this context, some voids might be located in undercompensated regions that expand as the Universe evolves, while others might have overcompensating walls that will eventually make the void collapse. This has lead to a classification of voids into two distinct populations, void-in-cloud and void-in-void (Sheth & van de Weygaert 2004; Ceccarelli et al. 2013; Cai et al. 2014; Hamaus, Sutter & Wandelt 2014), depending on whether they are located in a large-scale environment that is collapsing or expanding, respectively.
Ceccarelli et al. (2013) classify voids into void-in-cloud and void-in-void depending on whether the accumulated overdensity at r/rvoid = 3 is above or below 0, respectively. Hamaus et al. (2014) and Cai et al. (2015) have shown that voids can also be separated in these different populations depending on their size. Voids with larger (effective) radii are more likely to be void-in-void, while void-in-cloud normally correspond to smaller voids.
We have chosen not to distinguish between these different types of voids in previous sections due to the restricted volume of the simulations we are analysing. We must keep in mind that the largest void in our sample is comparable in size to the smallest voids in the 1 Gpc h−1 simulation box used in Cai et al. (2015) and Hamaus et al. (2014). The differentiation of these two void classes might depend on the large-scale modes present in any given simulation, and the size of EAGLE prohibits us from studying the large-radius end of the void size distribution. It is, nevertheless, still interesting to explore whether we can identify distinct void populations in EAGLE, as a way to improve our understanding between the void identification process and its relation to the size, resolution and tracer sampling of a simulation. Motivated by the results of previous sections, it is also important to determine the effect of baryons in the two void classes, in particular regarding the differences in void properties that arise as a result of using different subhalo samples to identify voids. We have to take this into account because due to simulation size, possible biases in our void samples could be present in the fraction of void-in-voids as a function of void radius.
In Fig. 9 , we show integrated subhalo number density profiles for voids in Ref-L0100N1504. The thin grey lines show individual profiles for voids identified using subhalo tracers selected by their total mass. The mean of all these profiles is shown by the solid black line, with error bars that show the scatter around the mean. For reference, the upper and lower horizontal dashed lines show the mean density and 20 per cent of the mean density of subhaloes in the simulation. As can be noted, individual void profiles can differ significantly. Some voids appear to be embedded in highly overdense environments, reaching values well above the mean density, while others have profiles that slowly increase towards the mean. It can be noticed that Δ = 0.2 at r/rvoid = 1, which corresponds to the density threshold imposed for void identification. High values of Δ are found near the centres of some voids. This is due to the presence of a small number of subhaloes in inner void regions that increase the integrated density well above the mean at such distances from the void centre. Some of these voids satisfy the density criteria at distances smaller than rvoid, but our algorithm guarantees that only the largest sphere satisfying the density criteria about any one centre is classified as a void.

Integrated density profile for voids in Ref-L0100N1504 at z = 0.0. The thin grey lines show individual profiles for voids identified using subhalo tracers selected by their total mass. The solid black line shows the average of these profiles. The dashed black lines above and below 1 show the average profile for voids classified as void-in-cloud and void-in-void, respectively, depending on whether their integrated density at r/rvoid = 3 is above or below 1. Blue dashed lines show the same results for voids identified using stellar mass-selected subhaloes. For reference, the horizontal dashed lines show the mean density of subhaloes and 20 per cent of the mean density. Error bars show the scatter around the mean of each profile.
When averaging over all curves, these voids show a profile that rises steeply out to r/rvoid ∼ 1 and then converges slowly to the mean density for larger distances, as seen in the previous section. However, this profile arises after averaging over very distinct curves. Following Ceccarelli et al. (2013), we separate voids in two different samples, depending on whether their integrated density is above or below 1 at r/rvoid = 3. The profile for these two samples are shown by the dashed and long-dashed black lines above and below the mean, respectively. These profiles resemble two distinct void populations: void-in-cloud are embedded in dense environments and have prominent ridges that overcompensate the lack of matter in their interiors. Void-in-void, on the other hand, show a profile that slowly increases towards the mean. They are undercompensated and do not feature an overdense ridge, which suggests that these voids may be located in an expanding large-scale environment.
The dashed and long-dashed blue lines show these same profiles for voids identified using subhalo tracers selected by their stellar mass. While the void-in-void population seems to be rather unaffected by this different selection for the subhaloes, the void-in-cloud population shows a profile that reaches higher amplitudes between 1 < r/rvoid < 4. This suggests that the differences observed in the abundance and density profiles of voids in the previous section come mainly from voids embedded in overdense environments.
In Fig. 10 , we show the subhalo velocity profiles for these two void populations. We notice that voids embedded in overdense environments show strong inflow velocities across a wide range of distances from the void centre. Voids that resemble the void-in-void population, on the other hand, exhibit strong outflow velocities, indicating that subhaloes are being evacuated from these regions. While these trends are consistent with what has been found in previous works (e.g. Ceccarelli et al. 2013; Paz et al. 2013), the interpretation of these results must be taken with care due to the small volume of the simulation. It might well be the case that many of the voids that resemble void-in-void populations in this study may actually be located in regions that we would otherwise identify as overdense if the volume of EAGLE was larger. It is, however, interesting to find that we can identify different kind of voids in such a small simulation box. The extent to which this difference is physical, and how this differentiation depends on the properties of the simulation will need to be answered in future works, as this area still remains largely unexplored.

Radial subhalo velocity profile for voids in Ref-L0100N1504. The solid black line shows the mean profile for voids identified using subhalo tracers selected by their total mass. The long-dashed and short-dashed lines show the mean profiles for voids classified as void-in-cloud and void-in-void, respectively. Blue lines show results for voids identified using stellar mass-selected subhaloes. Error bars show the scatter around the mean of each profile.
7 CONCLUSIONS AND DISCUSSION
We have analysed cosmic voids the EAGLE simulations, using subhalo tracers and the mass field for void identification. We have studied the impact of baryonic physics on void statistics by comparing void populations in the main hydrodynamical EAGLE simulation (Ref-L0100N1504) and its DM-only counterpart (DM-L0100N1504), which was run using the same initial conditions but only following the evolution of DM. EAGLE provides a unique opportunity to test the effects of baryons on void properties due to its high resolution and detailed treatment of baryonic physics, using subgrid models that follow star formation, radiative cooling, stellar feedback from massive stars, black hole growth and AGN feedback.
We define voids as spherical regions that have integrated density profiles equal to 20 per cent of the mean mass/subhalo density in the simulation. We have used a modified version of the spherical underdensity void finder presented in Padilla et al. (2005). When finding voids in the simulation with baryons using subhalo tracers, we used two different samples of subhaloes: the first one consists in a sample of subhaloes selected by stellar mass (|$\rm {M_{stel} \ge 10^8\ \mathrm{M}_{{\odot }}}$|). The second sample consists in a set of subhaloes with the same number density, but selected by their total mass. In DM-L0100N1504, subhaloes are selected by their total mass, and the number density of subhaloes remains fixed and equal to the other two samples.
We have calculated the void abundance, density and velocity profiles for all the samples described above. We find effects originating from feedback, which expels gas into voids and, more importantly, from the stochasticity in the stellar mass that manages to form in subhaloes, that is from the scatter of the stellar mass–DM mass relation.
Our main results can be summarized as follows:
DM-L0100N1504 produces about 24 per cent more voids than Ref-L0100N1504 when the mass field is used to identify voids. However, this difference is mainly comes from small voids with 1.5 < rvoid < 5 Mpc (Fig. 5). The action of feedback mechanisms from supernovae and AGN in Ref-L010015N04 pollutes void regions with hot gas, which causes some of these voids to shrink in size compared to their DM-only counterpart. If these regions shrink enough, they do not enter our sample of voids selected by our algorithm because their sizes fall below the minimum void size selection, which results in the hydrodynamical simulation having a lower overall abundance of voids than the DM-only simulation. This effect is also observed for voids with rvoid ∼ 10 Mpc, although the associated statistical uncertainties are large. A bigger simulated volume is needed to test this effect with higher significance.
There are no significant differences in the abundance of voids identified using subhaloes in Ref-L0100N1504 and DM-L0100N1504 when subhaloes are selected by their total mass in both simulations. However, selecting subhaloes by their stellar mass tends to produce slightly larger voids (Fig. 6). This discrepancy arises because a stellar mass cut tends to select subhaloes that have a stronger bias with respect to the underlying matter distribution. These subhaloes are located in higher density peaks of the cosmic web than those selected by their total mass, which in the end results in larger voids. These differences are also seen for snapshots of EAGLE corresponding to z = 0.5 and z = 1.0
When subhaloes are selected by their total mass, small differences are seen in the subhalo density profiles of voids in Ref-L0100N1504 and DM-L0100N1504 (Fig. 7). These differences are mainly caused by the effect of galaxy feedback on the subhalo mass function of the simulations. When taking this effect into account by computing the density profile with subhaloes of DM-L0100N1504 for voids identified in Ref-L0100N1504, the profiles are almost identical. When subhaloes are selected by their stellar mass, voids show density profiles with a more pronounced peak, reflecting an overabundance of subhaloes around void walls. This confirms the differences found in void abundance, namely voids traced by galaxies selected by their stellar mass are larger due to the fact that these tracers are more strongly clustered than those selected by their total mass. These differences are also seen at z = 0.5 and z = 1.0.
The void mass density profiles show similar features as the profiles measured using subhaloes, meaning that subhaloes correctly capture the features of the mass distribution within and around voids with reasonable accuracy for a biased tracer of the density field. By measuring the void gas density profile we find an excess of gas near void walls, suggestive of the action of feedback mechanisms polluting voids with hot gas coming from galaxy winds.
No significant differences are found in the velocity profiles of voids in the simulation with baryons and its DM-only counterpart, if the subhaloes are selected by their total mass. For the case where subhaloes are selected by their stellar mass, subhaloes evacuate void regions at slightly lower (5–10 km s−1) velocities than in the total mass case. The void velocity profiles appear to be very sensitive to the overlap that is allowed between voids in our algorithm, while the associated statistical uncertainties are still too large to establish a robust conclusion about this trend.
We are able to identify two different voids populations in EAGLE: voids embedded in underdense large-scale environments that appear to be expanding, and voids in contracting dense environments. These resemble the void-in-void and void-in-cloud populations found in previous works. The effects of baryons appear to be more significant for the void-in-cloud, as these voids show distinct density and velocity profiles when the subhaloes by which they are traced are selected by their stellar or total mass. The fraction of void-in-void that we find is probably affected by the restricted number of large-scale modes that are present in these simulations. Nevertheless, this raises many questions about the relation between this void classification and the properties of the simulations in which these structures are identified.
It appears that void properties are well captured by DM-only simulations, with baryons only adding second-order effects, which are less important than those so far reported for alternative cosmologies.
When identifying voids in the mass field, we find that a DM-only simulation tends to produce larger voids than a hydrodynamical one due to the effects of galaxy feedback. While this is an interesting result that sheds light on the impact of baryonic processes on the large-scale distribution of matter, it does not directly have an impact on observational studies of voids identified in galaxy surveys. We do find, however, that care must be taken with the galaxy tracers that are used to find and characterize voids, since they can have a strong impact on the properties that are inferred. The different results from stellar mass and subhalo mass cuts indicate that this check needs to be done for each void search, mimicking the number density of the tracer sample. Moreover, the bias of the galaxy sample needs to be reproduced by the haloes used to identify voids in a mock sample. The fact that these effects are also present at higher redshifts in the simulations suggests that they should be considered when studying voids in current and future large-scale galaxy surveys, especially in the context of studies that aim to constrain cosmological models from void statistics. The results of this work are relevant in particular to spherical-based void finders, which are more suitable in the case of weak-lensing studies. It is worth noting, however, that it is difficult to predict whether the differences that we find in this work can be extrapolated to other void finders, especially to those that are not spherical based. Nevertheless, this opens the possibility for future works to expand on this matter.
Given the size of the EAGLE simulation, we are not able to make predictions for voids larger than ∼ 25 Mpc, which is comparable to voids identified in the main galaxy sample of SDSS. This is the best that can be done including all baryonic physics to date, but since we have found that the actual gas distribution is not as important as the galaxy population that is used to identify voids, in Paillas et al. (in preparation) we will look at voids in different semi-analytic models of galaxy formation, which will be relevant to many more surveys. It is also worth noting that it is difficult to predict whether the differences that we find in this work can be extrapolated to other void finders.
Considering the results obtained in this study, a suitable avenue for future works would be to combine DM-only simulations with abundance matching or semi-analytical models to populate the DM haloes with galaxies in order to characterize voids closely to what observers would do.
Acknowledgments
We are grateful for the helpful comments and suggestions made by the anonymous referee. We thank Carlton Baugh and Joop Schaye for insightful discussions and comments. EP is supported by Proyecto Financiamiento BASAL PFB06. CDPL is funded by a Discovery Early Career Researcher Award (DE150100618). NP is supported by ‘Centro de Astronomía y Tecnologías Afines’ BASAL PFB-06 and by Fondecyt Regular 1150300. PBT acknowledges support from Fondecyt 1150334 and UNAB project 677-15. EP and NP acknowledge support for exchanges between PUC and Durham University from the Newton-CONICYT fund project DPI20140114. Parts of this research were conducted by the Australian Research Council Centre of Excellence for All-sky Astrophysics (CAASTRO) through project number CE110001020. This work used the DiRAC Data Centric system at Durham University, operated by the Institute for Computational Cosmology on behalf of the STFC DiRAC HPC Facility (www.dirac.ac.uk). This equipment was funded by BIS National E-infrastructure capital grant ST/K00042X/1, STFC capital grants ST/H008519/1 and ST/K00087X/1, STFC DiRAC Operations grant ST/K003267/1 and Durham University. DiRAC is part of the National E-Infrastructure. Part of the analysis was done on the Geryon cluster at the Centre for Astro- Engineering UC. The Anillo ACT-86, FONDEQUIP AIC-57 and QUIMAL 130008 provided funding for several improvements to the Geryon cluster. This work was supported in part by the Science and Technology Facilities Council (grant number ST/F001166/1), European Research Council (grant numbers GA 267291 ‘Cosmiway’) and CONICYT PIA Anillo ACT- 1417.
REFERENCES
APPENDIX A: CONVERGENCE TESTS
As mentioned in Section 2.2.1, our void finding algorithm starts by constructing a rectangular grid and computing the number of particles (or haloes) that fall on each cell of the grid. Only empty cells are considered as prospective void centres. Naturally, decreasing the size of the grid cells (i.e. increasing the grid resolution) increases the number of prospective centres, but spurious void centres also appear more frequently. These spurious centres are eventually removed in further steps of the algorithm, but they increase the computational time of the void identification process. When identifying voids using subhalo tracers, this increase in computational time is negligible, since the number of subhaloes in EAGLE is small and the void finder can be run in a matter of seconds. In this case, we can increase the resolution of the grid a lot and we are still able to run the algorithm in a reasonably short time. When identifying voids in the mass field, however, the computational time for void identification is of the order of 40 000 h of CPU time due to the large number of particles in EAGLE, so we cannot arbitrarily increase the resolution of the grid. We are interested to know to what extent the identified void sample changes depending on the resolution of the grid. The expectation is that above a given resolution, the number of voids increases only due to spurious void centres that are removed after applying steps (ii) and (iii) of our algorithm.
We run a convergence test in Ref-L0025N0376 to address this issue. Since this simulation has the same resolution and subgrid physics as Ref-L0100N1504, these tests are also applicable to void samples in the big simulation. Fig. A1 shows the abundance of voids identified using the mass field in Ref-L0025N0376 as a function of the cell size of the grid used in the first step of the void finder. The solid black, dashed red and dot–dashed blue lines show results for void samples with overlaps of 20, 30 and 40 per cent, respectively. The lower horizontal axis shows the average number of particles that fall in each cell of the rectangular grid, while the upper horizontal axis shows the size of the cells in Mpc. The abundance of voids is not severely affected when reducing the size of the grid cells below ∼ 0.3 Mpc. Moreover, we have explicitly examined the void size distribution for each of these samples, finding that the dispersion in the mean void radius is less than 2 per cent. Taking this into consideration, we choose to construct a grid with a cell resolution of 0.3 Mpc to search for voids in Ref-L0100N1504 using the mass field .

Number of voids traced by mass in Ref-L0100N1504 at z = 0, as a function of the size of the cell used to smooth the density field in the first step of the void finding algorithm (Section 2.2.1). The lower horizontal axis shows the average number of particles that fall in each cell of the rectangular grid, while the upper horizontal axis shows the size of the cells in Mpc. The solid black, dashed red and dot–dashed blue lines show results for void samples with overlaps of 20, 30 and 40 per cent, respectively. These voids achieve convergence in their abundance, and we therefore use a cell size equal to 0.3 Mpc to identify voids traced by mass in Ref-L0100N1504.
APPENDIX B: CLUSTERING MEASUREMENTS
The two-point correlation function (or simply the autocorrelation function) ξ(r) is among the most widely used statistical tools to study the spatial distribution of galaxies in the Universe. It measures the excess probability of finding a pair of points separated by a distance r, compared to a random distribution (Peebles 1980).
In Fig. B1, we show an estimation of the two-point correlation function for the subhalo samples used to identify voids in Ref-L0100N1504 at z = 0.0, calculated using 30 bins of distance between 0 and 20 Mpc. The black solid and blue dashed lines show measurements of the autocorrelation function for subhaloes that were selected by their total or stellar mass, respectively. Error bars show bootstrap errors, computed using 40 000 bootstrap re-samples. Each bootstrap re-sample is constructed by sampling with replacement 40 076 galaxies from the original set. For each of these re-samples, the autocorrelation function is computed, and the error bars are obtained from the standard deviation of the distribution of correlation functions. We observe that subhaloes selected by their stellar mass are more strongly clustered than those selected by their total mass over a wide range of scales. This suggests that the former are more likely to be located in denser environments of the cosmic web such as walls and filaments. In this scenario, voids traced by these subhaloes will be larger than voids traced by a more homogeneous distribution of points, since the density criteria used for the definition of a void will be satisfied at a larger radius, as explained in Section 4.2. This explains the increased size of voids traced by stellar mass-selected subhaloes seen in Fig. 6, as well as a higher peak of the density profile for these voids in Fig. 7 .

Two-point autocorrelation function of subhaloes used to trace voids Ref-L0100N1504 at z = 0. The black-solid and blue-dashed lines show measurements for subhaloes selected by their total or stellar mass, respectively. Stellar mass-selected subhaloes show stronger clustering at all scales considered, which explains the increased size of the voids they trace.
APPENDIX C: EVOLUTION WITH REDSHIFT
In previous sections we have shown that when galaxies are used as tracers for voids, differences in the void abundance and void density profile arise, in part, due to how galaxies of a given mass populate haloes of a given mass and environment. From an observational point of view, it is important to consider that much of the signal from future large-scale surveys, such as LSST (LSST Science Collaboration 2009), EUCLID (Laureijs et al. 2011) or eBOSS (Dawson et al. 2016), will come from galaxies at different redshift ranges. For example, over the time between z ∼ 0.6 and z = 0.0, the relation between halo mass and stellar mass might have changed (e.g. Moster, Naab & White 2013). Moreover, given that future large-scale clustering and weak-lensing studies will heavily rely on galaxies detected near z ∼ 0.5, it is important to check whether our results change at earlier epochs in the simulation.
In Fig. C1, we show the distribution of sizes for voids identified using subhalo tracers in snapshots of Ref-L0100N1504 at z = 0, z = 0.5 and z = 1.0, shown by the solid black, long-dashed magenta and dashed orange lines, respectively. For better clarity, the upper panel only shows the abundance for voids traced by subhaloes selected by their stellar mass, while the lower panels show the ratio between the abundance of these voids and those traced by subhaloes selected by their total mass .

Top panel: cumulative distribution of sizes for voids traced by subhaloes selected by their stellar mass in Ref-L100N1504. Measurements for z = 0, z = 0.5 and z = 1.0 are shown by the solid black, long-dashed magenta and dashed orange lines, respectively. While the total number of voids remains similar in all three cases, their size increases as a function of time as more matter gets accreted into filaments and haloes. Lower panel: abundance ratio between voids traced by subhaloes selected by their stellar/total mass in Ref-L0100N1504. Irrespectively of the redshift considered, voids traced by subhaloes selected by their stellar mass are larger than the other void samples.
We observe that while the total number of voids is fairly similar in all the samples, the voids identified at higher redshift are somewhat smaller, although the associated uncertainties are still large. The interpretation of this difference in void size is not straightforward, as there are many mechanisms that interplay. These voids at higher redshift were identified using the same integrated density threshold Δ = −0.8 as in the previous sections. Whether this threshold should be modified according to the growth of fluctuations is still a subject of debate in the field. Moreover, in Section 6 we showed that different void populations can be identified in EAGLE, some of them being embedded in large-scale environments that are expanding, while others are contracting. These and other factors might affect the distribution of void sizes at different redshifts. A detailed understanding of this evolution is outside the scope of this work, and a larger simulated volume is required to avoid possible biases in the determination of the large-scale environment of the largest voids in the sample.
Focusing on the lower panel of Fig. C1, we observe that selecting subhaloes by their stellar mass produces larger voids than the total mass selection, irrespectively of the redshift. This indicates that the differences observed in the void size distributions in Section 4 are persistent at z = 0.5 and z = 1.0.
Fig. C2 shows the absolute difference in the density profiles between voids traced by stellar mass-selected subhaloes and total mass-selected subhaloes in Ref-L0100N1504. Again we find at the three redshifts considered, voids traced by subhaloes selected by their stellar mass always show density profiles with a higher peak, reflecting a greater abundance of subhaloes around and within the void ridges. A small shift in the radius at which we get the largest differences between the two samples above is obtained with redshift, but the measurements are too noisy to make a conclusive statement about this feature .

Absolute difference in the subhalo density profiles between voids traced by stellar mass-selected subhaloes and total mass-selected subhaloes. Measurements for z = 0, z = 0.5 and z = 1.0 are shown by the solid black, long-dashed magenta and dashed orange lines, respectively. Irrespectively of the redshift considered, voids traced by subhaloes selected by their stellar mass show a greater abundance of tracers around the void ridges.
APPENDIX D: ERROR ESTIMATIONS
In Section 5 we showed measurements of the radial density and velocity profile for voids identified using subhaloes in the EAGLE simulations. The error bars in Figs 7 and 8 show the scatter around the mean density profiles, obtained using a jackknife re-sampling of all the voids used in the stacked (averaged) profiles. Due to the sensitivity of the profiles to the number and the location of the bins, there could be correlated errors among values from different bins, which could result in an underestimation of the true errors of the profiles. Thus, even though two profiles may show differences that are bigger than the error bars, these differences could still not be statistically significant when the correlated errors are taken into account. To explore this in further detail we compute the correlation matrices of all the curves shown in Figs 7 and 8. In Fig. D1, we show the correlation matrix for the subhalo density profile of voids traced by subhaloes selected by their total mass (black, solid line in the left-hand panel of Fig. 7). Correlation matrices of the remaining profiles are very similar and are not shown here. It appears that there are no strong off-diagonal correlations among bins, which means that it is safe to compare the differences in void profiles with the error bars shown in there .

The correlation matrix of the subhalo density profile measured for voids traced by total mass-selected subhaloes in Ref-L0100N1504 (black solid line in Fig. 7. The colourmap shows the Pearson product–moment correlation coefficients for each bin in the density profile. There are not many strong off-diagonal correlations among different bins, which means that we can compare the profiles in Section 5 with the error bars shown in there to get an estimate of the level at which these differences are statistically significant.
Fig. D2 shows the value of X2 when comparing different subhalo density profiles (Fig. 7), but only using the values of the profile up to r/rvoid. Thus, the last value at the right-hand end of the curves show the obtained X2 using all the bins in the calculation. The different lines compare the density profile of voids traced by subhaloes selected by their total mass in Ref-L0100N1504. The blue long-dashed line shows a comparison with voids traced by stellar mass-selected subhaloes, while the red dashed and the green dot–dashed lines show comparisons with voids traced by total mass-selected subhaloes in DM-L0100N1504, and voids found in Ref-L0100N1504 but with their profiles measured in Ref-L0100N1504. The horizontal dashed lines shows values of X2 where the null hypothesis can be rejected with a 70 (bottom) and 90 (top) per cent confidence level. The differences between the profiles of voids traced by total mass-selected subhaloes and those traced by stellar mass-selected subhaloes in Ref-L0100N1504 are statistically significant with a 70 per cent confidence level (blue, long-dashed line). The same holds true for a comparison between voids in Ref-L0100N1504 and DM-L0100N1504 when the same total mass selection cut is used for the tracers (red dashed line). When the same void centres are used to compute the density profile in Ref-L0100N1504 and DM-L0100N1504, very small differences are detected, but these are statistically significant to a 75 per cent confidence level .

The value of the reduced χ2 statistic (D1) comparing the density profiles of Fig. 7, as obtained when considering only the values of the profiles up to r/rvoid. The different lines compare the density profile inferred from subhaloes selected by their total mass with the profile obtained from: blue, long-dashed line: subhaloes selected by their stellar mass in Ref-L0100N1504. Red dashed line: subhaloes selected by their total mass in DM-L0100N1504. Green, dot–dashed line: subhaloes selected by their total mass in Ref-L0100N1504, but having their profiles measured in DM-L0100N1504. The null hypothesis is that each pair of profiles come from the same distribution. The top and bottom constant, dashed lines show values at which the null hypothesis can be rejected with a 90 and 70 per cent level of confidence, respectively.