ABSTRACT

The distribution of ionized hydrogen during the epoch of reionization (EoR) has a complex morphology. We propose to measure the 3D topology of ionized regions using the Betti numbers. These quantify the topology using the number of components, tunnels, and cavities in any given field. Based on the results for a set of reionization simulations we find that the Betti numbers of the ionization field show a characteristic evolution during reionization, with peaks in the different Betti numbers characterizing different stages of the process. The shapes of their evolutionary curves can be fitted with simple analytical functions. We also observe that the evolution of the Betti numbers shows a clear connection with the percolation of the ionized and neutral regions and differs between different reionization scenarios. Through these properties, the Betti numbers provide a more useful description of the topology than the widely studied Euler characteristic or genus. The morphology of the ionization field will be imprinted on the redshifted 21-cm signal from the EoR. We construct mock image cubes using the properties of the low-frequency element of the future Square Kilometre Array and show that we can extract the Betti numbers from such data sets if an observation time of 1000 h is used. Even for a much shorter observation time of 100 h, some topological information can be extracted for the middle and later stages of reionization. We also find that the topological information extracted from the mock 21-cm observations can put constraints on reionization models.

1 INTRODUCTION

The 21-cm signal produced by the hyperfine transition of neutral hydrogen is one of the most promising signals for probing the period from the history of our Universe when it transitioned from a cold and neutral state to a hot and ionized state (e.g. Furlanetto, Oh & Briggs 2006; Morales & Wyithe 2010; Zaroubi 2013). This transition period is known as the epoch of reionization (EoR), and ended around redshift z ≈ 5.5–6 (Keating et al. 2019; Kulkarni et al. 2019; Nasir & D’Aloisio 2020). The 21-cm signal produced during EoR will be redshifted and found at the low frequency end of the radio band. It traces the distribution of neutral hydrogen gas present in the intergalactic medium (IGM). Observing this signal will therefore shed light not only on the progress of reionization but also on a range of other issues such as the formation of first compact structures (e.g. Barkana 2016).

The global signal experiment, Experiment to Detect the Global EoR Signature (EDGES; Bowman & Rogers 2010), has claimed a detection of the sky averaged 21-cm signal from z ≈ 17 (Bowman et al. 2018). This detection is debated as its nature questions our current understanding of the early Universe (e.g. Hills et al. 2018; Singh & Subrahmanyan 2019). See Tashiro, Kadota & Silk (2014), Barkana (2018), Fialkov, Barkana & Cohen (2018), Muñoz & Loeb (2018), Feng & Holder (2018), and Fialkov & Barkana (2019) for few possible theories that can explain the Bowman et al. (2018) observation. Other global experiments, such as the Large-aperture Experiment to Detect the Dark Age (LEDA; Price et al. 2018) the Dark Ages Radio Explorer (DARE; Burns et al. 2017) and SARAS2 (Singh et al. 2017) are also trying to detect the global 21-cm signal. As we wait for an independent experiment to confirm the Bowman et al. (2018) observation, we will assume the standard cosmology in this study.

There also exists numerous efforts to detect the fluctuations in the signal using interferometric radio telescopes, such as the Low Frequency Array (LOFAR; e.g. van Haarlem et al. 2013), the Murchison Widefield Array (MWA; e.g. Wayth et al. 2018), and Hydrogen Epoch of Reionization Array (HERA; e.g. Deboer et al. 2017). Even though the fluctuations have not been detected, upper limits have been placed on the 21-cm power spectrum (e.g. Mertens et al. 2020; Trott et al. 2020) that have started to put constraints on the reionization process (e.g. Ghara et al. 2020, 2021; Greig et al. 2020a, b; Mondal et al. 2020). In the near future, the low frequency element of the Square Kilometre Array (SKA-Low; e.g. Mellema et al. 2013; Koopmans et al. 2015) will start observing the 21-cm signal produced during the EoR. SKA-Low is expected to be sensitive enough to not only detect the 21-cm power spectrum but also to produce images (e.g. Mellema et al. 2015; Giri 2019).

The 21-cm signal produced during EoR is highly non-Gaussian and therefore cannot be completely characterized by the 21-cm power spectrum (e.g. Majumdar et al. 2018). Therefore, we need to explore other summary statistics that are sensitive to the non-Gaussian information contained in the signal. Examples of such summary statistics that have been studied previously are the bispectrum (Majumdar et al. 2018; Watkinson et al. 2019), the position-dependent power spectrum (Giri et al. 2019b), size distributions (Kakiichi et al. 2017; Giri et al. 2018a, 2019a; Busch et al. 2020), and fractal dimensions (Bandyopadhyay, Choudhury & Seshadri 2017).

Yet another probe of the non-Gaussian information in the 21-cm signal is a summary statistics that describes the topology of the signal. This topology is quite sensitive to the properties of ioninzing sources (e.g. Hutter et al. 2021). Already Gnedin (2000) used the topology of the H ii (or ionized) regions to heuristically describe the EoR to consist of three stages, namely the preoverlapoverlap, and post-overlap stage. Subsequent studies quantified the topology using the genus g or its equivalent the Euler characteristic χ (χ = 2 − 2g) of the neutral hydrogen distribution (e.g. Gleser et al. 2006; Lee et al. 2008) or the ionization fraction field (Friedrich et al. 2011). These works could build on the extensive literature about the use of χ (or g) on the large-scale structure (e.g. Gott, Dickinson & Melott 1986; Gott et al. 1992; Matsubara 1994; Matsubara & Yokoyama 1996; Schmalzing & Buchert 1997; Park, Kim & Gott 2005).

The Euler characteristic is also equivalent to the third of the Minkowski functionals V3 and some works have considered the full set of these topological quantities (V0V3; Friedrich et al. 2011; Yoshiura et al. 2017; Chen et al. 2019). Based on the evolution of the Minkowski functionals the latter study proposed a five stage description of the reionization process.

The Euler characteristic (χ) for a 3D data set can be written as
(1)
where the terms on the right-hand side are the number of isolated objects, tunnels, and cavities. These terms are also known as the Betti numbers. In this paper we explore the information contained in the Betti numbers of the distribution of ionized regions during reionization. It is obvious that they contain more information compared to χ (e.g. van de Weygaert et al. 2011; Chingangbam et al. 2012; Park et al. 2013). Specifically, they allow us to study the hierarchical aspect of the topological features of ionized bubbles as reionization proceeds. We consider how the Betti numbers evolve during the different stages of reionization, how they can describe the state of the IGM, and whether the additional information they contain makes them more useful than the more commonly studied Euler characteristic/genus.

During reionization the distribution of ionized bubbles will imprint its topology on the observable 21-cm signal. Therefore, we will also discuss the prospects of measuring the Betti numbers of 3D structures in the upcoming 21-cm observations with SKA-Low, considering both resolution and noise effects. As far as we know there exists very few work that considered the Betti numbers in the context of reionization. Elbers & Van De Weygaert (2019) studied the Betti numbers in heuristic models of reionization that they called the H ii bubble network. Kapahtia et al. (2018, 2021) and Kapahtia, Chingangbam & Appleby (2019) only studied the Betti numbers for 2D 21-cm images. However, reionization is a 3D process and SKA-Low will produce 3D data cubes consisting of images at a range of frequencies. We will therefore study the three Betti numbers associated with the 3D topology.

There exists even more metrics to study the topology, such as the persistence homology (e.g. Zomorodian & Carlsson 2005; Edelsbrunner & Harer 2008) that is gaining attention in cosmology (e.g. van de Weygaert et al. 2010, 2011; Sousbie 2011; Sousbie, Pichon & Kawahara 2011; Pranav et al. 2017; Makarenko et al. 2018; Wilding et al. 2020). In persistence homology, the evolution of each κ-dimensional holes in the analysed field is tracked. Elbers & Van De Weygaert (2019) have used persistence homology to study the H ii bubble network. We defer the exploration of persistence homology of 21-cm data sets to future studies.

The paper is structured as follows. In the next section, we describe the topological concepts used throughout the paper. Section 3 explains explains our large-scale reionization simulations and the methods used to construct the 21-cm signal from simulation outputs. In Sections 4 and 5, we discuss the properties and evolution of the Betti numbers of the ionization and 21-cm signal fields, respectively. We discuss the detectability of the Betti numbers in 21-cm observations of upcoming radio telescopes in Section 6. In the final section, we summarize our findings.

2 TOPOLOGICAL MEASURES

In this section, we describe the methods we use to measure the topology of our data. In Section 2.1, we explain the metric we will use to quantify the topology of any field. The fields we analyse in this study are given in form of digital data. Therefore, we define an estimator for measuring the topology of digital data in Section 2.2.

2.1 Betti numbers

In this study, we quantify the topology of structures in our field with the Betti numbers (βκ) that are topological invariant quantities (Betti 1870). This means they remain unchanged under transformations, such as translation, rotation, and deformation. βκ is defined as the number of κ-dimensional holes in the structure we analyse. Our data sets are 3D and therefore their topology can be defined with three Betti numbers where each have a simple and intuitive meaning. The zeroth Betti number (β0), first Betti number (β1), and second Betti number (β2) probe the number of isolated objects, tunnels, and cavities, respectively.

In its simplest form, the reionization process can be pictured as a network formed by bubbles of ionized hydrogen gas. In Fig. 1 we show examples of structures formed by spherical bubbles. The left-hand and right-hand panels show projections of 3D data sets containing one and five spherical bubbles, respectively. As they both consist of a single connected structure, both data sets have β0 = 1. The data set in the left-hand panel has no tunnel, hence its β1 = 0. The five bubbles in the right-hand panel form a torus, which contains a tunnel, giving β1 = 1. The projection does not reveal the inner structure of the spheres. If we assume that the single sphere in the left-hand panel has a hollow centre, then this single cavity will give β2 = 1. If we assume that the spheres in the right-hand panel are all filled, this gives β2 = 0.

Projection of two different configurations of spheres where the one of the left is hollow. These configurations have Betti numbers βκ = (1, 0, 1) (left) and βκ = (1, 1, 0) (right), giving Euler characteristics χ = 2 and χ = 0, respectively.
Figure 1.

Projection of two different configurations of spheres where the one of the left is hollow. These configurations have Betti numbers βκ = (1, 0, 1) (left) and βκ = (1, 1, 0) (right), giving Euler characteristics χ = 2 and χ = 0, respectively.

In previous works, the Euler characteristics (χ) that describes the integral of the Gaussian curvature over the surface of the structure has been used to study the topology of reionization (e.g. Friedrich et al. 2011; Yoshiura et al. 2017). χ is related to the Betti numbers as (e.g. Pranav et al. 2019),
(2)
(also see equation 1). As the Betti numbers are components of χ, two data sets can have the same χ but different sets of Betti numbers. For example, the data set shown in the right-hand panel of Fig. 1 has βκ = {1, 1, 0} so χ = 0, which is the same as for the case when there is no structure. Such a situation is quite plausible in the bubble network produced by reionization and therefore Betti numbers should be better than χ for distinguishing the topologies of different reionization models. We refer the readers to Hatcher (2002) and Edelsbrunner & Harer (2008, 2010) for more mathematically rigorous descriptions of these topological quantities.

2.2 Estimators for digital data

All the fields that we work with in this study are digital data, which is a discrete, discontinuous representation of a given field. In order to capture the topology of the structures in our field, we need to decompose the structures into well-defined geometrical components in such a way that the topological properties of the structure are retained. We achieve this by constructing a cubical complex (e.g. Kaczynski, Mischaikow & Mrozek 2004; Wagner, Chen & Vuçini 2012), which is composed of points, line segments, squares, and cubes, for any given field. We refer the readers to Wagner et al. (2012) and Giri et al. (2019a) for more information about our method to construct and store our cubical complex.

2.2.1 Betti numbers

We follow the method described in Gonzalez-Lorenzo et al. (2016) to estimate the Betti numbers. Each isolated object is a group of connected pixels in our data set. We use the connected-component labelling module in scikit-image package1 (van der Walt et al. 2014). We refer inquisitive readers to Fiorio & Gustedt (1996) and Wu, Otoo & Shoshani (2005) for the algorithm used to label all the connected regions. This method needs less computation time compared to the classical Hoshen–Kopelman algorithm (Hoshen & Kopelman 1976). It is similar to the friends-of-friend algorithm used to find clusters in N-body simulations (e.g. Press & Davis 1982; Davis et al. 1985). The number of regions labelled gives the value of β0.

From the Alexander duality of any topological complex of three dimensions, we can find the β2 by estimating the β0 of the complementary field of the data set (e.g. Hatcher 2002; Gonzalez-Lorenzo et al. 2016). The remaining first Betti number β1 is not calculated directly, but derived from the Euler characteristics (χ) of the data set (as described in Section 2.2.2) according to β1 = β0 + β2 − χ. This method of calculating the β1 was introduced by Gonzalez-Lorenzo et al. (2016). We have tested our estimator on a Gaussian random field (GRF). The results are presented in Appendix  A.

2.2.2 Euler characteristics

Once we have constructed the cubical complex for any given digital data, we can estimate the Euler characteristic χ as
(3)
where nd is the dimension of the data set. κi is the number of cubes of dimension i in the constructed cubical complex. In our data set, dimensions of points, line segments, squares, and cubes are 0, 1, 2, and 3, respectively. See appendix  A of Giri et al. (2019a) for more details.

3 SIMULATION OF 21 CM SIGNAL

To test the usefulness of the Betti numbers we make use of simulated data. This section describes our methodology for simulating reionization and the 21-cm signal. Section 3.1 describes the large-scale reionization simulations we use and Section 3.2 outlines how we calculate the observable 21-cm signal from them. Finally we explain how we include telescope effects, such as the limited angular resolution and telescope noise in Section 3.3.

3.1 Reionization simulations

We use two different types of reionization simulations for our study, fully numerical radiative transfer simulations, and so-called semi-numerical simulations that rely on sophisticated form of photon counting.

3.1.1 Fully numerical simulations

The fully numerical simulations use two steps. We first simulate the evolution of the matter distribution using the N-body code CUBEP3M (Harnois-Déraps et al. 2013). The N-body simulation is run in a cosmological volume of 714 comoving Mpc (cMpc) with 69123 particles. We use such a large cosmological volume as they are important to capture the large-scale fluctuations (Giri et al. 2019b; Ghara et al. 2020) and the large-scale topological structure (Giri et al. 2019a) and are also a match to the Field of View of SKA-Low. See Iliev et al. (2014) for a discussion about the importance of the size of cosmological volumes for reionization simulations. Our N-body simulation run outputs snapshots at each 11.5 Myr. We obtain the density field by assigning mass to a cubic grid with 300 cells along each direction by smoothing with a smooth-particle-hydrodynamics kernel (Monaghan 1992; Iliev et al. 2014). The cosmological parameters used here are Ωm = 0.27, Ωk = 0, Ωb = 0.044, h = 0.7, ns = 0.96, and σ8 = 0.8. These values are consistent with the results from Wilkinson Microwave Anisotropy Probe (WMAP; Komatsu et al. 2011) and Planck (Planck Collaboration 2016, 2018) collaborations.

The reionization process is driven by sources of ionizing photons that form due to the collapse of baryonic matter in dark matter haloes. We determine the position and mass of dark matter haloes using the spherical overdensity halo-finder discussed in Watson et al. (2013). We identify haloes with masses |$M_\mathrm{halo} \ge 10^9 \, \mathrm{M}_\odot$|⁠. In these haloes, gas can cool through excitation of atomic hydrogen and they are relatively unaffected by radiative feedback (Sullivan, Iliev & Dixon 2018). We follow the convention in Dixon et al. (2016) and call these haloes high-mass atomic-cooling haloes (HMACHs). We model the sources by assigning the ionizing photon production rate, |$\dot{N}_\gamma$|⁠, to be proportional to the total mass in haloes within that cell, M. |$\dot{N}_\gamma$| is given as
(4)
where ζ, mp, and fcoll are the source efficiency, mass of proton, and total collapsed mass fraction in HMACHs, respectively.

To simulate the state of gas in the intergalactic medium, we next post-process the snapshots from CUBEP3M with the fully numerical radiative transfer code C2-RAY (Mellema et al. 2006a). C2RAY employs the short characteristics ray tracing method to solve the radiative transfer equation (Raga et al. 1999; Lim & Mellema 2003). The ray-tracing is performed up to a comoving distance of 71 Mpc from each source. This limit is meant to approximate the effect of the presence of optically thick absorbers in the IGM that are unresolved in our N-body simulation. This value is consistent with the measurement of the mean free path by Songaila & Cowie (2010). For more details about the simulations we refer the reader to Iliev et al. (2006), Mellema et al. (2006b), and Dixon et al. (2016).

In this work, we have run two scenarios (FN1 and FN2) using the methods described above. FN1 and FN2 use ζ values of 50 and 40, respectively. We show the history of these reionization scenarios in Fig. 2. As the sources are less efficient in FN2 compared to the FN1 scenario, reionization is delayed in the latter. The values of the source parameters are listed in Table 1. The Thompson scattering optical depth τ calculated from these simulations are within the 95 per cent credible interval of the constraints from Planck Collaboration (2018). The estimated τ are also given in Table 1.

The volume weighted average ionization fraction of our reionization simulations varying over the redshifts.
Figure 2.

The volume weighted average ionization fraction of our reionization simulations varying over the redshifts.

Table 1.

Reionization simulation parameters of various source models. We give the ionizing source parameters that are ionizing efficiency (ζ), minimum mass (Mmin), and mean free path of ionizing photons (Rmfp). The Thompson scattering optical depth (τ) derived from each simulations are also given here.

LabelCodeBox size (Mpc)MeshζMmin(M)Rmfp (Mpc)τ
FN1CUBEP3M + C2RAY714300350109710.058
FN2CUBEP3M + C2RAY714300340109710.050
SN121cmfast714300350109710.064
SN221cmfast714300350108710.084
SN321cmfast714300350109200.062
LabelCodeBox size (Mpc)MeshζMmin(M)Rmfp (Mpc)τ
FN1CUBEP3M + C2RAY714300350109710.058
FN2CUBEP3M + C2RAY714300340109710.050
SN121cmfast714300350109710.064
SN221cmfast714300350108710.084
SN321cmfast714300350109200.062
Table 1.

Reionization simulation parameters of various source models. We give the ionizing source parameters that are ionizing efficiency (ζ), minimum mass (Mmin), and mean free path of ionizing photons (Rmfp). The Thompson scattering optical depth (τ) derived from each simulations are also given here.

LabelCodeBox size (Mpc)MeshζMmin(M)Rmfp (Mpc)τ
FN1CUBEP3M + C2RAY714300350109710.058
FN2CUBEP3M + C2RAY714300340109710.050
SN121cmfast714300350109710.064
SN221cmfast714300350108710.084
SN321cmfast714300350109200.062
LabelCodeBox size (Mpc)MeshζMmin(M)Rmfp (Mpc)τ
FN1CUBEP3M + C2RAY714300350109710.058
FN2CUBEP3M + C2RAY714300340109710.050
SN121cmfast714300350109710.064
SN221cmfast714300350108710.084
SN321cmfast714300350109200.062

3.1.2 Semi-numerical simulations

The fully numerical simulations are computationally expensive. Therefore, to study the impact of source parameters on the topology measured with the Betti numbers, we use the fast semi-numerical code 21cmfast (Mesinger, Furlanetto & Cen 2011). In this code the density field is constructed by evolving the initial density field using second-order Lagrangian perturbation theory (2LPT; Scoccimarro 1998). 21cmfast builds the ionization field using the excursion-set approach developed in Furlanetto, Zaldarriaga & Hernquist (2004). A cell |$\pmb {x}$| in the simulation volume is labeled as ionized when it satisfies the following condition,
(5)
where |$f_\mathrm{coll}(\pmb {x}, R_\mathrm{s}, M_\mathrm{min})$| is the fraction of matter inside a sphere of radius Rs that has collapsed into haloes of mass larger than Mmin (e.g. Bond et al. 1991; Sheth & Tormen 1999). ζ is again the ionizing efficiency describing the amount of ionizing photons produced by the collapsed mass. This efficiency factor is in principle the same as the one defined in equation (4).

We have run three semi-numerical reionization scenarios (SN1, SN2, and SN3) by varying the source and sink parameters in 21cmfast. The values of the parameters are given in Table 1. We also show the reionzation history of these models in Fig. 2. Even though scenario SN1 has the same parameters as FN1, their reionization histories are not identical. This is because SN1 does not explicitly include the photon losses due to recombinations whereas FN1 does. In 21cmfast these losses are assumed to be accounted for in the value of ζ [see for e.g. equation (2) in Greig & Mesinger (2015)].

3.2 Redshifted 21 cm signal

When the 21-cm signal is observed with an interferometry-based radio telescope, the recorded observable is known as the differential brightness temperature. The observed differential brightness temperature δTb of the 21-cm signal, which is given as (e.g. Mellema et al. 2013)
(6)
where xHI and δ are the fraction of neutral hydrogen and the density fluctuation, respectively. Ts is the excitation temperature of the two spin states of the neutral hydrogen, known as the spin temperature. TCMB(z) is the cosmic microwave background (CMB) temperature at redshift z and Yp is the primordial helium abundance.

Here we will work with the assumption of high-spin temperature TsTCMB. During the EoR this is a reasonable assumption as even relatively low levels of X-ray radiation can raise the gas temperature above the CMB temperature (e.g. Pritchard & Furlanetto 2007; Pritchard & Loeb 2012). In the high-spin temperature limit, δTb is independent of Ts.

We construct the 21-cm signal at a given redshift z from equation (6) using the corresponding 3D neutral fraction and matter density cubes from the C2-RAY and CUBEP3M simulations. Each of these 3D data sets represents the characteristics of the signal at the corresponding redshift z and therefore called coeval cubes (CCs). We transform these CCs into image cubes by assuming one of the axes to be the frequency axis and the other two position in the sky. This procedure implies that we neglect the two line-of-sight effects: redshift space distortions (RSD; see e.g. Jensen et al. 2013) and the light-cone (LC) effect (see e.g. Datta et al. 2012) that would introduce minor distortions along the line of sight. We use our publicly available analysis code, tools21cm2 (Giri, Mellema & Jensen 2020a), to simulate the redshifted 21-cm signal cubes.

3.3 Mock SKA-Low observation

In this section, we explain our methods for constructing mock observation with SKA-Low.

3.3.1 Limited resolution

As our simulated 21-cm signal cubes are represented as digital data, they have an intrinsic resolution below which we do not have any information. The simulation resolution of our data set is Δx ≈ 2.4 cMpc. This simulation resolution corresponds to different angular scales on the sky at different z, which can be given as
(7)
where Dc(z) is the comoving distance to redshift z. Similarly the frequency scales also vary over redshifts, which can be given as
(8)
where ν0 is the rest frequency of the 21-cm line, H(z) the Hubble parameter at redshift z and c the speed of light. For z = 8, these equations give values Δθ = 0.881 arcmin and Δν = 0.133 MHz.

Apart from the simulation resolution, we will also be limited by the resolution of the telescope. As SKA-Low is an interferometry based radio telescope, it observes the sky in so-called uv space that corresponds to the Fourier transform of the sky. We refer the readers to Rohlfs & Wilson (2013) for details about radio telescope observations.

A radio interometric telescope comprises of radio antennae placed at different locations. Each antennae pair forms a baseline and it observes the signal at an angular scale proportional to the length of the baseline. For observing the 21-cm signal this angular scale can be written as
(9)
where B is the length of the baseline in cm. The first generation of SKA-Low will be sparsely filled with antennae outside a diameter of 2 km around the centre (e.g. Mellema et al. 2013). Therefore, a 21-cm data set produced with baselines longer than 2 km will be quite noisy. In this work we will approximate the synthesized beam of SKA-Low by a Gaussian with a full width half-maximum (FWHM) given by equation (9) with B = 2 km. For z = 8 this gives Δθtelescope ≈ 3 arcmin.

An interferometric radio telescope also has a smallest baseline that sets the largest scales that it can image. To implement this effect, we subtract its average value from each slice of the 21-cm signal data set along the frequency (or redshift) axis. This effect means that interferometric images lack absolute calibration of the signal. Finally, to attain isotropic resolution in our data set we convolve the signal along the frequency axis with a tophat filter of the same physical width as the Gaussian beam. For z = 8 this gives Δν ≈ 0.5 MHz.

3.3.2 System noise

The 21-cm observations with SKA-Low will also be prone to instrumental noise. In order to simulate this noise, we follow the approach in Ghara et al. (2017) and Giri, Mellema & Ghara (2018b). We model the effects using the currently available configuration3 of the first phase of SKA-Low that has 512 antennae stations each with a diameter of 35 m. We present the telescope parameters that we use in this study in Table 2.

Table 2.

The parameters used in this study to model the telescope properties.

ParametersValues
System temperature (Tsys)|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$| K
Effective collecting area (AD)962 m2
Critical frequency (νc)110 MHz
Declination−30°
Observation hour per day6 h
Signal integration time10 s
ParametersValues
System temperature (Tsys)|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$| K
Effective collecting area (AD)962 m2
Critical frequency (νc)110 MHz
Declination−30°
Observation hour per day6 h
Signal integration time10 s
Table 2.

The parameters used in this study to model the telescope properties.

ParametersValues
System temperature (Tsys)|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$| K
Effective collecting area (AD)962 m2
Critical frequency (νc)110 MHz
Declination−30°
Observation hour per day6 h
Signal integration time10 s
ParametersValues
System temperature (Tsys)|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$| K
Effective collecting area (AD)962 m2
Critical frequency (νc)110 MHz
Declination−30°
Observation hour per day6 h
Signal integration time10 s

We explore two observation time (tobs). One is an optimistic case of tobs  = 1000 h and other is a pessimistic case of tobs  = 100 h. At simulation resolution, the system noise at 1000 and 100 h will be very high. As the this noise is uncorrelated while the signal is correlated, the signal-to-noise ratio (SNR) will go up when the resolution is decreased. As discussed earlier, SKA-Low will have a limited resolution defined by the maximum baseline. Previous studies have shown that we can get interesting image data with 1000 h observation at a resolution corresponding to B = 2 km (e.g. Mellema et al. 2015; Ghara et al. 2017; Giri et al. 2018b, 2019a). However the SNR will be much lower for a 100 h observation at the same resolution. To achieve a similar SNR, we reduce the resolution by a factor of 2 for the 100 h mock observations.

We would like to note that the noise model we use assumes perfect calibration. In reality it is not always possible to reach this theoretical noise level. Therefore, to achieve the same SNR values as in our example with the real SKA-Low might require longer observation times or use of a lower resolution.

4 BETTI NUMBERS OF IONIZATION FIELD

The formation, merger, and evolution of ionized bubbles during reionization results in a complex morphology of H ii regions. In this section we investigate the Betti numbers estimated from the ionization field in our simulations.

4.1 Evolution during reionization

Fig. 3 shows the evolution of Betti numbers estimated from the ionization field of our FN1 simulations by defining structures using a threshold of |$x_\mathrm{H\, \rm {ii}}=0.5$|⁠. These Betti numbers trace the evolution topology of H ii regions during reionization. Here and in the remainder of this paper we plot the evolution of the Betti numbers against the fraction of our simulation volume that has been ionized, rather than time or redshift. Using the ionized volume fraction |$x_\mathrm{H\, \rm {ii}}^\mathrm{V}$| connects the evolution of the Betti numbers to the different stages of reionization and also makes it easier to compare the results of different reionization scenarios in which reionization may have evolved differently.

Evolution of the Betti numbers estimated from the $x_\mathrm{H\, \rm {ii}}$ field of the FN1 simulation. The black solid lines are the fits to the curves. See text for a description of these fits and Table 3 for the fit parameters.
Figure 3.

Evolution of the Betti numbers estimated from the |$x_\mathrm{H\, \rm {ii}}$| field of the FN1 simulation. The black solid lines are the fits to the curves. See text for a description of these fits and Table 3 for the fit parameters.

When reionization starts we see an increase in β0. This increase in β0 is due to the appearance of ionized bubbles around the sources emitting ionizing photons. As time passes, more sources form and ionize the hydrogen gas in the IGM around them. In the initial stages of reionization we not only see appearance of new ionized bubbles but also overlap of bubbles. A comparison between the value of β0 and the number of sources shows that such overlap occurs right from the start of reionization. The total number of bubbles will decrease when multiple ionized bubbles overlap. When the rate by which H ii regions start to overlap becomes larger than the formation rate of new isolated ionized bubbles, the value of β0 will start to decrease. In our model, we observe this turn-around at |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.2$|⁠. The value of β0 keeps decreasing as reionization proceeds and converges to unity at around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.7$|⁠.

At |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.1$|⁠, β1 starts to increase. β1 measures the number of tunnels and these tunnels can be formed by overlapping bubbles (see Section 2.1 for a simple example). Therefore, the growth of β1 is a sign of substantial bubble overlap modulating the topology of the ionization field. As reionization progresses, the number of tunnels increase until it reaches a maximum value beyond which β1 starts to decrease. In the FN1 simulation this peak is reached around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.55$|⁠. Unlike β0, the value of β1 remains much larger than 1 until the end of reionization.

Cavities inside H ii regions start forming during the middle stages of reionization, as indicated by the slow rise of β2, starting around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.2$|⁠. We can visualize these cavities as isolated neutral regions. As reionization progresses initially large connected neutral regions breakup and form multiple isolated neutral regions. The value of β2 attains a maximum around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.9$| after which it falls to zero once reionization completes. In any inside-out reionization scenario such as FN1, these later isolated neutral regions trace the cosmic voids. Giri et al. (2019a) noted that the number of isolated neutral regions in the later stages of reionization are relatively rare compared to the number of isolated ionized regions at the start of reionization. This is confirmed by comparing the maximum values of the β0 and β2 curves, which shows that the latter is about 3.5 times lower.

From the three curves we obtain three clear maxima, as well as the crossing points between β0 and β1 as well as between β1 and β2. As β1 enters the calculation of the Euler characteristic χ with a different sign than β0 and β2, the above evolutionary patterns cannot be distinguished when considering χ. It is for example impossible to separate the early rise of the number of tunnels and the decrease of the number of isolated regions due to overlap. At |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.35$| β0 ≈ β1. However the small positive contribution from β2 at that time will push the location of χ = 0 to a slightly later stage.

4.2 Connection between Betti numbers and percolation

These crossing points of Betti curves appear to be connected to the formation of the percolation clusters. Furlanetto & Oh (2016) showed that a large network of overlapping ionized bubbles spanning the entire simulation volume forms during the early stages of reionization. This network is known as the percolation cluster. A similar percolation event4 takes place towards the end of reionization when the last network of neutral regions spanning the entire volume disappears. As pointed out by Furlanetto & Oh (2016) this type of percolation behaviour is a hallmark of phase transitions.

Fig. 4 shows the percolation curves, i.e. the volume fraction of H ii regions contained in the percolation cluster, for all our simulations. Here we focus on the percolation curve of the FN1 scenario; the percolation curves of other scenarios are discussed in Section 4.5. We see that the appearance of the percolation cluster coincides with the time when β1 surpasses β0. Likewise we find that the disappearance of the neutral percolation cluster coincides with the time when β1 ≈ β2. It was recently proposed by Santos et al. (2019) and Bobrowski & Skraba (2020) that such a connection between Betti numbers and percolation may be universal.

Percolation curves for the ionized percolation cluster. The number on the ordinate, V∞/Vtot, is the fraction of the total ionized volume contained in the percolation cluster. The different curves show the evolution of this quantity for our five different reionization scenarios.
Figure 4.

Percolation curves for the ionized percolation cluster. The number on the ordinate, V/Vtot, is the fraction of the total ionized volume contained in the percolation cluster. The different curves show the evolution of this quantity for our five different reionization scenarios.

4.3 Parametrization of the evolution of Betti numbers

The evolution curves for each of the Betti numbers for scenario FN1 have strikingly regular shapes. After inspection of the other scenarios we find that the shapes appear to be independent of the source model driving the reionization process. We will discuss these additional reionization scenarios in Section 4.5. The curves for β0 and β2 seem to match a lognormal shape, whereas β1 appears to have a Gaussian shape. We therefore propose the following parametric forms for each Betti number,
(10)
(11)
(12)
where Ai, μi, and σi are the fitting parameters. Ai models the height of the curves. μi and σi define the peak position and width of the curves. We show the fits to the Betti numbers curve with black solid lines in Fig. 3. The fit parameters are listed in Table 3. Here and in the rest of this paper we use 10-fold cross validation to estimate the goodness of the fit (e.g. Hastie, Tibshirani & Friedman 2009; Giri et al. 2020b).
Table 3.

The values of parameters when we model the Betti numbers using equations (10), (11), and (12). The fit parameters for |$x_\mathrm{H\, \rm {ii}}$| are Betti number curves estimated from the field at simulation resolution. The fit parameters for δTb are Betti number curves estimated from the mock SKA-Low 21-cm images. We give the fit parameters corresponding to mock observations at tobs = 1000 h and B = 2 km. The fit parameters given in brackets correspond to mock observations at tobs = 100 h and B = 1 km.

Fit|$\pmb {x_\mathrm{H\, \rm {ii}}}$||$\pmb {\delta T_\mathrm{b}}$|
ModelsParametersβ0β1β2β0β1β2
FN1Ai1409188641550 (–)512 (443)92 (80)
μi0.250.550.100.27 (–)0.61 (0.61)0.04 (0.03)
σi0.710.150.430.55 (–)0.16 (0.18)0.33 (0.34)
FN2Ai14432092476156 (–)562 (500)90 (80)
μi0.270.550.110.29 (–)0.63 (0.62)0.04 (0.03)
σi0.750.140.460.72 (–)0.15 (0.18)0.38 (0.36)
SN1Ai11951652112161 (–)470 (371)43 (45)
μi0.210.560.060.24 (–)0.61 (0.63)0.03 (0.05)
σi0.630.160.370.69 (–)0.16 (0.15)0.26 (0.45)
SN2Ai1351216822366 (–)461 (342)43 (25)
μi0.250.560.080.31 (–)0.60 (0.64)0.02 (0.05)
σi0.690.150.410.64 (–)0.19 (0.16)0.19 (0.25)
SN3Ai12062048800138 (–)689 (688)130 (200)
μi0.210.550.090.24 (–)0.61 (0.67)0.03 (0.05)
σi0.640.150.480.68 (–)0.16 (0.11)0.19 (0.45)
Fit|$\pmb {x_\mathrm{H\, \rm {ii}}}$||$\pmb {\delta T_\mathrm{b}}$|
ModelsParametersβ0β1β2β0β1β2
FN1Ai1409188641550 (–)512 (443)92 (80)
μi0.250.550.100.27 (–)0.61 (0.61)0.04 (0.03)
σi0.710.150.430.55 (–)0.16 (0.18)0.33 (0.34)
FN2Ai14432092476156 (–)562 (500)90 (80)
μi0.270.550.110.29 (–)0.63 (0.62)0.04 (0.03)
σi0.750.140.460.72 (–)0.15 (0.18)0.38 (0.36)
SN1Ai11951652112161 (–)470 (371)43 (45)
μi0.210.560.060.24 (–)0.61 (0.63)0.03 (0.05)
σi0.630.160.370.69 (–)0.16 (0.15)0.26 (0.45)
SN2Ai1351216822366 (–)461 (342)43 (25)
μi0.250.560.080.31 (–)0.60 (0.64)0.02 (0.05)
σi0.690.150.410.64 (–)0.19 (0.16)0.19 (0.25)
SN3Ai12062048800138 (–)689 (688)130 (200)
μi0.210.550.090.24 (–)0.61 (0.67)0.03 (0.05)
σi0.640.150.480.68 (–)0.16 (0.11)0.19 (0.45)
Table 3.

The values of parameters when we model the Betti numbers using equations (10), (11), and (12). The fit parameters for |$x_\mathrm{H\, \rm {ii}}$| are Betti number curves estimated from the field at simulation resolution. The fit parameters for δTb are Betti number curves estimated from the mock SKA-Low 21-cm images. We give the fit parameters corresponding to mock observations at tobs = 1000 h and B = 2 km. The fit parameters given in brackets correspond to mock observations at tobs = 100 h and B = 1 km.

Fit|$\pmb {x_\mathrm{H\, \rm {ii}}}$||$\pmb {\delta T_\mathrm{b}}$|
ModelsParametersβ0β1β2β0β1β2
FN1Ai1409188641550 (–)512 (443)92 (80)
μi0.250.550.100.27 (–)0.61 (0.61)0.04 (0.03)
σi0.710.150.430.55 (–)0.16 (0.18)0.33 (0.34)
FN2Ai14432092476156 (–)562 (500)90 (80)
μi0.270.550.110.29 (–)0.63 (0.62)0.04 (0.03)
σi0.750.140.460.72 (–)0.15 (0.18)0.38 (0.36)
SN1Ai11951652112161 (–)470 (371)43 (45)
μi0.210.560.060.24 (–)0.61 (0.63)0.03 (0.05)
σi0.630.160.370.69 (–)0.16 (0.15)0.26 (0.45)
SN2Ai1351216822366 (–)461 (342)43 (25)
μi0.250.560.080.31 (–)0.60 (0.64)0.02 (0.05)
σi0.690.150.410.64 (–)0.19 (0.16)0.19 (0.25)
SN3Ai12062048800138 (–)689 (688)130 (200)
μi0.210.550.090.24 (–)0.61 (0.67)0.03 (0.05)
σi0.640.150.480.68 (–)0.16 (0.11)0.19 (0.45)
Fit|$\pmb {x_\mathrm{H\, \rm {ii}}}$||$\pmb {\delta T_\mathrm{b}}$|
ModelsParametersβ0β1β2β0β1β2
FN1Ai1409188641550 (–)512 (443)92 (80)
μi0.250.550.100.27 (–)0.61 (0.61)0.04 (0.03)
σi0.710.150.430.55 (–)0.16 (0.18)0.33 (0.34)
FN2Ai14432092476156 (–)562 (500)90 (80)
μi0.270.550.110.29 (–)0.63 (0.62)0.04 (0.03)
σi0.750.140.460.72 (–)0.15 (0.18)0.38 (0.36)
SN1Ai11951652112161 (–)470 (371)43 (45)
μi0.210.560.060.24 (–)0.61 (0.63)0.03 (0.05)
σi0.630.160.370.69 (–)0.16 (0.15)0.26 (0.45)
SN2Ai1351216822366 (–)461 (342)43 (25)
μi0.250.560.080.31 (–)0.60 (0.64)0.02 (0.05)
σi0.690.150.410.64 (–)0.19 (0.16)0.19 (0.25)
SN3Ai12062048800138 (–)689 (688)130 (200)
μi0.210.550.090.24 (–)0.61 (0.67)0.03 (0.05)
σi0.640.150.480.68 (–)0.16 (0.11)0.19 (0.45)

The fact that simple functional forms can be used to describe the evolution of the Betti numbers for reionization is another advantage over the Euler characteristic for which no simple fit appears to exist.

4.4 Dependence on simulation resolution

The topology of H ii regions is sensitive to the resolution of our simulation volume. Friedrich et al. (2011) showed the impact of simulation resolution on the Euler characteristics. In this section, we study how the Betti numbers depend on the simulation resolution.

We show the Betti numbers curves estimated from the FN1 simulation volumes at different resolutions in Fig. 5. The solid curve represents the Betti numbers estimated from the data set at the intrinsic resolution of our simulation (Δx ≈ 2.4 Mpc). The dashed and dash–dotted lines represent their values estimated after four-cell and eight-cell smoothing, respectively. The four-cell and eight-cell smoothing approximately correspond to the resolution of SKA-Low observations at z ≈ 9 when maximum baselines of 2 and 1 km are used, respectively.

Evolution of the Betti numbers estimated from the $x_\mathrm{H\, \rm {ii}}$ field of the FN1 simulation at various resolutions. The solid, dashed, and dash–dotted line-styles give Betti numbers estimated from data set at simulation resolution, four-cell smoothing, and eight-cell smoothing, respectively. The latter two approximately correspond to the resolution achieved by SKA-Low observations with a maximum baseline of 2 and 1 km, respectively, for redshift z ≈ 9.
Figure 5.

Evolution of the Betti numbers estimated from the |$x_\mathrm{H\, \rm {ii}}$| field of the FN1 simulation at various resolutions. The solid, dashed, and dash–dotted line-styles give Betti numbers estimated from data set at simulation resolution, four-cell smoothing, and eight-cell smoothing, respectively. The latter two approximately correspond to the resolution achieved by SKA-Low observations with a maximum baseline of 2 and 1 km, respectively, for redshift z ≈ 9.

When we fit these curves with the parametric form given in equations (10), (11), and (12), we find that the peak positions and widths of the curve do not change as the resolution degrades. The resolution only affects the amplitude parameter Ai for all the Betti numbers curve. This decrease in amplitude is expected as the number of features contained in the data set decreases when the resolution is degraded. The fact that the peak positions and widths are less affected adds to the attractiveness of the Betti numbers as topological quantities.

4.5 Comparing the topology of different reionization scenarios

In this section we compare the topology of various reionization models using the Betti numbers. The top panels of Fig. 6 show the Betti curves for our fully numerical simulations. The FN1 and FN2 simulations use source efficiencies ζ = 50 and 40, respectively. Therefore, reionization is delayed in FN2 (see Fig. 2). However, even though the reionization history is different the peak positions of the Betti curves are quite similar. The β0 gives the number of isolated ionized regions and at early times, this number will be proportional to the number of active ionizing sources and not the efficiency of those sources. Note, however, that β0 will not be exactly equal to the number of sources as some regions are formed by clustered sources. Fig. 2 shows that FN2 reaches a similar stage Δz ≈ 1 later than FN1. The number density of ionizing sources will not differ that much between those two epochs. Therefore, the β0 curves for both these models are quite similar. However, the topology does differ, as evidenced by the β1 and β2 curves. The FN2 simulation contains more tunnels during the middle stages of reionization and more cavities from about the middle until near the end of reionization. Using all the curves together, we can distinguish between these two models.

Evolution of the Betti numbers estimated from the $x_\mathrm{H\, \rm {ii}}$ field of different simulations. The top and bottom panels show the Betti numbers for our fully numerical and semi-numerical simulation models, respectively.
Figure 6.

Evolution of the Betti numbers estimated from the |$x_\mathrm{H\, \rm {ii}}$| field of different simulations. The top and bottom panels show the Betti numbers for our fully numerical and semi-numerical simulation models, respectively.

To test the robustness of our conclusions, we also study the topology of reionization in a set of semi-numerical simulations. As semi-numerical simulations are fast, we can easily vary the simulation parameters and observe its impact on the Betti numbers. In the bottom panel of Fig. 6, we show the Betti numbers curves for our three semi-numerical simulations (see Table 1). SN1 and SN3 use the same Mmin value (⁠|$10^9 \, \mathrm{M}_\odot$|⁠) for dark matter haloes, which means that reionization is driven by the same source population. Therefore, the β0 curves for SN1 and SN3 overlap with each other. The Mmin used in SN2 is |$10^8 \, \mathrm{M}_\odot$|⁠. As a result SN2 contains more ionizing sources compared to SN1 and SN3. We see the imprint of this on the β0 curve that reaches a higher amplitude for the SN2 simulation.

The SN1 and SN2 simulations assume Rmfp to be 71 Mpc where as SN3 simulation assumes it to be 20 Mpc. SN1 and SN3 otherwise have the same source parameters. Rmfp describes the largest distance ionizing photons can travel from a source. At the early times, the value of Rmfp does not have a significant impact on the topology as most H ii regions are smaller than Rmfp. However, once larger regions form during the middle and late stages of reionization, a smaller value for Rmfp will inhibit the growth of ionized regions. This will allow more tunnels to survive, as evidenced by the difference between the β1 curves of SN1 and SN3. The largest impact of Rmfp is associated with the final phases of reionization as can be seen from the β2 curves. The values are much higher for SN3, indicating that imposing a short mean free path leads to the formation of many more small neutral islands.

As pointed out in Section 4, the rise of β1 in simulation FN1 marks the appearance of the percolation cluster. The percolation curves for all our reionization models can be found in Fig. 4. We find that for all models, percolation happens roughly at the epoch when β1 crosses β0 confirming our hypotesis.

The Betti numbers curves from all our reionization models, both fully numerical and semi-numerical, show the characteristic forms defined in Section 4. We present the values of the fit parameters for all models in Table 3. We conclude that these fitting formulae equations (10), (11), and (12) provide a useful description of the evolution of Betti numbers in reionization simulations.

5 BETTI NUMBERS OF 21-CM SIGNAL

In this section we will study the evolution of Betti numbers estimated from the 21-cm signal (δTb). As explained in Section 3.2, this signal depends on neutral fraction |$\big (x_\mathrm{H\, \rm {i}}(\pmb {r}) = 1-x_\mathrm{H\, \rm {ii}}(\pmb {r}) \big)$|⁠, density, and spin temperature fields (see equation 6). For our study we assume the Universe has been heated before reionization begins causing the spin temperature factor to saturate to 1 as (TsTCMB). Even with this assumption, the δTb will still contain properties of both the ionization and density fields.

We first study the topology of structures by putting a threshold (⁠|$\delta T^\mathrm{th}_\mathrm{b}$|⁠) on the δTb field. We measure the topology of structures with pixel values below a given value of |$\delta T^\mathrm{th}_\mathrm{b}$|⁠. In Fig. 7, we show colour plots of the Betti numbers estimated from the δTb field of the FN1 simulation against the values of |$\delta T^\mathrm{th}_\mathrm{b}$| (y-axis) and |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| (x-axis). For high values of |$\delta T^\mathrm{th}_\mathrm{b}$|⁠, the topology is mostly defined by the fluctuations in the density field. However the Betti numbers will not exactly trace the topology of the density field as reionization will reduce the signal in the (predominantly) dense regions. For low |$\delta T^\mathrm{th}_\mathrm{b}$| value, the structures trace the topology of ionization field. For |$\delta T^\mathrm{th}_\mathrm{b} \lesssim 10$| mK, the Betti numbers estimated from δTb fields are similar to that estimated from |$x_\mathrm{H\, \rm {ii}}$| field of FN1 simulation.

Contour plots of the values of β0 (left-hand panel), β1 (middle panel), and β2 (right-hand panel) of the δTb field from the FN1 simulation as a function of global ionization fraction $x^\mathrm{v}_\mathrm{H\, \rm {ii}}$ and threshold value ($\delta T^\mathrm{th}_\mathrm{b}$).
Figure 7.

Contour plots of the values of β0 (left-hand panel), β1 (middle panel), and β2 (right-hand panel) of the δTb field from the FN1 simulation as a function of global ionization fraction |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| and threshold value (⁠|$\delta T^\mathrm{th}_\mathrm{b}$|⁠).

For different reionization scenarios we will obtain different colour plots from the one shown in Fig. 7. However, it is difficult to compare scenarios with these plots as they will be affected by instrumental limitations such as lack of an absolute calibration of the signal, limited resolution, and noise. In the next section, we propose a method to explore the topological properties of reionization using mock 21-cm observations for the future SKA-Low telescope.

6 BETTI NUMBERS OF 21-CM IMAGES OBSERVED WITH RADIO TELESCOPE

As discussed in the previous section, the structures in the ionization field will be imprinted on the 21-cm signal. Therefore, we can study the morphology of H ii regions if we can identify these regions in 21-cm images. As the future SKA-Low has been designed to be able to produce such images (Mellema et al. 2015), we will consider mock SKA-Low observations in this section and study our ability to estimate the Betti numbers from such observations.

Identifying the ionized (or neutral) regions in 21-cm images is a non-trivial problem. Giri et al. (2018b) compared various methods taken from the field of image processing and found that the superpixel method works the best in identifying ionized (or neutral) regions in 21-cm images. In the superpixel method, connected pixels with similar properties are grouped together. These pixel groups are known as superpixels. We then construct the histogram from the mean intensity values of all the superpixels. From this histogram we find the threshold for the superpixels that classifies them into ionized and neutral superpixels. See Giri et al. (2018b) for a detailed description of the method. Note that the identification of the ionized regions at the same time yields an estimate for the fraction of the observed volume that is ionized, thus allowing to determine the Betti numbers as a function of ionized volume fraction rather than redshift.

In Section 6.1, we consider mock SKA-Low observations with infinite observation time as we want to first learn what we can achieve in this most optimal case with our method. In this situation, the 21-cm images will be free from noise and only be affected by the limited resolution and the lack of absolute calibration. In Section 6.2 we study the impact of telescope noise on our analysis strategy.

6.1 Limited resolution

Fig. 8 shows the impact of SKA-Low resolution on the estimated Betti numbers. The solid and dashed lines represent the Betti numbers calculated from the |$x_\mathrm{H\, \rm {ii}}$| data sets degraded to a resolution corresponding to a maximum baseline (B) of 2 and 1 km, respectively. These curves differ slightly from the ones shown in Fig. 5 as the resolution corresponding to a certain value of B is redshift dependent. Even though the peak amplitude decreases as the resolution is degraded, the peak position of all the Betti number curves estimated from |$x_\mathrm{H\, \rm {ii}}$| does not change. This behaviour agrees with our findings in Section 4.4.

The solid and dashed (dash–dotted and dotted) represent Betti numbers curve estimated from the $x_\mathrm{H\, \rm {ii}}$ (δTb) fields from simulation FN1 at resolutions corresponding to a maximum baseline B of 2 and 1 km, respectively.
Figure 8.

The solid and dashed (dash–dotted and dotted) represent Betti numbers curve estimated from the |$x_\mathrm{H\, \rm {ii}}$|Tb) fields from simulation FN1 at resolutions corresponding to a maximum baseline B of 2 and 1 km, respectively.

The dash–dotted and dotted lines in Fig. 8 represent the Betti numbers estimated from the δTb fields at a resolution corresponding to B  = 2 and 1 km, respectively. The peak position of β0 for the δTb field observed at B = 2 km matches the peak positions of the β0 curves derived from the |$x_\mathrm{H\, \rm {ii}}$| fields. However, for the lower resolution case of B = 1 km the β0 derived from the δTb fields attains its peak at a much earlier epoch. This is because the ionized regions are not very large at these early times and for the lower resolution the superpixel method struggles to separate the ionization and density fluctuations. This effect is in fact also present for the B = 2 km case where it causes the non-smooth increase of β0.

The middle panel of Fig. 8 shows that the β1 curves for δTb field are slightly biased towards larger |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| compared to the ones for |$x_\mathrm{H\, \rm {ii}}$| field. We also observe that the peak amplitude of β1 curves for δTb field decreases and the width of the curve increases compared to the ones for |$x_\mathrm{H\, \rm {ii}}$| field. At early times (⁠|$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\lesssim 0.2$|⁠), the β1 determined from |$x_\mathrm{H\, \rm {ii}}$| field is close to zero for all our reionization scenarios (see Section 4.5). However our method gives non-zero values for β1 determined from δTb field. This anomaly is again caused as the superpixel method struggles to separate ionization and density fluctuations during these early stages of reionization.

The right-most panel of Fig. 8 shows the β2 curves for δTb field. These curves are also slightly biased towards larger |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| compared to the ones for |$x_\mathrm{H\, \rm {ii}}$| field. Even though the resolution corresponding to B = 1 km smooths away more features compared to B = 2 km, the superpixel method identifies similar number of neutral regions as these neutral regions are relatively large structures (Giri et al. 2019a). Therefore, the β2 curves for both resolution cases overlap with each other.

6.2 Telescope noise

The SKA-Low observations will contain instrumental noise, which we model using the method described in Section 3.3.2. Here we study the Betti numbers estimated from noisy 21-cm images. We consider two observation times, which are tobs = 1000 h and 100 h. The 21-cm images observed at tobs = 1000 h is degraded to a resolution corresponding to B = 2 km. In order to get similar SNR, we use B = 1 km for the 21-cm images observed with tobs = 100 h.

In the left-hand-most, middle, and right-hand-most panels of Fig. 9, we show from left to right the β0, β1 and β2 curves estimated from δTb field. As can be seen from these curves our method can reconstruct the topology of reionization most reliably for epochs |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\gtrsim 0.4$| from the noisy 21-cm images. For earlier times we find noisy Betti curves, which is what we expect for data sets in which the noise dominates over the signal. In Section 4, we found that the topology of reionization during the early stages is mostly traced by the β0 values. These are very difficult to extract from the noisy 21-cm images as most ionized regions are small-scale features that are indistinguishable from noise. The β1 and β2 values are more sensitive to the topology of reionization during the middle and late stages of reionization, respectively. We observe good reconstruction of β1 and β2 curves in Fig. 9 derived from noisy 21-cm images, with the exception of the β1 values for the B = 2 km, 1000 h observation that are severely underestimated.

Betti numbers estimated from the δTb field from simulation FN1 when observed with SKA-Low. The solid and dashed lines represent Betti numbers estimated from noiseless δTb observed at resolution corresponding to a maximum SKA-Low baseline of 2 and 1 km, respectively. The dash–dotted and dotted lines represent Betti numbers estimated from 1000 and 100 h observation of δTb observed at resolution corresponding to a maximum SKA-Low baseline B of 2 and 1 km, respectively.
Figure 9.

Betti numbers estimated from the δTb field from simulation FN1 when observed with SKA-Low. The solid and dashed lines represent Betti numbers estimated from noiseless δTb observed at resolution corresponding to a maximum SKA-Low baseline of 2 and 1 km, respectively. The dash–dotted and dotted lines represent Betti numbers estimated from 1000 and 100 h observation of δTb observed at resolution corresponding to a maximum SKA-Low baseline B of 2 and 1 km, respectively.

As the β0 curves at |$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \lesssim 0.4$| are noisy, we fit equation (10) to the these curves for |$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \gtrsim 0.4$| only. We list the fit parameter values in Table 3. Even though we cannot see any clear peak position in the β0 curves, the fit for the mock observations tobs = 1000 h does give us good values for μ0 and σ0. Even though the curve derived for tobs = 100 h seems to resemble the one for tobs = 1000 h, our cross-validation technique shows that no reasonable fit can be made. We therefore conclude that to study the β0 evolution from 21-cm images, a long observation time is needed.

Similarly, we fitted parameters of equations (11) and (12) to the β1 and β2 curves for |$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \gtrsim 0.4$|⁠. These fit parameters are also given in Table 3. The peak position and width of the β1 curves derived from the noisy 21-cm images match quite well with the ones for the noiseless case. However, their amplitude is lower than for the ideal case. The β2 curves derived from noisy 21-cm images almost overlap with the curves derived from the noiseless cases, leading to very similar fitting parameters for β2 for all cases, irrespective of observation time. This indicates that we can study the topology of reionization during its middle and late stages even with 21-cm observations with relatively short observation times.

6.3 Comparing different reionization models

In this section, we compare Betti number curves for all our reionization models when observed with SKA-Low. The top panel of Fig. 10 shows the curves derived from mock observations at tobs = 1000 h and B = 2 km. We find that we can extract the correct evolution the topology of reionization in all the models. These curves are more reliable at epochs |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\gtrsim 0.4$|⁠. Using all the Betti numbers together, we can distinguish between the models. In Table 3, we give the parameter values while fitting all these curves with equations (10), (11), and (12). The derived μ and σ values are quite close to the values derived from the |$x_\mathrm{H\, \rm {ii}}$| field.

Betti numbers estimated from the δTb field for all our reionization models when observed with SKA-Low. The top and bottom panels represent the Betti number curves extracted from mock observations of tobs = 1000 h (B = 2 km) and tobs = 100 h (B = 1 km), respectively. Different models are marked with different line-styles that are shown in the legend.
Figure 10.

Betti numbers estimated from the δTb field for all our reionization models when observed with SKA-Low. The top and bottom panels represent the Betti number curves extracted from mock observations of tobs = 1000 h (B = 2 km) and tobs = 100 h (B = 1 km), respectively. Different models are marked with different line-styles that are shown in the legend.

We also compare the Betti number curves for all our reionization models in our more pessimistic scenario. The bottom panel of Fig. 10 gives the curves derived from mock observations at tobs = 100 h and B = 1 km. Even though the curves are noisy, we can discern the expected evolution of the Betti numbers for each model. These curves are more reliable at epochs |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\gtrsim 0.5$|⁠, making it difficult to derive a fit for the evolution of β0. In Table 3 contains the fit parameters for the β1 and β2 curves. As the late neutral islands tend to be large, we find that we can measure the topology of reionization at late times quite well. As noted above, this can even be done using relatively short observations times with SKA-Low.

7 SUMMARY

In this work we explored a new method to study the 3D topology of H ii regions during reionization, namely the three Betti numbers. In the perspective of the distribution of ionized regions the zeroth Betti number (β0) gives the number of isolated H ii regions, the first Betti number (β1) gives the number of neutral tunnels through ionized regions, and the second Betti number (β2) gives the number of neutral islands embedded in ionized regions.

We investigated the evolution of the Betti numbers in a small set of fully numerical and semi-numerical simulations and found a characteristic pattern in which each of the Betti numbers first rises and then falls, each attaining a maximum value at different stages of reionization. When reionization starts, most of the ionized bubbles are isolated and β0 is the largest of the three numbers. It initially rises as the number of ionizing sources, including individual and clustered sources, grows. However, as existing ionized regions grow and new ones continue to form, they will start to overlap, leading eventually to a decrease of β0.

When substantial overlapping of ionized bubbles starts, more and more of these connected regions will be crossed by neutral tunnels and we consequently observe a rise in the values of β1. The time when the decreasing β0 and the increasing β1 cross corresponds to the appearance of the ionized percolation cluster (Furlanetto & Oh 2016). β1 reaches its maximum value at the middle stages of reionization. During this stage, isolated neutral regions, some of them the remnants of broken up tunnels, become more common and therefore the value of the second Betti number (β2) starts to increase. The time when the β1 and β2 curves cross corresponds to the disappearance of the neutral percolation cluster. During the later stages of reionization β2 reaches its peak value after which it drops as the last neutral islands disappear.

We find that the characteristic evolution of the Betti numbers with the volume-averaged ionization fraction (⁠|$x^{\rm v}_{\rm H\, \rm {ii}}$|⁠) of the Universe is quite well fitted by simple functions, a normal or Gaussian function for β1, and lognormal functions for β0 and β2. Each of these is determined by three parameters specifying the peak amplitude (Ai), peak position (μi), and width (σi) of the curves. Using our suite of reionization simulations, we showed that we can distinguish reionization models by their sets of Betti curves.

On the basis of these results we conclude that the Betti numbers provide a better way to describe the topology of reionization than the much better studied genus or Euler characteristic. Not only do the Betti numbers represent a set of intuitively clear morphological concepts in the context of ionized regions, they also display characteristic evolutionary patterns in which they each distinguish different stages of reionization, and which also appear to fit simple functional forms. Furthermore, their evolution shows a connection with the percolation process characteristic of reionization. As the Euler characteristic can easily be calculated from the Betti numbers, there appears to be no advantage to studying the topology of reionization with it.

The redshifted 21-cm signal probes the distribution of neutral hydrogen gas during reionization. Observations with the future SKA-Low will deliver 3D tomographic data sets consisting of sky images of the 21-cm signal at different frequencies. We explored the prospect of deriving the topology of H ii regions from such data sets. To identify ionized regions in the 21-cm data we use the superpixels method introduced in Giri et al. (2018b). We find that we can retrieve the topology of reionization quite well for middle and late stages of reionization (⁠|$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \gtrsim 0.4$|⁠). At early times, most H ii regions are small and difficult to identify in the mock 21-cm data sets due to a combination of low resolution and instrumental noise. We tested our method for two observation times, 1000 and 100 h. For both cases we could retrieve the evolution of β1 and β2 better than the one for β0. We therefore expect that β1 and β2 in practice will be more robust measures of topology than β0.

All Betti numbers curves derived from the mock 21-cm observations are noisy due to the imperfections of the observational data that do not allow a perfect identification of H ii regions. However when we fit β1 and β2 curves with our three parameter models, the intrinsic values of the model parameters are retrieved quite well. Retrieving the model parameters for β0 is more challenging for the reasons explained above.

We have also compared the topologies determined from mock 21-cm observations constructed from our suite of reionization simulations. We can clearly distinguish between these models for a 1000 h observation with SKA-Low. Determining the evolution of the Betti numbers with a 100 h observation is more difficult. However, we can still distinguish between our models using β1 and β2. As isolated neutral regions are relatively large structures (Giri et al. 2019a), the values of β2 estimated from the mock observations are the least affected by limited resolution and noise.

In this paper we have explored a small set of five reionization simulations, produced by two different codes. One might wonder about the validity of some of our conclusions, e.g. regarding the shapes of the Betti curves. If reionization proceeded inside-out we expect the Betti curves to always show the behaviour we have found: each Betti number peaks at a different time, in the order β0, β1, β2. The reason for this is that for inside-out reionization there will be a strong correlation between the local density and the redshift of reionization. The areas of highest density reionize first and the ones of lowest density reionize last. This implies that different stages of reionization can approximately be connected to different threshold values in the density field, where everything above that threshold value is ionized. In this case we expect the evolution of the Betti numbers of ionized regions to approximately follow the Betti curves for different threshold values in a GRF shown in Fig. A1. This means that each Betti number will peak at a different time, in the order β0, β1, β2, just as we have found for all our models. As reionization is a radiative transfer process, the correlation between density and redshift of reionziation will never be perfect, which is why the actual curves are not identical to the more or less Gaussian curves from Fig. A1. Why the evolutionary Betti curves would favour lognormal shapes for β0 and β2 and a Gaussian shape for β1 is harder to explain. We consider our proposed fitting functions to be a useful working hypothesis until cases are found where this approach breaks down.

Our study has used a number of simplifying assumptions which we would like to briefly discuss here. The first is that we have calculated the differential brightness temperature assuming the high-spin temperature limit. If X-ray heating is not as efficient as generally assumed, this assumption may not be globally valid during the earlier stages of reionization. However, as we have seen, the early stages of reionization will anyway be challenging to study topologically as it will be difficult to identify ionized regions in the data.

We have also neglected the various line-of-sight effects when constructing our mock 21-cm observations. These are the LC effect (e.g. Datta et al. 2012) and RSD (e.g. Jensen et al. 2013). The former effect is caused by the evolution of the signal along the frequency direction. If we would analyse 21-cm data sets with a large frequency range (≳ 10 MHz), we would see more overlapping regions at the high frequency end than at the low frequency end. This will affect the topology of H ii regions and therefore the Betti numbers. However, this can be avoided by choosing a narrower frequency range. The RSD effect displaces the signal from its cosmological redshift due to peculiar velocities of gas. This process mostly alters the amplitude of the signal and has a relatively minor impact on the morphology of H ii regions (Giri et al. 2018a) Therefore, we expect only a minor impact on the Betti numbers, specially at the resolution of the telescope. We defer a detailed study of the impact of these line-of-sight effects to the future.

A potentially larger challenge is the impact from bright foreground signals that are many times brighter than the 21-cm signal we want to analyse (Jelic et al. 2008). A good foreground mitigation strategy is crucial to extract the 21-cm signal from radio observation. However, these methods are not perfect and therefore will leave residuals in the 21-cm images. If our structure identification methods struggles to distinguish between these residual foreground and the H ii regions, then studying the topology of reionization would become difficult. In such a case, we have to develop a foreground mitigation method that preserves the topological information. A detailed study of the impact of foreground residual is beyond the scope of this paper.

ACKNOWLEDGEMENTS

We acknowledge Rien van de Weygaert for useful discussion. GM is supported in part by Swedish Research Council grant 2016-03581. The computations were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at PDC partially funded by the Swedish Research Council through grant agreement no. 2018-05973. We also have used results obtained using the Partnership for Advanced Computing in Europe (PRACE) Research Infrastructure resources Curie based at the Très Grand Centre de Calcul (TGCC) operated by CEA near Paris, France, and Marenostrum based in the Barcelona Supercomputing Center, Spain. Time on these resources was awarded by PRACE under PRACE4LOFAR grants 2012061089 and 2014102339 as well as under the Multi-scale Reionization grants 2014102281 and 2015122822.

DATA AVAILABILITY

The data underlying this article will be shared on reasonable request to the corresponding author.

Footnotes

3

The configuration is described in document SKA-TEL-SKO-0000557 Rev 1 and the positions of the individual stations in document SKA-TEL-SKO-0000422 Rev 2, both retrievable at https://astronomers.skatelescope.org/documents/

4

This should perhaps be called a depercolation event as the neutral percolation cluster disappears.

REFERENCES

Bandyopadhyay
B.
,
Choudhury
T. R.
,
Seshadri
T. R.
,
2017
,
MNRAS
,
466
,
2302

Barkana
R.
,
2016
,
Phys. Rep.
,
645
,
1

Barkana
R.
,
2018
,
Nature
,
555
,
71

Betti
E.
,
1870
,
Annali di Matematica Pura ed Applicata (1867-1897)
,
4
,
140

Bobrowski
O.
,
Skraba
P.
,
2020
,
Phys. Rev. E
,
101
,
32304

Bond
J. R.
,
Cole
S.
,
Efstathiou
G.
,
Kaiser
N.
,
1991
,
ApJ
,
379
,
440

Bowman
J. D.
,
Rogers
A. E. E.
,
2010
,
Nature
,
468
,
796

Bowman
J. D.
,
Rogers
E. E.
,
Monsalve
R. A.
,
Mozdzen
T. J.
,
Mahesh
N.
,
2018
,
Nat. Publ. Group
,
555
,
67

Burns
J. O.
et al. ,
2017
,
ApJ
,
844
,
33

Busch
P.
,
Eide
M. B.
,
Ciardi
B.
,
Kakiichi
K.
,
2020
,
MNRAS
,
498
,
4533

Chen
Z.
,
Xu
Y.
,
Wang
Y.
,
Chen
X.
,
2019
,
ApJ
,
885
,
23

Chingangbam
P.
,
Park
C.
,
Yogendran
K. P.
,
van de Weygaert
R.
,
2012
,
ApJ
,
755
,
122

Datta
K. K.
,
Mellema
G.
,
Mao
Y.
,
Iliev
I. T.
,
Shapiro
P. R.
,
Ahn
K.
,
2012
,
MNRAS
,
424
,
1877

Davis
M.
,
Efstathiou
G.
,
Frenk
C. S.
,
White
S. D. M.
,
1985
,
ApJ
,
292
,
371

Deboer
D. R.
et al. ,
2017
,
Publications of the Astronomical Society of the Pacific
,
129
,
045001

Dixon
K. L.
,
Iliev
I. T.
,
Mellema
G.
,
Ahn
K.
,
Shapiro
P. R.
,
2016
,
MNRAS
,
456
,
3011

Doroshkevich
A. G.
,
1970
,
Astrophysics
,
6
,
320

Edelsbrunner
H.
,
Harer
J.
,
2008
,
Contemp. Math.
,
453
,
257

Edelsbrunner
H.
,
Harer
J.
,
2010
,
Computational Topology: An Introduction
.
American Mathematical Soc
,
Providence, Rhode Island

Elbers
W.
,
Van De Weygaert
R.
,
2019
,
MNRAS
,
486
,
1523

Feng
C.
,
Holder
G.
,
2018
,
ApJ
,
858
,
L17

Fialkov
A.
,
Barkana
R.
,
2019
,
MNRAS
,
486
,
1763

Fialkov
A.
,
Barkana
R.
,
Cohen
A.
,
2018
,
Phys. Rev. Lett.
,
121
,
11101

Fiorio
C.
,
Gustedt
J.
,
1996
,
Theor. Comput. Sci.
,
154
,
165

Friedrich
M. M.
,
Mellema
G.
,
Alvarez
M. A.
,
Shapiro
P. R.
,
Iliev
I. T.
,
2011
,
MNRAS
,
413
,
1353

Furlanetto
S. R.
,
Oh
S. P.
,
2016
,
MNRAS
,
457
,
1813

Furlanetto
S. R.
,
Zaldarriaga
M.
,
Hernquist
L.
,
2004
,
AJ
,
613
,
1

Furlanetto
S. R.
,
Oh
S. P.
,
Briggs
F. H.
,
2006
,
Phys. Rep.
,
433
,
181

Ghara
R.
,
Choudhury
T. R.
,
Datta
K. K.
,
Choudhuri
S.
,
2017
,
MNRAS
,
464
,
2234

Ghara
R.
et al. ,
2020
,
MNRAS
,
493
,
4728

Ghara
R.
,
Giri
S. K.
,
Ciardi
B.
,
Mellema
G.
,
Zaroubi
S.
,
2021
,
MNRAS
,
503
,
4551

Giri
S. K.
,
2019
,
PhD thesis
,
Department of Astronomy, Stockholm University

Giri
S. K.
,
Mellema
G.
,
Dixon
K. L.
,
Iliev
I. T.
,
2018a
,
MNRAS
,
473
,
2949

Giri
S. K.
,
Mellema
G.
,
Ghara
R.
,
2018b
,
MNRAS
,
479
,
5596

Giri
S. K.
,
Mellema
G.
,
Aldheimer
T.
,
Dixon
K. L.
,
Iliev
I. T.
,
2019a
,
MNRAS
,
489
,
1590

Giri
S. K.
,
D’Aloisio
A.
,
Mellema
G.
,
Komatsu
E.
,
Ghara
R.
,
Majumdar
S.
,
2019b
,
J. Cosmol. Astropart. Phys.
,
2019
,
058

Giri
S. K.
,
Mellema
G.
,
Jensen
H.
,
2020a
,
J. Open Source Softw.
,
5
,
2363

Giri
S. K.
,
Zackrisson
E.
,
Binggeli
C.
,
Pelckmans
K.
,
Cubo
R.
,
2020b
,
MNRAS
,
491
,
5277

Gleser
L.
,
Nusser
A.
,
Ciardi
B.
,
Desjacques
V.
,
2006
,
MNRAS
,
370
,
1329

Gnedin
N. Y.
,
2000
,
ApJ
,
535
,
530

Gonzalez-Lorenzo
A.
, ,
Juda
M.
,
Bac
A.
,
Mari
J. L.
,
Real
P.
,
2016
,
in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
,
Berlin
. p.
130

Gott
J. R. I.
,
Dickinson
M.
,
Melott
A. L.
,
1986
,
ApJ
,
306
,
341

Gott
J. R. I.
,
Mao
S.
,
Park
C.
,
Lahav
O.
,
1992
,
ApJ
,
385
,
26

Greig
B.
,
Mesinger
A.
,
2015
,
MNRAS
,
449
,
4246

Greig
B.
,
Trott
C. M.
,
Barry
N.
,
Mutch
S. J.
,
Pindor
B.
,
Webster
R. L.
,
Stuart
J.
,
Wyithe
B.
,
2020a
,
MNRAS
,
500
,
5322

Greig
B.
et al. ,
2020b
,
MNRAS
,
501
,
1

Harnois-Déraps
J.
,
Pen
U.-L.
,
Iliev
I. T.
,
Merz
H.
,
Emberson
J. D.
,
Desjacques
V.
,
2013
,
MNRAS
,
436
,
540

Hastie
T.
,
Tibshirani
R.
,
Friedman
J.
,
2009
,
The Elements of Statistical Learning: Data Mining, Inference, and Prediction
.
Springer Science & Business Media
,
New York

Hatcher
A.
,
2002
,
Algebraic Topology
.
Cambridge Univ. Press
,
Cambridge

Hills
R.
,
Kulkarni
G.
,
Meerburg
P. D.
,
Puchwein
E.
,
2018
,
Nature
,
564
,
E32

Hoshen
J.
,
Kopelman
R.
,
1976
,
Phys. Rev. B
,
14
,
3438

Hutter
A.
, , ,
Dayal
P.
,
Yepes
G.
,
Gottlöber
S.
,
Legrand
L.
,
Ucci
G.
,
2021
,
MNRAS
,
503
,
3698

Iliev
I. T.
,
Mellema
G.
,
Pen
U. L.
,
Merz
H.
,
Shapiro
P. R.
,
Alvarez
M. A.
,
2006
,
MNRAS
,
369
,
1625

Iliev
I. T.
,
Mellema
G.
,
Ahn
K.
,
Shapiro
P. R.
,
Mao
Y.
,
Pen
U. L.
,
2014
,
MNRAS
,
439
,
725

Jelic
V.
et al. ,
2008
,
MNRAS
,
389
,
1319

Jensen
H.
et al. ,
2013
,
MNRAS
,
435
,
460

Kaczynski
T.
,
Mischaikow
K.
,
Mrozek
M.
,
2004
,
Computational Homology, Vol. 157 of Applied Mathematical Sciences
.
Springer-Verlag
,
New York
, p.
35

Kakiichi
K.
et al. ,
2017
,
MNRAS
,
471
,
1936

Kapahtia
A.
,
Chingangbam
P.
,
Appleby
S.
,
Park
C.
,
2018
,
J. Cosmol. Astropart. Phys.
,
2018
,
011

Kapahtia
A.
,
Chingangbam
P.
,
Appleby
S.
,
2019
,
J. Cosmol. Astropart. Phys.
,
2019
,
053

Kapahtia
A.
,
Chingangbam
P.
,
Ghara
R.
,
Appleby
S.
,
Choudhury
T. R.
,
2021
,
J. Cosmol. Astropart. Phys.
,
2021
,
26

Keating
L. C.
,
Weinberger
L. H.
,
Kulkarni
G.
,
Haehnelt
M. G.
,
Chardin
J.
,
Aubert
D.
,
2019
,
MNRAS
,
491
,
1736

Komatsu
E.
et al. ,
2011
,
ApJS
,
192
,
18

Koopmans
L.
et al. ,
2015
,
in Advancing Astrophysics with the Square Kilometre Array (AASKA14)
. http://pos.sissa.it/

Kulkarni
G.
,
Keating
L. C.
,
Haehnelt
M. G.
,
Bosman
S. E. I.
,
Puchwein
E.
,
Chardin
J.
,
Aubert
D.
,
2019
,
MNRAS
,
485
,
L24

Lee
K.
,
Cen
R.
,
Gott III
J. R.
,
Trac
H.
,
2008
,
ApJ
,
675
,
8

Lim
A.
,
Mellema
G.
,
2003
,
A&A
,
405
,
189

Majumdar
S.
,
Pritchard
J. R.
,
Mondal
R.
,
Watkinson
C. A.
,
Bharadwaj
S.
,
Mellema
G.
,
2018
,
MNRAS
,
476
,
4007

Makarenko
I.
,
Shukurov
A.
,
Henderson
R.
,
Rodrigues
L. F. S.
,
Bushby
P.
,
Fletcher
A.
,
2018
,
MNRAS
,
475
,
1843

Matsubara
T.
,
1994
,
ApJ
,
434
,
L43

Matsubara
T.
,
Yokoyama
J.
,
1996
,
ApJ
,
463
,
409

Mellema
G.
,
Iliev
I. T.
,
Alvarez
M. A.
,
Shapiro
P. R.
,
2006a
,
New A
,
11
,
374

Mellema
G.
,
Iliev
I. T.
,
Pen
U. L.
,
Shapiro
P. R.
,
2006b
,
MNRAS
,
372
,
679

Mellema
G.
et al. ,
2013
,
Exp. Astron.
,
36
,
235

Mellema
G.
,
Koopmans
L.
,
Shukla
H.
,
Datta
K. K.
,
Mesinger
A.
,
Majumdar
S.
,
2015
,
in Advancing Astrophysics with the Square Kilometre Array (AASKA14)
,
Proceeding of science
, Giardini Naxos, Italy. p.
10
http://pos.sissa.it/

Mertens
F. G.
et al. ,
2020
,
MNRAS
,
493
,
1662

Mesinger
A.
,
Furlanetto
S.
,
Cen
R.
,
2011
,
MNRAS
,
411
,
955

Monaghan
J. J.
,
1992
,
ARA&A
,
30
,
543

Mondal
R.
et al. ,
2020
,
MNRAS
,
498
,
178

Morales
M. F.
,
Wyithe
J. S. B.
,
2010
,
ARA&A
,
48
,
127

Muñoz
J. B.
,
Loeb
A.
,
2018
,
Nature
,
557
,
684

Nasir
F.
,
D’Aloisio
A.
,
2020
,
MNRAS
,
494
,
3080

Park
C.
,
Kim
J.
,
Gott
J.
,
Richard
I.
,
2005
,
ApJ
,
633
,
1

Park
C.
et al. ,
2013
,
J. Korean Astron. Soc.
,
46
,
125

Planck Collaboration
,
2016
,
Astronomy & Astrophysics
,
594
,
A13

Planck Collaboration
,
2018
,
A&A
,
641
,
67

Pranav
P.
,
Edelsbrunner
H.
,
Van De Weygaert
R.
,
Vegter
G.
,
Kerber
M.
,
Jones
B. J. T.
,
Wintraecken
M.
,
2017
,
MNRAS
,
465
,
4281

Pranav
P.
et al. ,
2019
,
MNRAS
,
485
,
4167

Press
W. H.
,
Davis
M.
,
1982
,
ApJ
,
259
,
449

Price
D.
et al. ,
2018
,
MNRAS
,
478
,
4193

Pritchard
J. R.
,
Furlanetto
S. R.
,
2007
,
MNRAS
,
376
,
1680

Pritchard
J. R.
,
Loeb
A.
,
2012
,
Reports on Progress in Physics
,
75
,
086901

Raga
A.
,
Mellema
G.
,
Arthur
S.
,
Binette
L.
,
Ferruit
P.
,
Steffen
W.
,
1999
,
Rev. Mex. Astron. Astrofis.
,
35
,
123

Rohlfs
K.
,
Wilson
T.
,
2013
,
Tools of Radio Astronomy
.
Springer Science & Business Media
,
Berlin

Santos
F. A. N.
,
Raposo
E. P.
,
Coutinho-Filho
M. D.
,
Copelli
M.
,
Stam
C. J.
,
Douw
L.
,
2019
,
Phys. Rev. E
,
100
,
032414

Schmalzing
J.
,
Buchert
T.
,
1997
,
ApJ
,
482
,
L1

Scoccimarro
R.
,
1998
,
MNRAS
,
299
,
1097

Sheth
R. K.
,
Tormen
G.
,
1999
,
MNRAS
,
308
,
119

Singh
S.
,
Subrahmanyan
R.
,
2019
,
ApJ
,
880
,
26

Singh
S.
et al. ,
2017
,
ApJ
,
845
,
L12

Songaila
A.
,
Cowie
L. L.
,
2010
,
ApJ
,
721
,
1448

Sousbie
T.
,
2011
,
MNRAS
,
414
,
350

Sousbie
T.
,
Pichon
C.
,
Kawahara
H.
,
2011
,
MNRAS
,
414
,
384

Sullivan
D.
,
Iliev
I. T.
,
Dixon
K. L.
,
2018
,
MNRAS
,
473
,
38

Tashiro
H.
,
Kadota
K.
,
Silk
J.
,
2014
,
Phys. Rev. D
,
90
,
83522

Tomita
H.
,
1986
,
Progress of Theoretical Physics
,
76
,
952
, doi:

Trott
C. M.
et al. ,
2020
,
MNRAS
,
493
,
4711

van de Weygaert
R.
,
Platen
E.
,
Vegter
G.
,
Eldering
B.
,
Kruithof
N.
,
2010
, in
Mostafavi
M. A.
, ed.,
Alpha Shape Topology of the Cosmic Web, In 2010 International Symposium on Voronoi Diagrams in Science and Engineering
,
IEEE
,
Quebec, Canada
, p.
224

van de Weygaert
R.
et al. ,
2011
, in
Gavrilova
M. L.
,
Tan
C. J, K.
,
Mostavi
M. A.
, eds,
Alpha, Betti and the Megaparsec Universe: On the Topology of the Cosmic Web, Vol. 6970
, Springer,
Berlin Heidelberg
, p.
60

van der Walt
S.
et al. ,
2014
,
PeerJ
,
2
,
e453

van Haarlem
M. P.
et al. ,
2013
,
A&A
,
556
,
A2

Wagner
H.
,
Chen
C.
,
Vuçini
E.
,
2012
, in
Peikert
R.
,
Hauser
H.
,
Carr
H.
,
Fuchs
R.
, eds,
Topological Methods in Data Analysis and Visualization II: Theory, Algorithms, and Applications
.
Springer
,
Berlin, Heidelberg
, p.
91

Watkinson
C. A.
,
Giri
S. K.
,
Ross
H. E.
,
Dixon
K. L.
,
Iliev
I. T.
,
Mellema
G.
,
Pritchard
J. R.
,
2019
,
MNRAS
,
482
,
2653

Watson
W. A.
,
Iliev
I. T.
,
D’Aloisio
A.
,
Knebe
A.
,
Shapiro
P. R.
,
Yepes
G.
,
2013
,
MNRAS
,
433
,
1230

Wayth
R. B.
et al. ,
2018
,
Publ. Astron. Soc. Aust.
,
35
,
33

Wilding
G.
,
Nevenzeel
K.
,
van de Weygaert
R.
,
Vegter
G.
,
Pranav
P.
,
Jones
B. J. T.
,
Efstathiou
K.
,
Feldbrugge
J.
,
2020
,
preprint (arXiv:2011.12851)

Wu
K.
,
Otoo
E.
,
Shoshani
A.
,
2005
,
Medical Imaging 2005: Image Processing
,
Proceedings of the SPIE
,
Vol. 5747
. p.
1965
, doi:

Yoshiura
S.
,
Shimabukuro
H.
,
Takahashi
K.
,
Matsubara
T.
,
2017
,
MNRAS
,
465
,
394

Zaroubi
S.
,
2013
,
The Epoch of Reionization
.
Springer
,
Berlin, Heidelberg
, p.
45

Zomorodian
A.
,
Carlsson
G.
,
2005
,
Discrete Comput. Geom.
,
33
,
249

APPENDIX A: TOPOLOGY OF GAUSSIAN RANDOM FIELD

In this section we use a GRF to validate our Betti numbers estimator for digital data. Our GRF is constructed from a (300)3 data cube filled with Gaussian random numbers. Due to discretization, our data cube will not be a perfect GRF. Therefore, following Park et al. (2013) we smooth this data with a Gaussian filter with a FWHM of five cell widths.

We identify structures by putting a threshold on the intensity values. All the pixels below the threshold constitute the structure whose topology we want to study. For any field |$\mathcal {A}$| the thresholds will scan through all the intensity values. Therefore, we will get a set of Betti numbers corresponding to a set of threshold values. The set of thresholds ν is defined such that |$\mathcal {A} = \sqrt{\left\langle \mathcal {A}^2 \right\rangle } \nu + \left\langle \mathcal {A} \right\rangle$|⁠.

The solid curve in Fig. A1 shows the Euler characteristic (χ) curve of the GRF. The χ curve is an even function with two peaks and one trough that follows a fixed analytical form (see Doroshkevich 1970; Tomita 1986). Giri et al. (2019a) already showed that the χ curve derived by our estimator for digital data matches this analytical form.

The Euler characteristics χ (solid, black) and Betti numbers β0 (dashed, blue), β1 (dotted, red) and β2 (dash–dotted, green) of a Gaussian random field calculated for various threshold values ν.
Figure A1.

The Euler characteristics χ (solid, black) and Betti numbers β0 (dashed, blue), β1 (dotted, red) and β2 (dash–dotted, green) of a Gaussian random field calculated for various threshold values ν.

Fig. A1 also shows the Betti numbers βκ as a function of threshold ν as determined with our estimator (see Section 2.2). The dashed, dotted, and dash–dotted curve are β0(ν), β1(ν), and β2(ν), respectively. We see that the left peak of the χ(ν) curve is caused by the peak in β0 or a large number of components compared to tunnels and cavities. Similarly, the right peak in χ is associated with a large value of β2 or a large number of cavities compared to components and tunnels. The trough in the χ(ν) curve is due to the value of β1 or a large number of tunnels compared to components and cavities. The Betti curves for a GRF estimated with our method agree with those found in previous works. As far as we know no analytical forms exist for these βκ(ν) curves. See Park et al. (2013) and Pranav et al. (2019) for a more extensive description of the Betti numbers of GRFs.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.