Measuring the topology of reionization with Betti numbers

ABSTRACT

The distribution of ionized hydrogen during the epoch of reionization (EoR) has a complex morphology. We propose to measure the 3D topology of ionized regions using the Betti numbers. These quantify the topology using the number of components, tunnels, and cavities in any given field. Based on the results for a set of reionization simulations we find that the Betti numbers of the ionization field show a characteristic evolution during reionization, with peaks in the different Betti numbers characterizing different stages of the process. The shapes of their evolutionary curves can be fitted with simple analytical functions. We also observe that the evolution of the Betti numbers shows a clear connection with the percolation of the ionized and neutral regions and differs between different reionization scenarios. Through these properties, the Betti numbers provide a more useful description of the topology than the widely studied Euler characteristic or genus. The morphology of the ionization field will be imprinted on the redshifted 21-cm signal from the EoR. We construct mock image cubes using the properties of the low-frequency element of the future Square Kilometre Array and show that we can extract the Betti numbers from such data sets if an observation time of 1000 h is used. Even for a much shorter observation time of 100 h, some topological information can be extracted for the middle and later stages of reionization. We also find that the topological information extracted from the mock 21-cm observations can put constraints on reionization models.

galaxies: high-redshift, intergalactic medium, cosmology: theory, dark ages, reionization, first stars, early Universe

1 INTRODUCTION

The 21-cm signal produced by the hyperfine transition of neutral hydrogen is one of the most promising signals for probing the period from the history of our Universe when it transitioned from a cold and neutral state to a hot and ionized state (e.g. Furlanetto, Oh & Briggs 2006; Morales & Wyithe 2010; Zaroubi 2013). This transition period is known as the epoch of reionization (EoR), and ended around redshift z ≈ 5.5–6 (Keating et al. 2019; Kulkarni et al. 2019; Nasir & D’Aloisio 2020). The 21-cm signal produced during EoR will be redshifted and found at the low frequency end of the radio band. It traces the distribution of neutral hydrogen gas present in the intergalactic medium (IGM). Observing this signal will therefore shed light not only on the progress of reionization but also on a range of other issues such as the formation of first compact structures (e.g. Barkana 2016).

The global signal experiment, Experiment to Detect the Global EoR Signature (EDGES; Bowman & Rogers 2010), has claimed a detection of the sky averaged 21-cm signal from z ≈ 17 (Bowman et al. 2018). This detection is debated as its nature questions our current understanding of the early Universe (e.g. Hills et al. 2018; Singh & Subrahmanyan 2019). See Tashiro, Kadota & Silk (2014), Barkana (2018), Fialkov, Barkana & Cohen (2018), Muñoz & Loeb (2018), Feng & Holder (2018), and Fialkov & Barkana (2019) for few possible theories that can explain the Bowman et al. (2018) observation. Other global experiments, such as the Large-aperture Experiment to Detect the Dark Age (LEDA; Price et al. 2018) the Dark Ages Radio Explorer (DARE; Burns et al. 2017) and SARAS2 (Singh et al. 2017) are also trying to detect the global 21-cm signal. As we wait for an independent experiment to confirm the Bowman et al. (2018) observation, we will assume the standard cosmology in this study.

There also exists numerous efforts to detect the fluctuations in the signal using interferometric radio telescopes, such as the Low Frequency Array (LOFAR; e.g. van Haarlem et al. 2013), the Murchison Widefield Array (MWA; e.g. Wayth et al. 2018), and Hydrogen Epoch of Reionization Array (HERA; e.g. Deboer et al. 2017). Even though the fluctuations have not been detected, upper limits have been placed on the 21-cm power spectrum (e.g. Mertens et al. 2020; Trott et al. 2020) that have started to put constraints on the reionization process (e.g. Ghara et al. 2020, 2021; Greig et al. 2020a, b; Mondal et al. 2020). In the near future, the low frequency element of the Square Kilometre Array (SKA-Low; e.g. Mellema et al. 2013; Koopmans et al. 2015) will start observing the 21-cm signal produced during the EoR. SKA-Low is expected to be sensitive enough to not only detect the 21-cm power spectrum but also to produce images (e.g. Mellema et al. 2015; Giri 2019).

The 21-cm signal produced during EoR is highly non-Gaussian and therefore cannot be completely characterized by the 21-cm power spectrum (e.g. Majumdar et al. 2018). Therefore, we need to explore other summary statistics that are sensitive to the non-Gaussian information contained in the signal. Examples of such summary statistics that have been studied previously are the bispectrum (Majumdar et al. 2018; Watkinson et al. 2019), the position-dependent power spectrum (Giri et al. 2019b), size distributions (Kakiichi et al. 2017; Giri et al. 2018a, 2019a; Busch et al. 2020), and fractal dimensions (Bandyopadhyay, Choudhury & Seshadri 2017).

Yet another probe of the non-Gaussian information in the 21-cm signal is a summary statistics that describes the topology of the signal. This topology is quite sensitive to the properties of ioninzing sources (e.g. Hutter et al. 2021). Already Gnedin (2000) used the topology of the H ii (or ionized) regions to heuristically describe the EoR to consist of three stages, namely the preoverlap, overlap, and post-overlap stage. Subsequent studies quantified the topology using the genus g or its equivalent the Euler characteristic χ (χ = 2 − 2g) of the neutral hydrogen distribution (e.g. Gleser et al. 2006; Lee et al. 2008) or the ionization fraction field (Friedrich et al. 2011). These works could build on the extensive literature about the use of χ (or g) on the large-scale structure (e.g. Gott, Dickinson & Melott 1986; Gott et al. 1992; Matsubara 1994; Matsubara & Yokoyama 1996; Schmalzing & Buchert 1997; Park, Kim & Gott 2005).

The Euler characteristic is also equivalent to the third of the Minkowski functionals V₃ and some works have considered the full set of these topological quantities (V₀–V₃; Friedrich et al. 2011; Yoshiura et al. 2017; Chen et al. 2019). Based on the evolution of the Minkowski functionals the latter study proposed a five stage description of the reionization process.

The Euler characteristic (χ) for a 3D data set can be written as

$$\begin{eqnarray*} \chi =N_\mathrm{o}-N_\mathrm{t}+N_\mathrm{c}, \end{eqnarray*}$$

(1)

where the terms on the right-hand side are the number of isolated objects, tunnels, and cavities. These terms are also known as the Betti numbers. In this paper we explore the information contained in the Betti numbers of the distribution of ionized regions during reionization. It is obvious that they contain more information compared to χ (e.g. van de Weygaert et al. 2011; Chingangbam et al. 2012; Park et al. 2013). Specifically, they allow us to study the hierarchical aspect of the topological features of ionized bubbles as reionization proceeds. We consider how the Betti numbers evolve during the different stages of reionization, how they can describe the state of the IGM, and whether the additional information they contain makes them more useful than the more commonly studied Euler characteristic/genus.

During reionization the distribution of ionized bubbles will imprint its topology on the observable 21-cm signal. Therefore, we will also discuss the prospects of measuring the Betti numbers of 3D structures in the upcoming 21-cm observations with SKA-Low, considering both resolution and noise effects. As far as we know there exists very few work that considered the Betti numbers in the context of reionization. Elbers & Van De Weygaert (2019) studied the Betti numbers in heuristic models of reionization that they called the H ii bubble network. Kapahtia et al. (2018, 2021) and Kapahtia, Chingangbam & Appleby (2019) only studied the Betti numbers for 2D 21-cm images. However, reionization is a 3D process and SKA-Low will produce 3D data cubes consisting of images at a range of frequencies. We will therefore study the three Betti numbers associated with the 3D topology.

There exists even more metrics to study the topology, such as the persistence homology (e.g. Zomorodian & Carlsson 2005; Edelsbrunner & Harer 2008) that is gaining attention in cosmology (e.g. van de Weygaert et al. 2010, 2011; Sousbie 2011; Sousbie, Pichon & Kawahara 2011; Pranav et al. 2017; Makarenko et al. 2018; Wilding et al. 2020). In persistence homology, the evolution of each κ-dimensional holes in the analysed field is tracked. Elbers & Van De Weygaert (2019) have used persistence homology to study the H ii bubble network. We defer the exploration of persistence homology of 21-cm data sets to future studies.

The paper is structured as follows. In the next section, we describe the topological concepts used throughout the paper. Section 3 explains explains our large-scale reionization simulations and the methods used to construct the 21-cm signal from simulation outputs. In Sections 4 and 5, we discuss the properties and evolution of the Betti numbers of the ionization and 21-cm signal fields, respectively. We discuss the detectability of the Betti numbers in 21-cm observations of upcoming radio telescopes in Section 6. In the final section, we summarize our findings.

2 TOPOLOGICAL MEASURES

In this section, we describe the methods we use to measure the topology of our data. In Section 2.1, we explain the metric we will use to quantify the topology of any field. The fields we analyse in this study are given in form of digital data. Therefore, we define an estimator for measuring the topology of digital data in Section 2.2.

2.1 Betti numbers

In this study, we quantify the topology of structures in our field with the Betti numbers (β_κ) that are topological invariant quantities (Betti 1870). This means they remain unchanged under transformations, such as translation, rotation, and deformation. β_κ is defined as the number of κ-dimensional holes in the structure we analyse. Our data sets are 3D and therefore their topology can be defined with three Betti numbers where each have a simple and intuitive meaning. The zeroth Betti number (β₀), first Betti number (β₁), and second Betti number (β₂) probe the number of isolated objects, tunnels, and cavities, respectively.

In its simplest form, the reionization process can be pictured as a network formed by bubbles of ionized hydrogen gas. In Fig. 1 we show examples of structures formed by spherical bubbles. The left-hand and right-hand panels show projections of 3D data sets containing one and five spherical bubbles, respectively. As they both consist of a single connected structure, both data sets have β₀ = 1. The data set in the left-hand panel has no tunnel, hence its β₁ = 0. The five bubbles in the right-hand panel form a torus, which contains a tunnel, giving β₁ = 1. The projection does not reveal the inner structure of the spheres. If we assume that the single sphere in the left-hand panel has a hollow centre, then this single cavity will give β₂ = 1. If we assume that the spheres in the right-hand panel are all filled, this gives β₂ = 0.

Figure 1.

Projection of two different configurations of spheres where the one of the left is hollow. These configurations have Betti numbers β_κ = (1, 0, 1) (left) and β_κ = (1, 1, 0) (right), giving Euler characteristics χ = 2 and χ = 0, respectively.

Open in new tab Download slide

In previous works, the Euler characteristics (χ) that describes the integral of the Gaussian curvature over the surface of the structure has been used to study the topology of reionization (e.g. Friedrich et al. 2011; Yoshiura et al. 2017). χ is related to the Betti numbers as (e.g. Pranav et al. 2019),

$$\begin{eqnarray*} \chi = \beta _0 - \beta _1 + \beta _2 \ \end{eqnarray*}$$

(2)

(also see equation 1). As the Betti numbers are components of χ, two data sets can have the same χ but different sets of Betti numbers. For example, the data set shown in the right-hand panel of Fig. 1 has β_κ = {1, 1, 0} so χ = 0, which is the same as for the case when there is no structure. Such a situation is quite plausible in the bubble network produced by reionization and therefore Betti numbers should be better than χ for distinguishing the topologies of different reionization models. We refer the readers to Hatcher (2002) and Edelsbrunner & Harer (2008, 2010) for more mathematically rigorous descriptions of these topological quantities.

2.2 Estimators for digital data

All the fields that we work with in this study are digital data, which is a discrete, discontinuous representation of a given field. In order to capture the topology of the structures in our field, we need to decompose the structures into well-defined geometrical components in such a way that the topological properties of the structure are retained. We achieve this by constructing a cubical complex (e.g. Kaczynski, Mischaikow & Mrozek 2004; Wagner, Chen & Vuçini 2012), which is composed of points, line segments, squares, and cubes, for any given field. We refer the readers to Wagner et al. (2012) and Giri et al. (2019a) for more information about our method to construct and store our cubical complex.

2.2.1 Betti numbers

We follow the method described in Gonzalez-Lorenzo et al. (2016) to estimate the Betti numbers. Each isolated object is a group of connected pixels in our data set. We use the connected-component labelling module in scikit-image package¹ (van der Walt et al. 2014). We refer inquisitive readers to Fiorio & Gustedt (1996) and Wu, Otoo & Shoshani (2005) for the algorithm used to label all the connected regions. This method needs less computation time compared to the classical Hoshen–Kopelman algorithm (Hoshen & Kopelman 1976). It is similar to the friends-of-friend algorithm used to find clusters in N-body simulations (e.g. Press & Davis 1982; Davis et al. 1985). The number of regions labelled gives the value of β₀.

From the Alexander duality of any topological complex of three dimensions, we can find the β₂ by estimating the β₀ of the complementary field of the data set (e.g. Hatcher 2002; Gonzalez-Lorenzo et al. 2016). The remaining first Betti number β₁ is not calculated directly, but derived from the Euler characteristics (χ) of the data set (as described in Section 2.2.2) according to β₁ = β₀ + β₂ − χ. This method of calculating the β₁ was introduced by Gonzalez-Lorenzo et al. (2016). We have tested our estimator on a Gaussian random field (GRF). The results are presented in Appendix A.

2.2.2 Euler characteristics

Once we have constructed the cubical complex for any given digital data, we can estimate the Euler characteristic χ as

$$\begin{eqnarray*} \chi = \sum ^{n_\mathrm{d}}_{i=0} (-1)^i \kappa _i , \end{eqnarray*}$$

(3)

where n_d is the dimension of the data set. κ_i is the number of cubes of dimension i in the constructed cubical complex. In our data set, dimensions of points, line segments, squares, and cubes are 0, 1, 2, and 3, respectively. See appendix A of Giri et al. (2019a) for more details.

3 SIMULATION OF 21 CM SIGNAL

To test the usefulness of the Betti numbers we make use of simulated data. This section describes our methodology for simulating reionization and the 21-cm signal. Section 3.1 describes the large-scale reionization simulations we use and Section 3.2 outlines how we calculate the observable 21-cm signal from them. Finally we explain how we include telescope effects, such as the limited angular resolution and telescope noise in Section 3.3.

3.1 Reionization simulations

We use two different types of reionization simulations for our study, fully numerical radiative transfer simulations, and so-called semi-numerical simulations that rely on sophisticated form of photon counting.

3.1.1 Fully numerical simulations

The fully numerical simulations use two steps. We first simulate the evolution of the matter distribution using the N-body code CUBEP³M (Harnois-Déraps et al. 2013). The N-body simulation is run in a cosmological volume of 714 comoving Mpc (cMpc) with 6912³ particles. We use such a large cosmological volume as they are important to capture the large-scale fluctuations (Giri et al. 2019b; Ghara et al. 2020) and the large-scale topological structure (Giri et al. 2019a) and are also a match to the Field of View of SKA-Low. See Iliev et al. (2014) for a discussion about the importance of the size of cosmological volumes for reionization simulations. Our N-body simulation run outputs snapshots at each 11.5 Myr. We obtain the density field by assigning mass to a cubic grid with 300 cells along each direction by smoothing with a smooth-particle-hydrodynamics kernel (Monaghan 1992; Iliev et al. 2014). The cosmological parameters used here are Ω_m = 0.27, Ω_k = 0, Ω_b = 0.044, h = 0.7, n_s = 0.96, and σ₈ = 0.8. These values are consistent with the results from Wilkinson Microwave Anisotropy Probe (WMAP; Komatsu et al. 2011) and Planck (Planck Collaboration 2016, 2018) collaborations.

The reionization process is driven by sources of ionizing photons that form due to the collapse of baryonic matter in dark matter haloes. We determine the position and mass of dark matter haloes using the spherical overdensity halo-finder discussed in Watson et al. (2013). We identify haloes with masses |$M_\mathrm{halo} \ge 10^9 \, \mathrm{M}_\odot$|⁠. In these haloes, gas can cool through excitation of atomic hydrogen and they are relatively unaffected by radiative feedback (Sullivan, Iliev & Dixon 2018). We follow the convention in Dixon et al. (2016) and call these haloes high-mass atomic-cooling haloes (HMACHs). We model the sources by assigning the ionizing photon production rate, |$\dot{N}_\gamma$|⁠, to be proportional to the total mass in haloes within that cell, M. |$\dot{N}_\gamma$| is given as

$$\begin{eqnarray*} \dot{N}_\gamma = \zeta \frac{ M \Omega _\mathrm{b}}{m_\mathrm{p}\Omega _\mathrm{m} }\frac{1}{f_\mathrm{coll}} \frac{\mathrm{d}f_\mathrm{coll}}{\mathrm{d}t}, \end{eqnarray*}$$

(4)

where ζ, m_p, and f_coll are the source efficiency, mass of proton, and total collapsed mass fraction in HMACHs, respectively.

To simulate the state of gas in the intergalactic medium, we next post-process the snapshots from CUBEP³M with the fully numerical radiative transfer code C²-RAY (Mellema et al. 2006a). C²RAY employs the short characteristics ray tracing method to solve the radiative transfer equation (Raga et al. 1999; Lim & Mellema 2003). The ray-tracing is performed up to a comoving distance of 71 Mpc from each source. This limit is meant to approximate the effect of the presence of optically thick absorbers in the IGM that are unresolved in our N-body simulation. This value is consistent with the measurement of the mean free path by Songaila & Cowie (2010). For more details about the simulations we refer the reader to Iliev et al. (2006), Mellema et al. (2006b), and Dixon et al. (2016).

In this work, we have run two scenarios (FN1 and FN2) using the methods described above. FN1 and FN2 use ζ values of 50 and 40, respectively. We show the history of these reionization scenarios in Fig. 2. As the sources are less efficient in FN2 compared to the FN1 scenario, reionization is delayed in the latter. The values of the source parameters are listed in Table 1. The Thompson scattering optical depth τ calculated from these simulations are within the 95 per cent credible interval of the constraints from Planck Collaboration (2018). The estimated τ are also given in Table 1.

$The volume weighted average ionization fraction of our reionization simulations varying over the redshifts.$

Figure 2.

The volume weighted average ionization fraction of our reionization simulations varying over the redshifts.

Open in new tab Download slide

Table 1.

Open in new tab

Reionization simulation parameters of various source models. We give the ionizing source parameters that are ionizing efficiency (ζ), minimum mass (M_min), and mean free path of ionizing photons (R_mfp). The Thompson scattering optical depth (τ) derived from each simulations are also given here.

Label	Code	Box size (Mpc)	Mesh	ζ	M_min(M_⊙)	R_mfp (Mpc)	τ
FN1	CUBEP³M + C²RAY	714	300³	50	10⁹	71	0.058
FN2	CUBEP³M + C²RAY	714	300³	40	10⁹	71	0.050
SN1	21cmfast	714	300³	50	10⁹	71	0.064
SN2	21cmfast	714	300³	50	10⁸	71	0.084
SN3	21cmfast	714	300³	50	10⁹	20	0.062

Label	Code	Box size (Mpc)	Mesh	ζ	M_min(M_⊙)	R_mfp (Mpc)	τ
FN1	CUBEP³M + C²RAY	714	300³	50	10⁹	71	0.058
FN2	CUBEP³M + C²RAY	714	300³	40	10⁹	71	0.050
SN1	21cmfast	714	300³	50	10⁹	71	0.064
SN2	21cmfast	714	300³	50	10⁸	71	0.084
SN3	21cmfast	714	300³	50	10⁹	20	0.062

Table 1.

Open in new tab

Label	Code	Box size (Mpc)	Mesh	ζ	M_min(M_⊙)	R_mfp (Mpc)	τ
FN1	CUBEP³M + C²RAY	714	300³	50	10⁹	71	0.058
FN2	CUBEP³M + C²RAY	714	300³	40	10⁹	71	0.050
SN1	21cmfast	714	300³	50	10⁹	71	0.064
SN2	21cmfast	714	300³	50	10⁸	71	0.084
SN3	21cmfast	714	300³	50	10⁹	20	0.062

Label	Code	Box size (Mpc)	Mesh	ζ	M_min(M_⊙)	R_mfp (Mpc)	τ
FN1	CUBEP³M + C²RAY	714	300³	50	10⁹	71	0.058
FN2	CUBEP³M + C²RAY	714	300³	40	10⁹	71	0.050
SN1	21cmfast	714	300³	50	10⁹	71	0.064
SN2	21cmfast	714	300³	50	10⁸	71	0.084
SN3	21cmfast	714	300³	50	10⁹	20	0.062

3.1.2 Semi-numerical simulations

The fully numerical simulations are computationally expensive. Therefore, to study the impact of source parameters on the topology measured with the Betti numbers, we use the fast semi-numerical code 21cmfast (Mesinger, Furlanetto & Cen 2011). In this code the density field is constructed by evolving the initial density field using second-order Lagrangian perturbation theory (2LPT; Scoccimarro 1998). 21cmfast builds the ionization field using the excursion-set approach developed in Furlanetto, Zaldarriaga & Hernquist (2004). A cell |$\pmb {x}$| in the simulation volume is labeled as ionized when it satisfies the following condition,

$$\begin{eqnarray*} f_\mathrm{coll}(\pmb {x}, R_\mathrm{s}, M_\mathrm{min}) \gt \zeta ^{-1} \ , \end{eqnarray*}$$

(5)

where |$f_\mathrm{coll}(\pmb {x}, R_\mathrm{s}, M_\mathrm{min})$| is the fraction of matter inside a sphere of radius R_s that has collapsed into haloes of mass larger than M_min (e.g. Bond et al. 1991; Sheth & Tormen 1999). ζ is again the ionizing efficiency describing the amount of ionizing photons produced by the collapsed mass. This efficiency factor is in principle the same as the one defined in equation (4).

We have run three semi-numerical reionization scenarios (SN1, SN2, and SN3) by varying the source and sink parameters in 21cmfast. The values of the parameters are given in Table 1. We also show the reionzation history of these models in Fig. 2. Even though scenario SN1 has the same parameters as FN1, their reionization histories are not identical. This is because SN1 does not explicitly include the photon losses due to recombinations whereas FN1 does. In 21cmfast these losses are assumed to be accounted for in the value of ζ [see for e.g. equation (2) in Greig & Mesinger (2015)].

3.2 Redshifted 21 cm signal

When the 21-cm signal is observed with an interferometry-based radio telescope, the recorded observable is known as the differential brightness temperature. The observed differential brightness temperature δT_b of the 21-cm signal, which is given as (e.g. Mellema et al. 2013)

$$\begin{eqnarray*} \delta T_\mathrm{b} (\pmb {x},z) & \approx & 27 x_\mathrm{HI} (\pmb {x},z) \big (1 + \delta (\pmb {x},z)\big)\left(\frac{1+z}{10} \right)^\frac{1}{2} \left(1 -\frac{T_\mathrm{CMB}(z)}{T_\mathrm{s}(\pmb {x},z)} \right)\nonumber \\ &&\left(\frac{\Omega _\mathrm{b}}{0.044}\frac{h}{0.7}\right) \left(\frac{\Omega _\mathrm{m}}{0.27} \right)^{-\frac{1}{2}} \left(\frac{1-Y_\mathrm{p}}{1-0.248}\right) \mathrm{mK}\ , \end{eqnarray*}$$

(6)

where x_HI and δ are the fraction of neutral hydrogen and the density fluctuation, respectively. T_s is the excitation temperature of the two spin states of the neutral hydrogen, known as the spin temperature. T_CMB(z) is the cosmic microwave background (CMB) temperature at redshift z and Y_p is the primordial helium abundance.

Here we will work with the assumption of high-spin temperature T_s ≫ T_CMB. During the EoR this is a reasonable assumption as even relatively low levels of X-ray radiation can raise the gas temperature above the CMB temperature (e.g. Pritchard & Furlanetto 2007; Pritchard & Loeb 2012). In the high-spin temperature limit, δT_b is independent of T_s.

We construct the 21-cm signal at a given redshift z from equation (6) using the corresponding 3D neutral fraction and matter density cubes from the C²-RAY and CUBEP³M simulations. Each of these 3D data sets represents the characteristics of the signal at the corresponding redshift z and therefore called coeval cubes (CCs). We transform these CCs into image cubes by assuming one of the axes to be the frequency axis and the other two position in the sky. This procedure implies that we neglect the two line-of-sight effects: redshift space distortions (RSD; see e.g. Jensen et al. 2013) and the light-cone (LC) effect (see e.g. Datta et al. 2012) that would introduce minor distortions along the line of sight. We use our publicly available analysis code, tools21cm² (Giri, Mellema & Jensen 2020a), to simulate the redshifted 21-cm signal cubes.

3.3 Mock SKA-Low observation

In this section, we explain our methods for constructing mock observation with SKA-Low.

3.3.1 Limited resolution

As our simulated 21-cm signal cubes are represented as digital data, they have an intrinsic resolution below which we do not have any information. The simulation resolution of our data set is Δx ≈ 2.4 cMpc. This simulation resolution corresponds to different angular scales on the sky at different z, which can be given as

$$\begin{eqnarray*} \Delta \theta (z) = \frac{\Delta x}{D_\mathrm{c}(z)}\, , \end{eqnarray*}$$

(7)

where D_c(z) is the comoving distance to redshift z. Similarly the frequency scales also vary over redshifts, which can be given as

$$\begin{eqnarray*} \Delta \nu = \frac{\nu _0 H(z) \Delta x}{c(1+z)^2}\, , \end{eqnarray*}$$

(8)

where ν₀ is the rest frequency of the 21-cm line, H(z) the Hubble parameter at redshift z and c the speed of light. For z = 8, these equations give values Δθ = 0.881 arcmin and Δν = 0.133 MHz.

Apart from the simulation resolution, we will also be limited by the resolution of the telescope. As SKA-Low is an interferometry based radio telescope, it observes the sky in so-called uv space that corresponds to the Fourier transform of the sky. We refer the readers to Rohlfs & Wilson (2013) for details about radio telescope observations.

A radio interometric telescope comprises of radio antennae placed at different locations. Each antennae pair forms a baseline and it observes the signal at an angular scale proportional to the length of the baseline. For observing the 21-cm signal this angular scale can be written as

$$\begin{eqnarray*} \Delta \theta _\mathrm{telescope} (z) = \frac{21(1+z)}{B}, \end{eqnarray*}$$

(9)

where B is the length of the baseline in cm. The first generation of SKA-Low will be sparsely filled with antennae outside a diameter of 2 km around the centre (e.g. Mellema et al. 2013). Therefore, a 21-cm data set produced with baselines longer than 2 km will be quite noisy. In this work we will approximate the synthesized beam of SKA-Low by a Gaussian with a full width half-maximum (FWHM) given by equation (9) with B = 2 km. For z = 8 this gives Δθ_telescope ≈ 3 arcmin.

An interferometric radio telescope also has a smallest baseline that sets the largest scales that it can image. To implement this effect, we subtract its average value from each slice of the 21-cm signal data set along the frequency (or redshift) axis. This effect means that interferometric images lack absolute calibration of the signal. Finally, to attain isotropic resolution in our data set we convolve the signal along the frequency axis with a tophat filter of the same physical width as the Gaussian beam. For z = 8 this gives Δν ≈ 0.5 MHz.

3.3.2 System noise

The 21-cm observations with SKA-Low will also be prone to instrumental noise. In order to simulate this noise, we follow the approach in Ghara et al. (2017) and Giri, Mellema & Ghara (2018b). We model the effects using the currently available configuration³ of the first phase of SKA-Low that has 512 antennae stations each with a diameter of 35 m. We present the telescope parameters that we use in this study in Table 2.

Table 2.

Open in new tab

The parameters used in this study to model the telescope properties.

Parameters	Values
System temperature (T_sys)	\|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$\| K
Effective collecting area (A_D)	962 m²
Critical frequency (ν_c)	110 MHz
Declination	−30°
Observation hour per day	6 h
Signal integration time	10 s

Parameters	Values
System temperature (T_sys)	\|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$\| K
Effective collecting area (A_D)	962 m²
Critical frequency (ν_c)	110 MHz
Declination	−30°
Observation hour per day	6 h
Signal integration time	10 s

Table 2.

Open in new tab

The parameters used in this study to model the telescope properties.

Parameters	Values
System temperature (T_sys)	\|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$\| K
Effective collecting area (A_D)	962 m²
Critical frequency (ν_c)	110 MHz
Declination	−30°
Observation hour per day	6 h
Signal integration time	10 s

Parameters	Values
System temperature (T_sys)	\|$60 (\frac{\nu }{300\mathrm{MHz}})^{-2.55}$\| K
Effective collecting area (A_D)	962 m²
Critical frequency (ν_c)	110 MHz
Declination	−30°
Observation hour per day	6 h
Signal integration time	10 s

We explore two observation time (t_obs). One is an optimistic case of t_obs = 1000 h and other is a pessimistic case of t_obs = 100 h. At simulation resolution, the system noise at 1000 and 100 h will be very high. As the this noise is uncorrelated while the signal is correlated, the signal-to-noise ratio (SNR) will go up when the resolution is decreased. As discussed earlier, SKA-Low will have a limited resolution defined by the maximum baseline. Previous studies have shown that we can get interesting image data with 1000 h observation at a resolution corresponding to B = 2 km (e.g. Mellema et al. 2015; Ghara et al. 2017; Giri et al. 2018b, 2019a). However the SNR will be much lower for a 100 h observation at the same resolution. To achieve a similar SNR, we reduce the resolution by a factor of 2 for the 100 h mock observations.

We would like to note that the noise model we use assumes perfect calibration. In reality it is not always possible to reach this theoretical noise level. Therefore, to achieve the same SNR values as in our example with the real SKA-Low might require longer observation times or use of a lower resolution.

4 BETTI NUMBERS OF IONIZATION FIELD

The formation, merger, and evolution of ionized bubbles during reionization results in a complex morphology of H ii regions. In this section we investigate the Betti numbers estimated from the ionization field in our simulations.

4.1 Evolution during reionization

Fig. 3 shows the evolution of Betti numbers estimated from the ionization field of our FN1 simulations by defining structures using a threshold of |$x_\mathrm{H\, \rm {ii}}=0.5$|⁠. These Betti numbers trace the evolution topology of H ii regions during reionization. Here and in the remainder of this paper we plot the evolution of the Betti numbers against the fraction of our simulation volume that has been ionized, rather than time or redshift. Using the ionized volume fraction |$x_\mathrm{H\, \rm {ii}}^\mathrm{V}$| connects the evolution of the Betti numbers to the different stages of reionization and also makes it easier to compare the results of different reionization scenarios in which reionization may have evolved differently.

$Evolution of the Betti numbers estimated from the $x_\mathrm{H\, \rm {ii}}$ field of the FN1 simulation. The black solid lines are the fits to the curves. See text for a description of these fits and Table 3 for the fit parameters.$

Figure 3.

Evolution of the Betti numbers estimated from the |$x_\mathrm{H\, \rm {ii}}$| field of the FN1 simulation. The black solid lines are the fits to the curves. See text for a description of these fits and Table 3 for the fit parameters.

Open in new tab Download slide

When reionization starts we see an increase in β₀. This increase in β₀ is due to the appearance of ionized bubbles around the sources emitting ionizing photons. As time passes, more sources form and ionize the hydrogen gas in the IGM around them. In the initial stages of reionization we not only see appearance of new ionized bubbles but also overlap of bubbles. A comparison between the value of β₀ and the number of sources shows that such overlap occurs right from the start of reionization. The total number of bubbles will decrease when multiple ionized bubbles overlap. When the rate by which H ii regions start to overlap becomes larger than the formation rate of new isolated ionized bubbles, the value of β₀ will start to decrease. In our model, we observe this turn-around at |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.2$|⁠. The value of β₀ keeps decreasing as reionization proceeds and converges to unity at around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.7$|⁠.

At |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.1$|⁠, β₁ starts to increase. β₁ measures the number of tunnels and these tunnels can be formed by overlapping bubbles (see Section 2.1 for a simple example). Therefore, the growth of β₁ is a sign of substantial bubble overlap modulating the topology of the ionization field. As reionization progresses, the number of tunnels increase until it reaches a maximum value beyond which β₁ starts to decrease. In the FN1 simulation this peak is reached around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.55$|⁠. Unlike β₀, the value of β₁ remains much larger than 1 until the end of reionization.

Cavities inside H ii regions start forming during the middle stages of reionization, as indicated by the slow rise of β₂, starting around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.2$|⁠. We can visualize these cavities as isolated neutral regions. As reionization progresses initially large connected neutral regions breakup and form multiple isolated neutral regions. The value of β₂ attains a maximum around |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.9$| after which it falls to zero once reionization completes. In any inside-out reionization scenario such as FN1, these later isolated neutral regions trace the cosmic voids. Giri et al. (2019a) noted that the number of isolated neutral regions in the later stages of reionization are relatively rare compared to the number of isolated ionized regions at the start of reionization. This is confirmed by comparing the maximum values of the β₀ and β₂ curves, which shows that the latter is about 3.5 times lower.

From the three curves we obtain three clear maxima, as well as the crossing points between β₀ and β₁ as well as between β₁ and β₂. As β₁ enters the calculation of the Euler characteristic χ with a different sign than β₀ and β₂, the above evolutionary patterns cannot be distinguished when considering χ. It is for example impossible to separate the early rise of the number of tunnels and the decrease of the number of isolated regions due to overlap. At |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\approx 0.35$| β₀ ≈ β₁. However the small positive contribution from β₂ at that time will push the location of χ = 0 to a slightly later stage.

4.2 Connection between Betti numbers and percolation

These crossing points of Betti curves appear to be connected to the formation of the percolation clusters. Furlanetto & Oh (2016) showed that a large network of overlapping ionized bubbles spanning the entire simulation volume forms during the early stages of reionization. This network is known as the percolation cluster. A similar percolation event⁴ takes place towards the end of reionization when the last network of neutral regions spanning the entire volume disappears. As pointed out by Furlanetto & Oh (2016) this type of percolation behaviour is a hallmark of phase transitions.

Fig. 4 shows the percolation curves, i.e. the volume fraction of H ii regions contained in the percolation cluster, for all our simulations. Here we focus on the percolation curve of the FN1 scenario; the percolation curves of other scenarios are discussed in Section 4.5. We see that the appearance of the percolation cluster coincides with the time when β₁ surpasses β₀. Likewise we find that the disappearance of the neutral percolation cluster coincides with the time when β₁ ≈ β₂. It was recently proposed by Santos et al. (2019) and Bobrowski & Skraba (2020) that such a connection between Betti numbers and percolation may be universal.

$Percolation curves for the ionized percolation cluster. The number on the ordinate, V∞/Vtot, is the fraction of the total ionized volume contained in the percolation cluster. The different curves show the evolution of this quantity for our five different reionization scenarios.$

Figure 4.

Percolation curves for the ionized percolation cluster. The number on the ordinate, V_∞/V_tot, is the fraction of the total ionized volume contained in the percolation cluster. The different curves show the evolution of this quantity for our five different reionization scenarios.

Open in new tab Download slide

4.3 Parametrization of the evolution of Betti numbers

The evolution curves for each of the Betti numbers for scenario FN1 have strikingly regular shapes. After inspection of the other scenarios we find that the shapes appear to be independent of the source model driving the reionization process. We will discuss these additional reionization scenarios in Section 4.5. The curves for β₀ and β₂ seem to match a lognormal shape, whereas β₁ appears to have a Gaussian shape. We therefore propose the following parametric forms for each Betti number,

$$\begin{eqnarray*} \beta _0(x^\mathrm{v}_\mathrm{H\, \rm {ii}}) = A_0 \exp {\left[-\frac{1}{2}\left(\frac{\ln x^\mathrm{v}_\mathrm{H\, \rm {ii}} - \ln \mu _0}{\sigma _0} \right)^2 \right]} \ , \end{eqnarray*}$$

(10)

$$\begin{eqnarray*} \beta _1(x^\mathrm{v}_\mathrm{H\, \rm {ii}}) = A_1 \exp {\left[-\frac{1}{2}\left(\frac{x^\mathrm{v}_\mathrm{H\, \rm {ii}} - \mu _1}{\sigma _1} \right)^2 \right]} \ , \end{eqnarray*}$$

(11)

$$\begin{eqnarray*} \beta _2(x^\mathrm{v}_\mathrm{H\, \rm {ii}}) = A_2 \exp {\left[-\frac{1}{2}\left(\frac{\ln (1-x^\mathrm{v}_\mathrm{H\, \rm {ii}}) - \ln \mu _2}{\sigma _2} \right)^2 \right]} \ , \end{eqnarray*}$$

(12)

where A_i, μ_i, and σ_i are the fitting parameters. A_i models the height of the curves. μ_i and σ_i define the peak position and width of the curves. We show the fits to the Betti numbers curve with black solid lines in Fig. 3. The fit parameters are listed in Table 3. Here and in the rest of this paper we use 10-fold cross validation to estimate the goodness of the fit (e.g. Hastie, Tibshirani & Friedman 2009; Giri et al. 2020b).

Table 3.

Open in new tab

The values of parameters when we model the Betti numbers using equations (10), (11), and (12). The fit parameters for |$x_\mathrm{H\, \rm {ii}}$| are Betti number curves estimated from the field at simulation resolution. The fit parameters for δT_b are Betti number curves estimated from the mock SKA-Low 21-cm images. We give the fit parameters corresponding to mock observations at t_obs = 1000 h and B = 2 km. The fit parameters given in brackets correspond to mock observations at t_obs = 100 h and B = 1 km.

	Fit		\|$\pmb {x_\mathrm{H\, \rm {ii}}}$\|			\|$\pmb {\delta T_\mathrm{b}}$\|
Models	Parameters	β₀	β₁	β₂	β₀	β₁	β₂
FN1	A_i	1409	1886	415	50 (–)	512 (443)	92 (80)
	μ_i	0.25	0.55	0.10	0.27 (–)	0.61 (0.61)	0.04 (0.03)
	σ_i	0.71	0.15	0.43	0.55 (–)	0.16 (0.18)	0.33 (0.34)
FN2	A_i	1443	2092	476	156 (–)	562 (500)	90 (80)
	μ_i	0.27	0.55	0.11	0.29 (–)	0.63 (0.62)	0.04 (0.03)
	σ_i	0.75	0.14	0.46	0.72 (–)	0.15 (0.18)	0.38 (0.36)
SN1	A_i	1195	1652	112	161 (–)	470 (371)	43 (45)
	μ_i	0.21	0.56	0.06	0.24 (–)	0.61 (0.63)	0.03 (0.05)
	σ_i	0.63	0.16	0.37	0.69 (–)	0.16 (0.15)	0.26 (0.45)
SN2	A_i	1351	2168	223	66 (–)	461 (342)	43 (25)
	μ_i	0.25	0.56	0.08	0.31 (–)	0.60 (0.64)	0.02 (0.05)
	σ_i	0.69	0.15	0.41	0.64 (–)	0.19 (0.16)	0.19 (0.25)
SN3	A_i	1206	2048	800	138 (–)	689 (688)	130 (200)
	μ_i	0.21	0.55	0.09	0.24 (–)	0.61 (0.67)	0.03 (0.05)
	σ_i	0.64	0.15	0.48	0.68 (–)	0.16 (0.11)	0.19 (0.45)

	Fit		\|$\pmb {x_\mathrm{H\, \rm {ii}}}$\|			\|$\pmb {\delta T_\mathrm{b}}$\|
Models	Parameters	β₀	β₁	β₂	β₀	β₁	β₂
FN1	A_i	1409	1886	415	50 (–)	512 (443)	92 (80)
	μ_i	0.25	0.55	0.10	0.27 (–)	0.61 (0.61)	0.04 (0.03)
	σ_i	0.71	0.15	0.43	0.55 (–)	0.16 (0.18)	0.33 (0.34)
FN2	A_i	1443	2092	476	156 (–)	562 (500)	90 (80)
	μ_i	0.27	0.55	0.11	0.29 (–)	0.63 (0.62)	0.04 (0.03)
	σ_i	0.75	0.14	0.46	0.72 (–)	0.15 (0.18)	0.38 (0.36)
SN1	A_i	1195	1652	112	161 (–)	470 (371)	43 (45)
	μ_i	0.21	0.56	0.06	0.24 (–)	0.61 (0.63)	0.03 (0.05)
	σ_i	0.63	0.16	0.37	0.69 (–)	0.16 (0.15)	0.26 (0.45)
SN2	A_i	1351	2168	223	66 (–)	461 (342)	43 (25)
	μ_i	0.25	0.56	0.08	0.31 (–)	0.60 (0.64)	0.02 (0.05)
	σ_i	0.69	0.15	0.41	0.64 (–)	0.19 (0.16)	0.19 (0.25)
SN3	A_i	1206	2048	800	138 (–)	689 (688)	130 (200)
	μ_i	0.21	0.55	0.09	0.24 (–)	0.61 (0.67)	0.03 (0.05)
	σ_i	0.64	0.15	0.48	0.68 (–)	0.16 (0.11)	0.19 (0.45)

Table 3.

Open in new tab

	Fit		\|$\pmb {x_\mathrm{H\, \rm {ii}}}$\|			\|$\pmb {\delta T_\mathrm{b}}$\|
Models	Parameters	β₀	β₁	β₂	β₀	β₁	β₂
FN1	A_i	1409	1886	415	50 (–)	512 (443)	92 (80)
	μ_i	0.25	0.55	0.10	0.27 (–)	0.61 (0.61)	0.04 (0.03)
	σ_i	0.71	0.15	0.43	0.55 (–)	0.16 (0.18)	0.33 (0.34)
FN2	A_i	1443	2092	476	156 (–)	562 (500)	90 (80)
	μ_i	0.27	0.55	0.11	0.29 (–)	0.63 (0.62)	0.04 (0.03)
	σ_i	0.75	0.14	0.46	0.72 (–)	0.15 (0.18)	0.38 (0.36)
SN1	A_i	1195	1652	112	161 (–)	470 (371)	43 (45)
	μ_i	0.21	0.56	0.06	0.24 (–)	0.61 (0.63)	0.03 (0.05)
	σ_i	0.63	0.16	0.37	0.69 (–)	0.16 (0.15)	0.26 (0.45)
SN2	A_i	1351	2168	223	66 (–)	461 (342)	43 (25)
	μ_i	0.25	0.56	0.08	0.31 (–)	0.60 (0.64)	0.02 (0.05)
	σ_i	0.69	0.15	0.41	0.64 (–)	0.19 (0.16)	0.19 (0.25)
SN3	A_i	1206	2048	800	138 (–)	689 (688)	130 (200)
	μ_i	0.21	0.55	0.09	0.24 (–)	0.61 (0.67)	0.03 (0.05)
	σ_i	0.64	0.15	0.48	0.68 (–)	0.16 (0.11)	0.19 (0.45)

	Fit		\|$\pmb {x_\mathrm{H\, \rm {ii}}}$\|			\|$\pmb {\delta T_\mathrm{b}}$\|
Models	Parameters	β₀	β₁	β₂	β₀	β₁	β₂
FN1	A_i	1409	1886	415	50 (–)	512 (443)	92 (80)
	μ_i	0.25	0.55	0.10	0.27 (–)	0.61 (0.61)	0.04 (0.03)
	σ_i	0.71	0.15	0.43	0.55 (–)	0.16 (0.18)	0.33 (0.34)
FN2	A_i	1443	2092	476	156 (–)	562 (500)	90 (80)
	μ_i	0.27	0.55	0.11	0.29 (–)	0.63 (0.62)	0.04 (0.03)
	σ_i	0.75	0.14	0.46	0.72 (–)	0.15 (0.18)	0.38 (0.36)
SN1	A_i	1195	1652	112	161 (–)	470 (371)	43 (45)
	μ_i	0.21	0.56	0.06	0.24 (–)	0.61 (0.63)	0.03 (0.05)
	σ_i	0.63	0.16	0.37	0.69 (–)	0.16 (0.15)	0.26 (0.45)
SN2	A_i	1351	2168	223	66 (–)	461 (342)	43 (25)
	μ_i	0.25	0.56	0.08	0.31 (–)	0.60 (0.64)	0.02 (0.05)
	σ_i	0.69	0.15	0.41	0.64 (–)	0.19 (0.16)	0.19 (0.25)
SN3	A_i	1206	2048	800	138 (–)	689 (688)	130 (200)
	μ_i	0.21	0.55	0.09	0.24 (–)	0.61 (0.67)	0.03 (0.05)
	σ_i	0.64	0.15	0.48	0.68 (–)	0.16 (0.11)	0.19 (0.45)

The fact that simple functional forms can be used to describe the evolution of the Betti numbers for reionization is another advantage over the Euler characteristic for which no simple fit appears to exist.

4.4 Dependence on simulation resolution

The topology of H ii regions is sensitive to the resolution of our simulation volume. Friedrich et al. (2011) showed the impact of simulation resolution on the Euler characteristics. In this section, we study how the Betti numbers depend on the simulation resolution.

We show the Betti numbers curves estimated from the FN1 simulation volumes at different resolutions in Fig. 5. The solid curve represents the Betti numbers estimated from the data set at the intrinsic resolution of our simulation (Δx ≈ 2.4 Mpc). The dashed and dash–dotted lines represent their values estimated after four-cell and eight-cell smoothing, respectively. The four-cell and eight-cell smoothing approximately correspond to the resolution of SKA-Low observations at z ≈ 9 when maximum baselines of 2 and 1 km are used, respectively.

$Evolution of the Betti numbers estimated from the $x_\mathrm{H\, \rm {ii}}$ field of the FN1 simulation at various resolutions. The solid, dashed, and dash–dotted line-styles give Betti numbers estimated from data set at simulation resolution, four-cell smoothing, and eight-cell smoothing, respectively. The latter two approximately correspond to the resolution achieved by SKA-Low observations with a maximum baseline of 2 and 1 km, respectively, for redshift z ≈ 9.$

Figure 5.

Evolution of the Betti numbers estimated from the |$x_\mathrm{H\, \rm {ii}}$| field of the FN1 simulation at various resolutions. The solid, dashed, and dash–dotted line-styles give Betti numbers estimated from data set at simulation resolution, four-cell smoothing, and eight-cell smoothing, respectively. The latter two approximately correspond to the resolution achieved by SKA-Low observations with a maximum baseline of 2 and 1 km, respectively, for redshift z ≈ 9.

Open in new tab Download slide

When we fit these curves with the parametric form given in equations (10), (11), and (12), we find that the peak positions and widths of the curve do not change as the resolution degrades. The resolution only affects the amplitude parameter A_i for all the Betti numbers curve. This decrease in amplitude is expected as the number of features contained in the data set decreases when the resolution is degraded. The fact that the peak positions and widths are less affected adds to the attractiveness of the Betti numbers as topological quantities.

4.5 Comparing the topology of different reionization scenarios

In this section we compare the topology of various reionization models using the Betti numbers. The top panels of Fig. 6 show the Betti curves for our fully numerical simulations. The FN1 and FN2 simulations use source efficiencies ζ = 50 and 40, respectively. Therefore, reionization is delayed in FN2 (see Fig. 2). However, even though the reionization history is different the peak positions of the Betti curves are quite similar. The β₀ gives the number of isolated ionized regions and at early times, this number will be proportional to the number of active ionizing sources and not the efficiency of those sources. Note, however, that β₀ will not be exactly equal to the number of sources as some regions are formed by clustered sources. Fig. 2 shows that FN2 reaches a similar stage Δz ≈ 1 later than FN1. The number density of ionizing sources will not differ that much between those two epochs. Therefore, the β₀ curves for both these models are quite similar. However, the topology does differ, as evidenced by the β₁ and β₂ curves. The FN2 simulation contains more tunnels during the middle stages of reionization and more cavities from about the middle until near the end of reionization. Using all the curves together, we can distinguish between these two models.

$Evolution of the Betti numbers estimated from the $x_\mathrm{H\, \rm {ii}}$ field of different simulations. The top and bottom panels show the Betti numbers for our fully numerical and semi-numerical simulation models, respectively.$

Figure 6.

Evolution of the Betti numbers estimated from the |$x_\mathrm{H\, \rm {ii}}$| field of different simulations. The top and bottom panels show the Betti numbers for our fully numerical and semi-numerical simulation models, respectively.

Open in new tab Download slide

To test the robustness of our conclusions, we also study the topology of reionization in a set of semi-numerical simulations. As semi-numerical simulations are fast, we can easily vary the simulation parameters and observe its impact on the Betti numbers. In the bottom panel of Fig. 6, we show the Betti numbers curves for our three semi-numerical simulations (see Table 1). SN1 and SN3 use the same M_min value (⁠|$10^9 \, \mathrm{M}_\odot$|⁠) for dark matter haloes, which means that reionization is driven by the same source population. Therefore, the β₀ curves for SN1 and SN3 overlap with each other. The M_min used in SN2 is |$10^8 \, \mathrm{M}_\odot$|⁠. As a result SN2 contains more ionizing sources compared to SN1 and SN3. We see the imprint of this on the β₀ curve that reaches a higher amplitude for the SN2 simulation.

The SN1 and SN2 simulations assume R_mfp to be 71 Mpc where as SN3 simulation assumes it to be 20 Mpc. SN1 and SN3 otherwise have the same source parameters. R_mfp describes the largest distance ionizing photons can travel from a source. At the early times, the value of R_mfp does not have a significant impact on the topology as most H ii regions are smaller than R_mfp. However, once larger regions form during the middle and late stages of reionization, a smaller value for R_mfp will inhibit the growth of ionized regions. This will allow more tunnels to survive, as evidenced by the difference between the β₁ curves of SN1 and SN3. The largest impact of R_mfp is associated with the final phases of reionization as can be seen from the β₂ curves. The values are much higher for SN3, indicating that imposing a short mean free path leads to the formation of many more small neutral islands.

As pointed out in Section 4, the rise of β₁ in simulation FN1 marks the appearance of the percolation cluster. The percolation curves for all our reionization models can be found in Fig. 4. We find that for all models, percolation happens roughly at the epoch when β₁ crosses β₀ confirming our hypotesis.

The Betti numbers curves from all our reionization models, both fully numerical and semi-numerical, show the characteristic forms defined in Section 4. We present the values of the fit parameters for all models in Table 3. We conclude that these fitting formulae equations (10), (11), and (12) provide a useful description of the evolution of Betti numbers in reionization simulations.

5 BETTI NUMBERS OF 21-CM SIGNAL

In this section we will study the evolution of Betti numbers estimated from the 21-cm signal (δT_b). As explained in Section 3.2, this signal depends on neutral fraction |$\big (x_\mathrm{H\, \rm {i}}(\pmb {r}) = 1-x_\mathrm{H\, \rm {ii}}(\pmb {r}) \big)$|⁠, density, and spin temperature fields (see equation 6). For our study we assume the Universe has been heated before reionization begins causing the spin temperature factor to saturate to 1 as (T_s ≫ T_CMB). Even with this assumption, the δT_b will still contain properties of both the ionization and density fields.

We first study the topology of structures by putting a threshold (⁠|$\delta T^\mathrm{th}_\mathrm{b}$|⁠) on the δT_b field. We measure the topology of structures with pixel values below a given value of |$\delta T^\mathrm{th}_\mathrm{b}$|⁠. In Fig. 7, we show colour plots of the Betti numbers estimated from the δT_b field of the FN1 simulation against the values of |$\delta T^\mathrm{th}_\mathrm{b}$| (y-axis) and |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| (x-axis). For high values of |$\delta T^\mathrm{th}_\mathrm{b}$|⁠, the topology is mostly defined by the fluctuations in the density field. However the Betti numbers will not exactly trace the topology of the density field as reionization will reduce the signal in the (predominantly) dense regions. For low |$\delta T^\mathrm{th}_\mathrm{b}$| value, the structures trace the topology of ionization field. For |$\delta T^\mathrm{th}_\mathrm{b} \lesssim 10$| mK, the Betti numbers estimated from δT_b fields are similar to that estimated from |$x_\mathrm{H\, \rm {ii}}$| field of FN1 simulation.

$Contour plots of the values of β0 (left-hand panel), β1 (middle panel), and β2 (right-hand panel) of the δTb field from the FN1 simulation as a function of global ionization fraction $x^\mathrm{v}_\mathrm{H\, \rm {ii}}$ and threshold value ($\delta T^\mathrm{th}_\mathrm{b}$).$

Figure 7.

Contour plots of the values of β₀ (left-hand panel), β₁ (middle panel), and β₂ (right-hand panel) of the δT_b field from the FN1 simulation as a function of global ionization fraction |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| and threshold value (⁠|$\delta T^\mathrm{th}_\mathrm{b}$|⁠).

Open in new tab Download slide

For different reionization scenarios we will obtain different colour plots from the one shown in Fig. 7. However, it is difficult to compare scenarios with these plots as they will be affected by instrumental limitations such as lack of an absolute calibration of the signal, limited resolution, and noise. In the next section, we propose a method to explore the topological properties of reionization using mock 21-cm observations for the future SKA-Low telescope.

6 BETTI NUMBERS OF 21-CM IMAGES OBSERVED WITH RADIO TELESCOPE

As discussed in the previous section, the structures in the ionization field will be imprinted on the 21-cm signal. Therefore, we can study the morphology of H ii regions if we can identify these regions in 21-cm images. As the future SKA-Low has been designed to be able to produce such images (Mellema et al. 2015), we will consider mock SKA-Low observations in this section and study our ability to estimate the Betti numbers from such observations.

Identifying the ionized (or neutral) regions in 21-cm images is a non-trivial problem. Giri et al. (2018b) compared various methods taken from the field of image processing and found that the superpixel method works the best in identifying ionized (or neutral) regions in 21-cm images. In the superpixel method, connected pixels with similar properties are grouped together. These pixel groups are known as superpixels. We then construct the histogram from the mean intensity values of all the superpixels. From this histogram we find the threshold for the superpixels that classifies them into ionized and neutral superpixels. See Giri et al. (2018b) for a detailed description of the method. Note that the identification of the ionized regions at the same time yields an estimate for the fraction of the observed volume that is ionized, thus allowing to determine the Betti numbers as a function of ionized volume fraction rather than redshift.

In Section 6.1, we consider mock SKA-Low observations with infinite observation time as we want to first learn what we can achieve in this most optimal case with our method. In this situation, the 21-cm images will be free from noise and only be affected by the limited resolution and the lack of absolute calibration. In Section 6.2 we study the impact of telescope noise on our analysis strategy.

6.1 Limited resolution

Fig. 8 shows the impact of SKA-Low resolution on the estimated Betti numbers. The solid and dashed lines represent the Betti numbers calculated from the |$x_\mathrm{H\, \rm {ii}}$| data sets degraded to a resolution corresponding to a maximum baseline (B) of 2 and 1 km, respectively. These curves differ slightly from the ones shown in Fig. 5 as the resolution corresponding to a certain value of B is redshift dependent. Even though the peak amplitude decreases as the resolution is degraded, the peak position of all the Betti number curves estimated from |$x_\mathrm{H\, \rm {ii}}$| does not change. This behaviour agrees with our findings in Section 4.4.

$The solid and dashed (dash–dotted and dotted) represent Betti numbers curve estimated from the $x_\mathrm{H\, \rm {ii}}$ (δTb) fields from simulation FN1 at resolutions corresponding to a maximum baseline B of 2 and 1 km, respectively.$

Figure 8.

The solid and dashed (dash–dotted and dotted) represent Betti numbers curve estimated from the |$x_\mathrm{H\, \rm {ii}}$| (δT_b) fields from simulation FN1 at resolutions corresponding to a maximum baseline B of 2 and 1 km, respectively.

Open in new tab Download slide

The dash–dotted and dotted lines in Fig. 8 represent the Betti numbers estimated from the δT_b fields at a resolution corresponding to B = 2 and 1 km, respectively. The peak position of β₀ for the δT_b field observed at B = 2 km matches the peak positions of the β₀ curves derived from the |$x_\mathrm{H\, \rm {ii}}$| fields. However, for the lower resolution case of B = 1 km the β₀ derived from the δT_b fields attains its peak at a much earlier epoch. This is because the ionized regions are not very large at these early times and for the lower resolution the superpixel method struggles to separate the ionization and density fluctuations. This effect is in fact also present for the B = 2 km case where it causes the non-smooth increase of β₀.

The middle panel of Fig. 8 shows that the β₁ curves for δT_b field are slightly biased towards larger |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| compared to the ones for |$x_\mathrm{H\, \rm {ii}}$| field. We also observe that the peak amplitude of β₁ curves for δT_b field decreases and the width of the curve increases compared to the ones for |$x_\mathrm{H\, \rm {ii}}$| field. At early times (⁠|$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\lesssim 0.2$|⁠), the β₁ determined from |$x_\mathrm{H\, \rm {ii}}$| field is close to zero for all our reionization scenarios (see Section 4.5). However our method gives non-zero values for β₁ determined from δT_b field. This anomaly is again caused as the superpixel method struggles to separate ionization and density fluctuations during these early stages of reionization.

The right-most panel of Fig. 8 shows the β₂ curves for δT_b field. These curves are also slightly biased towards larger |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}$| compared to the ones for |$x_\mathrm{H\, \rm {ii}}$| field. Even though the resolution corresponding to B = 1 km smooths away more features compared to B = 2 km, the superpixel method identifies similar number of neutral regions as these neutral regions are relatively large structures (Giri et al. 2019a). Therefore, the β₂ curves for both resolution cases overlap with each other.

6.2 Telescope noise

The SKA-Low observations will contain instrumental noise, which we model using the method described in Section 3.3.2. Here we study the Betti numbers estimated from noisy 21-cm images. We consider two observation times, which are t_obs = 1000 h and 100 h. The 21-cm images observed at t_obs = 1000 h is degraded to a resolution corresponding to B = 2 km. In order to get similar SNR, we use B = 1 km for the 21-cm images observed with t_obs = 100 h.

In the left-hand-most, middle, and right-hand-most panels of Fig. 9, we show from left to right the β₀, β₁ and β₂ curves estimated from δT_b field. As can be seen from these curves our method can reconstruct the topology of reionization most reliably for epochs |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\gtrsim 0.4$| from the noisy 21-cm images. For earlier times we find noisy Betti curves, which is what we expect for data sets in which the noise dominates over the signal. In Section 4, we found that the topology of reionization during the early stages is mostly traced by the β₀ values. These are very difficult to extract from the noisy 21-cm images as most ionized regions are small-scale features that are indistinguishable from noise. The β₁ and β₂ values are more sensitive to the topology of reionization during the middle and late stages of reionization, respectively. We observe good reconstruction of β₁ and β₂ curves in Fig. 9 derived from noisy 21-cm images, with the exception of the β₁ values for the B = 2 km, 1000 h observation that are severely underestimated.

Figure 9.

Betti numbers estimated from the δT_b field from simulation FN1 when observed with SKA-Low. The solid and dashed lines represent Betti numbers estimated from noiseless δT_b observed at resolution corresponding to a maximum SKA-Low baseline of 2 and 1 km, respectively. The dash–dotted and dotted lines represent Betti numbers estimated from 1000 and 100 h observation of δT_b observed at resolution corresponding to a maximum SKA-Low baseline B of 2 and 1 km, respectively.

Open in new tab Download slide

As the β₀ curves at |$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \lesssim 0.4$| are noisy, we fit equation (10) to the these curves for |$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \gtrsim 0.4$| only. We list the fit parameter values in Table 3. Even though we cannot see any clear peak position in the β₀ curves, the fit for the mock observations t_obs = 1000 h does give us good values for μ₀ and σ₀. Even though the curve derived for t_obs = 100 h seems to resemble the one for t_obs = 1000 h, our cross-validation technique shows that no reasonable fit can be made. We therefore conclude that to study the β₀ evolution from 21-cm images, a long observation time is needed.

Similarly, we fitted parameters of equations (11) and (12) to the β₁ and β₂ curves for |$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \gtrsim 0.4$|⁠. These fit parameters are also given in Table 3. The peak position and width of the β₁ curves derived from the noisy 21-cm images match quite well with the ones for the noiseless case. However, their amplitude is lower than for the ideal case. The β₂ curves derived from noisy 21-cm images almost overlap with the curves derived from the noiseless cases, leading to very similar fitting parameters for β₂ for all cases, irrespective of observation time. This indicates that we can study the topology of reionization during its middle and late stages even with 21-cm observations with relatively short observation times.

6.3 Comparing different reionization models

In this section, we compare Betti number curves for all our reionization models when observed with SKA-Low. The top panel of Fig. 10 shows the curves derived from mock observations at t_obs = 1000 h and B = 2 km. We find that we can extract the correct evolution the topology of reionization in all the models. These curves are more reliable at epochs |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\gtrsim 0.4$|⁠. Using all the Betti numbers together, we can distinguish between the models. In Table 3, we give the parameter values while fitting all these curves with equations (10), (11), and (12). The derived μ and σ values are quite close to the values derived from the |$x_\mathrm{H\, \rm {ii}}$| field.

Figure 10.

Betti numbers estimated from the δT_b field for all our reionization models when observed with SKA-Low. The top and bottom panels represent the Betti number curves extracted from mock observations of t_obs = 1000 h (B = 2 km) and t_obs = 100 h (B = 1 km), respectively. Different models are marked with different line-styles that are shown in the legend.

Open in new tab Download slide

We also compare the Betti number curves for all our reionization models in our more pessimistic scenario. The bottom panel of Fig. 10 gives the curves derived from mock observations at t_obs = 100 h and B = 1 km. Even though the curves are noisy, we can discern the expected evolution of the Betti numbers for each model. These curves are more reliable at epochs |$x^\mathrm{v}_\mathrm{H\, \rm {ii}}\gtrsim 0.5$|⁠, making it difficult to derive a fit for the evolution of β₀. In Table 3 contains the fit parameters for the β₁ and β₂ curves. As the late neutral islands tend to be large, we find that we can measure the topology of reionization at late times quite well. As noted above, this can even be done using relatively short observations times with SKA-Low.

7 SUMMARY

In this work we explored a new method to study the 3D topology of H ii regions during reionization, namely the three Betti numbers. In the perspective of the distribution of ionized regions the zeroth Betti number (β₀) gives the number of isolated H ii regions, the first Betti number (β₁) gives the number of neutral tunnels through ionized regions, and the second Betti number (β₂) gives the number of neutral islands embedded in ionized regions.

We investigated the evolution of the Betti numbers in a small set of fully numerical and semi-numerical simulations and found a characteristic pattern in which each of the Betti numbers first rises and then falls, each attaining a maximum value at different stages of reionization. When reionization starts, most of the ionized bubbles are isolated and β₀ is the largest of the three numbers. It initially rises as the number of ionizing sources, including individual and clustered sources, grows. However, as existing ionized regions grow and new ones continue to form, they will start to overlap, leading eventually to a decrease of β₀.

When substantial overlapping of ionized bubbles starts, more and more of these connected regions will be crossed by neutral tunnels and we consequently observe a rise in the values of β₁. The time when the decreasing β₀ and the increasing β₁ cross corresponds to the appearance of the ionized percolation cluster (Furlanetto & Oh 2016). β₁ reaches its maximum value at the middle stages of reionization. During this stage, isolated neutral regions, some of them the remnants of broken up tunnels, become more common and therefore the value of the second Betti number (β₂) starts to increase. The time when the β₁ and β₂ curves cross corresponds to the disappearance of the neutral percolation cluster. During the later stages of reionization β₂ reaches its peak value after which it drops as the last neutral islands disappear.

We find that the characteristic evolution of the Betti numbers with the volume-averaged ionization fraction (⁠|$x^{\rm v}_{\rm H\, \rm {ii}}$|⁠) of the Universe is quite well fitted by simple functions, a normal or Gaussian function for β₁, and lognormal functions for β₀ and β₂. Each of these is determined by three parameters specifying the peak amplitude (A_i), peak position (μ_i), and width (σ_i) of the curves. Using our suite of reionization simulations, we showed that we can distinguish reionization models by their sets of Betti curves.

On the basis of these results we conclude that the Betti numbers provide a better way to describe the topology of reionization than the much better studied genus or Euler characteristic. Not only do the Betti numbers represent a set of intuitively clear morphological concepts in the context of ionized regions, they also display characteristic evolutionary patterns in which they each distinguish different stages of reionization, and which also appear to fit simple functional forms. Furthermore, their evolution shows a connection with the percolation process characteristic of reionization. As the Euler characteristic can easily be calculated from the Betti numbers, there appears to be no advantage to studying the topology of reionization with it.

The redshifted 21-cm signal probes the distribution of neutral hydrogen gas during reionization. Observations with the future SKA-Low will deliver 3D tomographic data sets consisting of sky images of the 21-cm signal at different frequencies. We explored the prospect of deriving the topology of H ii regions from such data sets. To identify ionized regions in the 21-cm data we use the superpixels method introduced in Giri et al. (2018b). We find that we can retrieve the topology of reionization quite well for middle and late stages of reionization (⁠|$x^\mathrm{v}_\mathrm{H\, \rm {ii}} \gtrsim 0.4$|⁠). At early times, most H ii regions are small and difficult to identify in the mock 21-cm data sets due to a combination of low resolution and instrumental noise. We tested our method for two observation times, 1000 and 100 h. For both cases we could retrieve the evolution of β₁ and β₂ better than the one for β₀. We therefore expect that β₁ and β₂ in practice will be more robust measures of topology than β₀.

All Betti numbers curves derived from the mock 21-cm observations are noisy due to the imperfections of the observational data that do not allow a perfect identification of H ii regions. However when we fit β₁ and β₂ curves with our three parameter models, the intrinsic values of the model parameters are retrieved quite well. Retrieving the model parameters for β₀ is more challenging for the reasons explained above.

We have also compared the topologies determined from mock 21-cm observations constructed from our suite of reionization simulations. We can clearly distinguish between these models for a 1000 h observation with SKA-Low. Determining the evolution of the Betti numbers with a 100 h observation is more difficult. However, we can still distinguish between our models using β₁ and β₂. As isolated neutral regions are relatively large structures (Giri et al. 2019a), the values of β₂ estimated from the mock observations are the least affected by limited resolution and noise.

In this paper we have explored a small set of five reionization simulations, produced by two different codes. One might wonder about the validity of some of our conclusions, e.g. regarding the shapes of the Betti curves. If reionization proceeded inside-out we expect the Betti curves to always show the behaviour we have found: each Betti number peaks at a different time, in the order β₀, β₁, β₂. The reason for this is that for inside-out reionization there will be a strong correlation between the local density and the redshift of reionization. The areas of highest density reionize first and the ones of lowest density reionize last. This implies that different stages of reionization can approximately be connected to different threshold values in the density field, where everything above that threshold value is ionized. In this case we expect the evolution of the Betti numbers of ionized regions to approximately follow the Betti curves for different threshold values in a GRF shown in Fig. A1. This means that each Betti number will peak at a different time, in the order β₀, β₁, β₂, just as we have found for all our models. As reionization is a radiative transfer process, the correlation between density and redshift of reionziation will never be perfect, which is why the actual curves are not identical to the more or less Gaussian curves from Fig. A1. Why the evolutionary Betti curves would favour lognormal shapes for β₀ and β₂ and a Gaussian shape for β₁ is harder to explain. We consider our proposed fitting functions to be a useful working hypothesis until cases are found where this approach breaks down.

Our study has used a number of simplifying assumptions which we would like to briefly discuss here. The first is that we have calculated the differential brightness temperature assuming the high-spin temperature limit. If X-ray heating is not as efficient as generally assumed, this assumption may not be globally valid during the earlier stages of reionization. However, as we have seen, the early stages of reionization will anyway be challenging to study topologically as it will be difficult to identify ionized regions in the data.

We have also neglected the various line-of-sight effects when constructing our mock 21-cm observations. These are the LC effect (e.g. Datta et al. 2012) and RSD (e.g. Jensen et al. 2013). The former effect is caused by the evolution of the signal along the frequency direction. If we would analyse 21-cm data sets with a large frequency range (≳ 10 MHz), we would see more overlapping regions at the high frequency end than at the low frequency end. This will affect the topology of H ii regions and therefore the Betti numbers. However, this can be avoided by choosing a narrower frequency range. The RSD effect displaces the signal from its cosmological redshift due to peculiar velocities of gas. This process mostly alters the amplitude of the signal and has a relatively minor impact on the morphology of H ii regions (Giri et al. 2018a) Therefore, we expect only a minor impact on the Betti numbers, specially at the resolution of the telescope. We defer a detailed study of the impact of these line-of-sight effects to the future.

A potentially larger challenge is the impact from bright foreground signals that are many times brighter than the 21-cm signal we want to analyse (Jelic et al. 2008). A good foreground mitigation strategy is crucial to extract the 21-cm signal from radio observation. However, these methods are not perfect and therefore will leave residuals in the 21-cm images. If our structure identification methods struggles to distinguish between these residual foreground and the H ii regions, then studying the topology of reionization would become difficult. In such a case, we have to develop a foreground mitigation method that preserves the topological information. A detailed study of the impact of foreground residual is beyond the scope of this paper.

ACKNOWLEDGEMENTS

We acknowledge Rien van de Weygaert for useful discussion. GM is supported in part by Swedish Research Council grant 2016-03581. The computations were enabled by resources provided by the Swedish National Infrastructure for Computing (SNIC) at PDC partially funded by the Swedish Research Council through grant agreement no. 2018-05973. We also have used results obtained using the Partnership for Advanced Computing in Europe (PRACE) Research Infrastructure resources Curie based at the Très Grand Centre de Calcul (TGCC) operated by CEA near Paris, France, and Marenostrum based in the Barcelona Supercomputing Center, Spain. Time on these resources was awarded by PRACE under PRACE4LOFAR grants 2012061089 and 2014102339 as well as under the Multi-scale Reionization grants 2014102281 and 2015122822.

DATA AVAILABILITY

The data underlying this article will be shared on reasonable request to the corresponding author.

Footnotes

https://scikit-image.org/

https://tools21cm.readthedocs.io/

The configuration is described in document SKA-TEL-SKO-0000557 Rev 1 and the positions of the individual stations in document SKA-TEL-SKO-0000422 Rev 2, both retrievable at https://astronomers.skatelescope.org/documents/

This should perhaps be called a depercolation event as the neutral percolation cluster disappears.

REFERENCES

Bandyopadhyay

Choudhury

T. R.

Seshadri

T. R.

2017

MNRAS

466

2302

Month:	Total Views:
May 2021	6
June 2021	82
July 2021	42
August 2021	13
September 2021	30
October 2021	33
November 2021	15
December 2021	25
January 2022	15
February 2022	15
March 2022	14
April 2022	8
May 2022	21
June 2022	3
July 2022	18
August 2022	20
September 2022	24
October 2022	19
November 2022	11
December 2022	12
January 2023	17
February 2023	8
March 2023	46
April 2023	34
May 2023	9
June 2023	20
July 2023	13
August 2023	19
September 2023	20
October 2023	7
November 2023	21
December 2023	7
January 2024	39
February 2024	11
March 2024	15
April 2024	14
May 2024	30
June 2024	15
July 2024	20
August 2024	16

Article Contents

Measuring the topology of reionization with Betti numbers

ABSTRACT

1 INTRODUCTION

2 TOPOLOGICAL MEASURES

2.1 Betti numbers

2.2 Estimators for digital data

2.2.1 Betti numbers

2.2.2 Euler characteristics

3 SIMULATION OF 21 CM SIGNAL

3.1 Reionization simulations

3.1.1 Fully numerical simulations

3.1.2 Semi-numerical simulations

3.2 Redshifted 21 cm signal

3.3 Mock SKA-Low observation

3.3.1 Limited resolution

3.3.2 System noise

4 BETTI NUMBERS OF IONIZATION FIELD

4.1 Evolution during reionization

4.2 Connection between Betti numbers and percolation

4.3 Parametrization of the evolution of Betti numbers

4.4 Dependence on simulation resolution

4.5 Comparing the topology of different reionization scenarios

5 BETTI NUMBERS OF 21-CM SIGNAL

6 BETTI NUMBERS OF 21-CM IMAGES OBSERVED WITH RADIO TELESCOPE

6.1 Limited resolution

6.2 Telescope noise

6.3 Comparing different reionization models

7 SUMMARY

ACKNOWLEDGEMENTS

DATA AVAILABILITY

Footnotes

REFERENCES

APPENDIX A: TOPOLOGY OF GAUSSIAN RANDOM FIELD

Citations

Views

Altmetric

Email alerts

Astrophysics Data System

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only