-
PDF
- Split View
-
Views
-
Cite
Cite
E C Elson, A J Baker, S L Blyth, On the uncertainties of results derived from H i spectral line stacking experiments, Monthly Notices of the Royal Astronomical Society, Volume 486, Issue 4, July 2019, Pages 4894–4903, https://doi.org/10.1093/mnras/stz1178
- Share Icon Share
ABSTRACT
We present the results of a set of mock experiments aimed at quantifying the accuracy of results derived from H i spectral line stacking experiments. We focus on the effects of spatial and spectral aperture sizes and redshift uncertainties on co-added H i spectra, and by implication on the usefulness of results from H i line spectral stacking experiments. Using large spatial apertures to extract constituent galaxy spectra yields co-added spectra with high levels of contamination and with relatively low signal-to-noise ratios (S/Ns). These properties are also affected by the size of the spectral aperture as well as H i redshift uncertainties of galaxies. When redshift uncertainties are high, S/N decreases while the contamination level remains roughly constant. Using small spectral apertures in the presence of large H i redshift uncertainties can yield significant decreases in S/N without the expected decrease in the amount of contaminant flux. Our simulations show that a co-added spectrum rarely yields an accurate measure of the total H i mass of a galaxy sample. Total mass is generally over/underestimated for large/small spatial apertures, regardless of spectral aperture size. Our findings strongly suggest that any co-added H i galaxy spectrum needs to be fully modelled in the ways presented in this paper in order to apply accurate corrections for flux contamination and derive realistic uncertainties in total H i galaxy mass. Failing to do so will result in unreliable inferences of galaxy and cosmological parameters, such as ΩH i.
1 INTRODUCTION
Neutral atomic hydrogen (H i) is the dominant phase of the interstellar medium at z = 0. Constraints on the redshift evolution of the cosmic H i density, ΩH i, can be used to model the processes by which gas is accreted on to galaxies and subsequently processed and expelled. The cycle of gas in galaxies (including the time it spends as H i) is one of the main drivers of their formation and evolution.
Observations of H i in and around galaxies have traditionally been limited to the nearby Universe. In the very local Universe, direct detections of H i in galaxies can be used to construct the H i mass function (e.g. Zwaan et al. 2005; Martin et al. 2010) and hence measure ΩH i. Owing to the intrinsic faintness of the H i emission line, directly detecting it in galaxies at distances of more than a few hundred megaparsecs requires extremely long integration times. At high redshifts (z ≳ 1.5), measurements of ΩH i are derived from observations of damped Lyman α absorbers (DLAs; e.g. Wolfe et al. 1986, 1995; Storrie-Lombardi, McMahon & Irwin 1996; Péroux et al. 2005). Sánchez-Ramírez et al. (2016) used a sample of 742 DLAs to measure the continuous redshift evolution of ΩH i. By combining their measurements with those at z = 0, they derive a factor of ∼4 decrease in ΩH i from z = 5 to 0. They do, however, point out the large uncertainties on ΩH i over the intermediate redshift range 0.1 ≲ z ≲ 1.6, where it is still poorly constrained by DLAs. At the low end of this range, stacking of H i data from existing radio telescopes has been used to measure ΩH i (e.g. Chengalur, Braun & Wieringa 2001; Lah et al. 2007, 2009; Delhaize et al. 2013; Rhee et al. 2013, 2016). A larger fraction of this redshift range will be probed by means of H i detections of individual galaxies, and spectral line stacking, in the forthcoming Looking At the Distant Universe with the MeerKAT Array survey (LADUMA; Blyth et al. 2016), to be carried out on the 64-element MeerKAT array.
Results derived from H i spectral line stacking experiments can be highly uncertain. For single-dish observations of H i in the nearby Universe as well as interferometric observations of galaxies at higher redshifts, source confusion greatly limits the accuracy with which the average H i content of a sample can be determined by means of stacking. As an example, for the redshift range 0.04 ≲ z ≲ 0.13, Delhaize et al. (2013) estimate any one galaxy in their 2dFGRS sample observed with the Parkes Telescope to be confused, on average, with seven others. Unfortunately, quantifying such uncertainties is not always possible when considering only the observational data. Jones et al. (2015, 2016) use the Arecibo Legacy Fast Alfa (ALFALFA) survey correlation function to develop a computationally inexpensive method of predicting the amount of confused flux in a co-added H i spectrum. They conclude that stacking in deep SKA-precursor H i surveys will be only mildly affected by source confusion if their target synthesized beam size of 10 arcsec can be achieved. In Elson, Blyth & Baker (2016), we presented a set of synthetic data cubes containing model galaxies with realistic spatial and spectral H i distributions based on the catalogue of evaluated galaxy properties from Obreschkow & Meyer (2014). We used them to carry out several mock H i stacking experiments based on noise-free cubes for low- and high-redshift galaxy samples, and consistently found large fractions of contaminant emission due to source confusion in all co-added spectra.
In addition to quantifying the total amount of confused H i emission in a co-added spectrum, our simulated data cubes can be used to gain further insights into the extent to which confusion depends on various user-specified extraction parameters, as well as parameters not controlled by the user. In this work, we focus on the sorts of H i stacking experiments that will be applied to the LADUMA H i data. We do this by considering a noise-filled version of the high-redshift (0.7 < z < 0.758) cube presented in Elson et al. (2016), which we use to produce a suite of co-added spectra. The co-adds differ in the ways in which their constituent spectra are generated. We use spatial apertures of varying size, vary the spectral range over which individual spectra are extracted, and simulate the effects of offsets between optical and H i redshifts. All of these factors play a role in determining the amount of contaminant flux in a co-added spectrum, as well as its shape and signal-to-noise ratio (S/N). Our aim in this work is to quantify these various characteristics of a co-added spectrum as functions of the above-mentioned factors.
The layout of this paper is as follows. In Section 2, we present the details of the data cube used for our stacking experiments. In Section 3, we describe the various types of co-added spectra we produce in terms of various combinations of aperture sizes and redshift offsets. We discuss the various characteristics of our co-added spectra in Section 4, and present our conclusions in Section 5. Throughout this work, we assume a ΛCDM cosmology with a Hubble constant H0 = 67.3 km s−1 Mpc−1, ΩΛ = 0.685, and ΩM = 0.315 (Planck Collaboration XVI 2014).
2 SYNTHETIC DATA CUBES
In this work, we make use of the synthetic data products from Elson et al. (2016), where we presented a set of methods for converting mock galaxy catalogues into realistic data cubes containing H i line emission as well as telescope noise and beam effects. The cubes are based on the catalogue of evaluated galaxy properties from Obreschkow & Meyer (2014). For millions of galaxies in the redshift range z < 1.2, the catalogue presents detailed H i properties as well as auxiliary optical properties. The catalogue is based on the SKA Simulated Skies semi-analytic simulations, and therefore on the physical models described in Obreschkow et al. (2009a,b,c). These models are able to assign realistic masses and sizes to H i discs, allowing us to evaluate the characteristic properties of their H i emission lines. In Elson et al. (2016), we used noise-free cubes to carry out a series of mock H i stacking experiments, in order to quantify the rate of source confusion as a function of spatial resolution. In this work, we use an almost identical synthetic data cube, but we include noise. The reader is referred to Elson et al. (2016) for the full details of the properties of the synthetic H i data cubes, as well as the methods used to create them.
We choose to focus on the sort of H i image cube that will typically be produced by the LADUMA survey. LADUMA will carry out H i spectral line stacking experiments using data cubes with a frequency-dependent spatial resolution of ∼10–20 arcsec. It was shown in Elson et al. (2016) that H i stacking experiments carried out at 0.7 < z < 0.758 will not be dominated by source confusion when the spatial resolution of the data is 18 arcsec. Rather, ∼69 per cent of the flux in a stacked spectrum will come from the target galaxies of interest. Here we generate a synthetic cube spanning a sky area of 1.4○ × 1.4○ and a redshift range 0.7 < z < 0.758. This cube contains 52 662 galaxies with a total H i mass of |$3.53\times 10^{13}\, \mathrm{ M}_{\odot }$|. The distributions of the H i and stellar masses of these galaxies are (statistically) identical to those shown in fig. 13 of Elson et al. (2016), to which the reader is referred. The cube has a spatial resolution of 15 arcsec,1 and spatial and spectral pixel sizes of 3 arcsec and 26 kHz, respectively. As mentioned above, the main difference between the cubes used in this work and the cubes used in Elson et al. (2016) is the inclusion of noise. Here we use noise that is Gaussian distributed in each channel, with a standard deviation of 28.8 |$\mu$|Jy beam−1. This noise level assumes an estimated system equivalent flux density of 560 Jy for the 64-dish MeerKAT array’s UHF receivers at a frequency of ∼820 MHz, for a fiducial integration time of 1000 h and a channel width of 26 kHz. Fig. 1 shows the total intensity map for the noise-free version of our synthetic H i cube.

Left-hand panel: Total intensity map for a noise-free version of our synthetic H i data cube, generated by integrating H i emission over the full spectral range of the cube. The map spans 1.4○ × 1.4○ and contains H i line emission from 52 662 galaxies in the redshift range 0.7–0.758. The spatial resolution is 15 arcsec. Right-hand panel: Zoom in of the spatial region delimited by the white square in the left-hand panel. Emission from only 30 consecutive channels is shown, corresponding to a velocity range of ∼300 km s−1. H i flux densities in units of Jy beam−1 to which the colours in each panel correspond are shown in the colour bars.
3 MOCK STACKING EXPERIMENTS
In this work, we focus on quantifying the effects of aperture size, redshift offsets, and spectral extraction ranges of constituent spectra on the characteristics of co-added H i spectra. We do not extract the spectra of all 52 662 galaxies in our synthetic cube. Rather, we consider only those galaxies with stellar mass greater than |$10^{10}\, \mathrm{ M}_{\odot }$|. This cut results in approximately2 2650–3000 individual spectra contributing to the various co-added spectra.
In Figs 2, 3, and 4, co-added spectra are decomposed as in Elson et al. (2016). For any co-added spectrum, the black histogram represents the galaxy-averaged co-added signal per channel coming from target galaxies, non-target galaxies, and noise. The blue histogram represents the co-added mass coming from galaxies only, excluding noise. The green histogram represents the co-added mass coming only from target galaxies, containing zero contaminant signal. The red histogram represents contaminant mass from nearby galaxies that are spatially and/or spectrally confused with the target galaxies. The reader is referred to Elson et al. (2016) for a detailed description of how the mass contributions from target galaxies, nearby neighbours, and distant neighbours are precisely and reliably calculated for a spectrum extracted from the cube.

Co-added H i spectra based on the long spectra (230 channels) generated for galaxies with |$M_*\ge 10^{10}\, \mathrm{ M}_{\odot }$| in our synthetic H i data cube. From left to right, the constituent spectra are generated using aperture sizes of 15, 30, and 45 arcsec, respectively. The co-adds shown in the top row are based on constituent spectra for which the central channel corresponds exactly to the systemic velocity, Vsys, of the galaxy. Rows 2, 3, and 4 show co-adds based on constituent spectra for which the central channel corresponds to a velocity equal to |$V_{\mathrm{ sys}}+\delta _z^i$|, where |$\delta _z^i$| are galaxy-specific velocity offsets drawn from Gaussian distributions with means of 0 km s−1 and standard deviations of 50, 100, and 250 km s−1, respectively. In each panel, the black histogram represents the galaxy-averaged co-added signal coming from target galaxies, non-target galaxies, and noise. The blue histogram represents the co-added signal coming only from galaxies (targets and non-targets). The green histogram represents the co-added signal for the target galaxies only. The red histogram represents the co-added signal from galaxies that are spatially and/or spectrally confused with the target galaxies (i.e. contaminant emission). The RMS value of the reference spectrum is represented by the horizontal magenta line. The values shown in the right third of each panel give the mass of a particular co-added mass component in units of |$10^9\, \mathrm{ M}_{\odot }$| (left column) and the ratio of that mass to the co-added galaxy mass (right column). From top to bottom, the values correspond to the co-adds represented by the black, blue, green, and red histograms. Given in the left third of each panel is the RMS variation of the reference spectrum (i.e. the magenta line) in units of |$10^{8}\, \mathrm{ M}_{\odot }$| for a single channel, the signal-to-noise ratio for the total co-added signal, the number of galaxies contributing to the co-add, and the true average H i galaxy mass in units of |$10^9\, \mathrm{ M}_{\odot }$| as calculated from the evaluated H i masses in the Obreschkow & Meyer (2014) catalogue. The vertical dashed lines in each panel delimit a spectral range of ±300 km s−1 about the centre of the co-add.

Co-added H i spectra based on the mid-length spectra generated for galaxies with |$M_*\ge 10^{10}\, \mathrm{ M}_{\odot }$| in our synthetic H i data cube. Notation is as in Fig. 2. Constituent spectra (such as those in this figure) based on a variable spectral aperture size yield co-adds that have a variable number of galaxies contributing to the flux in each channel. Because only the most massive galaxies span a velocity range of up to ∼600 km s−1, very few galaxies contribute to the channels near the edges of the co-adds. At these extreme velocities, small-number statistics yield highly variable co-added fluxes.
Note that throughout this work, we typically refer to the co-added flux coming from target galaxies, non-target galaxies, and noise as the total co-added flux or mass. We do this simply to emphasize the fact that our black histograms in Figs 2, 3, and 4 are based on all of the flux (including noise) in our simulated cube. In contrast, the other histograms are based on noise-free versions of our simulated cube. However, in all of our co-adds, it is always the galaxy-averaged H i mass in a channel that is shown. In other words, the total amount of co-added flux in a particular channel is always divided by the number of galaxies that contributed flux to that channel.
3.1 Co-add variations
3.1.1 Aperture size
An H i spectrum for a single galaxy is usually generated from a small subvolume spanning several spatial pixels and channels in an H i cube. The emission in each channel of the subvolume is spatially integrated to yield flux density as a function of frequency (i.e. a spectrum). The manner in which a subvolume is specified for a particular galaxy varies from study to study. In this work, we use the size of the Gaussian beam to specify the spatial extents of the subvolumes. We use square spatial apertures with side lengths of 15, 30, and 45 arcsec, corresponding to 1, 2, and 3 half-power widths of the fiducial Gaussian beam.3 Given the 3 arcsec spatial pixel scale of our synthetic cube, the apertures have side lengths of 5, 10, and 15 pixels.
3.1.2 Spectral range
We specify the spectral extents of the subvolumes in three ways. The first method uses a fixed width of 230 channels, corresponding to a velocity range of ∼2184 km s−1. At the mean redshift of the cube, the frequency width Δν = 26 kHz of a channel corresponds to a velocity width Δv = 9.49 km s−1. These spectra are hereafter referred to as ‘long’ spectra. The other two methods incorporate a measure of the H i line width of each galaxy. To more closely mimic real data analysis procedures, rather than using the evaluated H i line width from the Obreschkow & Meyer (2014) catalogue (which would not be available a priori for an observed galaxy), we use the evaluated absolute Vega R-band magnitude (corrected for intrinsic dust extinction) together with the Tully–Fisher relation (Tully & Fisher 1977) from Verheijen (2001) to calculate each galaxy’s H i line width, corrected for the inclination of the disc. Fig. 5 shows the distribution of evaluated (log10W50, MR) pairs for all galaxies in our synthetic cube, with the Verheijen (2001) relation overlaid. In order to obtain the line-of-sight H i line width of a galaxy, W50 must be scaled by sin (i), where i is the inclination of the galaxy. In practice, the inclination can be inferred from the optical morphology of a galaxy. However, galaxies will not be spatially resolved at high redshifts. We therefore use the Verheijen (2001) Tully–Fisher relation to extract two W50-based spectral ranges for each galaxy: one based on the assumption that i = 90 deg, and the other based on the evaluated inclination of the galaxy. These spectra are hereafter referred to as the ‘mid-length’ and ‘short’ spectra, respectively.

Distribution of (log10W50, MR) ordered pairs for all galaxies in our synthetic cube. W50 measures have been corrected for the evaluated inclinations of the galaxies. Overlaid as a grey line is the R-band Tully–Fisher relation for the full H i sample of Verheijen (2001).
3.1.3 Redshift uncertainty
Once the spectral range of a subvolume is specified, the exact spectral location (i.e. channel) in the H i cube about which to centre the subvolume must also be specified. If we assume that the H i redshift of each target galaxy is known with zero uncertainty, then we can place the spectral centre of the subvolume precisely at the corresponding channel in the H i cube. In practice, however, an accurate measure of the H i redshift of a distant galaxy is seldom known. For galaxies that fall below the H i sensitivity threshold of a survey, this is always the case. It is for this reason that the optical redshift of a galaxy is typically used as a proxy for its H i redshift, which in turn allows for the spectral range of the galaxy to be specified.
However, optical and H i galaxy redshifts are known to be typically offset from one another. In their study of the impact of redshift uncertainties on spectral line stacking, Maddox et al. (2013) compare various sets of optical and H i galaxy redshifts. They find the differences between optical redshifts from the MPA-JHU catalogue and ALFALFA H i redshifts for 6419 galaxies to be well modelled as the sum of two Gaussians with means of 1.94 and −1.53 km s−1, and standard deviations of 11.54 and 35.56 km s−1, respectively. They use these Gaussian offsets to carry out some simple stacking experiments. They also use offsets as large as 250 km s−1, inspired by the redshift survey of the Great Observatories Origins Deep Survey (GOODS) South field conducted by Balestra et al. (2010) at low spectral resolution. For galaxies with more than one spectrum, Balestra et al. (2010) find the accuracy of a single measurement to be ± 255 km s−1.
In this work, we model the uncertainties in H i redshift as being Gaussian distributed about 0 km s−1, with standard deviations σz = 50, 100, and 250 km s−1. Given |$\delta _z^i$| as the redshift uncertainty of galaxy i (drawn from one of these three distributions) that has a true H i redshift |$z^i_{\mathrm{true}}$|, the spectral centre of its corresponding subvolume is placed at the channel corresponding to |$z^i_{\mathrm{true}}+\delta _z^i$|. Note that because the random redshift uncertainties (i.e. |$\delta _z^i$|) are Gaussian distributed (with a mean of zero), they can be positive or negative. We also generate a version of each spectrum that is based on a redshift uncertainty of 0 km s−1.
In total, we produce 3 × 3 × 4 = 36 co-added spectra in this work. Figs 2, 3, and 4 show the co-adds based on the long, mid-length, and short spectra, respectively. In each of the figures, columns 1, 2, and 3 correspond to the co-adds based on aperture sizes of 15, 30, and 45 arcsec, respectively. Row 1 in each of the figures corresponds to co-adds based on spectra that have redshift uncertainties of 0 km s−1, whereas rows 2, 3, and 4 correspond to spectra assuming the Gaussian-distributed redshift uncertainties with standard deviations σz = 25, 50, and 100 km s−1.
4 DISCUSSION
The various co-adds shown in Figs 2, 3, and 4 contain a wealth of information on the effects that spatial and spectral aperture sizes and redshift uncertainties have on the characteristics of co-added spectra. Throughout this work, when we state a co-added mass, we refer specifically to an integral of the average co-added mass per channel over the velocity range (−300)–(300) km s−1 about the stack centre. This is the maximum velocity range that the H i line emission from a galaxy is expected to span.
4.1 Purity and S/N
If the appropriate corrections are not applied, a co-added spectrum will seldom provide an accurate measure of the total H i mass of a galaxy sample. This limitation is due primarily to source confusion in the H i data cube, but also to the specific methodology used to generate the co-add.
In this work, we use the term ‘purity’ to refer to the fractional contribution of target galaxies to the total mass in a co-added spectrum. Purity is inversely related to the amount of contaminant flux in a stacked spectrum. The other very important characteristic of a co-added spectrum is its S/N. Indeed, the main goal of stacking is to combine many low-S/N spectra into a single average spectrum with higher S/N. In this work, we conservatively define S/N for a co-added spectrum as the ratio of the total flux integrated over the central ±300 km s−1 of the co-add to N × RMSco-add, where RMSco-add is the RMS value of a corresponding reference co-added spectrum and N is the number of channels making up the central ±300 km s−1 of the co-add. This conservative definition of S/N does not have its denominator scaling as |$\sqrt{N}$|, as other expressions for integrated signal to noise typically do. Rather, our definition specifies the total integrated signal as a function of the maximum possible amount of integrated noise. S/N ratios presented in this work will therefore be lower than those based on the more standard definition, presented in other studies. For each constituent galaxy spectrum extracted from the cube, another subvolume of the same size is extracted at a position offset by 150 arcsec in both right ascension and declination.4 This serves as the reference spectrum for the galaxy. All of the galaxy reference spectra are stacked to yield the final reference spectrum for a co-add. The RMS value of the reference spectrum (i.e. RMSco-add) is shown as the horizontal magenta line in all the panels of Figs 2, 3, and 4.
Fig. 6 shows S/N as a function of purity for all 36 of the co-added spectra shown in Figs 2, 3, and 4. The black, blue, and red symbols represent the co-adds based on long, mid-length, and short spectra (i.e. Figs 2, 3, and 4). The colour of a symbol is therefore indicative of the spectral aperture size used to generate the co-add. The size of any symbol is proportional to the size of the spatial aperture used to generate the co-add (i.e. 15, 30, or 45 arcsec). Finally, the type (or shape) of a symbol is indicative of the H i redshift uncertainties of the constituent spectra used to generate a particular co-added spectrum.

Signal-to-noise ratio (S/N) as a function of purity (fractional contribution of target galaxies to total co-added signal) for the co-added spectra shown in Figs 2, 3, and 4. The black, blue, and red symbols represent the long, mid-length, and short spectra total mass co-adds (i.e. black histograms in Figs 2, 3, and 4), respectively. The size of a symbol is proportional to the size of the aperture used to extract the spectra, and the type of symbol represents the standard deviation of the Gaussian-distributed redshift offsets applied to them. Both purity and S/N are affected by choices of spatial and spectral aperture sizes as well as redshift uncertainties.
Fig. 6 clearly shows both purity and S/N to be affected by the choices of spatial and spectral aperture sizes as well as redshift uncertainties. Furthermore, the two quantities are linked to one another in a roughly linear fashion. Digging deeper, we see that the first common characteristic shared by all sets of spectra is the manner in which the purity of a spectrum increases with decreasing spatial aperture size. All co-adds based on 15 arcsec (i.e. 1 beamwidth) spatial apertures have purity ⪆0.8. Notably, for all co-adds, purity does not vary significantly with increasing redshift offset: for given spatial and spectral aperture sizes, purity decreases by approximately 5 per cent as redshift uncertainties increase. A second characteristic shared by all sets of co-added spectra is the slightly counter-intuitive manner in which S/N (like purity) generally decreases significantly with increasing spatial aperture size. Put differently, more co-added mass does not result in a higher S/N, at least not according to the way we define S/N in this work. As expected, S/N also decreases with increasing redshift offsets. The typical shape of the co-added profile degrades significantly (becomes more spectrally extended) with increasing redshift offset as well; however, for a given typical redshift offset, the shape varies relatively little with increasing spatial aperture size.
Fig. 6 also illustrates the undesirable effects on co-added spectra that can arise from using small spectral apertures when redshift uncertainties are high. Considering only the 12 small symbols in Fig. 6 (corresponding to 15 arcsec apertures), the black symbols represent the co-adds based on long (velocity length: 2184 km s−1) constituent spectra. Clearly, purity and S/N are roughly constant regardless of H i redshift uncertainties. It is only when H i redshift uncertainties are of the order of 250 km s−1 that S/N noticeably decreases. However, the situation is different for the co-adds based on mid-length and short constituent spectra (blue and red symbols, respectively). These co-adds all span a narrow range in purity, yet a large S/N range. Specifically, S/N decreases as H i redshift uncertainties increase. This is due to the manner in which the H i emission from a galaxy can be completely ‘missed’ if a mid-length or short spectral aperture is placed at the wrong position in the cube (due to large H i redshift uncertainties). A detailed example of how spectral ranges and redshift offsets conspire to decrease the purity and S/N of constituent spectra is presented in Appendix A.
H i redshift offsets clearly play an important role in setting the optimal spectral apertures sizes to use in an H i stacking experiment. If H i redshift offsets are known or expected to be ≥100 km s−1, there is no benefit in using very short spectral apertures [velocity length: W50/sin (i)] over mid-length spectral apertures (velocity length: W50). This behaviour therefore eliminates the need for estimates of galaxy inclinations from optical imaging. However, given the manner in which S/N decreases significantly when H i redshift uncertainties are high (≥100 km s−1), a good strategy for maximizing the S/N of a stacked spectrum is to use long spectral apertures (not based on W50 measures), at the expense of lowering the purity of the co-added spectrum.
Clearly, the optimal combination of spatial and spectral aperture sizes in the presence of non-zero H i redshift offsets needs to be uniquely determined for a given H i stacking experiment. Our simulations and the methods presented in this work can be used to make such determinations to a high level of accuracy.
4.2 Average H i mass
While purity and S/N are two important characteristics of a co-added spectrum, the other very important property is the level of accuracy with which the average H i galaxy mass can be determined from the co-add. Relatively few of the co-adds shown in Figs 2, 3, and 4 yield an accurate measure of the true (evaluated) average H i galaxy mass, |$\langle \mathrm{\mathit{ M}_{H\,\small{I}}}\rangle _\mathrm{true}\approx 3.3\times 10^9\, \mathrm{ M}_{\odot }$|.
Fig. 7 shows the average H i mass measured from a co-added spectrum, 〈MH i〉co-add, relative to 〈MH i〉true as a function of purity. Co-adds based on large spatial apertures with H i redshift uncertainties ≤50 km s−1 overestimate 〈MH i〉true by factors of ∼1.4–1.7. This overestimate is expected given the high level of contamination (i.e. low purity) of these co-adds. For H i redshift uncertainties ≥100 km s−1, 〈MH i〉co-add/〈MH i〉true approaches unity. However, this convergence is a serendipitous result of the manner in which the large redshift uncertainties degrade the shape and lower the S/N of the co-added spectrum; it does not suggest that overestimates of 〈MH i〉true can be compensated for by large uncertainties in one’s optical redshift catalogue.

Ratio of co-added H i mass to true H i mass, 〈MH i〉co-add/〈MH i〉true, as a function of purity for the co-added spectra shown in Figs 2, 3, and 4. The black, blue, and red symbols represent the long, mid-length, and short spectra total mass co-adds (i.e. black histograms in Figs 2, 3, and 4), respectively. The size of a symbol is proportional to the size of the aperture used to extract the spectra, and the type of symbol represents the standard deviation of the Gaussian-distributed redshift offsets applied to them. A co-added spectrum seldom yields an accurate measure of the true H i mass of a galaxy sample. The size of the spatial aperture used to extract the constituent spectra largely determines the total amount of flux in the corresponding stacked spectrum.
〈MH i〉co-add/〈MH i〉true ratios for co-added spectra based on small (15 arcsec) spatial apertures are all too low by factors of ∼0.4–0.8, despite these co-adds having the highest purities. This combination of high purity and 〈MH i〉co-add/〈MH i〉true < 1 is a result of the manner in which a spatial aperture smaller than three times the half-power width of the resolution element naturally probes only a subset of the spatial area over which the H i flux of a point source5 is spread. Co-added spectra based on small spatial apertures will never recover all target galaxy flux. To achieve full recovery, spatial apertures of size three times or more the half-power width of the resolution element are required. However, as mentioned above, these co-added spectra are highly contaminated, thereby yielding 〈MH i〉co-add/〈MH i〉true > 1.
Co-added spectra based on 30 arcsec spatial apertures yield 〈MH i〉co-add/〈MH i〉true similar to those based on the 45 arcsec spatial apertures. When H i redshift uncertainties are small (≤50 km s−1), 〈MH i〉true is overestimated by factors of ∼1.2–1.4. When H i redshift uncertainties are high (≥100 km s−1), 〈MH i〉true is underestimated.
Finally, in all of the co-adds presented in Figs 2, 3, and 4, the total co-added mass including noise (i.e. the black histograms) is slightly less than the total co-added noise-free mass (i.e. the blue histograms). In Appendix B, we discuss how this negative contribution of noise to the total co-added mass is entirely consistent with the Gaussian properties of the noise.
The results presented in this section show it to be impossible to generate a co-added spectrum that has simultaneously high purity, high S/N, and 〈MH i〉co-add/〈MH i〉true ≈ 1. This result again points to the unavoidable need to fully model an H i stacking experiment in the ways presented in this paper, in order to accurately quantify the characteristics of the co-added spectrum and, very importantly, to apply the necessary corrections to any parameters inferred from it (e.g. average H i galaxy mass, ΩH i, etc.).
5 CONCLUSIONS
In this work, we have presented the details of a suite of H i spectral line stacking experiments based on synthetic data cubes that mimic the characteristics of observations that will be carried out as part of the MeerKAT deep H i survey, LADUMA. We have used various combinations of spatial and spectral aperture sizes as well as H i redshift uncertainties to generate co-added spectra. We analyse the properties of the co-adds in light of the specific details of how they were generated.
All co-added spectra contain a significant amount of contaminant emission due to source confusion. The amount of contaminant emission rises quickly with increasing spatial aperture size. Spectral aperture size also affects the contamination level. H i redshift uncertainties have a relatively small effect on the contamination level, yet significantly impact the S/N of a co-added spectrum. Using small spectral apertures in the presence of large H i redshift uncertainties can lead to a reduction in S/N by a factor of ∼2 without lowering the contamination level. A very clear conclusion drawn from our results is the fact that a co-added spectrum does not produce a reliable measure of the total H i mass of a galaxy sample. Regardless of spectral aperture size, large/small spatial apertures always over/underestimate the total H i mass. H i redshift uncertainties need to be carefully considered since they may lower the difference between the true H i mass and the co-added mass, yet in a highly unreliable manner. Our results suggest that H i spectral line stacking experiments based on LADUMA imaging will yield co-added spectra with a contamination level of up to 20 per cent and with S/N (using our new conservative definition) of up to ∼3.6
Given that almost all co-added H i spectra contain a non-negligible component of contaminant emission, the appropriate corrections must be made to the total co-added mass in order to retrieve an accurate measure of the contribution from target galaxies alone. Results based on mock stacking experiments such as ours provide a means of doing so in a reliable manner. The scientific impact of H i stacking experiments based on high-redshift imaging from forthcoming facilities such as MeerKAT, ASKAP, and ultimately the Square Kilometre Array will be significantly determined by the extent to which we can quantify and correct for the various observational and systematic uncertainties present in quantities derived from co-added spectra.
ACKNOWLEDGEMENTS
ECE acknowledges the financial assistance of the South African Radio Astronomy Observatory (SARAO) towards this research (http://www.ska.ac.za). This work is based on research supported in part by the National Research Foundation of South Africa (Grant Number 115238). AJB acknowledges support from a Fulbright scholarship during the early stages of this work. All authors sincerely thank the anonymous referee for insightful feedback that certainly improved the overall quality of this work.
Footnotes
15 arcsec corresponds to the central frequency, ν = 820 MHz, of the redshift range z = 0.7–0.758.
The exact number of individual spectra extracted from the cube depends on the relevant proximities of the galaxies to the edges of the cube. More specifically, the proximity of the subvolume (containing the H i line emission of a galaxy) to the edges of the cube is dependent on the choice of sizes of the spatial and spectral apertures used to extract it.
The width of the fiducial beam is calculated using the central frequency ν = 820 MHz of the redshift range z = 0.7–0.758.
For reference: the typical separation between galaxies in our cube with |$M_*\ge 10^{10}\, \mathrm{ M}_{\odot }$| is ∼0.55 deg.
All high-redshift galaxies observed with MeerKAT will be spatially unresolved, so the great majority of their flux will be spatially distributed over the area of the synthesized beam.
In this work, we define S/N as the ratio of the total co-added flux to co-added noise over the central ±300 km s−1 of a stacked profile. Alternative definitions (e.g. relying on the peak co-added flux) will deliver higher S/N but still need to be validated by simulations like those used in this paper.
REFERENCES
APPENDIX A: SPECTRAL RANGE
When a galaxy spectrum is extracted from a data cube, some estimate of its spectral extent is valuable. Extracting an unnecessarily long spectrum will lead to the inclusion of unwanted contaminant emission from neighbour galaxies. A spectral range matching that of the galaxy is expected to minimize the level of contamination in the spectrum. The Tully–Fisher (TF) relation can be used to estimate the spectral extent of a galaxy. However, if we are contending with large H i redshift uncertainties, extracting galaxy spectra over a small spectral range can be extremely detrimental to the purity and S/N of the co-add to which they contribute.
In Fig. A1, each panel shows two spectra associated with one of two different target galaxies, extracted in several ways from our synthetic H i data cube. In each panel, the green spectrum represents emission from the target galaxy, whereas the grey spectrum is the combined emission from the target and other neighbour galaxies (noise is excluded from this example). The solid and red-dotted vertical lines represent the TF spectral ranges of the galaxy based on (1) an assumed inclination of 90 deg and (2) the evaluated (true) inclination.

Various spectra associated with each of two different target galaxies. All spectra are extracted using a spatial aperture size of 30 arcsec. The top two panels show the spectra of the first galaxy, assuming redshift offsets of 0 and 150 km s−1. The bottom two panels show the spectra of the second galaxy, assuming redshift offsets of 0 and −150 km s−1. In each panel, the green spectrum represents emission from the target galaxy, whereas the grey spectrum is the combined emission from the target and other neighbour galaxies (noise is excluded from this example). The black-dashed vertical lines delimit a spectral range of ±300 km s−1 about the centre of the spectrum. The solid and red-dotted vertical lines represent the TF spectral ranges of the galaxy based on (1) an assumed inclination of 90 deg and (2) the evaluated (true) inclination, respectively. The actual (evaluated) galaxy inclination is given in the top left of each panel. A combination of a small spectral extraction range and a large redshift offset can yield an extracted spectrum that completely ‘misses’ the emission from a target galaxy of interest, thereby yielding a high level of contamination in the extracted spectrum.
The first (top) panel shows the spectrum of the first galaxy, extracted using a redshift offset of 0 km s−1. It is clear that the two TF-based specifications of the spectral range are very similar to one another and that they both do a good job of excluding the majority of the contaminant emission present within the full (grey) spectrum. Importantly, they also both neatly capture all of the target galaxy emission. Panel 2 shows spectra for the same galaxy as in panel 1, this time assuming a redshift offset of 150 km s−1. Due to the large offset, large fractions of target galaxy emission are shifted rightward of the spectral ranges expected to contain the target galaxy emission. Worse still is the fact that additional contaminant emission enters both of the TF-based spectral ranges from the left. The purity of this spectrum is therefore doubly affected, in a detrimental way, by the non-zero redshift offset and the narrow spectral ranges specified by the TF relation.
The third and fourth panels of Fig. A1 show the unshifted and shifted versions of the spectra for a second target galaxy from our synthetic cube. This time the galaxy spans a much smaller spectral range. The full (grey) spectrum is also highly contaminated by emission from neighbour galaxies. For a redshift uncertainty of 0 km s−1 (third panel), both TF-based specifications of the target galaxy spectral range do a good job of cutting out almost all of the contaminant emission in the full spectrum. For the case of a −150 km s−1 redshift offset (fourth panel), the situation is entirely different. The large offset shifts approximately half of the target galaxy emission leftward of the wider TF spectral range (solid red lines), while also adding a lot of contamination from the right. For the narrower TF spectral range based on the evaluated inclination of the galaxy (red-dotted lines), all of the target galaxy emission is shifted leftward beyond its limits, while more contamination is again added from the right. In this case, the combination of a small spectral extraction range and a large redshift uncertainty leads to the galaxy being completely ‘missed’, with its extracted spectrum containing only noise. If redshift offsets are high, the spectral extraction range for each galaxy needs to be kept large enough to ensure that the emission from the target is indeed captured, albeit at the expense of more contamination being introduced. Simulations such as ours should be used to find the optimal method(s) of specifying the spectral range.
APPENDIX B: CO-ADDED NOISE
For each of the co-adds shown in Figs 2, 3 and 4, the total co-added mass (i.e. black histogram) is less than the co-added galaxy mass (blue histogram). In other words, the noise in our synthetic cube makes negative contributions to the total co-added mass. This result, although unexpected, is indeed statistically correct and is a consequence of the fact that extracting subvolumes from the full-size cube at the positions of the |$M_*\ge 10^{10}\, \mathrm{ M}_{\odot }$| galaxies constitutes a single realization of the many ways in which a collection of subvolumes can be extracted. Repeating such an experiment many times using different sets of constituent spectra does indeed demonstrate that noise on average makes zero contribution to the total co-added mass.
In this section, we demonstrate this fact by carrying out three experiments in which we use our synthetic noise cube to generate 1000 co-adds each consisting of 2700 constituent spectra. Each constituent spectrum is based on a subvolume with pixel dimensions N × N × 230. For our three experiments, we use N = 5, 10, and 15 to match the spatial pixel dimensions of our long spectra based on 15, 30, and 45 arcsec spatial apertures. The three panels in Fig. B1 show the distributions of the sums of the 1000 co-adds for each of our N = 5, 10, and 15 experiments. The co-add sums are clearly approximately Gaussian distributed with a mean very close to 0 Jy. In each panel of Fig. B1, the mean co-add sum is marked by the black-dotted vertical line, while the red-dotted vertical lines mark the ±1σ values. The (negative) contributions from noise to the total masses in the co-adds shown in Figs 2, 3, and 4 are entirely consistent with the results presented in Fig. B1. The most directly comparable cases are the co-adds based on long spectra with redshift offsets of 0 km s−1 (i.e. top row of Fig. 2). For the 15, 30, and 45 arcsec co-adds, the respective noise contributions are |$(2.54\!-\!2.70)\times 10^9\, \mathrm{ M}_{\odot }$| = |$-1.6\times 10^8\, \mathrm{ M}_{\odot }$|, |$(4.87\!-\!5.27)\times 10^9\, \mathrm{ M}_{\odot }$| = |$-4.0\times 10^8\, \mathrm{ M}_{\odot }$|, and |$(5.78\!-\!6.35)\times 10^9\, \mathrm{ M}_{\odot }$| = |$-4.8\times 10^8\, \mathrm{ M}_{\odot }$|. These contributions are marked by the blue-dashed vertical lines in Fig. B1. In each case, the contribution is contained well within one standard deviation of the mean sum of the 1000 co-adds generated from our noise cube.

Distributions of summed H i mass for 1000 co-adds, each consisting of 2700 constituent subvolumes extracted from our smoothed noise cube. The subvolumes have pixel dimensions of N × N × 230, where N = 5, 10, and 15 (top to bottom panels). The mean, μ, of each distribution is marked by the black-dashed line, and μ ± 1σ are marked by the red-dashed lines. The solid blue line in each panel represents total H i mass contributed by noise in our co-added spectra shown in the top row of Fig. 2. In all cases, the (negative) amount of co-added noise is contained well within a single standard deviation of the mean sum of a co-add extracted from the noise cube.
For each of the distributions shown in Fig. B1, the mean co-add sum (indicated by the black-dashed line) is slightly offset from zero. However, the smoothed noise cube from which all of the constituent spectra were extracted has a mean voxel value μ = −5.06277 × 10−9 Jy beam−1. This very small, yet negative, mean value yields the observed non-zero mean co-add sums shown in the panels of Fig. B1. For example, for the middle panel, N × N × 230 × μ = −1.16 × 10−4 Jy beam−1, which does indeed convert almost exactly to the mean co-added mass of |$=-0.146\times 10^9\, \mathrm{ M}_{\odot }$|. This suggests that when given a real cube that has its aggregate line emission dominated by noise (which may also include unsubtracted continuum residuals), co-added fluxes should be appropriately offset in order to account for biases introduced by the noise statistics (specifically a non-zero noise level).