Abstract

Acoustic waves travelling through the early Universe imprint a characteristic scale in the clustering of galaxies, QSOs and intergalactic gas. This scale can be used as a standard ruler to map the expansion history of the Universe, a technique known as baryon acoustic oscillations (BAO). BAO offer a high-precision, low-systematics means of constraining our cosmological model. The statistical power of BAO measurements can be improved if the ‘smearing’ of the acoustic feature by non-linear structure formation is undone in a process known as reconstruction. In this paper, we use low-order Lagrangian perturbation theory to study the ability of 21-cm experiments to perform reconstruction and how augmenting these surveys with galaxy redshift surveys at relatively low number densities can improve performance. We find that the critical number density which must be achieved in order to benefit 21-cm surveys is set by the linear theory power spectrum near its peak, and corresponds to densities achievable by upcoming surveys of emission line galaxies such as eBOSS and DESI. As part of this work, we analyse reconstruction within the framework of Lagrangian perturbation theory with local Lagrangian bias, redshift–space distortions, |${\boldsymbol {k}}$|-dependent noise and anisotropic filtering schemes.

1 INTRODUCTION

In the last decade, it has been realized that the large-scale structure in the Universe can be used as a tool for measuring its expansion history with high accuracy and low systematics. One of the premier methods for measuring the distance-scale and expansion rate uses the baryon acoustic oscillation (BAO) ‘feature’ as a calibrated, standard ruler (see Particle Data Group 2014, for a review). Non-linear evolution of the large-scale structure in the Universe damps the acoustic oscillations in the power spectrum at late times (as has been extensively discussed in the literature, e.g. Bharadwaj 1996; Taylor & Hamilton 1996; Meiksin, White & Peacock 1999; Crocce & Scoccimarro 2008; Padmanabhan & White 2009; McCullagh & Szalay 2012; Tassev & Zaldarriaga 2012a; Schmittfull et al. 2015). The modes responsible for the broadening of the peak are of quite long wavelength (Eisenstein, Seo & White 2007a) and, as pointed out by Eisenstein et al. (2007b), these modes are also generally well measured by a survey aiming to do BAO. Thus, the impact of the non-linear evolution can be modelled and reduced by a process known as reconstruction (Eisenstein et al. 2007b; Padmanabhan et al. 2012). Reconstruction greatly improves fits to the distance scale using the BAO feature (Seo et al. 2010a; Anderson et al. 2014).

Traditionally, BAOs have been measured in galaxy surveys (e.g. Anderson et al. 2014, for the most recent detections), or in the intergalactic medium (Busca et al. 2013; Kirkby et al. 2013; Slosar et al. 2013; Delubac et al. 2015) either directly or in cross-correlation with QSOs (Font-Ribera et al. 2014). In recent years, technological advances have made it feasible to use 21-cm surveys to measure large-scale structure and, in principle, the BAO scale at redshifts z ∼ 1–2. In advance of a detection a wide variety of technologies are being investigated, ranging from large arrays of dishes (e.g. BAORadio,1 HIRAX, SKA1-MID2), to large single dishes (e.g. FAST3), to arrays of antenna tiles (e.g. BAOBAB4), to arrays in the focal plane of a large reflector (e.g. GBT-HIM, Parkes, BINGO5), to arrays of cylindrical reflectors (e.g. CHIME,6 ORT,7 Tianlai8). Each of these approaches has its advantages and difficulties. One difficulty that they all share in using redshifted 21-cm emission as a cosmological probe is that the signal is dwarfed by foreground9 contamination (Furlanetto, Oh & Briggs 2006; Chang et al. 2010). This renders many longer wavelength modes in such experiments unusable, and this can have a significant impact on the ability of such experiments to measure BAO (see Seo & Hirata 2015, for further discussion).

The literature contains very different estimates for the ultimate impact of foregrounds, depending upon assumptions about how well one can model the instrument. Some authors claim that foregrounds can be removed down to k ≃ 0.02 h Mpc−1 (Shaw et al. 2014, 2015) while others claim that all line-of-sight modes with k < 0.1 h Mpc−1 and modes with k < 0.6 k would be significantly contaminated (e.g. Liu, Parsons & Trott 2014; Pober 2015, building upon Datta, Bowman & Carilli 2010; Vedantham, Udaya Shankar & Subrahmanyan 2012; Morales et al. 2012; Parsons et al. 2012). We shall consider a range of possibilities motivated by these investigations.

Not surprisingly, the loss of long-wavelength modes causes a significant reduction in the BAO signal and, even more dramatically, in the ability to perform reconstruction on such a survey. We note that at high redshift and large scales, we expect the k-modes to be almost independent of one another, and thus there is no interpolation or filtering scheme that can compensate for a lost mode.10 If the 21-cm survey is unable to measure a given mode of the density field it must be supplied by other means. In this paper, we investigate whether a very sparse tracer of the density field, such as QSOs or emission line galaxies (ELGs), can be used to recover some of the missing, large-scale modes and improve reconstruction.

Specifically, we investigate how reconstruction is affected by missing modes using low-order Lagrangian perturbation theory. We extend this theory, in the context of reconstruction, to include filtering and missing modes (see also Seo et al. 2015, for similar topics) and anisotropic noise. Missing modes can be thought of as modes with infinite noise. A second sample can be used to ‘fill in’ the modes missed by a 21-cm survey, so that the noise in that region of k-space is set by the properties of the second sample. To be concrete, we consider QSOs and ELGs as tracers of the high-z, large-scale density field, since surveys covering large areas of sky with spectroscopic redshifts of objects in the appropriate redshift range are in progress. As an example, the eBOSS11 survey (Dawson et al. 2015) will obtain spectroscopic redshifts for more than 500 000 QSOs with 0.9 < z < 2.2 over 7500 deg2 of sky (Myers et al. 2015), significantly extending the existing samples in this redshift range. It will also measure redshifts for 190 000 ELGs in the range 0.7 < z < 1.1 over 750–1500 deg2. In the future DESI12 will generate samples of ELGs (and QSOs) with even higher number density and wider redshift coverage.

As might be expected, there is little gain in using tracers whose shot-noise exceeds their clustering power on the scales relevant for computing the displacements. Which tracers are useful in this context thus depends on the modes which a 21-cm survey is unable to access. In its standard configuration, QSOs are shot-noise limited at all k for the eBOSS surveys, though one could imagine similar surveys which could go deeper. Similarly the Ly α forest measured by BOSS and eBOSS provides good sampling, but long-wavelength modes along the line of sight can be contaminated by continuum modelling. Perhaps the best choice is the eBOSS ELG survey near z ≃ 1 which will provide a useful sample to augment reconstruction or, in the future, the DESI sample where even the lower density (but higher bias), high-z tail will be of use.

The outline of the paper is as follows. In Section 2, we use low-order Lagrangian perturbation theory to characterize reconstruction in the presence of noise, and then consider missing modes as modes with infinite noise. To avoid modes with infinite noise from erasing all the effects of reconstruction, we introduce a Wiener filter into the reconstruction scheme. In Section 3, we generalize our treatment to include anisotropic noise (such as the ‘wedge’ for 21-cm experiments; Datta et al. 2010) and redshift–space distortions and bias, and then show that the addition of even sparsely sampled objects to data where modes were previously missing can improve reconstruction. We show how this depends upon the noise of the added field and the geometry of the ‘wedge’. We conclude in Section 4. A discussion of Eulerian perturbation theory is provided in an appendix, as it is a particularly physical way of viewing reconstruction and seeing how the missing modes limit the effects of reconstruction.

2 RECONSTRUCTION WITH NOISE

Lagrangian perturbation theory has proven particularly useful as an approximate, analytic model of reconstruction (Noh, White & Padmanabhan 2009; Padmanabhan, White & Cohn 2009; Tassev & Zaldarriaga 2012b; Seo et al. 2015; White 2015). In this section, we review the Lagrangian framework, the reduction in signal-to-noise ratio that arises from non-linear structure formation and how to model reconstruction within this formalism. We begin with reconstruction for an isotropic system, to set notation and identify a few key features when noise is included. The generalization to include anisotropic noise and redshift–space distortions is presented in Section 3.

2.1 Review: peak broadening in Lagrangian perturbation theory

The modification of the power spectrum by non-linear evolution can be seen in the Lagrangian approach as follows (e.g. Matsubara 2008; Padmanabhan et al. 2009). If we denote a particle's initial, Lagrangian, position by |${\boldsymbol {q}}$| and its final, Eulerian, position by |${\boldsymbol {x}}$| then the Lagrangian displacement is defined through |${\boldsymbol {x}}={\boldsymbol {q}}+{\Psi }({\boldsymbol {q}})$|⁠. Assuming a local, Lagrangian bias defined by |$F[\delta _L({\boldsymbol {q}})]$| the density is (Matsubara 2008)
(1)
with
(2)
An unbiased tracer of the density field has F[λ] ∝ δ(D)(λ), while a tracer with linear bias b has the average of F′ over the Gaussian distribution of δL equal to b − 1 (Matsubara 2008). To leading order in the linear density contrast, δL,
(3)
(Zel'dovich 1970). Using the cumulant theorem and the fact that δL, and hence Ψ, is Gaussian one obtains (see Appendix A) a damping of the oscillations (e.g. Bharadwaj 1996; Taylor & Hamilton 1996; Meiksin et al. 1999; Eisenstein et al. 2007a; Crocce & Scoccimarro 2008; Matsubara 2008)
(4)
(We have assumed a scale-independent bias as appropriate for large scales and PL is the linear dark matter power spectrum.) The damping of the linear power spectrum (or equivalently the smoothing of the correlation function) reduces the contrast of the feature and the precision with which the size of ruler may be measured. The damping scale is set by the mean-squared Zel'dovich displacement of particles,
(5)
As we shall see, reconstruction decreases Σ2 (and also undoes the shift of the peak position from its linear value – see Padmanabhan et al. 2009; Sherwin & Zaldarriaga 2012 for further discussion). We will be studying the effects of reconstruction by considering changes to |$\mathcal {D}(k)$|⁠.

2.2 Review: reconstruction algorithm

Reconstruction ‘undoes’ the effects of non-linearity using the measured large-scale density field to infer the shifts that galaxies have undergone due to gravitational instability. The algorithm devised by Eisenstein et al. (2007b) consists of the following steps.

  • Smooth the halo, galaxy or 21-cm density field with a kernel W (see below) to filter out small scale (high k) modes, which are difficult to model. Divide the amplitude of the overdensity by an estimate of the large-scale bias, b, to obtain a proxy for the overdensity field: |$\delta (\boldsymbol {x})$|⁠.

  • Compute the shift, |$\boldsymbol {s}$|⁠, from the smoothed density field in redshift–space using the Zeldovich approximation (this field obeys |$\nabla \cdot \boldsymbol {R}\boldsymbol {s}=-\delta$| with |$R_{ij}=\delta _{ij}+(f/b)\hat{z}_i\hat{z}_j$|⁠). Once |$\boldsymbol {s}$| is obtained, multiply the line-of-sight component by 1 + f to approximately account for redshift–space distortions (see below for further discussion).

  • Move the galaxies by |$\boldsymbol {s}$| and compute the ‘displaced’ density field, δd.

  • Shift an initially spatially uniform distribution of particles by |$\boldsymbol {s}$| to form the ‘shifted’ density field, δs.

  • The reconstructed density field is defined as δr ≡ δd − δs with power spectrum |$P_{\rm rec}(k)\propto \langle | \delta _r^2|\rangle$|⁠.

Following Eisenstein et al. (2007b), we use a Gaussian smoothing of scale R for W, specifically W(k) = exp [−(kR)2/2]. Our R is canonically defined for a Gaussian smoothing, but alternative definitions of R exist in the literature,13 so care must be taken in comparisons. We take R = 10 h−1Mpc unless otherwise noted. Throughout we shall assume that the fiducial cosmology, bias and |$f=\text{d}\ln D/\text{d}\ln a\simeq \Omega _m^{0.55}$| are properly known during reconstruction. Various tests of the Eisenstein et al. (2007b) reconstruction algorithm and sensitivity to parameter choices have been performed in the literature. We refer the reader to Seo et al. (2010a), Padmanabhan et al. (2012), Xu et al. (2013), Burden et al. (2014), Tojeiro et al. (2014), Vargas-Magaña et al. (2014), Seo et al. (2015) which also contain useful details on the specific implementations. Note that we have chosen to perform ‘anisotropic reconstruction’, in which the shifted and displaced fields both include the factor of 1 + f in the line-of-sight direction as Lagrangian perturbation theory seems to model this algorithm better (White 2015). Our implementation of anisotropic reconstruction follows White (2015) and differs slightly from that in Seo et al. (2015). We correct for redshift–space distortions (to linear order) in defining |$\boldsymbol {s}$| in terms of the observed density field but include the factor of 1 + f in the line-of-sight component for both the displaced and shifted fields. We shall make further comparison with Seo et al. (2015) later.

With these definitions the displaced field is14
(6)
while the ‘shifted’ density field is
(7)
and hence
(8)
The reconstructed power spectrum is |$P(k)\propto \langle | \delta _r^2|\rangle$|⁠, which can be evaluated through use of the cumulant theorem.

2.3 Reconstruction with noise

In current galaxy redshift surveys targeted at BAO, the number density of tracers is such that noise is relatively insignificant. However, this need not be the case in general. When noise is present in the observed field, it will also contribute to the shifted field (White 2010). Assuming that the signal approaches linearity on large scales, for sufficiently large smoothing the general expression for the shifted field obeys
(9)
where we have written the noise contribution |$\delta _N({\boldsymbol {k}})$| explicitly.15
In equation (9) the weight, |$\mathcal {S}(k)$|⁠, applied to the density field is a combination of the usual Gaussian smoothing kernel and a noise-suppression factor,
(10)
Here, PL and PN are inputs to the algorithm, taken to be theoretical, linear power spectra estimates for the signal and noise, respectively. We shall assume these are the ‘true’ signal and noise spectra, since the procedure can be iterated.
We have not attempted to optimize fN. Instead we have chosen it to be a Wiener filter which acts to suppress noise in |${\boldsymbol {s}}$| from modes where PN is as large (or larger) than the cosmological signal.
(11)
If such a factor fN is not included then a very noisy density field will lead to almost random ‘shifts’ which wash out the structure rather than reconstructing it. A Wiener filter was also suggested by Seo & Hirata (2015, equation 17), though they in practice used a Gaussian with a modified smoothing instead.

2.4 Isotropic example

The effects of the noise PN and the filtering fN can be seen clearly in a simplified isotropic example, where |$P_N({\boldsymbol {k}})= P_N(k)$|⁠, redshift–space distortions are neglected, and the bias b = 1. In this limit, the leading terms of the reconstructed power spectrum are |$P_{\rm rec}({\boldsymbol {k}})=\mathcal {D}({\boldsymbol {k}})P_L(k) + \dots$| with16
(12)
Here
(13)
and Σ2 is the unreconstructed damping (equation 5). This is equivalent to, but written slightly differently than, previous results (see later). This form isolates both the damping |$\Sigma _{ss}^2$| remaining after reconstruction (which also contains the full effect of the noise in this approximation) and the damping Σ2 in the absence of reconstruction. With no filter (i.e. fN = 1), increasing the noise clearly increases |$\Sigma _{ss}^2$|⁠, increasing the overall damping of the reconstructed linear power spectrum.17

Taking fN to be the Wiener filter, 1/(1 + PN/PL), the contributions |$\mathcal {S}^2[P_L + P_N]$| in the damping scale in equation (13) go to zero when |$P_N({\boldsymbol {k}})$| is large (⁠|$\mathcal {S}P_L$| also tends to zero in this limit). The very noisy modes thus do not contribute significantly to |$\Sigma _{ss}^2$| or to |$\Sigma _s^2$|⁠, i.e. to reconstruction. Specifically, |$\Sigma _{ss}^2$| no longer increases with increasing noise as it would if no filter were applied. In the examples below, we shall take the noise to be Poisson shot noise, |$P_N=b^{-2}\bar{n}^{-1}$|⁠, for tracers with number density |$\bar{n}$| and linear bias b. In fact, we shall typically quote the noise levels in terms of an effective number density.

A filter can not only ensure that noisy modes do not destroy reconstruction for all modes; it also modifies reconstruction for the remaining modes, as does the form and level of noise. These effects can be similar to those due to changing the Gaussian smoothing scale R. (Studies changing R include Padmanabhan et al. 2012, Burden et al. 2014, Seo et al. 2015; as mentioned earlier Seo & Hirata (2015) trade a change in R for a filter.) Altering the filter, Gaussian smoothing or noise level does not necessarily have a uniform effect on the reconstruction of different modes: a given change may increase damping for some modes and decrease it for others. In Fig. 1, we show the change in damping, |$\mathcal {D}(k)$|⁠, when changing the smoothing R (top panel, fixing fN = 1 by setting PN = 0) as well as changing the noise level at fixed R (bottom panel). In the latter case, we show the results for two filters: fN = 1 and fN = 1/(1 + PN/PL). Modes at low k and high k have different trends in how well they are reconstructed as the smoothing increases or the filter changes. In particular with a noise-dependent filter and smaller R (R < 10 h−1Mpc, not shown) increasing noise can increase |$\mathcal {D}$| for some low values of k. To summarize, the combination of noise, filter and smoothing affects different k-modes differently, and increasing noise can sometimes improve reconstruction of a specific k-mode. This suggests that a series of filters, each optimized to a specific window and with an optimized shape, could improve reconstruction over what is possible with a single smoothing scale. We shall leave such investigation to a future paper. Henceforth we fix |$f_N = 1/(1+P_N({\boldsymbol {k}})/P_L({\boldsymbol {k}}))$|⁠.

The leading coefficient $\mathcal {D}(k)$ of the reconstructed power spectrum, $P_{\rm rec}(k)={\mathcal {D}}(k)P_L(k)+\dots$, for different smoothings and filters. The larger $\mathcal {D}(k)$, the better the reconstruction for the mode k. Lines at the far right of each plot are ordered as in the legend. Top: the change in ${\mathcal {D}}(k)$ with changes in the Gaussian smoothing scale R. For filtering scale below ∼10 h−1Mpc, $\mathcal {D}(k)$ is larger at large k and smaller at low k than for R > 10 h−1Mpc. Bottom: holding the smoothing R = 10 h−1Mpc, ${\mathcal {D}}(k)$ for different filters and four different choices of isotropic Poisson noise, corresponding to four densities. (Densities are quoted in units of h3 Mpc−3.) Similar to changing the smoothing scale R above, increasing the noise level (decreasing the density $b^2 \bar{n}$) can improve reconstruction [increase ${\mathcal {D}}(k)$] for some filters and values of k and degrade reconstruction for others. This can be seen in Fig. 2 below. Also shown is the unreconstructed signal (magenta dashed line) in the absence of noise. Here and hereon, redshift z = 1 is assumed.
Figure 1.

The leading coefficient |$\mathcal {D}(k)$| of the reconstructed power spectrum, |$P_{\rm rec}(k)={\mathcal {D}}(k)P_L(k)+\dots$|⁠, for different smoothings and filters. The larger |$\mathcal {D}(k)$|⁠, the better the reconstruction for the mode k. Lines at the far right of each plot are ordered as in the legend. Top: the change in |${\mathcal {D}}(k)$| with changes in the Gaussian smoothing scale R. For filtering scale below ∼10 h−1Mpc, |$\mathcal {D}(k)$| is larger at large k and smaller at low k than for R > 10 h−1Mpc. Bottom: holding the smoothing R = 10 h−1Mpc, |${\mathcal {D}}(k)$| for different filters and four different choices of isotropic Poisson noise, corresponding to four densities. (Densities are quoted in units of h3 Mpc−3.) Similar to changing the smoothing scale R above, increasing the noise level (decreasing the density |$b^2 \bar{n}$|⁠) can improve reconstruction [increase |${\mathcal {D}}(k)$|] for some filters and values of k and degrade reconstruction for others. This can be seen in Fig. 2 below. Also shown is the unreconstructed signal (magenta dashed line) in the absence of noise. Here and hereon, redshift z = 1 is assumed.

Now we are in a position to describe the effects of missing modes, i.e. modes which are not observed, on reconstruction. First of all, these modes are not present for measuring the power spectrum, which decreases the number of modes which can be averaged over and thus increases the error in a given k-bin (perhaps to infinity). Their absence also weakens the ability to reconstruct the rest of the modes. By treating missing modes as modes with infinite noise in equation (12), they can be seen to contribute to the original broadening (Σ2 is unchanged) but not to the reconstructed factors |$\Sigma _{ss}^2$| and |$\Sigma _s^2$|⁠. Before we explore this further we generalize our treatment to include anisotropic reconstruction and redshift–space distortions.

3 ANISOTROPIC RECONSTRUCTION

3.1 Formalism

Including redshift–space distortions in our theory, the displacements Ψ and |${\boldsymbol {s}}$| are increased in the line-of-sight direction by a factor 1 + f as noted in Section 2.2. The undamped signal is modified from b2PL to (b + fμ2)2PL, where we have again assumed linear bias b. In addition, the noise power spectra appearing in the expressions below are to be interpreted as the raw noise power divided by [1 + βμ2]2,
(14)
since we ‘remove’ redshift–space distortions when first computing |${\boldsymbol {s}}$| from our smoothed density field. This additional factor also propagates into the Wiener filter expression and thus |$\mathcal {S}({\boldsymbol {k}})$|⁠. The noise power, PN, may depend on the direction of |${\boldsymbol {k}}$| beyond that given by the 1 + βμ2 term; for example the 21-cm wedge corresponds to direction dependent missing modes (i.e. modes with very large noise).
The reconstructed power spectrum thus reads
(15)
with damping
(16)
(17)
The damping terms again depend upon the linear power spectrum and the (now possibly anisotropic) noise, and are given by18
(18)
These expressions are equivalent to the more familiar forms
(19)
where we have taken b = 1 and defined
with |$\Sigma ^2_{sd,\perp }=(1/2)(\Sigma ^2_{dd,\perp }+\Sigma ^2_{ss,\perp })$| and |$\Sigma ^2_{sd,\parallel }$| defined similarly. Note that for isotropic |$\mathcal {S}$| and PN the parallel components are simply (1 + f)2 times the perpendicular components and the coefficients of the exponentials become
(20)
In the appropriate limits these equations then agree with those in Matsubara (2008), White (2010), Carlson, Reid & White (2013), Seo et al. (2015). Note that in the notation of the appendix of Seo et al. (2015) we have chosen κ = b + fμ2 and λd = λs = f. It is closest to their ‘rec-iso’ case (in which they choose λs = 0).

3.2 Example: 21-cm wedge

As our motivating example we consider a density field with a noise level and different missing mode scenarios inspired by 21-cm experiments. We will take as a representative 21-cm noise level that corresponding to the forecast for a CHIME-like experiment in Chang et al. (2008), assuming that thermal noise dominates (see also Seo et al. 2010b; Seo & Hirata 2015). The CHIME-like noise can be expressed in terms of an effective number density of tracers (of unit bias)
(21)
With an eye to using ELGs as the example of a ‘helper’ tracer we will focus on redshift z = 1. In this case |$\bar{n}\simeq 3\times 10^{-3}\,h^3\,{\rm Mpc}^{-3}$|⁠.
For missing modes, we consider scenarios where we include only modes with |k| > kcut which lie outside a ‘wedge’, |k|/k > μmin. For definiteness, and because it covers the wide range of opinions in the literature, we consider kcut = 0, 0.02, 0.1 h Mpc−1. For μmin the edge of the wedge is given by (e.g. Pober 2015; Seo & Hirata 2015; Shaw et al. 2015, and references therein)
(22)
where χ is the comoving distance to redshift z, E(z) = H(z)/H0 and we have assumed a spatially flat Universe. The appropriate prefactor in front of |$\mathcal {R}$| and how far into the wedge it is possible to work is a matter of debate – we shall take this value as illustrative though higher values are possible. If the wedge cannot be penetrated then all modes with |μ| ≡ |k|/k < μmin
(23)
are lost. For a ΛCDM model at z = 1, and our choice of edge, μmin ≃ 0.56. Thus slightly more than half of all modes would be lost to the wedge at z = 1. A graphical representation of the modes which are lost (or kept) can be seen in Fig. 4 below.

We show in the top panel of Fig. 2 the change in |$\mathcal {D}(k)$|⁠, for k = k or k = k when certain modes are omitted from the reconstruction (the black solid and dashed lines are the comparison case of no missing modes). As already noted by Seo & Hirata (2015), the effect on reconstruction is dramatic. Missing modes thus significantly compromise the ability of a 21-cm survey to measure the distance scale.

The damping factor, $\mathcal {D}(k)$, for a survey where 21-cm modes have k∥ and μ = |k∥|/k restricted to the ranges shown. The black lines (upper lines at high k) have no missing modes. Colored lines include the effect of the wedge with μmin = 0.56 and/or a cut in k∥; lines for k∥ ≥ 0.1 hMpc−1, μ ≥ 0, 0.56 are degenerate. The noise of the 21-cm modes, i.e. modes not in the wedge, is taken to be equivalent to $b^2\,\bar{n}\simeq 3\times 10^{-3}\,h^3{\rm Mpc}^{-3}$ (see text). Two directions are shown: solid lines are along the line of sight (k∥ = k, k⊥ = 0) and dashed lines are transverse (k⊥ = k, k∥ = 0). Solid lines are ordered at large k according to k∥ ≥ 0, 0.02, 0.1, top to bottom, with μ ≥ 0 higher than μ ≥ 0.56 for each k∥ cut. Top: $\mathcal {D}(k)$ for modes which are present (obeying k∥ and μmin cut), PN = ∞ or $b^2 \bar{n}\rightarrow 0$ for missing modes. Bottom: the same cuts in k∥, μ, as above, but replacing the missing modes with an ELG survey with number density 3 × 10−4 h3 Mpc−3. For ${\mathcal {D}}(k=k_\perp )$, the line order at large k top to bottom is the same as in the legend. Note in the top figure that with our approximations a small range of k∥ is improved when other modes are completely left out of reconstruction, an example of increasing noise increasing $\mathcal {D}(k)$ for some k and suggesting that a different smoothing may help with better recovering those components.
Figure 2.

The damping factor, |$\mathcal {D}(k)$|⁠, for a survey where 21-cm modes have k and μ = |k|/k restricted to the ranges shown. The black lines (upper lines at high k) have no missing modes. Colored lines include the effect of the wedge with μmin = 0.56 and/or a cut in k; lines for k ≥ 0.1 hMpc−1, μ ≥ 0, 0.56 are degenerate. The noise of the 21-cm modes, i.e. modes not in the wedge, is taken to be equivalent to |$b^2\,\bar{n}\simeq 3\times 10^{-3}\,h^3{\rm Mpc}^{-3}$| (see text). Two directions are shown: solid lines are along the line of sight (k = k, k = 0) and dashed lines are transverse (k = k, k = 0). Solid lines are ordered at large k according to k ≥ 0, 0.02, 0.1, top to bottom, with μ ≥ 0 higher than μ ≥ 0.56 for each k cut. Top: |$\mathcal {D}(k)$| for modes which are present (obeying k and μmin cut), PN = ∞ or |$b^2 \bar{n}\rightarrow 0$| for missing modes. Bottom: the same cuts in k, μ, as above, but replacing the missing modes with an ELG survey with number density 3 × 10−4h3 Mpc−3. For |${\mathcal {D}}(k=k_\perp )$|⁠, the line order at large k top to bottom is the same as in the legend. Note in the top figure that with our approximations a small range of k is improved when other modes are completely left out of reconstruction, an example of increasing noise increasing |$\mathcal {D}(k)$| for some k and suggesting that a different smoothing may help with better recovering those components.

3.3 Filling in the wedge

The top panel of Fig. 2 demonstrates that loss of line-of-sight or wedge modes at low k significantly weakens the ability of a 21-cm experiment to constrain the distance scale. However, the modes which are missing are of very long wavelength, near the peak of the CDM power spectrum, and thus can be measured relatively well even by quite sparse tracers. The combined density field then has a noise level which is set by the 21-cm survey at high k and the ‘filler’ survey at lower k.

As a particular example we shall consider using ELGs19 as measured for example by eBOSS or DESI. Table 8 of Dawson et al. (2015) lists the ELG number density for the DECam ELG sample as ∼(1.5–3) × 10−4h3Mpc−3 over the range 0.7 < z < 1 and across 1000 deg2. We shall take the upper end of this range as an optimistic example and assume the ELGs are unbiased (b = 1). This gives the ‘filler’ data a number density an order of magnitude lower than the effective number density for the 21-cm data. The amplitudes of the shot noise and cosmological power, as a function of k, are compared in Fig. 3.

The matter power spectrum, at z = 1 and 1.5, compared to the QSO shot-noise ($1/b^2\bar{n}$; dotted line), the ELG shot-noise (dot–dashed) and the noise expected for a 21-cm survey such as CHIME (dashed line). Note that the QSO shot-noise exceeds the cosmological power for all k. For z = 1 we plot the ELG shot-noise for an eBOSS-like sample, assuming b = 1. At z = 1.5 we plot the shot-noise assuming constant clustering and DESI-like number densities.
Figure 3.

The matter power spectrum, at z = 1 and 1.5, compared to the QSO shot-noise (⁠|$1/b^2\bar{n}$|⁠; dotted line), the ELG shot-noise (dot–dashed) and the noise expected for a 21-cm survey such as CHIME (dashed line). Note that the QSO shot-noise exceeds the cosmological power for all k. For z = 1 we plot the ELG shot-noise for an eBOSS-like sample, assuming b = 1. At z = 1.5 we plot the shot-noise assuming constant clustering and DESI-like number densities.

While in reality one would estimate the density field from the combination of the surveys in an inverse variance manner, and the transition in the noise is likely to be smooth with |${\boldsymbol {k}}$|⁠, we shall instead take the noise to be |$1/\bar{n}_{21}$| where our 21-cm survey has data, sharply transitioning to |$1/\bar{n}_{\text{ELG}}$| where our 21-cm survey does not. (Our formalism can handle an arbitrary |$P_N({\boldsymbol {k}})$|⁠.) The lower panel of Fig. 2 shows the improvement that such a survey combination would make in reconstruction – in addition to allowing a measurement of modes inside the foreground-dominated region and so lowering the sample variance for those ks.

Fig. 4 shows the full |$\mathcal {D}(k_\perp ,k_\parallel )$| for four cases: no reconstruction, reconstruction using only 21-cm data, reconstruction with the addition of the ELGs and reconstruction using a photometric redshift sample (see below). Even missing the modes indicated, a comparison of the top left and right-hand panels shows that reconstruction improves the signal to noise of the acoustic signature. However, the improvement that comes from including the missing modes by using ELG data is dramatic, as a comparison of the bottom-left and top-right panels shows. We discuss the lower right panel below.

The damping function, $\mathcal {D}({\boldsymbol {k}})$, with 21-cm data restricted to k∥ ≥ 0.02 and μ ≥ 0.56 with an effective density $b^2\bar{n} = 3\times 10^{-3}\,h^3\,{\rm Mpc}^{-3}$. Top left: no reconstruction. Top right: reconstruction only with 21-cm data. Bottom left: reconstruction using ELGs with $b^2\bar{n}=3\times 10^{-4}\,h^3\,{\rm Mpc}^{-3}$ to fill in the missing modes. Bottom right: reconstruction using a photometric redshift survey with $b^2\bar{n}=3\times 10^{-3}\,h^3\,{\rm Mpc}^{-3}$ to fill in the modes with k∥ < 0.03 h Mpc−1. To be able to access such modes requires photo-z precision of δz/(1 + z) ∼ 0.01 (see text). Note that in spite of the wedge, non-missing modes along k⊥ are much better reconstructed because of redshift–space distortions along k∥.
Figure 4.

The damping function, |$\mathcal {D}({\boldsymbol {k}})$|⁠, with 21-cm data restricted to k ≥ 0.02 and μ ≥ 0.56 with an effective density |$b^2\bar{n} = 3\times 10^{-3}\,h^3\,{\rm Mpc}^{-3}$|⁠. Top left: no reconstruction. Top right: reconstruction only with 21-cm data. Bottom left: reconstruction using ELGs with |$b^2\bar{n}=3\times 10^{-4}\,h^3\,{\rm Mpc}^{-3}$| to fill in the missing modes. Bottom right: reconstruction using a photometric redshift survey with |$b^2\bar{n}=3\times 10^{-3}\,h^3\,{\rm Mpc}^{-3}$| to fill in the modes with k < 0.03 h Mpc−1. To be able to access such modes requires photo-z precision of δz/(1 + z) ∼ 0.01 (see text). Note that in spite of the wedge, non-missing modes along k are much better reconstructed because of redshift–space distortions along k.

We find that the combination of experiments does better than either does alone. The improvement over a 21-cm experiment which cannot measure some modes is demonstrated above. Although we do not show it here, the combination of the 21-cm data and the ELGs provides slightly better reconstruction than can be obtained with a survey of only ELGs (unless their number density can be increased by an order of magnitude). The damping scales, Σs and Σss, are common to all modes and benefit from the regions of k-space with lower noise. For our fiducial ELG number density the improvement is not dramatic since the noise is subdominant to the signal power for a broad range of k, however the situation changes if such high number densities cannot easily be obtained over the desired redshift range.

To further explore the parameter space, Fig. 5 shows the damping function along or transverse to the line of sight (with k = 0.15 h Mpc−1) as a function of tracer noise (specifically |$b^2\bar{n}$|⁠) for different choices of the missing modes. One can see that once the number density of the tracer has |$b^2\bar{n}\simeq 10^{-4}\,h^3\,{\rm Mpc}^{-3}$| or larger it is able to compensate for the modes missed by the 21-cm survey and improve the performance of reconstruction. This characteristic number density is given by the cosmological power (PL) at the scales where the 21-cm survey is missing modes, which is close to the peak of the power spectrum.

The damping function at k = 0.15 h Mpc−1 as a function of ELG density for different wedge shapes (as indicated in the figure). Solid lines show $\mathcal {D}$ for k∥ = k = 0.15 h Mpc−1 while dashed lines show k⊥ = k = 0.15 h Mpc−1. The small density, i.e. large noise, limits asymptotically reach the $\mathcal {D}(k)$ where only the 21-cm measurements are available. (If a mode is missing entirely when only 21-cm measurements are available, then it will not contribute before or after reconstruction.) In spite of the wedge, modes along k⊥ are much better reconstructed because of redshift–space distortions along k∥. For ${\mathcal {D}}(k=k_\perp )$, the line order at small density top to bottom is the same as in the legend. For ${\mathcal {D}}(k=k_\parallel )$, at small density the line order top to bottom is in increasing k∥ cut, with μ ≥ 0, 0.56 top to bottom for each k. The line for the constraint k∥ ≥ 0.1 h Mpc−1 is the same for μ ≥ 0, 0.56.
Figure 5.

The damping function at k = 0.15 h Mpc−1 as a function of ELG density for different wedge shapes (as indicated in the figure). Solid lines show |$\mathcal {D}$| for k = k = 0.15 h Mpc−1 while dashed lines show k = k = 0.15 h Mpc−1. The small density, i.e. large noise, limits asymptotically reach the |$\mathcal {D}(k)$| where only the 21-cm measurements are available. (If a mode is missing entirely when only 21-cm measurements are available, then it will not contribute before or after reconstruction.) In spite of the wedge, modes along k are much better reconstructed because of redshift–space distortions along k. For |${\mathcal {D}}(k=k_\perp )$|⁠, the line order at small density top to bottom is the same as in the legend. For |${\mathcal {D}}(k=k_\parallel )$|⁠, at small density the line order top to bottom is in increasing k cut, with μ ≥ 0, 0.56 top to bottom for each k. The line for the constraint k ≥ 0.1 h Mpc−1 is the same for μ ≥ 0, 0.56.

So far our discussion has assumed that the ‘filler’ sample comes from a spectroscopic survey. However, if only the low k modes are needed, samples and surveys with high-precision photometric redshifts could be used instead. The lower right panel of Fig. 4 shows an optimistic case where a photometric redshift survey can fill in all of the modes with k < 0.03 h Mpc−1. (This includes some modes already obtained from 21-cm outside the wedge, but adds information inside the wedge.) Translating a redshift uncertainty of δz into a comoving distance uncertainty of δχ = [c/H(z)]δz, to probe k = 0.03 h Mpc−1 requires δz/(1 + z) < 0.01 at z = 1. Such photo-z precision is in principle achievable, given enough filters. As shown in Fig. 4 lower right, recovering k < 0.03 h Mpc−1 (for all k) can have a large impact on reconstruction. Conversely, if only k < 0.01 h Mpc−1 can be recovered then the gain is minimal.

4 DISCUSSION

The study of large-scale structure has taught us a great deal about the Universe in which we live and provides tight constraints on fundamental physics. One of the key observables in large-scale structure are the so-called BAO, which provide a standard ruler enabling the measurement of the expansion history of the Universe with low systematics and high precision. The BAO signal is degraded by non-linear evolution, but the degradation can to some degree be overcome by density field reconstruction.

While 21-cm experiments can in principle measure large-scale structure efficiently at very high redshifts, where galaxy redshift surveys become increasingly expensive, they may suffer from large foreground contamination in a ‘wedge’ in k-space. The modes lost due to foregrounds have an impact on BAO measurements but an even larger impact upon reconstruction (as recently emphasized by Seo & Hirata 2015). In this paper, we have demonstrated, using an analytic model based on low-order Lagrangian perturbation theory, that even a relatively sparse tracer of the density field can ‘fill in’ the missing modes. The combined data can be more powerful than either of its parts: reaping the benefits of full k-coverage and low shot-noise. This allows density field reconstruction and tight measurements at smaller scales, which can be especially useful at higher z.

We have presented formulae for the efficacy of reconstruction in the case of biased tracers of the density field, including redshift–space distortions, anisotropic noise and noise-filtering during reconstruction. The missing modes can be modelled as infinite noise, while the effects of having two different samples can be modelled via anisotropic noise. These extend the formulae in the literature, but agree with those formulae in the appropriate limits. We find that the details of how BAO are predicted to be damped in Lagrangian perturbation theory depend upon the shape of the filter applied to estimate the shift field. If this finding holds up in simulations, it opens the possibility of ‘shaping’ the filter to improve the performance of reconstruction.

While even a very noisy tracer of the density field can be used to measure the power spectrum if enough modes can be averaged together, our intuition tells us that reconstruction – which depends on the 3D density field and not just its power spectrum – requires noise per k-mode less than the cosmological signal. Our calculation quantifies and supports this intuition, and we show that to measure the low-k modes missed by a 21-cm survey tracers with |$b^2\bar{n}\sim 10^{-3}\hbox{--}10^{-4}\,h^3\,{\rm Mpc}^{-3}$| are required.

Of the existing surveys that cover large cosmological volumes at high redshift, the QSO surveys are currently too sparse to be of significant benefit. An increase in number density, however, would lead to improved performance. Some of the low k modes could, in principle, be filled in with a photometric sample with excellent photometric redshifts (obtained, perhaps, using multiple medium bands). In the near term the most promising tracer, in the redshift range z > 1 where most future 21-cm surveys will be operating, are ELGs such as will be measured by eBOSS or DESI. If eBOSS can achieve its forecast number densities it would significantly improve reconstruction for 21-cm surveys which overlap in volume. In future even the higher z tail of the DESI ELG sample (with its higher bias) would be beneficial to 21-cm surveys which are unable to work deep into the foreground wedge.

We would like to thank Marcel Schmittfull for helpful conversations on acoustic oscillations and reconstruction, and Josh Dillon, Daniel Eisenstein, Marcel Schmittfull, Uros Seljak and Hee-Jong Seo for helpful feedback on the draft, and the anonymous referee for additional helpful suggestions. This work was begun at the Aspen Center for Physics, which is supported by National Science Foundation grant PHY-1066293. We thank the Center for its hospitality. T-CC acknowledges support from MoST grant 103-2112-M-001-002-MY3. This work made extensive use of the NASA Astrophysics Data System and of the astro-ph preprint archive at arXiv.org.

1

Ansari et al. (2012)

3

Nan et al. (2011)

4

Pober et al. (2013)

7

Ali & Bharadwaj (2014)

9

The foregrounds have been best studied in the context of 21-cm studies of the epoch of reionization, i.e. at lower frequencies than we consider. However, the signal and foregrounds scale in a similar manner with frequency so that many of the results carry over with minimal modification – see e.g. Pober (2015) for a recent discussion.

10

See however Zhu et al. (2015), which appeared as we were finishing this paper, for a method using the effect of long-wavelength modes on short-wavelength modes.

13

For example, Padmanabhan et al. (2009) use W(k) = exp [−(kR)2/4] in a similar analytic approach, while Burden et al. (2014); Vargas-Magaña et al. (2014) use the definition here.

14

See White (2015), section 3.1, for a discussion of the use of |${\boldsymbol {s}}({\boldsymbol {x}})$| versus |${\boldsymbol {s}}({\boldsymbol {q}})$|⁠.

15

Note that there is an implicit requirement on the smoothing |$\mathcal {S}({\boldsymbol {k}})$|⁠: when |$\mathcal {S}$| acts on the observed field it must suppress the non-linear power.

16

The propagator is |$\sqrt{\mathcal {D}}$|⁠.

17

Note that the presence of noise means that the shifted field also has a noise component δr = δd − δs + δN so that to leading order there is a term proportional to noise |$P(k) = {\mathcal {D}}({\boldsymbol {k}}) P_L(k) + {\mathcal {D}}_N(({\boldsymbol {k}}) P_N({\boldsymbol {k}})+ \dots$|⁠, where |${\mathcal {D}}_N(k) = 1 + \dots$|⁠, i.e. the leading contribution to the noise is not damped by reconstruction.

18

The generalization to noise which depends on kx and ky separately is straightforward.

19

Surveys of QSOs currently cover more area and have a wider redshift overlap with planned 21-cm experiments, but are very sparse. Assuming a fiducial b(z) = 0.53 + 0.29(1 + z)2 (Croom et al. 2005) and the number densities from Dawson et al. (2015) and Myers et al. (2015) we find that PNPL for all k, see Fig. 3. While this still allows high-precision measurements of P(k), given enough volume, it leads to poor reconstruction.

20

For |${\boldsymbol {k}}\propto \hat{k}_z$| the |${\boldsymbol {k}}_1$| modes with large contributions seem closer to the |$\hat{k}_{1,z}$| axis than for |${\boldsymbol {k}}\propto \hat{k}_{1,x}$|⁠. We thank the anonymous referee for pointing this out.

REFERENCES

Ali
S. S.
Bharadwaj
S.
2014
JA&A
35
157

Anderson
L.
et al.
2014
MNRAS
441
24

Ansari
R.
Campagne
J.-E.
Colom
P.
Magneville
C.
Martin
J.-M.
Moniez
M.
Rich
J.
Yèche
C.
2012
C. R. Phys.
13
46

Bharadwaj
S.
1996
ApJ
460
28

Burden
A.
Percival
W. J.
Manera
M.
Cuesta
A. J.
Vargas Magana
M.
Ho
S.
2014
MNRAS
445
3152

Busca
N. G.
et al.
2013
A&A
552
A96

Carlson
J.
Reid
B.
White
M.
2013
MNRAS
429
1674

Chang
T.-C.
Pen
U.-L.
Peterson
J. B.
McDonald
P.
2008
Phys. Rev. Lett.
100
091303

Chang
T.-C.
Pen
U.-L.
Bandura
K.
Peterson
J. B.
2010
Nature
466
463

Crocce
M.
Scoccimarro
R.
2008
Phys. Rev. D
77
023533

Croom
S. M.
et al.
2005
MNRAS
356
415

Datta
A.
Bowman
J. D.
Carilli
C. L.
2010
ApJ
724
526

Dawson
K. S.
et al.
2015
preprint (arXiv:e-prints)

Delubac
T.
et al.
2015
A&A
574
A59

Eisenstein
D. J.
Seo
H.-J.
White
M.
2007a
ApJ
664
660

Eisenstein
D. J.
Seo
H.-J.
Sirko
E.
Spergel
D. N.
2007b
ApJ
664
675

Font-Ribera
A.
et al.
2014
J. Cosmol. Astropart. Phys.
5
27

Furlanetto
S. R.
Oh
S. P.
Briggs
F. H.
2006
Phys. Rep.
433
181

Kirkby
D.
et al.
2013
J. Cosmol. Astropart. Phys.
3
24

Liu
A.
Parsons
A. R.
Trott
C. M.
2014
Phys. Rev. D
90
023019

McCullagh
N.
Szalay
A. S.
2012
ApJ
752
21

Matsubara
T.
2008
Phys. Rev. D
77
063530

Meiksin
A.
White
M.
Peacock
J. A.
1999
MNRAS
304
851

Morales
M. F.
Hazelton
B.
Sullivan
I.
Beardsley
A.
2012
ApJ
752
137

Myers
A. D.
et al.
2015
ApJS
221
27

Nan
R.
et al.
2011
Int. J. Mod. Phys. D
20
989

Noh
Y.
White
M.
Padmanabhan
N.
2009
Phys. Rev. D
80
123501

Padmanabhan
N.
White
M.
2009
Phys. Rev. D
80
063508

Padmanabhan
N.
White
M.
Cohn
J. D.
2009
Phys. Rev. D
79
063523

Padmanabhan
N.
Xu
X.
Eisenstein
D. J.
Scalzo
R.
Cuesta
A. J.
Mehta
K. T.
Kazin
E.
2012
MNRAS
427
2132

Parsons
A. R.
Pober
J. C.
Aguirre
J. E.
Carilli
C. L.
Jacobs
D. C.
Moore
D. F.
2012
ApJ
756
165

Particle Data Group
: Olive K. A.
2014
Chin. Phys. C
38
090001

Pober
J. C.
2015
MNRAS
447
1705

Pober
J. C.
et al.
2013
AJ
145
65

Schmittfull
M.
Feng
Y.
Beutler
F.
Sherwin
B.
Yat Chu
M.
2015
Phys. Rev. D
92
123522

Seo
H.-J.
Hirata
C. M.
2015
preprint (arXiv:e-prints)

Seo
H.-J.
et al.
2010a
ApJ
720
1650

Seo
H.-J.
Dodelson
S.
Marriner
J.
Mcginnis
D.
Stebbins
A.
Stoughton
C.
Vallinotto
A.
2010b
ApJ
721
164

Seo
H.-J.
Beutler
F.
Ross
A. J.
Saito
S.
2015
preprint (arXiv:e-prints)

Shaw
J. R.
Sigurdson
K.
Pen
U.-L.
Stebbins
A.
Sitwell
M.
2014
ApJ
781
57

Shaw
J. R.
Sigurdson
K.
Sitwell
M.
Stebbins
A.
Pen
U.-L.
2015
Phys. Rev. D
91
083514

Sherwin
B. D.
Zaldarriaga
M.
2012
Phys. Rev. D
85
103523

Slosar
A.
et al.
2013
J. Cosmol. Astropart. Phys.
4
26

Tassev
S.
Zaldarriaga
M.
2012a
J. Cosmol. Astropart. Phys.
4
13

Tassev
S.
Zaldarriaga
M.
2012b
J. Cosmol. Astropart. Phys.
10
6

Taylor
A. N.
Hamilton
A. J. S.
1996
MNRAS
282
767

Tojeiro
R.
et al.
2014
MNRAS
440
2222

Vargas-Magaña
M.
et al.
2014
MNRAS
445
2

Vedantham
H.
Udaya Shankar
N.
Subrahmanyan
R.
2012
ApJ
745
176

White
M.
2010
preprint (arXiv:e-prints)

White
M.
2015
MNRAS
450
3822

Xu
X.
Cuesta
A. J.
Padmanabhan
N.
Eisenstein
D. J.
McBride
C. K.
2013
MNRAS
431
2834

Zel'dovich
Y. B.
1970
A&A
5
84

Zhu
H.-M.
Pen
U.-L.
Yu
Y.
Er
X.
Chen
X.
2015
preprint (arXiv:e-prints)

APPENDIX A: DERIVING THE DAMPING

Starting with the expression for |$\delta ({\boldsymbol {k}})$| in the text (equation 2) we can write (setting bias b = 1, or F[λ]∝δ(D)(λ))
(A1)
Employing the cumulant theorem for a Gaussian variable (〈exp [iX]〉 = exp [−(1/2)〈X2〉]), making a change of variables |$\text{d}{\boldsymbol {q}}_1\,\text{d}{\boldsymbol {q}}_2\rightarrow \text{d}\boldsymbol {Q}\,\text{d}{\boldsymbol {q}}$| with |${\boldsymbol {q}}={\boldsymbol {q}}_1-{\boldsymbol {q}}_2$| and |$2\boldsymbol {Q}={\boldsymbol {q}}_1+{\boldsymbol {q}}_2$| and using translational symmetry we have
(A2)
where we have defined |$\xi _{ij}({\boldsymbol {q}})=\langle \Psi _i({\boldsymbol {q}})\Psi _j(0)\rangle$| and assumed |${\boldsymbol {k}}\ne 0$|⁠. Extracting the q-independent piece of the integral and expanding the exponential gives
(A3)
The Fourier transform of the |$k_ik_j\xi _{ij}({\boldsymbol {q}})$| term can easily be shown to be PL(k) to lowest order,
(A4)
(A5)
(A6)
while the exponential is the source of the damping term of the BAO features (or the broadening of the acoustic peak in the correlation function)
(A7)

APPENDIX B: RECONSTRUCTION IN EULERIAN PERTURBATION THEORY

Most of the reconstruction literature either prescribes its implementation on a data set or interprets this implementation within Lagrangian perturbation theory. However, recently Schmittfull et al. (2015) developed a theory of reconstruction based on Eulerian perturbation theory and introduced several new reconstruction schemes. One advantage of the Eulerian formulation, especially in the present context, is that it is naturally expressed in terms of the Fourier space density fields which are measured by 21-cm experiments. A disadvantage of the Eulerian schemes is the increased difficulty of including redshift–space distortions. In this appendix, we consider the impact of missing modes upon reconstruction in the Eulerian scheme to build intuition about their impact. We restrict ourselves to real-space measures, set the bias to 1 and the noise to zero for modes which have measurements. This preserves the main features of the problem while simplifying the presentation.

Heuristically, reconstruction works by ‘undoing’ some of the nonlinear time evolution which decreases and shifts the BAO peak. For Eulerian reconstruction, Schmittfull et al. (2015) approximate the initial density field by Taylor expanding the observed field at late times, specifically subtracting |$\Delta t\, \dot{\delta }$|⁠. Using the continuity equation to express |$\dot{\delta }$| in terms of the divergence of (1 + δ)v, and with an appropriate choice of Δt some algebra gives (Schmittfull et al. 2015)
(B1)
Here, |${{\boldsymbol {s}}}({\boldsymbol {k}})=-i{\boldsymbol {k}}/k^2\ \delta _{\rm obs}({\boldsymbol {k}})W(k)$| is the shift vector and |$\delta _R({\boldsymbol {k}})=\delta _{\rm obs}({\boldsymbol {k}})W(k)$|⁠, with a smoothing kernel W(k) (usually a Gaussian). In Fourier space, this becomes
(B2)
where by changing the kernel, |$\mathcal {K}$|⁠, we can now describe more general reconstruction methods.
The reconstructed power spectrum is |$P_{\rm rec}(k)\propto \langle | \delta _r^2|\rangle$| as before. The leading contribution to enhancing the BAO features, i.e. to reconstruction, was found by Schmittfull et al. (2015) to be (their equation 69):
(B3)
This three-point term can be seen to explicitly enhance the oscillations over those in Pobs(k), restoring some of the signal that is damped by non-linear evolution and bringing the result closer to PL(k).
In more detail,
(B4)
where
(B5)
is the usual perturbation theory kernel. For the Schmittfull et al. (2015) EGS reconstruction method
(B6)
and, defining |$\mu =\hat{k}_1\cdot \hat{k}$|⁠,
(B7)

Fig. B1 shows the contribution of different modes to |$D_3({\boldsymbol {k}}_1,-{\boldsymbol {k}}_1,{\boldsymbol {k}})P_L(k_1)k_{1,\perp }$|⁠. The modes which contribute most to D3 contribute the most to restoring the BAO signal, i.e. to reconstruction. Two different modes with k = 0.15 h Mpc−1 are shown, along the |$\hat{k}_{1,z}$| and |$\hat{k}_{1,x}$| axes (the latter shown in the |$\hat{k}_{1,z} -\hat{k}_{1,x}$| plane). Equal area in this plot gets equal weight in the integral, so that the relative importance of different modes can be read off more easily.20

Two slices in the $\hat{k}_{1,x},\hat{k}_{1,z}$ plane of the integrand $D_3({\boldsymbol {k}_1,-k_1,k}) P_L(k_1)k_{1,\perp }$ in equation (B3). Its integral is the dominant contribution in reconstruction of the linear power spectrum $P_L({\boldsymbol {k}})$, as noted in Schmittfull et al. (2015). The top figure has the contribution for observed mode ${\boldsymbol {k}}=0.15\hat{k}_\parallel =0.15\hat{k}_z$, and the bottom figure corresponds to ${\boldsymbol {k}}=0.15\hat{k}_x$. The measure of integration is dk∥dk⊥ = dk∥dkx in this plane, thus area is a measure of the size of the contribution to reconstruction. Note that the colour scale is logarithmic in this plot, unlike the earlier plots.
Figure B1.

Two slices in the |$\hat{k}_{1,x},\hat{k}_{1,z}$| plane of the integrand |$D_3({\boldsymbol {k}_1,-k_1,k}) P_L(k_1)k_{1,\perp }$| in equation (B3). Its integral is the dominant contribution in reconstruction of the linear power spectrum |$P_L({\boldsymbol {k}})$|⁠, as noted in Schmittfull et al. (2015). The top figure has the contribution for observed mode |${\boldsymbol {k}}=0.15\hat{k}_\parallel =0.15\hat{k}_z$|⁠, and the bottom figure corresponds to |${\boldsymbol {k}}=0.15\hat{k}_x$|⁠. The measure of integration is dkdk = dkdkx in this plane, thus area is a measure of the size of the contribution to reconstruction. Note that the colour scale is logarithmic in this plot, unlike the earlier plots.

As before, if an interferometer does not measure a given |${\boldsymbol {k}}_1$| mode, it will not contribute to D3 but it will contribute to the damping of the signal in Pobs. A simple way to see the effect of missing modes to reconstruction of a mode |${\boldsymbol {k}}$| is to ask what impact removing contributions from |${\boldsymbol {k}}_1$| and |${\boldsymbol {k}}\pm {\boldsymbol {k}}_1$| would have. Visually it is clear that the modes where the colour scale is white or yellow contribute the majority of D3, so losing these modes has the largest detrimental effect on reconstruction. This agrees with our the intuition obtained from the Lagrangian theory exposition in the main text.