-
PDF
- Split View
-
Views
-
Cite
Cite
Seshadri Nadathur, Paul Carter, Will J Percival, A Zeldovich reconstruction method for measuring redshift space distortions using cosmic voids, Monthly Notices of the Royal Astronomical Society, Volume 482, Issue 2, January 2019, Pages 2459–2470, https://doi.org/10.1093/mnras/sty2799
- Share Icon Share
ABSTRACT
Redshift space distortions (RSD) in the void–galaxy correlation ξs provide information on the linear growth rate of structure in low density environments. Accurate modelling of these RSD effects can also allow the use of voids in competitive Alcock–Paczynski measurements. Linear theory models of ξs are able to provide extremely good descriptions of simulation data on all scales provided the real space void positions are known. However, by reference to simulation data we demonstrate the failure of the assumptions implicit in current models of ξs for voids identified directly in redshift space, as would be simplest using real observational data. To overcome this problem we instead propose using a density-field reconstruction method based on the Zeldovich approximation to recover the real space void positions from redshift space data. We show that this recovers the excellent agreement between theory and data for ξs. Performing the reconstruction requires an input cosmological model so, to be self-consistent, we have to perform reconstruction for every model to be tested. We apply this method to mock galaxy and void catalogues in the Big MultiDark N-body simulation and consistently recover the fiducial growth rate to a precision of |$3.4{{\ \rm per\ cent}}$| using the simulation volume of (2.5 h−1Gpc)3.
1 INTRODUCTION
Galaxy redshift surveys provide 3D redshift-space maps of the large-scale structure of the Universe that contain anisotropies along the line-of-sight direction thanks to two effects: redshift space distortions (RSD) (Kaiser 1987) due to gravitationally induced peculiar velocities, and the Alcock–Paczynski (AP) effect (Alcock & Paczynski 1979) due to differential geometrical stretching along and transverse to the line of sight if the cosmological model used to translate redshifts to distances is incorrect. Measurement of these anisotropies therefore provides two correlated pieces of cosmological information. RSDs depend on the amplitude of the peculiar velocities and thus, in linear models, on the linear growth rate of structure f(z), commonly parametrized in the combination f(z)σ8(z), while the AP effect constrains the combination H(z)DA(z), where H(z) is the Hubble parameter and DA(z) the angular diameter distance at redshift z. The RSD and AP effects are in general degenerate. The Baryon Acoustic Oscillation (BAO) peak provides a clear, sharp feature in the galaxy clustering whose position is unaffected by RSD and this breaks the degeneracy and allows measurement of the AP effect. In other scenarios, it is difficult to distinguish the AP effect without an accurate model for RSD (and vice versa).
The theory of RSD in the galaxy clustering has been much studied (e.g. Scoccimarro 2004; Matsubara 2008; Taruya, Nishimichi & Saito 2010; Jennings, Baugh & Pascoli 2011; Reid & White 2011) but is in general complicated by significant non-linear contributions even at quite large scales, and therefore requires sophisticated modelling. A recently proposed promising alternative has been to use RSD in the cross-correlation of galaxies with cosmic void centres in order to measure the growth rate (e.g. Paz et al. 2013; Hamaus et al. 2015; Cai et al. 2016; Achitouv 2017; Achitouv et al. 2017; Hamaus et al. 2017; Hawken et al. 2017; Nadathur & Percival 2017). Voids have already proved useful cosmological tools in other scenarios, including for studying the integrated Sachs–Wolfe secondary anisotropies in the cosmic microwave background (e.g. Granett, Neyrinck & Szapudi 2008; Hotchkiss et al. 2015; Nadathur & Crittenden 2016; Kovács et al. 2017), weak gravitational lensing (Krause et al. 2013; Melchior et al. 2014; Clampitt & Jain 2015; Sánchez et al. 2017), and the thermal Sunyaev–Zeldovich effect (Alonso et al. 2018). A potential advantage of the use of voids for RSD studies is that galaxy dynamics in low density regions, where velocities are dominated by coherent bulk flows, can be modelled by linear theory alone. If the RSD effect around voids can be accurately modelled, they can be used in AP tests, as proposed by Lavaux & Wandelt (2012), who argue that under certain circumstances voids may even outperform the BAO with future survey data. Some preliminary studies using voids in this way exist (Sutter et al. 2012; Mao et al. 2017) but current constraints are weak due to the difficulty of disentangling the AP and RSD effects.
A major advance in the modelling of the void–galaxy correlation in redshift space, |$\xi ^s(\mathbf {s})$|, was made by Cai et al. (2016), who provided a linear RSD model that matched simulation data well except in the deep interior regions of voids. Nadathur & Percival (2017) extended this model by the addition of terms that are generally small for the Kaiser (1987) theory for the galaxy autocorrelation, but cannot be neglected within voids. The resulting completely linear model was shown to match simulation data extremely well on all scales, provided the real space positions of the voids could be determined. This opens the possibility of using voids to measure possible environment dependence of the growth rate, as well as to constrain cosmological parameters through a precise AP test.
However, one issue is that all current observational studies of RSD in the void–galaxy correlation (Paz et al. 2013; Hamaus et al. 2015; Achitouv et al. 2017; Hamaus et al. 2017; Hawken et al. 2017) make use of voids identified directly from redshift space galaxy data, in which case the real space void positions cannot be known. Chuang et al. (2017) have recently pointed out that this raises difficulties for the theoretical modelling of |$\xi ^s(\mathbf {s})$|, in general introducing an unknown velocity bias term on small scales. This is a consequence of a general theorem due to Seljak (2012), and applies to RSD in other data sets as well, in particular the Lyman α forest.
In this work we elaborate on this problem by explicitly outlining several key common assumptions required for the derivation of all current models for |$\xi ^s(\mathbf {s})$|. These assumptions can be summarized as: the conservation of void numbers in redshift space, the invariance of void centre positions under the redshift space mapping, and the isotropy of the real space galaxy density and velocity fields around the void centres. These assumptions all necessarily hold if the positions of real space voids are known, but we demonstrate using simulation data that all of them fail for voids identified directly in the redshift space galaxy distribution. Our demonstration makes use of the watershed void-finding algorithm of Neyrinck (2008), but we argue that these fundamental issues will also affect any other void finding routines. Thus changes to either the theoretical modelling or to the data analysis are required for a fully consistent analysis.
We propose to achieve the latter: specifically, we use an algorithm to reconstruct the real space void positions from redshift space galaxy data based on the Zeldovich approximation (Zel’dovich 1970), and now commonly used for BAO analyses (Eisenstein et al. 2007; Padmanabhan et al. 2012). The principle behind this method is to approximately reconstruct the real space galaxy distribution before performing the void-finding step. We apply this method to mock galaxy and void catalogues based on an N-body simulation and demonstrate that it reproduces the agreement between theory and data previously seen for real-space selected voids by Nadathur & Percival (2017). We discuss the role of the assumed cosmological parameters in the reconstruction and show how the method can self-consistently be used to measure the growth rate f. The reconstruction also provides a method to measure the real space galaxy density profile of voids.
The layout of the paper is as follows. Section 2 describes the simulation data used, the creation of mock catalogues and the methods used to measure the correlation function. In Section 3 we outline the derivation of the theoretical model for |$\xi ^s(\mathbf {s})$|, discuss the assumptions made in the derivation of this and all other models, and show that these do not hold for voids in redshift space. Section 4 outlines the reconstruction technique we introduce to recover the real space void positions, and evaluate its performance on the simulation data. In Section 5 we provide an algorithm to self-consistently perform the reconstruction and measure the growth rate when f is unknown, and show that this reproduces the fiducial value in our simulation. The Appendix provides a demonstration of the biases that can otherwise be introduced if the reconstruction procedure is not used. Finally, we sum up and draw conclusions in Section 6.
2 DATA
2.1 Simulation and mocks
Simulation results in this paper make use of mock galaxy and void catalogues derived from the Big MultiDark (BigMD) N-body simulation (Klypin et al. 2016) from the MultiDark simulation project (Prada et al. 2012). The BigMD simulation evolves 38403 particles in a box of side L = 2500 h−1Mpc using the gadget-2 (Springel 2005) and Adaptive Refinement Tree (Kravtsov, Klypin & Khokhlov 1997; Gottloeber & Klypin 2008) codes. The cosmological parameters used for the simulation were ΩM = 0.307, ΩB = 0.048, |$\Omega _\Lambda =0.693$|, ns = 0.95, σ8 = 0.825, and h = 67.8, and initial conditions for the simulation were set using the Zeldovich approximation at starting redshift zi = 100.
We use a halo catalogue created for the z = 0.52 snapshot using the Bound Density Maximum algorithm (Klypin & Holtzman 1997; Riebe et al. 2013), and populate these haloes with mock galaxies using the Halo Occupation Distribution (HOD) model of Zheng, Coil & Zehavi (2007), with parameters taken from Manera et al. (2013) and designed to approximately reproduce the clustering and mean number density for galaxies in the Baryon Oscillation Spectroscopic Survey (BOSS) CMASS galaxy sample. Details of the algorithm, the assignment of central and satellite galaxies, and HOD model parameters used are described more fully in Nadathur, Hotchkiss & Crittenden (2017). This is the same mock galaxy catalogue as used by Nadathur & Percival (2017).
Dark matter (DM) densities in the simulation are measured using a 23503 DM density grid generated from the full particle output of the simulation using a cloud-in-cell interpolation scheme. By comparing the power spectra of the clustering of DM and the mock galaxies at large scales, k ≲ 0.05 hMpc−1, we determine the linear bias value for the galaxy mocks, b = 1.87. This is taken as the fiducial bias value in subsequent calculations.
Finally, using the redshift space galaxy field we also recover the ‘pseudoreal space’ galaxy positions by using the reconstruction method described in detail in Section 4.
2.2 Void catalogues
We identify voids in the galaxy mocks described above through use of the zobov watershed void-finding algorithm (Neyrinck 2008). The zobov algorithm uses a Voronoi tessellation field estimator technique to reconstruct the galaxy density field from the discrete distribution, and then identifies local minima in this field and the watershed basins around them, to form a non-overlapping set of voids corresponding to local density depressions. We define each individual density basin as a distinct void, without any additional merging of neighbouring regions. A fuller description of the algorithm and void properties can be found in Nadathur (2016) and Nadathur et al. (2017).
We characterize each void by an effective spherical radius, |$R_\mathrm{v}=\left(3V/4\pi \right)^{1/3}$|, where V is the total volume of the void – note however that voids are in general of arbitrary shape and far from spherical. We assign the centre of each void to the centre of the largest sphere completely empty of galaxies that can be inscribed within the void (Nadathur & Hotchkiss 2015b; Nadathur et al. 2017). We apply the void-finding algorithm separately to the galaxy mocks in real space, redshift space, and the reconstructed pseudoreal space. The size distributions of the resulting void populations are shown in Fig. 1, and the total void numbers are summarized in Table 1.

The number of voids in the simulation as a function of void size Rv, for voids found using galaxy positions in real space, redshift space, and reconstructed pseudoreal space. Inset: The fractional difference in void numbers relative to real space voids, with the corresponding Poisson errors. The redshift space mapping leads to large changes at all void sizes. The reconstruction method recovers true void numbers to within |$2{{\ \rm per\ cent}}$| in each Rv bin over the scales of interest (see text), and to better than |$1{{\ \rm per\ cent}}$| overall.
Total numbers of voids recovered in each galaxy population and differences relative to real space.
. | Nv . | |$(N_\mathrm{v}-N_\mathrm{v}^\mathrm{real})/N_\mathrm{v}^\mathrm{real}$| (per cent) . |
---|---|---|
Real space | 32 742 | 0 |
Redshift space | 29 925 | −8.6 |
Reconstructed space | 32 913 | +0.5 |
. | Nv . | |$(N_\mathrm{v}-N_\mathrm{v}^\mathrm{real})/N_\mathrm{v}^\mathrm{real}$| (per cent) . |
---|---|---|
Real space | 32 742 | 0 |
Redshift space | 29 925 | −8.6 |
Reconstructed space | 32 913 | +0.5 |
Total numbers of voids recovered in each galaxy population and differences relative to real space.
. | Nv . | |$(N_\mathrm{v}-N_\mathrm{v}^\mathrm{real})/N_\mathrm{v}^\mathrm{real}$| (per cent) . |
---|---|---|
Real space | 32 742 | 0 |
Redshift space | 29 925 | −8.6 |
Reconstructed space | 32 913 | +0.5 |
. | Nv . | |$(N_\mathrm{v}-N_\mathrm{v}^\mathrm{real})/N_\mathrm{v}^\mathrm{real}$| (per cent) . |
---|---|---|
Real space | 32 742 | 0 |
Redshift space | 29 925 | −8.6 |
Reconstructed space | 32 913 | +0.5 |
The model for the RSD in the void–galaxy correlation is not expected to hold for all voids, in particular because galaxy velocities around small voids will be dominated by the local environment of structures outside the void rather than the void itself. We therefore restrict our attention to large voids. To achieve this, we apply the cut described in Nadathur & Percival (2017) and remove all voids in each sample with effective radius Rv smaller than the median void size. Given the distributions shown in Fig. 1, this corresponds to selecting Rv ≳ 43 h−1Mpc, with slight variations for the different populations. All correlation function measurements presented in this work are calculated for void populations after the Rv cut has been applied. Note that as the cut is determined by the median void size in each sample, the fractional differences shown in Table 1 are unchanged by its application.
2.3 Correlation function measurement and covariance matrix
The cross-correlation functions in equation (2) are measured separately for each void catalogue with each galaxy population. In what follows, we will refer to the cross-correlations of voids with galaxies in real space, redshift space, and reconstructed pseudoreal space using the notation ξr, ξs, and ξp, respectively. The particular void catalogue used in each case will be clear from the context.
3 THEORY
3.1 Linear RSD model for the void–galaxy correlation
A theoretical model for void–galaxy correlation in redshift space was derived by Nadathur & Percival (2017), extending that of Cai et al. (2016), and providing a model that works on all scales including right into the void centres. In the following, we sketch out the main features of this model, while referring readers to Nadathur & Percival (2017) for full details.
Nadathur & Percival (2017) showed that for the case of voids selected in real space, for which the assumptions above are valid, equation (17) provides an excellent fit to the measured redshift space void–galaxy correlation on all separation scales. We use this theoretical model for the main results presented in this paper. The fits with an earlier model by Cai et al. (2016) are considered in the Appendix.
3.2 Complications for voids based on redshift-space galaxies
A number of theoretical models have now been proposed for RSD in the void–galaxy correlation function ξs (e.g. Paz et al. 2013; Cai et al. 2016; Achitouv 2017; Nadathur & Percival 2017). They differ in derivation, and consequently they can lead to very different final predictions – see Nadathur & Percival (2017) for a discussion and comparison to simulation data.
Nevertheless, all models to date have been based on the same fundamental common assumptions highlighted in the previous section. These can be explicitly stated as
The number of voids is conserved under the redshift space mapping (equation 7)
Void positions are invariant under the redshift space mapping, so that the transformation |$\boldsymbol {r}\rightarrow \boldsymbol {s}$| depends on galaxy velocities only (equation 8)
The average radial outflow velocity around voids is isotropic (equation 9)
The real space correlation is isotropic, |$\xi ^r(\boldsymbol {r})=\xi ^r(r)$|
These conditions will necessarily hold if the real space void positions are known – (i) and (ii) by construction, and (iii) and (iv) because there is no preferred direction along which any anisotropy could be introduced.
However, all observational studies of the void–galaxy correlation until now have used voids identified directly in the redshift space galaxy distribution (Paz et al. 2013; Hamaus et al. 2016; Achitouv et al. 2017; Hamaus et al. 2017; Hawken et al. 2017). In this scenario, each of these assumptions is violated, as we show below, leading to questions about the validity of the models.
The violation of void conservation in redshift space compared to real space, when using the same void-finding routine, has been previously noted (Nadathur & Hotchkiss 2014; Zhao et al. 2016; Chuang et al. 2017). Comparison of the void populations in our simulation shows that the total number of voids changes by |${\sim }10{{\ \rm per\ cent}}$| under the redshift space mapping (Table 1). Fig. 1 shows that differences in void numbers apply over the entire range of void sizes Rv and are not limited to the smallest voids. In fact, the fractional difference relative to the number of real space voids is largest at large Rv, where it can be as much as |$40-50{{\ \rm per\ cent}}$|.
Assumption (ii) is equivalent to assuming that the void positions do not suffer RSD themselves. This has been shown to be incorrect for voids identified in redshift space galaxies by Chuang et al. (2017), who directly measure the quadrupole of the void autocorrelation function. In principle, it is possible to accommodate motion of the void centres by relaxing assumption (ii) and modifying equation (8) to account for void velocities: this possibility is discussed by Cai et al. (2016). However, as pointed out by Chuang et al. (2017), in general void-finding constitutes a non-linear transformation of the underlying galaxy (and thus, matter) density field, and therefore necessarily leads to a void velocity bias different from unity (Seljak 2012).3 As the action of the void-finder algorithm cannot be described by a simple mathematical model, the exact form of the non-linear transformation is in general unknown and therefore so is the void velocity bias.
Fig. 2 shows the average radial component of the dark matter velocity field around void centres as a function of the transverse and line-of-sight distances from the centre, |$v_r^\mathrm{DM}(\sigma ,\pi)$|, for different void populations.4 As expected, the velocity field is isotropic for real space voids. However, assumption (iii) clearly fails when voids are identified in the redshift space galaxy distribution. The physical reason for this is easy to understand intuitively: underdensities with higher line-of-sight outflow velocities will appear to have a lower central galaxy density in redshift space, and will therefore be preferentially selected by any void-finding algorithm, leading to the anisotropy in the stacked velocity profiles.

The average radial component of the dark matter velocity outflow around void centres, |$v_r^\mathrm{DM}(\sigma ,\pi)$|, as a function of real space distances from the void centre transverse to and parallel to the line-of-sight direction. Velocities are shown for voids identified using galaxy positions in real space (left-hand panel), redshift space (centre panel), and reconstructed pseudoreal space (right-hand panel). The lines show contours of |$v^\mathrm{DM}_r=100,\, 140,\, 180,\, 220\,\,\mathrm{kms}^{-1}$| (the final contour only appears in the central panel). Contour levels were chosen to highlight differences between panels.
Note that the assumption of isotropy in the average velocity profile, vr = vr(r), is separate from and more fundamental than the additional assumption of linear dynamics in equation (10). Indeed, the latter assumption has been dropped in some works (Achitouv 2017) which nevertheless still assume isotropy.
The final assumption common to all models of RSD in the void–galaxy correlation is the isotropy of the correlation in real space, |$\xi ^r(\boldsymbol {r})$|. To test this, we cross-correlate voids with the real space galaxy positions, and determine the quadrupole |$\xi ^r_2(r)$|. The results are shown in Fig. 3. For real space voids, |$\xi ^r_2(r)=0$| as required. However, when voids are identified using the redshift space galaxy positions, |$\xi ^r_2(r)$| is significantly non-zero over a range of scales. This is due to two contributing physical effects that are easily understood. First, as assumption (ii) is violated and the redshift-space void positions have RSD themselves, their cross-correlation with a tracer without RSD (in this case the real space galaxy field) will still show an anisotropy. Secondly, void-finding in redshift space preferentially selects regions with larger outflow velocities along the line-of-sight direction as shown by Fig. 2. This exaggerates apparent underdensities along the line of sight; conversely in real space the contours of ξr for such regions will be squashed along the line of sight, corresponding to |$\xi ^r_2\gt 0$|.

Real space quadrupole |$\xi ^r_2(r)$| of the void–galaxy cross-correlation, for different void populations. The blue (yellow) shaded regions indicate the 1σ error range around the measured quadrupoles for real space (redshift space) voids. Data points with error bars indicate the measurements for voids found in the reconstructed pseudoreal space galaxy distribution. For real space and reconstructed voids |$\xi ^r_2(r)=0$| as required for the validity of the model, but this is not the case if voids are identified using redshift space galaxies.
The results of this section show that all four of the fundamental assumptions required by all models of the void–galaxy correlation including RSD proposed so far are violated for voids found by applying the void-finding algorithm directly on the redshift space galaxy distribution; thus none of the models currently existing in the literature are applicable to this scenario. The physical explanation for these results shows that the same problems will exist for any other void-finding algorithms applied directly to redshift space data.
Modifying the theoretical framework to account for redshift-space void selection appears a highly challenging task without a mathematical model of the action of the void-finder. On the other hand, Nadathur & Percival (2017) showed that when the real space void positions are known and thus all the assumptions discussed are valid, the simple model of equation (17) already provides an extremely good fit to simulation data. Therefore in the next section we describe an approach to reconstruct the real space void positions from redshift space galaxy data, rather than to alter the RSD model.
4 RECONSTRUCTION METHOD
In this section we describe the reconstruction method to recover the real space void positions from redshift space data. We first describe the physical basis of our method and the algorithm used, and then evaluate its effect on void-finding and the void–galaxy correlation.
4.1 Understanding reconstruction
The goal of the reconstruction method described in the following is to recover the real space galaxy field using only the redshift space position information that would be available from survey data, and subtracting the effects of linear RSD from the galaxy positions. We will refer to the recovered galaxy positions as being in reconstructed or pseudoreal space. We then perform the void-finding step on the pseudoreal space galaxy field. To the extent that the RSD removal is successful, the voids thus obtained will match the true real space void positions, which are required for successful modelling of the anisotropic void–galaxy correlation.
We then measure the cross-correlation of these reconstructed void positions with the original redshift space galaxy field to obtain |$\xi ^s(\boldsymbol {s})$|. The cross-correlation of the void positions with the pseudoreal space galaxies gives |$\xi ^p(\boldsymbol {r})$|, which we will show is a very good approximation of the true real space correlation |$\xi ^r(\boldsymbol {r})$|. This provides a key input required for the theoretical modelling in redshift space, as well as being a quantity of interest in its own right (Pisani et al. 2014).
The method used to solve for the galaxy displacement field is based on the concept of density field reconstruction (Eisenstein et al. 2007), which is used frequently in the analysis of the BAO in the 2-point autocorrelation function of large redshift surveys (e.g. Padmanabhan et al. 2012; Anderson et al. 2013; Ross et al. 2015; Alam et al. 2017; Carter et al. 2018). The method makes use of first-order Lagrangian perturbation theory to make an estimate of the linear displacement field from the observed evolved density field, and moves the overdensity back along these vectors to approximately recover the linear overdensity field (Padmanabhan et al. 2012).
Unlike in the case of the BAO reconstruction, we are only interested in the removal of the linear RSD contribution (Kaiser 1987) and not the additional linearization of the evolved density field. For the BAO method, the extra step is required to enhance the BAO peak that has been smoothed by non-linear density evolution and bulk flows, and leads to an improvement in precision of the BAO measurement of the order of ∼2 at z ∼ 0 (Ross et al. 2015).
Other algorithms to recover the real space galaxy field from redshift space information have previously been studied (e.g. Wang et al. 2009, 2012; Kitaura et al. 2012, 2016; Shi et al. 2016). Our method differs from these in its relative simplicity, as we only attempt to remove the effects of coherent large-scale flows and thus the linear RSD shifts. We show below that this is sufficient for our purposes, and leave to future work the investigation of improvements that may be obtained from more sophisticated approaches.
4.2 Algorithm
To compute the overdensity field we assign galaxies to a grid using a cloud-in-cell (CIC) algorithm. The resulting δg is then smoothed by convolving with a Gaussian filter in Fourier space, |$S(k)=e^{{-(kR_s)^2}/2}$|. The smoothing length Rs must be chosen to be large enough to remove small-scale non-linearities but small enough to optimally capture information on the large-scale bulk flows. For BAO studies, typical smoothing lengths are Rs = 10−15 h−1Mpc (Padmanabhan et al. 2012; Burden et al. 2014; Achitouv & Blake 2015; Vargas-Magaña et al. 2017). Our fiducial results use a smoothing scale Rs = 10 h−1Mpc. We discuss the optimal choice of Rs further in Section 4.5.
In order to apply reconstruction we require input values for the galaxy bias b and the growth rate f. For the fiducial model, these values are b = 1.87 and f = 0.761, determined appropriately for our galaxy mocks and the simulation redshift. However, part of the motivation for measuring the void–galaxy correlation in redshift space is precisely to determine the true value of f. In Section 4.6 we therefore consider the effects on ξs of performing the reconstruction with the wrong assumed cosmology, and in Section 5 we outline a non-circular, self-consistent method for the measurement of f where we allow the model to be tested to dictate what value of f we should assume in the reconstruction step, and show that when applied to our simulation data it correctly returns the fiducial growth rate after marginalization over b.
It is worth commenting on the fact that for the reconstruction we assume the validity of a constant linear galaxy bias b. Within voids it has been argued that such a bias relationship does not hold (Neyrinck et al. 2014; Nadathur & Hotchkiss 2015a,b; Nadathur & Percival 2017) and we do not assume a linear bias for the determination of δ(r) in Section 3.1. This failure of the linear bias within voids can be explained purely as a statistical selection effect in the presence of stochasticity in the bias relationship, due to the restriction to regions specifically chosen to have very few galaxy tracers, and thus low δg. On the other hand, when considering all values of δg over the entire simulation box, on average the linear bias relationship 〈δ|δg〉 = δg/b remains a good approximation. The use of the CIC grid assignment and the Gaussian smoothing in reconstruction also help to suppress statistical deviations from the mean relationship.
4.3 Effect of reconstruction on void-finding
Table 1 shows that the total number of voids identified in the reconstructed pseudoreal space galaxy field agrees with the number of real space voids to within |$0.5{{\ \rm per\ cent}}$| when the reconstruction is performed in the fiducial cosmology. The size distribution of these voids also closely matches that of real space voids, as shown in Fig. 1. The fractional difference in void numbers, |$\Delta N_\mathrm{v}/N_\mathrm{v}^\mathrm{real}$|, is almost constant as a function of void size Rv, with the largest differences arising at the smallest void sizes. For the larger voids we consider for correlation function measurements, Rv ≳ 43 h−1Mpc, the difference in numbers is |$\lt \! 2{{\ \rm per\ cent}}$| in each individual Rv bin, and |$0.5{{\ \rm per\ cent}}$| overall. We therefore conclude that the reconstruction of redshift space data does conserve void numbers to a very good approximation, and thus that assumption (i) of Section 3.2 is valid.
The third panel in Fig. 2 shows the radial velocity outflow patterns around reconstructed void positions. In contrast to the case of redshift space voids, these profiles are isotropic and very similar to those for the true real space voids. Thus reconstruction is able to recover the validity of the assumption (iii), that |$v_\mathrm{ r}(\boldsymbol {r})=v_\mathrm{ r}(r)$|.
Finally, we test the isotropy of the cross-correlation of reconstructed void positions with the real space galaxy distribution, ξr(r). Fig. 3 shows the quadrupole |$\xi ^r_2(r)$| for this correlation (the data points with error bars), which is consistent with zero to good approximation. This verifies assumption (iv) made in Section 3.2.
The validity of assumption (ii) remains to be tested. In the following we will do this indirectly, by comparing the measured redshift space correlation of reconstructed voids to the theory derived from using this assumption. Note however that if residual RSD effects were present in the reconstructed void positions, these would also contribute to |$\xi ^r_2(r)\ne 0$|. Thus Fig. 3 already gives good reason to be confident that assumption (ii) is also valid.
4.4 Effect of reconstruction on ξs
We now compare the model of Section 3.1 to the measured redshift space correlation function multipoles |$\xi ^s_0$| and |$\xi ^s_2$| for reconstructed voids.
To calculate the theory model using equations (17) and (14) requires specifying three functions in real space: ξr(r), δ(r), and σv(r). The correlation with real space galaxies is not known; however, the reconstruction provides a practical alternative, which is to directly measure the correlation of voids with the pseudoreal space galaxy field, |$\xi ^p_0(r)$|. We tested the differences between |$\xi ^r_0$| and |$\xi ^p_0$| in the simulation data, and found that they were small, meaning that the method allows accurate reconstruction of the true real-space void galaxy density profile (Pisani et al. 2014). We use the measured |$\xi ^p_0(r)$| as the input for the theory predictions.
The functions δ(r) and σv(r) cannot be directly determined from galaxy survey data, but must be calibrated from mock void and galaxy catalogues in simulations. We measure the stacked average profiles of δ(r) and σv(r) for the real and reconstructed void populations and find only small differences, which when propagated through the theory lead to negligible differences in the predictions for |$\xi ^s_0(s)$| and |$\xi ^s_2(s)$|. We therefore fix the functions to match the measured values for real space voids, as done by Nadathur & Percival (2017).
Fig. 4 shows the predictions for |$\xi ^s_0$| and |$\xi ^s_2$| for the fiducial growth rate f = 0.761, together with the measured multipoles for cross-correlation of each of the real space, redshift space, and reconstructed void populations with redshift space galaxies. For visual clarity, the monopoles are shown as the difference with respect to the real space monopole |$\xi ^r_0(r)$|. Physical explanations for the various features seen – in particular the negative dip and positive peak in the quadrupole – were discussed by Nadathur & Percival (2017). The correlation functions for real space and reconstructed voids agree very well with each other: this is evidence that the reconstruction is successfully recovering the real space void positions. The model of equation (17) also provides an excellent description of the data: the reduced χ2 value for fits to |$\xi ^s_0(s)$| and |$\xi ^s_2(s)$| for reconstructed voids is 1.01 for the fiducial f.

Measured multipoles for the void–galaxy correlation with galaxies in redshift space (data points with error bars), compared to the fiducial theory prediction from equation (17) (solid line). Voids were identified from the redshift-space galaxy data using the reconstruction method described in Section 4. Left-hand panel: monopole, shown as difference to the real space monopole for clarity. Right-hand panel: Quadrupole. Shaded regions in both panels show the corresponding measured multipoles and 1σ error range for voids identified in redshift space without reconstruction.
Fig. 4 also shows that the void–galaxy correlation is very different for redshift-space voids, with particularly striking differences for the quadrupole. |$\xi ^s_0$| and |$\xi ^s_2$| for redshift space voids match neither the data for real space voids, nor the theory predictions. In the Appendix we consider a different theory model and show that it also fails to describe the data for redshift space voids. This is unsurprising given the discussion in Section 3.2: the use of redshift space voids invalidates the key assumptions made in deriving all theoretical models for ξs.
4.5 Choice of smoothing scale
Our implementation of the reconstruction algorithm requires a choice of width Rs for the Gaussian filter used to smooth the galaxy density field. Reconstruction implementations for BAO measurements typically use a fixed smoothing scale Rs ≃ 10−15 h−1Mpc (Padmanabhan et al. 2012; Burden et al. 2014; Achitouv & Blake 2015). Rs must be large enough to ensure the validity of the Zeldovich approximation used for equation (22), but the appropriate value is not known a priori. If Rs is too large, some of the contributions to equation (22) from density fluctuations on linear scales are missed, so the reconstruction will fail to remove all of the linear RSD from the galaxy distribution and the reconstructed voids will not match the true real space voids. If Rs is too small, the reconstruction picks up contributions from small-scale density fluctuations for which equation (22) is not accurate; again the reconstruction of void positions will fail to satisfy the conditions discussed in Section 3.2.
To explore this choice, we empirically tested the performance of five different values of the smoothing scale in the range 5 ≤ Rs ≤ 15 h−1Mpc. For each choice we re-ran the reconstruction and the void-finding, and measured the quadrupole of the cross-correlation of these voids with the reconstructed galaxy field, |$\xi ^p_2(r)$|. If the reconstruction procedure successfully removes all anisotropic effects and recovers the real space void positions, this quadrupole should be |$\xi ^{p,\mathrm{th}}_2(r)=0$|. On the other hand, failure to reproduce any of the key conditions of Section 3.2 would be expected to introduce a non-zero quadrupole.
Fig. 5 shows the results for different Rs. The fiducial choice of smoothing scale, Rs = 10 h−1Mpc, results in a quadrupole consistent with zero, as already discussed. As expected, larger values of Rs indeed fail to remove all of the anisotropy, giving results intermediate between those of real space and redshift space voids seen in Fig. 3. For Rs < 10 h−1Mpc, the reconstruction overcorrects in the opposite direction, leading to |$\xi ^p_2\lt 0$| at scales r ∼ 40 h−1Mpc. Fits to these data of the expected isotropic model |$\xi ^{p,\mathrm{th}}_2(r)=0$| give reduced χ2 values of 2.0, 1.3, 0.8, 1.3, and 2.3 for |$R_s=5,\,\,7.5,\,\,10,\, 12.5$|, and 15 h−1Mpc, respectively. Based on these results, we set Rs = 10 h−1Mpc for all the other reconstruction results reported in this paper.

The quadrupole |$\xi ^p_2(r)$| for the correlation of reconstructed void and galaxy positions, for different choices of the fixed smoothing scale Rs used in the reconstruction procedure (see text). The choice Rs = 10 h−1Mpc results in an isotropic correlation function consistent with expectation |$\xi ^{p,\mathrm{th}}_2(r)=0$|, but residual anisotropies are seen for other values.
It should be noted that the need to choose Rs in this manner is a limitation of current reconstruction algorithms that use a fixed smoothing scale. Iterative determination of the appropriate smoothing scale (e.g. Schmittfull, Baldauf & Zaldarriaga 2017; Hada & Eisenstein 2018) provides a promising alternative that might lead to further improvements, but we defer such investigations to future work.
4.6 Dependence of reconstruction on assumed cosmology
In addition to Rs, implementation of the reconstruction algorithm requires the specification of parameters f and b, the growth rate and linear galaxy bias. The growth rate is cosmology-dependent and is not known a priori for survey data; indeed an important motivation for measuring the void–galaxy correlation is to determine f. We therefore consider the effects on ξs of performing the reconstruction with the wrong input growth rate.
For each input value of f, we re-perform the reconstruction of the pseudoreal space galaxy field and the subsequent void-finding, and then measure the cross-correlation of these voids with the redshift space galaxies, ξs. The left-hand panel of Fig. 6 shows the variation of the resultant quadrupole, |$\xi ^{s,\mathrm{obs}}_2$|, for selected values of f. A significant trend can be seen in the height of the peak in |$\xi ^{s,\mathrm{obs}}_2$| in the range 30 ≲ s ≲ 60 h−1Mpc, as well as a smaller trend in the depth of the dip at s ∼ 25 h−1Mpc. This trend can be heuristically understood as follows. When the assumed f is smaller than the true growth rate, the reconstruction fails to remove all of the RSD in the void positions. Thus |$\xi ^{s,\mathrm{obs}}_2$| is closer to that for the redshift space voids shown in Fig. 4 (which corresponds to the limit of assuming f = 0 in the reconstruction). In particular this means a smaller dip and a larger peak in the quadrupole. Conversely, if the assumed f > ftrue, void positions are overcorrected and an opposite RSD effect is introduced in |$\xi ^{s,\mathrm{obs}}_2$|, increasing the depth of the dip and decreasing the height of the peak.

Left-hand panel: Effect on the measured redshift space quadrupole |$\xi ^s_2(s)$| when the assumed growth rate f is changed in the reconstruction procedure prior to void-finding. Error bars are omitted for clarity. Right-hand panel: Effect on the theoretical predicted quadrupole from changes in f. For these plots the assumed bias is kept fixed at the fiducial value, b = 1.87.
The right-hand panel of Fig. 6 shows the corresponding changes to the predicted quadrupole |$\xi ^{s,\mathrm{th}}_2$| as f is changed. Note that each prediction accounts for not only the direct effect of changes in f, but also the smaller effect of changes in the monopole |$\xi ^p_0(s)$|, which is determined separately for each reconstruction. Importantly, the trend of changes in the peak height around s ∼ 50 h−1Mpc is opposite to that in |$\xi ^{s,\mathrm{obs}}_2$|, i.e. larger assumed f leads to a higher predicted peak but a lower observed one. Only for values of the assumed growth rate close to ftrue do the theory and data match.
Fig. 7 shows the corresponding effect on the observed monopole, |$\xi ^{s,\mathrm{obs}}_0(s)$|. To visually highlight the differences, the same fiducial function |$\xi ^r_0(s)$| has been subtracted from each observed monopole. Changes to the f assumed in reconstruction produce smaller changes in |$\xi ^{s,\mathrm{obs}}_0(s)$| than in the quadrupole, and over a more restricted range of scales s. For the predicted monopole, the effects of changing the measured input monopole |$\xi ^p_0$| with each reconstruction are larger than for the quadrupole. Interestingly, they also act in the opposite direction to the effect of changing f. As a result, the predictions for |$\xi ^{s,\mathrm{th}}_0(s)$| have a very small dependence on the assumed f, and so are not shown.

Effect on the measured redshift space monopole |$\xi ^s_0(s)$| when the assumed growth rate f is changed in the reconstruction procedure prior to void-finding. For visual purposes, the data is shown as the difference relative to a fixed fiducial function |$\xi ^r_0(s)$|. Error bars are omitted for clarity. The assumed bias is kept fixed, b = 1.87.
5 RSD FITTING AND RESULTS
The results of Section 4.6 suggest that the reconstruction method can be used to measure the true growth rate f from measurement of the void–galaxy correlation ξs. To ensure that this process is performed self-consistently, we adopt the following algorithm:
To perform the reconstruction, assume a set of cosmological parameters; for our plane-parallel simulation setup, this requires a choice of the pair (f, b);
For each such choice, reconstruct the pseudoreal space galaxy field, run the void-finding step, and apply the void size cut to obtain the sample for cross-correlation;
Cross-correlate these void positions with the reconstructed galaxy positions to measure |$\xi ^p_0(r)$|, and with the redshift space galaxy positions to measure |$\xi ^s_0(s)$| and |$\xi ^s_2(s)$|;
Using the measured |$\xi ^p_0(r)$| and the assumed f, obtain the predicted |$\xi ^{s,\mathrm{th}}(\mathbf {s})$| using the dispersion model of equation (17), and decompose it into multipoles;
Obtain the χ2 for the fit of this model to the data (equation 5);
Return to step (i) and repeat for a different choice of (f, b).
For the purposes of this work, (f, b) are the only relevant parameters and so a simple grid search over parameter space is sufficient. We assume flat prior ranges of f ∈ (0.61, 0.91) and b ∈ (1.77, 1.95). For simplicity, we keep δ(r) and σv(r) fixed to the values determined for the real space voids: empirically we found that these change negligibly for reconstructed voids and do not depend on the choice of (f, b) in the reconstruction. We also keep the covariance matrix |$\mathsf {\boldsymbol C}_{ij}$| fixed, under the assumption that it does not depend strongly on reconstruction parameters.
For application of this technique to actual survey data, other cosmological parameters will need to be included, in particular Ωm, σ8, and parameters α⊥ and α|| to account for Alcock–Paczynski distortions. This will also require testing the dependence of δ(r) and σv(r) on Ωm and σ8, which is beyond the scope of this work.
The method outlined here results in changes to both theory model and data as the parameters are changed. This is necessary to self-consistently fit for the growth rate; however, an unfortunate and unavoidable consequence is that the resultant likelihood surface is noisy. Fig. 8 shows the likelihood for the recovered growth rate f after marginalization over the bias, corresponding to a mean recovered value |$f=0.744^{+0.027}_{-0.024}$| (|$68{{\ \rm per\ cent}}$| c.l.). This is consistent with the fiducial value ffid(z = 0.52) = 0.761 for the simulation, and corresponds to an |${\sim }3.4{{\ \rm per\ cent}}$| measurement of the growth rate. The noise introduced by the reconstruction is reflected in the fact that the likelihood does not behave as a smooth function for small changes in f, and the best-fitting value is slightly lower, fbest = 0.732. This can be contrasted with the smoother likelihood surface obtained by Nadathur & Percival (2017) when the real space void positions were assumed to be known exactly and reconstruction was not required.

The likelihood for the growth rate f, after marginalization over b, obtained from fits to the redshift space multipoles |$\xi ^s_0(s)$| and |$\xi ^s_2(s)$| of the void–galaxy correlation, using the reconstruction procedure described in the text. The recovered mean value is |$f=0.744^{+0.027}_{-0.024}$| (68 per cent c.l.), consistent with the fiducial value ffid = 0.761 (vertical dashed line).
6 DISCUSSION
Redshift space distortions to the void–galaxy correlation are extremely interesting for three reasons: they can provide a measure of the linear growth rate complementary to that from galaxy clustering, that is sensitive to environmental variations in f in low density regions as is expected in some modified gravity scenarios; they can be accurately modelled on all separation scales – at least when the real space void positions can be determined – using linear theory alone; and the success of linear models for the RSD in ξs potentially allows the use of voids in Alcock–Paczynski tests of cosmology that probe a different and wider range of scales than those using the BAO peak. Thus, RSD measurements using voids will therefore be important tools in the analysis of large-scale structure data from future galaxy redshift surveys.
However, while linear models of ξs describe simulation data well, their applicability to the case where voids are identified directly from redshift space galaxy positions was questioned by Chuang et al. (2017). We addressed this from first principles, identifying four key assumptions common to all existing theoretical models of ξs and explicitly showing that each of them is violated if void-finding is performed on the redshift space galaxy field. Modifying the theory to account for this appears to be a hard problem. Instead we have proposed a practical approach: applying the reconstruction technique to obtain real space voids from redshift space data, based on using the Zeldovich approximation to approximately recover the real space galaxy distribution. If this step is performed prior to void-finding, the validity of the model assumptions is restored, and the linear theory model (Nadathur & Percival 2017) provides an excellent fit to simulation data on all scales.
Performing the reconstruction requires input values for the cosmological parameters, in particular the growth rate f, which we wish to measure. In order to make our analysis non-circular, we need to re-perform reconstruction for every model tested, assuming that model represents the true cosmology. We find that different choices of f affect the reconstructed data and theory in different ways, so a simple self-consistent algorithm can be used to measure f from the reconstructed data. The technique also allows us to easily measure the real space void galaxy density profile. Applying this method to mock data from the Big MultiDark simulation, we were able to recover a |$3.4{{\ \rm per\ cent}}$| measurement of the growth rate at z ∼ 0.5 after marginalization over the bias, in good agreement with the fiducial value. The reconstruction process introduces noise, however, and so the recovered likelihood is not smooth for small changes in the model.
The use of some method to recover real space voids from redshift space data is necessary for the consistent application of any theoretical models to measurements of the void–galaxy correlation. The reconstruction method outlined here works sufficiently well, but further improvements are still possible. The choice of a fixed smoothing scale Rs is not ideal: the implementation of an iterative determination of the best smoothing scale (Schmittfull et al. 2017; Hada & Eisenstein 2018) would be better. It is also possible that other methods of estimating the velocity field from galaxy data, particularly Bayesian methods (Jasche & Wandelt 2013; Leclercq, Jasche & Wandelt 2015; Kitaura et al. 2016), could improve the correspondence between the reconstructed and true real space voids, leading to better agreement with theory.
The additional computational expense of performing the reconstruction is comparable to that of the tessellation required for the void-finding. For current data volumes, these steps can be run on a standard PC in a few minutes. Repeated iteration in order to derive the likelihood as described in Section 5 is feasible using parallel computing on a cluster. Unfortunately, as discussed in the Appendix, this additional expense is necessary for a self-consistent analysis.
A generalization of the method presented here to apply to survey data requires the introduction of several more free parameters than were considered for the simulation case, in particular the Alcock–Paczynski parameters and Ωm. This would be expected to lead to a degradation of the constraints obtained on f, but we leave a fuller investigation to subsequent work.
ACKNOWLEDGEMENTS
We thank Andreu Font-Ribera for useful discussions. SN acknowledges funding from the Marie Skłodowska-Curie Actions under the H2020 Framework of the European Commission, project 660053 cosmovoid, and a Dennis Sciama fellowship from the ICG. PC and WJP acknowledge support from the European Research Council through the Darksurvey grant 614030. The BigMultiDark simulations were performed on the SuperMUC supercomputer at the LeibnizRechenzentrum in Munich, using resources awarded to PRACE project number 2012060963. This work made use of the UK SCIAMA High Performance Computing cluster supported by the ICG, SEPNet, and the University of Portsmouth.
Footnotes
An alternative approach is taken by Pollina et al. (2016), who start by assuming such a linear bias relationship holds and then fit for b taken as a free parameter. The recovered bias is strongly scale-dependent (see their Figure 4), meaning that a single bias value does not fit over all scales.
Chuang et al. (2017) show that on very large scales, s ≳ 150 h−1Mpc, the void density field δv reduces to a quasi-linear transformation of the galaxy density field δg, and thus the void velocity bias on these scales is approximately unity. However, on the scales s ≲ 100 h−1Mpc relevant to the void–galaxy correlation this is not the case.
We show profiles of |$v_r^\mathrm{DM}$| rather than the average galaxy velocity as the latter are noisier close to the void centres due to low galaxy numbers. However, the strong anisotropy pattern for redshift space voids is present in both.
REFERENCES
APPENDIX: FITTING AN INCOMPLETE MODEL USING REDSHIFT SPACE VOIDS
The reconstruction method introduced in this paper in order to approximately recover real space void positions from redshift space galaxy data is more involved and computationally expensive than simply applying a void finder directly in redshift space, as been performed in numerous studies (Paz et al. 2013; Hamaus et al. 2016; Achitouv et al. 2017; Hamaus et al. 2017; Hawken et al. 2017). We have argued in Section 3.2 that the use of such redshift space voids is inconsistent with the assumptions made in the derivation of all theory models of the void–galaxy correlation that have thus far been published in the literature, as none of these has included a model for the void-finding in the theory derivation. We also showed in Section 4 that using redshift space voids without reconstruction leads to large discrepancies between observed correlation multipoles and the predictions of the dispersion model of equation (17).
Nevertheless, the possibility remains that the use of an earlier theoretical model that does not include all of the terms argued as important by Nadathur & Percival (2017) might by coincidence match the observed void–galaxy correlation for redshift space voids. In this Appendix, we consider the fit of the model of Cai et al. (2016) – that has recently been used in an analysis of BOSS data by Hamaus et al. (2017) – to redshift-space voids, to demonstrate that it does not match the data at the level required to make robust measurements.
In Fig. A1 we show the model prediction of equation (A2) for the fiducial value β = 0.407 for our simulation and mock galaxy sample. The data points in the figure show the measured quadrupole for the correlation of voids defined directly in redshift space with the redshift space galaxy field. To enable a direct like-for-like comparison with the results of Hamaus et al. (2017), we define the void centre positions to be the location of the Voronoi-volume-weighted barycentre of void member galaxy positions. (This results in slight differences relative to the |$\xi ^s_2(s)$| shown for the same redshift voids in Fig. 4, but does not affect the conclusions.) The covariance matrix has been re-estimated for this centre definition as well.

The measured redshift space quadrupole |$\xi ^s_2(s)$| for voids in redshift space (data points with error bars) together with the theory predictions for the alternative RSD model of Cai et al. (2016) & Hamaus et al. (2017) for different values of β ≡ f/b. To enable a like-for-like comparison with Hamaus et al. (2017), for this figure void centres are taken to be the barycentre positions (see text).
It can be seen that the measured and predicted quadrupoles share some qualitative features, in particular the absence of the strong negative ‘dip’ feature seen for real space and reconstructed voids (Fig. 4). However, the fiducial model provides an extremely poor quantitative fit to the data: the reduced χ2 for the fit is ∼9. This is as expected given the discussion in Section 3.2: the assumptions on which this model is based are not valid for redshift space voids, therefore any similarities can only be due to coincidence.
If the poor χ2 value is ignored and a fitting procedure for β is carried out, the resultant best-fitting value is β = 0.673 ± 0.016, which is very strongly biased relative to the fiducial β = 0.407. It is therefore clear that this method cannot be used to self-consistently measure the growth rate.
It should be noted that the volume of our simulation box, (2.5 h−1Gpc)3, is much larger than the available BOSS survey volume, and that the errors on our measurements of the correlation function multipoles are consequently much smaller than those currently achievable with survey data. Fig. A1 suggests that given the much larger BOSS error bars, and the much wider radial binning used by Hamaus et al. (2017), apparent consistency between the measured |$\xi ^s_2$| and this model is possible by coincidence. However, the much more precise measurements that will be possible with future survey data from DESI and Euclid will necessitate the use of the self-consistent reconstruction procedure.