An efficient and robust method to estimate halo concentration based on the method of moments

ABSTRACT

We propose an efficient and robust method to estimate the halo concentration based on the first moment of the density distribution, which is |$R_1\equiv \int _0^{r_{\rm vir}}4\pi r^3\rho (r)\mathrm{ d}r/M_{\rm vir}/r_{\rm vir}$|⁠. We find that R₁ has a monotonic relation with the concentration parameter of the Navarro–Frenk–White (NFW) profile, and that a cubic polynomial function can fit the relation with an error |$\lesssim 3~{{\ \rm per\ cent}}$|⁠. Tests on ideal NFW haloes show that the conventional NFW profile fitting method and the V_max/V_vir method produce biased halo concentration estimation by |$\approx 10~{{\ \rm per\ cent}}$| and |$\approx 30~{{\ \rm per\ cent}}$|⁠, respectively, for haloes with 100 particles. In contrast, the systematic error for our R₁ method is smaller than 0.5 per cent even for haloes containing only 100 particles. Convergence tests on realistic haloes in N-body simulations show that the NFW profile fitting method underestimates the concentration parameter for haloes with ≲300 particles by |$\gtrsim 20~{{\ \rm per\ cent}}$|⁠, while the error for the R₁ method is |$\lesssim 8~{{\ \rm per\ cent}}$|⁠. We also show other applications of R₁, including estimating V_max and the Einasto concentration c_e ≡ r_vir/r₋₂. The calculation of R₁ is efficient and robust, and we recommend including it as one of the halo properties in halo catalogues of cosmological simulations.

methods: statistical, galaxies: haloes, dark matter, large-scale structure of Universe

1 INTRODUCTION

Dark matter haloes, as the building blocks of the cosmic structures in our Universe, are virialized objects formed by gravitational instability. The assembly of haloes proceeds hierarchically, where small haloes are formed early and merge with each other to form larger ones. The assembly history of dark matter haloes correlates strongly to the halo structure, and many semi-analytical models have been proposed to explain this correlation and to predict halo structure from its assembly history (e.g. Navarro, Frenk & White 1997; Wechsler et al. 2002; Zhao et al. 2003a, 2009; Lu et al. 2006; Correa et al. 2015; Diemer & Joyce 2019). In addition, galaxies are born in the centres of dark matter haloes and evolve following the halo assembly process (see Mo, Van den Bosch & White 2010, for a review), so that dark matter haloes and galaxies are tightly related to each other. This motivates many attempts to model the galaxy–halo connection to understand galaxy formation and evolution (see Baugh 2006; Mo, Van den Bosch & White 2010; Wechsler & Tinker 2018, for reviews). Clearly, it is of paramount importance to accurately characterize the structure of dark matter haloes.

Numerical N-body simulations with collisionless cold dark matter particles provide an essential tool to study the structure of dark matter haloes. To begin with, dark matter haloes are identified with a halo-finding algorithm, such as the Friends-of-Friends (FoF) algorithm (e.g. Huchra & Geller 1982; Davis et al. 1985). Based on the spherical collapse model, a dark matter halo is defined as the collection of particles within a radius within which the mean density reaches some chosen value. This radius is usually referred to as the virial radius, r_vir, and the total mass enclosed is the halo mass, M_vir. The radial density profiles of dark matter haloes are found to be well described by the universal Navarro–Frenk–White (NFW) function,

$$\begin{eqnarray} \rho (r) = \frac{\rho _0}{r/r_{\mathrm{s}}(1 + r/r_{\mathrm{s}})^2}\, , \end{eqnarray}$$

(1)

specified by the two free parameters, r_s and ρ₀, or equivalently, M_vir and the halo concentration, c ≡ r_vir/r_s (Navarro, Frenk & White 1997).

However, the determination of the concentration parameter for simulated haloes is not straightforward. Many methods have been used to estimate the concentration parameters of simulated haloes from the spatial distribution of dark matter particles (e.g. Jing 2000; Bullock et al. 2001; Klypin et al. 2001; Wechsler et al. 2002; Zhao et al. 2003a, b, 2009; Duffy et al. 2008; Klypin, Trujillo-Gomez & Primack 2011; Klypin et al. 2016). One approach is to sample the radial density distribution of simulated haloes in discrete bins and fit it with the NFW profile (e.g. Bhattacharya et al. 2013). This method has several shortcomings. First, the estimated concentration is subject to the choice of discrete bins. The use of large bin sizes tends to smooth the gradient of the radial profile and cause an underestimate of the halo concentration, while the use of too small bins can introduce too much noise. In general, it is difficult to find an optimal binning strategy, particularly when a halo is only sparsely sampled. Secondly, the fitting method relies on the prior choice of the halo profile, which is the NFW profile in this case. Therefore, any deviations from the NFW profile will make the output concentration biased (Einasto 1965; Navarro et al. 2004; Wang et al. 2020b). Finally, the fitting procedure is relatively time-consuming, making it difficult to do for the large number of haloes found in large cosmological simulations.

To overcome some of the issues in the NFW profile fitting, other estimators of halo concentration have been proposed. One example is the method based on V_max/V_vir (Klypin et al. 2001, 2016; Klypin, Trujillo-Gomez & Primack 2011; Prada et al. 2012), where |$V_{\rm max}=\tt {MAX}(\sqrt{GM(\lt r)/r})$| is the maximum of the circular velocity as a function of r, and |$V_{\rm vir}=\sqrt{GM(\lt r_{\rm vir})/r_{\rm vir}}$| is the virial velocity. This quantity is closely related to the halo concentration for an NFW halo. However, this relation is ill-defined for haloes with concentration below 2.16 (Klypin et al. 2001), since by definition V_max cannot be smaller than V_vir. The halo concentration can also be inferred from r_f/r_vir, where r_f is the radius within which the mass is fM_vir, with 0 < f < 1 (e.g. Lang, Holley-Bockelmann & Sinha 2015). In addition, there are also methods based on the integrated mass profile (Poveda-Ruiz, Forero-Romero & Muñoz-Cuartas 2016) and the Voronoi Tessellation (Lang, Holley-Bockelmann & Sinha 2015).

In this paper, we propose an efficient and robust method to estimate the concentration parameter of NFW haloes based on the first moment of the density distribution. Section 2 introduces the data we use in this study. Section 3 introduces three different methods to estimate the halo concentration, and we test their performance in Section 4. Section 5 shows the mass–concentration relation obtained from the ELUCID simulation, to demonstrate the application of our method to large cosmological simulations. Section 6 discusses other applications of R₁, including estimating V_max of haloes and the Einasto concentration parameter. Finally, Section 7 summarizes our main results. Throughout this paper, we use ‘log’ to denote 10-based logarithm.

2 DATA

The estimation of halo concentration is subject to the sampling effect, where low-mass haloes are poorly sampled in N-body simulations, and the force softening effect, which can smooth the density profile in the inner region and cause an underestimation of the halo concentration (e.g. Power et al. 2003; Ludlow, Schaye & Bower 2019; Mansfield & Avestruz 2021). To separate the impact of these two effects, we use two different data sets to test the performances of different methods: ideal haloes generated from NFW profiles with different halo parameters, and realistic haloes selected from N-body simulations of different resolutions and force softening lengths. We also apply our method to a large N-body simulation to demonstrate its capability of recovering halo concentrations in large cosmological simulations.

2.1 Ideal NFW haloes

We generate ideal NFW haloes using the halofactory package based on Eddington’s inversion method (see appendix A for details). Here we use haloes with four different concentrations, c = 1, 5, 10, and 20, a range sufficient to cover haloes in cosmological N-body simulations. For each concentration, we generate individual haloes using different numbers of particles, ranging from ∼100 to ≳10⁴, to test the robustness of a given concentration estimator. For a given particle number, we use 10 000 random realizations to evaluate the statistical uncertainties.

2.2 N-body simulations

2.2.1 IllustrisTNG-Dark

The IllustrisTNG project consists of several dark-matter-only and hydrodynamical simulations (Pillepich et al. 2018; Nelson et al. 2019). Here we use the dark-matter-only simulations with different resolutions and gravitational softening lengths to test the impact on the concentration estimation. The information of these simulations is summarized in Table 1 (Nelson et al. 2019). It is noteworthy that TNG100-1-Dark and TNG100-3-Dark have identical initial conditions, but different mass resolutions and gravitational softening lengths. The IllustrisTNG project was based on a cosmology consistent with the results in Planck Collaboration XIII (2016), where |$\Omega _\Lambda =0.6911$|⁠, Ω_m = 0.3089, σ₈ = 0.8159, n_s = 0.9667, and h = 0.6774. Dark matter haloes are identified using the FoF algorithm with a linking length that is 0.2 times the mean inter-particle distance (Davis et al. 1985), and their masses are assigned as the total dark matter mass enclosed within the aperture where the mean overdensity is 200 times the critical density. This mass is denoted as M_200c, and the corresponding radius and concentration are denoted as r_200c and c_200c, respectively. The halo centre is specified as the location of the particle with the minimal gravitational potential. Substructures are identified with the subfind algorithm (Springel et al. 2001).

Table 1.

Open in new tab

Summary of the N-body simulations used in this study.

Simulation	L_box[h⁻¹Mpc]	Particle number	Particle mass [h⁻¹ M_⊙]	Gravitational softening length [h⁻¹ kpc]
TNG50-1-Dark	35	2160³	3.7 × 10⁵	0.2
TNG100-1-Dark	75	1820³	6.0 × 10⁶	0.5
TNG100-3-Dark	75	455³	3.8 × 10⁸	2.0
ELUCID	500	3072³	3.1 × 10⁸	3.5

Simulation	L_box[h⁻¹Mpc]	Particle number	Particle mass [h⁻¹ M_⊙]	Gravitational softening length [h⁻¹ kpc]
TNG50-1-Dark	35	2160³	3.7 × 10⁵	0.2
TNG100-1-Dark	75	1820³	6.0 × 10⁶	0.5
TNG100-3-Dark	75	455³	3.8 × 10⁸	2.0
ELUCID	500	3072³	3.1 × 10⁸	3.5

Table 1.

Open in new tab

Summary of the N-body simulations used in this study.

Simulation	L_box[h⁻¹Mpc]	Particle number	Particle mass [h⁻¹ M_⊙]	Gravitational softening length [h⁻¹ kpc]
TNG50-1-Dark	35	2160³	3.7 × 10⁵	0.2
TNG100-1-Dark	75	1820³	6.0 × 10⁶	0.5
TNG100-3-Dark	75	455³	3.8 × 10⁸	2.0
ELUCID	500	3072³	3.1 × 10⁸	3.5

Simulation	L_box[h⁻¹Mpc]	Particle number	Particle mass [h⁻¹ M_⊙]	Gravitational softening length [h⁻¹ kpc]
TNG50-1-Dark	35	2160³	3.7 × 10⁵	0.2
TNG100-1-Dark	75	1820³	6.0 × 10⁶	0.5
TNG100-3-Dark	75	455³	3.8 × 10⁸	2.0
ELUCID	500	3072³	3.1 × 10⁸	3.5

2.2.2 ELUCID

The ELUCID¹ simulation (Wang et al. 2013, 2014, 2016; Tweed et al. 2017) is a constrained simulation, run with a memory-optimized version of gadget-2 (Springel et al. 2005) known as l-gadget, to reconstruct the density field and formation history of our local Universe based on the Sloan Digital Sky Survey DR7 (York et al. 2000). It is thus one particular realization of the structure formation model in question. This simulation has 3072³ dark matter particles, each with a mass of |$3.09\times 10^8h^{-1}\rm M_\odot$|⁠, in a box with a side length of |$500h^{-1}\rm Mpc$|⁠. This simulation assumes a Lambda cold dark matter cosmology with Ω_m = 0.258, |$\Omega _\Lambda =0.742$|⁠, σ₈ = 0.80, n_s = 0.96, and h = 0.72. The information of the ELUCID simulation is summarized in Table 1 (Wang et al. 2016). ELUCID uses the same procedure as IllustrisTNG to identify and define dark matter haloes (see Section 2.2.1). The large volume and relatively high resolution of the ELUCID simulation allow us to investigate the mass–concentration relation over a large halo mass range.

3 METHOD

Here we introduce three methods to estimate halo concentration: two commonly used methods and our R₁ method. In addition, three other methods are discussed in Appendix C together with their performance on ideal NFW haloes.

3.1 The R₁ method

The total mass of a dark matter halo is expressed as

$$\begin{eqnarray} M_{\rm vir} = \int _0^{r_{\rm vir}}4\pi r^2 \rho (r)\mathrm{ d}r\, . \end{eqnarray}$$

(2)

Here ρ(r) is the radial density profile and r_vir is the halo radius, which is usually defined as the radius within which the enclosed mean density just exceeds some chosen value. The dimensionless first moment of the density distribution, R₁, can be defined as

$$\begin{eqnarray} R_1 = {1\over M_{\rm vir}r_{\rm vir}}\int _0^{r_{\rm vir}} 4\pi r^3\rho (r)\mathrm{ d}r\, , \end{eqnarray}$$

(3)

which can be expressed analytically for an NFW profile as

$$\begin{eqnarray} R_1 = \frac{c-2 \ln (1 + c) + c/(1 +c)}{c\left[\ln (1 + c)-c/(1 + c)\right]}\, . \end{eqnarray}$$

(4)

Despite the complicated functional form, the relation between R₁ and c is actually quite simple, as shown in Fig. 1. We fit both R₁(c) and c(R₁) with third-order polynomial functions:

$$\begin{eqnarray} \log R_1 &=& a_1(\log c)^3 + a_2(\log c)^2 + a_3\log c + a_4, \\ \log c &=& b_1(\log R_1)^3 + b_2(\log R_1)^2 + b_3\log R_1 + b_4, \\ a_1 &=& 0.0198,~~a_2=-0.086, ~~a_3 = -0.090, ~~a_4 = -0.230, \\ b_1 &=& -34.01,~~b_2=-43.91, ~~b_3 = -23.49, ~~b_4 = -3.48.\\ \end{eqnarray}$$

(5)

The bottom panels of Fig. 1 show the fractional difference between the relation in equation (4) and the fitting formula of equation (5). The fractional deviation is |$\lesssim 0.1~{{\ \rm per\ cent}}$| for the R₁–c relation and |$\lesssim 3~{{\ \rm per\ cent}}$| for the c–R₁ relation. We note that the relation between R₁ and c depends neither on cosmology nor on the threshold density chosen to define dark matter haloes.

$The relation between c and R1 for an NFW profile. Top panels: the circles are the analytical results obtained through equation (4), and the dashed lines are the fitted third-order polynomial functions in equation (5). Bottom panels: the fractional difference between the relation in equation (4) and the third-order polynomial fits.$

Figure 1.

The relation between c and R₁ for an NFW profile. Top panels: the circles are the analytical results obtained through equation (4), and the dashed lines are the fitted third-order polynomial functions in equation (5). Bottom panels: the fractional difference between the relation in equation (4) and the third-order polynomial fits.

Open in new tab Download slide

3.2 The NFW profile fitting method

The halo concentration can also be estimated by fitting the density distribution with an NFW profile (e.g. Bhattacharya et al. 2013). One can start with the cumulative mass distribution for an NFW halo:

$$\begin{eqnarray} M(\lt r) = \frac{m(cr/r_{\rm vir})}{m(c)}M_{\rm vir}\, , \end{eqnarray}$$

(6)

where

$$\begin{eqnarray} m(x) = \ln (1 + x)-x/(1+ x)\, . \end{eqnarray}$$

(7)

The optimal concentration can be found by minimizing the χ² defined as

$$\begin{eqnarray} \chi ^2 = \sum _i\frac{\left(M_i^{\rm sim}-M_i\right)^2}{\left(M_i^{\rm sim}\right)^2/n_i}, \end{eqnarray}$$

(8)

where M_i = M(< r_i) − M(< r_{i − 1}) is the mass within the i-th radial bin according to the NFW profile, |$M_i^{\rm sim}$| is the total mass of particles in the same radial bin for the simulated halo, and n_i is the number of particles in that bin. Here we take 20 equally spaced radial bins from 0.01r_vir to r_vir on the logarithmic scale. Clearly, the result of the NFW profile fitting method is subject to the choice of binning. In Appendix B, we test the performance with three different binning strategies and adopt the best one here to compare with the other two methods.

3.3 The V_max/V_vir method

For NFW haloes, the concentration parameter is also related to the ratio between the maximum circular velocity and the virial velocity,

$$\begin{eqnarray} \frac{V_{\rm max}}{V_{\rm vir}} = \frac{{{\tt MAX}}(V_{\rm circ}(r))}{V_{\rm circ}(r_{\rm vir})}\, , \end{eqnarray}$$

(9)

where |$V_{\rm circ}(r) = \sqrt{GM(\lt r)/r}$|⁠, and the relation is

$$\begin{eqnarray} \frac{V_{\rm max}}{V_{\rm vir}} = \left[\frac{0.216 c}{\ln (1 + c)-c/(1+ c)}\right]^{1/2} \end{eqnarray}$$

(10)

(e.g. Klypin et al. 2001). Note that this relation is only applicable for c ≳ 2.16 since V_max ≥ V_vir by definition.

4 TESTING THE PERFORMANCE OF THE HALO CONCENTRATION ESTIMATORS

4.1 Tests on ideal NFW profile

We first test the performance of the three concentration estimation methods on ideal NFW haloes generated from the halofactory package (see Appendix A). The results are presented in Fig. 2. The four columns are for four different input halo concentrations, from c = 1 to c = 20, and the three rows present the results for three different methods. In each panel, the red solid line shows the median fractional deviation of the concentration parameter estimated from 10 000 halo realizations as a function of particle number, and the magenta dashed lines and the cyan dotted lines show the 16th–84th and 2.5th–97.5th percentile ranges, respectively.

$The fractional difference between the input and the estimated concentrations with the NFW fitting method (top panels), the Vmax/Vvir method (middle panels), and the R1 method (bottom panels) for ideal NFW haloes generated with halofactory as a function of the number of particles in the halo. The red solid, magenta dashed, and cyan dotted lines show the 50th, 16th–84th, and 2.5th–97.5th percentiles. This figure shows that the R1 method gives an unbiased and less uncertain estimation of the input halo concentration compared with the other two methods. The middle left panel is empty since the Vmax/Vvir method cannot be applied to haloes with c < 2.16.$

Figure 2.

The fractional difference between the input and the estimated concentrations with the NFW fitting method (top panels), the V_max/V_vir method (middle panels), and the R₁ method (bottom panels) for ideal NFW haloes generated with halofactory as a function of the number of particles in the halo. The red solid, magenta dashed, and cyan dotted lines show the 50th, 16th–84th, and 2.5th–97.5th percentiles. This figure shows that the R₁ method gives an unbiased and less uncertain estimation of the input halo concentration compared with the other two methods. The middle left panel is empty since the V_max/V_vir method cannot be applied to haloes with c < 2.16.

Open in new tab Download slide

First, when the particle number is sufficiently large (≳10⁴), all three methods perform equally well and the fractional deviation of the halo concentration estimation for 95 per cent of halo realizations is within |$\pm 5~{{\ \rm per\ cent}}$|⁠. Secondly, when the particle number decreases to a few hundred, the NFW profile fitting method tends to underestimate halo concentration by |$\approx 10~{{\ \rm per\ cent}}$|⁠. We note that this result is subject to the choice of binning (see Appendix B). The V_max/V_vir method tends to overestimate the halo concentration by |$\approx 30~{{\ \rm per\ cent}}$|⁠, which was already noted in previous studies (see Poveda-Ruiz, Forero-Romero & Muñoz-Cuartas 2016). In contrast, the fractional deviation of the median value for our R₁ method is less than 0.5 per cent. Thirdly, the distribution of the estimated concentration broadens with decreasing particle numbers. When only 100 particles are used, the width of the 16th–84th percentiles is about 0.76c − 1.03c for the NFW fitting method, 0.91c − 1.06c for the V_max/V_vir method, and 0.63c − 0.91c for our R₁ method. Therefore, the R₁ method also yields the smallest variance among all three methods when haloes are poorly sampled. We also note that the V_max/V_vir method is not applicable to haloes with c ≲ 2.16. In addition, Appendix C presents the performance of three other concentration estimation methods. Our test results show that their performances are poorer than the R₁ method, even though some of them are more difficult to obtain from simulation data.

Finally, Appendix D shows the distributions of the logarithmic deviation of halo concentration estimated with our R₁ method. These distributions can be described by Gaussian functions, and the scatter decreases with increasing particle number and decreasing input concentration. A fitting function is provided to describe the dependence of the scatter on the particle number and the input concentration (see equation D2).

4.2 Impact of resolution

We have already shown that our R₁ method outperforms the other two methods in halo concentration estimation using ideal NFW haloes. However, low-mass haloes in realistic N-body simulations are not only poorly sampled, but also subject to the force softening used to avoid unphysical gravity when two particles are too close to each other (e.g. Power et al. 2003; Ludlow, Schaye & Bower 2019). In addition, haloes in simulations are not spherically symmetric and are not perfectly relaxed (e.g. Jing 2000; Jing & Suto 2002). While it is not a priori clear what the true concentration is for haloes in simulations, we can investigate which method gives the best convergence with the numerical resolution.

Here we use three simulations from the IllustrisTNG suite, which are TNG50-1-Dark, TNG100-1-Dark, and TNG100-3-Dark, to test the impact of numerical resolution on the estimation of halo concentration. Note again that the latter two simulations use identical initial conditions and simulation code, but different mass resolutions and gravitational softening lengths (see Table 1). Fig. 3 shows the mass–concentration relation obtained from these three simulations, and the top and bottom panels show results obtained from the NFW profile fitting method and our R₁ method, respectively. First, both methods produce nearly identical median mass–concentration relations and the 16th–84th percentiles in TNG100-1-Dark, whose particle mass is about |$6.0\times 10^6h^{-1}\rm M_\odot$|⁠. Secondly, the NFW profile fitting method underestimates the concentration of |$10^{11}h^{-1}\rm M_\odot$| haloes (≲300 particles) by |$\gtrsim 20~{{\ \rm per\ cent}}$| in TNG100-3-Dark, whose particle mass is about |$3.8\times 10^8h^{-1}\rm M_\odot$|⁠. In contrast, the R₁ method yields nearly identical mass–concentration relations across the entire mass range in these two simulations, and the fractional deviation for low-mass haloes is |$\lesssim 8~{{\ \rm per\ cent}}$| between the two simulations. Note that a |$10^{11}h^{-1}\rm M_\odot$| halo in TNG100-3-Dark is represented by only ≲300 particles. Finally, the mass–concentration relations obtained from TNG50-1-Dark by the two methods are similar to each other, and to those obtained from TNG100-1-Dark. The discrepancy at the massive end owes to cosmic variance, since the two simulations have different box sizes and initial conditions. In Appendix G, we show that the concentration parameter c_200c can also be obtained from R₁ with integrating only to r_500c, which is commonly used in observation.

Figure 3.

The mass–concentration relation for TNG50-1-Dark (cyan error bars), TNG100-1-Dark (red lines), and TNG100-3-Dark (blue lines), where the 16th–50th–84th percentiles are presented. The upper panel shows the result obtained with the NFW profile fitting method, and the lower panel shows the result obtained with our R₁ method. The NFW profile fitting method underestimates halo concentration for low-resolution simulations, while this effect is marginal for our R₁ method.

Open in new tab Download slide

Fig. 4 compares the halo concentration estimated with the NFW profile fitting method and our R₁ method in the three TNG-Dark simulations, where the open circles and error bars show the median and the 16th–84th percentiles, respectively. First, both methods yield similar concentrations for massive haloes and low-concentration low-mass haloes. Secondly, the NFW fitting method produces lower concentrations than our R₁ method for high-concentration low-mass haloes, and the discrepancy is larger in lower resolution simulations. Combined with the results in Fig. 3, we infer that the NFW profile fitting method tends to underestimate the concentration of high-concentration low-mass haloes in low-resolution simulations for two reasons. The first one is that the NFW profile fitting method tends to underestimate halo concentration for poorly sampled haloes, as shown in Fig. 2, but this effect becomes marginal once more than a few thousand particles are sampled. The second reason is that, for a given simulation volume, the force softening length is larger in lower resolution runs, and so is a larger fraction of the virial radius in lower mass haloes, and therefore has a large impact on the central mass profile. This will consequently cause the underestimation of halo concentration in the NFW profile fitting method. A common strategy to tackle this problem is to exclude particles below the convergence radius during the fitting, where the convergence radius is defined such that the two-body dynamical relaxation time-scale of the particles within this radius is comparable to the age of the universe (Power et al. 2003; Duffy et al. 2008; Correa et al. 2015). However, the convergence radius is about 0.1r_200c for haloes with a few hundred particles (Ludlow, Schaye & Bower 2019), and excluding particles within this radius will cause systematic underestimations of the concentration parameter by |$\approx 20~{{\ \rm per\ cent}}\!-\!50~{{\ \rm per\ cent}}$| for haloes with c ≈ 10–20 for the NFW fitting method (see Appendix B). In contrast, the R₁ method is less affected by the inclusion, since it gives more weight to the outer region of dark matter haloes.

Figure 4.

Top panels: the probability distribution function of halo concentration estimated with our R₁ method in three TNG-Dark simulations. Bottom panels: comparison of halo concentration estimated from the NFW profile fitting method and our R₁ method in different halo mass bins. When combined with Fig. 3, this figure demonstrates that, compared with our R₁ method, the NFW fitting method underestimates halo concentration for haloes sampled with small numbers (≲300) of particles, especially for high-concentration haloes.

Open in new tab Download slide

Finally, there is still a noticeable discrepancy between these two methods for high-concentration haloes with |$M_{\rm 200c}\approx 10^{11}h^{-1}\rm M_\odot$| in TNG100-1-Dark and TNG50-1-Dark, where a |$10^{11}\,h^{-1}\,\rm M_\odot$| halo is well represented by ≳2.7 × 10⁵ particles. In Appendix E, we find that these haloes deviate from the NFW profile due to the stripping of mass in the outskirts, and the mass distribution recovered from both concentrations matches the data equally well, despite the |$\gtrsim 10~{{\ \rm per\ cent}}$| systematics in the values of the estimated concentration. Nevertheless, these haloes constitute only a small portion of all haloes in the given mass bin, as one can see from the histogram in the top panels of Fig. 4.

5 THE MASS–CONCENTRATION RELATION IN THE ELUCID SIMULATION

It has been shown in Section 4 that our R₁ method outperforms the conventional method in the halo concentration estimation on both ideal NFW haloes and realistic haloes in N-body simulations, and it can give unbiased estimation of the concentration parameter for haloes with more than 200 particles. For this reason, we apply the R₁ method to the ELUCID simulation and infer the mass–concentration relation for haloes with 11 ≲ log (M_200c/[h⁻¹M_⊙]) ≲ 15. Notably, a |$10^{11}h^{-1}\rm M_\odot$| halo is only represented by about 300 particles in ELUCID.

Fig. 5 shows the median mass–concentration relation in ELUCID, as well as the 16th–84th percentiles. Here relaxed and unrelaxed haloes are separated according to the criterion in Neto et al. (2007), which is

$$\begin{eqnarray} {\rm Relaxed~halos:} \Delta &\lt& 0.07r_{\rm 200c}\\ {\rm Unrelaxed~halos:} \Delta &\gt& 0.07r_{\rm 200c} \\ \Delta &=& \Vert \mathbf {r}_{\rm min-pot}-\mathbf {r}_{\rm com}\Vert , \end{eqnarray}$$

(11)

Figure 5.

The mass–concentration relation in the ELUCID simulation for relaxed (blue error bars) and unrelaxed (red error bars) haloes, and the 16th–84th percentiles. The grey colour scale encodes the number density of dark matter haloes. The criterion for separating relaxed and unrelaxed haloes is shown in equation (11). The solid lines are the predictions of different semi-analytical models: Bullock + 01 (Bullock et al. 2001), Zhao + 09 (Zhao et al. 2009), Prada + 12 (Prada et al. 2012), Correa + 15 (Correa et al. 2015), Ludlow + 16 (Ludlow et al. 2016), Diemer + 19 (Diemer & Joyce 2019), and Ishiyama + 21 (Ishiyama et al. 2021).

Open in new tab Download slide

where r_min-pot is the position of the particle with the minimal gravitational potential, and r_com is the centre of mass of all dark matter particles within r_200c. Note that Neto et al. (2007) use two additional conditions to select haloes in equilibrium. They require that the mass fraction in substructures is lower than a threshold value and that the ratio between the kinetic energy and the potential energy is lower than a threshold. Here we use only the criterion in equation (11), for three reasons. First, as shown in Neto et al. (2007), equation (11) alone can select most of the haloes in equilibrium (see their fig. 2). Secondly, equation (11) is the simplest criterion to implement in N-body simulations, whereas the other two criteria require either identifying substructures or calculating the gravitational potential for each particle. Thirdly, the other two criteria suffer from some ambiguities. For instance, the substructure mass fraction is subject to the substructure finder used (e.g. van den Bosch & Jiang 2016) and to the resolution of the simulation (e.g. van den Bosch et al. 2018). Besides, the exact value of the virial ratio for selecting haloes in equilibrium is still under debate, as many argued that the surface pressure and even the non-spherical shape of haloes should be taken into account (e.g. Davis, D’Aloisio & Natarajan 2011; Klypin et al. 2016). Here one can see that the concentration parameter decreases from ≈8 to ≈4 with increasing mass for relaxed haloes, and from ≈4 to ≈2 for unrelaxed ones. It has already been noted in previous studies that unrelaxed haloes exhibit lower concentration than relaxed ones (e.g. Jing 2000; Neto et al. 2007; Duffy et al. 2008; Child et al. 2018). A detailed analysis in Wang et al. (2020a) reveals that a sudden halo–halo merger event will reduce the concentration dramatically, and the concentration parameter will gradually increase during the subsequent secular evolution. For comparison, the solid lines show the mass–concentration relations given by seven different semi-analytical models with the same cosmology and halo definition.² Our results are broadly consistent with these models.

In addition to the median mass–concentration relation, the distribution of concentration at given halo masses also carries important information. Fig. 6 shows the distribution of the logarithmic halo concentration, log c, for relaxed and unrelaxed haloes in four narrow halo mass bins. For each halo population in a given mass bin, we fit the distribution of log c to a Gaussian function. Each distribution is thus described by three parameters: F as the fraction of the target halo population among all haloes in the same mass bin, μ as the mean of the Gaussian function, and σ as the standard deviation. The fitting functions are shown in blue and red solid lines in Fig. 6 for relaxed and unrelaxed haloes, respectively. One can see that the Gaussian model describes the distribution quite well.

Figure 6.

The distribution of the logarithmic halo concentration in four mass bins for relaxed (blue) and unrelaxed (red) haloes, where the criterion to separate these two populations is shown in equation (11). The distribution of both halo populations are fitted with a Gaussian function as shown in solid lines. The best-fitting parameters are shown on the panel.

Open in new tab Download slide

Fig. 7 shows the halo mass dependence of these fitting parameters. First of all, the unrelaxed haloes only amounts to about 5 per cent of all haloes with |$M_{\rm 200c}\approx 10^{11}h^{-1}\rm M_\odot$|⁠, and this fraction increases to about 15 per cent for |$10^{14}h^{-1}\rm M_\odot$| haloes. The positive correlation between the unrelaxed halo fraction and halo mass is expected, since the halo merger rate is positively correlated to halo mass (e.g. Fakhouri & Ma 2008). Secondly, the mean logarithmic concentration declines with increasing halo mass for both relaxed and unrelaxed haloes, with a constant gap of about 0.28 dex. Finally, the scatter in the distribution of log c for relaxed and unrelaxed haloes are about 0.12 and 0.19 dex, respectively, with a weak dependence on halo mass.

$The halo mass dependence of the fitting parameters for relaxed (blue) and unrelaxed (red) haloes, where the left panel shows the halo fraction F, the middle panel the mean log c, and the right panel the standard deviation of log c.$

Figure 7.

The halo mass dependence of the fitting parameters for relaxed (blue) and unrelaxed (red) haloes, where the left panel shows the halo fraction F, the middle panel the mean log c, and the right panel the standard deviation of log c.

Open in new tab Download slide

6 OTHER APPLICATIONS OF R₁

6.1 Estimating V_max from R₁

The maximum circular velocity, V_max, is not only a proxy of halo concentration, but also a commonly adopted quantity to connect galaxies with their dark matter haloes (Reddick et al. 2013; Matthee et al. 2017; Zehavi et al. 2019). It is thus important to be able to obtain V_max efficiently and robustly for a large sample of simulated haloes in order to investigate the galaxy–halo connection using large cosmological simulations. To this end, we derive V_max from R₁ according to equations (4) and (10). Fig. 8 compares the V_max/V_vir calculated from equation (9) and derived from R₁, where one can see they match quite well. We note that there is a small discrepancy for low-mass haloes with high V_max/V_vir, which has the same origin as the discrepancy for low-mass haloes with high concentrations shown in Fig. 4 (see also Appendix E). Nevertheless, these haloes only account for a small portion of all haloes at the given halo mass bin, as shown in the top panels of Fig. 8. The relative rank is well preserved, as indicated by high Spearman’s rank correlation coefficients (≳0.92).

Top panels: the probability distribution of Vmax/Vvir estimated from R1 in three TNG-Dark simulations. Bottom panels: comparison of Vmax/Vvir calculated from equation (9) and from our R1 method in three TNG-Dark simulations. Spearman’s rank correlation coefficients are labelled on the bottom panels.

Figure 8.

Top panels: the probability distribution of V_max/V_vir estimated from R₁ in three TNG-Dark simulations. Bottom panels: comparison of V_max/V_vir calculated from equation (9) and from our R₁ method in three TNG-Dark simulations. Spearman’s rank correlation coefficients are labelled on the bottom panels.

Open in new tab Download slide

It should be noted that R₁ is defined only for main haloes.³ For a satellite subhalo contained in a host halo, one can trace its main-branch progenitor to the snapshot prior to the infall into its host halo and calculate its R₁ to derive V_max. This is similar to the calculation of V_peak, which is the peak value of V_max on the main branch and serves as a better proxy in subhalo abundance matching than V_max (Reddick et al. 2013). However, it is unclear whether or not environmental effects prior to the infall of haloes can break the relation between V_max and R₁. To test the validity of the R₁ method for subhaloes, we compare results between pre-infall haloes at a given redshift, defined as haloes that will become subhaloes in the subsequent snapshot, and the results are presented in Appendix H. There one can see that the V_max–R₁ relation does not depend on whether or not haloes are soon falling into other haloes to become a satellite, indicating that the R₁ method can also be used to estimate V_peak for subhaloes.

6.2 Estimating the Einasto concentration from R₁

It has been suggested that the radial density distribution of dark matter haloes in N-body simulations is better fitted with a three-parameter Einasto profile (Navarro et al. 2004; Gao et al. 2008; Wang et al. 2020b), which has the form

$$\begin{eqnarray} \rho (r) = \rho _{-2}\exp \left\lbrace -\frac{2}{\alpha }\left[\left(\frac{r}{r_{-2}}\right)^\alpha -1\right]\right\rbrace \, , \end{eqnarray}$$

(12)

where ρ₋₂, α, and r₋₂ are free parameters. Gao et al. (2008) found that there is a universal relationship between α and the peak height ν given by

$$\begin{eqnarray} \alpha = 0.155 + 0.0095v^2, ~\nu = \delta _{\rm crit}(z)/\sigma (M_{\rm vir}, z) , \end{eqnarray}$$

(13)

where the peak height ν is defined as the ratio between the critical overdensity δ_crit(z) for collapse at redshift z and the linear rms fluctuation at z within spheres containing mass M_vir. We note that the value of ν is determined by redshift and halo mass for a given cosmology. The typical value of α is between 0.15 and 0.3. The concentration parameter for the Einasto profile is defined as

$$\begin{eqnarray} c_{\rm e} \equiv r_{\rm vir}/r_{-2}\, . \end{eqnarray}$$

(14)

Therefore, at a given redshift of z, the halo mass M_vir and the Einasto concentration c_e together determine the halo density profile, with the parameter α determined by equation (13).

Fig. 9 shows the relation between R₁ and the Einasto concentration c_e for 0.15 ≤ α ≤ 0.3 in circles. And the solid lines are the fitting function,

$$\begin{eqnarray} c_e &=& d_1x^3 + d_2x^2 + d_3x + d_4\\ x &=& {R_1\over \alpha ^{0.95}} + e_1\alpha ^3 + e_2\alpha ^2 + e_3\alpha + e_4 \\ d_1 &=& -5.45,~~d_2 = 14.72,~~d_3= -18.70,~~d_4 = 9.07 \\ e_1 &=& 191.32,~~e_2=-173.00,~~e_3= 57.78,~~e_4= -8.06 \end{eqnarray}$$

(15)

The bottom panel shows the fractional residual, from which one can see that this fitting function is accurate to |$\lesssim 5~{{\ \rm per\ cent}}$| for c_e ≳ 3 and |$\lesssim 10~{{\ \rm per\ cent}}$| for c_e ≲ 3.

Figure 9.

The relation between R₁ and the concentration parameter of the Einasto profile, c_e, for different values of α (see equation 12). The solid lines are the fitting function in equation (15).

Open in new tab Download slide

7 SUMMARY

Estimating the concentration parameter and related quantities of simulated dark matter haloes in large numerical N-body simulations is a critical step to study halo structure and understand its relation to the halo assembly history and to the properties of galaxies that form in them. A reliable and efficient method is needed to estimate these quantities for large cosmological simulations that include haloes with a wide range of particle numbers. To this end, we propose an efficient and robust method to estimate the halo concentration and related quantities using the first moment of the density distribution. Our main results are summarized as follows:

We find that the first moment of the density distribution, defined as |$R_1=\int _0^{r_{\rm vir}}4\pi r^3\rho (r)\mathrm{ d}r^3/M_{\rm vir}/r_{\rm vir}$|⁠, has a simple, monotonic relation with the halo concentration for NFW haloes. A cubic polynomial function can describe this relation to |$\lesssim 3~{{\ \rm per\ cent}}$| accuracy (see Fig. 1).
Testing on ideal NFW haloes, we find that the NFW profile fitting method and the V_max/V_vir method introduce |$\approx 10~{{\ \rm per\ cent}}$| and |$\approx 30~{{\ \rm per\ cent}}$| systematics for haloes with 100 particles. In contrast, the bias introduced by the R₁ method is smaller than 0.5 per cent. The R₁ method yields the smallest variance among all the three methods.
Testing on realistic haloes in N-body simulations of different resolutions, we find that the NFW fitting method underestimates the concentration parameter of haloes with ≲300 particles by |$\gtrsim 20~{{\ \rm per\ cent}}$|⁠, due to the poor sampling and the large gravitational softening length. In contrast, such effects only introduce |$\lesssim 8~{{\ \rm per\ cent}}$| systematics in the R₁ method (see Figs 3 and 4).
We apply the R₁ method to the ELUCID N-body simulation and obtain the mass–concentration relation across four orders of magnitude of halo mass, separately for relaxed and unrelaxed haloes. We find that the distributions of the logarithmic concentration, log c, for both populations can be described by a Gaussian function. We find that the fraction of unrelaxed haloes ranges from |$\approx 5~{{\ \rm per\ cent}}$| to |$\approx 15~{{\ \rm per\ cent}}$| from |$10^{11}$| to |$10^{14}\,h^{-1}\,\rm M_\odot$|⁠. The mean logarithmic concentration declines monotonically with halo mass for both relaxed and unrelaxed haloes, and there is a constant difference of ≈0.28 dex between unrelaxed haloes of lower concentration and relaxed ones with higher concentration. The standard deviations of the logarithmic concentration for relaxed and unrelaxed haloes are ≈0.12 and ≈0.19 dex, respectively, with a weak dependence on halo mass. (see Figs 5, 6, and 7).
The maximum circular velocity, V_max, of simulated haloes can be derived from R₁ efficiently. The V_max–R₁ relation is not affected by whether or not the halo in question is about to be accreted by another halo and to become a subhalo (see Fig. 8 and Appendix H).
We find a fitting function for the relation between R₁ and the Einasto concentration c_e = r_vir/r₋₂ with 0.15 ≤ α ≤ 0.3, and the fractional deviation is |$\lesssim 5~{{\ \rm per\ cent}}$| for c ≳ 3 and |$\lesssim 10~{{\ \rm per\ cent}}$| for c ≲ 3 (see Fig. 9).

The concentration parameter and related structural quantities of dark matter haloes play an important role in the study of dark matter haloes and the modelling of the galaxy–halo connection. However, because of the uncertainty and tedium in their estimations, many cosmological simulations run in large boxes with relatively low resolutions avoid providing these quantities. The R₁ method proposed here can fill the gap, as it provides an accurate proxy for the concentration parameter for both NFW and Einasto haloes. Its estimation is both straightforward and efficient, thus suitable for large cosmological simulations, such as MillenniumTNG (Bose et al. 2023) and FLAMINGO (Schaye et al. 2023). We suggest that this quantity be provided in simulated halo catalogues along with other important halo properties.

ACKNOWLEDGEMENTS

Kai Wang thanks Fangzhou Jiang for his helpful comments and suggestions. The authors acknowledge the Tsinghua Astrophysics High-Performance Computing platform at Tsinghua University for providing computational and data storage resources that have contributed to the research results reported within this paper. This work is supported by the National Science Foundation of China (NSFC) Grant No. 12125301, 12192220, 12192222, and the science research grants from the China Manned Space Project with NO. CMS-CSST-2021-A07. YC is supported by China Postdoctoral Science Foundation Grant No. 2022TQ0329 and NSFC Grant No. 12192224.

The computation in this work is supported by the HPC toolkits hipp (Chen & Wang 2023) and pyhipp,⁴ipython (Perez & Granger 2007), matplotlib (Hunter 2007), numpy (van der Walt, Colbert & Varoquaux 2011), scipy (Virtanen et al. 2020), and astropy (Astropy Collaboration 2013, 2018, 2022). This research used NASA’s Astrophysics Data System for bibliographic information. The authors thank ELUCID collaboration for making their data products publicly available.⁵

DATA AVAILABILITY

The data underlying this article will be shared on reasonable request to the corresponding author.

Footnotes

https://www.elucid-project.com/

All these models are implemented in the colossus package (Diemer 2018), except Zhao + 09 (http://202.127.29.4/dhzhao/mandc.html) and Correa + 15 (https://www.camilacorrea.com/code/commah/).

In principle the R₁ method can also be used for stripped satellite subhaloes provided the core survives. The integral in equation (3) should then be stopped before the virial radius at some |$R_\Delta$| with Δ > 200 (see Appendix G).

https://github.com/ChenYangyao/pyhipp

https://www.elucid-project.com/

https://github.com/ChenYangyao/halofactory

We refer to Prof. Martin Weinberg’s lecture note for the detailed derivation in https://courses.umass.edu/astron850-mdw/eddington.pdf.

References

Astropy Collaboration

2013

A&A

558

A33

Month:	Total Views:
December 2023	8
January 2024	108
February 2024	30
March 2024	40
April 2024	44
May 2024	37
June 2024	25
July 2024	22
August 2024	22
September 2024	49
October 2024	27
November 2024	19
December 2024	30
January 2025	36
February 2025	42
March 2025	46
April 2025	47
May 2025	15

Article Contents

An efficient and robust method to estimate halo concentration based on the method of moments Open Access

ABSTRACT

1 INTRODUCTION

2 DATA

2.1 Ideal NFW haloes

2.2 N-body simulations

2.2.1 IllustrisTNG-Dark

2.2.2 ELUCID

3 METHOD

3.1 The R1 method

3.2 The NFW profile fitting method

3.3 The Vmax/Vvir method

4 TESTING THE PERFORMANCE OF THE HALO CONCENTRATION ESTIMATORS

4.1 Tests on ideal NFW profile

4.2 Impact of resolution

5 THE MASS–CONCENTRATION RELATION IN THE ELUCID SIMULATION

6 OTHER APPLICATIONS OF R1

6.1 Estimating Vmax from R1

6.2 Estimating the Einasto concentration from R1

7 SUMMARY

ACKNOWLEDGEMENTS

DATA AVAILABILITY

Footnotes

References

APPENDIX A: GENERATE DARK MATTER HALOES WITH HaloFactory

APPENDIX B: IMPACT OF BINNING ON NFW PROFILE FITTING

APPENDIX C: OTHER METHODS TO ESTIMATE HALO CONCENTRATION

APPENDIX D: UNCERTAINTIES OF THE R1 METHOD

APPENDIX E: DENSITY PROFILE FOR LOW-MASS AND HIGH-CONCENTRATION HALOES

APPENDIX F: IMPACT OF COSMOLOGY ON THE MASS–CONCENTRATION RELATION

APPENDIX G: ESTIMATING c200c FROM c500c

APPENDIX H: Vmax ESTIMATION FOR PRE-INFALL HALOES

Citations

Views

Altmetric

Email alerts

Astrophysics Data System

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only

An efficient and robust method to estimate halo concentration based on the method of moments

3.1 The R₁ method

3.3 The V_max/V_vir method

6 OTHER APPLICATIONS OF R₁

6.1 Estimating V_max from R₁

6.2 Estimating the Einasto concentration from R₁

APPENDIX D: UNCERTAINTIES OF THE R₁ METHOD

APPENDIX G: ESTIMATING c_200c FROM c_500c

APPENDIX H: V_max ESTIMATION FOR PRE-INFALL HALOES