Action-based distribution function modelling for constraining the shape of the Galactic dark matter halo

Bayesian prior and posterior distributions of our model parameters and the best-fitting parameters in our fiducial analysis with RR Lyrae stars.

Parameter	Quantity	Prior distribution	Note	Posterior distribution	Best fitting
	MW potential
Free	M_bulge (bulge mass)	M_bulge = (8.9 ± 0.89) × 10⁹M_⊙	(1)	[8.33, 9.23, 10.09] × 10⁹M_⊙	9.53 × 10⁹ M_⊙
Free	M_thin (thin disc mass)	M_thin = (35 ± 10) × 10⁹M_⊙	(2)	[34.39, 36.53, 38.55] × 10⁹M_⊙	37.22 × 10⁹ M_⊙
Free	M_thick (thick disc mass)	M_thick = (6 ± 3) × 10⁹M_⊙	(2)	[4.45, 6.21, 8.59] × 10⁹M_⊙	7.30 × 10⁹M_⊙
Free	\|$R_\mathrm{d}^\mathrm{thin}$\| (thin disc scale radius)	\|$R_\mathrm{d}^\mathrm{thin}= (2.6\pm 0.5) \, \mathrm{kpc}$\|	(2)	\|$[2.58, 2.68, 2.80] \, \mathrm{kpc}$\|	\|$2.63 \, \mathrm{kpc}$\|
Free	\|$R_\mathrm{d}^\mathrm{thick}$\| (thick disc scale radius)	\|$R_\mathrm{d}^\mathrm{thick}= (2.0\pm 0.2) \, \mathrm{kpc}$\|	(2)	\|$[1.76, 1.96, 2.15] \, \mathrm{kpc}$\|	\|$1.94 \, \mathrm{kpc}$\|
Fixed	\|$z_\mathrm{d}^\mathrm{thin}$\| (thin disc scale height)	\|$z_\mathrm{d}^\mathrm{thin}= 0.3 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.3 \, \mathrm{kpc}$\| (fixed)	...
Fixed	\|$z_\mathrm{d}^\mathrm{thick}$\| (thick disc scale height)	\|$z_\mathrm{d}^\mathrm{thick}= 0.9 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.9 \, \mathrm{kpc}$\| (fixed)	...
Free	q (DM density flattening)	Flat at 0.2 < U ≤ 0.5 (oblate)	(3)	[0.983, 0.993, 0.998]	0.996
Free	γ (DM inner density slope)	Flat at 0 < γ < 1.9	...	[0.785, 0.982, 1.209]	0.738
Free	a (DM scale radius)	Flat at −∞ < log₁₀a < ∞	...	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|	\|$10.46 \, \mathrm{kpc}$\|
Free	log₁₀ρ₀ (DM normalization)	Flat at −∞ < log₁₀ρ₀ < ∞	...	[6.89, 7.21, 7.44]	7.44
...	Thick to thin disk ratio	Σ_thick(R₀)/Σ_thin(R₀) = (0.12 ± 0.04)	(2)	[0.070, 0.104, 0.139]	0.120
...	Total stellar mass	Equation (9)	(2)	[50.13, 52.32, 54.24] × 10⁹M_⊙	54.04 × 10⁹M_⊙
...	Dark matter concentration c′	\|$\ln c^{\prime }_\mathrm{v} = \ln (r_{94}/r_{-2}) = 2.56 \pm 0.272$\|	(2)	[18.45, 19.55, 20.89]	18.87
	Stellar halo DF
Free	a_in (inner velocity anisotropy)	Flat at 0 < a_in < 1.	(4)	[0.234, 0.404, 0.649]	0.331
Free	b_in (inner velocity anisotropy)	Flat at 0 < b_in < 1.	(4)	[0.789, 0.879, 0.921]	0.890
Free	a_out (outer velocity anisotropy)	Flat at 0 < a_out < 1.	(4)	[0.292, 0.342, 0.383]	0.346
Free	b_out (outer velocity anisotropy)	Flat at 0 < b_out < 1.	(4)	[0.792, 0.836, 0.861]	0.842
Free	log₁₀J₀ (Action scale in DF)	Flat at −∞ < log₁₀J₀ < ∞	...	[3.02, 3.22, 3.59]	3.25
Free	Γ (inner halo density index)	Flat at 0 ≤ Γ < 2.8	...	[0.49, 1.47, 2.47]	1.50
Fixed	B (outer halo density index)	Fixed to B = 5	(5)	5 (fixed)	...
Fixed	κ (halo rotation)	Fixed to κ = 0 (non-rotating)	(6)	0 (fixed)	...
Fixed	J_ϕ,0 (action scale in DF)	Fixed to J_ϕ,0 = const.	(6)	const. (fixed)	...
Free	log₁₀η (outlier fraction)	Flat at −∞ < log₁₀η < −2	...	[−7.26, −4.65, −3.21]	−3.99

Parameter	Quantity	Prior distribution	Note	Posterior distribution	Best fitting
	MW potential
Free	M_bulge (bulge mass)	M_bulge = (8.9 ± 0.89) × 10⁹M_⊙	(1)	[8.33, 9.23, 10.09] × 10⁹M_⊙	9.53 × 10⁹ M_⊙
Free	M_thin (thin disc mass)	M_thin = (35 ± 10) × 10⁹M_⊙	(2)	[34.39, 36.53, 38.55] × 10⁹M_⊙	37.22 × 10⁹ M_⊙
Free	M_thick (thick disc mass)	M_thick = (6 ± 3) × 10⁹M_⊙	(2)	[4.45, 6.21, 8.59] × 10⁹M_⊙	7.30 × 10⁹M_⊙
Free	\|$R_\mathrm{d}^\mathrm{thin}$\| (thin disc scale radius)	\|$R_\mathrm{d}^\mathrm{thin}= (2.6\pm 0.5) \, \mathrm{kpc}$\|	(2)	\|$[2.58, 2.68, 2.80] \, \mathrm{kpc}$\|	\|$2.63 \, \mathrm{kpc}$\|
Free	\|$R_\mathrm{d}^\mathrm{thick}$\| (thick disc scale radius)	\|$R_\mathrm{d}^\mathrm{thick}= (2.0\pm 0.2) \, \mathrm{kpc}$\|	(2)	\|$[1.76, 1.96, 2.15] \, \mathrm{kpc}$\|	\|$1.94 \, \mathrm{kpc}$\|
Fixed	\|$z_\mathrm{d}^\mathrm{thin}$\| (thin disc scale height)	\|$z_\mathrm{d}^\mathrm{thin}= 0.3 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.3 \, \mathrm{kpc}$\| (fixed)	...
Fixed	\|$z_\mathrm{d}^\mathrm{thick}$\| (thick disc scale height)	\|$z_\mathrm{d}^\mathrm{thick}= 0.9 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.9 \, \mathrm{kpc}$\| (fixed)	...
Free	q (DM density flattening)	Flat at 0.2 < U ≤ 0.5 (oblate)	(3)	[0.983, 0.993, 0.998]	0.996
Free	γ (DM inner density slope)	Flat at 0 < γ < 1.9	...	[0.785, 0.982, 1.209]	0.738
Free	a (DM scale radius)	Flat at −∞ < log₁₀a < ∞	...	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|	\|$10.46 \, \mathrm{kpc}$\|
Free	log₁₀ρ₀ (DM normalization)	Flat at −∞ < log₁₀ρ₀ < ∞	...	[6.89, 7.21, 7.44]	7.44
...	Thick to thin disk ratio	Σ_thick(R₀)/Σ_thin(R₀) = (0.12 ± 0.04)	(2)	[0.070, 0.104, 0.139]	0.120
...	Total stellar mass	Equation (9)	(2)	[50.13, 52.32, 54.24] × 10⁹M_⊙	54.04 × 10⁹M_⊙
...	Dark matter concentration c′	\|$\ln c^{\prime }_\mathrm{v} = \ln (r_{94}/r_{-2}) = 2.56 \pm 0.272$\|	(2)	[18.45, 19.55, 20.89]	18.87
	Stellar halo DF
Free	a_in (inner velocity anisotropy)	Flat at 0 < a_in < 1.	(4)	[0.234, 0.404, 0.649]	0.331
Free	b_in (inner velocity anisotropy)	Flat at 0 < b_in < 1.	(4)	[0.789, 0.879, 0.921]	0.890
Free	a_out (outer velocity anisotropy)	Flat at 0 < a_out < 1.	(4)	[0.292, 0.342, 0.383]	0.346
Free	b_out (outer velocity anisotropy)	Flat at 0 < b_out < 1.	(4)	[0.792, 0.836, 0.861]	0.842
Free	log₁₀J₀ (Action scale in DF)	Flat at −∞ < log₁₀J₀ < ∞	...	[3.02, 3.22, 3.59]	3.25
Free	Γ (inner halo density index)	Flat at 0 ≤ Γ < 2.8	...	[0.49, 1.47, 2.47]	1.50
Fixed	B (outer halo density index)	Fixed to B = 5	(5)	5 (fixed)	...
Fixed	κ (halo rotation)	Fixed to κ = 0 (non-rotating)	(6)	0 (fixed)	...
Fixed	J_ϕ,0 (action scale in DF)	Fixed to J_ϕ,0 = const.	(6)	const. (fixed)	...
Free	log₁₀η (outlier fraction)	Flat at −∞ < log₁₀η < −2	...	[−7.26, −4.65, −3.21]	−3.99

Note. (1) – The same prior as in McMillan (2017). (2) – The prior is taken from the data compiled in Bland-Hawthorn & Gerhard (2016). See Section 5.2. (3) – U = (2/|$\pi$|⁠)arctan(q) (equation 8). The range of 0.2 < U ≤ 0.5 corresponds to 0.3249 < q ≤ 1. (4) – See equation (14). (5) – We fix the parameter B = 5, as it cannot be well constrained by our inner halo sample. (6) – (κ, J_ϕ,0) are fixed in this paper, except for the mock analysis with mock data generated from a cosmological simulation (see Appendix D2.4).

Table 1.

Bayesian prior and posterior distributions of our model parameters and the best-fitting parameters in our fiducial analysis with RR Lyrae stars.

Parameter	Quantity	Prior distribution	Note	Posterior distribution	Best fitting
	MW potential
Free	M_bulge (bulge mass)	M_bulge = (8.9 ± 0.89) × 10⁹M_⊙	(1)	[8.33, 9.23, 10.09] × 10⁹M_⊙	9.53 × 10⁹ M_⊙
Free	M_thin (thin disc mass)	M_thin = (35 ± 10) × 10⁹M_⊙	(2)	[34.39, 36.53, 38.55] × 10⁹M_⊙	37.22 × 10⁹ M_⊙
Free	M_thick (thick disc mass)	M_thick = (6 ± 3) × 10⁹M_⊙	(2)	[4.45, 6.21, 8.59] × 10⁹M_⊙	7.30 × 10⁹M_⊙
Free	\|$R_\mathrm{d}^\mathrm{thin}$\| (thin disc scale radius)	\|$R_\mathrm{d}^\mathrm{thin}= (2.6\pm 0.5) \, \mathrm{kpc}$\|	(2)	\|$[2.58, 2.68, 2.80] \, \mathrm{kpc}$\|	\|$2.63 \, \mathrm{kpc}$\|
Free	\|$R_\mathrm{d}^\mathrm{thick}$\| (thick disc scale radius)	\|$R_\mathrm{d}^\mathrm{thick}= (2.0\pm 0.2) \, \mathrm{kpc}$\|	(2)	\|$[1.76, 1.96, 2.15] \, \mathrm{kpc}$\|	\|$1.94 \, \mathrm{kpc}$\|
Fixed	\|$z_\mathrm{d}^\mathrm{thin}$\| (thin disc scale height)	\|$z_\mathrm{d}^\mathrm{thin}= 0.3 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.3 \, \mathrm{kpc}$\| (fixed)	...
Fixed	\|$z_\mathrm{d}^\mathrm{thick}$\| (thick disc scale height)	\|$z_\mathrm{d}^\mathrm{thick}= 0.9 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.9 \, \mathrm{kpc}$\| (fixed)	...
Free	q (DM density flattening)	Flat at 0.2 < U ≤ 0.5 (oblate)	(3)	[0.983, 0.993, 0.998]	0.996
Free	γ (DM inner density slope)	Flat at 0 < γ < 1.9	...	[0.785, 0.982, 1.209]	0.738
Free	a (DM scale radius)	Flat at −∞ < log₁₀a < ∞	...	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|	\|$10.46 \, \mathrm{kpc}$\|
Free	log₁₀ρ₀ (DM normalization)	Flat at −∞ < log₁₀ρ₀ < ∞	...	[6.89, 7.21, 7.44]	7.44
...	Thick to thin disk ratio	Σ_thick(R₀)/Σ_thin(R₀) = (0.12 ± 0.04)	(2)	[0.070, 0.104, 0.139]	0.120
...	Total stellar mass	Equation (9)	(2)	[50.13, 52.32, 54.24] × 10⁹M_⊙	54.04 × 10⁹M_⊙
...	Dark matter concentration c′	\|$\ln c^{\prime }_\mathrm{v} = \ln (r_{94}/r_{-2}) = 2.56 \pm 0.272$\|	(2)	[18.45, 19.55, 20.89]	18.87
	Stellar halo DF
Free	a_in (inner velocity anisotropy)	Flat at 0 < a_in < 1.	(4)	[0.234, 0.404, 0.649]	0.331
Free	b_in (inner velocity anisotropy)	Flat at 0 < b_in < 1.	(4)	[0.789, 0.879, 0.921]	0.890
Free	a_out (outer velocity anisotropy)	Flat at 0 < a_out < 1.	(4)	[0.292, 0.342, 0.383]	0.346
Free	b_out (outer velocity anisotropy)	Flat at 0 < b_out < 1.	(4)	[0.792, 0.836, 0.861]	0.842
Free	log₁₀J₀ (Action scale in DF)	Flat at −∞ < log₁₀J₀ < ∞	...	[3.02, 3.22, 3.59]	3.25
Free	Γ (inner halo density index)	Flat at 0 ≤ Γ < 2.8	...	[0.49, 1.47, 2.47]	1.50
Fixed	B (outer halo density index)	Fixed to B = 5	(5)	5 (fixed)	...
Fixed	κ (halo rotation)	Fixed to κ = 0 (non-rotating)	(6)	0 (fixed)	...
Fixed	J_ϕ,0 (action scale in DF)	Fixed to J_ϕ,0 = const.	(6)	const. (fixed)	...
Free	log₁₀η (outlier fraction)	Flat at −∞ < log₁₀η < −2	...	[−7.26, −4.65, −3.21]	−3.99

Parameter	Quantity	Prior distribution	Note	Posterior distribution	Best fitting
	MW potential
Free	M_bulge (bulge mass)	M_bulge = (8.9 ± 0.89) × 10⁹M_⊙	(1)	[8.33, 9.23, 10.09] × 10⁹M_⊙	9.53 × 10⁹ M_⊙
Free	M_thin (thin disc mass)	M_thin = (35 ± 10) × 10⁹M_⊙	(2)	[34.39, 36.53, 38.55] × 10⁹M_⊙	37.22 × 10⁹ M_⊙
Free	M_thick (thick disc mass)	M_thick = (6 ± 3) × 10⁹M_⊙	(2)	[4.45, 6.21, 8.59] × 10⁹M_⊙	7.30 × 10⁹M_⊙
Free	\|$R_\mathrm{d}^\mathrm{thin}$\| (thin disc scale radius)	\|$R_\mathrm{d}^\mathrm{thin}= (2.6\pm 0.5) \, \mathrm{kpc}$\|	(2)	\|$[2.58, 2.68, 2.80] \, \mathrm{kpc}$\|	\|$2.63 \, \mathrm{kpc}$\|
Free	\|$R_\mathrm{d}^\mathrm{thick}$\| (thick disc scale radius)	\|$R_\mathrm{d}^\mathrm{thick}= (2.0\pm 0.2) \, \mathrm{kpc}$\|	(2)	\|$[1.76, 1.96, 2.15] \, \mathrm{kpc}$\|	\|$1.94 \, \mathrm{kpc}$\|
Fixed	\|$z_\mathrm{d}^\mathrm{thin}$\| (thin disc scale height)	\|$z_\mathrm{d}^\mathrm{thin}= 0.3 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.3 \, \mathrm{kpc}$\| (fixed)	...
Fixed	\|$z_\mathrm{d}^\mathrm{thick}$\| (thick disc scale height)	\|$z_\mathrm{d}^\mathrm{thick}= 0.9 \, \mathrm{kpc}$\| (fixed)	(1)	\|$0.9 \, \mathrm{kpc}$\| (fixed)	...
Free	q (DM density flattening)	Flat at 0.2 < U ≤ 0.5 (oblate)	(3)	[0.983, 0.993, 0.998]	0.996
Free	γ (DM inner density slope)	Flat at 0 < γ < 1.9	...	[0.785, 0.982, 1.209]	0.738
Free	a (DM scale radius)	Flat at −∞ < log₁₀a < ∞	...	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|	\|$10.46 \, \mathrm{kpc}$\|
Free	log₁₀ρ₀ (DM normalization)	Flat at −∞ < log₁₀ρ₀ < ∞	...	[6.89, 7.21, 7.44]	7.44
...	Thick to thin disk ratio	Σ_thick(R₀)/Σ_thin(R₀) = (0.12 ± 0.04)	(2)	[0.070, 0.104, 0.139]	0.120
...	Total stellar mass	Equation (9)	(2)	[50.13, 52.32, 54.24] × 10⁹M_⊙	54.04 × 10⁹M_⊙
...	Dark matter concentration c′	\|$\ln c^{\prime }_\mathrm{v} = \ln (r_{94}/r_{-2}) = 2.56 \pm 0.272$\|	(2)	[18.45, 19.55, 20.89]	18.87
	Stellar halo DF
Free	a_in (inner velocity anisotropy)	Flat at 0 < a_in < 1.	(4)	[0.234, 0.404, 0.649]	0.331
Free	b_in (inner velocity anisotropy)	Flat at 0 < b_in < 1.	(4)	[0.789, 0.879, 0.921]	0.890
Free	a_out (outer velocity anisotropy)	Flat at 0 < a_out < 1.	(4)	[0.292, 0.342, 0.383]	0.346
Free	b_out (outer velocity anisotropy)	Flat at 0 < b_out < 1.	(4)	[0.792, 0.836, 0.861]	0.842
Free	log₁₀J₀ (Action scale in DF)	Flat at −∞ < log₁₀J₀ < ∞	...	[3.02, 3.22, 3.59]	3.25
Free	Γ (inner halo density index)	Flat at 0 ≤ Γ < 2.8	...	[0.49, 1.47, 2.47]	1.50
Fixed	B (outer halo density index)	Fixed to B = 5	(5)	5 (fixed)	...
Fixed	κ (halo rotation)	Fixed to κ = 0 (non-rotating)	(6)	0 (fixed)	...
Fixed	J_ϕ,0 (action scale in DF)	Fixed to J_ϕ,0 = const.	(6)	const. (fixed)	...
Free	log₁₀η (outlier fraction)	Flat at −∞ < log₁₀η < −2	...	[−7.26, −4.65, −3.21]	−3.99

Note. (1) – The same prior as in McMillan (2017). (2) – The prior is taken from the data compiled in Bland-Hawthorn & Gerhard (2016). See Section 5.2. (3) – U = (2/|$\pi$|⁠)arctan(q) (equation 8). The range of 0.2 < U ≤ 0.5 corresponds to 0.3249 < q ≤ 1. (4) – See equation (14). (5) – We fix the parameter B = 5, as it cannot be well constrained by our inner halo sample. (6) – (κ, J_ϕ,0) are fixed in this paper, except for the mock analysis with mock data generated from a cosmological simulation (see Appendix D2.4).

3.1 Models for the gravitational potential

We model Φ(R, z) following the parametrization by McMillan (2017), except that we allow the DM distribution to be oblate-axisymmetric rather than spherical. First, we briefly summarize the functional forms assumed for the baryonic mass components (the bulge, stellar discs, and gas discs) which are identical to McMillan (2017). In addition, most of the parameters of the baryonic potential are also fixed to the best-fitting parameters in McMillan (2017). We then describe our assumed DM distribution model.

3.1.1 The bulge

For the bulge, we adopt a function of the form,

$$\begin{eqnarray*} \rho _\mathrm{bulge}(R,z) = \frac{c_b \times M_\mathrm{bulge}}{(1 + (r^{\prime }/0.075 \, \mathrm{kpc}))^{1.8}} \exp \left[ -\left(\frac{r^{\prime }}{2.1 \, \mathrm{kpc}} \right)^{2} \right], \end{eqnarray*}$$

(3)

where (r′)² = R² + (z/0.5)². The total bulge mass is M_bulge, and c_b is a normalization constant. M_bulge is treated as a free parameter, with a prior of M_bulge = (8.90 ± 0.89) × 10⁹M_⊙ (see Table 1).

3.1.2 The thin and thick stellar discs

We adopt exponential density profiles for the thin and thick discs of the form,

$$\begin{eqnarray*} \rho _{\operatorname{stellar-disc}}(R,z) = \frac{M_\mathrm{disc}}{4 \pi R_\mathrm{d}^2 z_\mathrm{d}} \exp \left[ - \frac{|z|}{z_\mathrm{d}} - \frac{R}{R_\mathrm{d}} \right]. \end{eqnarray*}$$

(4)

The scale heights |$z_\mathrm{d}=0.3\, \mathrm{kpc}$| for the thin disc and |$z_\mathrm{d}=0.9\, \mathrm{kpc}$| for the thick disc are fixed following McMillan (2017). We treat the mass and the scale radius of these components, |$(M_\mathrm{thin}, R_\mathrm{d}^\mathrm{thin}, M_\mathrm{thick}, R_\mathrm{d}^\mathrm{thick})$|⁠, as parameters. The prior on these parameters is shown in Table 1.

We set an additional prior on the ratio of surface densities of thin and thick discs evaluated at Solar cylinder:

$$\begin{eqnarray*} \Sigma _\mathrm{thick} (R_0) / \Sigma _\mathrm{thin} (R_0) =(0.12\pm 0.04), \end{eqnarray*}$$

(5)

which is taken from Bland-Hawthorn & Gerhard (2016). We note that McMillan (2017) put a prior on the ratio of densities (rather than surface densities) of thin and thick discs, |$\rho _{\operatorname{stellar-disc}}^\mathrm{thick}(R_0, 0) / \rho _{\operatorname{stellar-disc}}^\mathrm{thin} (R_0, 0) =0.12\pm 0.012.$| However, we choose the surface density ratio, because this quantity is more robustly estimated in the literature (Bland-Hawthorn & Gerhard 2016).

3.1.3 The atomic and molecular gas discs

We adopt a density profile for the atomic (H i) and molecular (H₂) gas discs of the form,

$$\begin{eqnarray*} \rho _{\operatorname{gas-disc}}(R,z) = \frac{\Sigma _0}{4 z_\mathrm{d}} \exp \left[ - \frac{R_\mathrm{m}}{R} - \frac{R}{R_\mathrm{d}} \right] \mathrm{sech}^2\left[ \frac{z}{2 z_\mathrm{d}} \right], \end{eqnarray*}$$

(6)

and fix all the parameters as in table 1 of McMillan (2017).

3.1.4 The dark matter halo

We adopt a flattened, generalized NFW density profile ρ_DM(R, z) with a suppression at large radii given by

$$\begin{eqnarray*} \rho _\mathrm{DM}(R,z) = \rho _{0} \frac{a^3}{m^{\gamma } (a +m)^{3-\gamma }} \exp \left[ -\left(\frac{m}{400 \, \mathrm{kpc}} \right)^{6} \right], \end{eqnarray*}$$

(7)

where m² = R² + (z/q)². The four free parameters are the density flattening parameter q (or an auxiliary parameter U as mentioned below), the density normalization ρ₀, the scale length a, and the inner density slope γ. We note that the DM density is suppressed at large Galactocentric radii so that the total mass of the DM halo is finite. However, the way we suppress the DM density is not very important in our study, because we only use inner halo stars for our inference and therefore we can only constrain the DM density profile in the inner halo.

In this paper, we use q when we interpret our result. However, we internally use an auxiliary variable U

$$\begin{eqnarray*} U = \frac{2}{\pi }\mathrm{arctan}(q), \end{eqnarray*}$$

(8)

when fitting our model to the data (following Bowden et al. 2016). This is because U is mathematically a better way of expressing the density flattening. We note that oblate models have 0 < U < 0.5 (0 < q < 1), prolate models have 0.5 < U < 1 (1 < q < ∞), and spherical models have U = 0.5 (q = 1).

As we shall describe in Section 3.1.5, we do not allow prolate DM distributions. Thus, we use a flat prior for U: 0.2 < U ≤ 0.5. (This range of U corresponds to 0.3249 < q ≤ 1.) The prior for (γ, a, ρ₀) are shown in Table 1.

We use similar definitions for the concentration parameter, virial mass, and virial radius to those in McMillan (2017), by evaluating these quantities by spherically averaging the non-spherical density. Namely, for a given set of parameters (ρ₀, γ, a, q), we define two radii r₂₀₀ and r₉₄ such that the mean density within a sphere of r₂₀₀ and r₉₄ is 200 and 94 times the critical density (⁠|$\rho _\mathrm{crit} = 137.55 {\,\rm M_\odot} \, \mathrm{kpc}^{-3})$|⁠; see Binney & Tremaine 2008), respectively. The virial mass M₂₀₀ and M₉₄ are defined as the enclosed mass within the sphere of radius r₂₀₀ and r₉₄, respectively. We define 〈ρ(r)〉 as the mean density within a spherical shell centred at a radius r; and we define the radius r₋₂ such that dln 〈ρ〉/dln r = −2. The concentration parameter is defined as |$c^{\prime }_\mathrm{v} = r_{94}/r_{-2}$|⁠.

Following McMillan (2017), we set a prior on |$\log _{10} c^{\prime }_\mathrm{v}$|⁠, as shown in Table 1. Also following McMillan (2017), who uses ‘abundance matching’ of galaxy stellar masses with halo virial masses, M₂₀₀, from cosmological simulations, we set the prior on the total stellar mass M_star (the sum of the bulge, thin disc, and thick disc) to be given by

$$\begin{eqnarray*} \log _{10}M_\mathrm{star} = \log _{10} \left\lbrace \frac{ M_{200} \times 2 \times 0.0351}{ \left(\frac{M_{200}}{M_\mathrm{knee}}\right)^{-1.376} +\left(\frac{M_{200}}{M_\mathrm{knee}}\right)^{ 0.608} } \right\rbrace \pm 0.20 , \end{eqnarray*}$$

(9)

with M_knee = 10^11.59 M_⊙. In other words, the prior distribution of the logarithm of the stellar mass is centred at a value given by a function of the dark (virial) mass determined from abundance matching with a scatter of 0.20 dex (a scatter of |${\sim}58{{\ \rm per\ cent}}$| in the stellar mass).

3.1.5 A note on the oblate assumption of the dark matter halo

Throughout this paper, we only consider q ≤ 1, because the current version of AGAMA (Vasiliev 2019) that is used in our paper only allows computation of orbital actions in oblate (or spherical) potentials. This limitation is inherent to the Stäckel fudge method (Binney 2012) and stems from a more complex orbital structure of prolate potentials, which support two different long-axis tube orbit classes.

In computing the radial and vertical action of a given star with the Stäckel fudge method, it is essential to define an adequate ellipsoidal coordinate (λ, μ) in the (R, z) plane, such that the orbital motion of the star in (R, z) plane is approximately bounded by two λ = constant curves and two μ = constant curves (Binney 2012; Sanders & Binney 2015a, 2016; Vasiliev 2019). When the MW potential is oblate axisymmetric, such an ellipsoidal coordinate has the foci on the z-axis (i.e. prolate coordinate system is needed for an oblate potential). However, if the MW potential is prolate axisymmetric, due to the orbital geometry allowed in the prolate systems, the foci are located on the R-axis. The current version of AGAMA only supports the ellipsoidal coordinate for which foci are on the z-axis (i.e. valid only in oblate potentials). Therefore, using the current version of AGAMA to compute radial or vertical action in prolate system is mathematically incorrect. We note that some studies including Posti & Helmi (2019) did not take this limitation into account although they used AGAMA.

3.2 Distribution function model

We assume that the DF of the stellar halo is given by the sum of two components, the main component and the outlier component:

$$\begin{eqnarray*} f(\rm{\boldsymbol {x}},\rm{\boldsymbol {v}}) = (1-\eta) f_\mathrm{main}(\rm{\boldsymbol {J}}[\rm{\boldsymbol {x}},\rm{\boldsymbol {v}}]) + \eta f_\mathrm{outlier}(\rm{\boldsymbol {x}},\rm{\boldsymbol {v}}). \end{eqnarray*}$$

(10)

Here, |$\rm{\boldsymbol {J}}=(J_r, J_z, J_\phi)$| are the radial, vertical, and azimuthal action, respectively (Binney & Tremaine 2008). Each of the DFs f, f_main, and f_outlier is separately normalized to unity when integrated over |$(\rm{\boldsymbol {x}},\rm{\boldsymbol {v}})$|⁠. The parameter η describes the fraction of outlier stars which is assumed to be small. We adopt a flat prior for log₁₀η at log₁₀η < −2.

3.2.1 Action-based distribution function

The main component has an analytic form given by

$$\begin{eqnarray*} f_\mathrm{main}(J_r, J_\phi , J_z) &=& \frac{C_A}{(2\pi J_0)^3} \left(\frac{h(\rm{\boldsymbol {J}})}{J_0} \right)^{-\Gamma } \left[ 1 + \left(\frac{g(\rm{\boldsymbol {J}})}{J_0}\right) \right]^{(\Gamma - B)}\nonumber \\&&\times \, \left[ 1+\kappa \tanh \left(\frac{J_\phi }{J_{\phi ,0}} \right) \right] . \end{eqnarray*}$$

(11)

Here, the normalization factor C_A is defined such that the integral of f_main over the entire 6D phase-space is unity. This double power-law model was proposed by Posti et al. (2015), and is flexible enough to reproduce the broken power-law density profile of the halo (e.g. Deason et al. 2014), as well as the radially varying flattening and velocity anisotropy.

The parameter J₀ implicitly determines the break radius, where these halo properties gradually change from the inner to the outer asymptotic values (the relation between J₀ and the actual radius depends on the potential). We adopt a flat prior on log₁₀J₀.

The parameter Γ(<3) and B(>3) govern the inner and outer density slope of the stellar halo, respectively.³ For the prior on Γ, we adopt a flat prior at Γ at 0 ≤ Γ < 2.8. The parameter B is fixed to B = 5 in our analysis (cf. Binney & Wong 2017), because it is difficult to constrain B (or, roughly speaking the outer density profile) with our inner halo data at |$5 \lesssim r/\, \mathrm{kpc}\lesssim 27.5$|⁠.

We note that

$$\begin{eqnarray*} h(\rm{\boldsymbol {J}}) = h_r J_r + h_z J_z + h_\phi |J_\phi |, \end{eqnarray*}$$

(12)

and

$$\begin{eqnarray*} g(\rm{\boldsymbol {J}}) = g_r J_r + g_z J_z + g_\phi |J_\phi |, \end{eqnarray*}$$

(13)

are functions that govern the velocity anisotropy in the inner and outer part of the halo. The coefficients in |$h(\rm{\boldsymbol {J}})$| and |$g(\rm{\boldsymbol {J}})$| are subject to the following constraints: 0 < h_i (i = r, z, ϕ), h_r + h_z + h_ϕ = 3, 0 < g_i (i = r, z, ϕ), g_r + g_z + g_ϕ = 3. To handle these six variables (with four degrees of freedom) easily, in our analysis, we introduce four independent parameters, (a_in, b_in, a_out, b_out), such that

$$\begin{eqnarray*} \left\lbrace \begin{array}{l} h_r = 3 \sqrt{a_\mathrm{in}} (1-b_\mathrm{in}) \\ h_z = 3 \sqrt{a_\mathrm{in}} b_\mathrm{in} \\ h_\phi = 3 (1 - \sqrt{a_\mathrm{in}}) \\ \end{array}\right. \,\,\,\, \left\lbrace \begin{array}{l} g_r = 3 \sqrt{a_\mathrm{out}} (1-b_\mathrm{out}) \\ g_z = 3 \sqrt{a_\mathrm{out}} b_\mathrm{out} \\ g_\phi = 3 (1 - \sqrt{a_\mathrm{out}}) . \end{array}\right. \end{eqnarray*}$$

(14)

Instead of setting a prior on (h_r, h_z, h_ϕ) and (g_r, g_z, g_ϕ), we set a uniform prior between 0 and 1 on (a_in, b_in, a_out, b_out). This prior is mathematically equivalent to sampling points uniformly from a 2D region enclosed by an equilateral triangle. With this prior, we can sample (h_r, h_z, h_ϕ) and (g_r, g_z, g_ϕ) in an unbiased manner.

The parameter κ determines the net rotation of the stellar halo and J_ϕ,0 determines the scale of the angular momentum under which the rotation is suppressed. In the fiducial analysis with the Gaia RR Lyrae stars, we set κ = 0 and set J_ϕ,0 to be some constant. This is because we observe little net rotation for 1022 RR Lyrae stars within our survey volume with full 3D velocity data.⁴

To summarize, in analysing the Gaia RR Lyrae sample, we treat the following parameters as free parameters for the action-based DF f_main: The ‘break radius’ action J₀, the parameter governing the inner stellar halo density Γ, and the quantities governing the velocity anisotropy (a_in, b_in, a_out, b_out). Other parameters are fixed, as shown in Table 1.

3.2.2 Simple distribution function for the outlier population

Based on some tests with cosmological simulations, we found that f_main is flexible enough to capture the DF of the inner stellar halo. In reality, however, we expect that a small fraction of our sample stars may not be well described by f_main. For example, we noticed that some objects in our RR Lyrae star catalogue have very large tangential velocities, probably because they are misclassified as RR Lyrae stars (e.g. due to blending with nearby sources; see Section 2.2 of Iorio & Belokurov 2019 and references therein). Such a star would deteriorate the fit of the gravitational potential, because even a single star with extremely large velocity requires a very massive DM halo (since we assume that all the stars in our sample are bound to the MW).⁵ In order to handle these outlier stars, we introduce |$f_\mathrm{outlier}(\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}})$| given by

$$\begin{eqnarray*} f_\mathrm{outlier}(\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}}) = \left\lbrace \begin{array}{l{@}{\quad}l}0, & (\text{if}\ \rm{\boldsymbol {x}}\, \rm{is\, outside \,survey \,volume})\\ \frac{C_B}{\left(2\pi \sigma _\mathrm{outlier}^2 \right)^{\frac{3}{2}} } & \exp \left[ -\frac{|\rm{\boldsymbol {v}}|^2}{2 \sigma _\mathrm{outlier}^2} \right], \text{(otherwise)} \end{array}\right. \end{eqnarray*}$$

(15)

with |$\sigma _\mathrm{outlier}=1000 \, \mathrm{km\ s}^{-1}$|⁠. The normalization factor C_B is the reciprocal of the survey volume so that the integration of f_outlier over the 6D phase-space accessible to the survey is unity.

3.3 Selection function model

We model the sample selection function in equation (1) as

$$\begin{eqnarray*} S (\mathcal {D}, \ell , b) = \left\lbrace \begin{array}{l{@}{\quad}l} 1, &(\mathcal {D}_\mathrm{min}(b) \le \mathcal {D}\le \mathcal {D}_\mathrm{max}, |b| \ge 20^\circ), \\ 0, &(\text{otherwise}). \end{array}\right. \end{eqnarray*}$$

(16)

Here, |$\mathcal {D}_\mathrm{min}(b)$| and |$\mathcal {D}_\mathrm{max}$| are the minimum and maximum distance moduli for each line of sight, and are given by

$$\begin{eqnarray*} \mathcal {D}_\mathrm{min}(b) = 5 \log _{10} \left(\frac{5 \, \mathrm{kpc}/ |\sin b|}{10 \, \mathrm{pc}} \right), \end{eqnarray*}$$

(17)

$$\begin{eqnarray*} \mathcal {D}_\mathrm{max} = 5 \log _{10} \left(\frac{20 \, \mathrm{kpc}}{10 \, \mathrm{pc}} \right) = 16.505. \end{eqnarray*}$$

(18)

In this selection function model, it is implicitly assumed that the completeness of the RR Lyrae stars is 100 per cent in the survey volume. However, our result is unaffected as long as the completeness is constant within the survey volume. Indeed, according to fig. 13 of Mateu et al. (2020), the completeness of the RR Lyrae sample in Gaia DR2 at |b| > 20^○ is almost insensitive to the G-magnitude at 14 < G < 18, where most of our sample reside. Thus, the simple selection function model in the above equations is reasonable for our analysis.

3.4 Error model

Throughout this paper, we use primed variables to denote the true (error-free) quantities. We assume that the observational errors on |$(\ell , b, \mathcal {D}, \mu _{\alpha *}, \mu _\delta , v_{\mathrm{los}})$| are either negligible (δ functions) or Gaussian distributed:

$$\begin{eqnarray*} \mathrm{Pr}(\ell | \ell ^{\prime }, M) = \delta (\ell - \ell ^{\prime }), \end{eqnarray*}$$

(19)

$$\begin{eqnarray*} \mathrm{Pr}(b | b^{\prime }, M) = \delta (b - b^{\prime }), \end{eqnarray*}$$

(20)

$$\begin{eqnarray*} \mathrm{Pr}(\mathcal {D}| \mathcal {D}^{\prime }, M) = \frac{1}{\sqrt{2\pi } \sigma _{\mathcal {D}}} \exp \left[ - \frac{(\mathcal {D}-\mathcal {D}^{\prime })^2}{2 \sigma _{\mathcal {D}}^2} \right], \end{eqnarray*}$$

(21)

$$\begin{eqnarray*} \mathrm{Pr}(\rm{\boldsymbol \mu } | \rm{\boldsymbol \mu ^{\prime }}, M) = \frac{1}{ 2\pi |\Sigma _\mu |^{1/2}} \exp \left[ - \frac{1}{2} (\rm{\boldsymbol \mu }-\rm{\boldsymbol \mu ^{\prime }})^\mathrm{T} \Sigma _\mu ^{-1} (\rm{\boldsymbol \mu }-\rm{\boldsymbol \mu ^{\prime }}) \right], \end{eqnarray*}$$

(22)

$$\begin{eqnarray*} \mathrm{Pr}(v_{\mathrm{los}}| v_{\mathrm{los}}^{\prime }, M) = \frac{1}{\sqrt{2\pi } \sigma _\mathrm{v}} \exp \left[ - \frac{(v_{\mathrm{los}}-v_{\mathrm{los}}^{\prime })^2}{2 \sigma _\mathrm{v}^2} \right]. \end{eqnarray*}$$

(23)

Here, M indicates our model, which includes the model for the observational errors. As mentioned in Section 2.3.2, we assume that |$\sigma _{\mathcal {D}}=0.240$| is identical for all the RR Lyrae stars. We fully take into account the correlated uncertainties in |$\rm{\boldsymbol \mu } = (\mu _{\alpha *}, \mu _\delta)$|⁠, and Σ_μ is the covariance matrix. We note that equation (23) is still valid even if the v_los is not available, because we can set a large value of σ_v in such a case (as mentioned in McMillan & Binney 2013).

4 LIKELIHOOD OF THE STELLAR HALO DATA

As mentioned in Section 3, the objective of our analysis is to fit the kinematic data for RR Lyrae stars with a DF model. Here, we derive the likelihood for the RR Lyrae data given a set of model parameters.

The likelihood function we adopt is similar to those that have already been derived and discussed in previous studies (most notably McMillan & Binney 2012, 2013; Trick et al. 2016). However, these previous derivations ignore, or do not properly consider, the observational errors in distances to stars. We propose a new approach to handling the distance errors by taking advantage of the fact that all the RR Lyrae stars in our sample have approximately the same error on distance modulus.

For completeness, we start our discussion from the case where the stellar sample is error-free. Then, we proceed to a more realistic case in which observational errors (including distance errors and missing line-of-sight velocities) are taken into account.

4.1 Formulation with error-free data

In the absence of the observational errors, given the model M, the probability that ith star is found in a Cartesian phase-space volume |$\mathrm{d}^3\rm{\boldsymbol {x}} \mathrm{d}^3\rm{\boldsymbol {v}}$| centred at |$(\rm{\boldsymbol {x}}_i, \rm{\boldsymbol {v}}_i)$| is expressed as

$$\begin{eqnarray*} \mathrm{Pr} (\rm{\boldsymbol {x}}_i, \rm{\boldsymbol {v}}_i | M) \mathrm{d}^3\rm{\boldsymbol {x}} \mathrm{d}^3\rm{\boldsymbol {v}} &=& \frac{f(\rm{\boldsymbol {x}}_i, \rm{\boldsymbol {v}}_i | M) S(\rm{\boldsymbol {x}}_i) \mathrm{d}^3\rm{\boldsymbol {x}} \mathrm{d}^3\rm{\boldsymbol {v}} }{\int \mathrm{d}^3\rm{\boldsymbol {x}} \mathrm{d}^3\rm{\boldsymbol {v}} \,\, f(\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}} | M) S(\rm{\boldsymbol {x}}) } \nonumber \\ &=& \frac{f(\rm{\boldsymbol {x}}_i, \rm{\boldsymbol {v}}_i | M) S(\rm{\boldsymbol {x}}_i) \left| \frac{\partial (\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}})}{\partial \rm{\boldsymbol {u}}} \right|_{i} \mathrm{d}^6\rm{\boldsymbol {u}} }{\int \mathrm{d}^3\rm{\boldsymbol {x}} \mathrm{d}^3\rm{\boldsymbol {v}} \,\, f(\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}} | M) S(\rm{\boldsymbol {x}}) } . \end{eqnarray*}$$

(24)

Here, |$\rm{\boldsymbol {u}}$| is the observable vector defined in equation (2). The function |$S(\rm{\boldsymbol {x}})$| denotes the selection function of the survey, which depends on position only. The Jacobian is given by

$$\begin{eqnarray*} \left| \frac{\partial (\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}})}{\partial \rm{\boldsymbol {u}}} \right| = \frac{\ln 10}{5} k^2 d^5 \cos b, \end{eqnarray*}$$

(25)

where |$k=4.74047 \, \mathrm{km\ s}^{-1}(\, \mathrm{mas\ yr}^{-1})^{-1}$| and d is the heliocentric distance (in |$\, \mathrm{kpc}$|⁠) corresponding to the distance modulus |$\mathcal {D}$|⁠. The subscript i in the Jacobian in equation (24) implies that the quantity is evaluated at |$(\rm{\boldsymbol {x}}_i, \rm{\boldsymbol {v}}_i)$|⁠.

4.2 Formulation with observational errors

In the presence of the observational errors, the expression for the probability |$\mathrm{Pr} (\rm{\boldsymbol {x}}_i, \rm{\boldsymbol {v}}_i | M)$| becomes more complicated, as pointed out by Trick et al. (2016).⁶ We introduce a function |$E(\rm{\boldsymbol {u}} | \rm{\boldsymbol {u}}^{\prime } , M)$| that denotes the probability that a star’s observable vector is |$\rm{\boldsymbol {u}}$| given its true vector |$\rm{\boldsymbol {u}}^{\prime }$| and the model M. (Remember our notation that a primed quantity such as |$\rm{\boldsymbol {u}}^{\prime }$| denotes the true value of an unprimed quantity; see Section 3.4.)

Given the model M, the probability that ith star is found in a observable phase-space volume |$\mathrm{d}^6\bar{\rm{\boldsymbol {u}}}$| centred at the observed value |$\bar{\rm{\boldsymbol {u}}}_i$| is expressed as

$$\begin{eqnarray*} &&{\mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) \mathrm{d}^6\bar{\rm{\boldsymbol {u}}}} \nonumber \\ &&{\,\, =\frac{ \mathrm{d}^6 \bar{\rm{\boldsymbol {u}}} \,\, S(\rm{\boldsymbol {x}}(\bar{\rm{\boldsymbol {u}}}_i)) \int \mathrm{d}^6 \rm{\boldsymbol {u}}^{\prime } \,\, E(\bar{\rm{\boldsymbol {u}}}_i | \rm{\boldsymbol {u}}^{\prime } , M) f(\rm{\boldsymbol {x}}^{\prime } (\rm{\boldsymbol {u}}^{\prime }) , \rm{\boldsymbol {v}}^{\prime } (\rm{\boldsymbol {u}}^{\prime }) |M) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right| }{ \int \mathrm{d}^6 \rm{\boldsymbol {u}} \int \mathrm{d}^6 \rm{\boldsymbol {u}}^{\prime } \,\, E(\rm{\boldsymbol {u}} | \rm{\boldsymbol {u}}^{\prime } , M) f(\rm{\boldsymbol {x}}^{\prime } (\rm{\boldsymbol {u}}^{\prime }) , \rm{\boldsymbol {v}}^{\prime } (\rm{\boldsymbol {u}}^{\prime }) |M) S(\rm{\boldsymbol {x}}(\rm{\boldsymbol {u}})) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right| }. }\nonumber\\ \end{eqnarray*}$$

(26)

For brevity, we use the simplification |$f(\rm{\boldsymbol {u}}^{\prime } | M) \equiv f(\rm{\boldsymbol {x}}^{\prime } (\rm{\boldsymbol {u}}^{\prime }) , \rm{\boldsymbol {v}}^{\prime } (\rm{\boldsymbol {u}}^{\prime }) |M)$|⁠. Then, we obtain an expression for |$\mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M)$|⁠:

$$\begin{eqnarray*} \mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) =\frac{ S(\rm{\boldsymbol {x}}(\bar{\rm{\boldsymbol {u}}}_i)) \int \mathrm{d}^6 \rm{\boldsymbol {u}}^{\prime } \,\, E(\bar{\rm{\boldsymbol {u}}}_i | \rm{\boldsymbol {u}}^{\prime } , M) f(\rm{\boldsymbol {u}}^{\prime } | M) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right| }{ \int \mathrm{d}^6 \rm{\boldsymbol {u}} \int \mathrm{d}^6 \rm{\boldsymbol {u}}^{\prime } \,\, E(\rm{\boldsymbol {u}} | \rm{\boldsymbol {u}}^{\prime } , M) f(\rm{\boldsymbol {u}}^{\prime } | M) S(\rm{\boldsymbol {x}}(\rm{\boldsymbol {u}})) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right| } . \nonumber\\ \end{eqnarray*}$$

(27)

As seen from this expression, the integral of E × f over |$\rm{\boldsymbol {u}}^{\prime }$| is a convolution of the DF with the error kernel E, basically smearing out the true DF according to the uncertainty described by E. Also, we note that the selection function S acts on the blurred stellar distribution and therefore the argument of S is not |$\rm{\boldsymbol {x}}^{\prime }$| but |$\rm{\boldsymbol {x}}$|⁠.

4.3 Evaluation of equation (27) for the RR Lyrae sample

The result of equation (27) is generally applicable to both 6D data and 5D data, including our RR Lyrae sample with missing v_los. This is because 5D data without v_los data is equivalent to 6D data with large observational errors in v_los (McMillan & Binney 2013). In the following, we will show how to evaluate |$\mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M)$| in equation (27) for our RR Lyrae sample.

4.3.1 Denominator in equation (27)

By using the selection function model (Section 2.3.1) and the error model (Section 3.4), the denominator of equation (27) is given by

$$\begin{eqnarray*} A &=& \int \mathrm{d}^6 \rm{\boldsymbol {u}}^{\prime } f(\rm{\boldsymbol {u}}^{\prime } | M) \left| \frac{\partial (\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}})}{\partial \rm{\boldsymbol {u}}^{\prime }} \right| \int \mathrm{d}^6 \rm{\boldsymbol {u}} \,\, E(\rm{\boldsymbol {u}} | \rm{\boldsymbol {u}}^{\prime } , M) S(\mathcal {D}, \ell , b) \nonumber \\ &=& \int \mathrm{d}^6 \rm{\boldsymbol {u}}^{\prime } f(\rm{\boldsymbol {u}}^{\prime } | M) \left| \frac{\partial (\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}})}{\partial \rm{\boldsymbol {u}}^{\prime }} \right| \int \mathrm{d}^6 \rm{\boldsymbol {u}} \,\, S(\mathcal {D}, \ell , b) \mathrm{Pr}(\mathcal {D}| \mathcal {D}^{\prime }, M) \nonumber \\ &&\times \delta (\ell -\ell ^{\prime }) \delta (b-b^{\prime }) \mathrm{Pr}(v_{\mathrm{los}}| v_{\mathrm{los}}^{\prime }, M) \mathrm{Pr}(\rm{\boldsymbol \mu }|\rm{\boldsymbol \mu ^{\prime }}, M). \end{eqnarray*}$$

(28)

With equations (19)–(22), the integration over (ℓ, b, μ_ℓ*, μ_b) reduces to unity. By using equation (23) and assuming σ_v → ∞, the integration over v_los also reduces to unity. Thus, we obtain

$$\begin{eqnarray*} A &=& \int _{\text{footprint}} \mathrm{d}^2 [\ell ^{\prime }, (\sin b^{\prime })] \int \mathrm{d}^4 [\mathcal {D}^{\prime }, \rm{\boldsymbol \mu ^{\prime }}, v_{\mathrm{los}}^{\prime }] \,\, f(\rm{\boldsymbol {u}}^{\prime } |M) \nonumber \\ &&\times \frac{\ln 10}{5} k^2 (d^{\prime })^5 \int _{\mathcal {D}_\mathrm{min}}^{\mathcal {D}_\mathrm{max}} \mathrm{d} \mathcal {D}\,\, \mathrm{Pr}(\mathcal {D}|\mathcal {D}^{\prime },M) \nonumber \\ &=& \int _{\text{footprint}} \mathrm{d}^2 [\ell ^{\prime }, (\sin b^{\prime })] \int \mathrm{d}^4 [\mathcal {D}^{\prime }, \rm{\boldsymbol \mu ^{\prime }}, v_{\mathrm{los}}^{\prime }] \,\, f(\rm{\boldsymbol {u}}^{\prime } |M) \nonumber \\ &&\times \frac{\ln 10}{5} k^2 (d^{\prime })^5 \frac{1}{2}\! \left[ \mathrm{erf} \left(\frac{\mathcal {D}_\mathrm{max} - \mathcal {D}^{\prime }}{\sqrt{2} \sigma _{\mathcal {D}}} \right) + \mathrm{erf} \left(\frac{\mathcal {D}^{\prime } - DM_\mathrm{min}}{\sqrt{2} \sigma _{\mathcal {D}}} \right) \right]\!.\nonumber\\ \end{eqnarray*}$$

(29)

Here, |$(\mathcal {D}_\mathrm{min}, \mathcal {D}_\mathrm{max})$| are defined in Section 3.3. An intuitively understandable expression for A can be obtained by performing the integration over |$(\mu _{\alpha *}^{\prime }, \mu _\delta ^{\prime }, v_{\mathrm{los}}^{\prime })$|⁠:

$$\begin{eqnarray*} A &=& \int _{\text{footprint}} \mathrm{d}^2 [\ell ^{\prime }, (\sin b^{\prime })] \int \mathrm{d} \mathcal {D}^{\prime } \,\, \rho (\mathcal {D}^{\prime }, \ell ^{\prime }, b^{\prime } |M)\frac{\ln 10}{5} (d^{\prime })^3 \nonumber \\ &&\times \, \frac{1}{2} \left[ \mathrm{erf} \left(\frac{\mathcal {D}_\mathrm{max} - \mathcal {D}^{\prime }}{\sqrt{2} \sigma _{\mathcal {D}}} \right) + \mathrm{erf} \left(\frac{\mathcal {D}^{\prime } - \mathcal {D}_\mathrm{min}}{\sqrt{2} \sigma _{\mathcal {D}}} \right) \right]. \end{eqnarray*}$$

(30)

Here, |$\rho (\mathcal {D}^{\prime }, \ell ^{\prime }, b^{\prime } |M) = \int \mathrm{d}^3 \rm{\boldsymbol {v}}^{\prime } \,\, f(\rm{\boldsymbol {u}}^{\prime } |M)$| is the (normalized) stellar density. In the limit of |$\sigma _{\mathcal {D}} \rightarrow 0$|⁠, the factor |$\frac{1}{2}[\mathrm{erf}(.)+\mathrm{erf}(.)]$| is unity and thus A is the mass enclosed in the survey volume, as pointed out by Trick et al. (2016). In the presence of non-zero |$\sigma _{\mathcal {D}}$|⁠, A can be interpreted as the mass enclosed in the survey volume which is blurred by the distance errors. Practically, we find that the integration over |$\mathcal {D}^{\prime }$| needs to be performed at |$\mathcal {D}_\mathrm{min}-4\sigma _{\mathcal {D}} \lt \mathcal {D}^{\prime } \lt \mathcal {D}_\mathrm{max}+4\sigma _{\mathcal {D}}$| in equations (29) and (30).

In general, |$\sigma _{\mathcal {D}}$| is non-zero and each star has a different value of |$\sigma _{\mathcal {D}}$|⁠. In such a case, we need to evaluate A for each star, which is computationally very expensive. This is why |$\sigma _{\mathcal {D}}$| is neglected (or explicitly set to be zero) in previous studies (McMillan & Binney 2013; Trick et al. 2016). However, if |$\sigma _{\mathcal {D}}$| is approximately the same for the entire sample, which is the case for our RR Lyrae stars, we need to evaluate A only once for a given model, by assuming a single value for |$\sigma _{\mathcal {D}}$|⁠. With this prescription, we can dramatically reduce the computational cost while keeping our likelihood evaluation more precise. The derivation of equations (29) and (30) is the most important improvement we have over the previous formulation by McMillan & Binney (2013) and Trick et al. (2016).

4.3.2 Numerator in equation (27)

The numerator of equation (27) can be numerically evaluated using Monte Carlo integration.

If we had 6D data for the RR Lyrae stars, the evaluation would be relatively easy, because we would only need to sample from the error distribution of |$\rm{\boldsymbol {u}}^{\prime }$| to perform numerical integration. In such a case, we first randomly draw N_MC realizations of the observable vector |$\rm{\boldsymbol {u}}^{\prime }_{ij} =(\ell ^{\prime }, b^{\prime }, \mathcal {D}^{\prime }, \mu _{\alpha *}^{\prime }, \mu _\delta ^{\prime }, v_{\mathrm{los}}^{\prime })_{ij}$| (j = 1,.., N_MC) from the corresponding error distributions, defined in equations (19)–(23), centred around |$\bar{\rm{\boldsymbol {u}}}_i$|⁠. Then, by using these N_MC realizations of |$\lbrace \rm{\boldsymbol {u}}^{\prime }_{ij} \rbrace$|⁠, we can evaluate the equation (27):

$$\begin{eqnarray*} \mathrm{(6D)} \,\, \mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) = \frac{1}{A} \frac{ S(\rm{\boldsymbol {x}}(\bar{\rm{\boldsymbol {u}}}_i)) }{N_\mathrm{MC}} \sum _{j=1}^{N_\mathrm{MC}} { f(\rm{\boldsymbol {u}}^{\prime }_{ij} | M) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right|_{ij} } . \end{eqnarray*}$$

(31)

We use equation (31) to evaluate the model likelihood when we analyse the mock 6D data in Appendix D.

For our 5D RR Lyrae sample, the above-mentioned sampling method needs a modification, because the 5D data lack in v_los. For example, if we naively sample |$v_{\mathrm{los}}^{\prime }$| from a very wide distribution (assuming large σ_v in equation 23), a large fraction of the sampled phase-space coordinate |$\rm{\boldsymbol {u}}^{\prime }_{ij}$| corresponds to unbound stars. To achieve computational efficiency, we adopt an importance sampling for |$v_{\mathrm{los}}^{\prime }$| using a Cauchy distribution. Namely, for the ith star, we first draw N_MC samples from a Cauchy distribution. The scale parameter for the Cauchy distribution is fixed to |$150 \, \mathrm{km\ s}^{-1}$| and the location parameter is set to be the Solar reflex motion in the direction of the ith star, |$-(\rm{\boldsymbol {v}}_\odot \cdot \rm{\boldsymbol {e}}_{\mathrm{los},i})$|⁠. The other 5D phase-space coordinates are drawn in the same manner as before, using the error distribution given in equations (19)–(22). For each realization of the observable vector, |$\rm{\boldsymbol {u}}^{\prime }_{ij} =(\ell ^{\prime }, b^{\prime }, DM^{\prime }, \mu _{\alpha *}^{\prime }, \mu _\delta ^{\prime }, v_{\mathrm{los}}^{\prime })_{ij}$|⁠, we assign a weight

$$\begin{eqnarray*} w_{ij} = \pi (150 \, \mathrm{km\ s}^{-1}) \left[ 1 + \left(\frac{{v_{\mathrm{los}}^{\prime }}_{ij} - (- \rm{\boldsymbol {v}}_\odot \cdot \rm{\boldsymbol {e}}_{\mathrm{los},i})}{150 \, \mathrm{km\ s}^{-1}} \right)^2 \right] , \end{eqnarray*}$$

(32)

which is the reciprocal of the probability density of the above-mentioned Cauchy distribution.⁷ Here, |$\pi$| ≃ 3.14 is a mathematical constant. Finally, by using the realizations of |$\rm{\boldsymbol {u}}^{\prime }_{ij}$| and the weight w_ij, we evaluate the equation (27):

$$\begin{eqnarray*} \mathrm{(5D)} \,\, \mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) = \frac{1}{A} \frac{ S(\rm{\boldsymbol {x}}(\bar{\rm{\boldsymbol {u}}}_i)) }{N_\mathrm{MC}} \sum _{j=1}^{N_\mathrm{MC}} { w_{ij} f(\rm{\boldsymbol {u}}^{\prime }_{ij} | M) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right|_{ij} } . \end{eqnarray*}$$

(33)

4.4 Likelihood of the RR Lyrae stars

By using the expressions above, the logarithmic likelihood of the entire observed data set from the RR Lyrae sample given a model M can be expressed as

$$\begin{eqnarray*} \text{(6D)} \sum _{i=1}^{N_\mathrm{RRL}} \ln \mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) &=& - N_\mathrm{RRL} \ln A + \sum _{i=1}^{N_\mathrm{RRL}} \nonumber \\ &&\times \ln\,\left[\frac{1}{{N_\mathrm{MC}}} \sum _{j=1}^{N_\mathrm{MC}} { f(\rm{\boldsymbol {u}}^{\prime }_{ij} | M) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right|_{ij} } \right]\!\! , \nonumber\\ \end{eqnarray*}$$

(34)

in the case of 6D data and

$$\begin{eqnarray*} &&{\text{(5D)} \sum _{i=1}^{N_\mathrm{RRL}} \ln \mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) = - N_\mathrm{RRL} \ln A + \sum _{i=1}^{N_\mathrm{RRL}}} \nonumber\\ &&{\quad\times \,\ln \left[ \frac{1}{{N_\mathrm{MC}}} \sum _{j=1}^{N_\mathrm{MC}} { w_{ij} f(\rm{\boldsymbol {u}}^{\prime }_{ij} | M) \left| \frac{\partial (\rm{\boldsymbol {x}}^{\prime }, \rm{\boldsymbol {v}}^{\prime })}{\partial \rm{\boldsymbol {u}}^{\prime }} \right|_{ij} } \right] ,} \end{eqnarray*}$$

(35)

in the case of 5D data. Here, we assume that our RR Lyrae sample stars are complete within our survey volume and thus |$S(\rm{\boldsymbol {x}}(\bar{\rm{\boldsymbol {u}}}_i)) =1$| for all the stars (i = 1,..., N_RRL).

In our analysis, however, we slightly modify this likelihood and adopt a total likelihood given by

$$\begin{eqnarray*} \ln L_\mathrm{RRL}(\lbrace \bar{\rm{\boldsymbol {u}}} \rbrace | M) = \frac{N_\mathrm{RRL,eff}}{N_\mathrm{RRL}} \sum _{i=1}^{N_\mathrm{RRL}} \ln \mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M) , \end{eqnarray*}$$

(36)

with N_RRL,eff = 1000. This modification is a necessary compromise between the numerical accuracy requirements and limited computational resources, as described in the following subsection.

4.4.1 Numerical evaluation of the total likelihood

Our goal is to combine the likelihood of the RR Lyrae data given model M, ln L(Data(RRL)|M), with the likelihood functions of the circular velocity data and the vertical force data for our Markov chain Monte Carlo (MCMC) analysis. Thus, we need to be careful so that the numerical noise in the likelihood will not seriously affect our inference of the model parameters. Here, we focus on the 5D case and we describe two important numerical techniques to achieve our goal.

The first technique is related to the Monte Carlo integration of equation (33). The precision of this integration is determined by the number of Monte Carlo samples, N_MC. Ideally, it is desirable to set N_MC as large as possible to minimize the numerical noise in |$\mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M)$|⁠. However, this requires a large computational cost although we are not specifically interested in the absolute value of |$\mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M)$|⁠. Rather, we are more interested in the relative value of |$\mathrm{Pr} (\bar{\rm{\boldsymbol {u}}}_i | M)$| for different models (e.g. M = M₁ and M = M₂). Thus, following McMillan & Binney (2013), we use the same set of sampling points |$\rm{\boldsymbol {u}}^{\prime }_{ij}$| and weights w_ij throughout our MCMC analysis. We find that N_MC = 100 is enough for our purpose.

The second technique is related to the evaluation of A in equation (35). As discussed in Section 4.3.1, the value of A is common for all the RR Lyrae stars. If the fractional error in A is ϵ (ϵ ≪ 1), then this error results in an error of δ(log₁₀L(Data(RRL)|M)) = (1/ln 10)N_RRL,effϵ = 0.43N_RRL,effϵ (see discussion in McMillan & Binney 2013). If we require a tolerance of δ(log₁₀L) < 0.5, then the fractional error in A has to satisfy ϵ < (0.5/ln 10)/N_RRL,eff = 1/(0.87N_RRL,eff). With unlimited computational resources, we could have set N_RRL,eff = 16197(= N_RRL). However, the evaluation of A (see equation 29) is computationally challenging, because it involves 6D integration in the phase-space of observable quantities and the conversion of observable quantities into actions. We use an adaptive multidimensional integration package, cubature (https://github.com/stevengj/cubature), to evaluate A, and find that even with this sophisticated package, the integration does not converge within 10 min per model⁸ if we set N_RRL,eff = N_RRL. After some experiments, we find that setting N_RRL,eff = 1000 (or N_RRL,eff ≤ 3000) is a reasonable choice for our analysis in terms of the computational speed and numerical accuracy. Mathematically, setting N_RRL,eff < N_RRL is equivalent to assigning a weight of (N_RRL,eff/N_RRL) to each of our RR Lyrae stars. As a result, the constraining power from our RR Lyrae star sample is reduced, as if we only had N_RRL,eff stars in our catalogue.

5 ANALYSIS

From Bayes’ theorem, the posterior distribution of the model parameters M given the data D is expressed as

$$\begin{eqnarray*} \mathrm{Pr} (M | D) = \frac{\mathrm{Pr} (D| M) \mathrm{Pr} (M)}{\mathrm{Pr} (D)} , \end{eqnarray*}$$

(37)

where the Bayesian evidence, Pr(D), can be considered as a constant in our analysis. In this section, we discuss the total likelihood Pr(D|M), our choice of the prior Pr(M), and provide a description of the implementation of our Bayesian analysis.

5.1 Bayesian likelihood

5.1.1 Likelihood of the circular velocity data

The circular velocity at radius R is given by

$$\begin{eqnarray*} v_\mathrm{circ}^\mathrm{model} (R) = \left[ R \left(\frac{\partial \Phi (R,z)}{\partial R} \right) \right]^{1/2}_{z=0} . \end{eqnarray*}$$

(38)

By using the measured circular velocity and the associated random error {v_circ(R_circ,i) ± σ_circ,rand(R_circ,i)} at radius {R_circ,i} (for i = 1, ⋅⋅⋅, N_circ) taken from Eilers et al. (2019), the logarithmic likelihood of the circular velocity data is given by

$$\begin{eqnarray*} \ln L_\mathrm{circ} = - \sum _{i=1}^{N_\mathrm{circ}} \frac{1}{2} \left(\frac{v_\mathrm{circ}(R_{\mathrm{circ},i}) - v_\mathrm{circ}^\mathrm{model} (R_{\mathrm{circ},i})}{\sigma _\mathrm{circ,rand}(R_{\mathrm{circ},i})} \right)^2 . \end{eqnarray*}$$

(39)

5.1.2 Likelihood of the vertical force data

The vertical force at |$(R,z)=(R, 1.1 \, \mathrm{kpc})$| is given by

$$\begin{eqnarray*} K_{z,1.1 \, \mathrm{kpc}}^\mathrm{model} (R) = \left[ - \frac{\partial \Phi (R,z)}{\partial z} \right]_{z=1.1 \, \mathrm{kpc}} . \end{eqnarray*}$$

(40)

By using the measured vertical force and the associated error {K_z(R_Kz,i) ± σ_Kz(R_Kz,i)} at radius {R_Kz,i} (for i = 1, ⋅⋅⋅, N_Kz) taken from Bovy & Rix (2013), the logarithmic likelihood of the vertical force data is expressed as

$$\begin{eqnarray*} \ln L_\mathrm{Kz} = - \sum _{i=1}^{N_\mathrm{Kz}} \frac{1}{2} \left(\frac{K_z(R_{\mathrm{Kz},i}) - K_z^\mathrm{model} (R_{\mathrm{Kz},i})}{\sigma _\mathrm{Kz}(R_{\mathrm{Kz},i})} \right)^2 . \end{eqnarray*}$$

(41)

5.1.3 Total likelihood of the data

Given the model M, the logarithmic likelihood of the entire data D is expressed as

$$\begin{eqnarray*} \ln \mathrm{Pr}(D|M) = \ln L_\mathrm{circ} + \ln L_\mathrm{Kz} + \ln L_\mathrm{RRL} , \end{eqnarray*}$$

(42)

where L_circ, L_Kz, and L_RRL are defined in equations (39), (41), and (36), respectively. We note that we reduce the weight from RR Lyrae data by a factor (N_RRL,eff/N_RRL) = 1000/16197 when we compute L_RRL due to our computational limitation (see Section 4.4). Note that we also did an additional test without using the RR Lyrae data, by removing the last term in equation (42). This additional analysis clearly shows that the RR Lyrae data are essential for constraining the flattening of the DM halo which is completely unconstrained by the other two data sets (see Appendix C for details). This additional analysis also shows that the inclusion of the RR Lyrae data in the likelihood function affects several other model parameters slightly and significantly narrows the posterior distribution of the halo concentration parameter.

5.2 Bayesian prior

Our Bayesian prior for the MW potential and the stellar halo DF is described in Sections 3.1 and 3.2, respectively. These priors are summarized in Table 1. We note that the prior distributions for the parameters of the model potential are mostly taken from Bland-Hawthorn & Gerhard (2016) and McMillan (2017). We have tried various prior distributions and confirmed that the choice of the prior distribution does not change the main conclusion of our paper, especially the flattening of the DM halo.

5.3 Markov chain Monte Carlo analysis

To estimate the model parameters, we first search for the maximum-likelihood parameters with a Nelder-Mead optimization package constrNMPy (https://github.com/alexblaessle/constrNMPy) with some reasonable tolerance level. Then we use the resultant parameters as the initial condition of the Bayesian MCMC analysis. We use a package emcee (Foreman-Mackey et al. 2013) for the MCMC analysis. We use (2 × N_free) walkers (where N_free is the number of free parameters) and run the MCMC for several thousand steps. We discard initial half of the chain for burn-in and analysed the remaining chain.

The analysis code is written in python, and it is developed from an example code in AGAMA (https://github.com/GalacticDynamics-Oxford/Agama/blob/master/py/example_df_fit.py). In Appendix D, we validate our method with mock data sets.

6 RESULTS

In this section, we describe the results of our Bayesian MCMC analysis. The posterior distribution is summarized in Table 1. The corner plots are given in Appendix B.

6.1 Comparison of the input data and our model

To check the performance of our method, we first compare the input data and our model predictions.

6.1.1 Circular velocity

Fig. 2 shows the radial profile of the circular velocity v_circ(R) and the contribution from baryonic and DM components sampled from our posterior distribution. We can see that our model properly fits the rotation curve data from Eilers et al. (2019).

Figure 1.

Distribution of the 16 197 stars in our RR Lyrae sample on the sky (left-hand panel), in the Galactocentric Cartesian (x, z)-plane (middle panel) and (y, z)-plane (right-hand panel). Our sample does not include stars associated with obvious substructure, such as the Large and Small Magellanic Clouds (two small holes in the left-hand panel at around 270^○ < ℓ < 315^○, −50^○ < b < −25^○). In the middle- and right-hand panels, the Solar location is marked with the red ⊙ symbol. Our sample selection criteria only select stars that are confined within the region enclosed by the solid blue curve.

Figure 2.

The radial profile of the circular velocity v_circ(R), along with its contribution from baryon and DM. The grey shaded region corresponds to the central 68 percentile of the posterior distribution of our model, while the magenta solid lines cover the central 95 percentile. The blue data points with error bar are taken from Eilers et al. (2019), which are one of the input data sets for our fit.

The baryonic contribution of the circular velocity is widely discussed in terms of the disc maximality (Sackett 1997; Bovy & Rix 2013). We found that the ratio of the circular velocity contribution from the thin and thick stellar discs to the total circular velocity is

$$\begin{eqnarray*} \left. \rm{(Disc maximality)} \equiv \frac{V_\mathrm{circ, stellar\rm{-}disc}}{V_\mathrm{circ, total}} \right|_{R = 2.2 \langle R_\mathrm{d} \rangle } = 0.70_{-0.03}^{+0.02}, \end{eqnarray*}$$

(43)

when measured at 2.2 times the mass-weighted disc scale radius

$$\begin{eqnarray*} \langle R_\mathrm{d} \rangle = \frac{M_\mathrm{thin} R_\mathrm{d}^\mathrm{thin} + M_\mathrm{thick} R_\mathrm{d}^\mathrm{thick} }{M_\mathrm{thin} + M_\mathrm{thick}} . \end{eqnarray*}$$

(44)

The disc maximality is almost unchanged (by just ∼1 per cent level) if we include the contribution from the atomic and molecular gas discs that are fixed in our analysis. This fraction is noticeably smaller than the value expected for the so-called maximal discs with 0.85 ± 0.10 (Sackett 1997). Also, a similar quantity for the DM halo is estimated to be

$$\begin{eqnarray*} \left. \frac{V_\mathrm{circ, DM}}{V_\mathrm{circ, total}} \right|_{R = 2.2 \langle R_\mathrm{d} \rangle } = 0.59 \pm 0.03 . \end{eqnarray*}$$

(45)

Therefore, our analysis suggests that the rotation support of the inner MW by the DM halo is not negligible.

We note that the v_circ data used in our study (Eilers et al. 2019) were also analysed in Eilers et al. (2019) and de Salas et al. (2019) to estimate the DM density profile. Both of these studies found a reasonable model that fits the input data. de Salas et al. (2019) pointed out that a good fit to the v_circ data can be achieved by assuming different functional forms of the baryonic potential. This result suggests that a wide variety of baryonic models can explain the v_circ data equally well. Therefore, it is not surprising that we arrived at a good fit to the v_circ data. de Salas et al. (2019) also found that the posterior distribution of the parameters in their baryonic model potentials, such as the mass and the scale radius of the bulge or the stellar disc, are dominated by the prior distribution. This result implies that the circular velocity data alone are not good enough to constrain all the parameters of the potential. Similar to their finding, we found that the posterior distributions of the parameters for the bulge and the thick disc are dominated by the priors when we used the v_circ data plus K_z data (with or without the RR Lyrae data) (see Fig. C1). Our result confirms that a proper modelling of the baryonic mass model and a well-determined prior information on the parameters for the baryonic mass model are essential to make an inference on the MW potential.

6.1.2 Vertical force

Fig. 3 shows the radial profile of the vertical force |$K_{z,1.1\, \mathrm{kpc}}$| sampled from our posterior distribution. We can see that our model properly fits the data points of Bovy & Rix (2013). Our posterior distribution suggests that the local value of |$K_{z,1.1\, \mathrm{kpc}}$| measured at |$R=R_0=8.178 \, \mathrm{kpc}$| (Gravity Collaboration 2019) is |$K_{z,1.1\, \mathrm{kpc}}(R_0) = (72.7 \pm 1.4) / (2\pi G {\,\rm M_\odot} \, \mathrm{pc}^{-2})$|⁠, which is consistent with the classical measurement of |$(71 \pm 6) / (2\pi G {\,\rm M_\odot} \, \mathrm{kpc}^{-2})$| by Kuijken & Gilmore (1991).

$The radial profile of the vertical force $K_{z, 1.1 \, \mathrm{kpc}}$ measured at $z=1.1 \, \mathrm{kpc}$. The grey shaded region corresponds to the central 68 percentile of the posterior distribution of our model, while the magenta solid lines cover the central 95 percentile. The blue data points with error bar are taken from Bovy & Rix (2013), which are one of the input data sets for our fit.$

Figure 3.

The radial profile of the vertical force |$K_{z, 1.1 \, \mathrm{kpc}}$| measured at |$z=1.1 \, \mathrm{kpc}$|⁠. The grey shaded region corresponds to the central 68 percentile of the posterior distribution of our model, while the magenta solid lines cover the central 95 percentile. The blue data points with error bar are taken from Bovy & Rix (2013), which are one of the input data sets for our fit.

6.1.3 Proper motion distribution

In the left-hand hand column of Fig. 4, we show the statistical properties of the proper motion distribution, |$(\rho _{\mu _{\ell *}, \mu _{b}}, \sigma _{\mu _{\ell *}}, \sigma _{\mu _{b}})$|⁠, as a function of (ℓ, b) for our RR Lyrae sample. In computing these quantities, we first divide the RR Lyrae star sample into 72 × 18 cells with a size of (Δℓ, Δb) = (5^○, 10^○). Then, we analyse the proper motion distribution in each cell to evaluate |$(\rho _{\mu _{\ell *}, \mu _{b}}, \sigma _{\mu _{\ell *}}, \sigma _{\mu _{b}})$|⁠.⁹

$A visualization of the proper motion distribution of our RR Lyrae sample (left-hand column), the average prediction of our models constructed from the MCMC chain (middle column), and the normalized residual between data and model (right-hand column). In the top row, we show the result for the Pearson correlation coefficient $\rho _{\mu _{\ell *}, \mu _b}$ for the distribution of 2D proper motion (μℓ*, μb). As shown by the arrows in the top-middle panel, the quadrupole pattern of the $\rho _{\mu _{\ell *}, \mu _b}$ on the sky corresponds to a preferential motion of the RR Lyrae stars with radial orbits. In the middle row, we show the result for the dispersion $\sigma _{\mu _{\ell *}}$ of the distribution of μℓ,*. In the bottom row, we show the result for the dispersion $\sigma _{\mu _{b}}$ of the distribution of μb.$

Figure 4.

A visualization of the proper motion distribution of our RR Lyrae sample (left-hand column), the average prediction of our models constructed from the MCMC chain (middle column), and the normalized residual between data and model (right-hand column). In the top row, we show the result for the Pearson correlation coefficient |$\rho _{\mu _{\ell *}, \mu _b}$| for the distribution of 2D proper motion (μ_ℓ*, μ_b). As shown by the arrows in the top-middle panel, the quadrupole pattern of the |$\rho _{\mu _{\ell *}, \mu _b}$| on the sky corresponds to a preferential motion of the RR Lyrae stars with radial orbits. In the middle row, we show the result for the dispersion |$\sigma _{\mu _{\ell *}}$| of the distribution of μ_ℓ,*. In the bottom row, we show the result for the dispersion |$\sigma _{\mu _{b}}$| of the distribution of μ_b.

In the middle column of Fig. 4, we also derive the same statistical properties by using the posterior distribution. First, we randomly select 160 models from our MCMC chain. For each model, we generate a large enough sample of mock stars from the DF, and add the observational uncertainty. Then we randomly select 16197 error-added mock stars within the survey volume as in our RR Lyrae sample. We compute |$(\rho _{\mu _{\ell *}, \mu _{b}}, \sigma _{\mu _{\ell *}}, \sigma _{\mu _{b}})$| for each model, and average these quantities over 160 models along each line of sight.

In the right-hand column of Fig. 4, we show the normalized difference between the data and our models for each of the statistical properties of the proper motion distribution. Here, we first computed the average and the dispersion of the above-mentioned 160 models. Then, for each line of sight, we computed the residual between the data and average divided by the dispersion.

We see that the overall trend in the proper motion distribution is well recovered by our models. At −90^○ < ℓ < 90^○ and −90^○ < b < 90^○, we can see the quadrupole pattern in the correlation coefficient |$\rho _{\mu _{\ell *}, \mu _{b}}$| in the top row of Fig. 4 for both the RR Lyrae sample and our models. This pattern is known to be a characteristic of a radially biased velocity distribution (Iorio & Belokurov 2019). To illustrate this feature, we put four arrows in the top middle panel that describe how radial-orbit stars approximately move when seen from the Sun. The orientation of these arrows matches the quadrupole pattern of |$\rho _{\mu _{\ell *}, \mu _{b}}$|⁠. We note that it is the first time this proper motion distribution is successfully fit and recovered by a DF model. The model distributions for the dispersion |$\sigma _{\mu _l*}$| and |$\sigma _{\mu _b}$| in the next two rows also show very good overall agreement.

6.2 Comparison of the external data and our DF model

In this section, we compare our results with other independent data sets that are not used to constrain our model. This comparison serves to test the predictive power of our model.

In Fig. 5, we show the radial profile of the 3D velocity dispersion (σ_r, σ_θ, σ_ϕ) and the velocity anisotropy |$\beta = 1 - (\sigma _\theta ^2+\sigma _\phi ^2)/(2\sigma _r^2)$| for halo K giants obtained by the LAMOST survey (Bird et al. 2019). These data (shown as coloured open symbols) are not used in our analysis, but are compared with our model predictions (solid and dashed lines).

$Top panel: The radial profile of the velocity dispersion σr (red), σθ (green), and σϕ (blue) as a function of r predicted by our models. Bottom panel: The corresponding radial profile of velocity anisotropy $\beta (r) = 1 - (\sigma _\theta ^2+\sigma _\phi ^2)/(2\sigma _r^2)$. In both panels, the coloured dashed lines bracket the central 95 percentile of the posterior distribution of our model. The central 68 percentile of σr and β are also shown by the grey shaded region. We do not show 68 per cent region for σθ and σϕ for clarity. Open symbols are the velocity dispersions and the velocity anisotropy of K giants (Bird et al. 2019), which are not used in our fit but are shown for reference.$

Figure 5.

Top panel: The radial profile of the velocity dispersion σ_r (red), σ_θ (green), and σ_ϕ (blue) as a function of r predicted by our models. Bottom panel: The corresponding radial profile of velocity anisotropy |$\beta (r) = 1 - (\sigma _\theta ^2+\sigma _\phi ^2)/(2\sigma _r^2)$|⁠. In both panels, the coloured dashed lines bracket the central 95 percentile of the posterior distribution of our model. The central 68 percentile of σ_r and β are also shown by the grey shaded region. We do not show 68 per cent region for σ_θ and σ_ϕ for clarity. Open symbols are the velocity dispersions and the velocity anisotropy of K giants (Bird et al. 2019), which are not used in our fit but are shown for reference.

For this analysis, we first randomly select 160 models from our MCMC chain. For each model, we generate a large enough sample of mock stars from the DF, without adding any observational error. We select those error-free 6D mock data within our survey volume and compute (σ_r, σ_θ, σ_ϕ, β for each model. In Fig. 5, we plot the radial profile of these quantities.

The radial profiles of (σ_r, σ_θ, σ_ϕ, β) for our models and those of K giants are broadly consistent with each other. In particular, both our models and the K giants data suggest highly radially biased velocity distribution with β ≳ 0.75 at |$10 \lesssim r/\, \mathrm{kpc}\lesssim 22$|⁠. (We note that Bird et al. 2019 mentioned that β(r) of K giants shows a mild drop at |$r \gtrsim 25 \, \mathrm{kpc}$| due to the presence of substructure.) The high value of β is consistent with the result in Belokurov et al. (2018), in which work they proposed that the inner halo is dominated by the stellar debris of a radial merger with a massive satellite (now referred to as ‘Gaia-Sausage’ or ‘Gaia-Enceladus’) about 8−10 Gyr ago (see also Helmi et al. 2018).

We note that our estimate of σ_r is systematically offset from the observed trend of K giants at |$15 \, \mathrm{kpc}\lt r$|⁠. We do not have a clear understanding of this discrepancy, but it might be related to the fact that we did not use v_los information in our analysis. At large r, estimating σ_r without using v_los data (using only proper motion data) becomes difficult, because (i) the proper motions contribute little to v_los and (ii) σ_r is approximately equal to the line-of-sight velocity dispersion at large r. In this regard, it is worth noting that in the near future surveys such as DESI (Dark Energy Spectroscopic Instrument) are planning to measure v_los for a large number of RR Lyrae stars (Allende Prieto et al. 2020), which may be useful to improve our modelling of the stellar halo.

Our estimate of the velocity anisotropy 0.7 ≲ β ≲ 0.9 is significantly larger than the reported value of β ≲ 0.3 for K giants in SDSS catalogue (Das & Binney 2016) or blue horizontal branch (BHB) stars in SDSS catalogue (Das et al. 2016). One possible explanation for this discrepancy is that they took into account the metallicity dependence of the DF while we do not. However, this does not fully explain the discrepancy. It has been known that the metal-rich part of the stellar halo shows higher value of β (e.g. Deason, Belokurov & Evans 2011; Hattori et al. 2013; Kafle et al. 2013), but even for the metal-rich part of the DF Das & Binney (2016) suggest β ≃ 0.3 in the inner halo, which is much smaller than our estimate of β for the entire RR Lyrae population. Another possible explanation is that the proper motion data used in Das & Binney (2016) and Das et al. (2016) may not be accurate enough to estimate β. For example, if the proper motion errors in SDSS were underestimated, then the velocity ellipsoid could have been sphericalized (due to insufficient deconvolution of the proper motion error), which could result in smaller value of β. Yet another possibility is a counterintuitive scenario that the velocity distribution depends on the stellar type. Although we do not aggressively advocate this possibility, it may be worth noting that Utkin & Dambis (2020) recently claimed that the value of β for BHB stars is typically smaller than that of RR Lyrae stars.

6.3 Dark matter distribution

We now discuss the properties of the halo DM distribution within |$r \lesssim 30 \, \mathrm{kpc}$| as inferred from our analysis. Table 2 summarizes the characteristic parameters of the DM density profile derived from our analysis. The correlations between some of these quantities are shown in Figs B1 and B2.

Table 2.

Summary of the DM properties of the Milky Way.

Quantities	[16, 50, 84] percentiles
ρ_DM,⊙ [\|${\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|]	\|${0.00881, 0.00901, 0.00919} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|
ρ_DM,⊙ [GeV cm⁻³]	[0.335, 0.342, 0.349] GeV cm⁻³
M₂₀₀	[0.678, 0.730, 0.776] × 10¹² M_⊙
M₉₄	[0.774, 0.837, 0.894] × 10¹² M_⊙
r₂₀₀	\|$[180.52, 185.03, 188.84] \, \mathrm{kpc}$\|
r₉₄	\|$[242.71, 249.08, 254.64] \, \mathrm{kpc}$\|
c′ = r₉₄/r₋₂	[18.45, 19.55, 20.89]
r₋₂	\|$[11.69, 12.72, 13.73] \, \mathrm{kpc}$\|
a	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|
γ	[0.785, 0.982, 1.209]
q	[0.983, 0.993, 0.998]
\|$M_\mathrm{DM}(r\lt 20 \, \mathrm{kpc})$\|	[0.132, 0.134, 0.137] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 50 \, \mathrm{kpc})$\|	[0.311, 0.322, 0.330] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 100 \, \mathrm{kpc})$\|	[0.497, 0.523, 0.543] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 200 \, \mathrm{kpc})$\|	[0.711, 0.759, 0.798] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 300 \, \mathrm{kpc})$\|	[0.845, 0.907, 0.960] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 20 \, \mathrm{kpc})$\|	[0.182, 0.186, 0.191] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 50 \, \mathrm{kpc})$\|	[0.361, 0.374, 0.384] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 100 \, \mathrm{kpc})$\|	[0.547, 0.575, 0.598] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 200 \, \mathrm{kpc})$\|	[0.761, 0.811, 0.852] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 300 \, \mathrm{kpc})$\|	[0.895, 0.959, 1.015] × 10¹²M_⊙

Quantities	[16, 50, 84] percentiles
ρ_DM,⊙ [\|${\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|]	\|${0.00881, 0.00901, 0.00919} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|
ρ_DM,⊙ [GeV cm⁻³]	[0.335, 0.342, 0.349] GeV cm⁻³
M₂₀₀	[0.678, 0.730, 0.776] × 10¹² M_⊙
M₉₄	[0.774, 0.837, 0.894] × 10¹² M_⊙
r₂₀₀	\|$[180.52, 185.03, 188.84] \, \mathrm{kpc}$\|
r₉₄	\|$[242.71, 249.08, 254.64] \, \mathrm{kpc}$\|
c′ = r₉₄/r₋₂	[18.45, 19.55, 20.89]
r₋₂	\|$[11.69, 12.72, 13.73] \, \mathrm{kpc}$\|
a	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|
γ	[0.785, 0.982, 1.209]
q	[0.983, 0.993, 0.998]
\|$M_\mathrm{DM}(r\lt 20 \, \mathrm{kpc})$\|	[0.132, 0.134, 0.137] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 50 \, \mathrm{kpc})$\|	[0.311, 0.322, 0.330] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 100 \, \mathrm{kpc})$\|	[0.497, 0.523, 0.543] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 200 \, \mathrm{kpc})$\|	[0.711, 0.759, 0.798] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 300 \, \mathrm{kpc})$\|	[0.845, 0.907, 0.960] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 20 \, \mathrm{kpc})$\|	[0.182, 0.186, 0.191] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 50 \, \mathrm{kpc})$\|	[0.361, 0.374, 0.384] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 100 \, \mathrm{kpc})$\|	[0.547, 0.575, 0.598] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 200 \, \mathrm{kpc})$\|	[0.761, 0.811, 0.852] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 300 \, \mathrm{kpc})$\|	[0.895, 0.959, 1.015] × 10¹²M_⊙

Table 2.

Summary of the DM properties of the Milky Way.

Quantities	[16, 50, 84] percentiles
ρ_DM,⊙ [\|${\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|]	\|${0.00881, 0.00901, 0.00919} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|
ρ_DM,⊙ [GeV cm⁻³]	[0.335, 0.342, 0.349] GeV cm⁻³
M₂₀₀	[0.678, 0.730, 0.776] × 10¹² M_⊙
M₉₄	[0.774, 0.837, 0.894] × 10¹² M_⊙
r₂₀₀	\|$[180.52, 185.03, 188.84] \, \mathrm{kpc}$\|
r₉₄	\|$[242.71, 249.08, 254.64] \, \mathrm{kpc}$\|
c′ = r₉₄/r₋₂	[18.45, 19.55, 20.89]
r₋₂	\|$[11.69, 12.72, 13.73] \, \mathrm{kpc}$\|
a	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|
γ	[0.785, 0.982, 1.209]
q	[0.983, 0.993, 0.998]
\|$M_\mathrm{DM}(r\lt 20 \, \mathrm{kpc})$\|	[0.132, 0.134, 0.137] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 50 \, \mathrm{kpc})$\|	[0.311, 0.322, 0.330] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 100 \, \mathrm{kpc})$\|	[0.497, 0.523, 0.543] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 200 \, \mathrm{kpc})$\|	[0.711, 0.759, 0.798] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 300 \, \mathrm{kpc})$\|	[0.845, 0.907, 0.960] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 20 \, \mathrm{kpc})$\|	[0.182, 0.186, 0.191] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 50 \, \mathrm{kpc})$\|	[0.361, 0.374, 0.384] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 100 \, \mathrm{kpc})$\|	[0.547, 0.575, 0.598] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 200 \, \mathrm{kpc})$\|	[0.761, 0.811, 0.852] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 300 \, \mathrm{kpc})$\|	[0.895, 0.959, 1.015] × 10¹²M_⊙

Quantities	[16, 50, 84] percentiles
ρ_DM,⊙ [\|${\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|]	\|${0.00881, 0.00901, 0.00919} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$\|
ρ_DM,⊙ [GeV cm⁻³]	[0.335, 0.342, 0.349] GeV cm⁻³
M₂₀₀	[0.678, 0.730, 0.776] × 10¹² M_⊙
M₉₄	[0.774, 0.837, 0.894] × 10¹² M_⊙
r₂₀₀	\|$[180.52, 185.03, 188.84] \, \mathrm{kpc}$\|
r₉₄	\|$[242.71, 249.08, 254.64] \, \mathrm{kpc}$\|
c′ = r₉₄/r₋₂	[18.45, 19.55, 20.89]
r₋₂	\|$[11.69, 12.72, 13.73] \, \mathrm{kpc}$\|
a	\|$[10.29, 12.49, 16.66] \, \mathrm{kpc}$\|
γ	[0.785, 0.982, 1.209]
q	[0.983, 0.993, 0.998]
\|$M_\mathrm{DM}(r\lt 20 \, \mathrm{kpc})$\|	[0.132, 0.134, 0.137] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 50 \, \mathrm{kpc})$\|	[0.311, 0.322, 0.330] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 100 \, \mathrm{kpc})$\|	[0.497, 0.523, 0.543] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 200 \, \mathrm{kpc})$\|	[0.711, 0.759, 0.798] × 10¹²M_⊙
\|$M_\mathrm{DM}(r\lt 300 \, \mathrm{kpc})$\|	[0.845, 0.907, 0.960] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 20 \, \mathrm{kpc})$\|	[0.182, 0.186, 0.191] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 50 \, \mathrm{kpc})$\|	[0.361, 0.374, 0.384] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 100 \, \mathrm{kpc})$\|	[0.547, 0.575, 0.598] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 200 \, \mathrm{kpc})$\|	[0.761, 0.811, 0.852] × 10¹²M_⊙
\|$M_\mathrm{total}(r\lt 300 \, \mathrm{kpc})$\|	[0.895, 0.959, 1.015] × 10¹²M_⊙

6.3.1 Dark matter density flattening

Fig. 6(a) shows the posterior distribution of the DM density flattening q. We can see that the posterior distribution is strongly peaked near q = 1. Since q = 1 is the upper boundary of the prior distribution, we cannot rule out the possibility that the DM density is prolate. The fact that 99 per cent of the posterior distribution of q is located above q = 0.963 strongly disfavours even a moderately flattened DM halo. It is worth noting that the shape of the posterior distribution of q shown in Fig. 6(a) is naturally expected if the shape of the DM halo is actually nearly spherical. For example, in Appendix D1.1, we see that the posterior distributions of q derived from our mock analysis look very similar to Fig. 6(a), when the mock data is generated from an MW model with q = 0.996 (see Fig. D3c–f).

$The posterior distribution of the DM density profile. Panel (a): the probability DF of the density flattening q. The three dashed vertical lines at q > 0.98 corresponds to (16, 50, 84) percentiles of the distribution. The lower 1 and 5 percentiles are located at q = 0.963 and 0.973, respectively. The posterior distribution rules out oblate dark halo models with q < 0.963 with a confidence level of $99{{\ \rm per\ cent}}$. We note that the parameter range at q > 1 is not explored in this paper. Panel (b): the density profile ρDM evaluated on the Galactic plane (R, z) = (r, 0). The shaded region corresponds to the central 68 per cent of the distribution, while the magenta solid lines denote the central 95 per cent of the distribution. The ranges of literature value of the local DM density ρDM(R0, 0) estimated from ‘local’ measurements and estimated from ‘global’ modelling of the Milky Way are shown by the blue and red vertical bands at R ≃ R0, respectively. (Literature data are taken from de Salas & Widmark 2020.) Panel (c): the same as panel (b), but for the density profile ρDM(R = R0, z) evaluated at the Solar cylinder as a function of the distance z from the Galactic plane. We note that the horizontal axis of this plot is linear at $z\lt 1 \, \mathrm{kpc}$ and logarithmic at $1 \, \mathrm{kpc}\lt z$. Panel (d): the logarithmic density slope dln ρDM/dln r evaluated on the Galactic plane (R, z) = (r, 0).$

Figure 6.

The posterior distribution of the DM density profile. Panel (a): the probability DF of the density flattening q. The three dashed vertical lines at q > 0.98 corresponds to (16, 50, 84) percentiles of the distribution. The lower 1 and 5 percentiles are located at q = 0.963 and 0.973, respectively. The posterior distribution rules out oblate dark halo models with q < 0.963 with a confidence level of |$99{{\ \rm per\ cent}}$|⁠. We note that the parameter range at q > 1 is not explored in this paper. Panel (b): the density profile ρ_DM evaluated on the Galactic plane (R, z) = (r, 0). The shaded region corresponds to the central 68 per cent of the distribution, while the magenta solid lines denote the central 95 per cent of the distribution. The ranges of literature value of the local DM density ρ_DM(R₀, 0) estimated from ‘local’ measurements and estimated from ‘global’ modelling of the Milky Way are shown by the blue and red vertical bands at R ≃ R₀, respectively. (Literature data are taken from de Salas & Widmark 2020.) Panel (c): the same as panel (b), but for the density profile ρ_DM(R = R₀, z) evaluated at the Solar cylinder as a function of the distance z from the Galactic plane. We note that the horizontal axis of this plot is linear at |$z\lt 1 \, \mathrm{kpc}$| and logarithmic at |$1 \, \mathrm{kpc}\lt z$|⁠. Panel (d): the logarithmic density slope dln ρ_DM/dln r evaluated on the Galactic plane (R, z) = (r, 0).

Recently, Wegg et al. (2019) estimated the shape of the DM density profile by applying the axisymmetric form of the Jeans equations for the kinematic data for 15651 RR Lyrae stars at |$r\lt 20 \, \mathrm{kpc}$|⁠. They also find that the shape of the DM halo is nearly spherical, with the density flattening of q = 1.00 ± 0.09, which is consistent with our result.

6.3.2 Dark matter density profile

Fig. 6(b) and (c) show the DM density profile evaluated at |$(R,z)=(R, 0 \, \mathrm{kpc})$| and at (R, z) = (R₀, z), respectively. (We note |$R_0 = 8.178 \, \mathrm{kpc}$| is assumed; Gravity Collaboration 2019.) These profiles are sampled from the posterior distribution. We can see that our model puts a tight constraint on the radial and vertical density profiles. However, we need to be careful in interpreting this result. First, 99 per cent of the posterior distribution is distributed at 0.963 ≤ q ≤ 1, so the DM density profiles sampled from our posterior distribution are very close to spherical. Secondly, we use halo tracers that are distributed at |$5 \lesssim r/\, \mathrm{kpc}\lesssim 27.5$|⁠. Thus, the inference of the DM density outside this range is less reliable. Third, the seemingly small variation in the density profile at large R and large |z| are probably because we fix the outer density slope of the DM to be ≃ (− 3) in the outer halo.

In spite of the above-mentioned complexities, we think our estimate of ρ_DM(R, 0) is reliable for |$1 \lesssim R/\, \mathrm{kpc}\lesssim 30$| based on our mock analysis in Appendix D. This can be understood in the following manner. Typical halo stars have very radially elongated orbits (β ≃ 0.8; see Fig. 5), indicating that they probably have relatively small pericentric radii. Therefore, many halo stars in our sample have orbits that are affected by the DM distribution in the inner few kpc and therefore their kinematics, even at large radii, reflect this information. To be specific, based on the best-fit DF model, we found that 20 per cent of the RR Lyrae stars in our sample have apocenter radius larger than |$r = 29.7 \, \mathrm{kpc}$| and that 20 per cent of the stars have pericenter radius smaller than |$r = 0.4 \, \mathrm{kpc}$|⁠. For our mock analysis with smooth-halo mock data (Appendix D1), within |$0.4 \lt r/\, \mathrm{kpc}\lt 29.7$|⁠, we can constrain the DM density with less than 30 per cent uncertainty. This result supports the idea that we can constraint the DM profile outside |$5 \lt r/\, \mathrm{kpc}\lt 27.5$|⁠.

6.3.3 Dark matter density slope

In our analysis, the inner density slope (−γ) of the DM halo is a free parameter, while the outer density slope is fixed to be (−3). The central 68 per cent of the posterior distribution of γ is distributed at 0.785 < γ < 1.209 (see Table 2), which is consistent with a cusped NFW profile with γ = 1 (Navarro et al. 1997). Fig. 6(d) shows the DM logarithmic density slope dln ρ_DM/dln r as a function of Galactocentric radius r evaluated at (R, z) = (r, 0). As we can see from this figure, the logarithmic density slope is approximately (−1) at |$r \simeq 1 \, \mathrm{kpc}$|⁠.

6.3.4 Other constraints on the dark matter distribution

Table 2 summarizes the key properties of the DM density profile derived from our analysis. For example, our result constraints the local DM density to, |$\rho _{\mathrm{DM},\odot } = (9.01^{+0.18}_{-0.20})\times 10^{-3}{\,\rm M_\odot} \, \mathrm{pc}^{-3}$|⁠, which is equivalent to |$0.342^{+0.007}_{-0.007}$| GeVcm⁻³. The quoted uncertainty in ρ_DM,⊙ in our analysis only takes into account the random error. We note that de Salas et al. (2019) pointed out that the use of different functional forms of the baryonic potential could cause systematic shifts in ρ_DM,⊙. For reference, de Salas et al. (2019) assumed a generalized-NFW model for the DM density profile, and estimated |$\rho _{\mathrm{DM},\odot } = 7.89^{+0.74}_{-0.71}\times 10^{-3}{\,\rm M_\odot} \, \mathrm{pc}^{-3}$| and |$10.18^{+0.89}_{-0.95}\times 10^{-3}{\,\rm M_\odot} \, \mathrm{pc}^{-3}$| for their baryonic model B1 and B2, respectively. Apart from the systematic error due to the choice of our baryonic mass model, our measurement is slightly smaller than previous measurements of ρ_DM,⊙ = (11–|$16)\times 10^{-3} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$| derived from analyses of Solar-neighbour stars (Bienaymé et al. 2014; McKee, Parravano & Hollenbach 2015; de Salas & Widmark 2020), while it is consistent with previous measurements of ρ_DM,⊙ = (8–|$13)\times 10^{-3} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$| derived from analyses of global modelling of the Milky Way such as rotation curve (Piffl et al. 2014; de Salas et al. 2019; Cautun et al. 2020; de Salas & Widmark 2020). For the various measurements of ρ_DM,⊙ before and after the advent of Gaia data, we refer readers to reviews by Read (2014) and de Salas & Widmark (2020), respectively.

We also derive the enclosed mass of the DM and the enclosed mass of the DM plus baryons at various radii r. For example, our result indicates that the DM mass within the virial radius r₂₀₀ is |$M_{200} = 0.730^{+0.046}_{-0.052} \times 10^{12} {\rm {\,\rm M_\odot}}$|⁠, which is consistent with some recent studies (see Wang et al. 2020 for a review), but slightly lower than (0.90 ± 0.13) × 10¹²M_⊙ recently found by Vasiliev et al. (2021). However, we note that this result may be dominated by our prior on M_star/M₂₀₀, since our sample is distributed in the inner part of the halo at |$5 \lesssim r/\, \mathrm{kpc}\lesssim 27.5$|⁠.

7 DISCUSSION

7.1 Comparison with other studies

We have analysed the kinematics of RR Lyrae stars to estimate the DM density distribution, especially focusing on the flattening q of the DM halo within 30 kpc. Our result indicates that q > 0.963 with 99 per cent confidence level, which is consistent with a nearly spherical DM halo, although we cannot currently explore the possibility that q > 1 (prolate). Here, we compare our result with previous studies.

7.1.1 Previous results with stellar streams

Koposov et al. (2010) modelled GD-1 stellar stream (Grillmair & Dionatos 2006) and estimated the flattening of the Galactic potential. They found that the DM potential’s flattening is q_Φ > 0.89 with 90 per cent confidence level. According to a relationship between the density flattening and the potential flattening,¹⁰ their constraint corresponds to a density flattening of q > 0.68 with 90 per cent confidence level. Bovy et al. (2016) measured the DM density flattening to be |$q=1.3^{+0.5}_{-0.3}$| when using GD-1 stream (at |$r=14 \, \mathrm{kpc}$|⁠) and q = 0.93 ± 0.16 when using Pal 5 stream (at |$r=19 \, \mathrm{kpc}$|⁠). By combining these two data sets, they estimated the global value of q to be q = 1.05 ± 0.14. More recently, Malhan & Ibata (2019) analysed the GD-1 stream by using the astrometric data from Gaia DR2 to estimate |$q = 0.82^{+0.25}_{-0.13}$|⁠. Although the statistical uncertainties are relatively large, all of the above-mentioned results using the GD-1 stream are consistent with a nearly spherical DM halo.

Law & Majewski (2010) modelled the Sagittarius stellar stream to conclude that the best-fitting Galactic DM halo model is oblate-triaxial with the major axis lying in the Galactic disc plane about 7^○ to the Galactocentric y-axis, the intermediate axis perpendicular to the stellar disc, and the short axis lying 7^○ from the Galactic x-axis. The DM density distribution is flattened such that the axis lengths of the density distribution along Galactocentric x, y, z are given by x: y: z = 0.44: 1: 0.97. This model of the halo is strongly disfavoured since simulations show that stellar discs perpendicular to the intermediate axis of such a halo are violently unstable (Debattista et al. 2013). In addition, Pearson et al. (2015) argued that the triaxial potential model by Law & Majewski (2010) is not a good approximation at least at |$r\lt 20 \, \mathrm{kpc}$| since it would cause too much dispersal and thickening to the Pal 5 stellar stream.

Recently Vasiliev et al. (2021) constructed a halo model that fits Gaia DR2 proper motion data as well as all available radial velocity data with a time-dependent Galactic halo model that includes the reflex motion resulting from the gravitational perturbation by the Large Magellanic Cloud (LMC). These authors find that the models that fit the Sagittarius stream best, include deformation to the MW DM distribution such that the halo is oblate with an axial ratio R: z ≃ 1: 0.6 and aligned with the disc in the inner part of the halo, but becoming triaxial (twisted and then prolate-triaxial) and misaligned with the disc beyond |${\sim}50 \, \mathrm{kpc}$|⁠.

7.1.2 Previous results with field halo tracers

Loebman et al. (2014) applied the axisymmetric Jeans equations to kinematic data for field halo stars from SDSS and estimated the DM halo’s density flattening to be q ≃ 0.4 ± 0.1. This estimate is significantly smaller than most other studies (except for the recent work based on the Sagittarius stream when influenced by the LMC; Vasiliev et al. 2021). Interestingly, they also found that their halo sample has a radial velocity dispersion |$\sigma _r \simeq 141 \, \mathrm{km\ s}^{-1}$| across the survey volume (⁠|$d \lesssim 10 \, \mathrm{kpc}$|⁠), while we find |$\sigma _r \simeq 180 \, \mathrm{km\ s}^{-1}$| near the Sun dropping to |$\sigma _r \simeq 160 \, \mathrm{km\ s}^{-1}$| at R ∼ 20 kpc. If our RR Lyrae sample and their halo sample trace the same population of halo stars, this disagreement might arise from two sources. First Loebman et al. (2014) used a proper motion sample derived from SDSS and POSS which has significantly larger errors than Gaia DR2 proper motions. Second, they used photometric distances to their field star sample which are less accurate than distances to the Gaia RRLyrae sample.

Wegg et al. (2019) applied a similar axisymmetric Jeans equation formalism to an RR Lyrae sample from Gaia DR2 and estimated the DM density flattening to be q ≃ 1.00 ± 0.09. Their sample overlaps significantly with our RR Lyrae sample, and their spatial selection cut is similar to ours. The fact that our result is also consistent with a spherical DM halo provides strong evidence that the DM halo within r < 30 kpc is not highly oblate (however see Section 7.2.1 and Appendix D2 for the effects of disequilibrium).

It is worth mentioning that Posti & Helmi (2019) modelled the kinematics of globular clusters with an action-based DF model and estimated q = 1.3 ± 0.25 (prolate). They used AGAMA to compute the orbital actions. However, the method to compute actions that is implemented in AGAMA is inapplicable to prolate potentials, but the package does not explicitly forbid their use (see Section 3.1.5 for more details on this point). In this regard, the validity of their analysis is questionable.

7.1.3 Prediction from numerical simulations

As mentioned in Section 1, numerous cosmological hydrodynamical simulations over the past 15 years have predicted that the DM halo of MW-sized galaxies have oblate axisymmetric shapes within the inner (0.15–0.3)r₂₀₀. The most recent value of the mean flattening based on several thousand galaxies from the Illustris simulations being 〈q〉 = 0.79 ± 0.15 (Chua et al. 2019) within ∼0.15r₂₀₀ ∼ 30 kpc. This is much flatter than the q value obtained from our analysis, which excludes an oblate halo with q < 0.963 with a confidence level of 99 per cent. This could either imply a tension between the predictions of cosmological hydrodynamical simulations and our results or it could imply that some of our assumptions, principally, the assumption of dynamical equilibrium (as discussed in Section 7.2.1 and Appendix D2), could be in doubt. Additional applications of this method to mock data from cosmological simulations and haloes that are out of equilibrium, e.g. due to the interaction with the LMC, are in progress to quantify the effects of disequilibrium and to better assess the source of this disagreement (de Salas et al., in preparation).

7.2 Some issues in our analysis

7.2.1 Dynamical disequilibrium

We have assumed that the MW is in dynamical equilibrium. However, this assumption might be too simplistic. For example, Iorio & Belokurov (2019) pointed out that the RR Lyrae sample used here shows a triaxial spatial distribution within r < 30 kpc that has its principal axes tilted relative to the principal axes of the Galactic potential, and is possibly misaligned with the Galactic disc. This triaxial distribution of RR Lyrae stars in the inner stellar halo is thought to have been deposited by a highly radial accretion event referred to as the ‘Gaia–Sausage’ (Belokurov et al. 2018) or ‘Gaia–Enceladus’ (Helmi et al. 2018). Additional evidence for disequilibrium comes from the observation of two prominent substructures in the RRLyrae sample, the Hercules–Aquila Cloud and the Virgo Overdensity (Simion, Belokurov & Koposov 2019), which might be related to the same accretion event (cf. Naidu et al. 2021).

In this regard, it is worth mentioning that we also applied our code to a mock data set generated from one galaxy m12m from the Fire-2 Latte cosmological hydrodynamical suite of simulations (Wetzel et al. 2016; Hopkins et al. 2018; Sanderson et al. 2020). Like the real MW, this galaxy is not in perfect dynamical equilibrium and includes halo substructure. Our modelling of this galaxy results in an overestimate of the value of q (see Fig. D7). This is in contrast with our analysis with mock data sets generated from smooth, equilibrium halo models, which successfully recovers the input values of q with no obvious systematic bias (see Fig. D3). Thus, if the RR Lyrae stars in the inner halo used in our analysis and Wegg et al. (2019) are significantly out of equilibrium, the assumption of dynamical equilibrium could have resulted in an overestimate of q. While analyses of several more similar cosmological simulations are needed to assess how much of an overestimate can be expected from disequilibrium our analysis of galaxy m12m suggests an inflation of Δq ∼ 0.1−0.2 implying that the true flattening could be closer to q ∼ 0.75−0.90.

Erkal, Belokurov & Parkin (2020) argued that the perturbation from LMC is not negligible at |$r \gtrsim 30 \, \mathrm{kpc}$|⁠. If the LMC’s perturbation is strong, our DF fitting method might result in a biased estimate of q (see also Petersen & Peñarrubia 2021). However, the inner part of the halo is less affected by such a perturbation (Garavito-Camargo et al. 2020), so our analysis may not be seriously affected by LMC’s perturbation. Recently, Vasiliev et al. (2021) modelled the dynamics of the MW, LMC, and the Sagittarius dwarf galaxy. They also estimated the radial profile of the DM halo’s shape based on the morphology of the Sagittarius stream. In principle, it is possible to formulate how LMC affects the DF, but this is beyond the scope of this paper (see Deason et al. 2021).

7.2.2 More general shapes for the dark matter halo

Throughout this paper, we have assumed that q is constant as a function of radius. However, if q changes as a function of r, as predicted by cosmological hydrodynamical simulations (e.g. Zemp et al. 2012) our estimate of q might be biased. In principle, we can relax the assumption of constant q with some extra parameters, such as the inner and outer values of q and the transition radius.

We also note that, it is possible to estimate the triaxiality of the DM halo by implementing a fast algorithm to compute actions in a general triaxial potentials (including prolate potentials, with long axis oriented perpendicular to the disc plane), such as the method of Sanders & Binney (2015a). The fact that our posterior distribution of q is peaked at q = 1, the upper boundary of the currently explored range of q, points to the need for future investigations of prolate and triaxial halo shapes.

7.2.3 Metallicity dependence of the distribution function

There is some observational evidence that the DF of the stellar halo depends on the metallicity (Carollo et al. 2007, 2010; Deason et al. 2011; Hattori et al. 2013; Kafle et al. 2013; Das & Binney 2016; Bird et al. 2019; Carollo & Chiba 2021; Iorio & Belokurov 2021). In this paper, we do not take into account the metallicity dependence, because it would increase the number of free parameters. Our sample is confined to the inner halo (r ≲ 27.5), where relatively metal-rich halo stars ([Fe/H]>−2) dominate, therefore the DF is probably most representative of metal-rich stars. However, if we were to apply our method to a sample of stars in a larger volume (say |$r \lesssim 100 \, \mathrm{kpc}$|⁠), the metallicity dependence would be more important.

7.3 Other studies of distribution function fitting

As mentioned in Section 1, there have been several previous efforts to use the DF to construct models of the MW (both global models and models of individual components). We briefly summarize some of these other DF-based studies to put our work in the context.

McMillan & Binney (2012), McMillan & Binney (2013) formulated a Bayesian way of estimating the parameters of the DF of the stellar disc when the MW potential is given. In these works, they introduced some important ideas regarding DF fitting, such as (i) an efficient algorithm to compute the relative likelihood of the model given observational data with or without missing data; and (ii) the effect of the observational selection function. They used mock data sets of disc stars to show that the MW potential can be reliably measured with the DF fitting method. In our paper, we have followed the formulation by McMillan & Binney (2013).

In both McMillan & Binney (2012) and McMillan & Binney (2013), the distance uncertainty was not properly taken into account in evaluating the normalization factor of the DF. The effect of the distance uncertainty in the normalization was first rigorously formulated by Trick et al. (2016), although they did not use their rigorous formulation for their mock analysis. Instead, they used a simplified formulation that is equivalent to the formulation by McMillan & Binney (2013). Yet, they showed that the DF fitting can be used to estimate the MW potential within |${\sim}4 \, \mathrm{kpc}$| from the Sun with a reasonable size of mock data set. Our work adopted the formalism of Trick et al. (2016) and introduced a practical way of computing the normalization factor rigorously for the first time.

Using simple mock data sets with no observational errors, Ting et al. (2013) demonstrated that simultaneous fitting of the stellar disc DF and the MW potential can reconstruct the input model accurately. Bovy & Rix (2013) applied this methodology to 16269 disc stars in SDSS/SEGUE. They divided the sample into mono-abundance populations (MAPs) with similar chemistry ([Fe/H] and [α/Fe]). For each MAP, they simultaneously fit the parameters of the DF and the MW potential, which they used to estimate the vertical force K_z at |$z=1.1 \, \mathrm{kpc}$| at a representative R that is determined by the spatial distribution of the MAP. They further used the profile of the |$K_z(R, z=1.1 \, \mathrm{kpc})$| and an additional data set of the terminal velocity to estimate the MW potential. Their DF-based modelling approach in the first half of Bovy & Rix (2013) is close to ours, and their analysis of v_circ data and K_z data in the second half of Bovy & Rix (2013) corresponds to our additional analysis in Appendix C.

Similar analyses were also done for more extended sample of stars that include both halo and disc stars. Piffl et al. (2014) used ∼200000 giants from RAVE survey to find the maximum-likelihood model of the MW. They introduced a composite DF model consisting of a disc-like DF and a halo-like DF, although their sample is dominated by the disc stars. Binney & Piffl (2015) and Cole & Binney (2017) extended the work of Piffl et al. (2014) and modelled both the stellar disc and the dark halo with action-based DF models that self-consistently generated gravitational potential. Most importantly, Binney & Piffl (2015) and Cole & Binney (2017) did not directly model the density profile of the DM halo, but they modelled the DF of the DM halo instead. One advantage of using the DF of the DM is that they can handle the deformation of the DM halo by the baryonic distribution, as discussed in Piffl, Penoyre & Binney (2015).

Although the above-mentioned studies fit both the DF and the MW potential, there have been some studies in which the MW potential was fixed and the DFs of the tracers were investigated. Sanders & Binney (2015b) used the position-velocity data and [Fe/H] data for nearby disc stars from Geneva-Copenhagen Survey and SEGUE survey and analysed the structure of the MW disc with an action-based DF. Das & Binney (2016) and Das et al. (2016) used K giants and blue horizontal branch stars from SEGUE survey to analyse the MW stellar halo with an action-based DF. Binney & Wong (2017) analysed the position-velocity data for 157 globular clusters in the MW, by using an action-based DF model consisting of a disc-like rotating DF and a halo-like non-rotating DF.

8 CONCLUSIONS

In this paper, we have combined proper-motion data from Gaia DR2 for halo RR Lyrae stars within |$d \le 20 \, \mathrm{kpc}$| from the Sun (Iorio & Belokurov 2019), circular velocity data for red giants in the disc plane from Gaia DR2 (Eilers et al. 2019), and the vertical force data from SDSS/SEGUE (Bovy & Rix 2013) to constrain the 3D shape of the Galactic DM halo, by assuming that the stellar halo can be described by an analytic DF model and that the MW is oblate-axisymmetric.

Our method is based on the DF fitting formulation that was pioneered by McMillan & Binney (2012), McMillan & Binney (2013), and elaborated upon by Ting et al. (2013) and Trick et al. (2016). Our most important contribution to the DF fitting formalism is the introduction of a new way to handle the distance uncertainty of sample stars.

The main results of our modelling of the MW halo can be summarized as follows:

In total, 99 per cent of the posterior distribution of q = c/a (minor-to-major axial ratio of the DM density) is located at q > 0.963 (see Fig. 6a). We emphasize that we only explored oblate models with q ≤ 1 due to the limitations in the way we compute orbital actions of halo stars.
Our estimated value of q > 0.963 implies a nearly spherical DM halo within |$r \lesssim 30 \, \mathrm{kpc}$| and strongly disfavours a very flattened DM halo. This may be in conflict with recent ΛCDM cosmological simulations that predict 〈q〉 = 0.79 ± 0.15 (Chua et al. 2019) within |$0.15r_{200} ({\sim}30 \, \mathrm{kpc})$|⁠.
While validation tests of our code with with mock data created from smooth, equilibrium galactic models recover the values of q to high accuracy (see Fig. D3), our test with mock data generated from a galaxy (m12m) from the Fire-2 Latte cosmological hydrodynamical suite of simulations yields q values overestimated by ∼0.1−0.2 (see Fig. D7). This implies that if the MW halo is not in dynamical equilibrium as we have assumed, our estimate of q is an overestimate.
Our derived DF is a good match to the proper motion distribution in (l, b) and the derived correlation coefficient |$\rho _{\mu _{\ell *}\mu _{b}}$| accurately models, for the first time, the quadrupole feature characteristic of the radially anisotropic distribution observed in the data (see Fig. 4). The derived DF also provides an estimate of the radial, azimuthal and polar velocity dispersion profiles (σ_r(r), σ_ϕ(r), σ_θ(r)) and velocity anisotropy β(r) that are a good match to observed velocity dispersion and anisotropy profiles of K giant stars (Bird et al. 2019), which were not used in our analysis (see Fig. 5).
Our result puts a tight constraint on the local DM density: ρ_DM,⊙ = |$0.00901^{+0.00018}_{-0.00020} {\,\rm M_\odot} \, \mathrm{pc}^{-3}$|⁠, or |$0.342^{+0.007}_{-0.007}$| GeVcm⁻³ (see Fig. 6c and Table 2), which is consistent with other recent estimates (e.g. see reviews by Read 2014; Bland-Hawthorn & Gerhard 2016; de Salas & Widmark 2020).
Our result favours a cuspy DM halo with inner density slope |$(-\gamma) = -\left(0.982^{+0.227}_{-0.197} \right)$|⁠, which is consistent with an NFW profile (see Fig. 6b–d and Table 2).

ACKNOWLEDGEMENTS

The authors thank the referee for thorough reading and constructive comments that improved the manuscript. KH thanks Giuliano Iorio for kindly sharing the clean sample of RR Lyrae stars in Iorio & Belokurov (2019), and Robyn Sanderson and Andrew Wetzel for kindly sharing the Latte simulations. KH thanks Sergey Koposov for useful conversations and for his support. KH thanks Pablo F. de Salas for frequent discussions that improved KH’s analysis code. KH thanks AH for the support during this work. KH is supported by JSPS KAKENHI Grant Numbers JP21K13965 and JP21H00053. KH and MV thank members of the stellar haloes group at the University of Michigan for continued camaraderie and stimulating discussion. MV and KH were supported by NASA-ATP award NNX15AK79G. MV is also supported by NASA-ATP award 80NSSC20K0509. This work was supported by a Michigan Institute for Computational Discovery and Engineering (MICDE) catalyst grant for FY2019. EV acknowledges support from STFC via the Consolidated grant to the Institute of Astronomy. Some part of this research was started at the KITP workshop ‘Dynamical Models for Stars and Gas in Galaxies in the Gaia Era’ held at the Kavli Institute for Theoretical Physics. This work has made use of data from the European Space Agency (ESA) mission Gaia (http://www.cosmos.esa.int/gaia), processed by the Gaia Data Processing and Analysis Consortium (DPAC, http://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for the DPAC has been provided by national institutions, in particular the institutions participating in the Gaia Multilateral Agreement.

We used the following packages: AGAMA (Vasiliev 2019), constrNMPy (https://github.com/alexblaessle/constrNMPy), corner.py (Foreman-Mackey 2016), cubature (https://github.com/stevengj/cubature), emcee (Foreman-Mackey et al. 2013), gizmo_read (https://bitbucket.org/awetzel/gizmo_read), matplotlib (Hunter 2007), numpy (van der Walt, Colbert & Varoquaux 2011), PyGaia (https://github.com/agabrown/PyGaia), and scipy (Jones, Oliphant & Peterson 2001).

DATA AVAILABILITY

The MCMC chains obtained from our analysis are available upon reasonable request.

Footnotes

1

While other methods to measure the DM halo’s shape have been proposed, they have not been used very extensively. For example, Olling & Merrifield (2000) used the flaring of the H_I gas disc to determine the flattening of the DM halo, and Gnedin et al. (2005) proposed the use of proper motions of hypervelocity stars to derive the triaxiality of the halo.

2

Since the fraction of excluded stars is negligible, it is safe to regard our sample as a kinematically unbiased sample.

3

In the case of self-consistent DF model (which is not the case for our application), there is a relationship such that the density behaves as ρ ∝ r^−γ with γ = (4Γ − 6)/(Γ − 1) in the inner region and behaves as ρ ∝ r^−β with β = (B + 3)/2 in the outer region.

4

In Appendix D, we fix κ = 0 and set J_ϕ,0 = constant for our analysis with mock data created from smooth halo models. In contrast, we treat (κ, J_ϕ,0) as free parameters when we analyse the mock data created from one of the cosmological hydrodynamical simulations (m12m), since the stellar halo shows a net rotation.

5

For example, even if (N − 1) stars perfectly follow an action-based model M, an addition of a single unbound star would make the total likelihood of the model M zero. This is because action |$\rm{\boldsymbol {J}}$| is not defined for an unbound star and therefore |$f(\rm{\boldsymbol {x}}, \rm{\boldsymbol {v}})=0$| for the unbound star if f is action-based.

6

However, we note that the Jacobian factor is missing in equations 15–16 (in their section 2.7) of Trick et al. (2016).

7

The reason for using a Cauchy distribution is related to this weight. If we were to use a Gaussian distribution or any thin-tailed distribution, this weight would become very large for a Monte Carlo sample with large |$|{v_{\mathrm{los}}^{\prime }}_{ij} - (- \rm{\boldsymbol {v}}_\odot \cdot \rm{\boldsymbol {e}}_{\mathrm{los},i})|$|⁠. This means that a small number of such Monte Carlo samples may dominate the integral, causing a large systematic error in the Monte Carlo integration. This problem is minimized if we use a fat-tailed distribution such as Cauchy distribution. Note that the above-mentioned problem could be minimized if we could use a large enough number of Monte Carlo points N_MC, which is impractical in our case due to the computational cost.

8

If we are to run MCMC for 5000 steps (with a single MCMC ‘walker’), we need to evaluate the likelihood 5000 times. This would take ∼1 month assuming 10 min per model for evaluating likelihood.

9

We note that these quantities are derived from the covariance matrix of the proper motion distribution along each line of sight, which is different from the covariance matrix of the observational error for each star Σ_μ used in equation (22).

10

For a potential model with a nearly flat rotation curve, the density flattening q and the potential flattening |$q_\Phi$| are related by |$q^2 \simeq 2 q_\Phi ^4 - q_\Phi ^2$| (see equation 2.72b of Binney & Tremaine 2008).

11

Bovy & Rix (2013) derived the vertical force at |$|z|=1.1 \, \mathrm{kpc}$| by analysing the kinematics of disc stars with an assumption that the Galactic DM halo is spherical. Thus, one might worry whether using Bovy & Rix (2013)’s vertical force data introduces a bias such that the posterior distribution of q prefers a spherical DM halo. The additional analysis presented here shows that we do not need to worry about this potential source of bias.

12

We assume that RR Lyrae stars have a colour (V − I) = 0.6 and V-band absolute magnitude M_V = 0.6 (e.g. Yepez et al. 2018). We derive (G − V) and M_G = M_V + (G − V) with PyGaia (https://github.com/agabrown/PyGaia).

13

Table 2 of Sanderson et al. (2020) suggests that m12m’s stellar disc has a scale radius of |${\sim}3.2 \, \mathrm{kpc}$| when they fit the stellar disc at |$|z|\lt 0.3 \, \mathrm{kpc}$| and |$6 \, \mathrm{kpc}\lt R\lt 12 \, \mathrm{kpc}$| with a single exponential disc model. The same table also suggests that 90 percent of the disc stars are located at |$|z| \lt 2.3 \, \mathrm{kpc}$|⁠. If we assume a single-component stellar disc with a constant scale height of z_d, this result means that |$1-\exp (-2.3 \, \mathrm{kpc}/ z_\mathrm{d}) = 0.9$|⁠, or |$z_\mathrm{d} \simeq 1 \, \mathrm{kpc}$|⁠. These scale radius and scale height are consistent with those of our ‘component 2’, which is the dominant component at |$|z|\lt 0.3 \, \mathrm{kpc}$| and |$6 \, \mathrm{kpc}\lt R\lt 12 \, \mathrm{kpc}$|⁠.

REFERENCES

Allende Prieto

C.

et al. ,

2020

,

Res. Notes Am. Astron. Soc.

,

4

,

188

10.3847/2515-5172/abc1dc

Belokurov

V.

,

Erkal

D.

,

Evans

N. W.

,

Koposov

S. E.

,

Deason

A. J.

,

2018

,

MNRAS

,

478

,

611

10.1093/mnras/sty982

10.1051/0004-6361/201424478

Bienaymé

O.

et al. ,

2014

,

A&A

,

571

,

A92

10.1111/j.1365-2966.2012.21757.x

Binney

J.

,

2012

,

MNRAS

,

426

,

1324

Binney

J.

,

Piffl

T.

,

2015

,

MNRAS

,

454

,

3653

10.1093/mnras/stv2225

Binney

J.

,

Tremaine

S.

,

2008

,

Galactic Dynamics: Second Edition

.

Princeton Univ. Press

,

Princeton, NJ

Binney

J.

,

Wong

L. K.

,

2017

,

MNRAS

,

467

,

2446

10.1093/mnras/stx234

Bird

S. A.

,

Xue

X.-X.

,

Liu

C.

,

Shen

J.

,

Flynn

C.

,

Yang

C.

,

2019

,

AJ

,

157

,

104

10.3847/1538-3881/aafd2e

10.1146/annurev-astro-081915-023441

Bland-Hawthorn

J.

,

Gerhard

O.

,

2016

,

ARA&A

,

54

,

529

10.1088/0004-637X/795/1/95

Bovy

J.

,

2014

,

ApJ

,

795

,

95

10.1088/0004-637X/779/2/115

Bovy

J.

,

Rix

H.-W.

,

2013

,

ApJ

,

779

,

115

10.3847/1538-4357/833/1/31

Bovy

J.

,

Bahmanyar

A.

,

Fritz

T. K.

,

Kallivayalil

N.

,

2016

,

ApJ

,

833

,

31

Bowden

A.

,

Belokurov

V.

,

Evans

N. W.

,

2015

,

MNRAS

,

449

,

1391

10.1093/mnras/stv285

Bowden

A.

,

Evans

N. W.

,

Williams

A. A.

,

2016

,

MNRAS

,

460

,

329

10.1093/mnras/stw994

Carollo

D.

,

Chiba

M.

,

2021

,

ApJ

,

908

,

191

10.3847/1538-4357/abd7a4

10.1088/0004-637X/712/1/692

Carollo

D.

et al. ,

2007

,

Nature

,

450

,

1020

Carollo

D.

et al. ,

2010

,

ApJ

,

712

,

692

Cautun

M.

et al. ,

2020

,

MNRAS

,

494

,

4291

10.1093/mnras/staa1017

Chua

K. T. E.

,

Pillepich

A.

,

Vogelsberger

M.

,

Hernquist

L.

,

2019

,

MNRAS

,

484

,

476

10.1093/mnras/sty3531

Cole

D. R.

,

Binney

J.

,

2017

,

MNRAS

,

465

,

798

10.1093/mnras/stw2775

Das

P.

,

Binney

J.

,

2016

,

MNRAS

,

460

,

1725

10.1093/mnras/stw744

Das

P.

,

Williams

A.

,

Binney

J.

,

2016

,

MNRAS

,

463

,

3169

10.1093/mnras/stw2167

10.1088/1475-7516/2019/10/037

de Salas

P. F.

,

Widmark

A.

,

2020

,

preprint (arXiv:2012.11477)

de Salas

P. F.

,

Malhan

K.

,

Freese

K.

,

Hattori

K.

,

Valluri

M.

,

2019

,

J. Cosmol. Astropart. Phys.

,

2019

,

037

10.1111/j.1365-2966.2010.17785.x

Deason

A. J.

,

Belokurov

V.

,

Evans

N. W.

,

2011

,

MNRAS

,

411

,

1480

10.1088/0004-637X/787/1/30

Deason

A. J.

,

Belokurov

V.

,

Koposov

S. E.

,

Rockosi

C. M.

,

2014

,

ApJ

,

787

,

30

Deason

A. J.

et al. ,

2021

,

MNRAS

,

501

,

5964

10.1093/mnras/staa3984

Debattista

V. P.

,

Moore

B.

,

Quinn

T.

,

Kazantzidis

S.

,

Maas

R.

,

Mayer

L.

,

Read

J.

,

Stadel

J.

,

2008

,

ApJ

,

681

,

1076

10.1086/587977

Debattista

V. P.

,

Roškar

R.

,

Valluri

M.

,

Quinn

T.

,

Moore

B.

,

Wadsley

J.

,

2013

,

MNRAS

,

434

,

2971

10.1093/mnras/stt1217

Dubinski

J.

,

1994

,

ApJ

,

431

,

617

10.1086/174512

Eadie

G.

,

Jurić

M.

,

2019

,

ApJ

,

875

,

159

10.3847/1538-4357/ab0f97

Eilers

A.-C.

,

Hogg

D. W.

,

Rix

H.-W.

,

Ness

M. K.

,

2019

,

ApJ

,

871

,

120

10.3847/1538-4357/aaf648

Erkal

D.

et al. ,

2019

,

MNRAS

,

487

,

2685

10.1093/mnras/stz1371

Erkal

D.

,

Belokurov

V. A.

,

Parkin

D. L.

,

2020

,

MNRAS

,

498

,

5574

10.1093/mnras/staa2840

Fellhauer

M.

et al. ,

2006

,

ApJ

,

651

,

167

10.1086/507128

Foreman-Mackey

D.

,

2016

,

J. Open Source Softw.

,

1

,

24

10.21105/joss.00024

Foreman-Mackey

D.

,

Hogg

D. W.

,

Lang

D.

,

Goodman

J.

,

2013

,

Publ. Astron. Soc. Pac.

,

125

,

306

10.1086/670067

10.1051/0004-6361/202039657

Gaia Collaboration

,

2021

,

A&A

,

649

,

A1

Garavito-Camargo

N.

,

Besla

G.

,

Laporte

C. F. P.

,

Price-Whelan

A. M.

,

Cunningham

E. C.

,

Johnston

K. V.

,

Weinberg

M. D.

,

Gomez

F. A.

,

2020

,

ApJ

,

919

,

27

Gibbons

S. L. J.

,

Belokurov

V.

,

Evans

N. W.

,

2014

,

MNRAS

,

445

,

3788

10.1093/mnras/stu1986

Gnedin

O. Y.

,

Gould

A.

,

Miralda-Escudé

J.

,

Zentner

A. R.

,

2005

,

ApJ

,

634

,

344

10.1086/496958

10.1051/0004-6361/201935656

Gravity Collaboration

,

2019

,

A&A

,

625

,

L10

Grillmair

C. J.

,

Dionatos

O.

,

2006

,

ApJ

,

643

,

L17

10.1086/505111

10.1088/2041-8205/763/1/L17

Hattori

K.

,

Yoshii

Y.

,

Beers

T. C.

,

Carollo

D.

,

Lee

Y. S.

,

2013

,

ApJ

,

763

,

L17

Helmi

A.

,

2004

,

ApJ

,

610

,

L97

10.1086/423340

10.1038/s41586-018-0625-x

Helmi

A.

,

Babusiaux

C.

,

Koppelman

H. H.

,

Massari

D.

,

Veljanoski

J.

,

Brown

A. G. A.

,

2018

,

Nature

,

563

,

85

PubMed

Hogg

D. W.

,

Eilers

A.-C.

,

Rix

H.-W.

,

2019

,

AJ

,

158

,

147

10.3847/1538-3881/ab398c

Hopkins

P. F.

et al. ,

2018

,

MNRAS

,

480

,

800

10.1093/mnras/sty1690

Hunter

J. D.

,

2007

,

Comput. Sci. Eng.

,

9

,

90

10.1109/MCSE.2007.55

Iorio

G.

,

Belokurov

V.

,

2019

,

MNRAS

,

482

,

3868

10.1093/mnras/sty2806

Iorio

G.

,

Belokurov

V.

,

2021

,

MNRAS

,

502

,

5686

10.1093/mnras/stab005

Jing

Y. P.

,

Suto

Y.

,

2002

,

ApJ

,

574

,

538

10.1086/341065

Johnston

K. V.

,

Zhao

H.

,

Spergel

D. N.

,

Hernquist

L.

,

1999

,

ApJ

,

512

,

L109

10.1086/311876

Johnston

K. V.

,

Law

D. R.

,

Majewski

S. R.

,

2005

,

ApJ

,

619

,

800

10.1086/426777

Jones

E.

,

Oliphant

T.

,

Peterson

P.

,

2001

,

SciPy: Open Source Scientific Tools for Python

.

Kafle

P. R.

,

Sharma

S.

,

Lewis

G. F.

,

Bland -Hawthorn

J.

,

2013

,

MNRAS

,

430

,

2973

10.1093/mnras/stt101

Kazantzidis

S.

,

Kravtsov

A. V.

,

Zentner

A. R.

,

Allgood

B.

,

Nagai

D.

,

Moore

B.

,

2004

,

ApJ

,

611

,

L73

10.1086/423992

10.1088/2041-8205/720/1/L62

Kazantzidis

S.

,

Abadi

M. G.

,

Navarro

J. F.

,

2010

,

ApJ

,

720

,

L62

10.1088/0004-637X/712/1/260

Koposov

S. E.

,

Rix

H.-W.

,

Hogg

D. W.

,

2010

,

ApJ

,

712

,

260

10.1146/annurev.astro.42.053102.134024

Kormendy

J.

,

Kennicutt Robert

C. J.

,

2004

,

ARA&A

,

42

,

603

Kuijken

K.

,

Gilmore

G.

,

1991

,

ApJ

,

367

,

L9

10.1086/185920

10.1088/0004-637X/803/2/80

Küpper

A. H. W.

,

Balbinot

E.

,

Bonaca

A.

,

Johnston

K. V.

,

Hogg

D. W.

,

Kroupa

P.

,

Santiago

B. X.

,

2015

,

ApJ

,

803

,

80

10.1088/0004-637X/714/1/229

Law

D. R.

,

Majewski

S. R.

,

2010

,

ApJ

,

714

,

229

10.1051/0004-6361/201832727

Lindegren

L.

et al. ,

2018

,

A&A

,

616

,

A2

10.1088/0004-637X/794/2/151

Loebman

S. R.

et al. ,

2014

,

ApJ

,

794

,

151

10.1088/0004-637X/814/1/13

McKee

C. F.

,

Parravano

A.

,

Hollenbach

D. J.

,

2015

,

ApJ

,

814

,

13

McMillan

P. J.

,

2017

,

MNRAS

,

465

,

76

10.1093/mnras/stw2759

10.1111/j.1365-2966.2011.19879.x

McMillan

P. J.

,

Binney

J.

,

2012

,

MNRAS

,

419

,

2251

McMillan

P. J.

,

Binney

J. J.

,

2013

,

MNRAS

,

433

,

1411

10.1093/mnras/stt814

Malhan

K.

,

Ibata

R. A.

,

2019

,

MNRAS

,

486

,

2995

10.1093/mnras/stz1035

Mateu

C.

,

Holl

B.

,

De Ridder

J.

,

Rimoldini

L.

,

2020

,

MNRAS

,

496

,

3291

10.1093/mnras/staa1676

Naidu

R. P.

et al. ,

2021

,

preprint (arXiv:2103.03251)

Navarro

J. F.

,

Frenk

C. S.

,

White

S. D. M.

,

1997

,

ApJ

,

490

,

493

10.1086/304888

Nitschai

M. S.

,

Cappellari

M.

,

Neumayer

N.

,

2020

,

MNRAS

,

494

,

6001

10.1093/mnras/staa1128

10.1046/j.1365-8711.2000.03053.x

Olling

R. P.

,

Merrifield

M. R.

,

2000

,

MNRAS

,

311

,

361

10.1088/0004-637X/799/1/28

Pearson

S.

,

Küpper

A. H. W.

,

Johnston

K. V.

,

Price-Whelan

A. M.

,

2015

,

ApJ

,

799

,

28

10.1038/s41550-020-01254-3

Petersen

M. S.

,

Peñarrubia

J.

,

2021

,

Nat. Astron.

,

5

,

251

Piffl

T.

et al. ,

2014

,

MNRAS

,

445

,

3133

10.1093/mnras/stu1948

Piffl

T.

,

Penoyre

Z.

,

Binney

J.

,

2015

,

MNRAS

,

451

,

639

10.1093/mnras/stv938

10.1051/0004-6361/201833355

Posti

L.

,

Helmi

A.

,

2019

,

A&A

,

621

,

A56

Posti

L.

,

Binney

J.

,

Nipoti

C.

,

Ciotti

L.

,

2015

,

MNRAS

,

447

,

3060

10.1093/mnras/stu2608

Prada

J.

,

Forero-Romero

J. E.

,

Grand

R. J. J.

,

Pakmor

R.

,

Springel

V.

,

2019

,

MNRAS

,

490

,

4877

10.1093/mnras/stz2873

10.1088/0954-3899/41/6/063101

Read

J. I.

,

2014

,

J. Phys. G Nucl. Phys.

,

41

,

063101

Reid

M. J.

,

Brunthaler

A.

,

2004

,

ApJ

,

616

,

872

10.1086/424960

Sackett

P. D.

,

1997

,

ApJ

,

483

,

103

10.1086/304223

Sanderson

R. E.

et al. ,

2020

,

ApJS

,

246

,

6

10.3847/1538-4365/ab5b9d

Sanders

J. L.

,

Binney

J.

,

2013

,

MNRAS

,

433

,

1826

10.1093/mnras/stt816

Sanders

J. L.

,

Binney

J.

,

2015a

,

MNRAS

,

447

,

2479

10.1093/mnras/stu2598

Sanders

J. L.

,

Binney

J.

,

2015b

,

MNRAS

,

449

,

3479

10.1093/mnras/stv578

Sanders

J. L.

,

Binney

J.

,

2016

,

MNRAS

,

457

,

2107

10.1093/mnras/stw106

10.1111/j.1365-2966.2010.16253.x

Schönrich

R.

,

Binney

J.

,

Dehnen

W.

,

2010

,

MNRAS

,

403

,

1829

Simion

I. T.

,

Belokurov

V.

,

Koposov

S. E.

,

2019

,

MNRAS

,

482

,

921

10.1093/mnras/sty2744

Ting

Y.-S.

,

Rix

H.-W.

,

Bovy

J.

,

van de Ven

G.

,

2013

,

MNRAS

,

434

,

652

10.1093/mnras/stt1053

10.3847/0004-637X/830/2/97

Trick

W. H.

,

Bovy

J.

,

Rix

H.-W.

,

2016

,

ApJ

,

830

,

97

Utkin

N. D.

,

Dambis

A. K.

,

2020

,

MNRAS

,

499

,

1058

10.1093/mnras/staa2819

10.1111/j.1365-2966.2009.16192.x

Valluri

M.

,

Debattista

V. P.

,

Quinn

T.

,

Moore

B.

,

2010

,

MNRAS

,

403

,

525

10.1111/j.1365-2966.2011.19853.x

Valluri

M.

,

Debattista

V. P.

,

Quinn

T. R.

,

Roškar

R.

,

Wadsley

J.

,

2012

,

MNRAS

,

419

,

1951

van der Walt

S.

,

Colbert

S. C.

,

Varoquaux

G.

,

2011

,

Comput. Sci. Eng.

,

13

,

22

10.1109/MCSE.2011.37

Vasiliev

E.

,

2019

,

MNRAS

,

482

,

1525

10.1093/mnras/sty2672

Vasiliev

E.

,

Belokurov

V.

,

Erkal

D.

,

2021

,

MNRAS

,

501

,

2279

10.1093/mnras/staa3673

10.1007/s11433-019-1541-6

Wang

W.

,

Han

J.

,

Cautun

M.

,

Li

Z.

,

Ishigaki

M. N.

,

2020

,

Sci. China Phys., Mech., Astron.

63

,

109801

Wegg

C.

,

Gerhard

O.

,

Bieth

M.

,

2019

,

MNRAS

,

485

,

3296

10.1093/mnras/stz572

10.3847/2041-8205/827/2/L23

Wetzel

A. R.

,

Hopkins

P. F.

,

Kim

J.-h.

,

Faucher-Giguère

C.-A.

,

Kereš

D.

,

Quataert

E.

,

2016

,

ApJ

,

827

,

L23

10.1088/0067-0049/197/2/30

Yepez

M. A.

,

Arellano Ferro

A.

,

Muneer

S.

,

Giridhar

S.

,

2018

,

Rev. Mex. Astron. Astrofis.

,

54

,

15

Zemp

M.

,

Gnedin

O. Y.

,

Gnedin

N. Y.

,

Kravtsov

A. V.

,

2011

,

ApJS

,

197

,

30

10.1088/0004-637X/748/1/54

Zemp

M.

,

Gnedin

O. Y.

,

Gnedin

N. Y.

,

Kravtsov

A. V.

,

2012

,

ApJ

,

748

,

54

Zhu

Q.

,

Marinacci

F.

,

Maji

M.

,

Li

Y.

,

Springel

V.

,

Hernquist

L.

,

2016

,

MNRAS

,

458

,

1559

10.1093/mnras/stw374

APPENDIX A: COORDINATE SYSTEM

We adopt a right-hand Galactocentric Cartesian coordinate system (x, y, z), in which the (x, y)-plane is the Galactic disc plane. The x-axis is directed from the Sun to the Galactic centre, the y-axis is parallel to the direction of the Galactic rotation at the Solar position (i.e. the direction of ℓ = 90^○ as seen from the Sun), and the z-axis is perpendicular to the Galactic disc. The position of the Sun is assumed to be |$\rm{\boldsymbol {x}}_\odot = (x_\odot ,y_\odot ,z_\odot) = (-R_0,0,0)$|⁠, with |$R_0 = 8.178 \, \mathrm{kpc}$| (Gravity Collaboration 2019). The velocity of the Sun with respect to the Galactic rest frame is assumed to be |$\rm{\boldsymbol {v}}_\odot = (v_{x,\odot },v_{y,\odot },v_{z,\odot }) = (11.10, 247.30, 7.25) \, \mathrm{km\ s}^{-1}$| (Reid & Brunthaler 2004; Schönrich, Binney & Dehnen 2010). We also define a Galactocentric spherical coordinate system (r, ϕ, θ) and a Galactocentric cylindrical coordinate system (R, ϕ, z), such that (x, y, z) = (rcos θcos ϕ, rcos θsin ϕ, rsin θ) = (Rcos ϕ, Rsin ϕ, z). Also, for each 3D location with respect to the Sun, we define the line-of-sight unit vector |$\rm{\boldsymbol {e}}_\mathrm{los}$|⁠.

APPENDIX B: FULL RESULT OF OUR MCMC ANALYSIS

Figs B1 and B2 show corner plots of the posterior distributions resulting from our fiducial analysis. The quantities shown in Fig. B1 are the raw variables used in our MCMC, while the quantities shown in Fig. B2 are more physically meaningful quantities.

$The corner plot of all the parameters in our analysis of Gaia RR Lyrae stars. The parameters shown here are the parameters for the baryonic mass distribution (Mbulge/(1010 M⊙), $R_\mathrm{d}^\mathrm{thin}$, Mthin/(1010M⊙), $R_\mathrm{d}^\mathrm{thick}$, Mthick/(1010M⊙)), the parameters for the DM mass distribution (log10ρ0, log10a, γ, U), and the parameters for the DF (Γ, ain, bin, aout, bout, log10J0, log10η). The top panel of each column shows the the posterior distribution of each parameter (thick solid histograms), and the corresponding prior distribution (thin-dashed histograms).$

Figure B1.

The corner plot of all the parameters in our analysis of Gaia RR Lyrae stars. The parameters shown here are the parameters for the baryonic mass distribution (M_bulge/(10¹⁰ M_⊙), |$R_\mathrm{d}^\mathrm{thin}$|⁠, M_thin/(10¹⁰M_⊙), |$R_\mathrm{d}^\mathrm{thick}$|⁠, M_thick/(10¹⁰M_⊙)), the parameters for the DM mass distribution (log₁₀ρ₀, log₁₀a, γ, U), and the parameters for the DF (Γ, a_in, b_in, a_out, b_out, log₁₀J₀, log₁₀η). The top panel of each column shows the the posterior distribution of each parameter (thick solid histograms), and the corresponding prior distribution (thin-dashed histograms).

$The same as Fig. B1, but expressed by more physically meaningful quantities. The parameters shown here are the parameters for the baryonic mass distribution (Mbulge/(1010 M⊙), $R_\mathrm{d}^\mathrm{thin}$, Mthin/(1010 M⊙), $R_\mathrm{d}^\mathrm{thick}$, Mthick/(1010 M⊙), Mstar/(1010 M⊙)) and the parameters for the DM mass distribution (M200/(1012 M⊙), r200, M94/(1012M⊙), r94, c′, r−2, a, γ, q).$

Figure B2.

The same as Fig. B1, but expressed by more physically meaningful quantities. The parameters shown here are the parameters for the baryonic mass distribution (M_bulge/(10¹⁰M_⊙), |$R_\mathrm{d}^\mathrm{thin}$|⁠, M_thin/(10¹⁰ M_⊙), |$R_\mathrm{d}^\mathrm{thick}$|⁠, M_thick/(10¹⁰ M_⊙), M_star/(10¹⁰M_⊙)) and the parameters for the DM mass distribution (M₂₀₀/(10¹² M_⊙), r₂₀₀, M₉₄/(10¹²M_⊙), r₉₄, c′, r₋₂, a, γ, q).

In Fig. B1, the top panel of each column, also shows the prior distribution for each parameter (dashed blue histrograms). By comparing the posterior (solid black histograms) and prior distributions, we can see that the bulge and thick-disc parameters are essentially determined by the prior distributions, while the thin-disc and DM-halo parameters are constrained by the data.

APPENDIX C: COMPARISON WITH A POTENTIAL-ONLY FIT

In our fiducial model, we estimated the MW potential by using the circular velocity data, the vertical force data, and the kinematic data for RR Lyrae stars. The circular velocity and vertical force data only constrain the gravitational potential close to the disc plane, while the RR Lyrae constrain the global potential and the DF of RR Lyrae. To understand how much the RR Lyrae data contribute to our inference, we performed a similar analysis without using the RR Lyrae data. Namely, instead of the logarithmic likelihood function given in equation (42), we used a logarithmic likelihood given by

$$\begin{eqnarray*} \ln \mathrm{Pr}(D|M) &= \ln L_\mathrm{circ} + \ln L_\mathrm{Kz}. \end{eqnarray*}$$

(C1)

We used the same prior distribution as in our fiducial analysis. In Fig. C1, the green dotted histograms show the posterior distribution derived from this additional analysis, while the black solid histogram show the posterior distribution of our fiducial analysis.

Figure C1.

The posterior distribution of our analyses with and without using RR Lyrae data. The solid black histograms show the posterior distributions from our fiducial analysis, in which we used all three kinds of data. The dotted green histograms show the posterior distributions for a test in which we used the circular velocity data and vertical force data only (see Appendix C for details). The parameters shown here are the same as in Fig. B2. We see that the DM’s flattening parameter q can be well constrained if we use the RR Lyrae data in our inference. We note that the area below each histogram is normalized to unity, except for the green dotted histogram for q, which is multiplied by 100 for greater visibility.

On the one hand, the posterior distributions of parameters (a, γ, c′, q) and quantities characterizing the DM halo (e.g. virial mass and virial radius) are improved (become narrower) by using the RR Lyrae data. This improvement is most significant for the density flattening q of the DM halo. As we can see from the bottom right-hand panel, q is poorly constrained when we use the circular velocity data and the vertical force data only. In contrast, when we also use the RR Lyrae data, the posterior distribution of q is highly peaked. Based on this additional analysis, we can conclude that the posterior distribution of q in our fiducial analysis is peaked near q = 1 primarily because of the kinematic data of RR Lyrae stars, and not because of the circular velocity data or the vertical force data. This result is reassuring, because our main motivation in this paper is to constrain q by using halo tracers.¹¹

On the other hand, many parameters for the baryonic components, such as the bulge or the thick disc, show no significant improvement when we additionally use the RR Lyrae data. (The only exception is the mass of the thin disc M_thin, whose posterior distribution is peaked at higher value when RR Lyrae data are additionally used. As a result, the total stellar mass M_star = M_bulge + M_thin + M_thick is also peaked at higher value.) This result indicates that many baryonic parameters are almost exclusively determined by the circular velocity data, the vertical force data, and our prior distribution. Intriguingly, de Salas et al. (2019) also found that their posterior distribution for the parameters of the baryonic model is dominated by their prior when they analysed the same v_circ data. Our result confirms the result of de Salas et al. (2019) even in the presence of additional data such as K_z data or the kinematic data of halo stars. This result indicates that precise knowledge of the baryonic potential is essential for our understanding of the Galactic DM distribution.

APPENDIX D: VALIDATION OF OUR METHOD WITH MOCK DATA

To validate our method and results for the Gaia RR Lyrae sample, we perform similar analyses with eight mock data sets summarized in Table D1. For each of the models below, we generate a 5D mock data set without v_los and a 6D mock data set. The number of mock stars and the survey selection function are identical to those for the RR Lyrae sample.

Table D1.

Summary of mock data used for validation and results.

Base model	True DM density flattening	Data type	ρ_DM(R, 0, 0)	ρ_DM(R₀, 0, z)	q
Analytic DF model + analytic potential model	q_true = 0.6	5D	Fig. D1(a)	Fig. D2(a)	Fig. D3(a)
Analytic DF model + analytic potential model	q_true = 0.8	5D	Fig. D1(b)	Fig. D2(b)	Fig. D3(b)
Analytic DF model + analytic potential model	q_true = 0.996	5D	Fig. D1(c)	Fig. D2(c)	Fig. D3(c)
Analytic DF model + analytic potential model	q_true = 0.6	6D	Fig. D1(d)	Fig. D2(d)	Fig. D3(d)
Analytic DF model + analytic potential model	q_true = 0.8	6D	Fig. D1(e)	Fig. D2(e)	Fig. D3(e)
Analytic DF model + analytic potential model	q_true = 0.996	6D	Fig. D1(f)	Fig. D2(f)	Fig. D3(f)
m12m galaxy	Fig. D4	5D	Fig. D5(a)	Fig. D6(a)	Fig. D7(a)
m12m galaxy	Fig. D4	6D	Fig. D5(b)	Fig. D6(b)	Fig. D7(b)

Base model	True DM density flattening	Data type	ρ_DM(R, 0, 0)	ρ_DM(R₀, 0, z)	q
Analytic DF model + analytic potential model	q_true = 0.6	5D	Fig. D1(a)	Fig. D2(a)	Fig. D3(a)
Analytic DF model + analytic potential model	q_true = 0.8	5D	Fig. D1(b)	Fig. D2(b)	Fig. D3(b)
Analytic DF model + analytic potential model	q_true = 0.996	5D	Fig. D1(c)	Fig. D2(c)	Fig. D3(c)
Analytic DF model + analytic potential model	q_true = 0.6	6D	Fig. D1(d)	Fig. D2(d)	Fig. D3(d)
Analytic DF model + analytic potential model	q_true = 0.8	6D	Fig. D1(e)	Fig. D2(e)	Fig. D3(e)
Analytic DF model + analytic potential model	q_true = 0.996	6D	Fig. D1(f)	Fig. D2(f)	Fig. D3(f)
m12m galaxy	Fig. D4	5D	Fig. D5(a)	Fig. D6(a)	Fig. D7(a)
m12m galaxy	Fig. D4	6D	Fig. D5(b)	Fig. D6(b)	Fig. D7(b)

Table D1.

Summary of mock data used for validation and results.

Base model	True DM density flattening	Data type	ρ_DM(R, 0, 0)	ρ_DM(R₀, 0, z)	q
Analytic DF model + analytic potential model	q_true = 0.6	5D	Fig. D1(a)	Fig. D2(a)	Fig. D3(a)
Analytic DF model + analytic potential model	q_true = 0.8	5D	Fig. D1(b)	Fig. D2(b)	Fig. D3(b)
Analytic DF model + analytic potential model	q_true = 0.996	5D	Fig. D1(c)	Fig. D2(c)	Fig. D3(c)
Analytic DF model + analytic potential model	q_true = 0.6	6D	Fig. D1(d)	Fig. D2(d)	Fig. D3(d)
Analytic DF model + analytic potential model	q_true = 0.8	6D	Fig. D1(e)	Fig. D2(e)	Fig. D3(e)
Analytic DF model + analytic potential model	q_true = 0.996	6D	Fig. D1(f)	Fig. D2(f)	Fig. D3(f)
m12m galaxy	Fig. D4	5D	Fig. D5(a)	Fig. D6(a)	Fig. D7(a)
m12m galaxy	Fig. D4	6D	Fig. D5(b)	Fig. D6(b)	Fig. D7(b)

Base model	True DM density flattening	Data type	ρ_DM(R, 0, 0)	ρ_DM(R₀, 0, z)	q
Analytic DF model + analytic potential model	q_true = 0.6	5D	Fig. D1(a)	Fig. D2(a)	Fig. D3(a)
Analytic DF model + analytic potential model	q_true = 0.8	5D	Fig. D1(b)	Fig. D2(b)	Fig. D3(b)
Analytic DF model + analytic potential model	q_true = 0.996	5D	Fig. D1(c)	Fig. D2(c)	Fig. D3(c)
Analytic DF model + analytic potential model	q_true = 0.6	6D	Fig. D1(d)	Fig. D2(d)	Fig. D3(d)
Analytic DF model + analytic potential model	q_true = 0.8	6D	Fig. D1(e)	Fig. D2(e)	Fig. D3(e)
Analytic DF model + analytic potential model	q_true = 0.996	6D	Fig. D1(f)	Fig. D2(f)	Fig. D3(f)
m12m galaxy	Fig. D4	5D	Fig. D5(a)	Fig. D6(a)	Fig. D7(a)
m12m galaxy	Fig. D4	6D	Fig. D5(b)	Fig. D6(b)	Fig. D7(b)

To generate these mock data, we prepare three smooth stellar halo models constructed from analytic DFs embedded in analytic potential models, as detailed in Appendix D1.1. We also generated a mock date set from a realistic Milky-Way-like galaxy m12m from the Fire-2, Latte cosmological hydrodynamical simulation suite (Wetzel et al. 2016; Hopkins et al. 2018; Sanderson et al. 2020), as detailed in Appendix D2.4.

D1 Validation with ‘smooth halo’ mock data sets

D1.1 Mock data for smooth stellar halo models

We construct three smooth stellar halo models. We assume that the halo stars are test particles moving in an analytic potential composed of baryonic component and DM component. The functional form of the model potential (bulge, thin/thick/gas discs and DM halo) and the functional form of the stellar halo DF are identical to those in Section 3. We adopt the same gas disc model as in McMillan (2017), and we adopt three sets of parameters for bulge, thin disc, thick disc, DM halo, and stellar halo DF model:

the best-fitting parameters shown in Table 1 (with q = 0.996);
the same as the best-fitting parameters but with q = 0.8; and
the same as the best-fitting parameters but with q = 0.6.

In other words, these models are different from each other only in terms of the DM density flattening q.

For each model, we evaluate the circular velocity |$v_\mathrm{circ}^\mathrm{model}(R)$| at the same radii as in the data of Eilers et al. (2019) and take into account the same amount of observational error. Also, for each model, we evaluate the vertical force |$K_{z,1.1 \, \mathrm{kpc}}^\mathrm{model}(R)$| at the same locations as in the data of Bovy & Rix (2013) and take into account the same amount of fractional error. The mock data of circular velocity and vertical force are used to aid our DF fitting.

To generate mock RR Lyrae stars, we first sample mock halo stars from the above-mentioned three input models. We then add Gaia DR2-like proper motion errors that depend on the Gaia’s G-band photometry. For this purpose, we assume that all the mock stars are RR Lyrae stars with G-band absolute magnitude¹²M_G = −0.1376, and compute Gaia DR2-like proper motion error by using a formula described in equation (16) in Sanderson et al. (2020). We also add a distance modulus uncertainty of 0.240 (mimicking RR Lyrae stars), and an optimistic |$5 \, \mathrm{km\ s}^{-1}$| error on v_los.

From this sample of error-added mock stars, we randomly choose N = 16197 stars by using the spatial selection function defined in Section 3.3. In the following, we use three sets of ‘6D mock data’ and three sets of ‘5D mock data’ for which we mask v_los.

D1.2 Radial dark matter density

Fig. D1 shows the reconstructed DM density |$\rho _\mathrm{DM}(R, z=0 \, \mathrm{kpc})$|⁠. We see that the central 68 percentile region (grey shaded region) traces the true DM profile (dashed line) for all mock data.

$The DM density profile $\rho (R,z=0 \, \mathrm{kpc})$ reconstructed from our mock analysis. The input value of the flattening, qtrue, is shown in each panel. The shaded region and the region enclosed by solid lines cover the central 68 and 95 percentiles of the posterior distribution. Also, the dashed line corresponds to the true profile of the input model. In panels (a)–(c), the results are shown for DF fitting with 5D data (without vlos data). In panels (d)–(f), the results are shown for DF fitting with 6D data. The agreement between the true profile and the posterior profile indicates that our method can recover the DM profile even if we lack the vlos data.$

Figure D1.

The DM density profile |$\rho (R,z=0 \, \mathrm{kpc})$| reconstructed from our mock analysis. The input value of the flattening, q_true, is shown in each panel. The shaded region and the region enclosed by solid lines cover the central 68 and 95 percentiles of the posterior distribution. Also, the dashed line corresponds to the true profile of the input model. In panels (a)–(c), the results are shown for DF fitting with 5D data (without v_los data). In panels (d)–(f), the results are shown for DF fitting with 6D data. The agreement between the true profile and the posterior profile indicates that our method can recover the DM profile even if we lack the v_los data.

Although the spatial distribution of our mock sample stars is limited by the survey volume, the inferred density profile traces the actual density profile even beyond the survey volume. Given that the inner density slope is allowed to vary freely, it is intriguing that the DF fitting can recover well the radial DM density profile at |$1\lt R/\, \mathrm{kpc}\lt 5$|⁠. The almost perfect reconstruction of the outer density profile at |$R\gtrsim 30 \, \mathrm{kpc}$| can be partly explained by the fact that we fixed the outer density slope to be the correct value (−3).

D1.3 Vertical DM density

Fig. D2 shows the reconstructed vertical profile of the DM density ρ_DM(R = R₀, z) evaluated at the Solar cylinder. We see that the true profile (dashed line) matches well with the central 68 percentile region (grey shaded region) for all the mock data sets. By comparing panels (a)–(c) and (d)–(f), we see that the uncertainty in ρ_DM(R = R₀, z) is larger for models with more flattened DM haloes (with smaller q_true). This trend may be explained by the fact that a more flattened DM distribution makes it harder to disentangle the dynamical contribution from DM and the baryonic discs.

Figure D2.

The same as Fig. D1 but for the reconstructed DM density profile ρ(R₀, z) of the mock data.

D1.4 Dark matter density flattening

Fig. D3 shows the posterior distribution of q. We see that the posterior distribution is centred around q_true, for both 5D and 6D mock data. This result implies that the lack of v_los is not a problem in inferring q as long as we handle the missing v_los properly.

Figure D3.

The posterior distribution of the DM density flattening q for the mock data. We note that the range of horizontal axis is different in each panel.

It is not surprising that the uncertainty in q is larger if we use 5D data instead of 6D data. This result indicates that the added information from v_los improves the inference on q, which is motivation to obtain v_los for a large number of Galactic RR Lyrae stars and other halo tracers (e.g. BHB and K-giants).

We note that the uncertainty in q is smaller when q_true is larger. For example, for the 5D mock data sets, the difference between the 16th and 50th percentiles of the posterior distribution is Δq = 0.068, 0.042, and 0.014 for q_true = 0.6, 0.8, and 0.996, respectively. This trend may be understood by considering that it is more difficult to distinguish the dynamical contributions from the baryonic disc and the DM halo if the DM halo is more flattened.

In this paper, we do not allow q > 1, because AGAMA can currently compute the actions only in oblate potentials. In Fig. D3(c)–(f), where q_true = 0.996, the posterior distribution is skewed because our method cannot explore the parameter space at q > 1. It is worth noting that the posterior distribution of q is peaked at 0.99 < q < 1, consistent with q_true = 0.996. This result is intriguing when we interpret our results with our fiducial model for which the posterior distribution is peaked at q = 1, (see Fig. 6a). Based on our analysis of mock data, our results with Gaia RR Lyrae stars might indicate that the Galactic DM halo is very close to spherical.

D2 Validation with realistic mock data generated from the m12m galaxy from the Latte simulations

D2.1 True shape of the dark matter halo of galaxy m12m

We first derive the true shape of the m12m’s DM halo with an iterative ‘S1 method’ in Zemp et al. (2011). Fig. D4 shows the minor-to-major axial ratio (c/a) and the intermediate-to-major axial ratio (b/a) of the DM density at each ellipsoidal radius m defined by

$$\begin{eqnarray*} m^2 = x_a^2 + \left(\frac{x_b}{b/a}\right)^2 + \left(\frac{x_c}{c/a}\right)^2 . \end{eqnarray*}$$

(D1)

Here, (a, b, c) (a ≥ b ≥ c > 0) are the length scales of the three principal axes of the DM density distribution and (x_a, x_b, x_c) are the spatial coordinates along (a, b, c)-axes, respectively. It turns out that the minor axis (c-axis) of m12m DM halo is approximately parallel to the z-axis within |$m\lt 100 \, \mathrm{kpc}$|⁠. At |$m\lt 8 \, \mathrm{kpc}$|⁠, we see that (b/a, c/a) ≃ (0.8, 0.65), so the DM halo is triaxial. At |$m\gt 12 \, \mathrm{kpc}$|⁠, in contrast, we see that (b/a, c/a) ≃ (0.95, 0.65), indicating that the DM halo becomes nearly oblate-axisymmetric with q = (c/a) = 0.65 beyond ∼12kpc.

$The minor-to-major axial ratio (c/a) and the intermediate-to-major axial ratio (b/a) as a function of the ellipsoidal radius m from the centre of m12m galaxy. We see that the inner part ($m\lt 8\, \mathrm{kpc}$) of the DM halo is triaxial, while it becomes nearly axisymmetric at $m\gt 12 \, \mathrm{kpc}$.$

Figure D4.

The minor-to-major axial ratio (c/a) and the intermediate-to-major axial ratio (b/a) as a function of the ellipsoidal radius m from the centre of m12m galaxy. We see that the inner part (⁠|$m\lt 8\, \mathrm{kpc}$|⁠) of the DM halo is triaxial, while it becomes nearly axisymmetric at |$m\gt 12 \, \mathrm{kpc}$|⁠.

D2.2 Stellar distribution in m12m galaxy

The distribution of disc stars in m12m galaxy is nearly axisymmetric with no hint of a central bar. We fit the stellar distribution by a linear combination of three exponential discs, each of which can be described by equation (4). We refer to these three components as components 1, 2, and 3. The three parameters for component 1, the total mass, scale radius, and scale height are |$(M_\mathrm{disc,1}, R_\mathrm{d,1}, z_\mathrm{d,1}) = (2.30\times 10^{10}{\,\rm M_\odot} , 0.653 \, \mathrm{kpc}, 0.286 \, \mathrm{kpc})$|⁠. Based on the short scale radius, the component 1 can be interpreted as the pseudo-bulge of the galaxy. (We note that pseudo-bulges have an exponential radial density profiles as shown in Kormendy & Kennicutt 2004, unlike the MW bulge model in our fiducial analysis.) The three parameters for components 2 and 3 are |$(M_\mathrm{disc,2}, R_\mathrm{d,2}, z_\mathrm{d,2}) = (8.27\times 10^{10}{\,\rm M_\odot} , 3.0 \, \mathrm{kpc}, 0.89 \, \mathrm{kpc})$| and |$(M_\mathrm{disc,3}, R_\mathrm{d,3}, z_\mathrm{d,3}) = (2.28\times 10^{10}{\,\rm M_\odot} , 5.8 \, \mathrm{kpc}, 2.78 \, \mathrm{kpc})$|⁠, respectively. Based on the relative size of the scale heights, the components 2 and 3 can be interpreted as the thin and thick discs of m12m galaxy. We note that these structural parameters are at least partially consistent with the results in Sanderson et al. (2020).¹³

We use the functional forms of the above-mentioned three-component stellar distribution model in the mock analysis. We chose to treat (M_{disc, 1}, M_{disc, 2}, R_{d, 2}, M_{disc, 3}, R_{d, 3}) as the free parameters to be determined through mock analysis. We chose to fix (R_{d, 1}, z_{d, 1}, z_{d, 2}, z_{d, 3}) to their correct parameters mentioned in the previous paragraph. We note that these choices are designed to mimic our fiducial analysis. For example, we fixed z_{d, 2} in the mock analysis because the component 2 can be interpreted as the ‘thin disc’ of m12m and because the scale height of the MW’s thin disc is fixed in our fiducial analysis.

We also note that we adopted Bayesian priors for (M_{disc, 1}, M_{disc, 2}, R_{d, 2}, M_{disc, 3}, R_{d, 3}) that are similar to the ones shown in Table 1. For example, we applied a Bayesian prior of M_{disc, 1} = (2.30 ± 0.23) × 10¹⁰M_⊙, because the component 1 can be interpreted as the ‘bulge’ of m12m and because we adopted a 10 per cent uncertainty on the mass of the MW’s bulge in our fiducial analysis.

D2.3 Gas distribution in m12m galaxy

The distribution of the gas in m12m galaxy is approximately axisymmetric, with some hint of flocculent spiral-like structures. We use the distribution of the gas particles in m12m galaxy and compute the azimuthally averaged potential by using the AGAMA’s ‘CylSpline expansion’ scheme. Throughout the mock analysis, we use this ground-truth potential as the gas potential of m12m galaxy. This is motivated by the fact that we fixed the gas potential in our fiducial analysis.

D2.4 Mock data generated from Latte simulation

By using the star, gas, and DM particles in m12m galaxy, we evaluate the circular velocity curve and the radial profile of the vertical force at |$z=1.1 \, \mathrm{kpc}$| above the disc plane. We use these quantities to generate mock data for circular velocity curves and vertical force profiles akin to those shown in Figs 2 and 3 used in modelling the real RR Lyrae data.

To generate mock samples mimicking the RR Lyrae stars, we first select old halo stars in m12m with metallicity [Fe/H] < −1.5 and age |$\tau \gt 8 \, \mathrm{Gyr}$|⁠. For these old halo stars, we add mock observational errors assuming that all the stars are RR Lyrae stars. As in Section D1.1, we prepare a 6D mock data set and a 5D mock data set without v_los data whose spatial distribution is defined in Section 3.3.

We use these mock halo catalogues along with the mock circular velocity curve and vertical force, to infer the DM density distribution in m12m in exactly the same manner as done for the Gaia RR Lyrae sample in the main body of this paper.

D2.5 Radial dark matter density of m12m galaxy

Fig. D5 shows the posterior distribution of the DM density profile at |$(R,z)=(R,0 \, \mathrm{kpc})$|⁠. We see that ρ_DM(R, 0) is successfully reconstructed for |$1 \lesssim R \lesssim 100 \, \mathrm{kpc}$|⁠. Given that we do not use mock halo stars within |$5 \, \mathrm{kpc}$| from the galactic centre, the successful recovery of the inner density profile of the DM halo is quite promising.

$The same as Fig. D1 but using 5D and 6D mock data sets generated from m12m galaxy. The dashed line shows the true density profile $\rho _\mathrm{DM}(R, z=0 \, \mathrm{kpc})$, which is estimated from spherical harmonic expansion of the DM particles in m12m galaxy.$

Figure D5.

The same as Fig. D1 but using 5D and 6D mock data sets generated from m12m galaxy. The dashed line shows the true density profile |$\rho _\mathrm{DM}(R, z=0 \, \mathrm{kpc})$|⁠, which is estimated from spherical harmonic expansion of the DM particles in m12m galaxy.

It seems to be challenging for our method to recover the DM density profile within |$1 \, \mathrm{kpc}$| from the galaxy. For example, Fig. D5 shows that the reconstructed profile has a steeper density slope than the true profile at |$R \lesssim 1 \, \mathrm{kpc}$|⁠. This is not surprising, as we have seen in Fig. D1 that the reconstructed density profile at |$R\lesssim 1 \, \mathrm{kpc}$| is associated with large uncertainty even if the mock data are generated from a smooth halo model. This problem might be resolved by future access to halo stars within |$5 \, \mathrm{kpc}$| from the Galactic centre, however this is observationally challenging.

D2.6 Vertical dark matter density of m12m galaxy

Fig. D6 shows the posterior distribution of the DM density profile at (R, z) = (R₀, z). We see that the global shape of ρ_DM(R₀, z) is more or less well recovered, although there is some offset such that the reconstructed profile shows a lower density at low-|z| and the ‘knee’ of the recovered density profile occurs at slightly larger |z| than it should.

Figure D6.

The same as Fig. D2 but using 5D and 6D mock data sets generated from m12m galaxy in Latte simulation. The dashed line shows the true density profile ρ_DM(R₀, z), which is estimated from spherical harmonic expansion of the DM particles in m12m galaxy.

D2.7 Dark matter density flattening of m12m galaxy

Fig. D7 shows the posterior distribution of q. The three vertical dashed lines shows the (16,50,84)th percentiles of the distribution. We see that the posterior distribution has a width of ∼0.1 around its peak (⁠|$q=0.765^{+0.111}_{-0.110}$| and |$q=0.818^{+0.103}_{-0.099}$| for 5D and 6D data, respectively), which is comparable to the result for the smooth-halo mock data analysis with q_true = 0.6 (see Fig. D3a–d).

Figure D7.

The same as Fig. D3 but using 5D and 6D mock data sets generated from m12m galaxy. The shaded vertical region at 0.55 < q < 0.7 corresponds to the true density flattening as inferred from the radial profile of (c/a) in Fig. D4.

The true flattening of m12m DM halo is 0.55 < q < 0.7 (see the axial ratio (c/a) in Fig. D4). Thus, the median of the posterior distributions seen in Fig. D3 is systematically larger than it should be by ∼0.1−0.2. To understand the origin of this systematic offset, we show in Fig. D8 the correlation between the total stellar mass M_star and the DM flattening q. The positive correlation between (q, M_star) can be intuitively understood: If M_star is overestimated, then the overall potential will become more flattened unless the DM halo becomes ‘less flattened’ to compensate. Thus, in our Bayesian analysis, the increase of M_star is balanced by the increase of q. We speculate that our model favours a parameter set such that the stellar mass is larger than it should be, and that results in an overestimation of q. We have confirmed that our estimate of q in m12m’s DM halo can be improved (but not perfectly) if we fix the baryonic potential to the ground-truth potential computed from the particle distribution in m12m galaxy. At this moment, it is unclear why large (q, M_star) is favoured in our analysis. It might be due to the disequilibrium of the halo, or due to the slight triaxiality of the DM halo (see Fig. D4). In any case, the lesson learned from this analysis is that it is important to have strong constraints on the baryonic mass distribution in order to accurately estimate q.

Figure D8.

The correlation between the total stellar mass M_star and the DM density flattening q seen in the posterior distribution of the mock analysis of m12m galaxy. The correct value of q is 0.55 < q < 0.7, which is shown by a grey vertical band in the left-hand panels. The correct value of M_star is M_star = 1.28 × 10¹¹M_⊙, which is shown as the horizontal solid line in lower left-hand corner and as the vertical solid line in the lower right-hand corner of each panel.