ABSTRACT

We investigate the structure of our Galaxy’s young stellar disc by fitting the distribution functions (DFs) of a new family to 5D Gaia data for a sample of |$47\, 000$| OB stars. Tests of the fitting procedure show that the young disc’s DF would be strongly constrained by Gaia data if the distribution of Galactic dust were accurately known. The DF that best fits the real data accurately predicts the kinematics of stars at their observed locations, but it predicts the spatial distribution of stars poorly, almost certainly on account of errors in the best-available dust map. We argue that dust models could be greatly improved by modifying the dust model until the spatial distribution of stars predicted by a DF agreed with the data. The surface density of OB stars is predicted to peak at |$R\simeq 5.5\, \mathrm{kpc}$|⁠, slightly outside the reported peak in the surface density of molecular gas; we suggest that the latter radius may have been underestimated through the use of poor kinematic distances. The velocity distributions predicted by the best-fitting DF for stars with measured line-of-sight velocities v reveal that the outer disc is disturbed at the level of |$10\, \mathrm{km\, s}^{-1}$| in agreement with earlier studies, and that the measured values of v have significant contributions from the orbital velocities of binaries. Hence the outer disc is colder than it is sometimes reported to be.

1 INTRODUCTION

Over the last two decades our Galaxy has been the target of intense observational activity not only on account of its interest as our home but also because of its cosmological significance: it is a uniquely accessible example of the type of galaxy that currently dominates the cosmic star-formation rate. Data from massive photometric (Schmidt et al. 2005; Skrutskie et al. 2006; Kaiser et al. 2010) spectroscopic (Steinmetz et al. 2006; De Silva et al. 2015; Majewski et al. 2017) and astrometric (Gaia Collaboration 2016) surveys are now in hand and we need to synthesize these data into a coherent physical picture of our archetypal Galaxy.

The stellar distribution of our Galaxy is dominated by the disc. Over the last half century, it has become conventional to decompose the disc into several components. On the largest scale, Gilmore & Reid (1983) pointed out that the disc is split into thin and thick components. More recently, in light of spectroscopic data rather than star-counts, the view is gaining ground that the fundamental division is between a disc of very old stars with [α/Fe] larger than the Sun and a continuously forming disc of stars with ‘normal’ values of [α/Fe] (Hayden et al. 2015; Bland-Hawthorn et al. 2019). The former, very old disc has a scale height |$z_0\gt 0.7\, \mathrm{kpc}$| at all radii, while the latter disc has a much smaller |$(\sim 0.3\, \mathrm{kpc}$|⁠) scale height, except possibly beyond the solar radius R0, where it probably flares. The thin, or ‘α-normal’ disc is naturally divided into sub-discs comprising stars of similar ages because the α-normal disc has formed continuously over |$8-10\, \mathrm{Gyr}$| and older stellar cohorts now have larger random velocities and vertical scale heights. It is also thought that older cohorts have smaller radial scale lengths.

Recently (Li & Binney 2022), we investigated the structure of our Galaxy’s stellar halo by modelling the distribution of stars identified as RR-Lyrae variables in data from the Pan-STARRS survey (Kaiser et al. 2010). In that paper, we developed a technique for modelling a Galactic component using the 5D data for stars that is available in enormous quantities from the Gaia mission (Gaia Collaboration 2021, and references therein). In this paper, we apply this technique to objects that are likely OB stars, which may be considered tracers of the youngest cohort in the disc. The structure of this component is intrinsically of great interest, but modelling it forces one to engage with problems that do not arise when modelling the RR-Lyrae population.

We model a component by fitting to the data a parametrized distribution function (DF) f(J) that is a function of the action integrals within a given model of the Galaxy’s gravitational potential. This potential itself emerges from fitting DFs for several components, both stellar and dark, to 6D Gaia data (Binney & Vasiliev 2022, hereafter BV2022). The potential is that co-operatively generated by the DFs of dark matter and all the major stellar components together with a gas disc of pre-determined structure. The models are constructed and analysed using the agama software library (Vasiliev 2019).

When we lack data for one of the six phase-space coordinates, as we do for nearly all the ∼1.3 billion stars monitored by Gaia, fitting a DF to data is more computationally demanding because one has marginalize over the unknown line-of-sight velocity v of each star. Against this significant disadvantage may be set two substantial advantages: (i) in the Early Third Data Release (EDR3) from Gaia (Gaia Collaboration 2021) contains ∼1.3 × 109 stars whereas Gaia’s Radial Velocity Sample RVS (Gaia Collaboration 2018) contains only ∼7 × 106 stars, and (ii) while the selection function (SF) relevant to 5D data is complex, it is much better known than that of the RVS (Boubert et al. 2021; Everall et al. 2021). Hence, when using 5D data significance can be attached to the density of stars in real space in a way that is not possible when using 6D data: in that case only densities in velocity space are significant.

The Galaxy’s young stellar disc is perhaps the hardest component to observe because it is largely confined to a thin layer around the Galactic plane and is therefore heavily obscured, even fairly close to the Sun. We will find that our limited knowledge of the distribution of dust through the disc severely limits our ability to determine the structure of the young disc.

This paper is structured as follows. Section 2 explains how we extracted a sample of OB stars from Gaia EDR3. In Section 3, we outline the approach to modelling 5D data that was explained in Li & Binney (2022); Appendix  A gives greater detail. Section 4 specifies the structure of the DF f(J) that we fit to our tracers of the young disc. Section 5.1 outlines the self-consistent Galaxy model and the dust model in which we place models of the young disc. In Section 6, we use mock data to discover how accurately we can determine the structure of the young disc from a sample of OB stars. In Section 7 we fit DFs to our sample of OB stars and conclude that the limited spatial extent of the sample, combined with defects in dust models, severely limit our ability to determine the large-scale structure of the young disc. In Section 7.1, we investigate how well the best-fitting models account for the data. We find that while the models reproduce the observed kinematics well, defects of the dust models prevent the DFs reproducing the distribution of stars on the sky. In Section 7.2, we compare the predictions of our models to 6D data from the LAMOST spectroscopic survey. We confirm the conclusion of Eilers et al. (2020) that stars at |$R\sim 12\, \mathrm{kpc}$| in the anticentre direction are moving systematically outwards as a consequence of large-scale disturbance of the disc. We argue that the orbital velocities of binaries make a significant contribution to the measured line-of-sight velocities of OB stars, which is why several studies have found a puzzling increase with Galactocentric radius in the in-plane velocity dispersion of young stars. Section 8 sums up and identifies useful next steps.

2 A SAMPLE OF OB STARS

We selected OB stars from the intersection of three catalogues: Gaia’s (Gaia Collaboration 2016) early third data release (Gaia Collaboration 2021), the 2MASS infrared catalogue (Skrutskie et al. 2006), and the Starhorse catalogue (Anders et al. 2019, 2021),1 which provides Bayesian fits of distances and astrophysical parameters to all stars in EDR3 brighter than G = 18.5. EDR3, which was released in December of 2020, contains precision astrometry and photometry for 1.5 billion stars. Parallaxes and proper motions have typical uncertainties of |$0.07\,$|mas and |$0.07\,$|mas|$\, {\rm yr}^{-1}$| at |$G = 17\,$|mag and |$0.5\,$|mas and |$0.5\,$|mas|$\, {\rm yr}^{-1}$| at |$G = 20\,$| mag (Gaia Collaboration 2021). EDR3 gives magnitudes in red and blue passbands Grp and Gbp in addition to the broad-band G magnitudes.

We start by selecting stars that satisfy three basic criteria (Gaia Collaboration 2018):
(1)
where AG and E(GbpGrp) are the star’s extinction and colour from the Starhorse catalogue. In order to avoid contamination by red giants and red clump stars, 2MASS photometry (Skrutskie et al. 2006) is now used to make a further selection. We consider only stars with photometric flag AAA that are blue enough to satisfy
(2)
Then, we exclude A stars by restricting the sample to stars with |$T_{\rm eff}\gt 10\, 000\,$|K in the Starhorse catalogue.

Finally, we exclude stars lying within |$3\, \mathrm{kpc}$| of the Galactic centre to avoid the barred region of the Galaxy. However, on account of the high extinction towards the centre and the density profile of the young disc, few stars are picked even in the range |$R\in (3,5)\, \mathrm{kpc}$|⁠. The final catalogue of OB stars comprises |$46\, 916$| stars. They all have apparent magnitudes brighter than G = 16.65. Fig. 1 shows their spatial distribution.

The spatial distribution of our sample of OB stars projected on to the xy plane (upper panel) and xz plane (lower panel).
Figure 1.

The spatial distribution of our sample of OB stars projected on to the xy plane (upper panel) and xz plane (lower panel).

2.1 Selecting OB stars from a model

Up to an overall normalization, the DF of the OB stars will differ negligibly from the DF of the young disc. So the sampling algorithm built into agama can be used to pick a sample of OB stars distributed throughout the Galaxy. Absolute magnitudes are assigned to these stars by sampling the luminosity function of OB stars, and using their locations x and a dust model, we compute their apparent magnitudes G. The red curve in Fig. 2 shows the G-band luminosity function that we have used. It was obtained from the PARSEC stellar evolutionary tracks (Bressan et al. 2012),2 which yield the number of stars expected in a given range of absolute magnitudes per unit mass of a population with a given star-formation history. We assumed a constant star-formation rate over the last Gyr. Using this procedure we obtained luminosity functions for the Grp, Gbp, J, H, and Ks bands in addition to the G band.

The G-band luminosity function for OB stars extracted from the PARSEC stellar evolutionary tracks.
Figure 2.

The G-band luminosity function for OB stars extracted from the PARSEC stellar evolutionary tracks.

We assign each star observational uncertainties based on its apparent magnitude3 and scatter its phase-space coordinates by these errors. Then a mock star enters the catalogue if (cf equation 1)
(3)
These criteria are simple because EDR3 contains essentially all stars brighter than G = 16.65. The restriction on δ arises because we use a dust model (Green 2018; Green et al. 2019), that was developed from photometry taken in Hawii (Kaiser et al. 2010).

3 FORMALISM

Our approach to model fitting is that developed in McMillan & Binney (2012, 2013) and implemented in the case of 5D data by Li & Binney (2022). It is based on an algorithm for determining the likelihood of data given a gravitational potential Φ(x) a DF f(J) and a selection function S(x, M) that gives the probability that a star with absolute magnitude M located at x will enter the catalogue that is being modelled. The likelihoods are used to drive a Markov-chain Monte Carlo (MCMC) exploration of the parameter space of DFs. Two aspects of the present problem require additions to the formulae in Li & Binney (2022): (i) whereas RR-Lyrae stars can be assumed to have a unique absolute magnitude, the absolute magnitudes of our OB stars span a non-negligible range (M, M+), and (ii) each star has an extinction A derived from its location x. Appendix  A derives the additional formulae.

4 DISTRIBUTION FUNCTION

BV2022 introduced a new family of DFs for disc components.
(4)
where the functions fr and fz are
(5)
This definition involves a function
(6)
of Jϕ and six parameters pr, pz, Jr0, Jz0, Jϕ0, Jv0. The parameter pr determines how rapidly the radial velocity dispersion σR declines outwards, while pz determines how rapidly σz declines with distance from the plane. The constants Jr0 and Jz0 set, respectively, the radial and vertical velocity dispersions. Changes to the constant Jv0 modify the central velocity dispersion of the component, which is not of interest for this study.
The form proposed by BV2022 for fϕ(Jϕ) yields a stellar surface density that declines monotonically with radius. Since OB stars have formed recently from molecular gas, and the surface density of gas declines steeply inside the giant molecular ring, i.e. inside |$R\sim 5\, \mathrm{kpc}$|⁠, we multiply the BV2022 form
(7)
of fϕ by
(8)
where Jtaper is a parameter that determines the radius at which the surface density peaks and Jtrans determines the steepness of the surface density’s decline interior to the peak. In the definition (7) of fϕ, M is the disc’s mass, Jϕ0 sets the disc’s asymptotic scale length, and
(9)
where Jd0 is a constant. Modifying Jd changes the disc’s central density, which is not of interest here.

We refer readers to BV2022 for more information on the physical significance of the DF’s parameters. Those that are of concern here are pr, pz, Jr0, Jz0, Jϕ0, Jtaper, and Jtrans. We fix |$J_{\rm v0}=10\, \mathrm{kpc}\, \mathrm{km\, s}^{-1}$| and |$J_{\rm d0}=20\, \mathrm{kpc}\, \mathrm{km\, s}^{-1}$|⁠.

5 POTENTIAL AND DUST MODELS

Given a DF of the form f(J), any observable can be predicted once the Galaxy’s gravitational potential and its distribution of absorbing dust are specified.

5.1 The gravitational potential

The upper panel in Fig. 3 shows the circular speed of the gravitational potential that we have used. The blue dashed line represents all the spheroidal components including dark halo, bulge, and stellar halo. The black dotted line denotes the disc components. This potential was obtained by relaxing to self-consistency an axisymmetric model Galaxy that is defined by distribution functions for dark matter, a bulge, a stellar halo, four superposed stellar discs, and a gas disc. Table 1 lists key characteristics of this model. Further information for the parameters generating this model can be found in the online supplementary material. The lower panel in Fig. 3 shows the vertical density distributions of the components at the solar radius. The dot, dashed, and dash-dotted grey lines represent young, middle-age, and old components of the thin disc, respectively. The solid grey line shows the complete thin disc. The red and magenta lines denote thick disc and stellar halo, respectively. The dark magenta line shows the vertical density of the dark halo.

The upper panel shows the circular speed vc of the potential we use. The blue dots are observational estimates obtained by Eilers et al. (2019) from red giant stars. The lower panel shows the vertical density distributions at the solar radius for the components.
Figure 3.

The upper panel shows the circular speed vc of the potential we use. The blue dots are observational estimates obtained by Eilers et al. (2019) from red giant stars. The lower panel shows the vertical density distributions at the solar radius for the components.

Table 1.

The disc mass and densities at solar radius.

Mdisc5.48 × 1010 M
ρ(R0, z = 0)0.095 Mpc−3
ρ(R0, z = 1)0.005 Mpc−3
Mdisc5.48 × 1010 M
ρ(R0, z = 0)0.095 Mpc−3
ρ(R0, z = 1)0.005 Mpc−3
Table 1.

The disc mass and densities at solar radius.

Mdisc5.48 × 1010 M
ρ(R0, z = 0)0.095 Mpc−3
ρ(R0, z = 1)0.005 Mpc−3
Mdisc5.48 × 1010 M
ρ(R0, z = 0)0.095 Mpc−3
ρ(R0, z = 1)0.005 Mpc−3

5.2 Observational dust model

Several groups have recently developed models of the Galactic distribution of dust (Sale & Magorrian 2014; Lenz, Hensley & Doré 2017; Green 2018; Green et al. 2019). The green curves in Figs 4 and 5 show extinction as a function of distance along eight lines of sight according to the Bayestar2019 model from the most recent of these studies. The left-hand panel of Fig. 4 shows the line of sight (ℓ = 0, b = 10) towards the Galactic centre and that (ℓ = 180, b = 10) towards the anticentre, while the right-hand panel shows the lines of sight vertically downwards and upwards. The upper panels of Fig. 5 are for lines of sight at |$45\,$|deg to the centre and anticentre directions while the bottom left-hand panel is for the direction that in an axisymmetric Galaxy would be equivalent to that of the middle right-hand panel. Actually, the asymptotic extinction is about six times larger at |$\ell =135\,$|deg than at |$\ell =225\,$|deg. In each panel, the cyan line shows the extinction towards extra-Galactic objects from Schlegel et al. (1998). Ideally, each green curve would asymptote at large distances to its cyan line. Unfortunately, the reality falls well short of this ideal with the value from Green et al. (2019) systematically smaller than that from Schlegel et al. (1998).

Estimates of extinction versus distance (i) in the centre and anticentre directions at $b=10\,$ deg (left-hand panel) and vertically up or down (right-hand panel). The green and orange lines are from the Bayestar2019 (Green et al. 2019) and toy models, respectively. The dotted cyan lines show the asymptotic values from Schlegel, Finkbeiner & Davis (1998).
Figure 4.

Estimates of extinction versus distance (i) in the centre and anticentre directions at |$b=10\,$| deg (left-hand panel) and vertically up or down (right-hand panel). The green and orange lines are from the Bayestar2019 (Green et al. 2019) and toy models, respectively. The dotted cyan lines show the asymptotic values from Schlegel, Finkbeiner & Davis (1998).

As Fig. 4 for lines of sight at $b=10\,$ deg and various longitudes.
Figure 5.

As Fig. 4 for lines of sight at |$b=10\,$| deg and various longitudes.

5.3 Toy dust model

Since Figs 4 and 5 suggest that even the best current extinction map is likely far from the truth, we have investigated how results obtained from mock data are affected by use of an incorrect dust model. For these tests, we used a toy dust distribution assembled by adding a spiral distribution and a local bubble to an underlying axisymmetric distribution of dust. The latter is
(10)
where |$R_s=3\, \mathrm{kpc}$| and |$z_s=0.1\, \mathrm{kpc}$| are scale length and height of the dust disc, and ρ0 = 5 is a scale density. We add a four-arm spiral pattern to this axisymmetric density distribution by multiplying it by
(11)
where β = 6, m = 4 and
(12)
with α = 4, ϕ0 = 135°, and |$R=\sqrt{x^2+y^2}$|⁠.
To simulate the local bubble we further multiply the above density by
(13)
where s is heliocentric distance and |$\sigma =0.2\, \mathrm{kpc}$| sets the scale of the bubble.

The extinction is then the integral |$A=\int {\rm d}s\, \rho (R,z,\phi)$| of ρ(R, z, ϕ) along the line of sight and we use this extinction to assign apparent magnitudes to mock stars proposed by agama. When analysing the resulting mock catalogue, we use the dust model from Green et al. (2019), which is in this case inaccurate.

The orange curves in Figs 4 and 5 show an example of extinctions from this model alongside the empirical extinctions from Green et al. (2019) and Schlegel et al. (1998). The most serious defect of the toy model is a failure to generate radically different extinctions at the longitudes |$\ell =180\pm 45\,$| deg. This failure is attributable to its spherical local bubble rather than a cavity that is bounded by a very lumpy wall (Zucker et al. 2022). However, even a simplistic toy model of the dust enables us to probe the consequences of fitting data using a defective dust model.

6 TESTS

We generated ten mock catalogues as described in Section 2.1. The parameters used to generate these catalogues are given in the top row of Table 2. Then the computer explored the likelihood in the 7D parameter space with coordinates (Jr0, Jz0, ln Jϕ0, pr, pz, ln Jtapper, ln Jtrans). Thus we were holding fixed the parameters Jd0 and Jv0 that only affect a model in the region interior to all the data. In some runs these two parameters took the values used to generate the mock data, while in other runs they were fixed at values that differed from those that generated the data. These tests showed insensitivity of results to the values of these two parameters.

Table 2.

Summary of test results. Actions are given in |$\, \mathrm{kpc}\, \mathrm{km\, s}^{-1}$|⁠.

ParametersJr0Jz0ln Jϕ0prpzln Jtaperln Jtrans
Input values2056.910.350.356.914.50
Single catalogue|$20.21^{+0.65}_{-0.66}$||$4.97^{+0.09}_{-0.09}$||$6.89^{+0.02}_{-0.01}$||$0.27^{+0.05}_{-0.05}$||$0.30^{+0.02}_{-0.03}$||$6.90^{+0.01}_{-0.01}$||$4.53^{+0.10}_{-0.12}$|
Mean of 10 catalogues19.07 ± 0.885.01 ± 0.106.91 ± 0.010.31 ± 0.060.33 ± 0.036.90 ± 0.014.42 ± 0.09
Wrong dust model|$16.87^{+1.15}_{-1.82}$||$4.18^{+0.13}_{-0.12}$||$6.76^{+0.03}_{-0.03}$||$0.48^{+0.14}_{-0.05}$||$0.39^{+0.04}_{-0.04}$||$7.01^{+0.02}_{-0.02}$||$5.11^{+0.12}_{-0.19}$|
ParametersJr0Jz0ln Jϕ0prpzln Jtaperln Jtrans
Input values2056.910.350.356.914.50
Single catalogue|$20.21^{+0.65}_{-0.66}$||$4.97^{+0.09}_{-0.09}$||$6.89^{+0.02}_{-0.01}$||$0.27^{+0.05}_{-0.05}$||$0.30^{+0.02}_{-0.03}$||$6.90^{+0.01}_{-0.01}$||$4.53^{+0.10}_{-0.12}$|
Mean of 10 catalogues19.07 ± 0.885.01 ± 0.106.91 ± 0.010.31 ± 0.060.33 ± 0.036.90 ± 0.014.42 ± 0.09
Wrong dust model|$16.87^{+1.15}_{-1.82}$||$4.18^{+0.13}_{-0.12}$||$6.76^{+0.03}_{-0.03}$||$0.48^{+0.14}_{-0.05}$||$0.39^{+0.04}_{-0.04}$||$7.01^{+0.02}_{-0.02}$||$5.11^{+0.12}_{-0.19}$|
Table 2.

Summary of test results. Actions are given in |$\, \mathrm{kpc}\, \mathrm{km\, s}^{-1}$|⁠.

ParametersJr0Jz0ln Jϕ0prpzln Jtaperln Jtrans
Input values2056.910.350.356.914.50
Single catalogue|$20.21^{+0.65}_{-0.66}$||$4.97^{+0.09}_{-0.09}$||$6.89^{+0.02}_{-0.01}$||$0.27^{+0.05}_{-0.05}$||$0.30^{+0.02}_{-0.03}$||$6.90^{+0.01}_{-0.01}$||$4.53^{+0.10}_{-0.12}$|
Mean of 10 catalogues19.07 ± 0.885.01 ± 0.106.91 ± 0.010.31 ± 0.060.33 ± 0.036.90 ± 0.014.42 ± 0.09
Wrong dust model|$16.87^{+1.15}_{-1.82}$||$4.18^{+0.13}_{-0.12}$||$6.76^{+0.03}_{-0.03}$||$0.48^{+0.14}_{-0.05}$||$0.39^{+0.04}_{-0.04}$||$7.01^{+0.02}_{-0.02}$||$5.11^{+0.12}_{-0.19}$|
ParametersJr0Jz0ln Jϕ0prpzln Jtaperln Jtrans
Input values2056.910.350.356.914.50
Single catalogue|$20.21^{+0.65}_{-0.66}$||$4.97^{+0.09}_{-0.09}$||$6.89^{+0.02}_{-0.01}$||$0.27^{+0.05}_{-0.05}$||$0.30^{+0.02}_{-0.03}$||$6.90^{+0.01}_{-0.01}$||$4.53^{+0.10}_{-0.12}$|
Mean of 10 catalogues19.07 ± 0.885.01 ± 0.106.91 ± 0.010.31 ± 0.060.33 ± 0.036.90 ± 0.014.42 ± 0.09
Wrong dust model|$16.87^{+1.15}_{-1.82}$||$4.18^{+0.13}_{-0.12}$||$6.76^{+0.03}_{-0.03}$||$0.48^{+0.14}_{-0.05}$||$0.39^{+0.04}_{-0.04}$||$7.01^{+0.02}_{-0.02}$||$5.11^{+0.12}_{-0.19}$|

The SLSQP method in the scipy (Virtanen et al. 2020) package was first used to find the maximum of the likelihood and then the emcee package (Foreman-Mackey et al. 2013) was used to run a Markov Chain Monte Carlo exploration starting from the maximum likelihood. We used 35 walkers for each parameter over a total of 500 steps with the first 100 steps treated as burn-in.4 Fig. 6 shows a typical set of projections of the posterior probability distribution. The blue lines in each panel indicate the true parameter values while the dashed lines in the histograms show the 1-σ uncertainties from the 16th and 84th percentiles. The second row of Table 2 lists these uncertainties of the posterior distribution yielded by one catalogue, while the table’s third row gives the means and standard deviations of the means of the ten posterior distributions. The standard deviations are close to the 1-σ uncertainties from individual posterior distributions, but in most cases are slightly larger, as would be expected if the distributions had longer tails than a Gaussian.

Posterior probability distributions from a mock catalogue.
Figure 6.

Posterior probability distributions from a mock catalogue.

Only pr and pz have true values that lie outside the 1-σ range inferred from a single catalogue and even these true values lie within the 1-σ ranges inferred from ten catalogues. Thus the MCMC runs are delivering consistent results. Moreover, the precision with which parameters can be recovered is impressive – the uncertainty of the actions other than Jr0 is |$\lesssim 2$| per cent, while that of Jr0 is better than 4 per cent.

6.1 Correlations between parameters

Fig. 6 shows just two significant correlations between parameters, namely between Jr0 and pr and between Jz0 and pz. Jr0 sets the magnitude of the radial velocity dispersion σR, while pr sets the radial gradient of σR. Jz0 and pz similarly set the magnitude and radial gradient of σz. In the upper panel of Fig. 7 we plot σR versus R for three DFs. Also plotted as a chained black line is the radial density of mock stars in the catalogue. We see that very similar values of σR are obtained within the well sampled radial range using values of pr that range from 0 up to 0.6 by compensating for the change in pr by a change in Jr0. Interior to the well sampled range the three models predict very different values for σR, so they could be distinguished if the catalogue contained a significant number of stars at |$R\lt 3\, \mathrm{kpc}$|⁠. The lower panel of Fig. 7 makes the same point in the context of σz. Hence the strong degeneracies evident in Fig. 6 reflect the lack of stars at |$R\lesssim 3\, \mathrm{kpc}$|⁠.

Full lines: the radial and vertical velocity dispersions σR and σz as functions of radius R in the Galactic plane for several models. Broken lines: the surface density Σ(R) of OB stars – the densities of all models considered are indistinguishable.
Figure 7.

Full lines: the radial and vertical velocity dispersions σR and σz as functions of radius R in the Galactic plane for several models. Broken lines: the surface density Σ(R) of OB stars – the densities of all models considered are indistinguishable.

The uncertainties in Jz0 and pz are smaller than those in Jr0 and pr because the greatest differences in R are achieved towards the anticentre, where vR dominates v, which is not in the catalogue, while vz dominates vb, which is in the catalogue.

6.2 Impact of the dust model

Now we explore how using an incorrect dust model affects the posterior distribution of Jϕ0 and Jtaper. Fig. B1 shows the posterior probabilities obtained when a mock catalogue created using the toy dust model of Section 5.3 is analysed using the dust model of Green et al. (2019). The bottom row of Table 2 lists the corresponding statistics.

The formal uncertainties on the parameters, ln Jϕ0, ln Jtaper, and ln Jtrans that determine the stellar surface density increase only moderately but the true value of ln Jϕ0 now lies 0.12 above its 1-σ range, while the true values of ln Jtaper and ln Jtrans now lie 0.08 and 0.42 below their 1-σ ranges. The lower left-hand panel of Fig. 5 shows that around |$\ell =135\,$|deg, the toy model predicts much lower extinction than the Green et al. (2019) model. Low extinction enhances the number of mock stars, and to account for these stars in the presence of the larger observed extinction, the disc must be imagined more extended than it is. Hence, the recovered value of Jϕ0 should be larger than the true value. That this is not the case must be attributed to the unrealistically large values of Jtaper and Jtrans: increasing Jtaper pushes outwards the flat part of the surface density, and increasing Jtrans extends the flat section.

The formal uncertainties on Jr0,Jz0, pr, and pz all increase significantly and the true values of both Jr0 and Jz0 now lie above their 1-σ ranges, while the true values of pr and pz still lie within their 1-σ ranges.

This experiment strongly suggests (i) that the formal uncertainties on the recovered parameters of the DF materially underestimate the true uncertainties because the latter are dominated by the uncertain distribution of dust, and (ii) that degeneracies between the parameters make it hard to predict how recovered parameters will be affected by a change in the dust model.

7 FITS TO GAIA DATA

When the code was used for an MCMC search of fits to the real data, the chain showed a long-term drift towards larger Jr0 and smaller Jϕ0. In light of this drift, the MCMC chain was extended to 1500 steps, three times the number of steps used for the mock data, but without convincing evidence emerging that the chain’s long-term drift had ceased. As discussed below, we think this poor performance of the code on real data is a consequence of our use of an erroneous dust model. On account of the latter, no DF gives a convincing fit to all the data. Different DFs fit different parts of the data and the code wanders from one unsatisfactory DF to another without finding a convincing maximum in the data’s likelihood.

Here we present results from a near-converged section of ∼300 steps. These results illustrate the quality of fit to the data that can be achieved with the present dust model, and provide an indication of the parameter choices that might be found when a satisfactory dust model becomes available. Fig. 8 shows posterior distributions from the chosen section of the chain, while Table 3 gives the means and 1-σ uncertainties of these distributions. It should be borne in mind that the true uncertainties will be significantly larger than those quoted here on account of the premature truncation of the MCMC chain.

Posterior probability distributions from the real catalogue.
Figure 8.

Posterior probability distributions from the real catalogue.

Table 3.

Statistics of the probability distributions from fits to Gaia OB stars. Jd0 and Jv0 are kept fixed at |$20\, \mathrm{km\, s}^{-1}$| and |$10\, \mathrm{km\, s}^{-1}$|⁠, respectively.

ParametersJr0Jz0ln Jϕ0prpzln Jtaperln JtransJd0Jv0
Fitted result|$47.09^{+0.77}_{-0.87}$||$5.90^{+0.03}_{-0.03}$||$6.77^{+0.01}_{-0.01}$||$-1.48^{+0.02}_{-0.02}$||$-1.40^{+0.03}_{-0.02}$||$7.05^{+0.03}_{-0.01}$||$6.74^{+0.02}_{-0.01}$|2010
ParametersJr0Jz0ln Jϕ0prpzln Jtaperln JtransJd0Jv0
Fitted result|$47.09^{+0.77}_{-0.87}$||$5.90^{+0.03}_{-0.03}$||$6.77^{+0.01}_{-0.01}$||$-1.48^{+0.02}_{-0.02}$||$-1.40^{+0.03}_{-0.02}$||$7.05^{+0.03}_{-0.01}$||$6.74^{+0.02}_{-0.01}$|2010
Table 3.

Statistics of the probability distributions from fits to Gaia OB stars. Jd0 and Jv0 are kept fixed at |$20\, \mathrm{km\, s}^{-1}$| and |$10\, \mathrm{km\, s}^{-1}$|⁠, respectively.

ParametersJr0Jz0ln Jϕ0prpzln Jtaperln JtransJd0Jv0
Fitted result|$47.09^{+0.77}_{-0.87}$||$5.90^{+0.03}_{-0.03}$||$6.77^{+0.01}_{-0.01}$||$-1.48^{+0.02}_{-0.02}$||$-1.40^{+0.03}_{-0.02}$||$7.05^{+0.03}_{-0.01}$||$6.74^{+0.02}_{-0.01}$|2010
ParametersJr0Jz0ln Jϕ0prpzln Jtaperln JtransJd0Jv0
Fitted result|$47.09^{+0.77}_{-0.87}$||$5.90^{+0.03}_{-0.03}$||$6.77^{+0.01}_{-0.01}$||$-1.48^{+0.02}_{-0.02}$||$-1.40^{+0.03}_{-0.02}$||$7.05^{+0.03}_{-0.01}$||$6.74^{+0.02}_{-0.01}$|2010

In Fig. 8, the histograms are less Gaussian than the corresponding histograms in Fig. 6, and in the off-diagonal panels the regions of high likelihood are typically much more elongated than the corresponding regions. We attribute both phenomena to the failure of the chain to converge on account of the poor dust model.

7.1 Fit quality

A natural first question is whether the models with the largest recovered likelihoods give an acceptable account of the data. Figs 9 and 10 are histograms of velocity in the b and ℓ directions, respectively for stars binned by radius. The red histograms are for real stars and the blue histograms are for a sample of mock stars drawn from the model that gives the real data the highest likelihood. While the red and blue histograms agree moderately well in Fig. 9 for vb, they are disturbingly different in Fig. 10 for v. Figs 11 and 12 shed light on these differences by showing analogous histograms when we use the same model to choose a velocity for each real star. Now the real and mock distributions agree at least as well in v as in vb. Evidently, the clash in Fig. 10 arises because the real and mock stars have systematically different locations; at a given place, the model predicts velocities accurately, but it makes poor choices for the locations of stars.

Histograms of vb for real data (red) and for mock stars drawn from the best-fitting model. For each bin we give the median R value of the mock stars.
Figure 9.

Histograms of vb for real data (red) and for mock stars drawn from the best-fitting model. For each bin we give the median R value of the mock stars.

As Fig. 9 but for vℓ.
Figure 10.

As Fig. 9 but for v.

As Fig. 9 except that the blue histogram is computed from velocities assigned by sampling the best-fitting model at the locations of real rather than mock stars.
Figure 11.

As Fig. 9 except that the blue histogram is computed from velocities assigned by sampling the best-fitting model at the locations of real rather than mock stars.

As Fig. 11 for the vℓ component.
Figure 12.

As Fig. 11 for the v component.

Fig. 13 drives this point home by showing the real and mock distributions of stars in the Galactocentric angle ϕ. In general, the real distribution is narrower and shifted to larger ϕ than the mock distribution. Since differential rotation of the disc makes v a strong function of ϕ, different distributions in ϕ inevitably leads to a different distributions in v.

The distribution of ϕ for both best-fitting mock and real catalogue stars. The bins are the same as for Figs 10 and 9.
Figure 13.

The distribution of ϕ for both best-fitting mock and real catalogue stars. The bins are the same as for Figs 10 and 9.

The prime suspect for causing these differences in the distribution of ϕ between real and mock stars must be the dust model. Indeed, our axisymmetric DF has very little capacity to modify the distribution in ϕ. Figs 4 and 5 by contrast demonstrate that dust violently breaks the symmetry in ϕ. If the dust model breaks this symmetry incorrectly, the mock stars will not reproduce the observed bias in ϕ, and the predicted values of v will be wrongly distributed.

Of course dust also has a strong influence on the radial distribution of catalogued stars. The red and blue histograms in Fig. 14 show the real and mock distributions in R. The mock distribution is broader than the real one because it extends to smaller radii. This finding suggests that the extinctions provided by the dust model tend to be too small in directions towards the Galactic Centre.

The distribution in R of the real stars (red) and mock stars (blue) drawn from the best-fitting model.
Figure 14.

The distribution in R of the real stars (red) and mock stars (blue) drawn from the best-fitting model.

Changing the radial distribution of OB stars is very much within the scope of the DF, so it is perhaps puzzling that the best-fitting model does not reproduce the observed distribution in R better. The answer may be that when the DF changes the radial distribution of stars, for example by changing the value of Jtaper, it will also change the kinematics. So the need to fit the observed kinematics can push the DF to predict (correctly) an abundance of stars at small radii that do not feature in the data, but do in a mock catalogue drawn using extinctions that are too small.

The curve Fig. 15 shows the surface density of the best-fitting model. The density peaks at |$R\sim 5.5\, \mathrm{kpc}$|⁠, which is now thought to be close to the bar’s corotation radius (Pérez-Villegas et al. 2017; Binney 2020; Chiba, Friske & Schönrich 2021). The density drops faster from the peak inwards than outwards. The blue dots in Fig. 15 show the surface density of the young disc estimated by Xiang et al. (2018) using a catalogue from the LAMOST survey. They found a significant peak at the solar radius which they suggests reflects the Local Arm but they were unable to probe the disc interior to the Sun on account of LAMOST’s bias towards the anticentre. Bovy et al. (2016) used red clump stars in APOGEE to assess the structure of mono-abundance populations (MAPs). The metal-rich, low-α population to which the OB stars belong, was found to have a ‘broken’ exponential radial profile, with the break at |$R=7\, \mathrm{kpc}$|⁠. Mackereth et al. (2017), using red giants in APOGEE as the tracers, found that the youngest population has a break radius at |$R\simeq 8\, \mathrm{kpc}$|⁠. The surface density of our best-fitting model shown in Fig. 15 is broadly in line with these earlier results, although its break radius is somewhat smaller.

The curve shows the surface density of the best-fitting model of the young disc. The blue data points show the surface densities derived by Xiang et al. (2018) from LAMOST data. The normalization of the model is reduced by a factor 3 to facilitate the comparison of data and model.
Figure 15.

The curve shows the surface density of the best-fitting model of the young disc. The blue data points show the surface densities derived by Xiang et al. (2018) from LAMOST data. The normalization of the model is reduced by a factor 3 to facilitate the comparison of data and model.

The predicted peak in the surface density of OB stars at |$R\simeq 5.5\, \mathrm{kpc}$| falls outside the reported peak at |$R=4-5\, \mathrm{kpc}$| in the surface density of H2 (Heyer & Dame 2015). While studies of the stellar disc are liable to overestimate the radius of the peak stellar density because the innermost young disc is hidden behind the abundant dust near the plane at |$R\lesssim 6\, \mathrm{kpc}$|⁠, the radius at which the density of H2 peaks may have been underestimated. Indeed, hydrodynamical models (e.g. Sormani, Binney & Magorrian 2015) predict the density of gas to be low inside the bar’s corotation radius, which is currently believed to be as large as |$6\, \mathrm{kpc}$| (Pérez-Villegas et al. 2017; Binney 2020; Chiba et al. 2021). A circular-speed curve is required to convert an observed distribution of emission-line intensity in longitude and velocity into a plot of surface density versus radius. Perhaps when the emission-line data are re-analysed using a circular-speed curve extracted from Gaia, the H2 distribution will be found to peak at a larger radius.

7.2 Comparison of the LAMOST

Our model predicts distributions of space velocities and in this section we compare these predictions with distributions inferred by combining spectroscopic measurements of v with EDR3 astrometry. Specifically, many of the OB stars we have identified in the intersection of EDR3, 2MASS, and the Starhorse catalogue were observed by LAMOST (Xiang et al. 2021) so have measured values of v. We select stars within |$\left| b\right| \lt 5\,$| deg and |$\epsilon _v\lt 50\, \mathrm{km\, s}^{-1}$| to exclude stars with poorly determined v. We also shift the LAMOST values of v by |$4.54\, \mathrm{km\, s}^{-1}$| as suggested by Schönrich & Aumer (2017), Anguiano et al. (2018). After such corrections, LAMOST’s values of v agree well with the values measured for the same stars by APOGEE and Galah.

Fig. 16 shows the distributions of the corrected values of v when the stars are divided into four bins in R with the distributions of v obtained by sampling the best-fitting model at the locations of observed stars. The observed and mock distributions have similar shapes but are offset from one another by an amount that grows with RR0. In principle, these offsets could arise through biased distances for the stars causing Galactic rotation to make erroneous contributions to the mock values of v. We found that systematically increasing distances by 10 per cent shifted even the mock histogram for the furthest bin by only |$\sim 1.5\, \mathrm{km\, s}^{-1}$|⁠, so neither erroneous distances nor faulty data reduction can explain the measured offsets.

Observed and mock histograms of v∥ in four bins in R.
Figure 16.

Observed and mock histograms of v in four bins in R.

Fig. 17 compares observed and mock distributions of vR at the locations of LAMOST stars. This comparison is more sensitive to adopted distances than the above comparison of v values because vR has contributions from v and vb, which are proportional to distance. These contributions are small, however, because all stars lie near the anticentre. The mock distributions from our axisymmetric model are inevitably symmetric in vR, so the comparison highlights the asymmetry in the observed distributions. The asymmetry is negligible for the nearest sample because the U component of the solar velocity is chosen to eliminate this asymmetry in the wider populations of disc stars. The observed systematic increase in asymmetry with RR0 must arise from some combination of spiral structure and large-scale distortion of the disc by the Sgr Dwarf galaxy, which has been extensively discussed in connection with the Galactic warp and the phase spiral (Jiang & Binney 2000; Antoja et al. 2018; Binney & Schönrich 2018; Bland-Hawthorn et al. 2019; Laporte et al. 2019). Fig. 17 is consistent with what one would expect from the map ov vR deduced by Eilers et al. (2020) from data for red giant stars.

Orange histograms: the distributions of vR predicted by the best-fitting model in the plane at $R=8.75,\, 9.25,\, 9.75,$ and $10.25\, \mathrm{kpc}$. Grey histograms: distributions of vR for the OB stars in the LAMOST catalogue that lie in these spatial bins. The number of stars in each bin is 2097, 3572, 4268, and 3215.
Figure 17.

Orange histograms: the distributions of vR predicted by the best-fitting model in the plane at |$R=8.75,\, 9.25,\, 9.75,$| and |$10.25\, \mathrm{kpc}$|⁠. Grey histograms: distributions of vR for the OB stars in the LAMOST catalogue that lie in these spatial bins. The number of stars in each bin is 2097, 3572, 4268, and 3215.

Fig. 18 compares our model’s predictions (curves) for the velocity dispersions σz and σR with results from OB stars in LAMOST. The data for σz (red line and points) are in reasonable concordance, although the LAMOST data show a weaker outward decline than the model predicts. Fig. 19 provides evidence that the steeper gradient of the model is required by the EDR3 data by comparing the model prediction (dashed line) with the dispersion in vb, which for these low-latitude stars differs little from vz.

The curves show the profiles of the velocity dispersions σz (red) and σR (black) predicted by the best-fitting model. The dots show values derived from OB stars in LAMOST.
Figure 18.

The curves show the profiles of the velocity dispersions σz (red) and σR (black) predicted by the best-fitting model. The dots show values derived from OB stars in LAMOST.

The curve shows σz as a function of R in the Galactic plane from the best-fitting model. The dots show vb for real stars in Gaia EDR3 with $|b|\lt 5\,$ deg.
Figure 19.

The curve shows σz as a function of R in the Galactic plane from the best-fitting model. The dots show vb for real stars in Gaia EDR3 with |$|b|\lt 5\,$| deg.

In Fig. 18, the black points and full curve for σR are starkly incompatible. This conflict signals that the vR distribution has been broadened as well as shifted to negative values. The points in the upper panel of Fig. 20 show the ratio σRϕ as a function of R for LAMOST stars, while the black line shows the prediction of the best-fitting model. The data points climb away from the model’s line as RR0 grows. A classical result of stellar dynamics is that σRϕ is largely set by the shape of the circular-speed curve: when the latter is flat, any thin-disc population will have σRϕ ≃ √2 (e.g. Binney & Tremaine 2008). The LAMOST data imply values of σRϕ that reach well above 2. No equilibrium model could match these values.

σR/σϕ and σz/σϕ plotted against R for OB stars in LAMOST using Starhorse distances.
Figure 20.

σRϕ and σzϕ plotted against R for OB stars in LAMOST using Starhorse distances.

7.3 Impact of binaries

We now ask whether binary stars can explain the anomalously large values of σRϕ plotted in Fig. 20.

Massive stars usually have a massive binary companion, but we find that only ∼10 per cent of the OB stars in our sample have binary companions resolved by Gaia. Taking the angular resolution of Gaia to be |$0.1\,$|arcsec, it follows that the majority of these stars must be in binaries with separations
(14)
Consequently, their orbital velocities satisfy
(15)
The least massive B star has |$M\sim 3.8\, {\rm M}_\odot$| (Binney & Merrifield 1998), so binary velocities in excess of |$10\, \mathrm{km\, s}^{-1}$| must be common.
The velocities of binaries will be isotropically distributed, so the mean-square value of the component along the line of sight is |$v_{\parallel \rm b}^2=v_{\rm b}^2/3$|⁠. The mean-square velocity of the primary is |$M_2^2/(M_1+M_2)^2$| times this, so smaller by a factor of at least 4. If for simplicity we assume that light from the primary dominates the spectrum, the contribution to the measured value of |$v_\parallel ^2$| is
(16)
The distribution of binary separations determined by Duchêne & Kraus (2013) and Gravity Collaboration (2018) in a study of the Orion nebula yields |$\big \langle 1/r\big \rangle \simeq 5\, \hbox{AU}^{-1}$|⁠, so
(17)
Given that the product of masses in this equation is of the order of unity and that σR is dominated by v, it is very much to be expected that binaries will cause the measured value of σRϕ to be substantially higher than expected in a model that ignores binaries. Fig. 18 suggests that binaries set a floor value, |$\sigma _R\simeq 20\, \mathrm{km\, s}^{-1}$|⁠, to σR, which is smaller by a factor ∼2 than the value suggested by the calculation above. Our value of 〈1/r〉 may well be too large because it is dominated by the small fraction of very close binaries, and is correspondingly uncertain.

8 CONCLUSIONS

We extracted a sample of |$\sim 47\, 000$| OB stars from the intersection of Gaia EDR3 with the 2MASS and Pan-STARRS surveys. For this sample, we have photometry plus 5D astrometry. We used the algorithm developed in Li & Binney (2022) in connection with similar data for a sample of RR-Lyrae stars to fit DFs of the form f(J) to the OB population. Since the surface density of the young disc is expected to peak at a radius of a few kpc, we introduced two new parameters to the disc DFs proposed by BV2022. Small extensions of the algorithm were required to deal with (i) the finite spread in luminosity of the sampled stars, and (ii) extinction by dust. Tests of the algorithm on similar pseudo-data showed that the parameters of the true DF can be recovered with remarkable precision when the distribution of dust is accurately known. When a significantly incorrect dust distribution is used to analyse the data, the true parameters can lie several sigmas above or below the probable range returned by the algorithm.

When the algorithm is run on the real data using the dust model of Green et al. (2019), the most probable DF yields pseudo-data that are not correctly distributed on the sky. The source of this discrepancy is almost certainly imperfections of the dust model used. Since the distribution of velocities depends quite strongly on location, discrepancies in the spatial distribution of stars automatically leads to discrepancies between the predicted and measured velocity distributions. When the most probable DF is used to sample velocities at the observed locations of stars rather than at the locations of pseudo-stars, the observed and predicted velocity distributions agree well. These tests show that a sample of OB stars like the present one would pin down the phase-space structure of the young stellar disc with impressive accuracy if the 3D distribution of dust were accurately known. The discrepancies between the distributions of pseudo-stars and real stars constitute clear evidence that the best current dust model is significantly flawed.

Although the functional form of the DF provides great flexibility in the radial distribution of stars, the best-fitting DF predicts a distribution of OB stars that extends to smaller radii than the observed sample. Since the kinematics of a stellar disc are not independent of its surface-density profile, the MCMC search may favour discs that extend unexpectedly far in because the kinematics of such discs may fit the data better. Moreover, a dust model that included more dust interior to the Sun would make the data consistent with more extended discs. Hence, we consider it likely that there is more dust at |$R\lesssim 7\, \mathrm{kpc}$| than the Bayestar2019 model envisages.

The most probable DF predicts the distributions of the space velocities of OB stars. We have compared these predictions with the distributions of the sub-sample of EDR3 OB stars that have measured line-of-sight velocities because they fell within the LAMOST survey. The predicted and measured dispersions σz agree fairly well, although the measured values of σz are flat beyond |$R\sim 10\, \mathrm{kpc}$| while the predicted values decrease monotonically outwards. The measured values of σR do not decrease outwards as the model predicts – they even increase slightly. Moreover, the median value of vR drifts upwards from near zero at R0 to |$\overline{v}_R\sim 10\, \mathrm{km\, s}^{-1}$| at |$R\sim 12\, \mathrm{kpc}$|⁠. We concluded that this trend in |$\overline{v}_R$| is not an artefact induced by erroneous distances. In particular, it is associated with similar offsets between the measured line-of-sight velocities and those predicted by the model. It seems that some combination of spiral structure and disturbance by the gravitational field of the Sgr dwarf galaxy is changing velocities by |$\gtrsim 10\, \mathrm{km\, s}^{-1}$| between here and |$R\sim 12\, \mathrm{kpc}$|⁠.

The ratio σRϕ, which is strongly constrained by the shape of the circular-speed curve, is unexpectedly large for the LAMOST stars. We argued that this result arises naturally from the majority of the stars being in binaries too tight for Gaia to resolve. The line-of-sight velocity of such a star will differ from its barycentric velocity by a fraction of the binary velocity because the LAMOST data for most stars are based on a single epoch. The EDR3 proper motion, by contrast, is obtained by fitting a curve through the star’s observed positions at ∼40 epochs, so will be barely affected by their binary nature. The line-of-sight velocity of distant LAMOST OB stars feeds strongly into vR rather than vϕ because they are located close towards the anticentre. Hence binaries will push σRϕ above the value predicted for single stars.

The key to improving our understanding of the Galaxy’s young disc is construction of a better map of the distribution of dust. Traditionally dust is mapped by measuring extinctions for stars with known distances and the advent of vast number of precise parallaxes for stars within a few kiloparsecs of the Sun has breathed new life into this field (Bailer-Jones 2011; Sale & Magorrian 2014; Green et al. 2019; Lallement et al. 2022). The major problems now are (i) the determination of extinctions for large numbers of stars, and (ii) synthesizing the extinction measures into a coherent picture of the dust distribution. This work suggests an approach to dust mapping that dispenses with extinctions measures of individual stars. The stars of a stellar populations that is more than |$\sim 200\, \mathrm{Myr}$| old have to be fairly smoothly distributed in phase space. Moreover, the population’s velocity distribution at one location strongly constrains the population’s DF, and therefore its spatial distribution. Moreover, the velocity distribution at a location can be determined without knowledge of the column of dust through which we see that location because obscuration is velocity-independent. Hence, an exercise along the present lines could yield fairly firm predictions for the spatial distribution of stars, and a dust model could be strongly constrained by comparing this predicted distribution in (ℓ, b, ϖ, μ, μb, G, colours) with the observed star density.

A great advantage of such an approach is that it could exploit all the |$\gt 1.3\,$| billion stars tracked by Gaia, and thus produce dust maps of unprecedented spatial resolution. To implement this idea, it is necessary both to confront the computational challenge of simultaneously exploring high-dimensional parameter spaces of DFs and dust models, and to obtain a good model of Gaia’s selection function. There is an excellent prospect that implementation will soon be feasible.

SUPPORTING INFORMATION

src.zip

Please note: Oxford University Press is not responsible for the content or functionality of any supporting materials supplied by the authors. Any queries (other than missing material) should be directed to the corresponding author for the article.

ACKNOWLEDGEMENTS

We thank the anonymous referee for the suggestions to improve this paper. CL and JB are supported by the UK Science and Technology Facilities Council under grant number ST/N000919/1. JB also acknowledges support from the Leverhulme Trust through an Emeritus Fellowship.

This work presents results from the European Space Agency (ESA) space mission Gaia. Gaia data are being processed by the Gaia Data Processing and Analysis Consortium (DPAC). Funding for the DPAC is provided by national institutions, in particular the institutions participating in the Gaia MultiLateral Agreement (MLA). The Gaia mission website is https://www.cosmos.esa.int/gaia. The Gaia archive website is https://archives.esac.esa.int/gaia.

This publication makes use of data products from the Two Micron All Sky Survey, which is a joint project of the University of Massachusetts and the Infrared Processing and Analysis Center/California Institute of Technology, funded by the National Aeronautics and Space Administration and the National Science Foundation.

DATA AVAILABILITY

The agama  source codes and model parameter file are available in the online supplementary material. The data underlying this article will be shared on reasonable request to the corresponding author.

Footnotes

1

Gaia@AIP Services at https://gaia.aip.de/

3

In fact, this is not true especially for the proper motions because of the different celestial frames of Gaia bright and faint stars, respectively (Cantat-Gaudin & Brandt 2021). However, since the Astrometry Spread Function module in Gaia-verse package (Everall et al. 2021) only works for GaiaDR2 now, the simulated astrometric solutions are deviated from those in Gaia EDR3. As a result, we use some simple randomized errors based on apparent magnitudes instead.

4

The convergence was tested by computing the autocorrelation time recommended by emcee. The average autocorrelation time for all the parameters are about 25 which means the burn-in steps are reasonable. We also test a longer chain with N = 3000 which is more than 100 times the autocorrelation time. It yielded similar results to those obtained with N = 500 and in the interests of economy we use N = 500 here.

6

The overall count of 77 × 35 = 2695 evaluations of the DF for each star.

REFERENCES

Anders
F.
et al. ,
2019
,
A&A
,
628
,
A94

Anders
F.
et al. ,
2022
,
A&A
,
658
,
A91

Anguiano
B.
et al. ,
2018
,
A&A
,
620
,
A76

Antoja
T.
et al. ,
2018
,
Nature
,
561
,
360

Bailer-Jones
C. A. L.
,
2011
,
MNRAS
,
411
,
435

Binney
J.
,
2020
,
MNRAS
,
495
,
895

Binney
J.
,
Merrifield
M.
,
1998
,
Galactic Astronomy
.
Princeton Univ. Press
,
Princeton

Binney
J.
,
Schönrich
R.
,
2018
,
MNRAS
,
481
,
1501

Binney
J.
,
Tremaine
S.
,
2008
,
Galactic Dynamics: Second Edition
.
Princeton Univ. Press
,
Princeton

Binney
J.
,
Vasiliev
E.
,
2022
,
preprint (arXiv:2206.03523)

Bland-Hawthorn
J.
et al. ,
2019
,
MNRAS
,
486
,
1167

Boubert
D.
,
Everall
A.
,
Fraser
J.
,
Gration
A.
,
Holl
B.
,
2021
,
MNRAS
,
501
,
2954

Bovy
J.
,
Rix
H.-W.
,
Schlafly
E. F.
,
Nidever
D. L.
,
Holtzman
J. A.
,
Shetrone
M.
,
Beers
T. C.
,
2016
,
ApJ
,
823
,
30

Bressan
A.
,
Marigo
P.
,
Girardi
L.
,
Salasnich
B.
,
Dal Cero
C.
,
Rubele
S.
,
Nanni
A.
,
2012
,
MNRAS
,
427
,
127

Cantat-Gaudin
T.
,
Brandt
T. D.
,
2021
,
A&A
,
649
,
A124

Chiba
R.
,
Friske
J. K. S.
,
Schönrich
R.
,
2021
,
MNRAS
,
500
,
4710

De Silva
G. M.
et al. ,
2015
,
MNRAS
,
449
,
2604

Duchêne
G.
,
Kraus
A.
,
2013
,
ARA&A
,
51
,
269

Eilers
A.-C.
,
Hogg
D. W.
,
Rix
H.-W.
,
Ness
M. K.
,
2019
,
ApJ
,
871
,
120

Eilers
A.-C.
,
Hogg
D. W.
,
Rix
H.-W.
,
Frankel
N.
,
Hunt
J. A. S.
,
Fouvry
J.-B.
,
Buck
T.
,
2020
,
ApJ
,
900
,
186

Everall
A.
,
Boubert
D.
,
Koposov
S. E.
,
Smith
L.
,
Holl
B.
,
2021
,
MNRAS
,
502
,
1908

Foreman-Mackey
D.
,
Hogg
D. W.
,
Lang
D.
,
Goodman
J.
,
2013
,
PASP
,
125
,
306

Gaia Collaboration
,
2016
,
A&A
,
595
,
A1

Gaia Collaboration
,
2018
,
A&A
,
616
,
A11

Gaia Collaboration
,
2021
,
A&A
,
649
,
A1

Gilmore
G.
,
Reid
N.
,
1983
,
MNRAS
,
202
,
1025

Gravity Collaboration
,
2018
,
A&A
,
620
,
A116

Green
G. M.
,
2018
,
J. Open Source Softw.
,
3
,
695

Green
G. M.
,
Schlafly
E.
,
Zucker
C.
,
Speagle
J. S.
,
Finkbeiner
D.
,
2019
,
ApJ
,
887
,
93

Hayden
M. R.
et al. ,
2015
,
ApJ
,
808
,
132

Heyer
M.
,
Dame
T. M.
,
2015
,
ARA&A
,
53
,
583

Jiang
I.-G.
,
Binney
J.
,
2000
,
MNRAS
,
314
,
468

Kaiser
N.
et al. ,
2010
, in
Stepp
L. M.
,
Gilmozzi
R.
,
Hall
H. J.
, eds,
Proc. SPIE Conf. Ser. Vol. 7733, Ground-based and Airborne Telescopes III
.
SPIE
,
Bellingham
, p.
77330E

Lallement
R.
,
Vergely
J. L.
,
Babusiaux
C.
,
Cox
N. L. J.
,
2022
,
A&A
,
661
,
 A147

Laporte
C. F. P.
,
Minchev
I.
,
Johnston
K. V.
,
Gómez
F. A.
,
2019
,
MNRAS
,
485
,
3134

Lenz
D.
,
Hensley
B. S.
,
Doré
O.
,
2017
,
ApJ
,
846
,
38

Li
C.
,
Binney
J.
,
2022
,
MNRAS
,
510
,
4706

Mackereth
J. T.
et al. ,
2017
,
MNRAS
,
471
,
3057

Majewski
S. R.
et al. ,
2017
,
AJ
,
154
,
94

McMillan
P. J.
,
Binney
J.
,
2012
,
MNRAS
,
419
,
2251

McMillan
P. J.
,
Binney
J. J.
,
2013
,
MNRAS
,
433
,
1411

Pérez-Villegas
A.
,
Portail
M.
,
Wegg
C.
,
Gerhard
O.
,
2017
,
ApJ
,
840
,
L2

Sale
S. E.
,
Magorrian
J.
,
2014
,
MNRAS
,
445
,
256

Schlegel
D. J.
,
Finkbeiner
D. P.
,
Davis
M.
,
1998
,
ApJ
,
500
,
525

Schmidt
B. P.
,
Keller
S. C.
,
Francis
P. J.
,
Bessell
M. S.
,
2005
,
American Astronomical Society Meeting Abstracts #206
,
15.09

Schönrich
R.
,
2012
,
MNRAS
,
427
,
274

Schönrich
R.
,
Aumer
M.
,
2017
,
MNRAS
,
472
,
3979

Skrutskie
M. F.
et al. ,
2006
,
AJ
,
131
,
1163

Sormani
M. C.
,
Binney
J.
,
Magorrian
J.
,
2015
,
MNRAS
,
454
,
1818

Steinmetz
M.
et al. ,
2006
,
AJ
,
132
,
1645

Vasiliev
E.
,
2019
,
MNRAS
,
482
,
1525

Virtanen
P.
et al. ,
2020
,
Nat. Methods
,
17
,
261

Xiang
M.
et al. ,
2018
,
ApJS
,
237
,
33

Xiang
M.
,
Rix
H.-W.
,
Ting
Y.-S.
,
Zari
E.
,
El-Badry
K.
,
Yuan
H.-B.
,
Cui
W.-Y.
,
2021
,
ApJS
,
253
,
22

Zucker
C.
et al. ,
2022
,
Nature
,
601
,
334

APPENDIX A: FORMULAE FOR COMPUTING LIKELIHOODS

Let f(w) be the DF normalized such that |$\int {\rm d}^6{\bf w}\, f=1$|⁠, and Φ(M) be the population’s luminosity function. Then the probability that a randomly chosen survey star is located within the phase-space volume d6w around the phase-space location w = (x, v) and has absolute magnitude in (M, M + dM) is
(A1)
where S(w, M) is the probability that a star at w of absolute magnitude M enters the survey. The denominator
(A2)
is the probability that a star randomly chosen from the population will appear in the catalogue. It ensures that |$P({\bf w}|f\rm{ \& in survey})$| is a correctly normalized probability density.
On account of observational errors, we should maximize not |$P({\bf w},M|f\rm{ \& in survey})$| but the related probability density
(A3)
that the catalogue will list a star of absolute magnitude M at w that has true phase-space location w′. Here we assume that the distribution of observational errors G is a multivariate Gaussian with kernel K:
(A4)
where n = dim(w). Thus f should be chosen to maximize
(A5)
In practice it is convenient to work with sky coordinates, which do not comprise a system of canonical coordinates for phase space. Specifically, we use Galactic longitude and latitude ℓ, b, distance s, the proper motions |$\mu _\ell =\dot{\ell }\cos b$| and |$\mu _b=\dot{b}$|⁠, the line-of-sight velocity v. Forming these into the vector u = (ℓ, b, s, μ, μb, v), we have that the element of phase-space volume d6w is related to d6u by (e.g McMillan & Binney 2012)
(A6)
so
(A7)
where w′ is understood to be a function of u′. One advantage of working with sky coordinates is that the matrix K then simplifies. Most of its off-diagonal elements vanish and Kℓℓ and Kbb become very large because the sky positions of stars have negligible uncertainty. This being so G may be approximated by the product of a 4 × 4 matrix |$\widetilde{{\bf K}}$| and Dirac δ-functions in ℓ − ℓ′ and bb′ so the integrals over these coordinates can be trivially executed. Otherwise we neglect correlations by approximating |$\widetilde{{\bf K}}$| by a diagonal matrix,
(A8)
Conversion of heliocentric coordinates into phase-space coordinates requires knowledge of the Sun’s Galactocentric position and velocity. We use Galactocentric Cartesian coordinates (x, y, z) in which the Sun’s position vector is (− 8.2, 0, 0). Heliocentric distances are denoted by s and Galactocentric distances by r. From Schönrich (2012) we take the Sun’s Galactocentric velocity V to be
(A9)

In practice S depends not on M but on the apparent magnitude m = M + μ + A, where |$\mu =5\log _{10}(s/10\, \mathrm{pc})$| is the distance modulus and A(x) is the extinction.

A1 Evaluating the probabilities

Evaluation of the quality of a model requires execution of two distinct numerical tasks. One is computation of PS by integrating the product of the DF and the survey’s selection function through phase space. Introducing angle-action variables |$(\theta ,{\bf J})$|⁠, we have
(A10)
Following McMillan & Binney (2013), we execute this integral by the Monte Carlo principle:
(A11)
where the points xi are randomly sampled according to the probability density fs. We take fs to be a function of J only, so we can write
(A12)
Poisson noise is minimized if f(J)/fs(J) ≃ 1, and in our case this can be achieved by taking fs to be the first DF we try. If the coordinates |$(\theta _i,{\bf J}_i)$| of the points that sample fs and the resulting values of the ratio |$S(\theta _i,{\bf J}_i)/f_{\rm s}({\bf J}_i)$| are stored, the quality of any subsequently proposed DF can be computed cheaply merely by evaluating it at the Ji.
We use Gauss–Legendre integration with N = 20 nodes to evaluate the second integral for a k-th star:
(A13)
where x are fixed positions given the sampling density and |$M_{k}^{+}$| and |$M_{k}^{-}$| are, respectively, the upper and lower limit of the absolute magnitude, respectively:
(A14)
where |$m_{k}^{+}=21$| and |$m_{k}^{-}=5$| are the apparent magnitude limits of the Gaia survey, Ak is the G-band extinction. Note that these are the astrometric solutions’ limits rather than the detection limits of the Gaia survey. Additionally, if the computed absolute magnitude limits exceed the boundaries in the LF which is shown in Fig. 2, the boundary value will then be used to replace the computed absolute magnitude limits in the integral. Since we ignore the colour criteria in the SF, we only need to deal with LF in Gaia G band in the normalization factor.
We are concerned with the case that v has not been measured. Then the error ellipsoid becomes a section of a 4D cylinder, the cross-sections of which are 3D ellipsoids spanned by the measured quantities (s, μ, μb). We need to integrate through only that part of the cylinder for which the Galactocentric speed v is less than the escape speed because the DF vanishes for v(v) > vesc. The strategy we adopted is to obtain from quadpy5n = 77 locations xi and weights Ai for 3D integration with weight function |$\mathrm{e}^{-|{\bf x}|^2}$|⁠. Then with G(u|K) now denoting a 3D Gaussian distribution, we have at each i that
(A15)
where V0 is a normalizing velocity, v± are the values of v at which v = vesc and
(A16)
The integral over v was executed by Gauss–Legendre integration with unit weight function using 35 integrand evaluations.6 The number of node for disc stars are more than 1.5 times that in the RR-Lyrae work, which is because the line-of-sight distribution for disc stars are not regularly due to the SF and extinction on the plane. The constant V0 is the same for all stars and DFs, so it plays no role in the optimization and can be set to unity.

APPENDIX B: TEST RESULT FOR WRONG DUST MODEL

Fig. B1 shows the posterior distributions when the Green et al. (2019) dust model is used to analyse mock data produced using the toy dust model of Section 5.3.

Posterior probability distributions obtained when a mock catalogue created using the toy dust model is analysed using the Bayestar2019 dust model.
Figure B1.

Posterior probability distributions obtained when a mock catalogue created using the toy dust model is analysed using the Bayestar2019 dust model.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data