Abstract

We present a 3D Bayesian method to model the kinematics of strongly lensed galaxies from spatially resolved emission-line observations. This technique enables us to simultaneously recover the lens–mass distribution and the source kinematics directly from the 3D data cube. We have tested this new method with simulated OSIRIS observations for nine star-forming lensed galaxies with different kinematic properties. The simulated rotation curves span a range of shapes that are prototypes of different morphological galaxy types, from dwarf to massive spiral galaxies. We have found that the median relative accuracy on the inferred lens and kinematic parameters are at the level of 1 and 2 per cent, respectively. We have also tested the robustness of the technique against different inclination angles, signal-to-noise ratios, the presence of warps, or non-circular motions and we have found that the accuracy stays within a few per cent in most cases. This technique represents a significant step forward with respect to the methods used until now, as the lens parameters and the kinematics of the source are derived from the same 3D data. This enables us to study the possible degeneracies between the two and estimate the uncertainties on all model parameters consistently.

1 INTRODUCTION

Measuring the content of baryons and dark matter within galaxies, and its evolution with redshift, is a key test of galaxy formation models. In the context of ΛCDM cosmology, current numerical simulations have not yet produced consistent predictions on the fraction of dark matter within young galaxies. In particular, the amount of dark matter fraction within the stellar half-mass radius has been shown to be strongly dependent on the implementation of feedback processes (e.g. Wu et al. 2014; Remus et al. 2017; Teklu et al. 2018). For example, different simulations (e.g. Lovell et al. 2018; Teklu et al. 2018) have resulted in dark matter fraction at |$z$| ∼ 2 that can differ by almost one order of magnitude. Numerous physical mechanisms, such as the mass accretion history, the initial mass function, dynamical instabilities, and adiabatic contraction, determine the relative contribution of baryons and dark matter within a galaxy (e.g. Blumenthal et al. 1986; Dutton et al. 2011; Courteau & Dutton 2015; Zolotov et al. 2015). For this reason, quantifying the amount of dark matter from kinematical measurements provides a strong constraint on galaxy formation models (Rubin, Ford & Thonnard 1978; van Albada et al. 1985).

From an observational perspective, a number of observations have revealed that a significant number of high-redshift galaxies are a disc-dominated system (e.g. Förster Schreiber et al. 2006, 2009; Wisnioski et al. 2015; Mason et al. 2017). However, while in the Local Universe the flatness of the rotation curves and the matter content of disc galaxies is a well-established fact, at high redshift it is currently a matter of debate, with rotation curves having flat (Di Teodoro, Fraternali & Miller 2016), rising (Tiley et al. 2016), or declining shapes (Genzel et al. 2017; Lang et al. 2017). The declining rotation curves for six star-forming galaxies at redshift between 0.8 and 2.3 have been explained, for example, by Lang et al. (2017) and Genzel et al. (2017) as an indication of baryon-dominated systems, with a fraction of dark matter lower than 0.2. On the other hand, Di Teodoro et al. (2016), Di Teodoro et al. (2018), and Mason et al. (2017) have derived rotation curves and velocity dispersion values in agreement with those of local star-forming galaxies (Epinat et al. 2010).

The partition of the matter content between dark matter, stars, and gas within a galaxy is provided, also, by studying the evolution of the stellar mass Tully–Fisher relation (TFR; Tully & Fisher 1977), which correlates the stellar mass to the rotation velocity, a tracer of the total dynamical mass. A change in the normalization with redshift might, for example, indicate a redistribution of the total mass between visible and dark matter.

Even if the TFR has been explored at redshifts between 0 and 4 by numerous studies, there is no consensus whether it evolves (e.g. Puech et al. 2008; Straatman et al. 2017; Turner et al. 2017; Übler et al. 2017) or not (e.g. Miller et al. 2011, 2012; Di Teodoro et al. 2016; Harrison et al. 2017) with redshift.

The diverging results on the kinematics of high-redshift galaxies and, as a consequence of their matter content, can be ascribed to the different methods used to overcome the observational limitations. The study of the kinematics is mainly hampered by two factors: low spatial resolutions and low signal-to-noise ratios (SNRs). Seeing-limited observations are typically characterized by an effective spatial resolution of 5 kpc at the redshifts of the sources, |$z$| ∼ 1–2 (e.g. Förster Schreiber et al. 2009; Swinbank et al. 2017), while a handful of adaptive optic (AO) observations have achieved higher resolutions of ∼1–1.6 kpc (Molina et al. 2017; Förster Schreiber et al. 2018). Furthermore, because of cosmological surface-brightness dimming, only the bright central regions of galaxies can be observed, especially with AO. Although AO observations are characterized by a better spatial resolution with respect to seeing-limited observations, they have a worse sensitivity and a data binning is often required to increase the SNR.

One of the consequences of limited spatial resolution is to smooth out the measured rotation velocity via the so-called beam-smearing effect and can result in an overestimation of the velocity dispersion (e.g. Wright et al. 2009; Newman et al. 2013; Di Teodoro & Fraternali 2015). This effect can also lead to a misclassification of objects: for example, Newman et al. (2013) have shown that the fraction of dispersion-dominated galaxies in the SINS/zC-SINF surveys (Cresci et al. 2009; Förster Schreiber et al. 2009; Genzel et al. 2011) drops from 41 per cent at a seeing-limited resolution to 6–9 per cent when galaxies are observed in the AO mode.

The observational limitations imposed by low resolution and SNR can be successfully overcome by targeting strongly gravitationally lensed galaxies. Strong gravitational lensing offers the opportunity to study high-redshift galaxies at a much higher physical resolution and SNR in their source plane (e.g. Nesvadba et al. 2006; Swinbank et al. 2007). Furthermore, the magnifying power of gravitational lensing opens the possibility to study galaxies in the low-stellar-mass range of 5 × 108|$5\times 10^9 \, \mathrm{M}_\odot$| (e.g. Jones et al. 2010a; Leethochawalit et al. 2016; Mason et al. 2017), which is instead not easily achievable by surveys targeting unlensed galaxies (e.g. Förster Schreiber et al. 2006; Swinbank et al. 2012a).

It was only in recent years that the potential of gravitational lensing has started to be exploited: for example, Stark et al. (2008) have studied the kinematics of a lensed galaxy at a resolution of 120 pc at |$z$| = 3.07. The analysis of two larger samples then followed this study: Jones et al. (2010a) have analysed six lensed galaxies in the redshift range 1.7–3.1, and Livermore et al. (2015) have further extended this sample to 17 targets with redshift from 1 to 4. Leethochawalit et al. (2016) have analysed 15 lensed galaxies at |$z$| ∼ 2. Regarding the galaxy population properties, Jones et al. (2010a) and Leethochawalit et al. (2016) have used different methods to distinguish well-ordered velocity fields from disturbed/merging kinematics and obtained a different classification for similar ranges of redshift and stellar mass: 36 per cent of the galaxies in the Leethochawalit et al. (2016) sample are rotationally dominated and as many as 66 per cent in the Jones et al. (2010a) sample, as confirmed by Livermore et al. (2015).

So far, most of the analysis aimed at studying the kinematics of lensed sources with optical emission lines has been characterized by the following features:

  • the lens–mass model is derived from high-spatial-resolution-imaging data (e.g. from HST images; Stark et al. 2008; Jones et al. 2010b, 2013; Shirazi et al. 2014; Livermore et al. 2015; Leethochawalit et al. 2016; Yuan et al. 2017);

  • the kinematic modelling is done either by delensing the 3D Integral Field Unit (IFU) data (e.g. Jones et al. 2013; Livermore et al. 2015) and deriving the velocity and dispersion maps with a Gaussian fit to the emission lines in the source plane, or by deriving the moment maps in the image plane and then delensing these maps to the source plane (e.g. Jones et al. 2010b; Leethochawalit et al. 2016). In both cases, the lens model is kept fixed.

  • A functional form, usually an arctangent function, is used to fit the delensed velocity field and derive the rotation curve.

Recent studies based on molecular line observations have used a similar approach (e.g. Dye et al. 2015; Rybak et al. 2015b; Swinbank et al. 2015). One first derives the lens–mass distribution from the radio continuum, observed in the same bands as the molecular lines. Then, this model is used to derive the 3D-line data and calculate the corresponding moment maps in the source plane. Finally, kinematic parameters are derived either by applying the kinemetry method (Krajnović et al. 2006) to both the first- and second-moment maps (Rybak et al. 2015b) or by applying a dynamical model to the first-moment map (Dye et al. 2015; Swinbank et al. 2015).

All these approaches are suboptimal mainly for two reasons: first, if the lens model is kept fixed, it is not possible to quantify any degeneracy between the lens–mass parameters and the source kinematic properties. Secondly, the kinematic fitting is done on the reconstructed source rather than on the data. However, on the source plane, pixels are correlated, the noise properties not fully characterized, and the effective resolution changes with position according to the lensing magnification. As a consequence, one introduces systematic errors in the derivation of the kinematic properties of the source, which may be difficult to quantify.

Recently, Patrício et al. (2018) have applied a forward modelling approach that partly overcomes some of the above issues by deriving the velocity map directly on the image plane through a Gaussian fitting to the emission lines. However, similarly to the techniques described above, this method is not ideal, as it relies on a fixed lens model derived from a separate HST observation and it performs a kinematic modelling of the 2D velocity map, instead of the full 3D data cube.

Finally, other studies have been focusing on sources that are not significantly lensed (i.e. only weakly distorted), so that the kinematic analysis can be done directly on the image plane without having to reconstruct the unlensed emission (e.g. Mason et al. 2017; Di Teodoro et al. 2018; Girard et al. 2018). However, even small distortions of the observed axis ratio, due to lensing, could affect the capability to recover the kinematic parameters accurately. Mason et al. (2017) have tried to correct for this effect using a global value for the magnification factor.

In this paper, we present a novel Bayesian three-dimensional and pixellated approach, which, applied either to IFU or interferometric data, enables us to simultaneously reconstruct both the lensing mass distribution and the kinematics of the source. Our method represents a significant improvement over those described above as it is not affected by differential magnification nor poor understanding of the errors and resolution properties of the reconstructed unlensed plane. Our technique does not require the use of high-resolution imaging data for the derivation of the lens parameters, as these are derived from the same 3D data used for the modelling of the kinematics of the background galaxy. Since the lens parameters and the source are inferred simultaneously from the same data set, our method is not affected by differential magnification. Moreover, the kinematics of the background galaxy is not obtained by fitting on the source plane, but directly the lensed data in a hierarchical Bayesian fashion, where the kinematics on the source plane is essentially a hyper-parameter (i.e. parameter defining the prior) of the model. The main novelty of our procedure is that a modified tilted-ring kinematic model is an extra constraint for a pixellated source reconstruction. Furthermore, the derivation of the lens–mass model and the source kinematics is done simultaneously, allowing us to quantify possible degeneracies and to estimate the errors on all model parameters using a Bayesian approach. Finally, our 3D approach enables us to describe the kinematics of the source minimizing the influence of the beam-smearing effect.

This paper is organized as follows. In Section 2, we describe in detail the method used for the lens modelling and the derivation of the kinematics. In Section 3, we present the IFU-simulated data sets. In Section 4, we describe the modelling strategy and the assumptions applied to model the simulated data sets, which are then used in Section 5 to test our method under different observational set-ups. The robustness and the limits of the technique are summarized in Section 6, where we also list future developments and applications.

2 METHOD DESCRIPTION

This section describes the core features of our method, which is an extension of the technique developed by Vegetti & Koopmans (2009) to the 3D-domain. In particular, we present the statistical framework that allows us to reconstruct the background source, its kinematics, and the lens–mass distribution. The lens–mass parametrization is described in Section 2.2, while the details of the kinematic model used to describe the lensed source are given in Section 2.3.

2.1 Source reconstruction

In the following, we indicate with |$\mathbf {{\boldsymbol{s}}}$| and |$\mathbf {{\boldsymbol{d}}}$| the 3D pixellated surface brightness distribution of the source and the data in the image plane, respectively. We refer the reader to Fig. 1 for a schematic representation of the source and image planes.

A schematic view of the source and lens planes. On the upper panel, the lensed data for three representative spectral channels and the respective regular grid on the image plane. For each spectral channel, the position $\vec{x}$ of a subset of $\mathrm{\text{$N$}_{s}}$ pixels in the image plane corresponds to a position $\vec{y}$ on the source plane (lower panel), through the lens equation $\vec{y}=\vec{x}-\vec{\alpha }\left(\vec{x}\right)$. The points $\vec{y}$ are the vertices of a triangular adaptive grid on the source plane.
Figure 1.

A schematic view of the source and lens planes. On the upper panel, the lensed data for three representative spectral channels and the respective regular grid on the image plane. For each spectral channel, the position |$\vec{x}$| of a subset of |$\mathrm{\text{$N$}_{s}}$| pixels in the image plane corresponds to a position |$\vec{y}$| on the source plane (lower panel), through the lens equation |$\vec{y}=\vec{x}-\vec{\alpha }\left(\vec{x}\right)$|⁠. The points |$\vec{y}$| are the vertices of a triangular adaptive grid on the source plane.

Given a set of |$\mathrm{N_{ch}}$| observed spectral channels, the source and data vectors, |${{\boldsymbol{s}}}$| and |${{\boldsymbol {d}}}$|⁠, have a total of |$\mathrm{N_{ch}}$| components, |$\boldsymbol {s}_{\mathrm{i}..\mathrm{N_{ch}}}$| and |$\boldsymbol {d}_{\mathrm{i}..\mathrm{N_{ch}}}$|⁠, respectively, each representing the surface brightness distribution in one channel i:
(1)
(2)
For each ith spectral channel, the surface brightness distribution of the lensed cube |$\boldsymbol {d}_{\mathrm{i}}$|⁠, its noise |$\boldsymbol {n}_{\mathrm{i}}$|⁠, and the relative unknown source surface brightness distribution |$\boldsymbol {s}_{\mathrm{i}}$| are related to each other by the following set of linear equations:
(3)
where |$\rm {\boldsymbol{\sf M=B\, L}}$| is the response operator that depends on the lensing operator |${\boldsymbol{\sf L}}$| and the point spread function (PSF) operator |${\boldsymbol{\sf B}}$| (for interferometric data, |${\boldsymbol{\sf B}}$| is the Fourier transfer operator). The lensing operator |${\boldsymbol{\sf L}}$| is a non-linear function of the lens mass–density distribution parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$| (see Section 2.2 for their definition) via the lensing potential |$\psi \left(\boldsymbol{\eta _{\mathrm{lens}}}\right)$|⁠.

The method used for the source reconstruction is grid-based in the sense that the background–source surface brightness distribution is reconstructed on a triangular adaptive grid defined by a Delaunay tessellation. The source grid automatically adapts with the lensing magnification, so that there is a high pixel-density in the high-magnification regions close to the caustics. The vertices of the triangular grid are obtained by casting back to the source plane a subset |$\mathrm{N_{s}}$| of the |$\mathrm{N_{d}}$| image–plane pixels via the lens equation. We determine the surface brightness at each source-plane pixel by interpolating between the values at the vertices of the triangles. We reconstruct each channel on the same triangulation.

As both |$\boldsymbol{\eta _{\mathrm{lens}}}$| and the source |$\boldsymbol {s}_{\mathrm{i}}$| are unknown, equation (3) is ill-defined and cannot be simply inverted. Therefore, we derive a penalty function defined in the context of three levels of Bayesian inference, which are described below.

2.1.1 First level of inference – linear optimization

Using Bayes’ rule, the most probable a posteriori source, |$\mathbf {\text{$\boldsymbol{s}$}_{\rm MP}}$|⁠, given the data and a lens–mass model, is derived by maximizing the following posterior probability (Suyu et al. 2006; Vegetti & Koopmans 2009):
(4)
Here, the matrix |${\boldsymbol{\sf R}}$| is the source regularization form (variance, gradient, or curvature), with a strength set by the regularization level vector λ (see Koopmans 2005; Vegetti & Koopmans 2009, for further details). The regularization level vector λ has |$\mathrm{\text{$N$}_{ch}}$| components, so that the user can choose whether the level of regularization is constant across the spectral channels or not. For simplicity, all equations below and above assume a constant value of λ as a function of frequency. |$\boldsymbol{\eta _{\mathrm{kin}}}$| in equation (4) are the source kinematic parameters, as defined in Section 2.3. The remaining terms in equation (4) are the likelihood function |$P\left(\mathbf {{\boldsymbol{d}}}|\mathbf {{\boldsymbol{s}}}, \boldsymbol{\eta _{\mathrm{lens}}}\right)$|⁠, the prior on the source surface brightness distribution |$P\left(\mathbf {{\boldsymbol{s}}} | \lambda , \boldsymbol{\eta _{\mathrm{kin}}}, \mathbf {R}\right)$|⁠, and the evidence |$P\left(\mathbf {{\boldsymbol{d}}}| \lambda , \boldsymbol{\eta _{\mathrm{lens}}},\boldsymbol{\eta _{\mathrm{kin}}}, {\boldsymbol{\sf R}}\right)$|⁠, which is irrelevant for the maximization of the posterior, but plays an important role at the third level of inference (see Section 2.4). Under the assumption of Gaussian noise, the likelihood can be expressed as follows:
(5)
where ZD is the normalization, and ED is half the standard χ2 and is given by
(6)
Above, |${\boldsymbol{\sf C}}_{\mathrm{d\, i}}$| is the covariance matrix of the data for the ith spectral channel. Since the noise is assumed to be uncorrelated, it is a diagonal matrix.
In our implementation, the source prior assumes a quadratic form, which peaks at a source kinematic model, |$\mathbf {\text{$\boldsymbol{s}$}_{\mathrm{kin}}\left(\boldsymbol{\eta _{\mathrm{kin}}}\right)}$|⁠. The prior probability distribution is, therefore, expressed as
(7)
where the quadratic functional ER and the normalization ZR are given, respectively, by
(8)
and
(9)
|${\boldsymbol{\sf H}}_{\mathrm{R}}$| in the above equation is a block matrix made up of the Nch matrices |${\boldsymbol{\sf H}}_{\mathrm{R, i}}$| (see equation (8)) and it is defined as the Hessian of ER, |${\boldsymbol{\sf H}}_{\mathrm{R}}=\nabla \nabla E_{\mathrm{R}}={\boldsymbol{\sf R}}^{\intercal }{\boldsymbol{\sf R}}$|⁠. The most probable surface brightness is obtained by maximizing the posterior probability in equation (4), i.e. by solving the following set of linear equations:
(10)
The form of these equations differs from those derived in Vegetti & Koopmans (2009) in the last term on the right-hand side, |$\lambda {\boldsymbol{\sf H}}_{\mathrm{R i}}\boldsymbol {s}_{\mathrm{kin\, i}}$|⁠. This term is due to a different assumption about the peak of the source prior, which is equal to zero in Vegetti & Koopmans (2009) and |$\mathbf {\text{$\boldsymbol{s}$}_{\mathrm{kin}}}$| here.

2.1.2 Second level of inference – non-linear optimization

To infer the kinematic parameters |$\boldsymbol{\eta _{\mathrm{kin}}}$|⁠, the lens parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠, and the optimal level of regularization λ, we maximize the following posterior probability:
(11)
Assuming a prior that is flat in logλ, |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠, and |$\boldsymbol{\eta _{\mathrm{kin}}}$|⁠, equation (11) can be expressed as
(12)
If we assume a Gaussian noise and a quadratic form of regularization, equation (12) can be written as
(13)
In the above equation, E = ED + λER, |${\boldsymbol{\sf H}}_{\mathrm{E}}$| is its Hessian, and |$\mathbf {\text{$\boldsymbol{s}$}_{MP}}$| is the most probable solution that maximizes the posterior, |$\nabla E \left(\mathbf {\text{$\boldsymbol{s}$}_{MP}}\right) = 0$|⁠.

The expression for the posterior probability, equation (13), differs from that derived by Suyu et al. (2006) and Vegetti & Koopmans (2009) for the multiplications/summation by/over |$\mathrm{\text{$N$}_{ch}}$| and the presence of the term |$E_{\mathrm{s}}\left(\mathbf {\text{$\boldsymbol{s}$}_{kin}}\right)$|⁠, which is the main novelty of our approach. This allows us to derive the kinematic parameters of the source, while retaining the flexibility of a pixellated source surface brightness distribution, simultaneously infer the lens-mass distribution, and take advantage of the extra constraints provided by the velocity channels.

2.1.3 Third level of inference – model comparison

At the third level of inference, to compare and rank different models, we calculate the marginalized Bayesian evidence that is a measure of the probability of the data given by the model. In our case, this marginalized evidence can be expressed as the integral of the normalization factor in equation (4) over the lens parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠, the kinematic parameters |$\boldsymbol{\eta _{\mathrm{kin}}}$|⁠, and the source regularization λ, such that
(14)
This integral is calculated numerically with multinest (Feroz, Hobson & Bridges 2009; Feroz et al. 2013), which is a Nested Sampling-based method improving on the original idea by Skilling (2004). As a by-product of this evidence calculation, we also obtain the posterior distributions of the model parameters, allowing us to estimate their statistical uncertainties and degeneracies (see Section 5).

2.2 Lens–mass model

The lens parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$| that define the lensing operator |${\boldsymbol{\sf L}}$| are as follows: κ0, q, γ, x0, y0, θ, Γsh, θsh. These parameters describe a projected mass density profile as a cored elliptical power-law distribution with the contribution of an external shear component of strength Γsh and position angle θsh. The dimensionless projected mass density profile is defined as
(15)
Here, κ0 is the surface mass–density normalization, q is the projected flattening, γ is the density slope, x0 and y0 define the centre of the mass distribution, rc is the core radius, and θ is the position angle of the major axis. The Einstein radius for this density profile is defined as
(16)
In the following sections, we assume that the mass distribution has a negligible core radius of 10−4 arcsec.

2.3 Source kinematic model

We build the kinematic model using a modified version of the building-model function of 3Dbarolo (Di Teodoro & Fraternali 2015). To simulate the gas emission from a rotating galaxy, the 3Dbarolo algorithm uses a stochastic function that populates a 6D domain (three spatial and three spectral dimensions) with emitting gas clouds, which allow us to build the line profiles. The rotating galaxy is modelled as a series of concentric circular rings using the so-called tilted-ring model (Rogstad, Lockhart & Wright 1974). On each ring, the positions of the clouds are chosen randomly in such a way that, on average, the clouds become uniformly distributed over its surface. Each ring is described by the following parameters:

  • the coordinates of the centre xs, ys;

  • the inclination angle i, defined such that i = 90° for an edge-on galaxy and i = 0° for a face-on one;

  • the position angle PA, defined as the angle between the north direction of the sky and the projected major axis of the receding half of the rings measured counterclockwise;

  • the face-on gas column density Σ;

  • the systemic velocity Vsys;

  • the rotation velocity Vrot;

  • the velocity dispersion σgas.

The projected velocity along the line-of-sight Vlos at a particular radius R is defined by
(17)
where ϕ is the azimuthal angle in the plane of the galaxy. To build the 3D model, at each radius, the positions of the clouds are then rotated and projected into the plane of the sky with an orientation with respect to the observer defined by both the position and the inclination angles at that radius. As in 3Dbarolo, to obtain the velocity profile at each location, the clouds are divided into subclouds. Each of these subclouds has a velocity which is drawn from a Gaussian distribution with a dispersion of |$\sigma ^2= \sigma ^2_{\mathrm{gas}}+\sigma ^2_{\mathrm{instr}}$|⁠. Here, σinstr takes into account the instrumental broadening.

Unlike 3Dbarolo , our implementation does not allow for a variation of all the parameters ring by ring. Instead, we make the following assumptions: (i) all the rings have the same centre coordinates and systemic velocity (in Section 5.10, we explicitly test the effects of this assumption); (ii) the radial variations of the inclination and position angles are described by a polynomial of deg from 0 to 3; (iii) the radial variations of the rotation velocity and velocity dispersion are described by functional forms. The use of functional forms for the rotation velocity and velocity dispersion allows us to reduce the number of free parameters. Our kinematic model |$\mathbf {\text{$\boldsymbol{s}$}_{\mathrm{kin}}}$| is, therefore, defined by the following set of parameters |$\boldsymbol{\eta _{\mathrm{kin}}}=\lbrace R_{\mathrm{ext}}, \Sigma , x_{\mathrm{s}}, y_{\mathrm{s}}, V_{\mathrm{sys}}, i, PA, V_{\mathrm{rot}}, \sigma _{\mathrm{gas}}\rbrace$|⁠. Rext is the radial extension and is a fixed parameter chosen by the user. In Section 4, we describe the assumptions made to estimate Rext for the simulated data analysed in this paper. Following 3Dbarolo, the surface density of the gas Σ is also not a free parameter; instead, we impose a pixel-by-pixel normalization, which is given by the surface brightness distribution obtained from the lens modelling of the zeroth-moment map. The advantage of using a spatially changing normalization is that it allows us to take into account for possible asymmetries in the ionized or molecular gas distribution, as it is typical for high-redshift galaxies given the presence of massive star-forming regions (e.g. Genzel et al. 2011; Swinbank et al. 2012b; Livermore et al. 2015). Therefore, the presence of clumpy emissions or holes should not affect the derived kinematics, because this normalization ensures that the kinematics is independent of the gas distribution (e.g. Lelli et al. 2012; Di Teodoro & Fraternali 2015).

By construction, the kinematic model is built on a Cartesian grid, defined by a pixel scale and dimensions chosen by the user. Since the source reconstruction is determined on a Delaunay tessellation (see Section 2.1), we then map this model at the positions of the triangle vertices.

2.3.1 Rotation velocity and velocity dispersion curves

We have implemented three empirical functions to describe the rotation curves: the arctangent function (Courteau 1997), the hyperbolic tangent function (Andersen et al. 2001), and a multiparameter function (Rix et al. 1997). These are expressed by the following three expressions, respectively:
(18)
(19)
(20)
Rt is the turnover radius between the inner rising and outer part of the curve. Vt is the asymptotic velocity for the arctangent and hyperbolic tangent functions, and the velocity scale for the multiparameter one. ξ defines the sharpness of the turnover while β specifies the power-law behaviour of the curve at large radii. The arctangent function has mainly been used to model the kinematics of high-redshift galaxies (e.g. Swinbank et al. 2012a; Leethochawalit et al. 2016). However, it is not flexible enough to reproduce the different kinds of observed rotation curves, especially in the inner regions where a bump can be present. For this reason, we prefer the multiparameter function which is, by definition, more flexible.
To describe the velocity dispersion profile, the user can choose from a power-law, a linear, or an exponential function:
(21)
(22)
(23)

2.4 Optimization scheme

We infer the unknown parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠, λ, |$\boldsymbol{\eta _{\mathrm{kin}}}$| and the source |$\mathbf {{\boldsymbol{s}}}$| with an optimization scheme, which is divided into the following four stages (see also Fig. 2 for a schematic view):

  • To find a good initial guess for the lens model parameters, |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠, we start by modelling the zeroth-moment map of the data. This optimization is performed in three separate substeps. First, λ is kept fixed at a relatively large value, such that the source model remains relatively smooth, and |$P\left(\boldsymbol{\eta _{\mathrm{lens}}}|\lambda , \mathbf {{\boldsymbol{d}}},{\boldsymbol{\sf R}}\right)$| is maximized relatively to |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠. Secondly, the lens parameters are kept fixed at the most probable values found at the previous step, while |$P\left(\lambda |\boldsymbol{\eta _{\mathrm{lens}}}, \mathbf {{\boldsymbol{d}}},{\boldsymbol{\sf R}}\right)$| is optimized for the source regularization level λ. Finally, |$P\left(\boldsymbol{\eta _{\mathrm{lens}}}|\lambda , \mathbf {{\boldsymbol{d}}},{\boldsymbol{\sf R}}\right)$| is maximized again for the lens parameters with a source regularization level fixed to the most probable value determined in the previous stage. At every point of the non-linear mass-model optimization, the corresponding most probable source surface brightness distribution |$\mathbf {\text{$\boldsymbol{s}$}_{\rm MP}}$| is obtained by solving the linear system (10) with |$\mathbf {\text{$\boldsymbol{s}$}_{kin}}=0$|⁠.

  • We now model the entire 3D data cube. Assuming the values of |$\boldsymbol{\eta _{\mathrm{lens}}}$| found in step (i), we infer the optimal regularization constant λ and |$\boldsymbol{\eta _{\mathrm{kin}}}$|⁠, maximizing equation (13) by varying first the kinematic parameters that define |$\mathbf {\text{$\boldsymbol{s}$}_{kin}}$|⁠, then the source regularization level λ, and finally the kinematic parameters again. At this stage, the user can choose between a value of the regularization level λ that is the same for all of the spectral channels or a value that varies channel by channel.

  • We repeat the process described in (i), using equations (11) and (10) with |$\boldsymbol{\eta _{\mathrm{kin}}}$| equal to the value found in (ii).

  • Finally, the lens parameters, λ, and only the kinematic parameters that describe the rotation velocity and velocity dispersion, i.e. Vrot, σgas, are simultaneously left free to vary, starting from the values of the parameters found at the previous steps. As for the last two, at this stage, we focus on the 3D data cube.

A schematic overview of the four-step optimization scheme used to infer the unknown parameters $\boldsymbol{\eta _{\mathrm{lens}}}$, λ, $\boldsymbol{\eta _{\mathrm{kin}}}$. The four boxes represent the points (i)–(iv) in Section 2.4. An initial estimate of the lens parameters is obtained by fitting the zeroth-moment map, while for the successive steps the full 3D data cube is used.
Figure 2.

A schematic overview of the four-step optimization scheme used to infer the unknown parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$|⁠, λ, |$\boldsymbol{\eta _{\mathrm{kin}}}$|⁠. The four boxes represent the points (i)–(iv) in Section 2.4. An initial estimate of the lens parameters is obtained by fitting the zeroth-moment map, while for the successive steps the full 3D data cube is used.

The analysis described at the points (ii) and (iii) is repeated if a visual inspection of the residuals reveals a mismatch between the model and the data. All of the optimization steps described above are done with a non-linear optimizer (i.e. a Downhill–Simplex with Simulated Annealing; Press et al. 1992). As discussed in Section 2.1.3, the calculation of the Bayesian evidence with multinest allows us to explore the parameter space and obtain the posterior distributions of the parameters. In this case, both the kinematic and lens parameters are simultaneously changed.

3 IFU MOCK DATA

To investigate the ability of our new modelling technique to recover reliable lens and kinematic parameters, we simulate nine observations of H  α emission from star-forming lensed galaxies at redshifts between ∼1.3 and ∼2.4. In particular, we use the technical features of the OSIRISspectrograph (Larkin et al. 2006). We have chosen to focus on OSIRIS because it has the typical characteristics of a near-infrared integral field spectrometer mounted on an 8–10 m telescope in terms of spatial resolution, AO performances, and spectral resolution, with a typical channel width of 30–40 km s−1.

To simulate the lensed data, we first build a cube from a rotating galaxy (Section 3.1), we then lens it forward using the lens–mass model described in Section 2.2. Finally, we convolve the lensed cube with a spatial PSF and add the noise (Section 3.2).

3.1 Simulated sources

The lensed sources have redshifts between 1.3 and 2.4 (column 2 in Table 1) which results in their  Hα emission line falling in the H or K filters (column 4 in Table 1). The total H α fluxes (column 7 in Table 1) have values typical of star-forming galaxies at |$z$| ∼ 1–2 (e.g. Förster Schreiber et al. 2009; Livermore et al. 2015). The average resolving power is ∼3400 corresponding to |${\sim }6\,\,{\mathring{\rm A} }$| in these bands. The cube of the rotating galaxy is built using 3Dbarolo(Di Teodoro & Fraternali 2015). Input values for the geometrical and kinematical parameters that define the inclination i and position angle PA and the rotation velocity Vrot and dispersion σgas are listed in Table 1. The sources have an extension of ∼5–8 kpc along the major axis, as typical of |$z$| ∼ 1–2 galaxies (e.g. Wisnioski et al. 2015; Leethochawalit et al. 2016; Genzel et al. 2017; Patrício et al. 2018).

Table 1.

Observational and physical properties for the nine mock systems. Top table: Column 1: name of the data set. Column 2: redshift of the source. Column 3: redshift of the lens. Columns 4–5: OSIRIS filter and the corresponding FOV. Column 6: FWHM for the core + halo PSF (see Section 3.1). Middle table: Kinematic parameters used to create the source. Bottom table: Lens parameters used to lens the source and to create the observed mock data.

Observation set-up
Mock data set|$z$|source|$z$|lensFilterFOVFWHMFHαtexp
arcsecarcsec10−18 erg s−1 cm−2ks
M12.050.881Kn13.6 × 6.40.17 + 0.951514.4
M22.190.191Kn24.5 × 6.40.17 + 0.952014.4
M32.150.722Kn24.5 × 6.40.17 + 0.953314.4
M42.260.191Kn34.8 × 6.40.20 + 0.601512.6
M51.340.410Hn24.5 × 6.40.20 + 0.60610.8
M62.360.881Kn34.8 × 6.40.20 + 0.60612.6
M71.340.410Hn24.5 × 6.40.20 + 0.60910.8
M82.190.191Kn24.5 × 6.40.17 + 0.952014.4
M91.340.410Hn24.5 × 6.40.20 + 0.601012.4
Input kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M172.0265.0120.02.030.0−1.5
M252.0100.0223.01.015.01.225.0
M364.023.0157.227.41.1393.729.0
M459.0145.073.75.520.2450.146.0−1.19
M568.0280.0151.42.1734.026.0
M665.045.0219.70.650.565.638.0
M740.0280.0151.42.1734.026.0
M880.0100.0223.01.015.01.225.0
M968.0280.0/-3.75151.42.1734.026.0
Input lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.44−12.720.822.06−0.03913.33
M21.33157.950.932.280.050174.45
M31.000.000.992.000.24038.00
M41.33157.950.932.280.050174.45
M50.8171.200.842.000.09634.40
M61.44−12.720.822.06−0.03913.33
M70.8171.200.842.000.09634.40
M81.33157.950.932.280.050174.45
M90.8171.200.842.000.09634.40
Observation set-up
Mock data set|$z$|source|$z$|lensFilterFOVFWHMFHαtexp
arcsecarcsec10−18 erg s−1 cm−2ks
M12.050.881Kn13.6 × 6.40.17 + 0.951514.4
M22.190.191Kn24.5 × 6.40.17 + 0.952014.4
M32.150.722Kn24.5 × 6.40.17 + 0.953314.4
M42.260.191Kn34.8 × 6.40.20 + 0.601512.6
M51.340.410Hn24.5 × 6.40.20 + 0.60610.8
M62.360.881Kn34.8 × 6.40.20 + 0.60612.6
M71.340.410Hn24.5 × 6.40.20 + 0.60910.8
M82.190.191Kn24.5 × 6.40.17 + 0.952014.4
M91.340.410Hn24.5 × 6.40.20 + 0.601012.4
Input kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M172.0265.0120.02.030.0−1.5
M252.0100.0223.01.015.01.225.0
M364.023.0157.227.41.1393.729.0
M459.0145.073.75.520.2450.146.0−1.19
M568.0280.0151.42.1734.026.0
M665.045.0219.70.650.565.638.0
M740.0280.0151.42.1734.026.0
M880.0100.0223.01.015.01.225.0
M968.0280.0/-3.75151.42.1734.026.0
Input lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.44−12.720.822.06−0.03913.33
M21.33157.950.932.280.050174.45
M31.000.000.992.000.24038.00
M41.33157.950.932.280.050174.45
M50.8171.200.842.000.09634.40
M61.44−12.720.822.06−0.03913.33
M70.8171.200.842.000.09634.40
M81.33157.950.932.280.050174.45
M90.8171.200.842.000.09634.40
Table 1.

Observational and physical properties for the nine mock systems. Top table: Column 1: name of the data set. Column 2: redshift of the source. Column 3: redshift of the lens. Columns 4–5: OSIRIS filter and the corresponding FOV. Column 6: FWHM for the core + halo PSF (see Section 3.1). Middle table: Kinematic parameters used to create the source. Bottom table: Lens parameters used to lens the source and to create the observed mock data.

Observation set-up
Mock data set|$z$|source|$z$|lensFilterFOVFWHMFHαtexp
arcsecarcsec10−18 erg s−1 cm−2ks
M12.050.881Kn13.6 × 6.40.17 + 0.951514.4
M22.190.191Kn24.5 × 6.40.17 + 0.952014.4
M32.150.722Kn24.5 × 6.40.17 + 0.953314.4
M42.260.191Kn34.8 × 6.40.20 + 0.601512.6
M51.340.410Hn24.5 × 6.40.20 + 0.60610.8
M62.360.881Kn34.8 × 6.40.20 + 0.60612.6
M71.340.410Hn24.5 × 6.40.20 + 0.60910.8
M82.190.191Kn24.5 × 6.40.17 + 0.952014.4
M91.340.410Hn24.5 × 6.40.20 + 0.601012.4
Input kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M172.0265.0120.02.030.0−1.5
M252.0100.0223.01.015.01.225.0
M364.023.0157.227.41.1393.729.0
M459.0145.073.75.520.2450.146.0−1.19
M568.0280.0151.42.1734.026.0
M665.045.0219.70.650.565.638.0
M740.0280.0151.42.1734.026.0
M880.0100.0223.01.015.01.225.0
M968.0280.0/-3.75151.42.1734.026.0
Input lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.44−12.720.822.06−0.03913.33
M21.33157.950.932.280.050174.45
M31.000.000.992.000.24038.00
M41.33157.950.932.280.050174.45
M50.8171.200.842.000.09634.40
M61.44−12.720.822.06−0.03913.33
M70.8171.200.842.000.09634.40
M81.33157.950.932.280.050174.45
M90.8171.200.842.000.09634.40
Observation set-up
Mock data set|$z$|source|$z$|lensFilterFOVFWHMFHαtexp
arcsecarcsec10−18 erg s−1 cm−2ks
M12.050.881Kn13.6 × 6.40.17 + 0.951514.4
M22.190.191Kn24.5 × 6.40.17 + 0.952014.4
M32.150.722Kn24.5 × 6.40.17 + 0.953314.4
M42.260.191Kn34.8 × 6.40.20 + 0.601512.6
M51.340.410Hn24.5 × 6.40.20 + 0.60610.8
M62.360.881Kn34.8 × 6.40.20 + 0.60612.6
M71.340.410Hn24.5 × 6.40.20 + 0.60910.8
M82.190.191Kn24.5 × 6.40.17 + 0.952014.4
M91.340.410Hn24.5 × 6.40.20 + 0.601012.4
Input kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M172.0265.0120.02.030.0−1.5
M252.0100.0223.01.015.01.225.0
M364.023.0157.227.41.1393.729.0
M459.0145.073.75.520.2450.146.0−1.19
M568.0280.0151.42.1734.026.0
M665.045.0219.70.650.565.638.0
M740.0280.0151.42.1734.026.0
M880.0100.0223.01.015.01.225.0
M968.0280.0/-3.75151.42.1734.026.0
Input lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.44−12.720.822.06−0.03913.33
M21.33157.950.932.280.050174.45
M31.000.000.992.000.24038.00
M41.33157.950.932.280.050174.45
M50.8171.200.842.000.09634.40
M61.44−12.720.822.06−0.03913.33
M70.8171.200.842.000.09634.40
M81.33157.950.932.280.050174.45
M90.8171.200.842.000.09634.40

In the sections below, we provide more details on each model. In general, to check whether the functional forms in equations (18)–(23) are flexible enough to reproduce a variety of realistic kinematical scenarios, we have considered different input rotation curves of varying complexity and different shapes. In particular, the mock data M1 and M2 are created and modelled with the same functional forms implemented in our code (Sections 5.15.2). The mock data M3 are created with a different functional form (Section 5.3), while the simulated data M4, M5, and M6 have rotation curves derived from real observed galaxies (Sections 5.45.6). The rotation curves of M1 and M4 are typical of dwarf galaxies, the rotation curves of M2 and M5 are prototypes of spirals, while those of M3 and M6 are typical of massive spirals with a prominent bulge. We have included dwarf galaxy kinematics to test if our code is able to recover the shape of the rotation curve when the turning point is not reached and only the increasing part is observable. Finally, the mock data M7, M8, and M9 are used to test the limits of our modelling technique. The aim is to quantify the minimum and maximum inclination angles that allow us to reliably recover the kinematics (M7 and M8, Sections 5.75.8), as well as the minimum warp in the position angle that can be detected given the angular resolution of the data (M9, Section 5.9).

3.2 Simulated observations

We generate the simulated lensed data using a set of lens parameters |$\boldsymbol{\eta _{\mathrm{lens}}}$| (see Table 1) that we have derived from the lens modelling of a set of real galaxy-scale lenses from the SLACS (Bolton et al. 2006) and SHARP (Lagattuta et al. 2012) surveys. This choice is motivated by the fact that in this paper we are only focussing on galaxy-scale lenses. For the analysis of galaxy-cluster lenses, more complicated mass distributions would have to be considered. We plan to investigate this issue further in a following paper. Using the lens equation, we lens forward the source surface brightness to the image plane for each spectral channel of the source cube. The simulated data sets have a spatial pixel scale in the image plane of 0.1 arcsec. Taking the OSIRIS characteristics, the field of view (FOV, column 5 in Table 1) varies between 3.6 × 6.4 arcsec2 and 4.8 × 6.4 arcsec2 depending on the filter (column 4 in Table 1). We convolve the lensed cube with a spatial PSF, g, which is described by the combination of two Gaussians
(24)
This assumption allows us to take into account for the effects of the AO system, in the sense that the light of the source is divided between a diffraction limited-core, gdif, and a seeing-limited halo, gseeing, for a given strehl of the AO correction S (Law, Steidel & Erb 2006). In particular, M1 to M3 and M8 are simulated using an FWHM of about 0.17 arcsec for gdif, an FWHM of about 0.95 arcsec for gseeing, and S = 0.2; for M4 to M7 and M9, we use a full width at half maximum (FWHM) equal to 0.2 and 0.6 arcsec for gdif and gseeing, respectively, and S = 0.24. All values of the PSF parameters are typical of OSIRIS observations (e.g. Stark et al. 2008; Jones et al. 2010a; Wisnioski et al. 2011). The effect of the spectral resolution is included on the plane of the source as described in Section 2.3.
To simulate a realistic noise distribution, we create the sky-subtracted data, |$\mathbf {{\boldsymbol{d}}}$|⁠, using the simulation method by Law et al. (2006), which was specifically designed for OSIRIS observations. The value of |$\mathbf {{\boldsymbol{d}}}$| is then obtained as
(25)
where the noise |$\mathbf {{\boldsymbol{n}}}$| is a value extracted from a Gaussian distribution with a dispersion given by the sum in quadrature of the counts from the observed target, |$\mathbf {{\boldsymbol{t}}}$|⁠, and the background |$\mathbf {\text{$\boldsymbol{t}$}_{\mathrm{BG}}}$|⁠. For our mock observations, |$\mathbf {{\boldsymbol{t}}}= \sum _{\mathrm{i}}^{\mathrm{Nch}} {\boldsymbol{\sf M}}\boldsymbol {s}_{\mathrm{i}}$|⁠.

As explained in detail by Law et al. (2006), the background count rate |$\mathbf {\text{$\boldsymbol{t}$}_{\mathrm{BG}}}$| is a function of the wavelength and takes into account the Mauna Kea near-IR sky brightness spectrum, the telescope emissivity, and the AO system emissivity. We have taken into account the updated characteristics of the telescope and OSIRIS spectrograph relatively to those used by Law et al. (2006): improved grating efficiency (∼0.78; Mieda et al. 2014) and the halved read-out noise given by the installation of a new detector (T. Jones, private communications). The exposure times used for the simulated data sets M1–M9 are listed in the eighth column of Table 1 and they are typical of data containing star-forming lensed galaxies (Livermore et al. 2015; Leethochawalit et al. 2016). The resulting mock data have a median SNR of ∼14 (see Fig. A1 in Appendix  A).

4 MODELLING STRATEGY

In this section, we describe how we build the kinematic prior |$\mathbf {\text{$\boldsymbol{s}$}_{kin}}$| and derive the best kinematic parameters |$\boldsymbol{\eta _{\mathrm{kin}}}$| (we refer to Section 2.3 for a definition). In particular, we discuss the assumptions made to define the radial extension Rext, the centre and the systemic velocity, and the initial conditions for the geometrical and kinematic parameters for the specific data analysed in this paper. These assumptions can change depending on, e.g. the data quality of the observations, previous estimates of the kinematic and/or geometric parameters, and the accuracy of the redshift of the source.

The first step is to define the radial extension and the effective resolution on which |$\mathbf {\text{$\boldsymbol{s}$}_{kin}}$| is sampled. From the reconstruction of the zeroth moment, we first derive an SNR map on the reconstructed source, by propagating the observational noise of the data. We then define the radial extension Rext of the kinematic model as the radius along the apparent major axis of the galaxy at which the SNR ∼ 3. The kinematic models are built using a ring width that is half the size of the pixels on the image plane. We have explicitly verified that these choices do not influence the recovered kinematic parameters. The Cartesian grid is then mapped on to a triangular adaptive grid, with triangles of average dimensions between ∼10−3 to ∼10−1 arcsec (this is set by a combination of the pixel scale on the image plane and the lensing magnification).

To reduce the number of free kinematic parameters during the optimization, we chose to keep the centre of the source galaxy fixed at the flux-weighted average position of the zeroth-moment map (these differ by at most by 1 per cent from the correct values). The systemic velocity is also kept fixed at zero km s−1. When dealing with real data, one will be able to estimate its value from the global velocity profile, where the latter is obtained from the source intensity in each spectral channel of the data cube or other independent estimations. In Section 5.10, we discuss the results obtained by changing the centre and systemic velocity from the true values. The free kinematic parameters are then |$\boldsymbol{\eta _{\mathrm{kin}}}=\lbrace i, PA, V_{\mathrm{rot}}, \sigma _{\mathrm{gas}}\rbrace$|⁠.

Since the geometrical and kinematic parameters are coupled and degenerate (see equation (17)), they need to be initialized with educated guess values. In this paper, we estimate the geometrical parameters (i and PA) by applying 3Dbarolo to the 3D source derived from the lens parameters inferred at point (i) of Section 2.1. We set the initial values for Vt and Rt that define the rotation curve to the arbitrary, but observationally motivated, values of 100 km s−1 and 1 kpc, respectively (e.g. Jones et al. 2010a; Livermore et al. 2015). For the multiparameter function, we set β = 0.2 and ξ = 10.0 as initial guesses. The choice of the functional form is arbitrary, but it should be noted that the multiparameter function is the most flexible one and it reproduces the arctangent function for ξ = 1.1. Furthermore, as demonstrated in Section 5.2, a wrong choice of the functional form for the rotation curve leads to systematic image residuals, indicating that a different choice should be made. The initial value for σ0 is set to 30 km s−1, while initial guesses for the other parameters that define the velocity dispersion functions are chosen such that σ(Rext) − σ0 is not larger than 20 km s−1, as it is typical for the ionized gas of star-forming galaxies (e.g. Epinat et al. 2010; Di Teodoro et al. 2016; Mason et al. 2017).

4.1 Functional forms for the rotation velocity

Here, we briefly summarize the functional forms used to create and model the rotation velocities that define |$\mathbf {\text{$\boldsymbol{s}$}_{kin}}$| (see the second and third columns in Table 2). For the background galaxy of the simulated data M1, we assume a hyperbolic tangent function for the rotation velocity (blue squares in Fig. 3). The data are then modelled assuming the same parametric form that we have used to create them. These simulated data represent, therefore, a zeroth-order test of our modelling technique.

Rotation curves and velocity dispersions for the mock data set M1–M9. The blue squares are the input rotation curves and velocity dispersion profiles used to create the cubes containing the sources. These are then lensed forward to build the lensed mock data. The dashed red lines are the functional forms that best describe the kinematic priors, while the solid blue line for M3–M7 and M9 shows the fit to the input data using the same functional forms as those used for the kinematic priors at the 3D lens modelling stage. The orange bands for Vrot and σgas are obtained by error propagation from the uncertainties of the parameters that defined the rotation curves and velocity dispersion profiles, while the light orange bands for σgas take into account also the contribution from the spectral resolution (see Section 5 for further details). In the velocity dispersion profile of M3, the orange band is too thin (0.25 km s−1) to be visible, see the Discussion in Section 5.3 for further details.
Figure 3.

Rotation curves and velocity dispersions for the mock data set M1–M9. The blue squares are the input rotation curves and velocity dispersion profiles used to create the cubes containing the sources. These are then lensed forward to build the lensed mock data. The dashed red lines are the functional forms that best describe the kinematic priors, while the solid blue line for M3–M7 and M9 shows the fit to the input data using the same functional forms as those used for the kinematic priors at the 3D lens modelling stage. The orange bands for Vrot and σgas are obtained by error propagation from the uncertainties of the parameters that defined the rotation curves and velocity dispersion profiles, while the light orange bands for σgas take into account also the contribution from the spectral resolution (see Section 5 for further details). In the velocity dispersion profile of M3, the orange band is too thin (0.25 km s−1) to be visible, see the Discussion in Section 5.3 for further details.

Table 2.

For each model in column 1, we show the assumptions on the input (second column) and recovered (third column) shapes for the rotation curves. The fourth and fifth columns show the median uncertainties on the lens and kinematic parameters. The sixth and seventh columns show the relative accuracies for the lens and kinematic parameters, respectively.

Mock data setInput RCModel RCMedian uncertaintyMedian uncertaintyMedian accuracyMedian accuracy
on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|
 per cent per cent per cent per cent
M1ArctangentArctangent273.62.3
M2HyperbolicHyperbolic5131.32.8
M3Equation (26)Multiparameter532.81.7
M4NGC 2976Multiparameter591.53.0
M5NGC 3198Hyperbolic380.50.5
M6NGC 6674Multiparameter691.01.1
M7NGC 3198Hyperbolic862.43.1
 + low inclination
M8HyperbolicHyperbolic370.72.0
 + large inclination
M9NGC 3198Hyperbolic671.90.5
 + warp
Mock data setInput RCModel RCMedian uncertaintyMedian uncertaintyMedian accuracyMedian accuracy
on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|
 per cent per cent per cent per cent
M1ArctangentArctangent273.62.3
M2HyperbolicHyperbolic5131.32.8
M3Equation (26)Multiparameter532.81.7
M4NGC 2976Multiparameter591.53.0
M5NGC 3198Hyperbolic380.50.5
M6NGC 6674Multiparameter691.01.1
M7NGC 3198Hyperbolic862.43.1
 + low inclination
M8HyperbolicHyperbolic370.72.0
 + large inclination
M9NGC 3198Hyperbolic671.90.5
 + warp
Table 2.

For each model in column 1, we show the assumptions on the input (second column) and recovered (third column) shapes for the rotation curves. The fourth and fifth columns show the median uncertainties on the lens and kinematic parameters. The sixth and seventh columns show the relative accuracies for the lens and kinematic parameters, respectively.

Mock data setInput RCModel RCMedian uncertaintyMedian uncertaintyMedian accuracyMedian accuracy
on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|
 per cent per cent per cent per cent
M1ArctangentArctangent273.62.3
M2HyperbolicHyperbolic5131.32.8
M3Equation (26)Multiparameter532.81.7
M4NGC 2976Multiparameter591.53.0
M5NGC 3198Hyperbolic380.50.5
M6NGC 6674Multiparameter691.01.1
M7NGC 3198Hyperbolic862.43.1
 + low inclination
M8HyperbolicHyperbolic370.72.0
 + large inclination
M9NGC 3198Hyperbolic671.90.5
 + warp
Mock data setInput RCModel RCMedian uncertaintyMedian uncertaintyMedian accuracyMedian accuracy
on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|on |$\boldsymbol{\eta _{\mathrm{lens}}}$|on |$\boldsymbol{\eta _{\mathrm{kin}}}$|
 per cent per cent per cent per cent
M1ArctangentArctangent273.62.3
M2HyperbolicHyperbolic5131.32.8
M3Equation (26)Multiparameter532.81.7
M4NGC 2976Multiparameter591.53.0
M5NGC 3198Hyperbolic380.50.5
M6NGC 6674Multiparameter691.01.1
M7NGC 3198Hyperbolic862.43.1
 + low inclination
M8HyperbolicHyperbolic370.72.0
 + large inclination
M9NGC 3198Hyperbolic671.90.5
 + warp

The mock data M2 were built assuming an arctangent function for the rotation velocity (blue squares in Fig. 3). The data are modelled twice, once with the same functional form that we have used as input and the other with a hyperbolic tangent function.

We have created the simulated data M3 using the following functional form for the rotation velocity (blue squares in Fig. 3):
(26)
where V1 is the contribution from an isothermal dark-matter halo while V2 represents the contribution from a Sérsic profile with a Sérsic index n = 1 (Freeman 1970). Since our lens modelling code does not include the functional form expressed by equation (26), we model these mock data using the flexible multiparameter function (dashed red line in Fig. 3). After checking that the input rotation curve is well reproduced (see the Discussion in Section 5.3 for further details), we fit the input 1D rotation curve with the multiparameter function (blue solid line in Fig. 3).1 By comparing the results of this fit with the kinematic parameters derived by the 3D lens modelling code, we can then quantify the accuracy of our technique and study the systematic errors that may derive from the choice of the kinematic functional forms.

We have created the mock data sets M4, M5, and M6 using the rotation curves measured for three low-redshift galaxies: NGC 2976, NGC 3198, and NGC 6674 from Lelli, McGaugh & Schombert (2016). For these sources, the values of the rotation parameters listed in Table 1 are obtained by fitting the input data points (blue squares in Fig. 3) with one of the functional forms implemented in our code (blue solid line in Fig. 3). As for M2, this fitting allows us to evaluate an accuracy of the inferred kinematic parameters which is independent of the choice of the parametrization. As discussed in Sections 5.45.6, these mock data are then modelled assuming the multiparameter function for M4 and M6 and the hyperbolic tangent function for M5.

The simulated data M7, M8, and M9 are built to test the limits of our method. M7 and M8 have the same kinematics of M5 and M2 but an inclination angle of 40° and 80°, respectively. M9 has the same kinematics of M5, but it has a strong warp, which causes a change of 30° of its position angle across the galaxy.

5 RESULTS

To test the ability of our method to recover the input lens and kinematic parameters, we model the nine mock data sets introduced in Section 3.2 with the new modelling technique described in Section 2.1. All assumptions made during the modelling procedure were discussed in Section 4.

We obtain the uncertainties on the inferred parameters using multinest (see Section 2.4). For each parameter, we adopt priors that are flat in the intervals |$[\boldsymbol{\overline{\eta }_{\mathrm{lens/kin}}}-0.2\boldsymbol{\overline{\eta }_{\mathrm{lens/kin}}}, \boldsymbol{\overline{\eta }_{\mathrm{lens/kin}}}+0.2\boldsymbol{\overline{\eta }_{\mathrm{lens/kin}}}]$|⁠, where |$\boldsymbol{\overline{\eta }_{\mathrm{lens/kin}}}$| are the best-fitting parameters, inferred from the non-linear optimization (see Section 2.1). To be conservative, we report as errors in the parameters the sum in quadrature of the following two contributions: the 1σ uncertainties on the posterior distributions derived by multinest and the difference between the maximum a posteriori parameter values obtained by multinest, and the non-linear optimizer. This difference arises because, as discussed above, the geometrical parameters that describe the source, i.e. PA and i, are kept fixed during the optimization, while they are left free to vary during the multinest exploration of the parameter space. In most cases, the discrepancy is smaller than 5 per cent, while it reaches a level of 20 per cent in one case that will be discussed separately (see Section 5.6). For the mock data M1, M2, and M8, for which we have used the same functional forms to create and model the data, these errors only account for the statistical uncertainties, while for the other models they also provide an estimate of the systematic errors, related to the choice of parametrization. The median relative uncertainties on |$\boldsymbol{\eta _{\mathrm{lens}}}$| and |$\boldsymbol{\eta _{\mathrm{kin}}}$| for each model are listed in Table 2 (fourth and fifth columns, respectively), while the sixth and seventh columns in Table 2 list the relative accuracies.2

Figs. 5 (for M1) and B1B8 in Appendix  B (for M2–M8) show the comparison between mock observations and best-fitting models. We plot the contour levels of the input source (first column), the simulated lensed data (second column), the inferred lensed model (third column), the normalized image residuals (fourth column), the reconstructed source (fifth column), and the contour levels of the kinematic model (sixth column), for a selected number of spectral channels. We present the input and recovered rotation curves and velocity dispersion profiles with blue squares and dashed red lines, respectively, in Fig. 3. The orange band for the rotation velocities denotes the uncertainties εp, obtained after propagating the uncertainties on the parameters that describe the profiles (see Table 3). To take into account the contribution to the velocity dispersion uncertainties due to the spectral resolution, we compute the uncertainties on the values of σ(R) as |$\sqrt{\epsilon _{\mathrm{p}}^2+\epsilon _{\mathrm{cw}}^2}$| (light orange bands in Fig. 3). εp (orange bands in Fig. 3) has the same meaning defined above for Vrot(R), while εcw is obtained as the FWHM of the channel width divided by 3 × 2.355. The factor of 3 is obtained after testing the effect of the spectral resolution on the recovered velocity dispersion with mock data. We list the inferred lens and kinematic parameters in Table 3. These values should be compared with those reported in Table 1.

Table 3.

Top table: Recovered kinematic parameters that best describe the prior for the nine sources. Bottom table: Recovered lens parameters for the nine mock data sets. The uncertainties are derived using a multinest method, as explained in Section 5.

Recovered kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M174.0 ± 1.6260.0 ± 3.2115.0 ± 4.81.86 ± 0.2328.9 ± 3.7−1.1 ± 0.23
M254.4 ± 0.1102.6 ± 2.7219.3 ± 2.20.97 ± 0.0913.4 ± 2.61.15 ± 0.2128.5 ± 4.6
M363.2 ± 0.424.6 ± 1.5155.4 ± 4.227.1 ± 5.21.09 ± 0.0295.8 ± 13.125.9 ± 0.25
M460.0 ± 1.8150.0 ± 5.672.9 ± 5.65.23 ± 0.530.25 ± 0.0351.4 ± 5.343.9 ± 6.8−1.13 ± 0.20
M570.7 ± 5.9282.9 ± 2.5151.2 ± 13.92.07 ± 0.1831.8 ± 3.226.2 ± 2.1
M662.0 ± 3.345.0 ± 4.2220.9 ± 1.20.75 ± 0.130.57 ± 0.034.80 ± 1.7042.6 ± 8.5
M740.2 ± 0.5281.4 ± 2.7151.2 ± 11.42.09 ± 0.1341.1 ± 4.624.9 ± 1.8
M881.2 ± 2.597.5 ± 3.8219.7 ± 2.41.07 ± 0.0913.9 ± 1.01.12 ± 0.1227.8 ± 2.8
M967.6 ± 6.3277.8 ± 2.7/−3.3 ± 1.9152.4 ± 7.82.23 ± 0.1838.2 ± 5.825.9 ± 2.6
Recovered lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.43 ± 0.01−15.6 ± 0.50.79 ± 0.022.05 ± 0.01−0.044 ± 0.00410.2 ± 0.8
M21.34 ± 0.05155.8 ± 4.90.95 ± 0.062.25 ± 0.100.056 ± 0.011173.38 ± 0.11
M30.97 ± 0.04−0.08 ± 0.010.98 ± 0.012.04 ± 0.060.257 ± 0.02539.1 ± 0.07
M41.31 ± 0.09160.50 ± 6.50.95 ± 0.012.31 ± 0.160.051 ± 0.009173.8 ± 6.8
M50.78 ± 0.0369.6 ± 0.40.83 ± 0.062.03 ± 0.040.096 ± 0.00234.3 ± 3.2
M61.43 ± 0.01−15.65 ± 2.70.82 ± 0.032.08 ± 0.12−0.037 ± 0.00114.76 ± 1.19
M70.83 ± 0.0472.1 ± 6.20.81 ± 0.071.96 ± 0.090.093 ± 0.00233.3 ± 3.1
M81.32 ± 0.02153.7 ± 4.50.96 ± 0.032.28 ± 0.020.056 ± 0.004173.2 ± 4.1
M90.82 ± 0.0772.6 ± 4.40.82 ± 0.021.99 ± 0.130.101 ± 0.00433.9 ± 1.7
Recovered kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M174.0 ± 1.6260.0 ± 3.2115.0 ± 4.81.86 ± 0.2328.9 ± 3.7−1.1 ± 0.23
M254.4 ± 0.1102.6 ± 2.7219.3 ± 2.20.97 ± 0.0913.4 ± 2.61.15 ± 0.2128.5 ± 4.6
M363.2 ± 0.424.6 ± 1.5155.4 ± 4.227.1 ± 5.21.09 ± 0.0295.8 ± 13.125.9 ± 0.25
M460.0 ± 1.8150.0 ± 5.672.9 ± 5.65.23 ± 0.530.25 ± 0.0351.4 ± 5.343.9 ± 6.8−1.13 ± 0.20
M570.7 ± 5.9282.9 ± 2.5151.2 ± 13.92.07 ± 0.1831.8 ± 3.226.2 ± 2.1
M662.0 ± 3.345.0 ± 4.2220.9 ± 1.20.75 ± 0.130.57 ± 0.034.80 ± 1.7042.6 ± 8.5
M740.2 ± 0.5281.4 ± 2.7151.2 ± 11.42.09 ± 0.1341.1 ± 4.624.9 ± 1.8
M881.2 ± 2.597.5 ± 3.8219.7 ± 2.41.07 ± 0.0913.9 ± 1.01.12 ± 0.1227.8 ± 2.8
M967.6 ± 6.3277.8 ± 2.7/−3.3 ± 1.9152.4 ± 7.82.23 ± 0.1838.2 ± 5.825.9 ± 2.6
Recovered lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.43 ± 0.01−15.6 ± 0.50.79 ± 0.022.05 ± 0.01−0.044 ± 0.00410.2 ± 0.8
M21.34 ± 0.05155.8 ± 4.90.95 ± 0.062.25 ± 0.100.056 ± 0.011173.38 ± 0.11
M30.97 ± 0.04−0.08 ± 0.010.98 ± 0.012.04 ± 0.060.257 ± 0.02539.1 ± 0.07
M41.31 ± 0.09160.50 ± 6.50.95 ± 0.012.31 ± 0.160.051 ± 0.009173.8 ± 6.8
M50.78 ± 0.0369.6 ± 0.40.83 ± 0.062.03 ± 0.040.096 ± 0.00234.3 ± 3.2
M61.43 ± 0.01−15.65 ± 2.70.82 ± 0.032.08 ± 0.12−0.037 ± 0.00114.76 ± 1.19
M70.83 ± 0.0472.1 ± 6.20.81 ± 0.071.96 ± 0.090.093 ± 0.00233.3 ± 3.1
M81.32 ± 0.02153.7 ± 4.50.96 ± 0.032.28 ± 0.020.056 ± 0.004173.2 ± 4.1
M90.82 ± 0.0772.6 ± 4.40.82 ± 0.021.99 ± 0.130.101 ± 0.00433.9 ± 1.7
Table 3.

Top table: Recovered kinematic parameters that best describe the prior for the nine sources. Bottom table: Recovered lens parameters for the nine mock data sets. The uncertainties are derived using a multinest method, as explained in Section 5.

Recovered kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M174.0 ± 1.6260.0 ± 3.2115.0 ± 4.81.86 ± 0.2328.9 ± 3.7−1.1 ± 0.23
M254.4 ± 0.1102.6 ± 2.7219.3 ± 2.20.97 ± 0.0913.4 ± 2.61.15 ± 0.2128.5 ± 4.6
M363.2 ± 0.424.6 ± 1.5155.4 ± 4.227.1 ± 5.21.09 ± 0.0295.8 ± 13.125.9 ± 0.25
M460.0 ± 1.8150.0 ± 5.672.9 ± 5.65.23 ± 0.530.25 ± 0.0351.4 ± 5.343.9 ± 6.8−1.13 ± 0.20
M570.7 ± 5.9282.9 ± 2.5151.2 ± 13.92.07 ± 0.1831.8 ± 3.226.2 ± 2.1
M662.0 ± 3.345.0 ± 4.2220.9 ± 1.20.75 ± 0.130.57 ± 0.034.80 ± 1.7042.6 ± 8.5
M740.2 ± 0.5281.4 ± 2.7151.2 ± 11.42.09 ± 0.1341.1 ± 4.624.9 ± 1.8
M881.2 ± 2.597.5 ± 3.8219.7 ± 2.41.07 ± 0.0913.9 ± 1.01.12 ± 0.1227.8 ± 2.8
M967.6 ± 6.3277.8 ± 2.7/−3.3 ± 1.9152.4 ± 7.82.23 ± 0.1838.2 ± 5.825.9 ± 2.6
Recovered lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.43 ± 0.01−15.6 ± 0.50.79 ± 0.022.05 ± 0.01−0.044 ± 0.00410.2 ± 0.8
M21.34 ± 0.05155.8 ± 4.90.95 ± 0.062.25 ± 0.100.056 ± 0.011173.38 ± 0.11
M30.97 ± 0.04−0.08 ± 0.010.98 ± 0.012.04 ± 0.060.257 ± 0.02539.1 ± 0.07
M41.31 ± 0.09160.50 ± 6.50.95 ± 0.012.31 ± 0.160.051 ± 0.009173.8 ± 6.8
M50.78 ± 0.0369.6 ± 0.40.83 ± 0.062.03 ± 0.040.096 ± 0.00234.3 ± 3.2
M61.43 ± 0.01−15.65 ± 2.70.82 ± 0.032.08 ± 0.12−0.037 ± 0.00114.76 ± 1.19
M70.83 ± 0.0472.1 ± 6.20.81 ± 0.071.96 ± 0.090.093 ± 0.00233.3 ± 3.1
M81.32 ± 0.02153.7 ± 4.50.96 ± 0.032.28 ± 0.020.056 ± 0.004173.2 ± 4.1
M90.82 ± 0.0772.6 ± 4.40.82 ± 0.021.99 ± 0.130.101 ± 0.00433.9 ± 1.7
Recovered kinematic parameters
Mock data setiPAVtRtβξσ0R0σ1
°°km s−1kpckm s−1kpckm s−1
M174.0 ± 1.6260.0 ± 3.2115.0 ± 4.81.86 ± 0.2328.9 ± 3.7−1.1 ± 0.23
M254.4 ± 0.1102.6 ± 2.7219.3 ± 2.20.97 ± 0.0913.4 ± 2.61.15 ± 0.2128.5 ± 4.6
M363.2 ± 0.424.6 ± 1.5155.4 ± 4.227.1 ± 5.21.09 ± 0.0295.8 ± 13.125.9 ± 0.25
M460.0 ± 1.8150.0 ± 5.672.9 ± 5.65.23 ± 0.530.25 ± 0.0351.4 ± 5.343.9 ± 6.8−1.13 ± 0.20
M570.7 ± 5.9282.9 ± 2.5151.2 ± 13.92.07 ± 0.1831.8 ± 3.226.2 ± 2.1
M662.0 ± 3.345.0 ± 4.2220.9 ± 1.20.75 ± 0.130.57 ± 0.034.80 ± 1.7042.6 ± 8.5
M740.2 ± 0.5281.4 ± 2.7151.2 ± 11.42.09 ± 0.1341.1 ± 4.624.9 ± 1.8
M881.2 ± 2.597.5 ± 3.8219.7 ± 2.41.07 ± 0.0913.9 ± 1.01.12 ± 0.1227.8 ± 2.8
M967.6 ± 6.3277.8 ± 2.7/−3.3 ± 1.9152.4 ± 7.82.23 ± 0.1838.2 ± 5.825.9 ± 2.6
Recovered lens parameters
Mock data setκ0θqγΓshθsh
arcsec°°
M11.43 ± 0.01−15.6 ± 0.50.79 ± 0.022.05 ± 0.01−0.044 ± 0.00410.2 ± 0.8
M21.34 ± 0.05155.8 ± 4.90.95 ± 0.062.25 ± 0.100.056 ± 0.011173.38 ± 0.11
M30.97 ± 0.04−0.08 ± 0.010.98 ± 0.012.04 ± 0.060.257 ± 0.02539.1 ± 0.07
M41.31 ± 0.09160.50 ± 6.50.95 ± 0.012.31 ± 0.160.051 ± 0.009173.8 ± 6.8
M50.78 ± 0.0369.6 ± 0.40.83 ± 0.062.03 ± 0.040.096 ± 0.00234.3 ± 3.2
M61.43 ± 0.01−15.65 ± 2.70.82 ± 0.032.08 ± 0.12−0.037 ± 0.00114.76 ± 1.19
M70.83 ± 0.0472.1 ± 6.20.81 ± 0.071.96 ± 0.090.093 ± 0.00233.3 ± 3.1
M81.32 ± 0.02153.7 ± 4.50.96 ± 0.032.28 ± 0.020.056 ± 0.004173.2 ± 4.1
M90.82 ± 0.0772.6 ± 4.40.82 ± 0.021.99 ± 0.130.101 ± 0.00433.9 ± 1.7

In Fig. 4, we show the comparison between the input and recovered flat velocities Vflat and average velocity dispersions 〈σgas〉. The values of Vflat are obtained as the average of the rotation velocities at large radii, while the 〈σgas〉 are obtained by averaging the values of σgas(R) starting from R = 0 kpc. The error bars in Fig. 4 take into account the contribution of both the uncertainties on the values of Vrot and σgas at each radius, as showed by the orange and light orange bands in Fig. 3, and the standard deviation. The flat part of the rotation curves is correctly reproduced for all mock data sets. In particular, with our technique, we are able to recover Vflat not only for the galaxies for which the input rotation curves are described by functional forms, but also for the rotation curves taken from real galaxies. This test ensures that the functional forms implemented in our code are flexible enough to reproduce the variety of shape of rotation curves (from dwarf to massive galaxies). Even if the details in the inner region could be lost (see Section 5.3), the physical parameters that depend on Vflat can be exactly recovered. The values of 〈σgas〉 are recovered with a great accuracy, even if they are more affected by the spectral resolution.

Left: Recovered versus input values of Vflat for the nine mock data sets. Some points are shifted both on x- and y-axis by the same quantity for a better visualization of all the points. The green line represents a 1:1 relation between quantities on x- and y-axis. Right: Same as in the left-hand panel, but for 〈σgas〉. The errors bars take into account both the standard uncertainties due to error propagation and the standard deviation due to fact that these are averaged quantities.
Figure 4.

Left: Recovered versus input values of Vflat for the nine mock data sets. Some points are shifted both on x- and y-axis by the same quantity for a better visualization of all the points. The green line represents a 1:1 relation between quantities on x- and y-axis. Right: Same as in the left-hand panel, but for 〈σgas〉. The errors bars take into account both the standard uncertainties due to error propagation and the standard deviation due to fact that these are averaged quantities.

The rows show some representative channel maps for the simulated data set M1. Column 1: Input source. Column 2: Mock lensed data. Column 3: Lensed model and the corresponding critical curves. Column 4: Normalized residuals obtained as the ratio between the difference of the data and the model and the corresponding noise. Column 5: Reconstructed source and caustics. Column 6: Kinematic prior used to constrain the source reconstruction. The contour levels in the first and sixth columns are set at n = 0.1, 0.2, 0.4, 0.6, 0.8 times the value of the maximum flux.
Figure 5.

The rows show some representative channel maps for the simulated data set M1. Column 1: Input source. Column 2: Mock lensed data. Column 3: Lensed model and the corresponding critical curves. Column 4: Normalized residuals obtained as the ratio between the difference of the data and the model and the corresponding noise. Column 5: Reconstructed source and caustics. Column 6: Kinematic prior used to constrain the source reconstruction. The contour levels in the first and sixth columns are set at n = 0.1, 0.2, 0.4, 0.6, 0.8 times the value of the maximum flux.

Below, we provide a detailed discussion on the modelling results for each of the nine simulated data sets. The reader not interested in the details can skip to Section 5.10, where we address some key issues related to radial motions, SNRs, and changes in the centre coordinates and systemic velocities and to Section 6 where we summarize the results of our tests.

5.1 Mock data set M1

The simulated data M1 were created assuming an arctangent function for the rotation velocity and a dispersion curve that is linearly declining from a value of 30 km s−1 at R = 0 kpc to 21 km s−1 at R = 5.9 kpc. The source position angle also changes linearly from 270° in the inner regions to 260° in the outer areas.

We model the simulated data with the same functional forms used to create them. Since the small change in the position angle is not detectable by a visual analysis of the zeroth-moment map, resulting from the first step of the optimization scheme (see Section 2.4), we decided to keep it fixed to the constant value of 260° during the following steps. We have found that this assumption does not significantly influence the derived kinematics. The inferred parameters that define the rotation velocity and the velocity dispersion have median relative uncertainties of 7 per cent and are within 2σ from the input values. The inferred lens parameters, characterized by median relative uncertainties of 2 per cent, are within 1σ from the input values, with the only exceptions of the lens and external-shear position angles θ and θsh that differ by 5.7σ and 3.9σ, respectively, from the input values. This result is related to the fact that the lens is very close to being spherical and the shear strength almost negligible.

5.2 Mock data set M2

We have created the simulated data M2 using a hyperbolic tangent function for the source rotation curve and an exponential function for its velocity dispersion.

First, we model this data set assuming the same functional forms used as input. This choice produces normalized residuals that are of the order of 0.5 per cent (see the fourth column in Fig. B1). The inferred lens parameters have a median relative uncertainty of 5 per cent, while the recovered kinematic parameters have median relative uncertainties of 13 per cent (Table 3). The largest contribution to the kinematic uncertainties comes from the velocity dispersion parameters, due to the limited spectral resolution (channel width of ∼36.8 km s−1) of these data (see the orange band in Fig. 3). The input lens and the kinematic parameters are within the 1σ uncertainties from the recovered values. To test our capability to distinguish between different forms of parametrization, we have also modelled this data set with an arctangent function. We have found that under this assumption the residuals get worse (see the third column in Fig. 6), reaching a maximum value of ∼6σ. We have then computed the marginalized Bayesian evidence to compare and rank these two models. As anticipated in Section 2.4, the marginalized evidence in equation (14) allows us to quantify how well a model mi is supported by the data against another model mj. This quantification is expressed in terms of the Bayes factor, Δlog Eij = log Ei − log Ej. We find that the Bayes factor is of the order of 1400 against the arctangent model, meaning that the hyperbolic tangent function for the rotation curve is largely supported by the data. We can conclude, therefore, that the data contain enough information for us to infer the most suitable shape.

Same as Fig. B1, for a rotation velocity described by an arctangent function.
Figure 6.

Same as Fig. B1, for a rotation velocity described by an arctangent function.

5.3 Mock data set M3

The lens system M3 was created assuming a rotation curve for the background galaxy described by the functional form in equation (26). This function is not implemented in our code. The velocity dispersion was assumed to be constant.

We model these simulated data using the multiparameter function given by equation (20), which is the most flexible function that we have implemented. We find that the lens parameters are recovered with a median relative uncertainty of 5 per cent and are within 1σ from the input values. Our constraints on the lens–mass model are therefore not significantly influenced by our assumptions on the source prior. The inferred parameters that define the rotation curve (Vt, Rt, β, ξ) in Table 3 should be compared with those reported in Table 1, obtained by fitting the input 1D rotation curve (blue squares in Fig. 3) using the multiparameter function (solid blue line in Fig. 3). The inferred kinematic parameters have median uncertainties of 3 per cent and they are within 2σ from the fitted values. The only exception is the recovered velocity dispersion that is more than 3σ away. However, this discrepancy is due to an underestimation of the uncertainties that do not include the systematic errors introduced by the spectral resolution. If we take into account the uncertainties due to spectral resolution, εcw in Section 5, the recovered dispersion profile is in agreement within 1σ with the input profile (see the light orange band in Fig. 3). We find that both the fit to the input rotation curve (solid blue line in Fig. 3) and the rotation curve derived from our lens modelling technique (red dashed line in Fig. 3) are a poor description of the inner regions of the real curve (see the blue squares in Fig. 3). Despite its flexibility, the multiparameter function does not allow us to correctly reproduce the peculiarity of the data in the central regions. Correspondingly, we find that the overall fit to the data has systematic residuals that reach maximum values of ∼5σ (see the third column in Fig. B2). However, the rotation velocity at the outer regions is well reproduced, ensuring that even if the details of the inner regions are lost, the physical parameters that depend on the asymptotic velocity (e.g. angular momentum and dynamical mass) can still be recovered with good accuracy (see Fig. 4).

5.4 Mock data set M4

The input values of the rotation velocity for this system are taken from the rotation curve of the nearby dwarf galaxy NGC 2976 (Lelli et al. 2016). This choice allows us to test whether the assumed functional forms are good enough to reproduce real rotational curves. A linear equation describes the velocity dispersion curve.

During the modelling phase, we use the multiparameter function, equation (20), to describe the rotation velocity, while for the velocity dispersion we use the same functional form used as input. As for the simulated data M3, to have a quantitative estimate of the accuracy on the derived kinematics, we first fit the input 1D rotation curve with the same functional form used in the 3D lens modelling process (solid blue line in Fig. 3). The recovered kinematic parameters have a median relative uncertainty of 9 per cent, while the lens parameters have a median relative uncertainty of 5 per cent (Table 3). The inferred lens parameters are within the 2σ errors from the input values. The kinematic parameters are within 1σ from the values derived by fitting the 1D rotation curve.

5.5 Mock data set M5

As for the simulated data set M4, we create M5 using the rotation curve of a real galaxy as input, in this case NGC 3198 (Lelli et al. 2016). The input functional form for the velocity dispersion is an exponential function, equation (23), with σ1 = 0.0 km s−1.

At the modelling stage, we use the hyperbolic tangent function for the rotation curve and an exponential function with σ1 fixed at zero km s−1 for the velocity dispersion. As for the simulated data M3 and M4, we start by fitting the 1D rotation curve with the same functional form used for the 3D lens modelling. We find that the hyperbolic tangent function provides a good enough description of the data. The normalized residuals (fourth column in Fig. B4), indeed, do not show any systematic features, usually indicative of wrong assumptions in the building of the prior (e.g. see the models M2 and M3 in Sections 5.2 and 5.3). The recovered lens and kinematic parameters have median relative uncertainties of 3 and 8 per cent, respectively, and they are within 1σ from the input values.

5.6 Mock data set M6

The simulated data M6 were created using the rotation curve of the nearby galaxy NGC 6674 (Lelli et al. 2016), while for the velocity dispersion curve we have used a constant value of 38 km s−1. When modelling the data, the prior is built assuming the multiparameter function for the rotation curve, while the dispersion is assumed to be constant.

The input lens parameters (Table 1) are within the 1σ uncertainties from the recovered values (Table 3). The kinematic parameters inferred by the 3D lens modelling technique are within 1σ from the values derived by fitting the 1D rotation curve directly. We find that the inferred lens and kinematic parameters have a median relative uncertainty of 6 and 9 per cent, respectively. In particular, the velocity dispersion, σ0, has an uncertainty of 20 per cent. The major contribution to this error is the difference between the maximum a posteriori parameter value of 51.1 km s−1 obtained by multinest and the corresponding value of 42.6 km s−1 obtained by the non-linear optimizer (see Section 5). However, given the channel width of 33.9 km s−1 for this system, we believe the discrepancy not to be significant.

5.7 Mock data set M7

The derivation of the rotation curve for low-inclination galaxies is challenging for any kinematic-fitting algorithm. For example, for i = 40°, an error as small as ±5° in the estimation of the inclination angle can lead to a significant underestimation/overestimation of the rotation velocity as large as 10 per cent. To test the reliability of our modelling technique when the background source is a low-inclination galaxy, we have created the mock data M7 with the same lens and kinematic parameters of M5, but with an inclination angle for the source of 40°, instead of 68°.

As described above, we first model the zeroth-moment map and then use the recovered lens model parameters to derive a 3D model of the source. The latter is then analysed with 3Dbarolo to obtain initial guesses for the source geometrical parameters. For the mock data M7, this results in a value of i = 41.5°, in close agreement with the input value. Subsequently, by applying our 3D lens modelling analysis to the full lensed data cube, we derive an inclination angle of 40.2°. The inferred lens and kinematic parameters have median relative uncertainties of 8 and 6 per cent, respectively, differing by 1σ and 2σ from the input values. We can conclude, therefore, that the accurate reconstruction of the zeroth-moment map allows us to obtain a reasonable initial estimate of the inclination and, as a consequence, the kinematic properties of the galaxy are correctly recovered.

5.8 Mock data set M8

Di Teodoro & Fraternali (2015) have shown that for large inclinations, i > 75°, the inner points of the rotation curve can be underestimated and that this effect can be more significant for a flat rather than for a solid-body rotation curve. This is due to the fact that 3Dbarolo works ring by ring. However, we note that for a value of the inclination angle >75°, the errors on the inclination have little impact on the derived rotation curve, due to the sinusoidal dependence between the line-of-sight velocity and the inclination, see equation (17).

These mock data were built to study how an extreme value of the inclination angle affects the reconstruction of the source kinematics. For this reason, we have built the mock data M8 using the same lens and kinematic models as those used for M2, but assuming an inclination angle of 80°. In particular, we focus on M2 because it has a flat rotation curve.

The inferred values that describe the rotation velocity differ by ∼4 per cent from the input values, reproducing very well also the inner regions of the curve (see the red dashed line in Fig. 3). We can conclude, therefore, that the use of functional forms for the rotation velocity allows us to reproduce the inner regions of an edge-on galaxy better than a ring by ring method. Moreover, the inferred lens and kinematic parameters are characterized by median relative uncertainties of 3 and 7 per cent, respectively, and they are within 2σ from the input values.

5.9 Mock data set M9

The input lens and kinematic models are the same as those used to create the simulated data M5, but the input geometry of the source is different. In particular, the position angle has a strong warp, and it decreases linearly from a value of 280° at R = 0 kpc to 250° at the outermost radius.

Interestingly, we find that the presence of the warp is already revealed at the first step of our optimization strategy (Fig. 7), where we focus on the lens modelling of the zeroth moment [i.e. point (i), Section 2.4]. From a 3Dbarolo analysis of the derived source cube, we then obtain a position angle that changes linearly with a slope of 2.6° kpc−1 from an inner value of 278°. We then apply our 3D lens modelling technique with a position angle that changes linearly. The two parameters that describe this change are left free to vary, starting from the initial guesses found by 3Dbarolo. The best-fitting slope that describes the variation of the PA is 3.3° kpc−1, which differs by ∼6 per cent from the input value of 3.5° kpc−1. The inferred value of PA at R = 0 kpc is 277.8°, differing by ∼1 per cent from the input value of 280.0°. The inferred kinematic parameters have a median uncertainty of 7 per cent, while the lens parameters have a median uncertainty of 6 per cent. Both the lens and the kinematic parameters are within 1σ from the input values.

Zeroth-moment map for the reconstructed source M9, resulting after the first step [i.e. point (i), Section 2.4] of our optimization strategy.
Figure 7.

Zeroth-moment map for the reconstructed source M9, resulting after the first step [i.e. point (i), Section 2.4] of our optimization strategy.

5.10 Further tests

Observational evidence seems to indicate that physical processes such as disc turbulence, gas accretion, and subsequent disc instabilities are more prevalent at high redshift (e.g. Förster Schreiber et al. 2006; Law et al. 2009; Wisnioski et al. 2015). As a consequence, the contribution of non-rotational components could have a significant impact on the kinematic of high-redshift galaxies. The presence of significant radial motions, which are not included in the current analysis, could, in principle, limit the applicability of our technique to high-|$z$| galaxies. To quantify this issue, we have created a simulated data set that has the same lens and kinematic models as M1 but includes radial motions of 25 km s−1. We note that this value is larger than what is typically observed at low redshift (i.e. ≲10 km s−1; Trachternach et al. 2008), while, to date, there are no studies of radial motion in high-redshift galaxies. We have compared the lensed images of these simulated data with those of M1 for different values of angular resolution. Even in the ideal case of no observational noise, we have found no significant discrepancy, with a relative difference between the two lensed images of the order of ∼2 per cent, for the highest angular resolution case. We can conclude that, although non-circular motions could contribute to the overall kinematics, they are mostly not detectable at the current angular and spectral resolution.

To test how the SNR of the data affects the accuracy with which the lens–mass and source–kinematic parameters are recovered, we have re-simulated the mock data M1 with five different noise levels, obtained using different exposure times (see Fig. A2 in Appendix  A). As shown in the left-hand panel of Fig. 8, for an SNR3 below ∼3, the relative difference between the input and the recovered values is higher than 30 per cent both for the lens (orange circles) and for the kinematic parameters (magenta empty squares). If we focus on the relative difference between the input and the output Vflat (green triangles on the right-hand panel of Fig. 8), we conclude that we are still able to recover it with an accuracy of the order of 90 per cent even for the mock data set with the lowest SNR. The relative differences between the input and the recovered values of 〈σgas〉 (blue empty diamonds on the right-hand panel of Fig. 8) are smaller than ∼30 per cent for an SNR larger than 3.

Left: Relative difference between the input and the recovered lens (orange circles) and kinematic (magenta empty squares) parameters as a function of the SNR for different data-quality realizations of M1 mock data set. Right: Same as in the left-hand panel but for Vflat (green triangles) and 〈σgas〉 (blue empty diamonds).
Figure 8.

Left: Relative difference between the input and the recovered lens (orange circles) and kinematic (magenta empty squares) parameters as a function of the SNR for different data-quality realizations of M1 mock data set. Right: Same as in the left-hand panel but for Vflat (green triangles) and 〈σgas〉 (blue empty diamonds).

So far, we have modelled all the simulated data sets with a fixed systemic velocity of zero km s−1 and the centre of the kinematic prior set at the flux-weighted average position of the zeroth-moment map. However, for high-redshift galaxies, one expects an error on the centre of the order of 5–10 per cent and on the systemic velocity of the order of the channel width. We have, therefore, repeated the analysis of M1 by fixing the coordinates of the centre to values that are offset from the real ones by 10 per cent and with values of the systemic velocity that are offset by the channel width velocity from the real value. We have found that an incorrect choice of the systemic velocity does not affect our results significantly, while changes in the centre have a significant effect only if both coordinates are shifted by 10 per cent relative to the real values in the same direction.

Because of the small FOV of many IFU instruments, complete imaging of the lensed emission is often not possible, instead, only a limited part of the arc is observed. Here, we show how this observational limitation can strongly affect the reconstruction of the source morphology. The first row of Fig. 9 shows the emission of a background disc galaxy as lensed by a power-law mass distribution. In the same figure, we present the source that is reconstructed by considering an increasingly smaller part of the data. In particular, on the second row, we have modelled the entire set of images, on the third one we have excluded the counter image, on the fourth we have modelled only a small region of the main arc and the counter image, while on the fifth one only a small region of the main arc was taken into account. We find that one can safely ignore the counter image only if all images are modelled (see the third row of Fig. 9). However, failure to observe the entire main arc leads to an incorrect source reconstruction and a wrong estimate of both the centre of the galaxy and its extension, strongly affecting the derived kinematics (see the fifth row of Fig. 9). In this case, including the counter image is fundamental for a better reconstruction of the source (see the fourth row in Fig. 9). This result is due to the extension of the source, such that different regions of the galaxy are lensed into different regions and/or number of the images, depending on the lens system configuration. We conclude, therefore, that the analysis of the full set of lensed images (obtainable, for example, with mosaic observations) is crucial for a reliable derivation of the source properties.

First row: on the left side the input lensed data obtained by lensing forward the disc galaxy on the right side. From second to fifth rows: reconstruction of the source (right-hand panels) obtained by modelling different parts of the mock lensed data (left-hand panels). Second row: all data are considered for the source reconstruction. Third row: the counter image is excluded. Fourth row: the counter image and only part of the arc are modelled. Fifth row: only part of the arc is modelled to reconstruct the source.
Figure 9.

First row: on the left side the input lensed data obtained by lensing forward the disc galaxy on the right side. From second to fifth rows: reconstruction of the source (right-hand panels) obtained by modelling different parts of the mock lensed data (left-hand panels). Second row: all data are considered for the source reconstruction. Third row: the counter image is excluded. Fourth row: the counter image and only part of the arc are modelled. Fifth row: only part of the arc is modelled to reconstruct the source.

6 SUMMARY AND CONCLUSION

In this paper, we have presented a new method to model the kinematics of strong gravitationally lensed galaxies from 3D emission-line observations. The technique is entirely Bayesian: a Bayesian penalty function allows us to simultaneously infer the lens–mass and the source–kinematic properties from the same 3D data cube, while the Bayesian marginalized evidence enables us to rank and compare different lens and kinematic models. This new approach is also grid-based and hierarchical. The source is reconstructed on a magnification-dependent Delaunay tessellation, and its kinematics represent the hyper-parameter of the source prior. The primary focus of this method is to provide a robust approach for studying the resolved kinematic properties of high-redshift lensed galaxies. In this respect, it represents a significant improvement over past and recent approaches. Indeed, since the kinematic model is a hyper-parameter of the reconstructed source, the inferred kinematic properties are not influenced by the poor understanding of the errors and spatial resolution on the unlensed plane. Furthermore, the lens–mass distribution is derived consistently from the same 3D data cube.

To test the capabilities of this new method in inferring the correct model parameters, we have studied a sample of nine simulated lensed galaxies as they would be observed with the OSIRIS spectrograph. These galaxies are characterized by a variety of rotation curves and geometries. In particular, we have focused on rotation curves that are described either by different functional forms (i.e. simulated data M1–M3) or derived from real galaxies (i.e. data sets M4–M6). We have found that this variety of shapes for the input rotation curves (from solid-body to flat rotation curves) is robustly recovered. In particular, the median relative accuracy on the inferred lens and kinematic parameters are at the level of ∼1 and ∼2 per cent, respectively.

Focussing on the extreme cases of a low-inclination (40°, M7) and edge-on (80°, M8) galaxy, we have also studied how the inclination of the source affects the accuracy of the reconstructions. We have found that the kinematic parameters can be recovered with a median accuracy of 1 and 2 per cent, respectively, if a reasonable initial estimate of the inclination can be obtained from the analysis of the zeroth-moment map. We have then tested the capability of our code to identify the presence of a warp. We have concluded that warps as large as 30 deg can significantly affect the lensed data. However, we are still able to recover the kinematic parameters with an accuracy of 3 per cent. The rotation curves in all cases are accurately reconstructed, therefore the most important physical parameters of the source galaxy (e.g. the dynamical mass, the angular momentum) can be correctly inferred.

We have also investigated the effect of increasing noise and concluded that the parameters are recovered with an accuracy better than 30 per cent whenever the SNR is higher than ∼3. The flat part of the rotation velocity is recovered with an accuracy of the order of 90 per cent even when the SNR is ∼3. Finally, we have examined the effect of strong radial motions and found it to be irrelevant for the typical angular and spectral resolution of IFU observations. From these extensive tests, we can conclude that the method presented in this paper offers a novel and robust way to study the gas kinematics of high-redshift lensed galaxies using data from the last generation of IFUs. Taking advantages of strong gravitational lensing, we can study the kinematic properties of galaxies at |$z$| ∼ 2–3 with spatial resolutions and SNR not achievable for unlensed galaxies, even with current observational technique. Moreover, gravitational lensing offers the unique opportunity to study galaxies in the low-stellar-mass range, which is almost impossible for studies targeting unlensed galaxies.

In this paper, we have focused on galaxy-scale lenses. The formalism of our method is also applicable to cluster lenses, although with a more complex parametrization of the lensing potential. However, as the mass distribution of galaxy clusters is more complicated, we expect in this case larger uncertainties on both the lens parameters and the source kinematics.

We have also only focused on optical observations. However, our technique, being an extension to the 3D domain of the methods developed by Vegetti & Koopmans (2009) and Rybak et al. (2015a), can also be used for the modelling of emission lines with radio interferometric observations. While our current implementation does not include functional forms for the analysis of radial motions and/or outflows, we plan to implement these in a follow-up work. In the near future, we will apply our novel technique to the analysis of both ionized and molecular gas emission lines to study the extended physical and kinematical properties of high-redshift lensed galaxies.

The code presented in this paper is not publicly available; however, the reader interested in using the code can contact the first author.

ACKNOWLEDGEMENTS

We thank an anonymous referee for a very careful report and for his/her comments. We thank Cristopher Fassnacht and Tucker Jones for their helpful comments and suggestions on the mock data. Simona Vegetti has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement no. 758853). Enrico Di Teodoro acknowledges the support of the Australian Research Council through grant DP160100723.

Footnotes

1

The fitting of the input rotation curves for M3 to M6 was done using the Python package Scipy.optimize.

2

The median accuracies on |$\boldsymbol{\eta _{\mathrm{lens}}}$| and |$\boldsymbol{\eta _{\mathrm{kin}}}$| are calculated taking into account the relative difference between the input (Table 1) and recovered parameters (Table 3) as: (input-recovered)/input.

3

The values of the SNR are calculated as median of the SNRi for each spectral channel i of the data cube. The latter is calculated as SNR|$_{\mathrm{i}}=\sum _{\mathrm{k}}S_{\mathrm{k i}}/\sqrt{N_{\mathrm{k i}}^2}$|⁠, where the sum is over the pixels of the |$\mathrm{i\text{}}$|th channel, Ski is the signal at |$\mathrm{k\text{}}$|th pixel, and Nki is its noise.

REFERENCES

Andersen
D. R.
,
Bershady
M. A.
,
Sparke
L. S.
,
Gallagher
J. S.
III
,
Wilcots
E. M.
,
2001
,
ApJ
,
551
,
L131

Blumenthal
G. R.
,
Faber
S. M.
,
Flores
R.
,
Primack
J. R.
,
1986
,
ApJ
,
301
,
27

Bolton
A. S.
,
Burles
S.
,
Koopmans
L. V. E.
,
Treu
T.
,
Moustakas
L. A.
,
2006
,
ApJ
,
638
,
703

Courteau
S.
,
1997
,
AJ
,
114
,
2402

Courteau
S.
,
Dutton
A. A.
,
2015
,
ApJ
,
801
,
L20

Cresci
G.
et al. ,
2009
,
ApJ
,
697
,
115

Di Teodoro
E. M.
et al. ,
2018
,
MNRAS
,
476
,
804

Di Teodoro
E. M.
,
Fraternali
F.
,
2015
,
MNRAS
,
451
,
3021

Di Teodoro
E. M.
,
Fraternali
F.
,
Miller
S. H.
,
2016
,
A&A
,
594
,
A77

Dutton
A. A.
et al. ,
2011
,
MNRAS
,
416
,
322

Dye
S.
et al. ,
2015
,
MNRAS
,
452
,
2258

Epinat
B.
,
Amram
P.
,
Balkowski
C.
,
Marcelin
M.
,
2010
,
MNRAS
,
401
,
2113

Feroz
F.
,
Hobson
M. P.
,
Bridges
M.
,
2009
,
MNRAS
,
398
,
1601

Feroz
F.
,
Hobson
M. P.
,
Cameron
E.
,
Pettitt
A. N.
,
2013
, preprint (arXiv:1306.2144)

Förster Schreiber
N. M.
et al. ,
2006
,
ApJ
,
645
,
1062

Förster Schreiber
N. M.
et al. ,
2009
,
ApJ
,
706
,
1364

Förster Schreiber
N. M.
et al., 2018, preprint (arXiv:1802.07276)

Freeman
K. C.
,
1970
,
ApJ
,
160
,
811

Genzel
R.
et al. ,
2011
,
ApJ
,
733
,
101

Genzel
R.
et al. ,
2017
,
Nature
,
543
,
397

Girard
M.
et al. ,
2018
,
A&A
,
613
,
A72

Harrison
C. M.
et al. ,
2017
,
MNRAS
,
467
,
1965

Jones
T. A.
,
Swinbank
A. M.
,
Ellis
R. S.
,
Richard
J.
,
Stark
D. P.
,
2010a
,
MNRAS
,
404
,
1247

Jones
T.
,
Ellis
R.
,
Jullo
E.
,
Richard
J.
,
2010b
,
ApJ
,
725
,
L176

Jones
T.
,
Ellis
R. S.
,
Richard
J.
,
Jullo
E.
,
2013
,
ApJ
,
765
,
48

Koopmans
L. V. E.
,
2005
,
MNRAS
,
363
,
1136

Krajnović
D.
,
Cappellari
M.
,
de Zeeuw
P. T.
,
Copin
Y.
,
2006
,
MNRAS
,
366
,
787

Lagattuta
D. J.
,
Vegetti
S.
,
Fassnacht
C. D.
,
Auger
M. W.
,
Koopmans
L. V. E.
,
McKean
J. P.
,
2012
,
MNRAS
,
424
,
2800

Lang
P.
et al. ,
2017
,
ApJ
,
840
,
92

Larkin
J.
et al. ,
2006
,
New Astron. Rev
,
50
,
362

Law
D. R.
,
Steidel
C. C.
,
Erb
D. K.
,
2006
,
AJ
,
131
,
70

Law
D. R.
,
Steidel
C. C.
,
Erb
D. K.
,
Larkin
J. E.
,
Pettini
M.
,
Shapley
A. E.
,
Wright
S. A.
,
2009
,
ApJ
,
697
,
2057

Leethochawalit
N.
,
Jones
T. A.
,
Ellis
R. S.
,
Stark
D. P.
,
Richard
J.
,
Zitrin
A.
,
Auger
M.
,
2016
,
ApJ
,
820
,
84

Lelli
F.
,
Verheijen
M.
,
Fraternali
F.
,
Sancisi
R.
,
2012
,
A&A
,
537
,
A72

Lelli
F.
,
McGaugh
S. S.
,
Schombert
J. M.
,
2016
,
AJ
,
152
,
157

Livermore
R. C.
et al. ,
2015
,
MNRAS
,
450
,
1812

Lovell
M. R.
, et al. ,
2018
,
MNRAS
,
481
,
1950

Mason
C. A.
et al. ,
2017
,
ApJ
,
838
,
14

Mieda
E.
et al. ,
2014
,
PASP
,
126
,
250

Miller
S. H.
,
Bundy
K.
,
Sullivan
M.
,
Ellis
R. S.
,
Treu
T.
,
2011
,
ApJ
,
741
,
115

Miller
S. H.
,
Ellis
R. S.
,
Sullivan
M.
,
Bundy
K.
,
Newman
A. B.
,
Treu
T.
,
2012
,
ApJ
,
753
,
74

Molina
J.
,
Ibar
E.
,
Swinbank
A. M.
,
Sobral
D.
,
Best
P. N.
,
Smail
I.
,
Escala
A.
,
Cirasuolo
M.
,
2017
,
MNRAS
,
466
,
892

Nesvadba
N. P. H.
et al. ,
2006
,
ApJ
,
650
,
661

Newman
S. F.
et al. ,
2013
,
ApJ
,
767
,
104

Patrício
V.
et al. ,
2018
,
MNRAS
,
477
,
18

Press
W. H.
,
Teukolsky
S. A.
,
Vetterling
W. T.
,
Flannery
B. P.
,
1992
,
Numerical Recipes in C. The Art of Scientific Computing
,
Cambridge Univ. Press
,
New York
.

Puech
M.
et al. ,
2008
,
A&A
,
484
,
173

Remus
R.-S.
,
Dolag
K.
,
Naab
T.
,
Burkert
A.
,
Hirschmann
M.
,
Hoffmann
T. L.
,
Johansson
P. H.
,
2017
,
MNRAS
,
464
,
3742

Rix
H.-W.
,
Guhathakurta
P.
,
Colless
M.
,
Ing
K.
,
1997
,
MNRAS
,
285
,
779

Rogstad
D. H.
,
Lockhart
I. A.
,
Wright
M. C. H.
,
1974
,
ApJ
,
193
,
309

Rubin
V. C.
,
Ford
W. K.
Jr.
,
Thonnard
N.
,
1978
,
ApJ
,
225
,
L107

Rybak
M.
,
McKean
J. P.
,
Vegetti
S.
,
Andreani
P.
,
White
S. D. M.
,
2015a
,
MNRAS
,
451
,
L40

Rybak
M.
,
Vegetti
S.
,
McKean
J. P.
,
Andreani
P.
,
White
S. D. M.
,
2015b
,
MNRAS
,
453
,
L26

Shirazi
M.
,
Vegetti
S.
,
Nesvadba
N.
,
Allam
S.
,
Brinchmann
J.
,
Tucker
D.
,
2014
,
MNRAS
,
440
,
2201

Skilling
J.
,
2004
, in
Fischer
R.
,
Preuss
R.
,
Toussaint
U. V.
, eds,
AIP Conf. Proc. Vol. 735, Bayesian Inference and Maximum Entropy Methods in Science and Engineering
.
Am. Inst. Phys
,
New York
, p.
395

Stark
D. P.
,
Swinbank
A. M.
,
Ellis
R. S.
,
Dye
S.
,
Smail
I. R.
,
Richard
J.
,
2008
,
Nature
,
455
,
775

Straatman
C. M. S.
et al. ,
2017
,
ApJ
,
839
,
57

Suyu
S. H.
,
Marshall
P. J.
,
Hobson
M. P.
,
Blandford
R. D.
,
2006
,
MNRAS
,
371
,
983

Swinbank
A. M.
et al. ,
2015
,
ApJ
,
806
,
L17

Swinbank
A. M.
et al. ,
2017
,
MNRAS
,
467
,
3140

Swinbank
A. M.
,
Bower
R. G.
,
Smith
G. P.
,
Wilman
R. J.
,
Smail
I.
,
Ellis
R. S.
,
Morris
S. L.
,
Kneib
J.-P.
,
2007
,
MNRAS
,
376
,
479

Swinbank
A. M.
,
Sobral
D.
,
Smail
I.
,
Geach
J. E.
,
Best
P. N.
,
McCarthy
I. G.
,
Crain
R. A.
,
Theuns
T.
,
2012a
,
MNRAS
,
426
,
935

Swinbank
A. M.
,
Smail
I.
,
Sobral
D.
,
Theuns
T.
,
Best
P. N.
,
Geach
J. E.
,
2012b
,
ApJ
,
760
,
130

Teklu
A. F.
,
Remus
R.-S.
,
Dolag
K.
,
Arth
A.
,
Burkert
A.
,
Obreja
A.
,
Schulze
F.
,
2018
,
ApJ
,
854
,
L28

Tiley
A. L.
et al. ,
2016
,
MNRAS
,
460
,
103

Trachternach
C.
,
de Blok
W. J. G.
,
Walter
F.
,
Brinks
E.
,
Kennicutt
R. C.
Jr
,
2008
,
AJ
,
136
,
2720

Tully
R. B.
,
Fisher
J. R.
,
1977
,
A&A
,
54
,
661

Turner
O. J.
,
Harrison
C. M.
,
Cirasuolo
M.
,
McLure
R. J.
,
Dunlop
J.
,
Swinbank
A. M.
,
Tiley
A. L.
,
2017
, preprint (arXiv:1711.03604)

Übler
H.
et al. ,
2017
,
ApJ
,
842
,
121

van Albada
T. S.
,
Bahcall
J. N.
,
Begeman
K.
,
Sancisi
R.
,
1985
,
ApJ
,
295
,
305

Vegetti
S.
,
Koopmans
L. V. E.
,
2009
,
MNRAS
,
392
,
945

Wisnioski
E.
et al. ,
2011
,
MNRAS
,
417
,
2601

Wisnioski
E.
et al. ,
2015
,
ApJ
,
799
,
209

Wright
S. A.
,
Larkin
J. E.
,
Law
D. R.
,
Steidel
C. C.
,
Shapley
A. E.
,
Erb
D. K.
,
2009
,
ApJ
,
699
,
421

Wu
X.
,
Gerhard
O.
,
Naab
T.
,
Oser
L.
,
Martinez-Valpuesta
I.
,
Hilz
M.
,
Churazov
E.
,
Lyskova
N.
,
2014
,
MNRAS
,
438
,
2701

Yuan
T.
et al. ,
2017
,
ApJ
,
850
,
61

Zolotov
A.
et al. ,
2015
,
MNRAS
,
450
,
2327

APPENDIX A: SNR

In Fig. A1, we show the SNR for the same spectral channels shown in Figs. 5 and B1B8. For each spectral channel, the noise is added considering the procedure described in Section 3.2 and the exposure time listed in Table 1.

SNR for M1–M9. The spectral channels are the same shown in Fig. 5 for M1 and B1–B8 for M2–M9.
Figure A1.

SNR for M1–M9. The spectral channels are the same shown in Fig. 5 for M1 and B1B8 for M2–M9.

In Fig. A2, we show the SNR for the same spectral channels shown in Fig. 5 and for the different values of data quality, as described in Section 5.10 and summarized in Fig. 8. The six different noise levels are obtained by considering six values of the exposure times for M1: 14.4 ks, 7.2 ks, 3.6 ks, 1.8 ks, 900 s, and 600 s.

SNR for the data-quality tests described in Section 5.10 and in Fig. 8. The different qualities of the data are obtained by using different exposure times, as indicated in the titles. The spectral channels are the same as shown in Fig. 5.
Figure A2.

SNR for the data-quality tests described in Section 5.10 and in Fig. 8. The different qualities of the data are obtained by using different exposure times, as indicated in the titles. The spectral channels are the same as shown in Fig. 5.

APPENDIX B: MOCK data set M2–M9

As for Fig. 5 (for M1), we show in Figs. B1B8 the input and best-fitting models for M2–M8. For a selected number of spectral channels, we plot in the first column the contour levels of the input source, in the second column the simulated lensed data, in the third column the inferred lensed model, in the fourth column the normalized image residuals, in the fifth column the reconstructed source, and in the sixth column the contour levels of the kinematic model.

Same as Fig. 5 for the simulated data set M2, with n = {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8}.
Figure B1.

Same as Fig. 5 for the simulated data set M2, with n = {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8}.

M3: same as Fig. 5 for the simulated data set M3, with n = {0.1, 0.2, 0.4, 0.6, 0.8}.
Figure B2.

M3: same as Fig. 5 for the simulated data set M3, with n = {0.1, 0.2, 0.4, 0.6, 0.8}.

Same as Fig. 5 for the simulated data set M4, with n = {0.01, 0.05, 0.1, 0.2, 0.4, 0.6, 0.8}.
Figure B3.

Same as Fig. 5 for the simulated data set M4, with n = {0.01, 0.05, 0.1, 0.2, 0.4, 0.6, 0.8}.

Same as Fig. 5 for the simulated data set M5, with n = {0.0250.05, 0.1, 0.2, 0.3, 0.4, 0.6, 0.8}.
Figure B4.

Same as Fig. 5 for the simulated data set M5, with n = {0.0250.05, 0.1, 0.2, 0.3, 0.4, 0.6, 0.8}.

Same as Fig. 5 for the simulated data set M6, with n = {0.025, 0.05, 0.1, 0.2, 0.4, 0.6, 0.8}.
Figure B5.

Same as Fig. 5 for the simulated data set M6, with n = {0.025, 0.05, 0.1, 0.2, 0.4, 0.6, 0.8}.

Same as Fig. 5 for the simulated data set M7, with n = {0.09, 0.18, 0.36, 0.54, 0.64, 0.72}.
Figure B6.

Same as Fig. 5 for the simulated data set M7, with n = {0.09, 0.18, 0.36, 0.54, 0.64, 0.72}.

Same as Fig. 5 for the simulated data set M8, with n = {0.1, 0.2, 0.4, 0.6, 0.8}.
Figure B7.

Same as Fig. 5 for the simulated data set M8, with n = {0.1, 0.2, 0.4, 0.6, 0.8}.

Same as Fig. 5 for the simulated data set M9, with n = {0.1, 0.2, 0.4, 0.6, 0.8}.
Figure B8.

Same as Fig. 5 for the simulated data set M9, with n = {0.1, 0.2, 0.4, 0.6, 0.8}.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)