An introduction to bayesian spatial smoothing methods for disease mapping: modeling county firearm suicide mortality rates

Gause, Emma L; Schumacher, Austin E; Ellyson, Alice M; Withers, Suzanne D; Mayer, Jonathan D; Rowhani-Rahbar, Ali

doi:10.1093/aje/kwae005

Abstract

This article introduces bayesian spatial smoothing models for disease mapping—a specific application of small area estimation where the full universe of data is known—to a wider audience of public health professionals using firearm suicide as a motivating example. Besag, York, and Mollié (BYM) Poisson spatial and space–time smoothing models were fitted to firearm suicide counts for the years 2014-2018. County raw death rates in 2018 ranged from 0 to 24.81 deaths per 10 000 people. However, the highest mortality rate was highly unstable, based on only 2 deaths in a population of approximately 800, and 80.5% of contiguous US counties experienced fewer than 10 firearm suicide deaths and were thus suppressed. Spatially smoothed county firearm suicide mortality estimates ranged from 0.06 to 4.05 deaths per 10 000 people and could be reported for all counties. The space–time smoothing model produced similar estimates with narrower credible intervals as it allowed counties to gain precision from adjacent neighbors and their own counts in adjacent years. bayesian spatial smoothing methods are a useful tool for evaluating spatial health disparities in small geographies where small numbers can result in highly variable rate estimates, and new estimation techniques in R software have made fitting these models more accessible to researchers.

bayesian spatial smoothing, disease mapping, firearm suicide, spatial statistics, small area estimation

Introduction

Bayesian spatial smoothing models are useful tools for investigating rare outcomes or small geographies. These methods allow researchers to more precisely evaluate health outcomes to identify areas of higher or lower risk. While the value of this approach for small area estimation using complex survey data,¹^,² or with incomplete data collection,³ has been well-recognized, it is also useful for disease mapping, a specific application of small area estimation where the full universe of data is known. Employing spatial smoothing models for disease mapping is valuable to (1) more robustly evaluate geographical health disparities when small numbers are an issue, and (2) present reliable rates for all geographies without suppression.

Firearm suicide deaths are a motivating example. Nationally, firearm suicide deaths are not rare. Suicide is the tenth leading cause of death in the United States and has risen to become the fourth leading cause among those in midlife aged 35-54 years.⁴ Firearms are the most common means of suicide death, accounting for half of these deaths. Firearms are by far the most lethal means of suicide⁵; although only about 4% of suicide attempts involve a firearm, approximately 90% of these attempts result in death.⁴ However, small numbers present challenges when examining county-level rates, particularly in rural counties, which typically experience higher rates of firearm suicide compared with urban areas.⁶^,⁷ County-level analyses are crucial for firearm injury research to understand heterogeneity below the state level in firearm policy effects, take advantage of local natural experiments, and isolate risk and protective factors varying across space.

Traditional approaches to dealing with small numbers have been to combine or increase the level of geography under investigation,⁶^,^-⁹ aggregate over multiple years,¹⁰^,¹¹ or simply focus on metropolitan areas.¹²^,^-¹⁴ While these strategies circumvent the issue of small numbers, they introduce other sources of bias, including but not limited to measurement error or pure specification bias for data pooled across populations, potential ecological bias when large areas are used to make inference at a smaller scale, selection bias and a loss of generalizability when conditioning on areas included in analysis, and an inability to understand local variability or investigate rural areas.¹⁵

Mathematically, small numbers result in highly variable rates. Rates calculated from small numbers can be more extreme than rates calculated with larger denominators because the addition or subtraction of a single event has a much greater impact on the rate estimate. The Centers for Disease Control and Prevention (CDC) considers rates calculated from fewer than 20 occurrences to be unreliable and prohibits the public reporting of rates calculated with fewer than 10 occurrences due to the potential identifiability risk.¹⁶^,¹⁷ In 2019, 2535 counties had fewer than 10 firearm suicide deaths, meaning a firearm suicide mortality rate could not be reported for 80.7% of US counties.¹⁸ Notably, to overcome these issues the CDC has integrated spatial smoothing methodologies into their Web-based Injury Statistics Query and Reporting System (WISQARS) fatal injury data dashboard making spatially smoothed estimates easily obtainable.

This article introduces the utility of spatial smoothing methods for mapping rare outcomes or small geographies using county-level firearm suicide as an illustrative example. These methods are becoming more common in research and public health practice. Users or consumers of the resulting modeled estimates must be aware of how the data were produced to avoid misinterpretation or misuse. This article presents the theory behind spatial smoothing for disease mapping, specification of the model, when they are appropriate to implement, and how to interpret the resulting estimates.

Methods

Data

Mortality data for 2014-2018 were obtained from the CDC National Vital Statistics System restricted Detailed Mortality files.¹⁹ Firearm suicides were classified by International Classification of Diseases, Tenth Revision, codes, and deaths were included if the primary or any of the additional cause of death fields indicated a firearm suicide (Appendix S1, Table S1, available at https://doi.org/10.1093/aje/kwae005). The count of firearm suicide deaths in each year was summed using county of occurrence—where the death took place. We included deaths in the contiguous United States (excluding Alaska, Hawaii, and US territories). Population data were obtained from the CDC Race-Bridged population files; the distribution of population counts across counties in 2018 can be found in Figure S1. Raw mortality rates were calculated for each year by dividing the count of firearm suicides by the county population to understand the spatial distribution of real-world values, suppressing rates for areas with fewer than 10 deaths.

Statistical approach

We first present an example of a spatial smoothing model for county-level firearm suicide mortality rates in a single year, and then extend this method to illustrate space–time smoothing of rates over several years. The following models were fitted on the full, complete mortality data for each county, without suppression. Accompanying code used to present these examples is available in Appendix S2.

Bayesian vs frequentist statistics

The spatial smoothing method outlined here uses a bayesian approach. In bayesian statistics, as opposed to frequentist statistics, the observed data—here, firearm suicide counts—are considered known or fixed, and the parameters being estimated—such as latent county firearm suicide rates—are random unknowns. The model uses the observed data inputs, their spatial structure, a specified likelihood, and model prior specifications to estimate posterior probability distributions of the mortality rates for each area, from which one can extract both the most likely estimate as well as its 95% credible interval as a measure of uncertainty.

Spatial structure and the neighbor matrix

Spatial smoothing methods are attractive because geography acts as a proxy for unmeasured covariates which make adjacent areas more similar to each other than to those farther away; they do not require the inclusion of carefully selected covariates to produce valid estimates. This follows Tobler’s First Law of Geography: “Everything is related to everything else, but near things are more related than distant things.”²⁰^(p236) Individual counties’ firearm suicide mortality estimates gain precision by leveraging outcome data from their neighbors. This analysis followed a common approach wherein county neighbors were defined by their adjacency using a binary indicator of 1 if the counties share a common border and 0 otherwise. The neighbor matrix is the array of 0 and 1 values classifying every county pair in the analysis as either neighbors or not neighbors.

The neighbor matrix informs the spatial random effects—we used intrinsic conditional autoregressive (ICAR) random effects as they are computationally efficient, depending only on local spatial autocorrelation from immediate neighbors.²¹ These spatial random effects allow the raw county estimate for each area to be weighted towards the mean of the rates in adjacent counties, thus smoothing the estimate towards rates in neighboring areas. The degree of smoothing is dependent upon the variability of the estimate in that county based on the number of expected outcomes, so counties with small numbers are more affected by the influence of their neighbors. ICAR random effects can be thought of as a spatial version of the temporal first order random walk.

Likelihood, prior distributions, and hyperparameters

Bayesian inference requires specification of two main components: (1) the likelihood, which is a probability distribution from which the data are assumed to be drawn, and (2) prior probability distributions for all model parameters. Prior distributions can be informative—based on previous data or expert opinion—or weakly/noninformative, which forces the model to depend almost entirely on the data observed. Weakly/noninformative priors are therefore most appropriate with large datasets. Prior distributions rely on modeler-specified hyperparameters to incorporate this prior knowledge (or lack thereof). Hyperparameters should be carefully chosen as they can have a potentially large effect on model results, especially when using small datasets.

The prior distributions are combined with a specified likelihood for the observed data to calculate a posterior distribution for the random parameters. It is the resulting posterior distributions that are used for inference via point estimates and measures of uncertainty. Point estimates are commonly calculated as the mean or median of the posterior distributions for each parameter of interest. Uncertainty is typically summarized with posterior credible intervals, which, for a specified probability (eg, 95%), give a range of values for which the posterior probability of a parameter being in this range is equal to the specified probability. Different prior distributions lead to different shapes of posterior distributions, which can affect both point estimates and credible interval widths.

Priors are one of the most challenging elements of the model to understand and implement—readers should consult a statistician if they have information from prior studies or trials to inform the model. Most tools for statistical modeling include recommended priors in their documentation, which can be used as a default, but sensitivity analyses should be done using different priors to examine their impact on results.

Model specification

In our model to estimate smoothed firearm suicide rates, we used a Poisson likelihood appropriate for count data. The spatial smoothing model inputs included county firearm suicide counts as the outcome, a population offset to allow for calculation of rates, the neighbor matrix, and prior distribution hyperparameters. The model itself is made up of an intercept, a spatial random effect, an unstructured random error term, and a population offset. For each county i, the expected count of firearm suicide deaths, |$E(y)$|⁠, is estimated as:

$$ \log \left(E\left({y}_i\right)\right)=\log \left({p}_i\right)+\mathrm{\alpha} +{S}_i+{e}_i $$

$$ {S}_i \mid \left\{{S}_j={s}_j,j\sim i\right\},{\mathrm{\sigma}}_{\mathrm{s}}^2\sim N\left({\overline{s}}_i,\frac{\sigma_{\mathrm{s}}^2}{m_i}\right) $$

$$ {\overline{s}}_i=\frac{1}{m_i}\ {\sum}_{\left(j\sim i\right)}{s}_j $$

$$ {e}_i\ {\sim}_{iid}\ N\left(0,{\mathrm{\sigma}}_e^2\right) $$

where α is a fixed intercept, |$p$| is the population offset, |${S}_i$| are spatial random effects conditional on the neighbors (the effect of the weighted mean of the neighbors), and |${e}_i$| are the unstructured “independent and identically distributed” (IID) random effects for each area, or the deviation in area i from the overall mean. These IID random effects |${e}_i$| are normally distributed around zero with a variance of |${\mathrm{\sigma}}_e^2$|⁠. The spatial random effects |${S}_i$| are conditional on the neighbors such that |$j$| is a neighbor of |$i$|⁠, |${\overline{s}}_i$| is the mean of the neighbors for area i, |${m}_i$| is the number of neighbors, and |${\mathrm{\sigma}}_s^2$| is the smoothing parameter. The magnitude of the smoothing parameter is dependent on the variance so that more variable estimates are subject to more smoothing.

The combination of spatial ICAR and IID normal random effects is known as the Besag, York, and Mollié (BYM) model²² and is argued to have the most robust spatial structure of commonly used spatial models.²³ We used the BYM2 reparameterization of the BYM model,²⁴ which specifies the total variance of the spatial and nonspatial random effects along with the proportion of this variance that is spatial. This aids computational efficiency and model interpretability by ensuring the random effects are on the same scale.

This analysis used an uninformative prior on the intercept, along with penalized complexity (PC) priors for the BYM2 random effects.²⁵ PC priors are commonly recommended to aid interpretation of hyperparameters.²⁶ For modeling, PC prior hyperparameter values were chosen to be relatively uninformative and favor a simpler model fit, which means the posterior probability distribution used for model inference will be informed mostly by the observed data.

After exponentiating, the model output provides an estimate of the firearm suicide count for each county in the dataset with corresponding posterior distributions, from which a rate can easily be estimated using the offset.

Measuring uncertainty

Smoothed county-specific firearm suicide mortality rates were estimated as the median of the posterior distributions for each county. Uncertainty was measured by calculating 95% credible intervals, which are similar to but have a different interpretation compared with frequentist confidence intervals. The 95% credible intervals were calculated by identifying the 2.5th and 97.5th percentiles of the smoothing model’s posterior distribution for each county. The posterior probability that the latent mortality rate is in this interval is equal to 95%; thus, the credible interval gives a plausible range of values for the county-specific firearm suicide mortality rate. The width of the 95% credible interval is a useful metric for understanding the magnitude of the uncertainty in the modeled estimates to compare across counties.

Extending to a space–time framework

While it is possible to calculate annual smoothed mortality rates separately for each year of data, a more efficient method for estimating a time-series is to model all years by including an additional random effect for year. This allows borrowing strength over time to improve yearly estimates. Here we model time as a random walk of order 1,²⁷ which allows each year to be influenced only by the data from the previous year.

The resulting equation for the expected count of firearm suicide deaths |$E(y)$|⁠, for each county i, in each year r, becomes:

$$ \log \left(E\left({y}_{it}\right)\right)=\log \left({p}_{it}\right)+\mathrm{\alpha} +{S}_i+{e}_i+{\mathrm{\omega}}_t+{\mathrm{\phi}}_t $$

$$ {S}_i \mid \left\{{S}_j={s}_j,j\sim i\right\},{\mathrm{\sigma}}_{\mathrm{s}}^2\sim N\left({\overline{s}}_i,\frac{\sigma_{\mathrm{s}}^2}{m_i}\right) $$

$$ {\overline{s}}_i=\frac{1}{m_i}\ {\sum}_{\left(j\sim i\right)}{s}_j $$

$$ {e}_i\ {\sim}_{iid}\ N\left(0,{\mathrm{\sigma}}_e^2\right) $$

$$ {\mathrm{\omega}}_t\ {\sim}_{iid}\ N\left(0,{\mathrm{\sigma}}_{\mathrm{\omega}}^2\right) $$

$$ {\mathrm{\phi}}_t-{\mathrm{\phi}}_{t-1}\sim N\left(0,{\mathrm{\sigma}}_t^2\right). $$

This model builds on the spatial smoothing equation above with the addition of two temporal terms: an unstructured temporal error term |${\mathrm{\omega}}_t$| and the |${\mathrm{\phi}}_t$| temporal first-order random walk term for each time t. |${\mathrm{\omega}}_t$| represents the deviation at time t from the overall mean. |${\mathrm{\phi}}_t$| is conditional on its immediate preceding neighbor in time and the amount of smoothing is governed by the smoothing parameter |${\mathrm{\sigma}}_{\mathrm{t}}^2.$| Similar to the spatial random effects, the magnitude of temporal smoothing depends on the variance of values over time. It is important to note that this model only has separate spatial and temporal effects and does not include a spatiotemporal interaction, which would allow spatial effects to vary across time and/or temporal effects to vary across space. Models with spatiotemporal interactions can be computationally prohibitive to fit and are beyond the scope of this paper, but further information can be found from other sources.²⁷

Computation

Estimation of the posterior distribution can be done via various statistical methods. While certain models allow the prior distribution to be chosen such that the form of the posterior distribution can be calculated directly, this is often not the case in practice due to the complexity of models used for small area estimation. Thus, modelers must use other methods to approximate and sample from the posterior distribution. One of the most common general methodologies is Markov chain Monte Carlo (MCMC).²⁸ MCMC methods sample a chain of autocorrelated realizations from the posterior distribution in a specific way allowing for full exploration of the posterior. Traditional MCMC approaches can be quite computationally intensive for complex spatial models, which is often a barrier for many practitioners. Another approach, and the one used in this paper, is integrated nested Laplace approximation (INLA),²⁹ which is typically much faster and less computationally intensive because it approximates the necessary integrals for calculating posterior distributions in strategic ways. The INLA package²⁶^,³⁰ in R (R Foundation for Statistical Computing, Vienna, Austria) implements this method for a large class of models—latent Gaussian models—which includes the vast majority of spatial smoothing models most practitioners might want to fit. Furthermore, it has excellent documentation for new users, making it accessible to a wide audience.

All analyses were performed in R, version 3.6.2,³¹ using the INLA,²⁶^,³⁰ dplyr,³² sf,³³ sp,³⁴^,³⁵ and spdep³⁵^,³⁶ packages. This analysis is considered nonhuman subjects research and did not require IRB review. For a deeper look into the statistical underpinnings and mathematical structure behind these bayesian spatial smoothing methods, please see prior work.²⁶^,³⁷^,^-⁴⁰

Results

All 3084 US contiguous counties were included in the analysis; 114 108 firearm suicide deaths were recorded during the 5-year study period of 2014–2018. In 2018, county-level raw (unsmoothed) death rates ranged from zero to 24.81 deaths per 10 000 people (Figure 1). However, this highest mortality rate was based on only 2 deaths in a population of approximately 800; 2484 counties experienced fewer than 10 firearm suicide deaths in 2018—80.5% of counties—and thus were required to be suppressed. An additional 324 counties experienced fewer than 20 deaths, meaning that based on CDC guidelines it would not be possible to report a reliable firearm suicide mortality rate for 91.1% of contiguous US counties in 2018.

Figure 1

Raw county-level firearm suicide mortality rates with counties experiencing fewer than 10 deaths suppressed, United States, 2018.

Open in new tab Download slide

After the spatial and space–time smoothing models were employed, it was possible to report a reliable firearm suicide mortality rate for every county for the years 2014-2018. The 2018 spatially smoothed county-level firearm suicide rates ranged from 0.06 to 4.05 deaths per 10 000 people (Figure 2). Both the highest and lowest raw rates were smoothed towards the center of the data. Smoothing is more apparent for extreme high rates as they are often highly variable, but even low rates are smoothed towards the middle, particularly counties with zero deaths. These smoothed firearm suicide mortality rates can be interpreted as the estimated “true” risk, or the underlying firearm suicide rate for a hypothetical infinite superpopulation living in each county, and thus even if a county experienced zero deaths, the smoothed estimated death risk can never be zero since we assume it is impossible to have zero risk.

Figure 2

Spatially smoothed county-level firearm suicide mortality rates, United States, 2018.

Open in new tab Download slide

Smoothed firearm suicide mortality rate estimates may be more precise than the raw rates but are still calculated with uncertainty. The widths of the 95% credible intervals, a measure of this uncertainty, ranged from 0.06 up to 9.46 deaths per 10 000 people, with a median of 1.09 (Figure 3). As expected, the results from the space–time smoothing model (Figure S2) were more precise than spatial smoothing alone because county estimates gained information both from adjacent spatial neighbors and their own mortality counts in the prior year. The median width of the credible intervals for the 2018 space–time smoothed mortality rate estimates was 0.77 and ranged from 0.3 to 3.9 deaths per 10 000 (Figure S3). Rate estimates between the spatial and space–time models differed slightly for 2018 due to the influence of additional data from adjacent years, with a median difference of 0.3 and ranging from −1.06 to 1.5 deaths per 10 000. Estimated space–time smoothed firearm suicide rates from 2014 through 2018 are shown for a random sample of 12 counties (Figure 4).

Figure 3

Width of the 95% credible intervals of spatially smoothed firearm suicide mortality rate estimates, United States, 2018.

Open in new tab Download slide

Figure 4

Annual estimated space–time smoothed firearm suicide mortality rates with 95% credible intervals, in a sample of 12 US counties.

Open in new tab Download slide

Total running time for the R script used to calculate the raw and smoothed firearm suicide rate estimates for every contiguous US county in 2018 was 3 minutes and 12 seconds on a 2018 MacBook Pro (Apple Inc., Cupertino, CA) with 16GB of SDRAM. Run time may vary based on computer infrastructure and resolution of geographic information system data used for the neighbor matrix, but with the R-INLA package, computation time is no longer a major barrier for this type of complex bayesian analysis.

Discussion

This firearm suicide example illustrates the two primary benefits of spatial smoothing methods for disease mapping. First, it allowed us to more reliably compare smoothed firearm suicide mortality rates across different geographic contexts. Second, we were able to estimate and display precise mortality rates for areas experiencing 20 or fewer deaths—the majority of US counties. Since this model-based smoothing approach pools and weights data from surrounding counties, the rate estimates do not pose the same concern for identifiability in reporting.

Smoothed firearm suicide mortality rates illustrate the geographic variability in risk of firearm suicide across the United States (Figure 2), as well as their general trends over time (Figure 4). The smoothed estimates clearly show the lowest rates of firearm suicide in the Mid-Atlantic Coast, parts of the Midwest near Wisconsin and Iowa, and most of mid and Southern California, while the highest rates tend to occur in the Mountain West and Southwest. Almost all counties show a gradually increasing rate of firearm suicide mortality over time. This pattern matches the steady increase in all suicide rates, which have continued to climb over the past 20 years.⁴¹^,⁴²

While spatial and space–time smoothed models have many favorable qualities, they require informed decision making and have several potential drawbacks. The decision to smooth rates is an example of the well-known statistical trade-off between variance and bias; bias is introduced by smoothing over some real local heterogeneity to increase precision and make comparisons across geographies more reliable. In space–time models, rates over time are also smoothed by borrowing information from adjacent years and thus real, informative spikes or dips across time can be flattened out as well. A misuse of spatially smoothed estimates would be for detection of statistically significant hotspots either over space or time. Some of the apparent clustering of firearm suicide mortality rates by geography is in fact due to the spatial smoothing approach. By design, these methods produce estimates in adjacent areas that are more similar to each other, and thus should never be used as inputs for cluster or hotspot detection analyses.

The goals of spatial smoothing for disease mapping are related to variance correction and not prediction. In this application the full universe of data is known, as is the case when using vital statistics or comprehensive registry data. The raw mortality rates are accurate, as they reflect real counts experienced in each geography, but they are not precise or reliable because the potentially high variance implicit in small number calculations creates unstable rates. While these methods allow us to compare rates of firearm suicide deaths across the entire country over time, individual areas should supplement their understanding of firearm suicide risk with local knowledge whenever possible.

Unlike other nonspatial small area estimation methods, the use of covariates in these spatial smoothing models is not necessary as the influence of the neighbors accounts for unmeasured covariates. However, their inclusion may aid in increasing precision, particularly when data are sparse or when there are strong known outcome-covariate relationships that do not follow a spatial pattern and can be measured well ecologically. This example of firearm suicide deaths includes sparse data, sometimes clustered in particularly rural areas such as the Mountain West, where highly variable-rate counties are unable to gain much precision from their equally variable neighbors, leading to higher uncertainty reflected in wider credible intervals. This level of uncertainty may be unacceptable depending on the intended application for the smoothed estimates. Including additional years of data in a space–time model can help increase precision by incorporating more information for these sparse estimates. Additionally, given the heterogeneous firearm-related legal landscape across states and the known association between permissive firearm laws and higher mortality rates, future studies may consider including indicators of state policies or simply state fixed effects. However, the spatial smoothing approach is still preferred over nonspatial approaches as adjacency to states with permissive firearm laws and even the occurrence of nearby gun shows has been found to increase risk of firearm-related deaths across state borders, even within states with more a more restrictive legal landscape.⁴³^,⁴⁴

How to classify neighbors is also a crucial consideration as adjacency can be defined in several ways, which may be more or less appropriate for each specific research question. This analysis used a straightforward method of classifying neighbors by shared administrative boundaries. The resulting sparse, symmetrical, zero/one neighbor matrix allows for faster computation¹⁷ but may not reflect the true nature of between-county interaction as the simple structure allows only directly adjacent counties to share information. Neighbors can also be assigned using a distance threshold from the center of each area—either Euclidean or road network travel—or even by manually and painstakingly assigning neighbors using multiple criteria such as distance, contiguity, landscape features influencing contact, known social/economic networks, or other measures relevant to a particular analysis. Without local knowledge of the complex interdependencies among counties, the continuity approach is the simplest and most used. However, it may not work well for all counties, particularly for adjacent counties with very dissimilar contexts. It is also incapable of incorporating noncontiguous areas such as Alaska, Hawaii, and Puerto Rico, which is particularly unfortunate for firearm injury studies as these communities have unique legal and cultural landscapes and understudied vulnerable populations.

Spatial disease mapping is more accessible to researchers than ever. Traditional bayesian spatial smoothing techniques required time and computationally intensive MCMC iterations to create the posterior distributions used for inference. However, INLA has been found to perform well in fractions of the time and computational effort. Spatial smoothing using the R-INLA package can be a simple, straightforward, and computationally feasible method for most researchers familiar with R. Public health scientists may consider adding these methodologies to their toolkit, particularly when interested in rare outcomes or areas smaller than states.

In conclusion, with improvements made to data storage and computation, the ability to assess local mortality trends on a wide scale has increased and the demand for estimates below the state level has grown. Smaller areas have less susceptibility to ecological bias than larger aggregations because there is less within-area heterogeneity,¹⁵ but the reliability of these local estimates is a concern when based on small numbers. A bayesian approach leveraging the inherent spatial structure of outcomes can better identify true underlying risks in small geographies to identify genuine geographical disparities for more robust research.

Supplementary material

Supplementary material is available at the American Journal of Epidemiology online.

Funding

This work was supported by funds from the State of Washington to the Firearm Injury and Policy Research Program.

Conflict of interest

The authors declare no conflicts of interest.

Data availability

The mortality data used in this analysis is restricted-use and cannot be shared publicly. It was obtained from the National Center for Health Statistics, National Vital Statistics System. The R code for this analysis can be found in the manuscript supplement, and is also posted on GitHub: https://github.com/Epi-Emma/Firearm_Suicide_Disease_Mapping

REFERENCES

1.

Wakefield

J

,

Fuglstad

GA

,

Riebler

A

, et al.

Estimating under-five mortality in space and time in a developing world context

.

Stat Methods Med Res.

2019

;

28

(

9

):

2614

-

2634

. https://doi.org/10.1177/0962280218767988

2.

Haviland

MJ

,

Gause

E

,

Rivara

FP

, et al.

Assessment of county-level proxy variables for household firearm ownership

.

Prev Med.

2021

;

148

:

106571

. https://doi.org/10.1016/j.ypmed.2021.106571

3.

Datta

A

,

Lin

W

,

Rao

A

, et al.

bayesian estimation of MSM population size in Côte d’Ivoire

.

Statistics and Public Policy.

2019

;

6

(

1

):

1

-

13

. https://doi.org/10.1080/2330443X.2018.1546634

Google Scholar

Crossref

WorldCat

4.

Conner

A

,

Azrael

D

,

Miller

M

.

Suicide case-fatality rates in the United States, 2007 to 2014: a nationwide population-based study

.

Ann Intern Med.

2019

;

171

(

12

):

885

-

895

. https://doi.org/10.7326/M19-1324

5.

Anestis

MD

,

Selby

EA

,

Butterworth

SE

.

Rising longitudinal trajectories in suicide rates: the role of firearm suicide rates and firearm legislation

.

Prev Med.

2017

;

100

:

159

-

166

. https://doi.org/10.1016/j.ypmed.2017.04.032

6.

Branas

CC

,

Nance

ML

,

Elliott

MR

, et al.

Urban–rural shifts in intentional firearm death: different causes, same results

.

Am J Public Health.

2004

;

94

(

10

):

1750

-

1755

. https://doi.org/10.2105/AJPH.94.10.1750

7.

Mohatt

NV

,

Kreisel

CJ

,

Hoffberg

AS

, et al.

A systematic review of factors impacting suicide risk among rural adults in the United States

.

J Rural Health.

2021

;

37

(

3

):

565

-

575

. https://doi.org/10.1111/jrh.12532

8.

Siegel

M

,

Solomon

B

,

Knopov

A

, et al.

The impact of state firearm laws on homicide rates in suburban and rural areas compared to large cities in the United States, 1991-2016

.

J Rural Health.

2020

;

36

(

2

):

255

-

265

. https://doi.org/10.1111/jrh.12387

9.

Siegel

M

,

Ross

CS

,

King

C

.

The relationship between gun ownership and firearm homicide rates in the United States, 1981–2010

.

Am J Public Health.

2013

;

103

(

11

):

2098

-

2105

. https://doi.org/10.2105/AJPH.2013.301409

10.

Kalesan

B

,

Galea

S

.

Patterns of gun deaths across US counties 1999–2013

.

Ann Epidemiol.

2017

;

27

(

5

):

302

-

307.e3

. https://doi.org/10.1016/j.annepidem.2017.04.004

11.

Stansfield

R

,

Semenza

D

.

Licensed firearm dealer availability and intimate partner homicide: a multilevel analysis in sixteen states

.

Prev Med.

2019

;

126

:

105739

. https://doi.org/10.1016/j.ypmed.2019.05.027

12.

Crifasi

CK

,

Merrill-Francis

M

,

McCourt

A

, et al.

Association between firearm laws and homicide in urban counties

.

J Urban Health.

2018

;

95

(

3

):

383

-

390

. https://doi.org/10.1007/s11524-018-0273-3

13.

Kegler

SR

,

Dahlberg

LL

,

Vivolo-Kantor

AM

.

A descriptive exploration of the geographic and sociodemographic concentration of firearm homicide in the United States, 2004-2018

.

Prev Med.

2021

;

153

:

106767

. https://doi.org/10.1016/j.ypmed.2021.106767

14.

Miller

M

,

Warren

M

,

Hemenway

D

, et al.

Firearms and suicide in US cities

.

Inj Prev.

2015

;

21

(

e1

):

e116

-

e119

. https://doi.org/10.1136/injuryprev-2013-040969

15.

Wakefield

J

.

Ecologic studies revisited

.

Annu Rev Public Health.

2008

;

29

(

1

):

75

-

90

. https://doi.org/10.1146/annurev.publhealth.29.020907.090821

16.

Quick

H

.

Estimating county-level mortality rates using highly censored data from CDC WONDER

.

Prev Chronic Dis.

2019

;

16

(

180441

):E76. https://doi.org/10.5888/pcd16.180441

Google Scholar

OpenURL Placeholder Text

WorldCat

17.

Khana

D

,

Rossen

LM

,

Hedegaard

H

, et al.

A Baysian spatial and temporal modeling approach to mapping geographic variation in mortality rates for subnational areas with R-INLA

.

J Data Sci.

2018

;

16

(

1

):

147

-

182

.

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

18.

Centers for Disease Control and Prevention

. WISQARS. WISQARSTM—Web-based Injury Statistics Query and Reporting System,

Accessed November 11, 2021

. https://www.cdc.gov/injury/wisqars/index.html

19.

National Center for Health Statistics

. Detailed Multiple Cause of Death Research Files 2012-2018. [Restricted-Use].

Issued June 22, 2021

. Requested from: https://www.cdc.gov/nchs/nvss/nvss-restricted-data.htm

20.

Tobler

WR

.

A computer movie simulating urban growth in the Detroit region

.

Econ Geogr.

1970

;

46

:

234

-

240

. https://doi.org/10.2307/143141

Google Scholar

Crossref

WorldCat

21.

Banerjee

S

,

Carlin

B

,

Gelfand

A

.

Hierarchical Modeling and Analysis for Spatial Data

.

New York, NY

;

CRC Press

,

2014

. https://doi.org/10.1201/b17115

22.

Besag

J

,

York

J

,

Mollié

A

.

bayesian image restoration, with two applications in spatial statistics

.

Ann Inst Stat Math.

1991

;

43

(

1

):

1

-

20

. https://doi.org/10.1007/BF00116466

Google Scholar

Crossref

WorldCat

23.

Richardson

S

,

Thomson

A

,

Best

N

, et al.

Interpreting posterior relative risk estimates in disease-mapping studies

.

Environ Health Perspect.

2004

;

112

(

9

):

1016

-

1025

. https://doi.org/10.1289/ehp.6740

24.

Riebler

A

,

Sørbye

SH

,

Simpson

D

, et al.

An intuitive bayesian spatial model for disease mapping that accounts for scaling

.

Stat Methods Med Res.

2016

;

25

(

4

):

1145

-

1165

. https://doi.org/10.1177/0962280216660421

25.

Simpson

D

,

Rue

H

,

Riebler

A

, et al.

Penalising model component complexity: a principled, practical approach to constructing priors

.

Statist Sci.

2017

;

32

(

1

):

1

-

28

. https://doi.org/10.1214/16-STS576

Google Scholar

OpenURL Placeholder Text

WorldCat

26.

Rue

H

,

Riebler

A

,

Sørbye

SH

, et al.

bayesian computing with INLA: a review

.

Annu Rev Stat Appl.

2017

;

4

(

1

):

395

-

421

. https://doi.org/10.1146/annurev-statistics-060116-054045

Google Scholar

Crossref

WorldCat

27.

Rue

H

,

Held

L

.

Gaussian Markov Random Fields: Theory and Applications

.

CRC Press

,

New York

;

2005

. https://doi.org/10.1201/9780203492024

28.

Gamerman

D

,

Lopes

HF

.

Markov Chain Monte Carlo: Stochastic Simulation for bayesian Inference

. 2nd ed.

New York, NY

;

Chapman & Hall/CRC

,

2006

. https://doi.org/10.1201/9781482296426

29.

Held

L

,

Natário

I

,

Fenton

SE

, et al.

Towards joint disease mapping

.

Stat Methods Med Res.

2005

;

14

(

1

):

61

-

82

. https://doi.org/10.1191/0962280205sm389oa

30.

Rue

H

,

Martino

S

,

Chopin

N

.

Approximate bayesian inference for latent Gaussian models using integrated nested Laplace approximations (with discussion)

.

J R Stat Soc.

2009

;

71

(

2

):

319

-

392

. https://doi.org/10.1111/j.1467-9868.2008.00700.x

Google Scholar

Crossref

WorldCat

31.

R Core Team

.

R: A Language and Environment for Statistical Computing

. Vienna, Austria: R Foundation for Statistical Computing;

2019

.

32.

Wickham

H

,

François

R

,

Henry

L

,

Müller

K

. dplyr: A Grammar of Data Manipulation.

2020

. Accessed February 14, 2024. https://CRAN.R-project.org/package=dplyr

33.

Pebesma

E

.

Simple features for R: standardized support for spatial vector data

.

The R Journal.

2018

;

10

(

1

):

439

-

446

. https://doi.org/10.32614/RJ-2018-009

Google Scholar

Crossref

WorldCat

34.

Pebesma

E

,

Bivand

R

.

Classes and methods for spatial data in R

.

The R Journal

2005

;

5

(

2

):9-13.

Google Scholar

OpenURL Placeholder Text

WorldCat

35.

Bivand

R

,

Brown

E

,

Gomez-Rubio

V

.

Applied Spatial Data Analysis With R

. 2nd ed.

New York, NY

:

Springer

;

2013

.

36.

Bivand

R

,

Wong

D

.

Comparing implementations of global and local indicators of spatial association

.

TEST.

2018

;

27

(

3

):

716

-

748

. https://doi.org/10.1007/s11749-018-0599-x

Google Scholar

Crossref

WorldCat

37.

Wakefield

J

,

Best

N

,

Waller

L

. bayesian approaches to disease mapping. In:

Elliott

P

,

Wakefield

J

,

Best

N

,

Briggs

D

, eds.

Spatial Epidemiology: Methods and Applications

. Oxford, UK:

Oxford University Press

;

2000

:

104

-

127

. https://doi.org/10.1093/acprof:oso/9780198515326.003.0007

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

38.

Wakefield

J

.

Disease mapping and spatial regression with count data

.

Biostatistics.

2007

;

8

(

2

):

158

-

183

. https://doi.org/10.1093/biostatistics/kxl008

39.

Bivand

RS

,

Gómez-Rubio

V

,

Rue

H

.

Spatial data analysis with R-INLA with some extensions

.

J Stat Soft.

2015

;

63

(

20

):1-31. https://doi.org/10.18637/jss.v063.i20

Google Scholar

OpenURL Placeholder Text

WorldCat

40.

Schrödle

B

,

Held

L

.

A primer on disease mapping and ecological regression using INLA

.

Comput Stat.

2011

;

26

(

2

):

241

-

258

. https://doi.org/10.1007/s00180-010-0208-2

Google Scholar

Crossref

WorldCat

41.

Hedegaard

H

,

Curtin

S

,

Warner

M

. Suicide rates in the United States continue to increase.

CDC National Center for Health Statistics

;

2018

.

Accessed June 19, 2020

. https://www.cdc.gov/nchs/products/databriefs/db309.htm

42.

Steelesmith

DL

,

Fontanella

CA

,

Campo

JV

, et al.

Contextual factors associated with county-level suicide rates in the United States, 1999 to 2016

.

JAMA Netw Open.

2019

;

2

(

9

):e1910936. https://doi.org/10.1001/jamanetworkopen.2019.10936

Google Scholar

OpenURL Placeholder Text

WorldCat

43.

Kaufman

EJ

,

Morrison

CN

,

Branas

CC

, et al.

State firearm laws and interstate firearm deaths from homicide and suicide in the United States: a cross-sectional analysis of data by county

.

JAMA Intern Med.

2018

;

178

(

5

):

692

-

700

. https://doi.org/10.1001/jamainternmed.2018.0190

44.

Matthay

EC

,

Galin

J

,

Rudolph

KE

, et al.

In-state and interstate associations between gun shows and firearm deaths and injuries: a quasi-experimental study

.

Ann Intern Med.

2017

;

167

(

12

):

837

-

844

. https://doi.org/10.7326/M17-1792

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://dbpia.nl.go.kr/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Download all slides

Month:	Total Views:
February 2024	30
March 2024	19
April 2024	28
May 2024	28
June 2024	26
July 2024	203
August 2024	58
September 2024	49
October 2024	32
November 2024	19
December 2024	29
January 2025	35
February 2025	25
March 2025	65
April 2025	81
May 2025	8

Article Contents

An introduction to bayesian spatial smoothing methods for disease mapping: modeling county firearm suicide mortality rates

Abstract

Introduction

Methods

Data

Statistical approach

Bayesian vs frequentist statistics

Spatial structure and the neighbor matrix

Likelihood, prior distributions, and hyperparameters

Model specification

Measuring uncertainty

Extending to a space–time framework

Computation

Results

Discussion

Supplementary material

Funding

Conflict of interest

Data availability

REFERENCES

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

Article Contents

An introduction to bayesian spatial smoothing methods for disease mapping: modeling county firearm suicide mortality rates

Abstract

Introduction

Methods

Data

Statistical approach

Bayesian vs frequentist statistics

Spatial structure and the neighbor matrix

Likelihood, prior distributions, and hyperparameters

Model specification

Measuring uncertainty

Extending to a space–time framework

Computation

Results

Discussion

Supplementary material

Funding

Conflict of interest

Data availability

REFERENCES

Supplementary data

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Looking for your next opportunity?

This Feature Is Available To Subscribers Only