Abstract

Siblings share many environments and much of their genetics. Yet, siblings turn out different. Intelligence and education are influenced by birth order, with earlier-born siblings outperforming later-borns. We investigate whether birth order differences in education are caused by biological differences present at birth, that is, genetic differences or in utero differences. Using family data that spans two generations, combining registry, survey, and genotype information, this study is based on the Norwegian Mother, Father, and Child Cohort Study (MoBa). We show that there are no genetic differences by birth order as captured by polygenic scores (PGSs) for educational attainment. Earlier-born have lower birth weight than later-born, indicating worse uterine environments. Educational outcomes are still higher for earlier-born children when we adjust for PGSs and in utero variables, indicating that birth order differences arise postnatally. Finally, we consider potential environmental influences, such as differences according to maternal age, parental educational attainment, and sibling genetic nurture. We show that birth order differences are not biological in origin, but pinning down their specific causes remains elusive.

Significance Statement

Birth order is associated with educational outcomes, with earlier-born siblings outperforming later-borns. Using genetic data and data on education from a large Norwegian cohort, we find that these differences are not due to genetic differences or in utero differences. Later-born children even have higher birth weight, which is positively associated with education. When we take indicators of genetic and in utero differences into account, earlier-born siblings still do better in school. This establishes that birth order differences result from factors after birth, consistent with explanations that focus on home environments.

Introduction

Sibling differences receive less attention than sibling similarities, although they make up an important part of the picture of social inequality. Even for socio-economic outcomes, inequality within families (i.e. among adult siblings) can be as large as inequality between families (1). While evidence indicates a large role for environmental influences in producing differences among siblings in the same family, documenting specific, systematic influences has been more elusive. Developmental scientists have talked about a “gloomy prospect” in which nearly all influence in and by social environments is idiosyncratic (2, 3).

Birth order has long been offered as an example of a systematic source of environmental differentiation within families. Empirically, firstborn siblings have slightly higher intelligence (4–6), educational achievement (7, 8), and income (9, 10) than their siblings born later. These differences are routinely interpreted as reflecting causal mechanisms related to childhood experiences. For example, some work suggests that parental resources are more diluted for later-born siblings as a result of competing demands for parental attention (11, 12), while other work considers whether the presence of older siblings adversely affects the cognitive environment in which younger siblings are raised (13, 14).

Even though birth order differences in achievements are routinely presumed to reflect different social environments between siblings, there are several reasons to worry they may be caused biologically.

For one, biological differences may be induced by fertility decision-making. Genetic differences among siblings have been described as a “lottery” (15–17), and parents may be more likely to have an additional child if their already-born children evince desirable traits consistent with a favorable draw from this “lottery” (18). Later-born children may exhibit a ”regression to the mean" phenomenon of less propitious genetic endowments for these same traits. When deciding whether to have another child, parents have information on behavior and traits of their current child(ren) that are correlated with genotype.

Predictive information about a child's later cognitive ability is measurable at least as early as 10 months (19). Consistent with the findings of another study (20), in our data, we find that the polygenic score (PGS) for educational attainment is inversely associated with a number of behaviors at age 2 to 3 (e.g, . “Can't sit still,” “Gets in fights”) including some behaviors net of parent's PGSs (Figure S1, Supplementary Material). Thus, observable child traits exist at a quite early age that correlate with the genetic endowments associated with educational attainment, and in theory parents’ observation of these traits may influence some of their decisions on whether to have additional children.

If parents adjust their fertility-behavior dependent on the first child's behaviors in childhood and those behaviors are partly influenced by genetics, this could lead to differences in genetics between siblings according to birth order. Studies show that parents adjust their parenting according to child genotype too, engaging in more cognitively stimulating activities with children with higher PGS for education (21, 22). Thus gene–environment correlations could exacerbate any genetic or other biological differences present at birth.

Biological differences could also result from involuntary processes. Miscarriage occurs in 8% to 30% of all pregnancies. The risk of miscarriage increases with maternal age, but also, net of mother's age, studies have found that number of previous pregnancies is associated with likelihood of miscarriage, albeit not in a consistent direction (23, 24). Most miscarriages happen early in the pregnancy, with the main factors being genetic abnormalities (i.e. chromosomal aberrations like autosomal trisomy) and uterine malformation (25). Changing risk of miscarriage by either factor over pregnancies, or their combination, could result in survivorship-based biological differences by birth order in live births.

Maternal age also may influence in utero environments (26, 27), such as the rate of antibody attacks (28). Beyond this, maternal nutrition, stress, medical professional visits, and other health-related behaviors may vary according to birth order (29), causing differences in birth weight and other outcomes, which influence intelligence and educational attainments later in life (30, 31). Other conditions, such as preeclampsia, have been linked to birth order (32), health, and cognition, although often not in ways consistent with the expectation of firstborn advantage (33, 34).

For that matter, mutations also rise with parental age, in particular with paternal age at conception (26). There is evidence that advanced paternal age increases the probability of offspring intelligence disorder, schizophrenia, and autism (35–37), as well as the rate of several other health-related traits (38). A consequence of these many unknowns may be that there are biological mechanisms that produce differences between siblings present at birth, which we are still learning about. These can result in biological differences by birth order, albeit not necessarily a causal effect of birth order as such.

Against this backdrop it is important to recognize the limited research designs on which a considerable portion of the existing literature on birth order and educational outcomes is based. Many studies do not compare siblings in the same family, leaving genetic differences across families as an uncontrolled confounder (6). Birth-order studies often define birth order by the rearing family or by siblings who share a mother (7, 39), so that genetic differences by birth order may result from paternal genetic differences by siblings who do not share a father. Even when studies intend to include only full siblings, this is usually based only on maternal self-report. Sometimes mothers are unsure themselves who the father is (40) or misreport paternity for other reasons. For half-siblings with different fathers, paternal genetic differences could be associated with birth order, as mothers who have children with multiple partners typically have children with less educated fathers for their later-born children (41, 42).

Our study assesses birth order differences in educational outcomes within families using genetically confirmed full siblings. We do so with data that combines registry, survey, and genotype information from families in Norway (see Methods). At stake is that, if birth order differences are not due to postbirth environments, this would imply that using birth order as an exemplar of a documented systematic effect of family environments on sibling differentiation is incorrect, furthering contentions that intrafamilial environments are far more inert than many suppose. Moreover, if differences were specifically genetic, it would also undermine various causal inference strategies that effectively assume genetic differences among full siblings are independent (18, 43). We examine whether birth order effects on education are influenced by genetic differences as measured by PGSs for educational attainment. We also consider broader prebirth biological differences by looking at whether there are birth order differences in birth weight and birth length, as indicators of a combination of genetic and in utero factors. Furthermore, we look at nontransmitted alleles of parents, whose influence on development is sometimes referred to as “genetic nurture.”

In all, we address four research questions:

  1. Is birth order associated with educational outcomes? Based on previous research, we anticipate birth order differences in which earlier-borns fare better than later-borns. We make use of the full population of Norway to answer this question. We have two outcomes for two different generations, educational achievement measured at age 30 available for the “adult generation,” and national test scores performed in fifth, eighth, and ninth grade available for the “child generation.” This allows us to investigate if birth order differences are robust over time and educational outcomes, as well as whether birth order differences are present from childhood or if they arise at later ages.

  2. Is there a birth-order difference in genetics, as measured by PGSs for educational attainment? We again, here, examine data from both the adult and child generation of genotyped MoBa participants.

  3. Are there differences in birth weight and birth length between birth orders? Our data here are the the child generation in the MoBa-sample.

  4. Does the putatively socially based effect of birth order remain after accounting for genetic differences and in utero variables? Here, we use both the adult and child generation data in the MoBa, but while we can adjust for PGS and in utero variables in the child generation, we do not have the latter in the adult generation and so adjust for PGS only

Our analyses below of these four questions are consistent with the conclusion that birth order differences are not biological in origin. After presenting these, we also investigate whether our data permit any further illumination of the mechanisms behind birth order differences. We consider whether birth order interact with PGSs, vary by family background, and include both nontransmitted and sibling alleles.

Results

We begin by analyzing the full population of Norway using administrative data to see whether there are birth order differences among children and adults. To better parallel subsequent analyses with genomic data, we exclude participants whose parents were not born in Norway (see Methods). For the population of children we study (b. 1994 to 2009), we examined performance on national tests by computing the child's mean score of tests in three subjects measured at three times (grades five, eight, and nine; standardized within subject, age, and cohort; N = 301,795). For the adult population (b. 1945 to 1988), our outcome is completed years of education at age 30 (standardized within cohort, N = 2,067,878). We use family-level fixed effects when estimating the bivariate association between birth order and these outcomes, meaning we are comparing siblings within the same family. We control for sex and maternal age (44), and run the models separately by sibship size.

Consistent with other studies, we find that firstborn siblings have better educational outcomes than their later-born siblings. Figure 1 shows the magnitude of these associations. The top panel shows lower test scores for each successive birth order for all family sizes from two to five siblings. Most of these differences are present in the first test scores we observe (fifth grade), but the gaps do grow modestly from the last scores in our data (ninth grade; see Figure S2, Supplementary Material). The bottom panel shows similar patterns for educational attainment in the adult population. Figure 1 also shows that birth order differences increase when adjusted for maternal age.

Birth order and educational differences in the population. (A) and (B) Results from family-fixed effects linear regression models run separately by sibship size, with controls for sex and maternal age and cluster-robust standard errors. Firstborns serve as the reference category. All point estimates presented with 95% CI. In (A) children generation in the population (N = 301,795), where the outcome is the mean of national test score standardized within test, year of test, and birth cohort. (B) Adult generation in the population (N = 2,067,878), where the outcome is educational attainment at age 30, standardized within birth cohort.
Fig. 1.

Birth order and educational differences in the population. (A) and (B) Results from family-fixed effects linear regression models run separately by sibship size, with controls for sex and maternal age and cluster-robust standard errors. Firstborns serve as the reference category. All point estimates presented with 95% CI. In (A) children generation in the population (N = 301,795), where the outcome is the mean of national test score standardized within test, year of test, and birth cohort. (B) Adult generation in the population (N = 2,067,878), where the outcome is educational attainment at age 30, standardized within birth cohort.

PGS

Figure 2 shows the relationship between birth order and the PGS for educational attainment among families with two and three children, which are the vast majority of sibships in Norway. Sample sizes for this and subsequent analyses using genetic data are much smaller than those used in the previous figure as these data are only available for a portion of the population (see Methods). We provide separate results for the child generation (panels A and B; N = 24,507; 2,705) and the adult generation (panels C and D, N = 43,316; 3,897). Within each generation, we provide estimates for models comparing siblings both between families and within families. In addition to sex and cohort, between-family models adjust for the 10 principal components of the GWAS data to address potential confounding by ancestral differences (i.e. population stratification).

PGS and birth order. (A), (B), (C), and (D) Association between educational attainment PGS and birth order. Results from linear regression models run separately by sibship size, with controls for sex and mothers age. Firstborns serve as the reference category. All point estimates presented with 95% CI. In (A) and (C), between-family estimate, adjusted for 10 principal components. In (B) and (D), family-level fixed effects models with cluster-robust standard errors. In (A) and (B), children generation in the MoBa-sample (N = 24,507 (A); 2,705 (B)); (C) and (D) adult generation in the MoBa sample (N = 43,316 (C); 3,897 (D)).
Fig. 2.

PGS and birth order. (A), (B), (C), and (D) Association between educational attainment PGS and birth order. Results from linear regression models run separately by sibship size, with controls for sex and mothers age. Firstborns serve as the reference category. All point estimates presented with 95% CI. In (A) and (C), between-family estimate, adjusted for 10 principal components. In (B) and (D), family-level fixed effects models with cluster-robust standard errors. In (A) and (B), children generation in the MoBa-sample (N = 24,507 (A); 2,705 (B)); (C) and (D) adult generation in the MoBa sample (N = 43,316 (C); 3,897 (D)).

In all panels in Figure 2, CIs overlap zero for birth orders two and three, meaning that we find no differences in the PGS for educational attainment among different birth orders. Consequently, the genetic differences captured by that PGS cannot explain the observed relationship between birth order and educational outcomes.

In the introduction, we raised the possibility genetically influenced child behaviors could influence parents’ subsequent fertility decision-making. Considering this empirically, we looked specifically at whether a child's PGS for educational attainment predicted whether parents had another child. It did not (Table S1, Supplementary Material). While there are observable child behaviors in 2- to 3-y-old children that are correlated with PGSs for educational attainment, we do not find any evidence that a child's PGS influences their parents’ subsequent fertility.

Birth weight and birth length

In Figure 3, we turn to birth order differences in birth weight and birth length, available in the child generation of the sample only (N = 2,705). We adjust for sex, gestational age, and the PGS for education.

Birth length, birth weight, and birth order. (A) and (B) Results from family-level fixed effects linear regressions run separately by sibship size, with dummies for birth order. Children generation of the MoBa-sample (N = 2,705). Cluster robust standard errors, 95% CI. Firstborns serve as the reference category. In (A), birth length, with different control variables, pink point: sex; orange triangle: sex, gestational age, and educational attainment PGS; blue rectangle: sex, gestational age, educational attainment PGS, and mothers age at birth. In (B), birth weight, with controls, pink point: sex; orange triangle: sex, gestational age, educational attainment PGS, and birth length; blue rectangle: sex, gestational age, educational attainment PGS, birth length, and mothers age at birth.
Fig. 3.

Birth length, birth weight, and birth order. (A) and (B) Results from family-level fixed effects linear regressions run separately by sibship size, with dummies for birth order. Children generation of the MoBa-sample (N = 2,705). Cluster robust standard errors, 95% CI. Firstborns serve as the reference category. In (A), birth length, with different control variables, pink point: sex; orange triangle: sex, gestational age, and educational attainment PGS; blue rectangle: sex, gestational age, educational attainment PGS, and mothers age at birth. In (B), birth weight, with controls, pink point: sex; orange triangle: sex, gestational age, educational attainment PGS, and birth length; blue rectangle: sex, gestational age, educational attainment PGS, birth length, and mothers age at birth.

On average, later-born children are both longer and heavier than firstborn children. Differences in length account for roughly half of the difference in weight. Further analyses indicate that the differences in mean length or weight are not due to differences only in especially low- or high-weight births (see Figure S3, Supplementary Material). We find nearly identical results in the between-family models (see Figure S4, Supplementary Material). If higher birth weight indicates an advantaged prenatal environment, then it appears that later born children are actually advantaged in this regard relative to their first born siblings, despite their lower ultimate achievement.

Maternal age and birth spacing

We considered also the possibility that birth order differences were due to maternal age, as maternal age has been variously posited to influence cognitive development via both biological and social mechanisms. To confound an observed firstborn advantage, mothers age would need to be inversely associated with test scores. When we simultaneously model within- and between-family effects, we find maternal age to be positively associated with test scores between families and even more so within families (see Table S2, Supplementary Material). Consequently, as noted, the birth order differences presented in Figure 1 actually increase after accounting for maternal age. Within families, sibling differences in maternal age are equivalent to the birth spacing intervals. Our finding is, thus that birth order differences get smaller as both maternal age and the spacing between births increases. While this pattern is consistent with explanations of birth order differences rooted in differential parental investment and overall intellectual climate in the family, it is not consistent with ideas of there being in utero, mutational, or other biological advantages to being born to a younger mother.

Educational achievement and attainment with controls

In Figure 4, we show birth order differences within families after adjusting for all the aforementioned measures simultaneously: sex, gestational age, birth weight, birth length, and mothers age at birth, as well the PGS for education. We show results for birth order differences in educational achievement (children, panel A, N = 2,705) and educational attainment at age 30 (adults, panel B, N = 3,365). In all models, point estimates are negative for later-born children compared to firstborn. Although controlling for mothers' age at birth increases the uncertainty of estimates and CIs overlap zero (blue rectangle in Figure 4), all point estimates remain negative, and some again increase.

Educational achievement/attainment and birth order. (A) and (B) Results from family-level fixed effects linear regressions run separately by sibship size, with dummies for birth order where firstborns are the reference. Cluster robust standard errors, 95% CI. In (A), children generation in the MoBa-sample (N = 2,705) with mean of national test score as the outcome. Controls for sex (black circle); sex, gestational age, and educational attainment PGS (yellow triangle); sex, gestational age, birth weight, birth length, and educational attainment PGS (blue rectangle). In (B), adult generation in the MoBa-sample (N = 3,365) with educational attainment at age 30 as the outcome. Controls for sex (black circle); sex, educational attainment PGS (yellow triangle); and sex, educational attainment PGS, and mothers age (blue rectangle).
Fig. 4.

Educational achievement/attainment and birth order. (A) and (B) Results from family-level fixed effects linear regressions run separately by sibship size, with dummies for birth order where firstborns are the reference. Cluster robust standard errors, 95% CI. In (A), children generation in the MoBa-sample (N = 2,705) with mean of national test score as the outcome. Controls for sex (black circle); sex, gestational age, and educational attainment PGS (yellow triangle); sex, gestational age, birth weight, birth length, and educational attainment PGS (blue rectangle). In (B), adult generation in the MoBa-sample (N = 3,365) with educational attainment at age 30 as the outcome. Controls for sex (black circle); sex, educational attainment PGS (yellow triangle); and sex, educational attainment PGS, and mothers age (blue rectangle).

Taken together, these analyses rule out various scenarios by which birth order differences in educational outcomes are either confounded or the result of genetic or prenatal causes. The availability of multigenerational genotyped data with extensive phenotyping does not contradict an understanding of birth order differences as originating in postnatal environments.

Interrogating environmental origins

Given this, we next turned to see what additional clues the genomic data may offer for how these environmental influences operate. First, we examined whether birth order differences varied by PGS. Muslimova et al. (45) posit that birth order differences may be strongest among children with the highest genetic score for achievement, as these children may be able to capitalize best on increased parental investment. In the child sample, we observed no difference (see Table S3 and Figure S5, Supplementary Material). In the adult sample, we find a borderline statistically significant pattern opposite the expected direction: a smaller firstborn advantage among participants with higher PGSs. If that is true, higher genetic potential could mitigate the processes that produce birth order differences. Either way, there is no indication in our data that firstborn advantage only or more strongly exists among those with higher PGSs.

Second, we also considered whether, net of PGS, birth order differences are moderated by family background. Contradictory hypotheses have been proposed to this end. One is that advantaged environments would have greater disparities in effective parental investment, implying larger birth order differences in those families. The other is that advantaged environments would have more compensatory investment in lower-performing children, yielding smaller birth order differences (46). Our results here were consistent with the former of these scenarios (Table S4, Supplementary Material): differences were larger in families in which parents had higher educational attainment, including within-families and net of PGS. But, these differences by family background were small compared to the overall birth order difference (Figure S6, Supplementary Material)

Third, we examined the influence of other family member's PGSs on child achievement, net of the child's own score. Recent findings of “genetic nurture” have documented relationships between nontransmitted parental alleles and child attainment and achievement (47–49). Given that mothers are typically more involved in childrearing than fathers—not infrequently to a substantial degree—the finding of some past research that maternal nontransmitted alleles matter more for attainments than paternal nontransmitted alleles would not be surprising (50). We did not find this pattern in our data (see Table S6, Supplementary Material). Also, when we include the PGS for the siblings, the sibling score accounts for more than half of the magnitude of the difference in achievement that had been attributed to parental PGSs. Net of one another, the relationship between sibling PGS and achievement is nearly a quarter as large as that of a child's own score (Table S5, Supplementary Material). These results point to the importance of considering that genetic nurture may reflect sibling influence to a greater extent than has been previously appreciated.

Discussion

Our starting point for this investigation was the common presumption that birth order differences in educational outcomes reflect social environmental rather than genetic or prenatal mechanisms. Specifically regarding genetic differences, there are multiple potential reasons for genetic variation by birth order. Historically, the problem has been exacerbated by many studies using between-family samples and methods (51), but the possibility of genetic differences remains even in studies that compare siblings in the same family. The only way to assess decisively whether genetic differences may confound birth order studies is with data that contains genetic and comprehensive phenotypic information on many siblings. Using data that spans two generations of Norwegian siblings, we show no genetic differences by birth order as captured by PGSs for educational attainment. Similar preliminary findings have recently been reported using data from the United Kingdom (45).

We also examined indicators of possible in utero origins of birth order differences: birth weight, birth length, and maternal age. Our results suggest that none of these provide any leverage for explaining the robust advantage that earlier-born children have in test scores and ultimate educational attainment in our data. Indeed, later-born children are advantaged in birth weight and birth length, consistent with some earlier research (29, 52, 53), and so if anything they might have better in utero environments, and yet have worse educational outcomes.

These results strengthen the conclusion that birth order differences in educational outcomes mainly originate in postbirth environments (6, 39). Another implication of our findings is that various methodologies that use siblings in causal inferences, i.e. studies using PGSs within-family to alleviate population stratification (54, 55), and social science studies using siblings to capture the omnibus effect (of social background), are not confounded in ways they would be if birth order differences were due to genetic differences.

Given that birth order differences arise within families but seem unrelated to genetic factors and do not appear to be due to in utero environments, they constitute an interesting case of social inequality free from many sources of potential confounding. Understanding how they come about offers a window that may be more generally instructive for mechanisms that lead to inequalities among people with similar familial and sociodemographic backgrounds. To this end, we find that the birth order differences in test scores are already mostly present at our earliest time of observation (fifth grade, age 9/10) and grow only modestly from then to ninth grade. This clarifies that the environmental differences associated with birth order differences largely manifest in the first decade of life. The difference in educational attainment among adults is similar in magnitude, suggesting that these differences resulting from childhood environments have lifelong effects.

The two main environmental theories on birth order effects are linked to parental resources and sibling interactions. According to the resource dilution theory (12, 56, 57), economic and parental resources deplete as more household members arrive. In societies where resources are abundant like Norway, cultural resources are thought to be more important than economic (1). According to the theory, as earlier-born receive more of the parental cultural resources such as personal attention and help with homework than later-borns do, they excel in educational performance. While this pattern is consistent with the finding that birth order differences decline with increased spacing between births, it is not consistent with our results that maternal genetic endowments do not matter anymore than paternal endowments, given that mothers usually contribute more to child-rearing than fathers. Other studies have found stronger effects of maternal genetics (50), so more research will be needed to reach any decisive conclusion.

As for sibling interactions, these have been most prominently raised in variations of “confluence theory,” which proposes that siblings generally have negative effects on one another's cognitive development (13, 14, 58). Because firstborns have less exposure to being reared with siblings, and may also benefit from the opportunity to teach younger siblings, the negative influence of siblings is proposed to be least for firstborns. Our finding that the apparent influence of parental alleles shrinks markedly when sibling PGSs are included, and that sibling PGSs are significant associated with child outcomes net of a child's own score, indicates that some of what has hitherto been called “genetic nurture” could in fact be “sibling genetic nurture.” Recent enthusiasm for using sibling-based GWAS to purge GWAS studies of indirect genetic effects assumes that these indirect genetic effects are primarily due to parents (59). Because the approach does not account for indirect effects via siblings, the possibility that sibling genetic effects on achievement may be larger than commonly supposed (60) is important to resolve for evaluating the success of sibling-based GWAS.

Sibling effects have received less attention as compared to parental resources when it comes to explaining birth order effects. That said, Gibbs et al. (61) propose a conditional resource dilution (CRD) model, opening up for institutional- and family-level variation influencing birth order effects. In the CRD, brothers and sisters, and especially older siblings, may provide, rather than compete for resources. “Sibling genetic nurture” raises the possibility that these intersibling consequences for familial environments vary by sibling genotype, with genetic factors increasing achievement potential having positive spillovers for siblings.

Our study has some limitations. First, PGSs are based on common alleles, and we are not able to tell if there are differences in rarer variants. That said, If rarer variants in the form of de novo mutations were importantly associated with birth order, the most obvious explanation would have involved parental age, and this is contradicted by our findings regarding maternal age. Second, our proxies for in utero environments are partial, such that unmeasured consequential differences might still exist. Similarly, even though postnatal accounts of birth order differences focus on social environments, our design does rule out postnatal differences due to physiological causes or other, nonsocial phenomena. Third, the sample size for the within-family analyses is limited, as we have relatively few families with two or more full siblings. Statistical power curtails our present capacity to draw more decisive conclusions for some of our analyses, and we cannot rule out type II errors. Fourth, PGSs for educational attainment still only account for a limited portion of the overall heritability of educational attainment, and they will likely improve in the future. Last, we have no direct measures of parental and sibling behaviors, which obviously thwarts the ability to interrogate either as putative causes. Amidst the excitement for what incorporating genomic information may bring to the social sciences, we should not lose sight that the available phenotyping of population datasets often still leaves much to be desired.

In addition, even though birth order differences seem like a unary puzzle, the importance of different environmental mechanisms may shift in different societal and historical contexts. For the cohorts in our child sample, the vast majority of Norwegian children attend child care from age one, with the overall attendance being 90% for children aged one to five in 2010 (62). In such a context, mothers may be spending less time with their children than perhaps at any point in history, which might mean our data could be unpropitious for observing a maternal role in birth order effects. Scandinavian data has often been used in research on family dynamics research because of these countries’ population registers. Societies with exceptional data may also be exceptional in other respects. Moreover, Scandinavian population register data sources do not contain large-scale data on actual behaviors, resources, and practices within families, and there may simply be no way of getting to the bottom of birth order effects without such richer data. While our results show that birth order differences are largely environmental, the question of what specific environments factors are involved remains far from resolved.

Materials and Methods

Population data and variables from administrative registries

The data in this paper are from several sources. We begin with data from administrative registries covering the full population of Norway. The registries are of very high quality, and do not suffer from attrition, and have few registration errors. From the Central Population Registry we identify all family linkages and demographic variables, like birth cohort, sibship size, birth order, and mothers age at birth. We identify sibship size and birth order according to birth year within each mother. We also identify any sibships with multiple births, and exclude them as the assignment of birth order is less clear cut, and family dynamics may be different in multiple birth sibships. To better mirror our genomic sample we remove individuals born outside of Norway. Current genomic methods do not allow for us to include persons of non-European ancestry, and we restrict the data based on the population registry based on this related criteria, i.e. we only include Norwegian-born to Norwegian parents. For the adult part of the population, we restrict birth cohorts to 1945 to 1988.

We link the Central Population Registry to the National Educational database (NuDB) (63), also covering the full population. For the child generation in our analyses, we use standardized tests as the outcome. From NuDB, we use data from national tests conducted in fifth, eighth, and ninth grade in reading, mathematics, and English. We standardize each test to a z-score within test, year of test assessment, and birth cohort. Thereafter, we calculate a mean of all available test scores for each child, which serves as our outcome variable for the children in the sample. For the adult generation in our analyses, the outcome is educational attainment at age 30 following the International Standard Classification of Education (ISCED 2011) (63, 64). The variable is continuous as we transform it to being measured in years, using how many years it takes to complete the level of education attained following normal progression, according to ISCED. We standardize this variable too to compare effect sizes between outcomes. After restricting our sample and removing people with missing information etc., our population-based data has an N of 301,795 for the child birth cohorts, and 2,067,878 for the adult birth cohorts.

The Norwegian Mother, Father, and Child Cohort Study (MoBa)

The prepared population data are linked to The Norwegian Mother, Father, and Child Cohort Study (MoBa) (65, 66), which we use for the analysis with genomic and in utero variables. MoBa is a population-based pregnancy cohort study conducted by the Norwegian Institute of Public Health. Participants were recruited from all over Norway from 1999 to 2008, with the sample unit being pregnancy. The women consented to participation in 41% of the pregnancies. The cohort includes 114,500 children, 95,200 mothers, and 75,200 fathers. The current study is based on version 12 of the quality-assured data files. The establishment of MoBa and initial data collection was based on a license from the Norwegian Data Protection Agency and approval from The Regional Committees for Medical and Health Research Ethics. The MoBa cohort is now based on regulations related to the Norwegian Health Registry Act. The current study was approved by The Regional Committees for Medical and Health Research Ethics. Some 98,110 individuals in around 32,000 trios are currently genotyped. This interim release is known as “MoBa Genetics,” and is comprised of several separately genotyped and imputed batches due to partial funding for genotyping on a per-project basis. The current release is a merger of all subprojects after quality control and imputation. Details are available here: https://github.com/folkehelseinstituttet/mobagen.

MoBa-participants are sampled on pregnancy independent of previous pregnancies (67), meaning that children in MoBa could have siblings younger or older not included in the sample. However, as we identify family linkages and birth order from the population registry, we observe full sibship sizes and birth orders independently of the information in MoBa. The sampling of births within birth order is, therefore, random. As we construct sibship size variables from registries with observations from the birth of the parents and up until 2018, we are confident that we observe completed fertility histories for the vast majority of the sample, as most have reached ages where fertility rates are very low. For the adult generation in the MoBa-sample, we use the central population registry to link them to their parents (i.e. the grandparents of children in MoBa). As the pregnancies sampled for the MoBa are over the same cohort window (1999 to 2008), many of the coming parents sampled to the MoBa happens to be siblings too. Thus, we can establish sibship size, birth order, and other demographic information from the Central Population Registry, and use that to investigate birth order differences taking into account genomic information also for the adults in the MoBa-sample. Our observational window from the NuDB register is up until 2018. In a few cases the parents in MoBa have yet to have reached age 30 at our latest observation year 2018. Here, we take the latest observed age available, the lowest age being 27.

Variables from MoBa

MoBa contains genomic information for parents and children, which allows us to create PGSs for each individual based on a Genome-Wide-Association study (GWAS) for educational attainment of 1.1 million people (68). For the MoBa-sample, we conducted quality control using PLINK (v 1.90). Before we perform quality control in PLINK, we remove families with any individuals born in countries outside of Europe, The United States, Canada, New Zealand, or Australia. We also remove families with any multipartnered fertility parents in MoBa. Thresholds for genotyping call rate were set to 98%, minor allele frequency (MAF) 5%, and deviations from Hardy–Weinberg equilibrium was < 10–4 with mid P-value adjustment. We remove individuals with poor genotype quality with a threshold missing rate of 5%, as well as those with heterozygosity rates which deviate ± 3 SDs from the sample mean. We remove ancestry outliers in plink with an identity-by-state binominal test (PPC) set to 0.05, MAF 0.01, and compare the 1 to 5 nearest neighbors in the data. We remove families with a z-score for any neighbor below 4 SDs. We use KING software (69) to confirm that siblings are genetic siblings based on identity-by descent.

We use PRSice (70) to make PGSs for parents and children. We use all available SNPs, genome-wide significant or not, and clump with a 250-kb window. Clump-r2 was set to 0.1, excluding SNPs with higher linkage disequilibrium. The PGS is standardized.

In addition to genomic information, MoBa has several survey waves. Birth length and birth weight were self-reported from the mother when the child was 6 months old. Questionnaires and instrument documentation for the MoBa is available here: https://www.fhi.no/en/studies/moba/for-forskere-artikler/questionnaires-from-moba/

After doing quality control on the genomic data, and removing people according to what we have just described, 79,057 individuals are left in the sample, 29,815 children, 24,032 fathers, and 25,210 mothers. However, while we use this sample for between-family analysis in some figures, we rely mostly on within-family analysis, i.e. family fixed-effects with clustering on mothers and robust standard errors. Here, our sample size is smaller, as at least two children born to the same mother are needed to estimate the models, and we need both siblings to be genotyped (children generation in MoBa N = 2,705, adult generation in MoBa N = 3,897).

Funding

This research is part of the OPENFLUX project that has received funding from the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme (ERC consolidator grant agreement no. 818420). The Norwegian Mother, Father, and Child Cohort Study is supported by the Norwegian Ministry of Health and Care Services and the Ministry of Education and Research. This research is part of the HARVEST collaboration, supported by the Research Council of Norway (#229624). We also thank the NORMENT Centre for providing the genotype data, funded by the Research Council of Norway (#223273), the South East Norway Health Authority and the KG Jebsen Stiftelsen.

Classification

Social sciences, genetics.

ACKNOWLEDGEMENTS

We are grateful to all the participating families in Norway who take part in this on-going cohort study. We thank the Norwegian Institute of Public Health (NIPH) for generating high-quality genomic data. We thank Per Minor Magnus, Pål Njølstad, and Ole Andreassen who headed the forementioned projects. We further thank the Center for Diabetes Research, the University of Bergen for providing the genotype data and performing quality control and imputation of the data funded by the ERC AdG project SELECTionPREDISPOSED, Stiftelsen Kristian Gerhard Jebsen, Trond Mohn Foundation, the Research Council of Norway, the Novo Nordisk Foundation, the University of Bergen, and the Western Norway health Authorities (Helse Vest). Isungset thank Robbee Wedow for making genomics less hard.

Authors' Contributions

M.A.I. and J.F. designed and performed the research, M.A.I. analyzed the data; and M.A.I., J.F., O.A.A., and T.H.L. wrote the paper.

Data Availability

The consent given by the participants does not open for storage of data on an individual level in repositories or journals. Researchers who want access to data sets for replication should submit an application to [email protected]. Access to data sets requires approval from The Regional Committee for Medical Research Ethics in Norway and a formal contract with MoBa. Code is available at https://osf.io/fj3nb.

Notes

Competing Interest: The authors declare no competing interest.

References

1.

Conley
D
.
2005
.
The pecking order
.
New York (NY)
:
Vintage Books
.

2.

Turkheimer
E
.
2016
.
Weak genetic explanation 20 years later
.
Perspect Psychol Sci
.
11
:
24
28
.

3.

Plomin
R
,
Daniels
D
.
1987
.
Why are children in the same family so different from one another?
.
Behav Brain Sci Behav Brain Sci
.
10
:
1
16
.

4.

Rohrer
JM
,
Egloff
B
,
Schmukle
SC
.
2015
.
Examining the effects of birth order on personality
.
Proc Natl Acad Sci
.
112
:
14224
14229
.

5.

Belmont
L
,
Marolla
FA
.
1973
.
Birth order, family size, and intelligence: a study of a total population of 19-year-old men born in the Netherlands is presented
.
Science
.
182
:
1096
1101
.

6.

Kristensen
P
,
Bjerkedal
T
.
2007
.
Explaining the relation between birth order and intelligence
.
Science
.
316
:
1717
1717
.

7.

Black
SE
,
Devereux
PJ
,
Salvanes
KG
.
2005
.
The more the merrier? The effect of family size and birth order on children's education
.
Q J Econ
.
120
:
669
700
.

8.

Esposito
L
,
Kumar
SM
,
Villaseñor
A
.
2020
.
The importance of being earliest: birth order and educational outcomes along the socioeconomic ladder in Mexico
.
J. Popul. Econ.
33
:
1069
1099
.

9.

Behrman
JR
,
Taubman
P
.
1986
.
Birth order, schooling, and earnings
.
J Labor Econ
.
4
:
121
150
.

10.

Bertoni
M
,
Brunello
G
.
2016
.
Later-borns don't give up: the temporary effects of birth order on European earnings
.
Demography
.
53
:
449
470
.

11.

Lehmann
J-YK
,
Nuevo-Chiquero
A
,
Vidal-Fernandez
M
.
2018
.
The early origins of birth order differences in children's outcomes and parental behavior
.
J Hum Resour
.
53
:
123
156
.

12.

Blake
J
.
1989
.
Family size and achievement
.
Berkeley (CA)
:
University of California Press
.

13.

Zajonc
R
,
Markus
G
.
1975
.
Birth order and intellectual development
.
Psychol Rev
.
82
:
74
88
.

14.

Zajonc
RB
.
1983
.
Validating the confluence model
.
Psychol Bull
.
93
:
457
480
.

15.

Hyeokmoon
K
,
Martschenko
DO
,
Harden
KP
,
Diprete
TA
,
Koellinger
PD
.
2020
.
Genetic fortune : winning or losing education, income, and health
.
Tinbergen Institute Discussion Papers 20-053/V
.
Amsterdam
:
Tinbergen Institute
.

16.

Harden
KP
.
2021
.
The genetic lottery - why DNA matters for social equality
.
Princeton (NJ)
:
Princeton University Press
.

17.

Engzell
P
,
Tropf
FC
.
2019
.
Heritability of education rises with intergenerational mobility
.
Proc Natl Acad Sci
.
116
:
25386
25388
.

18.

Björklund
A
,
Salvanes
KG
.
2010
.
Education and family background. Mechanisms and policies
.
IZA Discuss Pap
.
3
:
201
247
.. DOI: .

19.

Tucker-Drob
E
,
Rhemtulla
M
,
Harden
K
,
Turkheimer
E
,
Fask
D
.
2011
.
Emergence of a gene-by-socioeconomic status interaction on infant mental ability from 10 months to 2 years
.
Psychol Sci
.
22
:
125
133
.

20.

Jansen
PR
et al.
2018
.
Polygenic scores for schizophrenia and educational attainment are associated with behavioural problems in early childhood in the general population
.
J Child Psychol Psychiatry Allied Discip
.
59
:
39
47
.

21.

Wertz
J
et al.
2019
.
Using DNA from mothers and children to study parental investment in children's educational attainment
.
Child Dev
.
00
:
cdev.13329
.

22.

Breinholt
A
,
Conley
D
,
Parenting
C-D
.
2020
.
Differential early childhood investment by offspring genotype
.
NBER working paper 28217
.
Cambridge (MA)
:
National Bureau of Economic Research
.

23.

Linnakaari
R
et al.
2019
.
Trends in the incidence, rate and treatment of miscarriage - nationwide register-study in Finland, 1998-2016
.
Hum Reprod
.
34
:
2120
2128
.

24.

Cohain
JS
,
Buxbaum
RE
,
Mankuta
D
.
2017
.
Spontaneous first trimester miscarriage rates per woman among parous women with 1 or more pregnancies of 24 weeks or more
.
BMC Preg Childbirth
.
17
:
1
7
.

25.

García-Enguídanos
A
,
Calle
ME
,
Valero
J
,
Luna
S
,
Domínguez-Rojas
V
.
2002
.
Risk factors in miscarriage: a review
.
Eur J Obstet Gynecol Reprod Biol
.
102
:
111
119
.

26.

Malaspina
D
et al.
2005
.
Paternal age and intelligence: Implications for age-related genomic changes in male germ cells
.
Psychiatr Genet
.
15
:
117
125
.

27.

Almond
D
,
Currie
J
.
2011
.
Killing me softly: the fetal origins hypothesis
.
J Econ Perspect
.
25
:
153
172
.

28.

Foster
JW
,
Archer
SJ
.
1979
.
Birth order and intelligence: an immunological interpretation
.
Percept Mot Skills
.
48
:
79
93
.

29.

Brenøe
AA
,
Molitor
R
.
2017
.
Birth order and health of newborns: what can we learn from Danish registry data?
.
J Popul Econ
.
31
:
363
395
.

30.

Black
SE
,
Devereux
PJ
,
Salvanes
KG
.
2007
.
From the cradle to the labor market? The effect of birth weight on adult outcomes
.
Q J Econ
.
122
. DOI:
10.3386/w11796
.

31.

Conley
D
,
Bennett
NG
.
2000
.
Is biology destiny? Birth weight and life chances
.
Am Sociol Rev
.
65
:
409
439
.

32.

Young
BC
,
Levine
RJ
,
Karumanchi
SA
.
2010
.
Pathogenesis of preeclampsia
.
Ann Rev Pathol Mech Dis
.
5
:
173
192
.

33.

Gumusoglu
SB
,
Chilukuri
ASS
,
Santillan
DA
,
Santillan
MK
,
Stevens
HE
.
2020
.
Neurodevelopmental outcomes of prenatal preeclampsia exposure
.
Trends Neurosci
.
43
:
253
268
.

34.

Wu
CS
et al.
2009
.
Health of children born to mothers who had preeclampsia: a population-based cohort study
.
Am J Obstet Gynecol
.
201
:
269.e1
269.e10
.

35.

Kong
A
et al.
2012
.
Rate of de novo mutations and the importance of father's age to disease risk
.
Nature
.
488
:
471
475
.

36.

Taylor
JL
et al.
2019
.
Paternal-age-related de novo mutations and risk for five disorders
.
Nat Commun
.
10
:
1
9
.

37.

Goldmann
JM
,
Veltman
JA
,
Gilissen
C
.
2019
.
De novo mutations reflect development and aging of the human germline
.
Trends Genet
.
35
:
828
839
.

38.

Sartorius
GA
,
Nieschlag
E
.
2009
.
Paternal age and reproduction
.
Hum Reprod Update
.
16
:
65
79
.

39.

Barclay
K
.
2015
.
Birth order and educational attainment: evidence from fully adopted sibling groups
.
Intelligence
.
48
:
109
122
.

40.

Bellis
MA
,
Hughes
K
,
Hughes
S
,
Ashton
JR
.
2005
.
Measuring paternal discrepancy and its public health consequences
.
J Epidemiol Commun Health
.
59
:
749
754
.

41.

Lillehagen
M
,
Isungset
MA
.
2020
.
New partner, new order? Multipartnered fertility and birth order effects on educational achievement
.
Demography
.
57
:
1625
1646
.

42.

Thomson
E
,
Lappegård
T
,
Carlson
M
,
Evans
A
,
Gray
E
.
2014
.
Childbearing across partnerships in Australia, the United States, Norway, and Sweden
.
Demography
.
51
:
485
508
.

43.

Davies
NM
et al.
2019
.
Within family Mendelian randomization studies
.
Hum Mol Genet
.
28
:
R170
R179
.

44.

Kravdal
Ø
.
2019
.
Taking birth year into account when analysing effects of maternal age on child health and other outcomes: the value of a multilevelmultiprocess model compared to a sibling model
.
Demogr Res
.
40
:
1249
1290
.

45.

Muslimova
D
,
Van Kippersluis
H
,
Rietveld
CA
,
Von Hinke
S
,
Meddens
SFW
.
2020
.
Dynamic complementarity in skill production : evidence from genetic endowments and birth order
.
DIAL working papers issue: 1
.
Dynamics of Inequality across the lifecourse (DIAL)
,
Turku, Finland
.
1
46
.

46.

Price
J
.
2008
.
Parent-child quality time: does birth order matter?
.
J Hum Resour
.
43
:
240
265
.

47.

Isungset
MA
et al.
2021
.
Social and genetic effects on educational performance in early adolescence
.
NBER working paper no. 28498
.
Cambridge (MA)
:
National Bureau of Economic Research
.
1
27
.

48.

Bates
TC
et al.
2018
.
The nature of nurture: using a virtual-parent design to test parenting effects on children's educational attainment in genotyped families
.
Twin Res Hum Genet
.
21
:
73
83
.

49.

Wang
B
et al.
2021
.
Robust genetic nurture effects on education: a systematic review and meta-analysis based on 38,654 families across 8 cohorts
.
Am J Hum Genet
.
108
:
1780
1791
.

50.

Kong
A
et al.
2018
.
The nature of nurture: effects of parental genotypes
.
Science
.
359
:
424
428
.

51.

Rodgers
JL
.
2001
.
What causes birth order-intelligence patterns? The admixture hypothesis, revived
.
Am Psychol
.
56
:
505
510
.

52.

Magnus
P
,
Berg
K
,
Bjérkedal
T
.
1985
.
The association of parity and birth weight: testing the sensitization hypothesis
.
Early Hum Dev
.
12
:
49
54
.

53.

Swamy
GK
,
Edwards
S
,
Gelfand
A
,
James
SA
,
Miranda
ML
.
2012
.
Maternal age, birth order, and race: differential effects on birthweight
.
J Epidemiol Commun Health
.
66
:
136
142
.

54.

Morris
TT
,
Davies
NM
,
Hemani
G
,
Smith
GD
.
2020
.
Population phenomena inflate genetic associations of complex social traits
.
Sci Adv
.
6
:
eaay0328
.

55.

Harden
KP
,
Koellinger
PD
.
2020
.
Using genetics for social science
.
Nat Hum Behav
.
4
:
567
576
.

56.

Blake
J
.
1981
.
Family size and the quality of children
.
Demography
.
18
:
421
442
.

57.

Blake
J
.
1986
.
Number of siblings, family background, and the process of educational attainment
.
Soc Biol
.
33
:
5
21
.

58.

Zajonc
RB
.
1976
.
Family configuration and intelligence
.
Science
.
192
:
227
236
.

59.

Howe
LJ
et al.
2022
.
Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects
.
Nat Genet
,
54
:
581
592
.. https://www.nature.com/articles/s41588-022-01062-7.

60.

Young
AI
et al.
2020
.
Mendelian imputation of parental genotypes for genome-wide estimation of direct and indirect genetic effects
.
bioRxiv
.
1
19
.

61.

Gibbs
BG
,
Workman
J
,
Downey
DB
.
2016
.
The (Conditional) resource dilution model: state- and community-level modifications
.
Demography
.
53
:
723
748
.

62.

Kitterød
RH
,
Nymoen
EH
,
Lyngstad
J
.
Endringer i bruk av barnetilsyn fra 2002 til 2010. Tabellrapport (Changes in use of child care from 2002 to 2010)
.
Akersveien 26
,
0177 Oslo, Norway
.

63.

Barrabés
N
,
Østli
GK
.
2012
.
Norwegian standard classification of education 2016
.
Oslo
:
Statistics Norway's Information Centre
.

64.

UNESCO Institute for Statistics
.
2012
.
The international standard classification of education (ISCED)
. DOI:.

65.

Magnus
P
et al.
2016
.
Cohort profile update: the Norwegian Mother and Child Cohort Study (MoBa)
.
Int J Epidemiol
.
45
:
382
388
.

66.

Paltiel
L
et al.
2014
.
The biobank in the Norwegian Mother and Child Cohort Study
.
Nor Epidemiol
.
16
:
29
35
.

67.

Rønningen
KS
et al.
2006
.
The biobank of the Norwegian Mother and Child Cohort Study: a resource for the next 100 years
.
Eur J Epidemiol
.
21
:
619
625
.

68.

Lee
JJ
et al.
2018
.
Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals
.
Nat Genet
.
50
:
1112
1121
.

69.

Manichaikul
A
et al.
2010
.
Robust relationship inference in genome-wide association studies
.
Bioinformatics
.
26
:
2867
2873
.

70.

Choi
SW
,
Mak
TS-H
,
O'Reilly
PF
.
2020
.
Tutorial: a guide to performing polygenic risk score analyses
.
Nat Protoc
.
15
:
2759
2772
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact [email protected]
Editor: Karen E Nelson
Karen E Nelson
Editor
Search for other works by this author on:

Supplementary data