-
PDF
- Split View
-
Views
-
Cite
Cite
Sumi N Wren, Chris A McLinden, Debora Griffin, Shao-Meng Li, Stewart G Cober, Andrea Darlington, Katherine Hayden, Cristian Mihele, Richard L Mittermeier, Michael J Wheeler, Mengistu Wolde, John Liggio, Aircraft and satellite observations reveal historical gap between top–down and bottom–up CO2 emissions from Canadian oil sands, PNAS Nexus, Volume 2, Issue 5, May 2023, pgad140, https://doi.org/10.1093/pnasnexus/pgad140
- Share Icon Share
Abstract
Measurement-based estimates of greenhouse gas (GHG) emissions from complex industrial operations are challenging to obtain, but serve as an important, independent check on inventory-reported emissions. Such top–down estimates, while important for oil and gas (O&G) emissions globally, are particularly relevant for Canadian oil sands (OS) operations, which represent the largest O&G contributor to national GHG emissions. We present a multifaceted top–down approach for estimating CO2 emissions that combines aircraft-measured CO2/NOx emission ratios (ERs) with inventory and satellite-derived NOx emissions from Ozone Monitoring Instrument (OMI) and TROPOspheric Ozone Monitoring Instrument (TROPOMI) and apply it to the Athabasca Oil Sands Region (AOSR) in Alberta, Canada. Historical CO2 emissions were reconstructed for the surface mining region, and average top–down estimates were found to be >65% higher than facility-reported, bottom–up estimates from 2005 to 2020. Higher top–down vs. bottom–up emissions estimates were also consistently obtained for individual surface mining and in situ extraction facilities, which represent a growing category of energy-intensive OS operations. Although the magnitudes of the measured discrepancies vary between facilities, they combine such that the observed reporting gap for total AOSR emissions is ≥(31 ± 8) Mt for each of the last 3 years (2018–2020). This potential underestimation is large and broadly highlights the importance of continued review and refinement of bottom–up estimation methodologies and inventories. The ER method herein offers a powerful approach for upscaling measured facility-level or regional fossil fuel CO2 emissions by taking advantage of satellite remote sensing observations.
In many countries, emissions reporting at the facility level is the foundation upon which emissions reduction targets are established and is used to track progress on meeting national emissions reduction commitments. Independent verification of reported greenhouse gas (GHG) emissions with “top–down” estimates based on atmospheric measurements is not routinely undertaken. A method for the estimation of fossil fuel CO2 emissions that combines aircraft and satellite observations is developed and applied to oil sands operations, which represent a major source of Canadian CO2 emissions. A large and long-term discrepancy between top–down and bottom–up CO2 emissions is shown, underscoring the importance of independent checks on reported emissions for industrial sources and the vital role atmospheric measurements can play in supporting national GHG reduction commitments.
Introduction
Anthropogenic emissions of greenhouse gases (GHG), particularly CO2, have unequivocally warmed the atmosphere, ocean, and land (1). In response, the international community has established conventions and treaties such as the United Nations Framework Convention on Climate Change (UNFCCC) and a related series of Conference of the Parties (COP) meetings seeking to reduce GHG emissions and limit future warming to 1.5°C relative to preindustrial levels, with the goal of mitigating the severity of climate change impacts (2). As a signatory to the UNFCCC and COP 21 (the Paris Agreement), Canada recently updated its nationally determined contribution (NDC) to a 2030 GHG emissions target of 40–45% below 2005 levels and has committed to net-zero GHG emissions by 2050 (3). An important aspect of these global initiatives is the accurate accounting of GHG emissions by country and economic sector, as a means of tracking mitigation efforts and demonstrating progress toward targets.
In Canada, facilities emitting >10 kt CO2eq year−1 are required to calculate and report their emissions to Canada's Greenhouse Gas Reporting Program (GHGRP), using methods consistent with guidelines developed by the Intergovernmental Panel on Climate Change (IPCC) and adopted by the UNFCCC (4). Industrial facilities reporting to emissions programs such as the GHGRP rely upon “bottom–up” methods such as material balance and use process-specific emission factors and extensive primary data on material input where possible. However, for various reasons (e.g. undersampling of sources and abnormal operating conditions), emission factors and activity data used in bottom–up estimates may be outdated or unrepresentative, and not all sources may be accounted for (i.e. unidentified sources and the presence of “super-emitters”) (5–7). These issues are particularly relevant for oil and gas (O&G) sector emissions, which consist of numerous complex and intermittent sources. Indeed, recent studies have demonstrated significant discrepancies between top–down and bottom–up emissions of methane (CH4) for the O&G sector at large, in Canada and elsewhere (5, 8, 9). The advancement of top–down methodologies for the independent estimation of emissions from large industrial facilities is essential to addressing these issues, and the resulting emissions estimates can function as a complement to traditional bottom–up estimates.
The oil sands (OS) in northern Alberta, Canada, are a complex mixture of sand, water, clay, and bitumen, representing the third largest oil reserves in the world, with proven recoverable reserves of 165 billion barrels (bbl) covering 142,000 km2 over three distinct areas: Athabasca, Cold Lake, and Peace River (10). Only 20% of OS reserves are shallow enough to be mined using surface mining techniques, while the remainder are in deeper deposits that require thermal in situ extraction methods (10, 11). The extraction, separation, and upgrading processes are fuel intensive (12), resulting in higher GHG emissions intensities for OS production compared with conventional crude oil production, although there is considerable variation among estimates (12–14). The scale and complexity of OS operations makes estimating emissions using bottom–up approaches challenging which is consistent with observations of larger top–down vs. bottom–up discrepancies for GHGs and other OS pollutants (15–20). GHG emissions from OS operations, which are dominated by CO2 from fuel combustion, have been growing over the past two decades. Given that the OS make up a large fraction of the national GHG emission total (e.g. in 2020 contributing to nearly half of the O&G sector total and 12% of the national total of 672 Mt CO2eq) (21), a small fraction of unaccounted for OS CO2 could represent a large overall emission and thus the benefit of top–down GHG emissions estimates is particularly relevant.
Reported CO2 emissions are considered to have small uncertainties because they are usually based on material balance for which data on material input (i.e. fuel volume and fuel carbon content) are reasonably well constrained and because fugitive emissions, which are difficult to quantify for many pollutants, are typically small by comparison (21). Nonetheless, recent top–down CO2 emissions estimates for individual OS mining operations, made by scaling up hourly CO2 emission rates derived from aircraft measurements, were found to be higher than GHGRP-reported estimates (17). However, scaling up emission rates derived from short-duration sampling campaigns to annual values (20) can be challenging; therefore, a better understanding of the magnitude and persistence of the observed emissions gap is needed. In the case of OS operations, top–down emissions estimates for in situ operations have never been made, but are especially important given the projected expansion of in situ projects (22) and their high emissions intensity (14, 23). As a result, total OS CO2 emissions are uncertain.
Here, we present a multifaceted approach for reconstructing historic OS CO2 emissions up to the present time, using measured CO2/NOx emission ratios (ERs) together with bottom–up (inventory) and satellite-derived NOx emissions from Ozone Monitoring Instrument (OMI) and TROPOspheric Ozone Monitoring Instrument (TROPOMI). Since nitrogen oxides (NOx = NO + NO2) are coemitted with CO2 in combustion processes, a top–down estimation methodology using CO2/NOx ERs rather than absolute CO2 emission rates has the potential to be more robust and allows for historical NOx emissions data to be used to reconstruct past CO2 emissions. This study yields the first comprehensive top–down estimates of total Athabasca Oil Sands Region (AOSR) OS operations CO2 emissions, which suggest that emissions from both surface mining and in situ operations are underestimated and that this gap has persisted for at least the last 15 years. The results underscore the importance of the independent evaluation of reported emissions with atmospheric measurements and the continuous advancement of both top–down and bottom–up emissions methodologies.
Results and discussion
Overview of the ER approach
Aircraft measurements of CO2 and NOx around surface mining and in situ facilities in Alberta, Canada (Fig. 1), were undertaken in the spring and summer of 2018 (Materials and methods). Syncrude Mildred Lake (SML), Suncor (SUN), and CNRL Horizon (HORZ) are the main AOSR surface mining facilities, with both mining (crude bitumen production) and upgrading (synthetic crude oil production) on site, while the “OTHER” surface mining facilities (Suncor Fort Hills - FHS, CNRL Muskeg River & Jackpine - MKJ, Imperial Kearl Lake - IKL, and Syncrude Aurora - AUR) are mining-only operations. Together, SML, SUN, and HORZ account for the majority of CO2 emissions reported to the GHGRP by surface mining facilities (∼75% in 2018 (24)). AOSR in situ facilities use steam-assisted gravity drainage (SAGD) technology exclusively, while the Cold Lake region in situ facilities use either cyclic steam stimulation (CSS) or SAGD technology. Both SAGD and CSS involve the injection of high-temperature, high-pressure steam into wells that are drilled into the OS reservoirs in order to reduce the bitumen's viscosity such that it can be pumped to the surface for further processing (25). A brief overview of the key processes involved in OS extraction can be found elsewhere (12, 25). See Supplementary Section 1 for additional details on OS emissions reporting.

Overview of study area and emissions ratio approach. A) Google Earth image of the Athabasca and Cold Lake OS operations. The small blue polygons define the areas associated with individual facilities, used for the TROPOMI NO2 retrievals. The square yellow box defines the area associated with the surface mining facilities, used for the OMI NO2 retrievals (the “OMI spatial domain”). (B) The OMI spatial domain, showing the locations of the seven surface mining (white text) and three in situ (gray text) facilities. C) Schematic drawing illustrating how two measured ERs (ERC and ERT) are scaled with two separate NOx emissions estimates (satellite-derived, SAT, or inventory, INV) to yield four top–down CO2 emissions estimates. For surface mining facilities, the ERC is derived from measured stack (ERstack) and mining (ERmine) ERs. NPRI, National Pollutant Release Inventory; AEIR, Alberta Annual Emissions Inventory Report; GHGRP, Greenhouse Gas Reporting Program. Map: Google Earth © 2020 (image credit: Landsat/Copernicus).
The aircraft measurements were used to derive CO2/NOx ERs which were then scaled by annual NOx emissions to obtain annual CO2 emissions (Fig. 1C), utilizing an approach similar to that attempted recently for limited urban regions (26–29) and power plants (30). These previous studies have relied on CO2/NOx ERs from emissions databases (26, 28, 29) (which typically use bottom–up methods) and from continuous emissions monitoring (CEMS) of stack emissions (30). Here, we derived top–down facility-level CO2/NOx ERs from aircraft measurements using two different methods, one based on mass balance calculations (ERT) and the other on direct CO2/NOx mixing ratio correlations (ERC) (described below). We then combined the measured ERs with either inventory (31, 32) or satellite-derived NOx emissions to obtain four separate top–down CO2 emissions estimates (Fig. 1C):
In Eq. 1, NOx,tot are the total annual NOx emissions reported to the relevant inventories or estimated by satellite (OMI or TROPOMI), expressed as NO2 (see schematic Fig. 1C); i describes the NOx emissions used (INV or SAT, where SAT = OMI or TROPOMI), and j describes the ER used (T or C, where T = ERT and C = ERC). In the following, the term “inventory NOx” is used to describe the sum of facility-reported emissions from the National Pollutant Release Inventory (NPRI (31)) and Alberta's Annual Emissions Inventory Report (AEIR (32)) as defined in Materials and methods. The key underlying assumptions in the ER approach are that (i) OS CO2 emissions are proportional to NOx emissions at the facility level, (ii) NOx emissions are well constrained, and (iii) the ER has minimal intraannual variability. The first assumption is supported by Fig. S1 which demonstrates that GHGRP-reported CO2 emissions are proportional to inventory NOx emissions for surface mining and in situ facilities. The latter assumptions are addressed below.
OS NOx emissions
The OMI was used to derive total NOx emissions from 2005 to 2020 for a domain centered over the entire AOSR surface mining region (the “OMI spatial domain,” see Fig. 1B and Table S2), while the TROPOMI was used to derive facility-specific NOx emissions from 2018 to 2021 (blue polygons in Fig. 1A and B and Table S3) (Materials and methods). As discussed in McLinden et al. (33) and shown in Fig. 2A, the OMI-derived NOx emissions are within uncertainties of the corresponding inventory NOx emissions. The inventory and TROPOMI-derived NOx emissions agree within uncertainties for individual surface mining facilities (Fig. S5A), and the sum of TROPOMI-derived NOx emissions of all surface mining facilities agrees to within <10% of the respective inventory total during 2018–2020 (Fig. 2A). The in situ facilities typically have much smaller NOx emissions (<1 kt year−1), close to the detection limit for TROPOMI-derived NOx emissions, which have uncertainties on the same order of magnitude (Fig. S5A). Therefore, TROPOMI-derived NOx emissions cannot be used at the facility-level for individual in situ facilities. However, the sum of TROPOMI-derived and inventory NOx emissions for the AOSR in situ facilities is in reasonable agreement (Fig. S5B). Thus, the top–down satellite observations serve to independently validate the inventory NOx emissions, and overall, the comparisons here and previous work (33–36) suggest that OS NOx emissions are indeed well constrained, likely due to the use of CEMS for NOx. To account for potential uncertainties in both data sets and to more fully capture the range of possible CO2 emissions, annual inventory and satellite-derived (OMI or TROPOMI) NOx emissions are each used separately as scaling factors (in conjunction with the ERs; Fig. 1C) in the determination of annual CO2 emissions from OS facilities.

OS NOx emissions and CO2/NOx ERs. A) Annual inventory and satellite-derived NOx emissions for the OMI spatial domain. OMI-derived NOx (yellow line, with shading to represent uncertainty) and the sum of TROPOMI-derived NOx for surface mining facilities (solid blue squares) and for all facilities in the OMI domain (open yellow squares), with error bars to represent uncertainty. The OMI-derived NOx emissions are three year running averages for all emissions within the OMI domain. Inventory NOx is the sum of the NPRI and AEIR facility-reported NOx emissions. B) Average 2018 correlation (ERC) and TERRA-derived (ERT) ERs for OS surface mining facilities (left axis) and SAGD in situ facilities (right axis). OTHER, nonupgrading mining operations (FHS, MKJ, and IKL). C, D) Background-corrected CO2 as a function of background-corrected NOx (= NO + NO2) colored by SO2 C) and BC D) for the 2018 June 6 box flight around the SUN surface mining facility, showing the isolated stack ER (ERstack—pink line) and mine ER (ERmine—green line) for that flight.
OS ERs
Mass balance-derived ERs (ERT)
Background-subtracted CO2 and NOx data from box flights around OS facilities were used as inputs into a Top–down Emissions Rate Retrieval Algorithm (TERRA) (37), which yielded facility-specific hourly CO2 (in t CO2 h−1) and NOx (in kg NO2 h−1) emission rates for surface mining (Table S4) and in situ (Table S5) facilities (Materials and methods, Supplementary Section 3, and Fig. S6). The largest average hourly CO2 emission rates are associated with the surface mining facilities that include bitumen upgrading: SML (1,940 ± 500 t h−1, N = 3), SUN (1,060 ± 220 t h−1, N = 3), and HORZ (860 ± 290 t h−1, N = 4), all similar in magnitude to those measured in 2013 (17) and larger than those measured for SAGD in situ facilities in this study, which had an average of 420 ± 90 t h−1 over 10 flights (although it should be noted that this SAGD average encompasses facilities of varying scale) (38). TERRA-derived CO2/NOx ERs (ERT) for surface mining (Table S4) and in situ (Table S5) facilities were calculated from the ratio of the hourly CO2 and NOx emissions rates and expressed as ppm CO2 per ppb NOx to facilitate comparison below (Materials and methods). The ERT is effectively a site-wide ER capturing overall emissions from the whole facility, as opposed to a component-based ER from a single source (i.e. individual stacks and mine vehicle fleet). While there is considerable variation in the measured hourly emission rates for a given facility/type of facility (Tables S4 and S5), the calculated ERT values show less variation, supporting the use of ERs rather than emission rates to upscale emissions. The mean ERT for the surface mining facilities are SML = 0.8 ± 0.1 ppm ppb−1, SUN = 0.5 ± 0.1 ppm ppb−1, HORZ = 1.1 ± 0.2 ppm ppb−1, and “OTHER” (average of IKL and FHS) = 0.7 ± 0.1 ppm ppb−1 (Fig. 2B). The ERT values for SAGD in situ facilities (mean = 3.4 ± 0.9 ppm ppb−1) are found to be larger than those for surface mining facilities.
Correlation ERs (ERC)
In addition to, and independent of, the mass balance approach for ER determination above, distinct CO2/NOx ERs were isolated directly from the aircraft CO2/NOx mixing ratio correlations (Materials and methods and Supplementary Section 4), as demonstrated in Fig. 2C for two distinct ERs from a flight around SUN, each representing a different source (see also Fig. S7). One of the ERs (slope of 0.98 with R2 = 0.9, pink line Fig. 2C) was from a plume characterized by high CO2 and high SO2 and was attributed to stack emissions during upgrading. The highest CO2 and SO2 concentrations were measured at elevated altitude, supporting the attribution of this ER to an elevated stack source (ERstack). The other ER was from broader CO2 enhancements on either side of the main CO2 plume characterized by high relative NOx and high black carbon (BC) concentrations (slope of 0.19 with R2 = 0.2, green line Fig. 2D). This ER was attributed to emissions from the mining fleet and associated ground-based mining operations which are more spatially diffuse (ERmine). The NOx concentrations associated with this ER decreased with altitude, further supporting attribution to a ground source.
Based on similar spatial–temporal characteristics, isolated ERs were attributed to stack (ERstack) or mining-related (ERmine) emissions for flights around surface mining facilities (Table S6), as summarized here and discussed in greater detail in Supplementary Section 4.1. The three facilities with surface mining and upgrading operations, SUN, SML, and HORZ, were the targets of the largest number of flights, and their mean ERstack were measured to be 1.1 ± 0.1 ppm ppb−1 (N = 3), 1.4 ± 0.3 ppm ppb−1 (N = 3), and 1.2 ± 0.1 ppm ppb−1 (N = 4), respectively. Flights around the mining-only (“OTHER”) facilities showed much sharper CO2 plumes from nonupgrading combustion sources such as boilers and furnaces on top of a broader CO2 enhancement (i.e. mine emissions) and yielded a mean ERstack of 2.7 ± 0.4 ppm ppb−1 (N = 4, only one flight per facility). It was not possible to confidently isolate a mine ER for all flights due to the lower associated CO2 and NOx enhancements, and correlation coefficients were typically lower than those associated with the ERstack (R2 ranging from 0.15 to 0.90). However, it was possible to isolate a total of 15 ERmine from 14 flights in the surface mining region, which were all similar and yielded a mean ERmine of 0.25 ± 0.06 ppm ppb−1. This ERmine was applied to all surface mining facilities in subsequent calculations.
Distinct CO2/NOx ERs were also obtained from box flights made around surface mining facilities during an earlier Oil Sands Campaign in 2013 (Table S7 and Fig. S8). The mean 2013 ERstack (SUN = 1.2 ± 0.1 ppm ppb−1 and SML = 1.4 ± 0.2 ppm ppb−1) was found to be the same within uncertainties as the mean 2018 ERstack, for each of the two major upgrading facilities (SUN and SML), and the mean ERmine across all 2013 surface mining flights (0.24 ± 0.03 ppm ppb−1) is almost identical to the 2018 value. The ERmine is further constrained by CO2–NOx correlations measured during elevated NOx events at the Oski-ôtin ground monitoring site in Fort McKay from 2013 to 2016 (39). These events were more frequently detected in winter with a lower, more stable boundary layer and thus were attributed to mine emissions from the nearby OS operations, with a median (interquartile range) CO2/NOx ER of 0.33 ppm ppb−1 (0.22–0.43 ppm ppb−1), consistent with the ERmine obtained here.
Although technological advances and fleet turnover (affecting both stack and mine emissions) might be expected to impact measured ERs over time (12, 13), the similarity in the derived ERstack and ERmine values 5 years apart, and between aircraft and ground measurements, suggests that the CO2–NOx ERs have remained stable, and therefore, the 2018 values are used below to estimate emissions for the recent past. This apparent stability is also consistent with the linear relationship between inventory NOx and GHGRP-reported CO2 over the past decade for each surface mining facility in Fig. S1A.
In order to compare the correlation ER with that derived using TERRA, for each surface mining facility, a single, correlation-based composite ER (correlation ERC) was obtained by weighting the mean stack and mine ERs using inventory NOx,stack (NPRI (31)) and NOx,fleet (AEIR (32)) emissions (Eq. 5 in Materials and methods). This ERC is then directly comparable to ERT and can be scaled to CO2 emissions using total facility NOx emissions. Like the ERT, the ERC can be considered a facility-level ER. Note that ERC does not take into account transient emissions associated with flaring, which were observed during four flights (Fig. S9; see discussion in Supplementary Section 4.1), or fugitive emissions (i.e. from tailings ponds), both of which are expected to be small (<5% of total emissions) by comparison (19, 40). The resulting facility-specific ERC are listed in Table S9 (2018–2020) and are displayed in Fig. 2B (2018).
Distinct CO2/NOx ERs were also isolated from flights around SAGD in situ facilities (see example flight around FBG in Fig. S10). The main source of CO2 emissions from in situ facilities is likely to be the steam generators used in the extraction process, which account for more than 90% of the energy used (41). Clearly defined CO2 plumes were intercepted by the aircraft, and the CO2/NOx correlations in the plumes were stronger (higher R2) than those observed for the surface mining facilities (due to their smaller footprint and the absence of diffuse ground source emissions). The single ER (or mean ER in the case of two plumes) was assigned as that facility's ERC (Table S8), which is comparable to its respective ERT.
Comparisons of ERC and ERT
The facility-wide CO2/NOx ERs derived from pollutant correlations (ERC) and TERRA (i.e. mass balance) calculations (ERT) for the three surface mining facilities with upgrading (SML, SUN, and HORZ), the “OTHER” surface mining-only facilities, and the SAGD in situ facilities are shown in Fig. 2B for 2018. The surface mining ERC and ERT agree within uncertainties for a given facility, with the exception of HORZ which had a larger spread in individual flight ERT values (ranging from 0.7 to 1.6 ppm ppb−1) compared with SUN and SML (see additional discussion in Supplementary Section 4.1). However, the good overall agreement suggests that the correlation ER approach effectively captures the facility-level CO2/NOx ER, indirectly validating both the magnitudes of the measured ERstack and ERmine and their weighting.
Likewise, for SAGD in situ facilities, the individual ERC are the same within uncertainties as the corresponding ERT (Fig. S11) for all but one facility (CCL) (addressed in Supplementary Section 4.2), and the means are in excellent agreement (mean ERT = 3.4 ± 0.9 ppm ppb−1 vs. mean ERC = 3.6 ± 0.4 ppm ppb−1). Moreover, the spread in ERs for in situ facilities (both ERC and ERT) is narrow, which is consistent with the existence of a qualitative, linear relationship between GHGRP-reported CO2 and inventory NOx for AOSR SAGD facilities (Fig. S1C) and further supports the use of the mean ERs in top–down estimates (i.e. for emissions from nonstudied SAGD facilities or for the AOSR SAGD as a whole).
ERs were predominantly obtained during warm months in this study. However, the ERs (and the CO2 emissions) are not expected to exhibit a strong seasonal dependence since the majority of emissions arise from stationary fuel combustion, which should be insensitive to ambient temperature. Limited cold season ER measurements are also broadly consistent with the warm season ERs (Supplementary Section 5). In terms of production and/or activity, OS facilities operate continuously throughout the year, and operational considerations which might impact the facility-wide ERs (e.g. shutdowns and maintenance) are usually random or intermittent. The spread in the measured ERs and their uncertainties likely capture some of this variability. See Supplementary Section 4.1 for further comparison to literature ERs and Supplementary Section 5 for further discussion.
Top–down CO2 emissions for individual OS facilities
Four separate top–down annual CO2 emissions estimates (EINV,C, EINV,T, ETROP,C, and ETROP,T) for individual surface mining facilities were made for 2018–2021 by scaling the derived ERC and ERT, by either inventory or TROPOMI-derived NOx emissions according to Eq. 1 (and Materials and methods) and as illustrated schematically in Fig. 1C. Given the large relative uncertainties associated with TROPOMI-derived NOx emissions for individual in situ facilities, CO2 emissions estimates at the facility-level were made using inventory NOx only. The relevant inputs and the resulting emissions estimates are provided in Table S9 for surface mining and Table S10 for in situ facilities, respectively. Annual CO2 emissions from SML and AUR were calculated separately and summed for comparison with coreported values in the GHGRP. The four top–down estimates are found to agree within uncertainties (Fig. S12) for each surface mining facility, with the exception of HORZ due to the discrepancy between its ERC and ERT. For all surface mining facilities, differences among the four estimates are driven by year-to-year differences in TROPOMI-derived vs. inventory NOx as well as the facility ERC vs. ERT (Fig. S12).
The mean 2018 top–down CO2 emissions are 24.3 ± 3.1 Mt for the SML + AUR facilities, 9.5 ± 1.8 Mt for SUN, 6.5 ± 1.1 Mt for HORZ, and 10.1 ± 1.2 Mt for the “OTHER” (FHS + MKJ + IKL) which are 116, 27, 34, and 31% greater than those reported to the GHGRP, respectively (Fig. 3A). The mean top–down estimates for FHS, MKJ, and IKL are separately 7, 44, and 43% greater, respectively. Using the same ERs, similar discrepancies were found for 2019 and 2020. These observed discrepancies are similar to those found for 2013 (123, 13, and 36% higher emissions than GHGRP-reported values for SML + AUR, SUN, and HORZ, respectively) based on scaling hourly CO2 emissions rates to annual totals using activity and production data (17). Applying the methodology used in the previous work to the present study, the 2018 CO2 emissions are estimated to be 10 ± 3 Mt (SUN) and 7 ± 1 Mt (HORZ) (Supplementary Section 6 and Table S11), agreeing within uncertainties to the mean top–down estimates using the ER method. It was not possible to reliably calculate CO2 emissions for SML using this method due to insufficient data and uneven production throughout the study month and throughout the year (see Supplementary Section 6). Using the separate ERstack and ERmine values, the percentage of the 2018 EINV,C emissions attributed to stacks was found to vary from ∼60 to 90% for all surface mining facilities (Table S9). As a result, the stack component of EINV,C alone exceeded the GHGRP-reported total facility emissions, for the two largest upgrading facilities (SML + AUR and SUN), whose annual CO2 emissions rank first and third among all facilities reporting to the GHGRP.

Annual 2018 GHGRP-reported (hatched bars) and top–down (solid bars) CO2 emissions estimates for A) surface mining and B) in situ facilities. A) Top–down CO2 emissions estimates for SML + AUR (left axis) and other surface mining facilities (right axis) calculated as the mean of the four separate top–down estimates (EINV,T, EINV,C, ETROP,T, and ETROP,C). Annual 2013 GHGRP-reported (open squares) and top–down (solid squares) CO2 emissions estimates for 2013 from Liggio et al., derived from aircraft measurements of hourly CO2 emission rates. B) Top–down CO2 emissions estimates for studied and nonstudied (OTHER) in situ facilities, calculated as the mean of the two top–down estimates using inventory NOx only (EINV,T and EINV,C). Error bars in A) and B) represent the propagated uncertainty of the mean value.
The mean top–down CO2 emissions for in situ SAGD facilities were also found to be higher than GHGRP-reported emissions for the majority of studied facilities in 2018 (Fig. 3B), 2019, and 2020 (Table S10). In 2018, the discrepancies ranged from 7 to 88% for studied facilities, with the two largest facilities (by GHGRP-reported emissions) exhibiting the largest discrepancies (74% greater on average for FBG and 88% greater on average for CFC). It is possible that the facility-wide ERs obtained in this study (ERC or ERT) and used to estimate emissions are not representative of all operating conditions, which could potentially account for some of the observed discrepancies, as discussed in Supplementary Section 5.
Historical top–down CO2 emissions for the surface mining region (OMI spatial domain)
Historical top–down CO2 emissions estimates (2005–2020) were made for the OMI spatial domain (Table S12). Two top–down totals (ΣEINV,T and ΣEINV,C) were calculated by summing the individual facility estimates (EINV,T and EINV,C from above) made using annual inventory NOx emissions, and two top–down totals (ΣEOMI,T and ΣEOMI,C) were calculated using OMI-derived NOx emissions, and they are shown along with GHGRP-reported CO2 emissions in Fig. 4A (Materials and methods and Supplementary Section 7).

Total OS emissions. A) Annual CO2 emissions for the OMI spatial domain from 2005 to 2020. GHGRP-reported CO2 emissions for the surface mining facilities (colored fill) and in situ facilities (gray fill). Top–down estimates (ΣEINV,T, ΣEINV,C, ΣEOMI,T, and ΣEOMI,C) calculated using measured ERs (ERT and ERC) and scaling with OMI-derived NOx (solid lines) and inventory NOx (dashed lines). The shaded regions represent the propagated uncertainty. The solid orange square is the 2013 top–down estimate for surface mining facilities from Liggio et al.. The open orange square adds the reported CO2 emissions from FBG + HTS + MACK to that estimate, and the upper error bar adds the mean top–down CO2 emissions calculated in this study. B) Mean total top–down (solid bars) and GHGRP-reported (hatched bars) CO2 emissions for the AOSR surface mining (teal) and in situ (peach) facilities with GHGRP-reported CH4 emissions from all AOSR facilities (dark pink). The 2018–2020 means are the means across the four separate totals calculated using measured ERs (ERT and ERC) and scaling with TROPOMI-derived and inventory NOx. The 2021 mean is the mean of the top–down estimates using TROPOMI-derived NOx only. The error bars represent the propagated uncertainty.
The four top–down CO2 emissions estimates (ΣEINV,T, ΣEINV,C, ΣEOMI,T, and ΣEOMI,C) are in good agreement, and the discrepancies are primarily driven by differences in the OMI-derived and inventory NOx emissions (i.e. Fig. 2B) and in the applied ERC and ERT (arising from the use of mean vs. facility-specific values and the different ERC and ERT values). The highest top–down estimates are found for the ΣEOMI,T case due to the higher OMI-derived NOx emissions (Fig. 2B) and the higher mean ERT applied to the surface mining portion of the emissions (0.8 ± 0.1 ppm ppb−1). The higher ERT results from the weighted mean of the individual upgrading ERT which may be skewed by the exclusion of mining-only facilities and the higher measured ERT for HORZ. The three other estimates are in much closer agreement. In particular, the ΣEINV,T and ΣEINV,C estimates are almost identical, due to the use of facility-specific ERT and ERC which are similar in most cases as discussed above, whereas for ΣEOMI,T, the weighted mean was used resulting in a larger gap between ΣEOMI,T and ΣEOMI,C. The impact of using facility-specific vs. mean ERs and time-invariant ERs is further addressed in Supplementary Section 7. Overall, all four estimates are in agreement which shows that the mean ERs can be used to adequately estimate emissions from all surface mining facilities. The spread in emissions estimates from all four methods (together with their uncertainties) shown in Fig. 4A capture the range of the CO2 emissions.
As illustrated by Fig. 4A, for all cases and across all years, the top–down CO2 emissions estimates are larger than the reported CO2 emissions, with the average top–down estimate ranging from 65% to over 100% larger. In 2018, the average among the four top–down estimates (67 ± 5 Mt) is 65% larger than the GHGRP-reported emissions for facilities within the OMI spatial domain (39.7 Mt). Application of a modeled, time-variant ERmine (see Supplementary Section 7 and Fig. S13) lowers 2005 and 2010 CO2 emissions by 3.3 Mt and 2.8 Mt, respectively, representing 9 and 5% of the corresponding mean top–down estimate for the OMI domain (Table S12). The magnitude of these decreases is smaller than the uncertainty accompanying our top–down estimates and is expected be a lower bound. Therefore, the impact of our assumption (constant ERmine) is small. The 2013 top–down CO2 emissions estimate (∼43 Mt CO2) made by Liggio et al. (17) is also shown in Fig. 4A but only considered surface mining facilities. Including emissions from in situ facilities within the OMI spatial domain (i.e. FBG, HTS, and MACK) would add an additional 8.4 Mt if GHGRP-reported CO2 emissions are used (open orange square, Fig. 4A) or ∼12 Mt if top–down emissions estimate from this study (EINV,C and EINV,T from Table S10) are used (upper error bar, Fig. 4A). This brings the 2013 top–down estimate into good agreement with the top–down estimates from this study (average across the four estimates of 59 ± 5 Mt), which use ERs rather than absolute emission rates to scale up CO2 emissions.
Total Athabasca OS estimates
Finally, the mean top–down CO2 emissions estimates for the entire AOSR (ΣEAOSR,tot), separated into surface mining (ΣEAOSR,SURF) and in situ (ΣEAOSR,INSITU) facility emissions (Fig. 4B), were calculated as the average of the four top–down totals, which are the sum of the individual facility estimates (EINV,C, EINV,T, ETROP,C, and ETROP,T) from above. For the in situ facilities, the sum of the TROPOMI-derived AOSR in situ NOx emissions was used to estimate total AOSR in situ CO2 emissions using the mean SAGD in situ ERT and ERC (Table S9). These ΣEAOSR,tot emissions estimates were only made for 2018 onward as TROPOMI-derived NOx emissions are only available since 2018.
The mean ΣEAOSR,SURF in 2018 is 50 ± 4 Mt, which is 19 Mt greater than the total GHGRP-reported value of 31 Mt (or 61% higher), with SML + AUR accounting for 68% of this discrepancy. The four totals used to derive the mean top–down estimate are almost identical (spread < 2 Mt) due to the similarity in the total inventory and TROPOMI-derived NOx emissions and the similarity in the ERC and ERT. The mean ΣEAOSR,INSITU in 2018 is 45 ± 8 Mt (2018) which is 64% higher than the GHGRP-reported total of 27 Mt. Emissions from the two largest AOSR in situ facilities (FBG and CFC) account for a significant fraction of the reporting gap (e.g. in 2018, they accounted for nearly 60% of the ∼12 Mt gap between reported emissions and the mean top–down estimate made using inventory NOx only).
The mean ΣEAOSR,tot (surface mining and in situ) CO2 emissions estimate for 2018 is 95 ± 9 Mt (a difference of 37 Mt or 62% higher than reported). As demonstrated by Fig. 4B, the measured reporting gap is similar in 2019 and 2020. Together, Fig. 4A and B illustrate two key results. The first being that the overall CO2 emissions estimated from the top–down method here are consistently higher than those reported to the GHGRP over a 15-year period and the second being that there appears to be a consistent downward trend in the total CO2 emissions in the 4 years between 2018 and 2021 (Fig. 4B), with the possibility of an accelerating decrease. Furthermore, Fig. 4A suggests that emissions may have peaked sometime between 2016 and 2017. Additional data in the coming years, based on TROPOMI, will help confirm this downward trend. Annual CH4 emissions reported to the GHGRP are also shown in Fig. 4B (in CO2eq) for comparison purposes and are considerably smaller than CO2 emissions, although previous work suggests that facility-reported CH4 emissions may also be underestimated (20).
Conclusions
The described multifaceted methods and the resulting CO2 emissions for the OS have several noteworthy implications. The good agreement between the top–down CO2 emissions estimates using the two different ER approaches, together with the agreement between 2013 and 2018 estimates (i.e. involving aircraft data from two different campaigns), adds confidence to the estimates and suggests that the current and previous (17) upscaling approaches both reliably estimate emissions. Additional measurements to assess the seasonal variability of the facility-wide emission rates and ERs, and to ensure that they are representative of the full range of operating conditions, would be beneficial and help reduce uncertainties. Overall, the ER method presented in this study represents a powerful means of upscaling measurements of fossil fuel CO2 from complex industrial facilities or other area emission sources. The CO2/NOx ER and its uncertainty needs to be quantitatively determined for each source type, as the quality of the estimated CO2 emissions will be limited by the uncertainty in the CO2/NOx ER. The ER method represents an improvement upon the method of Liggio et al. (17), as it is less sensitive to fluctuations in absolute emissions, and it also has the advantage of exploiting satellite data such that long-term historical trends over larger geographical regions can be explored. However, we do note that the accuracy of the CO2 emissions estimates will be limited by the accuracy and uncertainty of the satellite-derived NOx emissions estimates. Another advantage is that satellites provide timely data availability, compared with inventory data, which can be a couple of years in arrears. This suggests that the ER method can be used to assess recent progress on GHG reductions and to determine whether mitigation efforts are having their intended effects (27). We demonstrate that facility-level NOx emissions for locations with annual emissions as low as 1 kt are possible with TROPOMI. The high-resolution TROPOMI NO2 product also offers the potential to explore emissions trends on a subannual basis, compared with annually reported emissions. This approach also circumvents the current limitations of direct satellite observations of CO2 emissions which are hampered by sparse data coverage (both in time and space) and the high regional CO2 background concentration (26, 42, 43), with the potential to be improved further as additional satellites are launched with even higher spatial resolution. The resulting top–down emissions estimates have several uses, such as reducing uncertainties in emissions inventories, identifying opportunities for emissions reduction, and revealing potential discrepancies not captured by the usual processes. Additionally, top–down emissions estimates from ground-based, airborne, and space-based atmospheric CO2 measurements are anticipated to play an increasingly important role in complementing bottom–up emissions and in developing top–down global atmospheric inventories to support the global stock-take.
GHG emissions reporting programs and inventories play an important role in assessing national-level mitigation efforts and commitments (i.e. progress toward targets) and supporting climate policy development. The accuracy of the facility-reported GHGRP emissions is important because they, along with supporting data sets, help reconcile the emissions reported in the National Inventory Report (NIR), submitted annually to the UNFCCC (21). The results from this study suggest that facility-reported, bottom–up emissions may be underestimated for the OS. Although measured discrepancies are small at the facility level (both in absolute and relative terms) for a majority of OS facilities, they are large for a few OS facilities, and they compound such that the overall reporting gap for total AOSR emissions is ≥(31 ± 8) Mt for each of the last 3 years (2018–2020). Put into context, the observed emissions gap is equivalent in magnitude to annual CO2 emissions from a small European country such as Switzerland (34 Mt CO2eq in 2020) (44). Importantly, the reconstruction of historical CO2 emissions suggests that a reporting gap may have persisted for at least the last 15 years and indicates that the methods and/or the underlying data used to produce bottom–up inventories may require further research.
More broadly, these results suggest that material balance-based, bottom–up emissions estimates, which are relied upon to track O&G emissions globally, would benefit from periodic evaluation with independent atmospheric measurements such as the remote sensing and in situ methods described here. With respect to the OS specifically, CEMS for CO2 stack emissions (which are regulated for US power plants and provide a check against fuel-based emissions estimates (45, 46)) could be an important tool for constraining estimates of CO2 emissions. Efforts to understand discrepancies between inventory and measured O&G CH4 emissions in Canada (21) are currently underway, and a similar, continuous improvement approach may also benefit CO2 emissions reporting. Consequently, future work should focus on reconciling the observed discrepancy and consider the best ways to integrate atmospheric-based measurements with conventional, bottom–up inventories.
Materials and methods
Reported NOx and CO2 emissions
Annual stationary-source NOx releases to air (NOx,stack) are reported by industry to the NPRI (31) based on CEMS, emission factors, and/or engineering estimates, and annual mine fleet NOx emissions (NOx,fleet) are reported by industry to Alberta's AEIR program (32). In the following, inventory NOx emissions refer to these emissions which are expressed in kt NO2. For surface mining facilities, the total inventory NOx emissions are
For in situ facilities, the total inventory NOx emissions are solely the NPRI values. The NPRI and AEIR NOx emissions do not have an uncertainty assigned to them; however, in this study, an uncertainty of 10% is assigned to the inventory NOx emissions based on the reported accuracy of CEMS NOx emissions (47). Several facilities report NOx emissions to NPRI based on CEMS measurements (the others are based on published emission factors or engineering estimates). Annual CO2 emissions in Mt CO2 are reported by industry to the GHGRP. The GHGRP-reported emissions include emissions from stationary fuel combustion, onsite transportation, flaring, and fugitive emissions (venting, leakage, waste, and wastewater) (48). The inventory NOx and reported CO2 emissions are available up to 2020 (24).
Satellite-derived NOx emissions
The OMI (49, 50) aboard NASA's Aura satellite and TROPOMI (51, 52) aboard the Copernicus Sentinel-5 Precursor satellite sensors were used to independently estimate OS NOx emissions (see additional details in Supplementary Section 2, Table S1, and Figs. S2–S4). Briefly, these down-looking (nadir-viewing) instruments capture upwelling sunlight, either reflected by the Earth's surface or scattered by the atmosphere, out into space. Spectra in the blue (∼430 nm) were used to derive tropospheric vertical column densities (VCDs, the vertically integrated number density from the surface to tropopause) of NO2, which in conjunction with wind information from a meteorological reanalysis (ERA (53, 54)) were then used to obtain top–down estimates of NOx emissions (55–57). For both OMI and TROPOMI, VCDs were recalculated using air mass factors based on higher-resolution input information in order to minimize biases (35, 36). The NO2 VCDs were scaled to effective NOx VCDs using a simulated NO2/NOx ratio from the Environment and Climate Change Canada (ECCC) air quality model GEM-MACH sampled at the day, time, and location of each TROPOMI pixel. This approach accounts for the variation of the ratio downwind. For TROPOMI, the effective NO2/NOx ratio, or the ratio of emissions derived with and without this scaling, varied from 0.60 to 0.63 with the variation reflecting the actual sampling of that year. GEM-MACH ratios are consistent with the aircraft observations (see Fig. S4). For OMI, a single value of 0.61 was used for all years as analogous GEM-MACH output is not available for all years back to 2005 (33). An uncertainty of 10% is assigned to this ratio when calculating the satellite emissions error budgets. NOx emissions from OMI were taken directly from McLinden et al. (33) and extended to TROPOMI in this work using the same methodology, as described in Supplementary Section 2. OMI provides a long data record, 2005–present, but has a low spatial resolution (pixel size of 13 × 24 km2) and in this study is limited to total emissions from the AOSR mining operations (the “OMI spatial domain,” Fig. 1B). The OMI spatial domain (approx. area 5.8 × 103 km2) includes all seven mining facilities as well as three in situ facilities (FBG, HTS, and MACK). The OMI domain also contains five non-OS facilities which report NOx emission to the NPRI, but they account for <1% of the total reported NOx most years (Fig. 2A). TROPOMI, on the other hand, is a newer instrument and only reports data beginning in 2018, but owing to its superior spatial resolution (pixel size of 6 × 7 km2) and data density is able to give emissions estimates for individual facilities (blue polygons in Fig. 1A and B), although some fall below the detection limit. Derived emissions are annual totals; however, for OMI, these are based on running 3-year groupings of observations as data density is roughly 30 times lower than TROPOMI and thus only extend to 2020 (i.e. using 2019–2021 data).
Aircraft measurements
Aircraft measurements over the AOSR of northern Alberta were performed from 2018 April 5 to 15 and from 2018 May 30 to July 5, as part of an intensive field campaign. Details regarding the aircraft implementation and related technical aspects have been previously described (58). Over the course of the study, 30 flights were carried out over the OS for a total of ∼120 h. To quantify total primary emissions, flights were conducted by flying in “box” formations (i.e. four- or five-sided polygons, at multiple altitudes) around individual facilities, resulting in 24 separate virtual boxes around 14 OS facilities. Additional flights were performed in “screen” formation to study the photochemical transformation of pollutants downwind of the OS.
Gas, particle, meteorological, and aircraft state parameter measurements were conducted aboard the National Research Council of Canada Flight Research Laboratory (NRC-FRL) Convair 580 research aircraft, as described elsewhere (58) and published on the ECCC Data Catalogue (59). Trace gases were sampled through a 6.35-mm-diameter, rear-facing, perfluoroalkoxy sampling line, with a residence time of less than 1 s. CO2 measurements were made using a Picarro model G2401-M instrument at a time resolution of 2 s. The CO2 instrument was calibrated using gas cylinders during and after the field campaign (350–450 ppm). These cylinders are transfer traceable to standards used by the Climate Research Division at ECCC for the Greenhouse Gas Observational Program. The standards used for the Greenhouse Gas Observational Program are traceable to the Global Atmosphere Watch (GAW) Program of the World Meteorological Organization (WMO-traceable). NO was measured using a Thermo Scientific 42iTL analyzer, and NO2 was measured using a second 42iTL instrument following photolytic conversion of NO2 to NO. NOy (= NOx + NOz; NOx = NO + NO2; NOz = the sum of “other” oxidized nitrogen species) was measured by placing an external molybdenum converter heated to 325°C as close as possible to the sampling point and interfacing to a third 42iTL instrument. NO, NO2, and NOy measurements were obtained at 1 Hz, with a sample residence time ∼3–4 s. SO2 was measured using a Thermo Scientific 43iTL analyzer at 1 Hz, with a sample residence time ∼7 s. The NO/NO2/NOy and SO2 analyzers were calibrated over a 0–100-ppb range with the National Institute of Standards and Technology (NIST) certified standard cylinders (diluted in zero air). BC was sampled via a forward-facing shrouded isokinetic inlet (Droplet Measurement Technologies) located at the top of the fuselage, forward of the engines, and measured at 1 s resolution using a Single-Particle Soot Photometer (SP2) manufactured by Droplet Measurement Technologies. The BC instrument was calibrated with size-selected Fullerene Soot (Alfa Aesar) particles (range 0.2–48 fg) obtained by nebulizing a water suspension and passing it through an aerosol particle mass analyzer (Kanomax APM3600).
Since industrial CO2 emissions occur on top of a large regional and continental background, an algorithm was used to determine background CO2 for each flight, calculated as the linearly interpolated low-percentile mixing ratio (typically the fifth) over a rolling 5–15-min window, which was subsequently boxcar smoothed (over a rolling ∼5-min window). The background was then subtracted prior to further analysis (17). Typical CO2 enhancements were ∼10–60 ppm above background, and the background during the ∼1–2-h flights was generally stable (varying < 2 ppm over the course of the flight, with the SD in the upwind mixing ratios typically ≪1 ppm). NOx measurements were averaged to 2 s to match CO2, and the NOx background, which was small by comparison, was similarly subtracted.
TERRA and TERRA-derived ERT
Hourly CO2, NO, and NO2 emissions rates for OS surface mining and in situ facilities were derived using a TERRA (17, 37) which has been previously used to estimate emissions for a number of pollutants (15, 17, 18, 37). Emission rates retrieved via TERRA have been evaluated against industry-reported CEMS measurements and found to agree to within <5% for SO2 plumes (17, 37). CO2 and CH4 emission rates derived by TERRA have also been found to agree with emission rates calculated using a different mass balance algorithm, when method assumptions are met (60). Briefly, the TERRA algorithm maps the high-resolution pollutant, wind (speed and direction), and aircraft location data to the 2D screens which comprise the lateral walls of virtual boxes via simple kriging to a resolution of Δs = 40 m (width) and Δz = 30 m (altitude). An example virtual box for the 2018 June 2 flight around the Syncrude (SML) facility is shown in Fig. S6, and the downwind screens clearly show above-background CO2 enhancements from elevated (i.e. stack) and ground (i.e. mine fleet and other mining-related equipment) sources. The algorithm then estimates the emission rate within the enclosed box via the divergence theorem, which equates the change of mass within the volume to the integrated mass flux through the box walls. The algorithm takes into account the advective flux through the box walls (as well as the box top) as well as any increase in mass within the volume due to a change in air density. Terms accounting for turbulent flux, a decrease in mass due to deposition to the surface and an increase in mass due to chemical reaction, were previously found (37) to have a negligible impact on the final emission rate (for the pollutants and conditions relevant to this study) and hence were not included here. Uncertainties associated with extrapolation of concentrations from the lowest flight path to the surface (δr) and associated with the background determination (δb) were used to determine overall emission rate uncertainties as described in Supplementary Section 3.
TERRA-derived CO2/NOx ERs (ERT) were calculated from the ratio of the hourly CO2 and NOx emissions rates ( and ).
Mass-based ERs are related to ERs obtained on a per mole basis according to
and are nearly identical given the similarity in the molecular weights (MW) of CO2 and NO2.
Correlation CO2/NOx ERs (ERC) from time series data
Distinct CO2/NOx ERs were isolated from the background-corrected CO2 and NOx time series data for individual flights around or near the mining and in situ facilities. For each flight, regions with distinct but recurring (i.e. over consecutive altitude laps) CO2/NOx ERs were isolated (additional details in Supplementary Section 4.1). The CO2/NOx ER for a given region was obtained from the slope of the linear CO2/NOx correlation (see examples for mining and in situ facilities in Figs. S7 and S10). For surface mining facilities (with and without upgrading), isolated CO2/NOx ERs attributed to stack (ERstack) and mine (ERmine) emissions were weighted to yield an overall “correlation ER” (ERC):
where the fleet fraction (Ff) is calculated from the reported NOx,fleet (AEIR (32)) and NOx,stack (NPRI (31)):
One or two clearly defined CO2 plumes were intercepted by the aircraft for each in situ facility, and the slope of the CO2/NOx correlation in the plume was taken as ERC (or the mean slope in the case of two plumes). Aircraft measurements over the AOSR made from 2013 August 13 to September 7 as part of an earlier field campaign (described previously (15, 17, 37)) were used to further assess the robustness of the ERmine and ERstack obtained in this study.
Top–down CO2 emissions estimates
Top–down, facility-specific, annual CO2 emissions estimates (Ei,j) were calculated according to Eq. 1. Additional details regarding the application of Eq. 1 to estimate emissions and their uncertainties can be found in Supplementary Sections 5 and 7 for individual facilities and for the OMI spatial domain, respectively. Briefly, for individual facility estimates, facility-specific ERT and ERC were used where available, and the appropriate mean ERT and ERC were used where they were not (e.g. the mean SAGD ERT and ERC were applied to the seven nonstudied AOSR in situ facilities). To obtain ΣEOMI,T and ΣEOMI,C, the total OMI-derived NOx emissions were separated into annual surface mining and in situ emissions which were then multiplied by the relevant (i.e. mean) ERC or ERT and then summed to yield total annual CO2 emissions for the OMI spatial domain. The ΣEINV,T and ΣEINV,C are calculated as the sum of individual facility estimates for facilities within the OMI spatial domain.
Acknowledgments
We acknowledge the strong support of the National Research Council of Canada Convair-580 flight crew, along with the technical and Information Management and Information Technology staff in the Air Quality Research Division. We thank Bernard Firanski and Julie Narayan for help setting up TERRA. We thank Junhua Zhang for guidance and support accessing emissions inventory and CEMS data. We thank Richard Melick of Alberta Environment and Parks for providing us with early access to 2019 and 2020 AEIR NOx emissions data. We thank Marilyn Albert, Ewa Przybylo-Komar, Katelyn Mackay, and Tara-Lynn Carmody of Data Management and Stewardship, Corporate Services Division, Alberta Environment and Parks, for providing the hourly CEMS data for SO2 and NOx. We thank colleagues in the Science and Risk Assessment Division (Steve Smyth, Chia Ha, and Owen Barrigar) and the Climate Research Division (Felix Vogel) at ECCC for their insight and helpful feedback. Thanks to Sandro Leonardelli and Stoyka Netcheva for guidance and support with the Oil Sands Monitoring (OSM) program processes.
Supplementary material
Supplementary material is available at PNAS Nexus online.
Funding
The project was partially funded by the Environment and Climate Change Canada's Climate Change and Air Pollutants Program. This work was also partially funded under the OSM program, and the results are independent of any position of the program.
Author contributions
J.L., S.N.W., and S.-M.L. designed the research/study; S.N.W., Ch.M., D.G., S.-M.L., S.G.C., A.D., K.H., Cr.M., R.L.M., Mi.W., Me.W., and J.L. performed research; S.N.W., Ch.M., D.G., A.D., and J.L. analyzed data; and S.N.W. wrote the paper with input from all coauthors.
Data availability
The 2013 and 2018 aircraft measurements have been published in the Canada Open Data Portal: https://donnees.ec.gc.ca/data/air/monitor/ambient-air-quality-oil-sands-region/pollutant-transformation-aircraft-based-multi-parameters-oil-sands-region/?lang=en. OMI products and documentation are publicly available from the NASA Goddard Earth Science Data and Information System: https://disc.gsfc.nasa.gov/datasets/OMNO2_003/summary. TROPOMI Level 2 data products and related documentation are publicly available on the TROPOMI website: http://www.tropomi.eu/data-products/nitrogen-dioxide.
References
Canada's 2021 nationally determined contribution under the Paris Agreement. [accessed 2002 February 17]. https://www4.unfccc.int/sites/NDCStaging/Pages/All.aspx.
Author notes
Competing Interest: The authors declare no competing interest.