Articles | Volume 13, issue 9
Research article
28 Sep 2020
Research article |  | 28 Sep 2020

Quality controls, bias, and seasonality of CO2 columns in the boreal forest with Orbiting Carbon Observatory-2, Total Carbon Column Observing Network, and EM27/SUN measurements

Nicole Jacobs, William R. Simpson, Debra Wunch, Christopher W. O'Dell, Gregory B. Osterman, Frank Hase, Thomas Blumenstock, Qiansi Tu, Matthias Frey, Manvendra K. Dubey, Harrison A. Parker, Rigel Kivi, and Pauli Heikkinen

Seasonal CO2 exchange in the boreal forest plays an important role in the global carbon budget and in driving interannual variability in seasonal cycles of atmospheric CO2. Satellite-based observations from polar orbiting satellites like the Orbiting Carbon Observatory-2 (OCO-2) offer an opportunity to characterize boreal forest seasonal cycles across longitudes with a spatially and temporally rich data set, but data quality controls and biases still require vetting at high latitudes. With the objective of improving data availability at northern, terrestrial high latitudes, this study evaluates quality control methods and biases of OCO-2 retrievals of atmospheric column-averaged dry air mole fractions of CO2 (XCO2) in boreal forest regions. In addition to the standard quality control (QC) filters recommended for the Atmospheric Carbon Observations from Space (ACOS) B8 (B8 QC) and ACOS B9 (B9 QC) OCO-2 retrievals, a third set of quality control filters were specifically tailored to boreal forest observations (boreal QC) with the goal of increasing data availability at high latitudes without sacrificing data quality. Ground-based reference measurements of XCO2 include observations from two sites in the Total Carbon Column Observing Network (TCCON) at East Trout Lake, Saskatchewan, Canada, and Sodankylä, Finland. OCO-2 retrievals were also compared to ground-based observations from two Bruker EM27/SUN Fourier transform infrared spectrometers (FTSs) at Fairbanks, Alaska, USA. The EM27/SUN spectrometers that were deployed in Fairbanks were carefully monitored for instrument performance and were bias corrected to TCCON using observations at the Caltech TCCON site. The B9 QC were found to pass approximately twice as many OCO-2 retrievals over land north of 50 N than the B8 QC, and the boreal QC were found to pass approximately twice as many retrievals in May, August, and September as the B9 QC. While boreal QC results in a substantial increase in passable retrievals, this is accompanied by increases in the standard deviations in biases at boreal forest sites from ∼1.4 parts per million (ppm) with B9 QC to ∼1.6 ppm with boreal QC. Total average biases for coincident OCO-2 retrievals at the three sites considered did not consistently increase or decrease with different QC methods, and instead, responses to changes in QC varied according to site and satellite viewing geometries. Regardless of the quality control method used, seasonal variability in biases was observed, and this variability was more pronounced at Sodankylä and East Trout Lake than at Fairbanks. Long-term coincident observations from TCCON, EM27/SUN, and satellites from multiple locations would be necessary to determine whether the reduced seasonal variability in bias at Fairbanks is due to geography or instrumentation. Monthly average biases generally varied between −1 and +1 ppm at the three sites considered, with more negative biases in spring (March, April, and May – MAM) and autumn (September and October – SO) but more positive biases in the summer months (June, July, and August – JJA). Monthly standard deviations in biases ranged from approximately 1.0 to 2.0 ppm and did not exhibit strong seasonal dependence, apart from exceptionally high standard deviation observed with all three QC methods at Sodankylä in June. There was no evidence found to suggest that seasonal variability in bias is a direct result of air mass dependence in ground-based retrievals or of proximity bias from coincidence criteria, but there were a number of retrieval parameters used as quality control filters that exhibit seasonality and could contribute to seasonal dependence in OCO-2 bias. Furthermore, it was found that OCO-2 retrievals of XCO2 without the standard OCO-2 bias correction exhibit almost no perceptible seasonal dependence in average monthly bias at these boreal forest sites, suggesting that seasonal variability in bias is introduced by the bias correction. Overall, we found that modified quality controls can allow for significant increases in passable OCO-2 retrievals with only marginal compromises in data quality, but seasonal dependence in biases still warrants further exploration.

1 Introduction

The boreal forest or taiga biome is the largest terrestrial biome on Earth; it includes the sub-Arctic regions of Europe, Asia, and North America between 50 and 70 N latitude, it represents an important and influential component of the global carbon cycle, and it is a principle driver of the atmospheric carbon dioxide (CO2) seasonal cycle. Accurate accounting for seasonal CO2 exchange in boreal forest regions is an essential component in quantifying the global carbon budget and predicting future climate scenarios (Tans et al.1990; Pan et al.2011; Graven et al.2013; Barlow et al.2015; Bradshaw and Warkentin2015; Gauthier et al.2015; Yin et al.2018). Studies by Graven et al. (2013) and Barlow et al. (2015) used a combination of atmospheric modeling, aircraft observations, and a network of ground-based in situ observations to investigate seasonal carbon exchange in the boreal forest. Both studies found that the boreal forest plays an important role in global atmospheric CO2 concentrations, significantly influencing in situ observations of CO2 in the tropics (Mauna Loa). Multiple studies have shown that there is a latitude-dependent trend in the seasonal amplitude of atmospheric CO2, with increased seasonal uptake of CO2 in boreal forest regions (Graven et al.2013; Wunch et al.2013; Barlow et al.2015; Lindqvist et al.2015). Furthermore, the studies by Graven et al. (2013) and Barlow et al. (2015) found that the trend in seasonal cycle amplitudes of CO2, with respect to latitude, nearly doubled between 1960 and 2011, suggesting that seasonal changes in the boreal forest are growing at an accelerated rate relative to lower latitude regions. While some studies have reported rapid changes in seasonal carbon exchange in the boreal forest and proposed that this is a dominant driver in the global carbon budget, another study by Barnes et al. (2016) suggests that it is actually the temperate forest between 30 and 50 N that is the dominant driver in the global carbon budget. Another study by Yin et al. (2018) found evidence of limits on previous assumptions about the relationship between seasonal cycle amplitude and spring and autumn temperatures in northern high latitudes, emphasizing the need for continued data-driven model validation for these regions. It remains difficult to reconcile conflicting claims about contributions to the global carbon budget without a spatially and temporally rich set of measurements for high-latitude regions, and data availability in the boreal forest remains a major impediment to accurately quantifying uptake in the world's largest terrestrial biome (Pan et al.2011; Barlow et al.2015; Euskirchen et al.2017). Therefore, methods of expanding observational coverage through improved satellite observations at high latitudes are essential for clarifying our understanding of global CO2 exchange.

Satellite-based observations of atmospheric CO2 columns offer a more holistic view of global CO2 dynamics by expanding spatial coverage. NASA's Orbiting Carbon Observatory 2 (OCO-2) was launched in July 2014, with CO2 column retrievals available from September 2014 to the present (OCO-2 Science Team/Michael Gunson, Annmarie Eldering2018). Satellite-based observations from OCO-2 consist of solar reflectance in three spectral windows centered at 0.76, 1.61, and 2.06 µm and referred to as the O2 A band, weak CO2 band, and strong CO2 band, respectively. The Atmospheric Carbon Observations from Space (ACOS) full physics retrieval algorithm (currently on version 9 or “ACOS B9”) described by O'Dell et al. (2012, 2018) and Connor et al. (2008) fits absorption features in these windows and incorporates additional meteorology and model data to retrieve column-averaged dry air mole fractions of atmospheric CO2 (XCO2) along with a variety of other parameters, such as aerosol optical depth, surface albedo, surface pressure, and total column water vapor. A number of parameters in the full physics retrievals are used to designate thresholds for postprocessing quality control filters. OCO-2 is polar orbiting, so overpasses are more frequent at high latitudes than midlatitudes, presenting a valuable opportunity to amass an extensive archive of CO2 observations over the boreal forest. However, before OCO-2 can be used to evaluate CO2 seasonality for the boreal forest, these data need to be validated in high-latitude regions. Quality control filters implemented in previous versions of the ACOS algorithm, like version 8 (ACOS B8) discussed by O'Dell et al. (2018), removed the majority of high-latitude observations, and as a result, OCO-2 high-latitude data have been underutilized. Validation of OCO-2 satellite-based retrievals at high latitudes has also been limited by the relatively few dedicated ground-based monitoring sites at high latitudes (Wunch et al.2017b).

In addition to the limited availability of ground-based validation data, there are a number of other challenges to passive satellite measurements at high latitudes. The Sun stays low in the sky at high latitudes; even in summer, when the Sun travels a long azimuthal path, it does not reach the same solar elevations as at lower latitudes. Low solar elevation corresponds to a high solar zenith angle (sza) and high air mass, meaning that sunlight travels a greater distance through the atmosphere before reaching the instrument. High air masses can cause absorption spectra to become saturated at line center, making column retrievals more sensitive to the line wings and, thus, the line shape of the absorption line. Spectroscopic uncertainties tend to be exacerbated at higher air masses, and the relative impacts of radiative transfer effects from atmospheric aerosols on satellite retrievals are also increased at higher air masses. In particular, slant-path aerosol optical depths (aod's) are larger and scattering angles are smaller, which increases the fractional contribution of aerosol scatter to the total radiance detected by the satellite. Air mass dependence in passive column measurements continues to be an area of ongoing research in retrieval algorithms (Wunch et al.2015), and high air mass in winter is one of the primary reasons for halting observations at high-latitude sites in November through February. Aside from the sunlight, climates and ecosystems at high latitudes are highly seasonally dependent, so there are a number of seasonal parameters that may produce time-dependent biases at high latitudes if they are not handled properly in retrieval algorithms. In particular, Wu et al. (2018) noted time-dependent biases at Sodankylä with the RemoTeC–OCO-2 retrieval algorithm. Wunch et al. (2017b) suggests that there are not enough passable retrievals from ACOS B7 to identify seasonal bias at high latitudes. Snow- and ice-covered surfaces are known to introduce extensive challenges in passive retrievals of XCO2 due to low surface albedo in the weak (1.61 µm) and strong (2.06 µm) CO2 bands used by OCO-2 (Wiscombe and Warren1980), and reflection anisotropy effects can further complicate retrievals over snow (Boesch et al.2011; Crisp et al.2012). Because snow cover is also seasonal and follows the solar cycle, it may be difficult to isolate the causes of seasonal bias at high latitudes. As a result, a certain amount of seasonal dependence may be inevitable, but we still endeavor to minimize it with careful attention to quality control methods.

Ground-based column measurements from solar-viewing spectrometers complement passive satellite observations because both use infrared absorption spectroscopy, with the Sun as a radiation source, and observe a full atmospheric column abundance. The Total Carbon Column Observing Network (TCCON) is a ground-based network that uses solar-viewing, high-resolution infrared spectrometers to retrieve XCO2 (Wunch et al.2011a). TCCON is the reference measurement for OCO-2 and is the primary source of validation data. In addition to comparing OCO-2 to TCCON, this paper compares OCO-2 observations to ground-based observations from an EM27/SUN Fourier transform infrared spectrometer (EM27/SUN FTS) operated in Fairbanks, Alaska. The EM27/SUN FTS was developed by the Karlsruhe Institute of Technology (KIT) in collaboration with Bruker Optics (Gisi et al.2012; Hase et al.2016) to be a compact, mobile solar-viewing FTS designed for field deployment. The EM27/SUN spectrometers have a resolution of 0.5 cm−1, similar to that of OCO-2 with a ∼0.3 cm−1 resolution, while the Bruker IFS 125HR used by TCCON has a much higher resolution of ∼0.02 cm−1. All three instruments record a solar infrared spectrum that can be used to retrieve XCO2. Several recent studies have compared EM27/SUN observations to TCCON (Hedelius et al.2016, 2017; Velazco et al.2018; Frey et al.2019). This paper uses similar retrieval methods for EM27/SUN retrievals of XCO2 to those of Hedelius et al. (2016), Hedelius et al. (2017), and Velazco et al. (2018) by implementing the GGG2014 retrieval algorithm coupled with the EM27/SUN GGG interferogram-processing suite (EGI; Hedelius and Wennberg2017). Hedelius et al. (2016) observed a 0.03 %±0.08 % (0.12±0.32 ppm) offset when comparing four EM27/SUN spectrometers to co-located observations at the Caltech TCCON site. Hedelius et al. (2017) found some EM27/SUN biases to TCCON as large as 0.14 % (∼0.56 ppm) but also found statistically significant variability amongst TCCON sites up to 0.3 ppm, suggesting that the site-to-site biases amongst TCCON sites may be of a similar size to biases between EM27/SUN FTS and TCCON as observed by Hedelius et al. (2016). Velazco et al. (2018) found an average offset to TCCON of approximately 0.46 % (∼1.84 ppm) when comparing 2 years of co-located observations between an EM27/SUN FTS and the TCCON site at the University of Wollongong, Australia. While some of these biases are large enough to produce significantly different results when choosing the EM27/SUN FTS or TCCON for the ground-based validation of satellite-based XCO2 retrievals, these biases tend to be systematic in nature and can be corrected to achieve acceptable agreement with TCCON through regular calibration measurements. The two EM27/SUN FTS used in Fairbanks were calibrated against the Caltech TCCON, and bias corrections were implemented to ensure that both EM27/SUN and TCCON observations are comparable sources of validation data for OCO-2 (see Sect. S1 in the Supplement for details on EM27/SUN instrument comparisons). The EM27/SUNs were also aligned and serviced at KIT, during which time they were compared to measurements from the Karlsruhe TCCON.

The objective of this study is to explore ways of defining quality control criteria for OCO-2 in high-latitude regions so as to maximize spatial and temporal coverage over the boreal forest while maintaining acceptable agreement with ground sites. It is also essential that biases in OCO-2-retrieved XCO2 be carefully evaluated under different quality control filtering regimes and in the context of high-latitude seasonality studies. To this end, we first define retrieval, quality control, and aggregation methods for ground-based measurements that are reasonably equivalent for EM27/SUN or TCCON observations. We verify that EM27/SUN retrievals of XCO2 in Fairbanks are interchangeable with TCCON through comparisons with the Caltech TCCON, which are used to rescale EM27/SUN observations to the TCCON trace-gas scale. Then, we turn to the problem of data paucity in OCO-2 at high northern latitudes during spring and autumn and investigate the quality control filters applied to those data. We subsequently suggest new quality control filters, specifically validated in boreal forest regions, that substantially increase OCO-2 high-latitude data throughput, and we evaluate the consequences of applying different sets of quality control filters to coincident OCO-2 retrievals at three boreal forest sites. Finally, we discuss observed seasonality in OCO-2 biases in the boreal forest and explore some retrieval parameters that may contribute to seasonality in bias.

2 Sites and data sources

Ground-based column measurements were collected at three sites in the boreal forest, including two TCCON sites at East Trout Lake (ETL), Saskatchewan, Canada (54.354 N, 104.987 W; Wunch et al.2017a) and Sodankylä (SOD), Finland (67.367 N, 26.631 E; Kivi et al.2017), as well as long-term measurements in Fairbanks (UAF), Alaska, USA (65.859 N, 147.85 W), using two Bruker EM27/SUN spectrometers (see Fig. 1). Observations at the Sodankylä TCCON site began in 2009 and span the full period of OCO-2 observations considered in this analysis, from September 2014 to November 2018, with the exception of the winter months (Kivi and Heikkinen2016; Kivi et al.2017). At East Trout Lake, observations began in October 2016, and because this site is further south, these measurements are nearly year round (Wunch et al.2017a). In Fairbanks, the Los Alamos National Laboratory (LANL) EM27 (owned by LANL) was operated during August–October 2016 and March–October 2017, while the Karlsruhe Institute of Technology (KIT) EM27 (owned by KIT ) was operated in April–October 2018. Regular characterization of the instrument line shape (ILS) for each EM27/SUN spectrometer was used to monitor instrument performance over time (see Sect. S1). The LANL EM27 was regularly compared to the Caltech TCCON spectrometer in side-by-side observations and was used as a transfer standard to rescale EM27/SUN retrievals in Fairbanks to the TCCON trace-gas scale (see Sect. S1). Tight correlations between the LANL and KIT EM27/SUN spectrometers, and between the LANL EM27 and Caltech TCCON instrument, suggest that, given the appropriate bias correction, observations from either of the two EM27/SUN spectrometers are relatively interchangeable with TCCON observations.

Figure 1Circumpolar map showing locations of boreal forest sites included in this study.

Retrievals from TCCON sites are vetted with careful quality control standards before being archived publicly (Kivi et al.2017; Wunch et al.2017a). The GGG2014 retrieval algorithm is used to retrieve XCO2 from TCCON and EM27/SUN observations (Wunch et al.2015) with some input modifications introduced in EM27/SUN retrievals by EGI, as discussed by Hedelius et al. (2016) and sourced from Hedelius and Wennberg (2017). Quality controls applied to EM27/SUN retrievals follow those outlined by Hedelius et al. (2016), including an upper bound on sza at 82 and an upper bound on XCO2 retrieval error at 5 ppm. In addition to the quality controls suggested by Hedelius et al. (2016), a lower bound is set on the average solar intensity (SIA) in EM27/SUN retrievals at 90 AU. After quality control filtering, EM27/SUN retrievals are smoothed by eliminating retrieved XCO2 that deviates by more than 1 ppm from a five-point moving average (spectra are collected approximately every 10–15 s). Throughout this paper, all time aggregations of retrieved XCO2 from ground-based observations were weighted by the inverse of the retrieval error using the following:

(1) x = i x i x err , i - 2 i x err , i - 2 ,

where xi is the retrieved XCO2 of the ith retrieval in the aggregation interval, and xerr,i is the corresponding retrieval error.

OCO-2 observations were retrieved with the ACOS B9 retrieval algorithm and retrievals for this study were obtained from OCO-2 B9 Lite files (OCO-2 Science Team/Michael Gunson, Annmarie Eldering2018), which have been initially screened for cloud cover as described by Taylor et al. (2016) and bias corrected as described by Osterman et al. (2018). Only OCO-2 soundings over land are included in this analysis (with land_fraction = 100), and the standard bias correction to TCCON is applied to all OCO-2 retrievals of XCO2, unless otherwise stated. Following the coincidence criteria defined by Wunch et al. (2017b), OCO-2 soundings were considered coincident to ground sites if they fell within a 5 latitude by 10 longitude box centered on the ground site and occurred on the same day as the corresponding ground measurements. At Fairbanks and Sodankylä, OCO-2 observations consistently occur within approximately 30 min of local solar noon, and at East Trout Lake they occur within approximately 1 h of local solar noon. Therefore, we define a daily ground-based reference value for XCO2 (referred to as the near-noon ground measurement or NNG), which is the error-weighted average (see Eq. 1) of ground-based XCO2 collected within 2 h of local solar noon.

3 Results

3.1 OCO-2 quality control filtering

Three different sets of quality control filters were applied to OCO-2 high-latitude retrievals in this study and are defined in Table 1 (see the Supplement or Osterman et al.2018 for definitions of quality control parameters). Two of these three sets of quality control (QC) filters are recommended by the OCO science team for ACOS B8 retrievals (B8 QC) and ACOS B9 retrievals (B9 QC) and are summarized by the binary variables xco2_quality_flag_b8 and xco2_quality_flag in the OCO-2 B9 Lite files (Osterman et al.2018; O'Dell et al.2018). Methods for selecting quality thresholds and details on the B8 QC filters are discussed by O'Dell et al. (2018). Improvements in pointing accuracy in ACOS B9 (Kiel et al.2019), and a careful reevaluation of quality control parameters, allowed for intentionally more permissive quality thresholds in B9 QC than in B8 QC; this resulted in a substantial increase in data throughput over regions, such as the boreal forest and high latitudes in general, that have been sparsely represented under past OCO-2 ACOS spectral fitting and quality control regimes. The third set of quality control filters (boreal QC) were determined by evaluating quality control histograms like those presented by O'Dell et al. (2018) with “truth” as the NNG observations from the three boreal forest sites (see Appendix Fig. A1). Scatterplots of bias in XCO2 (ΔXCO2 OCO-2 – NNG) against various retrieval parameters were also considered as a way to search for groupings of bias outliers that could be eliminated with small changes in quality control thresholds. The boreal QC were set with the goal of maximizing data throughput for high-latitude boreal forest sites in spring and autumn while maintaining acceptable ranges of bias at boreal forest sites. Additional retrieval parameters, not used in B8 or B9 QC, were also considered for the boreal QC that relates to challenges in high-latitude observations, including the difference between retrieved and a priori temperature (deltaT), solar zenith angle (sza), XCO2 retrieval uncertainty (xco2_uncertainty), and total column water vapor (tcwv).

Table 1Quality control thresholds currently used to determine the B8 QC (xco2_quality_flag_b8=0 in OCO-2 Lite files), B9 QC (xco2_quality_flag=0 in OCO-2 Lite files), and boreal QC (new proposed thresholds for terrestrial high-latitude sites). Descriptions of QC parameters in this table can be found in Table S2 or Osterman et al. (2018).

Download Print Version | Download XLSX

Changes to thresholds for albedo in the strong CO2 band (albedo_sco2), the quality of the spectral fit in the weak CO2 band (rms_rel_wco2), and the standard deviation in surface elevation in the satellite field of view (altitude_stddev) were major contributors to the increase in passable high-latitude retrievals with B9 QC relative to B8 QC (compare Figs. 2 and 3). In the boreal QC, ranges of acceptable values are expanded from those in the B9 QC for the ratios of single-band retrievals of CO2 (co2_ratio) and H2O (h2o_ratio) and the quality of the spectral fit in the weak CO2 band (rms_rel_wco2). Albedo in the strong CO2 band (albedo_sco2) is not used as a QC filter in the boreal QC because it seemed that problematic data with low albedo_sco2 could be screened by other QC filters, and there was no evidence that low albedo_sco2 was explicitly correlated to larger OCO-2 biases at boreal forest sites (see quality control plots in Appendix A). In fact, increases in bias and retrieval standard deviation were more often associated with high albedo in the strong CO2 band rather than low values. More conservative thresholds were placed on the slope of albedo in the strong CO2 band given by the continuum fit (albedo_slope_sco2) than were previously used in the B8 QC or the B9 QC due to observed increases in the standard deviation of retrievals and larger negative biases specifically associated with more negative albedo slopes. One possible explanation for this observation is that certain surface types that are more prevalent in the boreal forest are not correctly modeled by the ACOS B9 algorithm, and this could be related to snow-covered surfaces. We expect that introducing a polynomial fit to the albedo in each band, rather than a linear fit, could improve the accuracy of modeled surface albedo in future ACOS versions and potentially result in reduced high-latitude biases. Thresholds for the difference between retrieved surface pressure and a priori surface pressure at the pointing location of the O2A band (dp_o2a) remained the same in boreal QC as in B9 QC, while thresholds for the difference between retrieved surface pressure and a priori surface pressure at the pointing location of the strong CO2 band (dp_sco2) were made marginally more conservative. Kiel et al. (2019) discuss the pointing errors and other long-term challenges with surface pressure bias in OCO-2 retrievals that lead to the addition of the dp_o2a and dp_sco2 parameters in which there is one retrieved surface pressure and a separate a priori surface pressure defined for each band. The aerosol optical depth (aod) parameters are mostly the same in the boreal QC as in the B9 QC, with the exceptions that total aod (aod_total) and the combined dust, water, and sea salt aod (dws) were removed in the boreal QC because these seemed superfluous after applying other aod filters. While the range of acceptable values for the difference between retrieved and a priori vertical CO2 gradient (co2_grad_del) is nearly the same in the B9 QC as in the boreal QC, the range of values is shifted up. This choice was made based on the distribution for co2_grad_del for the boreal forest sites, and the difference may be attributed to the use of a regional data set for boreal QC rather than a global data set for B9 QC. As previously mentioned, several parameters were used to define quality control filters in the boreal QC that were not included in the parameters for B8 QC and B9 QC. A threshold for sza was introduced in the boreal QC and was chosen to restrict data furthest north to the months of March through November. Potential challenges with data at high sza are discussed in the introduction of this paper, and high sza was found to be correlated with larger negative OCO-2 biases at boreal forest sites. The sza threshold in the boreal QC only screens approximately 0.5 % of retrievals that manage to move through the other boreal QC filters. The thresholds placed on the difference between retrieved and a priori temperature (deltaT) and total column water vapor (tcwv) were chosen because very low atmospheric water vapor or large differences between retrieved and modeled temperatures are likely to correspond with cold weather and snow cover. In particular, before the application of quality control filters, large negative biases in OCO-2 retrievals were found to be associated with low values of tcwv (discussed in more detail in Sect. 4.3). Although the majority of biased retrievals with low tcwv are screened out by other quality filters, this filter helped to remove a small number of outliers that pass the other QC filters (see Fig. 21). Finally, the uncertainty in retrieved XCO2 (xco2_uncertainty) was included arbitrarily in the analysis and found to be effective in eliminating a small number of outliers that made up less than 0.05 % of retrievals not screened by other filters.

Figure 2Total number of land soundings north of 50 N flagged bad by B8-quality filters in each month.


Figure 3Total number of land soundings north of 50 N flagged bad by B9-quality filters in each month.


Figure 4Total number of land soundings north of 50 N flagged bad by boreal-quality filters in each month.


3.2 Effect of QC on data throughput north of 50 N

For each set of QC filters, all retrievals over land north of 50 N latitude in OCO-2 Lite files were evaluated to determine how many failed to meet the quality thresholds for each parameter in each month. The results in Figs. 2, 3, and 4 show the number of soundings flagged as bad for each individual QC parameter, so larger values suggest that more data are being removed during quality filtering. One caveat is that for a sounding removed in QC filtering there could be multiple parameters for which the retrieval was flagged as bad, or there could be as few as one parameter for which the retrieval is flagged as bad. Figures 2 and 3 show there is a clear seasonality to triggered quality filters, with the majority of soundings flagged as bad occurring in spring and early summer. This seasonality is slightly diminished with the B9 QC (Fig. 3) relative to the more conservative B8 QC (Fig. 2) and is only marginally manifested in the boreal QC (Fig. 4). The reduction in the number of filtered soundings in spring with boreal QC is largely attributable to less conservative bounds on the spectral fit quality in the weak and strong CO2 bands (rms_rel_wco2, rms_rel_sco2) and the ratios of single-band retrievals of CO2 (co2_ratio) and H2O (h2o_ratio). In all three sets of quality control filters, the parameters which most often cause soundings to be flagged as bad, resulting in the removal of data points, are the spectral fit quality (rms_rel_wco2, rms_rel_sco2), the ratios of single-band retrievals of CO2 (co2_ratio) and H2O (h2o_ratio), and differences between the retrieved and various a priori surface pressures (dp_sco2, dp_o2a, dp). The fact that these parameters account for a greater abundance of retrievals being flagged as bad in spring and autumn suggests that there could be seasonal effects related to these retrieval parameters that need to be accounted for in high-latitude measurements. In particular, there has been speculation that spring snow cover would result in low surface albedo in the 1.61 and 2.06 µm bands, and patchy snow cover or snow-free vegetation protruding from the snow pack could cause variability in albedo within the satellite's field of view (Wiscombe and Warren1980; Boesch et al.2011). However, after matching Moderate Resolution Imaging Spectroradiometer (MODIS) snow cover data to coincident OCO-2 retrievals at our boreal forest sites, there was no clear connection found between snow cover and increased magnitudes of OCO-2 bias with or without QC filtering. It may still be the case that an incongruous spatial resolution between MODIS and OCO-2 is masking the effects of snow cover on bias or that OCO-2 is only biased by snow in combination with certain other effects of cold weather conditions that are more frequently occurring in spring.

Figure 5Maps of differences between the number of soundings passed by boreal QC and the number of soundings passed by B9 QC in each 1 latitude by 1 longitude grid cell in each month (January to June; summing all years from 2014 to 2018).

Figure 6Maps of differences between the number of soundings passed by boreal QC and the number of soundings passed by B9 QC in each 1 latitude by 1 longitude grid cell in each month (July to December; summing all years from 2014 to 2018).

Figure 7The fraction of soundings that pass each set of QC filters for each month out of all land soundings north of 50 N in the OCO-2 Lite files (note that OCO-2 observations are filtered by a preliminary cloud screening before being compiled in the Lite files). Also plotted are the average solar zenith angles reported in OCO-2 retrievals that pass in each month and the monthly snow extent in the Northern Hemisphere, averaged over 2014–2018, as reported in the NOAA database (Robinson et al.2012).


Figure 8Time series of ground-based and satellite-based data at each boreal forest site. These consist of daily averages of OCO-2 coincident soundings filtered with boreal QC alongside corresponding daily averages of NNG with averaging kernel corrections to OCO-2 applied, as described in Sect. 3.3.1 (X̃NNG).


Additional data gained from applying boreal QC rather than B9 QC can be visualized as an increase in spatial coverage of terrestrial high-latitude regions. Figures 5 and 6 show the difference in the number of soundings passed as “good” by boreal QC relative to B9QC in each 1 latitude by 1 longitude geographic grid cell. These maps point to substantial increases in spatial coverage of the boreal forest (∼50–70 N latitude band) in the spring and autumn months with boreal QC. This improvement in coverage is an important advantage of the boreal QC for selecting OCO-2 retrievals with the goal of evaluating longitudinal trends in seasonal cycles for the boreal forest. The maps in Figs. 5 and 6 also show some areas where boreal QC provides less OCO-2 data throughput than B9 QC. First, the sza threshold on the boreal QC removes most soundings that manage to pass the other quality filters in winter (November through February), but winter observations are nearly absent from the OCO-2 Lite files before any QC filtering is applied, either because they are removed by prescreening or the data are not collected in the first place. It, therefore, seems reasonable to exclude these data points to the greatest extent possible, and, in general, analyses on satellite-based high-latitude XCO2 continue to be confined to the period between March and October. Second, Figs. 5 and 6 show that there are some regions where boreal QC yields noticeably less throughput than B9 QC in June, July, and August; however, these months are when there is already an abundance of data and these grid cells with decreased boreal QC throughput are often adjacent to grid cells with an increase in throughput. Overall, the boreal QC provides increased spatial coverage, and this is complemented by an increase in total throughput for soundings over land north of 50 N (see Fig. 7).

Figure 9Average bias, standard deviation in bias, and number of passed soundings sorted by viewing geometry and quality control method and considering all coincident soundings at each of the three boreal forest sites. Note that there are no coincident nadir soundings for Fairbanks due to the satellite's operational design which favors glint observations in orbits primarily over oceans. Note that bias is defined as (ΔXCO2 (OCO-2 retrieval)i -(X̃NNG)i) for each coincident OCO-2 sounding.


The B9 QC filters succeed in tripling the number of passed retrievals over land at high latitudes, relative to the B8 QC, and the boreal QC allow nearly double the number of retrievals allowed by B9 QC (see Fig. 7 and the right column of Fig. 9). An important result of the boreal QC is the increase in passed retrievals in May, August, and September relative to the B9 QC. While the more relaxed B9 QC allows more high-latitude retrievals than B8 QC, the relative throughput between any two months remains roughly unchanged, and the shapes of the histograms in Fig. 7a and b are very similar despite scaling. By plotting monthly snow extent in the Northern Hemisphere, as reported by NOAA (Robinson et al.2012), alongside monthly average sza and monthly fraction of soundings north of 50 N passed by QC filters, Fig. 7 provides further evidence that some combination of sza and snow cover could be playing a role in high-latitude data removal. If solar zenith angles (sza) were the primary driver of seasonality in high-latitude data throughput, one would expect to obtain approximately the same quantity of passed retrievals in May as in July, but Fig. 7 indicates that nearly twice as many high-latitude retrievals pass QC filters in July. As mentioned previously in this section, additional analysis did not lead us to the conclusion that snow is the culprit in itself, but some effects from snow or differences between fresh and melting snow cannot be entirely excluded either. It remains unclear how combinations of radiative transfer effects may be contributing to increased data removal at high latitudes in spring.

Figure 10Monthly average bias (a, c, and e) and standard deviation in biases (b, d, and f) of coincident OCO-2 soundings at each of the three boreal forest sites and with each of the QC methods. Note that bias is defined as (ΔXCO2 (OCO-2 retrieval)i-(X̃NNG)i) for each coincident OCO-2 sounding.


3.3 Comparing OCO-2 and ground-based observations

3.3.1 Averaging kernel corrections

The retrieval averaging kernel represents the sensitivity of retrieved XCO2 to enhancements at different altitudes in the atmospheric column. When comparing retrievals of XCO2 from two different spectrometers, averaging kernels can be used to mathematically correct for systematic sources of disagreement that result from instrumental differences. In this paper, averaging kernel corrections were applied to simulate the OCO-2 retrieval that would result by assuming the ground-based retrieval to be truth and scaling by the OCO-2 averaging kernel with an averaging kernel-correction factor, dNNG (see Sect. S2). The result of averaging kernel corrections is a set of modified ground-based measurements (X̃NNG) that are the sum of the NNG XCO2 aggregate (XNNG) and a dNNG value that is uniquely calculated for each coincident OCO-2 retrieval such that, in the following:

(2) X ̃ NNG = X NNG + d NNG ,


(3) d NNG = ( 1 - γ ) h T A 0 x a ,

where γ is the scaling ratio of retrieved to a priori near-noon ground-based XCO2, h is the pressure-weighting function, A0 is the OCO-2 averaging kernel, and xa is the a priori CO2 profile (see Sect. S2 for details). Averaging kernel-correction factors (dNNG) display some seasonal variability; the ratio of retrieved to a priori ground-based XCO2 was found to be the dominant term causing this seasonality (see the Supplement and compare dNNG in the top row of Fig. S2 to (1−γ) in the third row of Fig. S2). Any seasonality introduced by averaging kernel corrections appears to be on too small a scale to explain seasonal variability observed in the following sections. Note that in the remainder of this paper East Trout Lake, Sodankylä, and Fairbanks are abbreviated in figures to ETL, SOD, and UAF, respectively.

Figure 11Box plots of distributions of monthly biases for each boreal forest site and for each QC, showing results with and without the standard B9 bias correction (bc). Box plots show the median in the center of the box, the first and third quartile as the bottom and top of the box, and the full range of data values as the bars extending above and below the box. Note that bias is defined as (ΔXCO2 (OCO-2 retrieval)i-(X̃NNG)i) for each coincident OCO-2 sounding.


3.3.2 Biases by site, viewing mode, and QC method

To evaluate the effects of these three quality control methods on observed biases and data throughput, NNG observations with averaging kernel corrections applied (X̃NNG) were compared against coincident OCO-2 retrievals at three sites in the boreal forest (East Trout Lake, Fairbanks, and Sodankylä; see Sect. 2 for definitions of NNG and coincidence criteria). Daily averages for the complete set of coincident OCO-2 retrievals obtained with boreal QC and corresponding X̃NNG are shown in Fig. 8. While the OCO-2 and NNG observations in Fig. 8 appear to be in close agreement on most days, there are a few outliers in the OCO-2 daily averages in spring and autumn that may contribute to a potential seasonal dependence in bias. Figure 9 provides an overview of the full data sets for each site, including total average bias, standard deviation in bias, and data throughput, sorted by the satellite-observing mode and quality control method. Note that bias is defined as ΔXCO2 (OCO-2 retrieval)i -(X̃NNG)i for each coincident OCO-2 sounding, so that a negative bias indicates that OCO-2 retrievals are lower than NNG, and a positive bias indicates that OCO-2 retrievals are higher than NNG. At all three sites, target mode retrievals had 0.1 to 0.5 ppm lower standard deviation than glint or nadir retrievals, which may indicate the introduction of proximity bias (i.e., soundings further from the ground site contributing larger bias). If proximity bias is an important source of bias, one may expect that target mode retrievals would also have lower average biases than glint and nadir retrievals. Results from East Trout Lake (ETL) and Fairbanks (UAF) meet this expectation, with the exception of the results from the B8 QC at East Trout Lake. Only at Sodankylä (SOD) do the average biases in target mode retrievals substantially exceed the average biases in glint or nadir retrievals, warranting further investigation of target observations at Sodankylä. In particular, the B8 QC results in an average bias in target mode soundings at Sodankylä that is at least twice that observed at East Trout Lake or Fairbanks, and because this increase is accompanied by increased standard deviation in target mode biases at Sodankylä, it could indicate influence from outliers. While the allowance of additional data switching from B8 QC to B9 QC or from B9 QC to boreal QC tends to be accompanied by an approximate increase in the standard deviation in biases of 0.1 to 0.3 ppm, average biases at the three sites are not consistently larger with the boreal QC (see left and center columns of Fig. 9). In considering the use of boreal QC for certain science applications at high latitudes, the introduction of additional scatter should be weighed against the large increase in usable retrievals, as shown in Fig. 9c, f, and i and in Fig. 7.

Figure 12Correlations and linear regressions for dpfrac (defined in the Supplement or by Kiel et al.2019) as a function of temperature at 700 hPa (T700) for all retrievals over land north of 50 N that pass boreal QC and separated here by viewing geometry.


Figure 13Correlations and linear regressions for the difference between retrieved and a priori surface pressure from GEOS5 forward processing for instrument teams (GEOS5-FP-IT; dp) as a function of temperature at 700 hPa (T700) for all retrievals over land north of 50 N that pass boreal QC and separated here by viewing geometry.


3.4 Seasonal variability in bias

High-latitude regions experience a higher degree of seasonality in many climate and environment variables than midlatitude regions, and one of our primary motivations in this study is expanding our ability to evaluate CO2 seasonality in the boreal forest. Considering the total average and standard deviation in biases for all coincident soundings, as in Fig. 9, can obscure seasonal variability in biases that contribute to uncertainty in characterizing seasonal cycles of CO2 obtained from satellite observations. Figure 10 shows the monthly average biases and monthly standard deviation in biases considered for each site and each set of QC filters. Under all three sets of QC filters we observe seasonal trends in biases, which are more pronounced at East Trout Lake and Sodankylä than at Fairbanks. The observed seasonal variability is characterized by more positive biases in mid to late summer and more negative biases in spring and autumn, which may cause satellite-based estimates of seasonal amplitude and timing to differ from ground-based estimates of these seasonal parameters. For most months the boreal QC does not result in substantial increases in the absolute value of the average monthly biases relative to B9 QC, but in April the boreal QC yields larger negative biases than B9 QC at Fairbanks, and at Sodankylä large negative biases are observed in the boreal QC without any counterpart in the B8 or B9 QC to compare them to. In Fairbanks the average April bias drops from −0.59 ppm, with the B9 QC, to −1.15 ppm, with the boreal QC. Figure 17a, shows that the difference in the absolute values of the average April bias in Fairbanks is the largest difference in absolute monthly biases when comparing the boreal and B9 QC methods. At Sodankylä April data are only allowed by the boreal QC, while B8 QC and B9 QC filter out all coincident OCO-2 retrievals, but the negative average bias obtained with boreal QC at Sodankylä in April represents the maximum absolute monthly bias of any month, site, or QC method. Figure 10 also shows larger negative biases with the boreal QC at Fairbanks in August, September, and October in which the boreal QC yields average monthly biases descending from −0.46 to −0.72 ppm in consecutive months, while the B9 QC yields average biases descending from −0.08 to −0.43 ppm. At East Trout Lake and Sodankylä, the monthly biases are only marginally different between the boreal QC and B9 QC, and the boreal QC resulted in slightly smaller monthly biases in March, June, and November at East Trout Lake and in August at Sodankylä (see Figs. 10 and 17a).

Figure 14Box plots of distributions of monthly biases at each boreal forest site and filtered by boreal QC, given the standard B9 bc (Eq. 4), given the B9 bc without the dpfrac term, given the B9 abc (Eq. 6), or given the B8 abc (Eq. 7). Note that bias is defined as (ΔXCO2 (OCO-2 retrieval)i-(X̃NNG)i) for each coincident OCO-2 sounding.


Monthly bias distributions are visualized with box plots for each site and set of QC filters in Fig. 11 to further elucidate potential seasonal trends in OCO-2 biases. Figure 11a, b, and c show that East Trout Lake has the most pronounced seasonal variability in biases and the trends observed are similar for all three sets of QC filters. Figure 11f suggests there is a slight seasonal trend at Sodankylä with the boreal QC that appears when March and April soundings are included. Overall, the monthly bias distributions also serve to emphasize the similarity in results from the different QC methods.

Figure 15Monthly average bias (a, c, and e) and standard deviation in biases (b, d, and f) of coincident OCO-2 soundings at each of the three boreal forest sites with boreal QC filtering and given each of the following bias correction modifications: the standard B9 bc (Eq. 4), the B9 bc without the dpfrac term, B9 abc with a term for temperature at 700 hPa (T700; Eq. 6), and B8 bc with a term for T700 (Eq. 7).


Figure 16Average bias and standard deviation in bias sorted by viewing geometry and bias correction modification for all coincident soundings at each of the three boreal forest sites with boreal QC. Note that there are no coincident nadir soundings for Fairbanks due to the satellite's operational design which favors glint observations in orbits primarily over oceans.


3.5 A modified OCO-2 bias correction with temperature at 700 hPa (T700)

The seasonal dependence of ΔXCO2 described in the previous section was found to be largely induced by the OCO-2 bias correction and is not apparent in ΔXCO2 calculated with un-bias-corrected OCO-2 retrievals (see Fig. 11). In OCO-2 B9 retrievals, the B9 bias correction (B9 bc) for soundings over land is defined by Osterman et al. (2018) as follows:

(4) X CO 2 , corrected = X CO 2 , raw - foot + 0.9 ( dpfrac ) + 9.0 ( dws ) + 0.029 ( co2_grad_del - 15 ) 0.9954 ,

with a footprint bias-correction term, foot, an overall divisor to agree with TCCON, and parameter-dependent terms adjusting based on a modified parameterization of the retrieved surface pressure bias defined by Kiel et al. (2019, dpfrac), the sum of dust, water, and sea salt aod's (dws), and the difference between retrieved and a priori vertical gradients in the CO2 profile (co2_grad_del). Of the terms in the B9 bc, dpfrac was the only one found to have seasonal variability at boreal forest sites that was similar to that observed in ΔXCO2 with the OCO-2 bias correction (see Figs. 11 and 25). As will be discussed in Sect. 4.4, all versions of the residual in retrieved surface pressure relative to a priori surface pressure (dpfrac, dp, dp_o2a, and dp_sco2) have seasonal variability that can be at least partially attributed to temperature dependence, so we propose new OCO-2 bias corrections with a term for temperature at 700 hPa (T700) to correct for the temperature dependence in dpfrac and dp. To calculate the temperature-dependent modification to the B9 bc, we consider the linear regressions for dpfrac as a function of T700 in each of the satellite-viewing modes for soundings over land north of 50 N that pass boreal QC (see Fig. 12). Then the regression coefficients for the different viewing modes are combined into average slope and average y intercept with weighting by the fractional abundance of retrievals in that mode to obtain an alternative B9 bias correction (B9 abc), as follows:


Figure 17Differences between average monthly biases (a) or standard deviation in monthly biases (b) when comparing boreal QC to B9 QC. Note that Fig. 10 shows typical average monthly bias ranges from −1 to 1 parts per million (ppm).


Figure 18Monthly histograms of the ratio in single-band retrievals of CO2 (co2_ratio) for all unfiltered OCO-2 retrievals over land (land_fraction = 100) split into two latitude bands, namely 10 to 50 N and 50 to 90 N.


An alternative B8 bias correction (B8 abc) was also constructed using linear regression terms for the difference between the retrieved and a priori surface pressure from GEOS5 forward processing for instrument teams (GEOS5-FP-IT; dp) as a function of T700 in Fig. 13 as follows:


Applying a modification of the B8 bias correction is consistent with the fact that spectroscopy and most aspects of the radiative transfer model remained the same when the ACOS version was updated from B8 to B9. Ideally, a global bias correction would be constructed to include temperature as a component of a broader analysis that considers contributions to the OCO-2 bias in a more holistic context. In this way, the bias correction would be more uniform and widely applicable, while the effects of potential parameter covariance or other influences that we are unable to control for in this regional analysis can be mitigated. That being said, the results that follow suggest that correcting for temperature dependence could be effective in reducing seasonality in OCO-2 bias over the boreal forest.

The second column of Fig. 14b, f, and j, in addition to results in Figs. 1516, show that seasonally dependent variability in biases is reduced when the dpfrac term is removed from the B9 bc, but both monthly and overall standard deviations in biases are increased. Without the dpfrac term in the B9 bc, monthly biases in March and April at Sodankylä and in April at Fairbanks are substantially reduced, and month-to-month variability at East Trout Lake is also reduced. Replacing the dpfrac or dp term with a T700 modification, as in B9 abc and B8 abc (Eqs. 6 and 7), results in lower monthly standard deviations in biases than those obtained in the B9 bc with the dpfrac term removed and that are nearly equivalent to those obtained with the standard B9 bc (Eq. 4). While some of the seasonal shape is reintroduced with the B9 abc and the B8 abc, biases are still reduced in spring and autumn relative to the B9 bc (see Fig. 15). The combined results of Fig. 15, with the total average biases and total standard deviations in biases shown in Fig. 16, suggest that the B8 abc is slightly more effective than B9 abc in reducing seasonal variability in bias, reducing total average bias, and reducing total standard deviations in biases. Figure 16 demonstrates that, for all sites and viewing modes, most of the total average biases with the B8 abc are within ±0.5 ppm. In particular, the B8 abc results in reduced average bias in target mode soundings at all three sites, including nadir soundings at Sodankylä and glint soundings at East Trout Lake and Fairbanks. The B8 abc did result in slight increases in total average biases in nadir soundings at East Trout Lake and in glint soundings at Sodankylä (see Fig. 16). However, with the B9 abc, average biases in all modes at Sodankylä, and in nadir and target retrievals at East Trout Lake, are nearly doubled relative to the standard B9 bc.

Figure 19Seasonal box plots of retrieved tcwv (total column water vapor) at (a, b, and c) East Trout Lake, (d, e, and f) Sodankylä, and (g, h, and i) Fairbanks, given each of the three quality control methods.


4 Discussion

Results from this analysis have revealed that modified QC filters for OCO-2 retrievals have the potential to recover large quantities of previously screened terrestrial high-latitude observations and provide double or triple the number of retrievals for use in scientific studies of high-latitude regions. This increase in data throughput is accompanied by only minor changes in average bias and increases in standard deviations in bias of approximately 0.3 ppm or less (see Figs. 9 and 17). Through monthly comparisons between coincident OCO-2 retrievals and ground-based measurements at three boreal forest sites, biases (ΔXCO2) were found to exhibit some seasonal variability that is mostly independent of the QC method applied (see Fig. 10 and Fig. 11). Observations from Fairbanks exhibit less seasonal variability in OCO-2 bias than East Trout Lake or Sodankylä, and it is unclear whether this is due to geography or the difference in ground-based instrumentation. The EM27/SUN has a similar spectral resolution to OCO-2 (∼0.5 and ∼0.3 cm−1, respectively), both of which differ from that of TCCON (∼0.02 cm−1) by an order of magnitude. The more similar spectral resolution of the EM27/SUN to OCO-2 could be a reason for smaller biases between these instruments. A recent study by Sha et al. (2019) compared long-term measurements of XCO2 using multiple ground-based infrared spectrometers at Sodankylä, including the EM27/SUN and TCCON. Their results suggest that the EM27/SUN at Sodankylä may retrieve higher XCO2 than TCCON in spring, and factors such as temperature and water vapor may influence differences between EM27/SUN and TCCON retrievals of XCO2. A more direct comparison between the EM27/SUN observations and satellite measurements from Sodankylä, and other locations, are needed to further investigate these relationships. Seasonally dependent biases can be challenging to correct and can ultimately result in biases between satellite-based and ground-based estimates of seasonal cycle parameters such as amplitude and timing. Therefore, it is essential that any seasonality in biases be characterized and potential sources be identified. Both the B9 abc in Eq. (6) and the B8 abc in Eq. (7) result in reduced monthly average biases in spring and autumn, and the standard deviations in biases are nearly equivalent to those obtained with the standard B9 bc in Eq. (4). The B8 abc was found to be slightly more effective than the B9 abc in reducing the seasonal curvature in monthly biases at East Trout Lake and Sodankylä, which could allow for lower uncertainty in seasonal cycle parameters estimated using OCO-2 retrievals over the boreal forest. However, some month-to-month variability persists with any of the bias corrections applied in this paper, and it is still important to continue to explore other contributions to seasonal variability in OCO-2 bias, such as the choice of QC, coincidence criteria, or processing of ground-based data. In this vein, the following sections consider differences in monthly average bias and standard deviation in bias between boreal QC and B9 QC. Then we consider a number of QC parameters that exhibit seasonal behavior and their potential role in seasonally dependent biases at boreal forest sites. The effects of limiting coincidence by midtropospheric temperature or changing the ground-based reference from NNG to an average of ground-based retrievals at a restricted range of solar zenith angles were also explored but found to not significantly affect seasonal dependence in bias (see Appendices B and C).

4.1 Potential contribution to seasonal bias from QC method

Overall, the B8 QC is the most conservative set of QC filters, the B9 QC allows for more relaxed thresholds in the QC parameters, and the boreal QC is the most permissive set of QC filters. It was observed in Fig. 9 that the total standard deviations in biases for all coincident soundings at East Trout Lake and Sodankylä gradually increase from ∼1.3 and ∼1.4 to ∼1.6 ppm ascending, with the increase in throughput obtained from relaxing QC filters (B8 QC < B9 QC < boreal QC). A similar trend at Fairbanks is reflected by an increase from ∼1.1 and ∼1.3 to ∼1.4 ppm. Figure 10 also demonstrates this increase in standard deviation with different QC filters, but there does not appear to be a seasonal trend in the monthly standard deviation in biases at East Trout Lake and Fairbanks. The anomalously high standard deviation in biases in June at Sodankylä remains to be reconciled, and it represents a potential complication that would perpetuate midsummer uncertainty even if some method of correcting seasonal trends in monthly bias were devised and implemented. Additionally, there is a substantial increase in standard deviation in biases at East Trout Lake in March with boreal QC compared to the B9 QC. While this increase in standard deviation is concerning, the availability of OCO-2 retrievals in the boreal forest in March remains insufficient for a representative sample of northern regions and is not likely to be included in seasonal studies of the boreal forest at this time.

The largest difference in the absolute values of monthly bias between boreal QC and B9 QC is 0.56 ppm in April at Fairbanks (see Fig. 17a). Boreal QC also results in a 0.18 ppm larger absolute bias than B9 QC in April at East Trout Lake. In July through October, boreal QC results in monthly biases at Fairbanks that are 0.1 to 0.4 ppm larger than with B9 QC, while in May and June there is no change in average monthly biases between the two QC methods. At East Trout Lake and Sodankylä boreal QC produces some monthly biases that are smaller, by up to 0.4 ppm, than B9 QC. Despite some increases in monthly biases with boreal QC relative to B9 QC, it is clear from Fig. 17 that the modifications in QC filters do not always result in larger monthly biases, and the effects should be weighed against the potential advantages of increasing passable retrievals and spatial coverage. We conclude that the differences between boreal QC and B9 QC are not likely to be a major source of seasonal variability in bias because seasonal dependence is observed with both QC methods in Figs. 10 and 11.

4.2 Seasonal variability in co2_ratio (XCO2 [2.06 µm]:XCO2 [1.61 µm])

The co2_ratio refers to the ratio of XCO2 retrieved by the 2.06 µm band to that retrieved by the 1.61 µm band. Recall that Wiscombe and Warren (1980) measured low and differing reflectance for snow in the 1.61 and 2.06 µm bands. Systematic departure from unity in the co2_ratio could result from spectroscopic inaccuracies in either band that are characteristic of the instrument or the line list used in the retrieval algorithm. Anomalous departures from unity in the co2_ratio can arise from low signal-to-noise ratio in either or both CO2 bands, which can be due to cloud and aerosol interference or the low reflectivity of snow- and ice-covered surfaces (Crisp et al.2012). Patchy snow cover or vegetation protruding through the snow may also cause discrepancies in signal intensity between the weak and strong CO2 bands as a result of variable surface reflectivity in the satellite field of view. In all months at midlatitudes and in May through October at high latitudes, terrestrial retrievals have a systematic departure from unity in the median co2_ratio, with the data approximately normally distributed around 1.012 (see Fig. 18). There is an even greater departure from unity in the co2_ratio for high-latitude retrievals in the winter months, namely November through April, with the data approximately normally distributed around 1.020. Figure 18 demonstrates that there is seasonal variation in the median and distribution of co2_ratio at latitudes north of 50 N that is not observed at latitudes from 10 to 50 N. This monthly difference in the distribution of retrieved co2_ratio at high latitudes may be a symptom of the effects of snow albedo, or it may be attributable to some other factor, but it warrants some attention because it may be associated with radiative transfer effects that contribute to negative biases in spring at the boreal forest sites.

4.3 Total column water vapor (tcwv), bias, and temperature dependence

The parameter tcwv refers to total column water vapor, which is calculated as the product of a scaling factor determined by the full physics retrieval and the a priori tcwv from the European Centre for Medium-Range Weather Forecasts (ECMWF). Atmospheric water vapor is expected to be seasonal, and the seasonality of tcwv at the three boreal forest sites is illustrated in the box plots in Fig. 19. Large amounts of atmospheric water vapor can suggest that there may be more cloud cover degrading the quality of both satellite-based and ground-based measurements. Even in the absence of clouds, water vapor is a strong infrared absorber in all three bands used by OCO-2, and water vapor is identified in Boesch et al. (2019) as the most important absorbing gas interfering with line fitting in OCO-2 retrievals. In selecting QC filters for the boreal QC, large negative biases (OCO-2 retrievals reporting lower values of XCO2 than NNG) were correlated to low tcwv, prompting the introduction of quality thresholds for tcwv in the boreal QC (see Fig. 20 and Table 1). Figure 21 shows the additional retrievals cut by the lower bound on tcwv at 3 kg m−2 (data left of the black dashed line) in the boreal QC which are not cut by other QC filters, and an overall downward trend persists in these removed data. One possible explanation is path shortening resulting from atmospheric scattering, which could result in retrieved spectral radiance that has failed to penetrate atmospheric layers near the surface. This would cause all retrieved gases to be underestimated so that total column water vapor and the total CO2 column are both erroneously low. However, Fig. 22 shows that while path shortening may explain some instances of negative biases and low tcwv, the relationship persists between a priori tcwv from ECMWF reanalysis and negative XCO2 biases. Because water vapor is a strong infrared absorber, it would be reasonable to expect retrieval errors when tcwv is high, but low atmospheric water vapor is also associated with cold fronts and snow cover. Figure 23 illustrates the relationship between tcwv and midtropospheric temperature (T700), at 700 hPa, in boreal forest coincident OCO-2 retrievals. There is a distinct maximum for tcwv at a given atmospheric temperature that is defined by the condensation temperature of water, and Fig. 23 shows that most of the retrievals with tcwv below 3 kg m−2 are also those with midtropospheric temperature (T700) below approximately 250 K. Therefore, it is reasonable to conclude that negative OCO-2 biases are also occurring at low temperatures, which is demonstrated by the correlations between ΔXCO2 and T700 in Fig. 24.

Figure 20Bias in coincident retrievals of XCO2 (ΔXCO2 OCO-2 – NNG), with no QC filtering against retrieved tcwv (total column water vapor), plotted as a density map. The lower bound placed on tcwv in the boreal QC (3 kg m−2) is shown as a black dashed vertical line.


Figure 21Bias in coincident retrievals of XCO2 (ΔXCO2 OCO-2 – NNG), with all boreal QC filtering except the bound on total column water vapor (tcwv) against retrieved tcwv, plotted as a density map. The lower bound placed on tcwv in the boreal QC (3 kg m−2) is shown as a black dashed vertical line.


Figure 22Bias in coincident retrievals of XCO2 (ΔXCO2 OCO-2 – NNG), with no QC filtering against a priori tcwv (total column water vapor), plotted as a density map. A priori tcwv is defined by ECMWF reanalysis data. The lower bound placed on tcwv in the boreal QC (3 kg m−2) is shown as a black dashed vertical line.


Figure 23Midtropospheric temperature (T700  temperature at 700 hPa) as a function of total column water vapor (tcwv) for all coincident retrievals without QC filters.


Figure 24Bias in coincident retrievals of XCO2 (ΔXCO2 OCO-2 – NNG) as a function of midtropospheric temperature (T700 temperature at 700 hPa) for all data without QC filters and each of the three QC methods presented in this paper.


4.4 Seasonal variability and temperature dependence in retrieved surface pressure bias

The dp_o2a and dp_sco2 variables are the residuals of retrieved and a priori surface pressure at the pointing locations of the O2A and strong CO2 bands, respectively. These two retrieval parameters were first included in B9 following the discovery of a pointing error that caused systematic inaccuracies in retrieved surface pressure (Kiel et al.2019). Before the release of ACOS B9, only a single dp variable (the difference between retrieved and a priori surface pressure from GEOS5-FP-IT) was used as a quality control and bias-correction parameter. In the analysis by Kiel et al. (2019), an additional parameterization of surface pressure residuals (dpfrac) was introduced for use in the OCO-2 B9 bc. The inclusion of dpfrac and dp in the OCO-2 bias correction is not the only reason that surface pressure residuals are important; accurate surface pressure measurements are essential for calculating XCO2, which is defined as the ratio of the total CO2 column to the total column of dry air. These terms are essential components of quality control and bias-correction methods because even small inaccuracies can translate to unacceptable errors in XCO2. While the effects of removing the dp term from the bias correction are considered in Sect. 3.5, it is probably inadvisable to remove this term entirely from bias correction or to loosen quality thresholds on dp variables without careful consideration of the impacts on XCO2. Furthermore, in attributing the causes and effects of trends in surface pressure residuals, there may be many competing factors. The seasonal box plots in Fig. 25 show that there is a seasonality in all four of the variants on surface pressure residuals (dpfrac, dp_o2a, dp_sco2, and dp) at the three boreal forest sites that are similar to the seasonality in bias-corrected ΔXCO2 (compare to Fig. 11). Similar seasonality may be a result of multiple seasonal parameters that equally effect dp and XCO2. Figures 12 and 13 show that both dpfrac and dp also exhibit linear dependence on T700, with greater linearity than the correlations between ΔXCO2 and T700, given either B9 QC or boreal QC in Fig. 24c and d. Not only is temperature clearly seasonal and correlated to other seasonal parameters, but rates and directions of atmospheric transport are also seasonal, and T700 has been found to link plumes in the free troposphere (Keppel-Aleks et al.2011). Kiel et al. (2019) show that systematic biases in dp are characterized by a positive trend close to the Equator and a negative trend at higher southern and northern latitudes, and we claim that this could also be a manifestation of temperature dependence.

Figure 25Seasonal box plots of dpfrac, dp, dp_o2a, and dp_sco2 at (a, b, c, and d) East Trout Lake, (e, f, g, and h) Sodankylä, and (i, j, k, and l) Fairbanks, given boreal QC filters.


5 Conclusions

Through ILS testing of EM27/SUN FTS used in Alaska, regular comparisons between multiple EM27/SUN FTS, and comparisons of EM27/SUN FTS with TCCON, we established the relative equivalence of EM27/SUN and TCCON observations as ground-based references for OCO-2 validation (see Sect. S1). With the application of multiplicative corrections, EM27/SUN FTS measurements in Fairbanks were compared to OCO-2 and yield similar magnitudes in OCO-2 biases as the TCCON sites considered in this study, with less seasonal variability in biases. More long-term observations with EM27/SUN FTS at different locations will be necessary to determine if the reduced seasonality in bias at Fairbanks is due to geography or instrumentation. While there were many challenges with data availability at high latitudes under the B8 QC, both the B9 QC and the boreal QC offer a two- to three-fold increase in passable retrievals from OCO-2 Lite files (OCO-2 Science Team/Michael Gunson, Annmarie Eldering2018) without major sacrifices in data quality. Total average biases for all sites, viewing modes, and quality control methods were within ±1 ppm (see Fig. 9). In particular, the boreal QC allows for nearly twice as many terrestrial OCO-2 retrievals north of 50 N latitude in the months of May, August, and September relative to the B9 QC, while resulting in no distinct increase in the total average bias and less than 0.3 ppm increase in the total standard deviation of biases for coincident retrievals at boreal forest sites (see Sect. 3.3.2 and Fig. 9). With the exception of an anomalously large negative April bias at Sodankylä, seasonal variability in monthly biases at these three boreal forest sites is mostly confined between −1 and +1 ppm, which equates to the 2 ppm maximum monthly standard deviation of biases in June at Sodankylä (when data availability is most abundant; see Fig. 10). Even a slight seasonal trend can significantly impact the analysis of seasonal cycle parameters, so it is important that valid methods for reducing the seasonal dependence in OCO-2 biases are identified. The OCO-2 bias correction seems to introduce some seasonality in OCO-2 bias through the inclusion of a dp (the difference between retrieved and a priori surface pressures) bias-correction term. We propose two alternative OCO-2 bias corrections in Eqs. (6) and (7) that correct for temperature dependence in dpfrac and dp, respectively, based on linear regressions shown in Figs. 12 and 13. It may be important to note that these alternative bias corrections are specifically tailored to high-latitude OCO-2 B9 retrievals over land with boreal QC. Of these two alternative bias corrections, the B8 abc in Eq. (7) appears to be more effective in reducing seasonal variability without substantial increases in average biases in any viewing modes or increases in monthly standard deviations in biases at boreal forest sites. The choice of B9 QC or boreal QC was not found to be a clear source of seasonal dependence in monthly OCO-2 bias in the boreal forest (see Sect. 4.1) nor were the effects of proximity bias or air mass dependence found to be important contributors to seasonal variability in biases (see Appendices B and C). Several sounding-retrieval parameters that have been used as QC filters were found to exhibit seasonal variability at these boreal forest sites, including the ratio of single-band retrievals of CO2 (co2_ratio), total column water vapor (tcwv), and the differences between retrieved and a priori surface pressures (dp, dp_o2a, and dp_sco2). These parameters may contribute to seasonal variability in biases by impacting data selection in the quality-filtering process, or they may be indicative of seasonal behavior at high latitudes that is not fully addressed in the retrieval algorithm. In particular, low tropospheric temperatures, or some other parameter that may be correlated to temperature, appears to be one of the primary contributors to seasonal dependence in OCO-2 bias at high latitudes. While the specific choices for QC parameters in the boreal QC method, proposed here, may still be a subject for consideration and debate, this analysis has shown that it is possible to modify quality controls tailored to a specific region and substantially increase the quantity of usable OCO-2 retrievals with only minor sacrifices in data quality. Furthermore, boreal QC coupled with an alternative bias correction that accounts for temperature dependence (Eq. 7) may yield sufficiently stable results for application in preliminary studies of boreal forest seasonal cycles of XCO2 across longitudes. It is also possible that improvements in spectroscopic modeling in future versions of the ACOS-retrieval algorithm would reduce or remove temperature dependence in surface pressure bias.

Appendix A: Quality control histograms

Following methods used for determining B8 QC and B9 QC parameters and thresholds, described in more detail by O'Dell et al. (2018), the filters used in the boreal QC are determined by qualitative assessment of the plots in Fig. A1. For these plots, only OCO-2 retrievals coincident to one of the three boreal forest ground sites, namely East Trout Lake, Sodankylä, and Fairbanks, are considered with the corresponding NNG measurements representing truth (see Sect. 2 for coincidence criteria and site locations). The plots include a histogram that shows the number of retrievals within a certain range of values (bin) for the QC filtering parameter in question (gray bars), the average ΔXCO2 in each bin with (black circles) and without (cyan circles) the standard bias correction, and the standard deviation in retrieved XCO2 in each bin (blue diamonds). We began by assessing the parameters that are used in B9 but with thresholds extensively loosened or removed wherever reasonably possible. Next, a number of additional retrieval parameters were assessed to see if their inclusion could reduce average bias and standard deviation in XCO2 by eliminating small numbers of outliers. After adding these parameters (deltaT, solar_zenith, xco2_uncertainty, and tcwv), the loose bounds on the remaining QC filtering parameters brought from B9 were reassessed and incrementally tightened.

Figure A1Histograms of retrieval parameters used in the boreal QC and considering only retrievals coincident to the three boreal forest sites in this paper. These plots are in the same style as those presented by O'Dell et al. (2018) but with NNG measurements as truth. Biases in XCO2 with and without the standard B9 bias correction, and the standard deviation in OCO-2-retrieved XCO2, are shown for each bin (in units of ppm). Implemented boreal QC thresholds for each parameter are represented by the black dashed vertical lines.


Appendix B: Potential contribution to seasonal bias from proximity bias

Another suspected source of bias, which may or may not have a seasonal component, is proximity bias. Assuming that the ground-based reference measurements are representative of the full coincidence region depends on a certain amount of regional homogeneity in CO2 columns, and spatial CO2 fields may not meet this criteria during seasonal transitions in spring and autumn. One method applied to the Greenhouse gases Observing Satellite (GOSAT) data by Wunch et al. (2011b), and based on modeling results by Keppel-Aleks et al. (2011), will isolate satellite soundings in the coincidence region that are likely to represent the same atmospheric plume observed at the ground site by choosing soundings with a midtropospheric temperature, at 700 hPa (T700), close to that above the ground site. Figure B1 shows monthly bias distributions for the full set of coincident retrievals alongside the monthly bias distributions of a subset of coincident retrievals that have retrieved T700 within ±1 K of T700 in the daily NCEP reanalysis results for the ground site. Results in Fig. B1 suggest that using T700 to screen coincident retrievals yields little to no observable improvements in seasonality of biases overall. At East Trout Lake the T700 coincidence criteria do reduce the absolute monthly bias in March and April with boreal QC, but in a number of other cases T700 screening results in a slight increase in absolute monthly biases for the spring and autumn months.

Figure B1Monthly bias distributions for coincident OCO-2 retrievals filtered with boreal QC by comparing the full set of coincident retrievals (black) alongside a subset with the temperature at 700 hPa (T700) in the OCO-2 retrieval equal to ±1 K of T700 from NCEP reanalysis at the ground site (red).


Appendix C: Potential contribution to seasonal bias from ground-based instrument air mass dependence

Seasonal biases in NNG data may arise from the air mass dependence of ground-based retrievals, particularly in high-latitude regions (Wunch et al.2011a). However, in Fig. C1 we define an alternative ground-based reference, named SZAG, that is the daily average of retrieved XCO2, with an apparent sza between 65 and 70, and at the two TCCON sites there is no observable difference in the monthly bias distributions, regardless of whether SZAG or NNG is used. At Fairbanks, the change in ground-based reference to SZAG results in a positive shift in almost all ΔXCO2 values, which for some months corresponds to a reduction in the size of biases and for other months corresponds to an increase in the size of biases. The results at Fairbanks may suggest that sza dependence in EM27/SUN observations requires further study. Seasonal variability in biases persists at all three sites and is largely the same regardless of whether the ground-based reference is NNG or SZAG. These results suggest that the air mass dependence of the ground-based instrument is not likely to be a dominant source of seasonal variability in OCO-2 biases.

Figure C1Monthly bias distributions for coincident OCO-2 retrievals filtered with boreal QC and referenced to NNG (black) alongside monthly bias distributions for coincident OCO-2 retrievals with boreal QC and referenced to SZAG (red).


Data availability

OCO-2 data and quality control parameters used here are taken from OCO-2 Lite files (version 9, “B9”). OCO-2 Lite files are produced by the NASA OCO-2 project at the Jet Propulsion Laboratory, California Institute of Technology, and obtained from the NASA Goddard Earth Science Data and Information Services Center (GES DISC;, last access: 17 March 2019). TCCON data are available from the TCCON data archive, hosted by CaltechDATA, at: (last access: 2 January 2019, CaltechDATA2020). Relevant EM27/SUN retrievals of XCO2 from Fairbanks, Alaska, including essential components for applying averaging kernel corrections, are available in the Supplement. Other data are available on request and are in the process of being placed in a long-term repository. All data sets are also cited individually in Sect. 2, under “Sites and data sources”.


The supplement related to this article is available online at:

Author contributions

NJ composed this paper and conducted the analysis under the supervision of WRS. DW contributed data from the East Trout Lake TCCON site and provided guidance on and thorough evaluations of the methods and interpretations in the paper. CWO offered guidance and instructions on methods of evaluating OCO-2 satellite retrieval parameters, bias corrections, and quality controls. GBO organized targeted satellite overpasses with OCO-2 over Fairbanks and generally oversaw the satellite-validation efforts for OCO-2. FH, TB, QT, MF, MKD, and HAP all contributed to data collection with the EM27/SUNs in Fairbanks, including instrument evaluations, maintenance, and establishing long-term operations in Fairbanks. HAP also acted as a local host during calibration to the Caltech TCCON. RK and PH operated the TCCON station at Sodankylä, Finland, and provided data and insights into the unique aspects of high-latitude ground-based measurements of XCO2.

Competing interests

The authors declare that they have no conflict of interest.


Thanks to Paul Wennberg, Coleen Roehl, and their colleagues at Caltech for the operation of the Caltech TCCON and support during side-by-side observations with the Los Alamos National Laboratory (LANL) EM27/SUN. The Simpson Lab at UAF acknowledges the Alaska Space Grant Graduate Fellowship and OCO Science Team Grant. The Karlsruhe Institute of Technology (KIT) acknowledges support from the ACROSS research infrastructure of the Helmholtz Association of German Research Centres (HGF) and from the Helmholtz Association in the framework of the Modular Observation Solutions for Earth Systems (MOSES) system. LANL acknowledges the NASA CMS and LDRD programs for their support.

Financial support

This research has been supported by NASA (grant no. NNH17ZDA001N-OCO2).

Review statement

This paper was edited by Joanna Joiner and reviewed by two anonymous referees.


Barlow, J. M., Palmer, P. I., Bruhwiler, L. M., and Tans, P.: Analysis of CO2 mole fraction data: first evidence of large-scale changes in CO2 uptake at high northern latitudes, Atmos. Chem. Phys., 15, 13739–13758,, 2015. a, b, c, d, e

Barnes, E. A., Parazoo, N., Orbe, C., and Denning, A. S.: Isentropic transport and the seasonal cycle amplitude of CO2, J. Geophys. Res.-Atmos., 121, 8106–8124,, 2016. a

Boesch, H., Baker, D., Connor, B., Crisp, D., and Miller, C.: Global Characterization of CO2 Column Retrievals from Shortwave-Infrared Satellite Observations of the Orbiting Carbon Observatory-2 Mission, Remote Sens.-Basel, 3, 270–304,, 2011. a, b

Boesch, H., Brown, L., Castano, R., Christi, M., Bonnor, C., Crisp, D., Eldering, A., Fisher, B., Frankenberg, C., Gunson, M., Granat, R., McDuffie, J., Miller, C., Natraj, V., O'Brien, D., O'Dell, C., Osterman, G., Oyafuso, F., Payne, V., Polonski, I., Smyth, M., Spurr, R., Thompson, D., and Toon, G.: Orbiting Carbon Observatory-2 & 3 (OCO-2 & OCO-3) Level 2 Full Physics Retrieval Algorithm Theoretical Basis, available at: (last access: 17 October 2019), 2019. a

Bradshaw, C. J. A. and Warkentin, I. G.: Global estimates of boreal forest carbon stocks and flux, Global Planet. Change, 128, 24–30,, 2015. a

CaltechDATA: Total Carbon Column Observing Network, available at:, (last access: 2 January 2019), 2020. a

Connor, B. J., Boesch, H., Toon, G. C., Sen, B., Miller, C., and Crisp, D.: Orbiting Carbon Observatory: Inverse method and prospective error analysis, J. Geophys. Res., 113, D05305,, 2008. a

Crisp, D., Fisher, B. M., O'Dell, C., Frankenberg, C., Basilio, R., Bösch, H., Brown, L. R., Castano, R., Connor, B., Deutscher, N. M., Eldering, A., Griffith, D., Gunson, M., Kuze, A., Mandrake, L., McDuffie, J., Messerschmidt, J., Miller, C. E., Morino, I., Natraj, V., Notholt, J., O'Brien, D. M., Oyafuso, F., Polonsky, I., Robinson, J., Salawitch, R., Sherlock, V., Smyth, M., Suto, H., Taylor, T. E., Thompson, D. R., Wennberg, P. O., Wunch, D., and Yung, Y. L.: The ACOS CO2 retrieval algorithm – Part II: Global XCO2 data characterization, Atmos. Meas. Tech., 5, 687–707,, 2012. a, b

Euskirchen, E. S., Edgar, C. W., Bret-Harte, M. S., Kade, A., Zimov, N., and Zimov, S.: Interannual and Seasonal Patterns of Carbon Dioxide, Water, and Energy Fluxes From Ecotonal and Thermokarst-Impacted Ecosystems on Carbon-Rich Permafrost Soils in Northeastern Siberia, J. Geophys. Res.-Biogeosci., 122, 2651–2668,, 2017. a

Frey, M., Sha, M. K., Hase, F., Kiel, M., Blumenstock, T., Harig, R., Surawicz, G., Deutscher, N. M., Shiomi, K., Franklin, J. E., Bösch, H., Chen, J., Grutter, M., Ohyama, H., Sun, Y., Butz, A., Mengistu Tsidu, G., Ene, D., Wunch, D., Cao, Z., Garcia, O., Ramonet, M., Vogel, F., and Orphal, J.: Building the COllaborative Carbon Column Observing Network (COCCON): long-term stability and ensemble performance of the EM27/SUN Fourier transform spectrometer, Atmos. Meas. Tech., 12, 1513–1530,, 2019. a

Gauthier, S., Bernier, P., Kuuluvainen, T., Shvidenko, A. Z., and Schepaschenko, D. G.: Boreal forest health and global change, Science, 349, 819–822,, 2015. a

Gisi, M., Hase, F., Dohe, S., Blumenstock, T., Simon, A., and Keens, A.: XCO2-measurements with a tabletop FTS using solar absorption spectroscopy, Atmos. Meas. Tech., 5, 2969–2980,, 2012. a

Graven, H. D., Keeling, R. F., Piper, S. C., Patra, P. K., Stephens, B. B., Wofsy, S. C., Welp, L. R., Sweeney, C., Tans, P. P., Kelley, J. J., Daube, B. C., Kort, E. A., Santoni, G. W., and Bent, J. D.: Enhanced Seasonal Exchange of CO2 by Northern Ecosystems Since 1960, Science, 341, 1085–1089,, 2013. a, b, c, d

Hase, F., Frey, M., Kiel, M., Blumenstock, T., Harig, R., Keens, A., and Orphal, J.: Addition of a channel for XCO observations to a portable FTIR spectrometer for greenhouse gas measurements, Atmos. Meas. Tech., 9, 2303–2313,, 2016. a

Hedelius, J. and Wennberg, P.: EM27/SUN GGG interferogram processing suite Version 2014, Hosted on CaltechDATA data archive, California Institute of Technology, Pasadena, California, USA,, 2017. a, b

Hedelius, J. K., Viatte, C., Wunch, D., Roehl, C. M., Toon, G. C., Chen, J., Jones, T., Wofsy, S. C., Franklin, J. E., Parker, H., Dubey, M. K., and Wennberg, P. O.: Assessment of errors and biases in retrievals of XCO2, XCH4, XCO, and XN2O from a 0.5 cm−1 resolution solar-viewing spectrometer, Atmos. Meas. Tech., 9, 3527–3546,, 2016. a, b, c, d, e, f, g

Hedelius, J. K., Parker, H., Wunch, D., Roehl, C. M., Viatte, C., Newman, S., Toon, G. C., Podolske, J. R., Hillyard, P. W., Iraci, L. T., Dubey, M. K., and Wennberg, P. O.: Intercomparability of XCO2 and XCH4 from the United States TCCON sites, Atmos. Meas. Tech., 10, 1481–1493,, 2017. a, b, c

Keppel-Aleks, G., Wennberg, P. O., and Schneider, T.: Sources of variations in total column carbon dioxide, Atmos. Chem. Phys., 11, 3581–3593,, 2011. a, b

Kiel, M., O'Dell, C. W., Fisher, B., Eldering, A., Nassar, R., MacDonald, C. G., and Wennberg, P. O.: How bias correction goes wrong: measurement of XCO2 affected by erroneous surface pressure estimates, Atmos. Meas. Tech., 12, 2241–2259,, 2019 a, b, c, d, e, f, g

Kivi, R. and Heikkinen, P.: Fourier transform spectrometer measurements of column CO2 at Sodankylä, Finland, Geosci. Instrum. Method. Data Syst., 5, 271–279,, 2016. a

Kivi, R., Heikkinen, P., and Kyro, E.: TCCON data from Sodankyla, Finland, Release GGG2014R0. TCCON data archive, hosted by CaltechDATA, California Institute of Technology, Pasadena, CA, U.S.A.,, 2017. a, b, c

Lindqvist, H., O'Dell, C. W., Basu, S., Boesch, H., Chevallier, F., Deutscher, N., Feng, L., Fisher, B., Hase, F., Inoue, M., Kivi, R., Morino, I., Palmer, P. I., Parker, R., Schneider, M., Sussmann, R., and Yoshida, Y.: Does GOSAT capture the true seasonal cycle of carbon dioxide?, Atmos. Chem. Phys., 15, 13023–13040,, 2015. a

OCO-2 Science Team/Michael Gunson, Annmarie Eldering: OCO-2 Level 2 bias-corrected XCO2 and other select fields from the full-physics retrieval aggregated as daily files, Retrospective processing V9r, Greenbelt, MD, USA, Goddard Earth Sciences Data and Information Services Center (GES DISC),, 2018. a, b, c

O'Dell, C. W., Connor, B., Bösch, H., O'Brien, D., Frankenberg, C., Castano, R., Christi, M., Eldering, D., Fisher, B., Gunson, M., McDuffie, J., Miller, C. E., Natraj, V., Oyafuso, F., Polonsky, I., Smyth, M., Taylor, T., Toon, G. C., Wennberg, P. O., and Wunch, D.: The ACOS CO2 retrieval algorithm – Part 1: Description and validation against synthetic observations, Atmos. Meas. Tech., 5, 99–121,, 2012. a

O'Dell, C. W., Eldering, A., Wennberg, P. O., Crisp, D., Gunson, M. R., Fisher, B., Frankenberg, C., Kiel, M., Lindqvist, H., Mandrake, L., Merrelli, A., Natraj, V., Nelson, R. R., Osterman, G. B., Payne, V. H., Taylor, T. E., Wunch, D., Drouin, B. J., Oyafuso, F., Chang, A., McDuffie, J., Smyth, M., Baker, D. F., Basu, S., Chevallier, F., Crowell, S. M. R., Feng, L., Palmer, P. I., Dubey, M., García, O. E., Griffith, D. W. T., Hase, F., Iraci, L. T., Kivi, R., Morino, I., Notholt, J., Ohyama, H., Petri, C., Roehl, C. M., Sha, M. K., Strong, K., Sussmann, R., Te, Y., Uchino, O., and Velazco, V. A.: Improved retrievals of carbon dioxide from Orbiting Carbon Observatory-2 with the version 8 ACOS algorithm, Atmos. Meas. Tech., 11, 6539–6576,, 2018. a, b, c, d, e, f, g

Osterman, G. B., Eldering, A., Avis, C., Chafin, B., O'Dell, C., Frankenberg, C., Fisher, B., Mandrake, L., Wunch, D., Granat, R., and Crisp, D.: Data Product User's Guide, Operational L1 and L2 Data Versions 8 and Lite File Version 9, available at: (last access: 2 October 2019), 2018. a, b, c, d, e

Pan, Y., Birdsey, R. A., Fang, J., Houghton, R., Kauppi, P. E., Kurz, W. A., Phillips, O. L., Shvidenko, A., Lewis, S. L., Canadell, J. G., Ciais, P., Jackson, R. B., Pacala, S. W., McGuire, A. D., Piao, S., Rautiainen, A., Sitch, S., and Hayes, D.: A Large and Persistent Carbon Sink in the World's Forests, Science, 333, 988–993,, 2011. a, b

Robinson, D. A., Estilow, T. W., and NOAA CDR Program: NOAA Climate Data Record (CDR) of Northern Hemisphere (NH) Snow Cover Extent (SCE), Version 1. NOAA National Centers for Environmental Information,, 2012. a, b

Sha, M. K., De Mazière, M., Notholt, J., Blumenstock, T., Chen, H., Dehn, A., Griffith, D. W. T., Hase, F., Heikkinen, P., Hermans, C., Hoffmann, A., Huebner, M., Jones, N., Kivi, R., Langerock, B., Petri, C., Scolas, F., Tu, Q., and Weidmann, D.: Intercomparison of low and high resolution infrared spectrometers for ground-based solar remote sensing measurements of total column concentrations of CO2, CH4 and CO, Atmos. Meas. Tech. Discuss.,, in review, 2019. a

Tans, P. P., Fung, I. Y., and Takahashi, T.: Observational Constraints on the Global Atmospheric CO2 Budget, Science, New Series, 247, 1431–1438, 1990. a

Taylor, T. E., O'Dell, C. W., Frankenberg, C., Partain, P. T., Cronk, H. Q., Savtchenko, A., Nelson, R. R., Rosenthal, E. J., Chang, A. Y., Fisher, B., Osterman, G. B., Pollock, R. H., Crisp, D., Eldering, A., and Gunson, M. R.: Orbiting Carbon Observatory-2 (OCO-2) cloud screening algorithms: validation against collocated MODIS and CALIOP data, Atmos. Meas. Tech., 9, 973–989,, 2016. a

Velazco, V. A., Deutscher, N. M., Morino, I., Uchino, O., Bukosa, B., Ajiro, M., Kamei, A., Jones, N. B., Paton-Walsh, C., and Griffith, D. W. T.: Satellite and ground-based measurements of XCO2 in a remote semiarid region of Australia, Earth Syst. Sci. Data, 11, 935–946,, 2019. a, b, c

Wiscombe, W. J. and Warren, S. G.: A model for the spectral albedo of snow. I: Pure snow, J. Atmos. Sci., 37, 2712–2733, 1980. a, b, c

Wu, L., Hasekamp, O., Hu, H., Landgraf, J., Butz, A., aan de Brugh, J., Aben, I., Pollard, D. F., Griffith, D. W. T., Feist, D. G., Koshelev, D., Hase, F., Toon, G. C., Ohyama, H., Morino, I., Notholt, J., Shiomi, K., Iraci, L., Schneider, M., de Mazière, M., Sussmann, R., Kivi, R., Warneke, T., Goo, T.-Y., and Té, Y.: Carbon dioxide retrieval from OCO-2 satellite observations using the RemoTeC algorithm and validation with TCCON measurements, Atmos. Meas. Tech., 11, 3111–3130,, 2018. a

Wunch, D., Toon, G. C., Blavier, J. F. L., Washenfelder, R. A., Notholt, J., Connor, B. J., Griffith, D. W. T., Sherlock, V., and Wennberg, P. O.: The Total Carbon Column Observing Network, Philos. T. R. Soc. A., 369, 2087–2112,, 2011a. a, b

Wunch, D., Wennberg, P. O., Toon, G. C., Connor, B. J., Fisher, B., Osterman, G. B., Frankenberg, C., Mandrake, L., O'Dell, C., Ahonen, P., Biraud, S. C., Castano, R., Cressie, N., Crisp, D., Deutscher, N. M., Eldering, A., Fisher, M. L., Griffith, D. W. T., Gunson, M., Heikkinen, P., Keppel-Aleks, G., Kyro, E., Lindenmaier, R., Macatangay, R., Mendonca, J., Messerschmidt, J., Miller, C. E., Morino, I., Notholt, J., A, F., Oyafuso, Rettinger, M., Robinson, J., Roehl, C. M., Salawitch, R. J., Sherlock, V., Strong, K., Sussmann, R., Tanaka, T., Thompson, D. R., Uchino, O., Warneke, T., and Wofsy, S. C.: A method for evaluating bias in global measurements of CO2 total columns from space, Atmos. Chem. Phys., 11, 12317–12337,, 2011b.  a

Wunch, D., Wennberg, P. O., Messerschmidt, J., Parazoo, N. C., Toon, G. C., Deutscher, N. M., Keppel-Aleks, G., Roehl, C. M., Randerson, J. T., Warneke, T., and Notholt, J.: The covariation of Northern Hemisphere summertime CO2 with surface temperature in boreal regions, Atmos. Chem. Phys., 13, 9447–9459,, 2013. a

Wunch, D., Toon, G. C., Sherlock, V., Deutscher, N. M., Liu, C., Feist, D. G., and Wennberg, P. O.: The Total Carbon Column Observing Network's GGG2014 Data Version, 43,, 2015. a, b

Wunch, D., Mendonca, J., Colebatch, O., Allen, N., Blavier, J.-F. L., Springett, S., Worthy, D., Kessler, R., and Strong, K.: TCCON data from East Trout Lake, Canada, Release GGG2014R0. TCCON data archive, hosted by CaltechDATA, California Institute of Technology, Pasadena, CA, USA,, 2017a. a, b, c

Wunch, D., Wennberg, P. O., Osterman, G., Fisher, B., Naylor, B., Roehl, C. M., O'Dell, C., Mandrake, L., Viatte, C., Kiel, M., Griffith, D. W. T., Deutscher, N. M., Velazco, V. A., Notholt, J., Warneke, T., Petri, C., De Maziere, M., Sha, V., Sussmann, R., Rettinger, M., Pollard, D., Robinson, J., Morino, I., Uchino, O., Hase, F., Blumenstock, T., Feist, D. G., Arnold, S. G., Strong, K., Mendonca, J., Kivi, R., Heikkinen, P., Iraci, L., Podolske, J., Hillyard, P. W., Kawakami, S., Dubey, M. K., Parker, H. A., Sepulveda, E., García, O. E., Te, Y., Jeseck, P., Gunson, M. R., Crisp, D., and Eldering, A.: Comparisons of the Orbiting Carbon Observatory-2 (OCO-2) XCO2 measurements with TCCON, Atmos. Meas. Tech., 10, 2209–2238,, 2017b. a, b, c

Yin, Y., Ciais, P., Chevallier, F., Li, W., Bastos, A., Piao, S., Wang, T., and Liu, H.: Changes in the Response of the Northern Hemisphere Carbon Uptake to Temperature Over the Last Three Decades, Geophys. Res. Lett., 45, 4371–4380,, 2018. a, b

Short summary
The boreal forest is the largest seasonally varying biospheric CO2-exchange region on Earth. This region is also undergoing amplified climate warming, leading to concerns about the potential for altered regional carbon exchange. Satellite missions, such as the Orbiting Carbon Observatory-2 (OCO-2) project, can measure CO2 abundance over the boreal forest but need validation for the assurance of accuracy. Therefore, we carried out a ground-based validation of OCO-2 CO2 data at three locations.