Quality controls, bias, and seasonality of CO 2 columns in the boreal forest with Orbiting Carbon Observatory-2, Total Carbon Column Observing Network, and EM27/SUN measurements

. Seasonal CO 2 exchange in the boreal forest plays an important role in the global carbon budget and in driving interannual variability in seasonal cycles of atmospheric CO 2 . Satellite-based observations from polar orbiting satellites like the Orbiting Carbon Observatory-2 (OCO-2) offer an opportunity to characterize boreal forest seasonal cycles across longitudes with a spatially and temporally rich data set, but data quality controls and biases still require vet-ting at high latitudes. With the


Introduction
The boreal forest or taiga biome is the largest terrestrial biome on Earth; it includes the sub-Arctic regions of Europe, Asia, and North America between 50 and 70 • N latitude, it represents an important and influential component of the global carbon cycle, and it is a principle driver of the atmospheric carbon dioxide (CO 2 ) seasonal cycle.Accurate accounting for seasonal CO 2 exchange in boreal forest regions is an essential component in quantifying the global carbon budget and predicting future climate scenarios (Tans et al., 1990;Pan et al., 2011;Graven et al., 2013;Barlow et al., 2015;Bradshaw and Warkentin, 2015;Gauthier et al., 2015;Yin et al., 2018).Studies by Graven et al. (2013) and Barlow et al. (2015) used a combination of atmospheric modeling, aircraft observations, and a network of ground-based in situ observations to investigate seasonal carbon exchange in the boreal forest.Both studies found that the boreal forest plays an important role in global atmospheric CO 2 concentrations, significantly influencing in situ observations of CO 2 in the tropics (Mauna Loa).Multiple studies have shown that there is a latitude-dependent trend in the seasonal amplitude of atmospheric CO 2 , with increased seasonal uptake of CO 2 in boreal forest regions (Graven et al., 2013;Wunch et al., 2013;Barlow et al., 2015;Lindqvist et al., 2015).Furthermore, the studies by Graven et al. (2013) and Barlow et al. (2015) found that the trend in seasonal cycle amplitudes of CO 2 , with respect to latitude, nearly doubled between 1960 and 2011, suggesting that seasonal changes in the boreal forest are growing at an accelerated rate relative to lower latitude regions.While some studies have reported rapid changes in seasonal carbon exchange in the boreal forest and proposed that this is a dominant driver in the global carbon budget, another study by Barnes et al. (2016) suggests that it is actually the temperate forest between 30 and 50 • N that is the dominant driver in the global carbon budget.Another study by Yin et al. (2018) found evidence of limits on previous assumptions about the relationship between seasonal cycle amplitude and spring and autumn temperatures in northern high latitudes, emphasizing the need for continued data-driven model validation for these regions.It remains difficult to reconcile conflicting claims about contributions to the global carbon budget without a spatially and temporally rich set of measurements for high-latitude regions, and data availability in the boreal forest remains a major impediment to accurately quantifying uptake in the world's largest terrestrial biome (Pan et al., 2011;Barlow et al., 2015;Euskirchen et al., 2017).Therefore, methods of expanding observational coverage through improved satellite observations at high latitudes are essential for clarifying our understanding of global CO 2 exchange.
Satellite-based observations of atmospheric CO 2 columns offer a more holistic view of global CO 2 dynamics by expanding spatial coverage.NASA's Orbiting Carbon Observatory 2 (OCO-2) was launched in July 2014, with CO 2 column retrievals available from September 2014 to the present (OCO-2 Science Team/Michael Gunson, Annmarie Eldering, 2018).Satellite-based observations from OCO-2 consist of solar reflectance in three spectral windows centered at 0.76, 1.61, and 2.06 µm and referred to as the O 2 A band, weak CO 2 band, and strong CO 2 band, respectively.The Atmospheric Carbon Observations from Space (ACOS) full physics retrieval algorithm (currently on version 9 or "ACOS B9") described by O'Dell et al. (2012O'Dell et al. ( , 2018) ) and Connor et al. (2008) fits absorption features in these windows and incorporates additional meteorology and model data to retrieve column-averaged dry air mole fractions of atmospheric CO 2 (X CO 2 ) along with a variety of other parameters, such as aerosol optical depth, surface albedo, surface pressure, and total column water vapor.A number of parameters in the full physics retrievals are used to designate thresholds for postprocessing quality control filters.OCO-2 is polar orbiting, so overpasses are more frequent at high latitudes than midlatitudes, presenting a valuable opportunity to amass an extensive archive of CO 2 observations over the boreal forest.However, before OCO-2 can be used to evaluate CO 2 seasonality for the boreal forest, these data need to be validated in highlatitude regions.Quality control filters implemented in previous versions of the ACOS algorithm, like version 8 (ACOS B8) discussed by O'Dell et al. (2018), removed the majority of high-latitude observations, and as a result, OCO-2 highlatitude data have been underutilized.Validation of OCO-2 satellite-based retrievals at high latitudes has also been limited by the relatively few dedicated ground-based monitoring sites at high latitudes (Wunch et al., 2017b).
In addition to the limited availability of ground-based validation data, there are a number of other challenges to passive satellite measurements at high latitudes.The Sun stays low in the sky at high latitudes; even in summer, when the Sun travels a long azimuthal path, it does not reach the same solar elevations as at lower latitudes.Low solar elevation corresponds to a high solar zenith angle (sza) and high air mass, meaning that sunlight travels a greater distance through the atmosphere before reaching the instrument.High air masses can cause absorption spectra to become saturated at line center, making column retrievals more sensitive to the line wings and, thus, the line shape of the absorption line.Spectroscopic uncertainties tend to be exacerbated at higher air masses, and the relative impacts of radiative transfer effects from atmospheric aerosols on satellite retrievals are also increased at higher air masses.In particular, slant-path aerosol optical depths (aod's) are larger and scattering angles are smaller, which increases the fractional contribution of aerosol scatter to the total radiance detected by the satellite.Air mass dependence in passive column measurements continues to be an area of ongoing research in retrieval algorithms (Wunch et al., 2015), and high air mass in winter is one of the primary reasons for halting observations at high-latitude sites in November through February.Aside from the sunlight, climates and ecosystems at high latitudes are highly seasonally dependent, so there are a number of seasonal parameters that may produce time-dependent biases at high latitudes if they are not handled properly in retrieval algorithms.In particular, Wu et al. (2018) noted time-dependent biases at Sodankylä with the RemoTeC-OCO-2 retrieval algorithm.Wunch et al. (2017b) suggests that there are not enough passable retrievals from ACOS B7 to identify seasonal bias at high latitudes.Snow-and ice-covered surfaces are known to introduce extensive challenges in passive retrievals of X CO 2 due to low surface albedo in the weak (1.61 µm) and strong (2.06 µm) CO 2 bands used by OCO-2 (Wiscombe and Warren, 1980), and reflection anisotropy effects can further complicate retrievals over snow (Boesch et al., 2011;Crisp et al., 2012).Because snow cover is also seasonal and follows the solar cycle, it may be difficult to isolate the causes of seasonal bias at high latitudes.As a result, a certain amount of seasonal dependence may be inevitable, but we still endeavor to minimize it with careful attention to quality control methods.
Ground-based column measurements from solar-viewing spectrometers complement passive satellite observations because both use infrared absorption spectroscopy, with the Sun as a radiation source, and observe a full atmospheric column abundance.The Total Carbon Column Observing Network (TCCON) is a ground-based network that uses solarviewing, high-resolution infrared spectrometers to retrieve X CO 2 (Wunch et al., 2011a).TCCON is the reference measurement for OCO-2 and is the primary source of validation data.In addition to comparing OCO-2 to TCCON, this paper compares OCO-2 observations to ground-based observations from an EM27/SUN Fourier transform infrared spectrometer (EM27/SUN FTS) operated in Fairbanks, Alaska.The EM27/SUN FTS was developed by the Karlsruhe Institute of Technology (KIT) in collaboration with Bruker Optics (Gisi et al., 2012;Hase et al., 2016) to be a compact, mobile solar-viewing FTS designed for field deployment.The EM27/SUN spectrometers have a resolution of 0.5 cm −1 , similar to that of OCO-2 with a ∼ 0.3 cm −1 resolution, while the Bruker IFS 125HR used by TCCON has a much higher resolution of ∼ 0.02 cm −1 .All three instruments record a solar infrared spectrum that can be used to retrieve X CO 2 .Several recent studies have compared EM27/SUN observations to TCCON (Hedelius et al., 2016(Hedelius et al., , 2017;;Velazco et al., 2018;Frey et al., 2019).This paper uses similar retrieval methods for EM27/SUN retrievals of X CO 2 to those of Hedelius et al. (2016), Hedelius et al. (2017), andVelazco et al. (2018) by implementing the GGG2014 retrieval algorithm coupled with the EM27/SUN GGG interferogram-processing suite (EGI; Hedelius and Wennberg, 2017).Hedelius et al. (2016) observed a 0.03 % ± 0.08 % (∼ 0.12 ± 0.32 ppm) offset when comparing four EM27/SUN spectrometers to co-located observations at the Caltech TCCON site.Hedelius et al. (2017) found some EM27/SUN biases to TCCON as large as 0.14 % (∼ 0.56 ppm) but also found statistically significant variability amongst TCCON sites up to 0.3 ppm, suggesting that the site-to-site biases amongst TCCON sites may be of a similar size to biases between EM27/SUN FTS and TCCON as observed by Hedelius et al. (2016).Velazco et al. (2018) found an average offset to TCCON of approximately 0.46 % (∼ 1.84 ppm) when comparing 2 years of co-located observations between an EM27/SUN FTS and the TCCON site at the University of Wollongong, Australia.While some of these biases are large enough to produce significantly different results when choosing the EM27/SUN FTS or TC-CON for the ground-based validation of satellite-based X CO 2 retrievals, these biases tend to be systematic in nature and can be corrected to achieve acceptable agreement with TC-CON through regular calibration measurements.The two EM27/SUN FTS used in Fairbanks were calibrated against the Caltech TCCON, and bias corrections were implemented to ensure that both EM27/SUN and TCCON observations are comparable sources of validation data for OCO-2 (see Sect.S1 in the Supplement for details on EM27/SUN instrument comparisons).The EM27/SUNs were also aligned and serviced at KIT, during which time they were compared to measurements from the Karlsruhe TCCON.
https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 The objective of this study is to explore ways of defining quality control criteria for OCO-2 in high-latitude regions so as to maximize spatial and temporal coverage over the boreal forest while maintaining acceptable agreement with ground sites.It is also essential that biases in OCO-2-retrieved X CO 2 be carefully evaluated under different quality control filtering regimes and in the context of high-latitude seasonality studies.To this end, we first define retrieval, quality control, and aggregation methods for ground-based measurements that are reasonably equivalent for EM27/SUN or TC-CON observations.We verify that EM27/SUN retrievals of X CO 2 in Fairbanks are interchangeable with TCCON through comparisons with the Caltech TCCON, which are used to rescale EM27/SUN observations to the TCCON trace-gas scale.Then, we turn to the problem of data paucity in OCO-2 at high northern latitudes during spring and autumn and investigate the quality control filters applied to those data.We subsequently suggest new quality control filters, specifically validated in boreal forest regions, that substantially increase OCO-2 high-latitude data throughput, and we evaluate the consequences of applying different sets of quality control filters to coincident OCO-2 retrievals at three boreal forest sites.Finally, we discuss observed seasonality in OCO-2 biases in the boreal forest and explore some retrieval parameters that may contribute to seasonality in bias.

Sites and data sources
Ground-based column measurements were collected at three sites in the boreal forest, including two TCCON sites at East Trout Lake (ETL), Saskatchewan, Canada (54.354 • N, 104.987 • W; Wunch et al., 2017a) and Sodankylä (SOD), Finland (67.367 • N, 26.631 • E; Kivi et al., 2017), as well as long-term measurements in Fairbanks (UAF), Alaska, USA (65.859 • N, 147.85 • W), using two Bruker EM27/SUN spectrometers (see Fig. 1).Observations at the Sodankylä TC-CON site began in 2009 and span the full period of OCO-2 observations considered in this analysis, from September 2014 to November 2018, with the exception of the winter months (Kivi and Heikkinen, 2016;Kivi et al., 2017).At East Trout Lake, observations began in October 2016, and because this site is further south, these measurements are nearly year round (Wunch et al., 2017a).In Fairbanks, the Los Alamos National Laboratory (LANL) EM27 (owned by LANL) was operated during August-October 2016 and March-October 2017, while the Karlsruhe Institute of Technology (KIT) EM27 (owned by KIT ) was operated in April-October 2018.Regular characterization of the instrument line shape (ILS) for each EM27/SUN spectrometer was used to monitor instrument performance over time (see Sect.S1).The LANL EM27 was regularly compared to the Caltech TCCON spectrometer in side-by-side observations and was used as a transfer standard to rescale EM27/SUN retrievals in Fairbanks to the TCCON trace-gas scale (see Sect.S1).Tight correlations between the LANL and KIT EM27/SUN spectrometers, and between the LANL EM27 and Caltech TCCON instrument, suggest that, given the appropriate bias correction, observations from either of the two EM27/SUN spectrometers are relatively interchangeable with TCCON observations.
Retrievals from TCCON sites are vetted with careful quality control standards before being archived publicly (Kivi et al., 2017;Wunch et al., 2017a).The GGG2014 retrieval algorithm is used to retrieve X CO 2 from TCCON and EM27/SUN observations (Wunch et al., 2015) with some input modifications introduced in EM27/SUN retrievals by EGI, as discussed by Hedelius et al. (2016) and sourced from Hedelius and Wennberg (2017).Quality controls applied to EM27/SUN retrievals follow those outlined by Hedelius et al. (2016), including an upper bound on sza at 82 • and an upper bound on X CO 2 retrieval error at 5 ppm.In addition to the quality controls suggested by Hedelius et al. (2016), a lower bound is set on the average solar intensity (SIA) in EM27/SUN retrievals at 90 AU.After quality control filtering, EM27/SUN retrievals are smoothed by eliminating retrieved X CO 2 that deviates by more than 1 ppm from a five-point moving average (spectra are collected approximately every 10-15 s).Throughout this paper, all time aggregations of retrieved X CO 2 from ground-based observations were weighted by the inverse of the retrieval error using the following: where x i is the retrieved X CO 2 of the ith retrieval in the aggregation interval, and x err,i is the corresponding retrieval error.
OCO-2 observations were retrieved with the ACOS B9 retrieval algorithm and retrievals for this study were obtained from OCO-2 B9 Lite files (OCO-2 Science Team/Michael Gunson, Annmarie Eldering, 2018), which have been initially screened for cloud cover as described by Taylor et al. (2016) and bias corrected as described by Osterman et al. (2018).Only OCO-2 soundings over land are included in this analysis (with land_fraction = 100), and the standard bias correction to TCCON is applied to all OCO-2 retrievals of X CO 2 , unless otherwise stated.Following the coincidence criteria defined by Wunch et al. (2017b), OCO-2 soundings were considered coincident to ground sites if they fell within a 5 • latitude by 10 • longitude box centered on the ground site and occurred on the same day as the corresponding ground measurements.At Fairbanks and Sodankylä, OCO-2 observations consistently occur within approximately 30 min of local solar noon, and at East Trout Lake they occur within approximately 1 h of local solar noon.Therefore, we define a daily ground-based reference value for X CO 2 (referred to as the near-noon ground measurement or NNG), which is the error-weighted average (see Eq. 1) of ground-based X CO 2 collected within 2 h of local solar noon.

OCO-2 quality control filtering
Three different sets of quality control filters were applied to OCO-2 high-latitude retrievals in this study and are defined in Table 1 (see the Supplement or Osterman et al., 2018 for definitions of quality control parameters).Two of these three sets of quality control (QC) filters are recommended by the OCO science team for ACOS B8 retrievals (B8 QC) and ACOS B9 retrievals (B9 QC) and are summarized by the binary variables xco2_quality_flag_b8 and xco2_quality_flag in the OCO-2 B9 Lite files (Osterman et al., 2018;O'Dell et al., 2018).Methods for selecting quality thresholds and details on the B8 QC filters are discussed by O'Dell et al. (2018).Improvements in pointing accuracy in ACOS B9 (Kiel et al., 2019), and a careful reevaluation of quality control parameters, allowed for intentionally more permissive quality thresholds in B9 QC than in B8 QC; this resulted in a substantial increase in data throughput over regions, such as the boreal forest and high latitudes in general, that have been sparsely represented under past OCO-2 ACOS spectral fitting and quality control regimes.The third set of quality control filters (boreal QC) were determined by evaluating quality control histograms like those presented by O'Dell et al. (2018) with "truth" as the NNG observations from the three boreal forest sites (see Appendix Fig. A1).Scatterplots of bias in X CO 2 ( X CO 2 ≡ OCO-2 -NNG) against various retrieval parameters were also considered as a way to search for groupings of bias outliers that could be eliminated with small changes in quality control thresholds.The boreal QC were set with the goal of maximizing data throughput for highlatitude boreal forest sites in spring and autumn while maintaining acceptable ranges of bias at boreal forest sites.Additional retrieval parameters, not used in B8 or B9 QC, were also considered for the boreal QC that relates to challenges in high-latitude observations, including the difference between retrieved and a priori temperature (deltaT), solar zenith angle (sza), X CO 2 retrieval uncertainty (xco2_uncertainty), and total column water vapor (tcwv).
Changes to thresholds for albedo in the strong CO 2 band (albedo_sco2), the quality of the spectral fit in the weak CO 2 band (rms_rel_wco2), and the standard deviation in surface elevation in the satellite field of view (altitude_stddev) were major contributors to the increase in passable highlatitude retrievals with B9 QC relative to B8 QC (compare Figs. 2 and 3).In the boreal QC, ranges of acceptable values are expanded from those in the B9 QC for the ratios of single-band retrievals of CO 2 (co2_ratio) and H 2 O (h2o_ratio) and the quality of the spectral fit in the weak CO 2 band (rms_rel_wco2).Albedo in the strong CO 2 band (albedo_sco2) is not used as a QC filter in the boreal QC because it seemed that problematic data with low albedo_sco2 could be screened by other QC filters, and there was no evidence that low albedo_sco2 was explicitly correlated to larger OCO-2 biases at boreal forest sites (see quality control plots in Appendix A).In fact, increases in bias and retrieval standard deviation were more often associated with high albedo in the strong CO 2 band rather than low values.More conservative thresholds were placed on the slope of albedo in the strong CO 2 band given by the continuum fit (albedo_slope_sco2) than were previously used in the B8 QC or the B9 QC due to observed increases in the standard deviation of retrievals and larger negative biases specifically associated with more negative albedo slopes.One possible explanation for this observation is that certain surface types that are more prevalent in the boreal forest are not correctly modeled by the ACOS B9 algorithm, and this could be related to snow-covered surfaces.We expect that introducing a polynomial fit to the albedo in each band, rather than a linear fit, could improve the accuracy of modeled surface albedo in future ACOS versions and potentially result in reduced high-latitude biases.Thresholds for the difference between retrieved surface pressure and a priori surface pressure at the pointing location of the O 2 A band (dp_o2a) remained the same in boreal QC as in B9 QC, while thresholds for the difference between retrieved surface pressure and a priori surface pressure at the pointing location of the strong CO 2 band (dp_sco2) were made marginally more conservative.Kiel et al. (2019) discuss the pointing errors and other long-term challenges with surface pressure bias in OCO-2 retrievals that lead to the addition of the dp_o2a and dp_sco2 parameters in which there is one retrieved surface pressure and a separate a priori surface pressure defined for each band.The aerosol optical depth (aod) parameters are mostly the same in the boreal QC as in the B9 QC, with the exceptions that https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020  [3,40] total aod (aod_total) and the combined dust, water, and sea salt aod (dws) were removed in the boreal QC because these seemed superfluous after applying other aod filters.While the range of acceptable values for the difference between retrieved and a priori vertical CO 2 gradient (co2_grad_del) is nearly the same in the B9 QC as in the boreal QC, the range of values is shifted up.This choice was made based on the distribution for co2_grad_del for the boreal forest sites, and the difference may be attributed to the use of a regional data set for boreal QC rather than a global data set for B9 QC.As previously mentioned, several parameters were used to define quality control filters in the boreal QC that were not included in the parameters for B8 QC and B9 QC.A threshold for sza was introduced in the boreal QC and was chosen to restrict data furthest north to the months of March through November.Potential challenges with data at high sza are discussed in the introduction of this paper, and high sza was found to be correlated with larger negative OCO-2 biases at boreal forest sites.The sza threshold in the boreal QC only screens approximately 0.5 % of retrievals that manage to move through the other boreal QC filters.The thresholds placed on the dif-ference between retrieved and a priori temperature (deltaT) and total column water vapor (tcwv) were chosen because very low atmospheric water vapor or large differences between retrieved and modeled temperatures are likely to correspond with cold weather and snow cover.In particular, before the application of quality control filters, large negative biases in OCO-2 retrievals were found to be associated with low values of tcwv (discussed in more detail in Sect.4.3).Although the majority of biased retrievals with low tcwv are screened out by other quality filters, this filter helped to remove a small number of outliers that pass the other QC filters (see Fig. 21).Finally, the uncertainty in retrieved X CO 2 (xco2_uncertainty) was included arbitrarily in the analysis and found to be effective in eliminating a small number of outliers that made up less than 0.05 % of retrievals not screened by other filters.

Effect of QC on data throughput north of 50 • N
For each set of QC filters, all retrievals over land north of 50 • N latitude in OCO-2 Lite files were evaluated to determine how many failed to meet the quality thresholds for each  parameter in each month.The results in Figs. 2, 3, and 4 show the number of soundings flagged as bad for each individual QC parameter, so larger values suggest that more data are being removed during quality filtering.One caveat is that for a sounding removed in QC filtering there could be multiple parameters for which the retrieval was flagged as bad, or there could be as few as one parameter for which the retrieval is flagged as bad.Figures 2 and 3 show there is a clear seasonality to triggered quality filters, with the majority of soundings flagged as bad occurring in spring and early summer.This seasonality is slightly diminished with the B9 QC (Fig. 3) relative to the more conservative B8 QC (Fig. 2) and is only marginally manifested in the boreal QC (Fig. 4).The reduction in the number of filtered soundings in spring with boreal QC is largely attributable to less conservative bounds on the spectral fit quality in the weak and strong CO 2 bands (rms_rel_wco2, rms_rel_sco2) and the ratios of single-band retrievals of CO 2 (co2_ratio) and H 2 O (h2o_ratio).In all three sets of quality control filters, the parameters which most often cause soundings to be flagged as bad, resulting in the removal of data points, are the spectral fit quality (rms_rel_wco2, rms_rel_sco2), the ratios of singleband retrievals of CO 2 (co2_ratio) and H 2 O (h2o_ratio), and differences between the retrieved and various a priori surface pressures (dp_sco2, dp_o2a, dp).The fact that these parameters account for a greater abundance of retrievals being flagged as bad in spring and autumn suggests that there could be seasonal effects related to these retrieval parameters that need to be accounted for in high-latitude measurements.In particular, there has been speculation that spring snow cover would result in low surface albedo in the 1.61 and 2.06 µm bands, and patchy snow cover or snow-free vegetation protruding from the snow pack could cause variability in albedo within the satellite's field of view (Wiscombe and Warren, 1980;Boesch et al., 2011).However, after matching Moderate Resolution Imaging Spectroradiometer (MODIS) snow cover data to coincident OCO-2 retrievals at our boreal forest sites, there was no clear connection found between snow cover and increased magnitudes of OCO-2 bias with or without QC filtering.It may still be the case that an incongruous https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 spatial resolution between MODIS and OCO-2 is masking the effects of snow cover on bias or that OCO-2 is only biased by snow in combination with certain other effects of cold weather conditions that are more frequently occurring in spring.
Additional data gained from applying boreal QC rather than B9 QC can be visualized as an increase in spatial coverage of terrestrial high-latitude regions.Figures 5 and 6 show the difference in the number of soundings passed as "good" by boreal QC relative to B9QC in each 1 • latitude by 1 • longitude geographic grid cell.These maps point to substantial increases in spatial coverage of the boreal forest (∼ 50-70 • N latitude band) in the spring and autumn months with boreal QC.This improvement in coverage is an important advantage of the boreal QC for selecting OCO-2 retrievals with the goal of evaluating longitudinal trends in seasonal cycles for the boreal forest.The maps in Figs. 5 and 6 also show some areas where boreal QC provides less OCO-2 data throughput than B9 QC.First, the sza threshold on the boreal QC removes most soundings that manage to pass the other quality filters in winter (November through February), but winter observations are nearly absent from the OCO-2 Lite files before any QC filtering is applied, either because they are removed by prescreening or the data are not collected in the first place.It, therefore, seems reasonable to exclude these data points to the greatest extent possible, and, in general, analyses on satellite-based high-latitude X CO 2 continue to be confined to the period between March and October.Second, Figs. 5 and 6 show that there are some regions where boreal QC yields noticeably less throughput than B9 QC in June, July, and August; however, these months are when there is already an abundance of data and these grid cells with decreased boreal QC throughput are often adjacent to grid cells with an increase in throughput.Overall, the boreal QC provides increased spatial coverage, and this is complemented by an increase in total throughput for soundings over land north of 50 • N (see Fig. 7).
The B9 QC filters succeed in tripling the number of passed retrievals over land at high latitudes, relative to the B8 QC, and the boreal QC allow nearly double the number of retrievals allowed by B9 QC (see Fig. 7 and the right column of Fig. 9).An important result of the boreal QC is the increase in passed retrievals in May, August, and September relative to the B9 QC.While the more relaxed B9 QC allows more high-latitude retrievals than B8 QC, the relative throughput between any two months remains roughly unchanged, and the shapes of the histograms in Fig. 7a and b are very similar despite scaling.By plotting monthly snow extent in the Northern Hemisphere, as reported by NOAA (Robinson et al., 2012), alongside monthly average sza and monthly fraction of soundings north of 50 • N passed by QC filters, Fig. 7 provides further evidence that some combination of sza and snow cover could be playing a role in high-latitude data removal.If solar zenith angles (sza) were the primary driver of seasonality in high-latitude data throughput, one would expect to obtain approximately the same quantity of passed retrievals in May as in July, but Fig. 7 indicates that nearly twice as many high-latitude retrievals pass QC filters in July.As mentioned previously in this section, additional analysis did not lead us to the conclusion that snow is the culprit in itself, but some effects from snow or differences between fresh and melting snow cannot be entirely excluded either.It remains unclear how combinations of radiative transfer effects may be contributing to increased data removal at high latitudes in spring.

Averaging kernel corrections
The retrieval averaging kernel represents the sensitivity of retrieved X CO 2 to enhancements at different altitudes in the atmospheric column.When comparing retrievals of X CO 2 from two different spectrometers, averaging kernels can be used to mathematically correct for systematic sources of disagreement that result from instrumental differences.In this paper, averaging kernel corrections were applied to simulate the OCO-2 retrieval that would result by assuming the groundbased retrieval to be truth and scaling by the OCO-2 averaging kernel with an averaging kernel-correction factor, dNNG (see Sect.S2).The result of averaging kernel corrections is a set of modified ground-based measurements ( XNNG ) that are the sum of the NNG X CO 2 aggregate (X NNG ) and a dNNG value that is uniquely calculated for each coincident OCO-2 retrieval such that, in the following: and where γ is the scaling ratio of retrieved to a priori near-noon ground-based X CO 2 , h is the pressure-weighting function, A 0 is the OCO-2 averaging kernel, and x a is the a priori CO 2 profile (see Sect.S2 for details).Averaging kernel-correction https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 factors (dNNG) display some seasonal variability; the ratio of retrieved to a priori ground-based X CO 2 was found to be the dominant term causing this seasonality (see the Supplement and compare dNNG in the top row of Fig. S2 to (1−γ ) in the third row of Fig. S2).Any seasonality introduced by averaging kernel corrections appears to be on too small a scale to explain seasonal variability observed in the following sections.Note that in the remainder of this paper East Trout Lake, Sodankylä, and Fairbanks are abbreviated in figures to ETL, SOD, and UAF, respectively.

Biases by site, viewing mode, and QC method
To evaluate the effects of these three quality control methods on observed biases and data throughput, NNG observations with averaging kernel corrections applied ( XNNG ) were compared against coincident OCO-2 retrievals at three sites in the boreal forest (East Trout Lake, Fairbanks, and Sodankylä; see Sect. 2 for definitions of NNG and coincidence criteria).
Daily averages for the complete set of coincident OCO-2 retrievals obtained with boreal QC and corresponding XNNG are shown in Fig. 8.While the OCO-2 and NNG observa- tions in Fig. 8 appear to be in close agreement on most days, there are a few outliers in the OCO-2 daily averages in spring and autumn that may contribute to a potential seasonal dependence in bias.Figure 9 provides an overview of the full data sets for each site, including total average bias, standard deviation in bias, and data throughput, sorted by the satelliteobserving mode and quality control method.Note that bias is defined as X CO 2 ≡ (OCO-2 retrieval) i −( XNNG ) i for each coincident OCO-2 sounding, so that a negative bias indicates that OCO-2 retrievals are lower than NNG, and a positive bias indicates that OCO-2 retrievals are higher than NNG.At all three sites, target mode retrievals had 0.1 to 0.5 ppm lower standard deviation than glint or nadir retrievals, which may indicate the introduction of proximity bias (i.e., soundings further from the ground site contributing larger bias).If proximity bias is an important source of bias, one may expect that target mode retrievals would also have lower average biases than glint and nadir retrievals.Results from East Trout Lake (ETL) and Fairbanks (UAF) meet this expectation, with the exception of the results from the B8 QC at East Trout Lake.Only at Sodankylä (SOD) do the average biases in target mode retrievals substantially exceed the average biases in glint or nadir retrievals, warranting further investigation of target observations at Sodankylä.In particular, the B8 QC results in an average bias in target mode soundings at Sodankylä that is at least twice that observed at East Trout Lake or Fairbanks, and because this increase is accompanied by increased standard deviation in target mode biases at Sodankylä, it could indicate influence from outliers.While the allowance of additional data switching from B8 QC to B9 QC or from B9 QC to boreal QC tends to be accompanied by an approximate increase in the standard deviation in biases of 0.1 to 0.3 ppm, average biases at the three sites are not consistently larger with the boreal QC (see left and center columns of Fig. 9).In considering the use of boreal QC for certain science applications at high latitudes, the introduction of additional scatter should be weighed against the large increase in usable retrievals, as shown in Fig. 9c, f, and i and in Fig. 7.

Seasonal variability in bias
High-latitude regions experience a higher degree of seasonality in many climate and environment variables than midlatitude regions, and one of our primary motivations in this study is expanding our ability to evaluate CO 2 seasonality in the boreal forest.Considering the total average and standard deviation in biases for all coincident soundings, as in Fig. 9, can obscure seasonal variability in biases that contribute to uncertainty in characterizing seasonal cycles of CO 2 obtained from satellite observations.Figure 10 shows the monthly average biases and monthly standard deviation in biases considered for each site and each set of QC filters.Under all three sets of QC filters we observe seasonal trends in biases, which are more pronounced at East Trout Lake and Sodankylä than at Fairbanks.The observed seasonal variability is characterized by more positive biases in mid to late summer and more negative biases in spring and autumn, which may cause satellitebased estimates of seasonal amplitude and timing to differ from ground-based estimates of these seasonal parameters.Monthly bias distributions are visualized with box plots for each site and set of QC filters in Fig. 11 to further elucidate potential seasonal trends in OCO-2 biases.Figure 11a,  b, and c show that East Trout Lake has the most pronounced seasonal variability in biases and the trends observed are similar for all three sets of QC filters.Figure 11f suggests there is a slight seasonal trend at Sodankylä with the boreal QC that appears when March and April soundings are included.
Figure 9. Average bias, standard deviation in bias, and number of passed soundings sorted by viewing geometry and quality control method and considering all coincident soundings at each of the three boreal forest sites.Note that there are no coincident nadir soundings for Fairbanks due to the satellite's operational design which favors glint observations in orbits primarily over oceans.Note that bias is defined as ( X CO 2 ≡ (OCO-2 retrieval) i −( XNNG ) i ) for each coincident OCO-2 sounding.
Overall, the monthly bias distributions also serve to emphasize the similarity in results from the different QC methods.

A modified OCO-2 bias correction with
temperature at 700 hPa (T700) The seasonal dependence of X CO 2 described in the previous section was found to be largely induced by the OCO-2 bias correction and is not apparent in X CO 2 calculated with un-bias-corrected OCO-2 retrievals (see Fig. 11).In OCO-2 B9 retrievals, the B9 bias correction (B9 bc) for soundings over land is defined by Osterman et al. (2018) as follows: with a footprint bias-correction term, foot, an overall divisor to agree with TCCON, and parameter-dependent terms ad-justing based on a modified parameterization of the retrieved surface pressure bias defined by Kiel et al. (2019, dpfrac), the sum of dust, water, and sea salt aod's (dws), and the difference between retrieved and a priori vertical gradients in the CO 2 profile (co2_grad_del).Of the terms in the B9 bc, dpfrac was the only one found to have seasonal variability at boreal forest sites that was similar to that observed in X CO 2 with the OCO-2 bias correction (see Figs. 11 and 25).As will be discussed in Sect.4.4, all versions of the residual in retrieved surface pressure relative to a priori surface pressure (dpfrac, dp, dp_o2a, and dp_sco2) have seasonal variability that can be at least partially attributed to temperature dependence, so we propose new OCO-2 bias corrections with a term for temperature at 700 hPa (T700) to correct for the temperature dependence in dpfrac and dp.To calculate the temperature-dependent modification to the B9 bc, we consider the linear regressions for dpfrac as a function of T700 in https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 each of the satellite-viewing modes for soundings over land north of 50 • N that pass boreal QC (see Fig. 12).Then the regression coefficients for the different viewing modes are combined into average slope and average y intercept with weighting by the fractional abundance of retrievals in that mode to obtain an alternative B9 bias correction (B9 abc), as follows: An alternative B8 bias correction (B8 abc) was also constructed using linear regression terms for the difference between the retrieved and a priori surface pressure from GEOS5 forward processing for instrument teams (GEOS5-FP-IT; dp) as a function of T700 in Fig. 13 as follows: Applying a modification of the B8 bias correction is consistent with the fact that spectroscopy and most aspects of the radiative transfer model remained the same when the ACOS version was updated from B8 to B9. Ideally, a global bias correction would be constructed to include temperature as a component of a broader analysis that considers contributions to the OCO-2 bias in a more holistic context.In this way, the bias correction would be more uniform and widely applicable, while the effects of potential parameter covariance or  other influences that we are unable to control for in this regional analysis can be mitigated.That being said, the results that follow suggest that correcting for temperature dependence could be effective in reducing seasonality in OCO-2 bias over the boreal forest.
The second column of Fig. 14b, f, and j, in addition to results in Figs.15-16, show that seasonally dependent variability in biases is reduced when the dpfrac term is removed from the B9 bc, but both monthly and overall standard deviations in biases are increased.Without the dpfrac term in the B9 bc, https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 Figure 13.Correlations and linear regressions for the difference between retrieved and a priori surface pressure from GEOS5 forward processing for instrument teams (GEOS5-FP-IT; dp) as a function of temperature at 700 hPa (T700) for all retrievals over land north of 50 • N that pass boreal QC and separated here by viewing geometry.
Figure 14.Box plots of distributions of monthly biases at each boreal forest site and filtered by boreal QC, given the standard B9 bc (Eq.4), given the B9 bc without the dpfrac term, given the B9 abc (Eq.6), or given the B8 abc (Eq.7).Note that bias is defined as ( X CO 2 ≡ (OCO-2 retrieval) i − ( XNNG ) i ) for each coincident OCO-2 sounding.
monthly biases in March and April at Sodankylä and in April at Fairbanks are substantially reduced, and month-to-month variability at East Trout Lake is also reduced.Replacing the dpfrac or dp term with a T700 modification, as in B9 abc and B8 abc (Eqs.6 and 7), results in lower monthly standard deviations in biases than those obtained in the B9 bc with the dpfrac term removed and that are nearly equivalent to those obtained with the standard B9 bc (Eq.4).While some of the seasonal shape is reintroduced with the B9 abc and the B8 abc, biases are still reduced in spring and autumn relative to the B9 bc (see Fig. 15).The combined results of Fig. 15, with the total average biases and total standard deviations in biases shown in Fig. 16, suggest that the B8 abc is slightly more effective than B9 abc in reducing seasonal variability in bias, reducing total average bias, and reducing total standard deviations in biases.Figure 16 demonstrates that, for all sites and viewing modes, most of the total average biases with the B8 abc are within ±0.5 ppm.In particular, the B8 abc results in reduced average bias in target mode soundings at all three sites, including nadir soundings at Sodankylä and glint soundings at East Trout Lake and Fairbanks.The B8 abc did result in slight increases in total average biases in nadir soundings at East Trout Lake and in glint soundings at Sodankylä (see Fig. 16).However, with the B9 abc, average biases in all modes at Sodankylä, and in nadir and target re-  and f) of coincident OCO-2 soundings at each of the three boreal forest sites with boreal QC filtering and given each of the following bias correction modifications: the standard B9 bc (Eq.4), the B9 bc without the dpfrac term, B9 abc with a term for temperature at 700 hPa (T700; Eq. 6), and B8 bc with a term for T700 (Eq.7).
trievals at East Trout Lake, are nearly doubled relative to the standard B9 bc.

Discussion
Results from this analysis have revealed that modified QC filters for OCO-2 retrievals have the potential to recover large quantities of previously screened terrestrial high-latitude observations and provide double or triple the number of retrievals for use in scientific studies of high-latitude regions.This increase in data throughput is accompanied by only minor changes in average bias and increases in standard deviations in bias of approximately 0.3 ppm or less (see Figs. 9 and 17).Through monthly comparisons between coincident OCO-2 retrievals and ground-based measurements at three boreal forest sites, biases ( X CO 2 ) were found to exhibit some seasonal variability that is mostly independent of the QC method applied (see Fig. 10 and Fig. 11).Observations from Fairbanks exhibit less seasonal variability in OCO-2 bias than East Trout Lake or Sodankylä, and it is unclear whether this is due to geography or the difference in groundbased instrumentation.The EM27/SUN has a similar spectral resolution to OCO-2 (∼ 0.5 and ∼ 0.3 cm −1 , respectively), both of which differ from that of TCCON (∼ 0.02 cm −1 ) by an order of magnitude.The more similar spectral resolution of the EM27/SUN to OCO-2 could be a reason for smaller biases between these instruments.A recent study by Sha et al. (2019) compared long-term measurements of X CO 2 using multiple ground-based infrared spectrometers at Sodankylä, including the EM27/SUN and TCCON.Their results suggest that the EM27/SUN at Sodankylä may retrieve higher X CO 2 than TCCON in spring, and factors such as temperature and water vapor may influence differences between EM27/SUN and TCCON retrievals of X CO 2 .A more direct https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 Figure 16.Average bias and standard deviation in bias sorted by viewing geometry and bias correction modification for all coincident soundings at each of the three boreal forest sites with boreal QC.Note that there are no coincident nadir soundings for Fairbanks due to the satellite's operational design which favors glint observations in orbits primarily over oceans.
comparison between the EM27/SUN observations and satellite measurements from Sodankylä, and other locations, are needed to further investigate these relationships.Seasonally dependent biases can be challenging to correct and can ultimately result in biases between satellite-based and groundbased estimates of seasonal cycle parameters such as amplitude and timing.Therefore, it is essential that any seasonality in biases be characterized and potential sources be identified.Both the B9 abc in Eq. ( 6) and the B8 abc in Eq. ( 7) result in reduced monthly average biases in spring and autumn, and the standard deviations in biases are nearly equivalent to those obtained with the standard B9 bc in Eq. ( 4).The B8 abc was found to be slightly more effective than the B9 abc in reducing the seasonal curvature in monthly biases at East Trout Lake and Sodankylä, which could allow for lower uncertainty in seasonal cycle parameters estimated using OCO-2 retrievals over the boreal forest.However, some month-to-month variability persists with any of the bias corrections applied in this paper, and it is still important to continue to explore other contributions to seasonal variability in OCO-2 bias, such as the choice of QC, coincidence criteria, or processing of ground-based data.In this vein, the following sections consider differences in monthly average bias and standard deviation in bias between boreal QC and B9 QC.
Then we consider a number of QC parameters that exhibit seasonal behavior and their potential role in seasonally de- pendent biases at boreal forest sites.The effects of limiting coincidence by midtropospheric temperature or changing the ground-based reference from NNG to an average of groundbased retrievals at a restricted range of solar zenith angles were also explored but found to not significantly affect seasonal dependence in bias (see Appendices B and C).

Potential contribution to seasonal bias from QC method
Overall, the B8 QC is the most conservative set of QC filters, the B9 QC allows for more relaxed thresholds in the QC parameters, and the boreal QC is the most permissive set of QC filters.It was observed in Fig. 9 that the total standard deviations in biases for all coincident soundings at East Trout Lake and Sodankylä gradually increase from ∼ 1.3 and ∼ 1.4 to ∼ 1.6 ppm ascending, with the increase in throughput obtained from relaxing QC filters (B8 QC < B9 QC < boreal QC).A similar trend at Fairbanks is reflected by an increase from ∼ 1.1 and ∼ 1.3 to ∼ 1.4 ppm. Figure 10 also demonstrates this increase in standard deviation with different QC filters, but there does not appear to be a seasonal trend in the monthly standard deviation in biases at East Trout Lake and Fairbanks.The anomalously high standard deviation in biases in June at Sodankylä remains to be reconciled, and it represents a potential complication that would perpetuate midsummer uncertainty even if some method of correcting seasonal trends in monthly bias were devised and implemented.Additionally, there is a substantial increase in standard deviation in biases at East Trout Lake in March with boreal QC compared to the B9 QC.While this increase in standard deviation is concerning, the availability of OCO-2 retrievals in the boreal forest in March remains insufficient for a representative sample of northern regions and is not likely to be included in seasonal studies of the boreal forest at this time.
The largest difference in the absolute values of monthly bias between boreal QC and B9 QC is 0.56 ppm in April at Fairbanks (see Fig. 17a).Boreal QC also results in a 0.18 ppm larger absolute bias than B9 QC in April at East Trout Lake.In July through October, boreal QC results in monthly biases at Fairbanks that are 0.1 to 0.4 ppm larger than with B9 QC, while in May and June there is no change in average monthly biases between the two QC methods.At East Trout Lake and Sodankylä boreal QC produces some monthly biases that are smaller, by up to 0.4 ppm, than B9 QC.Despite some increases in monthly biases with boreal QC relative to B9 QC, it is clear from Fig. 17 that the modifications in QC filters do not always result in larger monthly biases, and the effects should be weighed against the potential advantages of increasing passable retrievals and spatial coverage.We conclude that the differences between boreal QC and B9 QC are not likely to be a major source of seasonal variability in bias because seasonal dependence is observed with both QC methods in Figs. 10 and  The co2_ratio refers to the ratio of X CO 2 retrieved by the 2.06 µm band to that retrieved by the 1.61 µm band.Recall that Wiscombe and Warren (1980) measured low and differing reflectance for snow in the 1.61 and 2.06 µm bands.Systematic departure from unity in the co2_ratio could result from spectroscopic inaccuracies in either band that are characteristic of the instrument or the line list used in the retrieval algorithm.Anomalous departures from unity in the co2_ratio can arise from low signal-to-noise ratio in either or both CO 2 bands, which can be due to cloud and aerosol interference or the low reflectivity of snow-and ice-covered surfaces (Crisp et al., 2012).Patchy snow cover or vegetation protruding through the snow may also cause discrepancies in signal intensity between the weak and strong CO 2 bands as a result of variable surface reflectivity in the satellite field of view.In all months at midlatitudes and in May through October at high latitudes, terrestrial retrievals have a systematic departure from unity in the median co2_ratio, with the data approximately normally distributed around 1.012 (see https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 Fig. 18).There is an even greater departure from unity in the co2_ratio for high-latitude retrievals in the winter months, namely November through April, with the data approximately normally distributed around 1.020.Figure 18 demonstrates that there is seasonal variation in the median and distribution of co2_ratio at latitudes north of 50 • N that is not observed at latitudes from 10 to 50 • N.This monthly difference in the distribution of retrieved co2_ratio at high latitudes may be a symptom of the effects of snow albedo, or it may be attributable to some other factor, but it warrants some attention because it may be associated with radiative transfer effects that contribute to negative biases in spring at the boreal forest sites.

Total column water vapor (tcwv), bias, and temperature dependence
The parameter tcwv refers to total column water vapor, which is calculated as the product of a scaling factor determined by the full physics retrieval and the a priori tcwv from the European Centre for Medium-Range Weather Forecasts (ECMWF).Atmospheric water vapor is expected to be seasonal, and the seasonality of tcwv at the three boreal forest sites is illustrated in the box plots in Fig. 19.Large amounts of atmospheric water vapor can suggest that there may be more cloud cover degrading the quality of both satellitebased and ground-based measurements.Even in the absence of clouds, water vapor is a strong infrared absorber in all three bands used by OCO-2, and water vapor is identified in Boesch et al. (2019) as the most important absorbing gas interfering with line fitting in OCO-2 retrievals.In selecting QC filters for the boreal QC, large negative biases (OCO-2 retrievals reporting lower values of X CO 2 than NNG) were correlated to low tcwv, prompting the introduction of quality thresholds for tcwv in the boreal QC (see Fig. 20 and Table 1).Figure 21 shows the additional retrievals cut by the lower bound on tcwv at 3 kg m −2 (data left of the black dashed line) in the boreal QC which are not cut by other QC filters, and an overall downward trend persists in these removed data.One possible explanation is path shortening resulting from atmospheric scattering, which could result in retrieved spectral radiance that has failed to penetrate atmospheric layers near the surface.This would cause all retrieved gases to be underestimated so that total column water vapor and the total CO 2 column are both erroneously low.However, Fig. 22 shows that while path shortening may explain some instances of negative biases and low tcwv, the relationship persists between a priori tcwv from ECMWF reanalysis and negative X CO 2 biases.Because water vapor is a strong infrared absorber, it would be reasonable to expect retrieval errors when tcwv is high, but low atmospheric water vapor is also associated with cold fronts and snow cover.Figure 23 illustrates the relationship between tcwv and midtropospheric temperature (T700), at 700 hPa, in boreal forest coincident OCO-2 retrievals.There is a distinct maximum for tcwv at a given atmospheric temperature that is defined by the condensation temperature of water, and Fig. 23 shows that most of the retrievals with tcwv below 3 kg m −2 are also those with midtropospheric temperature (T700) below approximately 250 K. Therefore, it is reasonable to conclude that negative OCO-2 biases are also occurring at low temperatures, which is demonstrated by the correlations between X CO 2 and T700 in Fig. 24.
4.4 Seasonal variability and temperature dependence in retrieved surface pressure bias The dp_o2a and dp_sco2 variables are the residuals of retrieved and a priori surface pressure at the pointing locations of the O 2 A and strong CO 2 bands, respectively.These two rehttps://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020  trieval parameters were first included in B9 following the discovery of a pointing error that caused systematic inaccuracies in retrieved surface pressure (Kiel et al., 2019).Before the release of ACOS B9, only a single dp variable (the difference between retrieved and a priori surface pressure from GEOS5-FP-IT) was used as a quality control and bias-correction parameter.In the analysis by Kiel et al. (2019), an additional parameterization of surface pressure residuals (dpfrac) was introduced for use in the OCO-2 B9 bc.The inclusion of dpfrac and dp in the OCO-2 bias correction is not the only reason that surface pressure residuals are important; accurate surface pressure measurements are essential for calculating X CO 2 , which is defined as the ratio of the total CO 2 column to the total column of dry air.These terms are essential compo-  nents of quality control and bias-correction methods because even small inaccuracies can translate to unacceptable errors in X CO 2 .While the effects of removing the dp term from the bias correction are considered in Sect.3.5, it is probably inadvisable to remove this term entirely from bias correction or to loosen quality thresholds on dp variables without careful consideration of the impacts on X CO 2 .Furthermore, in attributing the causes and effects of trends in surface pressure residuals, there may be many competing factors.The seasonal box plots in Fig. 25 show that there is a seasonality in all four of the variants on surface pressure residuals (dpfrac, dp_o2a, dp_sco2, and dp) at the three boreal forest sites that are similar to the seasonality in bias-corrected X CO 2 (compare to Fig. 11).Similar seasonality may be a result of multiple seasonal parameters that equally effect dp Figure 24.Bias in coincident retrievals of X CO 2 ( X CO 2 ≡ OCO-2 -NNG) as a function of midtropospheric temperature (T700 ≡ temperature at 700 hPa) for all data without QC filters and each of the three QC methods presented in this paper.and X CO 2 .Figures 12 and 13 show that both dpfrac and dp also exhibit linear dependence on T700, with greater linearity than the correlations between X CO 2 and T700, given either B9 QC or boreal QC in Fig. 24c and d.Not only is temperature clearly seasonal and correlated to other seasonal parameters, but rates and directions of atmospheric transport are also seasonal, and T700 has been found to link plumes in the free troposphere (Keppel-Aleks et al., 2011).Kiel et al. (2019) show that systematic biases in dp are characterized by a positive trend close to the Equator and a negative trend at higher southern and northern latitudes, and we claim that this could also be a manifestation of temperature dependence.

Conclusions
Through ILS testing of EM27/SUN FTS used in Alaska, regular comparisons between multiple EM27/SUN FTS, and comparisons of EM27/SUN FTS with TCCON, we established the relative equivalence of EM27/SUN and TCCON observations as ground-based references for OCO-2 validation (see Sect.S1).With the application of multiplicative corrections, EM27/SUN FTS measurements in Fairbanks were compared to OCO-2 and yield similar magnitudes in OCO-2 biases as the TCCON sites considered in this study, with less seasonal variability in biases.More long-term observations with EM27/SUN FTS at different locations will be necessary to determine if the reduced seasonality in bias at Fair-banks is due to geography or instrumentation.While there were many challenges with data availability at high latitudes under the B8 QC, both the B9 QC and the boreal QC offer a two-to three-fold increase in passable retrievals from OCO-2 Lite files (OCO-2 Science Team/Michael Gunson, Annmarie Eldering, 2018) without major sacrifices in data quality.Total average biases for all sites, viewing modes, and quality control methods were within ±1 ppm (see Fig. 9).In particular, the boreal QC allows for nearly twice as many terrestrial OCO-2 retrievals north of 50 • N latitude in the months of May, August, and September relative to the B9 QC, while resulting in no distinct increase in the total average bias and less than 0.3 ppm increase in the total standard deviation of biases for coincident retrievals at boreal forest sites (see Sect. 3.3.2and Fig. 9).With the exception of an anomalously large negative April bias at Sodankylä, seasonal variability in monthly biases at these three boreal forest sites is mostly confined between −1 and +1 ppm, which equates to the 2 ppm maximum monthly standard deviation of biases in June at Sodankylä (when data availability is most abundant; see Fig. 10).Even a slight seasonal trend can significantly impact the analysis of seasonal cycle parameters, so it is important that valid methods for reducing the seasonal dependence in OCO-2 biases are identified.The OCO-2 bias correction seems to introduce some seasonality in OCO-2 bias through the inclusion of a dp (the difference between retrieved and a priori surface pressures) https://doi.org/10.5194/amt-13-5033-2020Atmos.Meas.Tech., 13, 5033-5063, 2020 bias-correction term.We propose two alternative OCO-2 bias corrections in Eqs. ( 6) and ( 7) that correct for temperature dependence in dpfrac and dp, respectively, based on linear regressions shown in Figs. 12 and 13.It may be important to note that these alternative bias corrections are specifically tailored to high-latitude OCO-2 B9 retrievals over land with boreal QC.Of these two alternative bias corrections, the B8 abc in Eq. ( 7) appears to be more effective in reducing seasonal variability without substantial increases in average biases in any viewing modes or increases in monthly standard deviations in biases at boreal forest sites.The choice of B9 QC or boreal QC was not found to be a clear source of seasonal dependence in monthly OCO-2 bias in the boreal forest (see Sect. 4.1) nor were the effects of proximity bias or air mass dependence found to be important contributors to seasonal variability in biases (see Appendices B and C).Several sounding-retrieval parameters that have been used as QC filters were found to exhibit seasonal variability at these boreal forest sites, including the ratio of single-band retrievals of CO 2 (co2_ratio), total column water vapor (tcwv), and the differences between retrieved and a priori surface pressures (dp, dp_o2a, and dp_sco2).These parameters may contribute to seasonal variability in biases by impacting data selection in the quality-filtering process, or they may be indicative of seasonal behavior at high latitudes that is not fully addressed in the retrieval algorithm.In particular, low tropospheric temperatures, or some other parameter that may be correlated to temperature, appears to be one of the primary contributors to seasonal dependence in OCO-2 bias at high latitudes.
While the specific choices for QC parameters in the boreal QC method, proposed here, may still be a subject for consideration and debate, this analysis has shown that it is possible to modify quality controls tailored to a specific region and substantially increase the quantity of usable OCO-2 retrievals with only minor sacrifices in data quality.Furthermore, boreal QC coupled with an alternative bias correction that accounts for temperature dependence (Eq.7) may yield sufficiently stable results for application in preliminary studies of boreal forest seasonal cycles of X CO 2 across longitudes.
It is also possible that improvements in spectroscopic modeling in future versions of the ACOS-retrieval algorithm would reduce or remove temperature dependence in surface pressure bias.
Following methods used for determining B8 QC and B9 QC parameters and thresholds, described in more detail by O'Dell et al. (2018), the filters used in the boreal QC are determined by qualitative assessment of the plots in Fig. A1.For these plots, only OCO-2 retrievals coincident to one of the three boreal forest ground sites, namely East Trout Lake, Sodankylä, and Fairbanks, are considered with the corresponding NNG measurements representing truth (see Sect. 2 for coincidence criteria and site locations).The plots include a histogram that shows the number of retrievals within a certain range of values (bin) for the QC filtering parameter in question (gray bars), the average X CO 2 in each bin with (black circles) and without (cyan circles) the standard bias correction, and the standard deviation in retrieved X CO 2 in each bin (blue diamonds).We began by assessing the parameters that are used in B9 but with thresholds extensively loosened or removed wherever reasonably possible.Next, a number of additional retrieval parameters were assessed to see if their inclusion could reduce average bias and standard deviation in X CO 2 by eliminating small numbers of outliers.After adding these parameters (deltaT, solar_zenith, xco2_uncertainty, and tcwv), the loose bounds on the remaining QC filtering parameters brought from B9 were reassessed and incrementally tightened.
Author contributions.NJ composed this paper and conducted the analysis under the supervision of WRS.DW contributed data from the East Trout Lake TCCON site and provided guidance on and thorough evaluations of the methods and interpretations in the paper.CWO offered guidance and instructions on methods of evaluating OCO-2 satellite retrieval parameters, bias corrections, and quality controls.GBO organized targeted satellite overpasses with OCO-2 over Fairbanks and generally oversaw the satellitevalidation efforts for OCO-2.FH, TB, QT, MF, MKD, and HAP all contributed to data collection with the EM27/SUNs in Fairbanks, including instrument evaluations, maintenance, and establishing longterm operations in Fairbanks.HAP also acted as a local host during calibration to the Caltech TCCON.RK and PH operated the TC-CON station at Sodankylä, Finland, and provided data and insights into the unique aspects of high-latitude ground-based measurements of X CO 2 .
Competing interests.The authors declare that they have no conflict of interest.

Figure 1 .
Figure 1.Circumpolar map showing locations of boreal forest sites included in this study.

Figure 2 .
Figure 2. Total number of land soundings north of 50 • N flagged bad by B8-quality filters in each month.

Figure 3 .
Figure 3.Total number of land soundings north of 50 • N flagged bad by B9-quality filters in each month.

Figure 4 .
Figure 4. Total number of land soundings north of 50 • N flagged bad by boreal-quality filters in each month.

Figure 5 .
Figure 5. Maps of differences between the number of soundings passed by boreal QC and the number of soundings passed by B9 QC in each 1 • latitude by 1 • longitude grid cell in each month (January to June; summing all years from 2014 to 2018).

Figure 6 .
Figure 6.Maps of differences between the number of soundings passed by boreal QC and the number of soundings passed by B9 QC in each 1 • latitude by 1 • longitude grid cell in each month (July to December; summing all years from 2014 to 2018).

Figure 7 .
Figure7.The fraction of soundings that pass each set of QC filters for each month out of all land soundings north of 50 • N in the OCO-2 Lite files (note that OCO-2 observations are filtered by a preliminary cloud screening before being compiled in the Lite files).Also plotted are the average solar zenith angles reported in OCO-2 retrievals that pass in each month and the monthly snow extent in the Northern Hemisphere, averaged over 2014-2018, as reported in the NOAA database(Robinson et al., 2012).

Figure 8 .
Figure 8.Time series of ground-based and satellite-based data at each boreal forest site.These consist of daily averages of OCO-2 coincident soundings filtered with boreal QC alongside corresponding daily averages of NNG with averaging kernel corrections to OCO-2 applied, as described in Sect.3.3.1 ( XNNG ).

Figure 10 .
Figure10.Monthly average bias (a, c, and e) and standard deviation in biases (b, d, and f) of coincident OCO-2 soundings at each of the three boreal forest sites and with each of the QC methods.Note that bias is defined as ( X CO 2 ≡ (OCO-2 retrieval) i − ( XNNG ) i ) for each coincident OCO-2 sounding.

Figure 11 .
Figure 11.Box plots of distributions of monthly biases for each boreal forest site and for each QC, showing results with and without the standard B9 bias correction (bc).Box plots show the median in the center of the box, the first and third quartile as the bottom and top of the box, and the full range of data values as the bars extending above and below the box.Note that bias is defined as ( X CO 2 ≡ (OCO-2 retrieval) i − ( XNNG ) i ) for each coincident OCO-2 sounding.

Figure 12 .
Figure12.Correlations and linear regressions for dpfrac (defined in the Supplement or byKiel et al., 2019) as a function of temperature at 700 hPa (T700) for all retrievals over land north of 50 • N that pass boreal QC and separated here by viewing geometry.

Figure 15 .
Figure15.Monthly average bias (a, c, and e) and standard deviation in biases (b, d, and f) of coincident OCO-2 soundings at each of the three boreal forest sites with boreal QC filtering and given each of the following bias correction modifications: the standard B9 bc (Eq.4), the B9 bc without the dpfrac term, B9 abc with a term for temperature at 700 hPa (T700; Eq. 6), and B8 bc with a term for T700 (Eq.7).

Figure 17 .
Figure 17.Differences between average monthly biases (a) or standard deviation in monthly biases (b) when comparing boreal QC to B9 QC.Note that Fig. 10 shows typical average monthly bias ranges from −1 to 1 parts per million (ppm).

Figure 18 .
Figure 18.Monthly histograms of the ratio in single-band retrievals of CO 2 (co2_ratio) for all unfiltered OCO-2 retrievals over land (land_fraction = 100) split into two latitude bands, namely 10 to 50 • N and 50 to 90 • N.

Figure 19 .
Figure 19.Seasonal box plots of retrieved tcwv (total column water vapor) at (a, b, and c) East Trout Lake, (d, e, and f) Sodankylä, and (g, h, and i) Fairbanks, given each of the three quality control methods.

Figure 20 .
Figure 20.Bias in coincident retrievals of X CO 2 ( X CO 2 ≡ OCO-2 -NNG), with no QC filtering against retrieved tcwv (total column water vapor), plotted as a density map.The lower bound placed on tcwv in the boreal QC (3 kg m −2 ) is shown as a black dashed vertical line.

Figure 21 .
Figure 21.Bias in coincident retrievals of X CO 2 ( X CO 2 ≡ OCO-2 -NNG), with all boreal QC filtering except the bound on total column water vapor (tcwv) against retrieved tcwv, plotted as a density map.The lower bound placed on tcwv in the boreal QC (3 kg m −2 ) is shown as a black dashed vertical line.

Figure 22 .
Figure 22.Bias in coincident retrievals of X CO 2 ( X CO 2 ≡ OCO-2 -NNG), with no QC filtering against a priori tcwv (total column water vapor), plotted as a density map.A priori tcwv is defined by ECMWF reanalysis data.The lower bound placed on tcwv in the boreal QC (3 kg m −2 ) is shown as a black dashed vertical line.

Figure 23 .
Figure 23.Midtropospheric temperature (T700 ≡ temperature at 700 hPa) as a function of total column water vapor (tcwv) for all coincident retrievals without QC filters.

Figure A1 .
Figure A1.Histograms of retrieval parameters used in the boreal QC and considering only retrievals coincident to the three boreal forest sites in this paper.These plots are in the same style as those presented by O'Dell et al. (2018) but with NNG measurements as truth.Biases in X CO 2 with and without the standard B9 bias correction, and the standard deviation in OCO-2-retrieved X CO 2 , are shown for each bin (in units of ppm).Implemented boreal QC thresholds for each parameter are represented by the black dashed vertical lines.
Data availability.OCO-2 data and quality control parameters used here are taken from OCO-2 Lite files (version 9, "B9").OCO-2 Lite files are produced by the NASA OCO-2 project at the Jet Propulsion Laboratory, California Institute of Technology, and obtained from the NASA Goddard Earth Science Data and Information Services Center (GES DISC; https://daac.gsfc.nasa.gov/,last access: 17 March 2019).TCCON data are available from the TCCON data archive, hosted by CaltechDATA, at: https://tccondata.org/(last access: 2 January 2019, CaltechDATA, 2020).Relevant EM27/SUN retrievals of X CO 2 from Fairbanks, Alaska, including essential components for applying averaging kernel corrections, are available in the Supplement.Other data are available on request and are in the process of being placed in a long-term repository.All data sets are also cited individually in Sect.2, under "Sites and data sources".