Research Online Research Online Ensemble-based satellite-derived carbon dioxide and methane Ensemble-based satellite-derived carbon dioxide and methane column-averaged dry-air mole fraction data sets (2003-2018) for column-averaged dry-air mole fraction data sets (2003-2018) for carbon and climate applications carbon and climate applications

Abstract Satellite retrievals of column-averaged dry-air mole fractions of carbon dioxide (CO2) and methane (CH4), denoted XCO2 and XCH4, respectively, have been used in recent years to obtain information on natural and anthropogenic sources and sinks and for other applications such as comparisons with climate models. Here we present new data sets based on merging several individual satellite data products in order to generate consistent long-term climate data records (CDRs) of these two Essential Climate Variables (ECVs). These ECV CDRs, which cover the time period 2003-2018, have been generated using an ensemble of data products from the satellite sensors SCIAMACHY/ENVISAT and TANSO-FTS/GOSAT and (for XCO2) for the first time also including data from the Orbiting Carbon Observatory 2 (OCO-2) satellite. Two types of products have been generated: (i) Level 2 (L2) products generated with the latest version of the ensemble median algorithm (EMMA) and (ii) Level 3 (L3) products obtained by gridding the corresponding L2 EMMA products to obtain a monthly 5 ∘ x5 ∘ data product in Obs4MIPs (Observations for Model Intercomparisons Project) format. The L2 products consist of daily NetCDF (Network Common Data Form) files, which contain in addition to the main parameters, i.e., XCO2 or XCH4, corresponding uncertainty estimates for random and potential systematic uncertainties and the averaging kernel for each single (quality-filtered) satellite observation. We describe the algorithms used to generate these data products and present quality assessment results based on comparisons with Total Carbon Column Observing Network (TCCON) ground-based retrievals. We found that the XCO2 Level 2 data set at the TCCON validation sites can be characterized by the following figures of merit (the corresponding values for the Level 3 product are listed in brackets) - single-observation random error (1 σ ): 1.29 ppm (monthly: 1.18 ppm); global bias: 0.20 ppm (0.18 ppm); and spatiotemporal bias or relative accuracy (1 σ ): 0.66 ppm (0.70 ppm). The corresponding values for the XCH4 products are single-observation random error (1 σ ): 17.4 ppb (monthly: 8.7 ppb); global bias: −2.0 ppb (−2.9 ppb); and spatiotemporal bias (1 σ ): 5.0 ppb (4.9 ppb). It has also been found that the data products exhibit very good long-term stability as no significant long-term bias trend has been identified. The new data sets have also been used to derive annual XCO2 and XCH4 growth rates, which are in reasonable to good agreement with growth rates from the National Oceanic and Atmospheric Administration (NOAA) based on marine surface observations. The presented ECV data sets are available (from early 2020 onwards) via the Climate Data Store (CDS, https://cds.climate.copernicus.eu/, last access: 10 January 2020) of the Copernicus Climate Change Service (C3S, https://climate.copernicus.eu/, last access: 10 January 2020). Abstract. Satellite retrievals of column-averaged dry-air mole fractions of carbon dioxide (CO 2 ) and methane (CH 4 ), denoted XCO 2 and XCH 4 , respectively, have been used in recent years to obtain information on natural and anthropogenic sources and sinks and for other applications such as comparisons with climate models. Here we present new data sets based on merging several individual satellite data products in order to generate consistent long-term climate data records (CDRs) of these two Essential Climate Variables (ECVs). These ECV CDRs, which cover the time period 2003–2018, have been generated using an ensemble of data products from the satellite sensors SCIAMACHY/ENVISAT and TANSO-FTS/GOSAT and (for XCO 2 ) for the ﬁrst time also including data from the Orbiting Carbon Observatory 2 (OCO-2) satellite. Two types of products have been generated: (i) Level 2 (L2) products generated with the latest version of the ensemble median algorithm (EMMA) and (ii) Level 3 (L3) products obtained by gridding the corresponding L2 EMMA products to obtain a monthly 5 ◦ × 5 ◦ data product in Obs4MIPs (Observations for Model Intercomparisons Project) format. The L2 products consist of daily NetCDF (Network Common Data Form) ﬁles, which contain in addition to the main parameters, i.e., XCO 2 or XCH 4 , corresponding uncertainty estimates for random and potential systematic uncertainties and the averaging kernel for each single (quality-ﬁltered) satellite observation. We describe the algorithms used to generate these data products and present quality assessment results based on comparisons with Total Carbon Column Observing Network (TC-CON) ground-based retrievals. We found that the XCO 2 Level 2 data set at the TCCON validation sites can be characterized by the following ﬁgures of merit (the corresponding values for the Level 3 product are listed in brackets) – single-observation random error (1 σ ): 1.29 ppm (monthly: 1.18 ppm); global bias: 0.20 ppm (0.18 ppm); and spatiotemporal bias or relative accuracy (1 σ ): 0.66 ppm (0.70 ppm). The corresponding values for the XCH 4 products are single-observation random error (1 σ ): 17.4 ppb (monthly: 8.7 ppb); global bias: − 2 . 0 ppb ( − 2 . 9 ppb); and spatiotemporal bias (1 σ ): 5.0 ppb (4.9 ppb). It has also been found that the data products exhibit very good long-term stability as no signiﬁcant long-term bias trend has been identiﬁed. The new data sets have also been used to derive annual XCO 2 and XCH 4 growth rates, which are in reasonable to good agreement with growth rates from the National Oceanic and Atmospheric Administration (NOAA) based on marine surface observations. The presented ECV data sets are available (from early 2020 onwards) via the Climate Data Store (CDS, https://cds.climate.copernicus.eu/, last access: 10 January 2020) of the Copernicus Climate Change Service (C3S, https://climate.copernicus.eu/, last access: 10 January 2020).

Abstract. Satellite retrievals of column-averaged dry-air mole fractions of carbon dioxide (CO 2 ) and methane (CH 4 ), denoted XCO 2 and XCH 4 , respectively, have been used in recent years to obtain information on natural and anthropogenic sources and sinks and for other applications such as comparisons with climate models. Here we present new data sets based on merging several individual satellite data products in order to generate consistent long-term climate data records (CDRs) of these two Essential Climate Variables (ECVs). These ECV CDRs, which cover the time period 2003-2018, have been generated using an ensemble of data products from the satellite sensors SCIAMACHY/ENVISAT and TANSO-FTS/GOSAT and (for XCO 2 ) for the first time also including data from the Orbiting Carbon Observatory 2 (OCO-2) satellite. Two types of products have been generated: (i) Level 2 (L2) products generated with the latest version of the ensemble median algorithm (EMMA) and (ii) Level 3 (L3) products obtained by gridding the corresponding L2 EMMA products to obtain a monthly 5 • × 5 • data product in Obs4MIPs (Observations for Model Intercomparisons Project) format. The L2 products consist of daily NetCDF (Network Common Data Form) files, which contain in addition to the main parameters, i.e., XCO 2 or XCH 4 , corresponding uncertainty estimates for random and potential systematic uncertainties and the averaging kernel for each single (quality-filtered) satellite observation. We describe the algorithms used to generate these data products and present quality assessment results based on comparisons with Total Carbon Column Observing Network (TC-CON) ground-based retrievals. We found that the XCO 2 Level 2 data set at the TCCON validation sites can be characterized by the following figures of merit (the corresponding values for the Level 3 product are listed in brackets)single-observation random error (1σ ): 1.29 ppm (monthly: 1.18 ppm); global bias: 0.20 ppm (0.18 ppm); and spatiotemporal bias or relative accuracy (1σ ): 0.66 ppm (0.70 ppm). The corresponding values for the XCH 4 products are singleobservation random error (1σ ): 17.4 ppb (monthly: 8.7 ppb); global bias: −2.0 ppb (−2.9 ppb); and spatiotemporal bias (1σ ): 5.0 ppb (4.9 ppb). It has also been found that the data products exhibit very good long-term stability as no significant long-term bias trend has been identified. The new data sets have also been used to derive annual XCO 2 and XCH 4 growth rates, which are in reasonable to good agreement with growth rates from the National Oceanic and Atmospheric Administration (NOAA) based on marine surface observations. The presented ECV data sets are available (from early 2020 onwards) via the Climate Data Store (CDS, https://cds.climate.copernicus.eu/, last access: 10 January 2020) of the Copernicus Climate Change Service (C3S, https://climate.copernicus.eu/, last access: 10 January 2020).
The C3S satellite greenhouse gas (GHG) data set consists of single-sensor satellite data products and of merged (i.e., combined multi-sensor, multi-algorithm) data products. Here we present the latest version, version 4.1, of the merged Level 2 (L2) and merged Level 3 (L3) XCO 2 and XCH 4 data products, which cover the time period 2003-2018. The L2 products (XCO2_EMMA and XCH4_EMMA) have been compiled with the ensemble median algorithm (EMMA) originally proposed by Reuter et al. (2013) and recent modifications, which are described in Sect. 3.1. These products contain detailed information for each single observation (i.e., footprint or ground pixel) including time, latitude and longitude, the main parameter (i.e., XCO 2 or XCH 4 ), its stochastic uncertainty (e.g., due to instrument noise), an estimate of potential systematic uncertainties (e.g., due to spatial or temporal bias patterns), and its averaging kernel and corresponding a priori profile. The L3 products (XCO2_OBS4MIPS and XCH4_OBS4MIPS) are gridded products at monthly time and 5 • × 5 • spatial resolution in Obs4MIPs (Obser- Figure 1. Overview of the presented XCO 2 data set. Shown are time series over land for three latitude bands (global, black line; Northern Hemisphere, red; Southern Hemisphere, green) and global maps (half-yearly averages at 1 • ×1 • obtained by gridding (averaging) the merged Level 2, i.e., EMMA, product). See Sect. 4 for a detailed discussion. vations for Model Intercomparisons Project, https://www. earthsystemcog.org/projects/obs4mips/, last access: 10 January 2020) format. Figure 1 provides an overview of the resulting merged XCO 2 data product in terms of time series for three latitude bands and global maps and the similarly structured Fig. 2 shows the XCH 4 product. As can be seen, XCO 2 and XCH 4 are both increasing with time and exhibit seasonal fluctuations and spatial variations. The spatiotemporal characteristics of the merged data -e.g., the spatial sampling -reflect the characteristics of the underlying individual sensor satellite data (described in the data section, Sect. 2). Figures 1 and 2 are discussed in detail in the results section, Sect. 4. How these data products have been generated is described in the methods section, Sect. 3. A summary and conclusions are given in Sect. 5.

Data
In this section, we present an overview about the input data used to generate and validate the new XCO 2 and XCH 4 data products.

Satellite data
The input satellite data used to generate the merged satellite data products are individual satellite sensor Level 2 (L2) data products. Table 1 provides an overview about the satellite XCO 2 input data sets. As can be seen, in total eight XCO 2 L2 data products have been used to generate the merged L2 and Level 3 (L3) XCO 2 data products, each corresponding to a different combination of satellite sensor and retrieval algorithm. An overview about the time coverage of these input data products is presented in Fig. 3. As can be seen, the time period 2003 to March 2009 is only covered by one XCO 2 product, namely XCO 2 retrieved with the Bremen Optimal Estimation DOAS (BESD) algorithm (Reuter et al., 2010 from the SCIAMACHY/ENVISAT (Bovensmann et al., 1999) instrument. A second SCIAMACHY XCO 2 data product is available, which has been retrieved with the Weighting Function Modified Differential Optical Absorption Spectroscopy (WFM-DOAS or WFMD) algorithm , but this product is not used because the merging algorithm EMMA (Reuter et al., 2013, described in Sect. 3.1) requires one or more than two input data products (because the median of a set of elements is, according to our definition which avoids averaging, not defined for two elements). Therefore, one of the two products had to be selected, and the choice was the BESD product for XCO 2 because of somewhat higher data quality compared to the WFMD product  (note however that the WFMD product has the advantage of containing a larger number of observations). As can be seen from Table 1 and Fig. 3, several GOSAT input products have been used from April 2009 onwards and two OCO-2 XCO 2 products from September 2014 and May 2015 onwards. Note that additional algorithms/data products are available but have not been used as input, for example the GOSAT BESD XCO 2 product  and the OCO-2 RemoTeC XCO 2 product . These or other additional products may be added in future versions of the merged XCO 2 products. Note also that we always use the bias-corrected version of a data product, if available (some product files contain bias-corrected and uncorrected values). . Individual satellite sensor XCO 2 data products contributing to the merged XCO 2 data products (see Table 1 for details). The required minimum number of contributing products is shown by the grey area.
All listed satellites perform nadir (down-looking) and glint observations and provide radiance spectra covering the relevant CO 2 and CH 4 absorption bands located in the shortwave infrared (SWIR) part of the electromagnetic spectrum (around 1.6 and 2 µm) and also cover the O 2 A-band spectral region in the near-infrared (NIR, around 0.76 µm). All individual sensor input L2 data products have been generated using retrieval algorithms based on minimizing the difference between a modeled radiance spectrum and the observed spectrum by modifying so-called state vector elements (for details we refer to the references listed in Table 1; for additional information see also the Algorithm Theoretical Basis Documents (ATBDs); Buchwitz et al., 2019b, and. The exact definition of the state vector depends on the algorithm, but the general approach is based on the optimal estimation (Rodgers, 2000) formalism or similar approaches (see references in Table 1). Among the state vector elements is a representation of the CO 2 vertical profile but also other parameters to consider interfering gases (e.g., water vapor), surface reflection, atmospheric scattering, and other effects and parameters, which have an impact on the (interpretation of the) measured radiance spectrum. Table 2 and Fig. 4 provide an overview about the satellite XCH 4 L2 input data sets. As for XCO 2 , the time period 2003 to March 2009 is only covered by one SCIAMACHY data product. From April 2009 onwards several GOSAT XCH 4 products are available (see Table 2) and have been used to generate the merged XCH 4 data L2 and L3 data products. For future updates it is also planned to include XCH 4 from the Sentinel-5 Precursor (S5P) satellite (Veefkind et al., 2012), but S5P XCH 4 Schneising et al., 2019) has not yet been included as the time period covered by these products is currently quite short (less than 2 years). However, we aim to include S5P XCH 4 for one of the next updates of the merged methane products. Table 1. Satellite XCO 2 Level 2 (L2) data products used as input for the generation of the merged L2 and L3 XCO 2 version 4.1 data products. For products which have been generated in the framework of the CCI and C3S projects the corresponding product ID is listed (the other products are external products, which have been obtained from the corresponding websites; see Acknowledgements). Temporal coverage indicates the time coverage of the input data sets.

Ground-based data
The satellite data products have been validated by comparison with the XCO 2 and XCH 4 data products of the TCCON . TCCON is a network of ground-based Fourier transform spectrometers (FTSs) recording direct solar spectra in the NIR/SWIR spectral region. From these spectra, accurate and precise column-averaged abundances of CO 2 , CH 4 , and a number of other species are retrieved. The TCCON data products (version GGG2014) have been obtained via the TCCON data archive (https://tccondata.org/, last access: 15 July 2019). An overview about the used TC-CON sites is presented in Table 3.
In Sect. 4.3, we present annual XCO 2 and XCH 4 growth rates, which have been derived from the new XCO 2 and XCH 4 OBS4MIPS data products using the method described in . These growth rates are compared with growth rates derived from marine surface CO 2 and CH 4 observations, which have been obtained from the National Oceanic and Atmospheric Administration (NOAA) (for details including links and last access see Acknowledgements).

Merging algorithm EMMA
In order to generate the merged L2 products, the ensemble median algorithm is used, which is described in detail in Reuter et al. (2013). Therefore, we limit the description given here to a short overview of the latest version of the EMMA algorithm. To be specific, we initially describe the EMMA XCO 2 algorithm and explain differences relevant for XCH 4 at the end of this subsection. The EMMA XCO 2 data product consists of selected individual L2 soundings from the available individual sensor L2 input products (listed in Table 1). The EMMA L2 product is based on selecting the best soundings (i.e., single ground pixel observations) from the ensemble of individual sensor L2 products. Sounding selection is based on monthly time and 10 • × 10 • spatial intervals. To decide which individual product is selected for a given month and given grid cell, all input products are first gridded (monthly, 10 • × 10 • ) to consider the fact that the spatiotemporal sampling is different for each individual product (due to different satellite sensors and algorithm-dependent quality-filtering strategies). The selected product is the median in terms of average XCO 2 per month and grid cell (note that in case of an even number of products the product which is closest to the mean is selected). The median is used primarily to remove potential outliers. The advantage of the median is also (in contrast to, for example, the arithmetic mean) that no averaging or other modifications to the input data are required. In order for a grid cell to be assigned a valid value, the following criterion has to be fulfilled: a minimum number of data products having a standard error of the mean (SEOM) of less than 1 ppm has to be available (see grey area in Fig. 3). SEOM is defined by 1 n i σ 2 i , with σ i being the (scaled; see below) XCO 2 uncertainty of the ith out of n soundings.
This means that EMMA selects for each month and each 10 • × 10 • grid cell exactly one product of the available individual L2 input products and then transfers all relevant information (i.e., XCO 2 and its uncertainty, related averaging kernels and a priori profile, etc.) from the selected original L2 file into the corresponding daily EMMA L2 product file. This ensures that most of the original information from the selected individual product is also contained in the merged product.
However, some modifications are applied. In order to remove (or at least to minimize) the impact of different a priori assumptions, all products are converted to common a priori CO 2 vertical profiles (see Reuter et al., 2013, for details). The new a priori profiles are obtained from the simple empirical CO 2 model (SECM, Reuter et al., 2012). SECM is essentially an empirically found function with parameters optimized using a CO 2 model (CT2017; see below). The SECM model used here is referred to as SECM2018 and is an update of the SECM model described in Reuter et al. (2012). The main difference is that SECM2018 is using a recent version of NOAA's assimilation system Carbon-Tracker (Peters et al., 2007, with updates documented at: http://carbontracker.noaa.gov/, last access: 10 January 2020), namely CT2017.
SECM2018 is also used to correct for potential offsets between the individual data products by adding or subtracting a global offset (i.e., by using one constant offset value for each individual product applied globally and for the full time series). Time series of the individual data products before and after offset correction are shown in Fig. 5. Note that in Fig. 5 all data are relative to SECM2018, which is a very simple CO 2 model, and therefore all variations and trends seen in Fig. 5 are at least to some extent model errors. As can be seen from Fig. 5, the correction brings the individual data sets typically closer together without changing any of their other characteristics (e.g., their time dependence). But as can also be seen from Fig. 5, better agreement is only achieved on average, not necessarily for all products during the entire time period. For example, the GOSAT RemoTeC product (blue curve) during 2009-2012 exhibits a somewhat larger difference after the offset correction. The approximately 2 ppm (0.5 %) spike at the beginning of the time series is likely due to a positive bias of the underlying BESD data product, which has not been corrected due to the lack of reference data in this time period (see also the discussion of this aspect in . An obvious issue is also the approximately 1.5 ppm (0.4 %) discontinuity in the first half of 2014 of the PPDF-S (photon path length probability density function/simultaneous) product (light-green curve). Depending on application, this may be an issue when this product is used stand-alone, but this is not a problem for EMMA as EMMA identifies and ignores outliers.
Another modification applied to the individual L2 input products is a potential scaling of their reported uncertainty for the individual L2 soundings. The scaling factor has been chosen such that on average the uncertainty of the reported error is consistent with the standard deviation of satellite minus ground-based validation data differences (see Sect. 4.1 for the validation of the reported uncertainties via the uncertainty ratio).
In order to avoid that an individual input product, which has much more observations than the other products (such as OCO-2 compared to GOSAT), entirely dominates the EMMA product, a method has been implemented to prevent overweighting the contributions from individual L2 input data products. The method is based on limiting the number of L2 data points. For each grid cell and month, we perform the following steps: first, we compute SEOM for each algorithm. From these values, we compute the 25th percentile and divide it by The result is used as the minimum SEOM threshold. If SEOM of an individual algorithm is smaller than this threshold, a subset of soundings is randomly chosen such that SEOM becomes just larger than the threshold. If, for example, all σ i are 1 ppm, then SEOM simply becomes 1/ √ n. If in this case, for example, data from four algorithms were available with n 1 = 60, n 2 = 80, n 3 = 100, and n 4 = 1000, the SEOM threshold would become 1/ √ 2 n 3 , which would effectively limit the number of soundings of the fourth algorithm to 200 (chosen randomly).
In addition to the L2 information of the selected data products, EMMA stores the following diagnostic information for each selected sounding: identifier for the selected L2 algorithm and inter-algorithm spread (IAS) within the grid box of the sounding. Within each grid box, IAS is defined as the  algorithm-to-algorithm standard deviation of the grid box averages.
By how much each individual satellite XCO 2 data product contributes to the EMMA XCO 2 product is shown in Fig. 6. Figure 6a shows the relative data weight (RDW), and Fig. 6b shows the number of soundings per month. How the RDW is defined is explained in detail in Reuter et al. (2013). In short, the RDW is defined as the relative number of soundings weighted with the corresponding (square of the inverse) uncertainty. RDW is high if a (relatively) large number of soundings contribute to the EMMA product and if these soundings have (relatively) low uncertainty compared to the other contributing products. The RDW of a product is a measure of how much information on XCO 2 this product con-tributes to the EMMA product relative to the other contributing products. As can be seen from Fig. 6, the SCIAMACHY BESD product is the only product until early 2009, when the GOSAT time series starts. As can also be seen, OCO-2 dominates in terms of RDW and number of soundings from 2015 onwards. This is because OCO-2 provides much more data with typically better uncertainty compared to the other (GOSAT) product.
The EMMA L2 XCH 4 product has been generated similarly to the EMMA L2 XCO 2 product, i.e., using essentially the same method as described above. A difference is that the offset correction has been done with a CH 4 model instead of SECM2018. This model is the simple CH 4 climatological model (SC4C), and we use the year 2018 update referred to as SC4C2018 in the following. The SC4C2018 model is similar to SECM2018 but for XCH 4 . It is a model-based CH 4 climatology adjusted for the annual growth rate (note that this model has also been used as the climatological training and a calibration data set as described in Schneising et al., 2019). The EMMA algorithm SEOM limit controlling the minimum number of data points per grid box, month, and algorithm has been set to 12 ppb for XCH 4 . The impact of the offset correction for merging the XCH 4 products is shown in Fig. 7. Note that in Fig. 7 all data are relative to SC4C2018, which is a very simple CH 4 model, and therefore all variations and trends seen in Fig. 7 are at least to some extent model errors. As for CO 2 (Fig. 5) the offset correction typically brings the various XCH 4 products closer together but does not change any of their other characteristics. The PPDF-S product suffers from a discontinuity (of 8 ppb or 0.4 %) in the first half of 2014 (see above for a similar problem for PPDF-S XCO 2 ). Figure 8a shows the RDW, and Fig. 8b shows the number of soundings per month for all individual sensor XCH 4 products contributing to the XCH 4 EMMA product. Until early 2009, the SCIAMACHY WFMD product is the only product contributing to the EMMA product. Note that the RDW of the SCIAMACHY products drops at the end of 2005 (in contrast to the absolute number of soundings per month). The reason is the increase in the uncertainty of this product due to detector degradation (see, e.g., Schneising et al., 2011, for details). As can also be seen, the two GOSAT proxy (PR) products (i.e., CH4_GOS_OCPR and CH4_GOS_SRPR) dominate the XCH 4 EMMA product because they contain more soundings compared to the other (GOSAT) data products.

Algorithm to generate the Level 3 OBS4MIPS products
The version 4.1 L3 XCO2_OBS4MIPS and XCH4_OBS4MIPS data products have been obtained by gridding (averaging) the version 4.1 L2, i.e., XCO2_EMMA and XCH4_EMMA, products using monthly time and 5 • ×5 • spatial resolution. The algorithm for the generation of the OBS4MIPS products is described in Reuter et al. (2019b). Therefore, we here provide only a short overview.
For each individual product, the gridding is based on computing an arithmetic, unweighted average of all soundings falling in a grid box.For each grid box, the standard error of the mean is computed using the uncertainties contained in the corresponding EMMA product files. In order to reduce noise at least two individual observations must be present and the resulting standard error of the mean must be less than 1.6 ppm for XCO 2 and less than 12 ppb for XCH 4 .
Besides XCO 2 or XCH 4 , the final L3 product also includes (per grid box and month) the number of soundings used for averaging; the average column-averaging kernel; the average a priori profile; the standard deviation of the averaged XCO 2 or XCH 4 values; and an estimate for the total uncertainty computed as the root sum square of two values, where one value is SEOM and the other value is IAS as computed by EMMA. For cases including only one algorithm, the second value is replaced by quadratically adding spatial and seasonal accuracy determined from the TCCON validation.

Validation method
The validation of the merged satellite-derived XCO 2 and XCH 4 data products is based on comparisons with groundbased XCO 2 and XCH 4 TCCON observations (using version GGG2014). We present results from two somewhat different validation methods (the EMMA method, Reuter et al., 2013; and the QA/QC method, Buchwitz et al., 2017b; see below), which are similar to other validation methods used in recent years (e.g., Butz et al., 2010;Cogan et al., 2012;Dils et al., 2014;O'Dell et al., 2018;Parker et al., 2011). These methods differ with respect to details such as the chosen colocation criterion, whether the data are brought to a common a priori or not, and if yes which a priori has been used. In the following, we will highlight some of these details as relevant for the two validation methods used for this paper.
Both methods used for the validation of the L2 EMMA products are based on colocating each individual satellite XCO 2 (or XCH 4 ) observation with a corresponding value obtained from TCCON using predefined spatial and temporal colocation criteria (see below). The comparisons take into account different a priori assumptions regarding the vertical profiles of CO 2 (or CH 4 ) as used for the generation of the L2 input products by converting either the satellite data (QA/QC method) or the TCCON data (EMMA method) to a common a priori. This a priori correction is based on using the satellite averaging kernels and a priori profiles, which are contained (for each single observation) in the EMMA product files. The magnitude of the a priori correction (the explicit formula is shown as Eq. 3 in Dils et al., 2014) depends on the deviation (difference) of the averaging kernel from unity and on the difference of the a priori profiles. Because the averaging kernel profiles are typically close to unity (note that both satellite and the TCCON retrievals correspond to cloud-free  conditions) and because the a priori profiles are not totally unrealistic, the a priori correction is typically very small (approximately 0.1 ppm for XCO 2 and 1 ppb for XCH 4 ).
The first validation method is the EMMA quality assessment method, which is described in Reuter et al. (2013). Note that EMMA is not only a merging method but also a data quality assessment method, as the assessment of the quality of all satellite input data (listed in Tables 1 and 2) is a key aspect of EMMA. The second method is the quality as-sessment/quality control (QA/QC) method , which is applied to all satellite XCO 2 and XCH 4 data products generated for the Copernicus Climate Change Service (C3S), i.e., to the merged products but also to all the individual sensor CCI/C3S L2 input products, which are also available via the Copernicus Climate Data Store (CDS) (see products with CCI/C3S product ID listed in Tables 1 and 2).
Key differences between the QA/QC method and the EMMA method are listed as follows.  Table 1 for details), (ii) EMMA XCO 2 (bottom left), and (iii) the inter-algorithm spread (IAS, 1σ ) as computed by EMMA (bottom right; see main text for details). Also shown in the bottom-right panel are the locations of the TCCON sites (pink triangles) and the range of IAS values covered by them (see color bar). Note that the OCO-2 maps (row 4) are empty because this satellite was launched after April 2011 (see Fig. 10 for OCO-2 XCO 2 ).
-Colocation criteria: QA/QC used ±2 • latitude and ±4 • longitude as the spatial colocation criterion, but EMMA used 500 km (both methods use the same temporal colocation criterion of 2 h).
-Filtering criterion surface elevation: EMMA requires a surface elevation difference of less than 250 m between a TCCON site and satellite footprints, whereas the QA/QC does not use this filtering criterion.
-A priori correction: both methods correct for the use of different a priori CO 2 vertical profiles in the various retrieval algorithms, but QA/QC uses the TCCON a priori as common a priori, whereas EMMA uses the SECM2018 model for CO 2 and the SC4C2018 model for CH 4 (see Sect. 3.1).
-Approach to quantify seasonal bias and linear bias trend: the EMMA method is based on fitting a trend model, which includes an offset term, a slope term, and a sine term for seasonal fluctuations (see Reuter et al., 2019c) and computes the seasonal bias from the standard deviation of the fitted seasonal fluctuation term and obtains the bias trend and its uncertainty from the fitted slope term. The QA/QC method (Buchwitz et al., 2019a) uses (only) a linear fit to obtain the bias trend Figure 10. As Fig. 9 but for April 2015. Note that the SCIAMACHY/BESD map (top left) is empty because this product ended in April 2012 (see Fig. 9 for SCIAMACHY/BESD XCO 2 ). and its uncertainty and computes the seasonal bias from the standard deviation of the seasonal biases (as also done by Dils et al., 2014, for their quantity seasonality).
-Criteria for enough data: both algorithms use several different thresholds for the required minimum number of colocations per TCCON site and minimum length of overlapping TCCON time series.
Despite all these differences, quite similar overall figures of merit have been obtained with both methods (see results section, Sect. 4). This indicates that the overall data quality results do not critically depend on the details of the assessment method (the same conclusion has also been reported for earlier comparisons of results from different assessment methods, e.g., Buchwitz et al., 2015Buchwitz et al., , 2017b.  (Fig. 9) and April 2015 (Fig. 10) (note that 2011 is the last full year with data from SCIAMACHY and that 2015 is the first full year with OCO-2 data). The maps in the first four rows of Figs. 9 and 10 show the individual sensor/algorithm L2 input data. As can be seen, the spatial XCO 2 patterns are quite similar (e.g., north-south gradient),  Figure 11. Average XCO 2 inter-algorithm spread (1σ ) during 2003-2018. As can be seen, the scatter is typically around 1 ppm except over parts of the tropics (in particular central Africa), the Himalayas, and at high latitudes, where the scatter can be larger.
but there are also significant differences, especially with respect to the spatial coverage. The spatial coverage depends on time and is related to the different satellite instruments but also due to algorithm-dependent quality filtering. The largest differences are between the SCIAMACHY BESD product (top left in Fig. 9) compared to the other products, as the SCIAMACHY product is limited to observations over land, whereas the GOSAT and OCO-2 products also have some ocean coverage due to the ocean-glint mode, which permits the acquisition of an adequate signal (and therefore also signal-to-noise ratio) also over the ocean (note that the reflectivity of water is poor outside of sunglint conditions in the used SWIR spectral regions around 1.6 and 2 µm). The EMMA product is shown in the bottom-left panels of Figs. 9 and 10, and in the bottom-right panel IAS is shown, which quantifies the level of agreement (or disagreement) among the various satellite input data sets. The IAS maps also show the location of the TCCON sites (pink triangles) and the IAS values at the TCCON sites (see pink triangles above the color bar). As can be seen, the TCCON sites are typically located outside of regions where the IAS is highest. The average IAS for the entire time period 2003-2018 is shown in Fig. 11. As can be seen, the scatter is typically in the range 0.6-1.1 ppm with the exception of parts of the tropics, in particular central Africa, the Himalayas, parts of southeast Asia, and high latitudes. High latitudes typically correspond to large solar zenith angles, which is a challenge for accurate satellite XCO 2 retrievals, as this typically corresponds to low signal and therefore low signal-to-noise ratio resulting in enhanced scatter of the retrieved XCO 2 . In areas with frequent cloud coverage, such as parts of the tropics, sampling is sparse and this may also contribute to a larger scatter.
Detailed validation results for all individual sensors and the EMMA XCO 2 Level 2 data products are shown in Appendix A (Fig. A1) for all TCCON sites. The validation results are summarized in Table 4 (per site) and Table 5 (overall) together with the corresponding results of the QA/QC assessment method. Table 4 lists all TCCON sites, which fulfill either the EMMA method or the QA/QC method criteria with respect to a minimum number of colocations and length of time series. Listed are the numerical values (in ppm), which have been computed for several figures of merit. This includes (i) the overall estimation of the single-observation random error computed as the standard deviation of the satellite minus TCCON differences; (ii) the uncertainty ratio, which is the ratio of the mean value of the reported (1σ ) uncertainty to the standard deviation of the satellite-TCCON difference (computed to validate the reported uncertainties); (iii) the overall bias computed as the mean value of the satellite-TCCON differences; and (iv) the seasonal bias, computed as the standard deviation of the biases determined for the four seasons. Also shown in the last two rows are the mean value and the standard deviation of the values listed per TCCON site in the rows above. Several of these values have been used to compute the values listed in Table 5, which shows the overall summary of the quality assessment. Table 5 lists (i) the mean value of the single-observation random error, (ii) the global bias computed as the mean value of the biases at the various TCCON sites, (iii) the regional bias computed as the standard deviation of the biases at the various TCCON sites, (iv) the mean seasonal bias, and (v) the spatiotemporal bias computed as the root sum square of the regional and of the seasonal bias. The spatiotemporal bias is used to quantify the achieved performance for relative accuracy, which characterizes the spatially and temporally vary- ing component of the bias (i.e., neglects a possible global bias (global offset), which is reported separately).
The linear bias trend has also been computed by fitting a line to the satellite-TCCON differences (not shown here). The mean value of the linear trend (slope) and its uncertainty (1σ , obtained from the standard deviation of the slope at the various TCCON sites) are −0.05 ± 0.06 ppm yr −1 for the EMMA method and −0.06 ± 0.09 ppm yr −1 for the QA/QC method. This means that no significant long-term bias trend has been detected; i.e., the satellite product is stable.
As can be seen from Table 5, the values computed independently using the EMMA and the QA/QC assessment methods are quite similar, which gives not only confidence in the overall quality assessment summary documented in Table 5 but also in the products and the used validation methods.
Note however that the quality of the satellite data (at least at TCCON sites) is very likely better than Table 5 suggests (i) because the TCCON retrievals are not free of errors (the 1σ XCO 2 uncertainty is about 0.4 ppm; Wunch et al., 2010) and (ii) because of the representation error originating from the (real) spatiotemporal variability of XCO 2 around the TC-CON sites. The overall error related to this is difficult to quantify, but some indication can potentially be obtained by additional assessment results such as the one shown in Fig. 12. Figure 12 shows the biases as obtained with the EMMA method at the various TCCON sites used for the EMMA method comparisons. Shown are not only the mean satellite-TCCON differences as obtained for the EMMA product but also for all the individual sensor/algorithm input products. The differences are shown as anomalies with respect to the mean; i.e., the sum of the differences in each row is zero. This is equivalent to assuming that for a given satellite product the mean value over all TCCON sites is zero. As can be seen from Fig. 12, the satellite-TCCON differences are dominantly positive (orange and red colors) for higherlatitude TCCON sites and mostly negative (blue colors) for lower-latitude TCCON sites. In order to rule out that this is an artifact of the EMMA assessment method, the overall biases computed with the QA/QC method and biases computed Table 6. TCCON XCO 2 bias in parts per million (ppm; satellite -TCCON). "-" means that the number of available colocations is less than the threshold required by the corresponding assessment method. Note that this table includes only a subset of the 10 sites shown in Fig. 12, namely only those sites with a mean bias being considerably (more than 1.5 times) larger than the standard deviation of the biases. by the individual product data providers (DPs) have also been derived. These biases have been used to compute -for each of the 10 TCCON sites shown in Fig. 12 -the mean bias and the standard deviation of these biases. For 4 of these 10 sites the mean bias is considerably (more than 1.5 times) larger than the standard deviation of the biases, and the corresponding results for these four sites are shown in Table 6. This does not necessarily mean that these sites have the largest biases. This only means that the derived biases at these sites are (independent of their magnitude) the most consistent across all satellite products used for comparison. As can be seen from Table 6, the biases are always positive at Sodankylä, Karlsruhe, and Orléans and always negative at Lamont. Note that this does not imply that all derived biases are significant as some biases are very small, e.g., the FOCAL bias at Sodankylä, which is only 0.02 ppm. Because it is unlikely that all three satellites and several retrieval algorithms produce XCO 2 products with similar biases at a given TCCON site, this provides an indication of biases either due to representation errors or due to biases within the TCCON data ( Table 6). Note that these biases are within the accuracy stated by TC-CON, which is 0.8 ppm (2σ ) (Wunch et al., 2010, Hedelius et al., 2017. The accuracy of the TCCON data will be improved for the next data release (planned for 2020). This new TCCON data set will allow for better identification of the causes for the observed biases.
The XCO2_OBS4MIPS product has also been directly compared with TCCON using a comparison method based on the comparison of the monthly satellite product with TC-CON monthly mean values. The results are shown in Fig. 13. As can be seen, the mean difference (satellite -TCCON) is 0.18 ppm (which is close to the mean value of the global bias of 0.20 ppm listed in Table 5), the standard deviation is 1.18 ppm (as expected, because of the spatiotemporal averaging, which is somewhat smaller than the value of 1.29 ppm obtained for the XCO2_EMMA product listed in Table 5), and the linear correlation coefficient is 0.99. The spatiotemporal bias, computed as the standard deviation of 3-monthly averages at the TCCON sites listed in Fig. 13, is 0.7 ppm. Figure 1 presents an overview of the XCO 2 data product in terms of time series for three latitude bands and global maps. XCO 2 is increasing almost linearly during the 16-year time period (for a discussion of the derived annual growth rates see Sect. 4.3). The main reason for this increase is CO 2 emission due to burning of fossil fuels (Le Quéré et al., 2018). The seasonal cycle, which is caused primarily by quasi-regular uptake and release of atmospheric CO 2 by the terrestrial vegetation due to photosynthesis and respiration (e.g., Kaminski et al., 2017, Yin et al., 2018, is most pronounced over the Northern Hemisphere. The half-yearly maps for 2003 are based on SCIAMACHY on board ENVISAT (Burrows et al., 1995;Bovensmann et al., 1999) satellite data, and the maps for 2018 contain data from the GOSAT (since 2009) (Kuze et  al., 2016) and OCO-2 (since 2014) (Crisp et al., 2004) satellites. GOSAT and OCO-2 also provide good-quality XCO 2 retrievals over the oceans due to their sunglint observation mode.
4.2 Products XCH4_EMMA and XCH4_OBS4MIPS (v4.1) As for XCO 2 , monthly maps have also been generated for the EMMA XCH 4 data product. Two examples are shown in Fig. 14 for September 2010 and in Fig. 15 for September 2018. The individual sensor XCH 4 input data are shown in the first four rows, and the EMMA XCH 4 product is shown in the bottom-left panel. The bottom-right panel shows the IAS. As can be seen, the spatial patterns of the XCH 4 maps are similar but not identical. The IAS shows a quite large variability. The scatter is larger compared to the corresponding XCO 2 IAS (Figs. 9 and 10, bottom-right panels), and spatially the grid cells with larger spread are more equally distributed over the globe but with largest differences over the southern part of Asia.
Detailed validation results are shown in Appendix A (Fig. A2), and the validation results are summarized in Tables 7 and 8, which have the same structure as the corresponding XCO 2 tables (Tables 4 and 5). These tables also list the results of the QA/QC assessment method, which results in quite similar (within a few ppb) overall quality assessment results (Table 8) as obtained with the EMMA method. The linear bias trend has also been computed by fitting a line to the satellite-TCCON differences (not shown here). The mean value of the linear trend (slope) and its uncertainty (1σ , obtained from the standard deviation of the slope at the various TCCON sites) are −0.1 ± 0.4 ppb yr −1 for the EMMA method and 0.5±0.8 ppb yr −1 for the QA/QC method. As for XCO 2 , this means that no significant long-term bias trend has been detected; i.e., the satellite product is stable. Figure 16 shows the TCCON station XCH 4 bias anomaly as also shown for XCO 2 in Fig. 12; i.e., Fig. 16 shows the biases as obtained with the EMMA method at the various TCCON sites used for the EMMA method comparisons. As for XCO 2 not only the mean satellite-TCCON differences as obtained for the EMMA product are shown but also the differences for all the individual sensor/algorithm input products. The differences are shown as anomalies with respect to the mean; i.e., the sum of the differences in each row is zero. As can be seen from Fig. 16, the pattern of satellite-TCCON XCH 4 differences has some similarity with the XCO 2 difference pattern shown in Fig. 12. For example, the differences are mostly positive at Sodankylä and Garmisch-Partenkirchen and mostly negative at Darwin and Wollongong. But there are also significant differences, for example, Figure 12. Average XCO 2 differences (satellite -TCCON) for the different satellite XCO 2 products at 10 TCCON sites as used by the EMMA assessment method. The differences are shown as anomalies; i.e., the sum of the values corresponding to a given row is zero. Note that here "ACOS" refers to NASA's ACOS algorithm as applied to GOSAT and that "NASA" refers to NASA's ACOS algorithm as applied to OCO-2. Figure 13. Summary of the comparison of product XCO2_OBS4MIPS with TCCON monthly mean XCO 2 (each symbol corresponds to one month and to one TCCON site; each color corresponds to a different TCCON site; TCCON site colors and site IDs (see Table 3) are shown on the right). The comparison is based on 1446 monthly values. The mean difference (satellite -TCCON) is 0.18 ppm and the standard deviation of the difference is 1.18 ppm. The linear correlation coefficient R is 0.99. with respect to the sign of the bias (e.g., Park Falls, Bremen, Karlsruhe).
The XCH4_OBS4MIPS product has also been directly compared with TCCON (Fig. 17) using the same method as also used for product XCO2_OBS4MIPS (Fig. 13). As can be seen from Fig. 17, the mean difference (satellite -TCCON) is −2.88 ppb (which is close to the mean value of the global bias of −2.0 ppb of product XCH4_EMMA listed in Table 8), the standard deviation is 8.65 ppb (as expected, because of the averaging, which is somewhat smaller than the value of 17.4 ppb obtained for the XCH4_EMMA product listed in Table 8), and the linear correlation coefficient is 0.97. Figure 2 presents an overview of the XCH 4 data product in terms of time series for three latitude bands and global maps. As can be seen, XCH 4 was nearly constant during 2003-2006 (apart from seasonal fluctuations) but has been increasing since 2007 (for a discussion of the trend and annual growth rates see Sect. 4.3). The reason for this is likely a combination of increasing natural (e.g., wetlands) and anthropogenic (e.g., fossil fuel related) emissions and possibly decreasing sinks (hydroxyl, OH, radical), but it does not seem currently possible to be more definitive (e.g., Worden et al., 2017;Nisbet et al., 2019;Turner et al., 2019;Howarth, 2019;Schaefer, 2019).

Annual growth rates
Finally, we present an update and extension of the year 2003-2016 annual XCO 2 growth rates shown in , using the new OBS4MIPS v4.1 XCO 2 data set covering the time period 2003-2018 (Fig. 18). Figure 18a shows the time series of the globally averaged OBS4MIPS version 4.1 XCO 2 data product over land. In contrast to , the analysis presented here is based on data over land only as this permits the generation of a time series with better internal consistency (note that the XCO 2 OBS4MIPS product is land only for [2003][2004][2005][2006][2007][2008]. The average growth rate during 2010-2018, i.e., for the time period where an ensemble of GOSAT and OCO-2 data has been used, is 2.28 ± 0.04 ppm yr −1 . As can be seen from Fig. 18b, the year 2017 and 2018 growth rates are less than the growth rates of the years 2015 and 2016, which were years with a strong El Niño. The XCO 2 growth rates are in reasonable agreement with the global CO 2 growth rates published by National Oceanic and Atmospheric Ad- Figure 14. September 2010 XCH 4 at 10 • × 10 • spatial resolution showing (i) the individual sensor/algorithm input data sets (panels in rows 1-4; see Table 2 for details), (ii) EMMA XCH 4 (bottom left), and (iii) the inter-algorithm spread (IAS, 1σ ) as computed by EMMA (bottom right; see main text for details). Also shown in the bottom-right panel are the locations of the TCCON sites (pink triangles) and the range of IAS values covered by them (see color bar). ministration (NOAA) (shown in blue color in Fig. 18b), which are based on marine surface CO 2 observations (ftp: //aftp.cmdl.noaa.gov/products/trends/co2/co2_gr_gl.txt; last access: 30 July 2019). As can be seen from Fig. 18b, the agreement of the satellite-derived XCO 2 growth rates with the NOAA surface-CO 2 -based growth rates is better from year 2010 onwards compared to the time period before when the EMMA data set consists only of one SCIAMACHY data set instead of the full ensemble. For 2018, the XCO 2 growth rate is 2.1 ± 0.5 ppm yr −1 , which is lower than the NOAA surface CO 2 growth rate of 2.43 ± 0.08 ppm yr −1 . Note that the 1σ uncertainty ranges of the two growth rate estimates overlap, which indicates that the two growth rate estimates are consistent.
The growth rate of atmospheric methane is also an important quantity (e.g., Nisbet et al., 2019). The method of  has now also been used to compute annual XCH 4 growth rates from satellite XCH 4 retrievals. Figure 19a shows the time series of the globally averaged OBS4MIPS version 4.1 XCH 4 data product over land. As shown by the linear fit, the average growth rate is 7.9 ± 0.2 ppb yr −1 during 2010-2018, i.e., for the time period where an ensemble of GOSAT data has been used. The annual growth rates are shown in Fig. 19b for the satellitederived XCH 4 (red) and for the NOAA growth rates (ftp: //aftp.cmdl.noaa.gov/products/trends/ch4/ch4_gr_gl.txt; last access: 30 July 2019) derived from marine surface CH 4 observations. For 2018, the XCH 4 growth rate is 10 ± 6 ppb yr −1 , which is close to the NOAA surface CH 4 growth rate of 9.46 ± 0.56 ppb yr −1 .

Summary and conclusions
Satellite-derived ensemble XCO 2 and XCH 4 data products have been generated and validated. These data products are the version 4.1 Level 2 (L2) products XCO2_EMMA and XCH4_EMMA and the Level 3 (L3) products XCO2_OBS4MIPS and XCH4_OBS4MIPS and cover the time period 2003-2018. The data products are freely avail-able for interested users via the Copernicus Climate Data Store (CDS, https://cds.climate.copernicus.eu/, last access: 10 January 2020), where also earlier versions of these data products are accessible. The L2 products have been generated with an adapted version of the EMMA algorithm (Reuter et al., 2013), and the L3 products have been generated by gridding (averaging) the EMMA L2 product to obtain products at monthly time and 5 • × 5 • spatial resolution in Obs4MIPS format. The products have been validated by comparisons with TCCON ground-based XCO 2 and XCH 4 retrievals using TCCON version GGG2014.
From January 2003 to March 2009 the products are based on SCIAMACHY/ENVISAT, and from April 2009 onwards the products use an ensemble of one SCIAMACHY (until Figure 16. As Fig. 12 but for XCH 4 , i.e., average XCH 4 differences (satellite -TCCON) for the different satellite XCH 4 products at 10 TCCON sites as used by the EMMA assessment method. The differences are shown as anomalies; i.e., the sum of the values corresponding to a given row is zero. Figure 17. Summary of the comparison of product XCH4_OBS4MIPS with TCCON monthly mean XCH 4 . The comparison is based on 1518 monthly values. The mean difference (satellite -TCCON) is −2.88 ppb and the standard deviation of the difference is 8.65 ppb. The linear correlation coefficient R is 0.97. early 2012) and several GOSAT products. The XCO 2 products contain in addition L2 products from NASA's OCO-2 mission from September 2014 onwards.
The EMMA algorithm selects for each month and each 10 • × 10 • grid cell one of the available products, i.e., one from the existing ensemble of L2 input products, and transfers all relevant information (including averaging kernel etc.) from the selected L2 input product into the merged EMMA L2 product. The selected product is the median product. The main purpose of EMMA is to generate a Level 2 product, which covers an as-long-as-possible time series (longer than any of the individual sensor input data sets) with as-high-aspossible accuracy including all information needed, e.g., for surface flux inverse modeling. The median approach helps to reduce the occurrence of potential outliers and thus reduces spatial and temporal biases in the generated data products.
Detailed quality assessment results based on comparisons with TCCON ground-based retrievals have been presented. We found that the XCO 2 Level 2 data set at the TC-CON validation sites can be characterized by the following figures of merit (the corresponding values for the Level 3 product are listed in brackets) -single-observation random error (1σ ): 1.29 ppm (monthly: 1.18 ppm); global bias: 0.20 ppm (0.18 ppm); and spatiotemporal bias or relative accuracy (1σ ): 0.66 ppm (0.70 ppm). The corresponding values for the XCH 4 products are single-observation random error (1σ ): 17.4 ppb (monthly: 8.7 ppb); global bias: −2.0 ppb (−2.9 ppb), spatiotemporal bias (1σ ): 5.0 ppb (4.9 ppb). It has also been found that the data products exhibit very good long-term stability as no significant linear bias trends have been identified.
The new data sets have also been used to derive annual XCO 2 and XCH 4 growth rates, which are in reasonable to good agreement with growth rates from the National Oceanic and Atmospheric Administration (NOAA) based on marine surface observations. An important application for the EMMA products is to use them together with inverse modeling to obtain improved information on regional-scale CO 2 (e.g., Houweling et al., 2015) and CH 4 (e.g., Alexe et al., 2015) surface fluxes. Applications for the corresponding OBS4MIPS products are, for example, climate model comparisons (e.g., Lauer et al., 2017) and studies related to annual growth rates (e.g., . It is however important to note that these merged products are not necessarily the most optimal products for all applications as they do not contain all data from a given satellite sensor. For example, users interested primarily in emissions from power plants or other localized CO 2 sources will prefer the original OCO-2 Level 2 data product (e.g., Nassar et al., 2017;Reuter et al., 2019a). Especially for users interested in only parts of the time series it is recommended to use the individual sensor products in addition to the merged product as this may significantly increase the robustness, reliability, and uncertainty characterization of key findings.  The corresponding annual mean XCH 4 values are also listed. The increase during 2010-2018 is 7.9 ± 0.2 ppb yr −1 as obtained via a linear fit. (b) Annual XCH 4 growth rates (red, with 1σ uncertainties; the corresponding numerical values are also listed with 1σ uncertainty in brackets) and CH 4 growth rates from NOAA (shown in blue) obtained from marine surface CH 4 observations.

Appendix A
In this appendix, detailed validation results are shown for the individual sensor and EMMA XCO 2 and XCH 4 Level 2 data products.
The comparison of the various XCO 2 data products with TCCON XCO 2 at 10 TCCON sites is shown in Fig. A1. These 10 TCCON sites fulfill the EMMA criteria in terms of a sufficiently large number of colocations as defined to obtain robust conclusions per site. The individual soundings of the EMMA XCO 2 product are shown as white circles with a black border. As can be seen, they are located within (mostly close to the center of) the range of values of the individual sensor/algorithm XCO 2 values, which is expected. Figure A2 shows the comparison of the EMMA XCH 4 product (white circles with a black border) and of the individual sensor XCH 4 input products with TCCON XCH 4 originating from the EMMA assessment method. As for the EMMA XCO 2 product (Fig. A1), the EMMA XCH 4 is located near the center of the clouds of XCH 4 values, as expected. Figure A1. XCO 2 time series at 10 TCCON sites during January 2009-December 2018 as obtained using the EMMA quality assessment method. TCCON GGG2014 XCO 2 is shown as thick black dots, the individual satellite L2 input products are shown as colored dots, and the EMMA product is shown as white circles with black borders. The derived numerical values are listed in Table 4. Figure A2. XCH 4 time series at 10 TCCON sites during April 2010-December 2018 as obtained using the EMMA quality assessment method. TCCON GGG2014 XCH 4 is shown as thick black dots, the individual satellite L2 input products are shown as colored dots, and the EMMA product is shown as a white circles with black borders. The derived numerical values are listed in Table 7.
Acknowledgements. The generation of the EMMA Level 2 and OBS4MIPS Level 3 data sets and the corresponding data analysis has been funded primarily by the European Union (EU) via the Copernicus Climate Change Service (C3S, https://climate. copernicus.eu/, last access: 10 January 2020) managed by the European Centre for Medium-Range Weather Forecasts (ECMWF).
The further development of the FOCAL retrieval algorithm used to generate the OCO-2/FOCAL XCO 2 input data product would not have been possible without cofunding from the European Union (EU) Horizon 2020 (H2020) research and innovation program projects CHE (grant agreement no. 776186) and VERIFY (grant agreement no. 776810). The generation of the XCO2_OBS4MIPS product also benefited from cofunding from EU H2020 project CCiCC (grant agreement no. 821003).
We thank several space agencies for making available satellite Level 1 (L1) input data: ESA/DLR for SCIAMACHY L1 data, JAXA for GOSAT Level 1B data, and NASA for the OCO-2 L1 data product. We also thank ESA for making the GOSAT L1 product available via the ESA Third Party Mission (TPM) archive.
The TCCON stations Ascension Island, Bremen, Garmisch, Karlsruhe, and Ny-Ålesund have been supported by the German Bundesministerium für Wirtschaft und Energie (BMWi) via the DLR under grants 50EE1711A-E. We thank the ESA Ariane tracking station at North East Bay, Ascension Island, for hosting and local support. Nicholas M. Deutscher is supported by an ARC Future Fellowship, FT180100327. The TCCON site at Réunion island is operated by the Royal Belgian Institute for Space Aeronomy with financial support in 2014,2015,2016,2017,2018, and 2019 under the EU project ICOS-Inwire and the ministerial decree for ICOS (FR/35/IC2) and local activities supported by LACy/UMR8105 -Université de La Réunion. The TCCON stations at Tsukuba and Burgos are supported in part by the GOSAT series project. Local support for Burgos is provided by the Energy Development Corporation (EDC, Philippines). The Paris TCCON site has received funding from Sorbonne Université, the French research center CNRS, the French space agency CNES, and Région Île-de-France.
We also thank Peter Bergamaschi for providing MACC-II project inversion system CH 4 fields, which have been used as input for the SC4C2018 model. The article processing charges for this open-access publication were covered by the University of Bremen.
Review statement. This paper was edited by John Worden and reviewed by Ray Nassar and two anonymous referees.