Monitoring the Tropospheric Monitoring Instrument (TROPOMI) short-wave infrared (SWIR) module instrument stability using desert sites

Since its launch in 2017, Tropospheric Monitoring Instrument (TROPOMI) on S-5P has provided very high quality data using daily global coverage for a number of key atmospheric trace gases. Over its first 1000 d in operation, the short-wave infrared (SWIR) module has been very stable, and the continuously monitored calibration has remained of high quality. This calibration relies on a combination of extensive pre-launch and post-launch measurements, complemented by regular monitoring of internal light sources and background measurements. In this paper we present a method and results for independent validation of the SWIR module calibration and instrument stability by examining the signal stability of a sample of 23 pseudo-invariant calibration desert sites. The data covers over 2 years of operational data. With a Lambertian surface assumption, the results show that the SWIR module has little to no instrument degradation down to an accuracy of about 0.3 % yr−1, validating results obtained from the internal calibration suite. The method presented here will be used as ongoing validation of the SWIR calibration.


Introduction
The Tropospheric Monitoring Instrument (TROPOMI 1 ) is the sole instrument on the Sentinal-5 Precursor (S5-P) mission (better known as S-5P; Veefkind et al., 2012). S5-P is part of the ESA Copernicus programme and is the first mission covering the atmospheric composition. Its aim is to quantify specific trace-gas column densities. With the very wide swath opening of more than 60 • , TROPOMI can provide daily global coverage. The four channels (UV, UVIS, NIR and SWIR) give an unprecedent view of atmospheric composition and evolution on timescales of days to years. Launched in late 2017, TROPOMI started regular operations in April 2018 after in-flight commissioning and science verification. The instrument has been shown to be very stable with only limited degradation Ludewig et al., 2020). The short-wave infrared (SWIR) module covers a wavelength range between 2300 and 2380 nm and has been designed for the carbon monoxide (CO) and methane (CH 4 ) products, (e.g. Hu et al., 2018;Borsdorff et al., 2018;Pandey et al., 2019;Zhang et al., 2020).
Since the start of nominal operations, TROPOMI has been in excellent health, and the calibration has remained stable, in particular the SWIR module Ludewig et al., 2020). For its calibration, TROPOMI relies on the results from the on-ground campaign (Kleipool et al., 2018;Tol et al., 2018;van Hees et al., 2018), regular solar irradiance measurements and background measurements, and measurements using on-board lights that are part of the integrated calibration unit (ICU) (see  for an overview of on-board lights relevant to the SWIR module). The on-board lights consist of a white light source (WLS), dedicated LED (DLED), five diode lasers and a common LED (CLED). The on-board lights have various goals to maintain specific parts of the calibration; for example, the diode lasers are aimed at characterizing any ISRF (instrument spectral response function) and/or stray light changes (Tol et al., 2018;van Hees et al., 2018). In flight, the main goal is the relative calibration, although some parts of the absolute calibration can be derived. The results reveal instrument instabilities for the SWIR module to be less than 0.5 % over the first year. This stability has been improved by regular monitoring (see http://mps.tropomi.eu, last access: 18 October 2021), with the SWIR module stable down to 0.3 % or less over the first 2 or 3 years of regular operations.
TROPOMI products are provided either as L1b products, which consist of calibrated radiance and irradiance data, or L2 products, which consist of the retrieved column densities of selected gas tracers. Data quality from TROPOMI is validated using a range of space-based, airborne and groundbased networks and/or dedicated measurements. 2 These validations are done using the full range of the L2 products. However, these efforts, which are part of the Mission Performance Centre of S-5P do not include independent validation of the L1b data quality due to the complexity of the task. Various scenarios have been discussed, but in essence, L1b validation is much more complicated and cannot easily be carried out using automated ground stations.
Nevertheless, validation of the L1b product serves two important goals. First, it provides validation of the quality of the absolute and relative radiometric calibration and thus the quality of the calibration key data in use. Second, it provides an independent quantification of the instrument stability. The first goal requires independent ground-based measurements. These typically originate from co-incidental space-based measurements (so-called cross-calibration; see Chander et al., 2013;Kataoka et al., 2017) or from complementary ground measurements through vicarious calibration (e.g. Kuze et al., 2014;Bruegge et al., 2019b, a). Both have strict limitations due to instrument properties, temporal difference or spatial coverage. However, validation of instrument stability has less stringent demands and can be carried out by monitoring pseudo-invariant calibration sites, from hereon referred to as PICS. PICS are locations on Earth that are typically free of precipitation, vegetation and/or clouds. They are relatively free of annual changes (e.g. due to vegetation) and possess a homogeneous surface. Desert locations are typically well suited as PICS. PICS have been in use as calibration sites since the mid-90s (Cosnefroy et al., 1996;Lacherade et al., 2013;Bacour et al., 2019), but they are almost exclusively used by CCD (charge-coupled device) cameras in the visible part of the spectrum. Other well-known sites include RailRoad Valley playa (Nevada, US), Gobabeb (Namibia) or Boatou (China), which are all part of the RAD-CAL network .
Note that even in optimal circumstances, annual variations may exist, due to, for example, moisture content or small vegetation, both in the top soil. Non-Lambertian reflectance components of the surface is expressed through the bidirectional reflectance distribution function (BRDF), which determines into which directions sunlight is reflected.
The BRDF of desert sand, in particular its accuracy and effect at off-nadir viewing angles, has been described and characterized by Bruegge et al. (2019a). They find that desert sand has a characteristic "hot spot" in the backwards direction of the incoming solar radiation. They also show that these BRDFs are not well characterized at large off-nadir viewing angles, with differences of 5 % in a normalized solution with respect to the zenith shown as typical. The Moderate Resolution Imaging Spectroradiometer (MODIS) BRDF product (Schaaf et al., 2011) provides world-wide coverage with a 16 d interval but also suffers from inaccuracies as described in Bruegge et al. (2019a) due to saturation of the detector. These are relevant for TROPOMI, due to its very wide swath.
In this paper we present an analysis of the site stability of a sample of 23 PICS in the Saharan, Arabian and Namib deserts. 3 The sample was adopted from the analysis of Bacour et al. (2019), which itself revisited the original Saharan PICS sample from Cosnefroy et al. (1996) and Lacherade et al. (2013). Most analyses of PICS are done at visible wavelengths and/or at high resolutions of imagers. In this analysis, we will use continuum channels of the TROPOMI SWIR channel assuming a Lambertian surface. In addition, no atmospheric correction will be applied. Section 2 provides the final sample, while the data usage of TROPOMI is presented in Sect. 3. This includes data filters and restrictions as well as the assumptions in our analysis. Section 4 gives the results, while the impact is discussed in Sect. 5.

Saharan sites
Saharan PICS have been used for monitoring for a large number of sensors (e.g. Cosnefroy et al., 1996;Lacherade et al., 2013;Bacour et al., 2019). These studies have shown the Sa- hara to be reasonably stable across large spatial scales and relatively invariant in time. The most recent in-depth review of the spatial and temporal variability of Saharan PICS can be found in Bacour et al. (2019) where the original lists of Cosnefroy et al. (1996) and Lacherade et al. (2013) were inspected. An update of coordinates was given as compared to the original locations of Cosnefroy et al. (1996), as well as a quantified ordering on spatial homogeneity across 20 and 100 km as well as temporal stability. In this study we have adopted the final sample of Bacour et al. (2019) in their Tables 2 and 3. We will be using their coordinates to track the temporal variability of the SWIR channel continuum signal. As a typical TROPOMI pixel is at least 7 × 5 km 2 but can be as large as 20 × 7 km 2 (especially during the early parts of operations), the sites are inspected for their temporal stability and spatial homogeneity at 20 km (TVar(20km) and SHom(20km), as reported in Table 2 of Bacour et al., 2019).

TROPOMI data
S5-P has daily overpasses over all sites shown in Table 1. For most sites, multiple overpasses only exist every ∼ 5 d due to the proximity to the Equator. An overpass is valid if a pixel is located less than 0.2 • from the reference location given in Table 1. At the Equator this equals 22 km. At typical Saharan latitudes, the separation is less then 20 km. Note that if multiple pixels qualify, only the pixel with the smallest separation to the reference position is used. TROPOMI data were processed using the operational version of the L1b version 01.00.00. Several filters are applied to improve the overall data quality. These are listed in Table 2. Apart from the separation restriction, the viewing zenith angle must be smaller than 50 • , the cloud fraction of the pixel must be below 0.02, a successful and valid irradiance measurement should have been taken within a day of the overpass, and the solar zenith angle must be below 60 • . 4 With the constraints described above, the largest variable parameter is the size of the TROPOMI overpass pixel. The size of a TROPOMI pixel is dependent on its location in the swath. At nadir, a typical TROPOMI pixel at SWIR wavelength is 7×7 km 2 , while at the edge pixels can dramatically increase in size in the across-track direction, growing to sizes of up to 7 × 26 km 2 . Note that after August 2019, this was reduced to 5.5 × 7 km 2 due to shorter integration times. As seen from Bacour et al. (2019), spatial homogeneity at larger scales (i.e. 100 km) can be significantly worse than its counterpart at 20 km scales. As such, it is imperative to avoid "contamination" from large scales and focus only on the central regions. TROPOMI-SWIR soundings are thus restricted to viewing zenith angles of 50 • or less. Effects of larger viewing angles will be investigated in a future paper. Effectively this constrains soundings to positions away from the edges of the swath. This also automatically limits the pixel to a maximum pixel size in the across-track direction, while in the along-track dimension the resolution is set by the sampling time. The pixel size is thus dependent on the location through the viewing angle and on the date due to the change in integration time. In general, pixels are always smaller than 7 km in the along-track dimension and 8.5 km in the across-track dimension.
A choice was made to include relatively large viewing angles (up to 50 • ) but exclude more extreme angles (TROPOMI can observe at almost 65 • ). The alternatives, either limits to small viewing zenith angles or including all viewing angles, either severely constrained the available data or introduced additional scatter. Other alternatives, such as binning the data and removing angle dependencies in the scatter did not improve the data. To compare data over the full time span, data are corrected by dividing by the cosine of the solar zenith angle. An additional constraint to limit the maximum solar zenith angle was implemented to filter a few problematic cases during winter times.
Cloud fraction is defined as a number between 0 and 1 representing the area of a pixel covered by clouds. A number of 0 is completely cloud-free, while 1 represents a pixel completely covered by clouds. The cloud fraction information was extracted from the support data included in the operational data product of the CH 4 "offline" processor, version 1.2.0. Given the stray light correction as defined in Tol et al. (2018) and applied in the processor, no filtering was used for cloud presence beyond the sounding pixel.
Last but not least, soundings are required to have a valid irradiance measurement using TROPOMI-SWIR within a day. This filter is aimed at possible problematic soundings taken after an orbit manoeuver or anomaly. This affects less than 0.1 % of all TROPOMI measurements.
If an overpass is considered valid, the radiance is derived by taking the median signal for all pixels with centre calibrated wavelength between 2312.7 and 2312.9 nm. Deep absorption features of CH 4 as well as H 2 O are avoided. Uncertainty of the radiance measurements is propagated at each correction using Gaussian error propagation. Note that the uncertainty on the radiance of a sounding is small and includes only the shot noise of the detector, photon noise and uncertainties of the calibration data included during L1b processing.

Lambertian surface and atmospheric correction assumptions
For our analysis, the surface is assumed to be Lambertian in nature. It is well documented that off-nadir soundings are affected by non-Lambertian effects (e.g. Bruegge et al., 2019a). Although correction routines can be applied, these often rely on ground measurements to derive the BRDF correction factors within a modified Rahman-Pinty-Verstraete (mRPV) model (Rahman et al., 1993b, a). Even in such a well-characterized site such as the Railroad Valley playa, differences larger than 5 % are seen in normalization factor (Bruegge et al., 2019b, a). These differences appear to be dependent on measurements used to derive the free parameters in a mRPV model. Ground measurements are not available for any of the sites listed in Table 1. Measurements from space, such as MODIS or MISR (Multi-angle Imaging SpectroRadiometer), are typically of insufficient quality to reliably improve the data due to saturation effects and/or lack of wavelength coverage (Schaaf et al., 2011;Bruegge et al., 2019a). This is particularly apparent at larger viewing zenith angles. Various studies Bruegge et al. (2019b, a) and Kuze et al. (2014) indicate that this assumption will cause the largest scatter. In theory, the very large swath of TROPOMI is well suited to characterize this effect in more detail, following, for example, the results as presented in Bruegge et al. (2019b, a). However, this effect is considered to be beyond the scope of this work; exploration of BRDF effects can only be carried out at instrumented sites such as Railroad Valley due to the need for validation. Similarly, the lack of atmospheric correction applied to the data also may introduce additional scatter and or effects. The most important effect is due to varying aerosol densities. This effect is assumed to be relatively small at 2.3 µm wavelengths. Analysis did reveal that these assumptions are the most likely origin for a secondary yearly variation intro-duced due to the dependency of solar angles to the date. This is revealed as a sine wave with a 365 d period. This sine wave is fitted and removed from the data. The effects are discussed in Sect. 4.2. Future work (van Kempen et al., 2021) will explore non-Lambertian and atmospheric correction effects for TROPOMI SWIR wavelengths.

TROPOMI data quality
The TROPOMI-SWIR response has been shown to be extremely stable over the first year  with no detector degradation seen. This has been corroborated by the ongoing monitoring of the calibration carried out at SRON and KNMI. 5 Degradation of the transmission is below 0.2 % for the detector median. Regions of the detector also do not show degradation below the accuracy limits of the stability of the white light source or the dedicated LED. This allows two key assumptions to be made for this work. First, change in signal above 2 % is difficult to attribute to the instrument. Second, any degradation seen in the signal above the Sahara will be representative over a large part of the columns 6 associated with 2312.7 to 2312.9 nm, only excluding the top and bottom parts of the detector which correspond to viewing angles larger than 50 • . Solar calibration over this region shows deviations of less than 0.2 % over the time period presented in this paper. Table 3 shows the median radiance and its standard deviation (both in absolute and relative scales) of accepted soundings for all sites from a period starting 28 April 2018 to 1 October 2020. In addition to these statistical properties, the slope of a linear fit over 1000 d is given. This linear fit can be used as a quantification of potential degradation. This can be compared to the SWIR stability as seen in van  and Ludewig et al. (2020). For example, this stability should be similar to the trends seen in the DLED, WLS and transmission, as shown in Figs. 23, 24 and 25 of van . Note that the DLED has its own degradation. Figure 2 shows the complete time series of four sites of our sample. These four sites were chosen at random to represent the sample. The results of all other sites are presented in given in the Supplement. Figure 3 gives two examples of the sine wave that was fitted and subsequently corrected for. Figure 4 shows the distribution of standard deviations and slopes of the linear fits of the full sample.

Results
Interestingly, the standard deviation found for most sites is (a) much larger than the uncertainty per measurement (which is estimated to be less then 0.1 %; Kleipool et al., 2018;Ludewig et al., 2020), and (b) relatively constant for all sites in both time and magnitude. A spread of 8 to 11 × 10 −9 mol m −2 sr −1 nm −1 s −1 , equivalent to 3 %-5 %, is found for all sites.
The results of van Kempen et al. (2019) show that measurement-to-measurement instrumental variation due to detector noise is much smaller and that the uncertainty from instrumental effects is dominated by photon noise only, which is estimated to be on the order of 0.1 %). The higher measurement-to-measurement variation reported in Ludewig et al. (2020) seen in the irradiance was attributed to electronic noise. However, due to more in-depth analysis carried out, it is now attributed to surface roughness of the diffusers (Tim A. van Kempen and Antje Ludewig, private communications, 2020-2021). The distribution of viewing angles from TROPOMI SWIR over a full year is large, due to the large swath of TROPOMI that allows for daily global coverage.
From investigations of the TROPOMI-SWIR data above the RailRoad Valley site compared to on-ground measurements, we attribute the derived standard deviation to non-Lambertian reflections of the sandy surface (van Kempen et al., 2021).
Individual site heterogeneity is considered to be a small effect. Site heterogeneity would effect the results due to the varying size of the TROPOMI pixel and exhibit itself as a pattern repeated every 16 d, equivalent to the orbital cycle of S-5p. This pattern varies by site and depends on the more precise BRDF properties of the surfaces. No clear evidence for patterns on these timescales can be seen or derived from Fig. 2. Heterogeneity would also be detectable as a correlation between the values SHOM of 20 or 100 km of Table 2 of Bacour et al. (2019) and the derived standard deviation values of Table 3. Figure 5 shows the lack of correlation between the given site homogeneity at scales of 20 km and the standard deviation of the time series given in Table 3.
Linear fits found are very shallow, with typical number between 0 and 5 × 10 −9 mol m −2 sr −1 nm −1 s −1 per 1000 d. This equates to a maximum increase in transmission of 0.7 % yr −1 . The average of the slope is equal to 0.3 % yr −1 . Note that many sites are found at lower values, and a few even showing a negative slope. Given the typical standard deviation, we attribute the slopes to statistical variations. The impact on SWIR instrument degradation is discussed below.

Nadir view
Given the influence of non-Lambertian reflection on the standard deviation, the stability of the four sites with various quality (Egypt1, Algeria3, Libya3 and NamibiaPICSAND1; see Fig. 6) is shown using only viewing angles of less than 7.5 • from nadir. The results are given in Table 4 with the full results given for completeness. Differences are marginal, Figure 2. Continuum radiance of Egypt1 (a), Algeria3 (b), Libya3 (c) and NamibiaPICSAND1 (d) using a zenith angle filter of 50 • . Shown are all soundings over a period of more than 1000 d, starting 28th of April 2018. The median radiance (red) and a linear fit (green) are shown with lines. Data have been corrected for the solar zenith angle at each individual overpass and assuming the surface is Lambertian. A yearly variation has been corrected using a fitted sine wave with a period of 365 d. with the largest differences seen in the linear fit. For some, the standard deviation improved. The main difference was found in the reliability of the removal of the yearly variation through the 365 d period sine wave. Interestingly, the fit parameters were nearly identical as that derived above, with only the reliability of this fit found to be significantly worse. This is easily explained due to the significantly lower amount of data used as input for the fit. This also corroborates the conclusions on the influence of heterogeneity. Any effects of site heterogeneity would be nearly eliminated as the pixels are nearly identical in size and viewing the same area. This  would result in a severely reduced standard deviation, which is not observed in Table 4.

BRDF
A clear limitation of our analysis is the assumption of a Lambertian surface over the desert sites, as detailed in Sect. 3.1. A more thorough analysis using a correct reflectance model over each desert surface should be able to improve the model. This is typically done over instrumented sites such as Rail-Road Valley. As none of the sites of Table 1 contain ground instrumentation that covers the 2.3 µm band, several alternatives were explored to achieve this. However, these were unsuccessful. Figure 6. Continuum radiance of Egypt1 (a), Algeria3 (b), Libya3 (c) and NamibiaPICSAND1 (c) using a zenith angle filter of 7.5 • . Shown are all soundings over a period of more than 1000 d, starting 28 April 2018. The median radiance (red) and a linear fit (green) are shown with lines. Data have been corrected for the solar zenith angle at each individual overpass and assuming the surface is Lambertian. A yearly variation has been corrected using a fitted sine wave with a period of 365 d. Other sites are given in the Supplement.
-Over these desert sites, the MODIS BRDF product MCD43A1 was found to be not accurate enough. Due to saturation over the bright desert sites, in particular at longer wavelengths, the product includes significant gaps in the data coverage. In addition, any accepted product shows large uncertainties.
-The Visible Infrared Imaging Radiometer Suite (VIIRS) BRDF data product VNP43DNBA1 is not saturated but appears to lack the so-called hot spot in key directions, introducing a systematic error.
-Over instrumented sites such as RailRoad Valley, the accuracy of the correction of BRDF effects appears to be 5 % (Bruegge et al., 2019a, b;Kuze et al., 2014). A 5 % uncertainty is currently larger than the 3 % spread/variation we see.
These conclusions corroborate the conclusion that for implementation of BRDF corrections of TROPOMI measurements, instrumented sites play an important role. In addition, one needs to explore differences in site composition at 2.3 µm for it to be applicable to Saharan sites.

Origin of yearly variation
The sine wave used to correct an observed yearly variation is attributed to the BRDF effect. It is not instrumental in nature given the results of van Kempen et al.
(2019) and ongoing monitoring (see https://www.sron.nl/ tropomi-swir-monitoring/, last access: 4 October 2021) and should thus be attributed to a feature present in the surface. The amplitudes and period offsets with respect to 1 January 2018 are given in Table 5. Example fits are given in the Supplement. It appears that variation in this yearly variation is regionally distributed. Sites in the south-western variation (Mauritania, Mali, Niger, and Algeria1 and Algeria2, have a larger amplitude (6 to 11) period offset with a small negative offset in days with respect to the reference day of 1 January 2018, while eastern Sahara sites (Libya, Egypt, Sudan) tend to have smaller amplitudes (4 to 5) and show a slight positive offset. The Namib desert site is clearly different due to its location in the Southern Hemisphere. The sites in the Arabian Desert are grouped south (Arabia1 and Arabia2) and north (Arabia3 and ArabiaPICSAND1) in amplitude but show no correlation in period. Arabia1 has a relatively large offset. As to the origin, four possibilities are entertained. Several others are either disqualified or limited to relatively small contributions. The first is shadowing of dune formation, caused by site heterogeneity. Bacour et al. (2019) investigated site homogeneity for both the original positions of Cosnefroy et al. (1996) and their own more optimal locations. The latter locations, used here, are significantly more homogeneous at scales of both 20 and 100 km. Given the typical pixel size of TROPOMI, homogeneity at 20 km is relevant. Differences are seen in Google Maps (Bacour et al., 2019), but in general the sites are homogeneous down to a few percent or less. If dune formation would be a large factor, minima would be expected around mid-December due to the solar zenith angle. This minimum corresponds to an offset in days equal to approximately 80. In addition, the sole Southern Hemisphere location would show a minimum during late June and an offset of 260 d. This offset would also not be dependent on site, while amplitude can be. Both are not seen Table 3. However, dune formation and site heterogeneity in general can be more chaotic and thus need to be investigated further.
The second possibility is a dependency to rock or sand composition for reflectance and BRDF, another component in site heterogeneity. Our data are insufficient to make any conclusions on this. However, it would be related to location and size of hot spot and would be incompatible with the observation that the yearly variation is detected with similar amplitudes using only nadir overpasses. This hypothesis does warrant deeper investigation.
Third, given the ISRF width (van Hees et al., 2018), some wings of absorption features are within the range. Such line absorption could show a yearly variation. These absorptions are very small (< 0.5 %) and are not part of either CH 4 or H 2 O absorption features. Identifying any line contamination beyond CH 4 and H 2 O is considered beyond the scope of this paper. Given the expected magnitude, this possibility is considered to be only a very remote possibility.
Last but not least, the lack of atmospheric correction likely influences the results. At SWIR continuum wavelengths this correction is dominated by varying quantities of aerosol in the atmosphere as a function of atmosphere height. This effect likely can contribute to yearly variations, but due to the opacity of aerosols at these wavelengths (0.05 or less) it is expected to be smaller than the BRDF effects.
Several possibilities are ruled out. First, any instrument stability that would be yearly in scope would be detectable either with the on-board calibration and/or reflectance average that is also monitored   et al., 2020). These show limits of less than 0.3 %, which is in line with the results of the linear fits. Another possibility is the varying distance between the Sun and the Earth. Although this variation is expected to be of similar magnitude (∼ 2.25 %), it would also cause two features that are not observed. First perihelion occurs near 5 January. Sine offsets would thus be consistently fitted at 90 or +270 d in Table 5. Deviations from this fit are not expected to be larger than 10 %. However fits with a complete 180 • phase difference are found. TROPOMI radiance is corrected for the Earth-Sun distance (See TROPOMI L1b ATBD: S5P-KNMI-L01B-0009-SD).
Although the second option (BRDF) appears to be the most likely, the other three options cannot be excluded, in particular the effect of aerosol in the atmospheric correction, nor can it be excluded that all contribute to the variation and/or spreads observed. A future publication (van Kempen et al., 2021) will investigate the BRDF and the effect of the atmospheric corrections including aerosols, including its yearly variation. This future publication will carry out the analysis over a single instrumented site (RailRoad Valley) to be able to accurately provide validation against ground measurements.

Conclusions
The results above show that the sample of Saharan PICS is well suited as a validation of the instrument stability, both using viewing angles up to 50 • and to only 7.5 • . The standard deviations seen are similar from site to site. This was achieved by a strict filter on clouds (cloud coverage of < 0.02) and correcting for the solar zenith angle during any overpass. In addition, observed yearly variations were removed by fitting a sine wave with a period of a single year. The results clearly show no evidence for TROPOMI-SWIR instrument degradation or increased transmission. The results validate the conclusions shown in van Kempen et al. (2019) with a limit of 0.5 % over the first 1000 d. Note that the internal calibration suite shows no degradation or increased transmission down to levels of ∼ 0.1 % over the first year. Subsequent monitoring 7 has revealed this to hold over the first 1000 d as well.
The lack of difference using filters with both 50 and 7.5 • is somewhat surprising given the attributed influence of non-Lambertian reflectances (which peak at angles of 20-30 • from zenith) and the inclusion of larger TROPOMI pixels (due to the swath size of TROPOMI). Note however that following Bacour et al. (2019) most sites were assumed to be spatially homogeneous. This likely explains the lack of influence of larger pixels at larger viewing angles. Subsequently, we can conclude that the spatial homogeneity conclusions indeed apply to SWIR wavelengths following this study.
The variation from site to site is also relatively small but does appear to be correlated by region. The Namib desert has a significantly lower radiance, while the sites in the Arabian Desert only show a lower radiance by about 3 %-5 % as compared to the Saharan. Differences within the Sahara (i.e. east-west or per country) appear to be marginal and warrant further investigation.
The assumption of a Lambertian surface limits the accuracy of the data and provides a limit to the interpretation of the data. It very likely is the origin of the observed spread in each time series, as well as either the main contributor or a significant part of observed systematics variations such as the observed yearly variation. TROPOMI is an ideal instrument to explore BRDF effects out to very large viewing angles. However, given the limitations of space-based BRDF products as discussed in Sect. 4.2, conclusions given there, and recent conclusions of the BRDF studies in, for example, RailRoad Valley (Bruegge et al., 2019a), such studies must include validation by usage of instrumented sites.
In conclusion, future SWIR spectrometers should, if possible, prefer the usage of internal calibration sources or solar calibration over the usage of PICS as the primary means for instrument calibration. However, for so-called small-sat applications with total weight and/or life-limited instrument constraints, results presented above show that monitoring of suitable PICS offers a suitable, albeit a relatively more inaccurate, alternative. Note however that at high resolution, several limitations introduced by heterogeneity will be less relevant. Currently, the accuracy of the correction of BRDF effects is the greatest limitation, which should be explored over instrumented sites. Supplement. The supplement related to this article is available online at: https://doi.org/10.5194/amt-14-6711-2021-supplement.
Author contributions. The work was started by FO under supervision of TAvK as part of an internship. TAvK expanded conclusions and additional work, including writing the paper. The base software packages were written and provided by RMvH.
Competing interests. The authors declare that they have no conflict of interest.
Disclaimer. Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.