Validation of Sentinel-5P TROPOMI tropospheric NO<sub>2</sub> products by comparison with NO<sub>2</sub> measurements from airborne imaging DOAS, ground-based stationary DOAS, and mobile car DOAS measurements during the S5P-VAL-DE-Ruhr campaign

Lange, Kezia; Richter, Andreas; Schönhardt, Anja; Meier, Andreas C.; Bösch, Tim; Seyler, André; Krause, Kai; Behrens, Lisa K.; Wittrock, Folkard; Merlaud, Alexis; Tack, Frederik; Fayt, Caroline; Friedrich, Martina M.; Dimitropoulou, Ermioni; Van Roozendael, Michel; Kumar, Vinod; Donner, Sebastian; Dörner, Steffen; Lauster, Bianca; Razi, Maria; Borger, Christian; Uhlmannsiek, Katharina; Wagner, Thomas; Ruhtz, Thomas; Eskes, Henk; Bohn, Birger; Santana Diaz, Daniel; Abuhassan, Nader; Schüttemeyer, Dirk; Burrows, John P.

doi:https://doi.org/10.5194/amt-16-1357-2023

Articles | Volume 16, issue 5

https://doi.org/10.5194/amt-16-1357-2023

© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/amt-16-1357-2023

© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 16, issue 5

Research article

|

14 Mar 2023

Research article |

| 14 Mar 2023

Validation of Sentinel-5P TROPOMI tropospheric NO₂ products by comparison with NO₂ measurements from airborne imaging DOAS, ground-based stationary DOAS, and mobile car DOAS measurements during the S5P-VAL-DE-Ruhr campaign

Kezia Lange, Andreas Richter, Anja Schönhardt, Andreas C. Meier, Tim Bösch, André Seyler, Kai Krause, Lisa K. Behrens, Folkard Wittrock, Alexis Merlaud, Frederik Tack, Caroline Fayt, Martina M. Friedrich, Ermioni Dimitropoulou, Michel Van Roozendael, Vinod Kumar, Sebastian Donner, Steffen Dörner, Bianca Lauster, Maria Razi, Christian Borger, Katharina Uhlmannsiek, Thomas Wagner, Thomas Ruhtz, Henk Eskes, Birger Bohn, Daniel Santana Diaz, Nader Abuhassan, Dirk Schüttemeyer, and John P. Burrows

Download

Final revised paper (published on 14 Mar 2023)
Preprint (discussion started on 25 Oct 2022)

Interactive discussion

Status: closed

RC1:
'Comment on amt-2022-264', Anonymous Referee #2, 14 Nov 2022

The manuscript by Lange et al. discusses the S5P-VAL-DE-Ruhr validation campaign. It includes a very extensive and well-presented validation of the TROPOMI dataset with aircraft and ground-based measurements, including a good measurement campaign overview. It further includes a comparison of different TROPOMI NO2 datasets and shows the significant improvements of the latest product version PAL over OFFL. It is of interest to readers of AMT. I would recommend publication after addressing some suggestions (see supplement).

Citation: https://doi.org/10.5194/amt-2022-264-RC1
- AC1: 'Reply on RC1', Kezia Lange, 04 Jan 2023
  
  The comment was uploaded in the form of a supplement: https://amt.copernicus.org/preprints/amt-2022-264/amt-2022-264-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/amt-2022-264-AC1
RC2:
'Comment on amt-2022-264', Anonymous Referee #3, 21 Nov 2022
Referee comments to amt-2022-264 titled, ‘Validation of Sentinel-5P TROPOMI tropospheric NO2 products by comparison with NO2 measurements from airborne imaging, ground-based stationary, and mobile car DOAS measurements during the S5P-VAL-DE-Ruhr campaign’

This manuscript uses DOAS data collected in September 2020 in a polluted region in western Germany from airborne, ground, and car-based instrumentation to validate the set of TROPOMI L2 NO2 products (both research and operational). The airborne datasets are first validated by the ground and car-based systems which then justifies the airborne use for validating TROPOMI. This paper fits the scope of AMT and will be valuable as a validation dataset for the TROPOMI NO2 product. However, before publishing, this manuscript requires some minor technical corrections/clarifications as detailed below but more reflection toward conclusions drawn about improvements in the S5P PAL product and the impact of clouds. Detailed comments below.

More significant comments:

Most of the results in this work are too heavily based on the slope of the regression, which is not representing the complete behavior of the validation activity. Table A1 has at least median difference in % which actually in some cases contradicts the results of the slope (e.g., having a 21% higher column from TROPOMI as a median from S5P PAL V02.03.01.). Consider more in-depth analysis based on statistics other than slope for all intercomparisons.

Some conclusions drawn in section 6 are either overgeneralized or not quite technically correct. These comments do not specify lines in the test but more so in general comments that need to be kept in mind when adding to and editing the analysis based on the suggestions below:

With the data presented, conclusions about the S5P PAL product are only stated as an improvement. This is an overgeneralized conclusion, and the authors should do some more detailed analysis from other statistics. Some of this is already done with discussion of the lower lobe results but it is missing discussion on the higher lobe. Additionally, with the loss in precision, some users may find this result more detrimental than having a predictable low slope and this is not commented upon in the results, abstract, or conclusions.

The main reason concluded about the improved PAL product is due to the cloud correction. It is stated that all these changes are due to more ‘realistic’ cloud pressures or more ‘realistic’ cloud corrections but this more ‘realistic’ outcome is not demonstrated in this region. Therefore, these conclusions cannot be stated unless they are proven with the data available in that specific region (e.g., could look at imagery from satellites or other creative sources and reflect on what it should be in reality). In fact, removing the cloud correction all together (Figure 9b) shows that the massive improvement in slope is something else removed from the cloud correction as this is the best result in terms of conserving precision (correlation) and a higher slope.

Conclusions drawn about the cloud pressure in some cases seems to not be interpreted correctly as written. For example, discussions from line 555-563 talk about the low lobe. (1) It is stated that cloud pressures are too low, but looking at imagery online there seems to be zero clouds seen by VIIRS on this afternoon, so cloud pressures shouldn’t be low to start with. (2) Aerosols are also pointed at as a potential cause, but the sensitivity results in Figure A2 show that the impact of aerosols would not be large enough to create this bias in this lobe.

Technical comments in relation to the AirMAP retrieval that need more justification or clarification.

It is said that the reference VCD in the troposphere for AirMAP is 1e15. One of the MAX-DOAS retrievals has a different value of 1.5E15 but they are referred to as similar. It is different by 50% rather than similar. Please clarify these difference or explain them.

Can the reference value be justified with any other data from this work? (i.e., What does the CAMS model say the tropospheric amount is?)

Could this reference assumption be the cause for a low offset between the car DOAS systems and the airborne dataset?

Can the authors justify why a 1km box profile used if CAMS analysis is available for these flights to provide a profile shape and what that assumption impact may be in the results? A 1 km box profile assumes that NO2 is well mixed through that 1km boundary layer which has been demonstrated as not the case with in situ measurements from aircraft near strong sources (which is the case here in many of these flights). (e.g., https://doi.org/10.1002/2015JD024203 and https://doi.org/10.1525/elementa.2020.00163 ). This paper also shows the impact of AMFs based on assuming a 1km box vs an urban profile atmos-meas-tech.net/3/475/2010/

Line 291: ‘Surfaces with different brightness introduce artefacts in the maps of NO2’. The impact isn’t necessarily an artifact at the SCD stage. This is caused by the brighter surface increasing sensitivity in the lower parts of the atmosphere meaning a higher slant column if NO2 is present (if there is not any or minimal NO2 then this spatial pattern will not show up in the slant column). It only becomes an artifact if the surface reflectivity assumption in the AMF calculation doesn’t account for this accurately.

Minor comments:

When referring to the spatial resolution of TROPOMI as 3.5 km x 5.5 km, please specify that this is at nadir.

Line 74. Mention what version Verhoelst et al. validated to be consistent with this analysis and the other mentioned publications.

Line 94: the conclusion of ‘low bias’ is prematurely stated (before showing any results). Recommend just removing ‘low’ from the sentence.

Figure 2 is mentioned before Figure 1. Consider reordering figures to reflect this or consider combining Figures 1 and 2 for a more helpful side-by-side comparison.

Line 159: capitalized Ozone Monitoring Instrument

Line 179-181: The sentence about V02.04.01 should either clearly state that this analysis does not include this product or should be removed.

Lines 173-177: The following sentence needs references: ‘Other factors that could contribute to the underestimation are the low spatial resolution of the used a priori NO2 profiles from the TM5-MP global chemistry transport model, the use of the OMI LER climatology given on a grid of 0.5° x 0.5° for the AMF and cloud fraction retrieval in the NO2 fit window and the GOME-2 LER climatology used for the NIR-FRESCO cloud retrieval given on a grid of 0.25° x 0.25° measured at mid-morning.’

Line 189: add the spatial resolution of the CAMS global analysis

Line 198-199: The sentence referring to 15% increases needs a reference.

Line 308: define quantitatively what polluted means for this statistic.

Equation 5 seems to be the same as equation 4. Is it needed?

Consider making a table of all the various information of the retrievals for the AirMAP, car, and stationary DOAS retrievals as the sections get repetitive and there are small differences in places that are hard to keep straight.

Are there references for all the individual car or ground-based systems? If so, please add in the sections that describe them.

The MAX-DOAS measurement truck is different from the rest in that it measures in the UV rather than the visible wavelengths of the other retrievals. Is it realistic for their AMFs to be the same as the other systems?

Line 415-416. The SCD of the reference for this DOAS instrument seems quite large considering the statements that the AMFs for a zenith DOAS retrieval are about 1.3. Is this off by an order of magnitude or are the measurements just in a densely polluted area for the reference?

Line 449-451. Is there a reference for the tropospheric NO2 product from Pandora that can be added to this section? This is the first publication I have seen use that product.

Line 501: Is it +/- 1 hour or 30 minutes? The rest of the paper seems to reflect 30 minutes.

Line 576: Before this line, it says that the criterion for comparison is the same as Judd et al. 2020 but at this location the authors should specify that this criterion (filtering for delta CS less/greater than 50 hpa) is the opposite of the filter applied by Judd et al. to avoid confusion. Bonus suggestion: it could be nice to have a comparison of what the results look like for the points with delta CS less than 50 hPa?

Line 604-606: ‘This behavior is different from the small impact that we observed for changing the a priori NO2 profile information from TM5 to CAMS for the OFFL V01.03.02 dataset’. The change seems to be on the same order of magnitude rather than different.

Line 660. Saying cloud fractions are always lower than 0.14 contradicts from other examples in the text. (e.g., saying it was on average 0.21 in line 128).

Line 667: it is stated that on average TROPOMI is lower than air map but there are no averages reported in the manuscript.
Citation: https://doi.org/10.5194/amt-2022-264-RC2
- AC2: 'Reply on RC2', Kezia Lange, 04 Jan 2023
  
  The comment was uploaded in the form of a supplement: https://amt.copernicus.org/preprints/amt-2022-264/amt-2022-264-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/amt-2022-264-AC2
RC3:
'Comment on amt-2022-264', Anonymous Referee #1, 28 Nov 2022
This manuscript provides an evaluation of TROPOMI NO2 vertical tropospheric columns (VTCs) against airborne NO2 observations from the AirMAP imaging spectrometer, which itself is compared with ground-based stationary and mobile DOAS instruments.

The study is excellent. The manuscript is very well written and clear and the topic fits well within the scope of AMT. I can only commend the authors for this thorough study and strongly recommend publication.

Excellence:

The study goes well beyond previous evaluation studies in several respects. The validation experiment combining ground-based and airborne remote sensing is very well thought-out: The ground-based measurements provide a high-quality reference and the airborne observations, which effectively sampled the area covered by individual TROPOMI pixels over regions with strong NO2 gradients, provide the link to the satellite data. This setup allows for a quantitative (almost 1:1) comparison in contrast to more indirect/qualitative comparisons previous studies. Furthermore, the extensive aircraft observations over three different regions with varying NO2 levels allowed covering many individual TROPOMI pixels necessary, which is necessary to compute robust statistics. Finally, the study does not stop at presenting the comparisons but goes a long way towards explaining the reasons for errors and biases in different versions of TROPOMI data. By replicating the TROPOMI retrieval algorithm, the authors were able to analyze the influence of key input parameters such as a priori NO2 profiles and surface reflectance on the data. The main source of error was found to be the FRESCO cloud retrieval, which tended to place the cloud tops at too low elevation probably due the inability of the algorithm to properly account for the effect of aerosols. This finding is essential to guide further developments of the retrieval algorithms and improvements of the operational TROPOMI NO2 product in the future.

The study is very well written with almost no typos or grammatical errors, logically organized, well balanced in terms of conciseness and detail, the figures and tables are of high quality, and the Appendices add valuable information.

I have only two small points to consider:

The differences in the NO2 VTCs between the former operational offline algorithm (OFFL V01.03.02) and the improved new algorithm PAL V02.03.01 are very large. The authors mention that the main change was a switch from the FRESCO-S to the FRESCO-wide cloud retrieval algorithm, but there is little information on what else changed what the (potential) influence of these changes were. It would be useful to get some more insight into the changes.

The comparisons between AirMAP and the ground-based mobile and stationary suggest that the ground-based measurements (separately analyzed by the different groups) provide a consistent set of reference measurements. Nevertheless the question arises whether there has been no direct comparison between the ground-based instruments, e.g. when a car DOAS passed by a the location of a stationary instrument or when several car instrument were placed at the same location. If such intercomparisons have been made, it would be good to learn about them and add the results e.g. in an Appendix.

Small corrections:

Page 7, Line 148: Change "are retrieved" to "were retrieved"

Page 8, Line 179: I think the acronym DLER has not been introduced before.

Page 12, Equation 12: Why is the VDC_ref,trop not simply added to dSCD/AMF_trop? Why do we need to multiply VCD_ref,trop with AMF_ref,trop / AMF_trop? Please explain.

Page 29, line 661: I think the recommended filter criterion of 0.5 applies to the cloud fraction, not to cloud fraction.
Citation: https://doi.org/10.5194/amt-2022-264-RC3
- AC3: 'Reply on RC3', Kezia Lange, 04 Jan 2023
  
  The comment was uploaded in the form of a supplement: https://amt.copernicus.org/preprints/amt-2022-264/amt-2022-264-AC3-supplement.pdf
  
  Citation: https://doi.org/10.5194/amt-2022-264-AC3

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Kezia Lange on behalf of the Authors (09 Jan 2023) Author's response Author's tracked changes Manuscript

ED: Publish as is (12 Jan 2023) by Robyn Schofield

AR by Kezia Lange on behalf of the Authors (12 Jan 2023)

Post-review adjustments

AA: Author's adjustment | EA: Editor approval

AA by Kezia Lange on behalf of the Authors (10 Feb 2023) Author's adjustment Manuscript

EA: Adjustments approved (14 Mar 2023) by Robyn Schofield

Short summary

We present airborne imaging DOAS and ground-based stationary and car DOAS measurements conducted during the S5P-VAL-DE-Ruhr campaign in the Rhine-Ruhr region. The measurements are used to validate spaceborne NO₂ data products from the Sentinel-5 Precursor TROPOspheric Monitoring Instrument (TROPOMI). Auxiliary data of the TROPOMI NO₂ retrieval, such as spatially higher resolved a priori NO₂ vertical profiles, surface reflectivity, and cloud treatment are investigated to evaluate their impact.

Validation of Sentinel-5P TROPOMI tropospheric NO2 products by comparison with NO2 measurements from airborne imaging DOAS, ground-based stationary DOAS, and mobile car DOAS measurements during the S5P-VAL-DE-Ruhr campaign

Download

Interactive discussion

Peer review completion

Post-review adjustments

Validation of Sentinel-5P TROPOMI tropospheric NO₂ products by comparison with NO₂ measurements from airborne imaging DOAS, ground-based stationary DOAS, and mobile car DOAS measurements during the S5P-VAL-DE-Ruhr campaign