The SPARC water vapour assessment II: biases and drifts of water vapour satellite data records with respect to frost point hygrometer records

Kiefer, Michael; Hurst, Dale F.; Stiller, Gabriele P.; Lossow, Stefan; Vömel, Holger; Anderson, John; Azam, Faiza; Bertaux, Jean-Loup; Blanot, Laurent; Bramstedt, Klaus; Burrows, John P.; Damadeo, Robert; Dinelli, Bianca Maria; Eriksson, Patrick; García-Comas, Maya; Gille, John C.; Hervig, Mark; Kasai, Yasuko; Khosrawi, Farahnaz; Murtagh, Donal; Nedoluha, Gerald E.; Noël, Stefan; Raspollini, Piera; Read, William G.; Rosenlof, Karen H.; Rozanov, Alexei; Sioris, Christopher E.; Sugita, Takafumi; von Clarmann, Thomas; Walker, Kaley A.; Weigel, Katja

doi:https://doi.org/10.5194/amt-16-4589-2023

Articles | Volume 16, issue 19

https://doi.org/10.5194/amt-16-4589-2023

Special issue:

Water vapour in the upper troposphere and middle atmosphere:...

https://doi.org/10.5194/amt-16-4589-2023

Articles | Volume 16, issue 19

Research article

12 Oct 2023

Research article |

| 12 Oct 2023

The SPARC water vapour assessment II: biases and drifts of water vapour satellite data records with respect to frost point hygrometer records

Michael Kiefer, Dale F. Hurst, Gabriele P. Stiller, Stefan Lossow, Holger Vömel, John Anderson, Faiza Azam, Jean-Loup Bertaux, Laurent Blanot, Klaus Bramstedt, John P. Burrows, Robert Damadeo, Bianca Maria Dinelli, Patrick Eriksson, Maya García-Comas, John C. Gille, Mark Hervig, Yasuko Kasai, Farahnaz Khosrawi, Donal Murtagh, Gerald E. Nedoluha, Stefan Noël, Piera Raspollini, William G. Read, Karen H. Rosenlof, Alexei Rozanov, Christopher E. Sioris, Takafumi Sugita, Thomas von Clarmann, Kaley A. Walker, and Katja Weigel

Abstract

Satellite data records of stratospheric water vapour have been compared to balloon-borne frost point hygrometer (FP) profiles that are coincident in space and time. The satellite data records of 15 different instruments cover water vapour data available from January 2000 through December 2016. The hygrometer data are from 27 stations all over the world in the same period. For the comparison, real or constructed averaging kernels have been applied to the hygrometer profiles to adjust them to the measurement characteristics of the satellite instruments. For bias evaluation, we have compared satellite profiles averaged over the available temporal coverage to the means of coincident FP profiles for individual stations. For drift determinations, we analysed time series of relative differences between spatiotemporally coincident satellite and hygrometer profiles at individual stations. In a synopsis we have also calculated the mean biases and drifts (and their respective uncertainties) for each satellite record over all applicable hygrometer stations in three altitude ranges (10–30 hPa, 30–100 hPa, and 100 hPa to tropopause). Most of the satellite data have biases <10 % and average drifts <1 % yr⁻¹ in at least one of the respective altitude ranges. Virtually all biases are significant in the sense that their uncertainty range in terms of twice the standard error of the mean does not include zero. Statistically significant drifts (95 % confidence) are detected for 35 % of the ≈ 1200 time series of relative differences between satellites and hygrometers.

Download & links

Article (PDF, 31079 KB)

Supplement (3734 KB)

Download & links

Article (31079 KB)
Full-text XML
Supplement (3734 KB)
BibTeX
EndNote

How to cite.

Kiefer, M., Hurst, D. F., Stiller, G. P., Lossow, S., Vömel, H., Anderson, J., Azam, F., Bertaux, J.-L., Blanot, L., Bramstedt, K., Burrows, J. P., Damadeo, R., Dinelli, B. M., Eriksson, P., García-Comas, M., Gille, J. C., Hervig, M., Kasai, Y., Khosrawi, F., Murtagh, D., Nedoluha, G. E., Noël, S., Raspollini, P., Read, W. G., Rosenlof, K. H., Rozanov, A., Sioris, C. E., Sugita, T., von Clarmann, T., Walker, K. A., and Weigel, K.: The SPARC water vapour assessment II: biases and drifts of water vapour satellite data records with respect to frost point hygrometer records, Atmos. Meas. Tech., 16, 4589–4642, https://doi.org/10.5194/amt-16-4589-2023, 2023.

Received: 19 Apr 2023 – Discussion started: 21 Apr 2023 – Revised: 28 Aug 2023 – Accepted: 30 Aug 2023 – Published: 12 Oct 2023

1 Introduction

Water vapour is the most potent greenhouse gas in the atmosphere (Kiehl and Trenberth, 1997). Its radiative effect per unit mass change is strongest around the tropical tropopause (Riese et al., 2012; Solomon et al., 2010). Trends of stratospheric water vapour are expected to be related to the temperatures of the tropical tropopause where air transporting water vapour enters the stratosphere (e.g. Fueglistaler and Haynes, 2005; Randel and Park, 2019). Rising troposphere and tropopause temperatures due to global warming may lead to increasing stratospheric water vapour abundances, initiating a positive feedback loop where global warming will be further accelerated due to increasing water vapour abundances in the lower stratosphere (e.g. Gettelman et al., 2010; Dessler et al., 2013, 2016). In addition, the major stratospheric source of water vapour is the oxidation of methane (e.g. le Texier et al., 1988), which has more than doubled since 1800 (Blunier et al., 1993) and is expected to continue rising in future (e.g. Lelieveld et al., 1998), further increasing stratospheric water vapour.

Since 1980, despite constant or slightly decreasing tropical tropopause temperatures (Gettelman et al., 2009; Hu et al., 2015), an increase in stratospheric water vapour has been observed over Boulder, Colorado (Oltmans and Hofmann, 1995). This cannot be explained by the high positive correlation between tropical tropopause temperatures and water vapour in the lowermost tropical stratosphere (Fueglistaler and Haynes, 2005; Randel and Park, 2019). In consequence, numerous studies have been performed to better understand the stratospheric water vapour budget and trends (e.g. Oltmans and Hofmann, 1995; Oltmans et al., 2000; Rosenlof et al., 2001; Nedoluha et al., 2003; Hurst et al., 2011 b; Dessler et al., 2014; Hegglin et al., 2014; Brinkop et al., 2016). Vertically resolved profiles of atmospheric water vapour have been observed around the globe by satellite-based instruments in low Earth orbits since the mid-1970s. From the year 2000 on, 15 different satellite instruments have observed vertically resolved water vapour distributions from the middle troposphere to the mesosphere and above. More than 2 decades ago, a first assessment of the quality of water vapour observations including ground-based, balloon-borne and satellite instrumentation was published as the WCRP/SPARC (World Climate Research Programme/Stratosphere-troposphere Processes And their Role in Climate) report no. 2 (Kley et al., 2000). The many new satellite instruments in orbit since 2000 have made it of great interest to reassess the quality and consistency of water vapour observations from space. Here we concentrate only on stratospheric measurements by satellites and balloon-borne frost point hygrometers (FPs).

Many of the satellite data records included in this study are described in detail by their data providers in reports and scientific papers (a compilation of information relevant to this paper is presented in Walker et al., 2023). These reports and scientific papers also contain, in most cases, some information about validation activities. Frost point hygrometers have often been used for satellite data validation since they are considered to be most accurate and internally consistent water vapour instruments for stratospheric measurements. Comparisons of different instruments, including their calibrations and data processing routines, were the focus of several field campaigns (Vömel et al., 2007 a, b, 2016; Hurst et al., 2011 a; Rollins et al., 2014; Hall et al., 2016). Despite differences between the measurements by instruments employing different sensing techniques, consistency was found within the data from FPs.

Each comparison of satellite data records to the FP soundings, however, has been done in a slightly different way by each validation team, resulting in a wealth of validation publications that are not consistent down to the last detail. This lack of consistency hampers activities where several satellite data records need to be merged to construct a long-term time series, e.g. for trend assessments. For this reason, we decided for this WCRP/SPARC WAVAS-II (Water Vapor Assessment II) activity to perform the comparison of all available satellite data records obtained during the period of 2000 through 2016 to FP data in a fully consistent and reproducible way. We document here where the FP data came from, how we made them comparable to the satellite data, and how the comparisons were performed. Overall, all of our satellite-to-FP comparisons are done in a similar way. The result of this activity is the first fully self-consistent quality assessment of vertically resolved biases and drifts in the stratospheric water vapour measurements by numerous satellite instruments and FPs, along with the respective uncertainties. In order to be consistent with the other assessments within the WCRP/SPARC WAVAS-II activity (see ACP/AMT/ESSD special issue “Water vapour in the upper troposphere and middle atmosphere: a WCRP/SPARC satellite data quality assessment including biases, variability, and drifts”, https://amt.copernicus.org/articles/special_issue10_830.html, last access: 28 September 2023), we use the same data versions as used in other papers in the WAVAS-II special issue, even in cases where newer data versions have become available in the meantime.

The paper is structured as follows: in Sect. 2 we describe the FP data and the satellite data records, including their preparation for use within this study. Further, we explain how we made the FP data comparable in terms of their vertical resolution and how the biases and the drifts have been calculated. Section 3 presents the assessment of the biases between the satellite and FP data records, starting with each individual satellite data record versus the FP data at each site and then discussing comparisons of all satellite data versus one station, as well as one satellite data record versus all stations. We summarize these findings with a synopsis of the biases and their uncertainties for each satellite data set over all its associated FP sites, in three different altitude ranges. Section 4 presents the assessment of instrumental drifts of the satellite data records against FP records, also including a synopsis of the drifts of each satellite data set, in three altitude ranges, over all its associated FP sites. Section 5 summarizes our findings and offers recommendations for the use of the satellite data records under assessment. The individual bias and drift figures for pairs of satellite records and FP stations are presented in the Supplement and Appendix to this paper, respectively.

2 Data and data handling

In this study, we compare the satellite data records under assessment in the WCRP/SPARC WAVAS-II activity to reference-quality FP soundings at 27 stations (79^∘ N to 45^∘ S latitude) during 2000 through 2016. A total of 31 data records from 15 different satellite instruments provide a subset of measurements coincident with the FP soundings (for coincidence criteria see below) that can be evaluated against the profile data from FP balloon soundings. In the following, we briefly describe FP and satellite data, explain the adjustments of the vertical resolution of the FP data to each of the various satellite data records, and describe the methods for the bias and drift assessments.

2.1 Frost point hygrometer data

The chilled mirror technique (Brewer, 1949; Barrett et al., 1950) is based upon the well-known equilibrium thermodynamic relationship (Clausius–Clapeyron) between an ice or liquid water surface and overlying water vapour. Frost point hygrometers actively maintain the equilibrium of this two-phase system by continuously adjusting the temperature of the condensate layer such that it remains stable. Both the NOAA (National Oceanic and Atmospheric Administration) Global Monitoring Laboratory's frost point hygrometer (NOAA FPH) and the cryogenic frost point hygrometer (CFH) use optical detection of the condensate layer on a small mirror. A feedback loop actively regulates the mirror temperature to maintain a stable condensate layer, making the water vapour content of the overlying air directly calculable from the mirror temperature.

The balloon-borne NOAA FPH was first flown over Boulder, CO, in 1980 (Oltmans et al., 2000) and, to date, has produced a 43-year record of stratospheric water vapour mixing ratios (Hurst et al., 2011 b). It has also been flown routinely at Lauder, New Zealand, since 2004 and Hilo, Hawaii, since 2010 and has been part of a number of tropical, mid-latitude, and polar measurement campaigns (Kley et al., 1997). The NOAA FPH payload is configured to enable measurements not only during ascent but also during controlled (5 m s⁻¹) descent of the balloon when water vapour contamination is improbable. The FPH measurement uncertainty is largely determined by the stability of the frost layer and, under satisfactory performance, is 0.1–0.3 K in frost-point temperature in the stratosphere, leading to a measurement uncertainty of <6 % for stratospheric mixing ratios (Hall et al., 2016).

Table 1Overview of NOAA (National Oceanic and Atmospheric Administration) frost point hygrometer (NOAA FPH) and cryogenic frost point hygrometer (CFH) stations used for comparisons with satellite data.

^a Data from these sites were used for the drift analyses.

Download Print Version | Download XLSX

https://amt.copernicus.org/articles/16/4589/2023/amt-16-4589-2023-f01

Figure 1Locations of NOAA FPH and CFH stations that provided measurement data for these intercomparisons (a) and the temporal coverage of the data records at the respective stations (b). RV Mirai was a measurement campaign based on a ship cruise. This is indicated by the dotted line connecting the respective symbols. In the lower plot each symbol represents at least one balloon-borne FP sounding. Note that some of the FP data sets began before 2000, but only the data from 2000 through 2016 are used here for bias and drift evaluations. FP record start dates are presented in Table 1.

The CFH (Vömel et al., 2007 a, b, 2016) works along the same principle as the NOAA FPH but uses a proportional–integral–derivative controller with a continuously variable parameter schedule to make observations between the surface and the middle stratosphere (25 km). The uncertainty of the condensate phase in the temperature range below 0 ^∘C is largely eliminated, allowing continuous profiles over a wider range of frost-point temperatures to be measured. It suffers no artefacts in cirrus clouds and may only be limited in wet precipitating clouds with the detector lens getting wet. The measurement uncertainty of the CFH is less than 0.5 K throughout the entire profile, which translates to conservative uncertainty values of 4 % in the lower troposphere and increasing to 9 % in the stratosphere.

Neither the CFH nor NOAA FPH requires water vapour calibration standards or a water vapour calibration scale; only the mirror thermistor must be calibrated with high accuracy, and this is accomplished using traceable standards of the US National Institute of Standards and Technology (NIST).

Temperature and pressure measurements used to convert frost point hygrometer data into relative humidity values and volume mixing ratios, respectively, are from the accompanying radiosondes on each balloon. Measurements of temperature and pressure have been provided by different radiosonde models throughout the years: Vaisala models RS80, RS92, and RS41; InterMet models iMet-1-RSB and iMet-4-RSB; and Meisei models RS-06G and RS-11G.

Offsets in the pressure measurements of radiosondes may bias the calculation of the mixing ratio in the stratosphere (Stauffer et al., 2014; Inai et al., 2015). To minimize this bias, the radiosonde pressure measurements are usually corrected using the radiosonde's acquisition of the geometric altitude by Global Navigation Satellite System (GNSS). In some radiosonde systems, the pressure is not measured directly but instead derived from the GNSS altitude. Only in older systems that precede the availability of GNSS observations on radiosondes starting in the late 1990s are pressures used without any corrections except those based on a simple pre-flight comparison at the surface with ground-based sensors. For this work we used FP mixing ratio averages on a fixed 250 m altitude grid. These are typically further reduced in vertical resolution as they are convolved with real or constructed averaging kernels for the different satellite instruments (see Sect. 2.3).

Table 1 lists the stations from which NOAA FPH or CFH data have been used for comparison with satellite data, together with their period of operation, the type of instrument launched, and the geographical coordinates of the site. Each station is given a three-letter code to simplify its identification in the remainder of this paper. Figure 1 provides an overview of the geographical locations and the measurement periods of the stations, together with the symbols and colour codes that are used throughout this paper to mark the respective data of the stations.

In the remainder of this paper we do not distinguish between NOAA FPH and CFH, so we continue to use the generic term “FP” for frost point hygrometer instruments and data.

2.2 Satellite data

Satellite data from all instruments providing measurements coincident with FP balloon soundings have been selected. Data quality filter criteria according to the original data descriptions from the data providers have been applied (for a summary of these data-set-specific criteria, see Walker et al., 2023). No further bulk screening for data outliers surviving the previous data quality filtering has been applied. The 31 satellite data records that are used in this comparison are listed in Table 2 along with their three-letter codes. Figure 2 shows the symbols and colour codes for the satellite data sets used throughout this paper. The data versions we have assessed in this study are not the most recent ones to date for most of the satellite data sets. For reasons of consistency, the data versions used here are the same as those assessed by the other comparative studies of the SPARC WAVAS-II activity. It is left to future studies to evaluate if more recent data versions of water vapour satellite data are improved with respect to those assessed here. Such evaluations can also be done individually by comparing newer data versions to those assessed here to quantify any changes in the biases and drifts reported here.

https://amt.copernicus.org/articles/16/4589/2023/amt-16-4589-2023-f02

Figure 2Colours, symbols, and three-letter codes for the satellite data records used throughout the paper (upper part) and temporal distribution of available data of the respective satellite instruments on a monthly basis until the end of 2016 (lower part, not divided into measurement modes or data versions). Note that three of the SAT data sets began before 2000 (SAGE II 1984, HALOE 1991, POAM III 1998), but only the data from 2000 through 2016 are used here for bias and drift evaluations.

The SPARC water vapour assessment II: biases and drifts of water vapour satellite data records with respect to frost point hygrometer records

2.1 Frost point hygrometer data

2.2 Satellite data

2.3 Adaptation of the vertical resolution of FP profiles to the satellite data and interpolation to a common grid

3.1 Method of calculation of bias and standard error of the mean bias

3.2 Individual comparisons between satellite data records and FP stations

3.3 Mean biases of the satellite data records by FP stations

3.4 Mean biases of the satellite data by data record

ACE-FTS v3.5 (ACE)

GOMOS (GOM)

HALOE (HAL)

HIRDLS (HIR)

ILAS-II (ILA)

MAESTRO (MST)

MIPAS (MBR, MER, MIR, MOR)

MLS (MLS)

POAM-III (POM)

SAGE-II (SG2)

SAGE-III (SG3)

SCIAMACHY (SC3, SC1, SC4, SCL)

SMILES (SLA, SLB)

SMR (SM5, SM4)

SOFIE (SOF)

3.5 Synopsis of the bias assessment

4.1 Evaluation of data records for drift analysis

4.2 Methods for quantifying drifts

4.2.1 Special case of MLS drifts

4.3 Drift profiles for the unique SAT–FP pairs

4.4 Mean drifts for the unique SAT–FP pairs

4.5 Mean drifts at each FP site across all SATs

4.6 Mean drifts for each SAT across all FP sites

4.7 Synopsis of the drift assessments

ACE

GOM (GOMOS)

HAL (HALOE)

MST (MAESTRO)

MLS and MLS∗

SG2 (SAGE II)

SC3 (SCIAMACHY limb)

SC1 (SCIAMACHY solar OEM)

SC4 (SCIAMACHY solar OP)

SM5 (SMR 544 GHz)

SM4 (SMR 489 GHz)

SOF (SOFIE)

MBR and MBM (MIPAS Bologna V5R NOM and MA)

MER and MEM (MIPAS ESA V7R NOM and MA)

MIR and MIM (MIPAS IMK/IAA V5R NOM and MA)

MOR and MOM (MIPAS Oxford V5R NOM and MA)

A1 Biases per SAT data set

A2 Biases per FP data set, ordered by latitude

B1 Drift profiles for every SAT–FP pair

B2 Drifts per SAT data set

B3 Drifts per FP data set, ordered by latitude

MLS and MLS^∗