Can machine learning correct microwave humidity radiances for the influence of clouds?

Kaur, Inderpreet; Eriksson, Patrick; Pfreundschuh, Simon; Duncan, David Ian

doi:https://doi.org/10.5194/amt-14-2957-2021

Articles | Volume 14, issue 4

https://doi.org/10.5194/amt-14-2957-2021

Articles | Volume 14, issue 4

Research article

20 Apr 2021

Research article |

| 20 Apr 2021

Can machine learning correct microwave humidity radiances for the influence of clouds?

Inderpreet Kaur, Patrick Eriksson, Simon Pfreundschuh, and David Ian Duncan

Abstract

A methodology based on quantile regression neural networks (QRNNs) is presented that identifies and corrects the cloud impact on microwave humidity sounder radiances at 183 GHz. This approach estimates the posterior distributions of noise-free clear-sky (NFCS) radiances, providing nearly bias-free estimates of clear-sky radiances with a full posterior error distribution. It is first demonstrated by application to a present sensor, the MicroWave Humidity Sounder 2 (MWHS-2); then the applicability to sub-millimetre (sub-mm) sensors is also analysed. The QRNN results improve upon what operational cloud filtering techniques like a scattering index can achieve but are ultimately imperfect due to limited information content on cirrus impact from traditional microwave channels – the negative departures associated with high cloud impact are successfully corrected, but thin cirrus clouds cannot be fully corrected. In contrast, when sub-mm observations are used, QRNN successfully corrects most cases with cloud impact, with only 2 %–6 % of the cases left partially corrected. The methodology works well even if only one sub-mm channel (325 GHz) is available. When using sub-mm observations, cloud correction usually results in error distributions with a standard deviation less than typical channel noise values. Furthermore, QRNN outputs predicted quantiles for case-specific uncertainty estimates, successfully representing the uncertainty of cloud correction for each observation individually. In comparison to deterministic correction or filtering approaches, the corrected radiances and attendant uncertainty estimates have great potential to be used efficiently in assimilation systems due to being largely unbiased and adding little further uncertainty to the measurements.

Download & links

Article (PDF, 4229 KB)

Download & links

How to cite.

Received: 26 Nov 2020 – Discussion started: 23 Dec 2020 – Revised: 04 Mar 2021 – Accepted: 08 Mar 2021 – Published: 20 Apr 2021

1 Introduction

Satellite observations of humidity inside the troposphere are mainly performed by downward-looking sensors. Among this class of observations, the frequency range around 183 GHz has a special position. Water vapour has a noticeable transition at 22 GHz, but it is relatively weak and only column values can be derived (e.g. Schluessel and Emery, 1990) for the observation geometry of concern. The first transition in the microwave region that can be used to derive altitude information, i.e. “sounding”, is the one at 183 GHz (Kakar, 1983; Wang et al., 1983). On the other hand, at infrared wavelengths, a high number of water vapour transitions are found, including some of high strength. As a consequence, infrared sounders can provide humidity profiles with high precision and good vertical resolution but with strong limitations imposed by clouds. To be able to also sense humidity inside and below clouds, weather satellites have for some time been equipped with channels around 183 GHz. Today, such channels are part of several sensors, such as ATMS (Advanced Technology Microwave Sounder; Weng et al., 2012).

Although microwave channels are less affected by cloud contamination, precipitation and most dense clouds, particularly if found at a high altitude, can still affect measured radiances around 183 GHz (e.g. Bennartz and Bauer, 2003). As the impact from the hydrometeors then is dominated by scattering, the complexity of the analysis of the data increases dramatically and there exists a need to identify the problematic cases. This is normally denoted as cloud filtering, in order to obtain data of “clear-sky” character. Such filtering has been applied to derive climate records (Lang et al., 2020) and is essential in studies of the agreement between observations and simulations (Brogniez et al., 2016) as well as comparing observations of different instruments to validate their calibration (John et al., 2013; Moradi et al., 2015; Berg et al., 2016). Commonly used cloud filtering methods for these applications are based on 183 GHz data alone, involving rules on the brightness temperature differences between channels (Burns et al., 1997; Buehler et al., 2007).

Another motivation necessitating the need for cloud filtering is usage of 183 GHz channels in numerical weather prediction (NWP). Usage of passive microwave data by all-sky assimilation in global NWP is growing (Geer et al., 2017), but 183 GHz data are still mainly used in a clear-sky fashion (Geer et al., 2018). The latter is particularly true in NWP of regional scope (Gustafsson et al., 2018), with clear-sky assimilation of 183 GHz radiances still commonplace. Regardless, both clear-sky and all-sky assimilation require identification of cloud-affected observations, either to screen out these observations or to assign an appropriate observation error. The most commonly used cloud filtering techniques are the “scattering index” (Geer et al., 2014) and the “observation minus background” (O−B). The first one is based on brightness temperature differences between 89 and 150 GHz. In the second one, the forecast model is used to obtain an estimate of the expected clear-sky value and the observation is rejected if the deviation exceeds some threshold (English et al., 1999).

At 183 GHz, the impact of hydrometeors typically causes a decrease in the observed radiance due to scattering from ice hydrometeors (e.g. Barlakas and Eriksson, 2020). This implies that if any cloud contamination is missed by the filtering, a negative bias in the mean radiance, compared to the true clear-sky mean, may translate into a bias in humidity after the retrieval or assimilation. For NWP systems assimilating clear-sky observations, the effect of undetected clouds may be overcome by inflating the observational errors and diminishing the impact of observations. Furthermore, the mathematical assumptions of data assimilation (DA) are predicated on Gaussian errors with no mean bias, and residual cloud impacts that cause a net bias are not easily handled by variational bias correction. One solution is to apply a very strict filtering, but this increases the rejection of clear-sky values, i.e. an important loss of useful data. Another limitation of existing filtering approaches is their “one-for-all” approach; i.e. observations in all 183 GHz channels are either kept or rejected. This often rejects more observations than needed, as the channels differ in their altitude coverage. An observation could be cloudy in some channels and still be clear-sky in others. To allow a channel-specific filtering, data likely need to be combined in a more complex manner than simple differences, but it is unclear what type of regression would be best as the ideal solution would be scene-dependent. This points towards applying machine learning techniques (e.g. Favrichon et al., 2019). A maybe less obvious problem is the assignment of uncertainty to the filtered values. To our best knowledge, so far only estimates of mean and worst-case errors exist in the literature. Some cases with relatively high cloud impact will likely be missed, while most cases are clear-sky cases from the start. As the remaining cloudy cases can cause significant biases, the likely solution is to apply a quite conservative (high) error estimate. However, this will unnecessarily downgrade the value of the truly clear-sky cases and the observations are used in a non-optimal manner.

In this study, we approach the cloud filtering task from a new angle. The basic idea is to derive an estimate of the corresponding noise-free clear-sky (NFCS) value (i.e. the radiance that would have been measured in absence of noise and hydrometeors). This is done for each channel separately and by only using the measurements, although the scheme is demonstrated in the study by using simulated observations. Not only is a best estimate provided but also a case-specific uncertainty. This information could be used as a pure filter, by rejecting data where the correction exceeds some threshold value. However, it is even better to replace the original value with the predicted NFCS value when forming the clear-sky dataset. We denote this approach as cloud correction. It is shown below that a basically bias-free cloud correction can be obtained. This feature also removes the need for defining threshold values, as long as the retrieval or assimilation system can incorporate the uncertainty of the corrected value. As will also be shown, the uncertainty for originally clear-sky data is determined by noise, but the uncertainty increases with magnitude of correction. Accordingly, the cloud correction approach permits the full weight of clear-sky data to be preserved.

The proposed cloud correction scheme makes use of a quantile regression neural network (QRNN; Pfreundschuh et al., 2018) to obtain a probabilistic prediction of the NFCS value. Unlike traditional neural network techniques, which typically only provide a point estimate of the target variable, QRNNs are trained to predict an arbitrary set of quantiles of its Bayesian a posteriori distribution (Pfreundschuh et al., 2018). The predicted a posteriori distribution can then be used to derive an estimate of the NFCS value together with an estimate of the corresponding uncertainty.

The main focus of this study is the potential of this cloud correction method using sub-millimetre (sub-mm) observations, which will become available operationally with the launch of the Ice Cloud Imager (ICI; Eriksson et al., 2020) aboard the next generation of European Organisation for the Exploitation of Meteorological Satellites Polar System – Second Generation (EUMETSAT EPS-SG). Additionally, we demonstrate the feasibility of the approach based on 89 and 150 GHz channels (following Geer et al., 2014), which are available on several sensors extant today. The focus on sub-mm channels is motivated by several reasons. First, the higher frequencies are more sensitive to scattering effects from smaller hydrometeors and are thus expected to provide greater sensitivity to high-altitude cirrus clouds. For example, in some cloudy situations, the cloud impact at 183 GHz may be of the order of thermal noise and modelling uncertainties, while the impact at 325 GHz is significant enough to provide a sufficient signal-to-noise ratio for identifying clouds. Second, the proposed cloud correction method allows integration of ICI sub-mm observations in clear-sky DA schemes with no further modifications, thus providing a simple way to make use of this novel data source as soon as it becomes available.

A description of the data used in this study and the QRNN approach is provided in Sect. 2. In Sect. 3, we demonstrate the applicability of correction scheme to existing sensors, and later its application is extended to include sub-mm channels (Sect. 4). The results are discussed in Sect. 5, and Sect. 6 presents the conclusions from this work and the future outlook.

2 Data and methods

2.1 Satellite instruments

2.1.1 MicroWave Humidity Sounder 2

The MicroWave Humidity Sounder 2 (MWHS-2) is an instrument on two current satellites in the FengYun-3 series: FY-3C and FY-3D. MWHS-2 is a cross-track scanning microwave radiometer and measures 15 frequencies in the range 89–191 GHz. The 89 and 150 GHz frequencies are window channels, five humidity sounding channels are centred around 183 GHz, and eight temperature sounding channels are centred on the 118 GHz oxygen absorption line. The five humidity sounding channels are similar to ATMS. Observations from MWHS-2 are routinely assimilated in all-sky conditions at the European Centre for Medium-Range Weather Forecasts (ECMWF) with demonstrable positive impact on forecast performance (Duncan and Bormann, 2020). The channels relevant to this study are described in Table 1. It should be noted that the NEΔT values in the table are according to pre-launch specifications and not measured NEΔT values.

Table 1Specifications of MWHS-2 channels relevant to this study.

Download Print Version | Download XLSX

For the demonstration of the study, MWHS-2 simulations from the ECMWF model background are used. Actual measurements are not taken into account. The requisite data were obtained from ECMWF. More details are given in Sect. 2.2.

2.1.2 Ice Cloud Imager

The ICI is a new instrument aboard EPS-SG satellite MetOp-SG (Meteorological Operational satellite – Second Generation). MetOp-SG is scheduled for launch in 2024, and it will make ICI the first operational sensor observing Earth using sub-mm wavelengths. The main objective of ICI is to use high-frequency channels for measuring ice cloud properties and improve the representation of ice clouds in regional and global NWP models. ICI is a conically scanning radiometer that will measure 13 frequencies from 183 up to 664 GHz. Among all available channels, 183, 325 and 448 GHz will measure vertical polarization, while other channels around 243 and 664 GHz are “window channels” and will measure both vertical and horizontal polarization. The instrument will observe Earth from a mean altitude of 832 km with the sensor viewing angle of 44.767^∘ (measured from nadir). For all the channels, the mean footprint size is about 15 km, but the exact geolocation of samples differs. Therefore, a simultaneous utilization of data from different channels shall require remapping to a common footprint (Eriksson et al., 2020).

For this study, we conducted the forward simulations of the channels around 183, 325, 448 and 664 GHz (Table 2). For brevity, we assume that all simulations are mapped to a common footprint.

Table 2Specifications of ICI channels relevant to this study.

Download Print Version | Download XLSX

2.1.3 Small Microwave Satellite

The Small Microwave Satellite (SMS) is a hypothetical satellite which we introduce to represent the type of sensors currently being considered for future small missions carrying a single instrument. We assume it to be a single across-track scanning microwave radiometer. In this study, we assume five 183 GHz channels and four 325 GHz channels, and we just ignore whether the mission has additional channels at lower frequencies or not. A brief summary of the channel specifications assumed is provided in Table 3.

Table 3Specifications of SMS channels.

Download Print Version | Download XLSX

2.2 Simulations

MWHS-2 simulated radiances during the period June–July 2020 are sourced from ECMWF. In the current version of the ECMWF Integrated Forecasting System (IFS), cycle 47R1 (IFS, 2020), clear-sky and all-sky radiative transfer are performed simultaneously for monitoring purposes, despite all humidity sounders being assimilated via all-sky exclusively. These side-by-side radiative transfer calculations on a large variety of model scenes provide an ideal dataset for comparing radiances with and without cloud effects. Out of all the available observations during the period, we use data for the latitudinal range of 60^∘ S to 60^∘ N and satellite zenith angle of less than 7.5^∘. With this filter, we have approximately 290 000 cases. Figure 1 shows the histogram of background and bias-corrected observations for MWHS-2 channel 14. The part of the distribution matching clear-sky conditions shows good agreement between the background and the observations. The main deviations in the distributions arise from the hydrometeor scattering. With limited scope of particle size and shape variation in current NWP microphysical schemes, the true cloud variability in radiance space is likely underestimated, though this is but one factor among many when it comes to the challenge of modelling clouds.

https://amt.copernicus.org/articles/14/2957/2021/amt-14-2957-2021-f01

Figure 1Probability distribution functions (PDFs) of simulated and observed brightness temperatures for MWHS-2 channel 14. The data cover a latitude range from 60^∘ S to 60^∘ N and satellite zenith angle of less than 7.5^∘.

Can machine learning correct microwave humidity radiances for the influence of clouds?

2.1 Satellite instruments

2.1.1 MicroWave Humidity Sounder 2

2.1.2 Ice Cloud Imager

2.1.3 Small Microwave Satellite

2.2 Simulations

2.3 Quantile regression neural networks

2.3.1 QRNN model configurations

2.4 Evaluation metrics

3.1 Experiments

3.2 Prediction accuracy

3.2.1 QRNN-single applied to MWHS-2 channel 14

3.2.2 Comparison of QRNN-all and QRNN-single

3.2.3 QRNN-single applied to channels 11, 12, 13 and 15

3.3 Prediction uncertainty

4.1 Experiments

4.2 Prediction accuracy

4.2.1 ICI

4.2.2 SMS

4.3 Prediction uncertainty

5.1 Cloud correction with existing sensors

5.2 Cloud correction with sub-mm frequencies

5.3 Prediction uncertainty and implications for data assimilation