the Creative Commons Attribution 4.0 License.

the Creative Commons Attribution 4.0 License.

# A statistically optimal analysis of systematic differences between Aeolus horizontal line-of-sight winds and NOAA's Global Forecast System

### Ross N. Hoffman

### Katherine E. Lukens

The European Space Agency Aeolus mission launched a first-of-its-kind spaceborne Doppler wind lidar in August 2018. To optimize the assimilation of the Aeolus Level-2B (B10) horizontal line-of-sight (HLOS) winds, significant systematic differences between the observations and numerical weather prediction (NWP) background winds should be removed. Total least squares (TLS) regression is used to estimate speed-dependent systematic differences between the Aeolus HLOS winds and the National Oceanic and Atmospheric Administration (NOAA) Finite-Volume Cubed-Sphere Global Forecast System (FV3GFS) 6 h forecast winds. Unlike ordinary least squares regression, TLS regression optimally accounts for random errors in both predictors and predictands. Large, well-defined, speed-dependent systematic differences are found in the lower stratosphere and troposphere in the tropics and Southern Hemisphere. Correction of these systematic differences improves the forecast impact of Aeolus data assimilated into the NOAA global NWP system.

The spaceborne Doppler wind lidar onboard the European Space Agency (ESA) Aeolus mission measures both Mie (i.e., clouds and aerosols) and Rayleigh (i.e., molecular) backscatter to derive wind profiles along the sensor's horizontal line of sight (HLOS) throughout the troposphere and lower stratosphere (Straume-Lindner, 2018; Straume et al., 2020). The Aeolus HLOS Level-2B (L2B) winds have demonstrated positive impacts on global weather forecasts (Rennie et al., 2021; Cress, 2020; Garrett et al., 2020, 2022).

To optimize the positive impact of Aeolus HLOS winds on weather forecasts,
large systematic differences between Aeolus winds and numerical weather
prediction (NWP) model background winds should be corrected (Daley, 1991).
Therefore, it is important to identify potential systematic differences
between Aeolus winds and their NWP model background counterparts (Liu et
al., 2020, 2021). The systematic differences may come from both the NWP
model background and the Aeolus winds. First, current operational global NWP
background winds still have larger errors or uncertainty in regions where
conventional wind observations are sparse or absent. For example, the 6 h
forecast zonal winds from the ECMWF model (https://www.ecmwf.int/en/forecasts, last access: 19 December 2019) and the NOAA Finite-Volume Cubed-Sphere
Global Forecast System (FV3GFS) model (Kleist et al., 2021) show large
systematic differences in the upper troposphere and lower stratosphere of
the tropics, the Southern Hemisphere (SH), and poleward of 70^{∘} N,
with maxima of the order of 2.0, −0.5, and 0.5 m s^{−1}, respectively (Fig. 1). Such systematic differences in regions where conventional data are sparse may be due in part to differences in the assimilation of satellite radiances at the NWP centers. Second, although corrections to several substantial sources of systematic differences in the Aeolus HLOS winds (baseline B10) have been implemented, including corrections to the dark current signal anomalies of single pixels (so-called hot pixels) on the accumulation charge-coupled devices (ACCDs), to the linear drift in the illumination of the Mie and Rayleigh spectrometers, and to the telescope M1 mirror temperature variations (Reitebuch et al., 2020; Weiler et al., 2021), uncorrected systematic differences due to potential calibration issues might remain in Aeolus HLOS winds and may contribute to potential systematic differences between Aeolus and the NWP background HLOS winds. The residual systematic differences may lead to suboptimal assimilation of Aeolus HLOS winds in NWP systems.

For clarity in the remainder of this article, certain words and phrases are
assigned specific definitions. Thus, throughout this article, the phrase
“Aeolus winds” specifically means the observations of Aeolus Level-2B (B10) HLOS winds. Similarly, the phrase “FV3GFS winds” specifically means the numerical weather prediction (NWP) background HLOS winds evaluated from the FV3GFS 6 h forecasts at the observation location and time. (In discussions of winds that are not HLOS winds, terms like *u* wind, *v* wind, or wind vector are used.) Further, the phrase “Mie winds” specifically means Aeolus winds derived from Mie backscatter observations, and the phrase “Rayleigh winds” specifically means Aeolus winds derived from Rayleigh backscatter observations. Also, throughout this article, the word “innovations” without further qualification specifically refers to the differences between these Aeolus and FV3GFS winds, and the word “bias” (and the phrases “Mie bias” and “Rayleigh bias”) without further qualification specifically refers to the mean of these innovations, where the sample mean is over some specified space–time volume for either the Mie or Rayleigh winds.

Speed-dependent biases identified and estimated using ordinary least squares (OLS) are subject to contamination from random errors in Aeolus and/or FV3GFS winds (Frost and Thompson, 2000), since OLS assumes no errors in the predictor or independent variable, which in this case would be either the Aeolus or FV3GFS winds or a combination of the two. In contrast, total least squares (TLS) regression accounts for errors in both dependent and independent variables and generates a statistically optimal analysis of the biases (Deming, 1943; Ripley and Thompson, 1987; Markovsky and Van Huffel, 2007). For the case of Aeolus and FV3GFS winds, the use of linear TLS regression (Ripley and Thompson, 1987) finds an optimal estimate of the true (assumed linear) relationship between Aeolus and FV3GFS winds.

In this study, the TLS regression approach is used to estimate biases that depend linearly on wind speed. The suboptimality of OLS bias estimates is demonstrated by comparison to the TLS bias estimates, which are treated as the “truth” in this study. A bias correction based on the TLS bias analysis is proposed to optimize Aeolus wind assimilation by the FV3GFS model and thus improve the impact of Aeolus winds on FV3GFS forecasts. Section 2 describes the Aeolus and FV3GFS winds, the TLS bias analysis method, and the estimation of the ratio of error variances of Aeolus to FV3GFS winds, for which the ratio is used in the TLS regression. Section 3 describes the variations in the TLS bias estimates with height, latitude, and wind speed. Section 4 demonstrates the substantial differences between the TLS and OLS bias estimates. Section 5 proposes a TLS bias correction for Aeolus data assimilation. The forecast impact of the TLS bias correction is presented in Sect. 6. Section 7 presents a summary of the findings and conclusions.

## 2.1 Aeolus L2B and FV3GFS background wind data

The Aeolus L2B cloudy-sky Mie winds and clear-sky Rayleigh winds are examined for the period 1–7 September 2019. This 1-week period provides a sufficient sample to estimate the biases. The Aeolus winds were obtained from the Aeolus dataset (baseline B10) reprocessed by ESA (Rennie et al., 2021, Weiler et al., 2021). The reprocessing includes the M1 bias correction, which removes most of the globally and vertically averaged biases of both Mie and Rayleigh winds (Weiler et al., 2021). The Aeolus winds are reported at a standard set of vertical layers (de Kloe, 2020). This study examines Mie and Rayleigh winds within height ranges of 0–22 km that include nearly all Aeolus winds. The height is defined relative to the EGM96 geoid for the L2B winds (Tan et al., 2008).

The Aeolus and FV3GFS winds are obtained from a data assimilation experiment
(hereafter the BASE experiment), where the Aeolus winds are monitored, and the Aeolus wind observation operator (*H*_{i}) is applied to the
FV3GFS background (*x*^{b}) to obtain the value of FV3GFS wind (${y}_{i}^{\mathrm{b}}={H}_{i}({x}^{\mathrm{b}}$)) corresponding to each Aeolus wind $\left({y}_{i}^{\mathrm{o}}\right)$. This experiment employs the FV3GFS data assimilation system, called Global Statistical Interpolation (GSI; Kleist et al., 2009), configured for the 4DEnVar algorithm, with 64 vertical levels and horizontal resolutions of C384 (∼25 km) for the deterministic analysis and forecast and C192 (∼50 km) for the 80 ensemble members (Wang and Lei, 2014).

Similar Aeolus data quality control procedures, as recommended by ESA and
ECMWF (Rennie et al., 2021), were implemented to reject the following
observations: the HLOS L2B confidence flag “invalid”, Rayleigh winds at layers below 850 hPa, L2B uncertainties greater than 12 m s^{−1}, accumulation lengths less than 60 km, and atmospheric pressure within 20 hPa of topographic surface pressure, and Mie winds with L2B uncertainties greater than 5 m s^{−1} and accumulation lengths less than 5 km. Further, a standard outlier check rejects any Aeolus wind for which $|{y}_{i}^{\mathrm{o}}-{y}_{i}^{\mathrm{b}}|$ is greater than 4 times the estimated errors for Aeolus winds prescribed by the data assimilation system.

When examining Aeolus wind statistics, we stratify the Aeolus data by
orbital phase, either ascending when the spacecraft is moving northward or
descending when the spacecraft is moving southward. The vertical and daily
variations in Mie and Rayleigh biases for global horizontal samples are
consistent throughout the period (Fig. 2). For ascending orbits, the Mie
biases are positive above 6 km and negative below 6 km and are as large as
+1.8 and −0.5 m s^{−1}, respectively. The Mie biases are smaller and
positive at most levels in descending orbits. In descending orbits, the
Rayleigh biases are as positive as +1.2 m s^{−1} above 10 km and as negative as −1.2 m s^{−1} below 8 km. The positive biases in ascending orbits are smaller. The results indicate that the biases vary substantially with height and orbit phase for both Mie and Rayleigh winds. The Mie and Rayleigh biases also vary considerably with latitude (Fig. 3). Mie biases are as positive as +1.5 m s^{−1} in the upper troposphere, and Rayleigh biases are as positive as +2.0 m s^{−1} in the tropical upper troposphere. Both Mie and Rayleigh biases are as negative as −1.0 m s^{−1} in the lowest layers.

The statistical relationship between Aeolus and FV3GFS winds is illustrated by the density plots in Fig. 4. There is a strong correlation of 0.93 between Mie and FV3GFS winds and of 0.96 between Rayleigh and FV3GFS winds. The average and OLS regression of the innovations as a function of Aeolus wind suggest considerable speed-dependent biases with both linear and nonlinear components (Fig. 5). In this study, we focus on the estimation and correction of the linear part of the biases using the TLS linear regression.

## 2.2 TLS linear regression

In this section, we review the TLS linear regression method (Ripley and Thompson, 1987) in the context of estimating potential speed-dependent biases. The TLS estimate for each collocated pair of Aeolus and FV3GFS winds (${y}_{i}^{\mathrm{o}}$, ${y}_{i}^{\mathrm{b}})$ is defined by the following:

where ${\widehat{y}}_{i}^{\mathrm{o}}$ and ${\widehat{y}}_{i}^{\mathrm{b}}$ are the TLS estimates of the true Aeolus and FV3GFS winds, ${\mathit{\epsilon}}_{i}^{\mathrm{o}}$ and ${\mathit{\epsilon}}_{i}^{\mathrm{b}}$ are random errors, and *N* is the number of Aeolus/FV3GFS wind collocations in the sample. The sample might be defined by a vertical layer or a latitude band. In OLS regression, since it is assumed that there are no errors in the predictor, the predictor can be used directly to estimate the predictand. The situation is a little more complicated in TLS regression, where (${\widehat{y}}_{i}^{\mathrm{b}}$, ${\widehat{y}}_{i}^{\mathrm{o}})$, the most probable true state, is the point on the regression line that is closest in a statistical sense to the point $({y}_{i}^{\mathrm{b}}$, ${y}_{i}^{\mathrm{o}})$.

Here it is assumed that ${\mathit{\epsilon}}_{i}^{\mathrm{o}}$ and ${\mathit{\epsilon}}_{i}^{\mathrm{b}}$ are independent and that the random error variance ratio $\mathit{\delta}=\left({\mathit{\sigma}}^{\mathrm{o}}/{\mathit{\sigma}}^{\mathrm{b}}\right){}^{\mathrm{2}}=E\left[{\mathit{\epsilon}}_{i}^{\mathrm{o}}{\mathit{\epsilon}}_{i}^{\mathrm{o}}\right]/E\left[{\mathit{\epsilon}}_{i}^{\mathrm{b}}{\mathit{\epsilon}}_{i}^{\mathrm{b}}\right]$ is known. The error variance ratio *δ* is a crucial parameter in determining the TLS bias analysis and is estimated as described in the next section. Furthermore, the true relationship between the Aeolus and FV3GFS winds is assumed to be described by a linear function (as seen in
Fig. 5) as follows:

where *c*_{0} is an offset or constant coefficient, and *c*_{1} is a
speed-dependent coefficient.

The TLS regression finds an optimal estimate of the ${\widehat{y}}_{i}^{\mathrm{b}}$,
*c*_{0}, and *c*_{1} by minimizing the cost function *J*, as follows:

To determine the ${\widehat{y}}_{i}^{\mathrm{b}}$, the derivative of *J* with respect to ${\widehat{y}}_{i}^{\mathrm{b}}$ is set to zero, resulting in the following:

Equation (4) thereby reduces the problem to a minimization in terms of *c*_{0} and *c*_{1}. A similar equation holds even if the error variances vary with *i*, but then there is no closed form solution for *c*_{0} and *c*_{1} as there is in the current case, which is known as the Deming problem (Ripley and Thompson, 1987). When the coefficients *c*_{0} and *c*_{1} are obtained, the TLS estimate for the new or within-sample observation is given by Eq. (4). Finally, the estimate of the bias for the *k*th observation, either for a new or within-sample observation, is given by the following:

Given the form of Eq. (5), we will refer to *c*_{0} and (*c*_{1}−1) as the offset and speed-dependent bias coefficients, respectively, hereafter.

## 2.3 Estimation of the random error variance ratio

In this study, errors of Aeolus winds are estimated by the Hollingsworth–Lonnberg method (Hollingsworth and Lonnberg, 1986; Garrett et al., 2022), which include Aeolus instrument errors and forward modeling error and representativeness errors of the FV3GFS background at the specific 25 km horizontal resolution. The random error variance ratio $\mathit{\delta}=({\mathit{\sigma}}^{\mathrm{o}}/{\mathit{\sigma}}^{\mathrm{b}}{)}^{\mathrm{2}}$ in the TLS bias analysis is estimated from the innovations from the BASE experiment for 1–7 September 2019. It is assumed that there are no correlations between the random errors of the Aeolus and FV3GFS winds and no horizontal correlations between the random errors of Aeolus winds separated by more than 90 km. These assumptions are justified a posteriori by the reasonable error estimate of FV3GFS background winds (Garrett et al., 2022).

Global error estimates are calculated for all Mie and Rayleigh winds in each
layer as follows. First, the spatial covariance of the innovations is
calculated. Since these are innovations from the BASE experiment where
Aeolus data are not assimilated, it is reasonable to assume that the Aeolus
and FV3GFS wind errors are uncorrelated. Then the spatial covariance of the
innovations, (*σ*^{o−b})^{2}, at zero separation distance,
is equal to the following:

where *σ*^{o} and *σ*^{b} are the
random error standard deviations of Aeolus and FV3GFS winds, respectively.

By assumption, at separation distances greater than 90 km, the innovation
covariances are estimates of the FV3GFS wind error covariance alone and can
be extrapolated back to zero separation to obtain an estimate of the error
variance of the FV3GFS winds, (*σ*^{b})^{2}, and then,
using Eq. (6), the error variance of the Aeolus winds, (*σ*^{o})^{2}, may be determined. Note that this can only be done
using innovation covariances at separation distances large enough to have
negligible covariances between the Aeolus winds. Since the calculated
innovation covariances are globally averaged over all HLOS winds, it is not
surprising that the corresponding biases are small. The small residual
biases in the innovations may introduce small (<0.1) spurious spatial correlations. This spurious correlation, taken as the value calculated for the last bin (at 990 km), is removed from the correlation curves at all separation distances. The estimated random error variance ratio *δ* is assigned to the layer center height, defined as the global average heights of the Mie and Rayleigh wind in each vertical range bin. Figure 6 shows that the vertical profiles of the square root of *δ* vary in the range of 1.2–1.6 for Mie winds versus FV3GFS winds and 2–3 for Rayleigh winds versus FV3GFS winds, respectively.

In the future, we plan to explore the benefit of the scene-dependent L2B estimated errors on the TLS bias estimates and Aeolus wind assimilation.

In this section, variations in the TLS bias estimates with orbital phase and height are examined to motivate the use of a TLS bias correction scheme proposed in Sect. 5.

## 3.1 Variation in TLS bias estimates with height

The variation in the TLS solution with height and orbital phase is described
here. The TLS samples include winds at all latitudes in each layer. The
vertical distribution of the TLS constant and speed-dependent bias analysis
coefficients in Eq. (5) is shown in Fig. 7. The speed-dependent bias
coefficient (*c*_{1}−1) varies substantially with height and orbital phase.
For Mie winds, this coefficient is quite large at most heights, ranging from
3 % to 6 %, with maxima at 3 and 12–16 km. For Rayleigh winds, this
coefficient is smaller and ranges from 1 % to 3 % in ascending orbits
and 1 %–5 % in descending orbits, with maxima around 3.5 and 16 km.

The offset bias coefficient *c*_{0} for both Mie and Rayleigh winds also shows large variations with height and orbit, with its value as large as ±1.0 m s^{−1}. In general, the offset bias coefficient
is positive in upper layers and negative in layers close to the Earth's
surface, consistent with the patterns seen in the global horizontal average
of the innovations in Fig. 2. The vertical distribution of the average TLS
bias estimate as a function of Aeolus wind is shown in Fig. 8. The biases
vary substantially with height. Since the TLS biases are in part dependent
on speed, at most heights the biases increase substantially as the magnitude
of Aeolus wind speed increases. The biases at the extreme Aeolus wind speeds
are as large as +2.5 and −1.0 m s^{−1} for Mie winds and +1.5 and
−1.0 m s^{−1} for Rayleigh winds. There are clear speed-dependent biases in the vertical average of these biases as well (Fig. 9). The results suggest that the innovations have both vertically varying and vertically averaged speed-dependent biases.

## 3.2 Variation in biases with latitude

The variation in the TLS solution with latitude and orbital phase is
described here. For this purpose, the samples include all heights in each
10^{∘} latitude band, and the vertical average of the error ratio
*δ* is used. In general, the bias coefficients obtained are
large and vary considerably with latitude and orbital phase, with maxima
found in the tropics (Fig. 10). For example, the speed-dependent bias
coefficient (*c*_{1}−1) for Mie winds in the tropics can be quite large,
ranging up to a maximum of 11 %. This coefficient is smaller for Rayleigh
winds, ranging from −1 % to 5 %, with maxima found in the tropics. The offset bias coefficient *c*_{0} for Mie winds also varies considerably
with latitude and orbit, ranging from −1.0 to +1.6 m s^{−1}. The offset bias coefficient *c*_{0} is smaller for Rayleigh winds.

The latitudinal distribution of the average TLS bias as a function of Aeolus
wind speed is shown in Fig. 11. For both Mie and Rayleigh winds, the average
TLS biases increase considerably at most latitudes as the magnitude of
Aeolus wind speed increases, particularly in the tropics and SH, with
extreme values of about ±1.5 m s^{−1}.

## 3.3 Discussion

The results indicate that the speed-dependent bias coefficient (*c*_{1}−1)
is quite large, reaching ∼10 % and 5 % for Mie and
Rayleigh winds, respectively, particularly in the lower stratosphere and
lower troposphere of the tropics. This suggests that there exist large
speed-dependent biases in the FV3GFS and/or Aeolus winds. Given that there
exist large uncertainties in the FV3GFS (and ECMWF) background winds in the
tropics (see Fig. 1), it is likely that the FV3GFS background may be a
significant source of the biases, and this will require further investigation. In any case, these large speed-dependent biases should be
corrected to optimize Aeolus wind assimilation and the impact of Aeolus
winds on NWP forecasts. The large variations in the TLS bias estimates with
latitude and height guide the design of the proposed TLS bias correction in
Sect. 5.

Parallel OLS regressions using three different predictors of the biases are
compared with the TLS bias estimate results presented in Sect. 3. The OLS
predictors are the FV3GFS winds, the Aeolus winds, and their average. The
first two of these OLS regressions are equivalent to OLS regressing Aeolus
winds on FV3GFS winds and OLS regressing FV3GFS winds on Aeolus winds. The
regression lines of these two cases are added to Fig. 4. The TLS speed-dependent coefficient (*c*_{1}−1) (in Eq. 5) is 6 % and 4 % for
Mie and Rayleigh winds, respectively. However, the OLS regression of Aeolus
winds on FV3GFS winds produces considerably smaller bias estimates, with
(*c*_{1}−1) estimated as 1 % and 2 % for Mie and Rayleigh winds, respectively. On the other hand, the OLS regression of the FV3GFS winds on Aeolus winds exhibits much larger bias estimates relative to the TLS bias analysis, with (*c*_{1}−1) estimated as 18 % and 15 % for Mie and Rayleigh winds, respectively.

The vertical distributions of the average biases as a function of Aeolus
winds are shown in Fig. 12 for the descending orbits for the following three methods: (1) OLS regression using FV3GFS winds as a predictor (top row), (2) TLS regression (middle row, which repeats the bottom two panels of Fig. 8), and (3) OLS regression using the average of FV3GFS and Aeolus as a predictor (bottom row). The average bias estimates in the top panels are about 0.5 m s^{−1} smaller in magnitude in most layers compared to the middle panels. The average biases in the bottom panels are about 0.5–1.0 m s^{−1} in magnitude larger than the middle panels in most layers, particularly for Rayleigh winds. The bias estimates of OLS regression using Aeolus winds only as a predictor (not shown) are even larger than what is shown in the bottom panels. The large differences in the bias estimates using the TLS and OLS regression are due to the fact that both Aeolus and FV3GFS winds have large errors. If the predictor (either Aeolus or FV3GFS winds) has very small errors, then the OLS regressions would be close to perfect, and the OLS and TLS regressions would give very similar results. In such situation, the random error ratio would be either infinity small (≪1) or infinity large (≫1). However, the Aeolus and FV3GFS winds have considerable errors, and the actual random error ratio is about 2–3 for the Rayleigh winds versus FV3GFS winds and about 1.2–1.5 for the Mie winds versus FV3GFS winds (Fig. 6). This leads to the large differences in the OLS and TLS bias estimates. Specifically, the OLS bias estimates using Aeolus winds as a predictor have larger differences from the TLS estimates than the OLS estimates using FV3GFS winds as a predictor.

In this section, a TLS bias correction is proposed to optimize Aeolus wind
data assimilation. Because the findings in Sect. 3 show substantial variation in the bias coefficients with latitude, vertical layer, and orbital phase, the TLS bias coefficients are calculated from the winds in 19 discrete bins of latitude (centered every 10^{∘} between 90^{∘} S to 90^{∘} N) for each vertical range/layer and for ascending and descending orbits separately. The error ratio *δ* shown in Fig. 6 is used in all latitude bands for each layer. For each assimilation cycle, the bias coefficients are computed by TLS regression for the innovations in the week before the cycle (i.e., for the previous 28 cycles). The period of 1 week provides a large enough sample for the regression. As shown by Ripley and Thompson (1987), the TLS solution only involves solving a quadratic equation with coefficients given by sample sums. Therefore, an efficient approach is to calculate and save these sums for every cycle and accumulate them over the 28 cycles. For each of the innovations in the assimilation cycle, values of the TLS regression coefficients *c*_{0} and *c*_{1} are linearly interpolated to the latitude of the Aeolus observation. Subsequently, the TLS estimated bias, calculated using Eq. (5), is subtracted from the innovation. Note that the bias correction is determined by the TLS analysis solution for ${\widehat{y}}_{k}^{\mathrm{b}}$ that, in turn, is determined from the observation and background wind, ${y}_{k}^{\mathrm{o}}$ and ${y}_{k}^{\mathrm{b}}$, following Eq. (4).

The proposed scheme is applied to the Aeolus and FV3GFS winds of the BASE
experiment. As expected, the corresponding TLS bias estimates show
considerable speed-dependent biases. For example, in the bins centered at
the Equator and 80^{∘} S, where the speed-dependent biases are
expected to be largest based on Fig. 9, the TLS bias estimates vary
considerably with speed and are in some cases larger in magnitude than 1.5 m s^{−1} at higher Aeolus wind magnitudes (Fig. 13).

The vertical distribution of the global average of the remaining biases (i.e., after TLS bias correction) as a function of Aeolus wind is shown in Fig. 14, which is in the same format and for the same sample of observations as Fig. 8. A comparison of these two figures reveals that most of the biases are removed by the proposed TLS bias correction. The latitudinal variations in the biases are also corrected (Fig. 15). In addition, the biases in the vertical average are also mostly removed, as shown in Fig. 9.

Several observing system experiments (OSEs) using the NOAA global data assimilation system are performed using the Aeolus winds with and without the TLS bias correction. For the period of 2 August–16 September 2019, Garrett et al. (2022) demonstrate the positive impact of Aeolus winds on the NOAA global forecast. The largest impact is seen in the tropical upper troposphere and lower stratosphere where the day 1–3 wind vector forecast root mean square error (RMSE) is reduced by up to 4 %. Specifically, the assimilation of Aeolus impacts the steering currents ambient to tropical cyclones, resulting in up to a 20 % reduction in track forecast error in the eastern Pacific and Atlantic basins. The application of TLS bias correction increases the positive impact of Aeolus data assimilation on the forecasts.

OSE results for a 2019 record-breaking winter storm case over the USA are
reported here. On 26 November 2019, one major storm approached the West
Coast of the USA from the eastern Pacific and produced a record-breaking low
pressure of 973 hPa and wind gust of 171 km h^{−1} near the Oregon/California border. Over the next few days, the low merged with the subtropical jet as it tracked eastward across the USA. The combination of cold air, moisture, and high winds produced snow blizzard conditions across the USA.

As in Garrett et al. (2022), the OSEs include the baseline experiment (BASE) without the assimilation of Aeolus winds, the experiment AEOM that is identical to BASE except that Aeolus winds are assimilated, and the experiment AEOT hat is identical to AEOM, except that it also includes the TLS bias correction. A difference summary assessment metric (SAM; Hoffman et al., 2018) is computed for day 1–7 forecasts in the North American (NA) region of the experiments validated at 00:00 UTC on 22–28 November 2019. The SAM illustrates the overall forecast skill by normalizing the AC and RMSE values for each parameter (temperature, geopotential height, wind, and relative humidity) and each lead time. Figure 16 shows that the TLS bias correction improves the impact of Aeolus winds on the forecasts of wind, temperature, and geopotential height for day 3–7 and especially for day 5–7 lead times. The overall improvement of Aeolus winds for AEOM and AEOT is about 4 % and 10 %, respectively (above the 95 % significance level, Fig. 16c), illustrating the usefulness of the TLS bias correction.

The vertically integrated water vapor transport (IVT) is a useful metric in forecasting precipitation associated with winter storms (e.g., Lavers et al., 2017). The IVTs of the day 7 forecast for the experiments validated for 00:00 UTC on 27 and 28 November are shown in Fig. 17. Aeolus winds have a strong impact on the locations and intensities of the IVT maxima near the USA West Coast and in the Midwest. In general, the IVTs are closer to the ECMWF analyses in AEOT than in AEOM. As a result, Aeolus winds show strong impact on the locations and corresponding amounts of precipitation, as seen in Fig. 18 and quantified by the equitable threat and BIAS skill scores (https://www.wpc.ncep.noaa.gov/rgnscr/verify.html, last access: 15 January 2022, Wang, 2014), respectively (Fig. 19). Specifically, the precipitation amounts near the West Coast and the Midwest are much less in AEOT than in BASE and AEOM. The precipitation in the Midwest also shifts eastward in AEOT, compared to BASE and AEOM (Fig. 18). The precipitation forecast skills (verified against NCEP precipitation rain gauge data analyses) over the contiguous United States (CONUS) region, that is, the equitable threat (location) and BIAS (amount) scores are shown in Fig. 19. The precipitation amount is overpredicted (BIAS score >1.0) in both BASE and AEOM but is closer to the analysis (BIAS score closer to 1.0) in AEOT. The equitable threat is larger (with marginal significance level; Fig. 19c) in AEOT than in BASE and AEOM, indicating that the location of precipitation in the forecast is improved in AEOT. These results suggest the potential benefit of the TLS bias correction to precipitation forecasts.

In this study, a TLS linear regression is used to optimally estimate
speed-dependent linear biases in the Aeolus innovations. The Aeolus and
FV3GFS winds for 1–7 September 2019 are analyzed. Clear speed-dependent
linear biases for both Mie and Rayleigh winds are found, particularly in the
lower troposphere and stratosphere of the tropics and Southern Hemisphere.
The largest biases are about 10 % and 5 % of FV3GFS wind speed and are as large as ±2.5 and ±1.5 m s^{−1} at high Aeolus wind magnitudes for Mie and Rayleigh winds, respectively.

It is found that the TLS linear bias estimates are considerably larger than the OLS regression of Aeolus innovations on FV3GFS winds. However, they are much smaller than the OLS regression on both Aeolus winds only and on the average of Aeolus and FV3GFS winds. This is more evident for the Rayleigh winds.

The proposed TLS bias correction removes much of the biases in the innovations before Aeolus wind assimilation. In a companion paper, Garrett et al. (2022) demonstrate that the application of this TLS bias correction considerably enhances the positive impact of Aeolus winds on NOAA FV3GFS global and tropical cyclone forecasts for the period of 2 August to 15 September 2019. In this study, it is also demonstrated that the application of the TLS bias correction improves the impact of Aeolus winds on the forecast of a record-breaking 2019 winter storm, including the associated precipitation over the USA. It is expected that the application of the TLS bias correction can improve and enhance Aeolus data impacts on the analysis and forecast skill of other NWP systems. It should be noted that the proposed TLS approach presented here might be applied to other types of observations that have errors typically characterized as a percentage of the observed value, including quantities related to the concentrations or mass fractions of chemical species or hydrometeors or quantities like radio occultation refractivity and bending angle.

The Aeolus data assimilation code with NOAA GSI data assimilation system is publicly available from https://essic.umd.edu/joom2/index.php/faculty-and-staff?layout=user&user_id=1020&dir=JSROOT%2Fhliu6/CODE (Liu et al., 2022).

The Aeolus L2B Earth Explorer data used in this study are publicly available and can be accessed via the ESA Aeolus Online Dissemination System (https://aeolus-ds.eo.esa.int/oads/access/; European Space Agency, 2020). The data related to the Aeolus assimilation experiment outputs are not publicly available due to the huge volume of data. We will try to provide access to the data upon request.

KG and KI proposed the project as co-investigators thereby acquiring funding for the project. KG and KI provided the expertise and project management that guided this work. KG and HL designed and interpreted the Aeolus assimilation experiments. HL and RNH developed the TLS bias correction. HL wrote the initial manuscript, RNH and KEL helped him to revise and edit the manuscript, and all the authors reviewed the manuscript.

The contact author has declared that none of the authors has any competing interests.

The scientific results and conclusions, as well as any views or opinions expressed herein, are those of the author(s) and do not necessarily reflect those of NOAA or the U.S. Department of Commerce.

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the special issue “Aeolus data and their application (AMT/ACP/WCD inter-journal SI)”. It is not associated with a conference.

The authors thank the two anonymous reviewers, for their careful and helpful reviews. This work has been supported by the NOAA/NESDIS Office of Projects, Planning, and Acquisition (OPPA) Technology Maturation Program (TMP), managed by Patricia Weir and Nai-Yu Wang. The authors would like to acknowledge Michael Rennie and Lars Isaksen (ECMWF), for their comments and suggestions on the assimilation of Aeolus observations, and William McCarty with NASA/GMAO, for providing earlier versions of the GSI with Aeolus ingest and observation operator capability. The Aeolus L2B BUFR data were provided by ECMWF.

This research has been supported by the National Oceanic and Atmospheric Administration, National Environmental Satellite, Data, and Information Service (grant nos. NA14NES4320003 and NA19NES4320002).

This paper was edited by Ad Stoffelen and reviewed by two anonymous referees.

Cress, A.: Validation and impact assessment of Aeolus observations in the DWD modeling system, Status report, Aeolus NWP Impacts Working Meeting, June 2020, https://www.aeolus.esa.int/confluence/display/CALVAL/Aeolus+NWP+impact+working+meeting+2?preview=/12354328/12354463/5_DWD_acress_aeolus_20200617.pdf. (last access: 2 January 2022), 2020.

Daley, R.: Atmospheric data analysis, Cambridge Atmospheric and Space Science series, Cambridge University Press, Cambridge, 457 pp., ISBN-13: 978-0521458252, 1991.

de Kloe, J., Rennie, M., Stoffelen, A., Tan, D., Anderson, E., Dabas, A., Poli, P., and Huber, D.: Aeolus Data Innovation Science Cluster DISC ADM-Aeolus Level-2B/2C Processor Input/Output Data Definitions Interface Control Document, KNMI, Aeolus, DISC, Tech. rep., REF: AED-SD-ECMWF-L2B-037, https://earth.esa.int/eogateway/documents/20142/37627/Aeolus-L2B-2C-Input-Output-DD-ICD.pdf (last access: 2 January 2022), 2020.

Deming, W. E.: Statistical adjustment of data, Wiley, NY, 288 pp., ISBN: 0-486-64685-8, 1943.

European Space Agency (ESA): Aeolus L2B Earth Explorer data set, ESA [data set], https://aeolus-ds.eo.esa.int/oads/access/ (last access: 25 February 2021), 2020.

Frost, C. and Thompson S.: Correcting for regression dilution bias: comparison of methods for a single predictor variable, J. R. Stat. Soc. A, 163, 173–190, https://doi.org/10.1111/1467-985X.00164, 2000.

Garrett, K., Liu, H., Ide, K., Lukens, K., and Cucurull, L.: Updates to Aeolus Impact Assessment on NOAA global NWP, 2nd ESA Aeolus Cal/Val and Science Workshop, Virtual, 2–6 November 2020, https://www.dropbox.com/s/cd0r1gz7t77gq0g/Kevin_Garrett_Oral_Evaluation_of_Aeolus.pptx (last access: 2 January 2022), 2020.

Garrett, K., Liu, H., Ide, K., Hoffman, R. N., and Lukens, K. E.: Optimization and Impact Assessment of Aeolus HLOS Wind Data Assimilation in NOAA's Global Forecast System, Q. J. R. Meteorol. Soc., https://doi.org/10.1002/qj.4331, 2022.

Hoffman, R. N., Kumar, K., Boukabara, S., Yang, F., and Atlas, R.: Progress in Forecast Skill at Three Leading Global Operational NWP Centers during 2015–17 as Seen in Summary Assessment Metrics (SAMs), Wea. Forecasting, 33, 1661–1679, https://doi.org/10.1175/WAF-D-18-0117.1, 2018.

Hollingsworth, A. and Lonnberg, P.: The statistical structure of short-range forecast errors as determined from radiosonde data – Part I: The wind field, Tellus, 38A, 111–136, https://doi.org/10.3402/tellusa.v38i2.11707, 1986

Kleist, D. T., Parrish, D.-F., Derber, J.-C., Treadon, R., Wu, W.-S., and Lord, S.: Introduction of the GSI into the NCEP Global Data Assimilation System, Wea. Forecasting, 24, 1691–1705, https://doi.org/10.1175/2009WAF2222201.1, 2009.

Kleist, D. T., Treadon, R., Thomas, C., Liu, H., Bathmann, K., Merkova, D., Martin, C.-R., Shao, H., Cucurul, L., Yoe, J., and Tallapragada, V.: NCEP Operational Global Data Assimilation Upgrades: From Versions 15 through 16, Special Symposium on Global and Mesoscale Models, 14 January 2021, Amer. Meteor. Soc., 12.3, https://ams.confex.com/ams/101ANNUAL/meetingapp.cgi/Paper/378554, (last access: 2 January 2022), 2021.

Lavers, D. A., Zsoter E., Richardson D. S., and Pappenberger L.: An Assessment of the ECMWF Extreme Forecast Index for Water Vapor Transport during Boreal Winter, Weather Forecast, 32, 1667–1674, https://doi.org/10.1175/WAF-D-17-0073.1, 2017.

Liu, H., Garrett, K. Ide, K., Hoffman, R. N., and Lukens, K. E.: Bias correction and Error Specification of Aeolus Winds for NOAA Global Data Assimilation System, 2nd ESA Aeolus CAL/VAL and Science Workshop, Virtual, 2–6 November 2020, https://www.dropbox.com/s/f518n7n8ouhgwhy/Hui_LIU_Flash_Evaluation_update.pdf? (last access: 2 January 2022), 2020.

Liu, H., Garrett, K., Ide, K., Hoffman, R. N., and Lukens, K. E.: Impact Assessment of Aeolus Winds on NOAA Global Forecast, European Geophysical Union general assembly, Virtual, 19–30 April 2021, https://meetingorganizer.copernicus.org/EGU21/session/40837 (last access: 2 January 2022), 2021.

Liu, H., Garrett, K., Ide, K., and Hoffman, R.-N.: Preliminary code of Aeolus wind assimilation with NOAA GSI, Earth Sustem Science Interdisciplinary Center [code], https://essic.umd.edu/joom2/index.php/faculty-and-staff?layout=user&user_id=1020&dir=JSROOT%2Fhliu6/CODE, last access: 30 June 2022.

Markovsky, I. and Van Huffel, S.: Overview of total least squares methods, Signal Process., 87, 2283–2302, https://doi.org/10.1016/j.sigpro.2007.04.004, 2007.

Reitebuch, O., Bracci, F., and Lux, O.: Assessment of the Aeolus performance and bias correction – Results from the Aeolus DISC, 2nd Aeolus Cal/Val Workshop, Virtual, November 2020, https://www.dropbox.com/s/m3kjp540otwm17l/Oliver_Reitebuch_Oral_Assessment-Aeolus-DISC.pdf (last access: 2 January 2022), 2020.

Rennie, M. P., Isaksen, L., Weiler, F., de Kloe, J., Kanitz, T., and Reitebuch, O.: The impact of Aeolus wind retrievals on ECMWF global weather forecasts, Q. J. R. Meteorol. Soc., 147, 3555–3586, https://doi.org/10.1002/qj.4142, 2021.

Ripley, B. D. and Thompson, M.: Regression techniques for the detection of analytical bias, Analyst, 112, 377–383, https://doi.org/10.1039/AN9871200377, 1987.

Straume, A. G., Rennie, M., Isaksen, L., de Kloe, J., Marseille, G.-J., Stoffelen, A., Flament, T., Stieglitz, H., Dabas, A., Huber, D., Reitebuch, O., Lemmerz, C., Lux, O., Marksteiner, U., Weiler, F., Witschas, B., Meringer, M., Schmidt, K., Nikolaus, I., Geiss, A., Flamant, P., Kanitz, T., Wernham, D., von Bismarck, J., Bley, S., Fehr, T., Floberghagen, R., and Parinello, T. : ESA's Space-Based Doppler Wind Lidar Mission Aeolus First Wind and Aerosol Product Assessment Results, edited by: Liu, D., Wang, Y., Wu, Y., Gross, B., and Moshary, F., EPJ Web of Conferences, 7 July 2020, 237, 01007, https://doi.org/10.1051/epjconf/202023701007, 2020.

Straume-Lindner, A. G.: Aeolus Sensor and Product Description, European Space Agency – European Space Research and Technology Centre, The Netherlands, Tech. rep., REF: AE-SU-ESA-GS-000, https://earth.esa.int/eogateway/documents/20142/37627/Aeolus-Sensor-and-Product-Description.pdf (last access: 2 January 2022), 2018.

Tan, D. G. H., Adnderson, E., de Kloe, J., Marseille, G.-J., Stoffelen, A., Poli, P., Denneulin, M.-D., Dabas, A., Huber, D., Reitebuch, O., Flamant, M., Rille, O.-R., and Nett, H.: The ADM-Aeolus wind retrieval algorithms, Tellus A, 60, 191–205, https://doi.org/10.1111/j.1600-0870.2007.00285.x, 2008.

Wang, C.-C.: On the Calculation and Correction of Equitable Threat Score for Model Quantitative Precipitation Forecasts for Small Verification Areas: The Example of Taiwan, Weather Forecast., 29, 788–798, https://doi.org/10.1175/WAF-D-13-00087.1, 2014.

Wang, X. and Lei, T.: GSI-Based Four-Dimensional Ensemble–Variational (4DEnsVar) Data Assimilation: Formulation and Single-Resolution Experiments with Real Data for NCEP Global Forecast System, Mon. Wea. Rev., 142, 3303–3325, https://doi.org/10.1175/MWR-D-13-00303.1, 2014.

Weiler, F., Rennie, M., Kanitz, T., Isaksen, L., Checa, E., de Kloe, J., Okunde, N., and Reitebuch, O.: Correction of wind bias for the lidar on board Aeolus using telescope temperatures, Atmos. Meas. Tech., 14, 7167–7185, https://doi.org/10.5194/amt-14-7167-2021, 2021.

- Abstract
- Introduction
- Data and methodology
- The TLS bias estimates
- Comparison to OLS regressions
- A TLS bias correction
- Impact of the TLS bias correction on forecast skill
- Summary and conclusions
- Code availability
- Data availability
- Author contributions
- Competing interests
- Disclaimer
- Special issue statement
- Acknowledgements
- Financial support
- Review statement
- References

- Abstract
- Introduction
- Data and methodology
- The TLS bias estimates
- Comparison to OLS regressions
- A TLS bias correction
- Impact of the TLS bias correction on forecast skill
- Summary and conclusions
- Code availability
- Data availability
- Author contributions
- Competing interests
- Disclaimer
- Special issue statement
- Acknowledgements
- Financial support
- Review statement
- References