the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Retrieval of temperature and humidity profiles from groundbased highresolution infrared observations using an adaptive fast iterative algorithm
Wei Huang
Bin Yang
Shuai Hu
Wanying Yang
Zhenfeng Li
Wantong Li
Xiaofan Yang
Various retrieval algorithms have been developed for retrieving temperature and water vapor profiles from Atmospheric Emitted Radiance Interferometer (AERI) observations. The physical retrieval algorithm, named AERI Optimal Estimation (AERIoe), outperforms other retrieval algorithms in many aspects except the retrieval time, which is significantly increased due to the complex radiative transfer process. The calculation of the Jacobian matrix is the most computationally intensive step of the physical retrieval algorithm. Interestingly, an analysis of the change in AERI observations' information content with respect to Jacobians revealed that the AERIoe algorithm's performance presents negligible dependence on these metrics. Thus, the Jacobian matrix could remain unchanged when the variation in the atmospheric state is small in the retrieval process to reduce the most timeconsuming computation. On the basis of the above findings, a fast physical–iterative retrieval algorithm was proposed by adaptively recalculating Jacobians in keeping with the changes in the atmospheric state. Experiments with synthetic observations demonstrate that the proposed method experiences an average reduction in retrieval time by an impressive 59 % compared to the original AERIoe algorithm while achieving maximum rootmeansquare errors of less than 0.95 K and 0.22 log(ppmv) for heights below 3 km for the temperature and water vapor profile, respectively. Further analyses revealed that the fastretrieval algorithm reached an acceptable convergence rate of 98.7 %, marginally lower than AERIoe's 99.9 % convergence rate for the 826 cases used in this study.
 Article
(1643 KB)  Fulltext XML
 BibTeX
 EndNote
Highquality atmospheric profiles are required for many endeavors, including radiative transfer, cloud process research, and assimilation into mesoscale models to improve forecasts (Turner et al., 2000). The accuracy of the initial field provided by observation networks is becoming a key factor restricting the skill of numerical weather prediction (NWP) models (Romine et al., 2013; Li et al., 2016). The existing observation networks are insufficient to meet the needs of convectivescale numerical weather prediction systems, especially in the prediction of convection initiation convective processes (Kain et al., 2013; Wagner et al., 2019; Geerts et al., 2018). As the spatiotemporal resolution is too coarse, radiosonde profiles cannot capture the atmospheric phenomena in detail. Satelliteborne instruments are able to provide wider geographical coverage and higher horizontal resolution than groundbased balloon radiosonde observations. However, satellite retrievals remain insufficient to resolve the structure within the planetary boundary layer (PBL), as the weighting functions of satelliteborne observations peak above the PBL. A promising solution is groundbased thermal infrared spectrometers that measure downwelling spectral infrared radiance, which show good skill at retrieving the temperature and humidity profiles of the PBL.
The commonly used groundbased infrared hyperspectral equipment mainly includes Fourier transform infrared (FTIR) instruments of the Karlsruhe Institute of Technology deployed in the Detection of Atmospheric Composition Change (NDACC) (De Mazière et al., 2018) and the Atmospheric Emitted Radiance Interferometer (AERI) developed by the University of Wisconsin Space Science and Engineering Center (UWSSEC) deployed in the Atmospheric Radiation Measurement (ARM) program (Knuteson et al., 2004). The FTIR instrument observes nearinfrared and midinfrared highresolution solar spectra, which are mainly used to retrieve water vapor (Schneider et al., 2006b, a; Schneider and Hase, 2009), water isotopologs (Schneider et al., 2006b; Barthlott et al., 2017), and various trace gas (Gardiner et al., 2008; Kiel et al., 2016; Zhou et al., 2018; Yin et al., 2020, 2021a, b; Viatte et al., 2014) profiles or total columns. The spectral region of AERI covers the range of 520–3000 cm^{−1} containing a 15 µm absorption band of CO_{2} commonly used for the retrieval of temperature profiles, which makes it more advantageous in detecting thermodynamic profiles (Rowe et al., 2006). Specific retrieval algorithms, capable of being divided into statistical retrieval algorithms and physical retrieval algorithms as per different principles, are required to extract large amounts of information on the retrieved atmospheric profiles from infrared hyperspectral radiance data. Physical retrieval algorithms utilize the radiative transfer simulation and the iterative optimization strategy, which exhibit higher retrieval accuracy compared to the statistical retrieval algorithms (Yang and Min, 2018; Cimini et al., 2010). Two physical retrieval algorithms, named AERI Profiles of Water Vapor and Tempearture (AERIprof) (Smith et al., 1999; Feltz et al., 1998) and AERI Optimal Estimation (AERIoe) (Turner and Löhnert, 2014; Turner and Blumberg, 2019; Turner and Löhnert, 2021), have been successively adopted in AERI equipment to derive thermodynamic profiles. Based on the “onion peeling” technique, AERIprof adjusts the firstguess profile from bottom to top with the iterative algorithm to minimize the difference between the calculated and observed radiation. Given that AERIprof only needs to calculate the diagonal elements in the Jacobian matrix, its retrieval speed is faster than that of the optimalestimation method (Rodgers, 2000).
However, the AERIprof algorithm has several significant drawbacks, such as its high dependence on the firstguess profile and an inability to provide uncertainty estimates for retrieval results (Turner and Löhnert, 2014; Blumberg et al., 2017, 2015). The limitations of AERIprof could be overcome by the AERIoe optimalestimation retrieval algorithm, which was designed as an alternative to the previous physical algorithm. One of the important improvements remains to reduce the dependence on the firstguess profile by introducing regularization parameters into the AERIoe algorithm to balance the observation and the prior information. To achieve good stability and accuracy, the regularization parameters in the AERIoe algorithm are defined as a monotonic sequence that contains at least seven values, leading to a minimum of seven iterations for convergence because the regularization parameters are iterationdependent (Bakushinskii, 1992; Xu et al., 2016). Additionally, the Jacobian matrix is recalculated for each iteration due to the dependence on the current state vector, which significantly increases the amount of calculation and results in a high retrieval time.
The aim of this study is to accelerate the retrieval speed of AERIoe. In Sect. 3, an investigation into the information content of AERI observations concerning Jacobians revealed that the performance of the AERIoe algorithm exhibits marginal dependence on these matrices. Motivated by this finding, a fast physical–iterative retrieval method, henceforth called Fast AERIoe, is proposed to address the limitation of the long retrieval time of AERIoe by adaptively recalculating Jacobians without manual intervention. By this means, the retrieval speed of AERIoe can be improved due to the reduction in the computation amount. In this study, only temperature and water vapor profiles are retrieved from Fast AERIoe, and cases of cloudy situations will be addressed in future work. Finally, the retrieval time, convergence characteristics, and accuracy of the new algorithm are presented in Sect. 4.
The data used in the study are from the ARM program supported by the U.S. Department of Energy, which aims to quantitatively study the atmospheric radiation budget and develop and verify the parameterization scheme of the numerical model (Revercomb et al., 2003; Ellingson et al., 2016). This program mainly focuses on the longterm observation of atmospheric states and radiative fluxes, providing information to researchers around the world to inform and validate predictive models of climate and weather. We use data collected at the Southern Great Plains (SGP) site, which is located at 36.61^{∘} N and 149.88^{∘} W near Lamont, Oklahoma, USA (Sisterson et al., 2016). These data mainly include groundbased infrared spectra obtained by AERI and radiosonde profiles, with the former used to retrieve the temperature and water vapor profiles and the latter mainly used to evaluate the accuracy of the retrieval results.
2.1 AERI
AERI can continuously receive downwelling atmospheric infrared radiance from 3.3 to 19.2 µm (520–3000 cm^{−1}) with a spectral resolution better than 1 cm^{−1}, among which the infrared radiation of the 520–1800 cm^{−1} band is obtained by the mercury cadmium telluride (HgCdTe) detector, and the 1800–3020 cm^{−1} band is obtained by the indium antimonide (InSb) detector. The AERI frontend optics include a scene mirror and two calibrated blackbodies, one of which changes with the temperature of the surrounding environment, while the other is maintained at a fixed temperature (60 ^{∘}C). AERI achieved a calibration accuracy of better than 1 % by viewing two highprecision blackbodies and a nonlinearity correction for the detectors (Knuteson et al., 2004). The temporal resolution of the AERI standard remains approximately 8 min, including a 3 min skydwell period and the subsequent observation of the two blackbodies.
AERI has many observation channels, including not only temperature and humidity profile information, but also trace gas information such as ozone, methane, and random error. To avoid contributions to the downwelling radiance by other gases, appropriate channels that are primarily sensitive to the retrieved profiles must be selected. The spectral regions used in the study are consistent with AERIoe v1.2, which used only the 538–588 cm^{−1} band for water vapor profiling to exclude scattering effects from clouds (Turner and Blumberg, 2019). The specific wavenumbers used to perform the retrieval are shown in Table 1, among which the spectral region used for temperature retrieval includes 167 channels and the water vapor includes 104 channels.
2.2 Radiosonde data
Radiosondes have been used for decades to provide humidity, temperature, and wind profiles throughout the troposphere and are considered to be the most accurate means of detecting the vertical structure of the atmosphere. They are often used to evaluate the accuracy of other groundbased profilers. Located 150 m to the north of the AERI equipment, the closer radiosonde release point can ensure the comparability of radiosonde profiles and AERI retrieval results (Wakefield et al., 2021). The radiosonde data at the SGP site have been obtained by Vaisala RS92 since 2002 (Turner et al., 2016), including temperature, humidity, pressure, wind direction, and wind speed. It is regularly launched four times a day at 05:30, 11:30, 17:30, and 23:30 UTC.
We collected radiosonde profiles and AERI radiation data from 2012, screening 826 groups of qualified data samples through quality control, spatiotemporal matching, and clearsky recognition. A synthetic dataset of simulated AERI spectra corresponding to 826 sets of radiosonde profiles was produced using the linebyline radiative transfer model (LBLRTM), with parameter settings consistent with Sect. 3.1.
3.1 Retrieval configuration
The AERIoe algorithm, based upon the optimalestimation method, iteratively searches for the atmospheric state that most conforms to the observation and prior constraints.
Here, X is the profile of the atmospheric state to be retrieved, X_{a} is the prior profile of the atmosphere, S_{a} is the a priori covariance matrix, Y^{m} is the observed radiance vector, F(X) is the computed radiance for X, S_{e} is the observation error covariance matrix, and n denotes the iteration number. The superscripts T and −1 indicate the matrix transpose and inverse, respectively.
To improve the stability of the retrieval algorithm, the regularization parameter γ was introduced in Eq. (1), which is set as fixed values from large to small ([1000, 300, 100, 30, 10, 3, 10, 1]). As γ decreases with iterations, more observation information is introduced to improve the retrieval accuracy. Iterations are continued until γ decreases to 1 and the following convergence criterion is satisfied.
N represents the dimension of the retrieved atmospheric state vector.
Note that K depends on X used for estimating the Jacobian, which means that K must be recomputed for every iteration step. The updating of the Jacobians in the above retrieval process requires the calculation of the optical thickness or radiance (intensity) with respect to the elements of X at each height, which might be computationally expensive depending on the lengths of X and Y^{m} (Maahn et al., 2020). Owing to the constraints of γ, the decrease in the difference between the simulated and observed radiation is not very much in the adjustment of individual iterations to the retrieval profile. At this time, the change in the Jacobian calculated as per the iteration profile is negligible. Backed by the above analysis, a fast iterative algorithm called Fast AERIoe is proposed on the basis of the AERIoe algorithm. The flowchart of Fast AERIoe is shown in Fig. 1. Most of the configurations are consistent with AERIoe described by Turner and Löhnert (2014), except for some modifications highlighted as follows.
 a.
Atmospheric configurations. The height grid of X is consistent with AERIoe, but the maximum retrieval height is limited to 3 km. This is done because the variations in K above 3 km are negligible because most of the information in the AERI spectrum lies in the lowest 2 km of the atmosphere for temperature and water vapor profiles (Turner and Löhnert, 2014). The cloud properties were excluded from the state vector X, which is beyond the scope of this study. The corresponding prior profile X_{a} and the prior covariance matrices represented by S_{a} are modified to be consistent with X.
 b.
Observational vector Y. Spectral regions that are sensitive to cloud properties were removed from the observational vector Y to be consistent with the state vector X. Furthermore, additional observations, including surface temperature and water vapor, were incorporated into the observation vector; details are described by Turner and Blumberg (2019).
 c.
Jacobian matrix K. K is derived from LBLRTM, which is the same as AERIoe except for the version (12.8 instead of 12.1). Another modification is that K is not recomputed to improve the retrieval speed of the algorithm when the variations in the iterative profile X_{n} are small.
3.2 Adaptive recalculation of the Jacobian
The method to reduce the calculation of K is the key to speeding up the AERIoe algorithm. The Jacobians are dependent on the atmospheric state, which means that K must be recalculated for every iteration step. The question arises as to under what circumstances K does not need to be recalculated. Therefore, the dependence of the retrieval capability on Jacobians must be analyzed, and indicators that reflect the changes in Jacobians should be determined to determine whether K is recalculated or not.
3.2.1 Quantification of algorithm retrieval capability
The retrieval accuracy of the atmospheric profile depends on the amount of atmospheric information in the hyperspectral data. Shannon information content (SIC) and degrees of freedom for signal (DFS), as important indicators to describe the effective information contained in hyperspectral data (Rodgers, 1998), can quantitatively describe the detector's retrieval ability for a specific atmospheric profile. SIC represents the reduction in uncertainty in the retrieved profiles contributed by the observation, with the calculation formula shown in Eq. (3). DFS provides the number of independent pieces of information contained in the measured radiation, with the calculation formula shown in Eq. (4).
Here, $\widehat{\mathbf{S}}$ is the posterior error covariance matrix, also known as the analysis error covariance matrix. Its diagonal element is the standard deviation of the retrieval error, with the calculation formula $\widehat{\mathbf{S}}$ as follows:
of which
3.2.2 Analysis of the dependence of AERIoe on Jacobians
It can be seen from Eqs. (3) and (4) that SIC and DFS are determined by S_{e}, S_{a}, K, and γ. However, S_{a} and S_{e} remain unchanged during retrieval, which makes SIC and DFS change with iteration due to variations in γ and K. As γ drops to 1 at the final iteration, the values of SIC and DFS are only dependent on K. Owing to the difficulty in quantifying the change in the twodimensional Jacobian caused by the iteration profiles, a monitoring index, henceforth called K_Index, is designed and used to characterize the change in the profiles at various iterations. The calculation of K_Index comes from the convergence criterion convergence_index, which contains not only the difference between the iteration profiles, but also the posterior dominated by the Jacobian. The introduced K_Index should reflect the changes in the temperature and humidity profiles, which means that the influence of the Jacobian should be excluded. Then, the convergence_index was degenerated into the K_Index as follows.
The values of K_Index in Fig. 2, which cover most of the K_Index during the AERIoe retrieval process (ranging from 0 to 260; see Fig. 3), were obtained by multiplying the prior profile by different scale factors. The atmospheredependent K values were computed by the LBLRTM with the prior profiles above, and SIC and DFS were calculated using Eqs. (3) and (4), respectively, with different Jacobians. Both SIC and DFS change slowly with K_Index as shown in Fig. 2, with the variation of SIC within 13 % (from 13.9 to 16.1) and that of DFS within 4 % (from 3.7 to 3.9) for temperature and 13 % (from 1.4 to 1.7) for water vapor, which demonstrates that SIC and DFS remain almost unchanged on the condition that the value of K_Index is small. This provides an effective means of improving the retrieval speed of AERIoe by recalculating K selectively when X is not changing much or K_Index is small. This could be achieved by comparing the value of K_Index with its threshold at each iteration to determine whether K is recalculated or not.
3.2.3 Determination of the K_Index threshold
The selection of the threshold for K_Index is very important for the Fast AERIoe algorithm. If the threshold remains too large, too many Jacobians will stop updating, resulting in a decline in retrieval accuracy or even nonconvergence of the retrieval process; when the threshold value remains too small, most Jacobians need to be recalculated, which cannot effectively shorten the retrieval time.
Figure 3 shows the histogram of the K_Index distribution for each iteration in the retrieval process, with the K_Index values at each iteration calculated using the clearsky data for 2012. Since the climatological mean profile was used as the first guess, which has a large deviation from the real atmospheric state, a larger value of K_Index was demonstrated in the first step of the retrieval. The K_Index value decreases significantly from the second iteration (see Fig. 3a), indicating that the adjustment of the iterative profile remains very small and that the retrieval process tends to be stable relative to the first iteration. As the retrieval proceeds, the iteration profile gradually approaches the truth, and the K_Index box gradually shortens to below 0.5 (see Fig. 3b). Using this value as the threshold for K_Index, most of the Jacobian after the second iteration does not need to be recalculated, and the retrieval time could be effectively reduced. However, the K_Index in iteration 7 shows larger outliers, indicating that the instability of the retrieval algorithm increases when the γ factor decreases to 1. To reduce the impact of the Jacobian on the convergence of the algorithm, the threshold for the K_Index after iteration 6 is set to 0.1 according to Fig. 3c, of which the K_Index box at iteration 7 is within 0.1. It should be noted that the threshold of K_Index used in the Fast AERIoe algorithm is dependent on the datasets used in the retrieval. They are presented “as is” and are not intended to be directly applied by the reader. We encourage readers to develop their own indicator to reduce the recalculation of Jacobians based on the atmospheric profiles they intend to retrieve.
The simulated AERI radiation is used for retrieval to better analyze the performance of Fast AERIoe and to eliminate the interference of other factors. An advantage of using synthetic observations is that the true atmospheric state is known, which can be used to evaluate the retrieval accuracy. Second, the errors caused by parameters in the forward model, such as the deviation of trace gas content, the strength and temperature dependence of the water vapor continuum absorption, and the halfwidths of absorption lines, could be eliminated (Maahn et al., 2020). Third, we can control the noise level in the synthetic measurement.
4.1 Retrieval process
Examples of the Fast AERIoe retrieval using the simulated spectra at various iterations are shown in Fig. 4. These profiles represent the typical performance of each retrieval configuration at the SGP site. The entire retrieval process took 3.59 min with seven iterations, in which only Jacobians of the first and second iterations were updated. The retrieved profiles converged quickly below 1 km, with little adjustment of the temperature and humidity profiles following the first iteration. For the upper atmosphere above 1.5 km, the temperature and humidity profiles have a relatively large adjustment and gradually approach the radiosonde profile with the iterations. This feature of the Fast AERIoe retrieval process is very similar to AERIoe, which is determined by the information content of the AERI spectra. The information content is concentrated near the surface, which leads to a more rapid convergence in the lowest portions of the profile. The information content of the upper layer is lower, and as such it is necessary to reduce the value of γ to introduce more observation information so that the retrieved profiles are refined to approach the radiosonde profile as the iterations are continued.
One advantage of the optimalestimation method remains that the posterior error covariance matrix of the solution $\widehat{\mathbf{S}}$ can be obtained to estimate the uncertainty of the retrieval results of each sample. The temperature and water vapor profiles show a strong correlation for the correlation coefficient matrix of S_{a} (see Fig. 5a and c), especially the temperature profile, which has a high correlation coefficient above 0.6 between any two layers because of the relatively stable vertical gradient of the temperature profile. The nondiagonal elements below 1 km in the correlation coefficient matrix of the $\widehat{\mathbf{S}}$ results from Fast AERIoe show a much lower correlation than those of S_{a} (see Fig. 5b and d), which means that the retrieved profiles in the lower atmosphere are dominated by AERI observations. However, with increasing height, the correlation of the area near the diagonal increases significantly. Therefore, the retrieval algorithm will rely more on the constraint of prior information at the upper layer of the PBL. The 1σ uncertainty lines, which are the square root of the diagonal of the covariance matrices for the prior (blueshaded area) and the posterior (black horizontal line) in Fig. 4, demonstrate that the retrieved profile has a much smaller uncertainty than the prior. Therefore, the Fast AERIoe algorithm can effectively reduce the impact of uncertainties in the firstguess profile on the retrieval results. As the height increases, the black horizontal line segment becomes longer for either the temperature profile or the water vapor profile, indicating that the uncertainty in the retrieved profiles increases in the upper PBL.
4.2 Performance
4.2.1 Retrieval time
Both the AERIoe and Fast AERIoe algorithms were used to retrieve 826 groups of simulated AERI radiation data at SGP stations in 2012 to evaluate the retrieval performance of Fast AERIoe. The codes for both of the retrieval algorithms are written in the MATLAB language and run on a Lenovo Aircross 510P computer, of which the CPU is Intel Core i77700 and the operating system is Ubuntu 14.04. To analyze the code timing of the retrieval algorithm, the code was divided into the following sections: preparation, iteration 1, iteration 2, and iteration 3, etc., until the final iteration. The preparation section mainly consists of atmosphere construction, observation vector construction, and precalculated variable importation. The iteration sections include the recalculation of K and F(X) and the inversion using Eq. (1). Note that iteration 1 does not need to calculate K and F(X) because the prior profile X_{a} is fixed (mean value of the atmosphere), and the K and F(X) associated with it are precalculated. The time consumed by each section was analyzed for both AERIoe and Fast AERIoe, and the results for an arbitrarily selected case are provided in Table 2. The recalculation of F(X) and K consumed an immense amount of time in the retrieval process of AERIoe, and the latter is the most timeconsuming section. Therefore, by reducing the recalculation of K, the retrieval time of Fast AERIoe is greatly reduced compared to AERIoe.
The average retrieval time of Fast AERIoe for the 826 cases used in the study is 3.7 min, which is more than 50 % shorter than that of AERIoe, with an average retrieval time of 9.0 min, which is beyond the temporal resolution (approximately 8 min) of AERI observations. All of the AERIoe samples consumed more than 8 min, while only 10 cases exceeded the temporal resolution of AERI for the Fast AERIoe algorithm. Note that the retrieval time is dependent on the computing platform and the method used to compute Jacobians and is not intended to be directly applied by the reader.
In addition to the recalculation of K, the retrieval time is also affected by the total iteration steps. Therefore, statistics of the average retrieval time difference (Time_diff for short) caused by the K recalculation step difference (K_diff for short) and the average total iteration step difference (Total_diff for short) are provided in this study. The retrieval samples are divided into seven categories (shown in Table 3), in keeping with K_diff between AERIoe and Fast AERIoe. On this basis, Time_diff and Total_diff between the two retrieval algorithms for various samples are calculated. As shown in Fig. 6, with an increase in K_diff, Time_diff also increased gradually, showing a strong positive correlation. Compared with K_diff, the value of Total_diff is very small, and its impact on the retrieval time is also minimal, only having slight negative and positive effects on the Time_diff of Class3 and Class6. Therefore, the improvement in the retrieval speed of Fast AERIoe is mainly due to the recalculation of Jacobians.
4.2.2 Convergence characteristics
A total of 825 samples of the 826 datasets using the AERIoe algorithm achieved convergence, with the convergence rate reaching 99.9 %. The Fast AERIoe algorithm has 815 groups of samples to achieve convergence, with the convergence rate reaching 98.7 %, which is lower than that of AERIoe. Among the 11 sets of retrieval samples that did not achieve convergence, the K_Index of most of them did not change much after the γ was dropped to 1, indicating that the subsequent iterations had little effect on the adjustment of the profiles, so the iterative profile corresponding to the minimum convergence_index could be taken as the retrieval results instead of criterion (2). Figure 7a shows the comparison between the retrieved profiles from AERIoe using criterion (2) and Fast AERIoe using the new convergence criteria with 11 sets of nonconverged samples. The temperature profiles obtained by the two algorithms are virtually identical, with an R^{2} of 0.99. For the water vapor mixing ratio (WVMR), the introduction of the new convergence criteria reduces the value of R^{2}, but this still reaches 0.84, indicating that the two datasets still have a strong correlation. The above results indicate that the method of using the minimum convergence_index to obtain the retrieval profiles is a reasonable and feasible method, as the Fast AERIoe algorithm cannot achieve convergence.
4.2.3 Accuracy
Traditional methods used to evaluate the accuracy of retrieved profiles against radiosondes compute the BIAS and rootmeansquare error (RMSE), with the calculation formula as follows:
where i and j represent the serial numbers of vertical stratification and samples, respectively, with M being the number of samples. X_{retrieval} is defined as retrieved profiles, and ${\mathit{X}}_{\mathrm{sonde}}^{\mathrm{smooth}}$ is radiosonde observations that are smoothed with the averaging kernel A by the following multiplication to reduce the vertical representativeness errors:
The BIAS and RMSE of AERIoe and Fast AERIoe are calculated for 826 sets of samples using the above equations within the altitude range of 0–3 km, and the results are shown in Fig. 8. The temperature profile below 1.0 km and the water vapor profile below 1.5 km have obvious positive deviations, with the maximum deviations reaching 1.0 K and 0.2 log(ppmv), respectively. However, the BIAS and RMSE at the bottom are significantly reduced due to the constraint of the surface observations, indicating that the introduction of surface meteorological observation data into the observation vector has an obvious positive effect. The temperature profiles retrieved by Fast AERIoe show a negative deviation of 0.05 K between 1.0 and 1.5 km and a maximum increase in RMSE within 0.08 K above 1.0 km when compared with AERIoe. For the water vapor profile, the BIAS and RMSE profiles of Fast AERIoe are in good agreement with AERIoe, except for a maximum increase in BIAS within 0.03 log(ppmv) below 1.0 km. When considering the magnitude of the temperature (roughly on the order of 300 K) and water vapor (roughly on the order of 5–10 log(ppmv)) profiles, the differences between the retrieved profiles are negligible, indicating that the retrieval results of Fast AERIoe are comparable to those of AERIoe.
The comparison of the profiles retrieved by the two algorithms can be demonstrated more clearly by the modified Taylor plots (Turner and Löhnert, 2014), which are used to evaluate how well each retrieved profile can capture the vertical shapes of its true profile, as BIAS and RMSE can only describe the average accuracy of the whole dataset at each height. These Taylor diagrams show Pearson's correlation coefficient between two datasets on the y axis and the ratio of the standard deviation on the x axis. Each retrieval–sonde pair is used to derive the correlation coefficient (r) from Eq. (11) and the ratio of the standard deviations from Eq. (12); both are used by Turner and Löhnert (2014).
Within the equations, s(z) and a(z) are defined as the radiosonde observations and retrieved profiles between 0 and 3 km, and ($\stackrel{\mathrm{\u203e}}{s},\stackrel{\mathrm{\u203e}}{a}$t) and (σ_{s},σ_{a}) are the mean values and standard deviations in the same height range.
Retrievals that have a correlation coefficient of 1 and a standard deviation ratio (SDR) of 1 mean that the two datasets match perfectly. Figure 9a and b show these plots for the clearsky AERIoe and Fast AERIoe retrievals. For the temperature retrievals, both Fast AERIoe and AERIoe perform well, with 90 % of the correlation coefficients above 0.9 and the intersection of the arms close to 1. Figure 9b shows that retrieving the water vapor structure is much more difficult with both algorithms; the spreads in the correlation coefficient and SDR are much larger for water vapor than for temperature. Most of the blue and red symbols “×” in Fig. 9, which indicate the scores for the individual profiles of the two algorithms, are close to each other for both the temperature and water vapor profiles. Therefore, the modified Taylor plots also confirm the conclusion that the retrieval results of the AERIoe and Fast AERIoe algorithms are comparable.
4.3 Real observations
Since the clouds overhead have a significant influence on the infrared spectra, the primary problem is how to screen clearsky samples when using the measured AERI data to retrieve the temperature and humidity profiles. The contribution of clouds to infrared radiation not only interferes with the inversion of temperature and humidity profiles, but also provides technical means for obtaining cloud macrophysical properties (Liu et al., 2022). Figure 10 shows the AERIobserved spectrum under cloudy and clearsky conditions. The AERI observations under the two conditions remain highly different, indicating that the AERIobserved spectrum can be adopted directly to determine whether clouds or clear skies are present. To establish an accurate cloud recognition model, we adopted the cloud fraction data obtained from the allsky image at the same site as the label for training, where the sample with a cloud fraction of less than 30 % is marked as 0, indicating clear sky, while the sample with a cloud fraction greater than 30 % is marked as 1, indicating that there is cloud overhead. Using the abovementioned method, the cloud fraction of the allsky image from March to May 2010 was labeled and temporally matched with the AERIobserved radiance to form a training dataset, based on which a cloud recognition model was established by training the backpropagation (BP) neural network, with the final crossvalidation accuracy reaching 94.3 %. Compared with the recognition method by radiosonde, the BP cloud recognition model greatly improves the discrimination accuracy without requiring additional detection equipment. The BP cloud recognition model was applied to the 178 groups of AERI observations collected on 21 October 2012, with 168 groups of clearsky samples screened in total.
Benefitting from good retrieval accuracy and high temporal resolution, AERI instruments can be used to monitor thermodynamic temporal structures that may not be resolved by infrequent radiosonde launches. Figure 11 shows the time–height cross sections of the temperature and WVMR profiles derived from the Fast AERIoe retrievals. Figure 11 shows that AERI resolved the temperature inversion prior to approximately 15:00 UTC, and the height of the inversion layer gradually increased over time. After 15:00 UTC, the temperature near the surface increases significantly, accompanied by the disappearance of the inversion layer. From the comparisons with radiosonde profiles shown in Fig. 12, the retrieval results of Fast AERIoe are well matched with radiosonde profiles, especially the temperature profiles, which demonstrates the ability of the algorithm to resolve the inversion layer.
The AERIoe algorithm retrieves atmospheric temperature and humidity profiles using the optimalestimation framework, which can make full use of information in the infrared spectrum and give the uncertainty of each retrieval result. AERIoe reduces the dependence on the firstguess profile by introducing regularization parameters, but it also requires more iterative steps, which increases the calculation amount and retrieval time of the algorithm. In this paper, a fastretrieval method called Fast AERIoe was established by adaptively recalculating the Jacobians in the retrieval process of AERIoe. Based on the statistical comparison of the two methods (AERIoe and Fast AERIoe) with radiosonde observations, the retrieval performance of Fast AERIoe is summarized as follows.

The retrieval speed of Fast AERIoe is significantly improved compared with AERIoe while keeping the parameters of the computing platform unchanged, with the average retrieval time reduced by more than 50 %. The temperature and water vapor profiles derived from Fast AERIoe are almost unchanged compared with AERIoe, illustrating that the retrieval results of Fast AERIoe are comparable to those of AERIoe. The deep retrieval architecture is able to extract highly nonlinear features from the AERI observations and shows a better inversion capability than the existing statistical methods (Yang et al., 2023), albeit with the lack of the radiative transfer process. Therefore, the combination of AERIoe and deep learning can improve the accuracy of AERI for retrieving temperature and humidity profiles.

For the convergence characteristics, 825 out of 826 samples adopting AERIoe meet the convergence criterion, while the sample adopting Fast AERIoe converges over 98 % of the time. The method of recalculating Jacobians in Fast AERIoe slightly reduces the convergence of the retrieval algorithm. Despite this, the Fast AERIoe algorithm has demonstrated the ability to retrieve reliable temperature and water vapor profiles more quickly, which is fast enough for realtime processing.

When Fast AERIoe is adopted to measure the AERI spectrum, a cloud recognition model without additional detection equipment is established based on the BP neural network algorithm to remove cloudysky cases. Compared with the commonly used cloud recognition method by radiosonde observations, the BP cloud recognition model has greatly improved the discrimination accuracy. It should be noted that the hyperspectra under the two weather conditions of clear sky with high humidity and few clouds are relatively close, while the above two weather conditions are far from further distinguished when building the BP cloud recognition model, which may reduce the discriminative accuracy of the model. Furthermore, the cloud base height (CBH) also contributes to AERI radiation (Ye et al., 2022), and adding CBH to the recognition model helps to improve the accuracy of the model.
A single instrument always has some defects in vertical coverage, vertical resolution, temporal resolution, and accuracy in obtaining the vertical distribution of atmospheric continents (BarreraVerdejo et al., 2016). The combination of multiple remotesensing devices in an optimal retrieval algorithm can overcome the shortcomings of a single device, making full use of each measurement to achieve the purpose of enhancing their benefits. However, the increase in observation equipment will inevitably lead to more complex calculations of the forward model and Jacobian, which will lead to a significant increase in the amount of calculation and retrieval time. Therefore, it is particularly necessary to carry out research on fast retrieval in the case of joint retrieval. Apart from the influence of the Jacobian on the retrieval time, so is the number of iterations required by the retrieval algorithm, which is dominated by the regularization parameter. Future work will focus on the application of Fast AERIoe in the combination of different observations and the selection of regularization parameters to permit the retrieval algorithm to converge more efficiently.
The data used in the paper (including AERI and radiosonde) are available from the ARM Data Archive (https://adc.arm.gov/discovery/#/, last access: 19 January 2022). The code for recalculating Jacobians is not publicly available at this time but may be obtained from the authors upon reasonable request.
LL, BY, and WH determined the main goal of this study. WH developed the approach, analyzed the data, and visualized the results of the experiments. LL prepared the paper with contributions from all the coauthors. SH acquired funding and edited the paper. WL and WY prepared the various datasets. ZL and XY provided guidance on algorithmic procedures. All the coauthors reviewed the paper.
The contact author has declared that none of the authors has any competing interests.
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The authors thank the U.S. Department of Energy (DOE) Atmospheric Radiation Measurement (ARM) program for providing meteorological data online for free. The authors are deeply grateful to Atmospheric and Environmental Research (AER) Inc. for providing the LBLRTM codes online for free.
This research has been supported by the National Natural Science Foundation of China (grant nos. 42175154 and 62105367) and the Natural Science Foundation of Hunan Province of China (grant no. 2020JJ4662).
This paper was edited by Jian Xu and reviewed by two anonymous referees.
Bakushinskii, A. B.: The problem of the convergence of the iteratively regularized Gauss–Newton method, Comp. Math. Math. Phys.+, 32, 1503–1509, 1992.
BarreraVerdejo, M., Crewell, S., Löhnert, U., Orlandi, E., and Di Girolamo, P.: Groundbased lidar and microwave radiometry synergy for high vertical resolution absolute humidity profiling, Atmos. Meas. Tech., 9, 4013–4028, https://doi.org/10.5194/amt940132016, 2016.
Barthlott, S., Schneider, M., Hase, F., Blumenstock, T., Kiel, M., Dubravica, D., García, O. E., Sepúlveda, E., Mengistu Tsidu, G., Takele Kenea, S., Grutter, M., PlazaMedina, E. F., Stremme, W., Strong, K., Weaver, D., Palm, M., Warneke, T., Notholt, J., Mahieu, E., Servais, C., Jones, N., Griffith, D. W. T., Smale, D., and Robinson, J.: Tropospheric water vapour isotopologue data (H_{2}^{16}O, H_{2}^{18}O, and HD^{16}O) as obtained from NDACC/FTIR solar absorption spectra, Earth Syst. Sci. Data, 9, 15–29, https://doi.org/10.5194/essd9152017, 2017.
Blumberg, W., Wagner, T., Turner, D., and Correia Jr., J.: Quantifying the accuracy and uncertainty of diurnal thermodynamic profiles and convection indices derived from the Atmospheric Emitted Radiance Interferometer, J. Appl. Meteorol. Clim., 56, 2747–2766, https://doi.org/10.1175/JAMCD170036.1, 2017.
Blumberg, W. G., Turner, D. D., Löhnert, U., and Castleberry, S.: GroundBased Temperature and Humidity Profiling Using Spectral Infrared and Microwave Observations. Part II: Actual Retrieval Performance in ClearSky and Cloudy Conditions, J. Appl. Meteorol. Clim., 54, 2305–2319, https://doi.org/10.1175/jamcd150005.1, 2015.
Cimini, D., Westwater, E. R., and Gasiewski, A. J.: Temperature and Humidity Profiling in the Arctic Using GroundBased MillimeterWave Radiometry and 1DVAR, IEEE T. Geosci. Remote, 48, 1381–1388, https://doi.org/10.1109/TGRS.2009.2030500, 2010.
De Mazière, M., Thompson, A. M., Kurylo, M. J., Wild, J. D., Bernhard, G., Blumenstock, T., Braathen, G. O., Hannigan, J. W., Lambert, J.C., Leblanc, T., McGee, T. J., Nedoluha, G., Petropavlovskikh, I., Seckmeyer, G., Simon, P. C., Steinbrecht, W., and Strahan, S. E.: The Network for the Detection of Atmospheric Composition Change (NDACC): history, status and perspectives, Atmos. Chem. Phys., 18, 4935–4964, https://doi.org/10.5194/acp1849352018, 2018.
Ellingson, R. G., Cess, R. D., and Potter, G. L.: The atmospheric radiation measurement program: Prelude, Meteor. Mon., 57, 1.1–1.9, https://doi.org/10.1175/AMSMONOGRAPHSD150029.1, 2016.
Feltz, W. F., Smith, W. L., Knuteson, R. O., Revercomb, H. E., Woolf, H. M., and Howell, H. B.: Meteorological applications of temperature and water vapor retrievals from the groundbased Atmospheric Emitted Radiance Interferometer (AERI), J. Appl. Meteorol., 37, 857–875, https://doi.org/10.1175/15200450(1998)037<0857:MAOTAW>2.0.CO;2, 1998.
Gardiner, T., Forbes, A., de Mazière, M., Vigouroux, C., Mahieu, E., Demoulin, P., Velazco, V., Notholt, J., Blumenstock, T., Hase, F., Kramer, I., Sussmann, R., Stremme, W., Mellqvist, J., Strandberg, A., Ellingsen, K., and Gauss, M.: Trend analysis of greenhouse gases over Europe measured by a network of groundbased remote FTIR instruments, Atmos. Chem. Phys., 8, 6719–6727, https://doi.org/10.5194/acp867192008, 2008.
Geerts, B., Raymond, D. J., Grubišić, V., Davis, C. A., Barth, M. C., Detwiler, A., Klein, P. M., Lee, W.C., Markowski, P. M., Mullendore, G. L., and Moore, J. A.: Recommendations for In Situ and Remote Sensing Capabilities in Atmospheric Convection and Turbulence, B. Am. Meteorol. Soc., 99, 2463–2470, https://doi.org/10.1175/bamsd170310.1, 2018.
Kain, J. S., Coniglio, M. C., Correia, J., Clark, A. J., Marsh, P. T., Ziegler, C. L., Lakshmanan, V., Miller, S. D., Dembek, S. R., and Weiss, S. J.: A feasibility study for probabilistic convection initiation forecasts based on explicit numerical guidance, B. Am. Meteorol. Soc., 94, 1213–1225, https://doi.org/10.1175/BAMSD1100264.1, 2013.
Kiel, M., Wunch, D., Wennberg, P. O., Toon, G. C., Hase, F., and Blumenstock, T.: Improved retrieval of gas abundances from nearinfrared solar FTIR spectra measured at the Karlsruhe TCCON station, Atmos. Meas. Tech., 9, 669–682, https://doi.org/10.5194/amt96692016, 2016.
Knuteson, R., Revercomb, H., Best, F., Ciganovich, N., Dedecker, R., Dirkx, T., Ellington, S., Feltz, W., Garcia, R., and Howell, H.: Atmospheric emitted radiance interferometer. Part I: Instrument design, J. Atmos. Ocean. Tech., 21, 1763–1776, https://doi.org/10.1175/JTECH1662.1, 2004.
Li, J., Wang, P., Han, H., Li, J., and Zheng, J.: On the assimilation of satellite sounder data in cloudy skies in numerical weather prediction models, J. Meteorol. Res., 30, 169–182, https://doi.org/10.1007/s1335101651142, 2016.
Liu, L., Ye, J., Li, S., Hu, S., and Wang, Q.: A novel machine learning algorithm for cloud detection using aeri measurement data, Remote Sens., 14, 2589, https://doi.org/10.3390/rs14112589, 2022.
Maahn, M., Turner, D. D., Löhnert, U., Posselt, D. J., Ebell, K., Mace, G. G., and Comstock, J. M.: Optimal Estimation Retrievals and Their Uncertainties: What Every Atmospheric Scientist Should Know, B. Am. Meteorol. Soc., 101, E1512–E1523, https://doi.org/10.1175/bamsd190027.1, 2020.
Revercomb, H. E., Turner, D. D., Tobin, D. C., Knuteson, R. O., Feltz, W. F., Barnard, J., Bösenberg, J., Clough, S., Cook, D., Ferrare, R., Goldsmith, J., Gutman, S., Halthore, R., Lesht, B., Liljegren, J., Linné, H., Michalsky, J., Morris, V., Porch, W., Richardson, S., Schmid, B., Splitt, M., Van Hove, T., Westwater, E., and Whiteman, D.: The Arm Program's Water Vapor Intensive Observation Periods: Overview, Initial Accomplishments, and Future Challenges, B. Am. Meteorol. Soc., 84, 217–236, https://doi.org/10.1175/bams842217, 2003.
Rodgers, C. D.: Information content and optimisation of high spectral resolution remote measurements, Adv. Space Res., 21, 361–367, https://doi.org/10.1016/S02731177(97)009150, 1998.
Rodgers, C. D.: Inverse methods for atmospheric sounding: theory and practice, World scientific, 119–120, ISBN 9814498688, 2000.
Romine, G. S., Schwartz, C. S., Snyder, C., Anderson, J. L., and Weisman, M. L.: Model bias in a continuously cycled assimilation system and its influence on convectionpermitting forecasts, Mon. Weather Rev., 141, 1263–1284, https://doi.org/10.1175/MWRD1200112.1, 2013.
Rowe, P. M., Walden, V. P., and Warren, S. G.: Measurements of the foreignbroadened continuum of water vapor in the 6.3 µm band at 30 ^{∘}C, Appl. Optics, 45, 4366–4382, https://doi.org/10.1364/AO.45.004366, 2006.
Schneider, M. and Hase, F.: Groundbased FTIR water vapour profile analyses, Atmos. Meas. Tech., 2, 609–619, https://doi.org/10.5194/amt26092009, 2009.
Schneider, M., Hase, F., and Blumenstock, T.: Water vapour profiles by groundbased FTIR spectroscopy: study for an optimised retrieval and its validation, Atmos. Chem. Phys., 6, 811–830, https://doi.org/10.5194/acp68112006, 2006a.
Schneider, M., Hase, F., and Blumenstock, T.: Groundbased remote sensing of HDO $/$ H_{2}O ratio profiles: introduction and validation of an innovative retrieval approach, Atmos. Chem. Phys., 6, 4705–4722, https://doi.org/10.5194/acp647052006, 2006b.
Sisterson, D., Peppler, R., Cress, T., Lamb, P., and Turner, D.: The ARM southern great plains (SGP) site, Meteor. Mon., 57, 6.1–6.14, https://doi.org/10.1175/AMSMONOGRAPHSD160004.1, 2016.
Smith, W. L., Feltz, W. F., Knuteson, R. O., Revercomb, H. E., Woolf, H. M., and Howell, H. B.: The retrieval of planetary boundary layer structure using groundbased infrared spectral radiance measurements, J. Atmos. Ocean. Tech., 16, 323–333, https://doi.org/10.1175/15200426(1999)016<0323:TROPBL>2.0.CO;2, 1999.
Turner, D. D. and Blumberg, W. G.: Improvements to the AERIoe Thermodynamic Profile Retrieval Algorithm, IEEE J. Sel. Top. Appl., 12, 1339–1354, https://doi.org/10.1109/JSTARS.2018.2874968, 2019.
Turner, D. D. and Löhnert, U.: Information Content and Uncertainties in Thermodynamic Profiles and Liquid Cloud Properties Retrieved from the GroundBased Atmospheric Emitted Radiance Interferometer (AERI), J. Appl. Meteorol. Clim., 53, 752–771, https://doi.org/10.1175/jamcd130126.1, 2014.
Turner, D. D. and Löhnert, U.: Groundbased temperature and humidity profiling: combining active and passive remote sensors, Atmos. Meas. Tech., 14, 3033–3048, https://doi.org/10.5194/amt1430332021, 2021.
Turner, D. D., Feltz, W. F., and Ferrare, R. A.: Continuous water vapor profiles from operational groundbased active and passive remote sensors, B. Am. Meteorol. Soc., 81, 1301–1318, https://doi.org/10.1175/15200477(2000)081<1301:CWBPFO>2.3.CO;2, 2000.
Turner, D. D., Mlawer, E. J., and Revercomb, H. E.: Water Vapor Observations in the ARM Program, Meteor. Mon., 57, 13.11–13.18, https://doi.org/10.1175/amsmonographsd150025.1, 2016.
Viatte, C., Strong, K., Walker, K. A., and Drummond, J. R.: Five years of CO, HCN, C_{2}H_{6}, C_{2}H_{2}, CH_{3}OH, HCOOH and H_{2}CO total columns measured in the Canadian high Arctic, Atmos. Meas. Tech., 7, 1547–1570, https://doi.org/10.5194/amt715472014, 2014.
Wagner, T. J., Klein, P. M., and Turner, D. D.: A new generation of groundbased mobile platforms for active and passive profiling of the boundary layer, B. Am. Meteorol. Soc., 100, 137–153, https://doi.org/10.1175/BAMSD170165.1, 2019.
Wakefield, R. A., Turner, D. D., and Basara, J. B.: Evaluation of a Land–Atmosphere Coupling Metric Computed from a GroundBased Infrared Interferometer, J. Hydrometeorol., 22, 2073–2087, https://doi.org/10.1175/jhmd200303.1, 2021.
Xu, J., Schreier, F., Doicu, A., and Trautmann, T.: Assessment of Tikhonovtype regularization methods for solving atmospheric inverse problems, J. Quant. Spectrosc. Ra., 184, 274–286, https://doi.org/10.1016/j.jqsrt.2016.08.003, 2016.
Yang, J. and Min, Q.: Retrieval of atmospheric profiles in the New York State Mesonet using onedimensional variational algorithm, J. Geophys. Res.Atmos., 123, 7563–7575, https://doi.org/10.1029/2018JD028272, 2018.
Yang, W., Liu, L., Deng, W., Huang, W., Ye, J., and Hu, S.: Deep Retrieval Architecture of Temperature and Humidity Profiles from GroundBased Infrared Hyperspectral Spectrometer, Remote Sens., 15, 2320, https://doi.org/10.3390/rs15092320, 2023.
Ye, J., Liu, L., Yang, W., and Ren, H.: Using Artificial Neural Networks to Estimate CloudBase Height From AERI Measurement Data, IEEE Geosci. Remote Sens. Lett., 19, 1–5, https://doi.org/10.1109/LGRS.2022.3182473, 2022.
Yin, H., Sun, Y., Liu, C., Lu, X., Smale, D., Blumenstock, T., Nagahama, T., Wang, W., Tian, Y., Hu, Q., Shan, C., Zhang, H., and Liu, J.: Groundbased FTIR observation of hydrogen chloride (HCl) over Hefei, China, and comparisons with GEOSChem model data and other groundbased FTIR stations data, Opt. Express, 28, 8041–8055, https://doi.org/10.1364/OE.384377, 2020.
Yin, H., Sun, Y., Liu, C., Wang, W., Shan, C., and Zha, L.: Remote Sensing of Atmospheric Hydrogen Fluoride (HF) over Hefei, China with GroundBased HighResolution Fourier Transform Infrared (FTIR) Spectrometry, Remote Sens., 13, 791, https://doi.org/10.3390/rs13040791, 2021a.
Yin, H., Sun, Y., Wang, W., Shan, C., Tian, Y., and Liu, C.: Groundbased highresolution remote sensing of sulphur hexafluoride (SF6) over Hefei, China: characterization, optical misalignment, influence, and variability, Opt. Express, 29, 34051–34065, https://doi.org/10.1364/OE.440193, 2021b.
Zhou, M., Langerock, B., Vigouroux, C., Sha, M. K., Ramonet, M., Delmotte, M., Mahieu, E., Bader, W., Hermans, C., Kumps, N., Metzger, J.M., Duflot, V., Wang, Z., Palm, M., and De Mazière, M.: Atmospheric CO and CH_{4} time series and seasonal variations on Reunion Island from groundbased in situ and FTIR (NDACC and TCCON) measurements, Atmos. Chem. Phys., 18, 13881–13901, https://doi.org/10.5194/acp18138812018, 2018.