Methods for validation of random uncertainty estimates and their applications to ozone profiles from limb-viewing satellite instruments

Sofieva, Viktoria F.; Laeng, Alexandra; von Clarmann, Thomas; Stiller, Gabriele; Kiefer, Michael; Tamminen, Johanna; Rozanov, Alexey; Arosio, Carlo; Livesey, Nathaniel; Damadeo, Robert; Sheese, Patrick; Walker, Kaley A.; Degenstein, Doug; Zawada, Daniel; Kramarova, Natalya A.; Keppens, Arno

doi:10.5194/amt-19-1837-2026

Articles | Volume 19, issue 5

https://doi.org/10.5194/amt-19-1837-2026

Articles | Volume 19, issue 5

Research article

16 Mar 2026

Research article |

| 16 Mar 2026

Methods for validation of random uncertainty estimates and their applications to ozone profiles from limb-viewing satellite instruments

Viktoria F. Sofieva, Alexandra Laeng, Thomas von Clarmann, Gabriele Stiller, Michael Kiefer, Johanna Tamminen, Alexey Rozanov, Carlo Arosio, Nathaniel Livesey, Robert Damadeo, Patrick Sheese, Kaley A. Walker, Doug Degenstein, Daniel Zawada, Natalya A. Kramarova, and Arno Keppens

Abstract

For satellite measurements of atmospheric composition, the random uncertainty estimates provided by retrieval algorithms might be imperfect due to various approximations used in the retrievals or the presence of unknown error sources. This paper presents an overview of the methods used for the validation of random uncertainty estimates. All methods discussed in this study are categorized, and assumptions and limitations of each method are discussed. This overview evaluates these methods in application to ozone profile measurements from limb and occultation satellite instruments and provides practical illustrations of random uncertainty validation.

Download & links

Article (PDF, 3344 KB)

Download & links

How to cite.

Sofieva, V. F., Laeng, A., von Clarmann, T., Stiller, G., Kiefer, M., Tamminen, J., Rozanov, A., Arosio, C., Livesey, N., Damadeo, R., Sheese, P., Walker, K. A., Degenstein, D., Zawada, D., Kramarova, N. A., and Keppens, A.: Methods for validation of random uncertainty estimates and their applications to ozone profiles from limb-viewing satellite instruments, Atmos. Meas. Tech., 19, 1837–1852, https://doi.org/10.5194/amt-19-1837-2026, 2026.

Received: 13 Jun 2025 – Discussion started: 19 Sep 2025 – Revised: 15 Jan 2026 – Accepted: 09 Feb 2026 – Published: 16 Mar 2026

1 Introduction

In nearly all data analyses, such as data comparisons, aggregating/combining/merging data, data assimilation etc., information about data uncertainty is needed. Such characterization of uncertainty would ideally include both systematic and random components, as well as spatio-temporal resolution of the data, as discussed in von Clarmann et al. (2020). Validation of uncertainty estimates is needed, especially if the measurement uncertainty cannot be fully characterized or is based on assumptions. This is typical for remote-sensing measurements, which use retrievals of atmospheric parameters that solve inverse problems. The random uncertainty (i.e. the component of uncertainty that varies randomly and independently between repeated measurements) of the remote sensing measurements is usually estimated via propagation of instrumental noise and other random uncertainties through the inversion algorithm. These estimates, which are sometimes referred to as “ex-ante” errors (von Clarmann, 2006) (other terms are “prognostic”, “predicted”, “inductive”, or “bottom-up”), can be imperfect due to various approximations used in retrievals, or due to the presence of random components in parameter uncertainties.

The aim of this paper is to provide an overview of the methods for validation of random error component. The terms “error” and “uncertainty” are used interchangeably in our paper. We extend the overview of such methods presented in the introduction of Sofieva et al. (2014) and illustrate them using ozone profile retrievals from Earth-orbiting limb and occultation instruments. In our paper, we discuss the applicability and limitations of each method, with the focus on ozone profile retrievals from satellite measurements.

This paper contributes to the APARC (Atmospheric Processes and their Role in Climate) activity TUNER (Towards UNified Error Reporting) https://aparc-climate.org/activities/tuner/ (last access: 13 February 2026).

2 Data

To illustrate the methods employed for validation of random uncertainties, we use ozone profiles retrieved from several limb and occultation measurements. The summary of the datasets is collected in Table 1, and the principles of uncertainty estimates are described below.

Table 1Information about the ozone profile datasets used in the paper.

Download Print Version | Download XLSX

2.1 Atmospheric Chemistry Experiment -Fourier Transform Spectrometer (ACE-FTS)

ACE-FTS is a solar occultation instrument operating on board SCISAT satellite from February 2004 to present. ACE-FTS is a high-spectral resolution Fourier-transform spectrometer in the infrared (2.2–13 µm) wavelength regions. Ozone profile retrievals are performed via the global non-linear fitting of the observed spectra. In retrievals from ACE-FTS, estimated random uncertainties of ozone profiles are the fitting errors from the least-squares inversion process (Boone et al., 2005; Sheese et al., 2022). The mean relative random uncertainties are estimated to be lower than 3 % between 12 and 62 km and typically less than 2 % around 30–35 km. Relative uncertainties are slightly higher in polar regions. The ACE-FTS ozone uncertainty estimates slightly grow with time, going from ∼ 1.7 % in the middle stratosphere in the beginning of the mission to ∼ 2.0 % in the recent period. The vertical resolution of ACE-FTS ozone profiles is estimated to be ∼ 3 km.

2.2 Global Ozone Monitoring by Occultation of Stars (GOMOS)

GOMOS was a stellar occultation instrument operated on board Envisat in 2002–2012 (Bertaux et al., 2010). Ozone profiles are retrieved from UV-VIS spectrometer measurements using two-step inversion (Kyrölä et al., 2010): the spectral inversion followed by the vertical inversion. In the vertical inversion, a Tikhonov-type target-resolution regularization is applied (Kyrölä et al., 2010; Sofieva et al., 2004), so that the vertical resolution of retrieved profiles is the same for all occultations. In this paper we use the GOMOS ozone profiles processed with ALGOM2s v.1 Scientific Processor (Sofieva et al., 2017). The error propagation scheme is similar to that used in GOMOS IPF v.6 processor (Kyrölä et al., 2010; Tamminen et al., 2010), as the ALGOM2s ozone profiles are identical to those of IPF v.6 in the stratosphere and differ only in the UTLS. The error estimates (square roots of the diagonal elements of the covariance matrix) are provided in the Level 2 data. The covariance matrix of retrieved profile uncertainties is obtained via Gaussian error propagation through the GOMOS inversion, see Tamminen et al. (2010) for details. Both noise and the dominating random modelling error (due to scintillations) are taken into account in GOMOS inversion. Thus, error estimates provided in Level 2 files represent the total random uncertainty estimates (if neglecting the random part of the parameter errors, which are expected to be minor compared to the abovementioned random error sources).

The random uncertainties of GOMOS ozone profiles depend on stellar brightness, spectral class and obliquity of occultation. They are typically in the range from 0.5 % to 5 % in the stratosphere. Examples of typical uncertainties of GOMOS ozone profiles can be found in Tamminen et al. (2010). An extensive validation of GOMOS random uncertainty estimates is performed and reported in Sofieva et al. (2014). It was shown that GOMOS random uncertainty estimates are realistic for not-dim stars. Due to instrument ageing, GOMOS random uncertainty estimate grow with time, especially for dim stars (Tamminen et al., 2010).

2.3 Michelson Interferometer for Passive Atmospheric Sounding (MIPAS)

MIPAS was an infrared limb emission spectrometer operated on board Envisat in 2002–2012. In this work, we use MIPAS v8 data retrieved with the IMK/IAA processor. Ozone profile retrievals is performed via constrained inverse modelling of limb radiances (von Clarmann et al., 2009). The random errors provided with version 8 IMK MIPAS data consist of the propagated covariance of the spectra, and further uncertainties of parameters used in the retrievals (like uncertainty of the temperature or the line-of-sight pointing) which are of random nature. A detailed description of how these random errors were calculated is provided by von Clarmann et al. (2022) and Kiefer et al. (2023). According to Kiefer et al. (2023, Supplement), the total random error is up to about a factor of 2 to 3 (even >3 for unfavorable conditions like polar winter) higher than the pure propagated measurement noise error in the lower part of the stratosphere (up to about 30 km), and by a factor of 1.1 to 1.4 higher in the upper part of the stratosphere. For illustrations in our paper, if not specified explicitly, we use the total random uncertainty.

2.4 Microwave Limb Sounder (MLS)

MLS is a microwave limb emission spectrometer operating on board the Aura satellite since 2004. This paper uses the Aura MLS “Version 5” dataset (Livesey et al., 2022), retrieved using the same tomographic retrieval algorithm employed for all previous MLS data versions (Livesey et al., 2006; Read et al., 2006; Schwartz et al., 2006). The Level 2 data products consist of vertical profiles spaced 1.5° along the orbital track, with pressure as the vertical coordinate. Each profile is accompanied by a separate profile reporting the estimated precision uncertainty (i.e., random uncertainty) in the profile. This is based on the square root of the diagonal of the solution covariance matrix from the optimal estimation-based retrieval and thus overestimates the scatter in the geophysical products in cases where the a priori information and other regularization constraints contribute significantly to the results (typically at the upper end of the useful vertical range of each product). Systematic errors due to uncertainties in instrument calibration, spectroscopy, and other parameters have been quantified through multiple perturbation studies detailed in Read et al. (2006), and are documented for each version, along with averaging kernels and rules for data use and screening, in a data quality document (Livesey et al., 2022 for version 5).

2.5 Ozone Mapper and Profile Suite – Limb Profiler (OMPS-LP)

OMPS-LP has been operating on board the Suomi- National Polar-orbiting Partnership satellite since 2012. It measures scattered solar light in the limb-viewing geometry. Ozone profiles are retrieved using measurements at UV and Visible wavelengths. For illustrations in this paper, we use three ozone profile datasets from the OMPS-LP instrument: OMPS-LP USask 2D v1.3.0 processed at the University of Saskatchewan, OMPS-LP UBr v4.1 processed at the University Bremen, and the NASA processor v2.6.

The OMPS-LP USask two-dimensional retrieval process uses Gaussian error propagation to estimate the covariance of the retrieved solution due to measurement noise (ignoring the smoothing error). The reported precision is the square root of the diagonal elements of the converged solution covariance matrix. The measurement noise is assumed to be a constant 1 % at all altitudes and wavelengths.

OMPS-LP UBr v.4.1 Level 2 data provide covariance matrices for the retrieved profiles which are obtained by the propagation of the measurement noise errors. The latter are estimated from spectral fit residuals obtained during the pre-processing step. Contributions of the parameter errors are estimated using the Monte-Carlo approach for a set of representative observations and reported by Arosio et al. (2022) along with total error budget estimations.

The NASA OMPS-LP v2.6 algorithm (Kramarova et al., 2024; Rault and Loughman, 2013) retrieves ozone profiles by employing the second order Tikhonov regularization method. The estimated precision for each profile retrieval is calculated using the square roots of diagonal elements of the solution covariance matrix. Systematic errors related to uncertainties in altitude registration, some algorithmic parameters, and a priori profiles have been evaluated and reported by Kramarova et al. (2024); Moy et al. (2017).

2.6 Optical and Spectroscopic Remote Imaging System (OSIRIS)

OSIRIS has been operating on board the Odin satellite since 2001. It uses measurements of scattered solar light for retrievals of ozone and other constituents' profiles. OSIRIS V7.2 data uses standard Gaussian error propagation to estimate the uncertainties in the retrieved ozone profiles. The covariance matrix is calculated through propagation of the measurement noise that is estimated through counting statistics. The reported precision is the square root of the diagonal elements of the covariance matrix.

2.7 Stratospheric Aerosol and Gas Experiment (SAGE)

SAGE II and SAGE III/ISS are solar occultation instruments. The retrieval algorithms for both SAGE II v7.0 (Damadeo et al., 2013) and SAGE III/ISS v5.3 (see SAGE III ATBD, 2002 and Wang et al., 2020) are similar. Ozone profile retrievals are based on the global fit using visible wavelengths (Chappuis band). The uncertainties are computed from the statistical distribution of observations in the L1 transmission data and then propagated through the retrieval algorithm via Gaussian error propagation into the uncertainties for the L2 profiles of ozone and other species. Typical random retrieval uncertainties are within 1 % between 18 and 52 km.

2.8 Scanning Imaging Absorption Spectrometer for Atmospheric Cartography (SCIAMACHY)

SCIAMACHY was a passive remote sensing spectrometer operated on Envisat over 2002–2012. It measured scattered solar light in the ultraviolet, visible and near infrared wavelength region (240–2380 nm). Ozone profile retrievals are performed via constrained inverse modelling of limb radiances using UV and visible spectra. SCIAMACHY V3.5 Level 2 data, which are used in this work, provide covariance matrices for the retrieved profiles which are obtained by the propagation of the measurement noise errors. The latter are estimated from spectral fit residuals obtained during the pre-processing step. Contributions of the parameter errors are estimated using the Monte-Carlo approach for a set of representative observations. Rahpoe et al. (2013) described the method to estimate the parameter errors and reported resulting values along with the total error budget estimation for the precursor retrieval version (V2.5). The results for V3.5 are expected to be similar.

3 Methods for validation of random uncertainty estimates

3.1 Specifics of satellite measurements of atmospheric composition

In remote sensing, retrieved parameters result from solving the inverse problem. The reported random uncertainties are usually estimated via propagation of instrumental noise and other random errors through the inversion algorithm. Error estimates might be wrong if the source uncertainties used for propagation are not known well enough. If some of the error sources are not characterized and the corresponding uncertainties are not considered, the reported uncertainty is underestimated.

The normalized χ² statistics, $χ_{norm}^{2}$ , is commonly used for assessing the adequacy of the theoretical description of measurements (forward model) and as an indication of the correctness of random uncertainty estimates. $χ_{norm}^{2}$ is usually evaluated as:

\begin{matrix} (1) & χ_{norm}^{2} = \frac{1}{N - p} {(y - y_{\mod})}^{T} S_{y, random}^{- 1} (y - y_{\mod}), \end{matrix}

where y is the vector of observed parameters (e.g., spectrally resolved radiance values, transmittances), y_mod is the vector of modelled (theoretical) measurements, S_y,random is the covariance matrix of random measurement errors, N is the number of measurements and p is the number of retrieved parameters (e.g., Bevington and Robinson, 2003; Taylor, 1997). $χ_{norm}^{2}$ is also called “ χ² per degree of freedom”. If the theoretical model describes the experimental data correctly and the measurement errors are properly defined, $χ_{norm}^{2} \approx 1$ , ideally. Very large $χ_{norm}^{2}$ values indicate underestimated random uncertainties, while $χ_{norm}^{2}$ smaller than 1 imply that they are overestimated (Bevington and Robinson, 2003). This simple analysis of $χ_{norm}^{2}$ has helped to discover a missing random error component in early GOMOS retrievals, which was due to uncorrected residual scintillation, and to parameterize it in later processing versions (Sofieva et al., 2010; Tamminen et al., 2010). In the first step of the GOMOS retrievals – the spectral inversion (Kyrölä et al., 2010) – $χ_{norm}^{2}$ is evaluated at each tangent altitude independently; y and y_mod in the GOMOS case are measured and modelled transmittance spectra, N is the number of spectral pixels (maximum 1416) and p is the number of fitted parameters (slant column densities for ozone, NO₂, NO₃, and aerosol parameters). Figure 1 compares $χ_{norm}^{2}$ in the GOMOS retrievals in the set of oblique occultations from the brightest star Sirius when residual scintillation errors are ignored (blue) or considered (red).

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f01

Figure 1Adapted from (Sofieva et al., 2010): $χ_{norm}^{2}$ in GOMOS retrievals from a set of oblique occultations of Sirius (August 2003, 66° S, obliquity angle is ∼ 25°). Blue: random error due to residual scintillation is ignored, red: modelling errors are taken into account in the retrievals. Dots: values in individual occultations, bold lines indicate median values for the sets. The black dashed line indicates $χ_{norm}^{2} = 1$ .

A similar analyses of $χ_{norm}^{2}$ applied to GOMOS IPF v6 data has spotted $χ_{norm}^{2} < 1$ at upper altitudes in case of dim stars, and a more detailed analysis identified the reason and the influence on uncertainty estimates (Sofieva et al., 2014).

In some retrievals, the uncertainty estimate is derived from the fit residuals, as it is done for SCIAMACHY and OMPS-LP UBr ozone retrievals (e.g., Arosio et al., 2022). Such an approach forces $χ_{norm}^{2}$ to be close to 1 and can provide an estimate of the uncertainties of measurements in the case one does not trust the information on measurement uncertainty contained in the Level-1 data.

$χ_{norm}^{2}$ is a statistical and integral characteristic that indicates not only correctness of the random uncertainty estimates but also consistency of measurements with the forward model used for inversion of atmospheric parameters, which is itself often incomplete, or reflects inaccurate knowledge of instrument calibration and/or spectroscopic parameters. In some cases, a priori/regularization terms are included into $χ_{norm}^{2}$ too. Sometimes, this metric is used to determine how strong the regularization should be. Therefore, analyses of $χ_{norm}^{2}$ at the measurement level are useful, but they cannot substitute validation of uncertainty estimates of the retrieved ozone profiles.

The retrieved ozone profiles are characterized not only by uncertainty estimates but also by vertical resolution and, in the case of tomographic retrievals, by horizontal resolution. This means that, for such retrievals, the impacts of radiance noise on adjacent data points are not independent, and the proper characterization of associated uncertainties is obtained using their covariance matrix.

If the retrieval is performed with the Bayesian maximum a posteriori estimates (Rodgers, 2000; von Clarmann, 2006), a data correlation can also arise due to the usage of a priori information. These aspects should be taken into account when validating uncertainties.

3.2 General strategy

In the laboratory, experimental precision estimates can be obtained using repeated measurements under the same conditions: the sample variance $s^{2} = var (x) = 〈(x - 〈x〉)^{2}〉$ (angular brackets denote the mean hereafter) approaches the variance of random error distribution σ² (i.e., squared precision) when the size of sample N tends to infinity. For different samples of size N, the values of sample variance will vary due to different random error realization. An ensemble of sample variances is a random variable with a distribution depending on noise variance σ² and N. The quantity $\frac{(N - 1) s^{2}}{σ^{2}}$ has a χ² distribution with N−1 degrees of freedom, $χ_{N - 1}^{2}$ (e.g., Bevington and Robinson, 2003; Taylor, 1997). For large N, $χ_{N - 1}^{2}$ distribution can be approximated by a Gaussian distribution with variance 2N thus var $(\frac{(N - 1) s^{2}}{σ^{2}}) = 2 N$ . This gives the uncertainty of the experimentally estimated random error

\begin{matrix} (2) & var (s^{2}) \approx σ^{4} \frac{2}{N} . \end{matrix}

In contrast with many laboratory experiments, geophysical observation conditions cannot be kept exactly constant for atmospheric measurements. Therefore, the sample variance contains a contribution from the natural variability $σ_{nat}^{2}$ :

\begin{matrix} (3) & s^{2} \approx σ^{2} + σ_{nat}^{2} . \end{matrix}

For validation of uncertainty estimates, $σ_{nat}^{2}$ should be minimized by selecting collocated measurements or it should be estimated from independent sources (for example, from a chemistry-transport model, CTM).

Approaches for validation of error estimates usually rely on the variance of the difference, $s_{12}^{2} = var (x_{1} - x_{2})$ , in a set of collocated measurements x₁ and x₂:

\begin{matrix} (4) & s_{12}^{2} = σ_{0, nat}^{2} + σ_{1}^{2} + σ_{2}^{2} . \end{matrix}

In Eq. (4), $σ_{0, nat}^{2}$ stands for the natural variability within a space-time collocation window (note that $σ_{0, nat}^{2}$ , which represents the mismatch uncertainty, is different from $σ_{nat}^{2}$ in Eq. (3) that represent natural variability in a certain location).

It is important to note that for the vertically resolved ozone profile data involved, calculating differences and combining data require harmonization of data representations in terms of physical quantities and vertical sampling at least. As the satellite data result from a retrieval process, knowledge of prior information and averaging kernel matrices in principle allows retrieval differences to be accounted for as well. Keppens et al. (2019) provide an overview of harmonization operations for atmospheric profile observations, covering vertical representation matching, vertical smoothing matching, and retrieval matching (essentially the prior information contributions). The effect of these manipulations on the information content and uncertainty budget of the original data is extensively discussed in that work and will not be repeated here. In the following, we assume that all profiles are presented in a similar vertical resolution, so vertical smoothing difference errors between profiles can be neglected. For the illustrations in this paper, the harmonization of the vertical resolution is not needed, as the satellite limb profiles considered in this study have similar vertical resolution, see Table 1.

We divide the methods for random uncertainty validation into two groups depending on what kind of data are used: (1) from the same instrument and (2) from different instruments.

3.3 Using collocated measurements from the same instrument

For perfectly collocated measurements ( $σ_{0, nat}^{2} \approx 0$ ) from the same instrument with the same precisions $σ_{1} = σ_{2} = σ$ , Eq. (4) is reduced to $s_{12}^{2} \approx 2 σ^{2}$ , thus allowing validation of the uncertainty estimate ${\hat{σ}}^{2} = s_{12}^{2} / 2$ . In this estimate, random errors in x₁ and x₂ are assumed to be uncorrelated. This uncertainty validation method was realized, for example, for closely collocated MIPAS ozone profiles (Piccolo and Dudhia, 2007) and OSIRIS ozone measurements (Bourassa et al., 2012). The uncertainty of this experimental precision estimate is defined by the uncertainty of sample variance $s_{12}^{2}$ .

There are several limitations associated with this method. First, natural variability $σ_{0, nat}^{2}$ is not necessarily small, even with a tight spatio-temporal window. In such cases $s_{12}^{2}$ will be larger than a combined uncertainty 2σ², thus the estimate ${\hat{σ}}^{2} = s_{12}^{2} / 2$ will be biased high. Second, the number of self-collocated measurements for limb satellites is limited. Self-collocated measurements are usually found around the Poles, while at other latitudes a larger temporal separation (which can involve measurements in different day/night conditions) has to be accepted. In addition, the number of self-collocated measurements for instruments with coarse sampling (stellar and solar occultation) is relatively low. For example, ∼ 200 collocated occultations (with spatial separation Δr less than 300 km and temporal separation Δt less than 3h) per year of the star S30 (notation in the GOMOS catalogue) can be found for GOMOS; all located near the North Pole in winter. For ACE-FTS, fewer than 100 self-collocations per year with the criteria $Δ t = 3 h, Δ r = 300 km$ are found, and ∼ 400 self-collocated measurements per year can be found with the collocation criteria $Δ t = 5 h, Δ r = 500 km$ . For SAGE II and SAGE III/ISS, there are no self-collocated measurements with the abovementioned collocation criteria.

Provided many collocated measurements from the same instrument are available (self-collocations), the precision of the dataset can also be estimated by computing a so-called structure function D(ρ) (e.g., Tatarskii, 1961), or the RMS difference of the field as a function of increasing separation in time and in space:

\begin{matrix} (5) & D (ρ) = D (r_{1} - r_{2}) = \frac{1}{2} 〈{[f (r_{1}) - f (r_{2})]}^{2}〉 \end{matrix}

where r₁ and r₂ are two locations and a vector $ρ = r_{1} - r_{2}$ is their spatio-temporal separation. In geostatistics, D is called the variogram (Cressie, 1993; Matheron, 1963; Wackernagel, 2003). When using experimental (noisy) data for evaluation of the variogram/structure function, the difference of an atmospheric parameter in two locations is defined not only by the natural variability of this atmospheric parameter, but also by uncertainty of the measurements. Therefore, with the spatio-temporal separation ρ→0, D(ρ) tends toward the random uncertainty variance $σ_{noise}^{2}$ (the offset at zero is called “nugget” in geostatistics). Since self-collocated measurements are from the same instrument, no biases between them are expected. Figure 2 illustrates the structure function method, which is discussed in details in Sofieva et al. (2021) and applied to TROPOMI total ozone measurements.

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f02

Figure 2Reproduced from (Sofieva et al., 2021): The schematic representation of the structure function estimated from noisy measurements, ρ denotes spatio-temporal separation. Blue line: the structure function for noise-free data, red line: the structure function for experimental (noisy) data.

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f03

Figure 3Colored thin curves: experimental precision estimates $S_{12} / \sqrt{2}$ for different separation distances; thick curves with errorbars: the mean ex-ante uncertainty estimate and its standard deviation for propagated instrumental noise (magenta) and full random uncertainty (red); MIPAS self-collocations close to the North and South Poles in 2005–2011 during local summer are used.

Download

For ozone profiles from limb instruments, the structure function method is difficult to apply, as it requires a substantial number of measurements with close separation. An analogous method – evaluation of the one-dimensional structure function in polar regions (with transformation of temporal mismatch to spatial separation using the ECMWF wind field) – has been applied for validation of random uncertainty estimates of the MIPAS and GOMOS ozone profiles (Laeng et al., 2015; Laeng and Von Clarmann, 2021; Sofieva et al., 2014).

Figure 3 illustrates the application of the structure function method to MIPAS v8 ozone profiles, which have a detailed error characterization. With decreasing separation distance between measurements, ex-post uncertainties $s_{12} / \sqrt{2}$ approach to a curve, which is between ex-ante estimates for total random error (thick red curves) and for propagated instrumental noise (thick magenta curves).

3.4 Using measurements from different instruments

3.4.1 Method of Fioletov et al. (2006)

Fioletov et al. (2006) have proposed estimating simultaneously the random data uncertainties and natural variability from sample variances of two perfectly collocated datasets and the variance of their difference. We reproduce the formulae here, as we assess the application of this method. The Fioletov method relies on sample variances $s_{i}^{2}$ of the collocated data:

\begin{matrix} (6) & s_{i}^{2} = σ_{nat}^{2} + σ_{i}^{2}, i = 1, 2 \end{matrix}

and the variance of their difference (Eq. 4), which is reduced to:

\begin{matrix} (7) & s_{12}^{2} = σ_{1}^{2} + σ_{2}^{2}, \end{matrix}

by assuming $σ_{0, nat}^{2} \approx 0$ . It is also assumed that the bias between datasets is the same for the selected sample.

In Eqs. (6) and (7), $σ_{nat}^{2}$ is natural variability and $σ_{i}^{2}$ are measurement precisions. Solving Eqs. (6) and (7) for $σ_{nat}^{2}$ , $σ_{1}^{2}$ and $σ_{2}^{2}$ , we get their experimental estimates based on sample variance:

\begin{matrix} (8) & \begin{array}{l} {\hat{σ}}_{nat}^{2} = 0.5 (s_{1}^{2} + s_{2}^{2} - s_{12}^{2}) \\ {\hat{σ}}_{1}^{2} = 0.5 (s_{1}^{2} - s_{2}^{2} + s_{12}^{2}) \\ {\hat{σ}}_{2}^{2} = 0.5 (s_{2}^{2} - s_{1}^{2} + s_{12}^{2}) \end{array} \end{matrix}

The uncertainty of the natural variability and precision estimates given by Eq. (8) depend on uncertainty of sample variances, which depend, in turn, on sample variances themselves and the number of measurements. The estimates are thus only as accurate as the least accurate of these parameters. In approximation of large samples (when uncertainty of the sample variance can be approximated by Eq. 2), the variance of the estimates (8) can be expressed in terms of “true” natural variability and precision variances $σ_{nat}^{2}$ , $σ_{1}^{2}$ and $σ_{2}^{2}$ as (using Eqs. 2, 6–8):

\begin{matrix} (9) & \begin{aligned} var ({\hat{σ}}_{1}^{2}) & = var ({\hat{σ}}_{2}^{2}) = var ({\hat{σ}}_{nat}^{2}) = \frac{1}{2 N} ({(σ_{nat}^{2} + σ_{1}^{2})}^{2} \\ + {(σ_{nat}^{2} + σ_{2}^{2})}^{2} + {(σ_{1}^{2} + σ_{2}^{2})}^{2}) \end{aligned} \end{matrix}

with the following simple estimates for upper and lower limits (after opening brackets in Eq. 9):

\begin{matrix} (10) & \frac{1}{N} (σ_{nat}^{4} + σ_{1}^{4} + σ_{2}^{4}) < var ({\hat{σ}}_{1, 2, nat}^{2}) < \frac{1}{N} {(σ_{nat}^{2} + σ_{1}^{2} + σ_{2}^{2})}^{2} . \end{matrix}

Since the precision estimates by the Fioletov method are linear combinations of three sample variances, they can have large uncertainty if one of the sample variances is large and/or the number of collocated measurements is limited. Not for all combinations of limb instruments perfectly collocated measurements can be found (especially for instruments with sparse sampling). In practice, satellite measurements separated by a few hundreds of kilometers and a few hours are considered collocated. The natural variability within the space-time collocation window is small but not zero, thus resulting in additional difficulties in the application of this method. Note that the estimates from Eq. (8) do not ensure positivity of $σ_{1}^{2}$ and $σ_{2}^{2}$ . Negative solutions can be within uncertainty intervals; their appearance can be caused either by insufficient amount of data or by the unaccounted natural variability within the collocation window.

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f04

Figure 4Application of Fioletov's method to MIPAS and SCIAMACHY ozone datasets in 2007. Left: sample standard deviations s₁ and s₂ in collocated pairs, and the standard deviation of differences s₁₂. Right: ex-post uncertainty estimates ${\hat{σ}}_{1}$ and ${\hat{σ}}_{2}$ , and the estimate of natural variability ${\hat{σ}}_{nat}$ with 1σ uncertainties (solid lines). Ex-ante uncertainty estimates are shown with dashed lines. Error bars are 1σ uncertainties.

Download

For illustration, we applied this method to MIPAS and SCIAMACHY measurements in 2007. Collocated profiles with time separation less than 5 h, spatial distance less than 400 km and latitude difference less than 2° were selected in the tropics (20° S–20° N), with 13 785 such profile pairs found. The left panel of Fig. 4 shows sample standard deviations s₁ and s₂ of MIPAS and SCIAMACHY profiles, respectively, and the standard deviation of differences s₁₂. The right panel shows the a posteriori (“ex-post” in terminology of von Clarmann et al., 2020) estimates of random uncertainties and natural variability from Eq. (8) with uncertainties therein given by Eq. (9). The estimates of random errors reported by the retrieval algorithms (“ex-ante” in terminology of von Clarmann et al., 2020) are also shown in right panels of Fig. 4 by dashed lines. Negative estimates of $σ_{1}^{2}$ and $σ_{2}^{2}$ are ignored. All computations are performed in absolute units, but the estimates are plotted as a percentage for clarity. We observe that the ex-ante and ex-post uncertainties of MIPAS ozone profiles are very close to each other. For SCIAMACHY, Fioletov's method suggests a larger uncertainty estimates at altitudes 25–37 km than reported in the retrievals.

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f05

Figure 5Ex-ante and ex-post uncertainty estimates from Fioletov's method for MLS and ACE-FTS in years 2018–2019 (a) and 2018–2022 (b), and SAGE III/ISS and ACE-FTS in years 2018–2023. The number of collocated profiles is indicated in the panel titles.

Download

As a general note for this and subsequent illustrations, the ex-ante random uncertainties for some instruments (see Table 1 for details) are due to measurement noise. This is a dominating source of random error, however not the only one (von Clarmann et al., 2020). As a result, ex-post random uncertainty estimates are expected to be slightly larger.

The best performance of the Fioletov's method is expected for datasets with dense sampling and similar random uncertainty estimates. The data should be selected in the regions of low variability. As mentioned above, Fioletov's method requires a large number of collocated profiles to yield reliable estimates of ex-post uncertainties. The application of this method to solar occultation data by ACE-FTS and SAGE III/ISS is illustrated in Fig. 5. The same collocation criteria are used, but the number of collocated profiles is significantly smaller than for MIPAS and SCIAMACHY, even though more years of data are used. For MLS and ACE-FTS, the number of collocations is 741 for years 2018–2019 and 1471 for years 2018–2022. The ex-post uncertainty estimates from Fioletov's method have substantial error bars (Fig. 5a, b). In application to ACE-FTS and SAGE III/ISS, only 19 collocated profiles are found, so the resulting uncertainty estimates have huge error bars (Fig. 5c).

The Fioletov's method is applicable to both individual-profile and tomographic retrievals, as the data in collocated pair can be considered as uncorrelated. As a general note, the fully rigorous approach of defining co-locations for tomographic retrievals should be different from that used for 1D individual-profile retrievals. Since the along-track dimension is part of the retrieval in the tomographic technique, interpolation should be performed along this dimension to the co-location point.

3.4.2 A differential method: comparisons of natural variability patterns

Sofieva et al. (2014) proposed a simple method for detecting flaws and/or checking of consistency of random uncertainty estimates. The authors called it a “differential method”. Let us consider, for example, two datasets selected in a region of small and slowly changing natural variability. A large sample size is assumed. If the random uncertainty estimates for both datasets are correct, then the difference in sample variance $s_{1}^{2} - s_{2}^{2}$ will be equal to the difference in precision estimates $σ_{1}^{2} - σ_{2}^{2}$ . The term $σ_{nat}^{2}$ from Eq. (3) cancels out because it is assumed to be the same for both samples. The estimates of the sample variance, $s_{i}^{2}$ , provide the upper limit for experimental estimates of measurement precision, as $s_{i}^{2} > σ_{i}^{2}$ .

A simple comparison of sample variance $s_{i}^{2}$ with the random uncertainty estimate $σ_{i}^{2}$ enables the detection of overestimated random uncertainties, if the relation $s_{i}^{2} > σ_{i}^{2}$ is violated. Through such a comparison, Sofieva et al. (2014) found overestimated random uncertainties for the GOMOS ozone profiles using very dim stars, and further investigation by the instrument experts identified a flaw in accounting for instrumental dark charge noise.

If one of the datasets has realistic precision estimates, for example from well-calibrated instruments (so-called Fiducial Reference Measurements), or those estimates are validated by other methods, then application of the differential method is straightforward.

If there are several datasets with unvalidated (or not completely validated) uncertainty estimates, one can consider confronting natural variability estimates ${\hat{σ}}_{nat}^{2} = s_{i}^{2} - σ_{i}^{2}$ . Since the natural variability estimates from various datasets should agree within uncertainty intervals, strong deviations from the majority estimates can indicate potential flaws in the random error estimation.

For example, Sofieva et al. (2014) compared estimates of natural variability in the tropics from GOMOS data using different stars and found consistent positive values of ${\hat{σ}}_{nat}^{2}$ for bright stars ( $χ_{norm}^{2} \approx 1$ and application of the structure function method also suggested that the random uncertainties are realistic for bright stars). However, for very dim stars, negative values of ${\hat{σ}}_{nat}^{2}$ have been detected, which, together with $χ_{norm}^{2} < 1$ , pointed to overestimated random uncertainties.

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f06

Figure 6Top panels: Sample standard deviation s (solid lines) and the mean uncertainty estimates σ (dashed lines) in the tropical stratosphere (20° S–20° N) in 2002–2004 (1st column), 2006–2008 (2nd column) and 2018–2020 (3rd and 4th column). Bottom panels: the estimates of the natural variability ${\hat{σ}}_{nat} = \sqrt{s^{2} - σ^{2}}$ with its uncertainty (1 SD) in the tropics for the same periods. Colors are specified in the bottom panels. For GOMOS, occultations of 30 brightest stars are selected for the analysis.

Download

In this paper, we illustrate this differential method by considering sample variance and uncertainty estimates from several limb and occultation instruments. The measurements are selected in the tropics, 20° S–20° N, in three periods, 2002–2004 (first column of Fig. 6), 2006–2008 (2nd column) and 2018–2020 (3rd and 4th column of Fig. 6), for the limb instruments operating in these periods. The upper panels of Fig. 6 show the sample standard deviation (solid lines) and the mean random uncertainty estimates (dashed lines). The lower panels show the estimates of natural variability ${\hat{σ}}_{nat}$ with associated uncertainties indicated by error bars. For GOMOS, occultations of the 30 brightest stars are used for the analysis, in order to make the GOMOS dataset more homogeneous and to avoid data with overestimated uncertainties. SAGE II and SAGE III/ISS ozone profiles were smoothed down to 2 km vertical resolution, for compatibility with other datasets.

As observed in Fig. 6, for all datasets except for OMPS USask, s<σ as expected. In case of OMPS USask, the mean uncertainty σ exceeds the sample standard deviation s at several altitudes, which indicates an overestimation of the random uncertainty component. The overestimation is caused by a bug in the V1.3.0 product and will be fixed in a future version. The profiles of natural variability ${\hat{σ}}_{nat}$ obtained from GOMOS, MIPAS, OSIRIS, ACE-FTS, SAGE II, SAGE III/ISS, OMPS UBr and OMPS NASA are very close to each other, and their uncertainty intervals overlap. For SCIAMACHY, the pattern of natural variability is also similar, but the increased sample variance at 25–35 km is not explained by its random uncertainty estimates; this suggests a slight underestimation of random error component at these altitudes. For MLS and SAGE III/ISS, there is a very good agreement with other datasets below 40–45 km; above 45 km, the random uncertainty estimates grow fast with altitude, which results in somewhat smaller estimates of ${\hat{σ}}_{nat}$ compared to other datasets. This indicates an overestimation of MLS and SAGE III/ISS random uncertainties above 40–45 km (see also the explanation in Sect. 2.4).

Successful application of this method implies the following conditions:

a.
Natural variability should be the same for both samples.
b.
Natural variability should not be too large compared to the precision estimates, otherwise the sample variance estimates will have large uncertainty. This condition of small natural variability is satisfied for ozone in the tropical stratosphere and in the summer stratosphere at other latitudes. (However, the random uncertainty estimates for limb-instruments are usually smaller than the natural variability.)
c.
Measurements within each sample should have similar precision.

The method can also be applied to the data retrieved with the tomographic approach, if the selected region is sufficiently large (exceeding the horizontal correlation length of tomographic retrievals).

If the natural variability is known from an external source (for example, estimated from the measurements with realistic uncertainty estimates or from a model that correctly reproduces natural variability), ex-post uncertainties can be estimated as ${\hat{σ}}_{ex-post}^{2} = s^{2} - σ_{nat}^{2}$ , where s² is the sample variance of a set of measurements and $σ_{nat}^{2}$ is the estimated of the natural variability. The use of the modelled data in validation of random uncertainty estimates is also discussed in Sect. 3.5 of this paper.

3.4.3 Triple collocation methods

Stoffelen's method

The idea of using the collocated measurements from three (or more) systems for data calibration and validation of uncertainties was proposed by Stoffelen (1998). In his formulation, it is supposed that three measurement systems X, Y, Z provide collocated measurements of the same quantity t. Let system X be the reference system with respect to which systems Y and Z are to be calibrated. Suppose also that linear calibration (simple scaling) is sufficient for the whole range of values under consideration, and that the reference system X is free of bias. Then the measurements can be written as

\begin{matrix} (11) & \begin{array}{l} x = t + ε_{x} \\ y = c_{y} (t + ε_{y}) \\ z = c_{z} (t + ε_{z}), \end{array} \end{matrix}

where c_y and c_z are scaling factors and $ε_{x}, ε_{y}, ε_{z}$ are random errors in each measurement sample. The random error components are assumed to be unbiased and not correlated with each other and with the parameter t. The calibration coefficients can be derived from covariances

\begin{matrix} (12) & \begin{array}{l} c_{y} = cov (y, z) / cov (x, z) \\ c_{z} = cov (y, z) / cov (x, y), \end{array} \end{matrix}

where cov $(\cdot, \cdot)$ denotes covariance. These coefficients allow creating the calibrated data $y^{*} = c_{y}^{- 1} y, z^{*} = c_{z}^{- 1} z$ . Then the natural variability of the parameter t can be estimated as e.g. $σ_{t}^{2} = cov (x, y^{*})$ and uncertainty variances as

\begin{matrix} (13) & \begin{array}{l} σ_{x}^{2} = var (x) - σ_{t}^{2} \\ σ_{y}^{2} = var (y^{*}) - σ_{t}^{2} \\ σ_{z}^{2} = var (z^{*}) - σ_{t}^{2} . \end{array} \end{matrix}

The main assumptions of the method are: (a) the measurements are a linear function of the true signal with additive zero-mean random measurement noise; (b) measurement errors and true signal are stationary, and they are independent; (c) measurement errors are independent, and (d) the measurements are perfectly collocated, i.e. mismatch uncertainty is zero. The assumptions (b–d) are similar to other methods described above. The application of this method to validation of random uncertainties of tropospheric ozone from nadir instruments can be found in Hubert et al. (2021). In case of limb satellite observations, the requirement of triple collocation dramatically reduces the sample size (by approximately an order of magnitude); this results in larger uncertainties of the estimated parameters.

Another variant of the triple collocation method is described in the following subsection.

Von Clarmann's method

Von Clarmann proposed a method for random uncertainty validation, which uses 3 sets of measurements with pairwise collocations (Laeng and von Clarmann, 2021). This method takes into account the small-scale natural variability, which is estimated using a high-resolution chemistry-transport model data. Let us denote by $s_{i j}^{2}$ the sample variance of differences in collocated datasets i and j, by $ν_{i j}^{2}$ – natural variability (mismatch) variance of the collocated datasets i and j, and by σ_i- ex-ante uncertainty estimates. Let us assume that the true random uncertainties are $σ_{true, i}^{2} = c_{i} σ_{i}^{2}, i = 1, 2, 3$ . Then the expressions for the sample variance in the collocated pairs results in the following system for determination of correction factors c_i:

\begin{matrix} (14) & \begin{array}{c} c_{1} σ_{1}^{2} + c_{2} σ_{2}^{2} + ν_{12}^{2} = s_{12}^{2} \\ c_{1} σ_{1}^{2} + c_{3} σ_{3}^{2} + ν_{13}^{2} = s_{13}^{2} \\ c_{2} σ_{2}^{2} + c_{3} σ_{3}^{2} + ν_{23}^{2} = s_{23}^{2} \end{array} \end{matrix}

If $ν_{i j}^{2}$ are known, the solution of the linear system (14) is

\begin{matrix} (15) & \begin{array}{c} c_{1} = \frac{1}{2 σ_{1}^{2}} [(s_{12}^{2} - ν_{12}^{2}) + (s_{13}^{2} - ν_{13}^{2}) - (s_{23}^{2} - ν_{23}^{2})] \\ c_{2} = \frac{1}{2 σ_{2}^{2}} [(s_{12}^{2} - ν_{12}^{2}) + (s_{23}^{2} - ν_{23}^{2}) - (s_{13}^{2} - ν_{13}^{2})] \\ c_{3} = \frac{1}{2 σ_{3}^{2}} [(s_{13}^{2} - ν_{13}^{2}) + (s_{23}^{2} - ν_{23}^{2}) - (s_{12}^{2} - ν_{12}^{2})] \end{array} \end{matrix}

The Eqs. (14) and (15) are written in terms of correction factors, as it is presented in the original report; however, they can be also presented in terms of ex-ante and ex-post uncertainties. Similarly to Fioletov's method, Eq. (15) does not guarantee positivity of c_i (solution of the linear system), which may result in unphysical negative estimates of random error variance. Since the estimates by Eq. (15) are the linear combinations of sample variances, the accurate estimate require large samples of collocated data, similarly to the Fioletov method.

For illustration, the method was applied to MIPAS, MLS, and SCIAMACHY data in 2007 at 20° S–20° N, where 3236 triple collocations (with time difference <4 h and spatial separation <300 km) are found. The small-scale variability estimates were obtained from BASCOE model data field down-sampled to typical horizontal resolution along line of sight of limb instrument data (Laeng et al., 2022).

The estimation of the small-scale variability using the ozone fields generated by the advanced chemistry-transport model with a high horizontal resolution seems to be correct, as this mismatched variability is caused by dynamics, and it is characterized by the statistical characteristic (variance). Although some small-scale processes may not be fully resolved in the model, this should not influence the ν_ij estimates, as the effective horizontal resolution of the limb measurements along the line of sight is ∼ 300 km.

The results of application of the von Clarmann's method (Fig. 7) agree very well with other methods presented in our paper (see also below).

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f07

Figure 7Dashed lines: ex-ante uncertainties; solid lines with error bars: ex-post random uncertainties estimated by the von Clarmann method. Error bars are 1σ uncertainties.

Download

3.5 Using CTM simulation in validation of uncertainties

Modern chemistry-transport models have high horizontal and vertical resolution, and a majority of them use meteorological reanalyses in the advection schemes. They show good agreement with the observational data, therefore it is attractive to use the information the models provide in validation of uncertainties. For example, the modelled field can be used for characterization of differences due to the co-location mismatch, i.e. differences in spatio-temporal sampling and smoothing of the variable and inhomogeneous ozone field. Such an approach has been applied in several studies (e.g., Sheese et al., 2021; Verhoelst et al., 2015).

The model estimates of small-scale natural variability are used also in von Clarmann's method. Potentially, analogous characterization would also improve the Fioletov's method.

Sofieva et al. (2022) used ozone data, which are simulated with the chemistry-transport model SILAM adjusted to MLS for ex-post random uncertainty estimates by the differential method (this method is referred to as the SUNLIT method). For each instrument and each month, the authors evaluated sample variance in 10° latitude zones from experimental data and the SILAM-adjusted field, which is sub-sampled at measurements locations. The sample variance of the model dataprovides the estimates of natural variability. Then ex-post uncertainties are estimated as ${\hat{σ}}_{ex-post}^{2} = s^{2} - σ_{nat}^{2}$ , where s² is the sample variance in a set of measurements and $σ_{nat}^{2}$ is the estimate of the natural variability. Figure 8 illustrates ex-ante and ex-post uncertainties for GOMOS, MIPAS, SCIAMACHY, OSIRIS and MLS using the data in September 2007. These estimates agree well with those obtained by the Fioletov's and von Clarmann's method (Fig. 4, right, Figs. 7, and 9). The approach of Sofieva et al. (2022) allows selecting sufficiently large data samples in a relatively short time period; the authors applied their method to the adjustment of random uncertainties for each month and for each instrument.

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f08

Figure 8Ex-ante (dashed lines) and ex-post (solid lines) random uncertainty estimates for GOMOS, MIPAS, SCIAMACHY, OSIRIS and MLS in September 2007, based on the method described in Sofieva et al. (2022).

Download

https://amt.copernicus.org/articles/19/1837/2026/amt-19-1837-2026-f09

Figure 9Uncertainty estimates for MIPAS at 20° S–20° N in 2007 by different methods. Dashed lines: ex-ante uncertainties (coinciding for Fioletov's and von Clarmann's methods), solid lines: ex-post uncertainties (%). For the SUNLIT method, data in September 2007 are used. Error bars are 1σ uncertainties.

Download

Figure 9 compares the ex-post uncertainties for MIPAS ozone retrievals in 2007 at latitudes 20° S–20° N evaluated by different methods. For Fioletov's and von Clarmann's methods, the data from the entire year 2007 are used, while for SUNLIT only data from September 2007 are used. The agreement between ex-post uncertainty estimates by different methods is good: ex-post uncertainties are within the corresponding error bars. In the upper stratosphere, the SUNLIT method yields slightly larger ex-post uncertainties, probably due to underestimated natural variability in the model data (in SUNLIT, daily mean ozone profiles are used, thus missing diurnal variability).

3.6 Notes on validation with fiducial reference measurements

If a dataset with well known (or validated and concluded to be realistic) uncertainty estimates is available, then the validation of uncertainties of a second dataset using either Eq. (4) or the differential method is straightforward. Such an approach is usually used in validation of satellite measurements with ground-based data. When exploiting Eq. (4), it is advantageous to characterize/simulate $σ_{0, nat}^{2}$ . Such approach was explored for the validation of satellite total ozone column data by ground-based measurements (Verhoelst et al., 2015).

For ozone profiles, ozonesondes are usually used for validation of satellite data (evaluation of biases and drifts). However, according to Tarasick et al. (2021) the characterization of ozonesonde uncertainties is even more complicated than for satellite data, and the uncertainties are not constant but varying in the range 5 %–10 % (sometimes even up to 20 %). Together with a limited number of tight collocations, these features impose limitations for validation of satellite random uncertainties using ozonesonde data.

3.7 On using the Markov Chain Monte Carlo method

The random uncertainties reported by retrieval algorithms are usually estimated via propagation of instrumental noise and other random uncertainties through the inversion algorithm. Markov chain Monte Carlo (MCMC) method can be used to produce a robust estimate of the probability distribution of a retrieved quantity that is nonlinearly related to the measurements and that has non-Gaussian error statistics. A methodology for validating the traditional error characterization by applying the MCMC technique can be found in e.g. Tamminen (2004). This paper shows the application of MCMC method to GOMOS data. The MCMC technique is suitable for studying uncertainties of retrieved parameters and it enables analyzing the error structure also in a nonlinear case (and thus validating the standard Gaussian characterization). The advantage of the sampling based MCMC method is also that it allows implementing non-Gaussian measurement and modelling error characterization as well as using non-Gaussian prior information. While the MCMC method cannot provide information about missing or overestimated uncertainties directly, the method is often implemented so that unknown uncertainties are parametrized and included, e.g., via hierarchical formulation, allowing these uncertainties to be taken into account. It would be very useful to compare such approaches with the ones presented in this overview paper, in the future.

4 Summary and discussion

In this paper, we presented methods for random uncertainty validation and illustrated their application using ozone profiles retrieved from measurements by satellite instruments in the limb-viewing geometry. These methods considered in this study rely on deriving a posteriori (ex-post) random uncertainties using statistical analyses of collocated data samples. Advantages and limitations of each method are discussed, as well as accuracy of ex-post random uncertainty estimates.

As a general requirement for all methods, the data samples should be selected in regions of small and slowly changing natural variability. Otherwise, if the natural variability exceeds significantly random uncertainties, this prevents the computation of reliable ex-post estimates of random uncertainties. The methods for random uncertainty validation are divided into two groups depending on what kind of data are used: (1) from the same instrument and (2) from different instruments.

Practical examples of validation of random uncertainty with the discussion of advantages and limitations of each method are provided in this study. It is shown that, for instruments with dense sampling, such as MIPAS and MLS, several methods can be applied, for example those based on self-collocations or collocations with other datasets. For datasets that are obtained with tomographic retrievals, the Fioletov's, von Clarmann's and differential methods can be applied. For instruments with coarse sampling, such as GOMOS, ACE-FTS or SAGE II-III, the differential method is the most appropriate. It has been shown previously and also confirmed in this study that simulations with high-quality and high-resolution chemistry-transport models are useful in validation of reported random uncertainties: the model simulations can be used for estimation of small-scale natural variability.

The methods presented in this overview can be also applied to other measurements. In particular, the structure function method has been already successfully applied to total column measurements by TROPOMI in Sofieva et al. (2021). All methods can be applied also to data with coarse vertical resolution, such as profiles retrieved from nadir-looking instruments. For the application of the methods based on the statistics of differences, the profiles should have a compatible vertical resolution. This might require prior application of harmonization (see Keppens et al., 2019 for details). Then the validation of random uncertainties can be performed at the vertical scales corresponding to harmonized profiles.

Data availability

The ozone data from limb instruments used in this paper are available in HARMOZ_ALT format (Sofieva et al., 2013) at https://climate.esa.int/en/projects/ozone/data, last access: 27 February 2026.

Author contributions

This review paper is the result of numerous discussions during the APARC and ISSI TUNER activities. VFS wrote the majority of the paper and prepared the majority of the illustrations. AL contributed significantly to the manuscript writing and preparation of illustrations. TvC, GS, MK, JT, AR, CA, NL, RD, PS, KAW, DD, DZ, NAK and AK provided the data and contributed to the analyses and writing of the paper.

Competing interests

At least one of the (co-)authors is a member of the editorial board of Atmospheric Measurement Techniques. The peer-review process was guided by an independent editor, and the authors also have no other competing interests to declare.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

We acknowledge the scientific guidance (and sponsorship) of the World Climate Research Programme to motivate this work, coordinated in the framework of APARC TUNER activity. The International Space Science Institute (ISSI) has funded two International Team meetings in Berne at their venue. The authors thank the Canadian Space Agency and the Swedish National Space Board for their long-term support of the Odin mission and the OSIRIS instrument. The work at the University of Bremen was funded in parts by ESA and University and State of Bremen. We gratefully acknowledge the computing time granted by the Resource Allocation Board and provided for the supercomputers Lise and Emmy at NHR@ZIB and NHR@Göttingen as part of the NHR infrastructure. Work at the Jet Propulsion Laboratory, California Institute of Technology, was performed under contract with the National Aeronautics and Space Administration (80NM0018D0004).

Financial support

This research has been supported by European Space Agency (projects Ozone_cci and VACUUM), the Research Council of Finland (Flagship of Advanced Mathematics for Sensing Imaging and Modelling grant 359196), and International Space Science Institute (ISSI).

Review statement

This paper was edited by Diego Loyola and reviewed by two anonymous referees.

References

Arosio, C., Rozanov, A., Gorshelev, V., Laeng, A., and Burrows, J. P.: Assessment of the error budget for stratospheric ozone profiles retrieved from OMPS limb scatter measurements, Atmos. Meas. Tech., 15, 5949–5967, https://doi.org/10.5194/amt-15-5949-2022, 2022.

Bertaux, J. L., Kyrölä, E., Fussen, D., Hauchecorne, A., Dalaudier, F., Sofieva, V., Tamminen, J., Vanhellemont, F., Fanton d'Andon, O., Barrot, G., Mangin, A., Blanot, L., Lebrun, J. C., Pérot, K., Fehr, T., Saavedra, L., Leppelmeier, G. W., and Fraisse, R.: Global ozone monitoring by occultation of stars: an overview of GOMOS measurements on ENVISAT, Atmos. Chem. Phys., 10, 12091–12148, https://doi.org/10.5194/acp-10-12091-2010, 2010.

Bevington, P. R. and Robinson, D. K.: Data reduction and error analysis for the physical sciences, 3. ed., McGraw-Hill, Boston, Mass., 320 pp., 2003.

Boone, C. D., Nassar, R., Walker, K. A., Rochon, Y., McLeod, S. D., Rinsland, C. P., and Bernath, P. F.: Retrievals for the atmospheric chemistry experiment Fourier-transform spectrometer, Appl. Opt., 44, 7218–7231, 2005.

Bourassa, A. E., McLinden, C. A., Bathgate, A. F., Elash, B. J., and Degenstein, D. A.: Precision estimate for Odin-OSIRIS limb scatter retrievals, J. Geophys. Res., 117, D04303, https://doi.org/10.1029/2011JD016976, 2012.

Bourassa, A. E., Roth, C. Z., Zawada, D. J., Rieger, L. A., McLinden, C. A., and Degenstein, D. A.: Drift-corrected Odin-OSIRIS ozone product: algorithm and updated stratospheric ozone trends, Atmos. Meas. Tech., 11, 489–498, https://doi.org/10.5194/amt-11-489-2018, 2018.

Cressie, N. C. A.: Statistics for Spatial Data, Wiley Series in Probability and Statistics, https://doi.org/10.1002/9781119115151, 1993.

Damadeo, R. P., Zawodny, J. M., Thomason, L. W., and Iyer, N.: SAGE version 7.0 algorithm: application to SAGE II, Atmos. Meas. Tech., 6, 3539–3561, https://doi.org/10.5194/amt-6-3539-2013, 2013.

Fioletov, V. E., Tarasick, D. W., and Petropavlovskikh, I.: Estimating ozone variability and instrument uncertainties from SBUV(/2), ozonesonde, Umkehr, and SAGE II measurements: Short-term variations, J. Geophys. Res., 111, D02305, https://doi.org/10.1029/2005JD006340, 2006.

Hubert, D., Heue, K.-P., Lambert, J.-C., Verhoelst, T., Allaart, M., Compernolle, S., Cullis, P. D., Dehn, A., Félix, C., Johnson, B. J., Keppens, A., Kollonige, D. E., Lerot, C., Loyola, D., Maata, M., Mitro, S., Mohamad, M., Piters, A., Romahn, F., Selkirk, H. B., da Silva, F. R., Stauffer, R. M., Thompson, A. M., Veefkind, J. P., Vömel, H., Witte, J. C., and Zehner, C.: TROPOMI tropospheric ozone column data: geophysical assessment and comparison to ozonesondes, GOME-2B and OMI, Atmos. Meas. Tech., 14, 7405–7433, https://doi.org/10.5194/amt-14-7405-2021, 2021.

Jia, J., Rozanov, A., Ladstätter-Weißenmayer, A., and Burrows, J. P.: Global validation of SCIAMACHY limb ozone data (versions 2.9 and 3.0, IUP Bremen) using ozonesonde measurements, Atmos. Meas. Tech., 8, 3369–3383, https://doi.org/10.5194/amt-8-3369-2015, 2015.

Keppens, A., Compernolle, S., Verhoelst, T., Hubert, D., and Lambert, J.-C.: Harmonization and comparison of vertically resolved atmospheric state observations: methods, effects, and uncertainty budget, Atmos. Meas. Tech., 12, 4379–4391, https://doi.org/10.5194/amt-12-4379-2019, 2019.

Kiefer, M., von Clarmann, T., Funke, B., García-Comas, M., Glatthor, N., Grabowski, U., Höpfner, M., Kellmann, S., Laeng, A., Linden, A., López-Puertas, M., and Stiller, G. P.: Version 8 IMK–IAA MIPAS ozone profiles: nominal observation mode, Atmos. Meas. Tech., 16, 1443–1460, https://doi.org/10.5194/amt-16-1443-2023, 2023.

Kramarova, N. A., Xu, P., Mok, J., Bhartia, P., Jaross, G., Moy, L., Chen, Z., Frith, S. M., DeLand, M. T., Kahn, D., Labow, G. J., Li, J., Nyaku, E., Weaver, C., Ziemke, J. R., Davis, S. M., and Jia, Y.: Decade-long ozone profile record from Suomi NPP OMPS Limb Profiler: Assessment of version 2.6 data. Earth and Space Science, 11, e2024EA003707, https://doi.org/10.1029/2024EA003707, 2024.

Kyrölä, E., Tamminen, J., Sofieva, V., Bertaux, J. L., Hauchecorne, A., Dalaudier, F., Fussen, D., Vanhellemont, F., Fanton d'Andon, O., Barrot, G., Guirlet, M., Mangin, A., Blanot, L., Fehr, T., Saavedra de Miguel, L., and Fraisse, R.: Retrieval of atmospheric parameters from GOMOS data, Atmos. Chem. Phys., 10, 11881–11903, https://doi.org/10.5194/acp-10-11881-2010, 2010.

Laeng, A. and Von Clarmann, T.: VACUUM'R – Final Report, Karlsruher Institut für Technologie (KIT), https://doi.org/10.5445/IR/1000177659, 2021.

Laeng, A., Hubert, D., Verhoelst, T., Von Clarmann, T., Dinelli, B. M., Dudhia, A., Raspollini, P., Stiller, G., Grabowski, U., Keppens, A., Kiefer, M., Sofieva, V., Froidevaux, L., Walker, K. A., Lambert, J.-C., and Zehner, C.: The ozone climate change initiative: Comparison of four Level-2 processors for the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS), Remote Sensing of Environment, 162, 316–343, https://doi.org/10.1016/j.rse.2014.12.013, 2015.

Laeng, A., von Clarmann, T., Errera, Q., Grabowski, U., and Honomichl, S.: Satellite data validation: a parametrization of the natural variability of atmospheric mixing ratios, Atmos. Meas. Tech., 15, 2407–2416, https://doi.org/10.5194/amt-15-2407-2022, 2022.

Livesey, N. J., Van Snyder, W., Read, W. G., and Wagner, P. A.: Retrieval algorithms for the EOS Microwave limb sounder (MLS), IEEE Trans. Geosci. Remote Sensing, 44, 1144–1155, https://doi.org/10.1109/TGRS.2006.872327, 2006.

Livesey, N. J., Read, W. G., Wagner, P. A., Froidevaux, L., Santee, M. L., Schwartz, M. J., Lambert, A., Millan Valle, L. F., Pumphrey, H. C., Manney, G. L., Fuller, R. A., Jarnot, R. F., Knosp, B. W., and Lay, R. R.: Earth Observing System (EOS) Microwave Limb Sounder (MLS) Version 5.0x Level 2 and 3 data quality and description document (5.0-1.1a), NASA Goddard Earth Sciences Data and Information Services Center, https://doi.org/10.5067/AURA/MLS/DOC/V5_DATAQUALITYDOCUMENT, 2022.

Matheron, G.: Principles of geostatistics, Economic Geology, 58, 1246–1266, https://doi.org/10.2113/gsecongeo.58.8.1246, 1963.

Moy, L., Bhartia, P. K., Jaross, G., Loughman, R., Kramarova, N., Chen, Z., Taha, G., Chen, G., and Xu, P.: Altitude registration of limb-scattered radiation, Atmos. Meas. Tech., 10, 167–178, https://doi.org/10.5194/amt-10-167-2017, 2017.

Piccolo, C. and Dudhia, A.: Precision validation of MIPAS-Envisat products, Atmos. Chem. Phys., 7, 1915–1923, https://doi.org/10.5194/acp-7-1915-2007, 2007.

Rahpoe, N., von Savigny, C., Weber, M., Rozanov, A. V., Bovensmann, H., and Burrows, J. P.: Error budget analysis of SCIAMACHY limb ozone profile retrievals using the SCIATRAN model, Atmos. Meas. Tech., 6, 2825–2837, https://doi.org/10.5194/amt-6-2825-2013, 2013.

Rault, D. F. and Loughman, R. P.: The OMPS Limb Profiler Environmental Data Record Algorithm Theoretical Basis Document and Expected Performance, IEEE Trans. Geosci. Remote Sensing, 51, 2505–2527, https://doi.org/10.1109/TGRS.2012.2213093, 2013.

Read, W. G., Shippony, Z., Schwartz, M. J., Livesey, N. J., and Van Snyder, W.: The clear-sky unpolarized forward model for the EOS aura microwave limb sounder (MLS), IEEE Trans. Geosci. Remote Sensing, 44, 1367–1379, https://doi.org/10.1109/TGRS.2006.873233, 2006.

Rodgers, C. D.: Inverse Methods for Atmospheric sounding: Theory and Practice, World Scientific, 2000.

SAGE III Algorithm Theoretical Basis Document (ATBD): Solar and Lunar Algorithm, NASA, https://asdc.larc.nasa.gov/documents/sage3/ATBDs/Solar_and_Lunar_Algorithm.pdf (last access: 2 March 2026), 2002.

Schwartz, M. J., Read, W. G., and Van Snyder, W.: EOS MLS forward model polarized radiative transfer for Zeeman-split oxygen lines, IEEE Trans. Geosci. Remote Sensing, 44, 1182–1191, https://doi.org/10.1109/TGRS.2005.862267, 2006.

Sheese, P. E., Walker, K. A., Boone, C. D., Degenstein, D. A., Kolonjari, F., Plummer, D., Kinnison, D. E., Jöckel, P., and von Clarmann, T.: Model estimations of geophysical variability between satellite measurements of ozone profiles, Atmos. Meas. Tech., 14, 1425–1438, https://doi.org/10.5194/amt-14-1425-2021, 2021.

Sheese, P. E., Walker, K. A., Boone, C. D., Bourassa, A. E., Degenstein, D. A., Froidevaux, L., McElroy, C. T., Murtagh, D., Russell III, J. M., and Zou, J.: Assessment of the quality of ACE-FTS stratospheric ozone data, Atmos. Meas. Tech., 15, 1233–1249, https://doi.org/10.5194/amt-15-1233-2022, 2022.

Sofieva, V. F., Tamminen, J., Haario, H., Kyrölä, E., and Lehtinen, M.: Ozone profile smoothness as a priori information in the inversion of limb measurements, Ann. Geophys., 22, 3411–3420, https://doi.org/10.5194/angeo-22-3411-2004, 2004.

Sofieva, V. F., Vira, J., Kyrölä, E., Tamminen, J., Kan, V., Dalaudier, F., Hauchecorne, A., Bertaux, J.-L., Fussen, D., Vanhellemont, F., Barrot, G., and Fanton d'Andon, O.: Retrievals from GOMOS stellar occultation measurements using characterization of modeling errors, Atmos. Meas. Tech., 3, 1019–1027, https://doi.org/10.5194/amt-3-1019-2010, 2010.

Sofieva, V. F., Rahpoe, N., Tamminen, J., Kyrölä, E., Kalakoski, N., Weber, M., Rozanov, A., von Savigny, C., Laeng, A., von Clarmann, T., Stiller, G., Lossow, S., Degenstein, D., Bourassa, A., Adams, C., Roth, C., Lloyd, N., Bernath, P., Hargreaves, R. J., Urban, J., Murtagh, D., Hauchecorne, A., Dalaudier, F., van Roozendael, M., Kalb, N., and Zehner, C.: Harmonized dataset of ozone profiles from satellite limb and occultation measurements, Earth Syst. Sci. Data, 5, 349–363, https://doi.org/10.5194/essd-5-349-2013, 2013 (data available at: https://climate.esa.int/en/projects/ozone/data last access: 27 February 2026.).

Sofieva, V. F., Tamminen, J., Kyrölä, E., Laeng, A., von Clarmann, T., Dalaudier, F., Hauchecorne, A., Bertaux, J.-L., Barrot, G., Blanot, L., Fussen, D., and Vanhellemont, F.: Validation of GOMOS ozone precision estimates in the stratosphere, Atmos. Meas. Tech., 7, 2147–2158, https://doi.org/10.5194/amt-7-2147-2014, 2014.

Sofieva, V. F., Ialongo, I., Hakkarainen, J., Kyrölä, E., Tamminen, J., Laine, M., Hubert, D., Hauchecorne, A., Dalaudier, F., Bertaux, J.-L., Fussen, D., Blanot, L., Barrot, G., and Dehn, A.: Improved GOMOS/Envisat ozone retrievals in the upper troposphere and the lower stratosphere, Atmos. Meas. Tech., 10, 231–246, https://doi.org/10.5194/amt-10-231-2017, 2017.

Sofieva, V. F., Lee, H. S., Tamminen, J., Lerot, C., Romahn, F., and Loyola, D. G.: A method for random uncertainties validation and probing the natural variability with application to TROPOMI on board Sentinel-5P total ozone measurements, Atmos. Meas. Tech., 14, 2993–3002, https://doi.org/10.5194/amt-14-2993-2021, 2021.

Sofieva, V. F., Hänninen, R., Sofiev, M., Szeląg, M., Lee, H. S., Tamminen, J., and Retscher, C.: Synergy of Using Nadir and Limb Instruments for Tropospheric Ozone Monitoring (SUNLIT), Atmos. Meas. Tech., 15, 3193–3212, https://doi.org/10.5194/amt-15-3193-2022, 2022.

Stoffelen, A.: Toward the true near-surface wind speed: Error modeling and calibration using triple collocation, J. Geophys. Res., 103, 7755–7766, https://doi.org/10.1029/97JC03180, 1998.

Tamminen, J.: Validation of nonlinear inverse algorithms with Markov chain Monte Carlo method, J. Geophys. Res., 109, 2004JD004927, https://doi.org/10.1029/2004JD004927, 2004.

Tamminen, J., Kyrölä, E., Sofieva, V. F., Laine, M., Bertaux, J.-L., Hauchecorne, A., Dalaudier, F., Fussen, D., Vanhellemont, F., Fanton-d'Andon, O., Barrot, G., Mangin, A., Guirlet, M., Blanot, L., Fehr, T., Saavedra de Miguel, L., and Fraisse, R.: GOMOS data characterisation and error estimation, Atmos. Chem. Phys., 10, 9505–9519, https://doi.org/10.5194/acp-10-9505-2010, 2010.

Tarasick, D. W., Smit, H. G. J., Thompson, A. M., Morris, G. A., Witte, J. C., Davies, J., Nakano, T., Van Malderen, R., Stauffer, R. M., Johnson, B. J., Stübi, R., Oltmans, S. J., and Vömel, H.: Improving ECC Ozonesonde Data Quality: Assessment of Current Methods and Outstanding Issues, Earth and Space Science, 8, e2019EA000914, https://doi.org/10.1029/2019EA000914, 2021.

Tatarskii, V. I.: Wave Propagation in a Turbulent Medium, edited by: Silverman, R. A., McGraw-Hill, 285 pp., 1961.

Taylor, J. R.: An introduction to error analysis: the study of uncertainties in physical measurements, 2. ed., University Science Books, Sausalito, Calif, 327 pp., 1997.

Verhoelst, T., Granville, J., Hendrick, F., Köhler, U., Lerot, C., Pommereau, J.-P., Redondas, A., Van Roozendael, M., and Lambert, J.-C.: Metrology of ground-based satellite validation: co-location mismatch and smoothing issues of total ozone comparisons, Atmos. Meas. Tech., 8, 5039–5062, https://doi.org/10.5194/amt-8-5039-2015, 2015.

von Clarmann, T.: Validation of remotely sensed profiles of atmospheric state variables: strategies and terminology, Atmos. Chem. Phys., 6, 4311–4320, https://doi.org/10.5194/acp-6-4311-2006, 2006.

von Clarmann, T., Höpfner, M., Kellmann, S., Linden, A., Chauhan, S., Funke, B., Grabowski, U., Glatthor, N., Kiefer, M., Schieferdecker, T., Stiller, G. P., and Versick, S.: Retrieval of temperature, H₂O, O₃, HNO₃, CH₄, N₂O, ClONO₂ and ClO from MIPAS reduced resolution nominal mode limb emission measurements, Atmos. Meas. Tech., 2, 159–175, https://doi.org/10.5194/amt-2-159-2009, 2009.

von Clarmann, T., Degenstein, D. A., Livesey, N. J., Bender, S., Braverman, A., Butz, A., Compernolle, S., Damadeo, R., Dueck, S., Eriksson, P., Funke, B., Johnson, M. C., Kasai, Y., Keppens, A., Kleinert, A., Kramarova, N. A., Laeng, A., Langerock, B., Payne, V. H., Rozanov, A., Sato, T. O., Schneider, M., Sheese, P., Sofieva, V., Stiller, G. P., von Savigny, C., and Zawada, D.: Overview: Estimating and reporting uncertainties in remotely sensed atmospheric composition and temperature, Atmos. Meas. Tech., 13, 4393–4436, https://doi.org/10.5194/amt-13-4393-2020, 2020.

von Clarmann, T., Glatthor, N., Grabowski, U., Funke, B., Kiefer, M., Kleinert, A., Stiller, G. P., Linden, A., and Kellmann, S.: TUNER-compliant error estimation for MIPAS: methodology, Atmos. Meas. Tech., 15, 6991–7018, https://doi.org/10.5194/amt-15-6991-2022, 2022.

Wackernagel, H.: Multivariate Geostatistics, Springer, 2003.

Wang, H. J. R., Damadeo, R., Flittner, D., Kramarova, N., Taha, G., Davis, S., Thompson, A. M., Strahan, S., Wang, Y., Froidevaux, L., Degenstein, D., Bourassa, A., Steinbrecht, W., Walker, K. A., Querel, R., Leblanc, T., Godin-Beekmann, S., Hurst, D., and Hall, E.: Validation of SAGE III/ISS Solar Occultation Ozone Products With Correlative Satellite and Ground-Based Measurements, Journal of Geophysical Research: Atmospheres, 125, e2020JD032430, https://doi.org/10.1029/2020JD032430, 2020.

Zawada, D. J., Rieger, L. A., Bourassa, A. E., and Degenstein, D. A.: Tomographic retrievals of ozone with the OMPS Limb Profiler: algorithm description and preliminary results, Atmos. Meas. Tech., 11, 2375–2393, https://doi.org/10.5194/amt-11-2375-2018, 2018.

Articles

Short summary