the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.

In-flight estimation of instrument spectral response functions using sparse representations
Jihanne El Haouari
Jean-Michel Gaucel
Christelle Pittet
Jean-Yves Tourneret
Herwig Wendt
High-resolution spectrometers are composed of different optical elements and detectors that must be modeled as accurately as possible. Specifically, accurate estimates of instrument spectral response functions (ISRFs) are critical in order not to compromise the retrieval of trace gas concentrations from spectral measurements. Currently, parametric models are used to estimate these response functions. However, these models cannot always take into account the diversity of ISRF shapes that are encountered in practical applications. This paper studies a new ISRF estimation method based on a sparse representation of the ISRF in a dictionary. The proposed method is shown to be very competitive when compared to parametric models, yielding up to 1 order of magnitude smaller normalized ISRF estimation errors. The method is applied to different high-resolution spectrometers, demonstrating its reproducibility for multiple remote sensing missions.
- Article
(7261 KB) - Full-text XML
-
Supplement
(769 KB) - BibTeX
- EndNote
Space remote sensing makes it possible to remotely measure the composition of the atmosphere or the troposphere and to retrieve trace gas concentrations. It can also be used to monitor molecule fluxes at the Earth's surface, as is the case for the MicroCarb mission that is designed to monitor CO2 fluxes (Cansot et al., 2022) in order to provide a better understanding of the carbon cycle, which is important in the context of climate change. This can be done by analyzing the interaction of the atmosphere with natural radiation, such as sunlight, or artificial radiation, generated for example by a laser. Indeed, the presence of some molecules in the path of radiation modifies its spectral content at the characteristic wavelengths of the different elements. The information directly obtained from satellites is the atmospheric spectrum. By considering some specific wavelengths of interest, it is possible to determine the concentration of the desired trace gases in a column of atmosphere by comparing these measured spectra with a reference spectrum obtained using a radiative transfer model.
The instruments used for gas concentration estimation are high-resolution spectrometers. Spectrometers consist mainly of an optical part (for example composed of a slit, a telescope and dispersive grating) and a detector. In this configuration, the telescope projects the image of the Earth onto the spectrometer slit and then onto the detector. Each pixel of the detector is associated with a spatial direction (called ACT for ACross Track) and a specific wavelength. A binning and an averaging along the ACT axis are performed in order to improve the signal to noise ratio (SNR). For each of the two parts (optical part and detector), a response function is defined, which leads to a continuous optical function and another function associated with each pixel of the detector. This results in a global response function associated with each pixel along the spectral axis, known as the instrument spectral response function (ISRF), associated with a specific wavelength. The ISRFs can vary significantly depending on the instrument considered, and their shapes depend on the central wavelength, among other factors. The estimation of trace gas concentrations is an inversion process that is performed on the ground from spectrometer measurements and the instrument ISRFs. The accuracy of this estimation highly depends on the knowledge of these ISRFs for all pixels. For some missions, ISRFs are expected to be known with a normalized error of less than 1 %, which represents a significant challenge given that the variations in ISRF shape across the entire band frequently exceed this threshold.
Spectrometers are first calibrated on the ground where their associated ISRFs are estimated experimentally. However, the ISRFs are subject to in-flight changes due to mechanical movements associated with the launch of the instruments, thermal changes in orbit, or certain sensitivities linked to the instrument itself (such as the MicroCarb's sensitivity to the scene). As a consequence, these ISRFs need to be re-estimated regularly in flight throughout the mission. The principle of the estimation is to take a measurement of a spectrally known scene and to compare it with a spectral model of the scene convolved with the ISRFs at different wavelengths. Parametric models have been widely used in the literature to estimate ISRFs. Gaussian and generalized Gaussian parametric models (referred to as “Gauss” and “super-Gauss”) were proposed in Beirle et al. (2017). Parametric models are attractive for their simplicity and small number of parameters. However, they are not flexible enough to represent the diversity of ISRF shapes adequately. The ISRF estimation problem and the most important parametric models that have been considered in the literature are detailed in Sect. 2.
The objective of this work is to overcome the limitations of the existing parametric ISRF estimation methods caused by their insufficient accuracy. To this end, we propose as a first major contribution a new estimation strategy based on sparse representations of the ISRFs in a dictionary of well-chosen atoms. More precisely, the ISRFs are decomposed in a dictionary that is constructed using several ISRFs that are available from ground characterization for each instrument. The dictionary can also be updated iteratively online. For each instrument, each ISRF is then approximated by a linear combination of a small number of atoms of the dictionary associated with the instrument. The proposed approach is detailed in Sect. 3. We investigate and compare two different methods for obtaining the sparse representations of ISRFs.
As a second contribution, we conduct an extensive numerical study of the proposed ISRF estimation approach and compare it to parametric methods for datasets from several different spectrometers used in space missions, whose characteristics are detailed in Sect. 4. The main focus is on the MicroCarb instrument (Cansot et al., 2022), which is dedicated to studying atmospheric carbon dioxide and oxygen, with the objective of determining their concentrations at the Earth's surface. Additional results showing the applicability of the proposed methodology to other spectrometers are reported for the Orbiting Carbon Observatory 2 (OCO-2) spectrometer (Lee et al., 2017) and complemented by results for several other spectrometers that are reported in the Supplement.
Numerical results are reported in Sect. 5 and lead to the conclusion that the proposed method yields significantly improved flexibility and accuracy for ISRF estimation when compared to previous state-of-the-art parametric methods, consistently through the different datasets and scenarios, with a small number of parameters that can easily and efficiently be estimated in real time. Moreover, the method is shown to be robust with respect to design choices, the noise corrupting the observed measurements, the ISRF changes depending on the scene, or the possible mismatches on the prior knowledge on the ISRFs or reference spectra.
2.1 ISRF estimation model
The ISRF, which is sometimes referred to as the instrument line shape (ILS) (Sun et al., 2017b) or slit function (Sun et al., 2017a), is a function that describes the response of an instrument to a given wavelength. In this work, we only consider the spectral information, and thus each “pixel” l is associated with a specific wavelength λl, yielding an ISRF at this wavelength.1 The in-flight identification of ISRFs is obtained from scenes that are assumed to be perfectly known radiometrically and spectrally (such as the Sun, the Moon, uniform scenes such as a desert, etc.), which are referred to as reference spectra. The principle of ISRF estimation is to determine the in-flight ISRFs for each wavelength λl that minimize some similarity measure between the measured spectrum s(λl) and the reference spectrum r(λ) convolved with the ISRF denoted as Il(λl):
where * denotes convolution, and Nλ is the number of central wavelengths λl, each associated with one ISRF Il. For practical purposes, this equation can be discretized, leading to
where Δ is the sampling period between two consecutive points of the ISRF, which is assumed to be regularly sampled. In other words, a vector needs to be estimated for each ISRF, corresponding to the values that it takes on the wavelength grid at which the ISRFs are sampled. . The objective of the ISRF estimation problem is to solve the inverse problem (Eq. 2) assuming knowledge of both the reference spectrum r(λ) and the measurements s(λl).
A major difficulty with the inverse problem (Eq. 2) is that there is only one measurement per fixed wavelength λl, which makes it impossible to estimate the vector Il without further assumptions. Two approaches can be used to make this estimation problem identifiable.2 The first idea is to consider knowledge of several reference spectra for every wavelength. The problem is that this would require not only a sufficient number of calibration scenes to be available, but also that they substantially differ for each wavelength in order to provide complementary information on the shapes of the ISRFs. The second method, which is considered in this paper, has the advantage that it makes use of only one reference spectrum and is based on the assumption that the ISRFs for adjacent wavelengths λl are similar; i.e., they exhibit slight variations along the spectral axis between λl and λl+1. It is expected that the average of the normalized absolute error between the ISRFs in a window of Nobs+1 observations and the central ISRF at wavelength λl is below a given criterion for the ISRF estimation error. Note that the larger this variation, the more important the discrepancies in ISRF shapes. The small-variation assumption is not valid for the whole set of wavelengths, and the size of the sliding window must be adjusted in order to solve the ISRF estimation problem. This is a reasonable assumption for the ISRFs of real-world spectrometers. To estimate the ISRF at wavelength λl, we propose considering a vector of Nobs+1 observations, including those from the neighboring ISRFs as well. Rewritten in matrix form, Eq. 2 simplifies to
where contains the values of the reference spectrum covered by the different ISRFs in the neighborhood (see algorithm in Appendix A1). Given a model for the ISRF, estimating Il can then be conducted for each wavelength λl by minimizing the residual error .
2.2 Parametric models
It is difficult to analytically construct accurate forward models with a small number of parameters for ISRFs because they would need to incorporate a significant number of “contributors” associated with the instrument optics (slit, mirror, lens, separator, dispersing element), the detector or the acquisition mode. The state of the art therefore considers simple parametric models. A classical way to model and estimate the ISRF at wavelength λl is to use a parametric Gaussian model defined by
where is the unknown vector of parameters to be estimated.
An alternative ISRF model was studied (Beirle et al., 2017) using a generalized Gaussian distribution referred to as super-Gaussian in order to better fit the ISRF shapes:
where is the unknown parameter vector to be estimated. This model reduces to the Gaussian model when and kSG=2. The parameters wSG and kSG are the scale and shape parameters of the distribution, allowing more or less flat shapes to be modeled.
When using the parametric models (Eq. 3) and (Eq. 4), the ISRF estimation problem consists of estimating the unknown model parameters for each sliding window. This estimation can be performed using the least squares method, which minimizes the following cost function:
where is the unknown parameter vector and .
Simple parametric models, such as Gaussian or generalized Gaussian models, are attractive for their simplicity and small number of parameters, yet can struggle to take into account the variety of different ISRF shapes that can be observed in practice. An illustration is provided in Fig. 1, which shows examples of ISRFs for the MicroCarb mission. Clearly, these ISRFs cannot be accurately modeled by bell-shaped Gaussian distributions or by generalized Gaussians (because of the dip at the center, for example). This motivates the study of a new estimation method for ISRFs.
This paper investigates the use of sparse representations for ISRFs in a dictionary of well-chosen atoms. Models based on sparse approximations and on dictionary learning have been widely and successfully used for different signal and image processing applications (Zhang et al., 2015). These applications include image denoising, image classification, image reconstruction, compressed sensing or dimensionality reduction and involve large varieties of signals and images (Figueiredo et al., 2007; Tošić and Frossard, 2011). However, sparse representations have never been investigated for ISRF estimation, which is precisely the objective of this work.
3.1 Construction of the dictionary
Sparse representations express a given signal as a linear combination of a small number of signals that belong to a collection of reference patterns, or atoms, which is called a dictionary. This paper proposes decomposing the ISRF in a dictionary of atoms :
where is the γkth selected atom, i.e., the γkth column of the dictionary Φ, and αl,k is the corresponding non-zero coefficient of the sparse vector . The dictionary is built in such a way that linear combinations of a small number of its atoms (i.e., its columns) provide an efficient representation of the ISRF. Different methods allowing the dictionary to be built have been proposed in the literature. These methods are based on probabilistic learning, clustering, vector quantization or Bayesian inference (Tošić and Frossard, 2011). Dictionary learning usually involves a two-stage optimization structure, consisting first of a sparse-coding step to find the sparse vector αl, which minimizes the objective function for a fixed dictionary Φ, and then a dictionary update step, where the dictionary is estimated given a fixed sparse vector αl. Depending on the application, the dictionary can be updated using a closed-form solution, gradient descent or ground truth data. In this work we investigate two different ways of building the dictionary Φ. The first method constructs Φ by using the ND singular vectors associated with the largest singular values of the singular value decomposition (SVD) of a matrix composed of representative ISRF examples, as described in the algorithm of Appendix A2. The second method uses the K-SVD algorithm of Aharon et al. (2006), which belongs to the state of the art and is recalled in the algorithm of Appendix A5. The K-SVD algorithm is a generalization of the K-means algorithm in which the dictionary is updated by changing its columns separately and sequentially and applying K SVDs on an appropriate error matrix. Figure 2 displays the first atoms of dictionaries constructed using these two methods for band B1 of MicroCarb. These dictionaries are found to be similar, especially the first two atoms that correspond to the most energetic singular values. The first two atoms can be interpreted as the approximate average of all ISRFs used to build the dictionary (first atom) and a correction for adjusting the different widths of the ISRFs for different wavelengths (second atom), as seen in Fig. 1. The higher-order atoms obtained with SVD and K-SVD are slightly different but have similar shapes overall.
3.2 Inverse problem
Assuming that the ISRF can be decomposed in the dictionary Φ as in Eq. (6), the measured spectrum can be written as follows:
Thus, the ISRF estimation problem reduces to finding the sparse vector αl that minimizes the residual . This sparse-coding problem has been mathematically formulated in different ways (Zhang et al., 2015). One can use the l0 pseudo-norm regularization with a penalty parameter μ, leading to the following problem:
This problem is non-convex and NP hard, and many approximations and heuristics have been proposed in the literature to find an approximate solution. A standard method consists of using greedy algorithms such as the orthogonal matching pursuit (OMP). OMP is a modification of the matching pursuit (MP) algorithm, which improves convergence by adding an orthogonalization step (Mallat and Zhang, 1993; Pati et al., 1993). The atoms of the dictionary that minimize the data fidelity term are iteratively determined by minimizing the remaining residual error. The OMP algorithm is summarized in the algorithm of Appendix A3. Another method replaces the pseudo-norm l0 in Eq. (7) by the l1 norm, which leads to a convex problem known as the LASSO problem (Tan et al., 2015):
and the related algorithms studied in, e.g., Figueiredo et al. (2007) and Kim et al. (2007).
The OMP and LASSO algorithms provide a highly flexible decomposition of the ISRF, as the choice of the dictionary is not constrained to a specific form. Indeed, the basis functions can be learned, for example by using the K-SVD algorithm in conjunction with various matching pursuit algorithms. Another advantage of these methods is that they do not necessitate any prior assumption on the shape of the ISRFs (such as Gaussian ISRFs) and estimate them in a non-parametric way.
In the following, this paper compares the use of fixed dictionaries obtained by a single SVD and dictionaries estimated by K-SVD (alternation between SVD to update the dictionary and OMP to update the sparse code). The proposed approach using OMP and LASSO (or other sparse formulations) and either fixed or re-estimated dictionaries is referred to as SPIRIT, meaning SParse representation of Instrument spectral Response Functions using a dIcTionary.
The spectrometers used in this study are passive push-broom spectrometers, mainly hyperspectral dispersive spectrometers, such as the MicroCarb high-resolution spectrometer and the OCO-2 instrument. 3
4.1 Synthetic data generation
Reference spectra used in this study were generated using Automatized Atmospheric Absorption Atlas OPerational (4A/OP) software (NOVELTIS et al., 2012). This software is based on a fast and accurate line-by-line transfer model that can be integrated in operational processing chains including inverse problem processing (Armante et al., 2013). It was selected as the official radiative model and reference code by the Centre National d'Etudes Spatiales (CNES) for the MicroCarb mission. The profiles originate from the Thermodynamic Initial Guess Retrieval (TIGR) database, which is hosted by Aeris data. Data are available at https://www.aeris-data.fr/en/projects/thermodynamical-initial-guess-retrieval-tigr/ (last access: 31 March 2025). An example of a profile was selected from this database for the generation of a reference spectrum. The measured spectra were then obtained by convolving the reference spectrum with the ISRFs (normalized to area 1 for each instrument; see details in the next paragraphs) and embedded in additive Gaussian noise to generate representative measurements. The advantage of this data generation method is to provide ground truth ISRFs, which can be used to assess the performance of the different methods in a controlled scenario.
4.2 MicroCarb mission
MicroCarb is a mission developed by the CNES, whose aim is to ensure continuity with other carbon measuring missions such as OCO-2 and GOSAT, in order to monitor CO2 fluxes at the Earth surface and determine CO2 atmospheric concentrations. The MicroCarb mission uses a compact and low-cost space instrument that is smaller than current spectrometers. The instrument is capable of acquiring four spectral bands with a single detector. The first band, B1 (758.3–768.3 nm), is an O2 band with a spectral resolution of about 0.01 nm. Bands B2 (1596.7–1618.9 nm) and B3 (2023–2051 nm), with respective spectral resolutions of about 0.02 and 0.03 nm, are sensitive to the concentration of CO2 and have CO2 absorption lines. The last band, B4 (1264–1282.2 nm), is a second O2 band with a spectral resolution of about 0.02 nm. The wavelengths associated with this last band are closer to the CO2 wavelength and can be used for validation of space-based greenhouse gas observation (Bertaux et al., 2020). The whole dataset has been delivered by the French Space Agency (CNES, Toulouse) containing 1024 ISRFs associated with 1024 spectral measurements for the different bands. The data used for this experiment are the first band of MicroCarb with Nλ=1024 ISRFs and a sample size of N=895. The design of the MicroCarb instrument, obtained from Castelnau et al. (2019), is displayed in Fig. 3. More details about MicroCarb can be found on the CNES website (see https://microcarb.cnes.fr/en, last access: 11 December 2024). A particularity of this mission is that the shapes of the ISRFs are strongly dependent on the scene observed by the instrument, which is discussed in Sect. 5.3.3.

Figure 3Principle design of the MicroCarb instrument reproduced from Castelnau et al. (2019).
5.1 Numerical experiments and performance evaluation
The performance of the different ISRF estimation methods is evaluated in terms of ISRF estimation quality and the residual between the spectral measurements and their estimates. The quality of ISRF estimation can be quantified by the normalized absolute error between the ISRF and its estimate:
Note that for the instruments studied here, the ISRFs are assumed to be normalized to the unit area. The residual between the spectral measurements and their estimates is defined for each λl by
and summarized for an entire band in terms of the average residual:
In the MicroCarb mission, the ISRFs are considered to be well estimated when their normalized errors satisfy El<1 % for each wavelength. The performance of 1 % on the ISRF knowledge is an objective of the MicroCarb mission in order to provide an accurate determination of CO2 concentrations. The 1 % requirement accounts for uncertainty, acquisition noise of ISRFs and interpolation and is used as a target in this work. The proposed SPIRIT method is compared to the parametric methods based on Gaussian and super-Gaussian models. The parameters of these models are estimated using the non-linear least squares algorithm based on the Nelder–Mead optimization algorithm (Lagarias et al., 1998) (MATLAB function fminsearch). This iterative algorithm requires an initialisation and a stopping criterion. For the initialization of the Gaussian model, the mean was set to the sample mean of the ISRFs, the full width at half maximum (FWHM) was used for the standard deviation and the amplitude was initialized as . For the super-Gaussian model, the initialization was defined as , , and , where Γ is the gamma function. The algorithm was stopped after a maximum number of iterations equal to 20 000. The dictionary used by SPIRIT was constructed using an SVD of a collection of approximately 10 % of the total number of ISRFs within the band of interest or estimated using the K-SVD algorithm initialized with this collection. In our experiments, we used ND=25. Two different sparse-coding methods based on LASSO (Tibshirani, 1996) and OMP are investigated after dictionary construction. The first method uses a MATLAB implementation of LASSO with a parameter μ>0 adjusted to obtain a desired number of atoms. The non-zero coefficients obtained with LASSO were re-estimated in order to reduce the shrinking bias inherent to this method (Zhang and Huang, 2008). The implementations of the OMP and LASSO algorithms are summarized in Appendixes A3 and A4.
5.2 ISRF estimation performance
5.2.1 ISRF estimation for the MicroCarb mission
An example of an ISRF simulated for the MicroCarb mission and the estimates obtained with the different methods is displayed in Fig. 4. The results clearly illustrate the advantage of using SPIRIT for ISRF estimation, which leads to normalized estimation errors of less than 1 %, significantly below those obtained using the parametric estimation methods. A comparison between the different sparse approximations (OMP, LASSO) and dictionaries (SVD, K-SVD) that can be used by SPIRIT shows that OMP works better than LASSO for this example. Moreover, using the K-SVD algorithm does not significantly improve the results with respect to SVD, although it has significantly higher computational complexity.

Figure 4Example of a simulated ISRF for the MicroCarb mission and its estimates using parametric methods and SPIRIT.
The spectral measurements displayed in Fig. 5 were simulated by the CNES for the B1 wavelength range (758.4–768.9 nm). The results show that, for the MicroCarb spectrometer, the use of the super-Gauss parameterization reduces the residual error and ISRF approximation errors compared with the Gaussian model. SPIRIT yields significantly better results, with ISRF approximation errors below 1 % and of the order of 0.1 % for certain wavelengths. LASSO leads to overall less accurate approximations of the ISRFs than OMP, at a significantly higher computational cost, and the use of OMP is overall and consistently beneficial.

Figure 5Illustrations of (a) the measured spectrum reconstruction, (b) the difference between the measured spectrum and the reconstructed spectra, (c) the residuals ρl for each wavelengths, (d) the ISRF approximation error versus the wavelength, and (e) the mean ISRF approximation error versus the number of selected atoms for different methods (Gauss, super-Gauss, OMP, LASSO, SVD and K-SVD) and for band B1 of the MicroCarb instrument.
Sum of two generalized Gaussians
ISRFs can also be modeled using other parametric models, such as the sum of two generalized Gaussians with different shifted center wavelengths, although this has not yet been reported in the literature. As displayed in Fig. 6, this novel parametric approach yields enhanced outcomes as compared to the use of Gaussian and super-Gaussian models. However, the performance is still not competitive with respect to sparse-representation-based methods and necessitates more parameters to estimate. A more detailed study of such more complex parametric models is left for future work.
5.2.2 ISRF estimation for the Orbiting Carbon Observatory 2 (OCO-2) spectrometer
This section studies the applicability of the proposed method to the Orbiting Carbon Observatory 2 (OCO-2) spectrometer. The OCO-2 spectrometer is used in a NASA Earth observing satellite mission that was launched in July 2014. This mission is dedicated to the study of atmospheric carbon dioxide and oxygen and aims at characterizing the global CO2 seasonal cycles and to quantify the sources and sinks of carbon. OCO-2 is composed of three high-spectral-resolution imaging spectrometers for narrow spectral ranges. The characterization of ISRFs for this spectrometer is highly challenging and crucial due to this high spectral resolution. The ISRFs are measured for each pixel using a tunable diode laser during pre-flight calibration (Lee et al., 2017), and the results are stored in a look-up table. The data used in this article can be downloaded on the NASA data website EarthDATA (OCO-2 Science Team/Gunson and Eldering, 2019) (data available at https://disc.gsfc.nasa.gov/datacollection/OCO2_L1B_Calibration_11r.html, last access: 15 November 2023). The product considered in this study is the OCO-2 Level 1B version 11r for science acquired in March 2023, and the fourth footprint is used. Specification on the data product can be found in Crisp et al. (2021). Some of the ISRFs are declared as invalid due to radiometric, spatial, spectral or polarization problems (and are thus not considered for ISRF estimation). The ISRFs associated with bad pixels have not been considered in our experiments, resulting in a number of ISRFs lower than the number of pixels. To identify the ISRFs at the missing nominal wavelengths λl, a linear interpolation between two specified nominal wavelengths λa and λb with known ISRFs was employed. The resulting interpolated ISRF is defined by
Note that the number Nλ of wavelengths after interpolation may differ from the number of pixels of the instrument, which occurs if the ISRFs associated with the first and/or last pixels are missing. The ISRFs used for the experiments come from the O2 A band of OCO-2 with Nλ=859 ISRFs and a sample size N=895. Figure 7 displays an example of an ISRF from the OCO-2 dataset. A visual comparison with Fig. 5 shows that the ISRF shapes can differ significantly depending on the considered wavelength and the instrument. This observation suggests that the dictionary must be adapted to the spectrometer. Another interesting observation is that although the super-Gaussian distribution should theoretically always provide a better fit than the Gaussian distribution, it is not systematically the case in practice because of convergence issues for the iterative methods used to solve the non-linear least squares problem for parameter estimation. Specifically, the model parameters are estimated using a simplex-based optimization method (MATLAB function fminsearch) that aims at minimizing the residuals between the measured and estimated spectra, which does not always converge to a better solution for the super-Gaussian model than for the Gaussian model.

Figure 7Example of an ISRF retrieved for the OCO-2 mission and its estimates using parametric methods and SPIRIT.
Figure 8 displays performance results for the OCO-2 measurements obtained using the data for the O2 band (757–772 nm). The measured spectrum is reconstructed with the proposed sparse-representation methods for K=5 atoms chosen using a dictionary constructed using SVD or K-SVD. The results indicate that the super-Gaussian model delivers slightly better results than the Gaussian model in terms of residual error and mean ISRF approximation error. However, for the smaller wavelengths of the band, the ISRF approximation errors are slightly larger with the super-Gaussian model, as already observed in Fig. 7 for a single ISRF. Both parametric models yield close to 10 % ISRF approximation errors. The proposed sparse-representation approach again yields far better ISRF approximations and measurement fits, with the best results obtained using OMP and SVD.

Figure 8Illustrations of (a) the measured spectrum reconstruction, (b) the difference between the measured spectrum and the reconstructed spectra, (c) the residuals ρl for each wavelength, (d) the ISRF approximation error versus the wavelength, and (e) the mean ISRF approximation error versus the number of selected atoms using different methods (Gauss, super-Gauss, OMP and LASSO with SVD or K-SVD) for the O2 A band of the OCO-2 instrument.
5.2.3 Conclusions
Overall, the conclusions from these experiments are as follows. First, the super-Gaussian parameterization often yields better performance than the Gaussian one, corroborating the results reported in Beirle et al. (2017). However, the normalized ISRF approximation errors obtained with these parametric methods are consistently larger than 1 %, for both instruments and for all wavelengths. In contrast, the proposed SPIRIT approach based on sparse approximations of ISRFs in a suitable dictionary yields significantly better results. This result is due to the fact that the ISRF shapes depend strongly on the spectrometer and can vary across wavelengths, which cannot be accommodated easily with a simple parametric model. On the contrary, decompositions in appropriate dictionaries that depend on the spectrometer and the chosen wavelength offer sufficient flexibility for all use cases considered in this paper. Regarding the estimation algorithms, SVD overall provides an estimation performance close to K-SVD, and OMP leads to better estimation than LASSO. There is no theoretical reason for OMP to provide better performance than LASSO. However, it is important to note that the OMP and LASSO algorithms address two distinct problems: the OMP algorithm provides an approximate solution to the problem with an ℓ0 penalty, and the LASSO algorithm solves the relaxed problem using an ℓ1 regularization. Certain limitations of the LASSO algorithm have been highlighted in numerous publications, including Tibshirani (1996), and may also be at the origin of our observation. The results overall suggest the use of SVD for building the dictionary and OMP for ISRF estimation.
The proposed methods can also be applied to other instruments, such as Avantes, GOME-2, OMI and TROPOMI used in Beirle et al. (2017). Results obtained with these instruments are available in the Supplement and lead to similar conclusions.
5.3 Robustness analysis and ablation study
5.3.1 Robustness to additive noise
Monte Carlo simulations were conducted to study the robustness of the different ISRF estimation methods to the presence of measurement noise. Independent white Gaussian noise was added to the spectral measurements with several signal to noise ratio (SNR) levels to take into account thermal noise and spatial binning: spatial binning involves the arbitrary division of the imaged area on Earth into distinct fields of view (FOVs) (e.g., three FOVs for MicroCarb). The measured spectrum for each FOV is obtained as an average of the measured spectra within that FOV. 4 Table 1 reports the obtained residual approximation errors and the normalized average ISRF approximation errors for the two instruments MicroCarb and OCO-2. Approximation errors less than <1 % are highlighted in italic. These results show that the proposed sparse representations meet this target for SNRs larger than 20 dB. Moreover, OMP is found to be more robust to noise than LASSO and yields the best results overall. The parametric models again lead to large errors. It is interesting to note that these errors do not vary significantly with the noise level. This indicates that errors due to model misfit are larger than those induced by the noise degradations. To conclude, OMP combined with SVD provides the best results overall for ISRF estimation, also in the presence of additive noise.
5.3.2 Sensitivity to parameter tuning for SPIRIT
The proposed approach requires the choice of a small number of parameters, namely the size of the sliding window Nobs, the size of the dictionary ND and the number of atoms K. The choice of K has been studied above, and the best results were obtained for for both instruments; see Figs. 5 and 8 and the corresponding discussions in Sect. 5.2.2. Here, we further study the impact of Nobs and ND on the ISRF approximation errors. To this end, Figs. 9 and 10 show the approximation errors (in log 10 scale) as a function of Nobs for the Gaussian and super-Gaussian parameterizations and as functions of Nobs and ND for SPIRIT. Results are reported for the two instruments OCO-2 and MicroCarb and averaged for all ISRFs. The ISRF estimation errors decrease as Nobs increases, as expected. However, this decrease is more important for SPIRIT (e.g., for Nobs=80, the mean ISRF errors for Gauss and super-Gauss are equal to 16.27 % and 2.04 %, whereas they are equal to 0.29 % for OMP/SVD, 0.33 % for OMP/K-SVD, 1.23 % for LASSO/SVD and 1.40 % for LASSO/K-SVD), showing the interest of exploiting sparsity for ISRF estimation. The results in Figs. 9 and 10 also indicate that it is beneficial to use dictionaries of modest size, since the ISRF estimation errors increase for large dictionaries (ND≤100 for OMP and ND≤25 for LASSO). Based on this observation, ND=25 was used in all the experiments.

Figure 9Mean approximation errors for OCO-2 and the different estimation methods (Gauss, super-Gauss, OMP and LASSO with SVD or K-SVD) versus the number of observations Nobs and the dictionary size ND for K=5.
5.3.3 Robustness to ISRF changes
The ISRFs considered in the previous sections were obtained from uniform scenes referred to as “ISRF IN” for the MicroCarb mission. However these ISRFs can change depending on the scene observed by the instrument.
ISRFs for non-uniform scenes
The design of the MicroCarb instrument makes the ISRF sensitive to the slit illumination during the integration time. Such dependence on the scene can impact a multitude of instruments. 5 Eight different scenes of the Earth’s surface that are directly observed by the spectrometer's slit and subsequently recorded by the instrument's detector during the integration period are considered and are displayed in Fig. 12. These images were obtained in the ACT direction, and each image was divided along the ACT direction into three equal parts, resulting in three defined FOVs, labeled FOV 1, FOV 2 and FOV 3. The spatial pixels in each FOV are averaged to increase the spectral SNR. This binning and averaging step allows three measured spectra per imaged area to be determined, whose ISRFs have to be estimated. Figure 11 shows ISRFs from uniform scenes (left), randomly selected out of the 1024 ISRFs, and ISRFs from non-uniform scenes (right), randomly selected from the total set of eight scenes and three FOVs, highlighting the differences in ISRF shapes depending on the scene: the ISRFs can be more asymmetric for non-uniform scenes and are thus harder to estimate.6 It is interesting to note that the ISRF of a desert scene is very similar to the ISRF of a uniform scene, contrary to the ISRF of a horizontal coast profile, which makes the slit blinded during one-third of the integration time and leads to an asymmetric left-distorted ISRF, which is harder to estimate.

Figure 11Examples of ISRFs from uniform scenes (ISRF IN – a) and from different non-uniform scenes displayed in Fig. 12 and FOVs (ISRF scene – b) (MicroCarb band B1).
Estimation performance
This section studies the performance of SPIRIT in estimating non-uniform scene ISRFs for the first band (band B1) of the MicroCarb spectrometer. Two cases are considered: estimation using the original dictionary learned by examples of uniform ISRFs (ISRF IN) and estimation after modification of this dictionary to account for the diversity of ISRFs. Specifically, the second dictionary is constructed from a set of 103 ISRF INs (1 out of 10) and 3 ISRF scenes (out of 24). The second dictionary is then composed of ND=25 new atoms obtained by SVD from this collection of representative ISRFs. Results obtained using SPIRIT with OMP are displayed in Fig. 13. In the first case (dictionary learned by uniform ISRFs, Fig. 13 top row), the resulting normalized ISRF errors exceed 1 % for several scenes and FOVs, pointing to the fact that the dictionary is not well adapted for representing ISRFs for non-uniform scenes. The results obtained using the second dictionary are presented in the bottom part of Fig. 13. Using only three additional examples of ISRF scenes in the dictionary again allows ISRF estimation errors to be smaller than 1 %. Note that the lowest approximation errors are obtained in most cases using K=3 to K=6 atoms from the dictionary, as before. To conclude, these results show that the proposed method can easily adapt to more complex ISRF shapes by considering more diverse ISRF examples in the dictionary estimation step.
5.3.4 Robustness to pixel errors
Instrumental errors within a single pixel l can distort the shape of the ISRF of this pixel, leading to the creation of an outlier. This section investigates the impact of such outliers on ISRF estimation. To simulate this scenario, an ISRF from the OCO-2 instrument was inserted in pixel l=500 of band B1 of MicroCarb data, simulating an outlier in this pixel. The initial ISRF of the 500th band of MicroCarb and its new version are displayed in Fig. 14 (see black and red curves respectively). The corresponding estimation results, compared to those from the previous study without outliers, are displayed in Fig. 15. These results demonstrate that the presence of an erroneous ISRF in the sliding window leads to an increase in estimation errors for the windows containing the outlier since the ISRF estimation becomes more challenging. However, the results also indicate that the outlier ISRF could first be identified by inspecting the residuals between the measured spectrum and the ISRF reconstructions and then not be considered for ISRF estimation.

Figure 14Illustration of the generated ISRF (red) at pixel l=500 in the presence of pixel errors as compared to the original ISRF (black) for band B1 of MicroCarb.
5.3.5 Impact of uncertainties about the reference spectra and reference ISRFs
This section analyzes the impact of uncertainties about the ISRFs used to build the dictionary and the reference spectrum on the ISRF estimation performance.
Uncertainties about the ISRFs
To evaluate the impact of uncertainties affecting the ISRFs, Gaussian noise is added to one-third of the ISRFs used to construct the dictionary, with SNR=40 and SNR=60 dB. The noisy ISRFs are then made positive by taking their absolute values and normalized to have a unit area. The results, displayed in the left part of Fig. 16 (using K=4 atoms for the plot in the top row), show that as noise increases, better results are achieved with smaller values of K in the presence of noise with an increase in ISRF estimation errors. However, the estimation is relatively robust to the presence of noise affecting ISRFs used to build the dictionary since approximation errors remain below 1 % on average for both noise levels.
Uncertainties about the reference spectrum
In a second experiment, Gaussian noise is added to the reference spectrum, with SNR=20, SNR=40 and SNR=60 dB. The results are shown in the right part of Fig. 16 (using K=4 atoms for the plot in the top row). Using a reference spectrum corrupted by additive noise clearly has a smaller impact on estimation performance when compared to degradations affecting ISRFs used to build the dictionary. Note that high noise levels (SNR =20 dB) are necessary to significantly increase ISRF estimation errors, probably because of an averaging effect when computing spectral measurement by convolution of the reference spectrum with the ISRF.
Overall, these results indicate that the proposed method is robust to uncertainties in both the ISRFs and the reference spectrum, with ISRF approximation errors remaining below 1 % for realistic SNR levels.
This paper studied a new method for estimating the instrument spectral response functions (ISRFs) of spectrometers. This method is based on a sparse decomposition of the ISRFs into a dictionary of basis functions called atoms. The proposed method can be applied to a large variety of instruments as long as the ISRF estimation problem can be formulated as a linear inverse problem with a sufficient number of measurements (either because the ISRFs do not vary much in a small observation window, in the spectral or spatial domains, or because observations from several reference spectra can be obtained for the same ISRF). The method also requires that a sufficient number and variety of reference ISRFs have been identified and characterized on the ground to construct the dictionary. We recommend using the SVD algorithm to build the dictionary using representative ISRFs and the orthogonal matching pursuit (OMP) algorithm to decompose the ISRFs into this dictionary. The performance of these algorithms is excellent at the price of a very modest computational cost, which suggests its practicality for in-flight scenarios. Another interesting property of the proposed estimation method is that it is not impacted significantly by the shapes of the ISRFs to be estimated, allowing accurate estimations for different types of scenes. Numerical experiments presented in this paper also showed that the ISRFs of the MicroCarb and OCO-2 spectrometers can be estimated with approximation errors smaller than 1 %, which is very promising. Other results available in the Supplement confirm this conclusion for other spectrometers such as Avantes, GOME-2, OMI and TROPOMI.
Future work includes the consideration of radiometric and spectral errors (such as stray light, residual errors of calibration, temporal drifts or spectral shifts) that can degrade the performance of ISRF estimation. These errors are expected to affect some specific wavelengths, which suggests the importance of investigating specific algorithms, jointly correcting errors and estimating the ISRFs. The resulting problem is more challenging since there are non-linear relationships between the spectrometer measurements and these radiometric and spectral errors. Another interesting prospect is to analyze the potential of other methods, e.g., based on Gaussian mixtures or machine learning algorithms, for error correction and ISRF estimation. Finally, it would be interesting to assess the impact of potential uncertainties about the reference spectra and the ISRFs used to build the dictionary more extensively.
Appendix A describes the algorithm used to create the matrix of reference spectra, the OMP algorithm and the K-SVD algorithm. The LASSO algorithm was implemented using the MATLAB function lasso.m. The method used to select the hyperparameter μ is also presented.
A1 Reference spectrum matrix
The algorithm takes as an input the reference spectrum as a vector, the corresponding wavelengths λr, the wavelengths associated with the measured spectrum λ and the wavelengths associated with the ISRF Δ introduced in Sect. 2.
Algorithm A1Generation of the reference spectrum matrix.
A2 Construction of the dictionary
This appendix describes the construction of the dictionary Φ that is used in the sparse-representation-based algorithms K-SVD, LASSO and OMP.
Algorithm A2Construction of the dictionary.
A3 OMP algorithm
Appendix A3 describes the OMP algorithm used to find the sparse representation of the ISRF Il of interest using K non-zero coefficients in the dictionary Φ from the measured spectrum sl and the reference spectrum matrix Rl contained in the sliding window.
Algorithm A3Orthogonal matching pursuit (OMP) algorithm.
A4 LASSO algorithm
The MATLAB function lasso.m is used to find the sparse representation of the ISRF Il in the dictionary Φ using K non-zero coefficients, from the measured spectrum sl and the reference spectrum matrix Rl associated with the sliding window. A dichotomic search is used to obtain the sparsity parameter μ that leads to a given number non-zero coefficient K. The associated algorithm is described in Algorithm A4.
Algorithm A4LASSO algorithm.
A5 K-SVD algorithm
The K-SVD algorithm of Aharon et al. (2006) is described in Algorithm A5. At each step, the dictionary is updated by changing its columns separately and sequentially and applying K singular value decompositions (SVDs) on the appropriate error matrix Ej.
Algorithm A5Construction of the dictionary using the K-SVD algorithm.
The results obtained with the proposed method for the instruments Avantes, GOME-2, OMI and TROPOMI are provided in the Supplement. More details on the data and code used in this study are available upon request from the corresponding author.
The supplement related to this article is available online at https://doi.org/10.5194/amt-18-2573-2025-supplement.
JEH gathered the data for different spectrometers and CP for the MicroCarb spectrometer. JMG and CP contributed to a first formalization of the problem. Mathematical formulation, implementation and formal analysis were conducted by JEH, JYT and HW. All authors contributed to the writing process through discussion and feedback.
The contact author has declared that none of the authors has any competing interests.
Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.
We would like to thank Denis Jouglet from the Department of Atmospheric Sounding at CNES for providing the reference spectra at the different wavelengths. Moreover, we express our gratitude to Steffen Beirle from the Max Planck Institute for Chemistry (MPI-C) for helpful discussions and for providing some ISRF data.
This study was supported by the French Space Agency (CNES), France, and Thales Alenia Space, Cannes, France.
This paper was edited by Mark Weber and reviewed by two anonymous referees.
Aharon, M., Elad, M., and Bruckstein, A.: K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation, IEEE T. Signal Proces., 54, 4311–4322, https://doi.org/10.1109/TSP.2006.881199, 2006. a, b
Armante, R., Scott, N. A., Capelle, V., Chédin, A., Bernard, E., Standfuss, C., Tournier, B., and Pierangelo, C.: IASI conference Presentation: 4AOP: A fast and accurate operational forward radiative transfer model, https://cnes.fr/sites/default/files/migration/smsc/iasi/PDF/conf3/posters/84_Armante_R.pdf (last access: 31 March 2025), 2013. a
Beirle, S., Lampel, J., Lerot, C., Sihler, H., and Wagner, T.: Parameterizing the instrumental spectral response function and its changes by a super-Gaussian and its derivatives, Atmos. Meas. Tech., 10, 581–598, https://doi.org/10.5194/amt-10-581-2017, 2017. a, b, c, d
Bertaux, J.-L., Hauchecorne, A., Lefèvre, F., Bréon, F.-M., Blanot, L., Jouglet, D., Lafrique, P., and Akaev, P.: The use of the 1.27 µm O2 absorption band for greenhouse gas monitoring from space and application to MicroCarb, Atmos. Meas. Tech., 13, 3329–3374, https://doi.org/10.5194/amt-13-3329-2020, 2020. a
Cansot, E., Pistre, L., Castelnau, M., Landiech, P., Georges, L., Gaeremynck, Y., and Bernard, P.: MicroCarb instrument, overview and first results, Proc. SPIE 12777, Int. Conf. Space Optics, 12777, 1–13, https://doi.org/10.1117/12.2690330, 2022. a, b
Castelnau, M., Cansot, E., Buil, C., Pascal, V., Crombez, V., Lopez, S., Georges, L., and Dubreuil, M.: Modelization and validation of the diffraction effects in the MicroCarb instrument for accurately computing the instrumental spectral response function, in: Int. Conf. Space Optics-ICSO 2018, Chania, Greece, 12 July 2019, 11180, 1054–1068, SPIE, https://doi.org/10.1117/12.2536019, 2019. a, b
Crisp, D., Rosenberg, R., Chapsky, L., Keller Rodrigues, G. R., Lee, R., Merrelli, A., Osterman, G., Oyafuso, F., Pollock, R., Spiers, G., Yu, S., Zong, J., and Eldering, A.: Orbiting Carbon Observatory – 2 & 3 (OCO-2 & OCO-3)- Level 1B! Algorithm Theoretical Basis Document, https://sentinels.copernicus.eu/documents/247904/2476257/Sentinel-5P-TROPOMI-Level-1B-ATBD (last access: 15 November 2023), 2021. a
Figueiredo, M. A. T., Nowak, R. D., and Wright, S. J.: Gradient Projection for Sparse Reconstruction: Application to Compressed Sensing and Other Inverse Problems, IEEE J. Sel. Top. Signa., 1, 586–597, https://doi.org/10.1109/JSTSP.2007.910281, 2007. a, b
Kim, S.-J., Koh, K., Lustig, M., Boyd, S., and Gorinevsky, D.: An Interior-Point Method for Large-Scale ℓ1-Regularized Least Squares, IEEE J. Sel. Top. Signa., 1, 606–617, https://doi.org/10.1109/JSTSP.2007.910971, 2007. a
Lagarias, J. C., Reeds, J. A., Wright, M. H., and Wright, P. E.: Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions, SIAM J. Optimiz., 9, 112–147, 1998. a
Lee, R. A. M., O’Dell, C. W., Wunch, D., Roehl, C. M., Osterman, G. B., Blavier, J.-F., Rosenberg, R., Chapsky, L., Frankenberg, C., Hunyadi-Lay, S. L., Fisher, B. M., Rider, D. M., Crisp, D., and Pollock, R.: Preflight Spectral Calibration of the Orbiting Carbon Observatory 2, IEEE T. Geosci. Remote, 55, 2499–2508, https://doi.org/10.1109/TGRS.2016.2645614, 2017. a, b
Mallat, S. G. and Zhang, Z.: Matching pursuits with time-frequency dictionaries, IEEE T. Signal Proces., 41, 3397–3415, https://doi.org/10.1109/78.258082, 1993. a
NOVELTIS, CNES, and LMD: 4A/OP – Operational release for 4A – Automatized Atmospheric Absorption Atlas, https://4aop.noveltis.fr/references-and-publications (last access: 31 March 2025), 2012. a
OCO-2 Science Team/Gunson, M. and Eldering, A.: OCO-2 Level 1B calibrated, geolocated calibration spectra, Retrospective Processing V11r, Greenbelt, MD, USA, Goddard Earth Sciences Data and Information Services Center (GES DISC), https://disc.gsfc.nasa.gov/datacollection/OCO2_L1B_Calibration_11r.html (last access: 15 November 2023), 2019. a
Pati, Y. C., Rezaiifar, R., and Krishnaprasad, P. S.: Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition, in: Proc. Asilomar Conf. Signals, Systems and Computers, Pacific Grove, Pacific Grove, CA, USA, 1–3 November 1993, 40–44, https://doi.org/10.1109/ACSSC.1993.342465, 1993. a
Pittet, C., Crombez, V., Jouglet, D., Georges, L., Cansot, E., and Albert-Aguilar, A.: In-flight estimation of the MicroCarb Instrument Spectral Response Functions, Geophysical Research Abstracts, 21, EGU2019-6889, 2019. a
Sun, K., Liu, X., Huang, G., González Abad, G., Cai, Z., Chance, K., and Yang, K.: Deriving the slit functions from OMI solar observations and its implications for ozone-profile retrieval, Atmos. Meas. Tech., 10, 3677–3695, https://doi.org/10.5194/amt-10-3677-2017, 2017a. a
Sun, K., Liu, X., Nowlan, C. R., Cai, Z., Chance, K., Frankenberg, C., Lee, R. A. M., Pollock, R., Rosenberg, R., and Crisp, D.: Characterization of the OCO-2 instrument line shape functions using on-orbit solar measurements, Atmos. Meas. Tech., 10, 939–953, https://doi.org/10.5194/amt-10-939-2017, 2017b. a
Tan, M., Tsang, I. W., and Wang, L.: Matching Pursuit LASSO Part I: Sparse Recovery Over Big Dictionary, IEEE T. Signal Proces., 63, 727–741, https://doi.org/10.1109/TSP.2014.2385036, 2015. a
Tibshirani, R.: Regression Shrinkage and Selection via the Lasso, J. R. Stat. Soc. B, 58, 267–288, 1996. a, b
Tošić, I. and Frossard, P.: Dictionary Learning, IEEE Signal Process. Mag., 28, 27–38, https://doi.org/10.1109/MSP.2010.939537, 2011. a, b
Zhang, C.-H. and Huang, J.: The sparsity and bias of the Lasso selection in high-dimensional linear regression, Ann. Stat., 36, 1567–1594, https://doi.org/10.1214/07-AOS520, 2008. a
Zhang, Z., Xu, Y., Yang, J., Li, X., and Zhang, D.: A Survey of Sparse Representation: Algorithms and Applications, IEEE Access, 3, 490–530, https://doi.org/10.1109/ACCESS.2015.2430359, 2015. a, b
In practice, the wavelength associated with the pixel is obtained as the center (maximum, median or barycenter) of the measured ISRF at the given pixel. However, there are some effects, such as the smile effect (in ACT), or some gaps in our knowledge about the wavelengths (in along track) that can result in spectral shifts, which can degrade the estimation of ISRFs. These aspects are not considered in the present work. Thus, it is assumed that each pixel is associated with one wavelength which is known, and we address the ISRF estimation problem by solving an inverse problem.
Additional measurements could in principle be obtained experimentally using, e.g., a spectrally tunable onboard calibration source, albeit at an extra cost.
Alternative designs, such as Fourier transform infrared spectroscopy (FTIR), are also employed in practice, and the associated ISRFs can be obtained through the inverse Fourier transform. However, in certain applications, applying the Fourier transform can become more challenging (i.e., when undersampling is necessary or when the optical path difference varies depending on the position). If the problem can be modeled as a linear inverse problem, sparse-representation-based methods can be used with these spectrometers to estimate ISRFs. The proposed method is not specific to any instrument and can be applied to any instrument for which the problem can be formulated as a linear inverse problem.
In the case of the MicroCarb mission, the binning represents a compromise between the objective of achieving a good signal to noise ratio (SNR) and maintaining a suitable ground grid, which has a resolution of 13.5 km in ACT and 9 km along the track.
It can be possible to defocus the instrument in order to avoid this dependence on the slit illumination. However, the introduction of a defocus can potentially compromise the precision of the instrument, and thus it was ultimately decided to exclude this option from the MicroCarb instrument.
In practice, there is no information available regarding the non-uniformity of a given scene from the measured spectra. It is only during the inversion process, when estimating the ISRFs, that it becomes apparent (by looking at the measured spectra and the associated residuals) that the ISRFs have been modified. For a given reference spectrum, non-uniform scenes are generated using asymmetric ISRFs; see Pittet et al. (2019) for more details.
- Abstract
- Introduction
- Existing models and estimation methods
- Sparse approximations of ISRFs
- Instruments, datasets and preprocessing
- Results and discussion
- Conclusions
- Appendix A: Algorithms
- Code and data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Supplement
- Abstract
- Introduction
- Existing models and estimation methods
- Sparse approximations of ISRFs
- Instruments, datasets and preprocessing
- Results and discussion
- Conclusions
- Appendix A: Algorithms
- Code and data availability
- Author contributions
- Competing interests
- Disclaimer
- Acknowledgements
- Financial support
- Review statement
- References
- Supplement