In-flight estimation of instrument spectral response functions using sparse representations

El Haouari, Jihanne; Gaucel, Jean-Michel; Pittet, Christelle; Tourneret, Jean-Yves; Wendt, Herwig

doi:https://doi.org/10.5194/amt-18-2573-2025

Articles | Volume 18, issue 12

https://doi.org/10.5194/amt-18-2573-2025

Articles | Volume 18, issue 12

Research article

18 Jun 2025

Research article |

| 18 Jun 2025

In-flight estimation of instrument spectral response functions using sparse representations

Jihanne El Haouari, Jean-Michel Gaucel, Christelle Pittet, Jean-Yves Tourneret, and Herwig Wendt

Abstract

High-resolution spectrometers are composed of different optical elements and detectors that must be modeled as accurately as possible. Specifically, accurate estimates of instrument spectral response functions (ISRFs) are critical in order not to compromise the retrieval of trace gas concentrations from spectral measurements. Currently, parametric models are used to estimate these response functions. However, these models cannot always take into account the diversity of ISRF shapes that are encountered in practical applications. This paper studies a new ISRF estimation method based on a sparse representation of the ISRF in a dictionary. The proposed method is shown to be very competitive when compared to parametric models, yielding up to 1 order of magnitude smaller normalized ISRF estimation errors. The method is applied to different high-resolution spectrometers, demonstrating its reproducibility for multiple remote sensing missions.

Download & links

Article (PDF, 7261 KB)

Supplement (769 KB)

Download & links

How to cite.

Received: 12 Apr 2024 – Discussion started: 17 Apr 2024 – Revised: 12 Dec 2024 – Accepted: 17 Jan 2025 – Published: 18 Jun 2025

1 Introduction

Space remote sensing makes it possible to remotely measure the composition of the atmosphere or the troposphere and to retrieve trace gas concentrations. It can also be used to monitor molecule fluxes at the Earth's surface, as is the case for the MicroCarb mission that is designed to monitor CO₂ fluxes (Cansot et al., 2022) in order to provide a better understanding of the carbon cycle, which is important in the context of climate change. This can be done by analyzing the interaction of the atmosphere with natural radiation, such as sunlight, or artificial radiation, generated for example by a laser. Indeed, the presence of some molecules in the path of radiation modifies its spectral content at the characteristic wavelengths of the different elements. The information directly obtained from satellites is the atmospheric spectrum. By considering some specific wavelengths of interest, it is possible to determine the concentration of the desired trace gases in a column of atmosphere by comparing these measured spectra with a reference spectrum obtained using a radiative transfer model.

The instruments used for gas concentration estimation are high-resolution spectrometers. Spectrometers consist mainly of an optical part (for example composed of a slit, a telescope and dispersive grating) and a detector. In this configuration, the telescope projects the image of the Earth onto the spectrometer slit and then onto the detector. Each pixel of the detector is associated with a spatial direction (called ACT for ACross Track) and a specific wavelength. A binning and an averaging along the ACT axis are performed in order to improve the signal to noise ratio (SNR). For each of the two parts (optical part and detector), a response function is defined, which leads to a continuous optical function and another function associated with each pixel of the detector. This results in a global response function associated with each pixel along the spectral axis, known as the instrument spectral response function (ISRF), associated with a specific wavelength. The ISRFs can vary significantly depending on the instrument considered, and their shapes depend on the central wavelength, among other factors. The estimation of trace gas concentrations is an inversion process that is performed on the ground from spectrometer measurements and the instrument ISRFs. The accuracy of this estimation highly depends on the knowledge of these ISRFs for all pixels. For some missions, ISRFs are expected to be known with a normalized error of less than 1 %, which represents a significant challenge given that the variations in ISRF shape across the entire band frequently exceed this threshold.

Spectrometers are first calibrated on the ground where their associated ISRFs are estimated experimentally. However, the ISRFs are subject to in-flight changes due to mechanical movements associated with the launch of the instruments, thermal changes in orbit, or certain sensitivities linked to the instrument itself (such as the MicroCarb's sensitivity to the scene). As a consequence, these ISRFs need to be re-estimated regularly in flight throughout the mission. The principle of the estimation is to take a measurement of a spectrally known scene and to compare it with a spectral model of the scene convolved with the ISRFs at different wavelengths. Parametric models have been widely used in the literature to estimate ISRFs. Gaussian and generalized Gaussian parametric models (referred to as “Gauss” and “super-Gauss”) were proposed in Beirle et al. (2017). Parametric models are attractive for their simplicity and small number of parameters. However, they are not flexible enough to represent the diversity of ISRF shapes adequately. The ISRF estimation problem and the most important parametric models that have been considered in the literature are detailed in Sect. 2.

The objective of this work is to overcome the limitations of the existing parametric ISRF estimation methods caused by their insufficient accuracy. To this end, we propose as a first major contribution a new estimation strategy based on sparse representations of the ISRFs in a dictionary of well-chosen atoms. More precisely, the ISRFs are decomposed in a dictionary that is constructed using several ISRFs that are available from ground characterization for each instrument. The dictionary can also be updated iteratively online. For each instrument, each ISRF is then approximated by a linear combination of a small number of atoms of the dictionary associated with the instrument. The proposed approach is detailed in Sect. 3. We investigate and compare two different methods for obtaining the sparse representations of ISRFs.

As a second contribution, we conduct an extensive numerical study of the proposed ISRF estimation approach and compare it to parametric methods for datasets from several different spectrometers used in space missions, whose characteristics are detailed in Sect. 4. The main focus is on the MicroCarb instrument (Cansot et al., 2022), which is dedicated to studying atmospheric carbon dioxide and oxygen, with the objective of determining their concentrations at the Earth's surface. Additional results showing the applicability of the proposed methodology to other spectrometers are reported for the Orbiting Carbon Observatory 2 (OCO-2) spectrometer (Lee et al., 2017) and complemented by results for several other spectrometers that are reported in the Supplement.

Numerical results are reported in Sect. 5 and lead to the conclusion that the proposed method yields significantly improved flexibility and accuracy for ISRF estimation when compared to previous state-of-the-art parametric methods, consistently through the different datasets and scenarios, with a small number of parameters that can easily and efficiently be estimated in real time. Moreover, the method is shown to be robust with respect to design choices, the noise corrupting the observed measurements, the ISRF changes depending on the scene, or the possible mismatches on the prior knowledge on the ISRFs or reference spectra.

2 Existing models and estimation methods

2.1 ISRF estimation model

The ISRF, which is sometimes referred to as the instrument line shape (ILS) (Sun et al., 2017 b) or slit function (Sun et al., 2017 a), is a function that describes the response of an instrument to a given wavelength. In this work, we only consider the spectral information, and thus each “pixel” l is associated with a specific wavelength λ_l, yielding an ISRF at this wavelength.¹ The in-flight identification of ISRFs is obtained from scenes that are assumed to be perfectly known radiometrically and spectrally (such as the Sun, the Moon, uniform scenes such as a desert, etc.), which are referred to as reference spectra. The principle of ISRF estimation is to determine the in-flight ISRFs for each wavelength λ_l that minimize some similarity measure between the measured spectrum s(λ_l) and the reference spectrum r(λ) convolved with the ISRF denoted as I_l(λ_l):

\begin{matrix} (1) & s (λ_{l}) = (r * I_{l}) (λ_{l}) = \int_{R} r (λ_{l} - u) I_{l} (u) d u, l = 1, \dots, N_{λ}, \end{matrix}

where * denotes convolution, and N_λ is the number of central wavelengths λ_l, each associated with one ISRF I_l. For practical purposes, this equation can be discretized, leading to

\begin{matrix} (2) & s (λ_{l}) \approx \sum_{n = - N / 2}^{N / 2} r (λ_{l} - n Δ) I_{l} (n Δ), l = 1, \dots, N_{λ}, \end{matrix}

where Δ is the sampling period between two consecutive points of the ISRF, which is assumed to be regularly sampled. In other words, a vector $I_{l} = [I_{l} (- \frac{N}{2} Δ), \dots, I_{l} (\frac{N}{2} Δ)]^{T} \in R^{N + 1}$ needs to be estimated for each ISRF, corresponding to the values that it takes on the wavelength grid at which the ISRFs are sampled. $Δ = {- \frac{N}{2} Δ, \dots, \frac{N}{2} Δ} \in R^{N + 1}$ . The objective of the ISRF estimation problem is to solve the inverse problem (Eq. 2) assuming knowledge of both the reference spectrum r(λ) and the measurements s(λ_l).

A major difficulty with the inverse problem (Eq. 2) is that there is only one measurement per fixed wavelength λ_l, which makes it impossible to estimate the vector I_l without further assumptions. Two approaches can be used to make this estimation problem identifiable.² The first idea is to consider knowledge of several reference spectra for every wavelength. The problem is that this would require not only a sufficient number of calibration scenes to be available, but also that they substantially differ for each wavelength in order to provide complementary information on the shapes of the ISRFs. The second method, which is considered in this paper, has the advantage that it makes use of only one reference spectrum and is based on the assumption that the ISRFs for adjacent wavelengths λ_l are similar; i.e., they exhibit slight variations along the spectral axis between λ_l and λ_l+1. It is expected that the average of the normalized absolute error between the ISRFs in a window of N_obs+1 observations and the central ISRF at wavelength λ_l is below a given criterion for the ISRF estimation error. Note that the larger this variation, the more important the discrepancies in ISRF shapes. The small-variation assumption is not valid for the whole set of wavelengths, and the size of the sliding window must be adjusted in order to solve the ISRF estimation problem. This is a reasonable assumption for the ISRFs of real-world spectrometers. To estimate the ISRF at wavelength λ_l, we propose considering a vector $s_{l} = {[s (λ_{l - \frac{N_{obs}}{2}}), \dots, s (λ_{l + \frac{N_{obs}}{2}})]}^{T} \in R^{N_{obs} + 1}$ of N_obs+1 observations, including those from the neighboring ISRFs as well. Rewritten in matrix form, Eq. 2 simplifies to

s_{l} = R_{l} I_{l},

where $R_{l} = {[r_{l - \frac{N_{obs}}{2}}, \dots, r_{l + \frac{N_{obs}}{2}}]}^{T} \in R^{(N_{obs} + 1) \times (N + 1)}$ contains the values $r_{l} = [r (λ_{l} - \frac{N}{2} Δ), \dots, r (λ_{l} + \frac{N}{2} Δ)] \in R^{N + 1}$ of the reference spectrum covered by the different ISRFs in the neighborhood (see algorithm in Appendix A1). Given a model for the ISRF, estimating I_l can then be conducted for each wavelength λ_l by minimizing the residual error $| | s_{l} - R_{l} I_{l} | |_{2}^{2}$ .

2.2 Parametric models

It is difficult to analytically construct accurate forward models with a small number of parameters for ISRFs because they would need to incorporate a significant number of “contributors” associated with the instrument optics (slit, mirror, lens, separator, dispersing element), the detector or the acquisition mode. The state of the art therefore considers simple parametric models. A classical way to model and estimate the ISRF at wavelength λ_l is to use a parametric Gaussian model defined by

\begin{matrix} (3) & \begin{aligned} I_{l, β_{G}} (x) & = A_{G} \exp [- \frac{(λ_{l} - x - μ_{G})^{2}}{2 σ_{G}^{2}}], \\ l = 1, \dots, N_{λ}, x \in Δ, \end{aligned} \end{matrix}

where $β_{G} = [A_{G}, μ_{G}, σ_{G}^{2}]^{T}$ is the unknown vector of parameters to be estimated.

An alternative ISRF model was studied (Beirle et al., 2017) using a generalized Gaussian distribution referred to as super-Gaussian in order to better fit the ISRF shapes:

\begin{matrix} (4) & \begin{aligned} I_{l, β_{SG}} (x) & = A_{SG} \exp [- {|\frac{λ_{l} - x - μ_{SG}}{w_{SG}}|}^{k_{SG}}], \\ l = 1, \dots, N_{λ}, x \in Δ, \end{aligned} \end{matrix}

where $β_{SG} = [A_{SG}, μ_{SG}, w_{SG}, k_{SG}]^{T}$ is the unknown parameter vector to be estimated. This model reduces to the Gaussian model when $w_{SG} = 2 σ_{G}^{2}$ and k_SG=2. The parameters w_SG and k_SG are the scale and shape parameters of the distribution, allowing more or less flat shapes to be modeled.

When using the parametric models (Eq. 3) and (Eq. 4), the ISRF estimation problem consists of estimating the unknown model parameters for each sliding window. This estimation can be performed using the least squares method, which minimizes the following cost function:

\begin{matrix} (5) & C_{l} (β) = \sum_{n = 1}^{N + 1} | | s_{l} - R_{l} I_{l, β} | |_{2}^{2}, l = 1, \dots, N_{λ}, \end{matrix}

where $β \in {β_{G}, β_{SG}}$ is the unknown parameter vector and $I_{l, β} = [I_{l, β} (δ_{1}), \dots, I_{l, β} (δ_{N + 1})]^{T}$ .

Simple parametric models, such as Gaussian or generalized Gaussian models, are attractive for their simplicity and small number of parameters, yet can struggle to take into account the variety of different ISRF shapes that can be observed in practice. An illustration is provided in Fig. 1, which shows examples of ISRFs for the MicroCarb mission. Clearly, these ISRFs cannot be accurately modeled by bell-shaped Gaussian distributions or by generalized Gaussians (because of the dip at the center, for example). This motivates the study of a new estimation method for ISRFs.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f01

Figure 1Illustration of a superposition of 1024 ISRFs with centered wavelengths $λ_{l} = 758.3, \dots, 768.3$ nm around their central wavelengths. The ISRFs were simulated for band B1 of the MicroCarb instrument using uniform scenes.

Download

3 Sparse approximations of ISRFs

This paper investigates the use of sparse representations for ISRFs in a dictionary of well-chosen atoms. Models based on sparse approximations and on dictionary learning have been widely and successfully used for different signal and image processing applications (Zhang et al., 2015). These applications include image denoising, image classification, image reconstruction, compressed sensing or dimensionality reduction and involve large varieties of signals and images (Figueiredo et al., 2007; Tošić and Frossard, 2011). However, sparse representations have never been investigated for ISRF estimation, which is precisely the objective of this work.

3.1 Construction of the dictionary

Sparse representations express a given signal as a linear combination of a small number of signals that belong to a collection of reference patterns, or atoms, which is called a dictionary. This paper proposes decomposing the ISRF in a dictionary of atoms $Φ \in R^{(N + 1) \times N_{D}}$ :

\begin{matrix} (6) & I_{l} \approx I_{l}^{K} = Φ α_{l} = \sum_{k = 1}^{K} Φ_{γ_{k}} α_{l, k}, l = 1, \dots, N_{λ}, \end{matrix}

where $Φ_{γ_{k}}$ is the γ_kth selected atom, i.e., the γ_kth column of the dictionary Φ, and α_l,k is the corresponding non-zero coefficient of the sparse vector $α_{l} = [α_{l, 1}, \dots, α_{l, K}]^{T} \in R^{N_{D}}$ . The dictionary is built in such a way that linear combinations of a small number of its atoms (i.e., its columns) provide an efficient representation of the ISRF. Different methods allowing the dictionary to be built have been proposed in the literature. These methods are based on probabilistic learning, clustering, vector quantization or Bayesian inference (Tošić and Frossard, 2011). Dictionary learning usually involves a two-stage optimization structure, consisting first of a sparse-coding step to find the sparse vector α_l, which minimizes the objective function $| | I_{l} - Φ α_{l} | |_{2}^{2}$ for a fixed dictionary Φ, and then a dictionary update step, where the dictionary is estimated given a fixed sparse vector α_l. Depending on the application, the dictionary can be updated using a closed-form solution, gradient descent or ground truth data. In this work we investigate two different ways of building the dictionary Φ. The first method constructs Φ by using the N_D singular vectors associated with the largest singular values of the singular value decomposition (SVD) of a matrix composed of representative ISRF examples, as described in the algorithm of Appendix A2. The second method uses the K-SVD algorithm of Aharon et al. (2006), which belongs to the state of the art and is recalled in the algorithm of Appendix A5. The K-SVD algorithm is a generalization of the K-means algorithm in which the dictionary is updated by changing its columns separately and sequentially and applying K SVDs on an appropriate error matrix. Figure 2 displays the first atoms of dictionaries constructed using these two methods for band B1 of MicroCarb. These dictionaries are found to be similar, especially the first two atoms that correspond to the most energetic singular values. The first two atoms can be interpreted as the approximate average of all ISRFs used to build the dictionary (first atom) and a correction for adjusting the different widths of the ISRFs for different wavelengths (second atom), as seen in Fig. 1. The higher-order atoms obtained with SVD and K-SVD are slightly different but have similar shapes overall.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f02

Figure 2Representation of the first four atoms of the dictionary of ISRFs, Φ, constructed using an SVD on the matrix of representative ISRFs (a) or using the K-SVD algorithm using the same matrix of representative ISRFs (b) for the MicroCarb spectrometer (band B1).

Download

3.2 Inverse problem

Assuming that the ISRF can be decomposed in the dictionary Φ as in Eq. (6), the measured spectrum can be written as follows:

\begin{aligned} s_{l} & \approx R_{l} I_{l} \approx R_{l} Φ α_{l} = Ψ_{l} α_{l}, \\ l = 1, \dots, N_{λ} . \end{aligned}

Thus, the ISRF estimation problem reduces to finding the sparse vector α_l that minimizes the residual $| | s_{l} - Ψ_{l} α_{l} | |_{2}^{2}$ . This sparse-coding problem has been mathematically formulated in different ways (Zhang et al., 2015). One can use the l₀ pseudo-norm regularization $| | \cdot | |_{0}$ with a penalty parameter μ, leading to the following problem:

\begin{matrix} (7) & \begin{aligned} \arg min_{α_{l}} L (α_{l}, μ) & = \arg min_{α_{l}} | | s_{l} - Ψ_{l} α_{l} | |_{2}^{2} + μ | | α_{l} | |_{0}, \\ l = 1, \dots, N_{λ} . \end{aligned} \end{matrix}

This problem is non-convex and NP hard, and many approximations and heuristics have been proposed in the literature to find an approximate solution. A standard method consists of using greedy algorithms such as the orthogonal matching pursuit (OMP). OMP is a modification of the matching pursuit (MP) algorithm, which improves convergence by adding an orthogonalization step (Mallat and Zhang, 1993; Pati et al., 1993). The atoms of the dictionary that minimize the data fidelity term $| | s_{l} - Ψ_{l} α_{l} | |_{2}^{2}$ are iteratively determined by minimizing the remaining residual error. The OMP algorithm is summarized in the algorithm of Appendix A3. Another method replaces the pseudo-norm l₀ in Eq. (7) by the l₁ norm, which leads to a convex problem known as the LASSO problem (Tan et al., 2015):

\begin{matrix} (8) & \begin{aligned} \arg min_{α_{l}} & L (α_{l}, μ) = \arg min_{α_{l}} | | s_{l} - Ψ_{l} α_{l} | |_{2}^{2} + γ | | α_{l} | |_{1}, \\ l = 1, \dots, N_{λ}, \end{aligned} \end{matrix}

and the related algorithms studied in, e.g., Figueiredo et al. (2007) and Kim et al. (2007).

The OMP and LASSO algorithms provide a highly flexible decomposition of the ISRF, as the choice of the dictionary is not constrained to a specific form. Indeed, the basis functions can be learned, for example by using the K-SVD algorithm in conjunction with various matching pursuit algorithms. Another advantage of these methods is that they do not necessitate any prior assumption on the shape of the ISRFs (such as Gaussian ISRFs) and estimate them in a non-parametric way.

In the following, this paper compares the use of fixed dictionaries obtained by a single SVD and dictionaries estimated by K-SVD (alternation between SVD to update the dictionary and OMP to update the sparse code). The proposed approach using OMP and LASSO (or other sparse formulations) and either fixed or re-estimated dictionaries is referred to as SPIRIT, meaning SParse representation of Instrument spectral Response Functions using a dIcTionary.

4 Instruments, datasets and preprocessing

The spectrometers used in this study are passive push-broom spectrometers, mainly hyperspectral dispersive spectrometers, such as the MicroCarb high-resolution spectrometer and the OCO-2 instrument. ³

4.1 Synthetic data generation

Reference spectra used in this study were generated using Automatized Atmospheric Absorption Atlas OPerational (4A/OP) software (NOVELTIS et al., 2012). This software is based on a fast and accurate line-by-line transfer model that can be integrated in operational processing chains including inverse problem processing (Armante et al., 2013). It was selected as the official radiative model and reference code by the Centre National d'Etudes Spatiales (CNES) for the MicroCarb mission. The profiles originate from the Thermodynamic Initial Guess Retrieval (TIGR) database, which is hosted by Aeris data. Data are available at https://www.aeris-data.fr/en/projects/thermodynamical-initial-guess-retrieval-tigr/ (last access: 31 March 2025). An example of a profile was selected from this database for the generation of a reference spectrum. The measured spectra were then obtained by convolving the reference spectrum with the ISRFs (normalized to area 1 for each instrument; see details in the next paragraphs) and embedded in additive Gaussian noise to generate representative measurements. The advantage of this data generation method is to provide ground truth ISRFs, which can be used to assess the performance of the different methods in a controlled scenario.

4.2 MicroCarb mission

MicroCarb is a mission developed by the CNES, whose aim is to ensure continuity with other carbon measuring missions such as OCO-2 and GOSAT, in order to monitor CO₂ fluxes at the Earth surface and determine CO₂ atmospheric concentrations. The MicroCarb mission uses a compact and low-cost space instrument that is smaller than current spectrometers. The instrument is capable of acquiring four spectral bands with a single detector. The first band, B1 (758.3–768.3 nm), is an O₂ band with a spectral resolution of about 0.01 nm. Bands B2 (1596.7–1618.9 nm) and B3 (2023–2051 nm), with respective spectral resolutions of about 0.02 and 0.03 nm, are sensitive to the concentration of CO₂ and have CO₂ absorption lines. The last band, B4 (1264–1282.2 nm), is a second O₂ band with a spectral resolution of about 0.02 nm. The wavelengths associated with this last band are closer to the CO₂ wavelength and can be used for validation of space-based greenhouse gas observation (Bertaux et al., 2020). The whole dataset has been delivered by the French Space Agency (CNES, Toulouse) containing 1024 ISRFs associated with 1024 spectral measurements for the different bands. The data used for this experiment are the first band of MicroCarb with N_λ=1024 ISRFs and a sample size of N=895. The design of the MicroCarb instrument, obtained from Castelnau et al. (2019), is displayed in Fig. 3. More details about MicroCarb can be found on the CNES website (see https://microcarb.cnes.fr/en, last access: 11 December 2024). A particularity of this mission is that the shapes of the ISRFs are strongly dependent on the scene observed by the instrument, which is discussed in Sect. 5.3.3.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f03

Figure 3Principle design of the MicroCarb instrument reproduced from Castelnau et al. (2019).

5 Results and discussion

5.1 Numerical experiments and performance evaluation

The performance of the different ISRF estimation methods is evaluated in terms of ISRF estimation quality and the residual between the spectral measurements and their estimates. The quality of ISRF estimation can be quantified by the normalized absolute error between the ISRF and its estimate:

E_{l} = \sum_{n = - N / 2}^{N / 2} | I_{l} (n Δ) - \hat{I_{l}} (n Δ) | .

Note that for the instruments studied here, the ISRFs are assumed to be normalized to the unit area. The residual between the spectral measurements and their estimates is defined for each λ_l by

ρ_{l} = | | s_{l} - r_{l} {\hat{I}}_{l} | |_{2}^{2}

and summarized for an entire band in terms of the average residual:

ρ = \frac{1}{N_{λ}} \sum_{l = 1}^{N_{λ}} ρ_{l} .

In the MicroCarb mission, the ISRFs are considered to be well estimated when their normalized errors satisfy E_l<1 % for each wavelength. The performance of 1 % on the ISRF knowledge is an objective of the MicroCarb mission in order to provide an accurate determination of CO₂ concentrations. The 1 % requirement accounts for uncertainty, acquisition noise of ISRFs and interpolation and is used as a target in this work. The proposed SPIRIT method is compared to the parametric methods based on Gaussian and super-Gaussian models. The parameters of these models are estimated using the non-linear least squares algorithm based on the Nelder–Mead optimization algorithm (Lagarias et al., 1998) (MATLAB function fminsearch). This iterative algorithm requires an initialisation and a stopping criterion. For the initialization of the Gaussian model, the mean $μ_{G_{0}}$ was set to the sample mean of the ISRFs, the full width at half maximum (FWHM) was used for the standard deviation $σ_{G_{0}}$ and the amplitude was initialized as $A_{G_{0}} = (2 π σ_{G_{0}})^{- 1 / 2}$ . For the super-Gaussian model, the initialization was defined as $μ_{{SG}_{0}} = μ_{G_{0}}$ , $k_{{SG}_{0}} = 2$ , $w_{{SG}_{0}} = \sqrt{2} σ_{G_{0}}$ and $A_{{SG}_{0}} = \frac{k_{{SG}_{0}}}{2 w_{{SG}_{0}}} Γ (1 / k_{{SG}_{0}})$ , where Γ is the gamma function. The algorithm was stopped after a maximum number of iterations equal to 20 000. The dictionary used by SPIRIT was constructed using an SVD of a collection of approximately 10 % of the total number of ISRFs within the band of interest or estimated using the K-SVD algorithm initialized with this collection. In our experiments, we used N_D=25. Two different sparse-coding methods based on LASSO (Tibshirani, 1996) and OMP are investigated after dictionary construction. The first method uses a MATLAB implementation of LASSO with a parameter μ>0 adjusted to obtain a desired number of atoms. The non-zero coefficients obtained with LASSO were re-estimated in order to reduce the shrinking bias inherent to this method (Zhang and Huang, 2008). The implementations of the OMP and LASSO algorithms are summarized in Appendixes A3 and A4.

5.2 ISRF estimation performance

5.2.1 ISRF estimation for the MicroCarb mission

An example of an ISRF simulated for the MicroCarb mission and the estimates obtained with the different methods is displayed in Fig. 4. The results clearly illustrate the advantage of using SPIRIT for ISRF estimation, which leads to normalized estimation errors of less than 1 %, significantly below those obtained using the parametric estimation methods. A comparison between the different sparse approximations (OMP, LASSO) and dictionaries (SVD, K-SVD) that can be used by SPIRIT shows that OMP works better than LASSO for this example. Moreover, using the K-SVD algorithm does not significantly improve the results with respect to SVD, although it has significantly higher computational complexity.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f04

Figure 4Example of a simulated ISRF for the MicroCarb mission and its estimates using parametric methods and SPIRIT.

Download

The spectral measurements displayed in Fig. 5 were simulated by the CNES for the B1 wavelength range (758.4–768.9 nm). The results show that, for the MicroCarb spectrometer, the use of the super-Gauss parameterization reduces the residual error and ISRF approximation errors compared with the Gaussian model. SPIRIT yields significantly better results, with ISRF approximation errors below 1 % and of the order of 0.1 % for certain wavelengths. LASSO leads to overall less accurate approximations of the ISRFs than OMP, at a significantly higher computational cost, and the use of OMP is overall and consistently beneficial.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f05

Figure 5Illustrations of (a) the measured spectrum reconstruction, (b) the difference between the measured spectrum and the reconstructed spectra, (c) the residuals ρ_l for each wavelengths, (d) the ISRF approximation error versus the wavelength, and (e) the mean ISRF approximation error versus the number of selected atoms for different methods (Gauss, super-Gauss, OMP, LASSO, SVD and K-SVD) and for band B1 of the MicroCarb instrument.

Download

Sum of two generalized Gaussians

ISRFs can also be modeled using other parametric models, such as the sum of two generalized Gaussians with different shifted center wavelengths, although this has not yet been reported in the literature. As displayed in Fig. 6, this novel parametric approach yields enhanced outcomes as compared to the use of Gaussian and super-Gaussian models. However, the performance is still not competitive with respect to sparse-representation-based methods and necessitates more parameters to estimate. A more detailed study of such more complex parametric models is left for future work.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f06

Figure 6Results obtained using the different methods with a dictionary constructed using 103 ISRFs from band B1 for the sparse-representation-based methods.

Download

5.2.2 ISRF estimation for the Orbiting Carbon Observatory 2 (OCO-2) spectrometer

This section studies the applicability of the proposed method to the Orbiting Carbon Observatory 2 (OCO-2) spectrometer. The OCO-2 spectrometer is used in a NASA Earth observing satellite mission that was launched in July 2014. This mission is dedicated to the study of atmospheric carbon dioxide and oxygen and aims at characterizing the global CO₂ seasonal cycles and to quantify the sources and sinks of carbon. OCO-2 is composed of three high-spectral-resolution imaging spectrometers for narrow spectral ranges. The characterization of ISRFs for this spectrometer is highly challenging and crucial due to this high spectral resolution. The ISRFs are measured for each pixel using a tunable diode laser during pre-flight calibration (Lee et al., 2017), and the results are stored in a look-up table. The data used in this article can be downloaded on the NASA data website EarthDATA (OCO-2 Science Team/Gunson and Eldering, 2019) (data available at https://disc.gsfc.nasa.gov/datacollection/OCO2_L1B_Calibration_11r.html, last access: 15 November 2023). The product considered in this study is the OCO-2 Level 1B version 11r for science acquired in March 2023, and the fourth footprint is used. Specification on the data product can be found in Crisp et al. (2021). Some of the ISRFs are declared as invalid due to radiometric, spatial, spectral or polarization problems (and are thus not considered for ISRF estimation). The ISRFs associated with bad pixels have not been considered in our experiments, resulting in a number of ISRFs lower than the number of pixels. To identify the ISRFs at the missing nominal wavelengths λ_l, a linear interpolation between two specified nominal wavelengths λ_a and λ_b with known ISRFs was employed. The resulting interpolated ISRF is defined by

\begin{matrix} (9) & I_{l} = \frac{λ_{l} - λ_{a}}{λ_{b} - λ_{a}} I_{b} + \frac{λ_{b} - λ_{l}}{λ_{b} - λ_{a}} I_{a} . \end{matrix}

Note that the number N_λ of wavelengths after interpolation may differ from the number of pixels of the instrument, which occurs if the ISRFs associated with the first and/or last pixels are missing. The ISRFs used for the experiments come from the O₂ A band of OCO-2 with N_λ=859 ISRFs and a sample size N=895. Figure 7 displays an example of an ISRF from the OCO-2 dataset. A visual comparison with Fig. 5 shows that the ISRF shapes can differ significantly depending on the considered wavelength and the instrument. This observation suggests that the dictionary must be adapted to the spectrometer. Another interesting observation is that although the super-Gaussian distribution should theoretically always provide a better fit than the Gaussian distribution, it is not systematically the case in practice because of convergence issues for the iterative methods used to solve the non-linear least squares problem for parameter estimation. Specifically, the model parameters are estimated using a simplex-based optimization method (MATLAB function fminsearch) that aims at minimizing the residuals between the measured and estimated spectra, which does not always converge to a better solution for the super-Gaussian model than for the Gaussian model.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f07

Figure 7Example of an ISRF retrieved for the OCO-2 mission and its estimates using parametric methods and SPIRIT.

Download

Figure 8 displays performance results for the OCO-2 measurements obtained using the data for the O₂ band (757–772 nm). The measured spectrum is reconstructed with the proposed sparse-representation methods for K=5 atoms chosen using a dictionary constructed using SVD or K-SVD. The results indicate that the super-Gaussian model delivers slightly better results than the Gaussian model in terms of residual error and mean ISRF approximation error. However, for the smaller wavelengths of the band, the ISRF approximation errors are slightly larger with the super-Gaussian model, as already observed in Fig. 7 for a single ISRF. Both parametric models yield close to 10 % ISRF approximation errors. The proposed sparse-representation approach again yields far better ISRF approximations and measurement fits, with the best results obtained using OMP and SVD.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f08

Figure 8Illustrations of (a) the measured spectrum reconstruction, (b) the difference between the measured spectrum and the reconstructed spectra, (c) the residuals ρ_l for each wavelength, (d) the ISRF approximation error versus the wavelength, and (e) the mean ISRF approximation error versus the number of selected atoms using different methods (Gauss, super-Gauss, OMP and LASSO with SVD or K-SVD) for the O₂ A band of the OCO-2 instrument.

Download

5.2.3 Conclusions

Overall, the conclusions from these experiments are as follows. First, the super-Gaussian parameterization often yields better performance than the Gaussian one, corroborating the results reported in Beirle et al. (2017). However, the normalized ISRF approximation errors obtained with these parametric methods are consistently larger than 1 %, for both instruments and for all wavelengths. In contrast, the proposed SPIRIT approach based on sparse approximations of ISRFs in a suitable dictionary yields significantly better results. This result is due to the fact that the ISRF shapes depend strongly on the spectrometer and can vary across wavelengths, which cannot be accommodated easily with a simple parametric model. On the contrary, decompositions in appropriate dictionaries that depend on the spectrometer and the chosen wavelength offer sufficient flexibility for all use cases considered in this paper. Regarding the estimation algorithms, SVD overall provides an estimation performance close to K-SVD, and OMP leads to better estimation than LASSO. There is no theoretical reason for OMP to provide better performance than LASSO. However, it is important to note that the OMP and LASSO algorithms address two distinct problems: the OMP algorithm provides an approximate solution to the problem with an ℓ₀ penalty, and the LASSO algorithm solves the relaxed problem using an ℓ₁ regularization. Certain limitations of the LASSO algorithm have been highlighted in numerous publications, including Tibshirani (1996), and may also be at the origin of our observation. The results overall suggest the use of SVD for building the dictionary and OMP for ISRF estimation.

The proposed methods can also be applied to other instruments, such as Avantes, GOME-2, OMI and TROPOMI used in Beirle et al. (2017). Results obtained with these instruments are available in the Supplement and lead to similar conclusions.

5.3 Robustness analysis and ablation study

5.3.1 Robustness to additive noise

Monte Carlo simulations were conducted to study the robustness of the different ISRF estimation methods to the presence of measurement noise. Independent white Gaussian noise was added to the spectral measurements with several signal to noise ratio (SNR) levels to take into account thermal noise and spatial binning: spatial binning involves the arbitrary division of the imaged area on Earth into distinct fields of view (FOVs) (e.g., three FOVs for MicroCarb). The measured spectrum for each FOV is obtained as an average of the measured spectra within that FOV. ⁴ Table 1 reports the obtained residual approximation errors and the normalized average ISRF approximation errors for the two instruments MicroCarb and OCO-2. Approximation errors less than <1 % are highlighted in italic. These results show that the proposed sparse representations meet this target for SNRs larger than 20 dB. Moreover, OMP is found to be more robust to noise than LASSO and yields the best results overall. The parametric models again lead to large errors. It is interesting to note that these errors do not vary significantly with the noise level. This indicates that errors due to model misfit are larger than those induced by the noise degradations. To conclude, OMP combined with SVD provides the best results overall for ISRF estimation, also in the presence of additive noise.

Table 1Mean residual and approximation errors for different SNRs and different methods (Gauss (G), super-Gauss (SG) and OMP and LASSO, SVD and K-SVD). The best results are highlighted in bold, while ISRF approximation errors less than 1 % are shown in italics.

Download Print Version | Download XLSX

5.3.2 Sensitivity to parameter tuning for SPIRIT

The proposed approach requires the choice of a small number of parameters, namely the size of the sliding window N_obs, the size of the dictionary N_D and the number of atoms K. The choice of K has been studied above, and the best results were obtained for $K \approx 4 - 5$ for both instruments; see Figs. 5 and 8 and the corresponding discussions in Sect. 5.2.2. Here, we further study the impact of N_obs and N_D on the ISRF approximation errors. To this end, Figs. 9 and 10 show the approximation errors (in log ₁₀ scale) as a function of N_obs for the Gaussian and super-Gaussian parameterizations and as functions of N_obs and N_D for SPIRIT. Results are reported for the two instruments OCO-2 and MicroCarb and averaged for all ISRFs. The ISRF estimation errors decrease as N_obs increases, as expected. However, this decrease is more important for SPIRIT (e.g., for N_obs=80, the mean ISRF errors for Gauss and super-Gauss are equal to 16.27 % and 2.04 %, whereas they are equal to 0.29 % for OMP/SVD, 0.33 % for OMP/K-SVD, 1.23 % for LASSO/SVD and 1.40 % for LASSO/K-SVD), showing the interest of exploiting sparsity for ISRF estimation. The results in Figs. 9 and 10 also indicate that it is beneficial to use dictionaries of modest size, since the ISRF estimation errors increase for large dictionaries (N_D≤100 for OMP and N_D≤25 for LASSO). Based on this observation, N_D=25 was used in all the experiments.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f09

Figure 9Mean approximation errors for OCO-2 and the different estimation methods (Gauss, super-Gauss, OMP and LASSO with SVD or K-SVD) versus the number of observations N_obs and the dictionary size N_D for K=5.

Download

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f10

Figure 10Mean approximation errors for MicroCarb and the different estimation methods (Gauss, super-Gauss, OMP and LASSO with SVD or K-SVD) versus the number of observations N_obs and the dictionary size N_D for K=4.

Download

5.3.3 Robustness to ISRF changes

The ISRFs considered in the previous sections were obtained from uniform scenes referred to as “ISRF IN” for the MicroCarb mission. However these ISRFs can change depending on the scene observed by the instrument.

ISRFs for non-uniform scenes

The design of the MicroCarb instrument makes the ISRF sensitive to the slit illumination during the integration time. Such dependence on the scene can impact a multitude of instruments. ⁵ Eight different scenes of the Earth’s surface that are directly observed by the spectrometer's slit and subsequently recorded by the instrument's detector during the integration period are considered and are displayed in Fig. 12. These images were obtained in the ACT direction, and each image was divided along the ACT direction into three equal parts, resulting in three defined FOVs, labeled FOV 1, FOV 2 and FOV 3. The spatial pixels in each FOV are averaged to increase the spectral SNR. This binning and averaging step allows three measured spectra per imaged area to be determined, whose ISRFs have to be estimated. Figure 11 shows ISRFs from uniform scenes (left), randomly selected out of the 1024 ISRFs, and ISRFs from non-uniform scenes (right), randomly selected from the total set of eight scenes and three FOVs, highlighting the differences in ISRF shapes depending on the scene: the ISRFs can be more asymmetric for non-uniform scenes and are thus harder to estimate.⁶ It is interesting to note that the ISRF of a desert scene is very similar to the ISRF of a uniform scene, contrary to the ISRF of a horizontal coast profile, which makes the slit blinded during one-third of the integration time and leads to an asymmetric left-distorted ISRF, which is harder to estimate.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f11

Figure 11Examples of ISRFs from uniform scenes (ISRF IN – a) and from different non-uniform scenes displayed in Fig. 12 and FOVs (ISRF scene – b) (MicroCarb band B1).

Download

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f12

Figure 12Eight types of scenes (a) with the corresponding ISRFs (FOV 2) (b) for the MicroCarb instrument.

Download

Estimation performance

This section studies the performance of SPIRIT in estimating non-uniform scene ISRFs for the first band (band B1) of the MicroCarb spectrometer. Two cases are considered: estimation using the original dictionary learned by examples of uniform ISRFs (ISRF IN) and estimation after modification of this dictionary to account for the diversity of ISRFs. Specifically, the second dictionary is constructed from a set of 103 ISRF INs (1 out of 10) and 3 ISRF scenes (out of 24). The second dictionary is then composed of N_D=25 new atoms obtained by SVD from this collection of representative ISRFs. Results obtained using SPIRIT with OMP are displayed in Fig. 13. In the first case (dictionary learned by uniform ISRFs, Fig. 13 top row), the resulting normalized ISRF errors exceed 1 % for several scenes and FOVs, pointing to the fact that the dictionary is not well adapted for representing ISRFs for non-uniform scenes. The results obtained using the second dictionary are presented in the bottom part of Fig. 13. Using only three additional examples of ISRF scenes in the dictionary again allows ISRF estimation errors to be smaller than 1 %. Note that the lowest approximation errors are obtained in most cases using K=3 to K=6 atoms from the dictionary, as before. To conclude, these results show that the proposed method can easily adapt to more complex ISRF shapes by considering more diverse ISRF examples in the dictionary estimation step.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f13

Figure 13ISRF estimation errors for ISRF scenes obtained using a dictionary of uniform ISRFs (a, b, c) and mixed ISRFs (d, e, f).

Download

5.3.4 Robustness to pixel errors

Instrumental errors within a single pixel l can distort the shape of the ISRF of this pixel, leading to the creation of an outlier. This section investigates the impact of such outliers on ISRF estimation. To simulate this scenario, an ISRF from the OCO-2 instrument was inserted in pixel l=500 of band B1 of MicroCarb data, simulating an outlier in this pixel. The initial ISRF of the 500th band of MicroCarb and its new version are displayed in Fig. 14 (see black and red curves respectively). The corresponding estimation results, compared to those from the previous study without outliers, are displayed in Fig. 15. These results demonstrate that the presence of an erroneous ISRF in the sliding window leads to an increase in estimation errors for the windows containing the outlier since the ISRF estimation becomes more challenging. However, the results also indicate that the outlier ISRF could first be identified by inspecting the residuals between the measured spectrum and the ISRF reconstructions and then not be considered for ISRF estimation.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f14

Figure 14Illustration of the generated ISRF (red) at pixel l=500 in the presence of pixel errors as compared to the original ISRF (black) for band B1 of MicroCarb.

Download

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f15

Figure 15Residuals (a) and ISRF estimation errors (b) obtained in the presence of pixel errors for the MicroCarb spectrometer using the different methods (Gauss, super-Gauss, SVD/KSVD and OMP/LASSO).

Download

5.3.5 Impact of uncertainties about the reference spectra and reference ISRFs

This section analyzes the impact of uncertainties about the ISRFs used to build the dictionary and the reference spectrum on the ISRF estimation performance.

Uncertainties about the ISRFs

To evaluate the impact of uncertainties affecting the ISRFs, Gaussian noise is added to one-third of the ISRFs used to construct the dictionary, with SNR=40 and SNR=60 dB. The noisy ISRFs are then made positive by taking their absolute values and normalized to have a unit area. The results, displayed in the left part of Fig. 16 (using K=4 atoms for the plot in the top row), show that as noise increases, better results are achieved with smaller values of K in the presence of noise with an increase in ISRF estimation errors. However, the estimation is relatively robust to the presence of noise affecting ISRFs used to build the dictionary since approximation errors remain below 1 % on average for both noise levels.

https://amt.copernicus.org/articles/18/2573/2025/amt-18-2573-2025-f16

Figure 16Results obtained using SVD and OMP for the different scenarios of noisy ISRFs in the construction of the dictionary (a) and noisy reference spectra (b) for band B1 of MicroCarb.

Download

Uncertainties about the reference spectrum

In a second experiment, Gaussian noise is added to the reference spectrum, with SNR=20, SNR=40 and SNR=60 dB. The results are shown in the right part of Fig. 16 (using K=4 atoms for the plot in the top row). Using a reference spectrum corrupted by additive noise clearly has a smaller impact on estimation performance when compared to degradations affecting ISRFs used to build the dictionary. Note that high noise levels (SNR =20 dB) are necessary to significantly increase ISRF estimation errors, probably because of an averaging effect when computing spectral measurement by convolution of the reference spectrum with the ISRF.

Overall, these results indicate that the proposed method is robust to uncertainties in both the ISRFs and the reference spectrum, with ISRF approximation errors remaining below 1 % for realistic SNR levels.

6 Conclusions

This paper studied a new method for estimating the instrument spectral response functions (ISRFs) of spectrometers. This method is based on a sparse decomposition of the ISRFs into a dictionary of basis functions called atoms. The proposed method can be applied to a large variety of instruments as long as the ISRF estimation problem can be formulated as a linear inverse problem with a sufficient number of measurements (either because the ISRFs do not vary much in a small observation window, in the spectral or spatial domains, or because observations from several reference spectra can be obtained for the same ISRF). The method also requires that a sufficient number and variety of reference ISRFs have been identified and characterized on the ground to construct the dictionary. We recommend using the SVD algorithm to build the dictionary using representative ISRFs and the orthogonal matching pursuit (OMP) algorithm to decompose the ISRFs into this dictionary. The performance of these algorithms is excellent at the price of a very modest computational cost, which suggests its practicality for in-flight scenarios. Another interesting property of the proposed estimation method is that it is not impacted significantly by the shapes of the ISRFs to be estimated, allowing accurate estimations for different types of scenes. Numerical experiments presented in this paper also showed that the ISRFs of the MicroCarb and OCO-2 spectrometers can be estimated with approximation errors smaller than 1 %, which is very promising. Other results available in the Supplement confirm this conclusion for other spectrometers such as Avantes, GOME-2, OMI and TROPOMI.

Future work includes the consideration of radiometric and spectral errors (such as stray light, residual errors of calibration, temporal drifts or spectral shifts) that can degrade the performance of ISRF estimation. These errors are expected to affect some specific wavelengths, which suggests the importance of investigating specific algorithms, jointly correcting errors and estimating the ISRFs. The resulting problem is more challenging since there are non-linear relationships between the spectrometer measurements and these radiometric and spectral errors. Another interesting prospect is to analyze the potential of other methods, e.g., based on Gaussian mixtures or machine learning algorithms, for error correction and ISRF estimation. Finally, it would be interesting to assess the impact of potential uncertainties about the reference spectra and the ISRFs used to build the dictionary more extensively.

Appendix A: Algorithms

Appendix A describes the algorithm used to create the matrix of reference spectra, the OMP algorithm and the K-SVD algorithm. The LASSO algorithm was implemented using the MATLAB function lasso.m. The method used to select the hyperparameter μ is also presented.

A1 Reference spectrum matrix

The algorithm takes as an input the reference spectrum as a vector, the corresponding wavelengths λ_r, the wavelengths associated with the measured spectrum λ and the wavelengths associated with the ISRF Δ introduced in Sect. 2.

Algorithm A1Generation of the reference spectrum matrix.

Input: reference spectrum r, wavelengths of r denoted as λ_r, wavelengths of the measured spectrum λ, and ISRF wavelength Δ

Output: reference spectrum matrix for all wavelengths R.

for

l = 1, \dots, N_{λ}

λ_l=λ(l)

λ_{resp} = λ_{l} + Δ

R (l, :) = interp (λ_{r}, r, λ_{resp})

end for

return R

A2 Construction of the dictionary

This appendix describes the construction of the dictionary Φ that is used in the sparse-representation-based algorithms K-SVD, LASSO and OMP.

Algorithm A2Construction of the dictionary.

Input: matrix of selected ISRFs I, size of the dictionary N_obs

Output: dictionary of ISRFs Φ.

[U, Γ, V^{*}] = SVD (I)

Φ = V (:, 1 : N_{obs})

return Φ

A3 OMP algorithm

Appendix A3 describes the OMP algorithm used to find the sparse representation of the ISRF I_l of interest using K non-zero coefficients in the dictionary Φ from the measured spectrum s_l and the reference spectrum matrix R_l contained in the sliding window.

Algorithm A3Orthogonal matching pursuit (OMP) algorithm.

Input: measured spectrum s_l, reference spectrum matrix R_l, dictionary of ISRFs Φ and sparsity parameter K

Output: sparse vector α_l.

Ψ_l=R_lΦ

U₁=s_l

for

k = 1, \dots, K

Find

Ψ_{γ_{k}} \in Ψ_{l}

that maximizes the scalar product

| 〈 U_{k}, Ψ_{γ_{k}} / | | Ψ_{γ_{k}} | | 〉 |

Find

[α_{γ_{1}}, \dots, α_{γ_{k}}] \in α_{l}

that solves

\arg {min}_{α} | | U_{k} - \sum_{k^{'} = 1}^{k} α_{γ_{k^{'}}} Ψ_{γ_{k^{'}}} | |_{2}^{2}

U_{k + 1} = s_{l} - \sum_{k^{'} = 1}^{k} α_{γ_{k^{'}}} Ψ_{γ_{k^{'}}}

end for

return α_l

A4 LASSO algorithm

The MATLAB function lasso.m is used to find the sparse representation of the ISRF I_l in the dictionary Φ using K non-zero coefficients, from the measured spectrum s_l and the reference spectrum matrix R_l associated with the sliding window. A dichotomic search is used to obtain the sparsity parameter μ that leads to a given number non-zero coefficient K. The associated algorithm is described in Algorithm A4.

Algorithm A4LASSO algorithm.

Input: measured spectrum s_l, reference spectrum matrix R_l, dictionary of ISRFs Φ, sparsity parameter K, minimum value of the LASSO sparsity parameter μ_min and maximum value of the LASSO sparsity parameter μ_max

Output: sparse vector α_l.

Ψ_l=R_lΦ

α_{resp} = lasso (Ψ_{l}, s_{l}, “lambda”

while sparsity( α_resp) ≠K do

μ = \frac{μ_{min} + μ_{max}}{2}

α_{resp} = lasso (Ψ_{l}, s_{l}, “lambda”, μ, “Alpha”, 1)

if sparsity(α_resp) <K then

μ_max=μ

else

μ_min=μ

10:

end if

11:

end while

12:

Find the non-zero components in α_resp to form the vector

[γ_{1}, \dots, γ_{K}]

13:

Re-estimate the non-zero sparse coefficients: find

[α_{γ_{1}}, \dots, α_{γ_{k}}] \in α_{l}

that solves

\arg {min}_{α} | | s_{l} - \sum_{k^{'} = 1}^{k} α_{γ_{k^{'}}} Ψ_{γ_{k^{'}}} | |_{2}^{2}

14:

return α_l

A5 K-SVD algorithm

The K-SVD algorithm of Aharon et al. (2006) is described in Algorithm A5. At each step, the dictionary is updated by changing its columns separately and sequentially and applying K singular value decompositions (SVDs) on the appropriate error matrix E_j.

Algorithm A5Construction of the dictionary using the K-SVD algorithm.

Input: matrix of selected ISRFs I, number of selected ISRFs L, size of the dictionary N_obs, dictionary Φ obtained using SVD in Algorithm (2) and sparsity parameter K

Output: new dictionary of ISRFs Φ.

while not converging do

Sparse-coding step:

x_{l} = OMP (I_{l}, Φ, K) \forall l = 1, \dots, L

Dictionary update:

for

j = 1, \dots, N_{obs}

Define the group of examples that uses the jth column of the dictionary j,

w_{j} = {l | 1 \leq l \leq N, x_{T}^{j} (l) \neq 0}

Compute the overall representation error matrix,

E_{j} = I - \sum_{i \neq j} ϕ_{i} x_{T}^{i}

Build

E_{j}^{R}

from E_j using the columns corresponding to w_j

SVD

[U, Γ, V^{*}] = SVD (E_{j}^{R})

Update the dictionary column ϕ_j as the first column of U and the vector

x_{R}^{j}

as the first column of VΓ(1,1).

10:

end for

11:

end while

12:

return Φ

Code and data availability

The results obtained with the proposed method for the instruments Avantes, GOME-2, OMI and TROPOMI are provided in the Supplement. More details on the data and code used in this study are available upon request from the corresponding author.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/amt-18-2573-2025-supplement.

Author contributions

JEH gathered the data for different spectrometers and CP for the MicroCarb spectrometer. JMG and CP contributed to a first formalization of the problem. Mathematical formulation, implementation and formal analysis were conducted by JEH, JYT and HW. All authors contributed to the writing process through discussion and feedback.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

We would like to thank Denis Jouglet from the Department of Atmospheric Sounding at CNES for providing the reference spectra at the different wavelengths. Moreover, we express our gratitude to Steffen Beirle from the Max Planck Institute for Chemistry (MPI-C) for helpful discussions and for providing some ISRF data.

Financial support

This study was supported by the French Space Agency (CNES), France, and Thales Alenia Space, Cannes, France.

Review statement

This paper was edited by Mark Weber and reviewed by two anonymous referees.

References

Aharon, M., Elad, M., and Bruckstein, A.: K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation, IEEE T. Signal Proces., 54, 4311–4322, https://doi.org/10.1109/TSP.2006.881199, 2006. a, b

Armante, R., Scott, N. A., Capelle, V., Chédin, A., Bernard, E., Standfuss, C., Tournier, B., and Pierangelo, C.: IASI conference Presentation: 4AOP: A fast and accurate operational forward radiative transfer model, https://cnes.fr/sites/default/files/migration/smsc/iasi/PDF/conf3/posters/84_Armante_R.pdf (last access: 31 March 2025), 2013. a

Beirle, S., Lampel, J., Lerot, C., Sihler, H., and Wagner, T.: Parameterizing the instrumental spectral response function and its changes by a super-Gaussian and its derivatives, Atmos. Meas. Tech., 10, 581–598, https://doi.org/10.5194/amt-10-581-2017, 2017. a, b, c, d

Bertaux, J.-L., Hauchecorne, A., Lefèvre, F., Bréon, F.-M., Blanot, L., Jouglet, D., Lafrique, P., and Akaev, P.: The use of the 1.27 µm O2 absorption band for greenhouse gas monitoring from space and application to MicroCarb, Atmos. Meas. Tech., 13, 3329–3374, https://doi.org/10.5194/amt-13-3329-2020, 2020. a

Cansot, E., Pistre, L., Castelnau, M., Landiech, P., Georges, L., Gaeremynck, Y., and Bernard, P.: MicroCarb instrument, overview and first results, Proc. SPIE 12777, Int. Conf. Space Optics, 12777, 1–13, https://doi.org/10.1117/12.2690330, 2022. a, b

Castelnau, M., Cansot, E., Buil, C., Pascal, V., Crombez, V., Lopez, S., Georges, L., and Dubreuil, M.: Modelization and validation of the diffraction effects in the MicroCarb instrument for accurately computing the instrumental spectral response function, in: Int. Conf. Space Optics-ICSO 2018, Chania, Greece, 12 July 2019, 11180, 1054–1068, SPIE, https://doi.org/10.1117/12.2536019, 2019. a, b

Crisp, D., Rosenberg, R., Chapsky, L., Keller Rodrigues, G. R., Lee, R., Merrelli, A., Osterman, G., Oyafuso, F., Pollock, R., Spiers, G., Yu, S., Zong, J., and Eldering, A.: Orbiting Carbon Observatory – 2 & 3 (OCO-2 & OCO-3)- Level 1B! Algorithm Theoretical Basis Document, https://sentinels.copernicus.eu/documents/247904/2476257/Sentinel-5P-TROPOMI-Level-1B-ATBD (last access: 15 November 2023), 2021. a

Figueiredo, M. A. T., Nowak, R. D., and Wright, S. J.: Gradient Projection for Sparse Reconstruction: Application to Compressed Sensing and Other Inverse Problems, IEEE J. Sel. Top. Signa., 1, 586–597, https://doi.org/10.1109/JSTSP.2007.910281, 2007. a, b

Kim, S.-J., Koh, K., Lustig, M., Boyd, S., and Gorinevsky, D.: An Interior-Point Method for Large-Scale ℓ₁-Regularized Least Squares, IEEE J. Sel. Top. Signa., 1, 606–617, https://doi.org/10.1109/JSTSP.2007.910971, 2007. a

Lagarias, J. C., Reeds, J. A., Wright, M. H., and Wright, P. E.: Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions, SIAM J. Optimiz., 9, 112–147, 1998. a

Lee, R. A. M., O’Dell, C. W., Wunch, D., Roehl, C. M., Osterman, G. B., Blavier, J.-F., Rosenberg, R., Chapsky, L., Frankenberg, C., Hunyadi-Lay, S. L., Fisher, B. M., Rider, D. M., Crisp, D., and Pollock, R.: Preflight Spectral Calibration of the Orbiting Carbon Observatory 2, IEEE T. Geosci. Remote, 55, 2499–2508, https://doi.org/10.1109/TGRS.2016.2645614, 2017. a, b

Mallat, S. G. and Zhang, Z.: Matching pursuits with time-frequency dictionaries, IEEE T. Signal Proces., 41, 3397–3415, https://doi.org/10.1109/78.258082, 1993. a

NOVELTIS, CNES, and LMD: 4A/OP – Operational release for 4A – Automatized Atmospheric Absorption Atlas, https://4aop.noveltis.fr/references-and-publications (last access: 31 March 2025), 2012. a

OCO-2 Science Team/Gunson, M. and Eldering, A.: OCO-2 Level 1B calibrated, geolocated calibration spectra, Retrospective Processing V11r, Greenbelt, MD, USA, Goddard Earth Sciences Data and Information Services Center (GES DISC), https://disc.gsfc.nasa.gov/datacollection/OCO2_L1B_Calibration_11r.html (last access: 15 November 2023), 2019. a

Pati, Y. C., Rezaiifar, R., and Krishnaprasad, P. S.: Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition, in: Proc. Asilomar Conf. Signals, Systems and Computers, Pacific Grove, Pacific Grove, CA, USA, 1–3 November 1993, 40–44, https://doi.org/10.1109/ACSSC.1993.342465, 1993. a

Pittet, C., Crombez, V., Jouglet, D., Georges, L., Cansot, E., and Albert-Aguilar, A.: In-flight estimation of the MicroCarb Instrument Spectral Response Functions, Geophysical Research Abstracts, 21, EGU2019-6889, 2019. a

Sun, K., Liu, X., Huang, G., González Abad, G., Cai, Z., Chance, K., and Yang, K.: Deriving the slit functions from OMI solar observations and its implications for ozone-profile retrieval, Atmos. Meas. Tech., 10, 3677–3695, https://doi.org/10.5194/amt-10-3677-2017, 2017a. a

Sun, K., Liu, X., Nowlan, C. R., Cai, Z., Chance, K., Frankenberg, C., Lee, R. A. M., Pollock, R., Rosenberg, R., and Crisp, D.: Characterization of the OCO-2 instrument line shape functions using on-orbit solar measurements, Atmos. Meas. Tech., 10, 939–953, https://doi.org/10.5194/amt-10-939-2017, 2017b. a

Tan, M., Tsang, I. W., and Wang, L.: Matching Pursuit LASSO Part I: Sparse Recovery Over Big Dictionary, IEEE T. Signal Proces., 63, 727–741, https://doi.org/10.1109/TSP.2014.2385036, 2015. a

Tibshirani, R.: Regression Shrinkage and Selection via the Lasso, J. R. Stat. Soc. B, 58, 267–288, 1996. a, b

Tošić, I. and Frossard, P.: Dictionary Learning, IEEE Signal Process. Mag., 28, 27–38, https://doi.org/10.1109/MSP.2010.939537, 2011. a, b

Zhang, C.-H. and Huang, J.: The sparsity and bias of the Lasso selection in high-dimensional linear regression, Ann. Stat., 36, 1567–1594, https://doi.org/10.1214/07-AOS520, 2008. a

Zhang, Z., Xu, Y., Yang, J., Li, X., and Zhang, D.: A Survey of Sparse Representation: Algorithms and Applications, IEEE Access, 3, 490–530, https://doi.org/10.1109/ACCESS.2015.2430359, 2015. a, b

In practice, the wavelength associated with the pixel is obtained as the center (maximum, median or barycenter) of the measured ISRF at the given pixel. However, there are some effects, such as the smile effect (in ACT), or some gaps in our knowledge about the wavelengths (in along track) that can result in spectral shifts, which can degrade the estimation of ISRFs. These aspects are not considered in the present work. Thus, it is assumed that each pixel is associated with one wavelength which is known, and we address the ISRF estimation problem by solving an inverse problem.

Additional measurements could in principle be obtained experimentally using, e.g., a spectrally tunable onboard calibration source, albeit at an extra cost.

Alternative designs, such as Fourier transform infrared spectroscopy (FTIR), are also employed in practice, and the associated ISRFs can be obtained through the inverse Fourier transform. However, in certain applications, applying the Fourier transform can become more challenging (i.e., when undersampling is necessary or when the optical path difference varies depending on the position). If the problem can be modeled as a linear inverse problem, sparse-representation-based methods can be used with these spectrometers to estimate ISRFs. The proposed method is not specific to any instrument and can be applied to any instrument for which the problem can be formulated as a linear inverse problem.

⁴

In the case of the MicroCarb mission, the binning represents a compromise between the objective of achieving a good signal to noise ratio (SNR) and maintaining a suitable ground grid, which has a resolution of 13.5 km in ACT and 9 km along the track.

⁵

It can be possible to defocus the instrument in order to avoid this dependence on the slit illumination. However, the introduction of a defocus can potentially compromise the precision of the instrument, and thus it was ultimately decided to exclude this option from the MicroCarb instrument.

⁶

In practice, there is no information available regarding the non-uniformity of a given scene from the measured spectra. It is only during the inversion process, when estimating the ISRFs, that it becomes apparent (by looking at the measured spectra and the associated residuals) that the ISRFs have been modified. For a given reference spectrum, non-uniform scenes are generated using asymmetric ISRFs; see Pittet et al. (2019) for more details.

Articles

Short summary

This paper explores new techniques based on sparse representations for estimating the spectral response functions of high-resolution spectrometers. The method is highly competitive, with commonly used parametric models yielding more accurate estimates while accounting for wavelength dependence. The resulting normalized estimation errors of the spectrometer spectral responses are less than 1 %, which will allow for better quantification of trace gas concentrations at the Earth surface.