<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">AMT</journal-id><journal-title-group>
    <journal-title>Atmospheric Measurement Techniques</journal-title>
    <abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1867-8548</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/amt-19-3095-2026</article-id><title-group><article-title>A hybrid optimal estimation and machine learning approach to predict atmospheric composition</article-title><alt-title>OE–ML Fusion for TROPESS CO</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Werner</surname><given-names>Frank</given-names></name>
          <email>frank.werner@jpl.nasa.gov</email>
        <ext-link>https://orcid.org/0000-0002-7141-0934</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Bowman</surname><given-names>Kevin W.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-8659-1117</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Lee</surname><given-names>Seungwon</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-0280-5713</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Laughner</surname><given-names>Joshua L.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-8599-4555</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Payne</surname><given-names>Vivienne H.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>McDuffie</surname><given-names>James L.</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-9408-5695</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Jet Propulsion Laboratory, California Institute of Technology, 4800 Oak Grove Drive, Pasadena, CA 91109, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Frank Werner (frank.werner@jpl.nasa.gov)</corresp></author-notes><pub-date><day>11</day><month>May</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>9</issue>
      <fpage>3095</fpage><lpage>3109</lpage>
      <history>
        <date date-type="received"><day>1</day><month>October</month><year>2025</year></date>
           <date date-type="rev-request"><day>7</day><month>October</month><year>2025</year></date>
           <date date-type="rev-recd"><day>26</day><month>March</month><year>2026</year></date>
           <date date-type="accepted"><day>20</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Frank Werner et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026.html">This article is available from https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026.html</self-uri><self-uri xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e124">We present a HYbrid REtrieval Framework (HYREF) that predicts subcolumn carbon monoxide (CO) concentrations from Cross-track Infrared Sounder (CrIS) observations, trained to replicate the TRopospheric Ozone and its Precursors from Earth System Sounding (TROPESS) retrievals based on optimal estimation (OE). Unlike the OE algorithm, which produces retrievals for only a small fraction of available CrIS observations due to computationally expensive but physically accurate radiative transfer, the addition of machine learning (ML) techniques enables full coverage by providing high-resolution predictions for every valid CrIS sample. Importantly, in addition to CO concentrations, TROPESS-HYREF also predicts key retrieval diagnostics, namely column averaging kernels, degrees of freedom, and retrieval errors, that are essential for meaningful comparison with other observations, models, and ingestion into data assimilation. The framework is designed to emulate and extend the OE retrieval, rather than replace it, by providing full spatial coverage and enhanced resolution consistent with the underlying physical solution.</p>

      <p id="d2e127">The new framework achieves excellent performance with correlation coefficients <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.99</mml:mn></mml:mrow></mml:math></inline-formula> and a bias <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> when benchmarked against an independent test set, and reproduces fine-scale spatial patterns in CO fields observed during a major wildfire over North America. A scale analysis reveals substantial variability in CO concentrations below the nominal <inline-formula><mml:math id="M3" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> resolution of the TROPESS OE retrieval, which TROPESS-HYREF successfully resolves. Inference is computationally efficient, with daily global predictions completed in minutes on a single compute node. By filling observational gaps while maintaining consistency with the OE retrieval, this fusion of OE-derived physical information and ML-driven efficiency provides a practical pathway to high-resolution atmospheric CO monitoring with robust diagnostics.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Aeronautics and Space Administration</funding-source>
<award-id>80NM0020F0062</award-id>
</award-group>
</funding-group>
</article-meta>
  <notes notes-type="copyrightstatement">
  
      <p id="d2e173">© 2025 Jet Propulsion Laboratory, California Institute of Technology. Government sponsorship acknowledged.</p>
</notes></front>
<body>
      


<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e184">Carbon monoxide (CO) is a chemically reactive trace gas and key atmospheric pollutant, produced primarily through incomplete combustion of biomass and fossil fuels <xref ref-type="bibr" rid="bib1.bibx30" id="paren.1"/>, as well as through secondary production from the oxidation of methane (<inline-formula><mml:math id="M4" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CH</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) and non-methane hydrocarbons <xref ref-type="bibr" rid="bib1.bibx28" id="paren.2"><named-content content-type="pre">e.g.</named-content></xref>. It plays a central role in atmospheric chemistry by serving as a major sink for hydroxyl radicals (OH, <xref ref-type="bibr" rid="bib1.bibx34" id="altparen.3"/>), thereby influencing the oxidative capacity of the atmosphere and the lifetime of <inline-formula><mml:math id="M5" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CH</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> <xref ref-type="bibr" rid="bib1.bibx24" id="paren.4"><named-content content-type="pre">e.g.</named-content></xref>. Due to its intermediate lifetime (weeks to months), CO serves as a valuable tracer for long-range pollution transport and chemical processing in the troposphere <xref ref-type="bibr" rid="bib1.bibx12 bib1.bibx17" id="paren.5"><named-content content-type="pre">e.g.</named-content></xref>. It also contributes indirectly to radiative forcing via the formation of tropospheric ozone (<inline-formula><mml:math id="M6" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) and carbon dioxide (<inline-formula><mml:math id="M7" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CO</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>), classifying it as a short-lived climate pollutant <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx29" id="paren.6"/>.</p>
      <p id="d2e256">Satellite observations of CO, beginning with the Measurements of Air Pollution from Satellites (MAPS, <xref ref-type="bibr" rid="bib1.bibx43" id="altparen.7"/>) in the early 1980s and continuing with instruments such as Measurement of Pollution in the Troposphere (MOPITT) <xref ref-type="bibr" rid="bib1.bibx16" id="paren.8"/>, Atmospheric Infrared Sounder (AIRS) <xref ref-type="bibr" rid="bib1.bibx2" id="paren.9"/>, Tropospheric Emission Spectrometer (TES) <xref ref-type="bibr" rid="bib1.bibx3" id="paren.10"/>, Infrared Atmospheric Sounding Interferometer (IASI) <xref ref-type="bibr" rid="bib1.bibx13" id="paren.11"/>, Cross-track Infrared Sounder (CrIS) <xref ref-type="bibr" rid="bib1.bibx27" id="paren.12"/>, TROPOspheric Monitoring Instrument (TROPOMI) <xref ref-type="bibr" rid="bib1.bibx51" id="paren.13"/>, Greenhouse Gases Observing Satellite 2 (GOSAT–2) <xref ref-type="bibr" rid="bib1.bibx41" id="paren.14"/> and Geostationary Interferometric Infrared Sounder (GIIRS) <xref ref-type="bibr" rid="bib1.bibx59" id="paren.15"/>, have provided a long-term, global perspective on CO distributions, emission sources, and trends <xref ref-type="bibr" rid="bib1.bibx57 bib1.bibx8" id="paren.16"><named-content content-type="pre">e.g.</named-content></xref>. These datasets support air quality monitoring, inverse modeling of emissions, and evaluation of chemistry-climate models <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx19 bib1.bibx7" id="paren.17"><named-content content-type="pre">e.g.</named-content></xref>. While global CO concentrations have declined over the past two decades due to improved combustion efficiency and decreased biomass burning <xref ref-type="bibr" rid="bib1.bibx46 bib1.bibx60" id="paren.18"><named-content content-type="pre">e.g.</named-content></xref>, recent regional fire trends (see, e.g. <xref ref-type="bibr" rid="bib1.bibx35" id="altparen.19"/>), and evolving air quality policies continue to shape CO variability, underscoring the need for sustained satellite observations with well-characterized uncertainties <xref ref-type="bibr" rid="bib1.bibx48" id="paren.20"><named-content content-type="pre">e.g.</named-content></xref>. Nevertheless, changes in climate and extreme events can lead to substantial biomass burning events for which CO is a critical tracer to infer emissions <xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx10 bib1.bibx39" id="paren.21"/>.</p>
      <p id="d2e314">The NASA TRopospheric Ozone and its Precursors from Earth System Sounding (TROPESS) project  generates consistent, long-term records of tropospheric ozone and related trace gases, including CO <xref ref-type="bibr" rid="bib1.bibx6 bib1.bibx58" id="paren.22"/>. Building on the TES legacy, TROPESS applies a unified optimal estimation (OE, see, e.g.  <xref ref-type="bibr" rid="bib1.bibx44" id="altparen.23"/>) algorithm across multiple satellite platforms, supported by a comprehensive ground data system <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx21" id="paren.24"/>. Emphasis is placed on rigorous uncertainty analysis and intercomparisons with independent observations to ensure the accuracy needed for trend detection. Figure <xref ref-type="fig" rid="F1"/>a shows the spatial distribution of operational TROPESS Level 2 (L2) CO retrievals over the western US on 10 June 2023, based on CrIS measurements. A regional zoom (red box) reveals that, due to computational constraints, only <inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">1.5</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of the available CrIS soundings are processed, leaving substantial gaps in global CO monitoring.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e345"><bold>(a)</bold> Geolocations of L2 CO retrievals (blue dots) and L1B CrIS  radiances (orange dots) over the western US on 10 June 2023. <bold>(b)</bold> Simplified sketch of the ML setup, where three features (<inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">F</mml:mi><mml:mtext>1–3</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>; radiances at 2181.88 <inline-formula><mml:math id="M10" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, sensor viewing angle, and surface altitude) are used as input for the ML model in order to predict three labels (<inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">L</mml:mi><mml:mtext>1–3</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>; CO concentrations, retrieval error, and an individual column averaging kernel). <bold>(c)</bold> Simplified sketch of the ML model. The variables <inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">F</mml:mi><mml:mtext>1–3</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are converted to a two-dimensional input matrix which connects to neurons in two hidden layers, and map to a two-dimensional output matrix, which provides <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">L</mml:mi><mml:mtext>1–3</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>.</p></caption>
        <graphic xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026-f01.jpg"/>

      </fig>

      <p id="d2e421">Machine learning (ML) approaches, whose use in atmospheric science has expanded in recent years <xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx45 bib1.bibx53 bib1.bibx55 bib1.bibx47" id="paren.25"><named-content content-type="pre">e.g.</named-content></xref>, offer a promising path forward. ML models can efficiently learn complex, nonlinear relationships and provide rapid inference across large datasets. However, limitations in explainability and uncertainty quantification continue to hinder their broader application in remote sensing <xref ref-type="bibr" rid="bib1.bibx49" id="paren.26"/>.</p>
      <p id="d2e432">In contrast to conventional OE retrievals, which produce not only the retrieved quantities of interest but also key diagnostics such as <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="italic">χ</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> statistics, degrees of freedom (DoF), retrieval precision, error covariance, and column averaging kernels (AK), ML methods lack direct analogues to these quantities. Yet such diagnostics are critical for model-observation comparisons, data assimilation, and quality control <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx38 bib1.bibx52" id="paren.27"/>.</p>
      <p id="d2e449">Here, we present a novel hybrid framework that combines the strengths of OE and ML to generate high-resolution estimates of CO column concentrations from CrIS radiances. Our approach leverages OE retrievals as both training targets and sources of physically meaningful prior information, while enabling ML-driven capabilities such as rapid upscaling and the emulation of retrieval diagnostics. This fusion fills observational gaps left by current processing limits and provides an interpretable, uncertainty-aware pathway for incorporating ML into operational remote sensing pipelines. Importantly, the ML component is designed to emulate the OE retrieval and its associated diagnostics, rather than to replace or surpass the underlying physical solution, thereby extending OE-derived information to full spatial coverage. The TROPESS-HYREF framework is therefore intended as a hybrid OE-ML system, in which the ML model operates alongside OE, for example by filling gaps in retrieval coverage, and can be periodically retrained as new OE results become available.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data</title>
      <p id="d2e460">The CrIS instrument, onboard NOAA's Joint Polar Satellite System–1 (JPSS–1, also known as NOAA-20), is a Fourier Transform Spectrometer that captures Earth views across 30 cross-track interferograms, providing a swath width of 2200 <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>. Each interferogram contains a <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> array of fields of view (FOVs), with each circular FOV having a diameter of 14 <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> at nadir. CrIS data are processed to provide calibrated Level 1B (L1B) radiances in three spectral bands: 660–1095 <inline-formula><mml:math id="M18" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (longwave), 1210–1750 <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (midwave), and 2155–2550 <inline-formula><mml:math id="M20" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> (shortwave). The instrument unapodized spectral resolution is 0.625–2.5 <inline-formula><mml:math id="M21" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. NASA's version 2 L1B radiances are available from the Goddard Earth Sciences Data and Information Services Center (GES DISC) <xref ref-type="bibr" rid="bib1.bibx50" id="altparen.28"/>.</p>
      <p id="d2e551">TROPESS trace gas retrievals are provided on a reduced horizontal grid of <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.8</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> by the MUSES data processing system <xref ref-type="bibr" rid="bib1.bibx21 bib1.bibx22 bib1.bibx23" id="paren.29"/>. These retrievals are based on the TES L2 processing algorithm <xref ref-type="bibr" rid="bib1.bibx5" id="paren.30"/> and utilize an OE retrieval approach <xref ref-type="bibr" rid="bib1.bibx44" id="paren.31"/>. TROPESS retrievals of carbon monoxide (CO) are processed operationally, have undergone extensive verification <xref ref-type="bibr" rid="bib1.bibx58 bib1.bibx35" id="paren.32"><named-content content-type="pre">e.g.</named-content></xref>, and are accessible via the GES DISC. In this study, single-FOV CrIS–MUSES retrievals from the TROPESS forward stream were used <xref ref-type="bibr" rid="bib1.bibx6" id="paren.33"/>.</p>
      <p id="d2e583">Data in this study are comprised of CrIS and TROPESS data over April 2023–January 2025.</p>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>ML model</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Setup and training</title>
      <p id="d2e601">We developed, trained, and evaluated a ML model to simultaneously predict a variety of TROPESS CO variables, primarily using observed CrIS radiances and geolocation data as inputs. This setup is illustrated in the simplified diagram in Fig. <xref ref-type="fig" rid="F1"/>b, where we drastically limit the input and output variables to aid visibility. In this example the model uses three features (<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">F</mml:mi><mml:mtext>1–3</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>) as input: CrIS radiances at 2181.88 <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, the sensor viewing angle, and the surface altitude, respectively. These features are matrices, where each element <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mtext>1–3</mml:mtext><mml:mi>s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> corresponds to one of the <inline-formula><mml:math id="M26" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> samples, indexed as <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>. The ML model maps these features to a set of output labels (<inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold">L</mml:mi><mml:mtext>1–3</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>), which in this simplified example are the CO total column concentrations, the total column retrieval error, and the column AK at <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">511</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M30" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>. Like the features, these labels are matrices that contain elements <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msubsup><mml:mi>l</mml:mi><mml:mtext>1–3</mml:mtext><mml:mi>s</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> for each individual sample. Again, <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:mi>s</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula> denotes the individual sample (i.e. CrIS column).</p>
      <p id="d2e734">The ML model developed in this study is a feedforward artificial neural network (ANN), which maps the input to the output through several hidden layers, each consisting of a large number of interconnected neurons. A simplified schematic of an example ANN, with two hidden layers containing 7 and 5 neurons, respectively, is shown in Fig. <xref ref-type="fig" rid="F1"/>c. This diagram also illustrates how the geolocated features are transformed into two-dimensional input and output matrices and how they connect to the individual neurons.</p>
      <p id="d2e739">The exact model structure and hyperparameters (i.e. model settings) are determined through the procedures described in <xref ref-type="bibr" rid="bib1.bibx54 bib1.bibx55" id="text.34"/>. By applying <inline-formula><mml:math id="M33" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> fold cross-validation across a range of potential model setups, the ideal hyperparameters were found to be two hidden layers with 1506 neurons per layer, “Rectified Linear Unit” activation functions after each hidden layer, an L2 weight decay parameter of <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:mn mathvariant="normal">5.00</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">34</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and the “Adaptive Moment Estimation” optimizer with a learning rate of <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. The loss function minimized during training is the mean squared error. For each training iteration, batches of samples are passed through the model in a forward pass to compute predictions, followed by a backward pass in which model weights are updated via backpropagation. Each mini-batch contains 8192 samples. Further details on these parameters and their impact are provided in <xref ref-type="bibr" rid="bib1.bibx42" id="text.35"/>, <xref ref-type="bibr" rid="bib1.bibx25" id="text.36"/>,  and <xref ref-type="bibr" rid="bib1.bibx54" id="text.37"/>.</p>
      <p id="d2e798">Model training was carried out using the “Keras” library for Python (version 2.10.0; <xref ref-type="bibr" rid="bib1.bibx11" id="altparen.38"/>), with “TensorFlow” (version 2.10.0) as the backend <xref ref-type="bibr" rid="bib1.bibx1" id="paren.39"/>. Of the available CrIS radiances and TROPESS retrievals over April 2023–January 2025, <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mn mathvariant="normal">98</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of randomly selected samples were used as training data. After each training iteration, the model's performance was evaluated for an independent validation dataset comprised of <inline-formula><mml:math id="M37" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of the available data (approximately 185 000 samples). After several thousand iterations, the model weights corresponding to the best performance scores on the validation set were saved.</p>
      <p id="d2e832">The specific features used for the CO model include radiances from all 2224 spectral channels, the FOV index, the latitude and longitude of each sample, UTC time, a day/night flag, the sensor viewing angle, the day of the year, and the TROPESS subcolumn a priori values. This yields an input matrix containing 2235 variables. Note that the surface altitude was included for models predicting retrievals and diagnostics for other TROPESS species. We tested reduced channel sets focused on CO-sensitive regions but found that using the full spectrum provided slightly improved performance, likely due to additional information on atmospheric state variables (e.g. temperature and humidity). The predicted labels of the CO model consist of the subcolumn concentrations, column AKs, and subcolumn retrieval errors, resulting in an output matrix containing 24 variables.</p>
      <p id="d2e835">Prior to training, these inputs and outputs were filtered to remove invalid samples using a set of basic quality filters, including non-finite values, fill values, extreme outliers, failed retrieval quality flags, and target values outside the valid retrieval range. In addition, extreme outliers in the label distributions were masked using percentile-based tail cutoffs, and both input features and output labels were standardized before training.</p>
      <p id="d2e838">Model training was performed on a high-performance computing cluster and took <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">10</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M39" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">d</mml:mi></mml:mrow></mml:math></inline-formula> to converge to a solution for the <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">12</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">000</mml:mn></mml:mrow></mml:math></inline-formula> model weights.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Evaluation</title>
      <p id="d2e883">Model performance is evaluated using an independent test dataset, which consists of the remaining <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of randomly sampled data that were not included in the training or validation process. Ideally, (i) the model should reliably predict CO concentration retrievals and OE diagnostics for these data points, even though the ML algorithm was not trained on them, and (ii) performance metrics should be similar to those derived from the training and validation datasets. It is important to note that the objective of the ML model is not to generalize beyond the statistical characteristics of the OE retrieval, but to emulate the OE solution and provide full spatial coverage for the same observing system. As such, the random split into training, validation, and test datasets ensures that the model is evaluated on samples that were not explicitly used during training, while still reflecting the same underlying distribution of atmospheric states and observational conditions. Given the strong spatial and temporal correlations inherent in satellite observations, this approach is appropriate for assessing the model's ability to reproduce OE retrievals across the full range of conditions encountered in the dataset. In operational use, the model is continuously retrained with newly available OE retrievals, ensuring consistency with evolving atmospheric variability.</p>
      <p id="d2e898">Figure <xref ref-type="fig" rid="F2"/>a presents a joint histogram of total column CO from the ML and OE algorithms for over 180 000 samples in the independent test dataset. Yellow colors represent regions with the highest density of data points, while blue colors correspond to areas with very few samples. The good agreement between the ML and OE results is evident, as most observations are narrowly clustered around the <inline-formula><mml:math id="M42" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> line (indicated by the gray, dashed line). Five performance metrics are provided in the panel: Pearson's product-moment correlation coefficient (<inline-formula><mml:math id="M43" display="inline"><mml:mi>r</mml:mi></mml:math></inline-formula>), the root-mean-square deviation (RMSD), the median deviation between the predicted and retrieved CO (50p, i.e. the bias), and the 1st and 99th percentiles of the deviation. Notably, for the total column concentrations, we find <inline-formula><mml:math id="M44" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.99</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M45" display="inline"><mml:mrow><mml:mtext>RMSD</mml:mtext><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3.11</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">16</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M46" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, a median difference of <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">7.27</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">14</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M48" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and maximum absolute differences for the majority of samples of <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1.00</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">17</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M50" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e1043"><bold>(a)</bold> Joint histogram and <bold>(b)</bold> Bland–Altman plot of predicted and retrieved total column CO concentrations from the test data set. <bold>(c)</bold>–<bold>(h)</bold> Similar to <bold>(a)</bold> and <bold>(b)</bold>, but for the column averaging kernel (AK) at 162 <inline-formula><mml:math id="M51" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>, total column retrieval error, and degrees of freedom (DoF).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026-f02.jpg"/>

        </fig>

      <p id="d2e1079">Similar comparisons for tropospheric column concentrations, total AK at 162 <inline-formula><mml:math id="M52" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>, and DoF are shown in Fig. <xref ref-type="fig" rid="F2"/>c, e, and g. Again, the distributions closely follows the <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> line, with similarly high correlations (<inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.99</mml:mn></mml:mrow></mml:math></inline-formula>). The lowest correlation occurs for the column AK at the lowest atmospheric level (not shown), where <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.98</mml:mn></mml:mrow></mml:math></inline-formula>. These performance metrics are almost identical to those obtained for the validation dataset, where the comparison of predicted and retrieved total column CO concentrations yields <inline-formula><mml:math id="M56" display="inline"><mml:mrow><mml:mi>r</mml:mi><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">0.99</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:mtext>RMSD</mml:mtext><mml:mo>=</mml:mo><mml:mn mathvariant="normal">3.14</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">16</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M58" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, a median difference of <inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:mn mathvariant="normal">6.57</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">14</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M60" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, and maximum absolute differences for the majority of samples of <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">1.00</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">17</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M62" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1244">Performance metrics are consistent across training, validation, and test datasets, with nearly identical correlation coefficients (<inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi><mml:mi>r</mml:mi><mml:mo>|</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.0006</mml:mn></mml:mrow></mml:math></inline-formula>), normalized RMSD values (<inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi><mml:mtext>RMSD</mml:mtext><mml:mo>|</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.21</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, apart from the last two column AK levels deep in the stratosphere where values of effectively 0), and biases (<inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:mi mathvariant="normal">Δ</mml:mi><mml:mtext>bias</mml:mtext><mml:mo>|</mml:mo><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.12</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>) for all predicted variables. This indicates that there is no evidence of overfitting and that the model exhibits stable behavior across the available datasets.</p>
      <p id="d2e1309">We note that the use of a random split does not enforce strict independence between training, validation, and test datasets, as atmospheric states exhibit strong spatial and temporal correlations. In this study, the purpose of the split is therefore not to assess fully independent generalization, but to evaluate how well the model reproduces OE retrieval behavior across the distribution of atmospheric states and viewing geometries sampled by the instrument.</p>
      <p id="d2e1312">To ensure that the split remains representative, we verified that the distributions of key variables are statistically consistent across training, validation, and test datasets using Kolmogorov–Smirnov tests. Combined with the consistent performance metrics reported above, this indicates that the model generalizes well within the sampled observational distribution. This behavior is consistent with the intended hybrid OE–ML application, in which the model is designed to operate on the same observational distribution as OE.</p>
      <p id="d2e1315">To complement the regression analysis, we employ Bland–Altman plots to further assess the agreement between the OE and predicted results. This approach highlights systematic differences and potential heteroscedasticity that may not be apparent in standard correlation-based evaluations, especially for non-Gaussian or magnitude-dependent variability, and has been shown to provide a more robust framework for intercomparison of geophysical datasets <xref ref-type="bibr" rid="bib1.bibx33" id="paren.40"><named-content content-type="pre">e.g.</named-content></xref>. Panels b, d, f, and h show the Bland–Altman distributions, with the paired mean of predicted and OE values on the <inline-formula><mml:math id="M66" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> axis and the difference between the two datasets on the <inline-formula><mml:math id="M67" display="inline"><mml:mi>y</mml:mi></mml:math></inline-formula> axis, along with three horizontal dashed lines. The central line denotes the mean difference (bias), while the outer lines show the <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> limits of agreement (mean <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">1.96</mml:mn></mml:mrow></mml:math></inline-formula> standard deviations), indicating the interval that contains approximately <inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of the differences under the assumption of normally distributed residuals.</p>
      <p id="d2e1372">The distributions show no evidence of magnitude-dependent bias or systematic slope. The proportion of data points within the <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> limits of agreement (<inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:mn mathvariant="normal">96</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:mn mathvariant="normal">95</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, respectively) aligns closely with expectations for normally distributed residuals, and <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mn mathvariant="normal">79</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mn mathvariant="normal">80</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:mn mathvariant="normal">86</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:mn mathvariant="normal">78</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of points fall within <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> standard deviation. This indicates well-behaved error distributions with no pronounced evidence of heavy tails or heteroscedastic spread, suggesting that the model adequately captures variability across the full dynamic range. This is particularly relevant for machine learning retrievals, which can otherwise struggle in the presence of heteroscedastic relationships between observables and geophysical variables <xref ref-type="bibr" rid="bib1.bibx37" id="paren.41"><named-content content-type="pre">e.g.</named-content></xref>, and indicates that such effects are not evident in our ML predictions. Minor increases in spread near the peak of the distributions reflect regions of highest data density and do not indicate systematic bias or magnitude-dependent variability.</p>
      <p id="d2e1500">The close agreement between predicted and OE-derived diagnostics indicates that the ML model not only reproduces the retrieved state, but also captures the associated sensitivity and uncertainty characteristics of the OE solution. This consistency is critical for downstream applications, such as data assimilation and model evaluation, where the proper interpretation of retrievals depends on the availability of reliable AKs and error estimates.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Example maps</title>
      <p id="d2e1520">Figure <xref ref-type="fig" rid="F3"/>a presents a representative example scene of total column CO from the TROPESS OE retrieval on 10 June 2023. A large area of enhanced CO concentrations (<inline-formula><mml:math id="M81" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">3.00</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">18</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M82" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) is evident over Western Canada, associated with the unprecedented wildfire season that year <xref ref-type="bibr" rid="bib1.bibx31" id="paren.42"/>. These fires produced large smoke plumes that affected portions of Canada and the US for several weeks, before spreading across the Northern Hemisphere. Notably, enhanced CO concentrations (<inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">2.50</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">18</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M84" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molecules</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) are also recorded over Eastern Canada, the entire Eastern US, and parts of the Atlantic Ocean.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e1599"><bold>(a)</bold> Example scene of OE retrievals of total column CO over North America on 10 June 2023. <bold>(b)</bold> Similar to <bold>(a)</bold>, but showing the associated OE degrees of freedom (DoF). <bold>(c, d)</bold> Similar to <bold>(a, b)</bold>. but for the ML predictions. <bold>(e, f)</bold> Differences between colocated ML predictions and OE retrievals, and their respective error estimates.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026-f03.jpg"/>

        </fig>

      <p id="d2e1625">The associated OE DoF are shown in Fig. <xref ref-type="fig" rid="F3"/>b. Areas of moderate to high CO concentrations generally coincide with regions of elevated DoF. Smaller <inline-formula><mml:math id="M85" display="inline"><mml:mrow><mml:mtext>DoF</mml:mtext><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.6</mml:mn></mml:mrow></mml:math></inline-formula> are observed over Greenland, the Atlantic Ocean, and over isolated regions over the continental US. These reduced DoF are indicative of lower retrieval sensitivity  and are likely due to the interference of clouds or poor thermal contrast.</p>
      <p id="d2e1643">Figure <xref ref-type="fig" rid="F3"/>c and d show the ML predictions for total CO concentrations and DoF, respectively. These results are derived for each CrIS L1B sample. The increased spatial resolution is particularly noticeable over the oceans, but even over land the ML results capture much finer spatial features, while faithfully reproducing the CO enhancements and DoF from the OE retrieval. Divergence maps in Fig. <xref ref-type="fig" rid="F3"/>e and f illustrate the differences between predicted and retrieved results. The median differences are <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> for both variables, and for the majority of samples (i.e. within the 5th and 95th percentiles), ML predictions are within <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">2.40</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> for total CO concentrations and within <inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">4.12</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> for DoF. Overall, the difference between ML and OE total column CO concentrations exceeds the retrieval error for only 14 of the 5308 samples in the scene (<inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.26</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>). Similarly, excellent agreement is observed for the retrieval errors (not shown), with a majority of ML predictions within <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">6.11</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> and a median difference of <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.04</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1731">The close agreement between the ML predictions and OE retrieval shown in Fig. <xref ref-type="fig" rid="F3"/> primarily reflects interpolation within the sampled observational distribution, as this day is part of the training period, rather than fully independent spatiotemporal generalization. To further assess the performance of the ML model beyond the training period, Fig. <xref ref-type="fig" rid="F4"/> shows a global scene for 8 June 2025, which lies outside the training range (April 2023–January 2025). This experiment represents a limited temporal extrapolation test rather than a comprehensive assessment of long-term model stability. The results demonstrate that the model retains strong predictive skill under these conditions, indicating that it can generalize to unseen temporal states to a certain extent. However, we emphasize that such standalone predictive capability is not the primary objective of the framework. The model is designed to operate as part of a hybrid OE–ML system, benefiting from periodic retraining and remaining closely tied to the evolving distribution of OE retrievals.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e1740"><bold>(a, c, e)</bold> Similar to Fig. 3a, c, and e, but showing global total column CO for 8 June 2025. <bold>(b, d, f)</bold> Similar to <bold>(a, c, e)</bold>, but for the column averaging kernel (AK) at 383 <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula>. <bold>(g)</bold> Global mean column AK profile as a function of pressure, showing OE results (blue), associated variability (<inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula>, shaded), and ML predictions (orange). <bold>(h)</bold> Probability density functions of the difference between ML and OE total column retrieval errors (blue) and tropospheric column retrieval errors (orange). <bold>(i)</bold> Probability density function of the difference between ML and OE degrees of freedom (DoF). This scene lies outside the training period.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026-f04.jpg"/>

        </fig>

      <p id="d2e1787">Similar to Fig. 3, the ML predictions reproduce the spatial structure of total column CO with high fidelity, including regions of enhanced concentrations associated with wildfire activity over North America. Differences between ML predictions and colocated OE retrievals remain small and spatially unstructured. The median difference is <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.13</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, and the majority of samples (again, within the 5th and 95th percentiles) has ML CO concentrations within <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">3.00</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of the OE results.</p>
      <p id="d2e1818">In addition to column concentrations, Fig. 4 demonstrates that the ML model accurately reproduces key retrieval diagnostics. The column AK exhibit a maximum at 383 <inline-formula><mml:math id="M96" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">hPa</mml:mi></mml:mrow></mml:math></inline-formula> and is shown in panels b, d, and f, showing strong agreement in both spatial structure and magnitude. The median difference is <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.67</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, and the majority of ML predictions lie within <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">8.50</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of the OE results. The global mean AK profiles (Fig. 4g) are nearly identical between OE and ML, with differences well within the natural variability of the OE retrieval. Median differences over all vertical levels are within 0.001 <inline-formula><mml:math id="M99" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mn mathvariant="normal">90</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> of predictions are within 0.028 of the true AK value at any level. This translates to <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mn mathvariant="normal">1.5</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mn mathvariant="normal">10</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, respectively, at levels where the AK is noticeably different from zero, i.e. in the troposphere above the surface.</p>
      <p id="d2e1903">Likewise, differences in retrieval errors (Fig. 4h) and degrees of freedom (Fig. 4i) are centered near zero and exhibit narrow distributions, with full-width-at-half-maximum values of <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.02</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">18</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M104" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">molec</mml:mi><mml:mo>.</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> and 0.032, respectively. This behavior indicates that the ML-predicted diagnostics are statistically consistent with those derived from OE, within the intrinsic variability of the retrieval.</p>
      <p id="d2e1940">Together, these results demonstrate that the ML framework not only reproduces the retrieved state, but also captures the associated sensitivity and uncertainty characteristics of the OE solution for unseen atmospheric conditions, while resolving finer spatial structures beyond the native OE sampling. These characteristics suggest that the predicted diagnostics retain the key properties required for downstream applications such as data assimilation, although a full assessment within an assimilation framework is beyond the scope of this study.</p>
</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>The added value from CO at higher spatial resolution</title>
      <p id="d2e1951">A key question is whether the ML product captures physically meaningful CO variability below the nominal <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> TROPESS retrieval resolution, or whether it primarily behaves as a spatial interpolation of the OE field. While interpolation can reconstruct smooth fields between observations, it cannot introduce new information at unresolved spatial scales.</p>
      <p id="d2e1964">To investigate this, we employ two complementary approaches: (i) comparing the ML-predicted CO fields with linearly interpolated TROPESS CO retrievals, and (ii) analyzing the spatial power spectral densities <inline-formula><mml:math id="M106" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> to identify scale-dependent variability, particularly in the sub-<inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> domain. Together, these approaches allow us to assess whether interpolation is sufficient to capture the underlying structure, or whether additional variability persists at smaller scales that requires the higher-resolution information provided by the ML model.</p>
      <p id="d2e1994">Figure <xref ref-type="fig" rid="F5"/>a shows the interpolated OE CO retrievals. This field appears significantly smoother than the ML predictions (Fig. <xref ref-type="fig" rid="F3"/>c), especially for the region of enhanced CO over Western Canada and the Northeastern US. The difference between the interpolated and predicted CO concentrations is illustrated in Fig. <xref ref-type="fig" rid="F5"/>b, where blue and red colors indicate underestimation and overestimation by the interpolation, respectively. Deviations are centered around <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.50</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> but can exceed <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">30</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, especially in areas of enhanced CO. Maximum differences increase further, to <inline-formula><mml:math id="M110" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">48</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:mo>±</mml:mo><mml:mn mathvariant="normal">58</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>, when using nearest-neighbor or cubic spline interpolation, respectively.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e2061"><bold>(a)</bold> Interpolated TROPESS total column CO on 10 June 2023. <bold>(b)</bold> Difference between interpolated and predicted CO. <bold>(c, d)</bold> Average power spectral density <inline-formula><mml:math id="M112" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (black) as a function of wavenumber <inline-formula><mml:math id="M113" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula> for CO in latitudinal and longitudinal direction; <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for radiances at 2183.125 <inline-formula><mml:math id="M115" display="inline"><mml:mrow class="unit"><mml:msup><mml:mi mathvariant="normal">cm</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula> are shown in gray. Blue and orange lines indicate linear fits through different regions of <inline-formula><mml:math id="M116" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/3095/2026/amt-19-3095-2026-f05.jpg"/>

        </fig>

      <p id="d2e2151">While these differences highlight limitations of interpolation in regions of enhanced CO, a more general assessment of spatial variability across scales requires a spectral analysis. We therefore calculate power spectral densities <inline-formula><mml:math id="M117" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, which describe how variance in a spatial signal is distributed across different wavenumbers (<inline-formula><mml:math id="M118" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>). Since this analysis requires data on a regular grid, the ML-predicted CO concentrations are first interpolated onto a grid with constant spacing. As in the previous section, we focus on the total column CO field over North America on 10 June 2023, gridded at a resolution of <inline-formula><mml:math id="M119" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.133</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> in both latitude and longitude. Because the CrIS L1B radiances and corresponding ML predictions are provided on a similar but irregular grid, nearest-neighbor interpolation is used to retain most of the native variability; for comparison, linear and cubic spline interpolations are also evaluated.</p>
      <p id="d2e2198">Many geophysical fields exhibit scale-invariant behavior over a large range of wavenumbers, with <inline-formula><mml:math id="M120" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> following a power law:

                <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M121" display="block"><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo><mml:mo>∼</mml:mo><mml:msup><mml:mi>k</mml:mi><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="italic">β</mml:mi></mml:mrow></mml:msup><mml:mo>.</mml:mo></mml:mrow></mml:math></disp-formula>

          Sudden changes in the slope <inline-formula><mml:math id="M122" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>, so-called scale breaks, indicate changes in the physical processes governing variability. Such breaks have been reported in cloud-reflected radiances <xref ref-type="bibr" rid="bib1.bibx15" id="paren.43"><named-content content-type="pre">e.g.</named-content></xref>, paleotemperature records <xref ref-type="bibr" rid="bib1.bibx40" id="paren.44"><named-content content-type="pre">e.g.</named-content></xref>, and climate variability <xref ref-type="bibr" rid="bib1.bibx20" id="paren.45"><named-content content-type="pre">e.g.</named-content></xref>. We compute <inline-formula><mml:math id="M123" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> as the squared amplitude of the Fourier-transformed CO predictions in both latitudinal and longitudinal directions.</p>
      <p id="d2e2289">Figure <xref ref-type="fig" rid="F5"/>c and d present <inline-formula><mml:math id="M124" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> averaged over all grid points in latitude and longitude, respectively. A scale break is observed at <inline-formula><mml:math id="M125" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>≈</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.70</mml:mn></mml:mrow></mml:math></inline-formula>, corresponding to spatial scales of <inline-formula><mml:math id="M126" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">3.0</mml:mn></mml:mrow></mml:math></inline-formula>–<inline-formula><mml:math id="M127" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.5</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>, in both directions. Linear fits before and after the break, shown in blue and orange, were computed using the octave binning method reported in <xref ref-type="bibr" rid="bib1.bibx14" id="text.46"/>, which mitigates noise and limits energy accumulation at small scales. The binned <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> values are plotted as black dots. At the scale break, the slope in latitude (longitude) flattens from <inline-formula><mml:math id="M129" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">1.77</mml:mn></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math id="M130" display="inline"><mml:mn mathvariant="normal">1.77</mml:mn></mml:math></inline-formula>) to <inline-formula><mml:math id="M131" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.49</mml:mn></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math id="M132" display="inline"><mml:mn mathvariant="normal">0.23</mml:mn></mml:math></inline-formula>), indicating a relative enhancement of small-scale CO variability. This transition is consistent with the de-correlation length scale of CO (not shown) and indicates a transition to enhanced small-scale variability. A detailed attribution of the underlying processes is beyond the scope of this study. Notably, no secondary break is observed at smaller scales, and no steeper decline in <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi>E</mml:mi><mml:mi mathvariant="normal">I</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>k</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is found below the operational TROPESS retrieval resolution, which would imply reduced variability and a smoother distribution. Instead, the CO field remains highly variable down to the Nyquist limit of <inline-formula><mml:math id="M134" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>⋅</mml:mo><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>/</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">0.267</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> (derived from the effective sampling resolution).</p>
      <p id="d2e2446">Importantly, the observed scale break is neither an artifact of the retrieval nor dependent on the interpolation scheme. Applying the same analysis to CO-sensitive radiances in the spectral microwindow used in the OE retrieval (gray line in Fig. <xref ref-type="fig" rid="F5"/>c and d) yields similar results: comparable scale breaks at <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>≈</mml:mo><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> and consistent <inline-formula><mml:math id="M136" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> values. Changing the interpolation scheme from nearest neighbor to linear or cubic spline has minimal effect on the location of the break, though <inline-formula><mml:math id="M137" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula> values flatten slightly, reflecting increased variability across all scales. These minimal changes are not surprising, since the ML data are available at a very high spatial resolution (albeit on an irregular spatial grid).</p>
      <p id="d2e2479">For comparison, the power spectral density of linearly interpolated OE fields exhibits a similar large-scale slope (<inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">1.70</mml:mn></mml:mrow></mml:math></inline-formula>) but a much steeper decay of variability toward smaller spatial scales (<inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:mi mathvariant="italic">β</mml:mi><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">3.28</mml:mn></mml:mrow></mml:math></inline-formula>), reflecting the smoothing inherent in interpolation. While the apparent scale break shifts slightly, its exact location is sensitive to the fitting procedure and binning choices and is therefore not interpreted further.</p>
      <p id="d2e2507">In summary, the results in this section demonstrate that significant variability in total CO concentrations exists at scales below <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">4</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>, and importantly, below the nominal <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> resolution of the TROPESS retrievals. While interpolated OE fields exhibit a strong suppression of variability at these smaller scales, the ML product retains substantial structure down to the Nyquist limit. This indicates that the ML model captures physically meaningful sub-resolution variability that is not recovered by interpolation alone.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>Computational costs</title>
      <p id="d2e2540">A key advantage of applying machine learning models in inference mode is their computational efficiency <xref ref-type="bibr" rid="bib1.bibx55" id="paren.47"/>. As expected, the ML model is able to process a full day of CrIS radiance observations with remarkable speed. For 10 June 2023, the OE algorithm generated 44 192 CO column retrievals in <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">160</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M143" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">min</mml:mi></mml:mrow></mml:math></inline-formula>. In contrast, the ML model predicted CO concentrations and associated diagnostics for 2 916 000 columns in just <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M145" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">min</mml:mi></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e2582">This performance difference is even more striking when considering the computational resources used. The OE algorithm was run on 60 compute nodes utilizing a total of 480 CPU cores, while the ML model required only a single compute node with 8 CPU cores. Additionally, the prediction success rate was higher: <inline-formula><mml:math id="M146" display="inline"><mml:mrow><mml:mn mathvariant="normal">98.4</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> for the ML model (based on a conservative outlier flag) compared to <inline-formula><mml:math id="M147" display="inline"><mml:mrow><mml:mn mathvariant="normal">90.69</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> for the OE retrievals.</p>
      <p id="d2e2609">The superior computational performance of the ML model ensures that every individual CrIS sample can be processed efficiently, enabling predictions for any species included in the TROPESS retrieval framework. Moreover, this efficiency opens the door to near-real-time applications and provides a practical means to use the ML outputs to help constrain or enhance OE retrievals (see the discussion in Sect. 5).</p>
      <p id="d2e2612">Note, however, that the OE algorithm produces a vertical profile and an associated AK matrix whereas TROPESS-HYREF currently only predicts the derived column.</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e2624">This study presents a hybrid fusion of physics-based optimal estimation (OE) retrievals and machine learning (ML) to generate global, high-resolution carbon monoxide (CO) concentrations and associated diagnostics from CrIS observations. Our approach leverages the strengths of the TROPESS OE retrievals, namely accuracy, physical consistency, and interpretability, while using an artificial neural network to overcome their main limitation: sparse spatial sampling due to high computational costs and strict quality filtering. This enables us to increase the fraction of processed CrIS observations from <inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:mn mathvariant="normal">100</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e2653">The trained ML model within this MAchine Learning-OPtimal Estimation (TROPESS-HYREF) framework reproduces TROPESS CO column retrievals with high accuracy, achieving correlations exceeding 0.99 and low absolute biases <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:mo>&lt;</mml:mo><mml:mn mathvariant="normal">0.1</mml:mn><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula> across both test and validation data sets. Importantly, TROPESS-HYREF predicts not only column concentrations, but also associated diagnostics, including column averaging kernels, degrees of freedom (DoF), and retrieval errors. The close agreement between predicted and OE-derived quantities demonstrates that the ML model successfully emulates both the retrieved state and its associated sensitivity and uncertainty characteristics. We emphasize that the ML component is designed to reproduce and extend the OE retrieval, rather than to surpass its physical accuracy, by providing full spatial coverage and enhanced resolution consistent with the underlying solution.</p>
      <p id="d2e2670">Using representative example scenes, we demonstrate that the TROPESS-HYREF predictions reproduce and extend fine-scale spatial structures consistent with the OE retrieval and outperform standard interpolation methods, particularly in regions with elevated CO due to wildfire emissions. A scale analysis reveals significant spatial variability in the CO fields below <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:mn mathvariant="normal">3.5</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> and, more importantly, below the OE retrieval's native <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.80</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula> resolution, indicating that the ML predictions resolve meaningful sub-retrieval-scale features. Notably, variability persists down to the Nyquist sampling limit imposed by the CrIS observation footprint.</p>
      <p id="d2e2693">In terms of computational performance, TROPESS-HYREF processes a full day of CrIS observations more than 25 times faster than the OE algorithm, despite producing predictions for over 65 times more observations (i.e. 1625 times faster). The high success rate of the ML inference (<inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">98.4</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>) compared to the OE retrieval (<inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:mo>≈</mml:mo><mml:mn mathvariant="normal">90.7</mml:mn><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:mrow></mml:math></inline-formula>) further ensures consistent, global data availability.</p>
      <p id="d2e2725">By providing retrieval-like products at full coverage and enhanced resolution, this work bridges the gap between physically constrained atmospheric retrievals and scalable machine learning predictions. The ML outputs are suitable for downstream applications, including data assimilation, model validation, and trend analysis, while maintaining consistency with OE-derived physical information. They also offer the potential to inform and constrain future retrieval efforts, for example by serving as prior states or as an additional quality flag, where large discrepancies between OE and ML results could highlight potential issues with individual samples. In the current implementation, the model is trained on approximately two years of TROPESS OE retrievals; however, ongoing work focuses on incorporating regular retraining using newly available OE data. This ensures that TROPESS-HYREF continuously adapts to evolving atmospheric conditions while maintaining consistency with the OE retrieval, reinforcing its role as a complementary extension rather than a stand-alone predictive model.</p>
      <p id="d2e2728">The methodology developed for CO can be readily extended to other trace gases retrieved by the TROPESS MUSES OE algorithm, including ammonia (<inline-formula><mml:math id="M155" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">NH</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>), ozone (<inline-formula><mml:math id="M156" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">O</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>), and methane (<inline-formula><mml:math id="M157" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">CH</mml:mi><mml:mn mathvariant="normal">4</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>), as well as to other satellite instruments (TES, AIRS, OMI, and TROPOMI) and multi-instrument configurations (CrIS<inline-formula><mml:math id="M158" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>TROPOMI or AIRS<inline-formula><mml:math id="M159" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula>OMI) <xref ref-type="bibr" rid="bib1.bibx22 bib1.bibx36" id="paren.48"/>. In parallel, ongoing work explores the use of ML predictions as a first guess in the OE retrieval algorithm, which may accelerate convergence and reduce computational costs. This hybrid framework therefore complements, rather than replaces, the OE retrieval, enabling scalable, physically consistent, high-resolution atmospheric composition products.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e2786">CrIS L1B radiances and the TROPESS CO product files can be downloaded from GES DISC. A Zenodo repository (<ext-link xlink:href="https://doi.org/10.5281/zenodo.16968703" ext-link-type="DOI">10.5281/zenodo.16968703</ext-link>, <xref ref-type="bibr" rid="bib1.bibx56" id="altparen.49"/>) contains the HYREF CO model and all necessary Python routines, as well as a Jupyter notebook with step-by-step instructions, so interested parties can produce their own CO predictions. This repository also includes Jupyter notebooks, Python routines, and ancillary data sets to reproduce each figure in the manuscript.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e2798">FW, KWB, SL, JLL, VHP, and JLM have shaped the concept of this study and refined the approach during extensive discussions. FW, SL, and JLM implemented the ML algorithm into the current TROPESS algorithm pipeline. FW carried out the data analysis and prepared the figures for the manuscript. FW wrote the initial draft of the manuscript, which was subsequently refined by all authors.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e2804">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e2810">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e2816">Government sponsorship acknowledged. Work at the Jet Propulsion Laboratory, California Institute of Technology, was carried out under contract with the National Aeronautics and Space Administration (80NM0018D0004).</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e2821">This research has been supported by the National Aeronautics and Space Administration, Science Mission Directorate (grant no. 80NM0020F0062).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e2827">This paper was edited by Zhao-Cheng Zeng and reviewed by Daniel Miller and one anonymous referee.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Abadi et al.(2016)</label><mixed-citation> Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems, arXiv [preprint], arXiv1603.04467v2, Tue, 31 May 2016, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Aumann et al.(2003)</label><mixed-citation>Aumann, H., Chahine, M., Gautier, C., Goldberg, M., Kalnay, E., McMillin, L., Revercomb, H., Rosenkranz, P., Smith, W., Staelin, D., Strow, L., and Susskind, J.: AIRS/AMSU/HSB on the Aqua mission: design, science objectives, data products, and processing systems, IEEE T. Geosci. Remote, 41, 253–264, <ext-link xlink:href="https://doi.org/10.1109/TGRS.2002.808356" ext-link-type="DOI">10.1109/TGRS.2002.808356</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Beer et al.(2001)</label><mixed-citation>Beer, R., Glavich, T. A., and Rider, D. M.: Tropospheric emission spectrometer for the Earth Observing System's Aura satellite, Appl. Optics, 40, 2356–2367, <ext-link xlink:href="https://doi.org/10.1364/AO.40.002356" ext-link-type="DOI">10.1364/AO.40.002356</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Bowman and Henze(2012)</label><mixed-citation>Bowman, K. and Henze, D. K.: Attribution of direct ozone radiative forcing to spatially resolved emissions, Geophys. Res. Lett., 39, <ext-link xlink:href="https://doi.org/10.1029/2012GL053274" ext-link-type="DOI">10.1029/2012GL053274</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Bowman et al.(2006)</label><mixed-citation>Bowman, K., Rodgers, C., Kulawik, S., Worden, J., Sarkissian, E., Osterman, G., Steck, T., Lou, M., Eldering, A., Shephard, M., Worden, H., Lampel, M., Clough, S., Brown, P., Rinsland, C., Gunson, M., and Beer, R.: Tropospheric emission spectrometer: retrieval method and error analysis, IEEE T. Geosci. Remote, 44, 1297–1307, <ext-link xlink:href="https://doi.org/10.1109/TGRS.2006.871234" ext-link-type="DOI">10.1109/TGRS.2006.871234</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Bowman(2021)</label><mixed-citation>Bowman, K. W.: TROPESS CrIS-JPSS1 L2 Carbon Monoxide for Forward Processing, Summary Product V1, NASA Goddard Earth Sciences Data and Information Services Center, 2022 [data set], <ext-link xlink:href="https://doi.org/10.5067/JL1HT3NGEAW3" ext-link-type="DOI">10.5067/JL1HT3NGEAW3</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Buchholz et al.(2018)</label><mixed-citation>Buchholz, R. R., Hammerling, D., Worden, H. M., Deeter, M. N., Emmons, L. K., Edwards, D. P., and Monks, S. A.: Links between carbon monoxide and climate indices for the Southern Hemisphere and tropical fire regions, J. Geophys. Res.-Atmos., 123, 9786–9800, <ext-link xlink:href="https://doi.org/10.1029/2018JD028438" ext-link-type="DOI">10.1029/2018JD028438</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Buchholz et al.(2021)</label><mixed-citation>Buchholz, R. R., Worden, H. M., Park, M., Francis, G., Deeter, M. N., Edwards, D. P., Emmons, L. K., Gaubert, B., Gille, J., Martínez-Alonso, S., Tang, W., Kumar, R., Drummond, J. R., Clerbaux, C., George, M., Coheur, P.-F., Hurtmans, D., Bowman, K. W., Luo, M., Payne, V. H., Worden, J. R., Chin, M., Levy, R. C., Warner, J., Wei, Z., and Kulawik, S. S.: Air pollution trends measured from Terra: CO and AOD over industrial, fire-prone, and background regions, Remote Sens. Environ., 256, 112275, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2020.112275" ext-link-type="DOI">10.1016/j.rse.2020.112275</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Byrne et al.(2021)</label><mixed-citation>Byrne, B., Liu, J., Lee, M., Yin, Y., Bowman, K. W., Miyazaki, K., Norton, A. J., Joiner, J., Pollard, D. F., Griffith, D. W. T., Velazco, V. A., Deutscher, N. M., Jones, N. B., and Paton-Walsh, C.: The carbon cycle of southeast Australia during 2019–2020: drought, fires, and subsequent recovery, AGU Advances, 2, e2021AV000469, <ext-link xlink:href="https://doi.org/10.1029/2021AV000469" ext-link-type="DOI">10.1029/2021AV000469</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Byrne et al.(2024)</label><mixed-citation>Byrne, B., Liu, J., Bowman, K. W., Pascolini-Campbell, M., Chatterjee, A., Pandey, S., Miyazaki, K., van der Werf, G. R., Wunch, D., Wennberg, P. O., Roehl, C. M., and Sinha, S.: Carbon emissions from the 2023 Canadian wildfires, Nature, 633, 835–839, <ext-link xlink:href="https://doi.org/10.1038/s41586-024-07878-z" ext-link-type="DOI">10.1038/s41586-024-07878-z</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Chollet et al.(2015)</label><mixed-citation>Chollet, F. et al.: Keras, <uri>https://keras.io</uri> (last access: 29 April 2026), 2015.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Clerbaux et al.(2002)</label><mixed-citation>Clerbaux, C., Hadji-Lazaro, J., Payan, S., Camy-Peyret, C., Wang, J., Edwards, D. P., and Luo, M.: Retrieval of CO from nadir remote-sensing measurements in the infrared by use of four different inversion algorithms, Appl. Optics, 41, 7068–7078, <ext-link xlink:href="https://doi.org/10.1364/AO.41.007068" ext-link-type="DOI">10.1364/AO.41.007068</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Clerbaux et al.(2009)</label><mixed-citation>Clerbaux, C., Boynard, A., Clarisse, L., George, M., Hadji-Lazaro, J., Herbin, H., Hurtmans, D., Pommier, M., Razavi, A., Turquety, S., Wespes, C., and Coheur, P.-F.: Monitoring of atmospheric composition using the thermal infrared IASI/MetOp sounder, Atmos. Chem. Phys., 9, 6041–6054, <ext-link xlink:href="https://doi.org/10.5194/acp-9-6041-2009" ext-link-type="DOI">10.5194/acp-9-6041-2009</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Davis et al.(1996)</label><mixed-citation>Davis, A., Marshak, A., Wiscombe, W., and Cahalan, R.: Scale invariance of liquid water distributions in marine stratocumulus. Part I: Spectral properties and stationarity issues, J. Atmos. Sci., 53, 1538–1558, <ext-link xlink:href="https://doi.org/10.1175/1520-0469(1996)053&lt;1538:SIOLWD&gt;2.0.CO;2" ext-link-type="DOI">10.1175/1520-0469(1996)053&lt;1538:SIOLWD&gt;2.0.CO;2</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Davis et al.(1997)</label><mixed-citation> Davis, A., Marshak, A., Cahalan, R., and Wiscombe, W.: The Landsat scale break in stratocumulus as a three-dimensional radiative transfer effect: implications for cloud remote sensing, J. Atmos. Sci., 54, 241–260, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Drummond et al.(2010)</label><mixed-citation>Drummond, J. R., Zou, J., Nichitiu, F., Kar, J., Deschambaut, R., and Hackett, J.: A review of 9-year performance and operation of the MOPITT instrument, Adv. Space Res., 45, 760–774, <ext-link xlink:href="https://doi.org/10.1016/j.asr.2009.11.019" ext-link-type="DOI">10.1016/j.asr.2009.11.019</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Edwards et al.(2004)</label><mixed-citation>Edwards, D. P., Emmons, L. K., Hauglustaine, D. A., Chu, D. A., Gille, J. C., Kaufman, Y. J., Pétron, G., Yurganov, L. N., Giglio, L., Deeter, M. N., Yudin, V., Ziskin, D. C., Warner, J., Lamarque, J.-F., Francis, G. L., Ho, S. P., Mao, D., Chen, J., Grechko, E. I., and Drummond, J. R.: Observations of carbon monoxide and aerosols from the Terra satellite: Northern Hemisphere variability, J. Geophys. Res.-Atmos., 109, <ext-link xlink:href="https://doi.org/10.1029/2004JD004727" ext-link-type="DOI">10.1029/2004JD004727</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Field et al.(2015)</label><mixed-citation>Field, R. D., Luo, M., Kim, D., Del Genio, A. D., Voulgarakis, A., and Worden, J.: Sensitivity of simulated tropospheric CO to subgrid physics parameterization: a case study of Indonesian biomass burning emissions in 2006, J. Geophys. Res.-Atmos., 120, 11743–11759, <ext-link xlink:href="https://doi.org/10.1002/2015JD023402" ext-link-type="DOI">10.1002/2015JD023402</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Field et al.(2016)</label><mixed-citation>Field, R. D., Luo, M., Fromm, M., Voulgarakis, A., Mangeon, S., and Worden, J.: Simulating the Black Saturday 2009 smoke plume with an interactive composition-climate model: Sensitivity to emissions amount, timing, and injection height, J. Geophys. Res.-Atmos., 121, 4296–4316, <ext-link xlink:href="https://doi.org/10.1002/2015JD024343" ext-link-type="DOI">10.1002/2015JD024343</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Franzke et al.(2020)</label><mixed-citation>Franzke, C. L. E., Barbosa, S., Blender, R., Fredriksen, H.-B., Laepple, T., Lambert, F., Nilsen, T., Rypdal, K., Rypdal, M., Scotto, M. G., Vannitsem, S., Watkins, N. W., Yang, L., and Yuan, N.: The structure of climate variability across scales, Rev. Geophys., 58, e2019RG000657, <ext-link xlink:href="https://doi.org/10.1029/2019RG000657" ext-link-type="DOI">10.1029/2019RG000657</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Fu et al.(2016)</label><mixed-citation>Fu, D., Bowman, K. W., Worden, H. M., Natraj, V., Worden, J. R., Yu, S., Veefkind, P., Aben, I., Landgraf, J., Strow, L., and Han, Y.: High-resolution tropospheric carbon monoxide profiles retrieved from CrIS and TROPOMI, Atmos. Meas. Tech., 9, 2567–2579, <ext-link xlink:href="https://doi.org/10.5194/amt-9-2567-2016" ext-link-type="DOI">10.5194/amt-9-2567-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Fu et al.(2018)</label><mixed-citation>Fu, D., Kulawik, S. S., Miyazaki, K., Bowman, K. W., Worden, J. R., Eldering, A., Livesey, N. J., Teixeira, J., Irion, F. W., Herman, R. L., Osterman, G. B., Liu, X., Levelt, P. F., Thompson, A. M., and Luo, M.: Retrievals of tropospheric ozone profiles from the synergism of AIRS and OMI: methodology and validation, Atmos. Meas. Tech., 11, 5587–5605, <ext-link xlink:href="https://doi.org/10.5194/amt-11-5587-2018" ext-link-type="DOI">10.5194/amt-11-5587-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Fu et al.(2019)</label><mixed-citation>Fu, D., Millet, D. B., Wells, K. C., Payne, V. H., Yu, S., Guenther, A., and Eldering, A.: Direct retrieval of isoprene from satellite-based infrared measurements, Nat. Commun., 10, 3811, <ext-link xlink:href="https://doi.org/10.1038/s41467-019-11835-0" ext-link-type="DOI">10.1038/s41467-019-11835-0</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Gaubert et al.(2017)</label><mixed-citation>Gaubert, B., Worden, H. M., Arellano, A. F. J., Emmons, L. K., Tilmes, S., Barré, J., Martinez Alonso, S., Vitt, F., Anderson, J. L., Alkemade, F., Houweling, S., and Edwards, D. P.: Chemical feedback from decreasing carbon monoxide emissions, Geophys. Res. Lett., 44, 9985–9995, <ext-link xlink:href="https://doi.org/10.1002/2017GL074987" ext-link-type="DOI">10.1002/2017GL074987</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Goodfellow et al.(2016)</label><mixed-citation> Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning (Adaptive Computation and Machine Learning Series), The MIT Press, Cambridge, MA, ISBN-10 0262035618, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Grivas and Chaloulakou(2006)</label><mixed-citation>Grivas, G. and Chaloulakou, A.: Artificial neural network models for prediction of <inline-formula><mml:math id="M160" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">PM</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:math></inline-formula> hourly concentrations, in the Greater Area of Athens, Greece, Atmos. Environ., 40, 1216–1229, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2005.10.036" ext-link-type="DOI">10.1016/j.atmosenv.2005.10.036</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Han et al.(2013)</label><mixed-citation>Han, Y., Revercomb, H., Cromp, M., Gu, D., Johnson, D., Mooney, D., Scott, D., Strow, L., Bingham, G., Borg, L., Chen, Y., DeSlover, D., Esplin, M., Hagan, D., Jin, X., Knuteson, R., Motteler, H., Predina, J., Suwinski, L., Taylor, J., Tobin, D., Tremblay, D., Wang, C., Wang, L., Wang, L., and Zavyalov, V.: Suomi NPP CrIS measurements, sensor data record algorithm, calibration and validation activities, and record data quality, J. Geophys. Res.-Atmos., 118, 12734–12748, <ext-link xlink:href="https://doi.org/10.1002/2013JD020344" ext-link-type="DOI">10.1002/2013JD020344</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Holloway et al.(2000)</label><mixed-citation>Holloway, T., Levy II, H., and Kasibhatla, P.: Global distribution of carbon monoxide, J. Geophys. Res.-Atmos., 105, 12123–12147, <ext-link xlink:href="https://doi.org/10.1029/1999JD901173" ext-link-type="DOI">10.1029/1999JD901173</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>IPCC(2023)</label><mixed-citation>IPCC: Climate Change 2023: Synthesis Report. A Report of the Intergovernmental Panel on Climate Change, IPCC, <ext-link xlink:href="https://doi.org/10.59327/IPCC/AR6-9789291691647" ext-link-type="DOI">10.59327/IPCC/AR6-9789291691647</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Jacob(1999)</label><mixed-citation> Jacob, D.: Instroduction to Atmospheric Chemistry, Princeton University Press, ISBN 9780691001852, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Jain et al.(2024)</label><mixed-citation>Jain, P., Barber, Q. E., Taylor, S. W., Whitman, E., Castellanos Acuna, D., Boulanger, Y., Chavardès, R. D., Chen, J., Englefield, P., Flannigan, M., Girardin, M. P., Hanes, C. C., Little, J., Morrison, K., Skakun, R. S., Thompson, D. K., Wang, X., and Parisien, M.-A.: Drivers and impacts of the record-breaking 2023 wildfire season in Canada, Nat. Commun., 15, 6764, <ext-link xlink:href="https://doi.org/10.1038/s41467-024-51154-7" ext-link-type="DOI">10.1038/s41467-024-51154-7</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Jones et al.(2003)</label><mixed-citation>Jones, D. B. A., Bowman, K. W., Palmer, P. I., Worden, J. R., Jacob, D. J., Hoffman, R. N., Bey, I., and Yantosca, R. M.: Potential of observations from the Tropospheric Emission Spectrometer to constrain continental sources of carbon monoxide, J. Geophys. Res.-Atmos., 108, <ext-link xlink:href="https://doi.org/10.1029/2003JD003702" ext-link-type="DOI">10.1029/2003JD003702</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Knobelspiesse et al.(2019)</label><mixed-citation>Knobelspiesse, K., Tan, Q., Bruegge, C., Cairns, B., Chowdhary, J., van Diedenhoven, B., Diner, D., Ferrare, R., van Harten, G., Jovanovic, V., Ottaviani, M., Redemann, J., Seidel, F., and Sinclair, K.: Intercomparison of airborne multi-angle polarimeter observations from the Polarimeter Definition Experiment, Appl. Optics, 58, 650–669, <ext-link xlink:href="https://doi.org/10.1364/AO.58.000650" ext-link-type="DOI">10.1364/AO.58.000650</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Lelieveld et al.(2016)</label><mixed-citation>Lelieveld, J., Gromov, S., Pozzer, A., and Taraborrelli, D.: Global tropospheric hydroxyl distribution, budget and reactivity, Atmos. Chem. Phys., 16, 12477–12493, <ext-link xlink:href="https://doi.org/10.5194/acp-16-12477-2016" ext-link-type="DOI">10.5194/acp-16-12477-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Luo et al.(2024)</label><mixed-citation>Luo, M., Worden, H. M., Field, R. D., Tsigaridis, K., and Elsaesser, G. S.: TROPESS-CrIS CO single-pixel vertical profiles: intercomparisons with MOPITT and model simulations for 2020 western US wildfires, Atmos. Meas. Tech., 17, 2611–2624, <ext-link xlink:href="https://doi.org/10.5194/amt-17-2611-2024" ext-link-type="DOI">10.5194/amt-17-2611-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Malina et al.(2024)</label><mixed-citation>Malina, E., Bowman, K. W., Kantchev, V., Kuai, L., Kurosu, T. P., Miyazaki, K., Natraj, V., Osterman, G. B., Oyafuso, F., and Thill, M. D.: Joint spectral retrievals of ozone with Suomi NPP CrIS augmented by S5P/TROPOMI, Atmos. Meas. Tech., 17, 5341–5371, <ext-link xlink:href="https://doi.org/10.5194/amt-17-5341-2024" ext-link-type="DOI">10.5194/amt-17-5341-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Miller et al.(2020)</label><mixed-citation>Miller, D. J., Segal-Rozenhaimer, M., Knobelspiesse, K., Redemann, J., Cairns, B., Alexandrov, M., van Diedenhoven, B., and Wasilewski, A.: Low-level liquid cloud properties during ORACLES retrieved using airborne polarimetric measurements and a neural network algorithm, Atmos. Meas. Tech., 13, 3447–3470, <ext-link xlink:href="https://doi.org/10.5194/amt-13-3447-2020" ext-link-type="DOI">10.5194/amt-13-3447-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Miyazaki et al.(2015)</label><mixed-citation>Miyazaki, K., Eskes, H. J., and Sudo, K.: A tropospheric chemistry reanalysis for the years 2005–2012 based on an assimilation of OMI, MLS, TES, and MOPITT satellite data, Atmos. Chem. Phys., 15, 8315–8348, <ext-link xlink:href="https://doi.org/10.5194/acp-15-8315-2015" ext-link-type="DOI">10.5194/acp-15-8315-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Neyra-Nazarrett et al.(2025)</label><mixed-citation>Neyra-Nazarrett, O. A., Miyazaki, K., Bowman, K. W., and Saide, P. E.: An assessment of TROPESS CrIS and TROPOMI CO retrievals and their synergies for the 2020 Western U.S. wildfires, Remote Sens.-Basel, 17, <ext-link xlink:href="https://doi.org/10.3390/rs17111854" ext-link-type="DOI">10.3390/rs17111854</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Nilsen et al.(2016)</label><mixed-citation>Nilsen, T., Rypdal, K., and Fredriksen, H.-B.: Are there multiple scaling regimes in Holocene temperature records?, Earth Syst. Dynam., 7, 419–439, <ext-link xlink:href="https://doi.org/10.5194/esd-7-419-2016" ext-link-type="DOI">10.5194/esd-7-419-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Noël et al.(2022)</label><mixed-citation>Noël, S., Reuter, M., Buchwitz, M., Borchardt, J., Hilker, M., Schneising, O., Bovensmann, H., Burrows, J. P., Di Noia, A., Parker, R. J., Suto, H., Yoshida, Y., Buschmann, M., Deutscher, N. M., Feist, D. G., Griffith, D. W. T., Hase, F., Kivi, R., Liu, C., Morino, I., Notholt, J., Oh, Y.-S., Ohyama, H., Petri, C., Pollard, D. F., Rettinger, M., Roehl, C., Rousogenous, C., Sha, M. K., Shiomi, K., Strong, K., Sussmann, R., Té, Y., Velazco, V. A., Vrekoussis, M., and Warneke, T.: Retrieval of greenhouse gases from GOSAT and GOSAT-2 using the FOCAL algorithm, Atmos. Meas. Tech., 15, 3401–3437, <ext-link xlink:href="https://doi.org/10.5194/amt-15-3401-2022" ext-link-type="DOI">10.5194/amt-15-3401-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Reed and Marks(1999)</label><mixed-citation> Reed, R. and Marks, ll, R. J.: Neural Smithing: Supervised Learning in Feedforward Artificial Neural Networks, A Bradford Book, ISBN-10 0262181908, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Reichle Jr. et al.(1990)</label><mixed-citation>Reichle Jr., H. G., Connors, V. S., Holland, J. A., Sherrill, R. T., Wallio, H. A., Casas, J. C., Condon, E. P., Gormsen, B. B., and Seiler, W.: The distribution of middle tropospheric carbon monoxide during early October 1984, J. Geophys. Res.-Atmos., 95, 9845–9856, <ext-link xlink:href="https://doi.org/10.1029/JD095iD07p09845" ext-link-type="DOI">10.1029/JD095iD07p09845</ext-link>, 1990.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Rodgers(2000)</label><mixed-citation> Rodgers, C.: Inverse Methods for Atmospheric Sounding, World Scientific Publishing Co., ISBN-10 981022740X, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Saponaro et al.(2013)</label><mixed-citation>Saponaro, G., Kolmonen, P., Karhunen, J., Tamminen, J., and de Leeuw, G.: A neural network algorithm for cloud fraction estimation using NASA-Aura OMI VIS radiance measurements, Atmos. Meas. Tech., 6, 2301–2309, <ext-link xlink:href="https://doi.org/10.5194/amt-6-2301-2013" ext-link-type="DOI">10.5194/amt-6-2301-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Schultz et al.(2015)</label><mixed-citation>Schultz, M. G., Akimoto, H., Bottenheim, J., Buchmann, B., Galbally, I. E., Gilge, S., Helmig, D., Koide, H., Lewis, A. C., Novelli, P. C., Plass-Dülmer, C., Ryerson, T. B., Steinbacher, M., Steinbrecher, R., Tarasova, O., Tørseth, K., Thouret, V., and Zellweger, C.: The Global Atmosphere Watch reactive gases measurement network, Elementa: Science of the Anthropocene, 3, 000067, <ext-link xlink:href="https://doi.org/10.12952/journal.elementa.000067" ext-link-type="DOI">10.12952/journal.elementa.000067</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Schultz et al.(2021)</label><mixed-citation>Schultz, M. G., Betancourt, C., Gong, B., Kleinert, F., Langguth, M., Leufen, L. H., Mozaffari, A., and Stadtler, S.: Can deep learning beat numerical weather prediction?, Philos. T. Roy. Soc. A, 379, <ext-link xlink:href="https://doi.org/10.1098/rsta.2020.0097" ext-link-type="DOI">10.1098/rsta.2020.0097</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Strode et al.(2016)</label><mixed-citation>Strode, S. A., Worden, H. M., Damon, M., Douglass, A. R., Duncan, B. N., Emmons, L. K., Lamarque, J.-F., Manyin, M., Oman, L. D., Rodriguez, J. M., Strahan, S. E., and Tilmes, S.: Interpreting space-based trends in carbon monoxide with multiple models, Atmos. Chem. Phys., 16, 7285–7294, <ext-link xlink:href="https://doi.org/10.5194/acp-16-7285-2016" ext-link-type="DOI">10.5194/acp-16-7285-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Tyralis and Papacharalampous(2024)</label><mixed-citation>Tyralis, H. and Papacharalampous, G.: A review of predictive uncertainty estimation with machine learning, Artif. Intell. Rev., 57, 94, <ext-link xlink:href="https://doi.org/10.1007/s10462-023-10698-8" ext-link-type="DOI">10.1007/s10462-023-10698-8</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>UW-Madison Space Science and Engineering Center: Hank Revercomb; UMBC Atmospheric Spectroscopy Laboratory: Larrabee Strow(2018)</label><mixed-citation>UW-Madison Space Science and Engineering Center: Hank Revercomb; UMBC Atmospheric Spectroscopy Laboratory: Larrabee Strow: JPSS-1 CrIS Level 1B Full Spectral Resolution V2, Goddard Earth Sciences Data and Information Services Center (GES DISC) [data set], <ext-link xlink:href="https://doi.org/10.5067/EETSCFBDBLX6" ext-link-type="DOI">10.5067/EETSCFBDBLX6</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Veefkind et al.(2012)</label><mixed-citation>Veefkind, J., Aben, I., McMullan, K., Förster, H., de Vries, J., Otter, G., Claas, J., Eskes, H., de Haan, J., Kleipool, Q., van Weele, M., Hasekamp, O., Hoogeveen, R., Landgraf, J., Snel, R., Tol, P., Ingmann, P., Voors, R., Kruizinga, B., Vink, R., Visser, H., and Levelt, P.: TROPOMI on the ESA Sentinel-5 Precursor: a GMES mission for global observations of the atmospheric composition for climate, air quality and ozone layer applications, Remote Sens. Environ., 120, 70–83, <ext-link xlink:href="https://doi.org/10.1016/j.rse.2011.09.027" ext-link-type="DOI">10.1016/j.rse.2011.09.027</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>von Clarmann and Glatthor(2019)</label><mixed-citation>von Clarmann, T. and Glatthor, N.: The application of mean averaging kernels to mean trace gas distributions, Atmos. Meas. Tech., 12, 5155–5160, <ext-link xlink:href="https://doi.org/10.5194/amt-12-5155-2019" ext-link-type="DOI">10.5194/amt-12-5155-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Werner et al.(2020)</label><mixed-citation>Werner, F., Schwartz, M. J., Livesey, N. J., Read, W. G., and Santee, M. L.: Extreme outliers in lower stratospheric water vapor over North America observed by MLS: relation to overshooting convection diagnosed from colocated Aqua-MODIS data, Geophys. Res. Lett., 47, e2020GL090131, <ext-link xlink:href="https://doi.org/10.1029/2020GL090131" ext-link-type="DOI">10.1029/2020GL090131</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Werner et al.(2021)</label><mixed-citation>Werner, F., Livesey, N. J., Schwartz, M. J., Read, W. G., Santee, M. L., and Wind, G.: Improved cloud detection for the Aura Microwave Limb Sounder (MLS): training an artificial neural network on colocated MLS and Aqua MODIS data, Atmos. Meas. Tech., 14, 7749–7773, <ext-link xlink:href="https://doi.org/10.5194/amt-14-7749-2021" ext-link-type="DOI">10.5194/amt-14-7749-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Werner et al.(2023)</label><mixed-citation>Werner, F., Livesey, N. J., Millán, L. F., Read, W. G., Schwartz, M. J., Wagner, P. A., Daffer, W. H., Lambert, A., Tolstoff, S. N., and Santee, M. L.: Applying machine learning to improve the near-real-time products of the Aura Microwave Limb Sounder, Atmos. Meas. Tech., 16, 2733–2751, <ext-link xlink:href="https://doi.org/10.5194/amt-16-2733-2023" ext-link-type="DOI">10.5194/amt-16-2733-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Werner et al.(2025)</label><mixed-citation>Werner, F., Bowman, K. W., Lee, S., Laughner, J. L., Payne, V. H., and McDuffie, J. L.: Zenodo repository for A hybrid optimal estimation and machine learning approach to predict atmospheric composition, Zenodo [code], <ext-link xlink:href="https://doi.org/10.5281/zenodo.16968703" ext-link-type="DOI">10.5281/zenodo.16968703</ext-link>, 2025. </mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Worden et al.(2013)</label><mixed-citation>Worden, H. M., Deeter, M. N., Frankenberg, C., George, M., Nichitiu, F., Worden, J., Aben, I., Bowman, K. W., Clerbaux, C., Coheur, P. F., de Laat, A. T. J., Detweiler, R., Drummond, J. R., Edwards, D. P., Gille, J. C., Hurtmans, D., Luo, M., Martínez-Alonso, S., Massie, S., Pfister, G., and Warner, J. X.: Decadal record of satellite carbon monoxide observations, Atmos. Chem. Phys., 13, 837–850, <ext-link xlink:href="https://doi.org/10.5194/acp-13-837-2013" ext-link-type="DOI">10.5194/acp-13-837-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Worden et al.(2022)</label><mixed-citation>Worden, H. M., Francis, G. L., Kulawik, S. S., Bowman, K. W., Cady-Pereira, K., Fu, D., Hegarty, J. D., Kantchev, V., Luo, M., Payne, V. H., Worden, J. R., Commane, R., and McKain, K.: TROPESS/CrIS carbon monoxide profile validation with NOAA GML and ATom in situ aircraft observations, Atmos. Meas. Tech., 15, 5383–5398, <ext-link xlink:href="https://doi.org/10.5194/amt-15-5383-2022" ext-link-type="DOI">10.5194/amt-15-5383-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Zeng et al.(2023)</label><mixed-citation>Zeng, Z.-C., Lee, L., Qi, C., Clarisse, L., and Van Damme, M.: Optimal estimation retrieval of tropospheric ammonia from the Geostationary Interferometric Infrared Sounder on board FengYun-4B, Atmos. Meas. Tech., 16, 3693–3713, <ext-link xlink:href="https://doi.org/10.5194/amt-16-3693-2023" ext-link-type="DOI">10.5194/amt-16-3693-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Zheng et al.(2019)</label><mixed-citation>Zheng, B., Chevallier, F., Yin, Y., Ciais, P., Fortems-Cheiney, A., Deeter, M. N., Parker, R. J., Wang, Y., Worden, H. M., and Zhao, Y.: Global atmospheric carbon monoxide budget 2000–2017 inferred from multi-species atmospheric inversions, Earth Syst. Sci. Data, 11, 1411–1436, <ext-link xlink:href="https://doi.org/10.5194/essd-11-1411-2019" ext-link-type="DOI">10.5194/essd-11-1411-2019</ext-link>, 2019.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>A hybrid optimal estimation and machine learning approach to predict atmospheric composition</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Abadi et al.(2016)</label><mixed-citation>
       Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., Devin, M., Ghemawat, S., Goodfellow, I., Harp, A., Irving, G., Isard, M., Jia, Y., Jozefowicz, R., Kaiser, L., Kudlur, M., Levenberg, J., Mane, D., Monga, R., Moore, S., Murray, D., Olah, C., Schuster, M., Shlens, J., Steiner, B., Sutskever, I., Talwar, K., Tucker, P., Vanhoucke, V., Vasudevan, V., Viegas, F., Vinyals, O., Warden, P., Wattenberg, M., Wicke, M., Yu, Y., and Zheng, X.: TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems, arXiv [preprint], arXiv1603.04467v2, Tue, 31 May 2016, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Aumann et al.(2003)</label><mixed-citation>
       Aumann, H., Chahine, M., Gautier, C., Goldberg, M., Kalnay, E., McMillin, L., Revercomb, H., Rosenkranz, P., Smith, W., Staelin, D., Strow, L., and Susskind, J.: AIRS/AMSU/HSB on the Aqua mission: design, science objectives, data products, and processing systems, IEEE T. Geosci. Remote, 41, 253–264, <a href="https://doi.org/10.1109/TGRS.2002.808356" target="_blank">https://doi.org/10.1109/TGRS.2002.808356</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Beer et al.(2001)</label><mixed-citation>
       Beer, R., Glavich, T. A., and Rider, D. M.: Tropospheric emission spectrometer for the Earth Observing System's Aura satellite, Appl. Optics, 40, 2356–2367, <a href="https://doi.org/10.1364/AO.40.002356" target="_blank">https://doi.org/10.1364/AO.40.002356</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Bowman and Henze(2012)</label><mixed-citation>
       Bowman, K. and Henze, D. K.: Attribution of direct ozone radiative forcing to spatially resolved emissions, Geophys. Res. Lett., 39, <a href="https://doi.org/10.1029/2012GL053274" target="_blank">https://doi.org/10.1029/2012GL053274</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Bowman et al.(2006)</label><mixed-citation>
       Bowman, K., Rodgers, C., Kulawik, S., Worden, J., Sarkissian, E., Osterman, G., Steck, T., Lou, M., Eldering, A., Shephard, M., Worden, H., Lampel, M., Clough, S., Brown, P., Rinsland, C., Gunson, M., and Beer, R.: Tropospheric emission spectrometer: retrieval method and error analysis, IEEE T. Geosci. Remote, 44, 1297–1307, <a href="https://doi.org/10.1109/TGRS.2006.871234" target="_blank">https://doi.org/10.1109/TGRS.2006.871234</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Bowman(2021)</label><mixed-citation>
       Bowman, K. W.: TROPESS CrIS-JPSS1 L2 Carbon Monoxide for Forward Processing, Summary Product V1, NASA Goddard Earth Sciences Data and Information Services Center, 2022 [data set], <a href="https://doi.org/10.5067/JL1HT3NGEAW3" target="_blank">https://doi.org/10.5067/JL1HT3NGEAW3</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Buchholz et al.(2018)</label><mixed-citation>
       Buchholz, R. R., Hammerling, D., Worden, H. M., Deeter, M. N., Emmons, L. K., Edwards, D. P., and Monks, S. A.: Links between carbon monoxide and climate indices for the Southern Hemisphere and tropical fire regions, J. Geophys. Res.-Atmos., 123, 9786–9800, <a href="https://doi.org/10.1029/2018JD028438" target="_blank">https://doi.org/10.1029/2018JD028438</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Buchholz et al.(2021)</label><mixed-citation>
       Buchholz, R. R., Worden, H. M., Park, M., Francis, G., Deeter, M. N., Edwards, D. P., Emmons, L. K., Gaubert, B., Gille, J., Martínez-Alonso, S., Tang, W., Kumar, R., Drummond, J. R., Clerbaux, C., George, M., Coheur, P.-F., Hurtmans, D., Bowman, K. W., Luo, M., Payne, V. H., Worden, J. R., Chin, M., Levy, R. C., Warner, J., Wei, Z., and Kulawik, S. S.: Air pollution trends measured from Terra: CO and AOD over industrial, fire-prone, and background regions, Remote Sens. Environ., 256, 112275, <a href="https://doi.org/10.1016/j.rse.2020.112275" target="_blank">https://doi.org/10.1016/j.rse.2020.112275</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Byrne et al.(2021)</label><mixed-citation>
       Byrne, B., Liu, J., Lee, M., Yin, Y., Bowman, K. W., Miyazaki, K., Norton, A. J., Joiner, J., Pollard, D. F., Griffith, D. W. T., Velazco, V. A., Deutscher, N. M., Jones, N. B., and Paton-Walsh, C.: The carbon cycle of southeast Australia during 2019–2020: drought, fires, and subsequent recovery, AGU Advances, 2, e2021AV000469, <a href="https://doi.org/10.1029/2021AV000469" target="_blank">https://doi.org/10.1029/2021AV000469</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Byrne et al.(2024)</label><mixed-citation>
       Byrne, B., Liu, J., Bowman, K. W., Pascolini-Campbell, M., Chatterjee, A., Pandey, S., Miyazaki, K., van der Werf, G. R., Wunch, D., Wennberg, P. O., Roehl, C. M., and Sinha, S.: Carbon emissions from the 2023 Canadian wildfires, Nature, 633, 835–839, <a href="https://doi.org/10.1038/s41586-024-07878-z" target="_blank">https://doi.org/10.1038/s41586-024-07878-z</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Chollet et al.(2015)</label><mixed-citation>
       Chollet, F. et al.: Keras, <a href="https://keras.io" target="_blank"/> (last access: 29 April 2026), 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Clerbaux et al.(2002)</label><mixed-citation>
       Clerbaux, C., Hadji-Lazaro, J., Payan, S., Camy-Peyret, C., Wang, J., Edwards, D. P., and Luo, M.: Retrieval of CO from nadir remote-sensing measurements in the infrared by use of four different inversion algorithms, Appl. Optics, 41, 7068–7078, <a href="https://doi.org/10.1364/AO.41.007068" target="_blank">https://doi.org/10.1364/AO.41.007068</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Clerbaux et al.(2009)</label><mixed-citation>
       Clerbaux, C., Boynard, A., Clarisse, L., George, M., Hadji-Lazaro, J., Herbin, H., Hurtmans, D., Pommier, M., Razavi, A., Turquety, S., Wespes, C., and Coheur, P.-F.: Monitoring of atmospheric composition using the thermal infrared IASI/MetOp sounder, Atmos. Chem. Phys., 9, 6041–6054, <a href="https://doi.org/10.5194/acp-9-6041-2009" target="_blank">https://doi.org/10.5194/acp-9-6041-2009</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Davis et al.(1996)</label><mixed-citation>
       Davis, A., Marshak, A., Wiscombe, W., and Cahalan, R.: Scale invariance of liquid water distributions in marine stratocumulus. Part I: Spectral properties and stationarity issues, J. Atmos. Sci., 53, 1538–1558, <a href="https://doi.org/10.1175/1520-0469(1996)053&lt;1538:SIOLWD&gt;2.0.CO;2" target="_blank">https://doi.org/10.1175/1520-0469(1996)053&lt;1538:SIOLWD&gt;2.0.CO;2</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Davis et al.(1997)</label><mixed-citation>
       Davis, A., Marshak, A., Cahalan, R., and Wiscombe, W.: The Landsat scale break in stratocumulus as a three-dimensional radiative transfer effect: implications for cloud remote sensing, J. Atmos. Sci., 54, 241–260, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Drummond et al.(2010)</label><mixed-citation>
       Drummond, J. R., Zou, J., Nichitiu, F., Kar, J., Deschambaut, R., and Hackett, J.: A review of 9-year performance and operation of the MOPITT instrument, Adv. Space Res., 45, 760–774, <a href="https://doi.org/10.1016/j.asr.2009.11.019" target="_blank">https://doi.org/10.1016/j.asr.2009.11.019</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Edwards et al.(2004)</label><mixed-citation>
       Edwards, D. P., Emmons, L. K., Hauglustaine, D. A., Chu, D. A., Gille, J. C., Kaufman, Y. J., Pétron, G., Yurganov, L. N., Giglio, L., Deeter, M. N., Yudin, V., Ziskin, D. C., Warner, J., Lamarque, J.-F., Francis, G. L., Ho, S. P., Mao, D., Chen, J., Grechko, E. I., and Drummond, J. R.: Observations of carbon monoxide and aerosols from the Terra satellite: Northern Hemisphere variability, J. Geophys. Res.-Atmos., 109, <a href="https://doi.org/10.1029/2004JD004727" target="_blank">https://doi.org/10.1029/2004JD004727</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Field et al.(2015)</label><mixed-citation>
       Field, R. D., Luo, M., Kim, D., Del Genio, A. D., Voulgarakis, A., and Worden, J.: Sensitivity of simulated tropospheric CO to subgrid physics parameterization: a case study of Indonesian biomass burning emissions in 2006, J. Geophys. Res.-Atmos., 120, 11743–11759, <a href="https://doi.org/10.1002/2015JD023402" target="_blank">https://doi.org/10.1002/2015JD023402</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Field et al.(2016)</label><mixed-citation>
       Field, R. D., Luo, M., Fromm, M., Voulgarakis, A., Mangeon, S., and Worden, J.: Simulating the Black Saturday 2009 smoke plume with an interactive composition-climate model: Sensitivity to emissions amount, timing, and injection height, J. Geophys. Res.-Atmos., 121, 4296–4316, <a href="https://doi.org/10.1002/2015JD024343" target="_blank">https://doi.org/10.1002/2015JD024343</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Franzke et al.(2020)</label><mixed-citation>
       Franzke, C. L. E., Barbosa, S., Blender, R., Fredriksen, H.-B., Laepple, T., Lambert, F., Nilsen, T., Rypdal, K., Rypdal, M., Scotto, M. G., Vannitsem, S., Watkins, N. W., Yang, L., and Yuan, N.: The structure of climate variability across scales, Rev. Geophys., 58, e2019RG000657, <a href="https://doi.org/10.1029/2019RG000657" target="_blank">https://doi.org/10.1029/2019RG000657</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Fu et al.(2016)</label><mixed-citation>
       Fu, D., Bowman, K. W., Worden, H. M., Natraj, V., Worden, J. R., Yu, S., Veefkind, P., Aben, I., Landgraf, J., Strow, L., and Han, Y.: High-resolution tropospheric carbon monoxide profiles retrieved from CrIS and TROPOMI, Atmos. Meas. Tech., 9, 2567–2579, <a href="https://doi.org/10.5194/amt-9-2567-2016" target="_blank">https://doi.org/10.5194/amt-9-2567-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Fu et al.(2018)</label><mixed-citation>
       Fu, D., Kulawik, S. S., Miyazaki, K., Bowman, K. W., Worden, J. R., Eldering, A., Livesey, N. J., Teixeira, J., Irion, F. W., Herman, R. L., Osterman, G. B., Liu, X., Levelt, P. F., Thompson, A. M., and Luo, M.: Retrievals of tropospheric ozone profiles from the synergism of AIRS and OMI: methodology and validation, Atmos. Meas. Tech., 11, 5587–5605, <a href="https://doi.org/10.5194/amt-11-5587-2018" target="_blank">https://doi.org/10.5194/amt-11-5587-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Fu et al.(2019)</label><mixed-citation>
       Fu, D., Millet, D. B., Wells, K. C., Payne, V. H., Yu, S., Guenther, A., and Eldering, A.: Direct retrieval of isoprene from satellite-based infrared measurements, Nat. Commun., 10, 3811, <a href="https://doi.org/10.1038/s41467-019-11835-0" target="_blank">https://doi.org/10.1038/s41467-019-11835-0</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Gaubert et al.(2017)</label><mixed-citation>
       Gaubert, B., Worden, H. M., Arellano, A. F. J., Emmons, L. K., Tilmes, S., Barré, J., Martinez Alonso, S., Vitt, F., Anderson, J. L., Alkemade, F., Houweling, S., and Edwards, D. P.: Chemical feedback from decreasing carbon monoxide emissions, Geophys. Res. Lett., 44, 9985–9995, <a href="https://doi.org/10.1002/2017GL074987" target="_blank">https://doi.org/10.1002/2017GL074987</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Goodfellow et al.(2016)</label><mixed-citation>
       Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning (Adaptive Computation and Machine Learning Series), The MIT Press, Cambridge, MA, ISBN-10 0262035618, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Grivas and Chaloulakou(2006)</label><mixed-citation>
       Grivas, G. and Chaloulakou, A.: Artificial neural network models for prediction of PM<sub>1</sub>0 hourly concentrations, in the Greater Area of Athens, Greece, Atmos. Environ., 40, 1216–1229, <a href="https://doi.org/10.1016/j.atmosenv.2005.10.036" target="_blank">https://doi.org/10.1016/j.atmosenv.2005.10.036</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Han et al.(2013)</label><mixed-citation>
       Han, Y., Revercomb, H., Cromp, M., Gu, D., Johnson, D., Mooney, D., Scott, D., Strow, L., Bingham, G., Borg, L., Chen, Y., DeSlover, D., Esplin, M., Hagan, D., Jin, X., Knuteson, R., Motteler, H., Predina, J., Suwinski, L., Taylor, J., Tobin, D., Tremblay, D., Wang, C., Wang, L., Wang, L., and Zavyalov, V.: Suomi NPP CrIS measurements, sensor data record algorithm, calibration and validation activities, and record data quality, J. Geophys. Res.-Atmos., 118, 12734–12748, <a href="https://doi.org/10.1002/2013JD020344" target="_blank">https://doi.org/10.1002/2013JD020344</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Holloway et al.(2000)</label><mixed-citation>
       Holloway, T., Levy II, H., and Kasibhatla, P.: Global distribution of carbon monoxide, J. Geophys. Res.-Atmos., 105, 12123–12147, <a href="https://doi.org/10.1029/1999JD901173" target="_blank">https://doi.org/10.1029/1999JD901173</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>IPCC(2023)</label><mixed-citation>
       IPCC: Climate Change 2023: Synthesis Report. A Report of the Intergovernmental Panel on Climate Change, IPCC, <a href="https://doi.org/10.59327/IPCC/AR6-9789291691647" target="_blank">https://doi.org/10.59327/IPCC/AR6-9789291691647</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Jacob(1999)</label><mixed-citation>
       Jacob, D.: Instroduction to Atmospheric Chemistry, Princeton University Press, ISBN 9780691001852, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Jain et al.(2024)</label><mixed-citation>
       Jain, P., Barber, Q. E., Taylor, S. W., Whitman, E., Castellanos Acuna, D., Boulanger, Y., Chavardès, R. D., Chen, J., Englefield, P., Flannigan, M., Girardin, M. P., Hanes, C. C., Little, J., Morrison, K., Skakun, R. S., Thompson, D. K., Wang, X., and Parisien, M.-A.: Drivers and impacts of the record-breaking 2023 wildfire season in Canada, Nat. Commun., 15, 6764, <a href="https://doi.org/10.1038/s41467-024-51154-7" target="_blank">https://doi.org/10.1038/s41467-024-51154-7</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Jones et al.(2003)</label><mixed-citation>
       Jones, D. B. A., Bowman, K. W., Palmer, P. I., Worden, J. R., Jacob, D. J., Hoffman, R. N., Bey, I., and Yantosca, R. M.: Potential of observations from the Tropospheric Emission Spectrometer to constrain continental sources of carbon monoxide, J. Geophys. Res.-Atmos., 108, <a href="https://doi.org/10.1029/2003JD003702" target="_blank">https://doi.org/10.1029/2003JD003702</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Knobelspiesse et al.(2019)</label><mixed-citation>
       Knobelspiesse, K., Tan, Q., Bruegge, C., Cairns, B., Chowdhary, J., van Diedenhoven, B., Diner, D., Ferrare, R., van Harten, G., Jovanovic, V., Ottaviani, M., Redemann, J., Seidel, F., and Sinclair, K.: Intercomparison of airborne multi-angle polarimeter observations from the Polarimeter Definition Experiment, Appl. Optics, 58, 650–669, <a href="https://doi.org/10.1364/AO.58.000650" target="_blank">https://doi.org/10.1364/AO.58.000650</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Lelieveld et al.(2016)</label><mixed-citation>
       Lelieveld, J., Gromov, S., Pozzer, A., and Taraborrelli, D.: Global tropospheric hydroxyl distribution, budget and reactivity, Atmos. Chem. Phys., 16, 12477–12493, <a href="https://doi.org/10.5194/acp-16-12477-2016" target="_blank">https://doi.org/10.5194/acp-16-12477-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Luo et al.(2024)</label><mixed-citation>
       Luo, M., Worden, H. M., Field, R. D., Tsigaridis, K., and Elsaesser, G. S.: TROPESS-CrIS CO single-pixel vertical profiles: intercomparisons with MOPITT and model simulations for 2020 western US wildfires, Atmos. Meas. Tech., 17, 2611–2624, <a href="https://doi.org/10.5194/amt-17-2611-2024" target="_blank">https://doi.org/10.5194/amt-17-2611-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Malina et al.(2024)</label><mixed-citation>
       Malina, E., Bowman, K. W., Kantchev, V., Kuai, L., Kurosu, T. P., Miyazaki, K., Natraj, V., Osterman, G. B., Oyafuso, F., and Thill, M. D.: Joint spectral retrievals of ozone with Suomi NPP CrIS augmented by S5P/TROPOMI, Atmos. Meas. Tech., 17, 5341–5371, <a href="https://doi.org/10.5194/amt-17-5341-2024" target="_blank">https://doi.org/10.5194/amt-17-5341-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Miller et al.(2020)</label><mixed-citation>
       Miller, D. J., Segal-Rozenhaimer, M., Knobelspiesse, K., Redemann, J., Cairns, B., Alexandrov, M., van Diedenhoven, B., and Wasilewski, A.: Low-level liquid cloud properties during ORACLES retrieved using airborne polarimetric measurements and a neural network algorithm, Atmos. Meas. Tech., 13, 3447–3470, <a href="https://doi.org/10.5194/amt-13-3447-2020" target="_blank">https://doi.org/10.5194/amt-13-3447-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Miyazaki et al.(2015)</label><mixed-citation>
       Miyazaki, K., Eskes, H. J., and Sudo, K.: A tropospheric chemistry reanalysis for the years 2005–2012 based on an assimilation of OMI, MLS, TES, and MOPITT satellite data, Atmos. Chem. Phys., 15, 8315–8348, <a href="https://doi.org/10.5194/acp-15-8315-2015" target="_blank">https://doi.org/10.5194/acp-15-8315-2015</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Neyra-Nazarrett et al.(2025)</label><mixed-citation>
       Neyra-Nazarrett, O. A., Miyazaki, K., Bowman, K. W., and Saide, P. E.: An assessment of TROPESS CrIS and TROPOMI CO retrievals and their synergies for the 2020 Western U.S. wildfires, Remote Sens.-Basel, 17, <a href="https://doi.org/10.3390/rs17111854" target="_blank">https://doi.org/10.3390/rs17111854</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Nilsen et al.(2016)</label><mixed-citation>
       Nilsen, T., Rypdal, K., and Fredriksen, H.-B.: Are there multiple scaling regimes in Holocene temperature records?, Earth Syst. Dynam., 7, 419–439, <a href="https://doi.org/10.5194/esd-7-419-2016" target="_blank">https://doi.org/10.5194/esd-7-419-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Noël et al.(2022)</label><mixed-citation>
       Noël, S., Reuter, M., Buchwitz, M., Borchardt, J., Hilker, M., Schneising, O., Bovensmann, H., Burrows, J. P., Di Noia, A., Parker, R. J., Suto, H., Yoshida, Y., Buschmann, M., Deutscher, N. M., Feist, D. G., Griffith, D. W. T., Hase, F., Kivi, R., Liu, C., Morino, I., Notholt, J., Oh, Y.-S., Ohyama, H., Petri, C., Pollard, D. F., Rettinger, M., Roehl, C., Rousogenous, C., Sha, M. K., Shiomi, K., Strong, K., Sussmann, R., Té, Y., Velazco, V. A., Vrekoussis, M., and Warneke, T.: Retrieval of greenhouse gases from GOSAT and GOSAT-2 using the FOCAL algorithm, Atmos. Meas. Tech., 15, 3401–3437, <a href="https://doi.org/10.5194/amt-15-3401-2022" target="_blank">https://doi.org/10.5194/amt-15-3401-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Reed and Marks(1999)</label><mixed-citation>
       Reed, R. and Marks, ll, R. J.: Neural Smithing: Supervised Learning in Feedforward Artificial Neural Networks, A Bradford Book, ISBN-10 0262181908, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Reichle Jr. et al.(1990)</label><mixed-citation>
       Reichle Jr., H. G., Connors, V. S., Holland, J. A., Sherrill, R. T., Wallio, H. A., Casas, J. C., Condon, E. P., Gormsen, B. B., and Seiler, W.: The distribution of middle tropospheric carbon monoxide during early October 1984, J. Geophys. Res.-Atmos., 95, 9845–9856, <a href="https://doi.org/10.1029/JD095iD07p09845" target="_blank">https://doi.org/10.1029/JD095iD07p09845</a>, 1990.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Rodgers(2000)</label><mixed-citation>
       Rodgers, C.: Inverse Methods for Atmospheric Sounding, World Scientific Publishing Co., ISBN-10 981022740X, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Saponaro et al.(2013)</label><mixed-citation>
       Saponaro, G., Kolmonen, P., Karhunen, J., Tamminen, J., and de Leeuw, G.: A neural network algorithm for cloud fraction estimation using NASA-Aura OMI VIS radiance measurements, Atmos. Meas. Tech., 6, 2301–2309, <a href="https://doi.org/10.5194/amt-6-2301-2013" target="_blank">https://doi.org/10.5194/amt-6-2301-2013</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Schultz et al.(2015)</label><mixed-citation>
       Schultz, M. G., Akimoto, H., Bottenheim, J., Buchmann, B., Galbally, I. E., Gilge, S., Helmig, D., Koide, H., Lewis, A. C., Novelli, P. C., Plass-Dülmer, C., Ryerson, T. B., Steinbacher, M., Steinbrecher, R., Tarasova, O., Tørseth, K., Thouret, V., and Zellweger, C.: The Global Atmosphere Watch reactive gases measurement network, Elementa: Science of the Anthropocene, 3, 000067, <a href="https://doi.org/10.12952/journal.elementa.000067" target="_blank">https://doi.org/10.12952/journal.elementa.000067</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Schultz et al.(2021)</label><mixed-citation>
       Schultz, M. G., Betancourt, C., Gong, B., Kleinert, F., Langguth, M., Leufen, L. H., Mozaffari, A., and Stadtler, S.: Can deep learning beat numerical weather prediction?, Philos. T. Roy. Soc. A, 379, <a href="https://doi.org/10.1098/rsta.2020.0097" target="_blank">https://doi.org/10.1098/rsta.2020.0097</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Strode et al.(2016)</label><mixed-citation>
       Strode, S. A., Worden, H. M., Damon, M., Douglass, A. R., Duncan, B. N., Emmons, L. K., Lamarque, J.-F., Manyin, M., Oman, L. D., Rodriguez, J. M., Strahan, S. E., and Tilmes, S.: Interpreting space-based trends in carbon monoxide with multiple models, Atmos. Chem. Phys., 16, 7285–7294, <a href="https://doi.org/10.5194/acp-16-7285-2016" target="_blank">https://doi.org/10.5194/acp-16-7285-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Tyralis and Papacharalampous(2024)</label><mixed-citation>
       Tyralis, H. and Papacharalampous, G.: A review of predictive uncertainty estimation with machine learning, Artif. Intell. Rev., 57, 94, <a href="https://doi.org/10.1007/s10462-023-10698-8" target="_blank">https://doi.org/10.1007/s10462-023-10698-8</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>UW-Madison Space Science and Engineering Center: Hank Revercomb; UMBC Atmospheric Spectroscopy Laboratory: Larrabee Strow(2018)</label><mixed-citation>
       UW-Madison Space Science and Engineering Center: Hank Revercomb; UMBC Atmospheric Spectroscopy Laboratory: Larrabee Strow: JPSS-1 CrIS Level 1B Full Spectral Resolution V2, Goddard Earth Sciences Data and Information Services Center (GES DISC) [data set], <a href="https://doi.org/10.5067/EETSCFBDBLX6" target="_blank">https://doi.org/10.5067/EETSCFBDBLX6</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Veefkind et al.(2012)</label><mixed-citation>
       Veefkind, J., Aben, I., McMullan, K., Förster, H., de Vries, J., Otter, G., Claas, J., Eskes, H., de Haan, J., Kleipool, Q., van Weele, M., Hasekamp, O., Hoogeveen, R., Landgraf, J., Snel, R., Tol, P., Ingmann, P., Voors, R., Kruizinga, B., Vink, R., Visser, H., and Levelt, P.: TROPOMI on the ESA Sentinel-5 Precursor: a GMES mission for global observations of the atmospheric composition for climate, air quality and ozone layer applications, Remote Sens. Environ., 120, 70–83, <a href="https://doi.org/10.1016/j.rse.2011.09.027" target="_blank">https://doi.org/10.1016/j.rse.2011.09.027</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>von Clarmann and Glatthor(2019)</label><mixed-citation>
       von Clarmann, T. and Glatthor, N.: The application of mean averaging kernels to mean trace gas distributions, Atmos. Meas. Tech., 12, 5155–5160, <a href="https://doi.org/10.5194/amt-12-5155-2019" target="_blank">https://doi.org/10.5194/amt-12-5155-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Werner et al.(2020)</label><mixed-citation>
       Werner, F., Schwartz, M. J., Livesey, N. J., Read, W. G., and Santee, M. L.: Extreme outliers in lower stratospheric water vapor over North America observed by MLS: relation to overshooting convection diagnosed from colocated Aqua-MODIS data, Geophys. Res. Lett., 47, e2020GL090131, <a href="https://doi.org/10.1029/2020GL090131" target="_blank">https://doi.org/10.1029/2020GL090131</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Werner et al.(2021)</label><mixed-citation>
       Werner, F., Livesey, N. J., Schwartz, M. J., Read, W. G., Santee, M. L., and Wind, G.: Improved cloud detection for the Aura Microwave Limb Sounder (MLS): training an artificial neural network on colocated MLS and Aqua MODIS data, Atmos. Meas. Tech., 14, 7749–7773, <a href="https://doi.org/10.5194/amt-14-7749-2021" target="_blank">https://doi.org/10.5194/amt-14-7749-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Werner et al.(2023)</label><mixed-citation>
       Werner, F., Livesey, N. J., Millán, L. F., Read, W. G., Schwartz, M. J., Wagner, P. A., Daffer, W. H., Lambert, A., Tolstoff, S. N., and Santee, M. L.: Applying machine learning to improve the near-real-time products of the Aura Microwave Limb Sounder, Atmos. Meas. Tech., 16, 2733–2751, <a href="https://doi.org/10.5194/amt-16-2733-2023" target="_blank">https://doi.org/10.5194/amt-16-2733-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Werner et al.(2025)</label><mixed-citation>
       Werner, F., Bowman, K. W.,
Lee, S., Laughner, J. L., Payne, V. H., and McDuffie, J. L.: Zenodo repository
for A hybrid optimal estimation and machine learning approach to predict
atmospheric composition, Zenodo [code], <a href="https://doi.org/10.5281/zenodo.16968703" target="_blank">https://doi.org/10.5281/zenodo.16968703</a>, 2025. 

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Worden et al.(2013)</label><mixed-citation>
       Worden, H. M., Deeter, M. N., Frankenberg, C., George, M., Nichitiu, F., Worden, J., Aben, I., Bowman, K. W., Clerbaux, C., Coheur, P. F., de Laat, A. T. J., Detweiler, R., Drummond, J. R., Edwards, D. P., Gille, J. C., Hurtmans, D., Luo, M., Martínez-Alonso, S., Massie, S., Pfister, G., and Warner, J. X.: Decadal record of satellite carbon monoxide observations, Atmos. Chem. Phys., 13, 837–850, <a href="https://doi.org/10.5194/acp-13-837-2013" target="_blank">https://doi.org/10.5194/acp-13-837-2013</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Worden et al.(2022)</label><mixed-citation>
       Worden, H. M., Francis, G. L., Kulawik, S. S., Bowman, K. W., Cady-Pereira, K., Fu, D., Hegarty, J. D., Kantchev, V., Luo, M., Payne, V. H., Worden, J. R., Commane, R., and McKain, K.: TROPESS/CrIS carbon monoxide profile validation with NOAA GML and ATom in situ aircraft observations, Atmos. Meas. Tech., 15, 5383–5398, <a href="https://doi.org/10.5194/amt-15-5383-2022" target="_blank">https://doi.org/10.5194/amt-15-5383-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Zeng et al.(2023)</label><mixed-citation>
       Zeng, Z.-C., Lee, L., Qi, C., Clarisse, L., and Van Damme, M.: Optimal estimation retrieval of tropospheric ammonia from the Geostationary Interferometric Infrared Sounder on board FengYun-4B, Atmos. Meas. Tech., 16, 3693–3713, <a href="https://doi.org/10.5194/amt-16-3693-2023" target="_blank">https://doi.org/10.5194/amt-16-3693-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Zheng et al.(2019)</label><mixed-citation>
       Zheng, B., Chevallier, F., Yin, Y., Ciais, P., Fortems-Cheiney, A., Deeter, M. N., Parker, R. J., Wang, Y., Worden, H. M., and Zhao, Y.: Global atmospheric carbon monoxide budget 2000–2017 inferred from multi-species atmospheric inversions, Earth Syst. Sci. Data, 11, 1411–1436, <a href="https://doi.org/10.5194/essd-11-1411-2019" target="_blank">https://doi.org/10.5194/essd-11-1411-2019</a>, 2019.

    </mixed-citation></ref-html>--></article>
