<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">AMT</journal-id><journal-title-group>
    <journal-title>Atmospheric Measurement Techniques</journal-title>
    <abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1867-8548</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/amt-19-4415-2026</article-id><title-group><article-title>Cloud fields and aerosol classification with lidar using advanced AI approach</article-title><alt-title>Cloud fields and aerosol classification with lidar using advanced AI approach</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Peleg</surname><given-names>Yonatan</given-names></name>
          <email>yoni.peleg@post.runi.ac.il</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Zeida-Cohen</surname><given-names>Lior</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Tzror</surname><given-names>Imri</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Bühl</surname><given-names>Johannes</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Ansmann</surname><given-names>Albert</given-names></name>
          
        <ext-link>https://orcid.org/0000-0001-5382-8440</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff4">
          <name><surname>Chudnovsky</surname><given-names>Alexandra</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-8631-4275</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Yakhini</surname><given-names>Zohar</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Efi Arazi School of Computer Science, Reichman University, Herzliya, Israel</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Harz University of Applied Sciences, Wernigerode, Germany</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Leibniz Institute for Tropospheirc Research (TROPOS), Leibniz, Germany</institution>
        </aff>
        <aff id="aff4"><label>4</label><institution>Faculty of Exact Sciences, Department of Geophysics, Tel Aviv University, Tel Aviv, Israel</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Yonatan Peleg (yoni.peleg@post.runi.ac.il)</corresp></author-notes><pub-date><day>3</day><month>July</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>13</issue>
      <fpage>4415</fpage><lpage>4439</lpage>
      <history>
        <date date-type="received"><day>26</day><month>October</month><year>2025</year></date>
           <date date-type="rev-request"><day>26</day><month>January</month><year>2026</year></date>
           <date date-type="rev-recd"><day>8</day><month>June</month><year>2026</year></date>
           <date date-type="accepted"><day>21</day><month>June</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Yonatan Peleg et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026.html">This article is available from https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026.html</self-uri><self-uri xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e157">Understanding the vertical distribution of aerosol and clouds i.s critical for climate modeling, weather forecasting, and air quality monitoring. Lidar observations are central to profiling atmospheric composition, yet signal attenuation in optically thick layers limits the effective retrieval of some important properties above those layers. More complex measurement approaches, using a combination of Lidar and cloud radar systems, can be taken to support more inclusive and accurate inference. In this study, we develop a deep learning framework to address this trade-off and gap in the cost of data acquisition by enabling full-column aerosol and cloud classification using only standard lidar inputs, achieving particularly high skill for aerosol typing while demonstrating robust, physically consistent classification of ice-cloud fields even under conditions of strong lidar signal attenuation, with liquid-cloud uncertainties primarily arising from closely related microphysical classes. The approach is based on a U-Net architecture trained to predict combined aerosol and cloud types from vertical profiles of backscatter and depolarization. Classification targets integrate established aerosol typing from PollyXT with cloud and precipitation categorization from Cloudnet, facilitating a unified scheme. The model achieves high precision, recall, and F1-scores above 95 <inline-formula><mml:math id="M1" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>. By evaluating numerous complex case studies, we establish the model's ability to exploit information embedded in the lidar signal below attenuating layers, including structural and contextual features, to infer atmospheric conditions at higher altitudes, offering a robust AI-based enhancement to lidar-based atmospheric profiling and target classification. The application of AI in this context closes the gap between the need for vertical cloud maps and the sparse availability of Cloudnet.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e177">Aerosols and clouds are fundamental components of the Earth's atmosphere, exerting profound influences on the planet's climate system. They modulate radiative energy balance through scattering and absorption of solar and terrestrial radiation, significantly impacting the hydrological cycle through precipitation processes <xref ref-type="bibr" rid="bib1.bibx32 bib1.bibx1" id="paren.1"/>. They also play a role in atmospheric chemistry by interacting with gases and providing surfaces for chemical reactions to occur <xref ref-type="bibr" rid="bib1.bibx19 bib1.bibx21 bib1.bibx26" id="paren.2"/>. Clouds, in particular, have large but opposing effects on short-wave and long-wave radiation, resulting in a significant net cooling effect globally, although the magnitude remains uncertain <xref ref-type="bibr" rid="bib1.bibx17" id="paren.3"/>.  Aerosol-cloud interactions (ACI) represent one of the largest uncertainties in current climate projections <xref ref-type="bibr" rid="bib1.bibx19 bib1.bibx38" id="paren.4"/>. Understanding the vertical distribution and properties of both aerosol and clouds is therefore critical to accurately understand and quantify their climatic impacts and to improve the representation of atmospheric processes in weather prediction and climate models <xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx42 bib1.bibx34" id="paren.5"/>.</p>
      <p id="d2e195">Significant progress has been made in the development of remote sensing techniques capable of profiling atmospheric constituents. Lidar has emerged as a powerful tool for providing detailed vertical profiles of aerosol particles and clouds with high spatial and temporal resolution <xref ref-type="bibr" rid="bib1.bibx9" id="paren.6"/>. Advanced lidar systems, such as multiwavelength Raman and polarization lidars, can retrieve not only the vertical distribution of aerosol backscatter but also intensive optical properties. These properties include the lidar ratio (extinction-to-backscatter ratio) and the particle linear depolarization ratio, which provide crucial information about particle size, shape, and absorption characteristics, enabling the classification of different aerosol types and their different vertical distribution (e.g. dust, smoke, marine, urban haze) <xref ref-type="bibr" rid="bib1.bibx35 bib1.bibx36 bib1.bibx3" id="paren.7"/>. Networks like PollyNET, using standardized and automated PollyXT (POrtabLe Lidar sYstem with eXTended capabilities) <xref ref-type="bibr" rid="bib1.bibx12" id="paren.8"/> lidars, demonstrate the capability for continuous, near-real-time monitoring and characterization of aerosol profiles in diverse global locations <xref ref-type="bibr" rid="bib1.bibx4 bib1.bibx3" id="paren.9"/>.</p>
      <p id="d2e210">Lidar is highly effective for detecting aerosols and thin cloud layers, yet it is constrained by signal attenuation <xref ref-type="bibr" rid="bib1.bibx43 bib1.bibx42" id="paren.10"/>. The laser beam can be strongly scattered and absorbed by dense atmospheric constituents, particularly liquid water droplets <xref ref-type="bibr" rid="bib1.bibx16" id="paren.11"/>. In optically thick clouds, especially those containing liquid water, the lidar signal is usually fully attenuated within a few hundred meters above the cloud base, typically at optical depths (<inline-formula><mml:math id="M2" display="inline"><mml:mi mathvariant="italic">τ</mml:mi></mml:math></inline-formula>) around 3–5 <xref ref-type="bibr" rid="bib1.bibx44" id="paren.12"/>. This attenuation prevents the lidar from probing the full vertical extent of the cloud, hindering the characterization of the height of the cloud top, the internal structure, and the thermodynamic phase <xref ref-type="bibr" rid="bib1.bibx23" id="paren.13"/>. This physical limitation is the primary reason why synergistic approaches like Cloudnet rely on cloud radar, which can easily penetrate multiple cloud layers, to provide information above the lidar attenuation height <xref ref-type="bibr" rid="bib1.bibx8" id="paren.14"/>.</p>
      <p id="d2e236">Cloudnet integrates measurements from a lidar ceilometer, a cloud radar (typically millimeter-wavelength), a microwave radiometer (for integrated liquid water path), and thermodynamic profiles (temperature, humidity) from numerical weather prediction models. Its synergistic infrastructure is essential for retrieving continuous cloud micro-physical properties, continuously evaluating numerical weather prediction and climate models, and advancing our understanding of aerosol-cloud interaction. It also provides a detailed target classification, distinguishing between clear sky, aerosol, various cloud phases (liquid droplets, supercooled liquid, ice), precipitation types (drizzle, rain, snow), and even non-meteorological targets like insects <xref ref-type="bibr" rid="bib1.bibx18" id="paren.15"/>. Due to the complexity and high cost of operating and maintaining full Cloudnet setups, there is a need for new scalable, data driven methods that can extract comparable cloud information from single instrument observations such as lidar. While techniques exist to determine cloud base from lidar <xref ref-type="bibr" rid="bib1.bibx31" id="paren.16"/>, accurately classifying the entire cloud column using only lidar data remains a significant challenge, particularly for multi-layer, mixed-phase, or deep convective cloud systems <xref ref-type="bibr" rid="bib1.bibx23" id="paren.17"/>.</p>
      <p id="d2e249">The rapid advancement of deep learning, particularly through architectures such as Convolutional Neural Networks (CNNs) and the U-Net, has introduced innovative approaches for analyzing complex datasets. These methods have achieved outstanding performance in pattern recognition and image segmentation across a wide range of disciplines <xref ref-type="bibr" rid="bib1.bibx24 bib1.bibx25 bib1.bibx37 bib1.bibx33" id="paren.18"/>. In atmospheric science, deep learning has already demonstrated significant potential, for example in cloud phase classification from radar Doppler spectra <xref ref-type="bibr" rid="bib1.bibx40" id="paren.19"/>, cloud detection and segmentation from satellite imagery, and the analysis of lidar point clouds <xref ref-type="bibr" rid="bib1.bibx6" id="paren.20"/>.  A particularly compelling application lies in addressing the long-standing problem of signal attenuation in lidar observations. Conventional retrieval methods are limited once the backscatter signal is extinguished, leaving the atmospheric structure above the attenuation height poorly constrained. Deep learning offers a new perspective: by leveraging the rich set of features, correlations, and contextual cues embedded in the portion of the lidar profile below the attenuation threshold, and potentially its spatial and temporal evolution <xref ref-type="bibr" rid="bib1.bibx5" id="paren.21"/>, it infers the atmospheric properties above. A U-Net model, with its strong capacity for hierarchical feature extraction and pattern recognition, is especially well suited to capture these subtle and non-intuitive relationships.</p>
      <p id="d2e264">Machine learning has been increasingly applied to the detection and classification of aerosols and clouds from lidar observations. In most approaches, lidar measurements, commonly represented as time–height or along-track–height cross-sections, are treated as two-dimensional images. This formulation allows CNNs and U-Net architectures to learn spatial textures, morphological features, and contextual patterns directly from the data. For example, a CNN was developed for cloud–aerosol discrimination using only lidar measurements from NASA's Ice, Cloud, and Land Elevation Satellite (ICESat-2) <xref ref-type="bibr" rid="bib1.bibx30" id="paren.22"/>. Similarly, a U-Net model enhanced with self-attention mechanisms was constructed to classify cloud and aerosol layers in atmospheric vertical profiles using CALIPSO L1 data <xref ref-type="bibr" rid="bib1.bibx46" id="paren.23"/>. Both studies demonstrated the capacity of deep learning to reliably separate aerosols from clouds. However, their focus remained limited to binary cloud–aerosol discrimination, without further subdivision into specific categories. Another recent study introduced a multitask machine learning framework for space-based lidar, capable of simultaneous cloud-aerosol discrimination and aerosol typing <xref ref-type="bibr" rid="bib1.bibx14" id="paren.24"/>.  While their approach successfully improves the spatial resolution of retrievals compared to standard products, their model is trained on lidar-derived optical products and is therefore strictly bound by the physical signal limitations of the lidar instrument itself. Consequently, the model cannot infer or characterize atmospheric structures in regions where the lidar signal is fully attenuated.</p>
      <p id="d2e276">Other direction of research has targeted aerosol sub-classification using lidar data. For example, one study applied traditional algorithms to first detect atmospheric layers and compute their integrated optical properties, where these derived feature vectors were subsequently classified by a standard artificial neural network (ANN) <xref ref-type="bibr" rid="bib1.bibx29" id="paren.25"/>. A more recent comparison of six machine learning models for aerosol typing identified LightGBM as the most effective <xref ref-type="bibr" rid="bib1.bibx10" id="paren.26"/>. While these efforts highlight the promise of machine learning for aerosol categorization, they do not extend to the joint classification of cloud and aerosol subtypes. Importantly, distinct aerosol and cloud categories often exhibit complex cross-category and cross-type interactions. Capturing these interactions requires integrated datasets that explicitly combine both aerosol and cloud categories as classification targets.</p>
      <p id="d2e285">This paper introduces a deep learning methodology aimed at achieving unified aerosol and cloud classification throughout the vertical atmospheric column using only standard lidar measurements as input. The approach utilizes a U-Net architecture trained end-to-end to map vertical profiles of lidar backscatter and depolarization to a combined target classification derived from PollyXT aerosol typing and Cloudnet categorization. Crucially, while elastic lidar observations are fundamentally limited by complete signal attenuation in optically thick clouds, our architecture leverages contextual learning to look beyond this physical barrier. Rather than attempting to retrieve impossible optical properties above the attenuation limit, the network generates probabilistic inferences for these upper atmospheric classes. These classifications are strictly constrained by the observed vertical structure below the cloud top and the surrounding thermodynamic context, offering a novel predictive capability where direct lidar observation fails.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data</title>
      <p id="d2e296">The primary input to the deep learning model consists of vertically resolved profiles obtained from ground-based lidar measurements in Limassol, Cyprus, between November 2016 and April 2018. Data were formatted as two-dimensional (2D) time-height images representing a sequence of profiles. Using 2D inputs allows the CNN architecture to exploit spatio-temporal context, capturing dynamic features or advection patterns relevant to the classification task. Our data consists of a temporal resolution of 90 <inline-formula><mml:math id="M3" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:math></inline-formula> and a vertical resolution of 37 <inline-formula><mml:math id="M4" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula>, where each image spans over 24 <inline-formula><mml:math id="M5" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:math></inline-formula> and 22.5 <inline-formula><mml:math id="M6" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula> in height. Data were provided by the Leibniz Institute for Tropospheric Research (TROPOS). The terms “sample” and “image” will be used intermittently to describe one 2D (time-height) input, where an “image” is a multi-channel time-height dataset.</p>
      <p id="d2e331">All input features were selected based on their established relevance in lidar-based classification frameworks. Optical channels primarily drive aerosol discrimination via particle size and shape, while thermodynamic variables (temperature and pressure) constrain physically plausible cloud phases. The input features used in this study include: attenuated backscatter coefficient at 532 and 1064 <inline-formula><mml:math id="M7" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>, aerosol backscatter coefficients at 532 and 1064 <inline-formula><mml:math id="M8" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>, particle depolarization ratio at 532 <inline-formula><mml:math id="M9" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>, volume depolarization ratio at 532 <inline-formula><mml:math id="M10" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>, the backscatter-related Ångström exponent between 532 and 1064 <inline-formula><mml:math id="M11" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>, model pressure and model temperature. For generation of the training dataset, all variables of the Cloudnet processing scheme were mapped to the PollyNET time-height grid. All variables from the Cloudnet processing scheme were interpolated onto the finer PollyXT time-height grid (90 <inline-formula><mml:math id="M12" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">s</mml:mi></mml:mrow></mml:math></inline-formula>, 37 <inline-formula><mml:math id="M13" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">m</mml:mi></mml:mrow></mml:math></inline-formula>). To avoid the introduction of artifacts or the blending of discrete categorical classes that would result from numerical averaging, this mapping was performed using nearest-neighbor interpolation. Consequently, the Cloudnet data was simply replicated onto the finer lidar grid, leaving the original categorical values entirely untouched.</p>
      <p id="d2e391">The target variable represents a unified classification that combines aerosol and cloud/precipitation types for each vertical bin in the profile. The construction of this unified mask follows a straightforward, rule-based merging strategy. The foundational mask is derived from the PollyXT target categorization algorithm <xref ref-type="bibr" rid="bib1.bibx3" id="paren.27"/>. To integrate comprehensive cloud and precipitation data, this base mask is subsequently overwritten by the Cloudnet target classification <xref ref-type="bibr" rid="bib1.bibx18" id="paren.28"/> in any pixel where the Cloudnet radar detects cloud or precipitation particles.</p>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e403">Unified atmospheric target classification mask. A 2D time-height training label for 3 November 2016, created by integrating PollyXT aerosol typing and Cloudnet cloud/precipitation categorization. The vertical axis represents altitude (up to 22.5 <inline-formula><mml:math id="M14" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">km</mml:mi></mml:mrow></mml:math></inline-formula>), and the horizontal axis represents 24 <inline-formula><mml:math id="M15" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">h</mml:mi></mml:mrow></mml:math></inline-formula> of observations at 90-s resolution. Classes range from clear atmosphere (Class 1) to specific aerosol types (Classes 3–6) and various cloud phases including water droplets and ice crystals (Classes 8–11).</p></caption>
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f01.png"/>

      </fig>

      <p id="d2e428">This merging strategy leverages the complementary physical capabilities of both instruments. Cloud radar excels at detecting ice crystals and penetrating dense cloud structures, whereas lidar is highly sensitive to aerosols and optically thin clouds. Consequently, radar classification takes precedence, where applicable, while lidar classification serves as the default in regions without a radar signal. This simple but effective rule preserves the detailed PollyXT aerosol classification, leverages Cloudnet to improve the representation of precipitation and thick clouds, and retains high-altitude ice clouds that are often too weak to exceed the radar's sensitivity threshold. The label classes are numbered as detailed in Table <xref ref-type="table" rid="T1"/>, and Fig. <xref ref-type="fig" rid="F1"/> shows an example target image that the model aims to learn.</p>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e438">Classification Labels.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="2">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Class Number</oasis:entry>
         <oasis:entry colname="col2">Class Label</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0</oasis:entry>
         <oasis:entry colname="col2">No class</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Clean atmosphere</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">None-typed particles/low concentration</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">Aerosol small</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Aerosol large spherical</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Aerosol mixture partly non-spherical</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Aerosol large non-spherical</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7</oasis:entry>
         <oasis:entry colname="col2">Cloud non-typed</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8</oasis:entry>
         <oasis:entry colname="col2">Cloud water droplets</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9</oasis:entry>
         <oasis:entry colname="col2">Cloud likely water droplets</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10</oasis:entry>
         <oasis:entry colname="col2">Cloud ice crystals</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11</oasis:entry>
         <oasis:entry colname="col2">Cloud likely ice crystals</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="T2" specific-use="star"><label>Table 2</label><caption><p id="d2e578">Skewness of input features before and after the <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> transformation across the full dataset.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Feature Name</oasis:entry>
         <oasis:entry colname="col2">Original Skewness</oasis:entry>
         <oasis:entry colname="col3">Transformed Skewness</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Aerosol backscatter (532 <inline-formula><mml:math id="M17" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">16.6148</oasis:entry>
         <oasis:entry colname="col3">16.6140</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Aerosol backscatter (1064 <inline-formula><mml:math id="M18" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">14.7135</oasis:entry>
         <oasis:entry colname="col3">14.7128</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Attenuated backscatter (532 <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">14.3346</oasis:entry>
         <oasis:entry colname="col3">14.3343</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Attenuated backscatter (1064 <inline-formula><mml:math id="M20" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">13.5441</oasis:entry>
         <oasis:entry colname="col3">13.5439</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Particle depolarization ratio (532 <inline-formula><mml:math id="M21" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">15.9032</oasis:entry>
         <oasis:entry colname="col3">6.6019</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Volume depolarization ratio (532 <inline-formula><mml:math id="M22" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">14.6148</oasis:entry>
         <oasis:entry colname="col3">7.2943</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Ångström exponent (<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mn mathvariant="normal">532</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1064</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>)</oasis:entry>
         <oasis:entry colname="col2">1.5965</oasis:entry>
         <oasis:entry colname="col3">0.5856</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model pressure</oasis:entry>
         <oasis:entry colname="col2">0.6173</oasis:entry>
         <oasis:entry colname="col3"><inline-formula><mml:math id="M25" display="inline"><mml:mo>-</mml:mo></mml:math></inline-formula>0.1458</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Model temperature</oasis:entry>
         <oasis:entry colname="col2">0.3239</oasis:entry>
         <oasis:entry colname="col3">0.2224</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Methodology</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Data preprocessing</title>
      <p id="d2e826">Prior to ingestion by the U-Net model, the raw lidar data were subjected to a series of preprocessing steps to ensure data quality and suitability for the network architecture. Initially, each input sample was filtered to conform to the expected input dimensions of 600 vertical bins by 960 time steps. Instances containing missing timestamps were identified; since they constituted only 3 <inline-formula><mml:math id="M26" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of the entire dataset, these samples were excluded from the training, validation and testing sets. Subsequently, the lidar data were clipped to zero, given that lidar-derived physical quantities such as backscatter and depolarization ratios are inherently non-negative. If negative values exist in lidar data, they typically resulted from instrumental noise or background subtraction procedures, particularly in regions with low signal-to-noise ratios. Next, we addressed NaN entries, which occur in lidar profiles due to factors such as complete signal attenuation in dense clouds, low signal-to-noise ratios in pristine regions, or instrument malfunction. Because neural networks cannot mathematically process NaN values, numerical imputation was a structural necessity. These missing values were replaced using the global average of the respective feature. However, because missing data in lidar often represents a physically meaningful state rather than a mere absence of measurement, it was critical to ensure this gap-filling did not introduce bias. To prevent the model from confusing a physical measurement with an imputed value, we engineered a corresponding binary indicator feature for each input variable. This indicator took a value of 1 if the original data at that specific pixel had been a NaN (and was subsequently imputed), and 0 otherwise. This crucial step provided the model with explicit information about the original data quality at each point, allowing it to reliably learn the distinction between valid signals and physically attenuated regions <xref ref-type="bibr" rid="bib1.bibx22" id="paren.29"/>.</p>
      <p id="d2e840">To address the highly skewed distributions of lidar signals, which often span several orders of magnitude, a <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> transformation was applied uniformly to the input features. This specific transformation was strictly necessary and chosen over standard logarithmic or square root alternatives. Because the lidar data arrays were clipped to zero to remove instrumental noise, they contain true zero values where <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is undefined. The <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> shift safely handles these zero-signal regions without generating artificial missing values, while compressing the dynamic range of high-intensity measurements sufficiently to prevent them from dominating the neural network's loss function. To quantitatively evaluate the effectiveness of this transformation over the full dataset, we calculated the Fisher–Pearson coefficient of skewness <xref ref-type="bibr" rid="bib1.bibx11" id="paren.30"/>, for each feature before and after the transformation:

                <disp-formula id="Ch1.Ex1"><mml:math id="M30" display="block"><mml:mstyle class="stylechange" displaystyle="true"/><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>g</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msup><mml:mfenced open="(" close=")"><mml:mrow><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>N</mml:mi></mml:mfrac></mml:mstyle><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfenced><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M31" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> is the total number of pixels in the dataset, <inline-formula><mml:math id="M32" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the individual pixel value, and <inline-formula><mml:math id="M33" display="inline"><mml:mover accent="true"><mml:mi>x</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover></mml:math></inline-formula> is the mean of the feature. A skewness value of 0 indicates perfect symmetry. As detailed in Table <xref ref-type="table" rid="T2"/>, the raw lidar variables exhibited extreme positive skewness. The transformation successfully reduced this skewness, resulting in more symmetrical, log-normal distributions. Notably, while the transformation heavily compressed the long right tails of certain features (such as the volume depolarization ratio and the backscatter-related Ångström exponent), the backscatter coefficients showed minimal changes in their global skewness metrics. This is an expected mathematical behavior: because backscatter coefficients consist of exceptionally small magnitude values (often on the order of <inline-formula><mml:math id="M34" display="inline"><mml:mrow><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>), <inline-formula><mml:math id="M35" display="inline"><mml:mi>x</mml:mi></mml:math></inline-formula> is close to 0, making <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mi>log⁡</mml:mi><mml:mo>(</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>+</mml:mo><mml:mi>x</mml:mi><mml:mo>)</mml:mo><mml:mo>≈</mml:mo><mml:mi>x</mml:mi></mml:mrow></mml:math></inline-formula>. Thus, the transformation acts as a safe, near-linear pass-through for these specific channels, preserving their underlying structural variance while stabilizing the broader feature space.</p>
      <p id="d2e1066">Following the logarithmic transformation, the features were standardized according to:

                <disp-formula id="Ch1.Ex2"><mml:math id="M37" display="block"><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>z</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mi mathvariant="italic">μ</mml:mi></mml:mrow><mml:mi mathvariant="italic">σ</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M38" display="inline"><mml:mi mathvariant="italic">μ</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M39" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula> were calculated from the log-transformed per-feature training data, and then applied to the validation and test datasets. Standardization transforms the features to have a mean of approximately zero and a standard deviation of approximately one, ensuring that all features contribute more equally to the learning process, preventing domination by features with larger numerical ranges, and promoting faster and more stable training <xref ref-type="bibr" rid="bib1.bibx20" id="paren.31"/>.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Learning</title>
      <p id="d2e1122">The dataset was partitioned into training, validation, and testing subsets in a 70:10:20 ratio, resulting in 284 samples allocated for training, 40 for validation, and 80 for testing. Care was taken to ensure that each subset maintained a similar variance in the distribution of the total distinct number of atmospheric classes present within each sample. For each subset, let <inline-formula><mml:math id="M40" display="inline"><mml:mi>I</mml:mi></mml:math></inline-formula> be the set of all samples and <inline-formula><mml:math id="M41" display="inline"><mml:mi>c</mml:mi></mml:math></inline-formula> be a specific class. Thus:

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M42" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>I</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:mi>I</mml:mi><mml:mo>|</mml:mo><mml:mtext> sample </mml:mtext><mml:mi>i</mml:mi><mml:mtext> contains class </mml:mtext><mml:mi>c</mml:mi><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E1"><mml:mtd><mml:mtext>1</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mtext>Count</mml:mtext><mml:mi>c</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mo>|</mml:mo><mml:msub><mml:mi>I</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          We then plot the kernel density estimate (KDE) of each subset and compare the curves to make sure that the distributions behave similarly (Fig. <xref ref-type="fig" rid="F2"/>). This stratified approach helped ensure that each dataset split was representative of the overall class diversity, preventing potential biases, and ensuring robust model training and evaluation.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e1208">Dataset stratification analysis. Kernel Density Estimate (KDE) plot showing the frequency of the 12 atmospheric classes across the Training (70 <inline-formula><mml:math id="M43" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>), Validation (10 <inline-formula><mml:math id="M44" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>), and Test (20 <inline-formula><mml:math id="M45" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>) subsets. The alignment of the curves ensures that each split is representative of the overall atmospheric diversity, preventing class frequency bias during model evaluation.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f02.png"/>

        </fig>

      <p id="d2e1241">A U-Net architecture (Fig. <xref ref-type="fig" rid="F3"/>) was selected for this task due to its demonstrated success in image segmentation, including applications in earth and atmospheric sciences <xref ref-type="bibr" rid="bib1.bibx13" id="paren.32"/> <xref ref-type="bibr" rid="bib1.bibx39" id="paren.33"/> <xref ref-type="bibr" rid="bib1.bibx15" id="paren.34"/> and other applications <xref ref-type="bibr" rid="bib1.bibx27" id="paren.35"/>. Its structure is particularly well-suited for tasks requiring both contextual understanding and precise localization. The architecture follows a symmetric encoder-decoder structure with skip connections, tailored to capture both hierarchical features and fine-grained spatial details. The encoder decreases the dimensions of the input and increases the number of feature channels, and the decoder increases the dimensions of the bridge data and decreases the number of feature channels.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e1261">U-Net architecture for pixel-wise atmospheric segmentation. This U-Net-style architecture features an encoder, a bottleneck, and a decoder. The encoder captures context through four downsampling blocks, where each block applies two sequential <inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> convolution layers (followed by batch normalization and ReLU) with an increasing number of filters (64, 128, 256, 512), a dropout layer with a rate of 0.1, and a final <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> max-pooling operation. Feature maps from the encoder are passed to the decoder via skip connections. The central bottleneck layer consists of two <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> convolutions with 1024 filters, batch norm, ReLU, and a dropout layer with a rate of 0.2. The decoder path enables exact localization by mirroring the encoder. Each of its four upsampling blocks uses a <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> transposed convolution, concatenates its output with the corresponding feature map from the skip connection, and applies two more <inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> convolutions. The network terminates with a <inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> convolution and a softmax activation function to generate the final pixel-wise segmentation mask.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f03.png"/>

        </fig>

      <p id="d2e1343">To effectively train the U-Net model for our multi-class segmentation task, a composite loss function was employed. This tailored loss function was designed to address two key challenges: achieving accurate segmentation for each individual atmospheric class, many of which are imbalanced, and specifically penalizing confusion between aerosol and clouds. The total loss, <inline-formula><mml:math id="M52" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>total</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>, is defined as:

                <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M53" display="block"><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>L</mml:mi><mml:mtext>total</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>L</mml:mi><mml:mtext>MWSD</mml:mtext></mml:msub><mml:mo>+</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:mo>⋅</mml:mo><mml:msub><mml:mi>L</mml:mi><mml:mtext>GC</mml:mtext></mml:msub></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M54" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>MWSD</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is the multiclass weighted squared dice loss, <inline-formula><mml:math id="M55" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>GC</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> (described below) is the group confusion loss (Eqs. <xref ref-type="disp-formula" rid="Ch1.E3"/> and <xref ref-type="disp-formula" rid="Ch1.E5"/> respectively). The penalty <inline-formula><mml:math id="M56" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> is a penalty factor that balances the contribution of the group confusion term (<inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>GC</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>  –  described below). This penalty was a hyperparameter and was optimized based on the best results of the validation set.</p>
      <p id="d2e1433">Dice coefficient is a common metric for evaluating overlap in segmentation tasks, and its loss variant has proven to be effective, particularly for imbalanced classes <xref ref-type="bibr" rid="bib1.bibx41" id="paren.36"/> <xref ref-type="bibr" rid="bib1.bibx28" id="paren.37"/>. To address the inherent imbalance in the frequency of different atmospheric aerosol and cloud types, class weights <inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mtext>class</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are introduced <xref ref-type="bibr" rid="bib1.bibx7" id="paren.38"/>. These weights were chosen to be inversely proportional to the frequency or volume of each class in the training dataset, thereby giving more importance to underrepresented classes. A multiplicative factor was applied for training stability. A weight of 0 was applied to the first class which is defined as “no-class” (Class 0). This was done to ensure that the model only learns physical features and assigns anything that doesn't fit a physical phenomenon to “no class” (Class 0). The dice coefficient was calculated as follows:

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M59" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>L</mml:mi><mml:mtext>MWSD</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E3"><mml:mtd><mml:mtext>3</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>-</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>h</mml:mi></mml:msubsup><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>t</mml:mi></mml:msubsup><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>p</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>h</mml:mi></mml:msubsup><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>t</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>t</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mi>y</mml:mi><mml:mi>p</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mi>w</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>w</mml:mi><mml:mi>c</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">200</mml:mn><mml:mo>⋅</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M60" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> and <inline-formula><mml:math id="M61" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula> are the height and time indices, <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the ground truth labels, <inline-formula><mml:math id="M63" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>p</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are the prediction probabilities, <inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the scaled inverse frequency of the true training class and <inline-formula><mml:math id="M65" display="inline"><mml:mrow><mml:msub><mml:mi>n</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the number of times a specific true class appears in the training dataset. Note that <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>c</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the weight associated to each class and is a function of <inline-formula><mml:math id="M67" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e1793">Although <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>MWSD</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> focuses on individual class performance, a critical requirement for this application is to strongly discourage misclassifications between fundamentally different atmospheric categories, specifically between aerosol and clouds. To address this, a group confusion loss term, <inline-formula><mml:math id="M69" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>GC</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> is introduced:

                <disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M70" display="block"><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>L</mml:mi><mml:mtext>GC</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>h</mml:mi></mml:munderover><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>t</mml:mi></mml:munderover><mml:msub><mml:mi>y</mml:mi><mml:mtext>ta</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mtext>pc</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mtext>tc</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mtext>pa</mml:mtext></mml:msub><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mi>j</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></disp-formula>

          where <inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mtext>ta</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are the ground truth aerosol labels, <inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mtext>pc</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are the predicted cloud probabilities, <inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mtext>tc</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are the ground truth cloud labels and <inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mtext>pa</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula> are the predicted aerosol probabilities. The composite loss function (Eq. <xref ref-type="disp-formula" rid="Ch1.E5"/>) is designed to guide the U-Net model not only to accurately segment individual aerosol and cloud types (even rare ones, due to weighting in <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mtext>MWSD</mml:mtext></mml:msub></mml:mrow></mml:math></inline-formula>), but also to maintain a clear distinction between the broader aerosol and cloud categories.</p>
      <p id="d2e1983">The U-Net model was developed and trained using the TensorFlow and Keras libraries. Adam optimization algorithm, with an initial learning rate of <inline-formula><mml:math id="M76" display="inline"><mml:mrow><mml:mn mathvariant="normal">5</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>, was employed to minimize the loss function (Eq. <xref ref-type="disp-formula" rid="Ch1.E2"/>). To prevent overfitting and reduce unnecessary training time, early stopping was implemented as follows. Validation loss was monitored and training was halted if no improvement was observed for 20 consecutive epochs, starting from epoch 50. The best weights achieved during training (post epoch 50) were restored upon stopping. The learning rate was adaptively adjusted during training. If the validation loss did not improve for 10 epochs, the learning rate was reduced by a factor of 0.2, down to a minimum learning rate of <inline-formula><mml:math id="M77" display="inline"><mml:mrow><mml:mn mathvariant="normal">5</mml:mn><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">7</mml:mn></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>. This allows for finer adjustments as the model approaches convergence. Training was implemented using Tensorflow on Google Cloud Vertex AI, Colab Enterprise notebook, using NVIDIA TESLA A100x4 GPUs.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results and discussion</title>
      <p id="d2e2033">In this section, we present the quantitative and qualitative evaluation of the trained U-Net model. We first assess its overall classification accuracy and class-specific performance across the test dataset, followed by detailed case studies demonstrating its behavior under complex atmospheric conditions.</p>
      <p id="d2e2036">Model training stopped after 167 steps due to the early stopping mechanism, where epoch 148 was chosen as the best epoch with a training loss value of 0.1198 and a validation loss value of 0.1288. Jaccard index at the chosen epoch was 0.9009 for the training set and 0.5955 for the validation set, and cloud-aerosol confusion loss was 0.0023. Both loss values and Jaccard indices for the training and validation sets plateaued at around step 120, where the most significant learning was done between steps 0 and 60. Results henceforth will be discussed solely regarding the test dataset, which comprises 80 samples that the model has not seen. The model's performance was quantitatively assessed using metrics such as precision, recall, and F1-score for each atmospheric class:

              <disp-formula specific-use="align" content-type="numbered"><mml:math id="M78" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E6"><mml:mtd><mml:mtext>6</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>F1</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>=</mml:mo><mml:mn mathvariant="normal">2</mml:mn><mml:mo>⋅</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>⋅</mml:mo><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E8"><mml:mtd><mml:mtext>8</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>(</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

        where TP, FP, and FN are the true positive, false positive and false negative results, and where <inline-formula><mml:math id="M79" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> represents the class. To determine the overall performance of the model, the macro-averaged F1-scores, weighted macro-averaged F1-scores, and micro F1-scores were calculated: 

              <disp-formula specific-use="align" content-type="numbered"><mml:math id="M80" display="block"><mml:mtable displaystyle="true"><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mtext>F1</mml:mtext><mml:mtext>macro</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>F1</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>P</mml:mi><mml:mtext>macro</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E9"><mml:mtd><mml:mtext>9</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>R</mml:mi><mml:mtext>macro</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mtext>F1</mml:mtext><mml:mtext>weighted</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mtext>F1</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>P</mml:mi><mml:mtext>weighted</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi>P</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E10"><mml:mtd><mml:mtext>10</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>R</mml:mi><mml:mtext>weighted</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:munderover><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mi>R</mml:mi><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mtr><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mtext>F1</mml:mtext><mml:mtext>micro</mml:mtext></mml:msub><mml:mo>=</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle><mml:mo>(</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>FP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>FN</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>P</mml:mi><mml:mtext>micro</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>FP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mtr><mml:mlabeledtr id="Ch1.E11"><mml:mtd><mml:mtext>11</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>R</mml:mi><mml:mtext>micro</mml:mtext></mml:msub><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>TP</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mn mathvariant="normal">12</mml:mn></mml:msubsup><mml:mtext>FN</mml:mtext><mml:mo>(</mml:mo><mml:mi>i</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

        where <inline-formula><mml:math id="M81" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula> runs on all classes, and <inline-formula><mml:math id="M82" display="inline"><mml:mrow><mml:msub><mml:mi>w</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the fraction of that class within the entire test dataset, and TP, FP, FN are the sum of the true positive, false positive and false negative results. The evaluation is based on the test dataset, which comprises 46 080 000 individual pixel classifications in all test samples, and is detailed in Table <xref ref-type="table" rid="T3"/>.</p>

<table-wrap id="T3" specific-use="star"><label>Table 3</label><caption><p id="d2e2828">Classification Performance Metrics.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Class description</oasis:entry>
         <oasis:entry colname="col2">Precision</oasis:entry>
         <oasis:entry colname="col3">Recall</oasis:entry>
         <oasis:entry colname="col4">F1-score</oasis:entry>
         <oasis:entry colname="col5">Support</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0 – No Class</oasis:entry>
         <oasis:entry colname="col2">0.99</oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
         <oasis:entry colname="col4">0.99</oasis:entry>
         <oasis:entry colname="col5">33 197 713</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1 – Clean atmosphere</oasis:entry>
         <oasis:entry colname="col2">0.88</oasis:entry>
         <oasis:entry colname="col3">0.78</oasis:entry>
         <oasis:entry colname="col4">0.83</oasis:entry>
         <oasis:entry colname="col5">2 840 232</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2 – Non-typed particles/ low concentration</oasis:entry>
         <oasis:entry colname="col2">0.84</oasis:entry>
         <oasis:entry colname="col3">0.89</oasis:entry>
         <oasis:entry colname="col4">0.87</oasis:entry>
         <oasis:entry colname="col5">4 274 238</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3 – Aerosol: small</oasis:entry>
         <oasis:entry colname="col2">0.89</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
         <oasis:entry colname="col5">751 168</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4 – Aerosol: large, spherical</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">1 323 590</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5 – Aerosol: mixture, partly non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.94</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
         <oasis:entry colname="col5">1 301 967</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6 – Aerosol: large, non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.93</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.94</oasis:entry>
         <oasis:entry colname="col5">499 600</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7 – Cloud: non-typed</oasis:entry>
         <oasis:entry colname="col2">0.80</oasis:entry>
         <oasis:entry colname="col3">0.73</oasis:entry>
         <oasis:entry colname="col4">0.76</oasis:entry>
         <oasis:entry colname="col5">178 227</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8 – Cloud: water droplets</oasis:entry>
         <oasis:entry colname="col2">0.54</oasis:entry>
         <oasis:entry colname="col3">0.49</oasis:entry>
         <oasis:entry colname="col4">0.52</oasis:entry>
         <oasis:entry colname="col5">7061</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9 – Cloud: likely water droplets</oasis:entry>
         <oasis:entry colname="col2">0.60</oasis:entry>
         <oasis:entry colname="col3">0.58</oasis:entry>
         <oasis:entry colname="col4">0.59</oasis:entry>
         <oasis:entry colname="col5">113 802</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10 – Cloud: ice crystals</oasis:entry>
         <oasis:entry colname="col2">0.83</oasis:entry>
         <oasis:entry colname="col3">0.79</oasis:entry>
         <oasis:entry colname="col4">0.81</oasis:entry>
         <oasis:entry colname="col5">1 486 952</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">11 – Cloud: likely ice crystals</oasis:entry>
         <oasis:entry colname="col2">0.59</oasis:entry>
         <oasis:entry colname="col3">0.63</oasis:entry>
         <oasis:entry colname="col4">0.61</oasis:entry>
         <oasis:entry colname="col5">105 450</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col5" align="center">Averages </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">micro</oasis:entry>
         <oasis:entry colname="col2">0.96</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">46 080 000</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">macro</oasis:entry>
         <oasis:entry colname="col2">0.82</oasis:entry>
         <oasis:entry colname="col3">0.81</oasis:entry>
         <oasis:entry colname="col4">0.81</oasis:entry>
         <oasis:entry colname="col5">46 080 000</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">weighted</oasis:entry>
         <oasis:entry colname="col2">0.96</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">46 080 000</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e2831">Support <inline-formula><mml:math id="M83" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> number of pixels pertaining to the corresponding truth class.</p></table-wrap-foot></table-wrap>

<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Analysis of Model Performance</title>
      <p id="d2e3167">The model achieved consistent performance across classes, with a micro-average (Eq. <xref ref-type="disp-formula" rid="Ch1.E11"/>) as well as weighted averages (Eq. <xref ref-type="disp-formula" rid="Ch1.E10"/>) for accuracy, precision, recall, and F1-score of 0.96. These results indicate that, when class imbalances are taken into account, the classification is reliable and unbiased toward specific categories. However, the macro average (Eq. <xref ref-type="disp-formula" rid="Ch1.E9"/>), which calculates the metric independently for each class and then averages them assuming equal weights, shows more moderate results: a precision of 0.82, recall of 0.81, and an F1-score of 0.81. The difference between the weighted and macro averages points to class imbalance, where the model performs well on the aerosol classes which are more common, but struggles more with cloud classes which are more rare.</p>
      <p id="d2e3176">The confusion matrix (Fig. <xref ref-type="fig" rid="F4"/>) provides a granular view of the model's classification accuracy and error patterns. The diagonal elements represent the percentage of correctly classified pixels (recall) for each class. Key aerosol types are classified with high recall rates: “Aerosol: small” (96 <inline-formula><mml:math id="M84" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> recall), “Aerosol: large, spherical” (96 <inline-formula><mml:math id="M85" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> recall), “Aerosol: mixture, partly non-spherical” (97 <inline-formula><mml:math id="M86" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> recall), and “Aerosol: large, non-spherical” (95 <inline-formula><mml:math id="M87" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> recall). This indicates that the model successfully learned to distinguish the nuanced lidar signatures corresponding to different aerosol properties, like size and shape. There is some minor confusion between aerosol types, such as 1 <inline-formula><mml:math id="M88" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of “Aerosol: large, spherical” being misclassified as “Non-typed particles,” and 2.9 <inline-formula><mml:math id="M89" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of “Aerosol: large, non-spherical” being misclassified as “Aerosol: mixture.” This is expected, as atmospheric aerosol populations are often complex mixtures rather than discrete types.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e3232">Comprehensive model performance matrix. Normalized confusion matrix for the test dataset. Diagonal elements represent the recall for each class, showing high accuracy for aerosol types (89 <inline-formula><mml:math id="M90" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>–95 <inline-formula><mml:math id="M91" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>). The matrix highlights physically plausible confusion between liquid cloud categories (Classes 8 and 9) and the model's high precision in identifying the “No Class” background.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f04.png"/>

        </fig>

      <p id="d2e3258">The classification of cloud types reveals a more complex challenge. While the model correctly identifies “Cloud: ice crystals” (Class 10) in 79 <inline-formula><mml:math id="M92" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of cases, its performance on liquid water clouds is notably lower. Only 49 <inline-formula><mml:math id="M93" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of “Cloud: water droplets” (Class 8) are correctly identified, and 58 <inline-formula><mml:math id="M94" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of “Cloud: likely water droplets” (Class 9) are correctly identified. An important observation is the confusion between similar and physically adjacent classes. For “Cloud: water droplets” (Class 8), while the recall is low (49 <inline-formula><mml:math id="M95" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>), a significant portion of the misclassifications go to neighboring liquid cloud classes: 19 <inline-formula><mml:math id="M96" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> are mislabeled as “Cloud: likely water droplets” (Class 9) and 4 <inline-formula><mml:math id="M97" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> as “Cloud: non-typed” (Class 7). Similarly, for “Cloud: likely ice crystals” (Class 11), the main source of error is misclassification as “Cloud: ice crystals” (26 <inline-formula><mml:math id="M98" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula>). This pattern of confusion is physically plausible. Distinguishing between definite and “likely” water droplets, or between droplets and small ice crystals near the freezing level, can be ambiguous even for synergistic algorithms, let alone for a model relying only on lidar.</p>
      <p id="d2e3318">“No Class” (class 0) is identified with 99 <inline-formula><mml:math id="M99" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> F1-score, which is very high given that the model was trained with a corresponding weight of 0 in the loss function. This is due to the fact that the U-Net's final layer is a convolution layer with 12 classes and a softmax activation function. The softmax function forces the model to output a probability distribution across all 12 classes for every pixel, and these probabilities must sum to 1. Thus, even if class 0 has zero weight in the loss, the model must still assign some probability to the channel corresponding to class 0 for every pixel. As the model improves in identifying and segmenting classes 1–11, it learns the features and contexts associated with them. For pixels that do not exhibit strong features of any of classes 1–11, the probabilities assigned by the softmax to these classes will naturally be low. Thus, the “leftover” probability mass is often assigned to the remaining classes, which in this case includes class 0. If class 0 is a general “background” or “none of the above specific particle types,” this can lead to it being correctly predicted for those pixels. In addition, class 0 is much more prevalent than other classes (Table <xref ref-type="table" rid="T3"/>).</p>
      <p id="d2e3331">To further investigate the model's capabilities and limitations, Jaccard index was calculated for each height index across the test dataset, excluding the dominant “No Class” category (Eq. <xref ref-type="disp-formula" rid="Ch1.E12"/>):

                <disp-formula id="Ch1.E12" content-type="numbered"><label>12</label><mml:math id="M100" display="block"><mml:mstyle displaystyle="true" class="stylechange"/><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mi>J</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mtext>TP</mml:mtext><mml:mrow><mml:mtext>TP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FP</mml:mtext><mml:mo>+</mml:mo><mml:mtext>FN</mml:mtext></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula>

          Figure <xref ref-type="fig" rid="F5"/> shows the resulting relationship between classification performance and altitude. The general trend aligns with the physical expectations of lidar performance: the highest Jaccard values (0.4–0.6) are concentrated in the lowest height indices (approximately 0–100). Performance gradually degrades through indices 100–300 and becomes poorest at high altitudes (indices <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mo>&gt;</mml:mo><mml:mn mathvariant="normal">300</mml:mn></mml:mrow></mml:math></inline-formula>), where Jaccard values drop below 0.2. This vertical decline in performance is primarily attributable to the smaller number of classes present at higher altitudes (portrayed by the darker colors). Given that the model trained on fewer examples at higher altitudes, we would expect to see lower accuracy at those heights. Furthermore, the decline in performance may also be caused by the degradation of the lidar's signal-to-noise ratio with increasing altitude and the significant signal attenuation caused by intervening clouds and dense aerosol layers. An interesting and seemingly counterintuitive trend is observed in the lowest part of the atmosphere (height indices 0–50). Within this range, where the pixel count is highest (indicated by the lighter color of the points), the mean Jaccard value shows a slight decrease with decreasing height. This is contrary to the assumption that performance should be uniformly best where the lidar signal is strongest and where there are more examples to train on.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e3377">Vertical profiles of classification reliability. The Jaccard Index (Intersection over Union) plotted as a function of height index, with points colored by the total pixel count (Support) at that altitude. Peak performance occurs in the free troposphere (indices 0–100), with degradation at higher altitudes caused by reduced signal-to-noise ratios and signal attenuation from intervening cloud layers.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f05.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Case studies to test model performance</title>
      <p id="d2e3394">To provide a qualitative assessment of the performance of the model, and to better understand its behavior under specific atmospheric conditions, we analyze a series of case studies from the test set. Each case compares the ground truth classification with the model's prediction and examines a corresponding confusion heatmap to identify specific areas and types of misclassification. The results of the case studies are summarized in Table <xref ref-type="table" rid="T4"/>.</p>

<table-wrap id="T4"><label>Table 4</label><caption><p id="d2e3402">F1-scores for Single Image Case Studies (1–3).</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="4">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Class Description</oasis:entry>
         <oasis:entry colname="col2">c.s. 1</oasis:entry>
         <oasis:entry colname="col3">c.s. 2</oasis:entry>
         <oasis:entry colname="col4">c.s. 3</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0 – No Class</oasis:entry>
         <oasis:entry colname="col2">1.00</oasis:entry>
         <oasis:entry colname="col3">0.98</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1 – Clean atmosphere</oasis:entry>
         <oasis:entry colname="col2">0.85</oasis:entry>
         <oasis:entry colname="col3">0.83</oasis:entry>
         <oasis:entry colname="col4">0.84</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2 – Non-typed particles/low concentration</oasis:entry>
         <oasis:entry colname="col2">0.74</oasis:entry>
         <oasis:entry colname="col3">0.80</oasis:entry>
         <oasis:entry colname="col4">0.87</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3 – Aerosol: small</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.90</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4 – Aerosol: large, spherical</oasis:entry>
         <oasis:entry colname="col2">0.98</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.97</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5 – Aerosol: mixture, partly non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.97</oasis:entry>
         <oasis:entry colname="col3">0.89</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6 – Aerosol: large, non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.82</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
         <oasis:entry colname="col4">0.78</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7 – Cloud: non-typed</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">0.81</oasis:entry>
         <oasis:entry colname="col4">0.42</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8 – Cloud: water droplets</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">0.51</oasis:entry>
         <oasis:entry colname="col4">0.68</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9 – Cloud: likely water droplets</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">0.63</oasis:entry>
         <oasis:entry colname="col4">0.45</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10 – Cloud: ice crystals</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">0.80</oasis:entry>
         <oasis:entry colname="col4">0.80</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">11 – Cloud: likely ice crystals</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">0.50</oasis:entry>
         <oasis:entry colname="col4">0.66</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col4" align="center">Averages </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">micro</oasis:entry>
         <oasis:entry colname="col2">0.96</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">macro</oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">0.78</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">weighted</oasis:entry>
         <oasis:entry colname="col2">0.96</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e3405">NA: not available.</p></table-wrap-foot></table-wrap>

<sec id="Ch1.S4.SS2.SSS1">
  <label>4.2.1</label><title>Case 1: Cloud-free  multiple layering conditions</title>
      <p id="d2e3683">The first case study (7 December 2016) examines a complex, cloud-free scene with several distinct aerosol layers (Fig. <xref ref-type="fig" rid="F6"/>). This scenario serves as a validation of the model's ability to classify aerosols. Given that the target classification is done using only lidar data, we expect the model to perform well. The model successfully captures the vertical extent, boundaries, and temporal evolution of the different aerosol layers.  The “Aerosol: mixture, partly non-spherical” (Class 5) layer is accurately reproduced by the model in both its location and classification. “Large, non-spherical” class is also correctly identified in the time steps between 200–450 and 800–960. Furthermore, the model also correctly identifies the overlying layer of “non-typed particles/low concentration” (Class 2) and the “clean atmosphere” (Class 1) above it. The per-image classification report confirms this strong performance with a weighted F1-score of 0.96. The most prevalent class, “Aerosol: mixture, partly non-spherical,” achieves an F1-score of 0.97. Other aerosol classes also show high F1-scores, such as “Aerosol: large, spherical” (0.98) and “Aerosol: small” (0.95).</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e3690">Case study 1: cloud free, distinct aerosol layers qualitative comparison. Side-by-side comparison showing the ground truth labels and the U-Net model's predictions (left) alongside the corresponding attenuated backscatter (right) for 10 September 2017.</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f06.png"/>

          </fig>

      <p id="d2e3699">The confusion matrix (Fig. <xref ref-type="fig" rid="FA4"/>) shows only minor, physically reasonable errors, such as overlap between “Clean atmosphere” and “Non-typed particles” and some confusion of “Aerosol: small” with “Aerosol: mixture.” Notably, the group confusion heatmap (Fig. <xref ref-type="fig" rid="FA1"/>) indicates no aerosol–cloud misclassifications in this case. These results indicate that the composite loss function improved discrimination, with the group confusion term reducing aerosol–cloud ambiguity and enabling clear separation between the two categories in this case. Furthermore, The generalized confusion heatmap (Fig. <xref ref-type="fig" rid="FA1"/>) shows only minor ambiguity, mainly between closely related aerosol types (e.g. Class 5 within Class 4). This case study illustrates the model's ability to resolve aerosol subtypes in cloud-free conditions.</p>
</sec>
<sec id="Ch1.S4.SS2.SSS2">
  <label>4.2.2</label><title>Case 2: Aerosol-cloud interaction study</title>
      <p id="d2e3717">The second case study investigates the interaction between a near-surface aerosol layer and an overlying liquid cloud (13 November 2016). This case directly tests the model's ability to delineate the boundary between aerosol and cloud and to correctly classify both in close proximity. As in the previous case, the model's prediction shows a very good structural agreement with the ground truth (Fig. <xref ref-type="fig" rid="F7"/>). It accurately identifies the general location and extent of the low-level aerosol layer (primarily “Aerosol: large, spherical”) and the cloud system above it (a mix of “likely water droplets” and “ice crystals”). The temporal evolution of both features is also well replicated.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e3724">Case study 2: low level liquid cloud and aerosol layers qualitative comparison. Side-by-side comparison showing the ground truth labels and U-Net model's predictions (left) alongside the attenuated backscatter (right) for 13 November 2016.</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f07.png"/>

          </fig>

      <p id="d2e3733">The Confusion Heatmap highlights that the most significant misclassifications are concentrated precisely at this boundary. This indicates that while the model can identify the core of the aerosol and cloud layers, it struggles to precisely delineate the transition zone between them. The Aerosol-Cloud Group Confusion Density Heatmap (Fig. <xref ref-type="fig" rid="FA2"/>) reveals distinct, though localized, instances of confusion between aerosol and cloud groups. These areas are co-located with the aerosol-cloud interface shown in the main plots. This indicates that the majority of the model's errors in this scene are aerosol-cloud misclassifications. The confusion matrix (Fig. <xref ref-type="fig" rid="FA5"/>) quantifies this: for example, 7.5 <inline-formula><mml:math id="M102" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of “Cloud: non-typed” (Class 7) is misclassified as “Aerosol: large, spherical” (Class 4). Most notably, the entire class of “Aerosol: large non-spherical” (Class 6) was misclassified and was primarily confused with Class 10 and Class 11 (a mix of “likely water droplets” and “ice crystals”). However, due to it's temporal-spacial singularity, where it only appears in one timestamp and at a specific height index, and due to its small support of only 54 <inline-formula><mml:math id="M103" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">pixels</mml:mi></mml:mrow></mml:math></inline-formula> (Table <xref ref-type="table" rid="TA2"/>), it is safe to disregard these results as anomalous.</p>
</sec>
<sec id="Ch1.S4.SS2.SSS3">
  <label>4.2.3</label><title>Case 3: Mid–High Clouds and Low-Level Dust Event</title>
      <p id="d2e3766">The last case study explores a complex multilayered cloud system that stretches from mid to high altitudes, with an air quality event (dust storm) at low altitudes, which took place on the date 15 January 2017. This case was chosen to test whether the model can predict clouds correctly at high altitudes at the very limit of lidar attenuation. The ground truth plot (Fig. <xref ref-type="fig" rid="F8"/>) shows a mid-level “Cloud: likely water droplets” (Class 9) layer at height index 200, where directly below and above it are classes 10 and 11 (ice clouds). The model's prediction correctly identifies the low-level aerosol and liquid cloud layers with good structural accuracy. Most importantly, it successfully infers the presence of the upper-level ice cloud in a position and with a structure that closely matches the ground truth. It is important to note that classifications assigned above the altitude of complete lidar signal attenuation do not represent direct observations, but probabilistic inferences based on the vertical structure below the cloud top and thermodynamic constraints learned from the training dataset. The per-image classification report shows a strong F1-score of 0.80 for “ice crystals” (Class 10), which make up the bulk of the upper cloud. This quantitatively confirms the model's successful inference. However, the confusion heatmap (Fig. <xref ref-type="fig" rid="FA3"/>) reveals errors concentrated at the top and bottom boundaries of the inferred ice cloud. The per-image confusion matrix (Fig. <xref ref-type="fig" rid="FA6"/>) shows that 23 <inline-formula><mml:math id="M104" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">%</mml:mi></mml:mrow></mml:math></inline-formula> of “likely ice crystals” are misclassified as “ice crystals,” a plausible and minor error. Critically, the “Aerosol/Cloud Group Confusion Density Heatmap” remains predominantly dark, indicating that even in this complex multi-layer, multi-phase scene, the model rarely confuses the fundamental aerosol and cloud categories.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e3785">Case study 3: mid–high clouds and low-Level dust event qualitative comparison. Side-by-side comparison showing the ground truth labels and the U-Net model's predictions (left) alongside the corresponding attenuated backscatter (right) for 15 January 2017.</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f08.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Concluding Remarks</title>
      <p id="d2e3805">This study investigated a novel deep learning approach for unified aerosol and cloud classification using only ground-based lidar data. The approach is motivated by the need for comprehensive atmospheric composition profiling and the inherent limitations of lidar-only retrievals (primarily signal attenuation in clouds). The high-quality datasets generated by Cloudnet are indispensable and serve as the robust ground truth required to train our supervised machine learning framework. Our objective was not to replace Cloudnet's multi-instrument infrastructure, but to use it to infer the presence of convective clouds, which facilitates cloud screening and establishing basic cloud occurrence statistics. This approach bridges the gap between the critical need for vertical cloud maps and the sparse global availability of full Cloudnet stations, allowing sites equipped only with standard lidars to benefit from approximated Cloudnet-like classifications. A U-Net architecture was developed and trained, where the model takes standard lidar measurements (attenuated backscatter, depolarization ratio) as input and aims to predict a detailed vertical classification encompassing both aerosol types (PollyXT outputs) and cloud/precipitation categories (Cloudnet outputs).</p>
      <p id="d2e3808">The deep learning model developed in this study demonstrated significant capability in classifying atmospheric constituents from lidar data alone. The model achieves excellent overall performance (weighted F1-score of 0.96) and is particularly adept at classifying diverse aerosol types. In cloud-free conditions, it successfully distinguishes between different aerosol categories with high recall rates, validating its ability to learn the subtle signatures associated with particle size and shape based on the input lidar features. Crucially, this approach positions our contribution distinctly within the current state of the art. Recent advancements, such as the multitask machine learning framework of <xref ref-type="bibr" rid="bib1.bibx14" id="text.39"/>, have demonstrated the efficacy of deep learning for cloud-aerosol typing using space-based lidar. However, because those models train on datasets derived solely from lidar measurements, they remain physically constrained by lidar signal attenuation and cannot classify features where the signal is extinguished. Our model overcomes this limitation by utilizing ground truth aligned with Cloudnet and PollyXT standards, which incorporate cloud radar data. Since radar penetrates thick optical layers, our training data set includes atmospheric information invisible to lidar. This allows the model to learn contextual correlations and approximate a lidar-radar synergy from a single lidar input, inferring properties above the attenuation limit, which is an operational advancement beyond purely lidar-trained architectures.</p>
      <p id="d2e3815">The model exhibits strong predictive capabilities in identifying ice clouds (F1-score of 0.81) but struggles more with classifying specific liquid cloud categories, particularly the rare “water droplets” class. The observed confusion between similar classes (e.g. “water droplets” and “likely water droplets”) is physically reasonable and highlights the inherent ambiguity in defining discrete boundaries for continuous atmospheric processes. The use of a composite loss function, which features a group confusion penalty, proved effective in minimizing the most critical classification errors. Across all case studies, including complex multi-layer and multi-phase scenes, the model consistently and reliably discriminated between the broader categories of aerosol and clouds, with confusion being rare and localized to the most ambiguous interface regions. Model performance varies predictably with altitude, achieving optimal results in the free troposphere, where signals are strong and targets are well-defined. Performance degrades at high altitudes due to decreasing signal-to-noise ratio and at very low altitudes in part due to instrumental effects and the high complexity of the planetary boundary layer.</p>
      <p id="d2e3818">Despite the potential, several limitations and challenges must be acknowledged. The performance of the deep learning model is fundamentally dependent on the quality, accuracy, and representativeness of the complex training dataset. Any biases or errors inherent in the reference PollyXT and Cloudnet algorithms used to generate the target variable will likely be learned and propagated by the U-Net model. Further more, a model trained on data from one specific site or lidar instrument, as is the case in this study, may not perform equally well and may not generalize in different atmospheric regimes or with data from different lidar systems without either retraining or the application of domain adaptation techniques.</p>
      <p id="d2e3822">The findings of our study have important implications for atmospheric remote sensing, suggesting that ground-based observational systems could be simplified. By applying advanced algorithms to relatively simple and cost-effective lidar data, it may be possible to reduce the reliance on co-located, complex, and expensive instruments such as cloud radars and microwave radiometers, thereby facilitating the establishment of denser observational networks <xref ref-type="bibr" rid="bib1.bibx18" id="paren.40"/>. Such networks could provide valuable data streams for improving weather forecasts, evaluating climate models (particularly concerning cloud feedback and ACI), and supporting air quality and environment monitoring as well as aviation safety.</p>
      <p id="d2e3828">We are planning future work to focus on further exploring different deep learning architectures, including variants of the U-Net or transformer-based models. Exploring physics-informed neural networks, which incorporate physical constraints into the learning process, might also improve the physical consistency of the predictions. Validation efforts using independent datasets could further build confidence and establish generalization capabilities. This includes data from different geographical locations, seasons, and lidar instruments. Comparison with data from field campaigns involving airborne in-situ measurements or overpasses of satellites with cloud-penetrating capabilities would provide valuable independent validation as well. Although our model successfully infers structures beyond the attenuation limit, quantifying the exact signal threshold required for valid reconstruction remains an important open question for future studies. Additionally, conducting a comprehensive quantitative ablation study to measure the precise performance impact of removing individual input variables could help decrease the complexity of the model to allow faster inference and model training times.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Case Studies</title>

<table-wrap id="TA1"><label>Table A1</label><caption><p id="d2e3847">Classification Performance Metrics for case study 1  –  Complex Multi-Layer Aerosol.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Class Description</oasis:entry>
         <oasis:entry colname="col2">Precision</oasis:entry>
         <oasis:entry colname="col3">Recall</oasis:entry>
         <oasis:entry colname="col4">F1-score</oasis:entry>
         <oasis:entry colname="col5">Support</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0 – No Class</oasis:entry>
         <oasis:entry colname="col2">1.00</oasis:entry>
         <oasis:entry colname="col3">1.00</oasis:entry>
         <oasis:entry colname="col4">1.00</oasis:entry>
         <oasis:entry colname="col5">392 451</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1 – Clean atmosphere</oasis:entry>
         <oasis:entry colname="col2">0.91</oasis:entry>
         <oasis:entry colname="col3">0.81</oasis:entry>
         <oasis:entry colname="col4">0.85</oasis:entry>
         <oasis:entry colname="col5">79 939</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2 – Non-typed particles/ low concentration</oasis:entry>
         <oasis:entry colname="col2">0.69</oasis:entry>
         <oasis:entry colname="col3">0.80</oasis:entry>
         <oasis:entry colname="col4">0.74</oasis:entry>
         <oasis:entry colname="col5">43 936</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3 – Aerosol: small</oasis:entry>
         <oasis:entry colname="col2">0.94</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
         <oasis:entry colname="col5">2965</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4 – Aerosol: large, spherical</oasis:entry>
         <oasis:entry colname="col2">0.97</oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">4146</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5 – Aerosol: mixture, partly non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
         <oasis:entry colname="col4">0.97</oasis:entry>
         <oasis:entry colname="col5">51 200</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6 – Aerosol: large, non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.87</oasis:entry>
         <oasis:entry colname="col3">0.78</oasis:entry>
         <oasis:entry colname="col4">0.82</oasis:entry>
         <oasis:entry colname="col5">1363</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7 – Cloud: non-typed</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">NA</oasis:entry>
         <oasis:entry colname="col4">NA</oasis:entry>
         <oasis:entry colname="col5">NA</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8 – Cloud: water droplets</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">NA</oasis:entry>
         <oasis:entry colname="col4">NA</oasis:entry>
         <oasis:entry colname="col5">NA</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9 – Cloud: likely water droplets</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">NA</oasis:entry>
         <oasis:entry colname="col4">NA</oasis:entry>
         <oasis:entry colname="col5">NA</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10 – Cloud: ice crystals</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">NA</oasis:entry>
         <oasis:entry colname="col4">NA</oasis:entry>
         <oasis:entry colname="col5">NA</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">11 – Cloud: likely ice crystals</oasis:entry>
         <oasis:entry colname="col2">NA</oasis:entry>
         <oasis:entry colname="col3">NA</oasis:entry>
         <oasis:entry colname="col4">NA</oasis:entry>
         <oasis:entry colname="col5">NA</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col5" align="center">Averages </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">micro</oasis:entry>
         <oasis:entry colname="col2">0.96</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">macro</oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.90</oasis:entry>
         <oasis:entry colname="col4">0.90</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">weighted</oasis:entry>
         <oasis:entry colname="col2">0.96</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.96</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table><table-wrap-foot><p id="d2e3850">NA: not available.</p></table-wrap-foot></table-wrap>

<table-wrap id="TA2"><label>Table A2</label><caption><p id="d2e4175">Classification Performance Metrics for case study 2  –  Low Level Liquid Cloud and Aerosol.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Class Description</oasis:entry>
         <oasis:entry colname="col2">Precision</oasis:entry>
         <oasis:entry colname="col3">Recall</oasis:entry>
         <oasis:entry colname="col4">F1-score</oasis:entry>
         <oasis:entry colname="col5">Support</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0 – No Class</oasis:entry>
         <oasis:entry colname="col2">0.98</oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">443 893</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1 – Clean atmosphere</oasis:entry>
         <oasis:entry colname="col2">0.84</oasis:entry>
         <oasis:entry colname="col3">0.82</oasis:entry>
         <oasis:entry colname="col4">0.83</oasis:entry>
         <oasis:entry colname="col5">24 579</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2 – Non-typed particles/ low concentration</oasis:entry>
         <oasis:entry colname="col2">0.77</oasis:entry>
         <oasis:entry colname="col3">0.83</oasis:entry>
         <oasis:entry colname="col4">0.80</oasis:entry>
         <oasis:entry colname="col5">28 258</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3 – Aerosol: small</oasis:entry>
         <oasis:entry colname="col2">0.88</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.90</oasis:entry>
         <oasis:entry colname="col5">11 605</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4 – Aerosol: large, spherical</oasis:entry>
         <oasis:entry colname="col2">0.94</oasis:entry>
         <oasis:entry colname="col3">0.96</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
         <oasis:entry colname="col5">26 893</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5 – Aerosol: mixture, partly non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.84</oasis:entry>
         <oasis:entry colname="col4">0.89</oasis:entry>
         <oasis:entry colname="col5">313</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6 – Aerosol: large, non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.00</oasis:entry>
         <oasis:entry colname="col3">0.00</oasis:entry>
         <oasis:entry colname="col4">0.00</oasis:entry>
         <oasis:entry colname="col5">54</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7 – Cloud: non-typed</oasis:entry>
         <oasis:entry colname="col2">0.88</oasis:entry>
         <oasis:entry colname="col3">0.75</oasis:entry>
         <oasis:entry colname="col4">0.81</oasis:entry>
         <oasis:entry colname="col5">7530</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8 – Cloud: water droplets</oasis:entry>
         <oasis:entry colname="col2">0.48</oasis:entry>
         <oasis:entry colname="col3">0.55</oasis:entry>
         <oasis:entry colname="col4">0.51</oasis:entry>
         <oasis:entry colname="col5">173</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9 – Cloud: likely water droplets</oasis:entry>
         <oasis:entry colname="col2">0.56</oasis:entry>
         <oasis:entry colname="col3">0.71</oasis:entry>
         <oasis:entry colname="col4">0.63</oasis:entry>
         <oasis:entry colname="col5">3420</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10 – Cloud: ice crystals</oasis:entry>
         <oasis:entry colname="col2">0.86</oasis:entry>
         <oasis:entry colname="col3">0.75</oasis:entry>
         <oasis:entry colname="col4">0.80</oasis:entry>
         <oasis:entry colname="col5">28 622</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">11 – Cloud: likely ice crystals</oasis:entry>
         <oasis:entry colname="col2">0.50</oasis:entry>
         <oasis:entry colname="col3">0.50</oasis:entry>
         <oasis:entry colname="col4">0.50</oasis:entry>
         <oasis:entry colname="col5">660</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col5" align="center">Averages </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">micro</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">macro</oasis:entry>
         <oasis:entry colname="col2">0.72</oasis:entry>
         <oasis:entry colname="col3">0.72</oasis:entry>
         <oasis:entry colname="col4">0.72</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">weighted</oasis:entry>
         <oasis:entry colname="col2">0.95</oasis:entry>
         <oasis:entry colname="col3">0.95</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<table-wrap id="TA3"><label>Table A3</label><caption><p id="d2e4502">Classification Performance Metrics for case study 3  –  Lidar Signal Attenuation.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Class Description</oasis:entry>
         <oasis:entry colname="col2">Precision</oasis:entry>
         <oasis:entry colname="col3">Recall</oasis:entry>
         <oasis:entry colname="col4">F1-score</oasis:entry>
         <oasis:entry colname="col5">Support</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">0 – No Class</oasis:entry>
         <oasis:entry colname="col2">1.00</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4">0.98</oasis:entry>
         <oasis:entry colname="col5">360 174</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">1 – Clean atmosphere</oasis:entry>
         <oasis:entry colname="col2">0.92</oasis:entry>
         <oasis:entry colname="col3">0.77</oasis:entry>
         <oasis:entry colname="col4">0.84</oasis:entry>
         <oasis:entry colname="col5">69 998</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2 – Non-typed particles/ low concentration</oasis:entry>
         <oasis:entry colname="col2">0.82</oasis:entry>
         <oasis:entry colname="col3">0.92</oasis:entry>
         <oasis:entry colname="col4">0.84</oasis:entry>
         <oasis:entry colname="col5">85 916</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3 – Aerosol: small</oasis:entry>
         <oasis:entry colname="col2">0.90</oasis:entry>
         <oasis:entry colname="col3">0.93</oasis:entry>
         <oasis:entry colname="col4">0.92</oasis:entry>
         <oasis:entry colname="col5">1266</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4 – Aerosol: large, spherical</oasis:entry>
         <oasis:entry colname="col2">0.98</oasis:entry>
         <oasis:entry colname="col3">0.97</oasis:entry>
         <oasis:entry colname="col4">0.97</oasis:entry>
         <oasis:entry colname="col5">12 059</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5 – Aerosol: mixture, partly non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.92</oasis:entry>
         <oasis:entry colname="col3">0.99</oasis:entry>
         <oasis:entry colname="col4">0.95</oasis:entry>
         <oasis:entry colname="col5">6543</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6 – Aerosol: large, non-spherical</oasis:entry>
         <oasis:entry colname="col2">0.69</oasis:entry>
         <oasis:entry colname="col3">0.90</oasis:entry>
         <oasis:entry colname="col4">0.78</oasis:entry>
         <oasis:entry colname="col5">1075</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">7 – Cloud: non-typed</oasis:entry>
         <oasis:entry colname="col2">0.57</oasis:entry>
         <oasis:entry colname="col3">0.33</oasis:entry>
         <oasis:entry colname="col4">0.42</oasis:entry>
         <oasis:entry colname="col5">85</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">8 – Cloud: water droplets</oasis:entry>
         <oasis:entry colname="col2">0.70</oasis:entry>
         <oasis:entry colname="col3">0.66</oasis:entry>
         <oasis:entry colname="col4">0.68</oasis:entry>
         <oasis:entry colname="col5">139</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">9 – Cloud: likely water droplets</oasis:entry>
         <oasis:entry colname="col2">0.78</oasis:entry>
         <oasis:entry colname="col3">0.32</oasis:entry>
         <oasis:entry colname="col4">0.45</oasis:entry>
         <oasis:entry colname="col5">2889</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10 – Cloud: ice crystals</oasis:entry>
         <oasis:entry colname="col2">0.69</oasis:entry>
         <oasis:entry colname="col3">0.94</oasis:entry>
         <oasis:entry colname="col4">0.80</oasis:entry>
         <oasis:entry colname="col5">30 870</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">11 – Cloud: likely ice crystals</oasis:entry>
         <oasis:entry colname="col2">0.58</oasis:entry>
         <oasis:entry colname="col3">0.76</oasis:entry>
         <oasis:entry colname="col4">0.66</oasis:entry>
         <oasis:entry colname="col5">4986</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry namest="col1" nameend="col5" align="center">Averages </oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">micro</oasis:entry>
         <oasis:entry colname="col2">0.93</oasis:entry>
         <oasis:entry colname="col3">0.93</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">macro</oasis:entry>
         <oasis:entry colname="col2">0.80</oasis:entry>
         <oasis:entry colname="col3">0.79</oasis:entry>
         <oasis:entry colname="col4">0.78</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">weighted</oasis:entry>
         <oasis:entry colname="col2">0.93</oasis:entry>
         <oasis:entry colname="col3">0.93</oasis:entry>
         <oasis:entry colname="col4">0.93</oasis:entry>
         <oasis:entry colname="col5">576 000</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <fig id="FA1"><label>Figure A1</label><caption><p id="d2e4825">Case study 1  –  spatial distribution of classification errors. (Left) Group confusion density heatmap highlighting specific regions where the model confuses the broad categories of aerosols and clouds. (Right) Generalized confusion heatmap excluding background and clean atmosphere classes (0–2) to visualize the nuances of subtype misclassification.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f09.png"/>

      </fig>

<fig id="FA2"><label>Figure A2</label><caption><p id="d2e4839">Case study 2  –  spatial distribution of classification errors. (Left) Group confusion density heatmap highlighting specific regions where the model confuses the broad categories of aerosols and clouds. (Right) Generalized confusion heatmap excluding background and clean atmosphere classes (0–2) to visualize the nuances of subtype misclassification.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f10.png"/>

      </fig>

      <fig id="FA3"><label>Figure A3</label><caption><p id="d2e4852">Case study 3  –  spatial distribution of classification errors. (Left) Group confusion density heatmap highlighting specific regions where the model confuses the broad categories of aerosols and clouds. (Right) Generalized confusion heatmap excluding background and clean atmosphere classes (0–2) to visualize the nuances of subtype misclassification.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f11.png"/>

      </fig>

<fig id="FA4"><label>Figure A4</label><caption><p id="d2e4867">Confusion matrix for case study 1  –  Complex Multi-Layer Aerosol.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f12.png"/>

      </fig>

      <fig id="FA5"><label>Figure A5</label><caption><p id="d2e4880">Confusion matrix for case study 2  –  Low Level Liquid Cloud and Aerosol.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f13.png"/>

      </fig>

<fig id="FA6"><label>Figure A6</label><caption><p id="d2e4894">Confusion matrix for case study 3  –  Lidar Signal Attenuation.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f14.png"/>

      </fig>

<fig id="FA7"><label>Figure A7</label><caption><p id="d2e4909">Raw signals case study 1  –  Input lidar and meteorological feature suite. Visualization of the multi-channel input data used by the U-Net, including attenuated and aerosol backscatter (<inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:mn mathvariant="normal">532</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1064</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M106" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>), depolarization ratios, the backscatter-related Ångström exponent, NWP-derived pressure/temperature profiles and liquid water path. These raw signals (excluding LWP) provide the hierarchical features necessary for the model to infer atmospheric state even in signal-attenuated regions.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f15.png"/>

      </fig>

<fig id="FA8"><label>Figure A8</label><caption><p id="d2e4943">Raw signals case study 2  –  Input lidar and meteorological feature suite. Visualization of the multi-channel input data used by the U-Net, including attenuated and aerosol backscatter (<inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:mn mathvariant="normal">532</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1064</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M108" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>), depolarization ratios, the backscatter-related Ångström exponent, NWP-derived pressure/temperature profiles and liquid water path. These raw signals (excluding LWP) provide the hierarchical features necessary for the model to infer atmospheric state even in signal-attenuated regions.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f16.png"/>

      </fig>

<fig id="FA9"><label>Figure A9</label><caption><p id="d2e4977">Raw signals case study 3  –  Input lidar and meteorological feature suite. Visualization of the multi-channel input data used by the U-Net, including attenuated and aerosol backscatter (<inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:mn mathvariant="normal">532</mml:mn><mml:mo>/</mml:mo><mml:mn mathvariant="normal">1064</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M110" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">nm</mml:mi></mml:mrow></mml:math></inline-formula>), depolarization ratios, the backscatter-related Ångström exponent, NWP-derived pressure/temperature profiles and liquid water path. These raw signals (excluding LWP) provide the hierarchical features necessary for the model to infer atmospheric state even in signal-attenuated regions.</p></caption>
        
        <graphic xlink:href="https://amt.copernicus.org/articles/19/4415/2026/amt-19-4415-2026-f17.png"/>

      </fig>


</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e5014">The combined PollyXT and Cloudnet dataset can be accessed at  <ext-link xlink:href="https://doi.org/10.5281/zenodo.17424878" ext-link-type="DOI">10.5281/zenodo.17424878</ext-link> <xref ref-type="bibr" rid="bib1.bibx2" id="paren.41"/>. Python code files are available at  <ext-link xlink:href="https://doi.org/10.5281/zenodo.17422969" ext-link-type="DOI">10.5281/zenodo.17422969</ext-link> <xref ref-type="bibr" rid="bib1.bibx45" id="paren.42"/>.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e5032">YP: writing  –  original draft (lead); formal analysis (lead); software (equal); visualization (lead), LZC: methodology (equal); software (equal), IT: methodology (equal); software (equal), JB: data curation (equal); conceptualization (equal); writing  –  review and editing (equal), AA: data curation (equal); conceptualization (equal), AC: conceptualization (equal); writing  –  review and editing (equal), ZY: supervision (lead); writing  –  review and editing (equal).</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e5038">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e5044">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e5050">Google Gemini Deep Research 2.5 Pro and the Overleaf AI editor were used for spell-checking and proofreading purposes.</p></ack><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e5055">This paper was edited by Bernhard Mayer and reviewed by three anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Albrecht(1989)</label><mixed-citation>Albrecht, B. A.: Aerosols, cloud microphysics, and fractional cloudiness, Science, 245, <ext-link xlink:href="https://doi.org/10.1126/science.245.4923.1227" ext-link-type="DOI">10.1126/science.245.4923.1227</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Ansmann et al.(2025)</label><mixed-citation>Ansmann, A., Bühl, J., and Peleg, Y.: Cloud Fields Identification with Lidar using advanced AI approach – CloudNet/PollyXT, Limassol 2016–2018, Zenodo [data set], <ext-link xlink:href="https://doi.org/10.5281/zenodo.17424878" ext-link-type="DOI">10.5281/zenodo.17424878</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Baars et al.(2017)</label><mixed-citation>Baars, H., Seifert, P., Engelmann, R., and Wandinger, U.: Target categorization of aerosol and clouds by continuous multiwavelength-polarization lidar measurements, Atmos. Meas. Tech., 10, 3175–3201, <ext-link xlink:href="https://doi.org/10.5194/amt-10-3175-2017" ext-link-type="DOI">10.5194/amt-10-3175-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Baars et al.(2018)</label><mixed-citation>Baars, H., Althausen, D., Engelmann, R., Heese, B., Ansmann, A., Wandinger, U., Hofer, J., Skupin, A., Komppula, M., Giannakaki, E., Filioglou, M., Bortoli, D., Silva, A. M., Pereira, S., Stachlewska, I. S., Kumala, W., Szczepanik, D., Amiridis, V., Marinou, E., Kottas, M., Mattis, I., and Müller, G.: PollyNET – an emerging network of automated raman-polarizarion lidars for continuous aerosolprofiling, in: EPJ Web of Conferences,  176, 09013, <ext-link xlink:href="https://doi.org/10.1051/epjconf/201817609013" ext-link-type="DOI">10.1051/epjconf/201817609013</ext-link>,  2018.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Bansal et al.(2022)</label><mixed-citation>Bansal, A., Lee, Y., Hilburn, K., and Ebert-Uphoff, I.: Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.2210.12310" ext-link-type="DOI">10.48550/arXiv.2210.12310</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Biasutti et al.(2019)</label><mixed-citation>Biasutti, P., Lepetit, V., Aujol, J. F., Bredif, M., and Bugeau, A.: LU-net: An efficient network for 3D LiDAR point cloud semantic segmentation based on end-to-end-learned 3D features and U-net, in: Proceedings – 2019 International Conference on Computer Vision Workshop, ICCVW 2019, 942–950, <ext-link xlink:href="https://doi.org/10.1109/ICCVW.2019.00123" ext-link-type="DOI">10.1109/ICCVW.2019.00123</ext-link>,  2019.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Bressan et al.(2022)</label><mixed-citation>Bressan, P. O., Junior, J. M., Correa Martins, J. A., de Melo, M. J., Gonçalves, D. N., Freitas, D. M., Marques Ramos, A. P., Garcia Furuya, M. T., Osco, L. P., de Andrade Silva, J., Luo, Z., Garcia, R. C., Ma, L., Li, J., and Gonçalves, W. N.: Semantic segmentation with labeling uncertainty and class imbalance applied to vegetation mapping, Int. J. Appl. Earth Obs., 108, <ext-link xlink:href="https://doi.org/10.1016/j.jag.2022.102690" ext-link-type="DOI">10.1016/j.jag.2022.102690</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Bühl et al.(2017)</label><mixed-citation>Bühl, J., Alexander, S., Crewell, S., Heymsfield, A., Kalesse, H., Khain, A., Maahn, M., Van-Tricht, K., and Wendisch, M.: Chapter 10: Remote sensing, Meteor. Mon., 58, <ext-link xlink:href="https://doi.org/10.1175/AMSMONOGRAPHS-D-16-0015.1" ext-link-type="DOI">10.1175/AMSMONOGRAPHS-D-16-0015.1</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Cairo et al.(2024)</label><mixed-citation>Cairo, F., Di Liberto, L., Dionisi, D., and Snels, M.: Understanding aerosol–cloud interactions through lidar techniques: a review, Remote Sens.-Basel, 16, <ext-link xlink:href="https://doi.org/10.3390/rs16152788" ext-link-type="DOI">10.3390/rs16152788</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>del Águila et al.(2025)</label><mixed-citation>del Águila, A., Ortiz-Amezcua, P., Tabik, S., Bravo-Aranda, J. A., Fernández-Carvelo, S., and Alados-Arboledas, L.: Aerosol type classification with machine learning techniques applied to multiwavelength lidar data from EARLINET, Atmos. Chem. Phys., 25, 12549–12567, <ext-link xlink:href="https://doi.org/10.5194/acp-25-12549-2025" ext-link-type="DOI">10.5194/acp-25-12549-2025</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Doane and Seward(2011)</label><mixed-citation>Doane, D. P. and Seward, L. E.: Measuring skewness: a forgotten statistic?, Journal of Statistics Education, 19, <ext-link xlink:href="https://doi.org/10.1080/10691898.2011.11889611" ext-link-type="DOI">10.1080/10691898.2011.11889611</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Engelmann et al.(2016)</label><mixed-citation>Engelmann, R., Kanitz, T., Baars, H., Heese, B., Althausen, D., Skupin, A., Wandinger, U., Komppula, M., Stachlewska, I. S., Amiridis, V., Marinou, E., Mattis, I., Linné, H., and Ansmann, A.: The automated multiwavelength Raman polarization and water-vapor lidar PollyXT: the neXT generation, Atmos. Meas. Tech., 9, 1767–1784, <ext-link xlink:href="https://doi.org/10.5194/amt-9-1767-2016" ext-link-type="DOI">10.5194/amt-9-1767-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Foley et al.(2024)</label><mixed-citation>Foley, S. R., Knobelspiesse, K. D., Sayer, A. M., Gao, M., Hays, J., and Hoffman, J.: 3D cloud masking across a broad swath using multi-angle polarimetry and deep learning, Atmos. Meas. Tech., 17, 7027–7047, <ext-link xlink:href="https://doi.org/10.5194/amt-17-7027-2024" ext-link-type="DOI">10.5194/amt-17-7027-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Fuller et al.(2025)</label><mixed-citation>Fuller, C. A., Selmer, P. A., Gomes, J., and McGill, M. J.: Using multitask machine learning to type clouds and aerosols from space-based photon-counting lidar measurements, Remote Sens.-Basel, 17, <ext-link xlink:href="https://doi.org/10.3390/rs17162787" ext-link-type="DOI">10.3390/rs17162787</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Galea et al.(2023)</label><mixed-citation>Galea, D., Ma, H.-Y., Wu, W.-Y., and Kobayashi, D.: Deep learning image segmentation for atmospheric rivers, Artificial Intelligence for the Earth Systems, 3, <ext-link xlink:href="https://doi.org/10.1175/aies-d-23-0048.1" ext-link-type="DOI">10.1175/aies-d-23-0048.1</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Haarig et al.(2023)</label><mixed-citation>Haarig, M., Hünerbein, A., Wandinger, U., Docter, N., Bley, S., Donovan, D., and van Zadelhoff, G.-J.: Cloud top heights and aerosol columnar properties from combined EarthCARE lidar and imager observations: the AM-CTH and AM-ACD products, Atmos. Meas. Tech., 16, 5953–5975, <ext-link xlink:href="https://doi.org/10.5194/amt-16-5953-2023" ext-link-type="DOI">10.5194/amt-16-5953-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Hartmann and Doelling(1991)</label><mixed-citation>Hartmann, D. L. and Doelling, D.: On the net radiative effectiveness of clouds, J. Geophys. Res., 96, <ext-link xlink:href="https://doi.org/10.1029/90JD02065" ext-link-type="DOI">10.1029/90JD02065</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Illingworth et al.(2007)</label><mixed-citation>Illingworth, A. J., Hogan, R. J., O'Connor, E. J., Bouniol, D., Brooks, M. E., Delanoë, J., Donovan, D. P., Eastment, J. D., Gaussiat, N., Goddard, J. W., Haeffelin, M., Klein Baltinik, H., Krasnov, O. A., Pelon, J., Piriou, J. M., Protat, A., Russchenberg, H. W., Seifert, A., Tompkins, A. M., van Zadelhoff, G. J., Vinit, F., Willen, U., Wilson, D. R., and Wrench, C. L.: Cloudnet: continuous evaluation of cloud profiles in seven operational models using ground-based observations, B. Am. Meteorol. Soc., 88, <ext-link xlink:href="https://doi.org/10.1175/BAMS-88-6-883" ext-link-type="DOI">10.1175/BAMS-88-6-883</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Intergovernmental Panel on Climate Change (IPCC)(2014)</label><mixed-citation>Intergovernmental Panel on Climate Change (IPCC): Clouds and aerosols, in: Climate Change 2013 – The Physical Science Basis: Working Group I Contribution to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, in:  Intergovernmental Panel on Climate Change (IPCC), edited by: Stocker, T. F., Qin, D., Plattner, G.-K., Tignor, M., Allen, S. K., Boschung, J., Nauels, A., Xia, Y., Bex, V., and Midgley, P. M., Cambridge University Press, Cambridge, <ext-link xlink:href="https://doi.org/10.1017/CBO9781107415324.016" ext-link-type="DOI">10.1017/CBO9781107415324.016</ext-link>, 571–658, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Ioffe and Szegedy(2015)</label><mixed-citation>Ioffe, S. and Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift, in: 32nd International Conference on Machine Learning, ICML 2015, 1, 448–456, <ext-link xlink:href="https://doi.org/10.48550/arXiv.1502.03167" ext-link-type="DOI">10.48550/arXiv.1502.03167</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Jacob(2000)</label><mixed-citation>Jacob, D. J.: Heterogeneous chemistry and tropospheric ozone, Atmos. Environ., 34, <ext-link xlink:href="https://doi.org/10.1016/S1352-2310(99)00462-8" ext-link-type="DOI">10.1016/S1352-2310(99)00462-8</ext-link>, 2000.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Jones(1996)</label><mixed-citation>Jones, M. P.: Indicator and stratification methods for missing explanatory variables in multiple linear regression, J. Am. Stat. Assoc., 91, <ext-link xlink:href="https://doi.org/10.1080/01621459.1996.10476680" ext-link-type="DOI">10.1080/01621459.1996.10476680</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Kalesse-Los et al.(2022)</label><mixed-citation>Kalesse-Los, H., Schimmel, W., Luke, E., and Seifert, P.: Evaluating cloud liquid detection against Cloudnet using cloud radar Doppler spectra in a pre-trained artificial neural network, Atmos. Meas. Tech., 15, 279–295, <ext-link xlink:href="https://doi.org/10.5194/amt-15-279-2022" ext-link-type="DOI">10.5194/amt-15-279-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Krizhevsky et al.(2017)</label><mixed-citation>Krizhevsky, A., Sutskever, I., and Hinton, G. E.: ImageNet classification with deep convolutional neural networks, Commun. ACM, 60, <ext-link xlink:href="https://doi.org/10.1145/3065386" ext-link-type="DOI">10.1145/3065386</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>LeCun et al.(2015)</label><mixed-citation>LeCun, Y., Hinton, G., and Bengio, Y.: Deep learning, Nature, 521,  436–444, <ext-link xlink:href="https://doi.org/10.1038/nature14539" ext-link-type="DOI">10.1038/nature14539</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Lelieveld and Crutzen(1991)</label><mixed-citation>Lelieveld, J. and Crutzen, P. J.: The role of clouds in tropospheric photochemistry, J. Atmos. Chem., 12, <ext-link xlink:href="https://doi.org/10.1007/BF00048075" ext-link-type="DOI">10.1007/BF00048075</ext-link>, 1991.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Levy-Jurgenson et al.(2020)</label><mixed-citation>Levy-Jurgenson, A., Tekpli, X., Kristensen, V. N., and Yakhini, Z.: Spatial transcriptomics inferred from pathology whole-slide images links tumor heterogeneity to survival in breast and lung cancer, Sci. Rep.-UK, 10, <ext-link xlink:href="https://doi.org/10.1038/s41598-020-75708-z" ext-link-type="DOI">10.1038/s41598-020-75708-z</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Milletari et al.(2016)</label><mixed-citation>Milletari, F., Navab, N., and Ahmadi, S. A.: V-Net: fully convolutional neural networks for volumetric medical image segmentation, in: Proceedings – 2016 4th International Conference on 3D Vision, 3DV 2016, 565–571, <ext-link xlink:href="https://doi.org/10.1109/3DV.2016.79" ext-link-type="DOI">10.1109/3DV.2016.79</ext-link>,  2016.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Nicolae et al.(2018)</label><mixed-citation>Nicolae, D., Vasilescu, J., Talianu, C., Binietoglou, I., Nicolae, V., Andrei, S., and Antonescu, B.: A neural network aerosol-typing algorithm based on lidar data, Atmos. Chem. Phys., 18, 14511–14537, <ext-link xlink:href="https://doi.org/10.5194/acp-18-14511-2018" ext-link-type="DOI">10.5194/acp-18-14511-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Oladipo et al.(2024)</label><mixed-citation>Oladipo, B., Gomes, J., McGill, M., and Selmer, P.: Leveraging deep learning as a new approach to layer detection and cloud–aerosol classification using ICESat-2 atmospheric data, Remote Sens.-Basel, 16, 2344, <ext-link xlink:href="https://doi.org/10.3390/rs16132344" ext-link-type="DOI">10.3390/rs16132344</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Pal et al.(1992)</label><mixed-citation>Pal, S. R., Steinbrecht, W., and Carswell, A. I.: Automated method for lidar determination of cloud-base height and vertical extent, Appl. Optics, 31, <ext-link xlink:href="https://doi.org/10.1364/ao.31.001488" ext-link-type="DOI">10.1364/ao.31.001488</ext-link>, 1992.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Ramanathan et al.(2001)</label><mixed-citation>Ramanathan, V., Crutzen, P. J., Kiehl, J. T., and Rosenfeld, D.: Aerosols, climate, and the hydrological cycle, Science, 294, 2119–2124, <ext-link xlink:href="https://doi.org/10.1126/science.1064034" ext-link-type="DOI">10.1126/science.1064034</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Reichstein et al.(2019)</label><mixed-citation>Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., and Prabhat:  Deep learning and process understanding for data-driven Earth system science, Nature, 566, <ext-link xlink:href="https://doi.org/10.1038/s41586-019-0912-1" ext-link-type="DOI">10.1038/s41586-019-0912-1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Rogozovsky et al.(2023)</label><mixed-citation>Rogozovsky, I., Ohneiser, K., Lyapustin, A., Ansmann, A., and Chudnovsky, A.: The impact of different aerosol layering conditions on the high-resolution MODIS/MAIAC AOD retrieval bias: the uncertainty analysis, Atmos. Environ., 309, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2023.119930" ext-link-type="DOI">10.1016/j.atmosenv.2023.119930</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Rogozovsky et al.(2025)</label><mixed-citation>Rogozovsky, I., Ansmann, A., Hofer, J., and Chudnovsky, A.: Unveiling atmospheric layers: vertical pollution patterns and prospects for high-resolution aerosol retrievals using the eastern Mediterranean as a case study, Environ. Sci. Technol., 59, 12181–12195, <ext-link xlink:href="https://doi.org/10.1021/acs.est.4c14556" ext-link-type="DOI">10.1021/acs.est.4c14556</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Rogozovsky et al.(2026)</label><mixed-citation>Rogozovsky, I., Ansmann, A., and Chudnovsky, A.: Vertical aerosol structure matters: improving the AOD–<inline-formula><mml:math id="M111" display="inline"><mml:mrow class="chem"><mml:msub><mml:mi mathvariant="normal">PM</mml:mi><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> link for air quality and exposure, Environ. Sci. Technol., 60, 14685–14697, <ext-link xlink:href="https://doi.org/10.1021/acs.est.6c00095" ext-link-type="DOI">10.1021/acs.est.6c00095</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Ronneberger et al.(2015)</label><mixed-citation>Ronneberger, O., Fischer, P., and Brox, T.: U-net: convolutional networks for biomedical image segmentation, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),  9351, 234–241, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-24574-4_28" ext-link-type="DOI">10.1007/978-3-319-24574-4_28</ext-link>,  2015.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Rosenfeld et al.(2014)</label><mixed-citation>Rosenfeld, D., Andreae, M. O., Asmi, A., Chin, M., De Leeuw, G., Donovan, D. P., Kahn, R., Kinne, S., Kivekäs, N., Kulmala, M., Lau, W., Schmidt, K. S., Suni, T., Wagner, T., Wild, M., and Quaas, J.: Global observations of aerosol-cloud-precipitation-climate interactions, Rev. Geophys., 52, 750–808, <ext-link xlink:href="https://doi.org/10.1002/2013RG000441" ext-link-type="DOI">10.1002/2013RG000441</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Rusyn et al.(2019)</label><mixed-citation>Rusyn, B., Korniy, V., Lutsyk, O., and Kosarevych, R.: Deep learning for atmospheric cloud image segmentation, in: 2019 11th International Scientific and Practical Conference on Electronics and Information Technologies, ELIT 2019 – Proceedings, 125–128, <ext-link xlink:href="https://doi.org/10.1109/ELIT.2019.8892285" ext-link-type="DOI">10.1109/ELIT.2019.8892285</ext-link>,  2019.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Schimmel et al.(2022)</label><mixed-citation>Schimmel, W., Kalesse-Los, H., Maahn, M., Vogl, T., Foth, A., Garfias, P. S., and Seifert, P.: Identifying cloud droplets beyond lidar attenuation from vertically pointing cloud radar observations using artificial neural networks, Atmos. Meas. Tech., 15, 5343–5366, <ext-link xlink:href="https://doi.org/10.5194/amt-15-5343-2022" ext-link-type="DOI">10.5194/amt-15-5343-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Sudre et al.(2017)</label><mixed-citation>Sudre, C. H., Li, W., Vercauteren, T., Ourselin, S., and Jorge Cardoso, M.: Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 10553 LNCS, 240–248, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-67558-9_28" ext-link-type="DOI">10.1007/978-3-319-67558-9_28</ext-link>,  2017.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Weitkamp(2005)</label><mixed-citation>Weitkamp, C. (Ed.): Lidar: Range-Resolved Optical Remote Sensing of the Atmosphere, Springer Series in Optical Sciences, Vol. 102, Springer, New York, NY, <ext-link xlink:href="https://doi.org/10.1007/b106786" ext-link-type="DOI">10.1007/b106786</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Winker et al.(2010)</label><mixed-citation>Winker, D. M., Pelon, J., Coakley, J. A., Ackerman, S. A., Charlson, R. J., Colarco, P. R., Flamant, P., Fu, Q., Hoff, R. M., Kittaka, C., Kubar, T. L., Le Treut, H., McCormick, M. P., Mégie, G., Poole, L., Powell, K., Trepte, K., Vaughan, M. A., and Wielicki, B. A.: The Calipso Mission: a global 3D view of aerosols and clouds, B. Am. Meteorol. Soc., 91, <ext-link xlink:href="https://doi.org/10.1175/2010BAMS3009.1" ext-link-type="DOI">10.1175/2010BAMS3009.1</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Winker et al.(2017)</label><mixed-citation>Winker, D., Chepfer, H., Noel, V., and Cai, X.: Observational constraints on cloud feedbacks: the role of active satellite sensors, Surv. Geophys., 38, <ext-link xlink:href="https://doi.org/10.1007/s10712-017-9452-0" ext-link-type="DOI">10.1007/s10712-017-9452-0</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>yonipeleg(2025)</label><mixed-citation>yonipeleg: YakhiniGroup/cloud-fields-identification: cloud-fields-identification (publication), Zenodo [software], <ext-link xlink:href="https://doi.org/10.5281/zenodo.17422969" ext-link-type="DOI">10.5281/zenodo.17422969</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Zhou et al.(2024)</label><mixed-citation>Zhou, X., Chen, B., Ye, Q., Zhao, L., Song, Z., Wang, Y., Hu, J., and Chen, R.: Cloud–aerosol classification based on the U-net model and automatic denoising CALIOP data, Remote Sens.-Basel, 16, <ext-link xlink:href="https://doi.org/10.3390/rs16050904" ext-link-type="DOI">10.3390/rs16050904</ext-link>, 2024.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Cloud fields and aerosol classification with lidar using advanced AI approach</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Albrecht(1989)</label><mixed-citation>
       Albrecht, B. A.: Aerosols, cloud microphysics, and fractional cloudiness, Science, 245, <a href="https://doi.org/10.1126/science.245.4923.1227" target="_blank">https://doi.org/10.1126/science.245.4923.1227</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Ansmann et al.(2025)</label><mixed-citation>
      
Ansmann, A., Bühl, J., and Peleg, Y.: Cloud Fields Identification with Lidar using advanced AI approach – CloudNet/PollyXT, Limassol 2016–2018, Zenodo [data set], <a href="https://doi.org/10.5281/zenodo.17424878" target="_blank">https://doi.org/10.5281/zenodo.17424878</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Baars et al.(2017)</label><mixed-citation>
       Baars, H., Seifert, P., Engelmann, R., and Wandinger, U.: Target categorization of aerosol and clouds by continuous multiwavelength-polarization lidar measurements, Atmos. Meas. Tech., 10, 3175–3201, <a href="https://doi.org/10.5194/amt-10-3175-2017" target="_blank">https://doi.org/10.5194/amt-10-3175-2017</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Baars et al.(2018)</label><mixed-citation>
       Baars, H., Althausen, D., Engelmann, R., Heese, B., Ansmann, A., Wandinger, U., Hofer, J., Skupin, A., Komppula, M., Giannakaki, E., Filioglou, M., Bortoli, D., Silva, A. M., Pereira, S., Stachlewska, I. S., Kumala, W., Szczepanik, D., Amiridis, V., Marinou, E., Kottas, M., Mattis, I., and Müller, G.: PollyNET – an emerging network of automated raman-polarizarion lidars for continuous aerosolprofiling, in: EPJ Web of Conferences,  176, 09013, <a href="https://doi.org/10.1051/epjconf/201817609013" target="_blank">https://doi.org/10.1051/epjconf/201817609013</a>,  2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Bansal et al.(2022)</label><mixed-citation>
       Bansal, A., Lee, Y., Hilburn, K., and Ebert-Uphoff, I.: Tools for Extracting Spatio-Temporal Patterns in Meteorological Image Sequences: From Feature Engineering to Attention-Based Neural Networks, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.2210.12310" target="_blank">https://doi.org/10.48550/arXiv.2210.12310</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Biasutti et al.(2019)</label><mixed-citation>
       Biasutti, P., Lepetit, V., Aujol, J. F., Bredif, M., and Bugeau, A.: LU-net: An efficient network for 3D LiDAR point cloud semantic segmentation based on end-to-end-learned 3D features and U-net, in: Proceedings – 2019 International Conference on Computer Vision Workshop, ICCVW 2019, 942–950, <a href="https://doi.org/10.1109/ICCVW.2019.00123" target="_blank">https://doi.org/10.1109/ICCVW.2019.00123</a>,  2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Bressan et al.(2022)</label><mixed-citation>
       Bressan, P. O., Junior, J. M., Correa Martins, J. A., de Melo, M. J., Gonçalves, D. N., Freitas, D. M., Marques Ramos, A. P., Garcia Furuya, M. T., Osco, L. P., de Andrade Silva, J., Luo, Z., Garcia, R. C., Ma, L., Li, J., and Gonçalves, W. N.: Semantic segmentation with labeling uncertainty and class imbalance applied to vegetation mapping, Int. J. Appl. Earth Obs., 108, <a href="https://doi.org/10.1016/j.jag.2022.102690" target="_blank">https://doi.org/10.1016/j.jag.2022.102690</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Bühl et al.(2017)</label><mixed-citation>
       Bühl, J., Alexander, S., Crewell, S., Heymsfield, A., Kalesse, H., Khain, A., Maahn, M., Van-Tricht, K., and Wendisch, M.: Chapter 10: Remote sensing, Meteor. Mon., 58, <a href="https://doi.org/10.1175/AMSMONOGRAPHS-D-16-0015.1" target="_blank">https://doi.org/10.1175/AMSMONOGRAPHS-D-16-0015.1</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Cairo et al.(2024)</label><mixed-citation>
       Cairo, F., Di Liberto, L., Dionisi, D., and Snels, M.: Understanding aerosol–cloud interactions through lidar techniques: a review, Remote Sens.-Basel, 16, <a href="https://doi.org/10.3390/rs16152788" target="_blank">https://doi.org/10.3390/rs16152788</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>del Águila et al.(2025)</label><mixed-citation>
      
del Águila, A., Ortiz-Amezcua, P., Tabik, S., Bravo-Aranda, J. A., Fernández-Carvelo, S., and Alados-Arboledas, L.: Aerosol type classification with machine learning techniques applied to multiwavelength lidar data from EARLINET, Atmos. Chem. Phys., 25, 12549–12567, <a href="https://doi.org/10.5194/acp-25-12549-2025" target="_blank">https://doi.org/10.5194/acp-25-12549-2025</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Doane and Seward(2011)</label><mixed-citation>
       Doane, D. P. and Seward, L. E.: Measuring skewness: a forgotten statistic?, Journal of Statistics Education, 19, <a href="https://doi.org/10.1080/10691898.2011.11889611" target="_blank">https://doi.org/10.1080/10691898.2011.11889611</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Engelmann et al.(2016)</label><mixed-citation>
       Engelmann, R., Kanitz, T., Baars, H., Heese, B., Althausen, D., Skupin, A., Wandinger, U., Komppula, M., Stachlewska, I. S., Amiridis, V., Marinou, E., Mattis, I., Linné, H., and Ansmann, A.: The automated multiwavelength Raman polarization and water-vapor lidar PollyXT: the neXT generation, Atmos. Meas. Tech., 9, 1767–1784, <a href="https://doi.org/10.5194/amt-9-1767-2016" target="_blank">https://doi.org/10.5194/amt-9-1767-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Foley et al.(2024)</label><mixed-citation>
       Foley, S. R., Knobelspiesse, K. D., Sayer, A. M., Gao, M., Hays, J., and Hoffman, J.: 3D cloud masking across a broad swath using multi-angle polarimetry and deep learning, Atmos. Meas. Tech., 17, 7027–7047, <a href="https://doi.org/10.5194/amt-17-7027-2024" target="_blank">https://doi.org/10.5194/amt-17-7027-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Fuller et al.(2025)</label><mixed-citation>
       Fuller, C. A., Selmer, P. A., Gomes, J., and McGill, M. J.: Using multitask machine learning to type clouds and aerosols from space-based photon-counting lidar measurements, Remote Sens.-Basel, 17, <a href="https://doi.org/10.3390/rs17162787" target="_blank">https://doi.org/10.3390/rs17162787</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Galea et al.(2023)</label><mixed-citation>
       Galea, D., Ma, H.-Y., Wu, W.-Y., and Kobayashi, D.: Deep learning image segmentation for atmospheric rivers, Artificial Intelligence for the Earth Systems, 3, <a href="https://doi.org/10.1175/aies-d-23-0048.1" target="_blank">https://doi.org/10.1175/aies-d-23-0048.1</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Haarig et al.(2023)</label><mixed-citation>
       Haarig, M., Hünerbein, A., Wandinger, U., Docter, N., Bley, S., Donovan, D., and van Zadelhoff, G.-J.: Cloud top heights and aerosol columnar properties from combined EarthCARE lidar and imager observations: the AM-CTH and AM-ACD products, Atmos. Meas. Tech., 16, 5953–5975, <a href="https://doi.org/10.5194/amt-16-5953-2023" target="_blank">https://doi.org/10.5194/amt-16-5953-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Hartmann and Doelling(1991)</label><mixed-citation>
       Hartmann, D. L. and Doelling, D.: On the net radiative effectiveness of clouds, J. Geophys. Res., 96, <a href="https://doi.org/10.1029/90JD02065" target="_blank">https://doi.org/10.1029/90JD02065</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Illingworth et al.(2007)</label><mixed-citation>
       Illingworth, A. J., Hogan, R. J., O'Connor, E. J., Bouniol, D., Brooks, M. E., Delanoë, J., Donovan, D. P., Eastment, J. D., Gaussiat, N., Goddard, J. W., Haeffelin, M., Klein Baltinik, H., Krasnov, O. A., Pelon, J., Piriou, J. M., Protat, A., Russchenberg, H. W., Seifert, A., Tompkins, A. M., van Zadelhoff, G. J., Vinit, F., Willen, U., Wilson, D. R., and Wrench, C. L.: Cloudnet: continuous evaluation of cloud profiles in seven operational models using ground-based observations, B. Am. Meteorol. Soc., 88, <a href="https://doi.org/10.1175/BAMS-88-6-883" target="_blank">https://doi.org/10.1175/BAMS-88-6-883</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Intergovernmental Panel on Climate Change
(IPCC)(2014)</label><mixed-citation>
      
Intergovernmental Panel on Climate Change (IPCC): Clouds and aerosols, in:
Climate Change 2013 – The Physical Science Basis: Working Group I
Contribution to the Fifth Assessment Report of the Intergovernmental Panel on
Climate Change, in:  Intergovernmental Panel on Climate Change
(IPCC), edited by: Stocker, T. F., Qin, D., Plattner, G.-K., Tignor, M., Allen, S. K., Boschung, J., Nauels, A., Xia, Y., Bex, V., and Midgley, P. M., Cambridge University Press, Cambridge, <a href="https://doi.org/10.1017/CBO9781107415324.016" target="_blank">https://doi.org/10.1017/CBO9781107415324.016</a>, 571–658, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Ioffe and Szegedy(2015)</label><mixed-citation>
       Ioffe, S. and Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift, in: 32nd International Conference on Machine Learning, ICML 2015, 1, 448–456, <a href="https://doi.org/10.48550/arXiv.1502.03167" target="_blank">https://doi.org/10.48550/arXiv.1502.03167</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Jacob(2000)</label><mixed-citation>
       Jacob, D. J.: Heterogeneous chemistry and tropospheric ozone, Atmos. Environ., 34, <a href="https://doi.org/10.1016/S1352-2310(99)00462-8" target="_blank">https://doi.org/10.1016/S1352-2310(99)00462-8</a>, 2000.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Jones(1996)</label><mixed-citation>
       Jones, M. P.: Indicator and stratification methods for missing explanatory variables in multiple linear regression, J. Am. Stat. Assoc., 91, <a href="https://doi.org/10.1080/01621459.1996.10476680" target="_blank">https://doi.org/10.1080/01621459.1996.10476680</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Kalesse-Los et al.(2022)</label><mixed-citation>
       Kalesse-Los, H., Schimmel, W., Luke, E., and Seifert, P.: Evaluating cloud liquid detection against Cloudnet using cloud radar Doppler spectra in a pre-trained artificial neural network, Atmos. Meas. Tech., 15, 279–295, <a href="https://doi.org/10.5194/amt-15-279-2022" target="_blank">https://doi.org/10.5194/amt-15-279-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Krizhevsky et al.(2017)</label><mixed-citation>
       Krizhevsky, A., Sutskever, I., and Hinton, G. E.: ImageNet classification with deep convolutional neural networks, Commun. ACM, 60, <a href="https://doi.org/10.1145/3065386" target="_blank">https://doi.org/10.1145/3065386</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>LeCun et al.(2015)</label><mixed-citation>
       LeCun, Y., Hinton, G., and Bengio, Y.: Deep learning, Nature, 521,  436–444, <a href="https://doi.org/10.1038/nature14539" target="_blank">https://doi.org/10.1038/nature14539</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Lelieveld and Crutzen(1991)</label><mixed-citation>
       Lelieveld, J. and Crutzen, P. J.: The role of clouds in tropospheric photochemistry, J. Atmos. Chem., 12, <a href="https://doi.org/10.1007/BF00048075" target="_blank">https://doi.org/10.1007/BF00048075</a>, 1991.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Levy-Jurgenson et al.(2020)</label><mixed-citation>
       Levy-Jurgenson, A., Tekpli, X., Kristensen, V. N., and Yakhini, Z.: Spatial transcriptomics inferred from pathology whole-slide images links tumor heterogeneity to survival in breast and lung cancer, Sci. Rep.-UK, 10, <a href="https://doi.org/10.1038/s41598-020-75708-z" target="_blank">https://doi.org/10.1038/s41598-020-75708-z</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Milletari et al.(2016)</label><mixed-citation>
       Milletari, F., Navab, N., and Ahmadi, S. A.: V-Net: fully convolutional neural networks for volumetric medical image segmentation, in: Proceedings – 2016 4th International Conference on 3D Vision, 3DV 2016, 565–571, <a href="https://doi.org/10.1109/3DV.2016.79" target="_blank">https://doi.org/10.1109/3DV.2016.79</a>,  2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Nicolae et al.(2018)</label><mixed-citation>
       Nicolae, D., Vasilescu, J., Talianu, C., Binietoglou, I., Nicolae, V., Andrei, S., and Antonescu, B.: A neural network aerosol-typing algorithm based on lidar data, Atmos. Chem. Phys., 18, 14511–14537, <a href="https://doi.org/10.5194/acp-18-14511-2018" target="_blank">https://doi.org/10.5194/acp-18-14511-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Oladipo et al.(2024)</label><mixed-citation>
       Oladipo, B., Gomes, J., McGill, M., and Selmer, P.: Leveraging deep learning as a new approach to layer detection and cloud–aerosol classification using ICESat-2 atmospheric data, Remote Sens.-Basel, 16, 2344, <a href="https://doi.org/10.3390/rs16132344" target="_blank">https://doi.org/10.3390/rs16132344</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Pal et al.(1992)</label><mixed-citation>
       Pal, S. R., Steinbrecht, W., and Carswell, A. I.: Automated method for lidar determination of cloud-base height and vertical extent, Appl. Optics, 31, <a href="https://doi.org/10.1364/ao.31.001488" target="_blank">https://doi.org/10.1364/ao.31.001488</a>, 1992.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Ramanathan et al.(2001)</label><mixed-citation>
      
Ramanathan, V., Crutzen, P. J., Kiehl, J. T., and Rosenfeld, D.: Aerosols, climate, and the hydrological cycle, Science, 294, 2119–2124, <a href="https://doi.org/10.1126/science.1064034" target="_blank">https://doi.org/10.1126/science.1064034</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Reichstein et al.(2019)</label><mixed-citation>
       Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., Carvalhais, N., and Prabhat:  Deep learning and process understanding for data-driven Earth system science, Nature, 566, <a href="https://doi.org/10.1038/s41586-019-0912-1" target="_blank">https://doi.org/10.1038/s41586-019-0912-1</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Rogozovsky et al.(2023)</label><mixed-citation>
       Rogozovsky, I., Ohneiser, K., Lyapustin, A., Ansmann, A., and Chudnovsky, A.: The impact of different aerosol layering conditions on the high-resolution MODIS/MAIAC AOD retrieval bias: the uncertainty analysis, Atmos. Environ., 309, <a href="https://doi.org/10.1016/j.atmosenv.2023.119930" target="_blank">https://doi.org/10.1016/j.atmosenv.2023.119930</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Rogozovsky et al.(2025)</label><mixed-citation>
       Rogozovsky, I., Ansmann, A., Hofer, J., and Chudnovsky, A.: Unveiling atmospheric layers: vertical pollution patterns and prospects for high-resolution aerosol retrievals using the eastern Mediterranean as a case study, Environ. Sci. Technol., 59, 12181–12195, <a href="https://doi.org/10.1021/acs.est.4c14556" target="_blank">https://doi.org/10.1021/acs.est.4c14556</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Rogozovsky et al.(2026)</label><mixed-citation>
       Rogozovsky, I., Ansmann, A., and Chudnovsky, A.: Vertical aerosol structure matters: improving the AOD–PM<sub>2.5</sub> link for air quality and exposure, Environ. Sci. Technol., 60, 14685–14697, <a href="https://doi.org/10.1021/acs.est.6c00095" target="_blank">https://doi.org/10.1021/acs.est.6c00095</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Ronneberger et al.(2015)</label><mixed-citation>
       Ronneberger, O., Fischer, P., and Brox, T.: U-net: convolutional networks for biomedical image segmentation, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics),  9351, 234–241, <a href="https://doi.org/10.1007/978-3-319-24574-4_28" target="_blank">https://doi.org/10.1007/978-3-319-24574-4_28</a>,  2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Rosenfeld et al.(2014)</label><mixed-citation>
       Rosenfeld, D., Andreae, M. O., Asmi, A., Chin, M., De Leeuw, G., Donovan, D. P., Kahn, R., Kinne, S., Kivekäs, N., Kulmala, M., Lau, W., Schmidt, K. S., Suni, T., Wagner, T., Wild, M., and Quaas, J.: Global observations of aerosol-cloud-precipitation-climate interactions, Rev. Geophys., 52, 750–808, <a href="https://doi.org/10.1002/2013RG000441" target="_blank">https://doi.org/10.1002/2013RG000441</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Rusyn et al.(2019)</label><mixed-citation>
       Rusyn, B., Korniy, V., Lutsyk, O., and Kosarevych, R.: Deep learning for atmospheric cloud image segmentation, in: 2019 11th International Scientific and Practical Conference on Electronics and Information Technologies, ELIT 2019 – Proceedings, 125–128, <a href="https://doi.org/10.1109/ELIT.2019.8892285" target="_blank">https://doi.org/10.1109/ELIT.2019.8892285</a>,  2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Schimmel et al.(2022)</label><mixed-citation>
       Schimmel, W., Kalesse-Los, H., Maahn, M., Vogl, T., Foth, A., Garfias, P. S., and Seifert, P.: Identifying cloud droplets beyond lidar attenuation from vertically pointing cloud radar observations using artificial neural networks, Atmos. Meas. Tech., 15, 5343–5366, <a href="https://doi.org/10.5194/amt-15-5343-2022" target="_blank">https://doi.org/10.5194/amt-15-5343-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Sudre et al.(2017)</label><mixed-citation>
       Sudre, C. H., Li, W., Vercauteren, T., Ourselin, S., and Jorge Cardoso, M.: Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations, in: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Vol. 10553 LNCS, 240–248, <a href="https://doi.org/10.1007/978-3-319-67558-9_28" target="_blank">https://doi.org/10.1007/978-3-319-67558-9_28</a>,  2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Weitkamp(2005)</label><mixed-citation>
      
Weitkamp, C. (Ed.): Lidar: Range-Resolved Optical Remote Sensing of the Atmosphere, Springer Series in Optical Sciences, Vol. 102, Springer, New York, NY, <a href="https://doi.org/10.1007/b106786" target="_blank">https://doi.org/10.1007/b106786</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Winker et al.(2010)</label><mixed-citation>
       Winker, D. M., Pelon, J., Coakley, J. A., Ackerman, S. A., Charlson, R. J., Colarco, P. R., Flamant, P., Fu, Q., Hoff, R. M., Kittaka, C., Kubar, T. L., Le Treut, H., McCormick, M. P., Mégie, G., Poole, L., Powell, K., Trepte, K., Vaughan, M. A., and Wielicki, B. A.: The Calipso Mission: a global 3D view of aerosols and clouds, B. Am. Meteorol. Soc., 91, <a href="https://doi.org/10.1175/2010BAMS3009.1" target="_blank">https://doi.org/10.1175/2010BAMS3009.1</a>, 2010.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Winker et al.(2017)</label><mixed-citation>
       Winker, D., Chepfer, H., Noel, V., and Cai, X.: Observational constraints on cloud feedbacks: the role of active satellite sensors, Surv. Geophys., 38, <a href="https://doi.org/10.1007/s10712-017-9452-0" target="_blank">https://doi.org/10.1007/s10712-017-9452-0</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>yonipeleg(2025)</label><mixed-citation>
      
yonipeleg: YakhiniGroup/cloud-fields-identification: cloud-fields-identification (publication), Zenodo [software], <a href="https://doi.org/10.5281/zenodo.17422969" target="_blank">https://doi.org/10.5281/zenodo.17422969</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Zhou et al.(2024)</label><mixed-citation>
       Zhou, X., Chen, B., Ye, Q., Zhao, L., Song, Z., Wang, Y., Hu, J., and Chen, R.: Cloud–aerosol classification based on the U-net model and automatic denoising CALIOP data, Remote Sens.-Basel, 16, <a href="https://doi.org/10.3390/rs16050904" target="_blank">https://doi.org/10.3390/rs16050904</a>, 2024.

    </mixed-citation></ref-html>--></article>
