<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">AMT</journal-id><journal-title-group>
    <journal-title>Atmospheric Measurement Techniques</journal-title>
    <abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1867-8548</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/amt-19-2225-2026</article-id><title-group><article-title>A Physics-Constrained Deep-Learning Framework based on Long-Term Remote-Sensing Data for Retrieving Vertical Distribution of PM<sub>2.5</sub> Chemical Components</article-title><alt-title>A Physics-Constrained Deep-Learning Framework</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Li</surname><given-names>Hongyi</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Yang</surname><given-names>Ting</given-names></name>
          <email>tingyang@mail.iap.ac.cn</email>
        <ext-link>https://orcid.org/0000-0001-5605-0654</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Sun</surname><given-names>Yele</given-names></name>
          
        <ext-link>https://orcid.org/0000-0003-2354-0221</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Wang</surname><given-names>Zifa</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>State Key Laboratory of Atmospheric Environment and Extreme Meteorology, Institute of Atmospheric Physics, Chinese Academy of Sciences, Beijing 100029, China</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>College of Earth and Planetary Sciences, University of Chinese Academy of Sciences, Beijing 100049, China</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Ting Yang (tingyang@mail.iap.ac.cn)</corresp></author-notes><pub-date><day>1</day><month>April</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>6</issue>
      <fpage>2225</fpage><lpage>2244</lpage>
      <history>
        <date date-type="received"><day>29</day><month>August</month><year>2025</year></date>
           <date date-type="rev-request"><day>15</day><month>September</month><year>2025</year></date>
           <date date-type="rev-recd"><day>9</day><month>January</month><year>2026</year></date>
           <date date-type="accepted"><day>21</day><month>January</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Hongyi Li et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026.html">This article is available from https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026.html</self-uri><self-uri xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e124">The vertical distribution of PM<sub>2.5</sub> chemical components is crucial for identifying the causes of atmospheric pollution and its impact on climate change and extreme weather. By integrating long-term lidar measurements, deep-learning algorithms and a physics-constrained optimization method, this paper presents a novel lidar-based retrieval framework to obtain vertical mass concentration profiles of PM<sub>2.5</sub> chemical components for the first time. Identifiable components include sulfate (SO<inline-formula><mml:math id="M4" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>), nitrate (NO<inline-formula><mml:math id="M5" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), ammonium (NH<inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), organic matter (OM) and black carbon (BC), which extend beyond the component types that traditional remote-sensing retrievals can identify. A 1-year retrieved surface mass concentrations of these components closely aligned with the observations, with Pearson correlation coefficient values ranging from 0.87 to 0.97. The retrieval framework applied to varying non-training spatiotemporal scenarios showed moderate generalization capability, although a tendency toward underestimation is observed. Tower and aircraft-based field campaigns indicate that the retrieved and observed vertical profiles of these components exhibited consistent patterns in mass concentrations and proportions. Subsequently, an explainable method was incorporated into the retrieval framework to quantify the multivariate driving effects on vertical profile retrieval. Results showed that the extinction coefficient and representative indicators within physiochemical processes contributed significantly to mass concentrations of these components. Finally, a dataset of vertical mass concentration profiles of these components over six years in a Chinese megacity (Beijing) was generated by the retrieval framework, revealing the dominant roles of OM and NO<inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> in PM<sub>2.5</sub> throughout the entire boundary layer across all seasons. As a result of the continued implementation of clean air policies in China, these components exhibited significant decreases during 2021–2022 compared with 2017–2018. Our retrieval framework offers a novel approach for acquiring vertical profiles of PM<sub>2.5</sub> chemical components, thereby providing a new perspective on elucidating the vertical evolution of atmospheric pollutants.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Natural Science Foundation of China</funding-source>
<award-id>42422506</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e224">PM<sub>2.5</sub> is a complex mixture composed of varying chemical components (Tao et al., 2017), mainly including sulfate (SO<inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>), nitrate (NO<inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), ammonium (NH<inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), organic matter (OM) and black carbon (BC). The diverse physiochemical properties arising from various chemical components yield distinct effects on the environment (Tan et al., 2018), climate change (Menon et al., 2002; Zhu et al., 2024) and human health (Kim et al., 2022). Vertical detection technologies have revealed that chemical components are primarily distributed at varying heights within the atmospheric boundary layer and contribute to environmental pollution through internal physiochemical processes (Morgan et al., 2009; Yang et al., 2024; Sun et al., 2015). Additionally, the proportion and vertical distribution of chemical components can regulate radiation flux at both the top of the atmosphere and at the surface by directly affecting light absorption and scattering, as well as the microphysical properties of clouds, thereby influencing climate change and extreme weather (Zhao et al., 2024). Consequently, characterizing the vertical structures of chemical components is essential for identifying the causes of PM<sub>2.5</sub> pollution and the response mechanisms related to climate change and extreme weather.</p>
      <p id="d2e284">Field campaigns are widely conducted to obtain vertical profiles of PM<sub>2.5</sub> chemical components by mounting observation instruments on meteorological towers, aircraft, tethered balloons and unmanned aerial vehicles. However, these platforms are constrained by sparse detection sites and heights, limited flight schedules, and high observation costs (Dubey et al., 2022), hindering the time-continuous acquisition of vertical profiles of PM<sub>2.5</sub> chemical components within the whole boundary layer over a long-term period. Continuous remote-sensing lidar detection technologies with high temporal and vertical resolution serve as robust pathways for the constant identification of PM<sub>2.5</sub> and its components across all altitudes (Matus et al., 2025; Toth et al., 2022; Wang et al., 2022). Additionally, both satellite-based lidar and ground-based lidar networks, such as China Lidar Joint Observation Network (LiDARNET, <uri>https://lidar.pku.edu.cn/</uri>, last access: 25 July 2025), Asian Dust Network (AD-NET) (Sugimoto et al., 2005), Micro Pulse Lidar Network (MPLNET) (Welton et al., 2001), and European Aerosol Research Lidar Network (EARLINET) (Ansmann et al., 2003), provide remote sensing capabilities with extensive spatial coverage.</p>
      <p id="d2e317">Retrieval algorithms for the lidar have been progressively developed over the past 20 years. Earlier studies utilized lidar depolarization ratios to identify dust and non-dust aerosols (Sugimoto et al., 2003; Tesche et al., 2009). Subsequently, additional lidar parameter constraints, such as multi-wavelength backscatter coefficient and lidar ratio, were incorporated to identify dust aerosol, water-soluble aerosols, black carbon, and sea salt based on the assumption of external mixing (Nishizawa et al., 2011; Nishizawa et al., 2017). Hara et al. (2018) considered the hygroscopic growth of water-soluble aerosols and their internal mixing with BC to mitigate the overestimation of BC retrieval (Hara et al., 2018). By integrating the ground-based lidar and sun-photometer, Wang et al. (2022) significantly increased the identifiable aerosol component types, including ammonium nitrate-like, water-insoluble organic matter, water-soluble organic matter, black carbon and fine-mode aerosol water content (Wang et al., 2022). However, the aerosol component type retrieved from existing lidar retrieval algorithms that utilize aerosol optical properties is not equivalent to the conventional chemical component type. Due to similar optical properties exhibited by PM<sub>2.5</sub> chemical components, the identification of chemical component types seems to be beyond the scope of remote-sensing retrieval (Wang et al., 2022). Moreover, the multiple parameterization assumptions introduced by existing lidar retrieval algorithms increase the uncertainties in component retrieval.</p>
      <p id="d2e329">Data-driven machine learning can interpret the nonlinear relationships between PM<sub>2.5</sub> chemical components and various driving factors without the constraints imposed by the inherent properties of these components (Li et al., 2025a). Meng et al. (2018) utilized a random forest algorithm to predict national mass concentrations of SO<inline-formula><mml:math id="M20" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, organic carbon (OC) and elemental carbon (EC), achieving <inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values ranging from 0.71 to 0.86 on a daily scale (Meng et al., 2018). Based on this algorithm, Lv et al. (2021) further achieved the hourly predictions of the aforementioned chemical components and NH<inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> with <inline-formula><mml:math id="M24" display="inline"><mml:mi>R</mml:mi></mml:math></inline-formula> values of 0.71–0.81 (Lv et al., 2021). Subsequently, deep learning algorithms are employed to accurately characterize complex nonlinear relationships and effectively extract data features, thereby enhancing the predictive ability of hourly mass concentrations of PM<sub>2.5</sub> chemical components (Lee et al., 2023; Liu et al., 2023; Li et al., 2025a). However, current studies primarily focus on predicting the ground-level mass concentrations of PM<sub>2.5</sub> chemical components but cannot interpret the vertical distribution of these components. Furthermore, existing prediction models are susceptible to the quantity and quality of available training data due to the absence of physical constraints, limiting their spatiotemporal generalization capabilities.</p>
      <p id="d2e418">In this study, we proposed a novel physics-constrained deep-learning framework that utilized lidar data to retrieve vertical profiles of five PM<sub>2.5</sub> chemical components (SO<inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NH<inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC) for the first time. Our retrieval framework effectively mitigates the limitations of remote-sensing retrieval algorithms in identifying chemical components, as well as the deficiencies and limited generalization capabilities of purely data-driven machine learning techniques in characterizing vertical profiles of these components. Detailed descriptions of the retrieval framework and the data utilized are provided in Sect. 2., while Sect. 3 discusses the validation of the retrieval framework, the assessment of feature importance, and applications of this framework. Section 4 presents the conclusion.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data and methodology</title>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Data</title>
<sec id="Ch1.S2.SS1.SSS1">
  <label>2.1.1</label><title>Lidar measurement</title>
      <p id="d2e491">The <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">bsc</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">532</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> data for deep learning module training and PM<sub>2.5</sub> chemical component retrieving is obtained from a ground-based dual-wavelength polarization Mie lidar at the Institute of Atmospheric Physics (IAP), Chinese Academy of Sciences (CAS), Beijing (39.98° N, 116.38° E). This Mie lidar has consistently detected optical signals since 2017, offering a temporal resolution of 15 min and a vertical resolution of 6 m. The lidar specification parameters and data preprocessing are detailed in Sect. S1 (and Table S1) and Sect. S2 in the Supplement, respectively. The <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">bsc</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">532</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> data from 8–15 February 2021 at 23 lidar sites in the North China Plain (NCP), provided by the China National Environmental Monitoring Center (CNEMC), were utilized to assess the spatial generalization ability. The multi-site data offers a temporal resolution of 5–20 min and a vertical resolution of 7.5 m. To generate an hourly resolution lidar dataset, minute-level data were resampled using a simple averaging method. Specifically, the arithmetic mean was calculated from all valid minute-level data points within each non-overlapping one-hour window aligned to the start of each hour (e.g., from 00:00 to 00:59).</p>
</sec>
<sec id="Ch1.S2.SS1.SSS2">
  <label>2.1.2</label><title>Auxiliary data for Retrieval</title>
      <p id="d2e545">The data of multiple meteorological parameters for deep learning module training and PM<sub>2.5</sub> chemical component retrieving can be obtained from the 5th Generation European Centre for Medium-Range Weather Forecasts (ECMWF) ReAnalysis (ERA5, <uri>https://cds.climate.copernicus.eu/datasets</uri>, last access: 25 July 2025), which provides the hourly data on pressure levels (1000–1 hPa) from 1940 to present with a spatial resolution of <inline-formula><mml:math id="M35" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.25</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.25</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>. The data of fine soil, coarse mass and fine sea salt for physics-constrained optimization can be obtained from 4th Generation ECMWF Atmospheric Composition Reanalysis (EAC4, <uri>https://ads.atmosphere.copernicus.eu/datasets</uri>, last access: 25 July 2025), which provides the 3 h data on pressure levels (1000–1 hPa) from 2003 to 2024 with a spatial resolution of <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mn mathvariant="normal">0.75</mml:mn><mml:mi mathvariant="italic">°</mml:mi><mml:mo>×</mml:mo><mml:mn mathvariant="normal">0.75</mml:mn><mml:mi mathvariant="italic">°</mml:mi></mml:mrow></mml:math></inline-formula>. The mass concentration (<inline-formula><mml:math id="M37" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of fine soil is approximately estimated by the mixing ratio (kg kg<sup>−1</sup>) of dust aerosol with a diameter of 0.03–0.9 <inline-formula><mml:math id="M40" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m. The mass concentration (<inline-formula><mml:math id="M41" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of coarse mass is approximately estimated by the mixing ratio (kg kg<sup>−1</sup>) of dust aerosol with a diameter of 0.9–20 <inline-formula><mml:math id="M44" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m. The mass concentration (<inline-formula><mml:math id="M45" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of fine sea salt is approximately estimated by the mixing ratio (kg kg<sup>−1</sup>) of sea salt aerosol with a diameter of 0.03–5 <inline-formula><mml:math id="M48" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m. The pressure levels (hPa) of ERA5 and EAC4 are converted to geometric heights (m), and the 3 h EAC4 data is converted to hourly data through a linear interpolation method. The grid cells of EAC4 and ERA5 that contain the lidar sites were extracted using the k-nearest neighbor search method based on longitude and latitude data (Friedman et al., 1977). The lidar data and the reanalysis data were interpolated onto a preset vertical grid with a height range of 50 m to 3 km using linear interpolation. The preset height information is presented in Sect. S2.</p>
</sec>
<sec id="Ch1.S2.SS1.SSS3">
  <label>2.1.3</label><title>Surface observations</title>
      <p id="d2e725">Ground-level mass concentrations of NH<inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M50" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M51" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC at the Beijing lidar site (39.98° N, 116.38° E) were collected for training the deep learning module and validating retrievals by a high-resolution time-of-flight aerosol mass spectrometer, with a temporal resolution of 1 h, covering the periods from 1 January  2021, to 31 March  2022, and 1 June  to 31 August  2022. Ground-level mass concentrations of the five PM<sub>2.5</sub> chemical components at 23 non-training NCP sites were provided by CNEMC. Besides, ground-level PM<sub>2.5</sub> mass concentrations, approximately equal to the sum of the mass concentrations of the five chemical components, are available on CNEMC data release website (<uri>https://www.cnemc.cn/</uri>, last access: 25 July 2025).</p>
</sec>
<sec id="Ch1.S2.SS1.SSS4">
  <label>2.1.4</label><title>Aircraft-based and tower-based measurements</title>
      <p id="d2e797">The aircraft-based vertical profiles for retrieval independent verification were sampled in a flight experiment at an airport site in Shijiazhuang (37.54° N, 114.35° E). The flight time schedules (LT, local time) are detailed in Table 1. The tower-based vertical profiles at altitudes of 16, 102  and 280 m were sampled at a 325 m meteorological tower located at the IAP, CAS in Beijing (39.98° N, 116.38° E) for 10 d (27 and 30 December 2023; 2, 5, 9, 12, 15, 18, 24, and 27 January 2024). A flow sampler with a flow rate of 42.8 L min<sup>−1</sup> and the 47 mm quartz filter membranes were utilized to collect PM<sub>2.5</sub> chemical component samples in the aircraft-based and tower-based sampling experiments. Furthermore, the 325 m tower-based vertical profiles from 30 December  2018 to 2 January 2019 were also collected from Lei et al. (2021)'s study.</p>

<table-wrap id="T1" specific-use="star"><label>Table 1</label><caption><p id="d2e824">Flight time schedules (LT, local time), corresponding surface temperature and relative humidity.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="5">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Date</oasis:entry>
         <oasis:entry colname="col2">Flight time</oasis:entry>
         <oasis:entry colname="col3">Sampling height (m)</oasis:entry>
         <oasis:entry colname="col4">Surface temperature (°C)</oasis:entry>
         <oasis:entry colname="col5">Surface relative humidity (%)</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">26 September  2024</oasis:entry>
         <oasis:entry colname="col2">19:10–21:10</oasis:entry>
         <oasis:entry colname="col3">2100</oasis:entry>
         <oasis:entry colname="col4">19.2–22.9</oasis:entry>
         <oasis:entry colname="col5">87.5–95</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">10  October 2024</oasis:entry>
         <oasis:entry colname="col2">19:40–21:40</oasis:entry>
         <oasis:entry colname="col3">600</oasis:entry>
         <oasis:entry colname="col4">18.8–19.2</oasis:entry>
         <oasis:entry colname="col5">29–30</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11  December 2024</oasis:entry>
         <oasis:entry colname="col2">15:00–16:00</oasis:entry>
         <oasis:entry colname="col3">1200</oasis:entry>
         <oasis:entry colname="col4">4.3–4.9</oasis:entry>
         <oasis:entry colname="col5">31–34</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">11  December 2024</oasis:entry>
         <oasis:entry colname="col2">16:00–17:00</oasis:entry>
         <oasis:entry colname="col3">1500</oasis:entry>
         <oasis:entry colname="col4">3.3–4.3</oasis:entry>
         <oasis:entry colname="col5">34–38</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <fig id="F1" specific-use="star"><label>Figure 1</label><caption><p id="d2e941">Remote-sensing retrieval framework for vertical distribution of five PM<sub>2.5</sub> chemical components (NH<inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M58" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M59" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC). (<inline-formula><mml:math id="M60" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>: <inline-formula><mml:math id="M61" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>-component wind; <inline-formula><mml:math id="M62" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula>: <inline-formula><mml:math id="M63" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula>-component wind; <inline-formula><mml:math id="M64" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>: Temperature; RH: Relative Humidity; <inline-formula><mml:math id="M65" display="inline"><mml:mi>q</mml:mi></mml:math></inline-formula>: Specific Humidity; <inline-formula><mml:math id="M66" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula>: Vertical Velocity; <inline-formula><mml:math id="M67" display="inline"><mml:mi>Z</mml:mi></mml:math></inline-formula>: Geopotential; <inline-formula><mml:math id="M68" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">ext</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">532</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>: Aerosol Extinction Coefficient at 532 nm; CNN: Convolutional Neural Network; ReLU: Rectified Linear Unit; FC: Fully Connected; BiLSTM: Bidirectional Long Short-Term Memory; IMPROVE: Interagency Monitoring of Projected Visual Environment; NSGA-II: Non-dominated Sorting Genetic Algorithm II).</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f01.png"/>

          </fig>

</sec>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Methodology</title>
<sec id="Ch1.S2.SS2.SSS1">
  <label>2.2.1</label><title>Retrieval framework</title>
      <p id="d2e1089">This paper proposed a novel retrieval framework for retrieving the vertical concentration profiles of five PM<sub>2.5</sub> chemical components (NH<inline-formula><mml:math id="M70" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M71" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M72" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC) from the lidar aerosol extinction coefficient at 532 nm (<inline-formula><mml:math id="M73" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">bsc</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">532</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>). As shown in Fig. 1, the retrieval framework mainly consists of a deep learning module and a physics-constrained optimization module. The input datasets of the deep learning module include the surface observation data, meteorological data and ground-based lidar data (Fig. 1a). Specifically, the aerosol extinction coefficient at 532 nm (<inline-formula><mml:math id="M74" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">bsc</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mn mathvariant="normal">532</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) and multiple meteorological parameters (<inline-formula><mml:math id="M75" display="inline"><mml:mi>u</mml:mi></mml:math></inline-formula>-component wind, <inline-formula><mml:math id="M76" display="inline"><mml:mi>v</mml:mi></mml:math></inline-formula>-component wind, temperature, relative humidity, specific humidity, vertical velocity and geopotential) serve as input features, while the concentrations of the five PM<sub>2.5</sub> chemical components (NH<inline-formula><mml:math id="M78" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC) serve as target features. The deep learning module (Fig. 1b), mainly consisting of the Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), attention mechanism and Bayesian optimization, is utilized to establish the nonlinear relationship between input and target features. The input datasets of the physics-constrained optimization module include the ground-based lidar data, aerosol auxiliary data and deep learning intermediate output (Fig. 1a, c), which provide fundamental input for establishing a multi-object function based on the Interagency Monitoring of Projected Visual Environment (IMPROVE) equation. The physics-constrained optimization module incorporates the multi-object loss function with the Non-dominated Sorting Genetic Algorithm II (NSGA-II) to implement external physical constraints (Fig. 1c), thus enhancing the extrapolation capability of the deep learning module and generating high-quality vertical concentration profiles of the five PM<sub>2.5</sub> chemical components. Detailed descriptions of the deep learning algorithms, hyperparameter tuning, and physics-constrained optimization used in this work will be presented in subsequent sections. The brief workflow of the retrieval framework is summarized as follows. <list list-type="bullet"><list-item>
      <p id="d2e1249">Step 1. The multi-source input datasets undergo matching across spatiotemporal and vertical dimensions. All input and output data are uniformly time-resolved to hourly intervals, while vertical data are uniformly vertically resolved into 10 layers ranging from 50 m to 3 km.</p></list-item><list-item>
      <p id="d2e1253">Step 2. The input data of the deep learning module are normalized by <inline-formula><mml:math id="M82" display="inline"><mml:mi>Z</mml:mi></mml:math></inline-formula>-score normalization to stabilize the training process, accelerate training convergence, and enhance model robustness (Al-Faiz et al., 2018; Cabello-Solorzano et al., 2023).</p></list-item><list-item>
      <p id="d2e1264">Step 3. Training deep learning module by using the normalized surface-level input data.</p></list-item><list-item>
      <p id="d2e1268">Step 4. Generating the normalized concentrations of the five PM<sub>2.5</sub> chemical components at each vertical layer by feeding the normalized height-level input data into the deep learning module.</p></list-item><list-item>
      <p id="d2e1281">Step 5. Denormalizing the deep-learning output by using the inverse <inline-formula><mml:math id="M84" display="inline"><mml:mi>Z</mml:mi></mml:math></inline-formula>-score transformation, with the mean and standard deviation statistics derived from the original training set, thereby recovering the physical mass concentration unit (<inline-formula><mml:math id="M85" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>).</p></list-item><list-item>
      <p id="d2e1312">Step 6. Optimizing the denormalized deep learning output through implementing an external physics constraint to obtain the high-quality vertical concentration profiles of the five PM<sub>2.5</sub> chemical components. Repeat steps 4–6 until the retrieval task is complete.</p></list-item></list></p>
</sec>
<sec id="Ch1.S2.SS2.SSS2">
  <label>2.2.2</label><title>Deep learning and hyperparameter tuning</title>
      <p id="d2e1332">The deep learning module is the core of the retrieval framework that generates the normalized vertical profiles of five PM<sub>2.5</sub> chemical components by feeding the lidar-based aerosol extinction coefficient at 532 nm (<inline-formula><mml:math id="M89" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mrow><mml:mi mathvariant="normal">ext</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">532</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) and multiple meteorological parameters. The deep learning module is designed by numerous neural network layers (Fig. 1, red part), including the CNN layer, Average Pooling layer, Rectified Linear Unit (ReLU) layer, Fully Connected (FC) layer, Attention Mechanism layer, Sigmoid layer, Flatten layer, BiLSTM layer, Dropout layer and Regression Output layer. The CNN and BiLSTM layers, coupled with the Attention Mechanism (AM), are designed to effectively capture the multivariate and temporal characteristics in the training data, thereby establishing a robust nonlinear mapping between the input and output features. The hybrid CNN-BiLSTM-AM architecture consistently outperforms single-architecture models in predictive tasks, as evidenced by numerous studies (Kavianpour et al., 2023; Ma et al., 2022; Shan et al., 2021; Zhang et al., 2023). Other layers are responsible for data input, structural transformation, normalization, nonlinear process, pooling process, neuron removal and data output, enhancing the training performance and preventing overfitting. Here, we review the description of three key layers, and the description of other layers can be found in our previous work (Li et al., 2025a).</p>
      <p id="d2e1361">CNN is a variant of the multilayer perceptron that efficiently identifies the relevant features through local perception, sparse connections and sharing of weight and bias (Alzubaidi et al., 2021). The convolutional layer in CNN performs convolutional computation on input across spatial dimensionality using learnable kernels to extract local features and enhance training efficiency (O'Shea and Nash, 2015). Then the convolutional output is typically enhanced nonlinearly by the ReLU layer (Eq. 1) or down-sampled nonlinearly by the pooling layer in a CNN architecture.

              <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M90" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">max</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mn mathvariant="normal">0</mml:mn><mml:mo>,</mml:mo><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="bold-italic">w</mml:mi><mml:mo>×</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the nonlinearly enhanced convolutional output at timestep <inline-formula><mml:math id="M92" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi>w</mml:mi><mml:mo>×</mml:mo><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>b</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> is the original convolutional output at timestep <inline-formula><mml:math id="M94" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the input data at timestep <inline-formula><mml:math id="M96" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M97" display="inline"><mml:mi>w</mml:mi></mml:math></inline-formula> is the weight vector and <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:msub><mml:mi>b</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the bias term.</p>
      <p id="d2e1495">The attention mechanism layer is incorporated with CNN to amplify the weight of key information and mitigate the interference of redundant information, leading to an enhancement in the quality of the CNN output (Wang and Zhang, 2025). The attention mechanism is inspired by the ability of human vision to selectively focus on key information (Guo et al., 2022). Our retrieval framework integrates a data-driven channel attention mechanism, which rescales the original feature channels from the convolutional layers through element-wise multiplication using learned attention weights, thereby enhancing the importance of key features and reduce the interference of irrelevant features. The attention weights are generated by the FC layers with a sigmoid activation function (Eq. 2) and then performs Schur product operation with CNN multivariate output (Eq. 3).

                  <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M99" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E2"><mml:mtd><mml:mtext>2</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">sigmoid</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mi mathvariant="normal">FC</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi mathvariant="normal">Pooling</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E3"><mml:mtd><mml:mtext>3</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="bold-italic">W</mml:mi><mml:mo>⋅</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">y</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the CNN multivariate output. Pooling and FC layers are responsible for down-sampling and feature learning, respectively, thus predicting the importance of <inline-formula><mml:math id="M101" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th feature. The sigmoid activation function is utilized to calculate the attention weight (<inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mi mathvariant="bold">W</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the reweighted multivariate output.</p>
      <p id="d2e1628">BiLSTM is a variant of Recurrent Neural Networks (RNNs) that learns long-timestep information bidirectionally and avoids the gradient vanishing or explosion of traditional RNNs (Kavianpour et al., 2023). Previous studies have indicated that BiLSTM outperforms LSTM in regression tasks due to the insufficient utilization of future information in LSTM (Siami-Namini et al., 2019; Yang and Wang, 2022). Therefore, the BiLSTM layer is integrated into the deep-learning module to fully capture the temporal characteristics of the CNN attention-weighted multivariate output. The BiLSTM layer is realized by the forward LSTM and backward LSTM (Eq. 4). Both the forward and backward LSTM consist of cell states, forget gates, input gates, output gates, and activation functions, which are responsible for transmission, screening and processing of temporal information. The final LSTM output is obtained by output gates and cell states (Eq. 5). A detailed description of BiLSTM components can be found in our previous work (Li et al., 2025a).

                  <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M104" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi mathvariant="bold-italic">H</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfenced open="[" close="]"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">h</mml:mi><mml:mo mathvariant="normal">→</mml:mo></mml:mover><mml:mi>t</mml:mi></mml:msub><mml:mo>;</mml:mo><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">h</mml:mi><mml:mo mathvariant="normal">←</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="normal">end</mml:mi><mml:mo>-</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:mfenced><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E5"><mml:mtd><mml:mtext>5</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi mathvariant="bold-italic">h</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">o</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>×</mml:mo><mml:mi>tanh⁡</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi mathvariant="bold-italic">C</mml:mi><mml:mi>t</mml:mi></mml:msub><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math id="M105" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the final output of BiLSTM at timestep <inline-formula><mml:math id="M106" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, which is obtained by concatenating the forward output <inline-formula><mml:math id="M107" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="bold-italic">h</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and backward output value <inline-formula><mml:math id="M108" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi mathvariant="bold-italic">h</mml:mi><mml:mo mathvariant="normal">←</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="normal">end</mml:mi><mml:mo>-</mml:mo><mml:mi>t</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>. <inline-formula><mml:math id="M109" display="inline"><mml:mrow><mml:msub><mml:mi>h</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the final output of LSTM at timestep <inline-formula><mml:math id="M110" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M111" display="inline"><mml:mrow><mml:msub><mml:mi>o</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the output of output gate at timestep <inline-formula><mml:math id="M112" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>, <inline-formula><mml:math id="M113" display="inline"><mml:mi mathvariant="normal">tanh</mml:mi></mml:math></inline-formula> is an activation function that regulates the values transmitted in neural networks by compressing the values to a range of from <inline-formula><mml:math id="M114" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> to 1. <inline-formula><mml:math id="M115" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mi>t</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the output of the cell state at timestep <inline-formula><mml:math id="M116" display="inline"><mml:mi>t</mml:mi></mml:math></inline-formula>.</p>
      <p id="d2e1842">Hyperparameter tuning is crucial for improving the performance of deep neural networks. Bayesian optimization can determine global optima with higher efficiency (Shahriari et al., 2016) and has been widely employed in hyperparameter optimization of varying machine learning models (Wu et al., 2019a). The primary process of Bayesian optimization involves establishing search spaces for hyperparameters and the corresponding objective function, followed by the determination of the optimal solution by minimizing the objective function (Eq. 6). Bayesian optimization utilizes a probabilistic surrogate model to iteratively estimate the complex unknown objective function based on the current query point and then identifies the next most promising query point by an acquisition function (Shahriari et al., 2016). The probabilistic surrogate model and the acquisition function in this study are the Gaussian process regression model (Rasmussen, 2004) and the Expected-Improvement-Per-Second-Plus function (Gelbart et al., 2014), respectively. The number of optimization iteration is set to 30 and the final optimal settings of model hyperparameters are presented in Table S2.

              <disp-formula id="Ch1.E6" content-type="numbered"><label>6</label><mml:math id="M117" display="block"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>∗</mml:mo></mml:msup><mml:mo>=</mml:mo><mml:mi mathvariant="normal">argmin</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>f</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>∈</mml:mo><mml:mi mathvariant="bold-italic">X</mml:mi><mml:mo>⊆</mml:mo><mml:msup><mml:mi>R</mml:mi><mml:mi>d</mml:mi></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M118" display="inline"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>∗</mml:mo></mml:msup></mml:mrow></mml:math></inline-formula> is the optimal scheme of multiple hyperparameters, <inline-formula><mml:math id="M119" display="inline"><mml:mi mathvariant="bold-italic">x</mml:mi></mml:math></inline-formula> is the decision vector composed of <inline-formula><mml:math id="M120" display="inline"><mml:mi>d</mml:mi></mml:math></inline-formula>-dimensional hyperparameters, <inline-formula><mml:math id="M121" display="inline"><mml:mi mathvariant="bold-italic">X</mml:mi></mml:math></inline-formula> is the search space that consists of all possible decision vectors, <inline-formula><mml:math id="M122" display="inline"><mml:mrow><mml:mi>f</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">x</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the unknown objective function.</p>
</sec>
<sec id="Ch1.S2.SS2.SSS3">
  <label>2.2.3</label><title>Physics-constrained optimization scheme</title>
      <p id="d2e1943">The normalized vertical profiles of PM<sub>2.5</sub> chemical components generated by the deep learning module are denormalized by the statistical characteristics of the initial input data of the surface-level observations. To reduce the retrieval error induced by the inherent extrapolation limitations of deep learning modules, a physics-constrained optimization scheme is incorporated into the retrieval framework based on a revised Interagency Monitoring of Projected Visual Environment (IMPROVE) Equation (Pitchford et al., 2007) and Non-dominated Sorting Genetic Algorithm II (NSGA-II) (Verma et al., 2021).</p>
      <p id="d2e1955">The revised IMPROVE Equation interprets the particle extinction coefficient (<inline-formula><mml:math id="M124" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula>) through the concentrations (<inline-formula><mml:math id="M125" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula>) and the optical and microphysical characteristics of PM<sub>2.5</sub> chemical components (Eq. 7).

              <disp-formula id="Ch1.E7" content-type="numbered"><label>7</label><mml:math id="M127" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi>s</mml:mi><mml:mi mathvariant="normal">SNA</mml:mi></mml:msubsup><mml:mi>f</mml:mi><mml:mfenced close=")" open="("><mml:mi mathvariant="normal">RH</mml:mi></mml:mfenced><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">SO</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">NH</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">OC</mml:mi></mml:msubsup><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">OC</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">FS</mml:mi></mml:msubsup><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">Fine</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">Soil</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mrow><mml:mi>C</mml:mi><mml:mi>M</mml:mi></mml:mrow></mml:msubsup><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">Coarse</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">Mass</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">FSS</mml:mi></mml:msubsup><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">FSS</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="normal">RH</mml:mi></mml:mfenced><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">Fine</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">Sea</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">Salt</mml:mi><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">BC</mml:mi></mml:msubsup><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">BC</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi mathvariant="normal">Rayleigh</mml:mi><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">Scattering</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math id="M128" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the estimated particle extinction coefficient (km<sup>−1</sup>), <inline-formula><mml:math id="M130" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the scattering efficiency (m<sup>2</sup> mg<sup>−1</sup>), <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">a</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the mass absorption efficiency (m<sup>2</sup> mg<sup>−1</sup>), respectively. <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:mi>f</mml:mi><mml:mfenced open="(" close=")"><mml:mi mathvariant="normal">RH</mml:mi></mml:mfenced></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">FSS</mml:mi></mml:msub><mml:mfenced close=")" open="("><mml:mi mathvariant="normal">RH</mml:mi></mml:mfenced></mml:mrow></mml:math></inline-formula> account for the increase in light scattering induced by hygroscopic growth of sulfate, nitrate and ammonium (SNA), as well as fine sea salt (FSS). <inline-formula><mml:math id="M138" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">FS</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M139" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">CM</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M140" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">FSS</mml:mi></mml:msubsup><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">FSS</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">a</mml:mi><mml:mi mathvariant="normal">BC</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are set to 0.001 m<sup>2</sup> mg<sup>−1</sup>, 0.0006, 0.0017 and 0.01 m<sup>2</sup>,mg<sup>−1</sup>, respectively. <inline-formula><mml:math id="M146" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula> are the mass concentrations (<inline-formula><mml:math id="M147" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of the PM<sub>2.5</sub> chemical components. Rayleigh  Scattering is set to 0.01 km<sup>−1</sup>. <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">SNA</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">OC</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are determined by Eqs. (8)–(9).

                  <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M153" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E8"><mml:mtd><mml:mtext>8</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">SNA</mml:mi></mml:msubsup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.003</mml:mn><mml:mo>×</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn><mml:mo>×</mml:mo><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">SO</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">NH</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">OC</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E9"><mml:mtd><mml:mtext>9</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">θ</mml:mi><mml:mi mathvariant="normal">s</mml:mi><mml:mi mathvariant="normal">OC</mml:mi></mml:msubsup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.00363</mml:mn><mml:mo>×</mml:mo><mml:mo>(</mml:mo><mml:mn mathvariant="normal">0.7</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">0.002</mml:mn><mml:mo>×</mml:mo><mml:mo>(</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">SO</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:msubsup><mml:mi mathvariant="normal">NH</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup><mml:mo>)</mml:mo><mml:mo>+</mml:mo><mml:mi>M</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="normal">OC</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            To implement the physics-constrained optimization, we first introduce a scale factor (<inline-formula><mml:math id="M154" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> for each chemical component at each vertical layer, which is used to correct the initial mass concentrations (Eq. 10). Then we determine the optimal scale factors through minimizing a multi-objective function (Eq. 11). The Pearson correlation coefficient (CORR) and root mean square error (RMSE) quantified by the lidar-observed and the IMPROVE-simulated extinction coefficient serve as two objective values in the multi-objective function. The NSGA-II algorithm is utilized to determine the optimal scale factors by solving the multi-objective function that simultaneously enhances the correlation and reduces the discrepancy between the IMPROVE-estimated and lidar-observed extinction coefficients.

                  <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M155" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E10"><mml:mtd><mml:mtext>10</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msubsup><mml:mi>M</mml:mi><mml:mi mathvariant="normal">regulated</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>h</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>×</mml:mo><mml:msubsup><mml:mi>M</mml:mi><mml:mi mathvariant="normal">original</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>h</mml:mi></mml:mrow></mml:msubsup><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:msubsup><mml:mi mathvariant="normal">SO</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup><mml:msubsup><mml:mi mathvariant="normal">NO</mml:mi><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msubsup><mml:mi mathvariant="normal">NH</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mi mathvariant="normal">OM</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">and</mml:mi><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi mathvariant="normal">BC</mml:mi><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E11"><mml:mtd><mml:mtext>11</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mi mathvariant="normal">min</mml:mi><mml:mo>(</mml:mo><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">RMSE</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">γ</mml:mi></mml:mfenced><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">CORR</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">γ</mml:mi></mml:mfenced><mml:mo>)</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msubsup><mml:mi>M</mml:mi><mml:mi mathvariant="normal">regulated</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>h</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math id="M157" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) is the regulated mass concentration of the <inline-formula><mml:math id="M159" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th chemical component at an altitude of <inline-formula><mml:math id="M160" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> (m), <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">γ</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the scale factor for the <inline-formula><mml:math id="M162" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th chemical component at an altitude of <inline-formula><mml:math id="M163" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> (m), and <inline-formula><mml:math id="M164" display="inline"><mml:mrow><mml:msubsup><mml:mi>M</mml:mi><mml:mi mathvariant="normal">original</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>,</mml:mo><mml:mspace width="0.125em" linebreak="nobreak"/><mml:mi>h</mml:mi></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> (<inline-formula><mml:math id="M165" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the original mass concentration of the <inline-formula><mml:math id="M167" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th chemical component at an altitude of <inline-formula><mml:math id="M168" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula> (m). <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">RMSE</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">γ</mml:mi></mml:mfenced></mml:mrow></mml:math></inline-formula> is the RMSE-based objective function (Eq. 12) and <inline-formula><mml:math id="M170" display="inline"><mml:mrow><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">CORR</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">γ</mml:mi></mml:mfenced></mml:mrow></mml:math></inline-formula> is the CORR-based objective function (Eq. 13).

                  <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M171" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E12"><mml:mtd><mml:mtext>12</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">RMSE</mml:mi></mml:msub><mml:mfenced open="(" close=")"><mml:mi mathvariant="italic">γ</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:msqrt><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:msup><mml:mfenced close=")" open="("><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:mfenced><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mi>K</mml:mi></mml:mfrac></mml:mstyle></mml:msqrt><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E13"><mml:mtd><mml:mtext>13</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>f</mml:mi><mml:mi mathvariant="normal">CORR</mml:mi></mml:msub><mml:mfenced close=")" open="("><mml:mi mathvariant="italic">γ</mml:mi></mml:mfenced><mml:mo>=</mml:mo><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:mfenced close=")" open="("><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi mathvariant="normal">SD</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced><mml:mfenced open="(" close=")"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:mrow><mml:mrow><mml:mi mathvariant="normal">SD</mml:mi><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msup><mml:mo>)</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:mfenced></mml:mrow><mml:mrow><mml:mi>K</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            where <inline-formula><mml:math id="M172" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> is the total number of samples, <inline-formula><mml:math id="M173" display="inline"><mml:mrow><mml:msubsup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>k</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> is the <inline-formula><mml:math id="M174" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>th observed extinction coefficient, <inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">σ</mml:mi><mml:mi>k</mml:mi></mml:msub><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the <inline-formula><mml:math id="M176" display="inline"><mml:mi mathvariant="normal">k</mml:mi></mml:math></inline-formula>th simulated extinction coefficient, <inline-formula><mml:math id="M177" display="inline"><mml:mover accent="true"><mml:mrow><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> is the average of simulated extinction coefficient, <inline-formula><mml:math id="M178" display="inline"><mml:mover accent="true"><mml:mrow><mml:msup><mml:mi mathvariant="italic">σ</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msup></mml:mrow><mml:mo mathvariant="normal">‾</mml:mo></mml:mover></mml:math></inline-formula> is the average of observed extinction coefficient, SD<inline-formula><mml:math id="M179" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mo>(</mml:mo><mml:mi mathvariant="italic">γ</mml:mi><mml:mo>×</mml:mo><mml:mi>M</mml:mi><mml:mo>)</mml:mo><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> is the standard deviation of simulated extinction coefficient, and SD<inline-formula><mml:math id="M180" display="inline"><mml:mrow><mml:mo>(</mml:mo><mml:msup><mml:mi mathvariant="bold-italic">σ</mml:mi><mml:mi mathvariant="normal">obs</mml:mi></mml:msup></mml:mrow></mml:math></inline-formula>) is the standard deviation of observed extinction coefficient.</p>
      <p id="d2e3396">NSGA is capable of simultaneously optimizing the multi-objective function by generating a Pareto front that consists of an ensemble of non-dominated solutions (Srinivas and Deb, 1994). The non-dominated solutions in a Pareto front meet the criterion that one objective cannot be further improved without compromising other objectives. However, the initial version of NSGA has several limitations. First, NSGA has a high computational complexity of <inline-formula><mml:math id="M181" display="inline"><mml:mrow><mml:mi>O</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>M</mml:mi><mml:msup><mml:mi>N</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula>, where <inline-formula><mml:math id="M182" display="inline"><mml:mi>M</mml:mi></mml:math></inline-formula> is the number of objective functions, and <inline-formula><mml:math id="M183" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> is the size of the population. Second, NSGA utilizes a sharing parameter to preserve the diversity of the population that dominates the choice of Pareto non-dominated solutions, resulting in the introduction of parameter uncertainty into the algorithm. Third, NSGA lacks an elitism mechanism, leading to the incorrect removal of advantageous solutions. NSGA-II is an improved NSGA with a lower computational complexity of <inline-formula><mml:math id="M184" display="inline"><mml:mrow><mml:mi>O</mml:mi><mml:mfenced open="(" close=")"><mml:mrow><mml:mi>M</mml:mi><mml:msup><mml:mi>N</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfenced></mml:mrow></mml:math></inline-formula> and an elitism mechanism that retains the dominant members of the parent and offspring generations during iterative evolution (Deb et al., 2002). Moreover, NSGA-II replaces the sharing parameters in NSGA with the crowding distance operator, mitigating the uncertainty of sharing parameters and the high computational complexity of sharing functions.</p>
      <p id="d2e3447">NSGA-II implements multi-objective optimization by two primary procedures, namely non-dominated sorting and crowding distance calculation. The non-dominated sorting progressively identifies the Pareto front at each rank from a population of size <inline-formula><mml:math id="M185" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>. The Pareto front at the second rank is derived from a population that excludes the Pareto front at the first rank. The crowding distance is utilized to quantify the priority of all optimal solutions within a Pareto front, defined as the normalized distance of two nearest optimal solutions on either side (Eq. 14).

              <disp-formula id="Ch1.E14" content-type="numbered"><label>14</label><mml:math id="M186" display="block"><mml:mrow><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>K</mml:mi></mml:msubsup><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msubsup><mml:mo>-</mml:mo><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mi mathvariant="normal">min</mml:mi></mml:msubsup></mml:mrow></mml:mfrac></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:math></disp-formula>

            where <inline-formula><mml:math id="M187" display="inline"><mml:mrow><mml:msub><mml:mi>d</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the crowding distance of the <inline-formula><mml:math id="M188" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th intermediate Pareto optimal solution, <inline-formula><mml:math id="M189" display="inline"><mml:mi>K</mml:mi></mml:math></inline-formula> is the number of Pareto optimal solutions in a Pareto front, <inline-formula><mml:math id="M190" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> is the <inline-formula><mml:math id="M191" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>th objective value induced by the (<inline-formula><mml:math id="M192" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>)th Pareto optimal solution, <inline-formula><mml:math id="M193" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> is the <inline-formula><mml:math id="M194" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>th objective value induced by the (<inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>)th Pareto optimal solution, <inline-formula><mml:math id="M196" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mi mathvariant="normal">max</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M197" display="inline"><mml:mrow><mml:msubsup><mml:mi>f</mml:mi><mml:mi>m</mml:mi><mml:mi mathvariant="normal">min</mml:mi></mml:msubsup></mml:mrow></mml:math></inline-formula> are the <inline-formula><mml:math id="M198" display="inline"><mml:mi>m</mml:mi></mml:math></inline-formula>th maximum and minimum objective values, respectively.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e3667">Brief workflow of NSGA-II (A: the parent population; B: the offspring population; C: the new population; P: the Pareto front).</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f02.png"/>

          </fig>

      <p id="d2e3676">The workflow of NSGA-II is summarized as follows (Fig. 2). <list list-type="custom"><list-item><label>a.</label>
      <p id="d2e3681">Randomly generating an initial population (<inline-formula><mml:math id="M199" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) of size <inline-formula><mml:math id="M200" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>. Performing selection, crossover and mutation operations on <inline-formula><mml:math id="M201" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> to generate an offspring population (<inline-formula><mml:math id="M202" display="inline"><mml:mrow><mml:msub><mml:mi>B</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) of size <inline-formula><mml:math id="M203" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>. The parent population (<inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) and the offspring population (<inline-formula><mml:math id="M205" display="inline"><mml:mrow><mml:msub><mml:mi>B</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) are combined to form a new population (<inline-formula><mml:math id="M206" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) of size <inline-formula><mml:math id="M207" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>.</p></list-item><list-item><label>b.</label>
      <p id="d2e3776">Performing a rapid non-dominated sorting on <inline-formula><mml:math id="M208" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> to generate the Pareto fronts (<inline-formula><mml:math id="M209" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M210" display="inline"><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mn mathvariant="normal">2</mml:mn><mml:mo>,</mml:mo><mml:mspace linebreak="nobreak" width="0.125em"/><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mi>n</mml:mi></mml:mrow></mml:math></inline-formula>) at different ranks.</p></list-item><list-item><label>c.</label>
      <p id="d2e3828">Filling the next population (<inline-formula><mml:math id="M211" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) of size <inline-formula><mml:math id="M212" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula> with <inline-formula><mml:math id="M213" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> based on the rank order.</p></list-item><list-item><label>d.</label>
      <p id="d2e3861">When <inline-formula><mml:math id="M214" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> is filled to the point of insufficient capacity to contain the entire <inline-formula><mml:math id="M215" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula>, the optimal solutions in <inline-formula><mml:math id="M216" display="inline"><mml:mrow><mml:msub><mml:mi>P</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> are inserted into <inline-formula><mml:math id="M217" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> in a priority order identified by the non-dominated sorting and crowding distance until the size of <inline-formula><mml:math id="M218" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> reaches <inline-formula><mml:math id="M219" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>.</p></list-item><list-item><label>e.</label>
      <p id="d2e3928">Performing selection, crossover and mutation operations on <inline-formula><mml:math id="M220" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula> to generate an offspring population (<inline-formula><mml:math id="M221" display="inline"><mml:mrow><mml:msub><mml:mi>B</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) of size <inline-formula><mml:math id="M222" display="inline"><mml:mi>N</mml:mi></mml:math></inline-formula>. The parent population (<inline-formula><mml:math id="M223" display="inline"><mml:mrow><mml:msub><mml:mi>A</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) and offspring population (<inline-formula><mml:math id="M224" display="inline"><mml:mrow><mml:msub><mml:mi>B</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) combine to form a new population (<inline-formula><mml:math id="M225" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>) of size <inline-formula><mml:math id="M226" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mi>N</mml:mi></mml:mrow></mml:math></inline-formula>.</p></list-item><list-item><label>f.</label>
      <p id="d2e4005">Iterating steps (b) to (e) until the convergence criteria are satisfied.</p></list-item></list></p>
</sec>
<sec id="Ch1.S2.SS2.SSS4">
  <label>2.2.4</label><title>Framework training and evaluation</title>
      <p id="d2e4016">An hourly multivariate dataset with extensive temporal coverage was employed to train and evaluate the deep learning module. To maintain temporal independence, the training (and validation) set was constructed from a 1-year (2021) time-series dataset obtained from a Beijing site (Fig. S1), while the testing set contains an independent 6-month (1 January–31 March and 1 June to 31 August 2022) time-series dataset obtained from the same site. A 10-fold time-series cross-validation (CV) scheme was designed for the training (and validation) set to preserve its temporal order and prevent future information leakage, which is detailed in Sect. S3 and Fig. S2 in the Supplement. The iteration number of Bayesian optimization is set to 20.</p>
      <p id="d2e4019">To fully evaluate the performance of the retrieval framework in predicting vertical profiles of NH<inline-formula><mml:math id="M227" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M228" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M229" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC, we conduct three retrieval experiments: (1) We compare the retrieved mass concentrations with the observed values at the surface level during a training year (2021) and three non-training years (2017, 2018 and 2024) to validate the temporal generalization in all seasons and under diverse meteorological conditions. (2) We assess the spatial generalization ability by applying the retrieval framework to 23 non-training lidar sites in the NCP from 8–15  February 2021 and comparing the retrieved mass concentrations with observations at the surface level. The spatial distribution of the 23 non-training lidar sites is presented in Fig. S1. (3) We validate the retrieved vertical profiles by aircraft-based and tower-based vertical observations during several non-training episodes. Subsequently, SHapley Additive exPlanations (SHAP), a local explainable technology (Lundberg et al., 2020), has been widely employed in prediction interpretation for varying machine learning models (Li et al., 2025b; Hou et al., 2022), is integrated into the deep learning module to quantify the impact of multivariate input features on the retrieval of PM<sub>2.5</sub> chemical components. Finally, we applied this retrieval framework to generate a long-term vertical profile dataset for five PM<sub>2.5</sub> chemical components in a megacity over six years of 2017–2018 and 2021–2024.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e4081">Scatterplots of the simulations (<inline-formula><mml:math id="M232" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) versus the observations (<inline-formula><mml:math id="M234" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) with probability density (%) for NH<inline-formula><mml:math id="M236" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M237" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M238" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC during the 10-fold cross-validation process <bold>(a1–a5)</bold> and temporally independent testing process <bold>(b1–b5)</bold>. The dotted grey lines represent the <inline-formula><mml:math id="M239" display="inline"><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, <inline-formula><mml:math id="M240" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula>, and <inline-formula><mml:math id="M241" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:mrow></mml:math></inline-formula> lines, and the solid red line represents the fitted regression line. CORR represents the correlation coefficient, and RMSE represents root mean square error.</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f03.png"/>

          </fig>

</sec>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and discussion</title>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Validation</title>
<sec id="Ch1.S3.SS1.SSS1">
  <label>3.1.1</label><title>Evaluation of the deep learning module performance</title>
      <p id="d2e4237">The 10-fold CV sets and a testing set with temporal independence are utilized to evaluate the predictive performance of the deep learning module, which is quantified by the discrepancies between simulations and observations at ground level for NH<inline-formula><mml:math id="M242" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M243" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M244" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC. Overall, the scatter distribution and fitted regression line closely align with the <inline-formula><mml:math id="M245" display="inline"><mml:mrow><mml:mn mathvariant="normal">1</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> line in both the 10-fold CV (Fig. 3a1–a5) and temporally independent testing phases (Fig. 3b1–b5). The error distributions are concentrated around 0, with mean errors between <inline-formula><mml:math id="M246" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.78</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">8.15</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M247" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.13</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">0.94</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M248" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> during the 10-fold CV phase (Fig. S3a1–a5) and between <inline-formula><mml:math id="M250" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">1.36</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">7.40</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M251" display="inline"><mml:mrow><mml:mo>-</mml:mo><mml:mn mathvariant="normal">0.07</mml:mn><mml:mo>±</mml:mo><mml:mn mathvariant="normal">1.00</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M252" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> during the temporally independent testing phase (Fig. S3b1–b5), demonstrating strong consistency between observations and simulations. Notably, the error distributions for the validation and independent testing sets are closely aligned, indicating that the deep learning module is robust and generalizes well to unseen data. Specifically for the 10-fold CV process (Fig. 3a1–a5), the CORR values for the five PM<sub>2.5</sub> chemical components range from 0.76 to 0.86, indicating that the deep learning module accurately interprets the relationship between multivariate input features and the five PM<sub>2.5</sub> chemical components. The RMSE values range from 0.95 to 8.35 <inline-formula><mml:math id="M256" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>, indicating a low discrepancy between simulations and observations. Compared to the 10-fold CV process, the temporally independent testing yields slightly lower CORR values (0.69–0.79) and higher RMSE values (1.00–8.87 <inline-formula><mml:math id="M258" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>), showing a slight underestimation for the five PM<sub>2.5</sub> chemical components (Fig. 3b1–b5). It is expected that the statistical results from the temporally independent testing are less robust than those from the 10-fold CV, since the temporally independent testing set aggregates a broader spectrum of temporal patterns compared to the validation set at each fold. Our statistical results from the 10-fold CV exhibit similarities or even improvements compared to those reported in other studies that predicting PM<sub>2.5</sub> chemical component concentrations based on machine learning models (Lv et al., 2021; Lin et al., 2022; Araki et al., 2022; Liu et al., 2023), indicating that the deep learning module demonstrates strong prediction capabilities.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e4468">Weekly-smoothed variations in the retrieved and observed concentrations (<inline-formula><mml:math id="M262" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of NH<inline-formula><mml:math id="M264" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(a1)</bold>, NO<inline-formula><mml:math id="M265" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>
<bold>(a2)</bold>, SO<inline-formula><mml:math id="M266" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(a3)</bold>, OM <bold>(a4)</bold> and BC <bold>(a5)</bold> in 2021. <bold>(b)</bold> same as <bold>(a1)</bold>–<bold>(a5)</bold> but for PM<sub>2.5</sub> in 2017. <bold>(c)</bold> same as <bold>(a1)</bold>–<bold>(a5)</bold> but for PM<sub>2.5</sub> in 2018. <bold>(d)</bold> same as <bold>(a1)</bold>–<bold>(a5)</bold> but for PM<sub>2.5</sub> in 2024. CORR represents the correlation coefficient, RMSE represents root mean square error.</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f04.png"/>

          </fig>

</sec>
<sec id="Ch1.S3.SS1.SSS2">
  <label>3.1.2</label><title>Comparison with ground-level observations</title>
      <p id="d2e4616">The retrieval framework was applied to retrieve the vertical profiles of NH<inline-formula><mml:math id="M270" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M271" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M272" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC in a Beijing lidar site (39.98° N, 116.38° E) over a training year (2021) and three non-training years (2017, 2018 and 2024). As illustrated in Fig. 4a1–a5, the weekly-smoothed variations in the retrieved surface concentrations of the five PM<sub>2.5</sub> chemical components demonstrate strong consistency with the observed surface concentrations for the training year, indicating that the retrieval framework adequately captures the temporal characteristics of these chemical components. The CORR values between the retrieved and observed concentrations range from 0.87 to 0.97, surpassing those of the deep learning module (Figs. 4a1–a5 and 3b1–b5). In addition, the RMSE values for all five chemical components (0.57–4.98<inline-formula><mml:math id="M274" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) are consistently lower than those from the deep-learning module (Figs. 4a1–a5 and 3b1–b5). These results demonstrate that the physics-constrained optimization effectively enhances the retrieval accuracy of chemical component concentrations.</p>
      <p id="d2e4688">For the non-training years, the retrieved surface concentrations of a sum of five PM<sub>2.5</sub> chemical components are compared to the observed surface PM<sub>2.5</sub> concentrations, owing to the absence of long-term observations for individual chemical components. As shown in Fig. 4b–d, the weekly-smoothed variations in the retrieved surface PM<sub>2.5</sub> concentrations closely align with the observed values in 2017, 2018 and 2024. The high values of surface PM<sub>2.5</sub> concentration observed in March–April and November of 2018 and 2024 are effectively captured by the retrieval framework. These results indicate that the retrieval framework roughly interprets the changes in concentrations of various chemical components across different periods, exhibiting fundamental temporal generalization capabilities. However, the retrieved concentrations show some overestimation cases during autumn in 2018 and spring in 2024, potentially associated with the uncertainties induced by the training data. The training data may lack a sufficiently diverse spectrum of meteorological conditions and pollution patterns, which limits the temporal generalizability of the retrieval framework across all complex and dynamic atmospheric scenarios. Future efforts should enhance retrieval accuracy by augmenting the training data with observations spanning a wider range of temporal conditions.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e4729">Data distribution properties of retrieved and observed surface mass concentration (<inline-formula><mml:math id="M280" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of NH<inline-formula><mml:math id="M282" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M283" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M284" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC at 23 non-training NCP lidar sites over a period of 8–15  February 2021, presented by a combination of boxplots and kernel density <bold>(a)</bold>. Spatial distribution of Pearson correlation coefficient (CORR) between retrieved and observed surface mass concentration of NH<inline-formula><mml:math id="M285" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(b1)</bold>, NO<inline-formula><mml:math id="M286" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(b2)</bold>, SO<inline-formula><mml:math id="M287" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(b3)</bold>, OM <bold>(b4)</bold> and BC <bold>(b5)</bold>. <bold>(c1)</bold>–<bold>(c5)</bold> Same as <bold>(b1)</bold>–<bold>(b5)</bold> but for root mean square error (RMSE, <inline-formula><mml:math id="M288" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>). The geographic basemap is hosted by Esri <inline-formula><mml:math id="M290" display="inline"><mml:mo>|</mml:mo></mml:math></inline-formula> Powered by Esri (<uri>https://www.esri.com/en-us/home</uri>, last access: 8 January 2026).</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f05.jpg"/>

          </fig>

      <p id="d2e4900">The retrieval framework was also applied to retrieve the vertical profiles of the five PM<sub>2.5</sub> chemical components at 23 non-training NCP lidar sites over a short-term period of 8–15 February  2021, aiming to validate its spatial generalization capabilities. Compared with the observed surface concentrations at 23 non-training sites, the retrieved surface concentrations exhibit a more clustered data distribution and exhibit a tendency toward underestimation across all components (Fig. 5a). The site-averaged CORR values for the five chemical components range from 0.21 to 0.46, with RMSE values spanning 2.7 to 20.37 <inline-formula><mml:math id="M292" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (Fig. S4). From a spatial perspective (Fig. 5b1–b5), non-training NCP sites located closer to the Beijing lidar site exhibit higher CORR values, with the highest reaching 0.71 (NH<inline-formula><mml:math id="M294" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), 0.56 (NO<inline-formula><mml:math id="M295" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), 0.81 (SO<inline-formula><mml:math id="M296" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>), 0.48 (OM) and 0.41 (BC). Conversely, the RMSE values are not affected by the distance from the Beijing lidar site (Fig. 5c1–c5), with the lowest reaching 2.91 <inline-formula><mml:math id="M297" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (NH<inline-formula><mml:math id="M299" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), 6.15 <inline-formula><mml:math id="M300" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (NO<inline-formula><mml:math id="M302" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>), 3.05 <inline-formula><mml:math id="M303" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (SO<inline-formula><mml:math id="M305" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>, 6.59 <inline-formula><mml:math id="M306" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (OM) and 0.78 <inline-formula><mml:math id="M308" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (BC). However, several sites exhibit poor retrieval performance, with CORR values ranging from <inline-formula><mml:math id="M310" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0.20</mml:mn></mml:mrow></mml:math></inline-formula> to <inline-formula><mml:math id="M311" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">0.30</mml:mn></mml:mrow></mml:math></inline-formula> (Fig. S5), which is primarily attributed to limitations in the spatial representativeness of the training data. The deep-learning module was trained exclusively on a long-term dataset from a single site in Beijing, which is insufficient to capture the spatial variability in emission intensity, as well as local meteorological and geographical conditions across the broader NCP. As a result, the spatial extrapolation capability of the deep-learning module is constrained. Although the retrieval framework can retrieve PM<sub>2.5</sub> chemical component concentrations at spatially distributed lidar sites, future work should incorporate long-term datasets from varying locations to enhance spatial generalization and extrapolation performance.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e5146">Vertical profiles (<inline-formula><mml:math id="M313" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) of NH<inline-formula><mml:math id="M315" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M316" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M317" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, and OM from retrieval <bold>(a1)</bold> and tower-based observation <bold>(a2)</bold> during a period from 30  December 2018 to 2 January  2019 in Beijing. The line represents the daily average of the hourly vertical profiles, and the shaded area represents the standard deviation. Averaged proportions of NH<inline-formula><mml:math id="M318" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M319" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M320" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM, and BC from retrieval <bold>(b1)</bold> and tower-based observation <bold>(b2)</bold> for 10 d (27 and 30 December 2023; 2, 5, 9, 12, 15, 18, 24, and 27 January 2024). <bold>(c1)</bold> and <bold>(c2)</bold> Same as <bold>(b1)</bold> and <bold>(b2)</bold> but for aircraft-based verification for 3 d (26 September, 10 October, 11 December 2024).</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f06.png"/>

          </fig>

</sec>
<sec id="Ch1.S3.SS1.SSS3">
  <label>3.1.3</label><title>Verification of retrieved vertical profiles</title>
      <p id="d2e5287">In addition to the spatiotemporal verification of surface-level mass concentrations, tower-based and aircraft-based observational experiments were conducted to validate the retrieved vertical profiles of five PM<sub>2.5</sub> chemical components during non-training periods. From the surface to <inline-formula><mml:math id="M322" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">200</mml:mn></mml:mrow></mml:math></inline-formula> m altitude, the retrieved and observed vertical profiles exhibit similar vertical patterns during a period from 30 December 2018 to 2 January  2019 in Beijing, with higher concentrations occurring at altitudes of 50–80 m for NH<inline-formula><mml:math id="M323" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M324" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M325" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> and OM (Fig. 6a1, a2). Specifically, as presented in Table S3, the CORR values are no less than 0.66 for all four PM<sub>2.5</sub> chemical components. However, the RMSE value for OM (23.04 <inline-formula><mml:math id="M327" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) is notably higher than that for the other components (4.08–10.48 <inline-formula><mml:math id="M329" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>), indicating limitations in the retrieval framework when representing the vertical profile of OM during winter pollution episodes. This discrepancy may be associated with retrieval uncertainties arising from input data quality and imposed physical constraints. Additionally, the retrieved and observed proportions of NH<inline-formula><mml:math id="M331" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M332" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M333" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC demonstrate significant consistency (Fig. 6b1, b2). Among these chemical components, NO<inline-formula><mml:math id="M334" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and OM contribute the largest proportions, followed by NH<inline-formula><mml:math id="M335" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and SO<inline-formula><mml:math id="M336" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, while BC contributes the smallest fraction. This proportional characteristic is evident in both the retrieved and observed proportions at altitudes of 600  and 1200 m (Fig. 6c1, c2). Due to the lack of NH<inline-formula><mml:math id="M337" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> measurements at 1500 m and the absence of both NH<inline-formula><mml:math id="M338" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and SO<inline-formula><mml:math id="M339" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> measurements at 2100 m, the proportions at these altitudes are statistically inferred from the remaining chemical components. The results indicate overall consistency between retrieved and observed proportions at altitudes of 1500  and 2100 m, although the proportion of NO<inline-formula><mml:math id="M340" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> is slightly overestimated at 2100 m and underestimated at 1500 m. Overall, the tower-based and aircraft-based verifications indicate that the retrieval framework achieves high accuracy in retrieving the vertical profiles of the five PM<sub>2.5</sub> chemical components during non-training period, demonstrating its robust generalization capability and reliability when applied to independent datasets.</p>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e5540">Relative contribution of 8 input features on predictive NH<inline-formula><mml:math id="M342" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(a1)</bold>, NO<inline-formula><mml:math id="M343" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(a2)</bold>, SO<inline-formula><mml:math id="M344" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(a3)</bold>, OM <bold>(a4)</bold> and BC <bold>(a5)</bold> at altitudes of 50, 766  and 1900 m. SHAP values with feature values of 8 input features for predictive NH<inline-formula><mml:math id="M345" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(b1)</bold>, NO<inline-formula><mml:math id="M346" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(b2)</bold>, SO<inline-formula><mml:math id="M347" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> <bold>(b3)</bold>, OM <bold>(b4)</bold> and BC <bold>(b5)</bold> at an altitude of 50 m. <bold>(c1)</bold>–<bold>(c5)</bold> Same as <bold>(b1)</bold>–<bold>(b5)</bold> but for an altitude of 766 m. <bold>(d1)</bold>–<bold>(d5)</bold> Same as <bold>(b1)</bold>–<bold>(b5)</bold> but for an altitude of 1900 m. F1: extinction coefficient at 532 nm, EXT; F2: Geopotential, GEOP; F3: Relative humidity, RH; F4: Specific humidity, SH; F5: Temperature, TEMP; F6: <inline-formula><mml:math id="M348" display="inline"><mml:mi>U</mml:mi></mml:math></inline-formula>-component wind, UW; F7: <inline-formula><mml:math id="M349" display="inline"><mml:mi>V</mml:mi></mml:math></inline-formula>-component wind; F8: Vertical velocity, VV.</p></caption>
            <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f07.png"/>

          </fig>

</sec>
</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Assessment of feature importance</title>
      <p id="d2e5708">The predictive performance of the deep learning module is intricately connected to the input features (Blum and Langley, 1997). Although the module incorporates the CNN and attention mechanism layer to mitigate issues related to feature dimension, the impact of input features on the module predictions remains ambiguous, which impedes module interpretability and restricts the capacity to enhance the module performance through effective feature selection. The SHAP method is employed to quantify the relative contributions of 8 input features to the predictions of the five PM<sub>2.5</sub> chemical components at various heights and to identify the impact of the input features on the decision-making processes of the deep learning module. The coexistence of a high feature value with a positive SHAP value in a specific feature implies an amplification of concentration prediction at elevated levels.</p>
      <p id="d2e5720">Figure 7a1–a5 depicts that the aerosol extinction coefficient at 532 nm (EXT), relative humidity (RH) and v-component wind (VW) are the significant input features for predicting the five PM<sub>2.5</sub> chemical components with an averaged relative contribution of 14.43 %, 15.84 % and 16.77 %. These features largely affect the vertical structure, chemical and physical processes, respectively. Specifically, EXT characterizes the vertical distribution of a total of the five PM<sub>2.5</sub> chemical components and plays a crucial indicative role in vertical profile predictions (Tao et al., 2016). RH is a key driving factor in aerosol hygroscopic growth, aqueous-phase chemical reactions, and heterogeneous reactions, significantly contributing to the mass concentrations of varying chemical components as reported in numerous studies (Fang et al., 2019; Wang et al., 2020; Gao et al., 2020; Liang et al., 2019). VW primarily affects latitudinal transboundary transport, which is a dynamic forcing in the southwest-northeast transport channel of the Beijing-Tianjin-Hebei (BTH) region (Yang et al., 2024). Notably, the relative contribution of EXT decreases with height from the surface (50 m) to the free atmosphere (1900 m), while the relative contribution of VW exhibits an opposite trend. The aerosol content in the upper planetary boundary layer is relatively low, and the weakened lidar aerosol signal is susceptible to interference from noise signals, restricting the indicative effect of EXT on chemical component concentrations. Conversely, pollution transport in the upper planetary boundary layer is less affected by interference from complex underlying surfaces than near-surface transport (Wu et al., 2019b), amplifying the driving effect of high-altitude VW on chemical component concentrations. Specific humidity (SH) and geopotential (GEOP) also provided important contributions (13.04 % and 12.85 %, respectively). SH is related to the vertical diffusion and wet scavenging of pollutants (Chatfield et al., 2020) and GEOP identifies the synoptic meteorological patterns that affect both horizontal process (Jia et al., 2022; Wang et al., 2021) and vertical distribution of pollutants within the boundary layer (Miao et al., 2022; Xu et al., 2019).</p>
      <p id="d2e5741">Figure 7b1–d5 further determines the impact of the input features on the decision-making processes of the deep learning module. From Fig. 7b1–b5, the elevated levels of EXT, GEOP, and VW significantly enhance the concentration predictions of the five PM<sub>2.5</sub> chemical components in the near-surface layer (50 m), while high-level RH exert either positive or negative effects on predictions. High RH not only facilitates aqueous-phase and heterogeneous chemical reactions, positively contributing to predictions, but also promotes aerosol coalescence, leading to dry and wet deposition that negatively contributes to predictions (Chen et al., 2020). The results in the middle of the boundary layer (766 m) are consistent with those observed in the near-surface layer (Fig. 7c1–c5). Particularly, the positive driving effect of lower VV values on predictions is more significant, with downward wind contributing positively to predictions, which is attributed to the fact that sinking airflows inhibit the dispersion of chemical components, thereby exacerbating aggregation and increasing concentration (Yang et al., 2022). The results in the free atmosphere (1900 m) align with those in the middle of the boundary layer (Fig. 7d1–d5). Notably, the influence of UW on predictions is more apparent, as the westerly wind positively contributes to the predictions, which is primarily due to the elevated emission sources located in the southwestern BTH region (Yang et al., 2024). Strong prevailing southwesterly winds at high altitudes enhance the regional transport of atmospheric pollutants, leading to an increase in concentration.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e5756">Vertical distribution of mass concentrations (<inline-formula><mml:math id="M354" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) for NH<inline-formula><mml:math id="M356" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M357" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M358" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC in spring (MAM, <bold>a1</bold>), summer (JJA, <bold>a2</bold>), autumn (SON, <bold>a3</bold>) and winter (DJF, <bold>a4</bold>) over six years (2017–2018, 2021–2024). Averaged vertical profiles of mass concentrations (<inline-formula><mml:math id="M359" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) for NH<inline-formula><mml:math id="M361" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M362" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M363" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC from 2017 to 2018 <bold>(b1)</bold>, from 2021 to 2022 <bold>(b2)</bold>, and from 2023 to 2024 <bold>(b3)</bold>. Annual change rates (<inline-formula><mml:math id="M364" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> a<sup>−1</sup>) of mass concentrations for NH<inline-formula><mml:math id="M367" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M368" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M369" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC at various altitudes from 2021 to 2024 <bold>(c)</bold>.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2225/2026/amt-19-2225-2026-f08.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Application of the retrieval framework</title>
      <p id="d2e5990">The retrieval framework was applied to generate a long-term dataset of vertical profiles for NH<inline-formula><mml:math id="M370" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M371" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M372" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC over six years (2017–2018, 2021–2024) at a Beijing lidar site. Figure 8 shows the averaged vertical profiles for the five PM<sub>2.5</sub> chemical components in spring (MAM) (Fig. 8a1), summer (JJA) (Fig. 8a2), autumn (SON) (Fig. 8a3) and winter (DJF) (Fig. 8a4) during the six years. OM mass concentrations are consistently the highest across all four seasons, followed by NO<inline-formula><mml:math id="M374" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, while the mass concentrations of NH<inline-formula><mml:math id="M375" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M376" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> and BC remain relatively low. The high proportions of OM and NO<inline-formula><mml:math id="M377" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> in Chinese PM<sub>2.5</sub> pollution were frequently reported in recent studies (Zhang et al., 2024; Liu et al., 2022). Since the implementation of the Air Pollution Prevention and Control Action Plan during 2013–2017 and the Three-year Action Plan to Win the Blue-Sky Defense War during 2018–2020 in China, effective reductions in sulfur dioxide (SO<inline-formula><mml:math id="M379" display="inline"><mml:mrow><mml:msub><mml:mi/><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> have gradually shifted the dominated chemical component of PM<sub>2.5</sub> pollution from SO<inline-formula><mml:math id="M381" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> to OM and NO<inline-formula><mml:math id="M382" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> (Niu et al., 2022). Furthermore, the decreased SO<inline-formula><mml:math id="M383" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> mass concentrations have amplified the competitive effect of NO<inline-formula><mml:math id="M384" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> on capturing NH<sub>3</sub> and NH<inline-formula><mml:math id="M386" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> in the thermodynamic equilibrium process, increasing NO<inline-formula><mml:math id="M387" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> mass concentrations (Geng et al., 2024). In comparison to the mass concentrations of the five PM<sub>2.5</sub> chemical components in MAM, SON and DJF, summertime mass concentrations are notably lower, which are attributed to reduced heating activities and enhanced wet deposition during summer periods (Liu et al., 2015; Ji et al., 2019). Moreover, the summertime vertical distributions of the five chemical components are relatively uniform, which may be attributed to the enhanced atmospheric vertical mixing effects induced by the unstable boundary layer (Roostaei et al., 2024).</p>
      <p id="d2e6221">Figure 8 also shows the averaged vertical profiles for 2017–2018 (Fig. 8b1), 2021–2022 (Fig. 8b2) and 2023–2024 (Fig. 8b3), as well as the annual change rate during 2021–2024 in Beijing (Fig. 8c). During 2017–2018, the implementation of clean air policies resulted in mass concentrations of NH<inline-formula><mml:math id="M389" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M390" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> and BC remaining below 8 <inline-formula><mml:math id="M391" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (Fig. 8b1). However, the mass concentrations of NO<inline-formula><mml:math id="M393" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> exceeded 13 <inline-formula><mml:math id="M394" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> at altitudes below <inline-formula><mml:math id="M396" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">125</mml:mn></mml:mrow></mml:math></inline-formula> m, and those of OM exceeded 15 <inline-formula><mml:math id="M397" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> below <inline-formula><mml:math id="M399" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> m, likely due to the nonlinear response to emission reduction (Li et al., 2021). Compared to 2017–2018, the mass concentrations of NH<inline-formula><mml:math id="M400" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M401" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M402" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC decreased significantly during 2021–2022 with average reductions of 8.36 %, 10.65 %, 6.58 %, 8.58 % and 5.85 %, respectively, from 50 to 3000 m (Fig. 8b2). These decreases are attributed to the continued implementation of clean air policies and reduced emissions associated with the COVID-19 pandemic control in China (Kang et al., 2020). During 2023–2024, the mass concentrations of NH<inline-formula><mml:math id="M403" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M404" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M405" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC increased relative to the 2021–2022 levels, with average increases of 5.49 %, 6.43 %, 4.65 %, 5.75 % and 4.40 %, respectively, from 50 to 3000 m (Fig. 8b3). This rebound over the NCP has be reported previously and is likely related to the offsetting effect of enhanced human activities following the relaxation of the COVID-19 pandemic lockdowns on the implementation of clean air policies (Song et al., 2025). From 2021 to 2024, the change rates of NH<inline-formula><mml:math id="M406" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M407" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and SO<inline-formula><mml:math id="M408" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> at approximately 50, 310 and 770 m exhibit decreasing trends, with decrease rates ranging from 0.06 to 0.19 <inline-formula><mml:math id="M409" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> a<sup>−1</sup>. In contrast, the change rates of the five chemical components at approximately 80, 120, 1210 and 1900 m show increasing trends, with the highest increase rate of 0.83 <inline-formula><mml:math id="M412" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> a<sup>−1</sup> occurring at <inline-formula><mml:math id="M415" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">120</mml:mn></mml:mrow></mml:math></inline-formula> m for NO<inline-formula><mml:math id="M416" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> (Fig. 8c). In addition, OM exhibited a significant increase rate of 0.69 <inline-formula><mml:math id="M417" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> a<sup>−1</sup> at <inline-formula><mml:math id="M420" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">120</mml:mn></mml:mrow></mml:math></inline-formula> m, which may be related to the low sensitivity of high-altitude organic aerosols to emission controls (Zhao et al., 2017). Future clean air policies should prioritize strengthening control measures for OM and NO<inline-formula><mml:math id="M421" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> within the lower and middle parts of the atmospheric boundary layer.</p>
</sec>
<sec id="Ch1.S3.SS4">
  <label>3.4</label><title>Limitations and uncertainties</title>
      <p id="d2e6613">The deep learning module in our retrieval framework can establish a powerful mapping between optical and meteorological features and PM<sub>2.5</sub> chemical species, and physics-based explicit constraints can enhance the reliability and expandability of the mapping relationships. However, several limitations and sources of uncertainty remain and should be acknowledged when interpreting the results and extending the framework to broader applications.</p>
      <p id="d2e6625">First, the spatial scope of the training data is predominantly restricted to the NCP region. Expanding the retrieval framework with data from more diverse geographical locations is necessary to improve its global transferability. Second, the current retrieval framework primarily relies on extinction coefficients at a wavelength of 532 nm, exhibiting dependence on specific lidar instruments. Future retrieval framework should focus on integrating diverse optical features from additional wavelengths to enhancing adaptability and transferability. Third, the auxiliary input data used in both the deep learning module and the physics-constrained optimization are obtained from global reanalysis products, which may not fully capture local atmospheric conditions at specific observational sites, thereby introducing representativeness errors into the retrievals. Acquiring the vertical observational data for these auxiliary features can effectively mitigate the uncertainty induced by the input data. Fourth, the IMPROVE equation applied as an external physical constraint may introduce additional uncertainty into the retrievals due to its systematic estimation biases (Lowenthal and Kumar, 2016). Moreover, since the IMPROVE equation was applied as an external physical constraint to optimize the retrievals of PM<sub>2.5</sub> chemical components, the machine learning model itself was not intrinsically constrained by physical principles during its training. Future work could incorporate an internal physical constraint into the machine learning model to improve its physical interpretability by formulating a hybrid loss function for training that combines the traditional data-fitting term with a physical term. Finally, long-term acquisition of independent vertical profiling data from both tower-based and aircraft-based campaigns is essential for a comprehensive assessment of the robustness of the vertical retrievals with respect to varying sites, aerosol types, and seasons.</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d2e6646">This study proposes a novel lidar-based retrieval framework for obtaining the vertical profiles of five PM<sub>2.5</sub> chemical components (NH<inline-formula><mml:math id="M425" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M426" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M427" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC) for the first time. A long-term multivariate dataset was utilized to train a complex deep-learning module in the retrieval framework, thus interpreting the nonlinear relationship among lidar parameters, meteorological parameters and PM<sub>2.5</sub> chemical components. A physics-constrained optimization module was integrated into the retrieval framework, enhancing the generalization capabilities of predicting vertical profiles across diverse spatiotemporal scenarios.</p>
      <p id="d2e6706">In situ surface observations of hourly mass concentrations of PM<sub>2.5</sub> and its five chemical components over a training year and three non-training years were used to validate the accuracy of the retrieval framework in interpreting temporal variations. The results showed that the Pearson correlation coefficient values between the retrieved and observed concentrations ranged from 0.87 to 0.97 during the training year, and the variations in the retrieved surface PM<sub>2.5</sub> mass concentrations closely aligned with the observations during the non-training year, indicating the robust capabilities of temporal prediction and generalization in the retrieval framework. The retrieval framework was then applied to obtain the mass concentrations of five PM<sub>2.5</sub> chemical components at 23 non-training sites. The retrieved results exhibited patterns that are moderately consistent with the corresponding observations. However, limitations remained in accurately capturing short-term temporal variations, with a general tendency toward underestimation. Tower-based and aircraft-based field campaigns at altitudes ranging from surface to 2100 m were conducted to validate the accuracy of the retrieved vertical profiles of NH<inline-formula><mml:math id="M432" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M433" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M434" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC. The tower-based and aircraft-based verifications indicate that the retrieved and observed vertical profiles of these components exhibited consistent patterns in mass concentrations and proportions, demonstrating the robust capabilities of the retrieval framework in obtaining high-precision vertical profiles from non-training datasets.  Subsequently, SHapley Additive exPlanations (SHAP), an explainable technology, is integrated into the deep learning module to quantify the impact of multivariate input features on the retrieval of PM<sub>2.5</sub> chemical components. The results showed that the aerosol extinction coefficient at 532 nm, relative humidity and v-component wind are the dominant input features for predicting the five PM<sub>2.5</sub> chemical components with an averaged relative contribution of 14.43 %, 15.84 % and 16.77,%. The driving effect of the input features on the decision-making processes of the deep learning module was also determined by SHAP values.</p>
      <p id="d2e6796">Finally, we applied this framework to generate a long-term dataset of vertical profiles for NH<inline-formula><mml:math id="M437" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M438" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M439" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, OM and BC over six years (2017–2018, 2021–2024). From this dataset, we found that OM mass concentrations are consistently the highest across all four seasons, followed by NO<inline-formula><mml:math id="M440" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, while the mass concentrations of NH<inline-formula><mml:math id="M441" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, SO<inline-formula><mml:math id="M442" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> and BC remain relatively low. From 2021 to 2024, the change rates of NH<inline-formula><mml:math id="M443" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mo>+</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula>, NO<inline-formula><mml:math id="M444" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> and SO<inline-formula><mml:math id="M445" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">4</mml:mn><mml:mrow><mml:mn mathvariant="normal">2</mml:mn><mml:mo>-</mml:mo></mml:mrow></mml:msubsup></mml:mrow></mml:math></inline-formula> at approximately 50, 310 and 770 m exhibit decreasing trends, with decrease rates ranging from 0.06 to 0.19 <inline-formula><mml:math id="M446" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> a<sup>−1</sup>. However, OM and NO<inline-formula><mml:math id="M449" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> exhibited significant increase rates of 0.69 and 0.83 <inline-formula><mml:math id="M450" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> a<sup>−1</sup>, respectively, at an altitude of <inline-formula><mml:math id="M453" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">120</mml:mn></mml:mrow></mml:math></inline-formula> m. Future clean air policies should prioritize strengthening control measures for OM and NO<inline-formula><mml:math id="M454" display="inline"><mml:mrow><mml:msubsup><mml:mi/><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo></mml:msubsup></mml:mrow></mml:math></inline-formula> within the lower and middle parts of the atmospheric boundary layer. Our new retrieval framework offers a novel approach to acquiring vertical profiles of PM<sub>2.5</sub> chemical components. Future efforts should aim to mitigate the overestimation of carbonaceous aerosols by regulating the parameters involved in the physics-constrained optimization process.</p>
</sec>

      
      </body>
    <back><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e7031">The source codes and related data in this manuscript are freely available upon request through the corresponding author (tingyang@mail.iap.ac.cn).</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e7034">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/amt-19-2225-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/amt-19-2225-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e7043">HL developed the retrieval framework, carried out the analysis and verification, as well as wrote this paper. TY provided scientific guidance and wrote this paper. TY and YS provided various measurement data. ZW did overall supervision. All authors reviewed and revised this paper.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e7049">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e7055">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e7061">We thank the National Key Research and Development Program of China (grant no. 2023YFC3705801), and the National Natural Science Foundation of China (grant no. 42275122). Ting Yang would like to express gratitude towards the Program of the Youth Innovation Promotion Association (CAS). We thank for the technical support of the National large Scientific and Technological Infrastructure “Earth System Numerical Simulation Facility” (<uri>https://cstr.cn/31134.02.EL</uri>, last access: 20 December 2025), and the data support of the China National Environmental Monitoring Center.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e7069">This research has been supported by the National Natural Science Foundation of China (grant no. 42422506).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e7075">This paper was edited by Omar Torres and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>Al-Faiz, M. Z., Ibrahim, A. A., and Hadi, S. M.: The effect of Z-Score standardization (normalization) on binary input due the speed of learning in back-propagation neural network, Iraqi J. Inf. Commun. Technol., 1, 42–48, <ext-link xlink:href="https://doi.org/10.31987/ijict.1.3.41" ext-link-type="DOI">10.31987/ijict.1.3.41</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Alzubaidi, L., Zhang, J., Humaidi, A. J., Al-Dujaili, A., Duan, Y., Al-Shamma, O., Santamaría, J., Fadhel, M. A., Al-Amidie, M., and Farhan, L.: Review of deep learning: concepts, CNN architectures, challenges, applications, future directions, J. Big Data, 8, 53, <ext-link xlink:href="https://doi.org/10.1186/s40537-021-00444-8" ext-link-type="DOI">10.1186/s40537-021-00444-8</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>Ansmann, A., Bösenberg, J., Chaikovsky, A., Comerón, A., Eckhardt, S., Eixmann, R., Freudenthaler, V., Ginoux, P., Komguem, L., Linné, H., Márquez, M. Á. L., Matthias, V., Mattis, I., Mitev, V., Müller, D., Music, S., Nickovic, S., Pelon, J., Sauvage, L., Sobolewsky, P., Srivastava, M. K., Stohl, A., Torres, O., Vaughan, G., Wandinger, U., and Wiegner, M.: Long-range transport of Saharan dust to northern Europe: The 11–16 October 2001 outbreak observed with EARLINET, J. Geophys. Res.: Atmos., 108, 4783, <ext-link xlink:href="https://doi.org/10.1029/2003JD003757" ext-link-type="DOI">10.1029/2003JD003757</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Araki, S., Shimadera, H., and Shima, M.: Continuous estimations of daily PM<sub>2.5</sub> chemical components from temporally sparse monitoring data using a machine learning approach, Atmos. Pollut. Res., 13, 101580, <ext-link xlink:href="https://doi.org/10.1016/j.apr.2022.101580" ext-link-type="DOI">10.1016/j.apr.2022.101580</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Blum, A. L. and Langley, P.: Selection of relevant features and examples in machine learning, Artif. Intel., 97, 245–271, <ext-link xlink:href="https://doi.org/10.1016/S0004-3702(97)00063-5" ext-link-type="DOI">10.1016/S0004-3702(97)00063-5</ext-link>, 1997.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>Cabello-Solorzano, K., Ortigosa de Araujo, I., Peña, M., Correia, L., and J. Tallón-Ballesteros, A.: The Impact of Data Normalization on the Accuracy of Machine Learning Algorithms: A Comparative Analysis, 18th International Conference on Soft Computing Models in Industrial and Environmental Applications (SOCO 2023), Cham, 344–353, <ext-link xlink:href="https://doi.org/10.1007/978-3-031-42536-3_33" ext-link-type="DOI">10.1007/978-3-031-42536-3_33</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Chatfield, R. B., Sorek-Hamer, M., Esswein, R. F., and Lyapustin, A.: Satellite mapping of PM<sub>2.5</sub> episodes in the wintertime San Joaquin Valley: a “static” model using column water vapor, Atmos. Chem. Phys., 20, 4379–4397, <ext-link xlink:href="https://doi.org/10.5194/acp-20-4379-2020" ext-link-type="DOI">10.5194/acp-20-4379-2020</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Chen, Z., Chen, D., Zhao, C., Kwan, M.-p., Cai, J., Zhuang, Y., Zhao, B., Wang, X., Chen, B., Yang, J., Li, R., He, B., Gao, B., Wang, K., and Xu, B.: Influence of meteorological conditions on PM<sub>2.5</sub> concentrations across China: A review of methodology and mechanism, Environ. Int., 139, <ext-link xlink:href="https://doi.org/10.1016/j.envint.2020.105558" ext-link-type="DOI">10.1016/j.envint.2020.105558</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Deb, K., Pratap, A., Agarwal, S., and Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II, IEEE Trans. Evol. Comput., 6, 182–197, <ext-link xlink:href="https://doi.org/10.1109/4235.996017" ext-link-type="DOI">10.1109/4235.996017</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Dubey, R., Patra, A. K., and Nazneen: Vertical profile of particulate matter: A review of techniques and methods, Air Qual. Atmos. Hlth., 15, 979–1010, <ext-link xlink:href="https://doi.org/10.1007/s11869-022-01192-1" ext-link-type="DOI">10.1007/s11869-022-01192-1</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Fang, Y., Ye, C., Wang, J., Wu, Y., Hu, M., Lin, W., Xu, F., and Zhu, T.: Relative humidity and O3 concentration as two prerequisites for sulfate formation, Atmos. Chem. Phys., 19, 12295–12307, <ext-link xlink:href="https://doi.org/10.5194/acp-19-12295-2019" ext-link-type="DOI">10.5194/acp-19-12295-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation> Friedman, J. H., Bentley, J. L., and Finkel, R. A.: An algorithm for finding best matches in logarithmic expected time, ACM T. Math. Software (TOMS), 3, 209–226, 1977.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>Gao, J., Wei, Y., Shi, G., Yu, H., Zhang, Z., Song, S., Wang, W., Liang, D., and Feng, Y.: Roles of RH, aerosol pH and sources in concentrations of secondary inorganic aerosols, during different pollution periods, Atmos. Environ., 241, 117770, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2020.117770" ext-link-type="DOI">10.1016/j.atmosenv.2020.117770</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>Gelbart, M. A., Snoek, J., and Adams, R. P.: Bayesian optimization with unknown constraints, arXiv [preprint], 1403, 5607, <ext-link xlink:href="https://doi.org/10.48550/arXiv.1403.5607" ext-link-type="DOI">10.48550/arXiv.1403.5607</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Geng, G., Liu, Y., Liu, Y., Liu, S., Cheng, J., Yan, L., Wu, N., Hu, H., Tong, D., Zheng, B., Yin, Z., He, K., and Zhang, Q.: Efficacy of China's clean air actions to tackle PM<sub>2.5</sub> pollution between 2013 and 2020, Nat. Geosci., 17, 987–994, <ext-link xlink:href="https://doi.org/10.1038/s41561-024-01540-z" ext-link-type="DOI">10.1038/s41561-024-01540-z</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>Guo, M. H., Xu, T. X., Liu, J. J., Liu, Z. N., Jiang, P. T., Mu, T. J., Zhang, S. H., Martin, R. R., Cheng, M. M., and Hu, S. M.: Attention mechanisms in computer vision: A survey, Comput. Vis. Media, 8, 331–368, <ext-link xlink:href="https://doi.org/10.1007/s41095-022-0271-y" ext-link-type="DOI">10.1007/s41095-022-0271-y</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Hara, Y., Nishizawa, T., Sugimoto, N., Osada, K., Yumimoto, K., Uno, I., Kudo, R., and Ishimoto, H.: Retrieval of Aerosol Components Using Multi-Wavelength Mie-Raman Lidar and Comparison with Ground Aerosol Sampling, Remote Sens., 10, <ext-link xlink:href="https://doi.org/10.3390/rs10060937" ext-link-type="DOI">10.3390/rs10060937</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>Hou, L., Dai, Q., Song, C., Liu, B., Guo, F., Dai, T., Li, L., Liu, B., Bi, X., Zhang, Y., and Feng, Y.: Revealing Drivers of Haze Pollution by Explainable Machine Learning, Environ. Sci. Technol. Lett., 9, 112–119, <ext-link xlink:href="https://doi.org/10.1021/acs.estlett.1c00865" ext-link-type="DOI">10.1021/acs.estlett.1c00865</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Ji, W., Wang, Y., and Zhuang, D.: Spatial distribution differences in PM<sub>2.5</sub> concentration between heating and non-heating seasons in Beijing, China, Environ. Pollut., 248, 574–583, <ext-link xlink:href="https://doi.org/10.1016/j.envpol.2019.01.002" ext-link-type="DOI">10.1016/j.envpol.2019.01.002</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Jia, Z., Doherty, R. M., Ordóñez, C., Li, C., Wild, O., Jain, S., and Tang, X.: The impact of large-scale circulation on daily fine particulate matter (PM<sub>2.5</sub>) over major populated regions of China in winter, Atmos. Chem. Phys., 22, 6471–6487, <ext-link xlink:href="https://doi.org/10.5194/acp-22-6471-2022" ext-link-type="DOI">10.5194/acp-22-6471-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>Kang, Y.-H., You, S., Bae, M., Kim, E., Son, K., Bae, C., Kim, Y., Kim, B.-U., Kim, H. C., and Kim, S.: The impacts of COVID-19, meteorology, and emission control policies on PM<sub>2.5</sub> drops in Northeast Asia, Sci. Rep., 10, 22112, <ext-link xlink:href="https://doi.org/10.1038/s41598-020-79088-2" ext-link-type="DOI">10.1038/s41598-020-79088-2</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>Kavianpour, P., Kavianpour, M., Jahani, E., and Ramezani, A.: A CNN-BiLSTM model with attention mechanism for earthquake prediction, J. Supercomput., 79, 19194–19226, <ext-link xlink:href="https://doi.org/10.1007/s11227-023-05369-y" ext-link-type="DOI">10.1007/s11227-023-05369-y</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Kim, S., Yang, J., Park, J., Song, I., Kim, D.-G., Jeon, K., Kim, H., and Yi, S.-M.: Health effects of PM<sub>2.5</sub> constituents and source contributions in major metropolitan cities, South Korea, Environ. Sci. Pollut. Res., 29, 82873–82887, <ext-link xlink:href="https://doi.org/10.1007/s11356-022-21592-1" ext-link-type="DOI">10.1007/s11356-022-21592-1</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Lee, Y. S., Choi, E., Park, M., Jo, H., Park, M., Nam, E., Kim, D. G., Yi, S.-M., and Kim, J. Y.: Feature extraction and prediction of fine particulate matter (PM<sub>2.5</sub>) chemical constituents using four machine learning models, Expert Syst. Appl., 221, 119696, <ext-link xlink:href="https://doi.org/10.1016/j.eswa.2023.119696" ext-link-type="DOI">10.1016/j.eswa.2023.119696</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>Lei, L., Sun, Y., Ouyang, B., Qiu, Y., Xie, C., Tang, G., Zhou, W., He, Y., Wang, Q., Cheng, X., Fu, P., and Wang, Z.: Vertical Distributions of Primary and Secondary Aerosols in Urban Boundary Layer: Insights into Sources, Chemistry, and Interaction with Meteorology, Environ. Sci. Technol., 55, 4542–4552, <ext-link xlink:href="https://doi.org/10.1021/acs.est.1c00479" ext-link-type="DOI">10.1021/acs.est.1c00479</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>Li, H., Yang, T., Du, Y., Tan, Y., and Wang, Z.: Interpreting hourly mass concentrations of PM<sub>2.5</sub> chemical components with an optimal deep-learning model, J. Environ. Sci., 151, 125–139, <ext-link xlink:href="https://doi.org/10.1016/j.jes.2024.03.037" ext-link-type="DOI">10.1016/j.jes.2024.03.037</ext-link>, 2025a.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>Li, H., Yang, T., Song, Y., Tian, P., He, J., Tan, Y., Tian, Y., Sun, Y., and Wang, Z.: Unveiling the intricate dynamics of PM<sub>2.5</sub> sulfate aerosols in the urban boundary layer: A pioneering two-year vertical profiling and machine learning-enhanced analysis in global Mega-City, Urban Clim., 61, 102424, <ext-link xlink:href="https://doi.org/10.1016/j.uclim.2025.102424" ext-link-type="DOI">10.1016/j.uclim.2025.102424</ext-link>, 2025b.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>Li, M., Zhang, Z., Yao, Q., Wang, T., Xie, M., Li, S., Zhuang, B., and Han, Y.: Nonlinear responses of particulate nitrate to NO<sub>x</sub> emission controls in the megalopolises of China, Atmos. Chem. Phys., 21, 15135–15152, <ext-link xlink:href="https://doi.org/10.5194/acp-21-15135-2021" ext-link-type="DOI">10.5194/acp-21-15135-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>Liang, L., Engling, G., Cheng, Y., Zhang, X., Sun, J., Xu, W., Liu, C., Zhang, G., Xu, H., Liu, X., and Ma, Q.: Influence of High Relative Humidity on Secondary Organic Carbon: Observations at a Background Site in East China, J. Meteor. Res., 33, 905–913, <ext-link xlink:href="https://doi.org/10.1007/s13351-019-8202-2" ext-link-type="DOI">10.1007/s13351-019-8202-2</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>Lin, G. Y., Chen, H. W., Chen, B. J., and Chen, S. C.: A machine learning model for predicting PM<sub>2.5</sub> and nitrate concentrations based on long-term water-soluble inorganic salts datasets at a road site station, Chemosphere, 289, <ext-link xlink:href="https://doi.org/10.1016/j.chemosphere.2021.133123" ext-link-type="DOI">10.1016/j.chemosphere.2021.133123</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>Liu, K., Zhang, Y., He, H., Xiao, H., Wang, S., Zhang, Y., Li, H., and Qian, X.: Time series prediction of the chemical components of PM<sub>2.5</sub> based on a deep learning model, Chemosphere, 342, 140153, <ext-link xlink:href="https://doi.org/10.1016/j.chemosphere.2023.140153" ext-link-type="DOI">10.1016/j.chemosphere.2023.140153</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib32"><label>32</label><mixed-citation>Liu, S., Geng, G., Xiao, Q., Zheng, Y., Liu, X., Cheng, J., and Zhang, Q.: Tracking Daily Concentrations of PM<sub>2.5</sub> Chemical Composition in China since 2000, Environ. Sci. Technol., 56, 16517–16527, <ext-link xlink:href="https://doi.org/10.1021/acs.est.2c06510" ext-link-type="DOI">10.1021/acs.est.2c06510</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>Liu, Z., Hu, B., Wang, L., Wu, F., Gao, W., and Wang, Y.: Seasonal and diurnal variation in particulate matter (PM<sub>10</sub> and PM<sub>2.5</sub>) at an urban site of Beijing: analyses from a 9-year study, Environ. Sci. Pollut. Res., 22, 627–642, <ext-link xlink:href="https://doi.org/10.1007/s11356-014-3347-0" ext-link-type="DOI">10.1007/s11356-014-3347-0</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>Lowenthal, D. H. and Kumar, N.: Evaluation of the IMPROVE Equation for estimating aerosol light extinction, J. Air Waste Manage., 66, 726–737, <ext-link xlink:href="https://doi.org/10.1080/10962247.2016.1178187" ext-link-type="DOI">10.1080/10962247.2016.1178187</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I.: From local explanations to global understanding with explainable AI for trees, Nat. Mach. Intell., 2, 56–67, <ext-link xlink:href="https://doi.org/10.1038/s42256-019-0138-9" ext-link-type="DOI">10.1038/s42256-019-0138-9</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Lv, L., Wei, P., Li, J., and Hu, J.: Application of machine learning algorithms to improve numerical simulation prediction of PM<sub>2.5</sub> and chemical components, Atmos. Pollut. Res., 12, 101211, <ext-link xlink:href="https://doi.org/10.1016/j.apr.2021.101211" ext-link-type="DOI">10.1016/j.apr.2021.101211</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Ma, T., Xiang, G., Shi, Y., and Liu, Y.: Horizontal in situ stresses prediction using a CNN-BiLSTM-attention hybrid neural network, Geomech. Geophys. Geo-energ. Geo-resour. 8, 152, <ext-link xlink:href="https://doi.org/10.1007/s40948-022-00467-2" ext-link-type="DOI">10.1007/s40948-022-00467-2</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Matus, A. V., Nowottnick, E. P., Yorks, J. E., and da Silva, A. M.: Enhancing surface PM<sub>2.5</sub> air quality estimates in GEOS using CATS lidar data, Earth  Space Sci., 12, e2024EA004078, <ext-link xlink:href="https://doi.org/10.1029/2024EA004078" ext-link-type="DOI">10.1029/2024EA004078</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Meng, X., Hand, J. L., Schichtel, B. A., and Liu, Y.: Space-time trends of PM<sub>2.5</sub> constituents in the conterminous United States estimated by a machine learning approach, 2005–2015, Environ. Int., 121, 1137–1147, <ext-link xlink:href="https://doi.org/10.1016/j.envint.2018.10.029" ext-link-type="DOI">10.1016/j.envint.2018.10.029</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Menon, S., Hansen, J., Nazarenko, L., and Luo, Y. F.: Climate effects of black carbon aerosols in China and India, Science, 297, 2250–2253, <ext-link xlink:href="https://doi.org/10.1126/science.1075159" ext-link-type="DOI">10.1126/science.1075159</ext-link>, 2002.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Miao, Y., Zhang, X., Che, H., and Liu, S.: Influence of Multi-Scale Meteorological Processes on PM<sub>2.5</sub> Pollution in Wuhan, Central China, Front. Environ. Sci., 10, <ext-link xlink:href="https://doi.org/10.3389/fenvs.2022.918076" ext-link-type="DOI">10.3389/fenvs.2022.918076</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Morgan, W. T., Allan, J. D., Bower, K. N., Capes, G., Crosier, J., Williams, P. I., and Coe, H.: Vertical distribution of sub-micron aerosol chemical composition from North-Western Europe and the North-East Atlantic, Atmos. Chem. Phys., 9, 5389–5401, <ext-link xlink:href="https://doi.org/10.5194/acp-9-5389-2009" ext-link-type="DOI">10.5194/acp-9-5389-2009</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>Nishizawa, T., Sugimoto, N., Matsui, I., Shimizu, A., and Okamoto, H.: Algorithms to retrieve optical properties of three component aerosols from two-wavelength backscatter and one-wavelength polarization lidar measurements considering nonsphericity of dust, J. Quant. Spectrosc. Radiat. Transfer, 112, 254–267, <ext-link xlink:href="https://doi.org/10.1016/j.jqsrt.2010.06.002" ext-link-type="DOI">10.1016/j.jqsrt.2010.06.002</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>Nishizawa, T., Sugimoto, N., Matsui, I., Shimizu, A., Hara, Y., Itsushi, U., Yasunaga, K., Kudo, R., and Kim, S.-W.: Ground-based network observation using Mie–Raman lidars and multi-wavelength Raman lidars and algorithm to retrieve distributions of aerosol components, J. Quant. Spectrosc. Radiat. Transfer, 188, 79–93, <ext-link xlink:href="https://doi.org/10.1016/j.jqsrt.2016.06.031" ext-link-type="DOI">10.1016/j.jqsrt.2016.06.031</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>Niu, Y., Li, X., Qi, B., and Du, R.: Variation in the concentrations of atmospheric PM<sub>2.5</sub> and its main chemical components in an eastern China city (Hangzhou) since the release of the Air Pollution Prevention and Control Action Plan in 2013, Air Qual. Atmos. Hlth., 15, 321–337, <ext-link xlink:href="https://doi.org/10.1007/s11869-021-01107-6" ext-link-type="DOI">10.1007/s11869-021-01107-6</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>O'Shea, K. and Nash, R.: An Introduction to Convolutional Neural Networks, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1511.08458" ext-link-type="DOI">10.48550/arXiv.1511.08458</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation>Pitchford, M., Malm, W., Schichtel, B., Kumar, N., Lowenthal, D., and Hand, J.: Revised Algorithm for Estimating Light Extinction from IMPROVE Particle Speciation Data, J. Air Waste Manage., 57, 1326–1336, <ext-link xlink:href="https://doi.org/10.3155/1047-3289.57.11.1326" ext-link-type="DOI">10.3155/1047-3289.57.11.1326</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>Rasmussen, C. E.: Gaussian Processes in Machine Learning, in: Advanced Lectures on Machine Learning: ML Summer Schools 2003, Canberra, Australia, 2–14 February  2003, Tübingen, Germany, 4–16  August  2003, Revised Lectures, edited by: Bousquet, O., von Luxburg, U., and Rätsch, G., Springer Berlin Heidelberg, Berlin, Heidelberg, 63–71, <ext-link xlink:href="https://doi.org/10.1007/978-3-540-28650-9_4" ext-link-type="DOI">10.1007/978-3-540-28650-9_4</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>Roostaei, V., Gharibzadeh, F., Shamsipour, M., Faridi, S., and Hassanvand, M. S.: Vertical distribution of ambient air pollutants (PM<sub>2.5</sub>, PM<sub>10</sub>, NO<sub><italic>X</italic></sub>, and NO<sub>2</sub>); A systematic review, Heliyon, 10, e39726, <ext-link xlink:href="https://doi.org/10.1016/j.heliyon.2024.e39726" ext-link-type="DOI">10.1016/j.heliyon.2024.e39726</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><mixed-citation>Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., and Freitas, N. d.: Taking the Human Out of the Loop: A Review of Bayesian Optimization, Proceedings of the IEEE, 104, 148–175, <ext-link xlink:href="https://doi.org/10.1109/JPROC.2015.2494218" ext-link-type="DOI">10.1109/JPROC.2015.2494218</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><mixed-citation>Shan, L., Liu, Y., Tang, M., Yang, M., and Bai, X.: CNN-BiLSTM hybrid neural networks with attention mechanism for well log prediction, J. Petrol. Sci. Eng., 205, 108838, <ext-link xlink:href="https://doi.org/10.1016/j.petrol.2021.108838" ext-link-type="DOI">10.1016/j.petrol.2021.108838</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><mixed-citation>Siami-Namini, S., Tavakoli, N., and Namin, A. S.: The Performance of LSTM and BiLSTM in Forecasting Time Series, 2019 IEEE Int. Conf. on Big Data (Big Data), Los Angeles, CA, USA, 3285–3292, <ext-link xlink:href="https://doi.org/10.1109/BigData47090.2019.9005997" ext-link-type="DOI">10.1109/BigData47090.2019.9005997</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib53"><label>53</label><mixed-citation>Song, Q., Huang, L., Zhang, Y., Li, Z., Wang, S., Zhao, B., Yin, D., Ma, M., Li, S., Liu, B., Zhu, L., Chang, X., Gao, D., Jiang, Y., Dong, Z., Shi, H., and Hao, J.: Driving Factors of PM<sub>2.5</sub> Pollution Rebound in North China Plain in Early 2023, Environ. Sci. Technol. Lett., 12, 305–312, <ext-link xlink:href="https://doi.org/10.1021/acs.estlett.4c01153" ext-link-type="DOI">10.1021/acs.estlett.4c01153</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib54"><label>54</label><mixed-citation>Srinivas, N. and Deb, K.: Muiltiobjective Optimization Using Nondominated Sorting in Genetic Algorithms, Evol. Comput., 2, 221–248, <ext-link xlink:href="https://doi.org/10.1162/evco.1994.2.3.221" ext-link-type="DOI">10.1162/evco.1994.2.3.221</ext-link>, 1994.</mixed-citation></ref>
      <ref id="bib1.bib55"><label>55</label><mixed-citation>Sugimoto, N., Uno, I., Nishikawa, M., Shimizu, A., Matsui, I., Dong, X., Chen, Y., and Quan, H.: Record heavy Asian dust in Beijing in 2002: Observations and model analysis of recent events, Geophys. Res. Lett., 30, <ext-link xlink:href="https://doi.org/10.1029/2002gl016349" ext-link-type="DOI">10.1029/2002gl016349</ext-link>, 2003.</mixed-citation></ref>
      <ref id="bib1.bib56"><label>56</label><mixed-citation>Sugimoto, N., Shimizu, A., Matsui, I., Uno, I., Arao, K., Dong, X., Zhao, S., Zhou, J., and Lee, C.-H.: Study of Asian Dust Phenomena in 2001–2003 Using A Network of Continuously Operated Polarization Lidars, Water, Air, &amp; Soil Pollution: Focus, 5, 145–157, <ext-link xlink:href="https://doi.org/10.1007/s11267-005-0732-1" ext-link-type="DOI">10.1007/s11267-005-0732-1</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bib57"><label>57</label><mixed-citation>Sun, Y., Du, W., Wang, Q., Zhang, Q., Chen, C., Chen, Y., Chen, Z., Fu, P., Wang, Z., Gao, Z., and Worsnop, D. R.: Real-Time Characterization of Aerosol Particle Composition above the Urban Canopy in Beijing: Insights into the Interactions between the Atmospheric Boundary Layer and Aerosol Chemistry, Environ. Sci. Technol., 49, 11340–11347, <ext-link xlink:href="https://doi.org/10.1021/acs.est.5b02373" ext-link-type="DOI">10.1021/acs.est.5b02373</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib58"><label>58</label><mixed-citation>Tan, T., Hu, M., Li, M., Guo, Q., Wu, Y., Fang, X., Gu, F., Wang, Y., and Wu, Z.: New insight into PM<sub>2.5</sub> pollution patterns in Beijing based on one-year measurement of chemical compositions, Sci. Total Environ., 621, 734–743, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2017.11.208" ext-link-type="DOI">10.1016/j.scitotenv.2017.11.208</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib59"><label>59</label><mixed-citation>Tao, J., Zhang, L., Cao, J., and Zhang, R.: A review of current knowledge concerning PM<sub>2.5</sub> chemical composition, aerosol optical properties and their relationships across China, Atmos. Chem. Phys., 17, 9485–9518, <ext-link xlink:href="https://doi.org/10.5194/acp-17-9485-2017" ext-link-type="DOI">10.5194/acp-17-9485-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib60"><label>60</label><mixed-citation>Tao, Z., Wang, Z., Yang, S., Shan, H., Ma, X., Zhang, H., Zhao, S., Liu, D., Xie, C., and Wang, Y.: Profiling the PM<sub>2.5</sub> mass concentration vertical distribution in the boundary layer, Atmos. Meas. Tech., 9, 1369–1376, <ext-link xlink:href="https://doi.org/10.5194/amt-9-1369-2016" ext-link-type="DOI">10.5194/amt-9-1369-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib61"><label>61</label><mixed-citation>Tesche, M., Ansmann, A., MüLler, D., Althausen, D., Mattis, I., Heese, B., Freudenthaler, V., Wiegner, M., Esselborn, M., Pisani, G., and Knippertz, P.: Vertical profiling of Saharan dust with Raman lidars and airborne HSRL in southern Morocco during SAMUM, Tellus B: Chem. Phys. Meteor., 61, 144–164, <ext-link xlink:href="https://doi.org/10.1111/j.1600-0889.2008.00390.x" ext-link-type="DOI">10.1111/j.1600-0889.2008.00390.x</ext-link>, 2009.</mixed-citation></ref>
      <ref id="bib1.bib62"><label>62</label><mixed-citation>Toth, T. D., Zhang, J., Vaughan, M. A., Reid, J. S., and Campbell, J. R.: Retrieving particulate matter concentrations over the contiguous United States using CALIOP observations, Atmos. Environ., 274, 118979, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2022.118979" ext-link-type="DOI">10.1016/j.atmosenv.2022.118979</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib63"><label>63</label><mixed-citation>Verma, S., Pant, M., and Snasel, V.: A Comprehensive Review on NSGA-II for Multi-Objective Combinatorial Optimization Problems, IEEE Access, 9, 57757–57791, <ext-link xlink:href="https://doi.org/10.1109/ACCESS.2021.3070634" ext-link-type="DOI">10.1109/ACCESS.2021.3070634</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib64"><label>64</label><mixed-citation>Wang, F., Yang, T., Wang, Z., Wang, H., Chen, X., Sun, Y., Li, J., Tang, G., and Chai, W.: Algorithm for vertical distribution of boundary layer aerosol components in remote-sensing data, Atmos. Meas. Tech., 15, 6127–6144, <ext-link xlink:href="https://doi.org/10.5194/amt-15-6127-2022" ext-link-type="DOI">10.5194/amt-15-6127-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib65"><label>65</label><mixed-citation>Wang, J., Li, J., Ye, J., Zhao, J., Wu, Y., Hu, J., Liu, D., Nie, D., Shen, F., Huang, X., Huang, D. D., Ji, D., Sun, X., Xu, W., Guo, J., Song, S., Qin, Y., Liu, P., Turner, J. R., Lee, H. C., Hwang, S., Liao, H., Martin, S. T., Zhang, Q., Chen, M., Sun, Y., Ge, X., and Jacob, D. J.: Fast sulfate formation from oxidation of SO<sub>2</sub> by NO<sub>2</sub> and HONO observed in Beijing haze, Nat. Commun., 11, 2844, <ext-link xlink:href="https://doi.org/10.1038/s41467-020-16683-x" ext-link-type="DOI">10.1038/s41467-020-16683-x</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bib66"><label>66</label><mixed-citation>Wang, S. and Zhang, Y.: An attention-based CNN model integrating observational and simulation data for high-resolution spatial estimation of urban air quality, Atmos. Environ., 340, 120921, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2024.120921" ext-link-type="DOI">10.1016/j.atmosenv.2024.120921</ext-link>, 2025.</mixed-citation></ref>
      <ref id="bib1.bib67"><label>67</label><mixed-citation>Wang, X., Zhang, R., Tan, Y., and Yu, W.: Dominant synoptic patterns associated with the decay process of PM<sub>2.5</sub> pollution episodes around Beijing, Atmos. Chem. Phys., 21, 2491–2508, <ext-link xlink:href="https://doi.org/10.5194/acp-21-2491-2021" ext-link-type="DOI">10.5194/acp-21-2491-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bib68"><label>68</label><mixed-citation>Welton, E., Campbell, J., Spinhirne, J., and Scott, V. S.: Global monitoring of clouds and aerosols using a network of micropulse lidar systems, Second International Asia-Pacific Symposium on Remote Sensing of the Atmosphere, Environment, and Space, SPIE, <ext-link xlink:href="https://doi.org/10.1117/12.417040" ext-link-type="DOI">10.1117/12.417040</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bib69"><label>69</label><mixed-citation>Wu, J., Chen, X.-Y., Zhang, H., Xiong, L.-D., Lei, H., and Deng, S.-H.: Hyperparameter Optimization for Machine Learning Models Based on Bayesian Optimizationb, J. Electron. Sci.  Technol., 17, 26–40, <ext-link xlink:href="https://doi.org/10.11989/JEST.1674-862X.80904120" ext-link-type="DOI">10.11989/JEST.1674-862X.80904120</ext-link>, 2019a.</mixed-citation></ref>
      <ref id="bib1.bib70"><label>70</label><mixed-citation>Wu, L. B., Ren, H., Wang, P., Chen, J., Fang, Y. T., Hu, W., Ren, L. J., Deng, J. J., Song, Y., Li, J., Sun, Y. L., Wang, Z. F., Liu, C. Q., Ying, Q., and Fu, P. Q.: Aerosol Ammonium in the Urban Boundary Layer in Beijing: Insights from Nitrogen Isotope Ratios and Simulations in Summer 2015, Environ. Sci. Technol. Lett., 6, 389–395, <ext-link xlink:href="https://doi.org/10.1021/acs.estlett.9b00328" ext-link-type="DOI">10.1021/acs.estlett.9b00328</ext-link>, 2019b.</mixed-citation></ref>
      <ref id="bib1.bib71"><label>71</label><mixed-citation>Xu, Y., Zhu, B., Shi, S., and Huang Y.: Two Inversion Layers and Their Impacts on PM<sub>2.5</sub> Concentration over the Yangtze River Delta, China, J. Appl. Meteor. Climatol., 58, 2349–2362, <ext-link xlink:href="https://doi.org/10.1175/JAMC-D-19-0008.1" ext-link-type="DOI">10.1175/JAMC-D-19-0008.1</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bib72"><label>72</label><mixed-citation>Yang, M. and Wang, J.: Adaptability of Financial Time Series Prediction Based on BiLSTM, Procedia Comput. Sci., 199, 18–25, <ext-link xlink:href="https://doi.org/10.1016/j.procs.2022.01.003" ext-link-type="DOI">10.1016/j.procs.2022.01.003</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib73"><label>73</label><mixed-citation>Yang, T., Li, H., Xu, W., Song, Y., Xu, L., Wang, H., Wang, F., Sun, Y., Wang, Z., and Fu, P.: Strong Impacts of Regional Atmospheric Transport on the Vertical Distribution of Aerosol Ammonium over Beijing, Environ. Sci. Technol. Lett., 11, 29–34, <ext-link xlink:href="https://doi.org/10.1021/acs.estlett.3c00791" ext-link-type="DOI">10.1021/acs.estlett.3c00791</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib74"><label>74</label><mixed-citation>Yang, T., Wang, H., Li, H., Guo, X., Wang, D., Chen, X., Wang, F., Xin, J., Sun, Y., and Wang, Z.: Quantitative attribution of wintertime haze in coastal east China to local emission and regional intrusion under a stagnant internal boundary layer, Atmos. Environ., 276, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2022.119006" ext-link-type="DOI">10.1016/j.atmosenv.2022.119006</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bib75"><label>75</label><mixed-citation>Zhang, J., Su, Y., Chen, C., Guo, W., Tan, Q., Feng, M., Song, D., Jiang, T., Chen, Q., Li, Y., Li, W., Wang, Y., Huang, X., Han, L., Wu, W., and Wang, G.: Chemical composition, sources and formation mechanism of urban PM<sub>2.5</sub> in Southwest China: a case study at the beginning of 2023, Atmos. Chem. Phys., 24, 2803–2820, <ext-link xlink:href="https://doi.org/10.5194/acp-24-2803-2024" ext-link-type="DOI">10.5194/acp-24-2803-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib76"><label>76</label><mixed-citation>Zhang, J., Ye, L., and Lai, Y.: Stock price prediction using CNN-BiLSTM-Attention model, Mathematics, 11, 1985, <ext-link xlink:href="https://doi.org/10.3390/math11091985" ext-link-type="DOI">10.3390/math11091985</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bib77"><label>77</label><mixed-citation>Zhao, C., Sun, Y., Yang, J., Li, J., Zhou, Y., Yang, Y., Fan, H., and Zhao, X.: Observational evidence and mechanisms of aerosol effects on precipitation, Sci. Bull., 69, 1569–1580, <ext-link xlink:href="https://doi.org/10.1016/j.scib.2024.03.014" ext-link-type="DOI">10.1016/j.scib.2024.03.014</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bib78"><label>78</label><mixed-citation>Zhao, J., Du, W., Zhang, Y., Wang, Q., Chen, C., Xu, W., Han, T., Wang, Y., Fu, P., Wang, Z., Li, Z., and Sun, Y.: Insights into aerosol chemistry during the 2015 China Victory Day parade: results from simultaneous measurements at ground level and 260 m in Beijing, Atmos. Chem. Phys., 17, 3215–3232, <ext-link xlink:href="https://doi.org/10.5194/acp-17-3215-2017" ext-link-type="DOI">10.5194/acp-17-3215-2017</ext-link>, 2017. </mixed-citation></ref>
      <ref id="bib1.bib79"><label>79</label><mixed-citation>Zhu, H., Yang, S., Zhao, H., Wang, Y., and Li, R.: Complex interplay of sulfate aerosols and meteorology conditions on precipitation and latent heat vertical structure, npj Clim. Atmos. Sci., 7, 191, <ext-link xlink:href="https://doi.org/10.1038/s41612-024-00743-w" ext-link-type="DOI">10.1038/s41612-024-00743-w</ext-link>, 2024.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>A Physics-Constrained Deep-Learning Framework based on Long-Term Remote-Sensing Data for Retrieving Vertical Distribution of PM<sub>2.5</sub> Chemical Components</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
      
Al-Faiz, M. Z., Ibrahim, A. A., and Hadi, S. M.: The effect of Z-Score
standardization (normalization) on binary input due the speed of learning in
back-propagation neural network, Iraqi J. Inf. Commun. Technol., 1, 42–48,
<a href="https://doi.org/10.31987/ijict.1.3.41" target="_blank">https://doi.org/10.31987/ijict.1.3.41</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
      
Alzubaidi, L., Zhang, J., Humaidi, A. J., Al-Dujaili, A., Duan, Y.,
Al-Shamma, O., Santamaría, J., Fadhel, M. A., Al-Amidie, M., and
Farhan, L.: Review of deep learning: concepts, CNN architectures,
challenges, applications, future directions, J. Big Data, 8, 53,
<a href="https://doi.org/10.1186/s40537-021-00444-8" target="_blank">https://doi.org/10.1186/s40537-021-00444-8</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
      
Ansmann, A., Bösenberg, J., Chaikovsky, A., Comerón, A., Eckhardt,
S., Eixmann, R., Freudenthaler, V., Ginoux, P., Komguem, L., Linné, H.,
Márquez, M. Á. L., Matthias, V., Mattis, I., Mitev, V., Müller,
D., Music, S., Nickovic, S., Pelon, J., Sauvage, L., Sobolewsky, P.,
Srivastava, M. K., Stohl, A., Torres, O., Vaughan, G., Wandinger, U., and
Wiegner, M.: Long-range transport of Saharan dust to northern Europe: The
11–16 October 2001 outbreak observed with EARLINET, J. Geophys. Res.:
Atmos., 108, 4783, <a href="https://doi.org/10.1029/2003JD003757" target="_blank">https://doi.org/10.1029/2003JD003757</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
      
Araki, S., Shimadera, H., and Shima, M.: Continuous estimations of daily
PM<sub>2.5</sub> chemical components from temporally sparse monitoring data using
a machine learning approach, Atmos. Pollut. Res., 13, 101580,
<a href="https://doi.org/10.1016/j.apr.2022.101580" target="_blank">https://doi.org/10.1016/j.apr.2022.101580</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
      
Blum, A. L. and Langley, P.: Selection of relevant features and examples in
machine learning, Artif. Intel., 97, 245–271,
<a href="https://doi.org/10.1016/S0004-3702(97)00063-5" target="_blank">https://doi.org/10.1016/S0004-3702(97)00063-5</a>, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
      
Cabello-Solorzano, K., Ortigosa de Araujo, I., Peña, M., Correia, L.,
and J. Tallón-Ballesteros, A.: The Impact of Data Normalization
on the Accuracy of Machine Learning Algorithms: A Comparative Analysis, 18th
International Conference on Soft Computing Models in Industrial and
Environmental Applications (SOCO 2023), Cham, 344–353,
<a href="https://doi.org/10.1007/978-3-031-42536-3_33" target="_blank">https://doi.org/10.1007/978-3-031-42536-3_33</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
      
Chatfield, R. B., Sorek-Hamer, M., Esswein, R. F., and Lyapustin, A.: Satellite mapping of PM<sub>2.5</sub> episodes in the wintertime San Joaquin Valley: a “static” model using column water vapor, Atmos. Chem. Phys., 20, 4379–4397, <a href="https://doi.org/10.5194/acp-20-4379-2020" target="_blank">https://doi.org/10.5194/acp-20-4379-2020</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
      
Chen, Z., Chen, D., Zhao, C., Kwan, M.-p., Cai, J., Zhuang, Y., Zhao, B.,
Wang, X., Chen, B., Yang, J., Li, R., He, B., Gao, B., Wang, K., and Xu, B.:
Influence of meteorological conditions on PM<sub>2.5</sub> concentrations across
China: A review of methodology and mechanism, Environ. Int., 139,
<a href="https://doi.org/10.1016/j.envint.2020.105558" target="_blank">https://doi.org/10.1016/j.envint.2020.105558</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
      
Deb, K., Pratap, A., Agarwal, S., and Meyarivan, T.: A fast and elitist
multiobjective genetic algorithm: NSGA-II, IEEE Trans. Evol. Comput., 6,
182–197, <a href="https://doi.org/10.1109/4235.996017" target="_blank">https://doi.org/10.1109/4235.996017</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
      
Dubey, R., Patra, A. K., and Nazneen: Vertical profile of particulate
matter: A review of techniques and methods, Air Qual. Atmos. Hlth., 15,
979–1010, <a href="https://doi.org/10.1007/s11869-022-01192-1" target="_blank">https://doi.org/10.1007/s11869-022-01192-1</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
      
Fang, Y., Ye, C., Wang, J., Wu, Y., Hu, M., Lin, W., Xu, F., and Zhu, T.: Relative humidity and O3 concentration as two prerequisites for sulfate formation, Atmos. Chem. Phys., 19, 12295–12307, <a href="https://doi.org/10.5194/acp-19-12295-2019" target="_blank">https://doi.org/10.5194/acp-19-12295-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
      
Friedman, J. H., Bentley, J. L., and Finkel, R. A.: An algorithm for finding
best matches in logarithmic expected time, ACM T. Math. Software (TOMS), 3,
209–226, 1977.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
      
Gao, J., Wei, Y., Shi, G., Yu, H., Zhang, Z., Song, S., Wang, W., Liang, D.,
and Feng, Y.: Roles of RH, aerosol pH and sources in concentrations of
secondary inorganic aerosols, during different pollution periods, Atmos.
Environ., 241, 117770, <a href="https://doi.org/10.1016/j.atmosenv.2020.117770" target="_blank">https://doi.org/10.1016/j.atmosenv.2020.117770</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
      
Gelbart, M. A., Snoek, J., and Adams, R. P.: Bayesian optimization with
unknown constraints, arXiv [preprint], 1403, 5607,
<a href="https://doi.org/10.48550/arXiv.1403.5607" target="_blank">https://doi.org/10.48550/arXiv.1403.5607</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
      
Geng, G., Liu, Y., Liu, Y., Liu, S., Cheng, J., Yan, L., Wu, N., Hu, H.,
Tong, D., Zheng, B., Yin, Z., He, K., and Zhang, Q.: Efficacy of China's
clean air actions to tackle PM<sub>2.5</sub> pollution between 2013 and 2020, Nat.
Geosci., 17, 987–994, <a href="https://doi.org/10.1038/s41561-024-01540-z" target="_blank">https://doi.org/10.1038/s41561-024-01540-z</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
      
Guo, M. H., Xu, T. X., Liu, J. J., Liu, Z. N., Jiang, P. T., Mu, T. J.,
Zhang, S. H., Martin, R. R., Cheng, M. M., and Hu, S. M.: Attention
mechanisms in computer vision: A survey, Comput. Vis. Media, 8, 331–368,
<a href="https://doi.org/10.1007/s41095-022-0271-y" target="_blank">https://doi.org/10.1007/s41095-022-0271-y</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
      
Hara, Y., Nishizawa, T., Sugimoto, N., Osada, K., Yumimoto, K., Uno, I.,
Kudo, R., and Ishimoto, H.: Retrieval of Aerosol Components Using
Multi-Wavelength Mie-Raman Lidar and Comparison with Ground Aerosol
Sampling, Remote Sens., 10, <a href="https://doi.org/10.3390/rs10060937" target="_blank">https://doi.org/10.3390/rs10060937</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
      
Hou, L., Dai, Q., Song, C., Liu, B., Guo, F., Dai, T., Li, L., Liu, B., Bi,
X., Zhang, Y., and Feng, Y.: Revealing Drivers of Haze Pollution by
Explainable Machine Learning, Environ. Sci. Technol. Lett., 9, 112–119,
<a href="https://doi.org/10.1021/acs.estlett.1c00865" target="_blank">https://doi.org/10.1021/acs.estlett.1c00865</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
      
Ji, W., Wang, Y., and Zhuang, D.: Spatial distribution differences in
PM<sub>2.5</sub> concentration between heating and non-heating seasons in Beijing,
China, Environ. Pollut., 248, 574–583,
<a href="https://doi.org/10.1016/j.envpol.2019.01.002" target="_blank">https://doi.org/10.1016/j.envpol.2019.01.002</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
      
Jia, Z., Doherty, R. M., Ordóñez, C., Li, C., Wild, O., Jain, S., and Tang, X.: The impact of large-scale circulation on daily fine particulate matter (PM<sub>2.5</sub>) over major populated regions of China in winter, Atmos. Chem. Phys., 22, 6471–6487, <a href="https://doi.org/10.5194/acp-22-6471-2022" target="_blank">https://doi.org/10.5194/acp-22-6471-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
      
Kang, Y.-H., You, S., Bae, M., Kim, E., Son, K., Bae, C., Kim, Y., Kim,
B.-U., Kim, H. C., and Kim, S.: The impacts of COVID-19, meteorology, and
emission control policies on PM<sub>2.5</sub> drops in Northeast Asia, Sci. Rep.,
10, 22112, <a href="https://doi.org/10.1038/s41598-020-79088-2" target="_blank">https://doi.org/10.1038/s41598-020-79088-2</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
      
Kavianpour, P., Kavianpour, M., Jahani, E., and Ramezani, A.: A CNN-BiLSTM
model with attention mechanism for earthquake prediction, J. Supercomput.,
79, 19194–19226, <a href="https://doi.org/10.1007/s11227-023-05369-y" target="_blank">https://doi.org/10.1007/s11227-023-05369-y</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
      
Kim, S., Yang, J., Park, J., Song, I., Kim, D.-G., Jeon, K., Kim, H., and
Yi, S.-M.: Health effects of PM<sub>2.5</sub> constituents and source
contributions in major metropolitan cities, South Korea, Environ. Sci.
Pollut. Res., 29, 82873–82887, <a href="https://doi.org/10.1007/s11356-022-21592-1" target="_blank">https://doi.org/10.1007/s11356-022-21592-1</a>,
2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
      
Lee, Y. S., Choi, E., Park, M., Jo, H., Park, M., Nam, E., Kim, D. G., Yi,
S.-M., and Kim, J. Y.: Feature extraction and prediction of fine particulate
matter (PM<sub>2.5</sub>) chemical constituents using four machine learning
models, Expert Syst. Appl., 221, 119696,
<a href="https://doi.org/10.1016/j.eswa.2023.119696" target="_blank">https://doi.org/10.1016/j.eswa.2023.119696</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
      
Lei, L., Sun, Y., Ouyang, B., Qiu, Y., Xie, C., Tang, G., Zhou, W., He, Y.,
Wang, Q., Cheng, X., Fu, P., and Wang, Z.: Vertical Distributions of Primary
and Secondary Aerosols in Urban Boundary Layer: Insights into Sources,
Chemistry, and Interaction with Meteorology, Environ. Sci. Technol., 55,
4542–4552, <a href="https://doi.org/10.1021/acs.est.1c00479" target="_blank">https://doi.org/10.1021/acs.est.1c00479</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
      
Li, H., Yang, T., Du, Y., Tan, Y., and Wang, Z.: Interpreting hourly mass
concentrations of PM<sub>2.5</sub> chemical components with an optimal
deep-learning model, J. Environ. Sci., 151, 125–139,
<a href="https://doi.org/10.1016/j.jes.2024.03.037" target="_blank">https://doi.org/10.1016/j.jes.2024.03.037</a>, 2025a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
      
Li, H., Yang, T., Song, Y., Tian, P., He, J., Tan, Y., Tian, Y., Sun, Y.,
and Wang, Z.: Unveiling the intricate dynamics of PM<sub>2.5</sub> sulfate
aerosols in the urban boundary layer: A pioneering two-year vertical
profiling and machine learning-enhanced analysis in global Mega-City, Urban
Clim., 61, 102424, <a href="https://doi.org/10.1016/j.uclim.2025.102424" target="_blank">https://doi.org/10.1016/j.uclim.2025.102424</a>, 2025b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
      
Li, M., Zhang, Z., Yao, Q., Wang, T., Xie, M., Li, S., Zhuang, B., and Han, Y.: Nonlinear responses of particulate nitrate to NO<sub>x</sub> emission controls in the megalopolises of China, Atmos. Chem. Phys., 21, 15135–15152, <a href="https://doi.org/10.5194/acp-21-15135-2021" target="_blank">https://doi.org/10.5194/acp-21-15135-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
      
Liang, L., Engling, G., Cheng, Y., Zhang, X., Sun, J., Xu, W., Liu, C.,
Zhang, G., Xu, H., Liu, X., and Ma, Q.: Influence of High Relative Humidity
on Secondary Organic Carbon: Observations at a Background Site in East
China, J. Meteor. Res., 33, 905–913,
<a href="https://doi.org/10.1007/s13351-019-8202-2" target="_blank">https://doi.org/10.1007/s13351-019-8202-2</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
      
Lin, G. Y., Chen, H. W., Chen, B. J., and Chen, S. C.: A machine learning
model for predicting PM<sub>2.5</sub> and nitrate concentrations based on
long-term water-soluble inorganic salts datasets at a road site station,
Chemosphere, 289, <a href="https://doi.org/10.1016/j.chemosphere.2021.133123" target="_blank">https://doi.org/10.1016/j.chemosphere.2021.133123</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
      
Liu, K., Zhang, Y., He, H., Xiao, H., Wang, S., Zhang, Y., Li, H., and Qian,
X.: Time series prediction of the chemical components of PM<sub>2.5</sub> based on
a deep learning model, Chemosphere, 342, 140153,
<a href="https://doi.org/10.1016/j.chemosphere.2023.140153" target="_blank">https://doi.org/10.1016/j.chemosphere.2023.140153</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
      
Liu, S., Geng, G., Xiao, Q., Zheng, Y., Liu, X., Cheng, J., and Zhang, Q.:
Tracking Daily Concentrations of PM<sub>2.5</sub> Chemical Composition in China
since 2000, Environ. Sci. Technol., 56, 16517–16527,
<a href="https://doi.org/10.1021/acs.est.2c06510" target="_blank">https://doi.org/10.1021/acs.est.2c06510</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
      
Liu, Z., Hu, B., Wang, L., Wu, F., Gao, W., and Wang, Y.: Seasonal and
diurnal variation in particulate matter (PM<sub>10</sub> and PM<sub>2.5</sub>) at an
urban site of Beijing: analyses from a 9-year study, Environ. Sci. Pollut.
Res., 22, 627–642, <a href="https://doi.org/10.1007/s11356-014-3347-0" target="_blank">https://doi.org/10.1007/s11356-014-3347-0</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
      
Lowenthal, D. H. and Kumar, N.: Evaluation of the IMPROVE Equation for
estimating aerosol light extinction, J. Air Waste Manage., 66, 726–737,
<a href="https://doi.org/10.1080/10962247.2016.1178187" target="_blank">https://doi.org/10.1080/10962247.2016.1178187</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
      
Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B.,
Katz, R., Himmelfarb, J., Bansal, N., and Lee, S.-I.: From local
explanations to global understanding with explainable AI for trees, Nat.
Mach. Intell., 2, 56–67, <a href="https://doi.org/10.1038/s42256-019-0138-9" target="_blank">https://doi.org/10.1038/s42256-019-0138-9</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
      
Lv, L., Wei, P., Li, J., and Hu, J.: Application of machine learning
algorithms to improve numerical simulation prediction of PM<sub>2.5</sub> and
chemical components, Atmos. Pollut. Res., 12, 101211,
<a href="https://doi.org/10.1016/j.apr.2021.101211" target="_blank">https://doi.org/10.1016/j.apr.2021.101211</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
      
Ma, T., Xiang, G., Shi, Y., and Liu, Y.: Horizontal in situ stresses prediction
using a CNN-BiLSTM-attention hybrid neural network, Geomech. Geophys.
Geo-energ. Geo-resour. 8, 152, <a href="https://doi.org/10.1007/s40948-022-00467-2" target="_blank">https://doi.org/10.1007/s40948-022-00467-2</a>,
2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
      
Matus, A. V., Nowottnick, E. P., Yorks, J. E., and da Silva, A. M.:
Enhancing surface PM<sub>2.5</sub> air quality estimates in GEOS using CATS lidar
data, Earth  Space Sci., 12, e2024EA004078,
<a href="https://doi.org/10.1029/2024EA004078" target="_blank">https://doi.org/10.1029/2024EA004078</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
      
Meng, X., Hand, J. L., Schichtel, B. A., and Liu, Y.: Space-time trends of
PM<sub>2.5</sub> constituents in the conterminous United States estimated by a
machine learning approach, 2005–2015, Environ. Int., 121, 1137–1147,
<a href="https://doi.org/10.1016/j.envint.2018.10.029" target="_blank">https://doi.org/10.1016/j.envint.2018.10.029</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
      
Menon, S., Hansen, J., Nazarenko, L., and Luo, Y. F.: Climate effects of
black carbon aerosols in China and India, Science, 297, 2250–2253,
<a href="https://doi.org/10.1126/science.1075159" target="_blank">https://doi.org/10.1126/science.1075159</a>, 2002.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
      
Miao, Y., Zhang, X., Che, H., and Liu, S.: Influence of Multi-Scale
Meteorological Processes on PM<sub>2.5</sub> Pollution in Wuhan, Central China,
Front. Environ. Sci., 10, <a href="https://doi.org/10.3389/fenvs.2022.918076" target="_blank">https://doi.org/10.3389/fenvs.2022.918076</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
      
Morgan, W. T., Allan, J. D., Bower, K. N., Capes, G., Crosier, J., Williams, P. I., and Coe, H.: Vertical distribution of sub-micron aerosol chemical composition from North-Western Europe and the North-East Atlantic, Atmos. Chem. Phys., 9, 5389–5401, <a href="https://doi.org/10.5194/acp-9-5389-2009" target="_blank">https://doi.org/10.5194/acp-9-5389-2009</a>, 2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
      
Nishizawa, T., Sugimoto, N., Matsui, I., Shimizu, A., and Okamoto, H.:
Algorithms to retrieve optical properties of three component aerosols from
two-wavelength backscatter and one-wavelength polarization lidar
measurements considering nonsphericity of dust, J. Quant. Spectrosc. Radiat.
Transfer, 112, 254–267, <a href="https://doi.org/10.1016/j.jqsrt.2010.06.002" target="_blank">https://doi.org/10.1016/j.jqsrt.2010.06.002</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
      
Nishizawa, T., Sugimoto, N., Matsui, I., Shimizu, A., Hara, Y., Itsushi, U.,
Yasunaga, K., Kudo, R., and Kim, S.-W.: Ground-based network observation
using Mie–Raman lidars and multi-wavelength Raman lidars and algorithm to
retrieve distributions of aerosol components, J. Quant. Spectrosc. Radiat.
Transfer, 188, 79–93, <a href="https://doi.org/10.1016/j.jqsrt.2016.06.031" target="_blank">https://doi.org/10.1016/j.jqsrt.2016.06.031</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
      
Niu, Y., Li, X., Qi, B., and Du, R.: Variation in the concentrations of
atmospheric PM<sub>2.5</sub> and its main chemical components in an eastern China
city (Hangzhou) since the release of the Air Pollution Prevention and
Control Action Plan in 2013, Air Qual. Atmos. Hlth., 15, 321–337,
<a href="https://doi.org/10.1007/s11869-021-01107-6" target="_blank">https://doi.org/10.1007/s11869-021-01107-6</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
      
O'Shea, K. and Nash, R.: An Introduction to Convolutional Neural
Networks, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1511.08458" target="_blank">https://doi.org/10.48550/arXiv.1511.08458</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
      
Pitchford, M., Malm, W., Schichtel, B., Kumar, N., Lowenthal, D., and Hand,
J.: Revised Algorithm for Estimating Light Extinction from IMPROVE Particle
Speciation Data, J. Air Waste Manage., 57, 1326–1336,
<a href="https://doi.org/10.3155/1047-3289.57.11.1326" target="_blank">https://doi.org/10.3155/1047-3289.57.11.1326</a>, 2007.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
      
Rasmussen, C. E.: Gaussian Processes in Machine Learning, in: Advanced
Lectures on Machine Learning: ML Summer Schools 2003, Canberra, Australia,
2–14 February  2003, Tübingen, Germany, 4–16  August  2003, Revised
Lectures, edited by: Bousquet, O., von Luxburg, U., and Rätsch, G.,
Springer Berlin Heidelberg, Berlin, Heidelberg, 63–71,
<a href="https://doi.org/10.1007/978-3-540-28650-9_4" target="_blank">https://doi.org/10.1007/978-3-540-28650-9_4</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
      
Roostaei, V., Gharibzadeh, F., Shamsipour, M., Faridi, S., and Hassanvand,
M. S.: Vertical distribution of ambient air pollutants (PM<sub>2.5</sub>,
PM<sub>10</sub>, NO<sub><i>X</i></sub>, and NO<sub>2</sub>); A systematic review, Heliyon, 10,
e39726, <a href="https://doi.org/10.1016/j.heliyon.2024.e39726" target="_blank">https://doi.org/10.1016/j.heliyon.2024.e39726</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
      
Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., and Freitas, N. d.:
Taking the Human Out of the Loop: A Review of Bayesian Optimization,
Proceedings of the IEEE, 104, 148–175,
<a href="https://doi.org/10.1109/JPROC.2015.2494218" target="_blank">https://doi.org/10.1109/JPROC.2015.2494218</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
      
Shan, L., Liu, Y., Tang, M., Yang, M., and Bai, X.: CNN-BiLSTM hybrid neural
networks with attention mechanism for well log prediction, J. Petrol. Sci.
Eng., 205, 108838, <a href="https://doi.org/10.1016/j.petrol.2021.108838" target="_blank">https://doi.org/10.1016/j.petrol.2021.108838</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
      
Siami-Namini, S., Tavakoli, N., and Namin, A. S.: The Performance of LSTM
and BiLSTM in Forecasting Time Series, 2019 IEEE Int. Conf. on Big Data (Big
Data), Los Angeles, CA, USA, 3285–3292,
<a href="https://doi.org/10.1109/BigData47090.2019.9005997" target="_blank">https://doi.org/10.1109/BigData47090.2019.9005997</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>53</label><mixed-citation>
      
Song, Q., Huang, L., Zhang, Y., Li, Z., Wang, S., Zhao, B., Yin, D., Ma, M.,
Li, S., Liu, B., Zhu, L., Chang, X., Gao, D., Jiang, Y., Dong, Z., Shi, H.,
and Hao, J.: Driving Factors of PM<sub>2.5</sub> Pollution Rebound in North China
Plain in Early 2023, Environ. Sci. Technol. Lett., 12, 305–312,
<a href="https://doi.org/10.1021/acs.estlett.4c01153" target="_blank">https://doi.org/10.1021/acs.estlett.4c01153</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>54</label><mixed-citation>
      
Srinivas, N. and Deb, K.: Muiltiobjective Optimization Using Nondominated
Sorting in Genetic Algorithms, Evol. Comput., 2, 221–248,
<a href="https://doi.org/10.1162/evco.1994.2.3.221" target="_blank">https://doi.org/10.1162/evco.1994.2.3.221</a>, 1994.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>55</label><mixed-citation>
      
Sugimoto, N., Uno, I., Nishikawa, M., Shimizu, A., Matsui, I., Dong, X.,
Chen, Y., and Quan, H.: Record heavy Asian dust in Beijing in 2002:
Observations and model analysis of recent events, Geophys. Res. Lett., 30,
<a href="https://doi.org/10.1029/2002gl016349" target="_blank">https://doi.org/10.1029/2002gl016349</a>, 2003.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>56</label><mixed-citation>
      
Sugimoto, N., Shimizu, A., Matsui, I., Uno, I., Arao, K., Dong, X., Zhao,
S., Zhou, J., and Lee, C.-H.: Study of Asian Dust Phenomena in 2001–2003
Using A Network of Continuously Operated Polarization Lidars, Water, Air,
&amp; Soil Pollution: Focus, 5, 145–157,
<a href="https://doi.org/10.1007/s11267-005-0732-1" target="_blank">https://doi.org/10.1007/s11267-005-0732-1</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>57</label><mixed-citation>
      
Sun, Y., Du, W., Wang, Q., Zhang, Q., Chen, C., Chen, Y., Chen, Z., Fu, P.,
Wang, Z., Gao, Z., and Worsnop, D. R.: Real-Time Characterization of Aerosol
Particle Composition above the Urban Canopy in Beijing: Insights into the
Interactions between the Atmospheric Boundary Layer and Aerosol Chemistry,
Environ. Sci. Technol., 49, 11340–11347,
<a href="https://doi.org/10.1021/acs.est.5b02373" target="_blank">https://doi.org/10.1021/acs.est.5b02373</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>58</label><mixed-citation>
      
Tan, T., Hu, M., Li, M., Guo, Q., Wu, Y., Fang, X., Gu, F., Wang, Y., and
Wu, Z.: New insight into PM<sub>2.5</sub> pollution patterns in Beijing based on
one-year measurement of chemical compositions, Sci. Total Environ., 621,
734–743, <a href="https://doi.org/10.1016/j.scitotenv.2017.11.208" target="_blank">https://doi.org/10.1016/j.scitotenv.2017.11.208</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>59</label><mixed-citation>
      
Tao, J., Zhang, L., Cao, J., and Zhang, R.: A review of current knowledge concerning PM<sub>2.5</sub> chemical composition, aerosol optical properties and their relationships across China, Atmos. Chem. Phys., 17, 9485–9518, <a href="https://doi.org/10.5194/acp-17-9485-2017" target="_blank">https://doi.org/10.5194/acp-17-9485-2017</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>60</label><mixed-citation>
      
Tao, Z., Wang, Z., Yang, S., Shan, H., Ma, X., Zhang, H., Zhao, S., Liu, D., Xie, C., and Wang, Y.: Profiling the PM<sub>2.5</sub> mass concentration vertical distribution in the boundary layer, Atmos. Meas. Tech., 9, 1369–1376, <a href="https://doi.org/10.5194/amt-9-1369-2016" target="_blank">https://doi.org/10.5194/amt-9-1369-2016</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>61</label><mixed-citation>
      
Tesche, M., Ansmann, A., MüLler, D., Althausen, D., Mattis, I., Heese,
B., Freudenthaler, V., Wiegner, M., Esselborn, M., Pisani, G., and
Knippertz, P.: Vertical profiling of Saharan dust with Raman lidars and
airborne HSRL in southern Morocco during SAMUM, Tellus B: Chem. Phys.
Meteor., 61, 144–164, <a href="https://doi.org/10.1111/j.1600-0889.2008.00390.x" target="_blank">https://doi.org/10.1111/j.1600-0889.2008.00390.x</a>,
2009.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>62</label><mixed-citation>
      
Toth, T. D., Zhang, J., Vaughan, M. A., Reid, J. S., and Campbell, J. R.:
Retrieving particulate matter concentrations over the contiguous United
States using CALIOP observations, Atmos. Environ., 274, 118979,
<a href="https://doi.org/10.1016/j.atmosenv.2022.118979" target="_blank">https://doi.org/10.1016/j.atmosenv.2022.118979</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>63</label><mixed-citation>
      
Verma, S., Pant, M., and Snasel, V.: A Comprehensive Review on NSGA-II for
Multi-Objective Combinatorial Optimization Problems, IEEE Access, 9,
57757–57791, <a href="https://doi.org/10.1109/ACCESS.2021.3070634" target="_blank">https://doi.org/10.1109/ACCESS.2021.3070634</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>64</label><mixed-citation>
      
Wang, F., Yang, T., Wang, Z., Wang, H., Chen, X., Sun, Y., Li, J., Tang, G., and Chai, W.: Algorithm for vertical distribution of boundary layer aerosol components in remote-sensing data, Atmos. Meas. Tech., 15, 6127–6144, <a href="https://doi.org/10.5194/amt-15-6127-2022" target="_blank">https://doi.org/10.5194/amt-15-6127-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>65</label><mixed-citation>
      
Wang, J., Li, J., Ye, J., Zhao, J., Wu, Y., Hu, J., Liu, D., Nie, D., Shen,
F., Huang, X., Huang, D. D., Ji, D., Sun, X., Xu, W., Guo, J., Song, S.,
Qin, Y., Liu, P., Turner, J. R., Lee, H. C., Hwang, S., Liao, H., Martin, S.
T., Zhang, Q., Chen, M., Sun, Y., Ge, X., and Jacob, D. J.: Fast sulfate
formation from oxidation of SO<sub>2</sub> by NO<sub>2</sub> and HONO observed in
Beijing haze, Nat. Commun., 11, 2844,
<a href="https://doi.org/10.1038/s41467-020-16683-x" target="_blank">https://doi.org/10.1038/s41467-020-16683-x</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>66</label><mixed-citation>
      
Wang, S. and Zhang, Y.: An attention-based CNN model integrating
observational and simulation data for high-resolution spatial estimation of
urban air quality, Atmos. Environ., 340, 120921,
<a href="https://doi.org/10.1016/j.atmosenv.2024.120921" target="_blank">https://doi.org/10.1016/j.atmosenv.2024.120921</a>, 2025.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib67"><label>67</label><mixed-citation>
      
Wang, X., Zhang, R., Tan, Y., and Yu, W.: Dominant synoptic patterns associated with the decay process of PM<sub>2.5</sub> pollution episodes around Beijing, Atmos. Chem. Phys., 21, 2491–2508, <a href="https://doi.org/10.5194/acp-21-2491-2021" target="_blank">https://doi.org/10.5194/acp-21-2491-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib68"><label>68</label><mixed-citation>
      
Welton, E., Campbell, J., Spinhirne, J., and Scott, V. S.: Global monitoring
of clouds and aerosols using a network of micropulse lidar systems, Second
International Asia-Pacific Symposium on Remote Sensing of the Atmosphere,
Environment, and Space, SPIE, <a href="https://doi.org/10.1117/12.417040" target="_blank">https://doi.org/10.1117/12.417040</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib69"><label>69</label><mixed-citation>
      
Wu, J., Chen, X.-Y., Zhang, H., Xiong, L.-D., Lei, H., and Deng, S.-H.:
Hyperparameter Optimization for Machine Learning Models Based on Bayesian
Optimizationb, J. Electron. Sci.  Technol., 17, 26–40,
<a href="https://doi.org/10.11989/JEST.1674-862X.80904120" target="_blank">https://doi.org/10.11989/JEST.1674-862X.80904120</a>, 2019a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib70"><label>70</label><mixed-citation>
      
Wu, L. B., Ren, H., Wang, P., Chen, J., Fang, Y. T., Hu, W., Ren, L. J.,
Deng, J. J., Song, Y., Li, J., Sun, Y. L., Wang, Z. F., Liu, C. Q., Ying,
Q., and Fu, P. Q.: Aerosol Ammonium in the Urban Boundary Layer in Beijing:
Insights from Nitrogen Isotope Ratios and Simulations in Summer 2015,
Environ. Sci. Technol. Lett., 6, 389–395,
<a href="https://doi.org/10.1021/acs.estlett.9b00328" target="_blank">https://doi.org/10.1021/acs.estlett.9b00328</a>, 2019b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib71"><label>71</label><mixed-citation>
      
Xu, Y., Zhu, B., Shi, S., and Huang Y.: Two Inversion Layers and Their
Impacts on PM<sub>2.5</sub> Concentration over the Yangtze River Delta, China, J.
Appl. Meteor. Climatol., 58, 2349–2362,
<a href="https://doi.org/10.1175/JAMC-D-19-0008.1" target="_blank">https://doi.org/10.1175/JAMC-D-19-0008.1</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib72"><label>72</label><mixed-citation>
      
Yang, M. and Wang, J.: Adaptability of Financial Time Series Prediction
Based on BiLSTM, Procedia Comput. Sci., 199, 18–25,
<a href="https://doi.org/10.1016/j.procs.2022.01.003" target="_blank">https://doi.org/10.1016/j.procs.2022.01.003</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib73"><label>73</label><mixed-citation>
      
Yang, T., Li, H., Xu, W., Song, Y., Xu, L., Wang, H., Wang, F., Sun, Y.,
Wang, Z., and Fu, P.: Strong Impacts of Regional Atmospheric Transport on
the Vertical Distribution of Aerosol Ammonium over Beijing, Environ. Sci.
Technol. Lett., 11, 29–34, <a href="https://doi.org/10.1021/acs.estlett.3c00791" target="_blank">https://doi.org/10.1021/acs.estlett.3c00791</a>,
2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib74"><label>74</label><mixed-citation>
      
Yang, T., Wang, H., Li, H., Guo, X., Wang, D., Chen, X., Wang, F., Xin, J.,
Sun, Y., and Wang, Z.: Quantitative attribution of wintertime haze in
coastal east China to local emission and regional intrusion under a stagnant
internal boundary layer, Atmos. Environ., 276,
<a href="https://doi.org/10.1016/j.atmosenv.2022.119006" target="_blank">https://doi.org/10.1016/j.atmosenv.2022.119006</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib75"><label>75</label><mixed-citation>
      
Zhang, J., Su, Y., Chen, C., Guo, W., Tan, Q., Feng, M., Song, D., Jiang, T., Chen, Q., Li, Y., Li, W., Wang, Y., Huang, X., Han, L., Wu, W., and Wang, G.: Chemical composition, sources and formation mechanism of urban PM<sub>2.5</sub> in Southwest China: a case study at the beginning of 2023, Atmos. Chem. Phys., 24, 2803–2820, <a href="https://doi.org/10.5194/acp-24-2803-2024" target="_blank">https://doi.org/10.5194/acp-24-2803-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib76"><label>76</label><mixed-citation>
      
Zhang, J., Ye, L., and Lai, Y.: Stock price prediction using
CNN-BiLSTM-Attention model, Mathematics, 11, 1985,
<a href="https://doi.org/10.3390/math11091985" target="_blank">https://doi.org/10.3390/math11091985</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib77"><label>77</label><mixed-citation>
      
Zhao, C., Sun, Y., Yang, J., Li, J., Zhou, Y., Yang, Y., Fan, H., and Zhao,
X.: Observational evidence and mechanisms of aerosol effects on
precipitation, Sci. Bull., 69, 1569–1580,
<a href="https://doi.org/10.1016/j.scib.2024.03.014" target="_blank">https://doi.org/10.1016/j.scib.2024.03.014</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib78"><label>78</label><mixed-citation>
      
Zhao, J., Du, W., Zhang, Y., Wang, Q., Chen, C., Xu, W., Han, T., Wang, Y., Fu, P., Wang, Z., Li, Z., and Sun, Y.: Insights into aerosol chemistry during the 2015 China Victory Day parade: results from simultaneous measurements at ground level and 260&thinsp;m in Beijing, Atmos. Chem. Phys., 17, 3215–3232, <a href="https://doi.org/10.5194/acp-17-3215-2017" target="_blank">https://doi.org/10.5194/acp-17-3215-2017</a>, 2017.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib79"><label>79</label><mixed-citation>
      
Zhu, H., Yang, S., Zhao, H., Wang, Y., and Li, R.: Complex interplay of
sulfate aerosols and meteorology conditions on precipitation and latent heat
vertical structure, npj Clim. Atmos. Sci., 7, 191,
<a href="https://doi.org/10.1038/s41612-024-00743-w" target="_blank">https://doi.org/10.1038/s41612-024-00743-w</a>, 2024.

    </mixed-citation></ref-html>--></article>
