<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">AMT</journal-id><journal-title-group>
    <journal-title>Atmospheric Measurement Techniques</journal-title>
    <abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1867-8548</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/amt-18-4771-2025</article-id><title-group><article-title>Machine learning data fusion for high spatio-temporal resolution PM<sub>2.5</sub></article-title><alt-title>Machine learning data fusion for high spatio-temporal resolution PM<sub>2.5</sub></alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Porcheddu</surname><given-names>Andrea</given-names></name>
          <email>andrea.porcheddu@uef.fi</email>
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Kolehmainen</surname><given-names>Ville</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-5621-795X</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Lähivaara</surname><given-names>Timo</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Lipponen</surname><given-names>Antti</given-names></name>
          
        <ext-link>https://orcid.org/0000-0002-6902-9974</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>Department of Technical Physics, University of Eastern Finland, Kuopio, Finland</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>Atmospheric Research Centre of Eastern Finland, Finnish Meteorological Institute, Kuopio, Finland</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Andrea Porcheddu (andrea.porcheddu@uef.fi)</corresp></author-notes><pub-date><day>25</day><month>September</month><year>2025</year></pub-date>
      
      <volume>18</volume>
      <issue>18</issue>
      <fpage>4771</fpage><lpage>4789</lpage>
      <history>
        <date date-type="received"><day>20</day><month>December</month><year>2024</year></date>
           <date date-type="rev-request"><day>14</day><month>February</month><year>2025</year></date>
           <date date-type="rev-recd"><day>28</day><month>July</month><year>2025</year></date>
           <date date-type="accepted"><day>30</day><month>July</month><year>2025</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2025 Andrea Porcheddu et al.</copyright-statement>
        <copyright-year>2025</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025.html">This article is available from https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025.html</self-uri><self-uri xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e131">Understanding PM<sub>2.5</sub> variability at fine scale is crucial to assess urban pollution impact on the population and to inform the policy-making process. PM<sub>2.5</sub> in-situ measurements at ground level cannot offer gapless spatial coverage, while current satellite retrievals generally cannot offer both high-spatial and high-temporal resolution, with night-time estimation posing further challenges. This study tackles these difficulties, introducing an innovative deep learning data fusion method to estimate hourly PM<sub>2.5</sub> maps using a grid with cell size 100 m <inline-formula><mml:math id="M6" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m on urban areas. We combine low resolution geophysical model data, high resolution geographical indicators, PM<sub>2.5</sub> in-situ ground stations measurements and PM<sub>2.5</sub> retrieved at satellite overpass. To simultaneously treat spatial and temporal correlations in our data, we deploy a 3D U-Net based neural network model. To evaluate the model, we select the city of Paris, France, in the year 2019 as our study region and time. Quantitative assessment of the model is carried out using the ground station data with a leave-one-out cross-validation approach. Our method outperforms MERRA-2 PM<sub>2.5</sub> estimates, predicting PM<sub>2.5</sub> hourly (<inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M12" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.51, RMSE <inline-formula><mml:math id="M13" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 6.58 <inline-formula><mml:math id="M14" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>), daily (<inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M17" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.65, RMSE <inline-formula><mml:math id="M18" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 4.92 <inline-formula><mml:math id="M19" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>), and monthly (<inline-formula><mml:math id="M21" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M22" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 0.87, RMSE <inline-formula><mml:math id="M23" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 2.87 <inline-formula><mml:math id="M24" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>). The proposed approach and its possible future developments can be highly beneficial for PM<sub>2.5</sub> exposure and regulation studies at fine suburban scale.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>European Space Agency</funding-source>
<award-id>4000137651/22/I-DT-lr</award-id>
</award-group>
<award-group id="gs2">
<funding-source>Research Council of Finland</funding-source>
<award-id>353084</award-id>
<award-id>358944</award-id>
<award-id>321761</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e361">One of the key indicators in air quality monitoring and regulation is PM<sub>2.5</sub> which is the concentration  of particulate matter (PM) with an aerodynamic diameter less than 2.5 <inline-formula><mml:math id="M28" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>m in cubic meter of air (<inline-formula><mml:math id="M29" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>). PM<sub>2.5</sub> has different chemical compositions and its emissions originate from different natural and anthropogenic sources such as fuel combustion, wildfires, and sea salt. From the epidemiological point of view, high PM<sub>2.5</sub> levels have been connected to many illnesses, such as stroke and cardiovascular and respiratory diseases <xref ref-type="bibr" rid="bib1.bibx26 bib1.bibx7 bib1.bibx34" id="paren.1"/>. The pathogenicity of fine particulate matter pollution makes it one of the biggest environmental health risks as over 90 % of the world's population lives in areas with annual mean PM<sub>2.5</sub> levels exceeding the new WHO 2021 air quality guideline of 5 <inline-formula><mml:math id="M34" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> <xref ref-type="bibr" rid="bib1.bibx12" id="paren.2"/>.</p>
      <p id="d2e455">PM<sub>2.5</sub> and other pollutants can be measured with high accuracy by in-situ ground station networks. However, the existing monitoring sites are sparsely located and mostly in developed countries, typically few stations in a large metropolitan area producing accurate measurements representing the conditions in the proximity of the ground stations. Despite some spatial interpolation techniques could be used <xref ref-type="bibr" rid="bib1.bibx8 bib1.bibx15" id="paren.3"/> to estimate PM<sub>2.5</sub> over larger urban areas from the point-like measurements obtained by these ground stations, they do not alone permit accurate spatially distributed estimates for epidemiological studies and regulation at a suburban level scale. Aiming at a more appropriate spatial coverage and resolution, PM<sub>2.5</sub> can also be estimated by using airborne remote-sensing techniques, in particular  satellite retrievals. In satellite remote sensing, PM<sub>2.5</sub> estimates are typically based on Aerosol Optical Depth (AOD), a quantity expressing electromagnetic radiation extinction through a column of air at a given wavelength. AOD is a columnar optical quantity while PM<sub>2.5</sub> is a concentration of particles at ground level. For AOD to PM<sub>2.5</sub>  conversion, the estimation utilizes auxiliary measurement and model data such as aerosol vertical distribution and metereological variables <xref ref-type="bibr" rid="bib1.bibx5 bib1.bibx33" id="paren.4"/>. Nowadays many AOD satellite products exist with different spatial and temporal resolution. Different studies used low orbiting satellites (e.g. MODIS product; <xref ref-type="bibr" rid="bib1.bibx17" id="altparen.5"/>) that have one or two overpasses per day or geostationary satellites (e.g. AHI product; <xref ref-type="bibr" rid="bib1.bibx2" id="altparen.6"/>) giving sub-hourly estimates. Instruments on low orbiting satellites have generally higher spatial resolution than geostationary ones. Although giving high spatial resolution PM<sub>2.5</sub> estimates, low orbiting satellites products have low temporal resolution (1–2 snapshots per day) and retrieving information on night-time aerosols is a challenging task for the development of geostationary satellites products. Reanalysis models such as MERRA-2 <xref ref-type="bibr" rid="bib1.bibx29" id="paren.7"/> and CAMS <xref ref-type="bibr" rid="bib1.bibx13" id="paren.8"/>, and forecast models such as GEOS-CF <xref ref-type="bibr" rid="bib1.bibx14" id="paren.9"/> offer hourly PM<sub>2.5</sub> available globally. However, the spatial resolution of these PM<sub>2.5</sub> maps is low (tens of kilometers) for higher resolution studies such as distribution of pollution at suburban levels.</p>
      <p id="d2e562">In recent years, numerous machine learning approaches have been investigated and shown to be effective for air quality monitoring and PM<sub>2.5</sub> forecasting. Several deep learning models leverage both spatial and temporal dependencies in meteorological and aerosol data to enhance prediction performance. For instance, a study conducted across the Greater Los Angeles area employed Graph Convolutional Networks (GCNs) and Convolutional Long Short-Term Memory (ConvLSTM) models to integrate satellite remote sensing data with ground-based monitoring, enabling accurate prediction of PM<sub>2.5</sub> concentrations <xref ref-type="bibr" rid="bib1.bibx22" id="paren.10"/>. Another study focused on the Seoul region combined air quality and meteorological data using kriging interpolation and a hybrid ConvLSTM-DNN model to generate PM<sub>2.5</sub> concentration maps <xref ref-type="bibr" rid="bib1.bibx15" id="paren.11"/>.</p>
      <p id="d2e598">To estimate PM<sub>2.5</sub>, we recently proposed a method <xref ref-type="bibr" rid="bib1.bibx27" id="paren.12"/> leveraging the Sentinel-3 POPCORN AOD product <xref ref-type="bibr" rid="bib1.bibx18" id="paren.13"/>. The POPCORN AOD is a post-process corrected version of Sentinel-3 SYNERGY land AOD, characterized by a high spatial resolution on a grid with cell size 300 m <inline-formula><mml:math id="M49" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 300 m and derived using a feed-forward neural network trained on AERONET-collocated data. This enhanced AOD product provides accurate spectral aerosol information for five regions of interest (Central Europe, Eastern USA, Western USA, Southern Africa, and India) for the year 2019, making it a valuable input for air quality estimation models. To post-process correct the MERRA-2 AOD-to-PM<sub>2.5</sub> conversion ratio, we deployed an ensemble of deep neural networks for a fusion of collocated ground station in-situ PM<sub>2.5</sub> data, MERRA-2 reanalysis model AOD and PM<sub>2.5</sub> data, spectral AERONET AOD, satellite-observed spectral top-of-atmosphere reflectances, and meteorology data. We also used various high-resolution geographical indicators representing, e.g., population density and land surface elevation. The deep learning model was used for estimation of PM<sub>2.5</sub> on a grid with cell size 100 m <inline-formula><mml:math id="M54" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m from low orbiting satellite images, producing 1–2 daily per overpass snapshots of high-resolution PM<sub>2.5</sub> data where AOD data was available.</p>
      <p id="d2e677">In this study, we have two research questions. How could we obtain PM<sub>2.5</sub> maps offering large (e.g. metropolitan level) spatial coverage with both high spatial and temporal resolution? Considering satellite derived PM<sub>2.5</sub> maps where AOD data is missing, e.g. because of cloud covering, how can we estimate PM<sub>2.5</sub> at those locations? To address these questions, we propose a novel deep learning based data fusion method to produce hourly PM<sub>2.5</sub> estimates on a grid with cell size 100 m <inline-formula><mml:math id="M60" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m. We use a 3D U-Net architecture <xref ref-type="bibr" rid="bib1.bibx6" id="paren.14"/> to produce 24 h sequences of hourly PM<sub>2.5</sub> maps. The model is trained to yield a small <inline-formula><mml:math id="M62" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub></mml:mrow></mml:math></inline-formula>-misfit with the PM<sub>2.5</sub> estimates obtained during satellite overpasses in our previous study <xref ref-type="bibr" rid="bib1.bibx27" id="paren.15"/>, as well as with available ground station data. As inputs, we utilize 24 h sequences of geophysical model data (MERRA-2) providing low-resolution maps of meteorological and aerosol-related indicators (1 h temporal resolution), and high-resolution geographical indicator maps (1-month temporal resolution). This allows the model to generate hourly PM<sub>2.5</sub> outputs for the entire 24 h period covered by the inputs. The model is trained on data for the year 2019 in the city of Paris, France, and assessed against ground station data with a leave one out cross validation approach.</p>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Data</title>
      <p id="d2e776">This section describes the data used in the proposed deep learning based data fusion for high resolution PM<sub>2.5</sub>. The proposed approach is tested using data from Paris, France, for the year 2019. NOODLESALAD PM<sub>2.5</sub> and OpenAQ PM<sub>2.5</sub> data are used as target to train and test our model. MERRA-2 data and high-resolution geographical indicators are utilized as input features. Since our satellite-based target variable (NOODLESALAD PM<sub>2.5</sub>) is represented on a grid with cell size 100 m <inline-formula><mml:math id="M69" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m, we train the model on the same grid, considering that some input features are on coarser and others on finer spatial representations. With this approach the model represents spatial patterns at the target data scale when fusing information from input data with different spatial scales. All the input features are listed in Table <xref ref-type="table" rid="TA1"/> in the Appendix. It is important to notice that other similar data sources could be utilized with our methodology.</p>
<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>NOODLESALAD PM<sub>2.5</sub></title>
      <p id="d2e841">NOODLESALAD PM<sub>2.5</sub> <xref ref-type="bibr" rid="bib1.bibx27" id="paren.16"/> retrievals are obtained applying a deep learning based post-process correction approach to the MERRA-2 AOD-to-PM<sub>2.5</sub> conversion ratio. The post-process corrected AOD-to-PM<sub>2.5</sub> conversion ratio is utilized to map high resolution POPCORN SENTINEL-3 SYNERGY AOD estimate <xref ref-type="bibr" rid="bib1.bibx18" id="paren.17"/> to high resolution PM<sub>2.5</sub> estimate. The post-process correction of MERRA-2 AOD-to-PM<sub>2.5</sub> conversion ratio is carried out deploying an ensemble of fully-connected feed-forward neural networks and a fusion of surface in-situ PM<sub>2.5</sub> observations, MERRA-2 reanalysis model AOD and PM<sub>2.5</sub> data, spectral AERONET AOD, satellite-observed spectral top-of-atmosphere reflectances, meteorology data, and various high-resolution geographical indicators. The ensemble technique leads to a distribution of predictions for a single PM<sub>2.5</sub> estimate. The median of the ensemble is considered as the PM<sub>2.5</sub> estimate and the width of the distribution is regarded as an uncertainty related to the machine learning model training (model uncertainty). NOODLESALAD PM<sub>2.5</sub> offers high resolution on a grid with cell size 100 m <inline-formula><mml:math id="M81" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m and is currently available for Sentinel-3A and 3B overpasses, covering Central Europe for the year 2019. The two Sentinel-3 satellites currently flying provide revisit times of less than two days for OLCI and less than one day for the SLSTR instrument at equator. Swath width of the OLCI instrument is 1270 km. SLSTR swath width is 1420 km for the nadir view and 750 km for the oblique view.</p>
      <p id="d2e949">Evaluation metrics for PM<sub>2.5</sub> at satellite overpass (<inline-formula><mml:math id="M83" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.55</mml:mn></mml:mrow></mml:math></inline-formula>, RMSE <inline-formula><mml:math id="M84" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 6.2 <inline-formula><mml:math id="M85" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) and PM<sub>2.5</sub> monthly averages (<inline-formula><mml:math id="M88" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0.72</mml:mn></mml:mrow></mml:math></inline-formula>, RMSE <inline-formula><mml:math id="M89" display="inline"><mml:mo>=</mml:mo></mml:math></inline-formula> 3.7 <inline-formula><mml:math id="M90" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>) show good agreement between NOODLESALAD PM<sub>2.5</sub> and OpenAQ ground stations data <xref ref-type="bibr" rid="bib1.bibx27" id="paren.18"/>. Given the better spatial coverage compared to ground stations and the high spatial resolution at satellite overpass, we utilize NOODLESALAD PM<sub>2.5</sub> to inform the model about PM<sub>2.5</sub> fine spatial distribution. In this work, we consider NOODLESALAD PM<sub>2.5</sub> retrievals in Paris, France, in 2019, and utilize them as part of the target data to train our model.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>OpenAQ</title>
      <p id="d2e1103">OpenAQ (<uri>https://openaq.org/</uri>, last access: 13 April 2023) is an open-access database for ground stations air quality data. In this study, we utilize OpenAQ as our source for surface in-situ PM<sub>2.5</sub> observations. OpenAQ offers pointwise air quality measurement data from thousands of stations. The temporal resolution of the data varies by station, with 1 h and daily observations commonly available. Figure <xref ref-type="fig" rid="F1"/> shows a map of OpenAQ stations that provide hourly data within our region of interest. We discard PM<sub>2.5</sub> observations when they are greater than the calculated upper fence <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">3</mml:mn><mml:mo>+</mml:mo><mml:mn mathvariant="normal">6</mml:mn><mml:mo>×</mml:mo><mml:mo>(</mml:mo><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">3</mml:mn><mml:mo>-</mml:mo><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">1</mml:mn><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (where <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mi>Q</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> are respectively the third and first quartiles of the PM<sub>2.5</sub> distribution), regarding them as outliers. This step was carried out to filter extreme outliers, which can be caused by exceptional events or ground station malfunctions.</p>

      <fig id="F1"><label>Figure 1</label><caption><p id="d2e1192">Map of OpenAQ stations in the region of interest (Paris, France).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f01.png"/>

        </fig>


</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>MERRA-2</title>
      <p id="d2e1211">The Modern-Era Retrospective analysis for Research and Applications, Version 2 (MERRA-2), is NASA's reanalysis model <xref ref-type="bibr" rid="bib1.bibx29" id="paren.19"/>. MERRA-2 provides model data for various variables in meteorology, aerosols and air quality. MERRA-2 has a spatial resolution of 0.5° <inline-formula><mml:math id="M102" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 0.625°, which is approximately 50 km in the Central Europe region. The time-varying variables from MERRA-2 that we use have a temporal resolution of 1 h, with both instantaneous and time-averaged values available depending on the variable and data product. Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/> contains a list of all MERRA-2 variables that are used as inputs in the proposed approach.</p>
      <p id="d2e1226">In addition to the variables contained in the MERRA-2 data, we calculate certain input variables from the MERRA-2 meteorological and aerosol data. These data are defined as: <list list-type="bullet"><list-item>
      <p id="d2e1231">Relative humidity (RH) at the surface. Equation based on the Clausius-Clapeyron equation <xref ref-type="bibr" rid="bib1.bibx21" id="paren.20"><named-content content-type="pre">see e.g.</named-content></xref>:<disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M103" display="block"><mml:mtable class="split" rowspacing="0.2ex" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:mi mathvariant="normal">RH</mml:mi><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mn mathvariant="normal">0.263</mml:mn><mml:mo>⋅</mml:mo><mml:mi mathvariant="normal">PS</mml:mi><mml:mo>⋅</mml:mo><mml:mi mathvariant="normal">QLML</mml:mi><mml:mo>/</mml:mo><mml:mi>exp⁡</mml:mi><mml:mo mathsize="1.1em">(</mml:mo><mml:mo mathsize="1.1em">(</mml:mo><mml:mn mathvariant="normal">17.67</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>⋅</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="normal">T</mml:mi><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="normal">M</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">273.15</mml:mn><mml:mo>)</mml:mo><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>/</mml:mo><mml:mo>(</mml:mo><mml:mi mathvariant="normal">T</mml:mi><mml:mn mathvariant="normal">2</mml:mn><mml:mi mathvariant="normal">M</mml:mi><mml:mo>-</mml:mo><mml:mn mathvariant="normal">29.65</mml:mn><mml:mo>)</mml:mo><mml:mo mathsize="1.1em">)</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p></list-item><list-item>
      <p id="d2e1315">Wind direction (WD10M) at 10 m:<disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M104" display="block"><mml:mrow><mml:mi mathvariant="normal">WD</mml:mi><mml:mn mathvariant="normal">10</mml:mn><mml:mi mathvariant="normal">M</mml:mi><mml:mo>=</mml:mo><mml:mi mathvariant="normal">arctan</mml:mi><mml:mfenced close=")" open="("><mml:mrow><mml:mo>-</mml:mo><mml:mi mathvariant="normal">V</mml:mi><mml:mn mathvariant="normal">10</mml:mn><mml:mi mathvariant="normal">M</mml:mi><mml:mo>/</mml:mo><mml:mi mathvariant="normal">U</mml:mi><mml:mn mathvariant="normal">10</mml:mn><mml:mi mathvariant="normal">M</mml:mi></mml:mrow></mml:mfenced></mml:mrow></mml:math></disp-formula></p></list-item><list-item>
      <p id="d2e1352">Wind speed (WS10M) at 10 m:<disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M105" display="block"><mml:mrow><mml:mi mathvariant="normal">WS</mml:mi><mml:mn mathvariant="normal">10</mml:mn><mml:mi mathvariant="normal">M</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mi mathvariant="normal">U</mml:mi><mml:mn mathvariant="normal">10</mml:mn><mml:msup><mml:mi mathvariant="normal">M</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="normal">V</mml:mi><mml:mn mathvariant="normal">10</mml:mn><mml:msup><mml:mi mathvariant="normal">M</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:math></disp-formula></p></list-item><list-item>
      <p id="d2e1392">PM<sub>2.5</sub> at surface: <xref ref-type="bibr" rid="bib1.bibx4" id="paren.21"/><disp-formula id="Ch1.E4" content-type="numbered"><label>4</label><mml:math id="M107" display="block"><mml:mtable rowspacing="0.2ex" class="split" displaystyle="true" columnalign="right left"><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi mathvariant="normal">PM</mml:mi><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub><mml:mo>=</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mo mathsize="1.1em">(</mml:mo><mml:mn mathvariant="normal">1.375</mml:mn><mml:mo>⋅</mml:mo><mml:mi mathvariant="normal">SO</mml:mi><mml:mn mathvariant="normal">4</mml:mn><mml:mi mathvariant="normal">SMASS</mml:mi><mml:mo>+</mml:mo><mml:mn mathvariant="normal">1.4</mml:mn><mml:mo>⋅</mml:mo><mml:mi mathvariant="normal">OCSMASS</mml:mi></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi mathvariant="normal">BCSMASS</mml:mi><mml:mo>+</mml:mo><mml:mi mathvariant="normal">DUSMASS</mml:mi><mml:mn mathvariant="normal">25</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd/><mml:mtd><mml:mrow><mml:mo>+</mml:mo><mml:mi mathvariant="normal">SSSMASS</mml:mi><mml:mn mathvariant="normal">25</mml:mn><mml:mo mathsize="1.1em">)</mml:mo><mml:mo>⋅</mml:mo><mml:msup><mml:mn mathvariant="normal">10</mml:mn><mml:mn mathvariant="normal">9</mml:mn></mml:msup></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula></p></list-item><list-item>
      <p id="d2e1477">AOD-to-PM<sub>2.5</sub> ratio <inline-formula><mml:math id="M109" display="inline"><mml:mi mathvariant="italic">η</mml:mi></mml:math></inline-formula>:<disp-formula id="Ch1.E5" content-type="numbered"><label>5</label><mml:math id="M110" display="block"><mml:mrow><mml:mi mathvariant="italic">η</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msub><mml:mi mathvariant="normal">PM</mml:mi><mml:mn mathvariant="normal">2.5</mml:mn></mml:msub></mml:mrow><mml:mi mathvariant="normal">TOTEXTTAU</mml:mi></mml:mfrac></mml:mstyle></mml:mrow></mml:math></disp-formula></p></list-item></list></p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>High-resolution geographical indicators</title>
      <p id="d2e1524">All geographic variables with the original resolution larger than 100 m were regridded to a common spatial grid with a resolution of 100 m using the Universal Transverse Mercator (UTM) projection. Linear interpolation method was used for continuous features and nearest neighbor interpolation for categorical variables. This preprocessing ensured that all features were spatially collocated prior to input into the deep learning model.</p>
<sec id="Ch1.S2.SS4.SSS1">
  <label>2.4.1</label><title>OpenStreetMap roads</title>
      <p id="d2e1534">OpenStreetMap is an open-source project that contains high spatial resolution map data. In our model, we utilize OpenStreetMap roads as a data source for inputs. Specifically, we calculate the distance to the nearest street or highway and use this measurement as one of our input variables. The distances are computed on a grid with cell size 100 m <inline-formula><mml:math id="M111" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m. In OpenStreetMap, all paths, streets, and highways are categorized under “highways”. However, we only consider certain sub-classes that include roads and highways accessible to car traffic, as these are potential sources of PM<sub>2.5</sub> pollution (information from <xref ref-type="bibr" rid="bib1.bibx24" id="altparen.22"/>). Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/> lists all the OpenStreetMap road types used to determine the distance to the nearest road.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS2">
  <label>2.4.2</label><title>NASA Black Marble Night Lights</title>
      <p id="d2e1566">NASA's Black Marble is a night light product derived from the Visible Infrared Imaging Radiometer Suite (VIIRS) day/night band (DNB) radiances captured during night-time. The DNB is extremely sensitive to light, allowing it to detect even very low-intensity lights on Earth's surface at night. Most of these night-time lights are attributed to human activities. Since the distribution of night lights closely reflects human presence, we use NASA's Black Marble Night Lights as a proxy for population density, incorporating it as an input in our models. We utilize Night Light data with a spatial resolution of 500 m, based on the annual data product VNP46A4 <xref ref-type="bibr" rid="bib1.bibx35" id="paren.23"/>.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS3">
  <label>2.4.3</label><title>MODIS land cover type</title>
      <p id="d2e1580">We utilize the MODIS <monospace>MCD12Q1</monospace> land cover type data product <xref ref-type="bibr" rid="bib1.bibx32" id="paren.24"/> to generate input variables that represent the distances to the nearest International Geosphere Biosphere Programme (IGBP) land cover types <xref ref-type="bibr" rid="bib1.bibx19 bib1.bibx1" id="paren.25"/>. The MODIS <monospace>MCD12Q1</monospace> data product has a spatial resolution of 500 m. A complete list of the IGBP land cover types can be found in Appendix <xref ref-type="sec" rid="App1.Ch1.S1"/>.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS4">
  <label>2.4.4</label><title>Digital elevation model</title>
      <p id="d2e1606">We utilize the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) digital elevation model (DEM) to represent land surface elevation <xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx10 bib1.bibx23" id="paren.26"/>. The ASTER DEM provides a spatial resolution of 1 arcsecond, which is approximately 30 m.</p>
</sec>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Methods</title>
      <p id="d2e1622">Our objective is to estimate time series of surface PM<sub>2.5</sub> maps (3D PM<sub>2.5</sub> arrays, two spatial dimensions and one time dimension) in the region of interest by fusion of satellite and ground station measurement data, model data and different indicators as inputs for the deep learning model. Since we are dealing with unstructured data in the form of images, a well-suited choice for the machine learning model is a Convolutional Neural Network (CNN) <xref ref-type="bibr" rid="bib1.bibx16 bib1.bibx3" id="paren.27"/>. Furthermore, since both the input maps and the output maps represent the same region of interest, a U-Net model is an appropriate choice <xref ref-type="bibr" rid="bib1.bibx30" id="paren.28"/>. As the data is in 3D, we choose a variant of U-Net called 3D U-Net <xref ref-type="bibr" rid="bib1.bibx6" id="paren.29"/>. The main difference between conventional U-Net and 3D U-Net is that the latter deploys 3D convolutions instead of 2D convolutions for processing 3D image data.</p>
      <p id="d2e1652">One must note that other network architectures could also be utilized. One possibility would be, e.g., to use a U-Net with convolutional Long Short-Term Memory (LSTM) layers <xref ref-type="bibr" rid="bib1.bibx31" id="paren.30"/>. Convolutional LSTM layers behave as LSTM layers, with the key difference of performing their internal operations as convolutions, consequently being a possible choice for processing time series of 2D images. Nevertheless, we decided to use 3D U-Net as it was found computationally feasible and less memory intensive for processing the large data sets.</p>
      <p id="d2e1658">The input data consist of 4 dimensional arrays. The first dimension represents time and has size 24 in order to contain hourly information of a single day. The second dimension contains the different channels (i.e. the different input features) and the remaining two dimensions are the spatial dimensions with image size of 960 <inline-formula><mml:math id="M115" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 960. The output is a 3 dimensional array containing 24 hourly PM<sub>2.5</sub> maps in the region of interest as 960 <inline-formula><mml:math id="M117" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 960 images with pixel size 100 m <inline-formula><mml:math id="M118" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m.</p>

      <fig id="F2" specific-use="star"><label>Figure 2</label><caption><p id="d2e1694">Visualization of the applied neural network architecture.</p></caption>
        <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f02.png"/>

      </fig>

      <p id="d2e1703">The model architecture has been implemented using the PyTorch framework <xref ref-type="bibr" rid="bib1.bibx25" id="paren.31"/>, a widely used library known for its flexibility and efficiency in developing deep learning models. A detailed schematic of the model architecture is provided in Fig. <xref ref-type="fig" rid="F2"/> to illustrate its structure and components, whereas Fig. <xref ref-type="fig" rid="F3"/> visualizes the corresponding data flow.</p>

      <fig id="F3" specific-use="star"><label>Figure 3</label><caption><p id="d2e1716">Visualization of the data flow in our method. Low spatial resolution (MERRA-2) data and high spatial resolution geographical indicators are projected on a common grid, joined and utilized as model input. The model output consists of hourly PM<sub>2.5</sub> maps.</p></caption>
        <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f03.png"/>

      </fig>

      <p id="d2e1734">The model consists of a contracting path (the encoder) and an expansive path (the decoder). On each level of the contracting path, 3D convolutions combined with ReLU activations and max pooling layers help in finding relevant features from the input maps, producing a new representation of the input with lower spatial and temporal resolution with a higher number of channels. The expansive path deploys 3D nearest neighbour upsampling and 3D convolutions followed by ReLU activations in order to recover a final output with the same spatial and temporal resolution of the input. Notice the skip connections linking each contracting path level to the corresponding expansive path level: from an intuitive point of view, these are useful to exploit the fine details contained in the input when generating the output. Finally, the output layer is a 3D 1 <inline-formula><mml:math id="M120" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 <inline-formula><mml:math id="M121" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 convolution followed by SoftPlus activations in order to constrain the output to be a positive definite array. Please notice that when we talk about 3 <inline-formula><mml:math id="M122" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 3 <inline-formula><mml:math id="M123" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 3 convolution and 1 <inline-formula><mml:math id="M124" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 <inline-formula><mml:math id="M125" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 1 convolution, we are referring to the size of the convolution kernel/filter along the depth, height, and width of the input volume. The number of parameters involved in the convolution operation can be obtained considering the number of input and output channels at a specific convolution layer (Fig. <xref ref-type="fig" rid="F2"/>). When testing the network architecture, various kernel sizes, internal activation functions, and upsampling techniques were evaluated, but no significant differences in performance were observed.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Loss function</title>
      <p id="d2e1789">We use the Mean Square Error (MSE) as the loss function in the supervised regression problem of fitting the 3D U-Net model to the training data. Ideally in clear sky conditions for a satellite overpass,  we would have a full PM<sub>2.5</sub> map. In ideal conditions for a single ground station, we would have a full time series without missing data. However, in reality satellite overpass data can lack information at some pixels, e.g. cloud covering can hinder AOD retrievals, therefore hindering PM<sub>2.5</sub> estimates, and a ground station can malfunction, leading to missing data in the time-series of the station. Furthermore, on a single day we usually have significantly more PM<sub>2.5</sub> pixel values coming from the satellite overpasses than measured values from the ground stations. The ground stations PM<sub>2.5</sub> data are usually more accurate than satellite estimate data and they are our only data source available hourly. On the other hand, NOODLESALAD PM<sub>2.5</sub> is our data source providing information far from the ground stations at the time of satellite overpasses. Therefore, to take the missing data and highly different number of satellite versus ground station data available into account, we consider masking and weighting the data in the MSE loss function.</p>
      <p id="d2e1837">Let the batch size be denoted by <inline-formula><mml:math id="M131" display="inline"><mml:mi>B</mml:mi></mml:math></inline-formula> (with <inline-formula><mml:math id="M132" display="inline"><mml:mrow><mml:mi>B</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">4</mml:mn></mml:mrow></mml:math></inline-formula> for this particular case) and define <inline-formula><mml:math id="M133" display="inline"><mml:mrow><mml:mi>N</mml:mi><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">960</mml:mn><mml:mo>×</mml:mo><mml:mn mathvariant="normal">960</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> as the set of all pixel indices. For each sample <inline-formula><mml:math id="M134" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> in the batch, define <inline-formula><mml:math id="M135" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>⊆</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">24</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> as the set of hours during which ground stations measurements are available, and <inline-formula><mml:math id="M136" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>⊆</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>,</mml:mo><mml:mi mathvariant="normal">…</mml:mi><mml:mo>,</mml:mo><mml:mn mathvariant="normal">24</mml:mn><mml:mo mathvariant="italic">}</mml:mo></mml:mrow></mml:math></inline-formula> as the corresponding set of hours for satellite overpasses. Let <inline-formula><mml:math id="M137" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> represent the target measurement at sample <inline-formula><mml:math id="M138" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula>, hour <inline-formula><mml:math id="M139" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula>, and pixel index <inline-formula><mml:math id="M140" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>, with <inline-formula><mml:math id="M141" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> denoting the corresponding predicted value by the deep learning model. When a measurement is not available (due to lacking data from a ground monitoring station or failed satellite retrieval due to cloud covering) <inline-formula><mml:math id="M142" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is encoded as 0, since this value does not naturally occur in the dataset (PM<sub>2.5</sub> is not zero in a realistic setting). On the other hand, <inline-formula><mml:math id="M144" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is always a positive real value by choice, since SoftPlus was chosen as  activation function for our model output.</p>
      <p id="d2e2076">For each sample <inline-formula><mml:math id="M145" display="inline"><mml:mi>b</mml:mi></mml:math></inline-formula> and hour <inline-formula><mml:math id="M146" display="inline"><mml:mi>h</mml:mi></mml:math></inline-formula>, define:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M147" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E6"><mml:mtd><mml:mtext>6</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:mi>N</mml:mi><mml:mo>∣</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mtext> and </mml:mtext><mml:mi>h</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="italic">}</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:msub><mml:mover accent="true"><mml:mi>S</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo mathvariant="italic">{</mml:mo><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:mi>N</mml:mi><mml:mo>∣</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mtext> and </mml:mtext><mml:mi>h</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo mathvariant="italic">}</mml:mo><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          representing pixel indices where ground station measurements (<inline-formula><mml:math id="M148" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) and satellite overpass measurements (<inline-formula><mml:math id="M149" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>S</mml:mi><mml:mo stretchy="false" mathvariant="normal">̃</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>) are available. The sets <inline-formula><mml:math id="M150" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M151" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> are defined separately for the ground stations and satellite overpass data. From the satellite overpass data, we create a subset <inline-formula><mml:math id="M152" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>⊂</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>S</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> of data with 25 % of size of <inline-formula><mml:math id="M153" display="inline"><mml:mrow><mml:msub><mml:mover accent="true"><mml:mi>S</mml:mi><mml:mo mathvariant="normal" stretchy="false">̃</mml:mo></mml:mover><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> by uniform random sampling of the pixels to be used in the minimization. The random sampling is performed at each training step (for every update of the network parameters) and can be seen as an optimization technique analogue to batch shuffling. This undersampling is beneficial for training on overpass estimates, especially given the substantial pixel count (nearly 1 million when all pixels provide valid measurements at overpass times).</p>
      <p id="d2e2392">The average losses for the ground stations and overpass contributions are defined by summing over the respective sets of valid hours:

                <disp-formula specific-use="gather" content-type="numbered"><mml:math id="M154" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E8"><mml:mtd><mml:mtext>8</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>h</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E9"><mml:mtd><mml:mtext>9</mml:mtext></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>h</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mfrac></mml:mstyle><mml:munder><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:munder><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mrow><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          where <inline-formula><mml:math id="M155" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M156" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> are factors depending on the sizes of the sets <inline-formula><mml:math id="M157" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M158" display="inline"><mml:mrow><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> (defined respectively as <inline-formula><mml:math id="M159" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M160" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula>). If these sets are empty, <inline-formula><mml:math id="M161" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M162" display="inline"><mml:mrow><mml:msub><mml:mi>C</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> are equal to 0, otherwise they correspond to <inline-formula><mml:math id="M163" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula> and <inline-formula><mml:math id="M164" display="inline"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mn mathvariant="normal">1</mml:mn><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>H</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mfrac></mml:mstyle></mml:math></inline-formula> respectively. Analogously, <inline-formula><mml:math id="M165" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M166" display="inline"><mml:mrow><mml:mo>|</mml:mo><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:math></inline-formula> represent the sizes of the sets <inline-formula><mml:math id="M167" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> and <inline-formula><mml:math id="M168" display="inline"><mml:mrow><mml:msub><mml:mi>S</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi><mml:mo>,</mml:mo><mml:mi>h</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula>.</p>
      <p id="d2e2942">We then define the sample-specific loss <inline-formula><mml:math id="M169" display="inline"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mi>b</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> as:

            <disp-formula id="Ch1.E10" content-type="numbered"><label>10</label><mml:math id="M170" display="block"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mi>b</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mfenced open="{" close=""><mml:mtable class="array" columnalign="left left"><mml:mtr><mml:mtd><mml:mrow><mml:mstyle displaystyle="false"><mml:mstyle displaystyle="false"><mml:mfrac style="text"><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub></mml:mrow><mml:mn mathvariant="normal">2</mml:mn></mml:mfrac></mml:mstyle></mml:mstyle><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>if </mml:mtext><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn><mml:mtext> and </mml:mtext><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>≠</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>if </mml:mtext><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">op</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mtext>if </mml:mtext><mml:msub><mml:mi>L</mml:mi><mml:mrow><mml:mi mathvariant="normal">st</mml:mi><mml:mo>,</mml:mo><mml:mi>b</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">0</mml:mn></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mfenced></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e3103">Finally, the overall loss for the batch is defined as:

            <disp-formula id="Ch1.E11" content-type="numbered"><label>11</label><mml:math id="M171" display="block"><mml:mrow><mml:mi mathvariant="script">L</mml:mi><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>B</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>b</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>B</mml:mi></mml:munderover><mml:msub><mml:mi>L</mml:mi><mml:mi>b</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula></p>
      <p id="d2e3138">This formalization provides the necessary structure to include both ground station and satellite overpass target data within each batch. A schematic representation of the loss function is presented in Fig. <xref ref-type="fig" rid="F4"/>.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e3145">Schematic representation of the loss function. Red squares represent available data points. White squares represent missing data. Black arrows represent operations: these operations are performed using available data (red squares) and the pixels with missing data are omitted.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f04.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Model training</title>
      <p id="d2e3163">For each data sample, one or two target maps correspond to the available satellite overpasses data (i.e. NOODLESALAD PM<sub>2.5</sub>) while the others contain only ground stations data (therefore 11 pixels for each map when data is available from all the stations). In order to test the results from the network training, we used leave-one-out cross-validation (CV) (i.e. we removed one different station from each training and used the data left out of training for testing purposes). Therefore, we trained 11 different networks (one for each ground station in the region of interest). Furthermore, the dataset has been split into training set (approximately 80 % of the data samples) used for the optimization of the network weights and validation set (approximately 20 % of the data samples) used for early stopping of the optimization. The early stopping is a form of regularization useful to avoid overfitting of the network <xref ref-type="bibr" rid="bib1.bibx11 bib1.bibx3" id="paren.32"/>. It consists in keeping track of both the training error and validation error with the objective of stopping the training when the validation error starts to increase (i.e. when the network stops to learn useful patterns and noise in the training set starts to play a significant role). While early stopping is not the only regularization technique applicable to train deep learning models, it offers a good trade-off between model performance and training time. We consider a patience parameter equal to 30, meaning that the training stops when no improvement on the validation loss is recorded over 30 epochs. Since using a small batch size introduces fluctuations in the loss during training, this choice of the patience parameter is reasonable, as a lower patience value could prematurely stop training and lead to underfitting.</p>
      <p id="d2e3178">We utilized the Adam algorithm (with learning rate equal to 0.0001) and the custom loss function described in Sect. <xref ref-type="sec" rid="Ch1.S3.SS1"/> to optimize the network model parameters.</p>
</sec>
</sec>
<sec id="Ch1.S4">
  <label>4</label><title>Results</title>
<sec id="Ch1.S4.SS1">
  <label>4.1</label><title>Overall performance and evaluation metrics</title>
      <p id="d2e3199">We considered a leave-one-out CV approach, training 11 models, each time leaving one station out of the training as test station. The results of predicted PM<sub>2.5</sub> at the locations of the 11 AQ stations in Paris are shown in Fig. <xref ref-type="fig" rid="F5"/>.</p>

      <fig id="F5" specific-use="star"><label>Figure 5</label><caption><p id="d2e3215"><inline-formula><mml:math id="M174" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> <bold>(a, d, g)</bold>, RMSE <bold>(b, e, h)</bold> and MAE <bold>(c, f, i)</bold> evaluation metrics resulting from the leave-one-out cross-validation per each test station. The metrics have been calculated for hourly averages <bold>(a–c)</bold>, daily averages <bold>(d–f)</bold> and monthly averages <bold>(g–i)</bold>. Our model predictions (blue bars) and MERRA-2 estimates (orange bars) are compared to the ground truth data (OpenAQ).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f05.png"/>

        </fig>

      <p id="d2e3253">Different fidelity metrics (per each trained model) were calculated to compare MERRA-2 estimates (orange bars) and our model predictions (blue bars) to OpenAQ measurements (ground truth). Correlation <inline-formula><mml:math id="M175" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>, Root Mean Square Error (RMSE) and Mean Absolute Error (MAE) values are shown on the left, middle and right columns. These metrics are evaluated for hourly averages, daily averages and monthly averages (estimated using the hourly averages) on the top, middle and bottom row. The fidelity metrics averages show that our model clearly outperforms MERRA-2. <inline-formula><mml:math id="M176" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> CV averages for our model are 0.51 (hourly averages), 0.65 (daily averages) and 0.87 (monthly averages). <inline-formula><mml:math id="M177" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> CV averages for MERRA-2 are respectively 0.10, 0.18, and 0.42. RMSE CV averages for our model are 6.58 <inline-formula><mml:math id="M178" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (hourly averages), 4.92 <inline-formula><mml:math id="M180" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (daily averages) and 2.87 <inline-formula><mml:math id="M182" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (monthly averages). The same metrics averages for MERRA-2 are respectively 9.05, 7.04 and 4.08 <inline-formula><mml:math id="M184" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>. MAE CV averages for our model are 4.61 <inline-formula><mml:math id="M186" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (hourly averages), 3.59 <inline-formula><mml:math id="M188" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (daily averages) and 2.51 <inline-formula><mml:math id="M190" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup> (monthly averages). MAE CV averages for MERRA-2 are respectively 6.39, 5.14 and 3.30 <inline-formula><mml:math id="M192" display="inline"><mml:mrow class="unit"><mml:mi mathvariant="normal">µ</mml:mi></mml:mrow></mml:math></inline-formula>g m<sup>−3</sup>. We can notice from Fig. <xref ref-type="fig" rid="F5"/> that our model outperforms MERRA-2 on all hourly and daily value metrics, and in most monthly averaged with the exceptions that MERRA-2 has better monthly MAE for stations 5 and 11, better RMSE for station 11. The <inline-formula><mml:math id="M194" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values still show a clear improvement of our model. For what regards station 5, the better RMSE and worse MAE are due to the fact that RMSE highlights outliers (i.e. MERRA-2 commits less but bigger mistakes). Anyway, the important <inline-formula><mml:math id="M195" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for both station 5 and station 11 show that our model correctly predicts the AQ trends better but is off mainly by a scaling factor. From Fig. <xref ref-type="fig" rid="F1"/>, one would expect some differences in the performances at different ground stations, since the more isolated is the station, the less information from surrounding stations is present in the training data (we have ideally full PM<sub>2.5</sub> maps only once per day). Stations 1, 2, 10, and 11 are clearly positioned outside the city center of Paris. Looking at the metrics for these stations, only the location of station 11 seems to  have somewhat different prediction accuracy by our model than the stations located in the city center. Although we considered a relatively small dataset to train and test our model, these results suggest it is not overfitting the training data.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e3491">Comparison between hourly PM<sub>2.5</sub> estimates from our model (blue), MERRA-2 (orange) and OpenAQ ground stations measurements (black) at station 1. The period considered runs between 4 and 13 December 2019.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f06.png"/>

        </fig>

      <fig id="F7" specific-use="star"><label>Figure 7</label><caption><p id="d2e3511">PM<sub>2.5</sub> map on 6 December 2019. The dots reprent PM<sub>2.5</sub> measurements from OpenAQ ground stations.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f07.png"/>

        </fig>

      <p id="d2e3538">Figure <xref ref-type="fig" rid="F6"/> presents an example of model performance at hourly resolution at Station 1 between 4 and 13 December 2019, compared against MERRA-2 and OpenAQ observations. The comparison illustrates that our model captures the observed variability more accurately than MERRA-2. Figure <xref ref-type="fig" rid="F7"/> shows hourly PM<sub>2.5</sub> concentration maps for the Paris region on 6 December 2019, with spatial patterns that agree well with OpenAQ station data. Zooming on these hourly maps we can notice slight block artifacts arising from MERRA-2 input low resolution. This effect could potentially be mitigated via spatial continuity constraints (i.e. adding a regularizer to the loss function) and will be considered in future work.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e3556">PM<sub>2.5</sub> daily cycle averages for the different seasons and monthly averages (at station 1). The black lines represent OpenAQ measurements, the orange lines represent MERRA-2 estimates and the blue lines are predicted by our model. The evaluation metrics comparing respectively MERRA-2 and our model to the ground truth (OpenAQ) are shown on the bottom-right.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f08.png"/>

        </fig>

      <fig id="F9" specific-use="star"><label>Figure 9</label><caption><p id="d2e3577">Predicted PM<sub>2.5</sub> seasonal averages maps by hour for the 2019 winter season (December, January and February) on the city of Paris. The dots represent ground stations measurements. These plots are obtained considering the model trained leaving out station 1.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f09.png"/>

        </fig>

</sec>
<sec id="Ch1.S4.SS2">
  <label>4.2</label><title>Seasonal daily trends and monthly averages</title>
      <p id="d2e3603">Figure <xref ref-type="fig" rid="F8"/> shows daily cycle averages on the different seasons (top and middle rows) and monthly averages (bottom row) of 2019 at the test station 1. Black lines represent the ground station measurements, while orange lines and blue lines represent respectively MERRA-2 estimates and our model predictions. Focusing on the daily cycle averages, qualitatively our model and MERRA-2 seem comparable for the spring seasons (March, April, May). The difference is evident on the winter (December, January, February), autumn (September, October, November) and summer (June, July, August) seasons. Especially on the summer season, our model improves notably the accuracy and correlation over MERRA-2. The improvement can be seen quantitatively looking at the metrics on the bottom-right of Fig. <xref ref-type="fig" rid="F8"/> (here the estimates for all the seasons have been taken into account in the calculation of the evaluation metrics). Notice how the metrics values for MERRA-2 and our model have been encoded with the same colors of the legend. On the bottom-left of Fig. <xref ref-type="fig" rid="F8"/> we compare monthly averages estimates. Again, the difference between MERRA-2 and our model is evident. While MERRA-2 could seem to give a good approximation to the PM<sub>2.5</sub> annual average at the station, it is not able to capture the time series trend. Our model improves notably from this point of view, showing also improvements in accuracy. This is clear from the metrics for monthly averages, where the <inline-formula><mml:math id="M204" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> is more than 3 times higher for our model, while the RMSE and MAE are about half of the respective errors in the MERRA-2 estimates.</p>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e3634">Predicted PM<sub>2.5</sub> monthly averages maps for the year 2019 on the city of Paris. The dots represent ground stations measurements. These plots are obtained considering the model trained leaving out station 1.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f10.png"/>

        </fig>

      <p id="d2e3652">Figure <xref ref-type="fig" rid="F9"/> shows PM<sub>2.5</sub> seasonal averages maps by hour for the 2019 winter season (December, January, February) predicted by our model on the city of Paris. The dots represent ground stations measurements. The general time series trend reflects the PM<sub>2.5</sub> variations seen in Fig. <xref ref-type="fig" rid="F8"/> on the top-left panel. The PM<sub>2.5</sub> levels seem to decrease at day time, and raise again at night time. This behaviour can be expected and physically explained through boundary layer height variations. Spatial variations of PM<sub>2.5</sub> are reasonable and in agreement with what found predicting PM<sub>2.5</sub> levels at satellite overpass <xref ref-type="bibr" rid="bib1.bibx27" id="paren.33"/>: the city center and areas surrounding main highways are predicted as the most polluted areas. The maps shown in Fig. <xref ref-type="fig" rid="F9"/> are obtained considering station 1 as test station.</p>

      <fig id="F11" specific-use="star"><label>Figure 11</label><caption><p id="d2e3713">Feature importance calculated as sum of the normalized absolute SHAP values for predictions at station 1.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/18/4771/2025/amt-18-4771-2025-f11.png"/>

        </fig>

      <p id="d2e3722">The maps shown in Fig. <xref ref-type="fig" rid="F10"/> represent PM<sub>2.5</sub> monthly averages for the year 2019 predicted by our model on the city of Paris. Dots represent ground stations measurementes. The general time series trend reflects the content of the bottom-left panel in Fig. <xref ref-type="fig" rid="F8"/> as expected: PM<sub>2.5</sub> levels are higher in colder months, while lower in warmer months. Again, this temporal variation of PM<sub>2.5</sub> could be explained through boundary layer height variations and also residential heating plays an important role in winter. Spatial variations of PM<sub>2.5</sub> present the same structure already discussed before for Fig. <xref ref-type="fig" rid="F9"/>. Again, the maps shown in Fig. <xref ref-type="fig" rid="F10"/> are obtained considering station 1 as test station. The agreement with the test station and training stations is generally good.</p>
</sec>
<sec id="Ch1.S4.SS3">
  <label>4.3</label><title>SHAP explainability</title>
      <p id="d2e3778">We employed the SHAP DeepExplainer to compute SHAP values and assess feature importance for the model predictions at station 1 <xref ref-type="bibr" rid="bib1.bibx20" id="paren.34"/>. Due to computational constraints, the SHAP values were calculated using a smaller background dataset, with analysis conducted on a subset of 90 randomly selected days. Feature importances were determined by summing the normalized absolute SHAP values, and are shown in Fig. <xref ref-type="fig" rid="F11"/>. As expected, T2M (2 m air temperature) emerged as one of the most significant predictors, consistent with prior findings. For instance, T2M influences the temporal variability of PM<sub>2.5</sub> through boundary layer dynamics and contains information about seasonal emission changes. Specific aerosol variables such as BCCMASS (Black Carbon Column Mass Density), could give the model an idea of how much important black carbon concentration is for the final PM<sub>2.5</sub> estimate, but at the same time act as proxy for other species related to black carbon emission sources. Among the most important high resolution input features, ASTERDEM (ASTER Digital Elevation Model) and BlackMarble (NASA Black Marble Night Lights) offer information about terrain topology and human activities location. While the former could provide useful information about aerosol transport, the latter could act as a proxy for aerosol sources spatial distribution. More generally, aggregating the feature importances in Fig. <xref ref-type="fig" rid="F11"/>, one can estimate the importance of athmosperic variables (35 %), aerosol variables (25 %) and high resolution indicators (40 %).</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <label>5</label><title>Conclusions</title>
      <p id="d2e3816">We developed a novel deep learning data fusion method to estimate hourly PM<sub>2.5</sub> on a grid with cell size 100 m <inline-formula><mml:math id="M218" display="inline"><mml:mo>×</mml:mo></mml:math></inline-formula> 100 m, utilizing low-resolution geophysical model data, high-resolution geographical indicators, satellite PM<sub>2.5</sub> retrievals and in-situ PM<sub>2.5</sub> ground measurements. A 3D U-Net based architecture was deployed to take into account both spatial and temporal correlations in the data at hand. The methodology was tested on data from Paris, France, for the year 2019.</p>
      <p id="d2e3853">The model outperforms MERRA-2 PM<sub>2.5</sub> estimates (our starting point, utilized as model input) on all the evaluation metrics considered. Our estimates are generally consistent with the PM<sub>2.5</sub> spatio-temporal variability assessed by ground stations measurements. Our method seems promising in answering our research questions: reliable gapless PM<sub>2.5</sub> maps at fine scale in absence of AOD data, due to absence of satellite overpass or due to failed AOD retrieval, seem possible.</p>
      <p id="d2e3883">Further improvements could be obtained by various means. The method is flexible for what concern data sources, as different data sources could be utilized as inputs, and targets in the training process. In particular, different satellite PM<sub>2.5</sub> sources could be considered in the training. So far, we considered only NOODLESALAD PM<sub>2.5</sub>. In future studies, we could take into account other satellite data at different satellite overpass times and with different spatial resolution. Considering that geostationary retrievals have high temporal resolution, we could also combine low orbiting instruments and geostationary instruments to integrate all the available information both on the spatial and temporal dimensions. Further, instead of relying solely on data, we could introduce physical constraints in our loss function, pushing the model training to the space of physical solutions assisted by differential equations. Physics Informed Neural Networks (PINNs) <xref ref-type="bibr" rid="bib1.bibx28" id="paren.35"/> have shown promising results in many areas of science and they can be a practical solution to achieve a deep learning based assimilation model at fine scale. It is also important to address the issue of data imbalance. The PM<sub>2.5</sub> distribution in the training data is inherently skewed toward lower values, posing a common challenge in training models. Additionally, satellite retrieval maps (NOODLESALAD PM<sub>2.5</sub>) are more susceptible to cloud cover during winter, causing seasonal imbalance to the training data. Expanding the dataset to include more locations and years could help mitigate these issues and improve model performance. We remark that assessment of the credibility of the spatial features in high-resolution air pollution mapping remains an important future topic. However, this is not an straightforward task due to the lack of independent ground based high-resolution data. At scales of single cities, such local data could possibly be obtained from separate measurement campaigns (e.g. using drones) for short time periods or low-cost sensor networks such as PurpleAir which, however, may have somewhat limited accuracy.</p>
      <p id="d2e3925">In conclusion, given the encouraging results and possible future developments, we believe our methodology could be relevant for PM<sub>2.5</sub> related exposure and regulation studies at finer (suburban level) scale.</p>
</sec>

      
      </body>
    <back><app-group>

<app id="App1.Ch1.S1">
  <label>Appendix A</label><title>Table with input features and lists of variables used from datasets</title>

<table-wrap id="TA1"><label>Table A1</label><caption><p id="d2e3953">Table of input features for the model.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">U10M</oasis:entry>
         <oasis:entry colname="col2">V10M</oasis:entry>
         <oasis:entry colname="col3">PS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">T2M</oasis:entry>
         <oasis:entry colname="col2">SO4SMASS</oasis:entry>
         <oasis:entry colname="col3">OCSMASS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">BCSMASS</oasis:entry>
         <oasis:entry colname="col2">DUSMASS25</oasis:entry>
         <oasis:entry colname="col3">SSSMASS25</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SO4CMASS</oasis:entry>
         <oasis:entry colname="col2">OCCMASS</oasis:entry>
         <oasis:entry colname="col3">BCCMASS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">DUCMASS25</oasis:entry>
         <oasis:entry colname="col2">SSCMASS25</oasis:entry>
         <oasis:entry colname="col3">TOTEXTTAU</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">TOTANGSTR</oasis:entry>
         <oasis:entry colname="col2">TOTSCATAU</oasis:entry>
         <oasis:entry colname="col3">QLML</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">SURFACERH</oasis:entry>
         <oasis:entry colname="col2">PM25</oasis:entry>
         <oasis:entry colname="col3">surface_to_column_ratio_PM25</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ETA</oasis:entry>
         <oasis:entry colname="col2">winddirection</oasis:entry>
         <oasis:entry colname="col3">windspeed</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">ASTERDEM</oasis:entry>
         <oasis:entry colname="col2">BlackMarble</oasis:entry>
         <oasis:entry colname="col3">distancetoroad</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetoroad_upwind_1</oasis:entry>
         <oasis:entry colname="col2">distancetoroad_upwind_2</oasis:entry>
         <oasis:entry colname="col3">distancetoroad_upwind_3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetoroad_upwind_4</oasis:entry>
         <oasis:entry colname="col2">distancetoroad_upwind_5</oasis:entry>
         <oasis:entry colname="col3">distancetoroad_upwind_6</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetoroad_upwind_7</oasis:entry>
         <oasis:entry colname="col2">distancetoroad_upwind_8</oasis:entry>
         <oasis:entry colname="col3">distancetoroad_upwind_9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetoroad_upwind_10</oasis:entry>
         <oasis:entry colname="col2">distancetoroad_upwind_11</oasis:entry>
         <oasis:entry colname="col3">distancetoroad_upwind_12</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetoroad_upwind_13</oasis:entry>
         <oasis:entry colname="col2">distancetoroad_upwind_14</oasis:entry>
         <oasis:entry colname="col3">distancetoroad_upwind_15</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetoroad_upwind_16</oasis:entry>
         <oasis:entry colname="col2">distancetoroad_upwind_17</oasis:entry>
         <oasis:entry colname="col3">distancetoroad_upwind_18</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetolandclass_1</oasis:entry>
         <oasis:entry colname="col2">distancetolandclass_2</oasis:entry>
         <oasis:entry colname="col3">distancetolandclass_3</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetolandclass_4</oasis:entry>
         <oasis:entry colname="col2">distancetolandclass_5</oasis:entry>
         <oasis:entry colname="col3">distancetolandclass_6</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetolandclass_7</oasis:entry>
         <oasis:entry colname="col2">distancetolandclass_8</oasis:entry>
         <oasis:entry colname="col3">distancetolandclass_9</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetolandclass_10</oasis:entry>
         <oasis:entry colname="col2">distancetolandclass_11</oasis:entry>
         <oasis:entry colname="col3">distancetolandclass_12</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetolandclass_13</oasis:entry>
         <oasis:entry colname="col2">distancetolandclass_14</oasis:entry>
         <oasis:entry colname="col3">distancetolandclass_15</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">distancetolandclass_16</oasis:entry>
         <oasis:entry colname="col2">distancetolandclass_17</oasis:entry>
         <oasis:entry colname="col3">landclass</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

<sec id="App1.Ch1.S1.SS1">
  <label>A1</label><title>MERRA-2 variables</title>
      <p id="d2e4232">We use the following meteorology-related variables from the MERRA-2 <monospace>M2T1NXSLV</monospace> dataset: <list list-type="bullet"><list-item>
      <p id="d2e4240">PS: surface pressure (Pa)</p></list-item><list-item>
      <p id="d2e4244">T2M: 2 m air temperature (K)</p></list-item><list-item>
      <p id="d2e4248">U10M: 10 m eastward wind (m s<sup>−1</sup>)</p></list-item><list-item>
      <p id="d2e4264">V10M: 10 m northward wind (m s<sup>−1</sup>)</p></list-item></list></p>
      <p id="d2e4279">We use the following meteorology-related variables from the MERRA-2 <monospace>M2T1NXFLX</monospace> dataset: <list list-type="bullet"><list-item>
      <p id="d2e4287">QLML: surface specific humidity (1)</p></list-item></list></p>
      <p id="d2e4290">We use the following aerosol and air quality related variables from the MERRA-2 <monospace>M2T1NXAER</monospace> dataset: <list list-type="bullet"><list-item>
      <p id="d2e4298">BCCMASS: Black Carbon Column Mass Density (kg m<sup>−2</sup>)</p></list-item><list-item>
      <p id="d2e4314">BCSMASS: Black Carbon Surface Mass Concentration (kg m<sup>−3</sup>)</p></list-item><list-item>
      <p id="d2e4330">DUCMASS25: Dust Column Mass Density – PM<sub>2.5</sub> (kg m<sup>−2</sup>)</p></list-item><list-item>
      <p id="d2e4355">DUSMASS25: Dust Surface Mass Concentration – PM<sub>2.5</sub> (kg m<sup>−3</sup>) </p></list-item><list-item>
      <p id="d2e4382">OCCMASS: Organic Carbon Column Mass Density (kg m<sup>−2</sup>)</p></list-item><list-item>
      <p id="d2e4398">OCSMASS: Organic Carbon Surface Mass Concentration (kg m<sup>−3</sup>)</p></list-item><list-item>
      <p id="d2e4414">SO4CMASS: SO4 Column Mass Density (kg m<sup>−2</sup>)</p></list-item><list-item>
      <p id="d2e4430">SO4SMASS: SO4 Surface Mass Concentration (kg m<sup>−3</sup>)</p></list-item><list-item>
      <p id="d2e4446">SSCMASS25: Sea Salt Column Mass Density – PM<sub>2.5</sub> (kg m<sup>−2</sup>)</p></list-item><list-item>
      <p id="d2e4471">SSSMASS25: Sea Salt Surface Mass Concentration – PM<sub>2.5</sub> (kg m<sup>−3</sup>)</p></list-item><list-item>
      <p id="d2e4496">TOTANGSTR: Total Aerosol Angstrom parameter [470–870 nm] (1)</p></list-item><list-item>
      <p id="d2e4501">TOTEXTTAU: Total Aerosol Extinction AOT [550 nm] (1)</p></list-item><list-item>
      <p id="d2e4505">TOTSCATAU: Total Aerosol Scattering AOT [550 nm] (1)</p></list-item></list></p>
</sec>
<sec id="App1.Ch1.S1.SS2">
  <label>A2</label><title>OpenStreetMap road types used to compute the distance to the closest road</title>
      <p id="d2e4516">We use the following road types to compute the distance to the closest road. The descriptions of the road types are obtained from <xref ref-type="bibr" rid="bib1.bibx24" id="text.36"/>. <list list-type="bullet"><list-item>
      <p id="d2e4524">motorway: A restricted access major divided highway, normally with 2 or more running lanes plus emergency hard shoulder. Equivalent to the Freeway, Autobahn, etc.</p></list-item><list-item>
      <p id="d2e4528">trunk: The most important roads in a country's system that aren't motorways.</p></list-item><list-item>
      <p id="d2e4532">primary: The next most important roads in a country's system.</p></list-item><list-item>
      <p id="d2e4536">secondary: The next most important roads in a country's system.</p></list-item><list-item>
      <p id="d2e4540">tertiary: The next most important roads in a country's system.</p></list-item><list-item>
      <p id="d2e4544">motorway_link: The link roads (sliproads/ramps) leading to/from a motorway from/to a motorway or lower class highway. Normally with the same motorway restrictions.</p></list-item><list-item>
      <p id="d2e4548">trunk_link: The link roads (sliproads/ramps) leading to/from a trunk road from/to a trunk road or lower class highway.</p></list-item><list-item>
      <p id="d2e4552">primary_link: The link roads (sliproads/ramps) leading to/from a primary road from/to a primary road or lower class highway.</p></list-item><list-item>
      <p id="d2e4556">secondary_link: The link roads (sliproads/ramps) leading to/from a secondary road from/to a secondary road or lower class highway.</p></list-item><list-item>
      <p id="d2e4560">tertiary_link: The link roads (sliproads/ramps) leading to/from a tertiary road from/to a tertiary road or lower class highway.</p></list-item></list></p>
</sec>
<sec id="App1.Ch1.S1.SS3">
  <label>A3</label><title>IGBP land cover types</title>
      <p id="d2e4572">IGBP classification contains the following land cover types: <list list-type="order"><list-item>
      <p id="d2e4577">Evergreen needleleaf forests</p></list-item><list-item>
      <p id="d2e4581">Evergreen broadleaf forests</p></list-item><list-item>
      <p id="d2e4585">Deciduous needleleaf forests</p></list-item><list-item>
      <p id="d2e4589">Deciduous broadleaf forests</p></list-item><list-item>
      <p id="d2e4593">Mixed forests</p></list-item><list-item>
      <p id="d2e4597">Closed shrublands</p></list-item><list-item>
      <p id="d2e4601">Open shrublands</p></list-item><list-item>
      <p id="d2e4605">Woody savannas</p></list-item><list-item>
      <p id="d2e4609">Savannas</p></list-item><list-item>
      <p id="d2e4613">Grasslands</p></list-item><list-item>
      <p id="d2e4617">Permanent wetlands</p></list-item><list-item>
      <p id="d2e4622">Croplands</p></list-item><list-item>
      <p id="d2e4626">Urban and built-up</p></list-item><list-item>
      <p id="d2e4630">Cropland/natural</p></list-item><list-item>
      <p id="d2e4634">Snow and ice</p></list-item><list-item>
      <p id="d2e4638">Barren</p></list-item><list-item>
      <p id="d2e4642">Water bodies</p></list-item></list></p>
</sec>
</app>
  </app-group><notes notes-type="codedataavailability"><title>Code and data availability</title>

      <p id="d2e4650">The OpenAQ data is open data and available for download at <uri>https://openaq.org/</uri> (last access: 13 April 2023). The OpenStreetMap data is open data and available for download at <uri>https://www.openstreetmap.org/</uri> (last access: 13 April 2023). All the NASA data (MERRA-2, MODIS, ASTER DEM) used in this work is open data and can be found and downloaded using the NASA Earthdata Search website at <uri>https://www.earthdata.nasa.gov/</uri> (last access: 13 April 2023). The NASA Black Marble Night Lights data is available at <uri>https://blackmarble.gsfc.nasa.gov/</uri> (last access: 13 April 2023). Data (including NOODLESALAD PM<sub>2.5</sub>) and code will be available from the authors on a reasonable request.</p>
  </notes><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e4677">AP: Conceptualization, Methodology, Software, Formal analysis, Writing – Original draft, Visualization. VK: Conceptualization, Methodology, Formal analysis, Writing – Original draft, Supervision. TL: Conceptualization, Methodology, Formal analysis, Writing – Original Draft, Supervision. AL: Conceptualization, Methodology, Software, Formal analysis, Writing – Original draft, Supervision.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e4683">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e4689">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Also, please note that this paper has not received English language copy-editing.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e4695">This study was funded by the European Space Agency EO Science for Society programme via the NOODLESALAD project (contract number 4000137651/22/I-DT-lr). The research was also supported by the Research Council of Finland via the Finnish Centre of Excellence of Inverse Modelling and Imaging (project no. 353084), Flagship of Advanced Mathematics for Sensing Imaging and Modelling (grant no. 358944), and research project (grant no. 321761). The authors wish to acknowledge CSC – IT Center for Science, Finland, for computational resources.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e4700">This research has been supported by the European Space Agency (grant no. 4000137651/22/I-DT-lr) and the Research Council of Finland, Luonnontieteiden ja Tekniikan Tutkimuksen Toimikunta (grant nos. 353084, 358944, and 321761).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e4706">This paper was edited by Sandip Dhomse and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Belward et al.(1999)</label><mixed-citation> Belward, A. S., Estes, J. E., and Kline, K. D.: The IGBP-DIS global 1-km land-cover data set DISCover: A project overview, Photogrammetric Engineering and Remote Sensing, 65, 1013–1020, 1999.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Bessho et al.(2016)</label><mixed-citation>Bessho, K., Date, K., Hayashi, M., Ikeda, A., Imai, T., Inoue, H., Kumagai, Y., Miyakawa, T., Murata, H., Ohno, T., Okuyama, A., Oyama, R., Sasaki, Y.,  Shimazu, Y., Shimoji, K., Sumida, Y., Suzuki, M., Taniguchi, H., Tsuchiyama,  H., Uesawa, D., Yokota, H., and Yoshida, R.: An Introduction to Himawari-8/9  – Japan's New-Generation Geostationary Meteorological Satellites, Journal of  the Meteorological Society of Japan. Ser. II, 94, 151–183, <ext-link xlink:href="https://doi.org/10.2151/jmsj.2016-009" ext-link-type="DOI">10.2151/jmsj.2016-009</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Bishop and Bishop(2024)</label><mixed-citation>Bishop, C. M. and Bishop, H.: Deep Learning: Foundations and Concepts,  Springer International Publishing, Cham, ISBN 978-3-031-45467-7, <ext-link xlink:href="https://doi.org/10.1007/978-3-031-45468-4" ext-link-type="DOI">10.1007/978-3-031-45468-4</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Buchard et al.(2016)</label><mixed-citation>Buchard, V., Da Silva, A., Randles, C., Colarco, P., Ferrare, R., Hair, J., Hostetler, C., Tackett, J., and Winker, D.: Evaluation of the surface PM<sub>2.5</sub> in Version 1 of the NASA MERRA Aerosol Reanalysis over the United States, Atmospheric Environment, 125, 100–111, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Chu et al.(2016)</label><mixed-citation>Chu, Y., Liu, Y., Li, X., Liu, Z., Lu, H., Lu, Y., Mao, Z., Chen, X., Li, N., Ren, M., Liu, F., Tian, L., Zhu, Z., and Xiang, H.: A Review on Predicting Ground PM<sub>2.5</sub> Concentration Using Satellite Aerosol Optical Depth, Atmosphere, 7, <ext-link xlink:href="https://doi.org/10.3390/atmos7100129" ext-link-type="DOI">10.3390/atmos7100129</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Çiçek et al.(2016)</label><mixed-citation>Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T., and Ronneberger, O.: 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation, edited by: Ourselin, S., Joskowicz, L., Sabuncu, M., Unal, G., Wells, W., Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016, MICCAI 2016, Lecture Notes in Computer Science, Springer, Cham., 9901. <ext-link xlink:href="https://doi.org/10.1007/978-3-319-46723-8_49" ext-link-type="DOI">10.1007/978-3-319-46723-8_49</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Cohen et al.(2017)</label><mixed-citation> Cohen, A. J., Brauer, M., Burnett, R., Anderson, H. R., Frostad, J., Estep, K., Balakrishnan, K., Brunekreef, B., Dandona„ L., Dandona, R., Feigin, V., Freedman, G., Hubbell, B., Jobling, A., Kan, H., Knibbs, L., Liu, Y., Martin, R., Morawska, L., Pope III, C. A., Shin, H., Straif, K., Shaddick, G., Thomas, M., van Dingenen, R., van Donkelaar, A., Vos, T., Murray, C. J. L., and Forouzanfar, M. H.: Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the Global Burden of Diseases Study 2015, The Lancet, 389, 1907–1918, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Deng(2015)</label><mixed-citation>Deng, L.: Estimation of PM<sub>2.5</sub> Spatial Distribution Based on Kriging Interpolation, in: Proceedings of the First International Conference on Information Sciences, Machinery, Materials and Energy, Atlantis Press, ISBN 978-94-62520-67-7, 1791–1794, <ext-link xlink:href="https://doi.org/10.2991/icismme-15.2015.370" ext-link-type="DOI">10.2991/icismme-15.2015.370</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Fujisada et al.(2011)</label><mixed-citation> Fujisada, H., Urai, M., and Iwasaki, A.: Advanced methodology for ASTER DEM generation, IEEE Transactions on Geoscience and Remote Sensing, 49, 5080–5091, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Fujisada et al.(2012)</label><mixed-citation> Fujisada, H., Urai, M., and Iwasaki, A.: Technical methodology for ASTER global DEM, IEEE Transactions on Geoscience and Remote Sensing, 50, 3725–3736, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Goodfellow et al.(2016)</label><mixed-citation>Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning, MIT Press, <uri>http://www.deeplearningbook.org</uri> (last access: 1 December 2024), 2016.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Health Effects Institute(2019)</label><mixed-citation> Health Effects Institute: State of global air 2019, Health Effects Institute, ISSN 2578-6873, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Inness et al.(2019)</label><mixed-citation>Inness, A., Ades, M., Agustí-Panareda, A., Barré, J., Benedictow, A., Blechschmidt, A.-M., Dominguez, J. J., Engelen, R., Eskes, H., Flemming, J., Huijnen, V., Jones, L., Kipling, Z., Massart, S., Parrington, M., Peuch, V.-H., Razinger, M., Remy, S., Schulz, M., and Suttie, M.: The CAMS reanalysis of atmospheric composition, Atmos. Chem. Phys., 19, 3515–3556, <ext-link xlink:href="https://doi.org/10.5194/acp-19-3515-2019" ext-link-type="DOI">10.5194/acp-19-3515-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>Keller et al.(2021)</label><mixed-citation>Keller, C. A., Knowland, K. E., Duncan, B. N., Liu, J., Anderson, D. C., Das,  S., Lucchesi, R. A., Lundgren, E. W., Nicely, J. M., Nielsen, E., Ott, L. E., Saunders, E., Strode, S. A., Wales, P. A., Jacob, D. J., and Pawson, S.: Description of the NASA GEOS Composition Forecast Modeling System GEOS-CF v1.0, Journal of Advances in Modeling Earth Systems, 13, e2020MS002413, <ext-link xlink:href="https://doi.org/10.1029/2020MS002413" ext-link-type="DOI">10.1029/2020MS002413</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Koo et al.(2024)</label><mixed-citation>Koo, J.-S., Wang, K.-H., Yun, H.-Y., Kwon, H.-Y., and Koo, Y.-S.: Development of PM<sub>2.5</sub> Forecast Model Combining ConvLSTM and DNN in Seoul, Atmosphere, 15, 1276, <ext-link xlink:href="https://doi.org/10.3390/atmos15111276" ext-link-type="DOI">10.3390/atmos15111276</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>LeCun et al.(1989)</label><mixed-citation>LeCun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., and Jackel, L.: Handwritten Digit Recognition with a Back-Propagation Network, in: Advances in Neural Information Processing Systems, edited by: Touretzky, D., vol. 2, Morgan-Kaufmann, <uri>https://proceedings.neurips.cc/paper_files/paper/1989/file/53c3bce66e43be4f209556518c2fcb54-Paper.pdf</uri> (last access: 1 December 2024), 1989.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Levy et al.(2013)</label><mixed-citation>Levy, R. C., Mattoo, S., Munchak, L. A., Remer, L. A., Sayer, A. M., Patadia, F., and Hsu, N. C.: The Collection 6 MODIS aerosol products over land and ocean, Atmos. Meas. Tech., 6, 2989–3034, <ext-link xlink:href="https://doi.org/10.5194/amt-6-2989-2013" ext-link-type="DOI">10.5194/amt-6-2989-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Lipponen et al.(2022)</label><mixed-citation>Lipponen, A., Reinvall, J., Väisänen, A., Taskinen, H., Lähivaara, T., Sogacheva, L., Kolmonen, P., Lehtinen, K., Arola, A., and Kolehmainen, V.: Deep-learning-based post-process correction of the aerosol parameters in the high-resolution Sentinel-3 Level-2 Synergy product, Atmos. Meas. Tech., 15, 895–914, <ext-link xlink:href="https://doi.org/10.5194/amt-15-895-2022" ext-link-type="DOI">10.5194/amt-15-895-2022</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Loveland and Belward(1997)</label><mixed-citation> Loveland, T. R. and Belward, A.: The international geosphere biosphere programme data and information system global land cover data set (DISCover), Acta Astronautica, 41, 681–689, 1997.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Lundberg and Lee(2017)</label><mixed-citation>Lundberg, S. M. and Lee, S.-I.: A Unified Approach to Interpreting Model Predictions, in: Advances in Neural Information Processing Systems 30, edited by: Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., Curran Associates, Inc., 4765–4774, <uri>http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf</uri> (last access: 1 December 2024), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Michaelides et al.(2019)</label><mixed-citation>Michaelides, S., Lane, J., and Kasparis, T.: Effect of Vertical Air Motion on Disdrometer Derived Z-R Coefficients, Atmosphere, 10, 77, <ext-link xlink:href="https://doi.org/10.3390/atmos10020077" ext-link-type="DOI">10.3390/atmos10020077</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Muthukumar et al.(2022)</label><mixed-citation>Muthukumar, P., Cocom, E., Nagrecha, K., Comer, D., Burga, I., Taub, J., Calvert, C. F., Holm, J., and Pourhomayoun, M.: Predicting PM<sub>2.5</sub> atmospheric air pollution using deep learning with meteorological data and ground-based observations and remote-sensing satellite big data, Air Quality, Atmosphere, &amp; Health, 15, 1221–1234, <ext-link xlink:href="https://doi.org/10.1007/s11869-021-01126-3" ext-link-type="DOI">10.1007/s11869-021-01126-3</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>NASA/METI/AIST/Japan Spacesystems and US/Japan ASTER Science Team(2019)</label><mixed-citation>NASA/METI/AIST/Japan Spacesystems and US/Japan ASTER Science Team: ASTER Global Digital Elevation Model V003, distributed by NASA EOSDIS Land Processes DAAC, <ext-link xlink:href="https://doi.org/10.5067/ASTER/ASTGTM.003" ext-link-type="DOI">10.5067/ASTER/ASTGTM.003</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>OpenStreetMap(2023)</label><mixed-citation>OpenStreetMap: OpenStreetMap Wiki – Key:highway, <uri>https://wiki.openstreetmap.org/wiki/Key:highway</uri> (last access: 13 April 2023), 2023.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Paszke et al.(2019)</label><mixed-citation>Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., Bai, J., and Chintala, S.: PyTorch: An Imperative Style, High-Performance Deep Learning Library, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1912.01703" ext-link-type="DOI">10.48550/arXiv.1912.01703</ext-link>, 3 December 2019.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Pope and Dockery(2006)</label><mixed-citation>Pope, C. A. I. and Dockery, D. W.: Health Effects of Fine Particulate Air Pollution: Lines that Connect, Journal of the Air &amp; Waste Management Association, 56, 709–742, <ext-link xlink:href="https://doi.org/10.1080/10473289.2006.10464485" ext-link-type="DOI">10.1080/10473289.2006.10464485</ext-link>, 2006.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Porcheddu et al.(2024)</label><mixed-citation>Porcheddu, A., Kolehmainen, V., Lähivaara, T., and Lipponen, A.: Post-process correction improves the accuracy of satellite PM<sub>2.5</sub> retrievals, Atmos. Meas. Tech., 17, 5747–5764, <ext-link xlink:href="https://doi.org/10.5194/amt-17-5747-2024" ext-link-type="DOI">10.5194/amt-17-5747-2024</ext-link>, 2024. </mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Raissi et al.(2019)</label><mixed-citation>Raissi, M., Perdikaris, P., and Karniadakis, G.: Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, Journal of Computational Physics, 378, 686–707, <ext-link xlink:href="https://doi.org/10.1016/j.jcp.2018.10.045" ext-link-type="DOI">10.1016/j.jcp.2018.10.045</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Randles et al.(2017)</label><mixed-citation> Randles, C. A., da Silva, A., Buchard, V., Colarco, P. R., Darmenov, A. S., Govindaraju, R. C., Smirnov, A., Ferrare, R. A., Hair, J. W., Shinozuka, Y., and Flynn C.: The MERRA-2 aerosol reanalysis, 1980 onward. Part I: System description and data assimilation evaluation, Journal of Climate, 30, 6823–6850, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Ronneberger et al.(2015)</label><mixed-citation>Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for  Biomedical Image Segmentation, edited by: Navab, N., Hornegger, J., Wells, W., Frangi, A., Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, MICCAI 2015, Lecture Notes in Computer Science, Springer, Cham., 9351, <ext-link xlink:href="https://doi.org/10.1007/978-3-319-24574-4_28" ext-link-type="DOI">10.1007/978-3-319-24574-4_28</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Shi et al.(2015)</label><mixed-citation>Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., kin Wong, W., and chun Woo, W.: Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting, arXiv [preprint], <ext-link xlink:href="https://doi.org/10.48550/arXiv.1506.04214" ext-link-type="DOI">10.48550/arXiv.1506.04214</ext-link>, 13 June 2015.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Sulla-Menashe and Friedl(2018)</label><mixed-citation>Sulla-Menashe, D. and Friedl, M. A.: User guide to collection 6 MODIS land cover (MCD12Q1 and MCD12C1) product, USGS, Reston, VA, USA, 1, 18, <uri>https://modis.ornl.gov/documentation/guides/MCD12_User_Guide_V6.pdf</uri> (last access: 13 April 2023), 2018.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Tang et al.(2024)</label><mixed-citation>Tang, D., Zhan, Y., and Yang, F.: A review of machine learning for modeling air quality: Overlooked but important issues, Atmospheric Research, 300, 107261, <ext-link xlink:href="https://doi.org/10.1016/j.atmosres.2024.107261" ext-link-type="DOI">10.1016/j.atmosres.2024.107261</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Thangavel et al.(2022)</label><mixed-citation>Thangavel, P., Park, D., and Lee, Y.-C.: Recent Insights into Particulate Matter (PM<sub>2.5</sub>)-Mediated Toxicity in Humans: An Overview, International Journal of Environmental Research and Public Health, 19, 7511, <ext-link xlink:href="https://doi.org/10.3390/ijerph19127511" ext-link-type="DOI">10.3390/ijerph19127511</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Wang et al.(2020)</label><mixed-citation>Wang, Z., Shrestha, R., and Román, M. O.: VIIRS/NPP Lunar BRDF-Adjusted Nighttime Lights Yearly L3 Global 15 arc second Linear Lat Lon Grid [data set], <ext-link xlink:href="https://doi.org/10.5067/VIIRS/VNP46A4.001" ext-link-type="DOI">10.5067/VIIRS/VNP46A4.001</ext-link>, 2020.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Machine learning data fusion for high spatio-temporal resolution PM<sub>2.5</sub></article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Belward et al.(1999)</label><mixed-citation>
      
Belward, A. S., Estes, J. E., and Kline, K. D.: The IGBP-DIS global 1-km land-cover data set DISCover: A project overview, Photogrammetric Engineering and Remote Sensing, 65, 1013–1020, 1999.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Bessho et al.(2016)</label><mixed-citation>
      
Bessho, K., Date, K., Hayashi, M., Ikeda, A., Imai, T., Inoue, H., Kumagai, Y., Miyakawa, T., Murata, H., Ohno, T., Okuyama, A., Oyama, R., Sasaki, Y.,  Shimazu, Y., Shimoji, K., Sumida, Y., Suzuki, M., Taniguchi, H., Tsuchiyama,  H., Uesawa, D., Yokota, H., and Yoshida, R.: An Introduction to Himawari-8/9  – Japan's New-Generation Geostationary Meteorological Satellites, Journal of  the Meteorological Society of Japan. Ser. II, 94, 151–183,
<a href="https://doi.org/10.2151/jmsj.2016-009" target="_blank">https://doi.org/10.2151/jmsj.2016-009</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Bishop and Bishop(2024)</label><mixed-citation>
      
Bishop, C. M. and Bishop, H.: Deep Learning: Foundations and Concepts,  Springer International Publishing, Cham, ISBN 978-3-031-45467-7, <a href="https://doi.org/10.1007/978-3-031-45468-4" target="_blank">https://doi.org/10.1007/978-3-031-45468-4</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Buchard et al.(2016)</label><mixed-citation>
      
Buchard, V., Da Silva, A., Randles, C., Colarco, P., Ferrare, R., Hair, J., Hostetler, C., Tackett, J., and Winker, D.: Evaluation of the surface PM<sub>2.5</sub> in Version 1 of the NASA MERRA Aerosol Reanalysis over the United States, Atmospheric Environment, 125, 100–111, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Chu et al.(2016)</label><mixed-citation>
      
Chu, Y., Liu, Y., Li, X., Liu, Z., Lu, H., Lu, Y., Mao, Z., Chen, X., Li, N., Ren, M., Liu, F., Tian, L., Zhu, Z., and Xiang, H.: A Review on Predicting Ground PM<sub>2.5</sub> Concentration Using Satellite Aerosol Optical Depth, Atmosphere, 7, <a href="https://doi.org/10.3390/atmos7100129" target="_blank">https://doi.org/10.3390/atmos7100129</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Çiçek et al.(2016)</label><mixed-citation>
      
Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T., and Ronneberger, O.: 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation, edited by: Ourselin, S., Joskowicz, L., Sabuncu, M., Unal, G., Wells, W., Medical Image Computing and Computer-Assisted Intervention – MICCAI 2016, MICCAI 2016, Lecture Notes in Computer Science, Springer, Cham., 9901. <a href="https://doi.org/10.1007/978-3-319-46723-8_49" target="_blank">https://doi.org/10.1007/978-3-319-46723-8_49</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Cohen et al.(2017)</label><mixed-citation>
      
Cohen, A. J., Brauer, M., Burnett, R., Anderson, H. R., Frostad, J., Estep, K., Balakrishnan, K., Brunekreef, B., Dandona„ L., Dandona, R., Feigin, V., Freedman, G., Hubbell, B., Jobling, A., Kan, H., Knibbs, L., Liu, Y., Martin, R., Morawska, L., Pope III, C. A., Shin, H., Straif, K., Shaddick, G., Thomas, M., van Dingenen, R., van Donkelaar, A., Vos, T., Murray, C. J. L., and Forouzanfar, M. H.: Estimates and 25-year trends of the global burden of disease attributable to ambient air pollution: an analysis of data from the Global Burden of Diseases Study 2015, The Lancet, 389, 1907–1918, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Deng(2015)</label><mixed-citation>
      
Deng, L.: Estimation of PM<sub>2.5</sub> Spatial Distribution Based on Kriging Interpolation, in: Proceedings of the First International Conference on Information Sciences, Machinery, Materials and Energy, Atlantis Press, ISBN 978-94-62520-67-7, 1791–1794, <a href="https://doi.org/10.2991/icismme-15.2015.370" target="_blank">https://doi.org/10.2991/icismme-15.2015.370</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Fujisada et al.(2011)</label><mixed-citation>
      
Fujisada, H., Urai, M., and Iwasaki, A.: Advanced methodology for ASTER DEM
generation, IEEE Transactions on Geoscience and Remote Sensing, 49,
5080–5091, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Fujisada et al.(2012)</label><mixed-citation>
      
Fujisada, H., Urai, M., and Iwasaki, A.: Technical methodology for ASTER global DEM, IEEE Transactions on Geoscience and Remote Sensing, 50, 3725–3736, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Goodfellow et al.(2016)</label><mixed-citation>
      
Goodfellow, I., Bengio, Y., and Courville, A.: Deep Learning, MIT Press,
<a href="http://www.deeplearningbook.org" target="_blank"/> (last access: 1 December 2024), 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Health Effects Institute(2019)</label><mixed-citation>
      
Health Effects Institute: State of global air 2019, Health Effects Institute, ISSN 2578-6873, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Inness et al.(2019)</label><mixed-citation>
      
Inness, A., Ades, M., Agustí-Panareda, A., Barré, J., Benedictow, A., Blechschmidt, A.-M., Dominguez, J. J., Engelen, R., Eskes, H., Flemming, J., Huijnen, V., Jones, L., Kipling, Z., Massart, S., Parrington, M., Peuch, V.-H., Razinger, M., Remy, S., Schulz, M., and Suttie, M.: The CAMS reanalysis of atmospheric composition, Atmos. Chem. Phys., 19, 3515–3556, <a href="https://doi.org/10.5194/acp-19-3515-2019" target="_blank">https://doi.org/10.5194/acp-19-3515-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>Keller et al.(2021)</label><mixed-citation>
      
Keller, C. A., Knowland, K. E., Duncan, B. N., Liu, J., Anderson, D. C., Das,  S., Lucchesi, R. A., Lundgren, E. W., Nicely, J. M., Nielsen, E., Ott, L. E.,
Saunders, E., Strode, S. A., Wales, P. A., Jacob, D. J., and Pawson, S.:
Description of the NASA GEOS Composition Forecast Modeling System GEOS-CF
v1.0, Journal of Advances in Modeling Earth Systems, 13, e2020MS002413,
<a href="https://doi.org/10.1029/2020MS002413" target="_blank">https://doi.org/10.1029/2020MS002413</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Koo et al.(2024)</label><mixed-citation>
      
Koo, J.-S., Wang, K.-H., Yun, H.-Y., Kwon, H.-Y., and Koo, Y.-S.: Development of PM<sub>2.5</sub> Forecast Model Combining ConvLSTM and DNN in Seoul, Atmosphere, 15, 1276, <a href="https://doi.org/10.3390/atmos15111276" target="_blank">https://doi.org/10.3390/atmos15111276</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>LeCun et al.(1989)</label><mixed-citation>
      
LeCun, Y., Boser, B., Denker, J., Henderson, D., Howard, R., Hubbard, W., and Jackel, L.: Handwritten Digit Recognition with a Back-Propagation Network, in: Advances in Neural Information Processing Systems, edited by: Touretzky, D., vol. 2, Morgan-Kaufmann, <a href="https://proceedings.neurips.cc/paper_files/paper/1989/file/53c3bce66e43be4f209556518c2fcb54-Paper.pdf" target="_blank"/> (last access: 1 December 2024), 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Levy et al.(2013)</label><mixed-citation>
      
Levy, R. C., Mattoo, S., Munchak, L. A., Remer, L. A., Sayer, A. M., Patadia, F., and Hsu, N. C.: The Collection 6 MODIS aerosol products over land and ocean, Atmos. Meas. Tech., 6, 2989–3034, <a href="https://doi.org/10.5194/amt-6-2989-2013" target="_blank">https://doi.org/10.5194/amt-6-2989-2013</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Lipponen et al.(2022)</label><mixed-citation>
      
Lipponen, A., Reinvall, J., Väisänen, A., Taskinen, H., Lähivaara, T., Sogacheva, L., Kolmonen, P., Lehtinen, K., Arola, A., and Kolehmainen, V.: Deep-learning-based post-process correction of the aerosol parameters in the high-resolution Sentinel-3 Level-2 Synergy product, Atmos. Meas. Tech., 15, 895–914, <a href="https://doi.org/10.5194/amt-15-895-2022" target="_blank">https://doi.org/10.5194/amt-15-895-2022</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Loveland and Belward(1997)</label><mixed-citation>
      
Loveland, T. R. and Belward, A.: The international geosphere biosphere programme data and information system global land cover data set (DISCover),
Acta Astronautica, 41, 681–689, 1997.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Lundberg and Lee(2017)</label><mixed-citation>
      
Lundberg, S. M. and Lee, S.-I.: A Unified Approach to Interpreting Model Predictions, in: Advances in Neural Information Processing Systems 30, edited by: Guyon, I., Luxburg, U. V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., and Garnett, R., Curran Associates, Inc., 4765–4774, <a href="http://papers.nips.cc/paper/7062-a-unified-approach-to-interpreting-model-predictions.pdf" target="_blank"/> (last access: 1 December 2024), 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Michaelides et al.(2019)</label><mixed-citation>
      
Michaelides, S., Lane, J., and Kasparis, T.: Effect of Vertical Air Motion on
Disdrometer Derived Z-R Coefficients, Atmosphere, 10, 77, <a href="https://doi.org/10.3390/atmos10020077" target="_blank">https://doi.org/10.3390/atmos10020077</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Muthukumar et al.(2022)</label><mixed-citation>
      
Muthukumar, P., Cocom, E., Nagrecha, K., Comer, D., Burga, I., Taub, J., Calvert, C. F., Holm, J., and Pourhomayoun, M.: Predicting PM<sub>2.5</sub> atmospheric air pollution using deep learning with meteorological data and ground-based observations and remote-sensing satellite big data, Air Quality,
Atmosphere, &amp; Health, 15, 1221–1234, <a href="https://doi.org/10.1007/s11869-021-01126-3" target="_blank">https://doi.org/10.1007/s11869-021-01126-3</a>,
2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>NASA/METI/AIST/Japan Spacesystems and US/Japan ASTER Science
Team(2019)</label><mixed-citation>
      
NASA/METI/AIST/Japan Spacesystems and US/Japan ASTER Science Team: ASTER
Global Digital Elevation Model V003, distributed by NASA EOSDIS Land
Processes DAAC, <a href="https://doi.org/10.5067/ASTER/ASTGTM.003" target="_blank">https://doi.org/10.5067/ASTER/ASTGTM.003</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>OpenStreetMap(2023)</label><mixed-citation>
      
OpenStreetMap: OpenStreetMap Wiki – Key:highway, <a href="https://wiki.openstreetmap.org/wiki/Key:highway" target="_blank"/> (last access: 13 April 2023), 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Paszke et al.(2019)</label><mixed-citation>
      
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., Desmaison, A., Köpf, A., Yang, E., DeVito, Z., Raison, M., Tejani, A., Chilamkurthy, S., Steiner, B., Fang, L., Bai, J., and Chintala, S.: PyTorch: An Imperative Style, High-Performance Deep Learning Library, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1912.01703" target="_blank">https://doi.org/10.48550/arXiv.1912.01703</a>, 3 December 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Pope and Dockery(2006)</label><mixed-citation>
      
Pope, C. A. I. and Dockery, D. W.: Health Effects of Fine Particulate Air
Pollution: Lines that Connect, Journal of the Air &amp; Waste Management
Association, 56, 709–742, <a href="https://doi.org/10.1080/10473289.2006.10464485" target="_blank">https://doi.org/10.1080/10473289.2006.10464485</a>, 2006.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Porcheddu et al.(2024)</label><mixed-citation>
      
Porcheddu, A., Kolehmainen, V., Lähivaara, T., and Lipponen, A.: Post-process correction improves the accuracy of satellite PM<sub>2.5</sub> retrievals, Atmos. Meas. Tech., 17, 5747–5764, <a href="https://doi.org/10.5194/amt-17-5747-2024" target="_blank">https://doi.org/10.5194/amt-17-5747-2024</a>, 2024.


    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Raissi et al.(2019)</label><mixed-citation>
      
Raissi, M., Perdikaris, P., and Karniadakis, G.: Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations, Journal of Computational
Physics, 378, 686–707, <a href="https://doi.org/10.1016/j.jcp.2018.10.045" target="_blank">https://doi.org/10.1016/j.jcp.2018.10.045</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Randles et al.(2017)</label><mixed-citation>
      
Randles, C. A., da Silva, A., Buchard, V., Colarco, P. R., Darmenov, A. S., Govindaraju, R. C., Smirnov, A., Ferrare, R. A., Hair, J. W., Shinozuka, Y., and Flynn C.: The MERRA-2
aerosol reanalysis, 1980 onward. Part I: System description and data
assimilation evaluation, Journal of Climate, 30, 6823–6850, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Ronneberger et al.(2015)</label><mixed-citation>
      
Ronneberger, O., Fischer, P., and Brox, T.: U-Net: Convolutional Networks for  Biomedical Image Segmentation, edited by: Navab, N., Hornegger, J., Wells, W., Frangi, A., Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, MICCAI 2015, Lecture Notes in Computer Science, Springer, Cham., 9351, <a href="https://doi.org/10.1007/978-3-319-24574-4_28" target="_blank">https://doi.org/10.1007/978-3-319-24574-4_28</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Shi et al.(2015)</label><mixed-citation>
      
Shi, X., Chen, Z., Wang, H., Yeung, D.-Y., kin Wong, W., and chun Woo, W.: Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting, arXiv [preprint], <a href="https://doi.org/10.48550/arXiv.1506.04214" target="_blank">https://doi.org/10.48550/arXiv.1506.04214</a>, 13 June 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Sulla-Menashe and Friedl(2018)</label><mixed-citation>
      
Sulla-Menashe, D. and Friedl, M. A.: User guide to collection 6 MODIS land cover (MCD12Q1 and MCD12C1) product, USGS, Reston, VA, USA, 1, 18, <a href="https://modis.ornl.gov/documentation/guides/MCD12_User_Guide_V6.pdf" target="_blank"/> (last access: 13 April 2023), 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Tang et al.(2024)</label><mixed-citation>
      
Tang, D., Zhan, Y., and Yang, F.: A review of machine learning for modeling air quality: Overlooked but important issues, Atmospheric Research, 300,
107261, <a href="https://doi.org/10.1016/j.atmosres.2024.107261" target="_blank">https://doi.org/10.1016/j.atmosres.2024.107261</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Thangavel et al.(2022)</label><mixed-citation>
      
Thangavel, P., Park, D., and Lee, Y.-C.: Recent Insights into Particulate Matter (PM<sub>2.5</sub>)-Mediated Toxicity in Humans: An Overview, International Journal of Environmental Research and Public Health, 19, 7511,
<a href="https://doi.org/10.3390/ijerph19127511" target="_blank">https://doi.org/10.3390/ijerph19127511</a>, 2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Wang et al.(2020)</label><mixed-citation>
      
Wang, Z., Shrestha, R., and Román, M. O.: VIIRS/NPP Lunar BRDF-Adjusted Nighttime Lights Yearly L3 Global 15 arc second Linear Lat Lon Grid [data set], <a href="https://doi.org/10.5067/VIIRS/VNP46A4.001" target="_blank">https://doi.org/10.5067/VIIRS/VNP46A4.001</a>, 2020.

    </mixed-citation></ref-html>--></article>
