Articles | Volume 18, issue 18
https://doi.org/10.5194/amt-18-4771-2025
© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.Machine learning data fusion for high spatio-temporal resolution PM2.5
Download
- Final revised paper (published on 25 Sep 2025)
- Preprint (discussion started on 14 Feb 2025)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2024-4056', Anonymous Referee #1, 31 Mar 2025
- AC1: 'Reply on RC1', Andrea Porcheddu, 23 Jun 2025
-
RC2: 'Comment on egusphere-2024-4056', Anonymous Referee #2, 26 May 2025
- AC2: 'Reply on RC2', Andrea Porcheddu, 23 Jun 2025
Peer review completion
AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Andrea Porcheddu on behalf of the Authors (04 Jul 2025)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (07 Jul 2025) by Sandip Dhomse
RR by Anonymous Referee #1 (18 Jul 2025)

ED: Publish subject to minor revisions (review by editor) (18 Jul 2025) by Sandip Dhomse

AR by Andrea Porcheddu on behalf of the Authors (28 Jul 2025)
Author's response
Author's tracked changes
Manuscript
ED: Publish as is (30 Jul 2025) by Sandip Dhomse
AR by Andrea Porcheddu on behalf of the Authors (08 Aug 2025)
Manuscript
This study integrates multi-source data, including satellite and ground-based station data, to construct a deep learning model for estimating 24-hour high-resolution PM2.5 data. High spatiotemporal resolution PM2.5 mapping is of significant importance for pollution control and decision-making, and this study represents a useful attempt in this field. However, the following issues need to be addressed:
The study aims to estimate 24-hourly PM2.5 maps at 100 m resolution in urban areas. However, as shown in Table A1, most of the input data have resolutions coarser than 100 m, except for OpenStreetMap roads and DEM data, which are not directly related to PM2.5. How do the authors justify that the estimated PM2.5 resolution truly reaches 100 m?
The paper presents a deep learning-based estimation approach, but the description of the methodology remains unclear. First, Lines 148–149 mention that "The output is a 3-dimensional array containing 24 hourly PM2.5 maps," but Lines 159–160 state that "the output layer is a 3D 1x1x1 convolution," which appears contradictory and should be clarified. Second, the construction of the loss function is confusing—it should ideally be constrained by PM2.5 measurements from ground stations and NOODLESALAD PM2.5, but its current formulation appears overly complex and difficult to understand.
The study aims to estimate 24-hour, 100 m resolution PM2.5 data, but most of the results presented are seasonal or monthly averages. We would like to see 24-hour PM2.5 mapping results. Additionally, the comparison with MERRA2 focuses mainly on accuracy. Could the authors also better illustrate PM2.5’s spatial distribution and gradient variations, or even capture specific pollution emissions?
The study applies explainable AI techniques to explore the importance of different features, showing that SHAP values identify 2-meter air temperature as the most important feature. However, this analysis could be further improved. First, the underlying reasons for why certain variables are important (or not) are not sufficiently explored. Second, a broader perspective could be considered—how much of the variability in PM2.5 can be explained by meteorological variables overall?
The description of NOODLESALAD PM2.5 and its role in this study is unclear. The authors should provide a more detailed explanation rather than merely citing previous studies.
The results and analysis section could be further improved. First, it is recommended to structure the results into separate subsections rather than mixing everything together. Second, the quality of Figures 3–6 should be improved—currently, the font size is too small, and the figure titles could be removed (since the descriptions are already included in the captions). Lastly, additional results, such as 24-hour high-resolution PM2.5 maps, could enhance the persuasiveness of the study.
The references in the paper are somewhat outdated, with few studies from the recent three years included. It is recommended to update and supplement them.
Some minor issues:
(1) Figure 1: Does the figure represent the road network? Please clarify.
(2) Line 134: "3D PM2.5 maps" could be misinterpreted as three-dimensional spatial maps (including altitude). Is this the correct terminology?
(3) Figure 2: The representation is somewhat abstract. It would be better if the inputs and outputs were explicitly illustrated.
(4) Line 279: "consistent with prior findings" should be supported with references.