the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Spectral Analysis Approach for Assessing Accuracy of a Low-Cost Air Quality Sensor Network Data
Vijay Kumar
Dinushani Senarathna
Supraja Gurajala
William Olsen
Shantanu Sur
Sumona Mondal
Suresh Dhaniyala
Abstract. Extensive monitoring of PM2.5 is critical for understanding changes in local air quality due to policy measures. With the emergence of low-cost air quality sensor networks, high spatio-temporal measurements of air quality are now possible. However, the sensitivity, noise, and accuracy of field data from such networks are not fully understood. In this study, we use frequency analysis of a two-year data record of PM2.5 from both the EPA and Purple Air (PA), a low-cost sensor network, to identify the contribution of individual periodic sources to local air quality in Chicago. We find that sources with time periods of 4, 8, 12, and 24 hours have significant but varying relative contributions to the data for both networks. Further analysis reveals that the 8- and 12-hour sources are traffic-related and photochemistry-driven, respectively, and that the contribution of both these sources is significantly lower in the PA data than in the EPA data. We also use a correction model that accounts for the contribution of relative humidity and temperature, and we observe that the PA temporal components can be made to match those of the EPA over the medium- and long-term but not over the short-term. Thus, standard approaches to improve the accuracy of low-cost sensor network data will not result in unbiased measurements. The strong source dependence of low-cost sensor network measurements demands exceptional care in the analysis of ambient data from these networks, particularly when used to evaluate and drive air quality policies.
- Preprint
(2886 KB) -
Supplement
(103 KB) - BibTeX
- EndNote
Vijay Kumar et al.
Status: open (until 06 Jul 2023)
-
RC1: 'Comment on amt-2023-62', Anonymous Referee #1, 04 May 2023
reply
Review
Spectral Analysis Approach for Assessing Accuracy of a Low-Cost Air Quality Sensor Network Data
The study aims to investigate the accuracy and reliability of data from low-cost air quality sensor networks, which have emerged as a promising tool for high spatio-temporal monitoring of air quality. The authors employed a frequency analysis to identify the contributions of individual periodic sources to local air quality in Chicago using a two-year data record of PM2.5 from both the EPA and Purple Air networks. Their findings highlight the source dependence of low-cost sensor network measurements and emphasize the need for exceptional care in the analysis of ambient data from these networks, particularly when used to evaluate and implement air quality policies.
The manuscript is well-written and clearly structured. Furthermore, the spectral analysis method used to evaluate the data is novel in the context of low-cost sensors. To criticize, the study does not yield scientifically significant new findings. The main conclusion is that the sensor response is source dependent and that without proper calibration, there is a high risk of data misinterpretation. This is the same conclusion that has been made in most, if not all, studies investigating low-cost sensors.
I recommend publication of this study because I consider the approach used to evaluate sensor data valuable. Furthermore, I encourage the authors to consider the following points to strengthen the impact of the research.
- The manuscript lacks a clear statement of the limitations of the study. This would be useful for readers to contextualize the findings.
- The authors suggest that the results of their analysis will provide guidance in devising new approaches to calibrate data from low-cost sensors, but it is unclear what specific recommendations are being made. A more explicit discussion of the implications of the study's findings for future research and policy decisions would have strengthened the overall impact of the article.
- Line 89 foe;d typo?
- 1 Consider adding a scale for the map and units for the population density.
- Local correction model; justify the use of both temperature and relative humidity in multiple linear regression. These variables are correlated with each other which can be problematic as the independent variables in MLR should be independent.
- Line 294 “Our analysis clearly demonstrates for the first time that the PA network’s very different sensitivity to different sources.” I suggest you remove the “first time” part here.
Citation: https://doi.org/10.5194/amt-2023-62-RC1
Vijay Kumar et al.
Vijay Kumar et al.
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
210 | 64 | 8 | 282 | 25 | 4 | 3 |
- HTML: 210
- PDF: 64
- XML: 8
- Total: 282
- Supplement: 25
- BibTeX: 4
- EndNote: 3
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1