Identifying optimal co-location calibration periods for low-cost sensors

Levy Zamora, Misti; Buehler, Colby; Datta, Abhirup; Gentner, Drew R.; Koehler, Kirsten

doi:10.5194/amt-16-169-2023

Articles | Volume 16, issue 1

https://doi.org/10.5194/amt-16-169-2023

© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Collection:

Low-cost sensors for the measurement of atmospheric...

https://doi.org/10.5194/amt-16-169-2023

© Author(s) 2023. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 16, issue 1

Research article

|

13 Jan 2023

Research article |

| 13 Jan 2023

Identifying optimal co-location calibration periods for low-cost sensors

Misti Levy Zamora, Colby Buehler, Abhirup Datta, Drew R. Gentner, and Kirsten Koehler

Download

Final revised paper (published on 13 Jan 2023)
Supplement to the final revised paper
Preprint (discussion started on 28 Apr 2022)
Supplement to the preprint

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2022-200', Anonymous Referee #1, 26 May 2022

General Comments

Overall the paper is very well written and presents its conclusions clearly. I recommend it for publication following some minor additions and corrections noted below.

My biggest concern is that, using a fixed total amount of sensor data, as the calibration period is increased, the evaluation period is decreased. Comparing results across calibration periods of different lengths could potentially be misleading. Ideally, calibration periods of the same length would be used in all cases; however, this is practically difficult with limited data. A comment to this effect should be added in the paper as a caveat for the presented results.

While the use of linear regression approaches to calibration is a reasonable way to approach the analysis, it is by no means the only approach to low-cost sensor calibration. In particular, methods for accounting for the non-linear impacts of various predictors, including quadratic regressions and various machine learning approaches, may be more appropriate. While it is not necessary to exhaustively investigate these here, some mention of these alternative approaches should be made, for example as a topic of future work. Similarly, while using simple “coverage” as a metric to test the appropriateness of the calibration period to the evaluation period is a reasonable first approach, more sophisticated comparisons of the statistical distributions of predictors across these periods could also be applied in future analysis and might also be mentioned here.

I would strongly suggest that the datasets used for this analysis be made publicly available if this has not already been done, and the data repository be linked in the paper. This will facilitate other researchers investigating the dataset to determine appropriate calibration strategies for their particular needs.

Specific Comments

Line 16: “mm” should be micrometers.

Line 18: “randomly” should be “randomly selected”.

Line 80: “was” should be “were”.

Line 90: What was the increment of the calibration durations? E.g., “ranging from 1 to 180 consecutive days in X day increments”. This can be inferred from the presented results, but it is best to explicitly state it as well.

Line 115: Please elaborate on what is meant by “time”, e.g., hour of the day, day of the week, age of the sensor, etc. Based on later comments I assume it is the age of the sensor, but this should be specified.

Line 202: “2.5” should be subscripted.

Figure 4: For completeness, plots similar to these should be created for all sensors and all predictors and included in the supplemental information.

Line 269: Remove “compound”.

Table 3 Caption: The bottom of the caption may be cut off. Also, the “required conditions” should be specified here.

Line 302: “was” should be “were”.

Line 311: Remove “and”.

Line 317-319: Regarding the statement “…the co-location duration was not as predictive of data accuracy…” this might not be entirely supported by your results as you present them, since you do not explicitly perform a meta-analysis of using either duration or coverage as a predictor of performance metrics. You might consider doing such an analysis, or slightly rephrasing this statement.

Line 334: The “<link>” is missing here.

Citation: https://doi.org/10.5194/egusphere-2022-200-RC1
- AC1: 'Reply on RC1', Misti Levy Zamora, 30 Nov 2022
  
  Overall, the paper is very well written and presents its conclusions clearly. I recommend it for publication following some minor additions and corrections noted below.
  The authors want to thank the reviewers for taking the time to provide comments. We have addressed the comments below.
  
  My biggest concern is that using a fixed total amount of sensor data, as the calibration period is increased, the evaluation period is decreased. Comparing results across calibration periods of different lengths could potentially be misleading. Ideally, calibration periods of the same length would be used in all cases; however, this is practically difficult with limited data. A comment to this effect should be added in the paper as a caveat for the presented results.
  Response: We acknowledge that this is a constraint caused by a limited data set. Given that we consider several pollutants, there is not one season that captures the full dynamic range of all the sensors. Also, we wanted to evaluate the calibrations consistently across sensor types in as many seasons as was permitted by the available data. To supplement our current analysis, we have added an additional supplemental figure (SF4; See below) referenced in section 3.2 for an analysis of the PM data where the 250 randomly selected calibration periods were from between 02/2019 and 11/2019 and the evaluation period was held to 11/2019-02/2020 for all of the considered calibrations. The results were consistent with the original method.
  We have also added to the methods section, “Ideally, evaluation periods of the same length would be used in all cases; however, this is challenging with a limited data set and when comparing pollutants with notably different seasonal trends.”
  
  While the use of linear regression approaches to calibration is a reasonable way to approach the analysis, it is by no means the only approach to low-cost sensor calibration. In particular, methods for accounting for the non-linear impacts of various predictors, including quadratic regressions and various machine learning approaches, may be more appropriate. While it is not necessary to exhaustively investigate these here, some mention of these alternative approaches should be made, for example as a topic of future work. Similarly, while using simple “coverage” as a metric to test the appropriateness of the calibration period to the evaluation period is a reasonable first approach, more sophisticated comparisons of the statistical distributions of predictors across these periods could also be applied in future analysis and might also be mentioned here.
  Response: We agree linear approaches may not be the best method for all low-cost sensors, but the popularity of linear models is due to their simplicity which makes them accessible to more users. We did investigate simple and interpretable non-linear forms like quadratic and splines with one breakpoint in a previous paper (Evaluating the performance of using low-cost sensors to calibrate for cross-sensitivities in a multipollutant network, https://doi.org/10.1021/acsestengg.1c00367). The calibration models proposed in that work were applied in this manuscript.
  We have added discussion on suggested future work: “Future work should evaluate if employing methods that account for any non-linear responses of key predictors can further optimize the calibration of low-cost sensors as well as if more sophisticated comparisons of the statistical distributions of predictors across calibration periods are beneficial.”
  
  I would strongly suggest that the datasets used for this analysis be made publicly available if this has not already been done, and the data repository be linked in the paper. This will facilitate other researchers investigating the dataset to determine appropriate calibration strategies for their particular needs.
  Response: Search center researchers plan to post data from the network together, including this subset of data. Upon request to the corresponding author, the authors can share the data from this publication
  
  Specific Comments
  Line 16: “mm” should be micrometers.
  Response: This has been corrected.
  
  Line 18: “randomly” should be “randomly selected”.
  Response: This has been corrected.
  
  Line 80: “was” should be “were”.
  Response: This has been corrected.
  
  Line 90: What was the increment of the calibration durations? E.g., “ranging from 1 to 180 consecutive days in X day increments”. This can be inferred from the presented results, but it is best to explicitly state it as well.
  Response: This has been added. “For each hypothetical calibration co-location scenario (i.e., ranging from 1 to 180 consecutive days in 1 day increments), 250 sample calibration test periods were randomly selected of that duration.”
  
  Line 115: Please elaborate on what is meant by “time”, e.g., hour of the day, day of the week, age of the sensor, etc. Based on later comments I assume it is the age of the sensor, but this should be specified.
  Response: In a previous publication, we assessed the change in baseline response over time. Time here refers to the time the data were collected. Therefore, the betas produced clarify how the sensor response changes per unit of time in the calibration period that was not accounted for by the other predictors. We have clarified in the text, “The CO sensor model included temperature, RH, and time, where time refers to the current date and time that the data were collected.”
  
  Line 202: “2.5” should be subscripted.
  Response: This has been corrected.
  
  Figure 4: For completeness, plots similar to these should be created for all sensors and all predictors and included in the supplemental information.
  Response: This has been added in Supplemental Figure 5. “The O3 sensor is an example of another sensor that exhibits a cross-sensitivity to another common pollutant (NO2; not shown in the main text), which has been demonstrated in a previous work (Levy Zamora, 2022). Additional examples of coverage of key variables for all the sensors are shown in Supplemental Figure 5.”
  
  Line 269: Remove “compound”.
  Response: This has been corrected.
  
  Table 3 Caption: The bottom of the caption may be cut off. Also, the “required conditions” should be specified here.
  Response: This has been corrected. “Table 3. Comparison of the median RMSE (µg/m3) for PM2.5 from 1-week calibration periods with different coverages of temperature and RH conditions. Only calibration periods with more than 50% coverage of the PM2.5 concentration range were included in the table (>50% corresponds to 26 µg/m3 or more in this dataset). For four scenarios (e.g., PM2.5 coverage > 50%, RH Coverage > 50%, T Coverage > 20%), the 1st percentile RMSE, 99th percentile RMSE, and the percentage of calibrations that exhibited all required conditions (e.g., RH > X % and T > X%) are shown (1st - 99th percentile; %). For comparison, the median (1st - 99th percentile) of the PM2.5 1-week calibration periods from the full data set (i.e., no coverage requirements) was 6.6 µg/m3 (3.1 – 18.3 µg/m3).”
  
  Line 302: “was” should be “were”.
  Response: This has been corrected.
  
  Line 311: Remove “and”.
  Response: This has been corrected.
  
  Line 317-319: Regarding the statement “…the co-location duration was not as predictive of data accuracy…” this might not be entirely supported by your results as you present them, since you do not explicitly perform a meta-analysis of using either duration or coverage as a predictor of performance metrics. You might consider doing such an analysis, or slightly rephrasing this statement.
  Response: We have modified the text to state “While longer co-location periods up to several months generally improved the performance of the sensor, optimal calibration could be produced from shorter co-location lengths if the calibration period covered the span of conditions likely to be encountered during the evaluation period.”
  
  Line 334: The “<link>” is missing here.
  Response: Thank you for noting this. We have modified it to state, “This material is available free of charge via the internet at https://egusphere.copernicus.org/preprints/2022/egusphere-2022-200/egusphere-2022-200-supplement.pdf.”
  
  Supplemental Figure 4. To supplement our current analysis method where the evaluation period is flexible in order to evaluate more seasons, here we show an analysis of the PM data where the 250 randomly selected calibration periods were from between 02/2019 and 11/2019 and the evaluation period was 11/2019-02/2020 for all of the considered calibrations. The potential range of A) RMSE and B) correlation coefficients (r) for a given co-location length. C) The starting times for each of the 250 calibrations for the one-day analysis are indicated in red, and the evaluation period is shown in gray.
  Supplemental Figure 5. Additional examples of coverage of key variables for all five sensors using 1-week calibration scenarios. A-C) PM (Temperature, RH, and PM concentration range), D-F) CO (Temperature, RH, and CO concentration range), G-I) NO₂ (Temperature, RH, NO₂concentration range, O₃ concentration range, and NO concentration range), J-L) NO (Temperature, RH, NO concentration range, and CO concentration range), and M-O) O₃ (Temperature, RH, O₃ concentration range, and NO₂ concentration range). The bluer squares indicate lower RMSE values (more accurate calibrations).
  
  Citation: https://doi.org/10.5194/egusphere-2022-200-AC1

RC2: 'Comment on egusphere-2022-200', Sreekanth Vakacherla, 02 Nov 2022

AC2: 'Reply on RC2', Misti Levy Zamora, 30 Nov 2022

Comments on “Optimizing co-location calibration periods for low-cost sensors” by Zamora et al.,

At the outset, the manuscript was very well drafted, and the analysis was thorough.

The authors want to thank the reviewers for taking the time to provide comments during this busy season. We have addressed the comments below.

Some of my comments are below:

The title of the paper is ‘optimizing …. calibration periods…..’, but I feel that this work hasn’t optimized the period, instead it gave suggestions on how to optimize. Authors can think of tweaking the title a bit.

Response: This has been modified to, ” Identifying optimal co-location calibration periods for low-cost sensors”

Typo in line 4 of the abstract (mm)

Response: This has been corrected.

Page 3: line numbers 59 and 60, instead of mentioning longer and shorter, please specify the actual duration.

Response: This has been added. “They reported that longer calibration periods (i.e., six weeks) produced fits with lower bias than fits from shorter calibration periods (i.e., 1 week). In that study, the one-week calibrations yielded the best R2 values.”

Section 2.2: Line 4: what is meant by ‘total duration’? Suggest rephrasing lines 2 – 4 for clarity. Currently, the sentence is confusing.

Response: The sentence has been modified to state, “For each hypothetical calibration co-location scenario (i.e., ranging from 1 to 180 consecutive days in 1 day increments), 250 sample calibration test periods were randomly selected of that duration."

Page 5: line 115/116: is the variable ‘time’ refers to cumulative time or hour of the day?

Response: In a previous publication, we assessed the change in baseline response over time. Time here refers to the time the data were collected. Therefore, the betas produced clarify how the sensor response changes per unit of time in the calibration period that was not accounted for by the other predictors. We have clarified in the text, “The CO sensor model included temperature, RH, and time, where time refers to the current date and time that the data were collected.”

Suggest providing mean/summary of pollutant values for the study period in any one of the tables (or as a separate table)

Response: This has been added to supplemental Table 1.

	Mean	Median	Range
PM_2.5(µg/m³)	8.4	7	1-53
CO (ppb)	261	199	100 -2950
NO₂(ppb)	8.5	5.5	1-58
O₃(ppb)	30.1	32	1-110
NO (ppb)	3.1	0.5	0.1-136.5

Supplemental Table 1. Descriptive statistics of the reference data used in the calibration models from the full year.

Suggest providing NRMSE values in addition to RMSE values to have an idea on how much is the percentage of the error with respect to mean

Response: We have created an NRMSE table (Supplemental Table 2).

1 Day

1 Week

1 Month

6 Weeks

3 Months

6 Months

PM_2.5

0.85

(0.13 – 8.11)

0.12

(0.08 – 0.42)

0.09

(0.08 – 0.25)

0.09

(0.08 – 0.18)

0.09

(0.08 – 0.14)

0.08

(0.08 – 0.09)

CO

4.21

(0.19 – 48.8)

0.24

(0.05 – 3.23)

0.09

(0.04 – 0.35)

0.07

(0.05 – 0.21)

0.06

(0.04 – 0.14)

0.06

(0.05 – 0.08)

NO₂

0.4

(0.11 – 2.27)

0.15

(0.08 – 0.35)

0.11

(0.07 – 0.19)

0.12

(0.07 – 0.15)

0.11

(0.07 – 0.14)

0.10

(0.07 – 0.13)

O₃

7.3

(0.14 – 119.59)

0.55

(0.11 – 2.43)

0.16

(0.08 – 0.31)

0.16

(0.08 – 0.24)

0.17

(0.08 – 0.28)

0.15

(0.12 – 0.18)

NO

0.12

(0.03 – 5.06)

0.06

(0.02 – 0.56)

0.03

(0.02 – 0.06)

0.03

(0.02 – 0.04)

0.03

(0.02 – 0.03)

02

(0.02 – 0.03)

The figure captions can be shortened

Response: We would prefer to keep the figure captions thorough, so they stand alone for the reader. We will shorten it if necessary to meet journal requirements.

Section 3.2: RMSE of 4 and r of 0.6, what are the criteria for these values?

Response: These two values were selected because most of the satisfactory models exhibited RMSE and r-values within these parameters. However, they were not included as a recommendation for evaluating PM models, but as a way to compare the seasonal differences as described in the paragraph.

Citation: https://doi.org/10.5194/egusphere-2022-200-AC2

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

AR by Misti Levy Zamora on behalf of the Authors (30 Nov 2022) Author's response Author's tracked changes Manuscript

ED: Publish as is (11 Dec 2022) by Maria Dolores Andrés Hernández

AR by Misti Levy Zamora on behalf of the Authors (19 Dec 2022) Author's response Manuscript

Short summary

We assessed five pairs of co-located reference and low-cost sensor data sets (PM_2.5, O₃, NO₂, NO, and CO) to make recommendations for best practices regarding the field calibration of low-cost air quality sensors. We found diminishing improvements for calibration periods longer than about 6 weeks for all sensors and that co-location can be minimized if the period is strategically selected and monitored so that the calibration period is representative of the desired measurement setting.