Articles | Volume 18, issue 8
https://doi.org/10.5194/amt-18-1771-2025
https://doi.org/10.5194/amt-18-1771-2025
Research article
 | 
24 Apr 2025
Research article |  | 24 Apr 2025

Performance validation and calibration conditions for novel dynamic baseline tracking air sensors in long-term field monitoring

Han Mei, Peng Wei, Meisam Ahmadi Ghadikolaei, Nirmal Kumar Gali, Ya Wang, and Zhi Ning
Abstract

The rapid expansion of low-cost sensor networks for air quality monitoring necessitates rigorous calibration to ensure data accuracy. Despite numerous published field calibration studies, a universal and comprehensive assessment of factors affecting sensor calibration remains elusive, leading to potential discrepancies in data quality across different networks. This study deployed eight sensor-based monitors in strategically chosen locations continuously for 2 years in Hong Kong, Macau and Shanghai. These locations covered a wide range of climatic conditions: Hong Kong's subtropical climate, Macau's similar yet distinct urban environment and Shanghai's more variable climate. Each monitor employed a dynamic baseline tracking method for the gas sensors, which isolates the concentration signals from temperature and humidity effects, enhancing the sensors' accuracy and reliability. This strategic deployment ensured that the sensors' performance and calibration processes were tested across diverse atmospheric conditions. The tests, which involved evaluating the validation performance by analysing randomly selected calibration sample subsets ranging from 1–15 d, indicated that the length of the calibration period, pollutant concentration range and time-averaging period are pivotal for sensor calibration quality. We determined that a 5–7 d calibration period minimizes calibration coefficient errors, and a wider concentration range improves the validation R2 values for all sensors, suggesting the necessity of setting specific concentration range thresholds. A time-averaging period of at least 5 min for data with 1 min resolution was recommended to enable optimal calibration in field operation. This study emphasizes the need for a comprehensive calibration assessment and the importance of considering environmental variability in the sensor calibration condition. These findings offer methodological guidance for the calibration of other sensor types, providing a reference for future research in the field of sensor calibration.

Share
1 Introduction

Rapid advancements in low-cost air sensor technology have led to a significant increase in their applications across various fields. These sensors offer a promising and cost-effective solution for monitoring air pollution at finer spatial scales and in novel locations compared to traditional monitoring methodologies. This has resulted in a growing demand for high-quality sensor data. Calibration is an indispensable component of the air sensor operational paradigm, pivotal for securing accurate and dependable data. By establishing a relationship between the raw sensor output and the corresponding reference measurement, calibration enhances the accuracy and precision of sensor data.

Common calibration methods include multi-point calibration with standard gases, controlled chamber calibration (Papapostolou et al., 2017; Sousan et al., 2016), on-site probe gas calibrations and field side-by-side calibration (Bisignano et al., 2022; Holstius et al., 2014; Spinelle et al., 2015, 2017). The first three methods are laboratory-based methods or rely on standard gas, which inherently possess constraints and may not fully capture the intricate interactions of multiple pollutants and environmental factors encountered in situ. This limitation raises concerns about the applicability of calibration results obtained under controlled conditions to actual monitoring environments (Castell et al., 2017). An alternative approach is the side-by-side calibration, which involves co-locating sensor systems with reference analysers in real-world environmental settings for a designated duration. This approach leverages the natural fluctuation in pollutant concentrations and environmental factors to accurately calibrate the sensors' sensitivity and baseline response. It is advantageous due to its procedural simplicity, negligible consumable usage and cost efficiency compared to laboratory assessments (Castell et al., 2017). Consequently, it has become a preferred method for calibration in various scenarios (Spinelle et al., 2015, 2017).

Despite the widespread application of field side-by-side calibration, several critical concerns persist regarding the process. The primary issue is the selection of appropriate calibration conditions. Factors like the calibration duration (Levy Zamora et al., 2023), the pollutant concentration distribution (Levy Zamora et al., 2023), sensor ageing (Li et al., 2021), interference from non-target gases (Cross et al., 2017), the impacts of temperature and relative humidity (Ariyaratne et al., 2023), and various gas sampling methods can significantly influence the calibration results. Determining the optimal conditions is crucial for achieving accurate and reliable calibration results. Extensive research has focused on the calibration period, the most frequently reported in recent studies (Datta et al., 2020; Gao et al., 2015; Kim et al., 2018; Mukherjee et al., 2019; Pinto et al., 2014; Spinelle et al., 2015, 2017; Topalovic et al., 2019). One study by Levy Zamora et al. (2023) evaluated the impact of the calibration period on calibration quality using calibration periods of up to 6 months from 1 year of PM2.5, CO, NO, NO2 and O3 data in Maryland, USA. Their results indicated diminishing improvements in median root-mean-square error (RMSE) for calibration periods longer than 6 weeks for all sensors. Levy Zamora et al. (2023) also highlighted the importance of considering environmental conditions during the calibration period that are similar to those encountered during the evaluation period to achieve the best calibration performance. Another study by Okorn and Hannigan (2021) reported that longer calibration periods (i.e. 6 weeks) resulted in fits with a reduced bias compared to fits obtained from shorter calibration periods (1 week), while the 1-week calibrations yielded the best R2 (coefficient of determination) values. While these studies have offered valuable insights into sensor field calibration conditions, more discussion is needed on other calibration factors, particularly the range of pollutant concentrations during the calibration period and the selection of time-averaging length for raw data before calibration. These two factors are more straightforward to standardize and quantify compared to other factors, as they can be defined with specific numerical values and consistent measurement protocols, making it easier to compare results across different studies and ensure reliable calibration outcomes.

In addition to investigating calibration conditions, an equally crucial aspect to address is the development of an effective calibration model that can accommodate these optimized sensor calibration conditions. This study focuses on electrochemical sensors, which are the most common type of air quality gas sensor. Laboratory studies of commercial electrochemical sensors have shown linear correlations between current response and gas analyte concentration under stable temperature and relative humidity (RH) conditions (Collier-Oxandale et al., 2020; Mead et al., 2013; Wei et al., 2018; Zong et al., 2021). However, due to their electrochemical characteristics, these sensors often exhibit non-linear responses to variations in temperature and RH (Ariyaratne et al., 2023; Li et al., 2021; Wei et al., 2018), which can significantly impair their performance in real-world applications. In the past, most studies adopted generic multiple linear regression (MLR) or machine learning models to calibrate raw sensor data, taking into account various complex variables such as temperature, RH, their gradient and their cross-sensitivity to other pollutants (Datta et al., 2020; Han et al., 2021; Levy Zamora et al., 2023; Si et al., 2020; Topalovic et al., 2019; Wei et al., 2020; Zimmerman et al., 2018). These models, while comprehensive, often face limitations such as the risk of over-fitting, extensive training requirements, restricted applicability, and difficulties in replicating and scaling up for large sensor numbers. Furthermore, the complexity of machine learning models can pose significant barriers for everyday users.

Instead of relying solely on mathematical algorithms for sensor calibration, we assessed a novel dynamic baseline tracking technology designed to physically mitigate temperature and RH effects on sensor signals, allowing these kinds of gas sensor device, i.e. mini air stations (MASs; Sapiens), to output the sensing data most directly related to the concentration signal. By isolating the non-linear influences of temperature and RH on sensor readings, this technology allowed us to focus exclusively on optimum calibration strategy and enabled the development of a refined linear calibration model. Based on the linear calibration model, we further identify the critical factors that influence calibration quality, thus optimizing calibration conditions for NO2, NO, CO and O3 electrochemical sensors. Our research uncovers three pivotal factors that significantly impact sensor calibration and validation performance: calibration period, concentration range and time averaging. By examining these factors' effects on the variation in the sensors' calibration coefficients, we aim to deepen the understanding of sensor calibration processes and enhance the performance of low-cost electrochemical air sensors. This methodology not only simplifies the calibration process but also ensures that the calibration model remains robust and applicable in varied and long-term field conditions.

2 Material and methods

2.1 Data collection

2.1.1 Sensor devices

Eight microsensor-based mini air stations (MAS-AF300, Sapiens), hereinafter referred to as “MASs”, shown in Fig. 1, were utilized in this study for continuous measurements of the air pollutants NO2, NO, O3 and CO under field conditions. Each MAS unit included three or four gas sensors along with a combined RH and temperature sensor (SHT75, Sensirion AG). This study focuses on electrochemical gas sensors for NO2 (Alphasense NO2-B43F), NO (Alphasense NO-B4), CO (Alphasense CO-B4) and O3 (Alphasense OX-B431). Please note that the O3 concentration is determined by calculating the difference between the readings of the oxidizing gas sensor (OX-B431) and the NO2 sensor (NO2-B43F). Furthermore, the MAS system incorporates numerous sophisticated functionalities. All the gas sensors are equipped with the dynamic baseline tracking technology by the manufacturer with details in the following section. The system is also equipped with an active air sampler, ensuring a flow rate of 0.8 L min−1. The sample air undergoes filtration through a Teflon dust filter before directly entering the sensor module, without the implementation of any temperature or humidity control measures. The Teflon dust filter for each MAS will be replaced regularly every month to prevent dust from entering the gas module and causing measurement errors and shortening the sensor life. To mitigate potential drift during long-term deployment, the MAS gas module incorporates an auto-zeroing function. During the zeroing process, the gaseous pollutant measurement module receives air samples from a separate zero module, from which NO, NO2 and O3 have been significantly mitigated. The data collected during the zeroing period are subsequently analysed to rectify any drift effects during the long-term deployment phase as part of the data cleaning procedure. A comprehensive description of this technology and its functional advantages can be found in a paper by Sun et al. (2017). All these incorporated functionalities in the MAS system are aimed at optimizing sensor performance, enhancing measurement accuracy and ensuring their long-term stability.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f01

Figure 1Structure diagram of a MAS monitoring device (dimensions: 420×320×180mm, H×W×D; weight: 12 kg; power consumption: 15 W).

Download

2.1.2 Measurement campaign details

To assess sensor performance under varying ambient conditions, these MASs were deployed in three distinctively different urban and climatic settings: Hong Kong's humid subtropical climate, Macau's somewhat similar yet distinct urban environment and Shanghai's more variable climatic conditions. Each city featured a co-location campaign with an air quality monitoring station (AQMS), as detailed in Table 1, and the AQMSs were equipped with Federal Equivalent Method (FEM) reference analysers.

Table 1Details of MAS devices in co-location calibrations.

Download Print Version | Download XLSX

The first co-location campaign in Hong Kong involved the four MASs, each equipped with all four types of gas sensors (NO2, NO, CO and O3), which were placed at the Tseung Kwan O AQMS (114.1148° N, 22.3716° E) regulated by the Hong Kong Environmental Protection Department. This station serves as a representative urban site, providing conditions suitable for sensor evaluation in a complex urban environment. In the second co-location campaign, two MASs were located at the Taipa AQMS (113.56882° N, 22.15896° E) in Macau, focusing on NO2, NO and O3 to capture the general urban background conditions unique to the region. The third campaign took place in Shanghai, where two MASs, monitoring NO2, NO and CO, were placed separately alongside two sets of reference analysers at the Waigaoqiao Port 2 site (121.57242° N, 31.36662° E) and Port 4 site (121.65496° N, 31.33302° E). This campaign was also the longest co-location campaign, lasting 22 months, offering a prolonged observation of the diverse and more polluted air quality conditions typical of a major industrial hub. These locations were chosen to ensure a comprehensive analysis across a spectrum of urban pollution levels and environmental conditions.

All eight MAS units were designed to automatically transmit the measured raw sensor signals and concentration data of the pollutants from the MAS to a secure cloud server in real time at 1 min resolution. The reference analyser in Hong Kong provided 1 min time resolution pollutant concentration data, while those in Macau and Shanghai provided hourly-averaged data, enabling us to conduct calibration analysis at varying time resolutions.

2.2 Dynamic baseline tracking method to mitigate environmental effects on sensors

The sensor device (MAS, Sapiens) has deployed a novel gas-sensing technology that enables the isolation of the concentration signal from environmental variables of temperature and RH through a patented dynamic baseline tracking method by the manufacturer, which operates by differentiating between the varying environment and target-pollutant-induced sensor signals using a dual-sensor module. Figure 2 shows the conceptual diagram of the MAS sensor module and general working principle of the dynamic baseline tracking method. This gas sensor system comprises a primary sensor – which is directly exposed to the air, capturing the original signal (designated as ORG) influenced by varying pollutants, temperature and RH – and a proprietary pair of differential filter sensors (designated as PDF) to track the dynamic baseline signal driven only by temperature and RH. The PDF sensor is equipped with a membrane permeable to water molecules that allows water vapour to penetrate through while filtering out the target gas molecules and preventing them from entering the sensor head. The differential signal (measured in volts) between the ORG and PDF sensors decouples the temperature and humidity effects, yielding a pure signal that reflects target gas concentrations.

Each MAS sensor module produces four distinct outputs for a specific pollutant: (i) the ORG sensor signal in volts, VORG; (ii) the PDF sensor signal in volts, VPDF; (iii) the voltage output from the difference in the ORG and PDF sensor signals in volts, VDIFF; and (iv) the concentration output of target gas in parts per billion (ppb), Conc. Each MAS has an onboard algorithm capability that converts sensor signals to concentration, with the conversion automatically performed on board the MAS for real-time concentration output. Equation (1) presents the conversion equation for NO2, NO and CO, where “a” denotes the slope of the equation, which is also indicative of the sensitivity (ppb mV−1) of the electrochemical sensors, and “b” represents the intercept of the equation. For the gas sensors exhibiting cross-sensitivity with non-target gases, an interfering gas correction component can be incorporated. Equation (2) presents the equation for calculating O3 concentrations using the Alphasense OX-B431 sensor with NO2 as an interferent. The coefficient “f” accounts for the cross-interference from NO2, and our empirical data, derived from a substantial number of tests, indicate that “f” typically falls within the range of 0.8–1.2.

(1)Conc(NO2,NO,CO)=a×VDIFF+b(2)Conc(O3)=a×VDIFF+b-f×Conc(NO2)

Prior to initiating the co-location campaign, a 15 d pre-test under field conditions and a laboratory test in the environmental chamber were conducted to demonstrate the method's capability to enhance the sensor performance under varying temperature and humidity conditions.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f02

Figure 2A conceptual diagram of the PDF-enabled MAS sensor device. In laboratory tests, standard gas with constant concentrations is periodically injected into the PDF and ORG sensors throughout varying temperature and RH cycles to investigate their effects on the sensor performance. The PDF tracks the baseline signal driven only by temperature and RH, while the ORG sensor captures the concentration profile influenced by both the target gas module and environmental conditions. The differential signal between the ORG and PDF sensors decouples the baseline signal induced by temperature and RH, producing a pure signal that reflects the target gas concentrations. This concept is also applicable to ambient conditions, where the differential signal between the paired ORG and PDF sensors demonstrates the accuracy and robustness of PDF technology for ambient air monitoring.

Download

2.3 Impact analysis of three crucial factors on calibration conditions

This study specifically focuses on conducting field tests to identify optimal calibration conditions by examining three primary factors that influence sensor calibration performance: (a) calibration period duration, (b) concentration variation range and (c) time-averaging pre-processing.

2.3.1 Calibration period optimization

Calibration is typically conducted within a specific time frame, constrained by time and resource availability. Standard protocols involve calibrating sensors over durations ranging from a few days to several weeks prior to their utilization in field-monitoring applications. The calibration's effectiveness largely depends on this time frame, referred to as the calibration period. The calibration period test in this study uses subsets of the full co-location period to generate a range of hypothetical calibration periods. We investigated calibration period scenarios ranging from 1–15 d. In each scenario, 500 samples were randomly selected using the numpy.random.choice() function in Python, ensuring randomness and independence in the selection of co-location timing. This approach is intended to create hypothetical scenarios that reflect the diverse conditions and variability under which calibration might occur in real-world sensor calibration practices. Sample sizes of 250, 500 and 1000 were tested and results stabilized with 500 samples, indicating minimal impact from decreasing or increasing the sample size further. The 500 randomly selected calibration periods are illustrated in Fig. S1 in the Supplement, which shows the start times for these periods for NO, with the approach also applied to NO2, CO and O3 sensors.

These calibration samples were used as the training set for each hypothetical calibration period in the calibration model to evaluate the range of potential R2 and RMSE values when applied in the sensor validation periods. Firstly, these samples were standardized to hourly data to facilitate consistent comparisons across various MAS units. The calibration coefficients (slope and intercept) of these samples were calculated as per Eq. (1) or (2). Subsequently, these coefficients were validated using the following month's data by comparing the hourly-calibrated sensor data and hourly reference data. A superior validation performance, indicated by higher R2 values and lower RMSE values, suggests that the calibration period effectively captures the relationship between the calibrated sensor and the reference data, thereby indicating an optimal calibration duration. This evaluation was not limited to the calibration period's immediate outcome; it also included a comparison of R2 and RMSE metrics against the hourly data validation set from the subsequent month. This dual-phase evaluation underscores that the calibration's true merit is better judged during the post-calibration validation phase, adhering to the standard practice of a bounded calibration period followed by an extended validation phase.

2.3.2 Concentration range analysis

We propose the hypothesis that users can strategically select a co-location period to minimize the calibration duration, recognizing that the calibration period is not the sole factor to consider when optimizing instrument co-location for calibration purposes. A critical aspect is to evaluate the representativeness of environmental conditions during the calibration period in relation to those observed during the long-term evaluation periods. Since the influence of temperature and RH on sensor signals has been significantly mitigated, concentration emerges as the key factor that accurately reflects environmental conditions. To analyse how the range of pollutant concentrations during the calibration period affected the sensor validation performance, we compared the validation R2 and RMSE outcomes with the same calibration period length but varied concentration ranges.

Firstly, we segmented the samples into distinct categories based on their concentration ranges while maintaining a constant calibration period. We employed the 5th to 95th percentile of the pollutant concentration in each category to define each range. This approach mitigates the impact of sporadic peak values, ensuring they do not disproportionately affect the overall concentration range assessment. Subsequently, the effectiveness of calibration across these ranges was systematically evaluated by comparing R2 and RMSE metrics during the validation periods in the subsequent month. This strategy enabled a thorough examination of how the concentration range impacts calibration accuracy, providing insights into the optimal range needed for precise sensor calibration.

2.3.3 Time-averaging evaluation

We also evaluated the influence of time averaging on calibration efficacy to identify the optimal data resolution for the best calibration outcomes. Given that reference analysers and sensors can provide data at granular levels, down to minutes or seconds, pre-calibration data processing plays a crucial role in the accuracy of calibration.

In this time-averaging analysis, we compared the calibration performance of data averaged over different time intervals, from minutes to hours. After processing the calibration dataset with varied time-averaging intervals, the resulting calibration coefficients were evaluated against the data from the following month's validation set. For example, for a sample with a calibration period of 1 d, sensor and reference data were averaged over 1, 3, 5, 7, 9, 11, 30, 60, 120 and 180 min and used to determine the sensor coefficients for each time-averaging interval. Following that, these coefficients were independently applied to the following 1-month validation period with hourly data to determine the R2 and RMSE under each time-averaging interval. The ideal time-averaging interval was determined based on the highest R2 and lowest RMSE values obtained in this validation phase, pinpointing the most effective time resolution for calibration.

3 Results and discussion

3.1 MAS sensor performance against temperature and RH variability

Before initiating the long-term co-location campaign, the MAS units equipped with NO2, NO, CO and O3 sensors were tested in Hong Kong, demonstrating the dynamic baseline tracking method's ability to enhance electrochemical sensor performance against varying temperatures and RH. We tested four MAS units and present findings from this one MAS as an example to evaluate the robustness of the PDF technology. During the 15 d pre-test in the summer (1–15 June 2019), temperatures varied between 28 and 42 °C, with RH levels from 45 % to 87 %. The outputs from the PDF sensor, the ORG sensor and the differential output between the paired ORG–PDF sensor are illustrated separately in Fig. 3a–d. The voltage signals from the PDF and ORG sensors were converted into concentration outputs using coefficients derived from Eq. (1). As shown in the figure, even during the typical ambient concentration ranges, the accuracy of the ORG sensor outputs for gases other than CO was notably poor, primarily due to significant influences from field temperature and RH. It was observed that the PDF sensor outputs for all gas pollutants did not exhibit a linear relationship with temperature or RH profiles. Different sensor types demonstrated distinct response patterns to variations in temperature and RH, highlighting the complex non-linear characteristics of electrochemical sensors in relation to the baseline dependence on these environmental factors.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f03

Figure 3(a–d) Performance validation of the MAS ORG and PDF sensors for detecting NO2, NO, CO and O3 under field conditions in 2019. Panel (e) displays the temperature and RH measured inside the MAS gas sensor modules. Panels (f–i) compare the readings from the ORG sensor and the MAS PDF-enabled sensor with reference measurements.

Download

With the PDF-enabled sensors, the physical separation of the climatic-driven baseline and target-gas-driven sensitivity is demonstrated to be feasible and effective. By subtracting the output of the PDF sensor from that of the ORG sensor, the resulting ORG–PDF output reveals a clear gas concentration profile that aligns closely with reference measurements. This relationship is illustrated in the scatter plots presented in Fig. 3f–i. For NO2, the ORG–PDF sensors showed stronger performance, with a high R2 (0.99) and low RMSE (0.94), compared to the lower R2 (0.44) and higher RMSE (5.80) for the ORG sensors without the PDF module. For NO and O3, the ORG–PDF sensors also demonstrated stronger performance compared to the ORG sensors without the PDF module. Specifically, the ORG–PDF sensors had strong R2 (0.97 for both NO and O3) and low RMSEs (1.72 for NO, 1.05 for O3), while the ORG sensors without the PDF module had weaker R2 (0.73 for NO, 0.59 for O3) and higher RMSEs (5.37 for NO, 4.18 for O3). For CO, the sensors exhibited comparable performance, with R2 around 0.93–0.94 and RMSE values between 16.70–19.00, regardless of the PDF module. We tested four MASs, and the other PDF-enabled sensors are shown in Fig. S2 in the Supplement. Their data quality performance has been consistent with the findings data reported here. These significant discrepancies between the ORG sensor output and ORG–PDF sensor output, especially for NO, NO2 and O3, highlight the importance of the dynamic baseline tracking method in improving the accuracy and reliability of measurements, notably under low-concentration conditions influenced by temperature and RH.

Additionally, laboratory tests in environmental chambers assessed the MAS NO sensor (Fig. S3 in the Supplement), exposing it to broad temperature (0–30 °C) and RH (10 % to 90 %) ranges. Despite these fluctuations, MAS sensors maintained consistent and stable readings after applying the dynamic baseline tracking method, as shown in Fig. S3b, with concentration steps from 50–300 ppb. The outcomes from both field and laboratory tests confirm that the dynamic baseline tracking method effectively neutralizes temperature and RH effects, primarily for NO2, NO and O3 sensors, achieving the desired performance while focusing primarily on concentration factors for subsequent analysis. Similar pre-tests were also conducted with the MAS units in Macau and Shanghai to assess the effectiveness of the dynamic baseline tracking method.

Upon completion of the pre-tests, the long-term field co-location campaigns were initiated. The dynamic baseline tracking method was first evaluated in this study to prove its effectiveness in long-term field tests. The performance of MAS1, particularly for NO and NO2, throughout the campaign is depicted in Figs. S4 and S5 in the Supplement. It should be noted that a single fixed calibration coefficient was used throughout the entire campaign duration. This fixed coefficient enabled the calibrated sensor data to consistently perform well throughout the co-location campaign. The absolute error (sensor  reference) generally stayed within ±5 ppb, and the relative error (absolute error / reference) was primarily under 15 %, indicating effective mitigation of temperature and RH impacts on the sensor's output, even during extended field conditions over a year. Importantly, the long-term analysis in Figs. S4 and S5 showed that selecting suitable calibration coefficients can ensure the sensors' stability and accuracy over prolonged periods. However, dedicating several months or even up to a year for calibration is not feasible in standard practice. Therefore, our main goal is to determine the optimal coefficients from short-term calibration periods to enhance long-term validation performance.

3.2 Impact of the calibration period on sensor calibration

As detailed in Sect. 2.3, we used 500 randomly selected samples for each calibration period, and this process generated 500 sets of calibration slopes and R2 and RMSE values from the validation period. Figure S6 in the Supplement displays the median and the 25th to 75th percentile range of these R2 and RMSE results across all eight MAS units with NO2 and NO sensors and all six units with CO and O3 sensors. Figure 4 extracts the 25th to 75th percentiles of each MAS result and combines them into a boxplot, making the trend across the calibration period more apparent. An increase in the median of R2 (e.g. for NO, R2 improved from 0.83–0.95 as the calibration period went from 1–15 d) coupled with a reduction in the median of RMSE (e.g. for NO, RMSE decreased from 3.71 to 2.12 over the same calibration period) shown in Fig. 4 indicates improved validation performance. The narrowing of the 25th to 75th percentile range across calibration periods (e.g. for NO, R2 range tightened from 0.66–0.96 to 0.90–0.98 as the calibration period went from 1–15 d) further supports this, with a tightening of validation performance towards a steadier state and reduced chance of abnormal calibration.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f04

Figure 4The range of the validation R2 and RMSE for a given calibration period for all MAS units consists of (a) NO2, (b) NO, (c) CO and (d) O3 sensors. The vertical error bar is the 25 %–75 % distribution of R2 and RMSE for different calibration periods.

Download

In Fig. 4, the most notable enhancements in validation performance were observed within the initial 1–3 d. Beyond this period, the rate of improvement was found to be less clear, with the median R2 increasing by less than 0.02 and the median RMSE decreasing by less than 0.1 (but less than 1 for CO) for further increases in the calibration period. For NO2, NO and O3, the upward trend in validation R2 and the downward trend in RMSE were observed, plateauing after 5 d. CO sensors in most MAS units reach stable R2 after 7 d. This suggests that lengthening the calibration period beyond 5 d for NO2, NO and O3 or 7 d for CO does not markedly benefit sensor data performance. If the sensor users can strategically select the co-location period to minimize the calibration duration, a period of 5–7 d is identified as the most effective for minimizing errors in calibration coefficient and avoiding notably low validation R2 values.

The aforementioned results are based on an average pattern derived from the combined data of all sensors. Figure S6 presents the separate performance of all eight MAS sensors over varying calibration periods. The NO2, NO, CO and O3 sensors in MAS1–4 in Hong Kong and MAS5–6 in Macau exhibited trends consistent with those shown in Fig. 4. A noteworthy observation in Fig. S6a and b is that the NO2 and NO sensors in MAS7 and MAS8 of the Shanghai campaign showed consistent performance over all calibration periods, lacking the trends observed in Fig. 4. Considering that the NO and NO2 concentrations at the site of Shanghai are significantly higher than those in the other two cities, it is hypothesized that the elevated pollutant concentrations at the Shanghai port provided a more favourable calibration condition, thereby diminishing the contribution of the calibration period. Thus, we conclude that for calibration conditions with a narrower concentration range, a calibration period of at least 5–7 d is necessary, whereas more polluted ambient environments are more conducive to sensor concentration calibration. Despite the short calibration duration of 1–3 d, the extensive concentration range assessed contributed to more precise calibration coefficients and improved validation performance, as will be discussed in the next section.

3.3 Impact of concentration range on sensor calibration

Another critical aspect is the impact of the concentration range experienced by the sensors during calibration periods. Figure S6 shows that MAS7 and MAS8 in the Shanghai campaign could achieve accurate and reliable calibration for NO and NO2 within just a day, given their exposure to environments with significant concentration variability. Our second test examined the effect of the concentration range.

Samples in Fig. 4 are grouped based on different concentration ranges, and the results are shown in Figs. 5 and S7 in the Supplement to explore the relationship between calibration period length, concentration range and sensor validation performance, categorizing the MAS units accordingly. For NO2 and NO sensors of MAS7 and MAS8, a separate analysis was essential due to their higher pollutant concentrations compared to other units, as detailed in Table 1. MAS1–6 are evaluated together in Fig. 5 for a lower concentration range, with 90 % of NO2 and NO ranges falling below 40 and 50 ppb, respectively. MAS7 and MAS8 are assessed in Fig. S7 for higher concentration ranges, where 90 % of the readings for both gases exceed these thresholds.

Figure 5 illustrates the calibration conditions at lower concentrations typical of environments like Hong Kong and Macau. The red zone of Fig. 5, indicating higher R2 values, is primarily concentrated in areas with wider concentration ranges. Specifically, when examining the performance of NO2 sensors, the lowest R2 value of 0.55 was recorded in the 0–10 ppb range, while the highest R2 value of 0.75 was recorded in the >50 ppb range. When the calibration period is held constant, an increase in the concentration range boosts the validation R2 from 0.55 to 0.75 with a notable turning point at 40 ppb. However, extending the calibration period without increasing the concentration range does not obviously improve the validation R2. NO, CO and O3 also displayed patterns similar to NO2, with R2 improvements linked to wider concentration ranges. For all gases, the highest R2 values were predominantly observed in the broadest concentration ranges. Therefore, achieving higher validation R2 values above the median, such as R2>0.65 for NO2, R2>0.84 for NO, R2>0.75 for CO and R2>0.95 for O3, requires significant concentration ranges, notably more than 40 ppb for NO2, as well as 10 ppb for NO, 500 ppb for CO and 20 ppb for O3. Reaching these ranges allows the calibration coefficients to stabilize and align closely with those derived from year-long calibration results.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f05

Figure 5(a) NO2, (b) NO, (c) CO and (d) O3 bubble plots of median R2 of MAS units 1–6 (as located in the low-concentration regions (Hong Kong and Macau)) and two factors: calibration period and concentration range. The size of the bubbles represents the number of samples. The colour represents the median R2 values in corresponding categories. Red represents the higher R2 value, while blue represents the lower R2 value.

Download

The recommended concentration ranges are 40, 10, 500 and 20 ppb for NO2, NO, CO and O3, respectively. The differences in these concentration thresholds for various gas sensors may be attributed to the distribution characteristics of the gas pollutants in the surrounding environment. The NO concentration range of 10 ppb is the lowest, possibly due to the prevalence of high ambient NO concentrations frequently appearing in the form of peaks. When employing the 5th to 95th percentile as the criteria for the concentration range, the NO range is observed to be the lowest among the gases. The higher concentration range analysis in Fig. S7 shows that increasing the concentration range beyond 40 ppb for NO2 and 50 ppb for NO does not improve validation R2 values, further indicating a threshold in the concentration range beyond which no additional sensor performance benefits are observed. This underscores the inadequacy of merely extending the calibration duration, and it is crucial to ensure an adequate concentration range during the calibration period. But beyond a certain concentration range threshold, further increases in the calibration range do not lead to additional improvements in the calibration results.

It is important to acknowledge certain limitations in this section. The range of environmental concentrations tested was limited and may not encompass all possible calibration scenarios. Consequently, we lack sufficient data to support similar conclusions for environments with either significantly larger concentration ranges – such as those where NO, NO2 and O3 concentrations exceed 150 ppb – or with consistently lower concentrations, where values remain below 10 ppb for extended periods. While our findings are applicable to most similar or closely related concentration environments, further investigation is needed to validate these conclusions across a broader spectrum of calibration conditions.

3.4 Impact of time averaging on sensor calibration

Another factor influencing calibration is the time averaging of the raw data, particularly for high-frequency measurements taken at intervals of a minute or seconds. Performing temporal averaging is critical before formulating the calibration equation. As indicated in Table 1, only the reference data from Hong Kong were obtained at a 1 min temporal resolution. Thus, only the data from MAS1–4 will be used for time-averaging evaluation. The time-averaging process aims to enhance the accuracy of calibration coefficients while ensuring a substantial data volume for a reliable calibration process.

Figure 6 presents results from two different perspectives: Fig. 6a–c focus on the time-averaging analysis and the consistency of results across different sensors, while Fig. 6d–f emphasize the patterns observed for varying calibration periods. Figure 6a–c show the performance of the NO2 sensors from MAS1–MAS4 across different time intervals, ranging from 1 min to 3 h. To eliminate the influence of the calibration period and adhere to the principles of single variable analysis, we utilized only 500 calibration samples from each MAS with a fixed calibration period of 1 d. The sensor and reference data for each calibration sample underwent time averaging across intervals of 1, 3, 5, 7, 9, 11, 30, 60, 120 and 180 min. Subsequent calibration and validation led to the determination of the calibration slope, R2 of the validation set and RMSE for these time-averaged intervals. The results reveal a clear trend of improvement across all three metrics with increasing time-averaging intervals, particularly notable between the 1 and 5 min intervals. All four MAS NO2 sensors exhibit a consistent trend in this regard.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f06

Figure 6(a–c) The potential range of calibration slope, the R2 and the RMSE of the validation set for MAS1–4 NO2 sensors for various time-averaging periods with a calibration period of 1 d. Differently coloured lines represent the results of different MAS units. The vertical error bar is the 25 %–75 % distribution of the results for different categories. (e–f) The calibration slope median, the R2 median and the RMSE median of the validation set for MAS1 NO2 sensors across all calibration periods, with different colours denoting time averages ranging from 1 min to 3 h.

Download

These findings are based on a calibration period of 1 d, and we extended the analysis to other calibration periods. Using MAS1 as an illustrative case, Fig. 6d–f display the trends across different time-averaging periods for various calibration periods. We derived the median values under each category. Analysis of Fig. 6e's vertical axis reveals that, for a 1 d calibration period, R2 values improved after hourly (R2=0.68) and 5 min averaging (R2=0.66) compared to the baseline 1 min data (R2=0.59), with a corresponding reduction in RMSE. For periods exceeding a day, median R2 values exhibited a modest rise from 0.64–0.66 for 1 min data to 0.68–0.70 for hourly data, suggesting the shorter the calibration period, the more pronounced the benefit of longer time averaging. Hence, calibrating with minute-level data over short periods of 1–3 d may lead to sub-optimal validation performance. Similar trends were observed for NO and CO, as shown in Figs. S8 and S9 in the Supplement; however, the trend for O3 shown in Fig. S10 in the Supplement was less pronounced, with only the calibration slope exhibiting a similar pattern. This may be attributed to the unique characteristics of O3 calculations (Eq. 2), where the influence of cross-interference from NO2 affects the results, thereby masking the impact of time averaging.

The results indicate that data averaging over an hour is more suitable for calibration than minute-level data. As depicted in Figs. 6 and S8–S10, a critical juncture is identified at the 5 min mark (highlighted by a green line with a star). After this point, the improvements in validation R2 and RMSE become substantially less obvious. Thus, for data originally recorded at 1 min intervals, applying a time averaging of 5 min or longer boosts the performance of the validation set, aligning the calibration coefficient more closely with the optimal one. The enhanced performance of hourly over minute-level data across various calibration periods warrants further investigation in the next section to understand the underlying factors.

3.5 Potential causes of sensor calibration coefficient variation

We selected a sample from the MAS1 NO2 sensor with a 1 d calibration period to analyse the benefits of hourly over minute-level data averaging. Regression analysis between sensor and reference data was performed for both 1 min and 1 h averages. Initially, data fitting during the calibration period was assessed. The time series plot in Fig. 7a shows that both minute- and hourly-averaged data closely align with the reference. However, obvious differences emerged when computing the calibration equations separately for each time frame. The calibration slope for minute-level data (a=2.70) was substantially lower than that for hourly data (a=3.91), corroborating the trends noted in Fig. 6. This discrepancy is evident in Fig. 7b, where the regression curves for minute-level and hourly data diverge. The orange line for hourly data intersects more closely with the dense cluster of orange dots representing minute-level data, unlike the minute-level data's blue fitting line, which misses this dense area. In the validation phase, applying the distinct calibration coefficients derived for minute and hour averages to the next month's dataset also highlighted clear differences. Figure 7c and d illustrate that minute-level calibration coefficients (blue line) resulted in less consistent sensor data with the reference (R2=0.72, RMSE=5.60) than the hourly data (R2=0.93, RMSE=2.74), especially at lower concentrations.

https://amt.copernicus.org/articles/18/1771/2025/amt-18-1771-2025-f07

Figure 7One of the calibration samples for MAS1's NO2 sensor with a calibration period of 1 d. Panel (a) is the time series, and (b) is the scatter plot of minute-level and hourly data for the NO2 sensor and reference during the calibration period. Panel (c) is the time series, and (d) is the scatter plot of minute-level and hourly data for the NO2 sensor and reference during the validation period. The colour bars in (b) and (d) represent the sample size in each region.

Download

The discrepancy between the two sets of calibration coefficients is further illustrated in the data distribution plots in Fig. S11 in the Supplement, where sensor and reference data distributions for varying time-averaging lengths are compared. As the time-averaging interval increases, the sensor data distribution more closely mirrors the reference data distribution. This observation supports the notion that time averaging can refine the accuracy of the calibration by aligning data distributions, leading to more precise calibration outcomes. This pattern consistently appeared across various samples, MAS units and gases, as described in Sect. 3.4, demonstrating the superior calibration accuracy achieved with longer averaging periods.

Furthermore, we investigated the potential factors for the observed pattern by analysing the residual term in the sensor calibration model from the mathematical perspective. The detailed analysis is provided in Sect. S1 in the Supplement. One plausible explanation is that the predictive capability of the calibration model using minute-level data may be compromised due to data noise. This noise can introduce variability that obscures underlying trends, ultimately leading to less reliable regression results and hindering the model's ability to accurately capture the relationship between sensor and reference data. While this explanation is plausible, we currently lack specific insights into which influential factors may be affecting the regression model. This remains an area for further investigation in our future work.

4 Conclusions

This study aimed to assess the performance of a novel dynamic baseline tracking method equipped with patented PDF gas sensors under different climate conditions and to critically analyse three factors influencing the sensor calibration performance of PDF-enabled NO2, NO, CO and O3 sensors: calibration period, concentration range and time averaging. By co-locating eight MAS units with reference analysers in three cities over a period of up to 22 months, a comprehensive framework for sensor calibration was established. The study utilized a dynamic baseline tracking method, enhancing the consistency between MAS sensor data and reference measurements. This method effectively countered the impacts of temperature and RH, focusing on pollutant concentration as the primary factor for MAS performance assessment.

In the calibration period analysis, equations were derived from 500 randomly selected samples for each period ranging from 1–15 d, with subsequent evaluation against the validation data. Initial improvements in validation performance were notable within the first 1–3 d of the calibration period, stabilizing around 5–7 d. This pattern suggests that extending the calibration period beyond 7 d yields negligible benefits; hence a 5–7 d calibration period is advocated to reduce calibration coefficient errors.

The concentration range assessment indicated that broader ranges enhance the validation R2 values across all gas sensors. This finding emphasizes the necessity of establishing a concentration range threshold to facilitate effective calibration. Optimal ranges were determined as over 40 ppb for NO2, 10 ppb for NO, 500 ppb for CO and 20 ppb for O3, with these thresholds ensuring reliable calibration coefficients and minimizing uncertainty in the results.

Time averaging's impact on calibration was significant, with improved coefficients and validation performance as averaging intervals increased. The 1 d calibration period showed the most substantial improvement, with hourly and 5 min averages providing higher R2 values than 1 min intervals. A 5 min threshold emerged as critical, advocating for a minimum of 5 min averaging to enhance calibration accuracy and align coefficients with the optimal standard.

This study offers comprehensive insights into calibrating electrochemical gas sensors, highlighting the roles of calibration period, concentration range and time averaging. Recommended practices for optimal calibration include (1) a calibration period of 5–7 d using hourly data; (2) a concentration variation range (5th to 95th percentile range) exceeding 40 ppb for NO2, 10 ppb for NO, 500 ppb for CO and 20 ppb for O3; and (3) a time averaging of 5 min or longer, preferably utilizing hourly data. The findings highlight the importance of balancing these factors to achieve optimal calibration outcomes, while extending certain calibration aspects beyond recommended thresholds may not yield additional benefits.

Acknowledging the limitations of this study, which focused exclusively on our MAS sensor technology with its active flow gas sampler, it should be noted that the specific calibration protocol described may not be directly applicable to studies involving different sensor types, commercial sensor packages from various manufacturers or different air sampling methods using passive samplers. Optimal calibration conditions may vary depending on the sensor's specific features and the calibration methods employed. For instance, regarding the optimized calibration period, a duration of at least 5–7 d is necessary for conditions with a narrower concentration range. In contrast, in locations with more polluted ambient environments, a shorter calibration duration of 1–3 d may be sufficient for effective sensor concentration calibration.

Future research endeavours should aim to diversify sensor types and increase the number of test sensors, thereby enhancing the generalizability and practicality of the findings. Nonetheless, the primary objective of this study is to provide methodological insights that can serve as a valuable reference for calibrating various sensor types. The developed dynamic baseline tracking method, along with the determined optimal calibration period, concentration range thresholds and time-averaging period, can inform and guide future research and calibration efforts for a wide range of sensors used in air quality monitoring. By establishing a foundation for standardized calibration approaches, this study contributes to advancing sensor technologies and promoting the generation of reliable and comparable air quality data across diverse monitoring networks.

Code and data availability

Code and data will be made available on request.

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/amt-18-1771-2025-supplement.

Author contributions

HM: writing (original draft), visualization, methodology, data curation, conceptualization. PW: writing (review and editing), validation, methodology, conceptualization. MAG: writing (review and editing), investigation. NKG: writing (review and editing), visualization, investigation. YW: software, methodology. ZN: writing (review and editing), validation, supervision, methodology, conceptualization.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors.

Acknowledgements

The authors acknowledge the financial support received from the Research Grants Council of Hong Kong through the General Research Fund (16212022) and also acknowledge the support from the Environmental Protection Department, HKSAR, for providing AQMS data. The authors also appreciate the technical support offered by Sapiens Environmental Technology Co Limited, the manufacturer of the sensor devices used in the study.

Financial support

This research has been supported by the Research Grants Council of Hong Kong, University Grants Committee (grant no. 16212022).

Review statement

This paper was edited by Albert Presto and reviewed by two anonymous referees.

References

Ariyaratne, R., Elangasinghe, M. A., Zamora, M. L., Karunaratne, D. G. G. P., Manipura, A., Jinadasa, K. B. S. N., and Abayalath, K. H. N.: Understanding the effect of temperature and relative humidity on sensor sensitivities in field environments and improving the calibration models of multiple electrochemical carbon monoxide (CO) sensors in a tropical environment, Sensor. Actuat. B-Chem., 390, 133935, https://doi.org/10.1016/j.snb.2023.133935, 2023. 

Bisignano, A., Carotenuto, F., Zaldei, A., and Giovannini, L.: Field calibration of a low-cost sensors network to assess traffic-related air pollution along the Brenner highway, Atmos. Environ., 275, 119008, https://doi.org/10.1016/j.atmosenv.2022.119008, 2022. 

Castell, N., Dauge, F. R., Schneider, P., Vogt, M., Lerner, U., Fishbain, B., Broday, D., and Bartonova, A.: Can commercial low-cost sensor platforms contribute to air quality monitoring and exposure estimates?, Environ. Int., 99, 293–302, https://doi.org/10.1016/j.envint.2016.12.007, 2017. 

Collier-Oxandale, A., Feenstra, B., Papapostolou, V., Zhang, H., Kuang, M., Der Boghossian, B., and Polidori, A.: Field and laboratory performance evaluations of 28 gas-phase air quality sensors by the AQ-SPEC program, Atmos. Environ., 220, 117092, https://doi.org/10.1016/j.atmosenv.2019.117092, 2020. 

Cross, E. S., Williams, L. R., Lewis, D. K., Magoon, G. R., Onasch, T. B., Kaminsky, M. L., Worsnop, D. R., and Jayne, J. T.: Use of electrochemical sensors for measurement of air pollution: correcting interference response and validating measurements, Atmos. Meas. Tech., 10, 3575–3588, https://doi.org/10.5194/amt-10-3575-2017, 2017. 

Datta, A., Saha, A., Zamora, M., Buehler, C., Hao, L., Xiong, F., Gentner, D., and Koehler, K.: Statistical field calibration of a low-cost PM2.5 monitoring network in Baltimore, Atmos. Environ., 242, 117761, https://doi.org/10.1016/j.atmosenv.2020.117761, 2020. 

Gao, M., Cao, J., and Seto, E.: A distributed network of low-cost continuous reading sensors to measure spatiotemporal variations of PM2.5 in Xi'an, China, Environ. Pollut., 199, 56–65, https://doi.org/10.1016/j.envpol.2015.01.013, 2015. 

Han, P., Mei, H., Liu, D., Zeng, N., Tang, X., Wang, Y., and Pan, Y.: Calibrations of Low-Cost Air Pollution Monitoring Sensors for CO, NO2, O3, and SO2, Sensors, 21, 256, https://doi.org/10.3390/s21010256, 2021. 

Holstius, D. M., Pillarisetti, A., Smith, K. R., and Seto, E.: Field calibrations of a low-cost aerosol sensor at a regulatory monitoring site in California, Atmos. Meas. Tech., 7, 1121–1131, https://doi.org/10.5194/amt-7-1121-2014, 2014. 

Kim, J., Shusterman, A. A., Lieschke, K. J., Newman, C., and Cohen, R. C.: The BErkeley Atmospheric CO2 Observation Network: field calibration and evaluation of low-cost air quality sensors, Atmos. Meas. Tech., 11, 1937–1946, https://doi.org/10.5194/amt-11-1937-2018, 2018. 

Levy Zamora, M., Buehler, C., Datta, A., Gentner, D. R., and Koehler, K.: Identifying optimal co-location calibration periods for low-cost sensors, Atmos. Meas. Tech., 16, 169–179, https://doi.org/10.5194/amt-16-169-2023, 2023. 

Li, J., Hauryliuk, A., Malings, C., Eilenberg, S. R., Subramanian, R., and Presto, A. A.: Characterizing the Aging of Alphasense NO2 Sensors in Long-Term Field Deployments, ACS Sens., 6, 2952–2959, https://doi.org/10.1021/acssensors.1c00729, 2021. 

Mead, M. I., Popoola, O. A. M., Stewart, G. B., Landshoff, P., Calleja, M., Hayes, M., Baldovi, J. J., McLeod, M. W., Hodgson, T. F., Dicks, J., Lewis, A., Cohen, J., Baron, R., Saffell, J. R., and Jones, R. L.: The use of electrochemical sensors for monitoring urban air quality in low-cost, high-density networks, Atmos. Environ., 70, 186–203, https://doi.org/10.1016/j.atmosenv.2012.11.060, 2013. 

Mukherjee, A., Brown, S., McCarthy, M., Pavlovic, N., Stanton, L., Snyder, J., D'Andrea, S., and Hafner, H.: Measuring Spatial and Temporal PM2.5 Variations in Sacramento, California, Communities Using a Network of Low-Cost Sensors, Sensors, 19, 4701, https://doi.org/10.3390/s19214701, 2019. 

Okorn, K. and Hannigan, M.: Improving Air Pollutant Metal Oxide Sensor Quantification Practices through: An Exploration of Sensor Signal Normalization, Multi-Sensor and Universal Calibration Model Generation, and Physical Factors Such as Co-Location Duration and Sensor Age, Atmosphere-Basel, 12, 645, https://doi.org/10.3390/atmos12050645, 2021. 

Papapostolou, V., Zhang, H., Feenstra, B. J., and Polidori, A.: Development of an environmental chamber for evaluating the performance of low-cost air quality sensors under controlled conditions, Atmos. Environ., 171, 82–90, https://doi.org/10.1016/j.atmosenv.2017.10.003, 2017. 

Pinto, J., Dibb, J., Lee, B., Rappenglück, B., Wood, E., Levy, M., Zhang, R., Lefer, B., Ren, X., Stutz, J., Tsai, C., Ackermann, L., Golovko, J., Herndon, S., Oakes, M., Meng, Q., Munger, J., Zahniser, M., and Zheng, J.: Intercomparison of field measurements of nitrous acid (HONO) during the SHARP campaign, J. Geophys. Res.-Atmos., 119, 5583–5601, https://doi.org/10.1002/2013JD020287, 2014. 

Si, M., Xiong, Y., Du, S., and Du, K.: Evaluation and calibration of a low-cost particle sensor in ambient conditions using machine-learning methods, Atmos. Meas. Tech., 13, 1693–1707, https://doi.org/10.5194/amt-13-1693-2020, 2020. 

Sousan, S., Koehler, K., Hallett, L., and Peters, T. M.: Evaluation of the Alphasense optical particle counter (OPC-N2) and the Grimm portable aerosol spectrometer (PAS-1.108), Aerosol Sci. Tech., 50, 1352–1365, https://doi.org/10.1080/02786826.2016.1232859, 2016. 

Spinelle, L., Gerboles, M., Villani, M. G., Aleixandre, M., and Bonavitacola, F.: Field calibration of a cluster of low-cost available sensors for air quality monitoring. Part A: Ozone and nitrogen dioxide, Sensor. Actuat. B-Chem., 215, 249–257, https://doi.org/10.1016/j.snb.2015.03.031, 2015.  

Spinelle, L., Gerboles, M., Villani, M. G., Aleixandre, M., and Bonavitacola, F.: Field calibration of a cluster of low-cost commercially available sensors for air quality monitoring. Part B: NO, CO and CO2, Sensor. Actuat. B-Chem., 238, 706–715, https://doi.org/10.1016/j.snb.2016.07.036, 2017. 

Sun, L., Westerdahl, D., and Ning, Z.: Development and Evaluation of A Novel and Cost-Effective Approach for Low-Cost NO2 Sensor Drift Correction, Sensors, 17, 1916, https://doi.org/10.3390/s17081916, 2017. 

Topalovic, D., Davidovic, M., Jovanovic, M., Bartonova, A., Ristovski, Z., and Jovasevic-Stojanovic, M.: In search of an optimal in-field calibration method of low-cost gas sensors for ambient air pollutants: Comparison of linear, multilinear and artificial neural network approaches, Atmos. Environ., 213, 640–658, https://doi.org/10.1016/j.atmosenv.2019.06.028, 2019. 

Wei, P., Ning, Z., Ye, S., Sun, L., Yang, F., Wong, K. C., Westerdahl, D., and Louie, P. K. K.: Impact Analysis of Temperature and Humidity Conditions on Electrochemical Sensor Response in Ambient Air Quality Monitoring, Sensors, 18, 59, https://doi.org/10.3390/s18020059, 2018. 

Wei, P., Sun, L., Anand, A., Zhang, Q., Huixin, Z., Deng, Z., Wang, Y., and Ning, Z.: Development and evaluation of a robust temperature sensitive algorithm for long term NO2 gas sensor network data correction, Atmos. Environ., 230, 117509, https://doi.org/10.1016/j.atmosenv.2020.117509, 2020. 

Zimmerman, N., Presto, A. A., Kumar, S. P. N., Gu, J., Hauryliuk, A., Robinson, E. S., Robinson, A. L., and R. Subramanian: A machine learning calibration model using random forests to improve sensor performance for lower-cost air quality monitoring, Atmos. Meas. Tech., 11, 291–313, https://doi.org/10.5194/amt-11-291-2018, 2018. 

Zong, H., Brimblecombe, P., Sun, L., Wei, P., Ho, K. F., Zhang, Q., Cai, J., Kan, H., Chu, M., Che, W., Lau, A., and Ning, Z.: Reducing the Influence of Environmental Factors on Performance of a Diffusion-Based Personal Exposure Kit, Sensors, 21, 4637, https://doi.org/10.3390/s21144637, 2021. 

Download
Short summary
Long-term field testing across diverse climatic environments is conducted to identify the optimized calibration conditions for NO2, NO, CO and O3 electrochemical sensors. The results uncovered three factors that influence calibration performance: calibration period, concentration range and time averaging. We developed a comprehensive framework for the best sensor calibration practices, which serves as a valuable reference for calibrating various sensor types used in air quality monitoring.
Share