Articles | Volume 18, issue 18
https://doi.org/10.5194/amt-18-4871-2025
https://doi.org/10.5194/amt-18-4871-2025
Research article
 | 
29 Sep 2025
Research article |  | 29 Sep 2025

A 30-month field evaluation of low-cost CO2 sensors using a reference instrument

Qixiang Cai, Ning Zeng, Xiaoyu Yang, Chi Xu, Zhaojun Wang, and Pengfei Han
Abstract

Low-cost, medium-precision sensor (LCS) networks have emerged as a promising approach for CO2 monitoring under complex urban emission conditions. However, the field performance of these LCSs faces significant challenges from environmental factors (e.g., temperature, humidity) and long-term drifts caused by sensor degradation (e.g., light source aging). In this study, we performed 30 months of co-located observations using LCS units (named SENSE-IAP) alongside a Picarro reference analyzer to evaluate long-term field performance, which is essential for the correction and validation of mid–low-cost CO2 observation networks. The environmental correction system we developed effectively corrected the impact of daily environmental changes, which reduced the root mean square errors (RMSEs) from 5.9 ± 1.2 to 1.6 ± 0.5 ppm for SENSE-IAP. The corrections remained robust against seasonal environmental changes, maintaining daily RMSE typically within 1–3 ppm throughout the 30 months of observation. Long-term drifts, commonly occurred in LCS, produced biases of up to 27.9 ppm over 2 years. Furthermore, seasonal drift cycle contributed up to 25 ppm RMSE after 6 months of deployment. Although the environmental correction system could not address these errors, linear interpolation effectively calibrated the long-term drift, reducing the 30-month RMSE to 2.4 ± 0.2 ppm. To improve the accuracy of high-density CO2 networks using such LCSs, we recommend maintaining a calibration frequency preferably within 3 months and not exceeding 6 months, with optimal calibration performed during both winter and summer to ensure accuracy within 5 ppm. These findings indicate that SENSE-IAP instruments can operate long-term without requiring a return to the laboratory for re-calibration or frequent field standard gas calibration, thereby substantially reducing time, labor, and financial costs.

Share
1 Introduction

Urban CO2 emissions demonstrate complex spatial and temporal variability (Wada et al., 2011) influenced by diverse emission sources (Gurney et al., 2012; Kellett et al., 2013), meteorological factors (Grimmond et al., 2002; Lateb et al., 2016), and potential misinterpretation by biogenic fluxes (Miles et al., 2021). These complexities pose significant challenges in accurately capturing and interpreting the intricate changes in urban CO2 concentrations, highlighting the necessity for more advanced and comprehensive monitoring solutions.

Table 1Types of low-cost and mid-cost sensors.

Download Print Version | Download XLSX

Recent advancements in medium-precision carbon monitoring technologies have enabled the establishment of high-density sensor networks with low to mid cost and medium precision, presenting a viable and competitive monitoring strategy (Müller et al., 2020; Shusterman et al., 2018). This approach effectively addresses the challenges posed by CO2 variability in urban environments, where complex emission sources, vegetative carbon sinks, and dynamic meteorological conditions interact. Unlike high-precision instruments (0.1 ppm) such as Picarro cavity ring-down spectroscopy (CRDS) analyzers (Picarro, 2023), which cost approximately USD 100 000; low-cost (USD 30–500); and mid-cost (USD 1000–5000) sensors utilizing non-dispersive infrared (NDIR) technology (Table 1) (Han et al., 2025), offer a 1–10 ppm accuracy at a fraction of the price (USD 30–5000) – representing a cost reduction of more than an order of magnitude. This enhanced affordability facilitates large-scale deployment, making these sensors particularly attractive for comprehensive urban CO2 monitoring systems (Lopez-Coto et al., 2017; Turner et al., 2016; Wu et al., 2016; Zeng et al., 2021).

While cost-effective, NDIR sensors are sensitive to environmental changes and often exhibit long-term drifts and abrupt jumps. Noise, environmental sensitivity, and temporal drifts result in raw measurements that typically have more significant errors and uncertainties than the accuracy and resolution required for urban CO2 monitoring. The accuracy of these sensors typically depends on correction methods that account for environmental factors such as temperature, humidity, and pressure. However, the sensitivities of these variables can vary significantly, presenting major challenges in calibrating large sensor networks (Bigi et al., 2018; Delaria et al., 2021; Hagan et al., 2018; Martin et al., 2017). With careful correction – using environmental chambers and co-located high-precision instruments – these sensors can achieve short-term measurement accuracy within ± 1–3 ppm under controlled laboratory conditions (Cai et al., 2024; Grange et al., 2025; Müller et al., 2020).

Additionally, unlike high-precision instruments, NDIR sensors are more prone to temporal drift and fluctuations. To mitigate these issues, sensors are periodically re-calibrated at the laboratory or undergo in situ field calibration using traceable standard gases. While essential, these correction processes are labor-intensive and time-consuming. Some networks adopt alternative approaches. For instance, sensor-specific drift slope can be predetermined before deployment, with offsets corrected using the lower percentile (5 %–10 %) of observations across the entire network (Shusterman et al., 2016). Delaria et al. (2021) demonstrated that using the median value from at least 12 sites with minimal temperature dependence as a reference can maintain network precision at 3.6 ppm. Another method involves determining drift by comparing nearby instruments under specific weather conditions when horizontal CO2 gradients are small (Müller et al., 2020). A widely used and robust calibration method involves automatic or manual standard gas injections. In the Beijing and Jinan networks, we applied this method across more than 160 instruments, with automatic standard gas calibration performed weekly. Results showed mean biases of −1.28 to −0.64 ppm at a 1-month scale (Cai et al., 2024). Manual standard gas calibration is particularly useful in mobile observations, such as on-road vehicle measurements and vertical profile observations using tethered balloons (Liu et al., 2021; Bao et al., 2020). However, calibration costs can be significant. For automatic standard gas calibration (weekly frequency), expenses reach approximately USD 300 per station per year, consuming two 8 L, 10 MPa gas tanks (one as a working standard and the other for quality control). Consequently, maintaining 100 stations with such low-cost sensors would incur an annual cost of USD 30 000 for standard gas alone.

Currently, several locations, such as California, Paris, Switzerland, and Beijing, have established high-density CO2 monitoring networks utilizing NDIR sensors (Table 1). For instance, the Berkeley Environmental Air-quality and CO2 Network (BEACO2N) in California, USA, utilizes the Vaisala CarboCap GMP343 sensor (mid-cost), with a raw accuracy of ± 3 ppm +1 % reading (Vaisala, 2020). After correcting for bias and temporal drift using the in situ method, the observation accuracy is approximately 1–4 ppm (Shusterman et al., 2016, 2018), while the reported accuracy improved to 1.6–3.6 ppm after temperature correction (Delaria et al., 2021). The Carbosense CO2 sensor network in Switzerland (Müller et al., 2020), which uses the SenseAir LP8 sensor (low-cost), with a raw accuracy of ± 50 ppm (SenseAir, 2019), achieves an observation accuracy of 8–12 ppm through initial laboratory chamber correction and regular drift calibration via ambient co-location with nearby reference instruments.

China aims at peaking carbon emissions before 2030 and achieving carbon neutrality before 2060 (the Dual Carbon Goals, DCGs) (He et al., 2020; Huang et al., 2023; Zeng et al., 2022). To support China's DCGs and address the high spatial variability in CO2 concentration in urban areas, the Institute of Atmospheric Physics, Chinese Academy of Sciences (IAP), has established a network of 134 sites using SenseAir K30 sensor since 2017 (Han et al., 2024). This study presents the correction methods developed for the SenseAir K30 sensors and evaluates the accuracy achievable through the environmental dependence correction method based on laboratory simulation. At a field observation site in Beijing, environmentally corrected low-cost sensors (LCSs) were co-located with high-precision Picarro instruments for up to 30 months. By comparing the data with measurements from Picarro instruments, we gained more profound insights into the long-term performance and long-term drift characteristics of the LCS, as well as assessed the effectiveness of our long-term drift correction method. Our findings demonstrate that timely long-term drift correction significantly improved the accuracy of urban CO2 monitoring networks based on LCS and reduced time, labor, and money investment. This research provides valuable evidence for optimizing the deployment and maintenance of LCS-based monitoring networks in urban environments.

2 The application of SenseAir K30 sensors for urban CO2 monitoring

A multivariate linear regression analysis was used for environmental correction, which can improve the accuracy of the SenseAir K30 sensor from its initial specification of ± 30 ppm ± 3 % of reading (SenseAir: K30 products sheets, 2022) to a range of 1.7–4.3 ppm (Martin et al., 2017). The environmentally corrected K30 sensor demonstrated reliability and consistency when compared to higher-precision instruments and standard gas under a controlled indoor environment with a root mean square error (RMSE) ranging from 1 to 3 ppm on a monthly scale (Cai et al., 2024). Furthermore, the low-cost sensor exhibited highly consistent with Picarro system during on-road observations conducted using the same vehicle with an RMSE of 3.6 ppm (Liu et al., 2021). In a study by Bao et al. (2020), the sensor was utilized to measure the CO2 vertical profile in the lower troposphere in Hebei Province, China, showing good consistency with traditional gas chromatography measurements (Bao et al., 2020). Additionally, Cai et al. (2024) applied the low-cost sensor in an industrial park, revealing a CO2 concentration enhancement of 5–28 ppm within the park compared to a reference site (Cai et al., 2025a).

The Beijing–Tianjin–Hebei (or Jing–Jin–Ji, JJJ) network, deployed with low-cost sensors, has provided valuable insights into seasonal variations, urban–rural differences, and the homology of CO2 and PM2.5 (Han et al., 2024). The low-cost sensors have also been proven effective in detecting signals related to COVID-19. Continued CO2 measurements in Beijing showed a 15 ppm reduction during the 2020 lockdown period compared to the before and after periods. Similarly, regular on-road CO2 observation in Beijing before, during, and after COVID-19 lockdown showed a 40–60 ppm decrease during COVID-19 lockdown period (Liu et al., 2021). These applications demonstrate the versatility and reliability of low-cost sensors in capturing both environmental and anthropogenic influences on atmospheric CO2 concentrations.

3 Instrument design and correction methods of SENSE-IAP

The SENSE-IAP instrument integrates three K30 sensors alongside a Bosch BME680 (BME) sensor (Bao et al., 2020; Liu et al., 2021), all collected by an updated version of BeagleBone Green Wireless (BBGW). The standard version of SENSE-IAP instrument also includes a Figaro TGS 2611 sensor for CH4 detection and a Plantower PMSA003 for PM2.5 measurements. These components are compactly integrated onto a single circuit board and housed within a weatherproof enclosure, as illustrated in Fig. 1.

The BME sensor is positioned close to the K30s to simultaneously monitor the temperature (T in °), relative humidity (RH in %), and pressure (P in hPa) of the air mass inside the instrument. This design ensures real-time correction of the sensor response values, accounting for dynamically changing external environmental conditions and enhancing measurement accuracy.

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f01

Figure 1The layout of sensors in the standard version of SENSE-IAP instrument.

Download

To improve the observing accuracy, we developed a CO2 calibration system incorporating both controlled environmental experiments and calibration software (Bao et al., 2020; Han et al., 2024; Liu et al., 2021). The system follows these main steps: (1) converting raw electrical signals to CO2 values within atmospheric observation ranges; (2) resampling data from 2 s to 1 min to reduce white noise; (3) applying environmental corrections of temperature, humidity, and air pressure during span calibration; and (4) performing calibration using standard gas traceable to the WMO X2007 scale (identical to the standard used in high-precision systems like Picarro instruments) to remove system bias. The controlled environmental experiments typically require 1 week of co-location with a reference instrument to determine environmental correction coefficients followed by another week of co-location for post-quality check. To fully characterize long-term drift patterns, we recommended maintaining observations for at least 1 year to capture seasonal variations (including both summer and winter cycles).

The raw signals from all sensors were collected at a frequency of 2 s with a standard deviation of approximately ± 4 ppm. Figure 2 shows the experimental results of continuously introducing standard gas over 25 h to evaluate the instrument's noise characteristics. As shown in Fig. 2, the Allan deviation (in ppm) decreases with increasing integration time. At a 2 s measurement interval, the noise level is 4 ppm, which decreases to approximately 0.2 ppm for integration times ranging from 2 min to 1 h. However, the Allan deviation increases after 1 h of integration time, indicating the presence of drift contributions.

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f02

Figure 2(a) The raw signals measured continuously over 25 h from 03:00 UTC on 15 October to 03:00 UTC on 16 October 2022. (b) Allan deviation log plots.

Download

In addition to white noise, the raw signals often contain outliers that must be removed through quality control. According to Eq. (1), following the 3σ principle, the original data points xi collected during the sampling period are treated as samples. The average value x of non-missing xi is calculated, and the standard deviation (SD) of the xi is σ. If |xi-x|>4σ, the data point is identified as an outlier and removed. We adopted the 4σ threshold to strike a balance between effectively removing outliers and preserving the natural ranges of variability in the data.

(1) σ = i = 1 n x i - x 2 n - 1

Subsequently, the raw signals are averaged from a 2 s interval to a 1 min interval (resample) to reduce standard deviation. A 1 min integration time was chosen as an optimal trade-off between noise reduction and maintaining sufficient time resolution to track natural variations in CO2 concentrations accurately. As in Eq. (2), the resampled value Yi for each minute is calculated as the average of all non-outlier values xj within that minute.

(2) Y i = j = 1 m x j m
https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f03

Figure 3(a) Compared CO2 at each processing step. (b) Synchronized monitoring of T and RH. (c–f) The scatter plots of CO2 measurements form LCS and Picarro instrument at different processing steps, including (c) the raw signal in 2 s resolution, (d) the values after noise reduction at a 1 min resolution, (e) the CO2 concentrations after environment corrections, and (f) systematic bias correction.

Download

The correction system we developed substantially improves the accuracy of CO2 measurements through a comprehensive process that includes outlier removal and noise reduction. Figure 3 shows the main steps of the correction system, with the data cleaning method described earlier constituting the initial two steps. As shown in Fig. 3a, the raw signal (blue) undergoes de-specking and denoising (red). However, a noticeable deviation remains between the LCS measurements and the true value, with a correlation coefficient (r value) of approximately 0.6 (Fig. 3c–d). The differences mainly come from environmental sensitivity and baseline deviations in concentration.

Similarly to the correction standards used in high-precision systems, our LCS units are calibrated using standard gas traceable to the WMO X2007 scale. This calibration adjusts the span and calibrates system bias before deployment. For the typical CO2 concentration range (400–700 ppm), a concentration-dependent offset (ΔC) exists between the time-averaged LCS measurements and the standard gas concentration. Since this concentration dependence varies for each K30 sensor, laboratory corrections are essential for accuracy improvement. Our concentration correction process includes multiple concentration gradients, ensuring applicability to real-world monitoring scenarios. The fitting parameters of ΔC against the measured values are determined through regression analysis, enabling precise correction across the operational range.

(3) Δ C = Y m - y 0 ,

where y0 represents the concentration of standard gas or the high-precision reference instrument (Picarro G2301); Ym is the minute-averaged values from LCSs.

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f04

Figure 4(a) Map of the location of Beijing-IAP in Beijing. (b) Map showing the location of Beijing-IAP within Beijing's main urban area. (c) The diagram of gas flow design for the synchronous observation system. (d) The photographs of the instrument installation setup. The source for the basemap used in panels (a)(b) is ESRI (https://server.arcgisonline.com/arcgis/rest/services/World_Topo_Map/MapServer, last access: 16 June 2025).

Environmental calibration was conducted in an environmental controlled chamber. The LCS environmental sensitivity corrections included T compensation (10–50 °C in five steps), RH compensation (10 %–90 % in nine steps), and P compensation. This correction is based on sensitivity testing conducted in the laboratory (Martin et al., 2017), with comparisons made against Picarro. Each sensor is assigned unique sensitivity parameters through multivariate regression and iteration analysis.

(4) Δ C = f a T , Y T + f a H , Y H + f a P , Y P + f a C , Y C + ε ,

where the baseline correction coefficient is ε; YT, YH, YP, and YC represent the compensation values for the T, RH, P, and concentration sensitivity, respectively, applied to the minute-averaged CO2 measurement; aT, aH, and aP are the regression coefficients against T, RH, and P, respectively; and aC is the span correction coefficient against concentration.

Thus, the corrected CO2 can be expressed as

(5) C = Y m - ( f a T , Y T + f a P , Y P + f a H , Y H + f a C , Y C + ε ) .

The r values between the CO2 corrected in the final two steps and the Picarro measurements are close to 1 (Fig. 3e–f). Additionally, the difference between the environmental correction (pink) and baseline calibration (green) in Fig. 3a represents the coefficient ε. After applying these correction steps, the accuracy of the LCS measurements improved to 1–4 ppm compared to Picarro (Liu et al., 2021).

Before deployment, the span and system bias of the LCSs were calibrated. However, once deployed to field stations, LCSs tended to drift on a weekly to monthly scale, necessitating time-dependent drift calibration. We defined Scor as the starting time of drift and Ecor as the time when the drift slope stabilized or when calibration was required, with Δ=Ecor-Scor. ΔCdrift represents the bias between the concentration C measured by the instrument and the standard concentration C0 at Ecor. Using the Eq. (6), the drift rate over time (ppm min−1) at Ecor is calculated (st). The bcal is a constant deviation, representing the difference between the baseline and the standard value before long-term drift occurs (at Scor). This value is generally considered to be zero since system bias has been calibrated before departure. The error at any time between Scor toEcor can be calibrated to CCordrift using the following formulas:

(6)ΔCdrift=stΔt+bcal,(7)CCordrift=C-ΔCdrift.

This integrated instrument with environmental correction and drift correction is named SENSE-IAP.

4 Co-located observation system

Our experiment has been conducted since July 2022 at IAP (Beijing-IAP site, Fig. 4a and b). Located in a central urban area with high population density, the Beijing-IAP site is significantly influenced by traffic emissions.

To evaluate the performance of the LCS, we developed a synchronous observation system that compares the LCS with high-precision instruments. This system includes two SENSE-IAP units (numbered pi688 and pi736), each equipped with three K30 sensors. A cavity ring-down spectrometer (Picarro G2301) was used as the high-precision instrument for CO2 measurements (Picarro, 2023). The precision and accuracy of the Picarro instrument are better than 0.1 ppm (Yang et al., 2021). At Beijing-IAP, the Picarro analyzer was calibrated monthly using high-pressure standard gases provided by the Meteorological Observation Center of the China Meteorological Administration (MOC/CMA), which are traceable to the World Meteorological Organization (WMO) X2007 scale.

To ensure long-term synchronous observation between the LCSs and Picarro, deployment enables two sets of instruments to measure the same gas mass. This ensures that any differences in observed values only come from the effects of T, RH, and P, as well as the concentration span, all of which can be adjusted through correction methods. The deployment setup is shown in Fig. 4b–c.

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f05

Figure 5Comparison between hourly CO2 concentrations measured by the SENSE-IAP and Picarro systems at the Beijing site from 13 to 27 July 2022 (a–c, g) and 10 to 24 January 2023 (d–f, h). We compared both the raw CO2 data from SenseAir (blue) and the environmentally corrected data from SENSE-IAP (red) with that from Picarro (black). The T and RH were measured for the environment inside the instrument.

Download

The instruments were mounted on the edge of an open window to directly measure the outdoor air and environmental changes, which were almost the same as field-deployed conditions (Figs. 4 and S8 in the Supplement). Temperature primarily ranged from 0–40° and RH typically between 0 % and 60 %. A partition isolated the LCSs from indoor spaces, ensuring measurements primarily reflected outdoor environmental conditions. Three K30 sensors and one BME sensor were housed in a transparent cover featuring two 2 mm diameter ventilation holes in the upper-left and lower-left corners (red circles in Fig. 4c and d). Air passively diffuses into the pi688 enclosure, while a 4 mm diameter blue tube connected to the cover's left side enabled active air sampling via an air pump (GAST DOA-P504-BN). Upstream of the pump, the airflow passed through a capsule-type 10 µm filter (COBETTER 92WM-LPF1000) for particulate matter removal (Fig. 4c). Downstream of the pump, a Nafion drying tube is installed for Picarro analysis (bypassed for SENSE-IAP measurements) (Fig. 4c). The filtered air is then divided by a four-way valve, delivering 3–5 L min−1 to the pi736 and 0.3–0.4 L min−1 to the Picarro analyzer, with excess air vented through the final outlet.

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f06

Figure 6(a) The time series of daily RMSEs for hourly CO2 concentration relative to Picarro (purple points) from June 2022 to December 2024, with a monthly rolling mean (blue line and shadow). (b) Box plot of the daily RMSEs of all six sensors across different seasons. Sensors from the same instrument are represented in the same colors (spring: dark pink/pink, summer: dark green/green, autumn: orange/yellow, and winter: dark blue/blue). Within each box, the red line indicates the mean value, while the blue dashed line represents the median values.

Download

5 Environmental corrections for field measurements

To distinguish between the effects of short-term environmental factors and long-term drifts, we present these corrections separately. To focus specifically on the environmental corrections, all results in this section were obtained after removing long-term drift. A comprehensive discussion of long-term drift is provided in Sect. 5. Figure 5 shows the results from environment-corrected SENSE-IAP at the Beijing- IAP site compared with those from the Picarro system. After approximately 2 weeks of data collection during both summer and winter, the SENSE-IAP showed highly consistent results with Picarro, with RMSEs of 1.6 ppm in summer and 1.8 ppm in winter. In contrast, the raw CO2 concentration data from the SenseAir showed a higher RMSE of 6.2 ppm in summer and 7.0 ppm in winter. Furthermore, the effectiveness of environmental correction is evident across different seasons.

The correction system effectively adjusted the CO2 concentration within the 400–700 ppm measurement range. As shown in Fig. 5b, during the period from 16 to 19 July, even when the ambient CO2 concentration experienced significant fluctuations, the instrument showed high consistency with Picarro. The T and RH detected by the BME sensor were used to monitor the instrument's internal environment. Notably, in winter mornings, sunrise caused a significant temperature increase due to the presence of metal components on the circuit board (Fig. 5f). Our environmental correction successfully corrected for the temperature dependence, as the deviation between the SenseAir and Picarro showed a strongly correlated with temperature in both seasons (Fig. S1a). Additionally, the deviation of SenseAir relative to Picarro was significantly associated with RH in summer (Fig. S1b), and our correction system incorporated humidity, which was related to ambient temperature. Compared to the raw SenseAir data, the consistency of all six sensors improved markedly, with the RMSE decreasing from 5.0 ± 1.0 to 1.3 ± 0.2 ppm in summer and from 6.8 ± 0.8 to 2.0 ± 0.4 ppm in winter (Fig. S2).

Table 2The long-term drift trend of six sensors (unit: ppm per month).

Download Print Version | Download XLSX

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f07

Figure 7Time series of ΔCO2 for the six sensors at the Beijing-IAP from June 2022 to December 2024 (black), with the monthly rolling mean (yellow line). The gray shadow represents the 1-month time range used to evaluate the RMSE and bias for each sensor after half a year, 1 year, and 2 years of deployment.

Download

Our correction system can perform environmental sensitivity analysis and correction on individual sensors, with the correction efficacy remaining robust across seasonal environmental changes. We further analyzed the daily RMSE of the SENSE-IAP relative to Picarro during 30 months of co-located observation. As shown in Fig. 6a, the daily RMSE of one sensor (pi688-K30) ranged from 1.5 to 4.0 ppm throughout the observation period, with the light blue shadow representing the monthly mean ± standard deviation. The average and median of daily RMSEs for this sensor were less than 2.0 ppm (Fig. 6b), except for an increase to higher than 2 ppm in summer (dark green). Compared to Picarro, the consistency of SENSE-IAP was better in winter, with an RMSE below 2.0 ppm on most days. Except for a few sensors exhibiting slightly higher RMSEs (approximately 3.0 ppm) in spring and autumn (pi732-K30), the daily RMSEs of the six sensors showed no significant seasonal variation (Fig. 6b).

6 Performance of typical long-term drift and correction method

After environmental correction, the six SENSE-IAP sensors were co-located with a Picarro analyzer synchronously for over 30 months. As shown in Fig. 7, two types of long-term drift were identified: (1) a downward drift trend and (2) a seasonal drift cycle. While the environmental correction system effectively corrects the impact of diurnal environmental changes (Fig. 6), significant errors occurred in sensors due to the two types of long-term drift. Without the long-term drift calibration algorithm, the bias of SENSE-IAP could reach 27.9 ppm with an RMSE of approximately 28.1 ppm (Fig. 7).

Table 3Evaluation parameters* of the CO2 concentration measured by K30 sensors compared to those from Picarro, including SenseAir-Corrected values, Raw signal, and SENSE-IAP at the Beijing-IAP from June 2022 to December 2024 (unit: ppm).

* According to the statistical results of 24 h running means.

Download Print Version | Download XLSX

https://amt.copernicus.org/articles/18/4871/2025/amt-18-4871-2025-f08

Figure 8(a) Comparison of hourly CO2 concentrations measured by six sensors and Picarro at Beijing-IAP from June 2022 to December 2024. (b) The time series of ΔCO2. (c) Scatter plot of SENSE-IAP and Picarro. (d) Histogram plot of the ΔCO2.

Download

During the observation period, all six sensors exhibited long-term downward drifts ranging from 0.1 to 1.2 ppm per month (Fig. 7 and Table 2). Among the six sensors deployed in this study, only pi736 K30_3 showed a drift trend of less than 0.1 ppm per month (Table 2). For sensors such as pi688-K30 and pi688-K30_2, the ΔCO2 displayed a continuous downward trend over the 30 months, with slopes of 1.2 and 1.0 ppm per month, respectively (Table 2). Notably, the drift trends for these sensors did not show significant stabilization over time.

In addition to the downward drift trend, sensors like pi688-K30_3 and pi736-K30 exhibited varying seasonal cycle trends. After 6 months of deployment, these sensors showed RMSEs of 25.3 and 24.8, respectively (Fig. 7). However, after more than 1 year of observation, the impact of seasonal drift decreased, and the errors caused by long-term drift were highlighted, resulting in reduced RMSEs as 17.3 and 10 ppm, respectively. The RMSE evaluation was conducted during the identical period employed for drift correction.

From the perspective of drift magnitude, a significant bias of 5 ppm (approximately 1 % of the ambient CO2 concentration) typically occurred within 3–10 months after calibration, with most cases observed within 5 months. The seasonal drift cycle occurred on a 6-month scale, with maximum errors typically occurring in winter and summer. Therefore, we recommend that the long-term drift calibration frequency of SENSE-IAP should be no less than 3 months and no longer than 6 months. In addition, drift calibration should be performed at least once during both winter and summer seasons. If the target monitoring accuracy is within 3 ppm, the drift calibration frequency should be at least every 2 months, as a 3 ppm bias typically develops within 2–5 months.

The method for the long-term drift calibration is as follows. According to functions 6–7, we illustrate our long-term drift calibration method by focusing on the first year of observations. At the start of the observation period (June 2022), we adjusted the baseline of the six sensors. We designated the initial calibration time point as Scor for the first observation period (June 2022 to January 2023). The inflection point of the drift trend in February 2023 was identified as Ecor for the first period and as Scor for the second period of the observation (January to September 2023). The Ecor for the second period was set to September 2023 in this study. We applied the linear calibration method between these two time points for both periods of the drift trend. After calibration, the CO2 concentrations from the six sensors showed strong consistency with the Picarro, with an RMSE ranging from 2.4–3.0 ppm (Fig. 8).

It should be noted that due to the loose connection of ventilation pipeline between July 2023 and March 2024, the CO2 concentration measured by pi736 and Picarro were not strictly synchronized. This issue led to a relatively lower short-term monitoring accuracy during this period, primarily due to the lag effect caused by air diffusion. To assess the actual hourly monitoring accuracy of SENSE-IAP, we excluded data from this period. Without this exclusion, the RMSE for pi736 would have been 3.9–4.5 ppm (Fig. S3). However, to ensure the completeness and robustness of the long-term drift analysis, we retained samples from this period, allowing for a more comprehensive evaluation of drift trends over time.

7 Comparison of data quality across multiple levels over a long-term scale

As previously mentioned, the correction system effectively corrected the sensors' short-term environmental dependence. By applying a linear calibration method at least every 3 months, we can calibrate the time-dependent drift of LCSs and resolve residual seasonal cycles that the environmental correction system cannot fully resolve. This seasonal variation is prominently reflected in the original electrical signal (raw signal). As shown in Fig. S4b, we eliminated the influence of short-term environment changes through a 24 h running mean, revealing that the seasonal drift cycle in the raw signal can reach up to 100 ppm with an RMSE of 38.2 and a bias of −21.1 ppm (Table 3).

The defining characteristic of medium-precision low-cost sensors is the presence of long-term drift. This drift, which exhibits a downward trend, is observed in all sensors, with variations only in the drift rate (Fig S5). Long-term drifts resulted in an RMSE of 15.8 and a bias of −12.0 for the LCS, respectively (Table 3). Although seasonal cycle does not significantly alter the overall trend of long-term drift, seasonal environmental variations can introduce errors of up to 25 ppm if baseline calibration are performed only annually (Fig. 7). Therefore, we recommend that baseline calibration be conducted at least every 6 months, ideally during both winter and summer.

The data provided by the SenseAir manufacturer were corrected for temperature sensitivity using the default temperature parameters. The long-term drift was calibrated using a so-called ABS algorithm, which employs periodic one-point calibration in SenseAir, assuming that the minimum value of CO2 concentration is 400 ppm in fresh air (SenseAir-Corrected data). However, due to the constant assumption of fresh air concentration, coupled with the carbon absorption of vegetation in summer and the higher emissions in winter, the SenseAir-Corrected data exhibit a fluctuating trend of approximately 20 ppm, with higher values in summer and lower values in winter. Consequently, the bias of SenseAir-Corrected data can be −3.3± 1.4 ppm with an RMSE of 12.1 ± 2.0 ppm (Fig. S6, Table 3). In contrast, the drift-calibrated SENSE-IAP data demonstrate a much smaller bias (0.8 ± 0.4 ppm) and an 80 % improvement in accuracy with an RMSE of 2.4 ± 0.2 ppm (Fig S7, Table 3).

8 Seasonal drift cycle effects on SENSE-IAP

As shown in Fig. S8, the seasonal variations observed before instrument linear calibration were correlated with T, RH, and P. The ΔCO2 between pi736-K30 and Picarro significantly correlated with all three environmental factors, with r values of −0.58, −0.46, and 0.33 against T, RH, and P, respectively. In contrast, the relationship between pi688-K30_3 and environmental factors was opposite to that of pi736-K30, with r values of 0.33, 0.5, and −0.6 against T, RH, and P, respectively. Considering seasonal phase differences between CO2 concentration changes and environmental factors, this seasonal deviation was likely attributable to not only insufficient environmental compensation but also the influence of seasonal effects on the instrument's physical properties. For instance, changes in the sensor's optical cavity size caused by thermal expansion and cold contraction can change the optical path lengths, subsequently affecting the pressure within the optical cavity and the strength of infrared CO2 absorption (Yao et al., 2023). However, this hypothesis cannot fully explain why the two sensors, pi688 K30_3 and pi736 K30, exhibited opposite drift directions during the same season.

Long-term drift is typically observed in low- and mid-cost NDIR CO2 sensors. Although our findings provide valuable references data specifically for K30 sensors, the recommended calibration frequency and observed seasonal cycle characteristics may also apply to other similar NDIR sensors. The calibration method we employed is universally applicable and effective for long-term drift corrections. For other NDIR sensors, we recommend selecting several representative samples for co-location with a high-precision instrument under field conditions for a minimum of 1 year to fully characterize their performance. This extended co-location period is crucial for comprehensively for evaluating both long-term drifts trends and seasonal drift patterns. Such characterization studies provide essential guidance for implementing remote calibration in high-density sensors networks. Alternatively, standard gas calibration serves as a practical substitute method. Based on our experimental results, we recommend a calibration frequency of at least once every 1 to 3 months for this approach.

9 Conclusions

We evaluated low-cost NDIR CO2 sensors using Picarro as a reference instrument. Our environmental correction system effectively corrected the impact of short-term daily environmental changes by assigning unique environmental sensitivity parameters to each sensor. This approach reduces the short-term RMSE from 5.9 ± 1.2 ppm for SenseAir to 1.6 ± 0.5 ppm for SENSE-IAP. The correction system demonstrates robustness against seasonal environmental variations, maintaining a daily RMSE of 1–3 ppm.

Based on a 30-month observation, we recommend that the calibration frequency for long-term drifts do not exceed 6 months. For optimal performance and to ensure the target monitoring accuracy remains within 1 % of the ambient CO2 concentration, a 3-month calibration interval is recommended. If standard instruments, standard gases (which are generally easier to obtain) or other reliable concentration references such as model simulations are available, long-term drift can be linearly corrected at the seasonal scale.

Consequently, after deployment, even with significant environmental changes around the instrument, there is no need to frequently bring the instruments back to the laboratory for re-correction of environmental impacts. After the long-term drift calibration, the RMSE of SENSE-IAP remains 2.4 ± 0.2 ppm even after 30 months of operation. This performance enables long-term deployment of the instruments, significantly reducing the maintenance costs associated with LCS.

Data availability

The data used to generate the figures in this paper are available at https://doi.org/10.6084/m9.figshare.29310890.v3 (Cai et al., 2025b).

Supplement

The supplement related to this article is available online at https://doi.org/10.5194/amt-18-4871-2025-supplement.

Author contributions

NZ, PFH, and QXC conceived and designed the study. QXC and PFH collected and analyzed the datasets. XYY, CX, and ZJW discussed the sensor results. QXC led the writing of the paper with contributions from all the coauthors. All co-authors contributed to the descriptions in and discussions of the manuscript.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Special issue statement

This article is part of the special issue “Greenhouse gas monitoring in the Asia–Pacific region (ACP/AMT/GMD inter-journal SI)”. It is a result of the 4th China Greenhouse Gas Monitoring Symposium, Nanjing, China, 2–3 November 2024.

Acknowledgements

We thank Cory Martin, Di Liu, Yinan Wang, Ming Cui, Jin Guo, Zhimin Zhang, and Yang Zi for their help in the SENSE-IAP instrument development, calibration, and deployments.

Financial support

This research was supported by the National Key R&D Program of China (grant nos. 2023YFC3705500 and 2017YFB0504000), Jinan Carbon Monitoring and Evaluation Pilot Project (grant no. SDGP370100000202202001740), Qiluzhongke Institute of Carbon Neutrality Program of Jinan Dual Carbon Simulator, and CAS Proof of Concept Program: Carbon Neutrality-oriented Urban Carbon Monitoring System and Its Industrialization (grant no. CAS-GNYZ-2022).

Review statement

This paper was edited by Andre Butz and reviewed by two anonymous referees.

References

Asimow, N. G., Turner, A. J., and Cohen, R. C.: Sustained Reductions of Bay Area CO2 Emissions 2018–2022, Environ. Sci. Technol., 58, 6586–6594, https://doi.org/10.1021/acs.est.3c09642, 2024. 

Bao, Z., Han, P., Zeng, N., Liu, D., Cai, Q., Wang, Y., Tang, G., Zheng, K., and Yao, B.: Observation and modeling of vertical carbon dioxide distribution in a heavily polluted suburban environment, Atmospheric Ocean. Sci. Lett., 13, 371–379, https://doi.org/10.1080/16742834.2020.1746627, 2020. 

Bigi, A., Mueller, M., Grange, S. K., Ghermandi, G., and Hueglin, C.: Performance of NO, NO2 low cost sensors and three calibration approaches within a real world application, Atmos. Meas. Tech., 11, 3717–3735, https://doi.org/10.5194/amt-11-3717-2018, 2018. 

Cai, Q., Han, P., Pan, G., Xu, C., Yang, X., Xu, H., Ruan, D., and Zeng, N.: Evaluation of Low-Cost CO2 Sensors Using Reference Instruments and Standard Gases for Indoor Use, Sensors, 24, 2680, https://doi.org/10.3390/s24092680, 2024. 

Cai, Q., Wang, Z., Han, P., Zeng, N., Nie, X., Yang, X., Wang, Z., and Gao, S.: The Enhancement and Variation in the Carbon Dioxide Concentration in a Typical Industrial Park in the Northern China, Aerosol Sci. Eng., 2025, 1–13, https://doi.org/10.1007/s41810-025-00286-4, 2025a. 

Cai, Q., Han, P., and Zeng, N.: CO2_monitoring_data, figshare [data set], https://doi.org/10.6084/m9.figshare.29310890.v3, 2025b. 

Delaria, E. R., Kim, J., Fitzmaurice, H. L., Newman, C., Wooldridge, P. J., Worthington, K., and Cohen, R. C.: The Berkeley Environmental Air-quality and CO2 Network: field calibrations of sensor temperature dependence and assessment of network scale CO2 accuracy, Atmos. Meas. Tech., 14, 5487–5500, https://doi.org/10.5194/amt-14-5487-2021, 2021. 

Grange, S. K., Rubli, P., Fischer, A., Brunner, D., Hueglin, C., and Emmenegger, L.: The ZiCOS-M CO2 sensor network: measurement performance and CO2 variability across Zurich, Atmos. Chem. Phys., 25, 2781–2806, https://doi.org/10.5194/acp-25-2781-2025, 2025. 

Grimmond, C. S. B., King, T. S., Cropley, F. D., Nowak, D. J., and Souch, C.: Local-scale fluxes of carbon dioxide in urban environments: methodological challenges and results from Chicago, Environ. Pollut., 116, S243–S254, https://doi.org/10.1016/S0269-7491(01)00256-1, 2002. 

Gurney, K. R., Razlivanov, I., Song, Y., Zhou, Y., Benes, B., and Abdul-Massih, M.: Quantification of Fossil Fuel CO2 Emissions on the Building/Street Scale for a Large U.S. City, Environ. Sci. Technol., 46, 12194–12202, https://doi.org/10.1021/es3011282, 2012. 

Hagan, D. H., Isaacman-VanWertz, G., Franklin, J. P., Wallace, L. M. M., Kocar, B. D., Heald, C. L., and Kroll, J. H.: Calibration and assessment of electrochemical air quality sensors by co-location with regulatory-grade instruments, Atmos. Meas. Tech., 11, 315–328, https://doi.org/10.5194/amt-11-315-2018, 2018. 

Han, P., Yao, B., Cai, Q., Chen, H., Sun, W., Liang, M., Zhang, X., Zhao, M., Martin, C., Liu, Z., Ye, H., Wang, P., Li, Y., and Zeng, N.: Support Carbon Neutral Goal with a High-Resolution Carbon Monitoring System in Beijing, B. Am. Meteor. Soc., 105, E2461–E2481, https://doi.org/10.1175/BAMS-D-23-0025.1, 2024. 

Han, P., Felix, V., Dominik, B., Tyler, B., Qixiang, C., Zhiqiang, L., Ning, Z., and Olivier, L.: Chapter 18: Dense Networks Of Mid-Low Cost CO2 Sensors, in IG3IS Urban Greenhouse Gas Emission Observation and Monitoring Good Research Practice Guidelines, WMO GAW Report 2025, https://www.dropbox.com/scl/fi/vuk8ad8bdui8ibekefuob/IG3ISUrbanGuidelines_FinalDraft_May2025.docx?rlkey=zmu8nwaj6dgenssuxct9jihqv&st=t7wtafz0&dl=0 (last access: 16 June 2025), 2025. 

He, J., Li, Z., Zhang, X., Wang, H., Dong, W., Chang, S., Ou, X., Guo, S., Tian, Z., Gu, A., Teng, F., Yang, X., Chen, S., Yao, M., Yuan, Z., Zhou, L., and Zhao, X.: Comprehensive report on China's Long-Term Low-Carbon Development Strategies and Pathways, Chin. J. Popul. Resour. Environ., 18, 263–295, https://doi.org/10.1016/j.cjpre.2021.04.004, 2020. 

Huang, Y., Liu, G., Bo, Y., Wang, J., Cao, M., Lu, X., and He, K.: Beijing-Tianjin-Hebei Coordinated Development toward the Carbon Peaking and Carbon Neutrality Goals, Strateg. Study CAE, 25, 160–172, 2023. 

Kellett, R., Christen, A., Coops, N. C., van der Laan, M., Crawford, B., Tooke, T. R., and Olchovski, I.: A systems approach to carbon cycling and emissions modeling at an urban neighborhood scale, Landscape Urban Plan., 110, 48–58, https://doi.org/10.1016/j.landurbplan.2012.10.002, 2013. 

Lateb, M., Meroney, R. N., Yataghene, M., Fellouah, H., Saleh, F., and Boufadel, M. C.: On the use of numerical modelling for near-field pollutant dispersion in urban environments–A review, Environ. Pollut., 208, 271–283, 2016. 

Lian, J., Laurent, O., Chariot, M., Lienhardt, L., Ramonet, M., Utard, H., Lauvaux, T., Bréon, F.-M., Broquet, G., Cucchi, K., Millair, L., and Ciais, P.: Development and deployment of a mid-cost CO2 sensor monitoring network to support atmospheric inverse modeling for quantifying urban CO2 emissions in Paris, Atmos. Meas. Tech., 17, 5821–5839, https://doi.org/10.5194/amt-17-5821-2024, 2024. 

Liu, D., Sun, W., Zeng, N., Han, P., Yao, B., Liu, Z., Wang, P., Zheng, K., Mei, H., and Cai, Q.: Observed decreases in on-road CO2 concentrations in Beijing during COVID-19 restrictions, Atmos. Chem. Phys., 21, 4599–4614, https://doi.org/10.5194/acp-21-4599-2021, 2021. 

Lopez-Coto, I., Ghosh, S., Prasad, K. R., and Whetstone, J.: Tower-based greenhouse gas measurement network design–The National Institute of Standards and Technology North East Corridor Testbed, Adv. Atmos. Sci., 34, 1095–1105, 2017. 

Martin, C. R., Zeng, N., Karion, A., Dickerson, R. R., Ren, X., Turpie, B. N., and Weber, K. J.: Evaluation and environmental correction of ambient CO2 measurements from a low-cost NDIR sensor, Atmos. Meas. Tech., 10, 2383–2395, https://doi.org/10.5194/amt-10-2383-2017, 2017. 

Miles, N. L., Davis, K. J., Richardson, S. J., Lauvaux, T., Martins, D. K., Deng, A. J., Balashov, N., Gurney, K. R., Liang, J., Roest, G., Wang, J. A., and Turnbull, J. C.: The influence of near-field fluxes on seasonal carbon dioxide enhancements: results from the Indianapolis Flux Experiment (INFLUX), Carbon Balance Manag., 16, 4, https://doi.org/10.1186/s13021-020-00166-z, 2021. 

Müller, M., Graf, P., Meyer, J., Pentina, A., Brunner, D., Perez-Cruz, F., Hüglin, C., and Emmenegger, L.: Integration and calibration of non-dispersive infrared (NDIR) CO2 low-cost sensors and their operation in a sensor network covering Switzerland, Atmos. Meas. Tech., 13, 3815–3834, https://doi.org/10.5194/amt-13-3815-2020, 2020. 

Picarro: G2301 Analyzer Datasheet, https://www.picarro.com/sites/default/files/product_documents/Picarro_G2301 Datasheet_230306.pdf (last access: 20 May 2025), 2023. 

SenseAir: LP8 Product Sheet, https://rmtplusstoragesenseair.blob.core.windows.net/docs/Dev/publicerat/PSH1233.pdf, last access: 6 March 2019. 

SenseAir: K30 products sheets, https://rmtplusstoragesenseair.blob.core.windows.net/docs/publicerat/PSP110.pdf, last access: 6 March 2022. 

Shusterman, A. A., Teige, V. E., Turner, A. J., Newman, C., Kim, J., and Cohen, R. C.: The BErkeley Atmospheric CO2 Observation Network: initial evaluation, Atmos. Chem. Phys., 16, 13449–13463, https://doi.org/10.5194/acp-16-13449-2016, 2016.  

Shusterman, A. A., Kim, J., Lieschke, K. J., Newman, C., Wooldridge, P. J., and Cohen, R. C.: Observing local CO2 sources using low-cost, near-surface urban monitors, Atmos. Chem. Phys., 18, 13773–13785, https://doi.org/10.5194/acp-18-13773-2018, 2018. 

Turner, A. J., Shusterman, A. A., McDonald, B. C., Teige, V., Harley, R. A., and Cohen, R. C.: Network design for quantifying urban CO2 emissions: assessing trade-offs between precision and network density, Atmos. Chem. Phys., 16, 13465–13475, https://doi.org/10.5194/acp-16-13465-2016, 2016. 

Vaisala: GMP343 Carbon Dioxide Probe Datasheet, https://docs.vaisala.com/v/u/B210688EN-H/en-US, last access: 6 March 2020. 

Wada, R., Pearce, J. K., Nakayama, T., Matsumi, Y., Hiyama, T., Inoue, G., and Shibata, T.: Observation of carbon and oxygen isotopic compositions of CO2 at an urban site in Nagoya using Mid-IR laser absorption spectroscopy, Atmos. Environ., 45, 1168–1174, https://doi.org/10.1016/j.atmosenv.2010.10.015, 2011. 

Wu, L., Broquet, G., Ciais, P., Bellassen, V., Vogel, F., Chevallier, F., Xueref-Remy, I., and Wang, Y.: What would dense atmospheric observation networks bring to the quantification of city CO2 emissions?, Atmos. Chem. Phys., 16, 7743–7771, https://doi.org/10.5194/acp-16-7743-2016, 2016. 

Yang, Y., Zhou, M., Wang, T., Yao, B., Han, P., Ji, D., Zhou, W., Sun, Y., Wang, G., and Wang, P.: Spatial and temporal variations of CO2 mole fractions observed at Beijing, Xianghe, and Xinglong in North China, Atmos. Chem. Phys., 21, 11741–11757, https://doi.org/10.5194/acp-21-11741-2021, 2021. 

Yao, G., Li, Y., Shang, Q., and Fan, H.: Research on Temperature Compensation of Optical Fiber MEMS Pressure Sensor Based on Conversion Method, Photonics, 10, 1–14, https://doi.org/10.3390/photonics10010022, 2023. 

Zeng, N., Han, P., Liu, Z., Liu, D., Oda, T., Martin, C. R., Liu, Z., Yao, B., Sun, W., Wang, P. C., Cai, Q., Dickerson, R., and Maksyutov, S.: Global to local impacts on atmospheric CO2 from the COVID-19 lockdown, biosphere and weather variabilities, Environ. Res. Lett., 17, 015003, https://doi.org/10.1088/1748-9326/ac3f62, 2021. 

Zeng, N., Jiang, K., Han, P., Hausfather, Z., Cao, J., Kirk-Davidoff, D., Ali, S., and Zhou, S.: The Chinese Carbon-Neutral Goal: Challenges and Prospects, Adv. Atmos. Sci., 39, 1229–1238, 2022. 

Download
Short summary
Mid- and low-cost CO2 sensors are attractive in carbon monitoring and atmospheric inversions. They are useful in both fixed stations and mobile monitoring. Yet the performance faces great challenges due to environmental impacts and long-term drifts. Here, we conducted 30 months of co-located observations using such sensors with a reference instrument. After corrections of environmental impacts and drifts, the accuracy reached 1–3 ppm. We recommend standard gas calibration within 3–6 months.
Share