Reconstruction of high-frequency methane atmospheric concentration peaks from measurements using metal  oxide low-cost sensors

Rivera Martinez, Rodrigo Andres; Santaren, Diego; Laurent, Olivier; Broquet, Gregoire; Cropley, Ford; Mallet, Cécile; Ramonet, Michel; Shah, Adil; Rivier, Leonard; Bouchet, Caroline; Juery, Catherine; Duclaux, Olivier; Ciais, Philippe

doi:https://doi.org/10.5194/amt-16-2209-2023

Articles | Volume 16, issue 8

https://doi.org/10.5194/amt-16-2209-2023

Articles | Volume 16, issue 8

Research article

25 Apr 2023

Research article |

| 25 Apr 2023

Reconstruction of high-frequency methane atmospheric concentration peaks from measurements using metal oxide low-cost sensors

Rodrigo Andres Rivera Martinez, Diego Santaren, Olivier Laurent, Gregoire Broquet, Ford Cropley, Cécile Mallet, Michel Ramonet, Adil Shah, Leonard Rivier, Caroline Bouchet, Catherine Juery, Olivier Duclaux, and Philippe Ciais

Abstract

Detecting and quantifying CH₄ gas emissions at industrial facilities is an important goal for being able to reduce these emissions. The nature of CH₄ emissions through “leaks” is episodic and spatially variable, making their monitoring a complex task; this is partly being addressed by atmospheric surveys with various types of instruments. Continuous records are preferable to snapshot surveys for monitoring a site, and one solution would be to deploy a permanent network of sensors. Deploying such a network with research-level instruments is expensive, so low-cost and low-power sensors could be a good alternative. However, low cost usually entails lower accuracy and the existence of sensor drifts and cross-sensitivity to other gases and environmental parameters. Here we present four tests conducted with two types of Figaro^® Taguchi gas sensors (TGSs) in a laboratory experiment. The sensors were exposed to ambient air and peaks of CH₄ concentrations. We assembled four chambers, each containing one TGS sensor of each type. The first test consisted in comparing parametric and non-parametric models to reconstruct the CH₄ peak signal from observations of the voltage variations of TGS sensors. The obtained relative accuracy is better than 10 % to reconstruct the maximum amplitude of peaks (RMSE ≤2 ppm). Polynomial regression and multilayer perceptron (MLP) models gave the highest performances for one type of sensor (TGS 2611C, RMSE =0.9 ppm) and for the combination of two sensors (TGS 2611C + TGS 2611E, RMSE =0.8 ppm), with a training set size of 70 % of the total observations. In the second test, we compared the performance of the same models with a reduced training set. To reduce the size of the training set, we employed a stratification of the data into clusters of peaks that allowed us to keep the same model performances with only 25 % of the data to train the models. The third test consisted of detecting the effects of age in the sensors after 6 months of continuous measurements. We observed performance degradation through our models of between 0.6 and 0.8 ppm. In the final test, we assessed the capability of a model to be transferred between chambers in the same type of sensor and found that it is only possible to transfer models if the target range of variation of CH₄ is similar to the one on which the model was trained.

Download & links

How to cite.

Received: 29 Jun 2022 – Discussion started: 08 Jul 2022 – Revised: 08 Jan 2023 – Accepted: 14 Feb 2023 – Published: 25 Apr 2023

1 Introduction

Methane (CH₄) is a greenhouse gas 28 times more potent than carbon dioxide, considering its warming potential over 100 years (Travis et al., 2020). Anthropogenic CH₄ emissions account for 60 % of global emissions (Saunois et al., 2020). Emissions from natural-gas production account for 63 % of total emissions in the category of fossil fuel production and use (Saunois et al., 2020). Fugitive leaks of natural gas at industrial facilities also present a safety hazard. Emissions from such facilities need to be continuously monitored, due to the episodic and spatially variable nature of leaks (Coburn et al., 2018). Leaks can be detected and quantified by LDAR (Leak Detection And Repair) surveys to detect high concentrations caused by a leak. Those surveys are periodical and have limitations related to the portability of instruments or accessibility of sites. A possible solution to overcome these limitations is to deploy a network of sensors that continuously measure methane concentrations around an emitting area (Kumar et al., 2015). Deploying such a network with highly precise instruments, using techniques such as cavity ring-down spectrometry (CRDS) is, however, cost-prohibitive. Low-cost sensors such as low-power metal oxide semiconductor (MOS) sensors for methane are an alternative. Recent studies (Riddick et al., 2020; Casey et al., 2019; Collier-Oxandale et al., 2018; Jørgensen et al., 2020; Rivera Martinez et al., 2021; Eugster et al., 2020) tested the ability of MOS sensors to monitor methane concentrations in natural and controlled conditions and showed a fair agreement between the concentrations derived from the sensors and those from high-precision reference instruments. MOS sensors are composed of a semiconducting-metal-oxide-sensing element heated at a temperature between 20 and 400 ^∘C (Örnek and Karlik, 2012; Barsan et al., 2007). When the semiconducting material is in contact with an electron donor gas like CH₄, a change in the conductivity occurs, measured by an external electrical circuit (Örnek and Karlik, 2012). MOS sensors are known to be less precise than CRDS to CH₄ variations, although they can detect small variations in concentrations. Most MOS sensors have cross-sensitivities to other electron donors and to environmental variables such as absolute humidity, pressure and temperature (Popoola et al., 2018), with non-linear interactions (Rivera Martinez et al., 2021).

Biases affect CH₄ measurements derived from low-cost sensors because of cross-sensitivities to other gases, dependence on environmental factors and internal drifts, e.g., due to aging. Figaro^® Taguchi gas sensors (TGSs) are a particular series of MOS capable of measuring CH₄. In order to limit biases of these sensors, several studies proposed a calibration model against a high-precision reference instrument. Casey et al. (2019) compared different calibration approaches with inverse and direct linear models and artificial neural networks to quantify O₃ from an SGX Corporation MiCS-2611 sensor, CO from a Mocon Baseline photoionization detector (PID) sensor, CO₂ from an ELT S-100 non-dispersive infrared (NDIR) sensor and CH₄ from observations of a Figaro^® TGS 2600 sensor. Collier-Oxandale et al. (2018, 2019) applied multilinear models, including interactions from environmental variables, to predict CH₄ concentrations and to detect and quantify volatile organic compounds (VOCs) from Figaro^® TGS 2600 and TGS 2602 MOS sensors at two sites with active oil and gas operations. Eugster et al. (2020) used empirical functions and artificial neural networks (ANNs) to derive CH₄ concentrations from 6 years of data collected with Figaro^® TGS 2600 sensors at a field site in the Arctic. Riddick et al. (2020) derived nonlinear empirical relationships for Figaro^® TGS 2600 sensors from three experiments, with durations varying from 1 d to 1 month. Rivera Martinez et al. (2021) reconstructed CH₄ concentration variations in room air from Figaro^® TGS 2611-C00 sensors using ANN models and co-variations of temperature, water mole fraction and pressure. Nevertheless, those comparisons were limited by the choice of a specific reconstruction model and restricted to only one type of sensor.

There is a need for a more thorough comparison of different calibration approaches for Figaro^® MOS sensors applied to measure CH₄. In addition, there is a need to assess the performances of MOS sensors to detect and quantify CH₄ spikes typical of industrial emission. This study aims to compare several parametric (linear and polynomial) and non-parametric models (random forest, hybrid random forest and ANNs) applied to different combinations of Figaro^® TGS sensors to reconstruct the CH₄ signals of repeated atmospheric spikes, based on the observed voltage of each sensor and environmental variables such as air temperature and pressure and H₂O mole fraction. The CH₄ signal we aim to reconstruct is representative of variations observed in the atmosphere from leaks that occur within or close to an emitting industrial facility, i.e., short-duration CH₄ enhancements (spikes) lasting between 1 and 7 min and ranging from a few tenths of parts per million to a few parts per million above an atmospheric background concentration of around 2 ppm (Kumar et al., 2021). In this study, we performed a laboratory experiment where a CRDS instrument and many TGS sensors of different types were exposed to a controlled airflow with artificially created CH₄ concentration spikes (Sect. 2). The spikes were composed of pure CH₄ and did not contain any VOCs, although those species could be present in natural-gas leaks from oil and gas facilities. The main focus of this study is the behavior of TGS sensors that are exposed to enhancements of CH₄ on top of a background signal without the presence of other interfering gases. The influence of VOCs on a real deployment should be considered and included as a predictor to the reconstruction models, corrected on a preprocessing stage by determining the sensitivity of TGS to them or determining, from specific laboratory experiments, the amount of signal that models can filter out and the needs in terms of ancillary measurements. The experiment lasted 4 months and provided 838 spikes, which give us a dense and complex dataset to train and test different models for reconstructing CH₄ variations.

For low-cost sensors, a collocation is often required with a highly precise reference instrument to train an empirical calibration model. This training phase should be as effective (parsimonious) as possible. The strategy is to reduce the time and maintenance costs of having a reference instrument on site if the purpose is to bring it in the field for future studies where low-cost sensors would have to be calibrated. We investigate the problem of “parsimonious training” by testing different configurations (model and inputs) to establish the minimum amount of reference data needed to obtain good performances with low-cost sensors (Sect. 3.2 and 3.3). Secondly, since the performance of low-cost sensors may change with time, it is important to understand if their measurements could be affected by a drift of their sensitivity over time. We address this problem of “non-stationary training” by comparing different calibration models for a second spike experiment conducted 6 months after the first one (Sect. 3.4). Thirdly, sensitivities may vary from one sensor to another and may require a sensor-specific calibration model, which becomes a problem when a large number of sensors are deployed. Finding a robust calibration model that could be trained using data from one or several sensors and applied to others remains an open question. We bring some insight to this problem of “generalized calibration” by training models to reconstruct the CH₄ signal from a group of sensors located in the same chamber and applying them to other groups of sensors in a different chamber (Sect. 3.5). To assess the performance of the calibration models and particularly their capability to reconstruct spikes of several parts per million occurring upon a background CH₄ level, here we define an acceptable performance to be an error of less than the 10 % of the maximum amplitude of the peaks we aim to reconstruct. In our case, this requirement is an RMSE of 2 ppm between the reconstructed CH₄ data from low-cost sensors and the true data from a reference instrument at a time resolution of 5 s.

2 Methods

2.1 Experimental setup

2.1.1 Low-cost CH₄ sensors

For the experiment, four independent sampling chambers were assembled. Each chamber contained a Figaro^® TGS 2600 originally designed to measure VOCs but sensitive to CH₄, TGS 2611-C00 with enhanced sensitivity to CH₄ and TGS 2611-E00 that includes a carbon filter on top of the sensing material to improve the selectivity to CH₄ even further (see Table A7 for information on the differences of each TGS sensor), alongside a relative humidity and temperature sensor (DHT22 or Sensirion SHT75), and a temperature and pressure sensor (Bosch BMP280; see Table 1 for details). Issues with the logger system produced gaps in environmental variables data, thus observations information from an external chamber (E; see Fig. 1b and Table 1 for details) was used in the correction of the sensitivity across all chambers. The sensors were placed on a circuit board to minimize the direct heating influence of the TGS sensors on temperature measurements. The sampling chamber was made of acrylic/glass with a gas inlet and outlet and a port for the electrical cables (Fig. 1a). Each sensor was connected in series with a high-precision load resistor which controlled sensitivity (Figaro, 2013, 2005). The voltage across each load resistor was recorded by an AB Electronics PiPlus ADC board, mounted on a Raspberry Pi 3b+ logging computer, and sampled at a frequency of 0.5 Hz (2 s). This voltage measurement was used in our characterization algorithms, referred to hereafter as the sensor voltage. We focus on the reconstruction of CH₄ using only the TGS 2611-C00 and TGS 2611-E00 data.

https://amt.copernicus.org/articles/16/2209/2023/amt-16-2209-2023-f01

Figure 1(a) Example of a chamber with three sensors inside. (b) Scheme of the spike creation experiment.

Reconstruction of high-frequency methane atmospheric concentration peaks from measurements using metal oxide low-cost sensors

2.1 Experimental setup

2.1.1 Low-cost CH4 sensors

2.1.2 Generation of methane spikes on top of ambient air

2.2 Separating CH4 spikes from background variations in ambient air

2.3 Modeling CH4 spikes from TGS sensor voltages and environmental variables

2.3.1 Linear and multilinear regression models

2.3.2 Polynomial regression models

2.3.3 Random forest and hybrid random forest models

2.3.4 Artificial neural networks (ANNs)

2.4 Finding a parsimonious model training strategy

2.5 Assessing aging effects of the sensors

2.6 Finding generalized models that can be used for other sensors of the same type

2.7 Metrics for performance evaluation

3.1 Data preprocessing and baseline correction

3.2 Reconstruction of CH4 spikes

3.3 Results of parsimonious training tests

3.4 Results for possible aging effect on model performance

3.5 Generalized models

How do our approach and results compare with previous studies?

2.1.1 Low-cost CH₄ sensors

2.2 Separating CH₄ spikes from background variations in ambient air

2.3 Modeling CH₄ spikes from TGS sensor voltages and environmental variables

3.2 Reconstruction of CH₄ spikes