<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "https://jats.nlm.nih.gov/nlm-dtd/publishing/3.0/journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0" article-type="research-article">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">AMT</journal-id><journal-title-group>
    <journal-title>Atmospheric Measurement Techniques</journal-title>
    <abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1867-8548</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/amt-19-2923-2026</article-id><title-group><article-title>Evaluating machine learning model performance in a two-step colocation process for TVOC and BTEX sensor calibration</article-title><alt-title>TVOC and BTEX sensor calibration</alt-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Frischmon</surname><given-names>Caroline</given-names></name>
          <email>caroline.frischmon@colorado.edu</email>
        <ext-link>https://orcid.org/0000-0002-9184-4801</ext-link></contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Porter</surname><given-names>Jack</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Balagopalan</surname><given-names>Ethan</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff2">
          <name><surname>Senga</surname><given-names>William</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff3">
          <name><surname>Johnston</surname><given-names>Jill</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="no" rid="aff1">
          <name><surname>Hannigan</surname><given-names>Michael</given-names></name>
          
        </contrib>
        <aff id="aff1"><label>1</label><institution>Department of Mechanical Engineering, University of Colorado Boulder, 1111 Engineering Drive, Boulder, CO, 80309,  USA</institution>
        </aff>
        <aff id="aff2"><label>2</label><institution>South Coast Air Quality Monitoring District, 21865 Copley Drive, Diamond Bar, CA, 91765, USA</institution>
        </aff>
        <aff id="aff3"><label>3</label><institution>Department of Environmental and Occupational Health, University of California Irvine,  856 Health Sciences Quad, Irvine, CA, 92697, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">Caroline Frischmon (caroline.frischmon@colorado.edu)</corresp></author-notes><pub-date><day>4</day><month>May</month><year>2026</year></pub-date>
      
      <volume>19</volume>
      <issue>9</issue>
      <fpage>2923</fpage><lpage>2939</lpage>
      <history>
        <date date-type="received"><day>23</day><month>September</month><year>2025</year></date>
           <date date-type="rev-request"><day>29</day><month>October</month><year>2025</year></date>
           <date date-type="rev-recd"><day>27</day><month>March</month><year>2026</year></date>
           <date date-type="accepted"><day>23</day><month>April</month><year>2026</year></date>
      </history>
      <permissions>
        <copyright-statement>Copyright: © 2026 Caroline Frischmon et al.</copyright-statement>
        <copyright-year>2026</copyright-year>
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026.html">This article is available from https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026.html</self-uri><self-uri xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026.pdf</self-uri>
      <abstract><title>Abstract</title>

      <p id="d2e141">Calibration of low-cost air quality sensors (LCSs) for total volatile organic compound (TVOC) and benzene, toluene, ethylbenzene, and xylenes (BTEX) quantification remains challenging due to the sensors' cross-sensitivity to temperature and humidity and their tendency to drift over time. In this study, we aimed to improve TVOC and BTEX metal oxide (Figaro TGS 2600, 2602, 2611) sensor calibration using a two-step colocation strategy. A two-step colocation places one LCS (the secondary standard) with a reference monitor while others operate in the field, then briefly colocates the field sensors with the secondary standard to address inter-sensor variability and drift. This strategy made it possible to develop the calibration model under environmental conditions closely matching those of the field, which is essential for model transferability from colocation to field conditions. In addition to TVOC and BTEX, we applied the two-step colocation process to NO<sub>2</sub> electrochemical (Alphasense B-4) sensors to demonstrate the broader applicability of our approach beyond TVOC and BTEX quantification.</p>

      <p id="d2e153">Next, we compared the performance of multiple machine learning models, including ridge, lasso, random forest, gradient boosting, extreme gradient boosting, support vector regression, and linear regression, to investigate the optimal model choice for calibration. We found that no single model performed best across all pollutants. For example, gradient boosting excelled at capturing peak TVOC concentrations, while linear regression performed best for BTEX. Conversely, linear regression was the worst-performing model for NO<sub>2</sub>. Overall, the models showed satisfactory RMSE around 40–50 ppb for TVOC, 1.25–1.75 ppb for BTEX, and 4–6 ppb for NO<sub>2</sub>. However, all models also overestimated baseline concentrations and underestimated peaks. The severity of this bias depended on the reference concentration distribution, with the most severe peak underestimation occurring in the more heavily skewed TVOC and BTEX data. The systematic bias at baseline and peak concentrations was not evident in the overall mean bias error, which was near zero for all pollutants. This result underscores the need to evaluate model performance across the entire concentration distribution. Finally, we found that calibration performance was sensitive to the choice of training and testing data split. Future research could seek to optimize the training and testing split to ensure robust model transferability to field data.</p>
  </abstract>
    
<funding-group>
<award-group id="gs1">
<funding-source>National Institute of Environmental Health Sciences</funding-source>
<award-id>1R01ES033478</award-id>
</award-group>
</funding-group>
</article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <label>1</label><title>Introduction</title>
      <p id="d2e183">Low-cost sensors (LCSs) are an increasingly popular tool used by community groups, scientific researchers, and local governments to document air quality impacts at spatial and temporal scales finer than are possible with regulatory or reference-grade monitoring <xref ref-type="bibr" rid="bib1.bibx16 bib1.bibx7 bib1.bibx11 bib1.bibx46" id="paren.1"/>. Despite their increasing usage, LCSs face limitations due to their reduced accuracy compared to reference-grade monitoring methods <xref ref-type="bibr" rid="bib1.bibx28 bib1.bibx5 bib1.bibx7" id="paren.2"/>. To improve accuracy, LCSs are often calibrated via colocation with a reference instrument, which corrects sensor drift and cross-sensitivity to temperature, humidity, and non-target pollutants <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx28 bib1.bibx37" id="paren.3"/>.</p>
      <p id="d2e195">In a colocation, sensors are placed next to a reference-grade monitor for a set period. The sensor signal is then calibrated to the reference-grade concentration time series via a linear regression or machine learning calibration model <xref ref-type="bibr" rid="bib1.bibx36 bib1.bibx46" id="paren.4"/>. For certain pollutants, such as ozone and PM<sub>2.5</sub>, this process can significantly improve the accuracy of LCSs <xref ref-type="bibr" rid="bib1.bibx1 bib1.bibx40" id="paren.5"/>. However, calibrating LCSs for other pollutants, especially volatile organic compounds (VOCs), has had more limited success <xref ref-type="bibr" rid="bib1.bibx7 bib1.bibx10" id="paren.6"/>, leading some users to interpret low-cost VOC data only qualitatively <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx49" id="paren.7"/>.</p>
      <p id="d2e219">VOCs are a broad class of chemicals that originate from both natural sources, such as wildfires and vegetation, and anthropogenic sources, such as industrial solvents, consumer products, and fossil fuel extraction, refining, and combustion. VOCs impact respiratory and cardiovascular health and can cause secondary health impacts by contributing to the formation of secondary organic aerosols and tropospheric ozone <xref ref-type="bibr" rid="bib1.bibx27 bib1.bibx22 bib1.bibx31" id="paren.8"/>. Benzene, toluene, ethylbenzene, and xylenes, or BTEX, are a subset of VOCs that are especially relevant to human health due to their carcinogenicity <xref ref-type="bibr" rid="bib1.bibx27" id="paren.9"/>. Total VOCs (TVOCs) and subsets like BTEX are challenging to quantify using LCSs in part because of their relatively low ambient concentrations <xref ref-type="bibr" rid="bib1.bibx58" id="paren.10"/>. Additionally, the chemical makeup of TVOCs can include many different species to which LCSs exhibit varying sensitivities. Low-cost TVOC and BTEX sensors can also show greater sensitivity to temperature and humidity than to VOC species, and they exhibit drift over long-term use <xref ref-type="bibr" rid="bib1.bibx34 bib1.bibx10" id="paren.11"/>. Given these challenges, researchers have had the greatest success in calibrating these sensors for indoor use only, where concentrations tend to be higher and temperature and humidity can be controlled <xref ref-type="bibr" rid="bib1.bibx50 bib1.bibx32" id="paren.12"/>.</p>
      <p id="d2e237">In this paper, we demonstrate a comprehensive calibration approach to better address the challenges of ambient, outdoor TVOC and BTEX quantification using LCSs. This approach involves a two-step colocation and a robust evaluation of machine learning calibration models. We include NO<sub>2</sub> quantification as a comparison to demonstrate how our approach can be used for other pollutants as well.</p>
<sec id="Ch1.S1.SS1">
  <label>1.1</label><title>Two-step colocation</title>
      <p id="d2e257">Calibration models that perform well with colocation data may still produce large error in field predictions if the model is not transferable to the environmental and sensor conditions at field sites. For instance, <xref ref-type="bibr" rid="bib1.bibx38" id="text.13"/> showed how sensor drift reduces the accuracy of calibration models when applied to field data collected long after the colocation period. Others have demonstrated how calibration models can overfit to the environmental conditions, source mixtures, and pollutant concentrations present at the colocation site, making the model less reliable when transferred to field sites <xref ref-type="bibr" rid="bib1.bibx39 bib1.bibx4 bib1.bibx61 bib1.bibx43" id="paren.14"/>. These impacts are especially important to consider for VOCs, which exhibit higher spatial variability than other pollutants and are composed of a complex mixture of chemicals, further complicating model transferability <xref ref-type="bibr" rid="bib1.bibx8" id="paren.15"/>.</p>
      <p id="d2e269">To ensure a calibration model is transferable to field data, colocations must capture the range of environmental conditions expected during field deployment, though many colocations occur only before and/or after field deployment, potentially during different seasons <xref ref-type="bibr" rid="bib1.bibx33" id="paren.16"/>. In contrast, a two-step colocation occurs simultaneously with field deployment, making it possible to capture data under the same seasonal conditions. In a two-step colocation, a single LCS system, referred to here as the secondary standard, is colocated with a reference-grade monitor while the remaining LCS systems collect data at field sites <xref ref-type="bibr" rid="bib1.bibx52 bib1.bibx45" id="paren.17"/>. The field sensors are then colocated with the secondary standard for a shorter period, referred to as the harmonization step, to address inter-sensor variability. A two-step colocation can improve calibration transferability, especially when the colocation with the reference monitor occurs close to the field deployment sites, as this increases the likelihood of capturing similar environmental conditions within the same seasonal timeframe. Two-step colocations also allow for more extensive colocation periods without decreasing the amount of field data that can be collected, which is advantageous for complex machine learning models that require extensive data to prevent overfitting. In this study, we improve upon the two-step colocation process developed by <xref ref-type="bibr" rid="bib1.bibx52" id="text.18"/> and <xref ref-type="bibr" rid="bib1.bibx45" id="text.19"/> to further address sensor drift and model transferability.</p>
</sec>
<sec id="Ch1.S1.SS2">
  <label>1.2</label><title>Evaluation of machine learning calibration models</title>
      <p id="d2e292">Machine learning algorithms can improve calibration models by accounting for the non-linear signal response of LCSs <xref ref-type="bibr" rid="bib1.bibx50 bib1.bibx36" id="paren.20"/>. Due to their complexity, machine learning models require more extensive colocation data to prevent model overfitting to the colocation data, as this can make the model less reliable under changing field conditions <xref ref-type="bibr" rid="bib1.bibx12" id="paren.21"/>. The optimal machine learning model varies by dataset and is typically selected by comparing calibration model performance metrics, such as root mean squared error (RMSE) and mean bias error (MBE), for a portion of colocation data excluded from model training <xref ref-type="bibr" rid="bib1.bibx62" id="paren.22"/>. Evaluating the excluded data, called testing data, helps prevent overfitting to the training data.</p>
      <p id="d2e304">Studies comparing the performance of various machine learning calibration models, such as random forest, artificial neural networks, support vector machine regressions, and gradient boosting, have relied on PM<sub>2.5</sub>, CO, NO<sub>2</sub>, CO<sub>2</sub>, and O<sub>3</sub> datasets <xref ref-type="bibr" rid="bib1.bibx66 bib1.bibx4 bib1.bibx26 bib1.bibx59 bib1.bibx13" id="paren.23"/>. Compared to these pollutants, TVOC and BTEX data are often much more imbalanced, with the majority of measurements clustered near low parts-per-billion baseline levels and occasional episodic spikes multiple orders of magnitude higher <xref ref-type="bibr" rid="bib1.bibx15 bib1.bibx48" id="paren.24"/>. Imbalanced datasets make it more difficult to assess calibration model performance, as models that poorly fit elevated concentrations may still achieve strong overall performance metrics if they accurately predict baseline concentrations <xref ref-type="bibr" rid="bib1.bibx44 bib1.bibx54" id="paren.25"/>. In these cases, partitioning the performance metrics by data percentile can reveal significantly underestimated peak concentrations <xref ref-type="bibr" rid="bib1.bibx66 bib1.bibx19" id="paren.26"/>. Additionally, certain machine learning models, especially gradient boosting, are generally better suited to address data imbalance <xref ref-type="bibr" rid="bib1.bibx20" id="paren.27"/>, but calibrations for low-cost TVOC and BTEX sensors have been limited to linear regression, random forest, and neural network models <xref ref-type="bibr" rid="bib1.bibx14 bib1.bibx19 bib1.bibx57 bib1.bibx64 bib1.bibx24 bib1.bibx50 bib1.bibx35 bib1.bibx47 bib1.bibx44" id="paren.28"/>. Thus, our study provides the first comparison to our knowledge of a comprehensive suite of machine learning models for TVOC and BTEX sensor calibration, including random forest, gradient boosting, support vector regression, artificial neural network, ridge, and lasso algorithms. We also evaluate model performance for NO<sub>2</sub> calibration to compare how these models perform on a more balanced dataset.</p>
</sec>
</sec>
<sec id="Ch1.S2">
  <label>2</label><title>Methods</title>
      <p id="d2e380">We deployed a network of low-cost sensor platforms, called HAQ-Pods, in South Los Angeles to study the impacts of local oil and gas development. Field data collection is ongoing as of July 2025. However, since this study is focused only on sensor calibration, rather than field data analysis, we limit the present analysis to data collected between 4 October 2023 and 28 March 2024 to use just one pre- and post-harmonization. The pre-harmonization occurred from 4 to 9 October 2023, and the post-harmonization occurred from  18  to 28 March 2024. From 10 October 2023 to 18 March 2024, the secondary standard HAQ-Pod was colocated with a reference instrument for calibration, while all other pods were deployed at field sites in South Los Angeles (Fig. <xref ref-type="fig" rid="F1"/>).</p>

      <fig id="F1"><label>Figure 1</label><caption><p id="d2e387">The TVOC and BTEX colocation site is indicated by the black icon, while the NO<sub>2</sub> colocation site is indicated by the gray icon. The harmonization site and field sites are encompassed in the blue box. ©<ext-link xlink:href="https://www.openstreetmap.org/copyright">OpenStreetMap</ext-link> contributors 2025. Distributed under the Open Data Commons Open Database License (ODbL) v1.0.</p></caption><alt-text>Map of calibration sites</alt-text>
        <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f01.png"/>

      </fig>

<sec id="Ch1.S2.SS1">
  <label>2.1</label><title>Instrumentation</title>
      <p id="d2e417">HAQ-Pods were developed in the Hannigan Air Quality (HAQ) Lab and have previously been used to capture local pollution events <xref ref-type="bibr" rid="bib1.bibx18 bib1.bibx54" id="paren.29"/>. HAQ-Pods use interchangeable, commercial sensors to quantify various pollutants. The metal oxide and electrochemical sensors used in this study are listed in Table <xref ref-type="table" rid="T1"/>. We also used a Bosch BME 680 sensor for temperature and humidity measurements. The sensors are housed in a weather-proof case (10.7<inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:msup><mml:mi/><mml:mrow><mml:mo>′</mml:mo><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">9.8</mml:mn><mml:mrow><mml:mo>′</mml:mo><mml:mo>′</mml:mo></mml:mrow></mml:msup><mml:mo>×</mml:mo><mml:msup><mml:mn mathvariant="normal">4.8</mml:mn><mml:mrow><mml:mo>′</mml:mo><mml:mo>′</mml:mo></mml:mrow></mml:msup></mml:mrow></mml:math></inline-formula>) outfitted with fans to draw air across the sensor surfaces. HAQ-Pods feature updated circuitry compared to Y-Pod air monitors, which have been described elsewhere <xref ref-type="bibr" rid="bib1.bibx9 bib1.bibx8" id="paren.30"/>.</p>

<table-wrap id="T1"><label>Table 1</label><caption><p id="d2e464">Sensors included in the HAQ-Pods.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="3">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Measurement</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Sensor</oasis:entry>
         <oasis:entry colname="col2">Target pollutant</oasis:entry>
         <oasis:entry colname="col3">principle</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Figaro TGS 2600</oasis:entry>
         <oasis:entry colname="col2">TVOC and BTEX</oasis:entry>
         <oasis:entry colname="col3">Metal oxide</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Figaro TGS 2602</oasis:entry>
         <oasis:entry colname="col2">TVOC and BTEX</oasis:entry>
         <oasis:entry colname="col3">Metal oxide</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Figaro TGS 2611</oasis:entry>
         <oasis:entry colname="col2">TVOC and BTEX</oasis:entry>
         <oasis:entry colname="col3">Metal oxide</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Alphasense B-4 series</oasis:entry>
         <oasis:entry colname="col2">NO<sub>2</sub></oasis:entry>
         <oasis:entry colname="col3">Electrochemical</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

      <p id="d2e562">For the TVOC calibration, the secondary standard HAQ-Pod was colocated with an Extractive Fourier Transform Infrared Spectrometer operated and maintained by the South Coast Air Quality Management District (South Coast AQMD) using methods similar to those described in <xref ref-type="bibr" rid="bib1.bibx41" id="text.31"/>. Ambient air is continuously drawn through heated Teflon inlet (1-inch ID) mounted on the roof of the station into a 25-liter cell at <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">25</mml:mn></mml:mrow></mml:math></inline-formula> L min<sup>−1</sup>. Infrared light is generated with a glow-bar and directed into the cell with a series of mirrors. The light beam is passed through the cell multiple times via a series of curved mirrors to achieve a total path length of <inline-formula><mml:math id="M16" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">100</mml:mn></mml:mrow></mml:math></inline-formula> m before exiting the cell and being directed into the infrared spectrometer (Bruker, Type: Matrix-M). Light is detected with an Indium/Antimonide detector in the 1800–4000 cm<sup>−1</sup> region and the spectrometer has a spectral resolution of 0.5 cm<sup>−1</sup>.</p>
      <p id="d2e626">Gas concentrations are calculated using synthetic spectra generated by fitting calibration spectra taken from the HITRAN <xref ref-type="bibr" rid="bib1.bibx53" id="paren.32"/> and PNNL <xref ref-type="bibr" rid="bib1.bibx51" id="paren.33"/> databases to the raw spectra collected by the spectrometer in a linear least squares type fitting algorithm within the spectral evaluation window of 2725 to 3007 cm<sup>−1</sup> <xref ref-type="bibr" rid="bib1.bibx21 bib1.bibx25" id="paren.34"/>. The spectral fit includes calibration spectra for H<sub>2</sub>O, CH<sub>4</sub>, HDO, Propane, Butane, and Octane with TVOC reported as the sum of Propane, Butane, and Octane.</p>
      <p id="d2e669">For the BTEX calibration, the secondary standard HAQ-Pod was collocated with an automated gas chromatograph (auto-GC) instrument (Tricorntech; MiTAP P320 Series)  operated and maintained by South Coast AQMD.  The auto-GC operates continuously with a distinct sampling and analysis phase that span 40  and 20 min, respectively.  During the sampling phase, the auto-GC continuously draws ambient air at 5 cm<sup>3</sup> min<sup>−1</sup> through a roof mounted inlet into a pre-concentrator for 40 min, totaling 200 cm<sup>3</sup> of gas.  During the analysis phase, the sample passes through a series of heated GC columns by way of pressurized inert carrier gas (N<sub>2</sub>), where compounds are separated by polarity and subsequently detected with a series of photoionization detectors.  Digital chromatograms are recorded and used to calculate ambient gas concentration via reference to a set of previously determined calibration chromatograms.  Calibration chromatograms are determined by routinely (twice weekly) directing gas from a concentration-certified compressed cylinder into the auto-GC.  The method is capable of an accuracy of 0.1 parts per billion by volume (ppbv).</p>
      <p id="d2e711">For the NO<sub>2</sub> calibration, the secondary standard HAQ-Pod was colocated with a Teledyne T200 instrument maintained by the South Coast AQMD. The reference instrument was calibrated by the South Coast AQMD every six months and underwent weekly zero and span checks, in addition to nightly PC checks to ensure correct values.</p>
</sec>
<sec id="Ch1.S2.SS2">
  <label>2.2</label><title>Two-step colocation</title>
      <p id="d2e732">The secondary standard HAQ-Pod was colocated with the South Coast AQMD instruments while all other HAQ-Pods were deployed at field sites in South Los Angeles from 10 October 2023 to 18 March 2024. The field HAQ-Pods were harmonized with the secondary standard HAQ-Pod from  4 to  9 October 2023 and from  18 to  28 March 2024. The physical set-up of the harmonization is shown in Fig. <xref ref-type="fig" rid="F2"/>. Unlike previous two-step colocations, the HAQ-Pods were harmonized with the secondary standard both before and after field deployment to better account for sensor drift across the entire study period, as suggested by <xref ref-type="bibr" rid="bib1.bibx52" id="text.35"/>.</p>

      <fig id="F2"><label>Figure 2</label><caption><p id="d2e742">Harmonization set up in Los Angeles</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f02.jpg"/>

        </fig>

      <p id="d2e751"><xref ref-type="bibr" rid="bib1.bibx52" id="text.36"/> and <xref ref-type="bibr" rid="bib1.bibx45" id="text.37"/> demonstrated different approaches for propogating the calibration model to the field LCSs in a two-step colocation (Fig. <xref ref-type="fig" rid="F3"/>). <xref ref-type="bibr" rid="bib1.bibx45" id="text.38"/> developed a primary calibration model between the secondary standard and reference instrument. In the harmonization step, the field LCS systems were then calibrated via a secondary calibration model to the concentration timeseries predicted by the calibrated secondary standard. This method is better suited for projects where the colocation and harmonization steps are similar in length, since each step requires equally complex calibration models.</p>
      <p id="d2e765">Conversely, <xref ref-type="bibr" rid="bib1.bibx52" id="text.39"/> first developed a multi-linear regression model for each sensor signal corrected to the secondary standard's corresponding sensor signal. The multi-linear regression models for the pollutant sensors included temperature and humidity in addition to the pollutant sensor signal. The primary calibration model between the secondary standard and reference monitor could then be applied directly to the corrected field LCS systems. Since this method uses only a linear regression in the harmonization step, it is well-suited for projects where the harmonization is much shorter than the colocation. However, including temperature and humidity in the pollutant sensor regression model may introduce dependencies on environmental conditions that then require the harmonization step to span the full range of conditions expected in the field – thereby reducing the method’s usefulness for short harmonizations. To avoid this, we included only the pollutant sensor signal and a time elapsed variable in our linear regressions for harmonization, so that the regression corrects only for inter-sensor linear baseline shifts and drift (Eq. <xref ref-type="disp-formula" rid="Ch1.E1"/>).

            <disp-formula id="Ch1.E1" content-type="numbered"><label>1</label><mml:math id="M27" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">1</mml:mn></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msub><mml:msub><mml:mi>T</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>

          Where <inline-formula><mml:math id="M28" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>s represent the model coefficients, <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the sensor signal for the <inline-formula><mml:math id="M30" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:msub><mml:mi>T</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the time elapsed from the start of data collection for the <inline-formula><mml:math id="M32" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, and <inline-formula><mml:math id="M33" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the error term for the <inline-formula><mml:math id="M34" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation.</p>

      <fig id="F3"><label>Figure 3</label><caption><p id="d2e886">Model propogation methods for two-step colocation. This example has two sensor signal inputs: the target pollutant signal and temperature. Solid lines indicate a concentration timeseries and dotted lines indicate raw sensor signal. Note that <xref ref-type="bibr" rid="bib1.bibx52" id="text.40"/> include temperature and humidity in all linear correction models, while we include time elapsed in all linear correction models in this paper.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f03.png"/>

        </fig>

</sec>
<sec id="Ch1.S2.SS3">
  <label>2.3</label><title>Calibration model development</title>
      <p id="d2e906">Inputs into the TVOC and BTEX colocation calibration models included the signals from the metal oxide sensors, as well as temperature, humidity, and a time elapsed feature to account for sensor drift. The NO<sub>2</sub> calibration model used inputs from the electrochemical NO<sub>2</sub> sensor along with temperature, humidity, and time-elapsed data.</p>
      <p id="d2e927">To evaluate model performance, we split the colocation data into training and testing datasets, allocating 80 % for training and 20 % for testing. While this train/test split is typically performed once, we repeated the calibration process 20 times to assess model robustness. For each run, we used a different split of the data. Specifically, for each iteration, we randomly selected a starting point in the time series and designated the next 10 % of data as testing data. A second 10 % segment was selected from a point halfway through the dataset relative to the first chunk, wrapping around if necessary. Figure <xref ref-type="fig" rid="F4"/> shows an example test/train split for each pollutant with reference and predicted concentrations (using a linear regression model) to demonstrate how this test/train approach allows the testing data to span different times of the year. This approach ensures the model is not overfit to a specific environmental state or season, while minimizing information leakage from autocorrelation between training and testing datasets <xref ref-type="bibr" rid="bib1.bibx43" id="paren.41"/>.</p>

      <fig id="F4" specific-use="star"><label>Figure 4</label><caption><p id="d2e937">Example of the test/train split for each colocation timeseries.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f04.png"/>

        </fig>

      <p id="d2e947">The sensor signals are time averaged to 60-minute intervals prior to model training to reduce noise <xref ref-type="bibr" rid="bib1.bibx3" id="paren.42"/>. Signals were then z-scored to a mean zero and standard deviation equal to one based on the training data. During model training, we use <inline-formula><mml:math id="M37" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-fold cross-validation (<inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mi>k</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">5</mml:mn></mml:mrow></mml:math></inline-formula>) to tune hyper-parameters to values that minimize the loss function, or mean squared error (MSE), shown in Eq. (<xref ref-type="disp-formula" rid="Ch1.E2"/>). We used the Scikit-learn Python module for all models except the artificial neural network, which used Tensorflow, and extreme gradient boosting, which used the Xgboost package.

            <disp-formula id="Ch1.E2" content-type="numbered"><label>2</label><mml:math id="M39" display="block"><mml:mrow><mml:mtext>MSE</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></disp-formula>

          Where <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the reference concentration for the <inline-formula><mml:math id="M41" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation and <inline-formula><mml:math id="M42" display="inline"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover></mml:math></inline-formula> is the predicted concentration for the <inline-formula><mml:math id="M43" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, and <inline-formula><mml:math id="M44" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of observations.</p>
</sec>
<sec id="Ch1.S2.SS4">
  <label>2.4</label><title>Machine learning model descriptions</title>
<sec id="Ch1.S2.SS4.SSS1">
  <label>2.4.1</label><title>Linear regression (LR)</title>
      <p id="d2e1086">LR employs the following expression to predict the reference concentration using the input sensor signals:

              <disp-formula id="Ch1.E3" content-type="numbered"><label>3</label><mml:math id="M45" display="block"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mn mathvariant="normal">0</mml:mn></mml:msub><mml:mo>+</mml:mo><mml:msubsup><mml:mi mathvariant="normal">Σ</mml:mi><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:msubsup><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub><mml:mo>+</mml:mo><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></disp-formula>

            Where <inline-formula><mml:math id="M46" display="inline"><mml:mi mathvariant="italic">β</mml:mi></mml:math></inline-formula>s represent the model coefficients, <inline-formula><mml:math id="M47" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the reference concentration for the <inline-formula><mml:math id="M48" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mrow><mml:mi>i</mml:mi><mml:mi>j</mml:mi></mml:mrow></mml:msub></mml:mrow></mml:math></inline-formula> is the <inline-formula><mml:math id="M50" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>th feature or input for the <inline-formula><mml:math id="M51" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, <inline-formula><mml:math id="M52" display="inline"><mml:mi>p</mml:mi></mml:math></inline-formula> is the number of features or model inputs, and <inline-formula><mml:math id="M53" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">ϵ</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> represents the error term for the <inline-formula><mml:math id="M54" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation.</p>
      <p id="d2e1220">The model coefficients are tuned by minimizing the model error. There are no hyper-parameters that require tuning via <inline-formula><mml:math id="M55" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-fold cross-validation in a linear regression model.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS2">
  <label>2.4.2</label><title>Regularized linear regression: Lasso (L) and ridge (R)</title>
      <p id="d2e1238">Lasso (L) and ridge (R) are regularized linear regression models that add a penalty term to the loss function to reduce overfitting. Lasso uses L1 regularization, which adds the sum of the absolute values of the regression coefficients to the loss function (Eq. <xref ref-type="disp-formula" rid="Ch1.E4"/>) <xref ref-type="bibr" rid="bib1.bibx60" id="paren.43"/>. This reduces the coefficients of unimportant inputs to zero, removing them from the model. Lasso regression is useful when it is suspected that not all model inputs are important for prediction. In ridge regression (L2 regularization), the penalty term is the sum of the squared values of the model coefficients (Eq. <xref ref-type="disp-formula" rid="Ch1.E5"/>), which reduces the coefficients towards zero without setting them at zero <xref ref-type="bibr" rid="bib1.bibx23" id="paren.44"/>. Thus, ridge regression simplifies the model while retaining all inputs in the model.

                  <disp-formula specific-use="align" content-type="numbered"><mml:math id="M56" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E4"><mml:mtd><mml:mtext>4</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle displaystyle="true" class="stylechange"/><mml:mtext>Lasso loss Function</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:munderover><mml:mo>|</mml:mo><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub><mml:mo>|</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E5"><mml:mtd><mml:mtext>5</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtext>Ridge loss Function</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>+</mml:mo><mml:mi mathvariant="italic">λ</mml:mi><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>j</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>p</mml:mi></mml:munderover><mml:msubsup><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msubsup></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

            Where <inline-formula><mml:math id="M57" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the reference concentration for the <inline-formula><mml:math id="M58" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, <inline-formula><mml:math id="M59" display="inline"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> is the predicted concentration for the <inline-formula><mml:math id="M60" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, <inline-formula><mml:math id="M61" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">β</mml:mi><mml:mi>j</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the coefficient for the <inline-formula><mml:math id="M62" display="inline"><mml:mi>j</mml:mi></mml:math></inline-formula>th model input, and <inline-formula><mml:math id="M63" display="inline"><mml:mi mathvariant="italic">λ</mml:mi></mml:math></inline-formula> is the regularization hyper-parameter, which determines the strength of the regularization.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS3">
  <label>2.4.3</label><title>Ensemble decision tree-based models: Random forest (RF), gradient boosting (GB), and extreme gradient boosting (XGB)</title>
      <p id="d2e1487">Random forest (RF), gradient boosting (GB), and extreme gradient boosting (XGB) are ensemble decision tree-based learning methods. Similarly to a flow chart, a decision tree separates the data into nodes based on the characteristics of the data, eventually reaching a final node that determines the prediction of the data <xref ref-type="bibr" rid="bib1.bibx56" id="paren.45"/>. As ensemble methods, these models combine outputs from multiple trees to produce a more robust solution <xref ref-type="bibr" rid="bib1.bibx2 bib1.bibx42" id="paren.46"/>. RF uses a bootstrap aggregating approach (bagging) to combine the trees, where each tree is built independently and given equal weight in the final solution <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx2" id="paren.47"/>. In contrast, GB and XGB use a “boosting” technique, where trees are built sequentially and seek to improve upon the error of the prior tree <xref ref-type="bibr" rid="bib1.bibx30 bib1.bibx42" id="paren.48"/>. XGB is a faster, optimized version of GB that also adds regularization (lasso and ridge) to reduce overfitting <xref ref-type="bibr" rid="bib1.bibx6" id="paren.49"/>.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS4">
  <label>2.4.4</label><title>Support vector machine regression (SVR)</title>
      <p id="d2e1513">Support vector machine regression (SVR) maps the data to a high-dimensional space using a kernel function and determines the best-fitting hyperplane for the transformed data <xref ref-type="bibr" rid="bib1.bibx55" id="paren.50"/>. SVR minimizes the model complexity by only fitting the hyperplane to points falling outside a margin of tolerance. This is different than other machine learning models, which seek to minimize overall error across all data points.</p>
</sec>
<sec id="Ch1.S2.SS4.SSS5">
  <label>2.4.5</label><title>Artificial neural network (ANN)</title>
      <p id="d2e1528">An artificial neural network (ANN) is a machine learning model designed to mimic the way input signals are processed and transformed into output signals in the brain <xref ref-type="bibr" rid="bib1.bibx63" id="paren.51"/>. In an ANN, the input signals pass through interconnected layers of nodes. The connections between the nodes are assigned weights, which are adjusted as the model is trained to optimize the output signal accuracy <xref ref-type="bibr" rid="bib1.bibx65 bib1.bibx29" id="paren.52"/>.</p>
</sec>
</sec>
<sec id="Ch1.S2.SS5">
  <label>2.5</label><title>Calibration model evaluation</title>
      <p id="d2e1546">We evaluated the overall fit of the calibration models using three performance metrics: root mean squared error (RMSE) (Eq. <xref ref-type="disp-formula" rid="Ch1.E6"/>), mean bias error (MBE) (Eq. <xref ref-type="disp-formula" rid="Ch1.E7"/>), and coefficient of determination (<inline-formula><mml:math id="M64" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula>) (Eq. <xref ref-type="disp-formula" rid="Ch1.E8"/>). We also evaluated RMSE and MBE by percentile groups to assess model fits across the entire range of data. For the percentile-based assessment, the data were partitioned into 0–5th percentile, 5–25th percentile, 25–75th percentile, 75–95th percentile, and 95–100th percentile groups.

                <disp-formula specific-use="align" content-type="numbered"><mml:math id="M65" display="block"><mml:mtable displaystyle="true"><mml:mlabeledtr id="Ch1.E6"><mml:mtd><mml:mtext>6</mml:mtext></mml:mtd><mml:mtd><mml:mstyle displaystyle="true" class="stylechange"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtext>RMSE</mml:mtext><mml:mo>=</mml:mo><mml:msqrt><mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:msqrt></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E7"><mml:mtd><mml:mtext>7</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:mtext>MBE</mml:mtext><mml:mo>=</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mn mathvariant="normal">1</mml:mn><mml:mi>n</mml:mi></mml:mfrac></mml:mstyle><mml:munderover><mml:mo movablelimits="false">∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:mo>)</mml:mo></mml:mrow></mml:mtd></mml:mlabeledtr><mml:mlabeledtr id="Ch1.E8"><mml:mtd><mml:mtext>8</mml:mtext></mml:mtd><mml:mtd><mml:mstyle class="stylechange" displaystyle="true"/></mml:mtd><mml:mtd><mml:mrow><mml:mstyle class="stylechange" displaystyle="true"/><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac style="display"><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:msub><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo stretchy="false" mathvariant="normal">^</mml:mo></mml:mover><mml:mi>i</mml:mi></mml:msub><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:msubsup><mml:mo>(</mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mover accent="true"><mml:mi>y</mml:mi><mml:mo mathvariant="normal">¯</mml:mo></mml:mover><mml:msup><mml:mo>)</mml:mo><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:mfrac></mml:mstyle></mml:mrow></mml:mtd></mml:mlabeledtr></mml:mtable></mml:math></disp-formula>

          Where <inline-formula><mml:math id="M66" display="inline"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow></mml:math></inline-formula> is the reference concentration for the <inline-formula><mml:math id="M67" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th  observation and <inline-formula><mml:math id="M68" display="inline"><mml:mover accent="true"><mml:mrow><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:mrow><mml:mo mathvariant="normal" stretchy="false">^</mml:mo></mml:mover></mml:math></inline-formula> is the predicted concentration for the <inline-formula><mml:math id="M69" display="inline"><mml:mi>i</mml:mi></mml:math></inline-formula>th observation, and <inline-formula><mml:math id="M70" display="inline"><mml:mi>n</mml:mi></mml:math></inline-formula> is the number of observations.</p>
      <p id="d2e1813">Lastly, we applied the calibration models to sensor data collected from three field sites within the Los Angeles study area. We compare the predictions of each model to understand how they vary when applied to field conditions.</p>
</sec>
</sec>
<sec id="Ch1.S3">
  <label>3</label><title>Results and Discussion</title>
      <p id="d2e1825">The full HAQ-Pod network included 11 field sites. We show data for just three sites to maintain the focus on sensor calibration in the present study. These three HAQ-Pods, along with the secondary standard HAQ-Pod had 100 % data completeness in the harmonization step, resulting in 333 harmonization data measurements for each HAQ-Pod after hourly averaging. Data completeness for HAQ-Pods 1, 2, and 3 in the field was 89 %, 86 % and 99 %, respectively, resulting in 3413, 3289, 3789 field data points for each HAQ-Pod. The secondary standard HAQ-Pod had 90 % data completeness, resulting in 3452 data points for the colocation calibration models.</p>
<sec id="Ch1.S3.SS1">
  <label>3.1</label><title>Two-step colocation</title>
      <p id="d2e1835">As observed during harmonization, colocated HAQ-Pod raw sensor signals are generally well correlated, with correlation coefficients between the secondary standard HAQ-Pod raw signals and corresponding raw signals in the other HAQ-Pods ranging from 0.87 to 1 (Supplement). However, some shifted baselines and amplitudes are still visible in the left column of Fig. <xref ref-type="fig" rid="F5"/>, especially for the metal oxide sensors, demonstrating the importance of correcting individual sensor signals rather than relying on a universal or general calibration. After applying the linear correction developed during harmonization (coefficients listed in Tables S2–S7), the signals are visually more aligned (right column of Fig. <xref ref-type="fig" rid="F5"/> and have correlation coefficients with the colocation HAQ-Pod signal ranging from 0.96–1 (Supplement).</p>
      <p id="d2e1842">The left column of Fig. <xref ref-type="fig" rid="F5"/> also reveals why performing both pre- and post-harmonization is essential for some sensors, especially Figaro TGS 2600 and 2611 in this case. These sensors show different behavior compared to the secondary standard HAQ-Pod signal in the pre- and post- harmonization, indicating sensor drift. We addressed this drift by including a time-elapsed feature in the harmonization correction, which approximates the drift as a linear function. It is likely the sensor drift is not perfectly linear, but the approximation appears to be sufficient here based on performance at both the start and end of harmonization. Conducting both pre- and post- harmonizations was essential for checking the drift approximation, as otherwise the model may extrapolate drift beyond what is reasonably possible.</p>

      <fig id="F5"><label>Figure 5</label><caption><p id="d2e1849">Raw sensor signal collected during harmonization (left column) and corrected signals after harmonization with the colocation HAQ-Pod (right column).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f05.png"/>

        </fig>

      <p id="d2e1859">The temperature and humidity ranges recorded during the field deployment show strong overlap with the temperature and humidity ranges recorded at the HAQ-Pod colocation site (Fig. <xref ref-type="fig" rid="F6"/>, note that measurements are uncalibrated sensor temperature and humidity). This overlap, which is important for colocation model transferability to field data, is possible because the two-step colocation allows for simultaneous colocation and field data collection. If colocation had occurred only before and after field deployment, as is typical for many low-cost sensor deployments, the overlap would look more like that of the field and harmonization phases in Fig. <xref ref-type="fig" rid="F6"/>. Conditions during the harmonization phase trended towards higher temperature and humidity because the harmonizations occurred in fall and spring, while the field deployment was mainly in winter. This seasonal difference could lead to poor transferability if temperature and humidity impacts on the sensors were corrected in the harmonization phases because the model would have to extrapolate the impacts at low temperature and humidity ranges. Instead, our two-step colocation strategy corrects for sensor cross-sensitivity to temperature and humidity in the colocation phase only, where there is sufficient overlap of environmental conditions.</p>

      <fig id="F6" specific-use="star"><label>Figure 6</label><caption><p id="d2e1868">Relative humidity (RH) and temperature (<inline-formula><mml:math id="M71" display="inline"><mml:mi>T</mml:mi></mml:math></inline-formula>) observed during field (green), colocation (orange), and harmonization (purple) phases. These plotted values are harmonized sensor signals not calibrated to a true T or RH measurement.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f06.png"/>

        </fig>

</sec>
<sec id="Ch1.S3.SS2">
  <label>3.2</label><title>Evaluation of machine learning calibration models</title>
      <p id="d2e1892">Figure <xref ref-type="fig" rid="F7"/> highlights how the distribution of TVOC and BTEX concentrations measured by the reference instruments during colocation are more skewed than NO<sub>2</sub>, as 75 % and 78 % of the total concentration range falls within the 95–100th percentile group for TVOC and BTEX, respectively, compared with just 31 % for NO<sub>2</sub>. Thus, the inclusion of NO<sub>2</sub> in our analysis provides a useful comparison of machine learning calibration performance on a more balanced dataset.</p>

      <fig id="F7"><label>Figure 7</label><caption><p id="d2e1926">Reference concentrations, divided into data percentile ranges, of each pollutant collected during colocation.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f07.png"/>

        </fig>

      <p id="d2e1935">In the following sections, we show the calibration model performance for each pollutant across all 20 calibration repetitions (Figs. <xref ref-type="fig" rid="F8"/>, <xref ref-type="fig" rid="F11"/>, <xref ref-type="fig" rid="F14"/>). <inline-formula><mml:math id="M75" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values for these models are available in Fig. S1. To better visualize how these models actually predict testing and field data concentrations, we show predicted data from one calibration run in Figs. <xref ref-type="fig" rid="F9"/>, <xref ref-type="fig" rid="F10"/>, <xref ref-type="fig" rid="F12"/>, <xref ref-type="fig" rid="F13"/>, <xref ref-type="fig" rid="F15"/>, and <xref ref-type="fig" rid="F16"/>. The calibration run we selected to visualize in these plots is one that was close to the median overall performance, based on RMSE and MBE, for each pollutant.</p>

      <fig id="F8" specific-use="star"><label>Figure 8</label><caption><p id="d2e1971">Performance statistics for TVOC calibration models, including overall RMSE <bold>(a)</bold>, RMSE by percentile group <bold>(b)</bold>, overall MBE <bold>(c)</bold>, and MBE by percentile group <bold>(d)</bold>. The calibration models are linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f08.png"/>

        </fig>

      <fig id="F9" specific-use="star"><label>Figure 9</label><caption><p id="d2e1994">Predicted concentration versus reference concentration for TVOC calibration models: linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f09.png"/>

        </fig>

      <fig id="F10" specific-use="star"><label>Figure 10</label><caption><p id="d2e2005">VOC concentration predictions at three field sites for each calibration model: linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR)</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f10.png"/>

        </fig>

      <fig id="F11" specific-use="star"><label>Figure 11</label><caption><p id="d2e2016">Performance statistics for BTEX calibration models, including overall RMSE <bold>(a)</bold>, RMSE by percentile group <bold>(b)</bold>, overall MBE <bold>(c)</bold>, and MBE by percentile group <bold>(d)</bold>. The calibration models are linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR)</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f11.png"/>

        </fig>

      <fig id="F12" specific-use="star"><label>Figure 12</label><caption><p id="d2e2040">Predicted concentration versus reference concentration for BTEX calibration models: linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f12.png"/>

        </fig>

      <fig id="F13" specific-use="star"><label>Figure 13</label><caption><p id="d2e2051">BTEX concentration predictions at three field sites for each calibration model: linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR)</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f13.png"/>

        </fig>

      <fig id="F14" specific-use="star"><label>Figure 14</label><caption><p id="d2e2062">Performance statistics for NO<sub>2</sub> calibration models, including overall RMSE <bold>(a)</bold>, RMSE by percentile group <bold>(b)</bold>, overall MBE <bold>(c)</bold>, and MBE by percentile group <bold>(d)</bold>. The calibration models are linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR)</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f14.png"/>

        </fig>

      <fig id="F15" specific-use="star"><label>Figure 15</label><caption><p id="d2e2094">Predicted concentration versus reference concentration for NO<sub>2</sub> calibration models: linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR).</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f15.png"/>

        </fig>

      <fig id="F16" specific-use="star"><label>Figure 16</label><caption><p id="d2e2115">NO<sub>2</sub> concentration predictions at three field sites for each calibration model: linear regression (LR), lasso (L), ridge (R), random forest (RF), gradient boosting (GB), extreme gradient boosting (XGB), artificial neural network (ANN), and support vector regression (SVR)</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f16.png"/>

        </fig>

<sec id="Ch1.S3.SS2.SSS1">
  <label>3.2.1</label><title>TVOC calibration</title>
      <p id="d2e2140">Depending on the test/train split of the data, the overall RMSE and MBE for each model varied by 25–35 ppb (Panels A, C in Fig. <xref ref-type="fig" rid="F8"/>). Within each run, the models showed similar overall testing RMSE and MBE, indicating the calibration performance depended more on test/train split than model choice. <inline-formula><mml:math id="M79" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> for all the models were also generally similar, with a median value of around 0.65 across all runs (Fig. S1). All the models tended to overestimate baseline concentrations in the 0–5th percentile, with the linear models showing the least overestimation bias (Fig. <xref ref-type="fig" rid="F8"/>, panel d). The models also tended to underestimate peak concentrations in the 95–100th percentile, though gradient boosting (GB) performed best for these peak values based on its median MBE.</p>
      <p id="d2e2158">Linear regression (LR) and ridge regression (R) have nearly identical performance statistics, indicating that ridge, or L2, regularization of the linear model is not necessary for this data. Lasso, or L1, regularization removed humidity and time elapsed from the model inputs in 16 of 20 runs, removed temperature in one run, temperature and time elapsed in one run, and just time elapsed in two runs. None of these combinations significantly changed the model's performance compared to LR and R.</p>
      <p id="d2e2161">Visual inspection of the scatter plots in Fig. <xref ref-type="fig" rid="F9"/> shows that all models except the two boosting models, GB and XGB, severely underpredict the five highest concentrations. Boosting models are known to better predict extreme values in a regression model. These results suggest that boosting models are a good choice for applications where accurately capturing extreme concentrations is critical, such as pollution episode detection.</p>
      <p id="d2e2166">When applied to field data, the models predict similar mean concentrations, within a few parts per billion, but vary more in their predictions of extreme concentrations (baseline and peak) (Fig. <xref ref-type="fig" rid="F10"/>). For the most part, this variation reflects the models performance on testing data. For example, LR, L, R, and SVR predict a negative baseline, which is also evident in the testing data predictions for these models, though less pronounced (Fig. <xref ref-type="fig" rid="F9"/>). The boosting models (GB, XGB) consistently predict the highest maximum concentration, which is reflective of these models' performances in the 95–100th percentile of testing data. Overall, the models are correlated in their predictions, with correlation coefficients ranging from 0.88 to 1 between each model (Supplement).</p>
</sec>
<sec id="Ch1.S3.SS2.SSS2">
  <label>3.2.2</label><title>BTEX calibration</title>
      <p id="d2e2181">The BTEX calibration models show little variability in performance across different test/train splits, except for data above the 95th percentile, where RMSE varied more than 5 ppb and MBE varied by more than 3 ppb for all models (Fig. <xref ref-type="fig" rid="F11"/>). <inline-formula><mml:math id="M80" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> was the lowest of the three pollutants, with a median value of around 0.55 across all runs and models (Fig. S2). Lasso (L) had the worst performance overall, while the other two linear models (LR, R) show slightly better performance at the highest percentile and have the best overall RMSE and MBE.</p>
      <p id="d2e2197">SVR shows less overestimation of baseline concentrations (0–25th percentile) than the other models. Comparing this model to ANN in Fig. <xref ref-type="fig" rid="F11"/> reveals an important issue regarding overall MBE as a performance statistic. Both models show similar bias above the 50th percentile. However, ANN overpredicts lower concentrations (0–50th percentile) much more than SVR. As a result, ANN has a better overall MBE compared to SVR. This is important to consider because in a practical sense, a model that shows less overall bias by equally overpredicting the baseline and underpredicting peak values may not be more useful than a model that accurately predicts baseline concentrations but underpredicts peaks.</p>
      <p id="d2e2202">Figure <xref ref-type="fig" rid="F12"/> shows how the L model predicts far less variability than other models, with few data points predicted above 5 ppb. The L1 regularization used in L removed all inputs except TGS 2602, resulting in a poor model. The L2 regularization in R did not significantly change the model compared to LR.</p>
      <p id="d2e2207">Like TVOC, the mean concentrations predicted at each field site show little variation between models, but variation is higher when comparing the entire range of data (Fig. 12). L consistently has the smallest interquartile range, reflecting the lower variability predicted in the testing data. At sites 1 and 3, all models except L predict similar maximum concentrations. However, at site 2, RF, GB, and XGB predict much lower maximums than LR, R, ANN, and SVR. It is not clear why this inconsistency is occurring; however, it may indicate issues with transferability of the models for BTEX given their changing behavior across field sites. Overall, the models are still highly correlated with each other, with correlation coefficients between models at each field site ranging from 0.89 to 1 (Supplement).</p>
</sec>
<sec id="Ch1.S3.SS2.SSS3">
  <label>3.2.3</label><title>NO<sub>2</sub> calibration</title>
      <p id="d2e2228">Overall, all the NO<sub>2</sub> calibration models tend to overpredict concentrations below the 25th percentile and underpredict concentrations above the 50th percentile (Fig. <xref ref-type="fig" rid="F14"/>, panel d). However, underprediction of data above the 95th percentile is much less severe than TVOC and BTEX predictions, likely because the distribution of reference NO<sub>2</sub> concentrations is more balanced than the BTEX and TVOC concentration distributions, resulting in a much smaller range of peak concentrations to predict. RMSE remains much more consistent across percentiles for NO<sub>2</sub> compared to TVOC and BTEX (Fig. <xref ref-type="fig" rid="F14"/>, panel b).</p>
      <p id="d2e2262">The three linear-based models (LR, L, R) perform poorly for NO<sub>2</sub>, particularly in the low percentile range of concentrations, where the models predict negative concentrations (Fig. <xref ref-type="fig" rid="F15"/>). Mean <inline-formula><mml:math id="M86" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> values were below 0.8 for these models, but mean <inline-formula><mml:math id="M87" display="inline"><mml:mrow><mml:msup><mml:mi>R</mml:mi><mml:mn mathvariant="normal">2</mml:mn></mml:msup></mml:mrow></mml:math></inline-formula> for the non-linear models were above 0.8 (Fig. S3). L1 regularization in the L model removes temperature. The poor performance of L across the high and low percentile ranges compared to LR indicates that all inputs (NO<sub>2</sub> target sensor, temperature, humidity, and time elapsed) are necessary for this calibration.</p>
      <p id="d2e2307">Once again, differences in the field predictions between each model exist mainly in the extreme values (Fig. <xref ref-type="fig" rid="F16"/>). As expected from testing data performance, the linear models predict some highly negative concentrations. RF is also consistent with its testing performance, specifically its baseline overestimation, as it shows the highest minimum concentration for all three sites. The correlation coefficients between a linear model (LR, R, L) and a non-linear model (RF, GB, XGB, ANN, SVR) are lower (0.73–0.93) than the correlation coefficients between two linear or two non-linear models (0.89–1) (Supplement). This is likely because of the unique negative concentrations predicted by the linear models.</p>
</sec>
</sec>
<sec id="Ch1.S3.SS3">
  <label>3.3</label><title>Discussion</title>
      <p id="d2e2321">Across the three pollutants, the calibration models generally underestimated peak concentrations and overestimated baseline levels, though the extent of this bias varied with the distribution of the reference data. For TVOC and BTEX, which had more imbalanced distributions, peak underestimation was more pronounced, likely due to the wider range of high concentrations. These results highlight the importance of evaluating model performance across the full concentration distribution, rather than relying solely on overall metrics. In many cases, biases at the extremes were masked in the overall statistics because the baseline overestimation and peak underestimation often offset each other around the median, resulting in overall MBE values near zero.</p>
      <p id="d2e2324">While there was broad variability in model performance across pollutants, some patterns still emerged. For example, our results also showed nearly identical performance for linear regression (LR) and ridge regression (R) across the three pollutants, suggesting that L2 regularization was not beneficial for these datasets. Lasso regression also did not lead to improved predictive performance for linear regression, especially for BTEX and NO<sub>2</sub>, where removing features through L1 regularization actually worsened model performance. To better understand the importance of each feature in response to the lasso performance, we plot the feature importances from the ensemble decision tree-based models (RF, GB, and XGB) in Fig. <xref ref-type="fig" rid="F17"/>. The feature importances were calculated using the Mean Decrease in Impurity using the Scikit-learn Python module. Based on Fig. <xref ref-type="fig" rid="F17"/>, it is clear that the target pollutant sensor features were more important than the temperature, humidity, and time elapsed features across all pollutants. However, given the reduced performance of lasso regression for BTEX and NO<sub>2</sub>, it appears that even “less important” features can still play an essential role in model training. The increased importance of TGS 2602 for BTEX compared to TVOC aligns with previous work, which found TGS 2602 to be more sensitive to BTEX than TGS 2600 <xref ref-type="bibr" rid="bib1.bibx10" id="paren.53"/>.</p>

      <fig id="F17"><label>Figure 17</label><caption><p id="d2e2354">Feature importance for ensemble decision tree-based models (RF, GB, XGB) models for TVOC, BTEX, and NO<sub>2</sub>.</p></caption>
          <graphic xlink:href="https://amt.copernicus.org/articles/19/2923/2026/amt-19-2923-2026-f17.png"/>

        </fig>

      <p id="d2e2373">Boosting methods, which are known for their improved ability to handle imbalanced data, reduced the underestimation of peak concentrations for TVOC. However, for BTEX, linear-based models (LR and R) actually outperformed boosting methods and other complex machine learning models for both peak concentrations and overall performance. For NO<sub>2</sub>, the linear models performed the worst, especially at baseline concentrations, highlighting that no single model consistently performs best across all colocation datasets. When applied to field data, the models predicted similar mean concentrations for each site but had more variability in baseline and peak concentration predictions. Generally, the variability followed patterns evident in the testing data performance.</p>
      <p id="d2e2385">We built each calibration model multiple times, with differing test/train splits, to assess the robustness of each model. The performance metrics showed some variability, especially in the 95–100th percentile for BTEX and TVOC, based on the test/train split. This was not surprising, given that the range of concentrations in the 95–100th percentile for the BTEX and TVOC testing data could vary dramatically depending on the test/train split. These results motivate further study into how data should be split into testing and training sets to ensure transferability and robustness. At a minimum, we suggest ensuring that the testing data encompass the full range of concentrations measured during colocation.</p>
</sec>
</sec>
<sec id="Ch1.S4" sec-type="conclusions">
  <label>4</label><title>Conclusions</title>
      <p id="d2e2397">Combining the two-step colocation process with a robust evaluation of multiple machine learning models led to satisfactory calibration of TVOC, BTEX, and NO<sub>2</sub> sensors in this study. Our two-step colocation strategy supported model transferability by ensuring the colocation encompassed more similar environmental conditions than was possible with only a pre- and post- calibration. We also showed why a two-step colocation is preferable to a universal calibration model, as the harmonization step was able to correct for the variability and drift unique to each individual sensor in a network. We found that calibration model performance is not one-size-fits-all and is affected by the distribution of reference concentrations and the test/train split. Even for models with little overall bias, percentile-based analysis revealed systemic overestimation of baseline concentrations and underestimation of peaks across all eight models: linear regression, lasso, ridge, random forest, gradient boosting, extreme gradient boosting, artificial neural network, and support vector regression.</p>
</sec>

      
      </body>
    <back><notes notes-type="dataavailability"><title>Data availability</title>

      <p id="d2e2415">Reference data were provided courtesy of the South Coast Air Quality Management District. These data have not passed through the normal review process and are therefore not quality-assured, and they are thus unofficial data. The HAQ-Pod data are available at <ext-link xlink:href="https://doi.org/10.17632/fs25d4b64k.1" ext-link-type="DOI">10.17632/fs25d4b64k.1</ext-link> <xref ref-type="bibr" rid="bib1.bibx17" id="paren.54"/>.</p>
  </notes><app-group>
        <supplementary-material position="anchor"><p id="d2e2424">The supplement related to this article is available online at <inline-supplementary-material xlink:href="https://doi.org/10.5194/amt-19-2923-2026-supplement" xlink:title="pdf">https://doi.org/10.5194/amt-19-2923-2026-supplement</inline-supplementary-material>.</p></supplementary-material>
        </app-group><notes notes-type="authorcontribution"><title>Author contributions</title>

      <p id="d2e2433">Conceptualization: MH, JJ, CF; methodology: CF; software: CF; formal analysis: CF; investigation: CF, JP, EB, WS; resources: JP, EB, WS, MH; writing – original draft: CF; writing – review &amp; editing: JP, EB, WS, JJ, MH; visualization: CF; supervision: JJ, MH; funding aquisition: JJ, MH.</p>
  </notes><notes notes-type="competinginterests"><title>Competing interests</title>

      <p id="d2e2439">The contact author has declared that none of the authors has any competing interests.</p>
  </notes><notes notes-type="disclaimer"><title>Disclaimer</title>

      <p id="d2e2445">Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. The authors bear the ultimate responsibility for providing appropriate place names. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.</p>
  </notes><ack><title>Acknowledgements</title><p id="d2e2451">Thank you to the South Coast Air Quality Monitoring District for providing reference instrumentation and to the Los Angeles pod hosts for allowing us to collect field data in your neighborhood. Thank you also to Venezia Ramirez, Brittany Lu-Jones, Nancy Lam, and Alexander Silverman for your support with pod calibration and maintenance.</p></ack><notes notes-type="financialsupport"><title>Financial support</title>

      <p id="d2e2456">This research has been supported by the National Institute of Environmental Health Sciences (grant no. 1R01ES033478).</p>
  </notes><notes notes-type="reviewstatement"><title>Review statement</title>

      <p id="d2e2463">This paper was edited by Albert Presto and reviewed by two anonymous referees.</p>
  </notes><ref-list>
    <title>References</title>

      <ref id="bib1.bibx1"><label>Barkjohn et al.(2021)</label><mixed-citation>Barkjohn, K. K., Gantt, B., and Clements, A. L.: Development and application of a United States-wide correction for PM2.5 data collected with the PurpleAir sensor, Atmos. Meas. Tech., 14, 4617–4637, <ext-link xlink:href="https://doi.org/10.5194/amt-14-4617-2021" ext-link-type="DOI">10.5194/amt-14-4617-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx2"><label>Breiman(2001)</label><mixed-citation>Breiman, L.: Random forests, Machine learning, 45, 5–32, <ext-link xlink:href="https://doi.org/10.1023/A:1010933404324" ext-link-type="DOI">10.1023/A:1010933404324</ext-link>, 2001.</mixed-citation></ref>
      <ref id="bib1.bibx3"><label>Cabello-Solorzano et al.(2023)</label><mixed-citation>Cabello-Solorzano, K., Ortigosa de Araujo, I., Peña, M., Correia, L., and J. Tallón-Ballesteros, A.: The impact of data normalization on the accuracy of machine learning algorithms: a comparative analysis, in: International conference on soft computing models in industrial and environmental applications, pp. 344–353, Springer, <ext-link xlink:href="https://doi.org/10.1007/978-3-031-42536-3_33" ext-link-type="DOI">10.1007/978-3-031-42536-3_33</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx4"><label>Casey and Hannigan(2018)</label><mixed-citation>Casey, J. G. and Hannigan, M. P.: Testing the performance of field calibration techniques for low-cost gas sensors in new deployment locations: across a county line and across Colorado, Atmos. Meas. Tech., 11, 6351–6378, <ext-link xlink:href="https://doi.org/10.5194/amt-11-6351-2018" ext-link-type="DOI">10.5194/amt-11-6351-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx5"><label>Castell et al.(2017)</label><mixed-citation>Castell, N., Dauge, F. R., Schneider, P., Vogt, M., Lerner, U., Fishbain, B., Broday, D., and Bartonova, A.: Can commercial low-cost sensor platforms contribute to air quality monitoring and exposure estimates?, Environ. Int., 99, 293–302, <ext-link xlink:href="https://doi.org/10.1016/j.envint.2016.12.007" ext-link-type="DOI">10.1016/j.envint.2016.12.007</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx6"><label>Chen and Guestrin(2016)</label><mixed-citation>Chen, T. and Guestrin, C.: Xgboost: A scalable tree boosting system, in: Proceedings of the 22nd international conference on knowledge discovery and data mining, pp. 785–794, <ext-link xlink:href="https://doi.org/10.1145/2939672.2939785" ext-link-type="DOI">10.1145/2939672.2939785</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx7"><label>Clements et al.(2017)</label><mixed-citation>Clements, A. L., Griswold, W. G., Rs, A., Johnston, J. E., Herting, M. M., Thorson, J., Collier-Oxandale, A., and Hannigan, M.: Low-cost air quality monitoring tools: from research to practice (a workshop summary), Sensors, 17, 2478, <ext-link xlink:href="https://doi.org/10.3390/s17112478" ext-link-type="DOI">10.3390/s17112478</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx8"><label>Collier-Oxandale et al.(2018)</label><mixed-citation>Collier-Oxandale, A., Casey, J. G., Piedrahita, R., Ortega, J., Halliday, H., Johnston, J., and Hannigan, M. P.: Assessing a low-cost methane sensor quantification system for use in complex rural and urban environments, Atmos. Meas. Tech., 11, 3569–3594, <ext-link xlink:href="https://doi.org/10.5194/amt-11-3569-2018" ext-link-type="DOI">10.5194/amt-11-3569-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx9"><label>Collier-Oxandale et al.(2020)</label><mixed-citation>Collier-Oxandale, A., Wong, N., Navarro, S., Johnston, J., and Hannigan, M.: Using gas-phase air quality sensors to disentangle potential sources in a Los Angeles neighborhood, Atmos. Environ., 233, 117519, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2020.117519" ext-link-type="DOI">10.1016/j.atmosenv.2020.117519</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx10"><label>Collier-Oxandale et al.(2019)</label><mixed-citation>Collier-Oxandale, A. M., Thorson, J., Halliday, H., Milford, J., and Hannigan, M.: Understanding the ability of low-cost MOx sensors to quantify ambient VOCs, Atmos. Meas. Tech., 12, 1441–1460, <ext-link xlink:href="https://doi.org/10.5194/amt-12-1441-2019" ext-link-type="DOI">10.5194/amt-12-1441-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx11"><label>Commodore et al.(2017)</label><mixed-citation>Commodore, A., Wilson, S., Muhammad, O., Svendsen, E., and Pearce, J.: Community-based participatory research for the study of air pollution: a review of motivations, approaches, and outcomes, Environ. Monit. Assess., 189, 1–30, <ext-link xlink:href="https://doi.org/10.1007/s10661-017-6063-7" ext-link-type="DOI">10.1007/s10661-017-6063-7</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bibx12"><label>Concas et al.(2021)</label><mixed-citation>Concas, F., Mineraud, J., Lagerspetz, E., Varjonen, S., Liu, X., Puolamäki, K., Nurmi, P., and Tarkoma, S.: Low-cost outdoor air quality monitoring and sensor calibration: A survey and critical analysis, ACM Transactions on Sensor Networks (TOSN), 17, 1–44, <ext-link xlink:href="https://doi.org/10.1145/3446005" ext-link-type="DOI">10.1145/3446005</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx13"><label>Considine et al.(2021)</label><mixed-citation>Considine, E. M., Reid, C. E., Ogletree, M. R., and Dye, T.: Improving accuracy of air pollution exposure measurements: Statistical correction of a municipal low-cost airborne particulate matter sensor network, Environ. Pollut., 268, 115833, <ext-link xlink:href="https://doi.org/10.1016/j.envpol.2020.115833" ext-link-type="DOI">10.1016/j.envpol.2020.115833</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx14"><label>De Vito et al.(2008)</label><mixed-citation>De Vito, S., Massera, E., Piga, M., Martinotto, L., and Di Francia, G.: On field calibration of an electronic nose for benzene estimation in an urban pollution monitoring scenario, Sensors and Actuators B: Chemical, 129, 750–757, <ext-link xlink:href="https://doi.org/10.1016/j.snb.2007.09.060" ext-link-type="DOI">10.1016/j.snb.2007.09.060</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx15"><label>Edgerton et al.(1989)</label><mixed-citation>Edgerton, S. A., Holdren, M. W., Smith, D. L., and Shah, J. J.: Inter-urban comparison of ambient volatile organic compound concentrations in US cities, JAPCA, 39, 729–732, <ext-link xlink:href="https://doi.org/10.1080/08940630.1989.10466561" ext-link-type="DOI">10.1080/08940630.1989.10466561</ext-link>, 1989.</mixed-citation></ref>
      <ref id="bib1.bibx16"><label>Fanti et al.(2021)</label><mixed-citation>Fanti, G., Borghi, F., Spinazzè, A., Rovelli, S., Campagnolo, D., Keller, M., Cattaneo, A., Cauda, E., and Cavallo, D. M.: Features and practicability of the next-generation sensors and monitors for exposure assessment to airborne pollutants: a systematic review, Sensors, 21, 4513, <ext-link xlink:href="https://doi.org/10.3390/s21134513" ext-link-type="DOI">10.3390/s21134513</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx17"><label>Frischmon and Hannigan</label><mixed-citation>Frischmon, C. and Hannigan, M.: Low-cost sensor data for VOC and BTEX calibration, Mendeley Data, V1 [data set], <ext-link xlink:href="https://doi.org/10.17632/fs25d4b64k.1" ext-link-type="DOI">10.17632/fs25d4b64k.1</ext-link>, 2026.</mixed-citation></ref>
      <ref id="bib1.bibx18"><label>Frischmon et al.(2025a)</label><mixed-citation>Frischmon, C., Crosslin, J., Burks, L., Weckesser, B., Hannigan, M., and Duderstadt, K.: Detecting air pollution episodes and exploring their impacts using low-cost sensor data and simultaneous community symptom and odor reports, Environ. Res. Lett., <ext-link xlink:href="https://doi.org/10.1088/1748-9326/adc28a" ext-link-type="DOI">10.1088/1748-9326/adc28a</ext-link>, 2025a.</mixed-citation></ref>
      <ref id="bib1.bibx19"><label>Frischmon et al.(2025b)</label><mixed-citation>Frischmon, C., Silberstein, J., Guth, A., Mattson, E., Porter, J., and Hannigan, M.: Improving the quantification of peak concentrations for air quality sensors via data weighting, Atmos. Meas. Tech., 18, 3147–3159, <ext-link xlink:href="https://doi.org/10.5194/amt-18-3147-2025" ext-link-type="DOI">10.5194/amt-18-3147-2025</ext-link>, 2025b.</mixed-citation></ref>
      <ref id="bib1.bibx20"><label>Galar et al.(2011)</label><mixed-citation>Galar, M., Fernandez, A., Barrenechea, E., Bustince, H., and Herrera, F.: A review on ensembles for the class imbalance problem: bagging-, boosting-, and hybrid-based approaches, IEEE T. Syst. Man Cy. C, 42, 463–484, <ext-link xlink:href="https://doi.org/10.1109/TSMCC.2011.2161285" ext-link-type="DOI">10.1109/TSMCC.2011.2161285</ext-link>, 2011.</mixed-citation></ref>
      <ref id="bib1.bibx21"><label>Griffith(1996)</label><mixed-citation>Griffith, D. W.: Synthetic calibration and quantitative analysis of gas-phase FT-IR spectra, Appl. Spectrosc., 50, 59–70, <ext-link xlink:href="https://doi.org/10.1366/0003702963906627" ext-link-type="DOI">10.1366/0003702963906627</ext-link>, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx22"><label>Haagen-Smit et al.(1953)</label><mixed-citation>Haagen-Smit, A. J., Bradley, C., and Fox, M.: Ozone formation in photochemical oxidation of organic substances, Ind. Eng. Chem., 45, 2086–2089, <ext-link xlink:href="https://doi.org/10.5254/1.3543469" ext-link-type="DOI">10.5254/1.3543469</ext-link>, 1953.</mixed-citation></ref>
      <ref id="bib1.bibx23"><label>Hoerl and Kennard(1970)</label><mixed-citation>Hoerl, A. E. and Kennard, R. W.: Ridge regression: Biased estimation for nonorthogonal problems, Technometrics, 12, 55–67, <ext-link xlink:href="https://doi.org/10.2307/1271436" ext-link-type="DOI">10.2307/1271436</ext-link>, 1970.</mixed-citation></ref>
      <ref id="bib1.bibx24"><label>Hong et al.(2023)</label><mixed-citation>Hong, G.-H., Le, T.-C., Lin, G.-Y., Cheng, H.-W., Yu, J.-Y., Dejchanchaiwong, R., Tekasakul, P., and Tsai, C.-J.: Long-term field calibration of low-cost metal oxide VOC sensor: Meteorological and interference gas effects, Atmos. Environ., 310, 119955, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2023.119955" ext-link-type="DOI">10.1016/j.atmosenv.2023.119955</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx25"><label>Johansson et al.(2014)</label><mixed-citation>Johansson, J. K., Mellqvist, J., Samuelsson, J., Offerle, B., Lefer, B., Rappenglück, B., Flynn, J., and Yarwood, G.: Emission measurements of alkenes, alkanes, SO<sub>2</sub>, and NO<sub>2</sub> from stationary sources in Southeast Texas over a 5 year period using SOF and mobile DOAS, J. Geophys. Res.-Atmos., 119, 1973–1991, <ext-link xlink:href="https://doi.org/10.1002/2013JD020485" ext-link-type="DOI">10.1002/2013JD020485</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx26"><label>Johnson et al.(2018)</label><mixed-citation>Johnson, N. E., Bonczak, B., and Kontokosta, C. E.: Using a gradient boosting model to improve the performance of low-cost aerosol monitors in a dense, heterogeneous urban environment, Atmos. Environ., 184, 9–16, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2018.04.019" ext-link-type="DOI">10.1016/j.atmosenv.2018.04.019</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx27"><label>Kampa and Castanas(2008)</label><mixed-citation>Kampa, M. and Castanas, E.: Human health effects of air pollution, Environ. Pollut., 151, 362–367, <ext-link xlink:href="https://doi.org/10.1016/j.envpol.2007.06.012" ext-link-type="DOI">10.1016/j.envpol.2007.06.012</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx28"><label>Karagulian et al.(2019)</label><mixed-citation>Karagulian, F., Barbiere, M., Kotsev, A., Spinelle, L., Gerboles, M., Lagler, F., Redon, N., Crunaire, S., and Borowiak, A.: Review of the performance of low-cost sensors for air quality monitoring, Atmosphere, 10, 506, <ext-link xlink:href="https://doi.org/10.3390/atmos10090506" ext-link-type="DOI">10.3390/atmos10090506</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx29"><label>Krogh(2008)</label><mixed-citation>Krogh, A.: What are artificial neural networks?, Nat. Biotechnol., 26, 195–197, <ext-link xlink:href="https://doi.org/10.1038/nbt1386" ext-link-type="DOI">10.1038/nbt1386</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx30"><label>Kumar and Sahu(2021)</label><mixed-citation>Kumar, V. and Sahu, M.: Evaluation of nine machine learning regression algorithms for calibration of low-cost PM<sub>2.5</sub> sensor, J. Aerosol Sci., 157, 105809, <ext-link xlink:href="https://doi.org/10.1016/j.jaerosci.2021.105809" ext-link-type="DOI">10.1016/j.jaerosci.2021.105809</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx31"><label>Laaksonen et al.(2008)</label><mixed-citation>Laaksonen, A., Kulmala, M., O'Dowd, C. D., Joutsensaari, J., Vaattovaara, P., Mikkonen, S., Lehtinen, K. E. J., Sogacheva, L., Dal Maso, M., Aalto, P., Petäjä, T., Sogachev, A., Yoon, Y. J., Lihavainen, H., Nilsson, D., Facchini, M. C., Cavalli, F., Fuzzi, S., Hoffmann, T., Arnold, F., Hanke, M., Sellegri, K., Umann, B., Junkermann, W., Coe, H., Allan, J. D., Alfarra, M. R., Worsnop, D. R., Riekkola, M.-L., Hyötyläinen, T., and Viisanen, Y.: The role of VOC oxidation products in continental new particle formation, Atmos. Chem. Phys., 8, 2657–2665, <ext-link xlink:href="https://doi.org/10.5194/acp-8-2657-2008" ext-link-type="DOI">10.5194/acp-8-2657-2008</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bibx32"><label>Leidinger et al.(2014)</label><mixed-citation>Leidinger, M., Sauerwald, T., Reimringer, W., Ventura, G., and Schütze, A.: Selective detection of hazardous VOCs for indoor air quality applications using a virtual gas sensor array, J. Sensor. Sensor Syst., 3, 253–263, <ext-link xlink:href="https://doi.org/10.5194/jsss-3-253-2014" ext-link-type="DOI">10.5194/jsss-3-253-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bibx33"><label>Levy Zamora et al.(2023)</label><mixed-citation>Levy Zamora, M., Buehler, C., Datta, A., Gentner, D. R., and Koehler, K.: Identifying optimal co-location calibration periods for low-cost sensors, Atmos. Meas. Tech., 16, 169–179, <ext-link xlink:href="https://doi.org/10.5194/amt-16-169-2023" ext-link-type="DOI">10.5194/amt-16-169-2023</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx34"><label>Lewis et al.(2016)</label><mixed-citation>Lewis, A. C., Lee, J. D., Edwards, P. M., Shaw, M. D., Evans, M. J., Moller, S. J., Smith, K. R., Buckley, J. W., Ellis, M., Gillot, S. R., and White, A.: Evaluating the performance of low cost chemical sensors for air pollution research, Faraday Discuss., 189, 85–103, <ext-link xlink:href="https://doi.org/10.1039/C5FD00201J" ext-link-type="DOI">10.1039/C5FD00201J</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx35"><label>Li et al.(2023)</label><mixed-citation>Li, Z., Ma, Z., Zhang, Z., Zhang, L., Tian, E., Zhang, H., Yang, R., Zhu, D., Li, H., Wang, Z., Zhang, Y., Xu, P., Xu, Y., Wwang, D., Wang, G., Kim, M., Yuan, Y., Qiao, X., Li, M., Xie, Y., and Jiang, J.: High-density volatile organic compound monitoring network for identifying pollution sources, Sci. Total Environ., 855, 158872, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2022.158872" ext-link-type="DOI">10.1016/j.scitotenv.2022.158872</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx36"><label>Liang(2021)</label><mixed-citation>Liang, L.: Calibrating low-cost sensors for ambient air monitoring: Techniques, trends, and challenges, Environ. Res., 197, 111163, <ext-link xlink:href="https://doi.org/10.1016/j.envres.2021.111163" ext-link-type="DOI">10.1016/j.envres.2021.111163</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx37"><label>Liu et al.(2020)</label><mixed-citation>Liu, X., Jayaratne, R., Thai, P., Kuhn, T., Zing, I., Christensen, B., Lamont, R., Dunbabin, M., Zhu, S., Gao, J., Wainwright, D., Neale, D., Kan, R., Kirkwood, J., and Morawska, L.: Low-cost sensors as an alternative for long-term air quality monitoring, Environ. Res., 185, 109438, <ext-link xlink:href="https://doi.org/10.1016/j.envres.2020.109438" ext-link-type="DOI">10.1016/j.envres.2020.109438</ext-link>, 2020.</mixed-citation></ref>
      <ref id="bib1.bibx38"><label>Malings et al.(2019)</label><mixed-citation>Malings, C., Tanzer, R., Hauryliuk, A., Kumar, S. P. N., Zimmerman, N., Kara, L. B., Presto, A. A., and R. Subramanian: Development of a general calibration model and long-term performance evaluation of low-cost sensors for air pollutant gas monitoring, Atmos. Meas. Tech., 12, 903–920, <ext-link xlink:href="https://doi.org/10.5194/amt-12-903-2019" ext-link-type="DOI">10.5194/amt-12-903-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx39"><label>Malyan et al.(2024)</label><mixed-citation>Malyan, V., Kumar, V., Moni, M., Sahu, M., Prakash, J., Choudhary, S., Raliya, R., Chadha, T. S., Fang, J., and Biswas, P.: Assessing the spatial transferability of calibration models across a low-cost sensors network, J. Aerosol Sci., 181, 106437, <ext-link xlink:href="https://doi.org/10.1016/j.jaerosci.2024.106437" ext-link-type="DOI">10.1016/j.jaerosci.2024.106437</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx40"><label>Masiol et al.(2018)</label><mixed-citation>Masiol, M., Squizzato, S., Chalupa, D., Rich, D. Q., and Hopke, P. K.: Evaluation and field calibration of a low-cost ozone monitor at a regulatory urban monitoring station, Aerosol Air Qual. Res., 18, 2029–2037, <ext-link xlink:href="https://doi.org/10.4209/aaqr.2018.02.0056" ext-link-type="DOI">10.4209/aaqr.2018.02.0056</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx41"><label>Mellqvist et al.(2017)</label><mixed-citation>Mellqvist, J., Samuelsson, J., Isoz, O., Brohede, S., Andersson, P., Ericsson, M., and Johansson, J.: Emission measurements of VOCs, NO<sub>2</sub> and SO<sub>2</sub> from the refineries in the south coast air basin using solar occultation flux and other optical remote sensing methods, FluxSense/SCAQMD-2015, <uri>https://www.aqmd.gov/docs/default-source/fenceline_monitoring/project_1/fluxsense_scaqmd2015_project1_finalreport(040717).pdf</uri> (last access: 28 April 2026), 2017.</mixed-citation></ref>
      <ref id="bib1.bibx42"><label>Natekin and Knoll(2013)</label><mixed-citation>Natekin, A. and Knoll, A.: Gradient boosting machines, a tutorial, Front. Neurorobot., 7, 21, <ext-link xlink:href="https://doi.org/10.3389/fnbot.2013.00021" ext-link-type="DOI">10.3389/fnbot.2013.00021</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bibx43"><label>Nowack et al.(2021)</label><mixed-citation>Nowack, P., Konstantinovskiy, L., Gardiner, H., and Cant, J.: Machine learning calibration of low-cost NO<sub>2</sub> and PM<sub>10</sub> sensors: non-linear algorithms and their impact on site transferability, Atmos. Meas. Tech., 14, 5637–5655, <ext-link xlink:href="https://doi.org/10.5194/amt-14-5637-2021" ext-link-type="DOI">10.5194/amt-14-5637-2021</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx44"><label>Okorn and Hannigan(2021a)</label><mixed-citation>Okorn, K. and Hannigan, M.: Applications and limitations of quantifying speciated and source-apportioned vocs with metal oxide sensors, Atmosphere, 12, 1383, <ext-link xlink:href="https://doi.org/10.3390/atmos12111383" ext-link-type="DOI">10.3390/atmos12111383</ext-link>, 2021a.</mixed-citation></ref>
      <ref id="bib1.bibx45"><label>Okorn and Hannigan(2021b)</label><mixed-citation>Okorn, K. and Hannigan, M.: Improving Air Pollutant Metal Oxide Sensor Quantification Practices through: An Exploration of Sensor Signal Normalization, Multi-Sensor and Universal Calibration Model Generation, and Physical Factors Such as Co-Location Duration and Sensor Age, Atmosphere, 12, 645, <ext-link xlink:href="https://doi.org/10.3390/atmos12050645" ext-link-type="DOI">10.3390/atmos12050645</ext-link>, 2021b.</mixed-citation></ref>
      <ref id="bib1.bibx46"><label>Okorn and Iraci(2024)</label><mixed-citation>Okorn, K. and Iraci, L. T.: An overview of outdoor low-cost gas-phase air quality sensor deployments: current efforts, trends, and limitations, Atmos. Meas. Tech., 17, 6425–6457, <ext-link xlink:href="https://doi.org/10.5194/amt-17-6425-2024" ext-link-type="DOI">10.5194/amt-17-6425-2024</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx47"><label>Okorn et al.(2021)</label><mixed-citation>Okorn, K., Jimenez, A., Collier-Oxandale, A., Johnston, J., and Hannigan, M.: Characterizing methane and total non-methane hydrocarbon levels in Los Angeles communities with oil and gas facilities using air quality monitors, Sci. Total Environ., 777, 146194, <ext-link xlink:href="https://doi.org/10.1016/j.scitotenv.2021.146194" ext-link-type="DOI">10.1016/j.scitotenv.2021.146194</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx48"><label>Ou-Yang et al.(2018)</label><mixed-citation>Ou-Yang, C.-F., Liao, W.-C., Chang, C.-C., Hsieh, H.-C., and Wang, J.-L.: Guided episodic sampling for capturing and characterizing industrial plumes, Atmos. Environ., 174, 188–193, <ext-link xlink:href="https://doi.org/10.1016/j.atmosenv.2017.11.044" ext-link-type="DOI">10.1016/j.atmosenv.2017.11.044</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bibx49"><label>Raheja et al.(2022)</label><mixed-citation>Raheja, G., Harper, L., Hoffman, A., Gorby, Y., Freese, L., O’Leary, B., Deron, N., Smith, S., Auch, T., Goodwin, M., and Westervelt, D.: Community-based participatory research for low-cost air pollution monitoring in the wake of unconventional oil and gas development in the Ohio River Valley: Empowering impacted residents through community science, Environ. Res. Lett., 17, 065006, <ext-link xlink:href="https://doi.org/10.1088/1748-9326/ac6ad6" ext-link-type="DOI">10.1088/1748-9326/ac6ad6</ext-link>, 2022.</mixed-citation></ref>
      <ref id="bib1.bibx50"><label>Robin et al.(2021)</label><mixed-citation>Robin, Y., Amann, J., Baur, T., Goodarzi, P., Schultealbert, C., Schneider, T., and Schütze, A.: High-performance VOC quantification for IAQ monitoring using advanced sensor systems and deep learning, Atmosphere, 12, 1487, <ext-link xlink:href="https://doi.org/10.3390/atmos12111487" ext-link-type="DOI">10.3390/atmos12111487</ext-link>, 2021.</mixed-citation></ref>
      <ref id="bib1.bibx51"><label>Rothman et al.(2005)</label><mixed-citation>Rothman, L. S., Jacquemart, D., Barbe, A., Benner, D. C., Birk, M., Brown, L., Carleer, M., Chackerian Jr, C., Chance, K., Coudert, L., Dana, V., Devi, V., JM, F., Gamache, R., Goldman, A., Hartmann, J., Jucks, K., Maki, A., Mandin, J., Massie, S., Orphal, J., Perrin, A., Rinsland, C., Smith, M., Tennyson, J., Tochenov, R., Toth, R., Vander Auwera, J., Varanasi, P., and Wagner, G.: The HITRAN 2004 molecular spectroscopic database, J. Quant. Spectrosc. Ra., 96, 139–204, <ext-link xlink:href="https://doi.org/10.1016/j.jqsrt.2004.10.008" ext-link-type="DOI">10.1016/j.jqsrt.2004.10.008</ext-link>, 2005.</mixed-citation></ref>
      <ref id="bib1.bibx52"><label>Sá et al.(2023)</label><mixed-citation>Sá, J., Chojer, H., Branco, P., Alvim-Ferraz, M., Martins, F., and Sousa, S.: Two step calibration method for ozone low-cost sensor: Field experiences with the UrbanSense DCUs, J. Environ. Manage., 328, 116910, <ext-link xlink:href="https://doi.org/10.1016/j.jenvman.2022.116910" ext-link-type="DOI">10.1016/j.jenvman.2022.116910</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx53"><label>Sharpe et al.(2004)</label><mixed-citation>Sharpe, S. W., Johnson, T. J., Sams, R. L., Chu, P. M., Rhoderick, G. C., and Johnson, P. A.: Gas-phase databases for quantitative infrared spectroscopy, Appl. Spectrosc., 58, 1452–1461, <ext-link xlink:href="https://doi.org/10.1366/0003702042641281" ext-link-type="DOI">10.1366/0003702042641281</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx54"><label>Silberstein et al.(2024)</label><mixed-citation>Silberstein, J., Wellbrook, M., and Hannigan, M.: Utilization of a Low-Cost Sensor Array for Mobile Methane Monitoring, Sensors, 24, 519, <ext-link xlink:href="https://doi.org/10.3390/s24020519" ext-link-type="DOI">10.3390/s24020519</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx55"><label>Smola and Schölkopf(2004)</label><mixed-citation>Smola, A. J. and Schölkopf, B.: A tutorial on support vector regression, Stat. Comput., 14, 199–222, <ext-link xlink:href="https://doi.org/10.1023/b:stco.0000035301.49549.88" ext-link-type="DOI">10.1023/b:stco.0000035301.49549.88</ext-link>, 2004.</mixed-citation></ref>
      <ref id="bib1.bibx56"><label>Song and Ying(2015)</label><mixed-citation>Song, Y.-Y. and Ying, L.: Decision tree methods: applications for classification and prediction, Shanghai Archives of Psychiatry, 27, 130, <ext-link xlink:href="https://doi.org/10.11919/j.issn.1002-0829.215044" ext-link-type="DOI">10.11919/j.issn.1002-0829.215044</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bibx57"><label>Spinelle et al.(2017a)</label><mixed-citation>Spinelle, L., Gerboles, M., Kok, G., Persijn, S., and Sauerwald, T.: Performance evaluation of low-cost BTEX sensors and devices within the EURAMET key-VOCs project, in: Proceedings, vol. 1, p. 425, MDPI, <ext-link xlink:href="https://doi.org/10.3390/proceedings1040425" ext-link-type="DOI">10.3390/proceedings1040425</ext-link>, 2017a.</mixed-citation></ref>
      <ref id="bib1.bibx58"><label>Spinelle et al.(2017b)</label><mixed-citation>Spinelle, L., Gerboles, M., Kok, G., Persijn, S., and Sauerwald, T.: Review of portable and low-cost sensors for the ambient air monitoring of benzene and other volatile organic compounds, Sensors, 17, 1520, <ext-link xlink:href="https://doi.org/10.3390/s17071520" ext-link-type="DOI">10.3390/s17071520</ext-link>, 2017b.</mixed-citation></ref>
      <ref id="bib1.bibx59"><label>Srishti et al.(2023)</label><mixed-citation>Srishti, S., Agrawal, P., Kulkarni, P., Gautam, H. C., Kushwaha, M., and Sreekanth, V.: Multiple PM low-cost sensors, multiple seasons’ data, and multiple calibration models, Aerosol Air Qual. Res., 23, 220428, <ext-link xlink:href="https://doi.org/10.4209/aaqr.220428" ext-link-type="DOI">10.4209/aaqr.220428</ext-link>, 2023.</mixed-citation></ref>
      <ref id="bib1.bibx60"><label>Tibshirani(1996)</label><mixed-citation> Tibshirani, R.: Regression shrinkage and selection via the lasso: a retrospective, J. Roy. Stat. Soc. B, 58, 267–288, 1996.</mixed-citation></ref>
      <ref id="bib1.bibx61"><label>Vikram et al.(2019)</label><mixed-citation>Vikram, S., Collier-Oxandale, A., Ostertag, M. H., Menarini, M., Chermak, C., Dasgupta, S., Rosing, T., Hannigan, M., and Griswold, W. G.: Evaluating and improving the reliability of gas-phase sensor system calibrations across new locations for ambient measurements and personal exposure monitoring, Atmos. Meas. Tech., 12, 4211–4239, <ext-link xlink:href="https://doi.org/10.5194/amt-12-4211-2019" ext-link-type="DOI">10.5194/amt-12-4211-2019</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx62"><label>Wang(2024)</label><mixed-citation>Wang, Z.: Evaluating the efficacy of machine learning in calibrating low-cost sensors, Appl. Comput. Eng., 71, 30–38, <ext-link xlink:href="https://doi.org/10.54254/2755-2721/71/20241635" ext-link-type="DOI">10.54254/2755-2721/71/20241635</ext-link>, 2024.</mixed-citation></ref>
      <ref id="bib1.bibx63"><label>Wesolowski and Suchacz(2012)</label><mixed-citation>Wesolowski, M. and Suchacz, B.: Artificial neural networks: theoretical background and pharmaceutical applications: a review, J. AOAC Int., 95, 652–668, <ext-link xlink:href="https://doi.org/10.5740/jaoacint.sge_wesolowski_ann" ext-link-type="DOI">10.5740/jaoacint.sge_wesolowski_ann</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bibx64"><label>Yurko et al.(2019)</label><mixed-citation>Yurko, G., Roostaei, J., Dittrich, T., Xu, L., Ewing, M., Zhang, Y., and Shreve, G.: Real-time sensor response characteristics of 3 commercial metal oxide sensors for detection of BTEX and chlorinated aliphatic hydrocarbon organic vapors, Chemosensors, 7, 40, <ext-link xlink:href="https://doi.org/10.3390/chemosensors7030040" ext-link-type="DOI">10.3390/chemosensors7030040</ext-link>, 2019.</mixed-citation></ref>
      <ref id="bib1.bibx65"><label>Zhang(2016)</label><mixed-citation>Zhang, Z.: A gentle introduction to artificial neural networks, Ann. Trans. Med., 4, 370, <ext-link xlink:href="https://doi.org/10.21037/atm.2016.06.20" ext-link-type="DOI">10.21037/atm.2016.06.20</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bibx66"><label>Zimmerman et al.(2018)</label><mixed-citation>Zimmerman, N., Presto, A. A., Kumar, S. P. N., Gu, J., Hauryliuk, A., Robinson, E. S., Robinson, A. L., and R. Subramanian: A machine learning calibration model using random forests to improve sensor performance for lower-cost air quality monitoring, Atmos. Meas. Tech., 11, 291–313, <ext-link xlink:href="https://doi.org/10.5194/amt-11-291-2018" ext-link-type="DOI">10.5194/amt-11-291-2018</ext-link>, 2018.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Evaluating machine learning model performance in a two-step colocation process for TVOC and BTEX sensor calibration</article-title-html>
<abstract-html/>
<ref-html id="bib1.bib1"><label>Barkjohn et al.(2021)</label><mixed-citation>
      
Barkjohn, K. K., Gantt, B., and Clements, A. L.: Development and application of a United States-wide correction for PM2.5 data collected with the PurpleAir sensor, Atmos. Meas. Tech., 14, 4617–4637, <a href="https://doi.org/10.5194/amt-14-4617-2021" target="_blank">https://doi.org/10.5194/amt-14-4617-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>Breiman(2001)</label><mixed-citation>
      
Breiman, L.: Random forests, Machine learning, 45, 5–32,
<a href="https://doi.org/10.1023/A:1010933404324" target="_blank">https://doi.org/10.1023/A:1010933404324</a>, 2001.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>Cabello-Solorzano et al.(2023)</label><mixed-citation>
      
Cabello-Solorzano, K., Ortigosa de Araujo, I., Peña, M., Correia, L., and
J. Tallón-Ballesteros, A.: The impact of data normalization on the
accuracy of machine learning algorithms: a comparative analysis, in:
International conference on soft computing models in industrial and
environmental applications, pp. 344–353, Springer,
<a href="https://doi.org/10.1007/978-3-031-42536-3_33" target="_blank">https://doi.org/10.1007/978-3-031-42536-3_33</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>Casey and Hannigan(2018)</label><mixed-citation>
      
Casey, J. G. and Hannigan, M. P.: Testing the performance of field calibration techniques for low-cost gas sensors in new deployment locations: across a county line and across Colorado, Atmos. Meas. Tech., 11, 6351–6378, <a href="https://doi.org/10.5194/amt-11-6351-2018" target="_blank">https://doi.org/10.5194/amt-11-6351-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>Castell et al.(2017)</label><mixed-citation>
      
Castell, N., Dauge, F. R., Schneider, P., Vogt, M., Lerner, U., Fishbain, B.,
Broday, D., and Bartonova, A.: Can commercial low-cost sensor platforms
contribute to air quality monitoring and exposure estimates?, Environ.
Int., 99, 293–302, <a href="https://doi.org/10.1016/j.envint.2016.12.007" target="_blank">https://doi.org/10.1016/j.envint.2016.12.007</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>Chen and Guestrin(2016)</label><mixed-citation>
      
Chen, T. and Guestrin, C.: Xgboost: A scalable tree boosting system, in:
Proceedings of the 22nd international conference on knowledge discovery and
data mining, pp. 785–794, <a href="https://doi.org/10.1145/2939672.2939785" target="_blank">https://doi.org/10.1145/2939672.2939785</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>Clements et al.(2017)</label><mixed-citation>
      
Clements, A. L., Griswold, W. G., Rs, A., Johnston, J. E., Herting, M. M.,
Thorson, J., Collier-Oxandale, A., and Hannigan, M.: Low-cost air quality
monitoring tools: from research to practice (a workshop summary), Sensors,
17, 2478, <a href="https://doi.org/10.3390/s17112478" target="_blank">https://doi.org/10.3390/s17112478</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>Collier-Oxandale et al.(2018)</label><mixed-citation>
      
Collier-Oxandale, A., Casey, J. G., Piedrahita, R., Ortega, J., Halliday, H., Johnston, J., and Hannigan, M. P.: Assessing a low-cost methane sensor quantification system for use in complex rural and urban environments, Atmos. Meas. Tech., 11, 3569–3594, <a href="https://doi.org/10.5194/amt-11-3569-2018" target="_blank">https://doi.org/10.5194/amt-11-3569-2018</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>Collier-Oxandale et al.(2020)</label><mixed-citation>
      
Collier-Oxandale, A., Wong, N., Navarro, S., Johnston, J., and Hannigan, M.:
Using gas-phase air quality sensors to disentangle potential sources in a Los
Angeles neighborhood, Atmos. Environ., 233, 117519,
<a href="https://doi.org/10.1016/j.atmosenv.2020.117519" target="_blank">https://doi.org/10.1016/j.atmosenv.2020.117519</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>Collier-Oxandale et al.(2019)</label><mixed-citation>
      
Collier-Oxandale, A. M., Thorson, J., Halliday, H., Milford, J., and Hannigan, M.: Understanding the ability of low-cost MOx sensors to quantify ambient VOCs, Atmos. Meas. Tech., 12, 1441–1460, <a href="https://doi.org/10.5194/amt-12-1441-2019" target="_blank">https://doi.org/10.5194/amt-12-1441-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>Commodore et al.(2017)</label><mixed-citation>
      
Commodore, A., Wilson, S., Muhammad, O., Svendsen, E., and Pearce, J.:
Community-based participatory research for the study of air pollution: a
review of motivations, approaches, and outcomes, Environ. Monit.
Assess., 189, 1–30, <a href="https://doi.org/10.1007/s10661-017-6063-7" target="_blank">https://doi.org/10.1007/s10661-017-6063-7</a>, 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>Concas et al.(2021)</label><mixed-citation>
      
Concas, F., Mineraud, J., Lagerspetz, E., Varjonen, S., Liu, X., Puolamäki,
K., Nurmi, P., and Tarkoma, S.: Low-cost outdoor air quality monitoring and
sensor calibration: A survey and critical analysis, ACM Transactions on
Sensor Networks (TOSN), 17, 1–44, <a href="https://doi.org/10.1145/3446005" target="_blank">https://doi.org/10.1145/3446005</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>Considine et al.(2021)</label><mixed-citation>
      
Considine, E. M., Reid, C. E., Ogletree, M. R., and Dye, T.: Improving accuracy
of air pollution exposure measurements: Statistical correction of a municipal
low-cost airborne particulate matter sensor network, Environ. Pollut.,
268, 115833, <a href="https://doi.org/10.1016/j.envpol.2020.115833" target="_blank">https://doi.org/10.1016/j.envpol.2020.115833</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>De Vito et al.(2008)</label><mixed-citation>
      
De Vito, S., Massera, E., Piga, M., Martinotto, L., and Di Francia, G.: On
field calibration of an electronic nose for benzene estimation in an urban
pollution monitoring scenario, Sensors and Actuators B: Chemical, 129,
750–757, <a href="https://doi.org/10.1016/j.snb.2007.09.060" target="_blank">https://doi.org/10.1016/j.snb.2007.09.060</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>Edgerton et al.(1989)</label><mixed-citation>
      
Edgerton, S. A., Holdren, M. W., Smith, D. L., and Shah, J. J.: Inter-urban
comparison of ambient volatile organic compound concentrations in US cities,
JAPCA, 39, 729–732, <a href="https://doi.org/10.1080/08940630.1989.10466561" target="_blank">https://doi.org/10.1080/08940630.1989.10466561</a>, 1989.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>Fanti et al.(2021)</label><mixed-citation>
      
Fanti, G., Borghi, F., Spinazzè, A., Rovelli, S., Campagnolo, D., Keller,
M., Cattaneo, A., Cauda, E., and Cavallo, D. M.: Features and practicability
of the next-generation sensors and monitors for exposure assessment to
airborne pollutants: a systematic review, Sensors, 21, 4513,
<a href="https://doi.org/10.3390/s21134513" target="_blank">https://doi.org/10.3390/s21134513</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>Frischmon and Hannigan</label><mixed-citation>
      
Frischmon, C. and Hannigan, M.: Low-cost sensor data for VOC and BTEX calibration, Mendeley Data, V1 [data set], <a href="https://doi.org/10.17632/fs25d4b64k.1" target="_blank">https://doi.org/10.17632/fs25d4b64k.1</a>, 2026.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>Frischmon et al.(2025a)</label><mixed-citation>
      
Frischmon, C., Crosslin, J., Burks, L., Weckesser, B., Hannigan, M., and
Duderstadt, K.: Detecting air pollution episodes and exploring their impacts
using low-cost sensor data and simultaneous community symptom and odor
reports, Environ. Res. Lett., <a href="https://doi.org/10.1088/1748-9326/adc28a" target="_blank">https://doi.org/10.1088/1748-9326/adc28a</a>,
2025a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>Frischmon et al.(2025b)</label><mixed-citation>
      
Frischmon, C., Silberstein, J., Guth, A., Mattson, E., Porter, J., and Hannigan, M.: Improving the quantification of peak concentrations for air quality sensors via data weighting, Atmos. Meas. Tech., 18, 3147–3159, <a href="https://doi.org/10.5194/amt-18-3147-2025" target="_blank">https://doi.org/10.5194/amt-18-3147-2025</a>, 2025b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>Galar et al.(2011)</label><mixed-citation>
      
Galar, M., Fernandez, A., Barrenechea, E., Bustince, H., and Herrera, F.: A
review on ensembles for the class imbalance problem: bagging-, boosting-, and
hybrid-based approaches, IEEE T. Syst. Man Cy.
C, 42, 463–484,
<a href="https://doi.org/10.1109/TSMCC.2011.2161285" target="_blank">https://doi.org/10.1109/TSMCC.2011.2161285</a>, 2011.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>Griffith(1996)</label><mixed-citation>
      
Griffith, D. W.: Synthetic calibration and quantitative analysis of gas-phase
FT-IR spectra, Appl. Spectrosc., 50, 59–70,
<a href="https://doi.org/10.1366/0003702963906627" target="_blank">https://doi.org/10.1366/0003702963906627</a>, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>Haagen-Smit et al.(1953)</label><mixed-citation>
      
Haagen-Smit, A. J., Bradley, C., and Fox, M.: Ozone formation in photochemical
oxidation of organic substances, Ind. Eng. Chem., 45,
2086–2089, <a href="https://doi.org/10.5254/1.3543469" target="_blank">https://doi.org/10.5254/1.3543469</a>, 1953.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>Hoerl and Kennard(1970)</label><mixed-citation>
      
Hoerl, A. E. and Kennard, R. W.: Ridge regression: Biased estimation for
nonorthogonal problems, Technometrics, 12, 55–67, <a href="https://doi.org/10.2307/1271436" target="_blank">https://doi.org/10.2307/1271436</a>,
1970.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>Hong et al.(2023)</label><mixed-citation>
      
Hong, G.-H., Le, T.-C., Lin, G.-Y., Cheng, H.-W., Yu, J.-Y., Dejchanchaiwong,
R., Tekasakul, P., and Tsai, C.-J.: Long-term field calibration of low-cost
metal oxide VOC sensor: Meteorological and interference gas effects,
Atmos. Environ., 310, 119955, <a href="https://doi.org/10.1016/j.atmosenv.2023.119955" target="_blank">https://doi.org/10.1016/j.atmosenv.2023.119955</a>,
2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>Johansson et al.(2014)</label><mixed-citation>
      
Johansson, J. K., Mellqvist, J., Samuelsson, J., Offerle, B., Lefer, B.,
Rappenglück, B., Flynn, J., and Yarwood, G.: Emission measurements of
alkenes, alkanes, SO<sub>2</sub>, and NO<sub>2</sub> from stationary sources in Southeast Texas
over a 5 year period using SOF and mobile DOAS, J. Geophys.
Res.-Atmos., 119, 1973–1991, <a href="https://doi.org/10.1002/2013JD020485" target="_blank">https://doi.org/10.1002/2013JD020485</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>Johnson et al.(2018)</label><mixed-citation>
      
Johnson, N. E., Bonczak, B., and Kontokosta, C. E.: Using a gradient boosting
model to improve the performance of low-cost aerosol monitors in a dense,
heterogeneous urban environment, Atmos. Environ., 184, 9–16,
<a href="https://doi.org/10.1016/j.atmosenv.2018.04.019" target="_blank">https://doi.org/10.1016/j.atmosenv.2018.04.019</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>Kampa and Castanas(2008)</label><mixed-citation>
      
Kampa, M. and Castanas, E.: Human health effects of air pollution,
Environ. Pollut., 151, 362–367, <a href="https://doi.org/10.1016/j.envpol.2007.06.012" target="_blank">https://doi.org/10.1016/j.envpol.2007.06.012</a>,
2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>Karagulian et al.(2019)</label><mixed-citation>
      
Karagulian, F., Barbiere, M., Kotsev, A., Spinelle, L., Gerboles, M., Lagler,
F., Redon, N., Crunaire, S., and Borowiak, A.: Review of the performance of
low-cost sensors for air quality monitoring, Atmosphere, 10, 506,
<a href="https://doi.org/10.3390/atmos10090506" target="_blank">https://doi.org/10.3390/atmos10090506</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>Krogh(2008)</label><mixed-citation>
      
Krogh, A.: What are artificial neural networks?, Nat. Biotechnol., 26,
195–197, <a href="https://doi.org/10.1038/nbt1386" target="_blank">https://doi.org/10.1038/nbt1386</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>Kumar and Sahu(2021)</label><mixed-citation>
      
Kumar, V. and Sahu, M.: Evaluation of nine machine learning regression
algorithms for calibration of low-cost PM<sub>2.5</sub> sensor, J. Aerosol
Sci., 157, 105809, <a href="https://doi.org/10.1016/j.jaerosci.2021.105809" target="_blank">https://doi.org/10.1016/j.jaerosci.2021.105809</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>Laaksonen et al.(2008)</label><mixed-citation>
      
Laaksonen, A., Kulmala, M., O'Dowd, C. D., Joutsensaari, J., Vaattovaara, P., Mikkonen, S., Lehtinen, K. E. J., Sogacheva, L., Dal Maso, M., Aalto, P., Petäjä, T., Sogachev, A., Yoon, Y. J., Lihavainen, H., Nilsson, D., Facchini, M. C., Cavalli, F., Fuzzi, S., Hoffmann, T., Arnold, F., Hanke, M., Sellegri, K., Umann, B., Junkermann, W., Coe, H., Allan, J. D., Alfarra, M. R., Worsnop, D. R., Riekkola, M.-L., Hyötyläinen, T., and Viisanen, Y.: The role of VOC oxidation products in continental new particle formation, Atmos. Chem. Phys., 8, 2657–2665, <a href="https://doi.org/10.5194/acp-8-2657-2008" target="_blank">https://doi.org/10.5194/acp-8-2657-2008</a>, 2008.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>Leidinger et al.(2014)</label><mixed-citation>
      
Leidinger, M., Sauerwald, T., Reimringer, W., Ventura, G., and Schütze, A.:
Selective detection of hazardous VOCs for indoor air quality applications
using a virtual gas sensor array, J. Sensor. Sensor Syst., 3,
253–263, <a href="https://doi.org/10.5194/jsss-3-253-2014" target="_blank">https://doi.org/10.5194/jsss-3-253-2014</a>, 2014.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>Levy Zamora et al.(2023)</label><mixed-citation>
      
Levy Zamora, M., Buehler, C., Datta, A., Gentner, D. R., and Koehler, K.: Identifying optimal co-location calibration periods for low-cost sensors, Atmos. Meas. Tech., 16, 169–179, <a href="https://doi.org/10.5194/amt-16-169-2023" target="_blank">https://doi.org/10.5194/amt-16-169-2023</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>Lewis et al.(2016)</label><mixed-citation>
      
Lewis, A. C., Lee, J. D., Edwards, P. M., Shaw, M. D., Evans, M. J., Moller,
S. J., Smith, K. R., Buckley, J. W., Ellis, M., Gillot, S. R., and White, A.:
Evaluating the performance of low cost chemical sensors for air pollution
research, Faraday Discuss., 189, 85–103, <a href="https://doi.org/10.1039/C5FD00201J" target="_blank">https://doi.org/10.1039/C5FD00201J</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>Li et al.(2023)</label><mixed-citation>
      
Li, Z., Ma, Z., Zhang, Z., Zhang, L., Tian, E., Zhang, H., Yang, R., Zhu, D.,
Li, H., Wang, Z., Zhang, Y., Xu, P., Xu, Y., Wwang, D., Wang, G., Kim, M.,
Yuan, Y., Qiao, X., Li, M., Xie, Y., and Jiang, J.: High-density volatile
organic compound monitoring network for identifying pollution sources,
Sci. Total Environ., 855, 158872,
<a href="https://doi.org/10.1016/j.scitotenv.2022.158872" target="_blank">https://doi.org/10.1016/j.scitotenv.2022.158872</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>Liang(2021)</label><mixed-citation>
      
Liang, L.: Calibrating low-cost sensors for ambient air monitoring: Techniques,
trends, and challenges, Environ. Res., 197, 111163,
<a href="https://doi.org/10.1016/j.envres.2021.111163" target="_blank">https://doi.org/10.1016/j.envres.2021.111163</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>Liu et al.(2020)</label><mixed-citation>
      
Liu, X., Jayaratne, R., Thai, P., Kuhn, T., Zing, I., Christensen, B., Lamont,
R., Dunbabin, M., Zhu, S., Gao, J., Wainwright, D., Neale, D., Kan, R.,
Kirkwood, J., and Morawska, L.: Low-cost sensors as an alternative for
long-term air quality monitoring, Environ. Res., 185, 109438,
<a href="https://doi.org/10.1016/j.envres.2020.109438" target="_blank">https://doi.org/10.1016/j.envres.2020.109438</a>, 2020.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>Malings et al.(2019)</label><mixed-citation>
      
Malings, C., Tanzer, R., Hauryliuk, A., Kumar, S. P. N., Zimmerman, N., Kara, L. B., Presto, A. A., and R. Subramanian: Development of a general calibration model and long-term performance evaluation of low-cost sensors for air pollutant gas monitoring, Atmos. Meas. Tech., 12, 903–920, <a href="https://doi.org/10.5194/amt-12-903-2019" target="_blank">https://doi.org/10.5194/amt-12-903-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>Malyan et al.(2024)</label><mixed-citation>
      
Malyan, V., Kumar, V., Moni, M., Sahu, M., Prakash, J., Choudhary, S., Raliya,
R., Chadha, T. S., Fang, J., and Biswas, P.: Assessing the spatial
transferability of calibration models across a low-cost sensors network,
J. Aerosol Sci., 181, 106437,
<a href="https://doi.org/10.1016/j.jaerosci.2024.106437" target="_blank">https://doi.org/10.1016/j.jaerosci.2024.106437</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>Masiol et al.(2018)</label><mixed-citation>
      
Masiol, M., Squizzato, S., Chalupa, D., Rich, D. Q., and Hopke, P. K.:
Evaluation and field calibration of a low-cost ozone monitor at a regulatory
urban monitoring station, Aerosol Air Qual. Res., 18, 2029–2037,
<a href="https://doi.org/10.4209/aaqr.2018.02.0056" target="_blank">https://doi.org/10.4209/aaqr.2018.02.0056</a>, 2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>Mellqvist et al.(2017)</label><mixed-citation>
      
Mellqvist, J., Samuelsson, J., Isoz, O., Brohede, S., Andersson, P., Ericsson,
M., and Johansson, J.: Emission measurements of VOCs, NO<sub>2</sub> and SO<sub>2</sub> from the
refineries in the south coast air basin using solar occultation flux and
other optical remote sensing methods, FluxSense/SCAQMD-2015, <a href="https://www.aqmd.gov/docs/default-source/fenceline_monitoring/project_1/fluxsense_scaqmd2015_project1_finalreport(040717).pdf" target="_blank"/> (last access: 28 April 2026), 2017.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>Natekin and Knoll(2013)</label><mixed-citation>
      
Natekin, A. and Knoll, A.: Gradient boosting machines, a tutorial, Front.
Neurorobot., 7, 21, <a href="https://doi.org/10.3389/fnbot.2013.00021" target="_blank">https://doi.org/10.3389/fnbot.2013.00021</a>, 2013.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>Nowack et al.(2021)</label><mixed-citation>
      
Nowack, P., Konstantinovskiy, L., Gardiner, H., and Cant, J.: Machine learning calibration of low-cost NO<sub>2</sub> and PM<sub>10</sub> sensors: non-linear algorithms and their impact on site transferability, Atmos. Meas. Tech., 14, 5637–5655, <a href="https://doi.org/10.5194/amt-14-5637-2021" target="_blank">https://doi.org/10.5194/amt-14-5637-2021</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>Okorn and Hannigan(2021a)</label><mixed-citation>
      
Okorn, K. and Hannigan, M.: Applications and limitations of quantifying
speciated and source-apportioned vocs with metal oxide sensors, Atmosphere,
12, 1383, <a href="https://doi.org/10.3390/atmos12111383" target="_blank">https://doi.org/10.3390/atmos12111383</a>, 2021a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>Okorn and Hannigan(2021b)</label><mixed-citation>
      
Okorn, K. and Hannigan, M.: Improving Air Pollutant Metal Oxide Sensor
Quantification Practices through: An Exploration of Sensor Signal
Normalization, Multi-Sensor and Universal Calibration Model Generation, and
Physical Factors Such as Co-Location Duration and Sensor Age, Atmosphere, 12,
645, <a href="https://doi.org/10.3390/atmos12050645" target="_blank">https://doi.org/10.3390/atmos12050645</a>, 2021b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>Okorn and Iraci(2024)</label><mixed-citation>
      
Okorn, K. and Iraci, L. T.: An overview of outdoor low-cost gas-phase air quality sensor deployments: current efforts, trends, and limitations, Atmos. Meas. Tech., 17, 6425–6457, <a href="https://doi.org/10.5194/amt-17-6425-2024" target="_blank">https://doi.org/10.5194/amt-17-6425-2024</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>Okorn et al.(2021)</label><mixed-citation>
      
Okorn, K., Jimenez, A., Collier-Oxandale, A., Johnston, J., and Hannigan, M.:
Characterizing methane and total non-methane hydrocarbon levels in Los
Angeles communities with oil and gas facilities using air quality monitors,
Sci. Total Environ., 777, 146194,
<a href="https://doi.org/10.1016/j.scitotenv.2021.146194" target="_blank">https://doi.org/10.1016/j.scitotenv.2021.146194</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>Ou-Yang et al.(2018)</label><mixed-citation>
      
Ou-Yang, C.-F., Liao, W.-C., Chang, C.-C., Hsieh, H.-C., and Wang, J.-L.:
Guided episodic sampling for capturing and characterizing industrial plumes,
Atmos. Environ., 174, 188–193, <a href="https://doi.org/10.1016/j.atmosenv.2017.11.044" target="_blank">https://doi.org/10.1016/j.atmosenv.2017.11.044</a>,
2018.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>Raheja et al.(2022)</label><mixed-citation>
      
Raheja, G., Harper, L., Hoffman, A., Gorby, Y., Freese, L., O’Leary, B.,
Deron, N., Smith, S., Auch, T., Goodwin, M., and Westervelt, D.:
Community-based participatory research for low-cost air pollution monitoring
in the wake of unconventional oil and gas development in the Ohio River
Valley: Empowering impacted residents through community science,
Environ. Res. Lett., 17, 065006, <a href="https://doi.org/10.1088/1748-9326/ac6ad6" target="_blank">https://doi.org/10.1088/1748-9326/ac6ad6</a>,
2022.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>Robin et al.(2021)</label><mixed-citation>
      
Robin, Y., Amann, J., Baur, T., Goodarzi, P., Schultealbert, C., Schneider, T.,
and Schütze, A.: High-performance VOC quantification for IAQ monitoring
using advanced sensor systems and deep learning, Atmosphere, 12, 1487,
<a href="https://doi.org/10.3390/atmos12111487" target="_blank">https://doi.org/10.3390/atmos12111487</a>, 2021.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>Rothman et al.(2005)</label><mixed-citation>
      
Rothman, L. S., Jacquemart, D., Barbe, A., Benner, D. C., Birk, M., Brown, L.,
Carleer, M., Chackerian Jr, C., Chance, K., Coudert, L., Dana, V., Devi, V.,
JM, F., Gamache, R., Goldman, A., Hartmann, J., Jucks, K., Maki, A., Mandin,
J., Massie, S., Orphal, J., Perrin, A., Rinsland, C., Smith, M., Tennyson,
J., Tochenov, R., Toth, R., Vander Auwera, J., Varanasi, P., and Wagner, G.:
The HITRAN 2004 molecular spectroscopic database, J. Quant.
Spectrosc. Ra., 96, 139–204,
<a href="https://doi.org/10.1016/j.jqsrt.2004.10.008" target="_blank">https://doi.org/10.1016/j.jqsrt.2004.10.008</a>, 2005.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>Sá et al.(2023)</label><mixed-citation>
      
Sá, J., Chojer, H., Branco, P., Alvim-Ferraz, M., Martins, F., and Sousa,
S.: Two step calibration method for ozone low-cost sensor: Field experiences
with the UrbanSense DCUs, J. Environ. Manage., 328, 116910,
<a href="https://doi.org/10.1016/j.jenvman.2022.116910" target="_blank">https://doi.org/10.1016/j.jenvman.2022.116910</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib53"><label>Sharpe et al.(2004)</label><mixed-citation>
      
Sharpe, S. W., Johnson, T. J., Sams, R. L., Chu, P. M., Rhoderick, G. C., and
Johnson, P. A.: Gas-phase databases for quantitative infrared spectroscopy,
Appl. Spectrosc., 58, 1452–1461, <a href="https://doi.org/10.1366/0003702042641281" target="_blank">https://doi.org/10.1366/0003702042641281</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib54"><label>Silberstein et al.(2024)</label><mixed-citation>
      
Silberstein, J., Wellbrook, M., and Hannigan, M.: Utilization of a Low-Cost
Sensor Array for Mobile Methane Monitoring, Sensors, 24, 519,
<a href="https://doi.org/10.3390/s24020519" target="_blank">https://doi.org/10.3390/s24020519</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib55"><label>Smola and Schölkopf(2004)</label><mixed-citation>
      
Smola, A. J. and Schölkopf, B.: A tutorial on support vector regression,
Stat. Comput., 14, 199–222,
<a href="https://doi.org/10.1023/b:stco.0000035301.49549.88" target="_blank">https://doi.org/10.1023/b:stco.0000035301.49549.88</a>, 2004.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib56"><label>Song and Ying(2015)</label><mixed-citation>
      
Song, Y.-Y. and Ying, L.: Decision tree methods: applications for
classification and prediction, Shanghai Archives of Psychiatry, 27, 130,
<a href="https://doi.org/10.11919/j.issn.1002-0829.215044" target="_blank">https://doi.org/10.11919/j.issn.1002-0829.215044</a>, 2015.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib57"><label>Spinelle et al.(2017a)</label><mixed-citation>
      
Spinelle, L., Gerboles, M., Kok, G., Persijn, S., and Sauerwald, T.:
Performance evaluation of low-cost BTEX sensors and devices within the
EURAMET key-VOCs project, in: Proceedings, vol. 1, p. 425, MDPI,
<a href="https://doi.org/10.3390/proceedings1040425" target="_blank">https://doi.org/10.3390/proceedings1040425</a>, 2017a.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib58"><label>Spinelle et al.(2017b)</label><mixed-citation>
      
Spinelle, L., Gerboles, M., Kok, G., Persijn, S., and Sauerwald, T.: Review of
portable and low-cost sensors for the ambient air monitoring of benzene and
other volatile organic compounds, Sensors, 17, 1520, <a href="https://doi.org/10.3390/s17071520" target="_blank">https://doi.org/10.3390/s17071520</a>,
2017b.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib59"><label>Srishti et al.(2023)</label><mixed-citation>
      
Srishti, S., Agrawal, P., Kulkarni, P., Gautam, H. C., Kushwaha, M., and
Sreekanth, V.: Multiple PM low-cost sensors, multiple seasons’ data, and
multiple calibration models, Aerosol Air Qual. Res., 23, 220428,
<a href="https://doi.org/10.4209/aaqr.220428" target="_blank">https://doi.org/10.4209/aaqr.220428</a>, 2023.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib60"><label>Tibshirani(1996)</label><mixed-citation>
      
Tibshirani, R.: Regression shrinkage and selection via the lasso: a
retrospective, J. Roy. Stat. Soc. B, 58, 267–288, 1996.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib61"><label>Vikram et al.(2019)</label><mixed-citation>
      
Vikram, S., Collier-Oxandale, A., Ostertag, M. H., Menarini, M., Chermak, C., Dasgupta, S., Rosing, T., Hannigan, M., and Griswold, W. G.: Evaluating and improving the reliability of gas-phase sensor system calibrations across new locations for ambient measurements and personal exposure monitoring, Atmos. Meas. Tech., 12, 4211–4239, <a href="https://doi.org/10.5194/amt-12-4211-2019" target="_blank">https://doi.org/10.5194/amt-12-4211-2019</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib62"><label>Wang(2024)</label><mixed-citation>
      
Wang, Z.: Evaluating the efficacy of machine learning in calibrating low-cost
sensors, Appl. Comput. Eng., 71, 30–38,
<a href="https://doi.org/10.54254/2755-2721/71/20241635" target="_blank">https://doi.org/10.54254/2755-2721/71/20241635</a>, 2024.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib63"><label>Wesolowski and Suchacz(2012)</label><mixed-citation>
      
Wesolowski, M. and Suchacz, B.: Artificial neural networks: theoretical
background and pharmaceutical applications: a review, J. AOAC
Int., 95, 652–668, <a href="https://doi.org/10.5740/jaoacint.sge_wesolowski_ann" target="_blank">https://doi.org/10.5740/jaoacint.sge_wesolowski_ann</a>, 2012.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib64"><label>Yurko et al.(2019)</label><mixed-citation>
      
Yurko, G., Roostaei, J., Dittrich, T., Xu, L., Ewing, M., Zhang, Y., and
Shreve, G.: Real-time sensor response characteristics of 3 commercial metal
oxide sensors for detection of BTEX and chlorinated aliphatic hydrocarbon
organic vapors, Chemosensors, 7, 40, <a href="https://doi.org/10.3390/chemosensors7030040" target="_blank">https://doi.org/10.3390/chemosensors7030040</a>, 2019.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib65"><label>Zhang(2016)</label><mixed-citation>
      
Zhang, Z.: A gentle introduction to artificial neural networks, Ann.
Trans. Med., 4, 370, <a href="https://doi.org/10.21037/atm.2016.06.20" target="_blank">https://doi.org/10.21037/atm.2016.06.20</a>, 2016.

    </mixed-citation></ref-html>
<ref-html id="bib1.bib66"><label>Zimmerman et al.(2018)</label><mixed-citation>
      
Zimmerman, N., Presto, A. A., Kumar, S. P. N., Gu, J., Hauryliuk, A., Robinson, E. S., Robinson, A. L., and R. Subramanian: A machine learning calibration model using random forests to improve sensor performance for lower-cost air quality monitoring, Atmos. Meas. Tech., 11, 291–313, <a href="https://doi.org/10.5194/amt-11-291-2018" target="_blank">https://doi.org/10.5194/amt-11-291-2018</a>, 2018.

    </mixed-citation></ref-html>--></article>
