<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing with OASIS Tables v3.0 20080202//EN" "journalpub-oasis3.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:oasis="http://docs.oasis-open.org/ns/oasis-exchange/table" xml:lang="en" dtd-version="3.0">
  <front>
    <journal-meta><journal-id journal-id-type="publisher">AMT</journal-id><journal-title-group>
    <journal-title>Atmospheric Measurement Techniques</journal-title>
    <abbrev-journal-title abbrev-type="publisher">AMT</abbrev-journal-title><abbrev-journal-title abbrev-type="nlm-ta">Atmos. Meas. Tech.</abbrev-journal-title>
  </journal-title-group><issn pub-type="epub">1867-8548</issn><publisher>
    <publisher-name>Copernicus Publications</publisher-name>
    <publisher-loc>Göttingen, Germany</publisher-loc>
  </publisher></journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5194/amt-11-4929-2018</article-id><title-group><article-title>Evaluation of a hierarchical agglomerative clustering
method applied to WIBS laboratory data for improved discrimination of
biological particles by comparing data preparation techniques</article-title><alt-title>Evaluation of clustering applied to WIBS bioaerosol data</alt-title>
      </title-group><?xmltex \runningtitle{Evaluation of clustering applied to WIBS bioaerosol data}?><?xmltex \runningauthor{N. J. Savage and J. A. Huffman}?>
      <contrib-group>
        <contrib contrib-type="author" corresp="no" rid="aff1 aff2">
          <name><surname>Savage</surname><given-names>Nicole J.</given-names></name>
          
        </contrib>
        <contrib contrib-type="author" corresp="yes" rid="aff1">
          <name><surname>Huffman</surname><given-names>J. Alex</given-names></name>
          <email>alex.huffman@du.edu</email>
        <ext-link>https://orcid.org/0000-0002-5363-9516</ext-link></contrib>
        <aff id="aff1"><label>1</label><institution>University of Denver, Department of Chemistry and Biochemistry, Denver, USA</institution>
        </aff>
        <aff id="aff2"><label>a</label><institution>now at: Aerosol Devices, Inc., Fort Collins, Colorado, USA</institution>
        </aff>
      </contrib-group>
      <author-notes><corresp id="corr1">J. Alex Huffman (alex.huffman@du.edu)</corresp></author-notes><pub-date><day>30</day><month>August</month><year>2018</year></pub-date>
      
      <volume>11</volume>
      <issue>8</issue>
      <fpage>4929</fpage><lpage>4942</lpage>
      <history>
        <date date-type="received"><day>5</day><month>April</month><year>2018</year></date>
           <date date-type="rev-request"><day>7</day><month>May</month><year>2018</year></date>
           <date date-type="rev-recd"><day>13</day><month>August</month><year>2018</year></date>
           <date date-type="accepted"><day>19</day><month>August</month><year>2018</year></date>
      </history>
      <permissions>
        
        
      <license license-type="open-access"><license-p>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this licence, visit <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link></license-p></license></permissions><self-uri xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018.html">This article is available from https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018.html</self-uri><self-uri xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018.pdf">The full text article is available as a PDF file from https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018.pdf</self-uri>
      <abstract>
    <p id="d1e95">Hierarchical agglomerative clustering (HAC) analysis has been successfully
applied to several sets of ambient data (e.g., Crawford et al., 2015;
Robinson et al., 2013) and with respect to standardized particles in the
laboratory environment (Ruske et al., 2017, 2018). Here
we show for the first time a systematic application of HAC to a comprehensive
set of laboratory data collected for many individual particle types using the
wideband integrated bioaerosol sensor (WIBS-4A)
(Savage et al., 2017). The impact of the ratio of
particle concentrations on HAC results was investigated, showing that
clustering quality can vary dramatically as a function of ratio. Six
strategies for particle preprocessing were also compared, concluding that
using raw fluorescence intensity (without normalizing to particle size) and
logarithmically transforming data values (scenario B) consistently produced
the highest-quality results for the particle types analyzed. A total of 23
one-to-one matchups of individual particles types was investigated. Results
showed a cluster misclassification of &lt; 15 % for 12 of 17 numerical
experiments using one biological and one nonbiological particle type each.
Inputting fluorescence data using a baseline <inline-formula><mml:math id="M1" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> threshold
produced a lower degree of misclassification than when inputting either all particles
(without a fluorescence threshold) or a baseline <inline-formula><mml:math id="M2" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> threshold.
Lastly, six numerical simulations of mixtures of four to seven components
were analyzed using HAC. These results show that a range of 12 %–24 % of
fungal clusters was consistently misclassified by inclusion of a mixture of
nonbiological materials, whereas bacteria and diesel soot were each able to
be separated with nearly 100 % efficiency. The study gives significant
support to clustering analysis commonly being applied to data from commercial
ultraviolet laser/light-induced fluorescence (UV-LIF) instruments used for bioaerosol research across the
globe and provides practical tools that will improve clustering results
within scientific studies as a part of diverse research disciplines.</p>
  </abstract>
    </article-meta>
  </front>
<body>
      

<sec id="Ch1.S1" sec-type="intro">
  <title>Introduction</title>
      <p id="d1e129">Particles of biological origin, or bioaerosols, make up a substantial
fraction of atmospheric aerosols and have the potential to influence
environmental processes and to negatively impact human health (Després et
al., 2012; Douwes et al., 2003; Fröhlich-Nowoisky et al., 2016; Shiraiwa
et al., 2017). In order to understand the impact bioaerosols, such as pollen,
fungal spores, and bacteria, play in various systems, it is important to be
able to identify and characterize these biological particles in the
atmosphere. One common method for the detection of bioaerosols is ultraviolet
laser/light-induced fluorescence (UV-LIF) because it can provide particle
detection in near real time and at a high particle size resolution (Fennelly
et al., 2017; Huffman and Santarpia, 2017; Sodeau and O'Connor, 2016). Many
commercial UV-LIF instruments have become available for bioaerosol detection,
but all of these techniques are challenged with the need to differentiate
between small differences in fluorescence properties in order to detect and
quantify biological aerosols. Recently commercialized instruments show an
improved ability to discriminate between particle types, for example by
utilizing multiple excitation sources or other particle data (e.g.,<?pagebreak page4930?> size and
shape). UV-LIF techniques are, however, inherently limited by the broad
nature of fluorescence spectra, and so instruments face a ubiquitous problem
of poor selectivity between particle types. By applying improved data
thresholding and particle classification techniques, particle
characterization can be further improved, but important limitations still
remain (Hernandez et al., 2016; Huffman et al., 2012; Perring et al., 2015;
Savage et al., 2017; Toprak and Schnaiter, 2013; Wright et al., 2014). One
strategy to improve the quality of differentiation between particles types
has been to collect full, resolved emission spectra, each at multiple
excitation wavelengths. This can lead to a high instrumental purchase cost,
and such instruments have not been widely applied or commercialized (Huffman
et al., 2016; Kiselev et al., 2013; Pan et al., 2009b; Ruske et al., 2017;
Swanson and Huffman, 2018). Most commercial UV-LIF instruments for bioaerosol
detection utilize one to two excitation wavelengths and integrate
fluorescence signals into a small number of emission bands. To extend the
improvements in particle classification for these commercial UV-LIF
instruments, a number of multivariate analysis techniques have been applied
to ambient particle analysis. The most common of these techniques include
principal component analysis, factor analysis, and cluster analysis
strategies. Classification algorithms, including several clustering
techniques in particular, have shown successful results in providing unbiased
insights into the classification of bioaerosols (Crawford et al., 2015;
Pinnick et al., 2013; Robinson et al., 2013; Swanson and Huffman, 2018).</p>
      <p id="d1e132">Cluster analysis is a broad class of data mining methods in which data
objects placed in the same group (or cluster) are more similar to one
another than to those objects placed in other groups. Classification
algorithms can be divided into two central models: (1) supervised and (2) unsupervised learning. Both models have associated advantages and
disadvantages. Supervised learning methods allow the “training” of data
and grouping to better reflect the data observations (Eick et al., 2004;
Ruske et al., 2017, 2018). This type of method enhances
(trains) the classification algorithm in that the output groups are
predetermined rather than discovered, as is the case for unsupervised
methods. Supervision requires the user to have appropriate starting
conditions to put into the model, which are often difficult or impossible to
determine. Supervised training methods are also much more time-efficient
compared to unsupervised methods, which is important when analyzing ambient
data sets where particle counts (individual objects) can be greater than
10<inline-formula><mml:math id="M3" display="inline"><mml:msup><mml:mi/><mml:mn mathvariant="normal">6</mml:mn></mml:msup></mml:math></inline-formula> (Ruske et al., 2017). In contrast, unsupervised training
methods present less bias and can adapt to unique situations because the
resultant clusters are based on models that have not been previously
trained. To access some of the advantages of supervised methods, however, it
is important to first apply unsupervised models to wide collections of
laboratory data of known particle types in order to gain insight into how
these models interpret data inputs and to learn how algorithms can best be
trained (Ruske et al., 2017).</p>
      <p id="d1e144">Hierarchical agglomerative clustering (HAC) is an unsupervised learning
method that has been most commonly applied for bioaerosol-related studies
(e.g., Crawford et al., 2015, 2016; Gosselin et al.,
2016; Pan et al., 2009a, 2007; Pinnick et al., 2004, 2013;  Robinson et al., 2013; Ruske et al., 2017, 2018).
Other unsupervised clustering techniques, such as the <inline-formula><mml:math id="M4" display="inline"><mml:mi>k</mml:mi></mml:math></inline-formula>-means clustering
method, have shown poor results when applied to ambient data sets because
the number of clusters used to represent the data are required a priori, and
this information is usually unknown prior to analysis (Ruske et al.,
2017). There are several different HAC methods or linkages including the following: single, complete, average, weighted, Ward's, centroid, and median
(Crawford et al., 2015; Müllner, 2013). Ruske et
al. (2017) compared a variety of HAC linkages and determined that Ward's
linkage had a higher percentage of correctly classifying particles in
comparison to other HAC methods.</p>
      <p id="d1e154">Recently, Savage et al. (2017) published a
comprehensive laboratory study applying the wideband integrated bioaerosol sensor (WIBS-4A) to a large and diverse set of biological and nonbiological
aerosol types. Following on to that work, the study presented here utilizes
those data as inputs to evaluate and challenge the HAC strategy of particle
differentiation using Ward's linkage of unsupervised clustering.
Previous HAC studies have focused primarily on (a) the analysis of simple
particle standards (i.e., fluorescent microbeads) and (b) the clustering of
particles from ambient data sets. There have been relatively few published
attempts to differentiate between biological particles and interfering
particles by clustering methods using controlled laboratory UV-LIF data or
to separate different kinds of biological particles from one another.
Presented here are results of the HAC method applied to data from a
comprehensive WIBS-4A laboratory study showing that clustering can dramatically
improve the removal of nonbiological particle types from data sets if operated
under appropriate conditions.</p>
</sec>
<sec id="Ch1.S2">
  <title>Experimental and computational methods</title>
      <p id="d1e163">The WIBS-4A (Droplet Measurement Techniques, Longmont, CO, USA) is a commonly
used UV-LIF based instrument for the detection and characterization of
biological particles. The instrument collects particles in the size range
0.8–20 <inline-formula><mml:math id="M5" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m and interrogates them in real time as particles flow
along the path between optical sources. The WIBS collects information
about fluorescence intensity in three channels (FL1, FL2, and FL3), particle
size, and particle asymmetry for each interrogated particle. The bands of
excitation and fluorescence emission are FL1 (<inline-formula><mml:math id="M6" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">ex</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">280</mml:mn></mml:mrow></mml:math></inline-formula>,
<inline-formula><mml:math id="M7" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">em</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">310</mml:mn></mml:mrow></mml:math></inline-formula>–400 nm), FL2 (<inline-formula><mml:math id="M8" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">ex</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">280</mml:mn></mml:mrow></mml:math></inline-formula>,
<inline-formula><mml:math id="M9" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">em</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">420</mml:mn></mml:mrow></mml:math></inline-formula>–650 nm), and FL3 (<inline-formula><mml:math id="M10" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">ex</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">370</mml:mn></mml:mrow></mml:math></inline-formula>,
<inline-formula><mml:math id="M11" display="inline"><mml:mrow><mml:msub><mml:mi mathvariant="italic">λ</mml:mi><mml:mi mathvariant="normal">em</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:mn mathvariant="normal">420</mml:mn></mml:mrow></mml:math></inline-formula>–650 nm). The excitation and emission
wavelengths chosen for each of the three fluorescence channels were designed to
maximize the information gained about key biological fluorophores present in
a broad range of<?pagebreak page4931?> bioparticles (Kaye et al., 2005; Pöhlker et al., 2012).
Early generations of UV-LIF bioaerosol spectrometers were often interpreted
to be able to detect proteins via channels similar to FL1 and products of
active cellular metabolism (i.e., riboflavin and NAD(P)H) via channels
similar to FL3, but these approximations are gross simplifications that
confound a more detailed investigation of particle types. For more information
on the design, operation, and calibration of this instrument, see, e.g., the
papers listed here and references therein: Foot et al. (2008), Healy
et al. (2012a, b), Hernandez et al. (2016), Kaye et al. (2005), Perring et al. (2015), Robinson et al. (2017), Savage et
al. (2017), and Stanley et al. (2011).</p>
      <p id="d1e264">All aerosol materials utilized have been listed previously in Table 2 of
Savage et al. (2017), where an overview of
size and fluorescence properties of particles utilized for this study are
also reported. No additional laboratory experiments were performed here
beyond the results presented previously.</p>
      <p id="d1e267">The fluorescence threshold applied to the differentiation of fluorescent
from nonfluorescent particles is a key step in UV-LIF data analysis.
Traditionally, a fluorescence threshold has been determined as the average
baseline fluorescence intensity measured in each of the three channels
during the forced trigger (FT) mode when no particles are present plus
3 times the standard deviation (<inline-formula><mml:math id="M12" display="inline"><mml:mrow><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> of that measurement (i.e., FT <inline-formula><mml:math id="M13" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> (Gabey et al., 2010).
Savage et al. (2017) also reported that additional particle
discrimination is possible by using FT <inline-formula><mml:math id="M14" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> as the threshold. Both
threshold definitions will be discussed here. After choosing a threshold of
minimum fluorescence, the fluorescence characteristics of a particle can be
classified into seven different particle types introduced by Perring et al. (2015) and summarized in Fig. 1
of Savage et al. (2017).</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F1"><caption><p id="d1e308">Schematic diagram showing the data preparation
process resulting in the generated clustering products. Parameters within
the pink box are the focus of this paper.</p></caption>
        <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018-f01.pdf"/>

      </fig>

<?xmltex \hack{\newpage}?>
</sec>
<sec id="Ch1.S3">
  <title>Clustering strategy</title>
      <p id="d1e325">Hierarchical clustering methods work by grouping objects from the bottom up,
meaning that each object (particle) starts as its own “cluster,” and
clusters are merged together based on similarities until a greatly reduced
number of clusters are presented as a final solution. Ward's method for
clustering is among the most popular approaches for HAC and is the only
method based on a classical sum-of-squares criterion, minimizing the
within-group sum of squares (or variance) (Müllner, 2013). The
WIBS-4A used here for data collection provides five parameters of information
for each individual particle detected (three fluorescence channels, size, and
asymmetry factor (AF)), resulting in five dimensions of data.</p>
      <p id="d1e328">The clustering analysis was performed using the open-source software R
package “fastercluster” (Müllner, 2013) using a Dell Latitude E7450
laptop computer with an Intel<sup>®</sup> Core<sup>™</sup> Processor
(i7-5600U CPU @ 2.60 GHz, 16 GB RAM).</p>
<sec id="Ch1.S3.SS1">
  <title>Data preparation</title>
      <p id="d1e342">Saturation of fluorescence intensity occurs at 2047 analog-to-digital counts
(ADCs) for each of the three FL channels in the WIBS-4A, at which point the
photomultiplier tube (PMT) reaches its upper limit of detection. A study by
Ruske et al. (2017) investigated whether nonfluorescent (in that case,
particles below the FT <inline-formula><mml:math id="M15" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> fluorescence threshold) and/or
saturating data points included in the clustering analysis hindered the
efficiency of the cluster output. The authors determined that removing both
saturating and nonfluorescent particles before HAC analysis resulted in a
better clustering performance in terms of correctly classifying ambient
particles. The quality of the clustering results is likely to be impacted by
the types of particles involved and the assumptions placed on those. As shown by
Savage et al. (2017), many biological particles
present a large fraction that saturates one or more of the fluorescence
detectors. Conversely, many nonbiological particles present a large
fraction of very weakly fluorescent particles with an intensity below a given
threshold, which are thus classified as nonfluorescent. To limit
the premodification of particle populations before clustering, the only filter
applied before clustering was to remove particles smaller than the lower
particle size detection limit of the WIBS-4A (0.8 <inline-formula><mml:math id="M16" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m), similar to
Ruske et al. (2017). In contrast, both saturating and nonfluorescent
particles were analyzed and the clustering results will be evaluated. Figure 1 outlines the data preparation process, including the conceptual process of
normalization, clustering, and validation of data, which is explained in
detail below.</p>
</sec>
<sec id="Ch1.S3.SS2">
  <title>Data normalization</title>
      <p id="d1e370">Normalization of the raw data is necessary before executing the clustering
algorithm because data parameters<?pagebreak page4932?> delivered from the instrument are
measured on different respective scales. For example, fluorescent intensity
values range from 0 to 2047 ADCs, size ranges from 0 to <inline-formula><mml:math id="M17" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">20</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M18" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m, and AF ranges from 0 to 100 arbitrary units.
Crawford et al. (2015) performed an analysis on polystyrene latex spheres
(PSLs) using several different normalization techniques, concluding that
<inline-formula><mml:math id="M19" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>-score normalization was the best technique when looking at cluster
performance using Ward's linkage for the separation of PSLs. As a result, we
utilize the <inline-formula><mml:math id="M20" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>-score normalization of Ward's linkage HAC for the presented
study. By this type of normalization, the mean value of all data points is
subtracted from each individual data point, and then each data point is
divided by the standard deviation of all points. Standardization using the
<inline-formula><mml:math id="M21" display="inline"><mml:mi>z</mml:mi></mml:math></inline-formula>-score method compares results to a normal (Gaussian) population, and we
have chosen to standardize our variables to a mean of 0 and a variance of 1
so that the output variables would be on comparable scales.</p>
</sec>
<sec id="Ch1.S3.SS3">
  <title>HAC scenarios</title>
      <p id="d1e418">Hierarchical agglomerative clustering performs optimally if all variables
(1) are independent of one another and (2) can be described well by a normal
(Gaussian) distribution (Norusis, 2011). To achieve meaningful results
from the clustering analysis, data values must, therefore, be input into the
clustering algorithm with an understanding of how specific preparatory
conditions can significantly impact results. To investigate optimal input conditions, a total of six clustering scenarios was explored, with conditions
summarized in Table 1. The impact of two separate variables was explored
within these scenarios by varying (i) whether fluorescence intensity was pre-normalized by particle size and (ii) whether the data values were input
after logarithmic transformation to produce a normal distribution.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T1"><caption><p id="d1e424">Six scenarios explored, with varying combinations of
pre-analysis treatment. “Fluorescence normalization” refers to whether
fluorescence intensity values were input to HAC as reported by the
instrument (No) or after normalizing to particle size (Yes). “Variables
logged” refers to whether data values were input as reported (No) or
manipulated to produce a normal distribution by using log(value) (Yes).</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.94}[.94]?><oasis:tgroup cols="7">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="left"/>
     <oasis:colspec colnum="5" colname="col5" align="left"/>
     <oasis:colspec colnum="6" colname="col6" align="left"/>
     <oasis:colspec colnum="7" colname="col7" align="left"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1">Parameters</oasis:entry>
         <oasis:entry colname="col2">A</oasis:entry>
         <oasis:entry colname="col3">B</oasis:entry>
         <oasis:entry colname="col4">C</oasis:entry>
         <oasis:entry colname="col5">D</oasis:entry>
         <oasis:entry colname="col6">E</oasis:entry>
         <oasis:entry colname="col7">F</oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">Fluorescence</oasis:entry>
         <oasis:entry colname="col2">No</oasis:entry>
         <oasis:entry colname="col3">No</oasis:entry>
         <oasis:entry colname="col4">Yes</oasis:entry>
         <oasis:entry colname="col5">Yes</oasis:entry>
         <oasis:entry colname="col6">Yes</oasis:entry>
         <oasis:entry colname="col7">No</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">normalization</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Variables</oasis:entry>
         <oasis:entry colname="col2">No</oasis:entry>
         <oasis:entry colname="col3">Yes</oasis:entry>
         <oasis:entry colname="col4">No</oasis:entry>
         <oasis:entry colname="col5">Yes</oasis:entry>
         <oasis:entry colname="col6">Yes,</oasis:entry>
         <oasis:entry colname="col7">Yes,</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">logged</oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">only</oasis:entry>
         <oasis:entry colname="col7">only</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">AF/size</oasis:entry>
         <oasis:entry colname="col7">AF/size</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">variables</oasis:entry>
         <oasis:entry colname="col7">variables</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

      <p id="d1e609">Ambient particle number vs. size distributions can often be well approximated
by lognormal distributions, although specific groups of particles, including
some bacteria, spores, and pollen, may not always exhibit a lognormal
distribution. Further, fluorescence intensity has been shown to scale with
particle size (e.g., Hill et al., 2001; Sivaprakasam et al., 2011).
Several previous studies attempted to utilize HAC for ambient
lognormally distributed particle size data (Crawford et al., 2014, 2015; Robinson et al., 2013) but applied the assumption
that particle fluorescence is normally distributed in a group of particles.
If this assumption is not correct, however, weakly fluorescing
particles are likely to be grouped into a single cluster based on the high
abundance of these particles (Robinson et al., 2013).
Scenarios C, D, and E (Table 1) utilize data input to the clustering
algorithm after fluorescence intensity was normalized to particle size (by
dividing fluorescence intensity value by light scattering signal when a
particle interacts with the diode laser beam) in order to explore the assumption that laboratory data should be treated like previously
explored ambient data sets and not logged. Scenarios B and D take into
account the logging of all parameters, producing normal distributions of all
variables (AF, particle size, three channels of fluorescence). By this process,
data values were input into the algorithm as a log(value) without separately
binning the points. For comparison, scenarios E and F explore log-spaced
distributions of size and AF, while retaining the assumption that the
fluorescence output is normally distributed. Scenario A data are neither
logged nor normalized. For comparison, scenario F represents the input
conditions that have been used frequently (e.g., Crawford et al., 2015;
Ruske et al., 2017).</p>
</sec>
<sec id="Ch1.S3.SS4">
  <title>Cluster validation</title>
      <p id="d1e618">An important feature of HAC is that it provides clusters in an unsupervised
manner, and the user must determine the number of clusters that makes
physical sense. One useful tool to systematically determine the optimal
number of final clusters is the Calinski–Harabasz (CH) index, which uses the
interclass–intraclass distance ratio (Liu et al., 2010). For each
clustering output the CH index was calculated for cluster solutions with 1 through 10 clusters, and the solution with the highest CH value was
generally determined to be the optimal number of clusters. Figure 2 shows an
example CH value versus cluster number plot for a mixture of <italic>Aspergillus niger</italic> fungal spores mixed
with diesel soot particles. The curve suggests the optimal result to be a
two-cluster solution for this trial, as was generally the case for
investigations where two particle types were mixed before clustering. In
order to reduce the length and complexity of discussion, the analysis of results
in Sects. 4.1–4.3 was limited to using cluster products only from the
two-cluster solution. In some cases, a three-cluster solution may have produced
higher-quality results, but these cases were not investigated.</p><?xmltex \hack{\newpage}?>
</sec>
</sec>
<?pagebreak page4933?><sec id="Ch1.S4">
  <title>Results and discussion</title>
      <p id="d1e632">The analysis of clustering quality was performed systematically and with
increasing complexity. Section 4.1 utilizes three pairs of particles types
to explore the effect of particle ratio and normalization strategies on
cluster performance. Using conclusions from this section, Sect. 4.2 then
expands the exploration to 20 additional pairs of particle types. Section 4.3 explores the effect of three different fluorescence thresholding
strategies on cluster output. Finally, Sect. 4.4 investigates the ability
of HAC analysis to separate particle types from mixed populations of
particle types.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F2"><caption><p id="d1e637">Example of a Calinski–Harabasz index plot for the cluster
experiment with input from <italic>Aspergillus niger</italic> and diesel soot (<inline-formula><mml:math id="M22" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> ratio). The optimal number of
clusters is determined by the highest CH value. For Sects. 4.1–4.3 only
two-cluster solutions were analyzed.</p></caption>
        <?xmltex \igopts{width=184.942913pt}?><graphic xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018-f02.pdf"/>

      </fig>

<sec id="Ch1.S4.SS1">
  <title>Investigating pre-normalization scenarios and particle input ratio</title>
      <p id="d1e666">To explore the ability to separate two distinct populations of particles
from one another, three different clustering trials are presented in this
section as one-to-one matchups: (1) <italic>Aspergillus niger </italic> (fungal spores, F2) vs.
standard diesel
soot (S4), (2) <italic>Pseudomonas stutzeri </italic> (bacteria, B3) vs. standard diesel soot (S4), and
(3) <italic>Aspergillus niger </italic> (fungal spores, F2) vs. California sand (mineral dust, D12). These four
particle materials were chosen to represent key classes of coarse particles
observed in ambient air. For each trial, a subset of particles from each
material type was selected randomly for HAC analysis. The clustering process
includes (i) the evaluation of cluster performance based on particle assignment
and cluster composition and (ii) the visual representations of cluster outputs
using the particle type classification introduced by Perring et al. (2015). For each of these three trials, the clustering process
was run separately using each of the six scenarios A–F described in Table 1.
Additionally, while exploring the optimal data preprocessing scenario, the
influence that different concentration ratios of particle types could play
in the clustering output was also explored. The cluster process for each
trial was performed using four different ratios of particles in each
particle set including situations with an equal ratio and where the
concentration of each particle type was significantly mismatched. In total,
this section represents 57 individual clustering experiments (3 trials <inline-formula><mml:math id="M23" display="inline"><mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">6</mml:mn></mml:mrow></mml:math></inline-formula> scenarios <inline-formula><mml:math id="M24" display="inline"><mml:mrow><mml:mo>×</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> particle ratios <inline-formula><mml:math id="M25" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> additional ratio trials) exploring three
independent input variables. The results will be utilized to explore many
more individual particle type matchups in the following sections.</p>
      <p id="d1e709">The first two trials include diesel soot particles because light-absorbing
carbon aerosol is commonly observed in aerosol samples with anthropogenic
influence (Bond et al., 2013) and because it can have fluorescence
characteristics difficult to distinguish from small biological particles
(e.g., Huffman et al., 2010; Pan et al., 2012; Savage et al., 2017; Yu et
al., 2016). For example, when excited by photons with a wavelength of 280 nm, diesel soot can be misinterpreted as single bacterial cells using the
WIBS, and so we explored here whether the two particle types could be
clustered separately (Pöhlker et al., 2012). The three trials
include two examples of biological particles, both exhibiting fluorescent
properties but with different excitation–emission characteristics and with a different average particle size.</p>
      <p id="d1e712">The output of the algorithm reports the particle type from which each
particle was input in order to evaluate the accuracy of the clustering. The
resulting output of each particle with an assigned cluster number is then
compared to the originating particle type to determine classification
accuracy. Figure 3 summarizes the relative accuracy of individual clustering
experiments by representing the percent of particles misclassified with
respect to known input identities (blue bar corresponding to correct
classification, red bar and overlaid value corresponding to incorrect
classification). The clustering process was generally effective for
separating particles correctly when two particle types were considered, but
results vary widely across the six scenarios. Several previous studies that
used HAC to separate particles within an ambient data set assumed that
particle fluorescence is already normally distributed (Crawford et al.,
2014, 2015; Robinson et al., 2013). As a result, these
previous studies did not normalize fluorescence data and thus used data
preparation scenario F in their clustering analysis. For comparison,
scenarios B and D were explored to test whether the clustering efficiency
would be improved or hindered by fluorescence normalization. Scenarios A and
F produced inconsistent results, with some experiments (i.e., a <inline-formula><mml:math id="M26" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> ratio of
fungal spores : diesel) producing a misclassification &lt; 1.1 %, whereas
other experiments (i.e., a <inline-formula><mml:math id="M27" display="inline"><mml:mrow><mml:mn mathvariant="normal">20</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">80</mml:mn></mml:mrow></mml:math></inline-formula> ratio of bacterial : diesel) produced a misclassification of up to 80 %. In contrast, scenarios B and D produced
consistently more accurate results. Scenario B, in particular, consistently
exhibited the most accurate classification of particles for almost every
individual experiment. No experiment involving scenario B produced a greater
than 9 % misclassification of particles, regardless of the particle input
ratio, and most experiments produced results with 0.1 %–3 % error.<?pagebreak page4934?> These
observations taken together suggest that particle fluorescence properties
may not be well described by normal distributions and that normalizing
fluorescence data prior to analysis may be more effective.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F3"><caption><p id="d1e741">Cluster misclassification shown for three
computational combinations of fungal spores (F2), bacteria (B3), diesel soot
(S4), and mineral dust (D12). Each combination explored with respect to the ratio of input particle number using scenario B and a two-cluster solution
for each experiment. Scenario letters A–F refers to the scenarios summarized in
Table 1. Red shaded regions (and values) indicate the percent of particles
misclassified. Blue shaded regions represent the percentage of particles
correctly classified.</p></caption>
          <?xmltex \igopts{width=236.157874pt}?><graphic xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018-f03.pdf"/>

        </fig>

      <?xmltex \floatpos{t}?><fig id="Ch1.F4" specific-use="star"><caption><p id="d1e753">Particle type stacked-category size distributions for
input and output clustering results, using FT <inline-formula><mml:math id="M28" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> threshold
definition. Each experiment (row) shows matchups of two particle types
computationally mixed using <inline-formula><mml:math id="M29" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> ratios, scenario B, and two-cluster
solutions. Panels <bold>(a)</bold>–<bold>(f)</bold> show the properties of input particles; <bold>(g)</bold>–<bold>(l)</bold> show the properties of cluster outputs.</p></caption>
          <?xmltex \igopts{width=398.338583pt}?><graphic xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018-f04.pdf"/>

        </fig>

      <p id="d1e799">The results of these experiments also highlight how important the ratio of
input particles can be. While scenario B was relatively consistent, varying
only between 0.1 % and 3.8 % error for different ratios of the fungal spore
versus diesel matchup, other experiments depended strongly on particle
ratio. It is clear that the input ratio of particle types cannot be
controlled during an ambient study, and so these results suggest that it is
important to keep the possibility of varying concentration ratios in mind
when interpreting time- or air-mass-associated changes in cluster
composition or when relaying the relative confidence in clustering results.
For the remainder of the discussion, experiments will be limited to a <inline-formula><mml:math id="M30" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula>
ratio following scenario B. In each case the input particles are a random
subset taken from the pool of particles in the experimental data. As a
result, individual samples selected from the same experiments (i.e., Fig. 4a, e) can show slightly different average properties. In some cases (i.e.,
diesel soot; Fig. 4d) the number of particles originally analyzed was small, and so to keep the input particle ratio at <inline-formula><mml:math id="M31" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula>, the corresponding particle
type was also limited to small numbers.</p>
      <p id="d1e826">To extend the investigation of the particle input ratio, the three matchups
presented in Fig. 3 were investigated using scenario B with 1 %
bioparticles and 99 % non-bioparticles in each case. In these
experiments the bacteria : diesel soot and fungal spores : dust particles
separated relatively well (6.6 % and 13.5 % misclassification,
respectively). The fungal spores : diesel soot separation was poor, however,
because the diesel soot particles were nearly evenly split into both
clusters, and the fungal spore particles were too low in concentration to
influence the cluster properties. More investigation is needed to explore
how extreme disparities in particle ratio could negatively influence cluster
quality in real-world settings.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T2" specific-use="star"><caption><p id="d1e832">Misclassification of two-cluster solutions for 23 matchups of two
individual particle types (equal ratio of particle number, B
scenario, no fluorescence threshold applied) computationally combined
before clustering analysis. Misclassification calculated as the sum
percentage of particles misclassified in each cluster divided by the total
number of particles. Three biological particle types (F2, B3, P9) compared
separately to <bold>(a)</bold> nonbiological particle materials and
<bold>(b)</bold> biological particle materials. Particle number input was a
subset of the total population of particles experimentally analyzed. Bold
values show a misclassification &gt; 15 %.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.90}[.90]?><oasis:tgroup cols="10">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="right"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:colspec colnum="10" colname="col10" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><bold>(a)</bold></oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry namest="col3" nameend="col10" align="center">Nonbiological particle materials </oasis:entry>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">Diesel</oasis:entry>
         <oasis:entry colname="col4">California</oasis:entry>
         <oasis:entry colname="col5">Arizona</oasis:entry>
         <oasis:entry colname="col6">Suwannee</oasis:entry>
         <oasis:entry colname="col7">Methyl-</oasis:entry>
         <oasis:entry colname="col8">Glyoxal</oasis:entry>
         <oasis:entry colname="col9">White</oasis:entry>
         <oasis:entry colname="col10">Wood</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">soot</oasis:entry>
         <oasis:entry colname="col4">sand</oasis:entry>
         <oasis:entry colname="col5">Test Dust</oasis:entry>
         <oasis:entry colname="col6">River</oasis:entry>
         <oasis:entry colname="col7">glyoxal <inline-formula><mml:math id="M32" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col8"><inline-formula><mml:math id="M33" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> amm.</oasis:entry>
         <oasis:entry colname="col9">t-shirt</oasis:entry>
         <oasis:entry colname="col10">smoke</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(Soot 4)</oasis:entry>
         <oasis:entry colname="col4">(Dust 2)</oasis:entry>
         <oasis:entry colname="col5">(Dust 12)</oasis:entry>
         <oasis:entry colname="col6">Humic</oasis:entry>
         <oasis:entry colname="col7">glycine</oasis:entry>
         <oasis:entry colname="col8">sulfate</oasis:entry>
         <oasis:entry colname="col9">(Misc. 2)</oasis:entry>
         <oasis:entry colname="col10">(Soot 6)</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">Acid</oasis:entry>
         <oasis:entry colname="col7">aerosol</oasis:entry>
         <oasis:entry colname="col8">aerosol</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">(HULIS 2)</oasis:entry>
         <oasis:entry colname="col7">(Brown</oasis:entry>
         <oasis:entry colname="col8">(Brown</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7">carbon 1)</oasis:entry>
         <oasis:entry colname="col8">carbon 3)</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">S4</oasis:entry>
         <oasis:entry colname="col4">D2</oasis:entry>
         <oasis:entry colname="col5">D12</oasis:entry>
         <oasis:entry colname="col6">H2</oasis:entry>
         <oasis:entry colname="col7">BC1</oasis:entry>
         <oasis:entry colname="col8">BC3</oasis:entry>
         <oasis:entry colname="col9">WT</oasis:entry>
         <oasis:entry colname="col10">WS</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>Aspergillus niger</italic></oasis:entry>
         <oasis:entry colname="col3">(1)</oasis:entry>
         <oasis:entry colname="col4">(3)</oasis:entry>
         <oasis:entry colname="col5">(4)</oasis:entry>
         <oasis:entry colname="col6">(5)</oasis:entry>
         <oasis:entry colname="col7">(6)</oasis:entry>
         <oasis:entry colname="col8">(7)</oasis:entry>
         <oasis:entry colname="col9">(8)</oasis:entry>
         <oasis:entry colname="col10">(9)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Fungi 2)</oasis:entry>
         <oasis:entry colname="col3"><italic>0.1 %</italic></oasis:entry>
         <oasis:entry colname="col4"><italic>2.6 %</italic></oasis:entry>
         <oasis:entry colname="col5">6.1 %</oasis:entry>
         <oasis:entry colname="col6">4.8 %</oasis:entry>
         <oasis:entry colname="col7">2.5 %</oasis:entry>
         <oasis:entry colname="col8"><bold>23.0</bold> %</oasis:entry>
         <oasis:entry colname="col9"><bold>40.5</bold> %</oasis:entry>
         <oasis:entry colname="col10">7.2 %</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>P. stutzeri</italic></oasis:entry>
         <oasis:entry colname="col3">(2)</oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M34" display="inline"><mml:mspace linebreak="nobreak" width="0.25em"/></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">(10)</oasis:entry>
         <oasis:entry colname="col6">(11)</oasis:entry>
         <oasis:entry colname="col7">(12)</oasis:entry>
         <oasis:entry colname="col8">(13)</oasis:entry>
         <oasis:entry colname="col9">(14)</oasis:entry>
         <oasis:entry colname="col10">(15)</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Bacteria 3)</oasis:entry>
         <oasis:entry colname="col3"><italic>1.2 %</italic></oasis:entry>
         <oasis:entry colname="col4"><inline-formula><mml:math id="M35" display="inline"><mml:mspace linebreak="nobreak" width="0.25em"/></mml:math></inline-formula></oasis:entry>
         <oasis:entry colname="col5">1.9 %</oasis:entry>
         <oasis:entry colname="col6">1.2 %</oasis:entry>
         <oasis:entry colname="col7">1.3 %</oasis:entry>
         <oasis:entry colname="col8">6.1 %</oasis:entry>
         <oasis:entry colname="col9"><bold>41.7</bold> %</oasis:entry>
         <oasis:entry colname="col10">4.7 %</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>Phleum pratense</italic></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">(16)</oasis:entry>
         <oasis:entry colname="col6">(17)</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Pollen 9)</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"><bold>22.7</bold> %</oasis:entry>
         <oasis:entry colname="col6"><bold>23.2</bold> %</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"><bold>(b)</bold></oasis:entry>
         <oasis:entry colname="col2"/>
         <oasis:entry namest="col3" nameend="col7" align="center">Biological particle materials </oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"><italic>S. cerevisiae</italic></oasis:entry>
         <oasis:entry colname="col4"><italic>Phleum pratense</italic></oasis:entry>
         <oasis:entry colname="col5"><italic>P. stutzeri</italic></oasis:entry>
         <oasis:entry colname="col6"><italic>Taxus baccata</italic></oasis:entry>
         <oasis:entry colname="col7"><italic>B. atrophaeus</italic></oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">(Fungi 4)</oasis:entry>
         <oasis:entry colname="col4">(Pollen 9)</oasis:entry>
         <oasis:entry colname="col5">(Bacteria 3)</oasis:entry>
         <oasis:entry colname="col6">(Pollen 5)</oasis:entry>
         <oasis:entry colname="col7">(Bacteria 1)</oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3">F4</oasis:entry>
         <oasis:entry colname="col4">P9</oasis:entry>
         <oasis:entry colname="col5">B3</oasis:entry>
         <oasis:entry colname="col6">P5</oasis:entry>
         <oasis:entry colname="col7">B1</oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>Aspergillus niger</italic></oasis:entry>
         <oasis:entry colname="col3">(18)</oasis:entry>
         <oasis:entry colname="col4">(19)</oasis:entry>
         <oasis:entry colname="col5">(20)</oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Fungi 2)</oasis:entry>
         <oasis:entry colname="col3"><bold>27.9</bold> %</oasis:entry>
         <oasis:entry colname="col4"><bold>36.4</bold> %</oasis:entry>
         <oasis:entry colname="col5">10.3 %</oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>P. stutzeri</italic></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4">(21)</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7">(22)</oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Bacteria 3)</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"><bold>18.3</bold> %</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"><bold>65.4</bold> %</oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"><italic>Phelum pratense</italic></oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">(23)</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2">(Pollen 9)</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"><bold>46.8</bold> %</oasis:entry>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

      <p id="d1e1605">An important tool readily applied to the analysis of ambient data is the
categorization of particles into eight fluorescent particle types (Perring et
al., 2015). Thus, to further investigate the quality of cluster accuracy,
Fig. 4 shows inputs and cluster outputs from three clustering experiments
stacked as a function of fluorescence particle type and particle size.
Figure 4a, b, g, and h show the input data for <italic>Aspergillus niger </italic>
and diesel soot (Fig. 4a–b) paired with the outputs of the two-cluster
solution (Fig. 4g–h). It can be seen that both particle materials have
predominantly particle type A characteristics, meaning that they are
fluorescent only in channel FL1. The fungal material also presents roughly
one-third of AB (green) and a small minority of nonfluorescent (gray)
characteristics. The size distribution of the fungal spores peaks at <inline-formula><mml:math id="M36" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">3</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M37" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m, whereas diesel soot peaks at <inline-formula><mml:math id="M38" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">1</mml:mn></mml:mrow></mml:math></inline-formula> <inline-formula><mml:math id="M39" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m in size.
While not shown in this plot style, the spores exhibit moderately higher FL1
channel fluorescence, with a median of 543 ADCs, whereas diesel soot exhibits
a median of 751 ADCs in this channel (see Savage et al., 2017; Table 2). Both
particle types show almost no fluorescent characteristics in either FL2 or
FL3. In summary, the particle distributions are relatively similar in
fluorescence particle type and their differences are largely related to
particle size, so separation of these particles through Trial 1 was
hypothesized to represent a relatively challenging initial exercise. The
clustering outputs presented in Fig. 4g–h, however, visually highlight the
conclusion represented by Fig. 3, which is that the particles in this trial
separated very well. Cluster 1 was comprised predominantly of fungal
particles and presented fluorescence and size traits qualitatively similar to
the input fungal particles, whereas cluster 2 was comprised predominantly of
diesel soot particles.</p>
      <p id="d1e1646">Results from the <inline-formula><mml:math id="M40" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> ratio of the scenario B experiments for the other
two trials are also shown in Fig. 4c, d, i, and j and Fig. 4e, f, k, and l.
In each case, the qualitative properties of the input particles are extremely
well represented by the corresponding output cluster, corroborating the
conclusion from Fig. 3 that the scenario B cases accurately separated the
particle groups investigated through these experiments. It is also important
to note here that the method of aerosolization for each particle type plays
an important role in the observed size distribution, and so results involving
laboratory particles should be interpreted with this in mind. Observed
fluorescence properties, in contrast, are expected to be conserved at a given
particle size and are intrinsically
related to particle composition.</p>
</sec>
<?pagebreak page4936?><sec id="Ch1.S4.SS2">
  <title>Investigating cluster quality without fluorescence threshold</title>
      <p id="d1e1667">After concluding that scenario B exhibited the most consistently accurate
clustering results using two-cluster solutions from mixtures comprised of two
particle type inputs, the analysis was expanded to include a broader range of
particle types. Using <inline-formula><mml:math id="M41" display="inline"><mml:mrow><mml:mn mathvariant="normal">50</mml:mn><mml:mo>:</mml:mo><mml:mn mathvariant="normal">50</mml:mn></mml:mrow></mml:math></inline-formula> ratios of two types of input particles,
prepared using scenario B (leaving fluorescence data un-normalized and logarithmically transforming data vaules), 20 new individual experiments were
performed. The results of all 23 experiments (3 from Sect. 4.1 and 20
introduced in Sect. 4.2) are summarized in Table 2 as the percentage of
particle misclassification. These trials were chosen to represent a broad
range of individual matchups that might be expected in ambient air. Of the
original 69 types of particles analyzed by Savage et al. (2017), 14 were used
in experiments here: 8 types of nonbiological particles and 6 types of
biological particles (2 each of fungal spores, bacteria, and pollen species).
Supplement Fig. S4 from Savage et al. (2017) shows size distributions stacked
by fluorescence particle type for each of the particle species discussed.</p>
      <p id="d1e1682">Table 2a organizes clustering results into three rows, showing the
misclassification of F2 (<italic>Aspergillus niger</italic> fungal
spores), B3 (<italic>Pseudomonas stutzeri</italic> bacteria), and P9 (<italic>Phelum pratense</italic> pollen) particles with
respect to a variety of other particle types represented by table column. Of
the 15 cluster experiments between fungal spores or bacteria and nonbiological material, only 3 showed a
misclassification greater than 7.5 % (bold text) and 7 were less than
3 %. The three outliers were experiment 7 (F2 vs. BC3;
glyoxal <inline-formula><mml:math id="M42" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> ammonium sulfate brown carbon aerosol), 8 (F2 vs. WT; white
t-shirt particles), and 14 (B3 vs. WT). Looking first at experiment 7, F2
particles show A-type fluorescence characteristics and are dominated by a
mode between 1.5 and 4 <inline-formula><mml:math id="M43" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m. BC3 particles are primarily
nonfluorescent &lt; 1.5 <inline-formula><mml:math id="M44" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m but are primarily A-type between
1.5 and 3 <inline-formula><mml:math id="M45" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m, suggesting similar size and fluorescence properties.
The white t-shirt particles separated poorly (<inline-formula><mml:math id="M46" display="inline"><mml:mrow><mml:mo>∼</mml:mo><mml:mn mathvariant="normal">41</mml:mn></mml:mrow></mml:math></inline-formula> %
misclassification) from both the fungal spore and bacterial particles. All
three particle types (WT, F2, and B3) exhibit medium fluorescent intensity in
the FL1 channel. The poor ability to separate WT from both F2 and B3 was
surprising, however, given that WT exhibited significantly higher mean
fluorescence in each of the FL2 and FL3 channels. As first mentioned by
Savage et al. (2017), great care should be taken when interpreting
fluorescent particle results from indoor environments where increased
concentrations of bleached fibers from clothing, bedding, paper, and cleaning
products may be present.</p>
      <p id="d1e1733">While the results show that the fungal spores and bacterial particles investigated could generally be well
separated from most potentially interfering nonbiological species, the
results were much less successful for differentiation from pollen. P9 pollen
particles separated poorly in all experiments (versus D12, H2, or P5), with a
rate of misclassification ranging from 22 to 47 %. It is important to
keep in mind, however, that the WIBS was operated using a standard gain
setting that limits analysis of particle size to below approximately
20 <inline-formula><mml:math id="M47" display="inline"><mml:mi mathvariant="normal">µ</mml:mi></mml:math></inline-formula>m. As a result, the WIBS is insensitive to whole pollen grains,
and so most of the particles observed during pollen experiments are small
pollen fragments. Any intact pollen grains that navigate the flow system to
be detected are likely to be binned together in the channel representing the
largest particles. Clustering results including pollen should be interpreted
accordingly. Pollen grains can fragment in ambient air as a function of
increased relative humidity (Miguel et al., 2006; Suphioglu et al., 1992;
Taylor et al., 2004), but the relative ratio of whole / fragmented
particles is hard to predict under ambient conditions. Smaller fragments can
also exhibit different fluorescent properties to whole grains (Pöhlker et
al., 2013). O'Connor et al. (2014) operated a WIBS-4 (Univ. Hertfordshire) at
a lower gain in order to improve the pollen detection efficiency, but these
results are not explored directly here.</p>
      <p id="d1e1743">The WIBS instrument is frequently used to differentiate between airborne
biological particles and material of nonbiological origin. A secondary goal
of differentiating more finely between types of biological aerosols is also
frequently pursued. To investigate this goal, six additional experiments
were conducted by pairing two different types of nonbiological particles
(Table 2b). In contrast to the results shown in Table 2a, the clustering
algorithm showed a generally poor ability to separate between two biological
particle types. Only one of the six experiments resulted in an error &lt; 15 % (F2 vs. B3, 10.3 % error), whereas error for the other five
experiments ranged from 18 % to 65 %. The worst accuracy was
demonstrated by experiment 22 (B1 vs. B3) and experiment 23 (P5 vs. P9). Both
of these experiments attempted to separate between different species of a
single particle type (i.e., between two bacteria or two pollen). Overall, these results suggest that the clustering strategy
may be quite useful at aiding the differentiation of biological material
from nonbiological material but that separating more finely to quantify
differences between types of individual biological particles is
significantly more challenging and not likely to be possible in most
situations.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T3" specific-use="star"><caption><p id="d1e1750">Further exploration of two-cluster solutions for the 10
matchups of two individual particle types shown in Table 2 with
a misclassification &gt; 15 %. Each matchup is shown using three
separate fluorescence threshold strategies in advance of particle input into
the cluster algorithm: (I) all particles included (no fluorescence threshold),
(II) particles with fluorescence intensity &lt; FT <inline-formula><mml:math id="M48" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula>
removed, and (III) particles with fluorescence intensity &lt; FT <inline-formula><mml:math id="M49" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> removed. <bold>(a)</bold> Particle misclassification. <bold>(b)</bold> Total particle
number used for clustering experiment.</p></caption><oasis:table frame="topbot"><oasis:tgroup cols="8">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="left"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:thead>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1"><bold>(a)</bold></oasis:entry>

         <oasis:entry colname="col2"/>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4"/>

         <oasis:entry colname="col5"/>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

         <oasis:entry colname="col8"/>

       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>

         <?xmltex \rotentry?><oasis:entry rowsep="1" colname="col1" morerows="9">Percent misclassified</oasis:entry>

         <?xmltex \rotentry?><oasis:entry rowsep="1" colname="col2" morerows="4">Bio <inline-formula><mml:math id="M50" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> Nonbio</oasis:entry>

         <oasis:entry colname="col3">Input</oasis:entry>

         <oasis:entry colname="col4">(7)</oasis:entry>

         <oasis:entry colname="col5">(8)</oasis:entry>

         <oasis:entry colname="col6">(14)</oasis:entry>

         <oasis:entry colname="col7">(16)</oasis:entry>

         <oasis:entry colname="col8">(17)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">F2 <inline-formula><mml:math id="M51" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> BC3</oasis:entry>

         <oasis:entry colname="col4">F2 <inline-formula><mml:math id="M52" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> WT</oasis:entry>

         <oasis:entry colname="col5">B3 <inline-formula><mml:math id="M53" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> WT</oasis:entry>

         <oasis:entry colname="col6">P9 <inline-formula><mml:math id="M54" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> D12</oasis:entry>

         <oasis:entry colname="col7">P9 <inline-formula><mml:math id="M55" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> H2</oasis:entry>

         <oasis:entry colname="col8"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(I) All particles</oasis:entry>

         <oasis:entry colname="col4">23.0 %</oasis:entry>

         <oasis:entry colname="col5">40.5 %</oasis:entry>

         <oasis:entry colname="col6">41.7 %</oasis:entry>

         <oasis:entry colname="col7">22.7 %</oasis:entry>

         <oasis:entry colname="col8">23.2 %</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(II) Fluor. &gt; FT <inline-formula><mml:math id="M56" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 3<inline-formula><mml:math id="M57" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">10.3 %</oasis:entry>

         <oasis:entry colname="col5">36.2 %</oasis:entry>

         <oasis:entry colname="col6">24.3 %</oasis:entry>

         <oasis:entry colname="col7">19.3 %</oasis:entry>

         <oasis:entry colname="col8">3.4 %</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col3">(III) Fluor. &gt; FT <inline-formula><mml:math id="M58" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 9<inline-formula><mml:math id="M59" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">41.4 %</oasis:entry>

         <oasis:entry colname="col5">32.6 %</oasis:entry>

         <oasis:entry colname="col6">31.8 %</oasis:entry>

         <oasis:entry colname="col7">45.3 %</oasis:entry>

         <oasis:entry colname="col8">14.0 %</oasis:entry>

       </oasis:row>
       <oasis:row>

         <?xmltex \rotentry?><oasis:entry rowsep="1" colname="col2" morerows="4">Bio <inline-formula><mml:math id="M60" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> Bio</oasis:entry>

         <oasis:entry colname="col3">Input</oasis:entry>

         <oasis:entry colname="col4">(18)</oasis:entry>

         <oasis:entry colname="col5">(19)</oasis:entry>

         <oasis:entry colname="col6">(21)</oasis:entry>

         <oasis:entry colname="col7">(22)</oasis:entry>

         <oasis:entry colname="col8">(23)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">F2 <inline-formula><mml:math id="M61" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> F4</oasis:entry>

         <oasis:entry colname="col4">F2 <inline-formula><mml:math id="M62" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> P9</oasis:entry>

         <oasis:entry colname="col5">B3 <inline-formula><mml:math id="M63" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> P9</oasis:entry>

         <oasis:entry colname="col6">B1 <inline-formula><mml:math id="M64" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> B3</oasis:entry>

         <oasis:entry colname="col7">P9 <inline-formula><mml:math id="M65" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> P5</oasis:entry>

         <oasis:entry colname="col8"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(I) All particles</oasis:entry>

         <oasis:entry colname="col4">27.9 %</oasis:entry>

         <oasis:entry colname="col5">36.4 %</oasis:entry>

         <oasis:entry colname="col6">18.8 %</oasis:entry>

         <oasis:entry colname="col7">65.4 %</oasis:entry>

         <oasis:entry colname="col8">46.8 %</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(II) Fluor. &gt; FT <inline-formula><mml:math id="M66" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 3<inline-formula><mml:math id="M67" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">13.3 %</oasis:entry>

         <oasis:entry colname="col5">31.0 %</oasis:entry>

         <oasis:entry colname="col6">20.0 %</oasis:entry>

         <oasis:entry colname="col7">77.5 %</oasis:entry>

         <oasis:entry colname="col8">24.9 %</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col3">(III) Fluor. &gt; FT <inline-formula><mml:math id="M68" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 9<inline-formula><mml:math id="M69" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">29.0 %</oasis:entry>

         <oasis:entry colname="col5">28.6 %</oasis:entry>

         <oasis:entry colname="col6">29.0 %</oasis:entry>

         <oasis:entry colname="col7">66.7 %</oasis:entry>

         <oasis:entry colname="col8">33.9 %</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col1"><bold>(b)</bold></oasis:entry>

         <oasis:entry colname="col2"/>

         <oasis:entry colname="col3"/>

         <oasis:entry colname="col4"/>

         <oasis:entry colname="col5"/>

         <oasis:entry colname="col6"/>

         <oasis:entry colname="col7"/>

         <oasis:entry colname="col8"/>

       </oasis:row>
       <oasis:row>

         <?xmltex \rotentry?><oasis:entry colname="col1" morerows="9">Number of particles</oasis:entry>

         <?xmltex \rotentry?><oasis:entry rowsep="1" colname="col2" morerows="4">Bio <inline-formula><mml:math id="M70" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> Nonbio</oasis:entry>

         <oasis:entry colname="col3">Input</oasis:entry>

         <oasis:entry colname="col4">(7)</oasis:entry>

         <oasis:entry colname="col5">(8)</oasis:entry>

         <oasis:entry colname="col6">(14)</oasis:entry>

         <oasis:entry colname="col7">(16)</oasis:entry>

         <oasis:entry colname="col8">(17)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">F2 <inline-formula><mml:math id="M71" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> BC3</oasis:entry>

         <oasis:entry colname="col4">F2 <inline-formula><mml:math id="M72" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> WT</oasis:entry>

         <oasis:entry colname="col5">B3 <inline-formula><mml:math id="M73" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> WT</oasis:entry>

         <oasis:entry colname="col6">P9 <inline-formula><mml:math id="M74" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> D12</oasis:entry>

         <oasis:entry colname="col7">P9 <inline-formula><mml:math id="M75" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> H2</oasis:entry>

         <oasis:entry colname="col8"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(I) All particles</oasis:entry>

         <oasis:entry colname="col4">1959</oasis:entry>

         <oasis:entry colname="col5">565</oasis:entry>

         <oasis:entry colname="col6">565</oasis:entry>

         <oasis:entry colname="col7">10 359</oasis:entry>

         <oasis:entry colname="col8">8902</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(II) Fluor. &gt; FT <inline-formula><mml:math id="M76" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 3<inline-formula><mml:math id="M77" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">1000</oasis:entry>

         <oasis:entry colname="col5">393</oasis:entry>

         <oasis:entry colname="col6">393</oasis:entry>

         <oasis:entry colname="col7">171</oasis:entry>

         <oasis:entry colname="col8">207</oasis:entry>

       </oasis:row>
       <oasis:row rowsep="1">

         <oasis:entry colname="col3">(III) Fluor. &gt; FT <inline-formula><mml:math id="M78" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 9<inline-formula><mml:math id="M79" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">471</oasis:entry>

         <oasis:entry colname="col5">319</oasis:entry>

         <oasis:entry colname="col6">319</oasis:entry>

         <oasis:entry colname="col7">38</oasis:entry>

         <oasis:entry colname="col8">37</oasis:entry>

       </oasis:row>
       <oasis:row>

         <?xmltex \rotentry?><oasis:entry colname="col2" morerows="4">Bio <inline-formula><mml:math id="M80" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> Bio</oasis:entry>

         <oasis:entry colname="col3">Input</oasis:entry>

         <oasis:entry colname="col4">(18)</oasis:entry>

         <oasis:entry colname="col5">(19)</oasis:entry>

         <oasis:entry colname="col6">(21)</oasis:entry>

         <oasis:entry colname="col7">(22)</oasis:entry>

         <oasis:entry colname="col8">(23)</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">F2 <inline-formula><mml:math id="M81" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> F4</oasis:entry>

         <oasis:entry colname="col4">F2 <inline-formula><mml:math id="M82" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> P9</oasis:entry>

         <oasis:entry colname="col5">B3 <inline-formula><mml:math id="M83" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> P9</oasis:entry>

         <oasis:entry colname="col6">B1 <inline-formula><mml:math id="M84" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> B3</oasis:entry>

         <oasis:entry colname="col7">P9 <inline-formula><mml:math id="M85" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> P5</oasis:entry>

         <oasis:entry colname="col8"/>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(I) All particles</oasis:entry>

         <oasis:entry colname="col4">10 000</oasis:entry>

         <oasis:entry colname="col5">8900</oasis:entry>

         <oasis:entry colname="col6">10 000</oasis:entry>

         <oasis:entry colname="col7">10 000</oasis:entry>

         <oasis:entry colname="col8">10 000</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(II) Fluor. &gt; FT <inline-formula><mml:math id="M86" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 3<inline-formula><mml:math id="M87" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">9600</oasis:entry>

         <oasis:entry colname="col5">8500</oasis:entry>

         <oasis:entry colname="col6">9800</oasis:entry>

         <oasis:entry colname="col7">10 000</oasis:entry>

         <oasis:entry colname="col8">10 000</oasis:entry>

       </oasis:row>
       <oasis:row>

         <oasis:entry colname="col3">(III) Fluor. &gt; FT <inline-formula><mml:math id="M88" display="inline"><mml:mo>+</mml:mo></mml:math></inline-formula> 9<inline-formula><mml:math id="M89" display="inline"><mml:mi mathvariant="italic">σ</mml:mi></mml:math></inline-formula></oasis:entry>

         <oasis:entry colname="col4">9200</oasis:entry>

         <oasis:entry colname="col5">8100</oasis:entry>

         <oasis:entry colname="col6">9700</oasis:entry>

         <oasis:entry colname="col7">10 000</oasis:entry>

         <oasis:entry colname="col8">7895</oasis:entry>

       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S4.SS3">
  <title>Investigating the impact of fluorescence thresholding strategy on
cluster quality</title>
      <p id="d1e2597">In previously published studies, removing particles from clustering analysis
that exhibited a particle fluorescence intensity below the threshold (i.e.,
nonfluorescent) or at the saturating point improved the efficiency of
clustering (Crawford et al., 2015; Ruske et al., 2017). In Sects. 4.1–4.2, particles with either of these characteristics were left in the
analysis to prevent the underestimation of the particles clustered. In this
section, however, we investigated whether removing nonfluorescent particles
could improve cluster accuracy for the experiments that performed poorly in
Sect. 4.2. Of the<?pagebreak page4937?> 23 trials represented in Table 2, 10 experiments
exhibited a 15 % or greater misclassification and were subjected to further
analysis in order to investigate whether using a more discriminating
fluorescence thresholding strategy could improve cluster results. In all 10 cases, fluorescence saturating particles were retained, and three separate
thresholding conditions were compared by (i) keeping all nonfluorescent
and saturating particles, (ii) removing nonfluorescent particles by
applying a fluorescence threshold of FT baseline <inline-formula><mml:math id="M90" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula>, and (iii) removing nonfluorescent particles by applying a fluorescence threshold
of FT baseline <inline-formula><mml:math id="M91" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula>. Savage et al. (2017) showed evidence that applying a FT <inline-formula><mml:math id="M92" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> improved WIBS results by removing a higher fraction of
nonbiological material from analysis than the more commonly
used FT <inline-formula><mml:math id="M93" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula>, without negatively impacting observations of
biological particles. Table 3 shows the percentage of particles
misclassified in each of the three scenarios investigated here (Table 3a) as
well as the number of particles subjected to the clustering algorithm (Table 3b).</p>
      <p id="d1e2648">Each scenario, with exception of the B3 vs. B9 experiment 21, shows a
decrease in particle misclassification from scenario I (no fluorescence
threshold applied) to scenario II (FT <inline-formula><mml:math id="M94" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula>. In contrast, 8 of
the 10 scenarios <italic>increase</italic> in particle misclassification when raising the
fluorescence threshold from <inline-formula><mml:math id="M95" display="inline"><mml:mrow><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> (II) to <inline-formula><mml:math id="M96" display="inline"><mml:mrow><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> (III). The
exceptions to this trend are experiments 8 (F2 vs. WT) and 19 (F2 vs. P9),
which show a nominal improvement in error (2 %–4 % reduction) with an increased
threshold. We hypothesize that the <inline-formula><mml:math id="M97" display="inline"><mml:mrow><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> results degrade, in most
cases, because the threshold becomes high enough that most weakly
fluorescing particles have been removed from analysis. This reduces the
ability of the cluster to group into low- and high-fluorescence categories,
and so remaining particles are separated less efficiently. Secondly,
removing particles at higher fluorescence thresholds leads to increasingly
poor counting statistics, as represented in Table 3b by the number of
particles included in each experiment. Overall, these results suggest that
inputting particles into the clustering analysis with at least a nominal
fluorescence threshold (i.e., FT <inline-formula><mml:math id="M98" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> can improve the clustering
results in many cases; however, increasing the threshold further may
decrease cluster quality.</p>

<?xmltex \floatpos{t}?><table-wrap id="Ch1.T4" specific-use="star"><caption><p id="d1e2716">Particle fraction for each type and total particle
number used as inputs for simulated mixtures. PBAP: primary biological aerosol particle.</p></caption><oasis:table frame="topbot"><?xmltex \begin{scaleboxenv}{.93}[.93]?><oasis:tgroup cols="12">
     <oasis:colspec colnum="1" colname="col1" align="left"/>
     <oasis:colspec colnum="2" colname="col2" align="left"/>
     <oasis:colspec colnum="3" colname="col3" align="right"/>
     <oasis:colspec colnum="4" colname="col4" align="right"/>
     <oasis:colspec colnum="5" colname="col5" align="right"/>
     <oasis:colspec colnum="6" colname="col6" align="right"/>
     <oasis:colspec colnum="7" colname="col7" align="right"/>
     <oasis:colspec colnum="8" colname="col8" align="right"/>
     <oasis:colspec colnum="9" colname="col9" align="right"/>
     <oasis:colspec colnum="10" colname="col10" align="right"/>
     <oasis:colspec colnum="11" colname="col11" align="right"/>
     <oasis:colspec colnum="12" colname="col12" align="right"/>
     <oasis:thead>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry rowsep="1" colname="col3">F2</oasis:entry>
         <oasis:entry rowsep="1" colname="col4">B3</oasis:entry>
         <oasis:entry rowsep="1" colname="col5">P9</oasis:entry>
         <oasis:entry rowsep="1" colname="col6">S4</oasis:entry>
         <oasis:entry rowsep="1" colname="col7">D12</oasis:entry>
         <oasis:entry rowsep="1" colname="col8">H2</oasis:entry>
         <oasis:entry rowsep="1" colname="col9">BC1</oasis:entry>
         <oasis:entry rowsep="1" colname="col10">WS</oasis:entry>
         <oasis:entry rowsep="1" colname="col11">WT</oasis:entry>
         <oasis:entry colname="col12"/>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">Mixture</oasis:entry>
         <oasis:entry colname="col2">Mixture</oasis:entry>
         <oasis:entry colname="col3"><italic>Asp. niger</italic></oasis:entry>
         <oasis:entry colname="col4"><italic>P. stutzeri</italic></oasis:entry>
         <oasis:entry colname="col5"><italic>Phleum</italic></oasis:entry>
         <oasis:entry colname="col6">Diesel</oasis:entry>
         <oasis:entry colname="col7">AZ Test</oasis:entry>
         <oasis:entry colname="col8">Suwannee</oasis:entry>
         <oasis:entry colname="col9">Brown</oasis:entry>
         <oasis:entry colname="col10">Wood</oasis:entry>
         <oasis:entry colname="col11">White</oasis:entry>
         <oasis:entry colname="col12">Total</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">number</oasis:entry>
         <oasis:entry colname="col2">name</oasis:entry>
         <oasis:entry colname="col3">(fungi)</oasis:entry>
         <oasis:entry colname="col4">(bacteria)</oasis:entry>
         <oasis:entry colname="col5"><italic>pratense</italic></oasis:entry>
         <oasis:entry colname="col6">soot</oasis:entry>
         <oasis:entry colname="col7">Dust</oasis:entry>
         <oasis:entry colname="col8">River</oasis:entry>
         <oasis:entry colname="col9">carbon 1</oasis:entry>
         <oasis:entry colname="col10">smoke</oasis:entry>
         <oasis:entry colname="col11">t-shirt</oasis:entry>
         <oasis:entry colname="col12">particle</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">(pollen)</oasis:entry>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">Humic</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12">number</oasis:entry>
       </oasis:row>
       <oasis:row rowsep="1">
         <oasis:entry colname="col1"/>
         <oasis:entry colname="col2"/>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7"/>
         <oasis:entry colname="col8">Acid</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12"/>
       </oasis:row>
     </oasis:thead>
     <oasis:tbody>
       <oasis:row>
         <oasis:entry colname="col1">1</oasis:entry>
         <oasis:entry colname="col2">Four-comp. A</oasis:entry>
         <oasis:entry colname="col3">25 %</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">25 %</oasis:entry>
         <oasis:entry colname="col7">25 %</oasis:entry>
         <oasis:entry colname="col8">25 %</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12">680</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">2</oasis:entry>
         <oasis:entry colname="col2">Four-comp. B</oasis:entry>
         <oasis:entry colname="col3">25 %</oasis:entry>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">25 %</oasis:entry>
         <oasis:entry colname="col7">25 %</oasis:entry>
         <oasis:entry colname="col8"/>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10">25 %</oasis:entry>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12">680</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">3</oasis:entry>
         <oasis:entry colname="col2">High PBAP</oasis:entry>
         <oasis:entry colname="col3">25 %</oasis:entry>
         <oasis:entry colname="col4">25 %</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7">20 %</oasis:entry>
         <oasis:entry colname="col8">20 %</oasis:entry>
         <oasis:entry colname="col9">10 %</oasis:entry>
         <oasis:entry colname="col10"/>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12">850</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">4</oasis:entry>
         <oasis:entry colname="col2">Low PBAP</oasis:entry>
         <oasis:entry colname="col3">12.5 %</oasis:entry>
         <oasis:entry colname="col4">12.5 %</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6">15 %</oasis:entry>
         <oasis:entry colname="col7">15 %</oasis:entry>
         <oasis:entry colname="col8">15 %</oasis:entry>
         <oasis:entry colname="col9">15 %</oasis:entry>
         <oasis:entry colname="col10">15 %</oasis:entry>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12">1134</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">5</oasis:entry>
         <oasis:entry colname="col2">Pollen</oasis:entry>
         <oasis:entry colname="col3"/>
         <oasis:entry colname="col4"/>
         <oasis:entry colname="col5">30 %</oasis:entry>
         <oasis:entry colname="col6">10 %</oasis:entry>
         <oasis:entry colname="col7">20 %</oasis:entry>
         <oasis:entry colname="col8">20 %</oasis:entry>
         <oasis:entry colname="col9">10 %</oasis:entry>
         <oasis:entry colname="col10">10 %</oasis:entry>
         <oasis:entry colname="col11"/>
         <oasis:entry colname="col12">850</oasis:entry>
       </oasis:row>
       <oasis:row>
         <oasis:entry colname="col1">6</oasis:entry>
         <oasis:entry colname="col2">Indoor air</oasis:entry>
         <oasis:entry colname="col3">20 %</oasis:entry>
         <oasis:entry colname="col4">20 %</oasis:entry>
         <oasis:entry colname="col5"/>
         <oasis:entry colname="col6"/>
         <oasis:entry colname="col7">20 %</oasis:entry>
         <oasis:entry colname="col8">20 %</oasis:entry>
         <oasis:entry colname="col9"/>
         <oasis:entry colname="col10"/>
         <oasis:entry colname="col11">20 %</oasis:entry>
         <oasis:entry colname="col12">850</oasis:entry>
       </oasis:row>
     </oasis:tbody>
   </oasis:tgroup><?xmltex \end{scaleboxenv}?></oasis:table></table-wrap>

</sec>
<sec id="Ch1.S4.SS4">
  <title>Investigating the capability to separate particles in
simulations of complex mixtures</title>
      <p id="d1e3172">To this point, our investigation has focused on a variety of individual
matchups between two distinct particle types. To better simulate real-world
scenarios, we computationally<?pagebreak page4938?> simulated six mixtures of particles by pooling
existing WIBS data from selected particle types at prescribed ratios. Each
simulated mixture was assembled to roughly represent a different
hypothetical mixture of particles that might be expected. Also, the
particles in each simulated mixture are assumed to be so diluted that any
agglomeration is negligible. Table 4 provides an overview of the percentage
of each particle type included as well as the total number of particles in
the mixture. Mixtures 1 and 2 were simulated arbitrarily to test if a
minority (25 %) of one type of fungal spores (F2) could be separated from
a majority (75 %) of a mixture of three different nonbiological
materials. Mixtures 3 and 4 synthesized arbitrary mixtures of two types of
bioaerosol (F2 and B3) with three or five types of nonbiological particles,
respectively. Mixture 5 was simulated to examine the separation of pollen
(P9) from a set of five nonbiological particles. Mixture 6 was simulated to
be similar to an indoor environment that might have a mixture of biological
particles (F2 and B3) with nonbiological materials, including bleached
fibers (WT). These mixtures are not intended to closely mimic any set of
individual ambient conditions but are rather used as very rough simulations for discussion and to prompt discussion related to future experiments
within the community. In a real-world sampling environment, one would also
expect a high concentration of nonfluorescent particles (e.g., most
organic aerosols, sea salt, dusts), but these were generally not sampled as
a part of the Savage et al. (2017) study, which
focused on fluorescent particles. As a result, relatively nonfluorescent
particles like D12 and H2 were included here as “fillers” in most mixtures
as surrogates for other types of nonfluorescent particles. Clustering
analysis was performed using the ratios listed in Table 4, the B scenario of
pre-normalization conditions, and the filtering of nonfluorescent particles below
the FT <inline-formula><mml:math id="M99" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> threshold. In all cases, the number of clusters
retrieved after HAC was pre-defined to be the same as the number of particle
types input.</p>

      <?xmltex \floatpos{t}?><fig id="Ch1.F5" specific-use="star"><caption><p id="d1e3189">Overview
of computationally simulated mixtures. Six
mixtures shown as groups of rows, with input particle fractions defined in
Table 4. Panel <bold>(a)</bold>  shows the particle number retrieved by each
individual cluster (horizontal rows) categorized by each input particle
type (vertical columns). Panel <bold>(b)</bold> shows the particle number
categorized and grouped by particle classes (i.e., nonbiological and
biological). Panel <bold>(c)</bold> shows the misclassification of groups of
particles. Colors: light green – fungal spores; blue – bacteria; pink – pollen; dark green – grouped biologically; brown – all nonbiological.</p></caption>
          <?xmltex \igopts{width=483.69685pt}?><graphic xlink:href="https://amt.copernicus.org/articles/11/4929/2018/amt-11-4929-2018-f05.pdf"/>

        </fig>

      <p id="d1e3207">Cluster results from all six mixtures are summarized in Fig. 5. Figure 5a shows the number of particles from each type assigned to each
cluster, and panels (b) and (c) show results grouped by general particle
classification (brown for nonbiological and dark green for biological).
Overall, the ability of the HAC analysis to separate the biological
particles from the nonbiological particles was high. In some cases, the
quality of separation of one or two biological species from a mixture of
nonbiological materials was even higher than the two-material matchups shown
in Sects. 4.1–4.3. The two four-component mixtures showed a 22.4 % and
14.8 % misclassification of fungal spores. In both cases, a small fraction
of each of the nonbiological materials was mixed into the spore cluster,
whereas almost none (1.5 % and 0.6 %) of the spores were incorrectly
mixed into the sum of the nonbiological clusters.</p>
      <p id="d1e3210">Mixtures 3 and 4 showed a similar misclassification for fungal spores
(11.9 % and 13.8 %, respectively), whereas the bacterial particles
clustered with amazing quality. For Mixture 3, no particles other than
bacterial particles were grouped into Cluster 1, and only 16 of 213
bacterial particles were assigned to other clusters. For Mixture 4, 135 of
137 particles in Cluster 6 were bacterial in origin and 135 of 142 bacterial
particles were assigned to the cluster. The combination of fungal and
bacterial particles in mixtures 3 and 4 resulted in a total of 5.0 % and
5.3 % misclassification of all biological particles.</p>
      <p id="d1e3214">In contrast to the poor separation of pollen from other particle types
discussed in Sect. 4.2, Mixture 5 showed a higher quality of separation
between pollen (9.4 % misclassified) and the sum of five other
nonbiological particle types. Lastly, the mixture designed to roughly mimic
an indoor environment included white t-shirt particles. In this mixture the
WT particles confounded the spore separation, but the bacterial separation
was nearly flawless.</p>
      <p id="d1e3217">Another surprising observation from the analysis of these simulated mixtures
was that the diesel soot particles (mixtures 1, 2, 4, and 5) separated into
their own cluster in almost all cases with very high quality (1.8 %,
2.9 %, 0.6 %, and 9.4 %, respectively, of diesel soot particles
misclassified into a different cluster). The quality of the separation of
bacterial particles and diesel soot (Mixture 4) was especially<?pagebreak page4939?> good,
given the qualitative similarity of the two particle populations. For
example, size distributions of each particle type show primarily A-type
particles with similar mean fluorescent intensity values in FL1, FL2, and
FL3 (Savage et al., 2017).</p>
</sec>
</sec>
<sec id="Ch1.S5" sec-type="conclusions">
  <title>Conclusions</title>
      <p id="d1e3228">The application of results from a recent set of systematic laboratory
experiments (Savage et al., 2017) by the commonly used hierarchical
agglomerative clustering analysis helps to reveal areas where the tool can be
used well and other areas where it struggles. First (Sect. 4.1) it was
observed that differing ratios of particle input into the clustering
algorithm can produce dramatically different results. It will be important
for anyone applying HAC to ambient particle sets, where particle ratios are
not independently verified, to interpret results somewhat loosely. In
Sect. 4.1 the clustering quality of scenario B, where fluorescence intensity
was not normalized to particle size and where all variables were input in
logarithmic space as log(value), was determined to consistently demonstrate
the highest-quality results. Further, the ability of the HAC analysis to
separate between two groups of individual particle types using no
fluorescence threshold (Sect. 4.2) and comparing three separate threshold
strategies (Sect. 4.3) was shown to be relatively high in many cases but
confounded in others. Lastly, Sect. 4.4 explored the ability of<?pagebreak page4940?> HAC analysis
to separate biological components from more complex mixtures of four to seven
types of input particles.</p>
      <p id="d1e3231">A standard fluorescence threshold of FT <inline-formula><mml:math id="M100" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> has been commonly
applied during WIBS analysis to separate between fluorescent and
nonfluorescent particles. Savage et al. (2017) concluded that the application
of a more aggressive threshold strategy (FT <inline-formula><mml:math id="M101" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi><mml:mo>)</mml:mo></mml:mrow></mml:math></inline-formula> could help
discriminate between biological and nonbiological particles more successfully
in many circumstances; however, certain types of interfering, nonbiological
particle species can still confound WIBS analysis, irrespective of the threshold. Here we have investigated an
orthogonal strategy to separate particle types by subjecting particles to HAC
computer analysis. By comparing the results of the HAC analysis with raw
separation based on fluorescence thresholding alone, the HAC analysis can
clearly increase the quality of differentiation. Interestingly, while Savage
et al. (2017) reported that the FT <inline-formula><mml:math id="M102" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> strategy helped improved
differentiation, using the same threshold in conjunction with HAC analysis
actually degraded results. We therefore conclude that if HAC analysis is to
be performed, the standard FT <inline-formula><mml:math id="M103" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">3</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> threshold is likely to produce
the highest-quality results; however, if HAC is not to be applied, the FT <inline-formula><mml:math id="M104" display="inline"><mml:mrow><mml:mo>+</mml:mo><mml:mn mathvariant="normal">9</mml:mn><mml:mi mathvariant="italic">σ</mml:mi></mml:mrow></mml:math></inline-formula> threshold is probably a better choice to enable the investigation
of biological particles while computationally filtering nonbiological
particles.</p>
      <p id="d1e3296">The overall message here is that HAC can be applied successfully to
differentiate particle types sampled by WIBS instruments and that it is most
successful at separating biological species (i.e., fungal spores and
bacteria) from nonbiological particles. In all cases the HAC method allows
the separation of particles at least at the order-of-magnitude level and
often with a misclassification of &lt; 5 %. As mentioned by Savage
et al. (2017), however, it should always be kept in mind that different
instruments may produce slightly different signals due to physical
differences between instruments (i.e., fluorescence calibration, tuning, and
detector gain sensitivity) and between calibration strategies (Könemann
et al., 2018; Robinson et al., 2017). Results here are also generally
extendable to other UV-LIF instruments, whether they offer single or many
channels of emission spectral resolution, in that the methods of particle
pre-preparation and the impact of the particle number ratio are likely to
relay similar effects to the clustering strategy. Subtle differences in
particles observed in a real-world environment will also complicate HAC analysis or the extension of results
presented here. The UV-LIF community is encouraged to continue laboratory
investigations, including a detailed interrogation of clustering analytical
techniques, to further understand limitations to better differentiating
between particles.</p>
</sec>

      
      </body>
    <back><notes notes-type="dataavailability">

      <p id="d1e3303">Plots in electronic format and experimental results can be provided upon request.</p>
  </notes><notes notes-type="authorcontribution">

      <p id="d1e3309">NJS ran clustering experiments and contributed to writing the paper.
JAH oversaw clustering experiments and contributed to writing the paper.</p>
  </notes><notes notes-type="competinginterests">

      <p id="d1e3315">The authors declare that they have no conflict of
interest.</p>
  </notes><ack><title>Acknowledgements</title><p id="d1e3321">The authors acknowledge the University of Denver for financial support from
the faculty start-up fund. Nicole Savage acknowledges financial support from
the Phillipson Graduate Fellowship at the University of Denver. Martin Gallagher, David Topping, and Simon Ruske in the School of Earth and
Environmental Sciences at the University of Manchester are acknowledged for
initial discussion regarding clustering strategy. Cathy Durso at the
University of Denver Center for Statistics and Visualization is acknowledged
for help running clustering algorithms. All contributors to the Savage et
al. (2017) paper, in which all experimental data discussed here were
originally presented, are acknowledged for their contributions.
<?xmltex \hack{\newline}?><?xmltex \hack{\newline}?>
Edited by: Mingjin Tang<?xmltex \hack{\newline}?>
Reviewed by: three anonymous referees</p></ack><ref-list>
    <title>References</title>

      <ref id="bib1.bib1"><label>1</label><mixed-citation>
Bond, T. C., Doherty, S. J., Fahey, D. W., Forster, P. M., Berntsen, T.,
DeAngelo, B. J., Flanner, M. G., Ghan, S., Karcher, B., Koch, D., Kinne, S.,
Kondo, Y., Quinn, P. K., Sarofim, M. C., Schultz, M. G., Schulz, M.,
Venkataraman, C., Zhang, H., Zhang, S., Bellouin, N., Guttikunda, S. K.,
Hopke, P. K., Jacobson, M. Z., Kaiser, J. W., Klimont, Z., Lohmann, U.,
Schwarz, J. P., Shindell, D., Storelvmo, T., Warren, S. G., and Zender, C.
S.: Bounding the role of black carbon in the climate system: A scientific
assessment, J. Geophys. Res.-Atmos., 118, 5380–5552, 2013.</mixed-citation></ref>
      <ref id="bib1.bib2"><label>2</label><mixed-citation>Crawford, I., Robinson, N. H., Flynn, M. J., Foot, V. E., Gallagher, M. W., Huffman, J. A., Stanley, W. R., and Kaye, P. H.: Characterisation of
bioaerosol emissions from a Colorado pine forest: results from the BEACHON-RoMBAS experiment, Atmos. Chem. Phys., 14, 8559–8578,
<ext-link xlink:href="https://doi.org/10.5194/acp-14-8559-2014" ext-link-type="DOI">10.5194/acp-14-8559-2014</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib3"><label>3</label><mixed-citation>Crawford, I., Ruske, S., Topping, D. O., and Gallagher, M. W.: Evaluation of hierarchical agglomerative cluster analysis methods for
discrimination of primary biological aerosol, Atmos. Meas. Tech., 8, 4979–4991, <ext-link xlink:href="https://doi.org/10.5194/amt-8-4979-2015" ext-link-type="DOI">10.5194/amt-8-4979-2015</ext-link>, 2015.</mixed-citation></ref>
      <ref id="bib1.bib4"><label>4</label><mixed-citation>Crawford, I., Lloyd, G., Herrmann, E., Hoyle, C. R., Bower, K. N., Connolly, P. J., Flynn, M. J., Kaye, P. H., Choularton, T. W.,
and Gallagher, M. W.: Observations of fluorescent aerosol-cloud interactions in the free troposphere at the High-Altitude
Research Station Jungfraujoch, Atmos. Chem. Phys., 16, 2273–2284, <ext-link xlink:href="https://doi.org/10.5194/acp-16-2273-2016" ext-link-type="DOI">10.5194/acp-16-2273-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib5"><label>5</label><mixed-citation>Després, V. R., Huffman, J. A., Burrows, S. M., Hoose, C., Safatov, A.
S., Buryak, G. A., Fröhlich-Nowoisky, J., Elbert, W., Andreae, M. O.,
Pöschl, U., and Jaenicke, R.: Primary Biological Aerosol Particles in
the Atmosphere: A Review, Tellus B,
64, 15598, <ext-link xlink:href="https://doi.org/10.3402/tellusb.v64i0.15598" ext-link-type="DOI">10.3402/tellusb.v64i0.15598</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib6"><label>6</label><mixed-citation>
Douwes, J., Thorne, P., Pearce, N., and Heederik, D.: Bioaerosol health
effects and exposure assessment: Progress and prospects, Ann. Occupa. Hyg., 47, 187–200, 2003.</mixed-citation></ref>
      <ref id="bib1.bib7"><label>7</label><mixed-citation>Eick, C. F., Zeidat, N., and Zhao, Z.: Supervised clustering-algorithms and
benefits, Conference proceedings,
Tools with Artificial Intelligence,  ICTAI 2004, <ext-link xlink:href="https://doi.org/10.1109/ICTAI.2004.111" ext-link-type="DOI">10.1109/ICTAI.2004.111</ext-link>,  774–776, 2004.</mixed-citation></ref>
      <ref id="bib1.bib8"><label>8</label><mixed-citation>Fennelly, M. J., Sewell, G., Prentice, M. B., O'Connor, D. J., and Sodeau,
J. R.: The Use of Real-Time Fluorescence Instrumentation to Monitor Ambient
Primary Biological Aerosol Particles (PBAP), Atmosphere, 9, 1–39, <ext-link xlink:href="https://doi.org/10.3390/atmos9010001" ext-link-type="DOI">10.3390/atmos9010001</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib9"><label>9</label><mixed-citation>Foot, V. E., Kaye, P. H., Stanley, W. R., Barrington, S. J., Gallagher, M.,
and Gabey, A.: Low-cost real-time multi-parameter bio-aerosol sensors,
Proceedings of the SPIE – The International Society for Optical Engineering,
7116, 711601, <ext-link xlink:href="https://doi.org/10.1117/12.800226" ext-link-type="DOI">10.1117/12.800226</ext-link>, 2008.</mixed-citation></ref>
      <ref id="bib1.bib10"><label>10</label><mixed-citation>Fröhlich-Nowoisky, J., Kampf, C. J., Weber, B., Huffman, J. A.,
Pöhlker, C., Andreae, M. O., Lang-Yona, N., Burrows, S. M., Gunthe, S.
S., Elbert, W., Su, H., Hoor, P., Thines, E., Hoffmann, T., Després, V.
R., and Pöschl, U.: Bioaerosols in the Earth system: Climate, health, and
ecosystem interactions, Atmos. Res., 182, 346–376,
<ext-link xlink:href="https://doi.org/10.1016/j.atmosres.2016.07.018" ext-link-type="DOI">10.1016/j.atmosres.2016.07.018</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib11"><label>11</label><mixed-citation>Gabey, A. M., Gallagher, M. W., Whitehead, J., Dorsey, J. R., Kaye, P. H., and Stanley, W. R.: Measurements and comparison of primary
biological aerosol above and below a tropical forest canopy using a dual channel fluorescence spectrometer, Atmos. Chem. Phys., 10,
4453–4466, <ext-link xlink:href="https://doi.org/10.5194/acp-10-4453-2010" ext-link-type="DOI">10.5194/acp-10-4453-2010</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib12"><label>12</label><mixed-citation>Gosselin, M. I., Rathnayake, C. M., Crawford, I., Pöhlker, C., Fröhlich-Nowoisky, J., Schmer, B., Després, V. R., Engling, G.,
Gallagher, M., Stone, E., Pöschl, U., and Huffman, J. A.: Fluorescent bioaerosol particle, molecular tracer, and fungal spore
concentrations during dry and rainy periods in a semi-arid forest, Atmos. Chem. Phys., 16, 15165–15184, <ext-link xlink:href="https://doi.org/10.5194/acp-16-15165-2016" ext-link-type="DOI">10.5194/acp-16-15165-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib13"><label>13</label><mixed-citation>
Healy, D. A., O'Connor, D. J., Burke, A. M., and Sodeau, J. R.: A laboratory
assessment of the Waveband Integrated Bioaerosol Sensor (WIBS-4) using
individual samples of pollen and fungal spore material, Atmos.
Environ., 60, 534–543, 2012a.</mixed-citation></ref>
      <ref id="bib1.bib14"><label>14</label><mixed-citation>
Healy, D. A., O'Connor, D. J., and Sodeau, J. R.: Measurement of the
particle counting efficiency of the “Waveband Integrated Bioaerosol
Sensor” model number 4 (WIBS-4), J. Aerosol Sci., 47, 94–99,
2012b.</mixed-citation></ref>
      <ref id="bib1.bib15"><label>15</label><mixed-citation>Hernandez, M., Perring, A. E., McCabe, K., Kok, G., Granger, G., and Baumgardner, D.: Chamber catalogues of optical and fluorescent
signatures distinguish bioaerosol classes, Atmos. Meas. Tech., 9, 3283–3292, <ext-link xlink:href="https://doi.org/10.5194/amt-9-3283-2016" ext-link-type="DOI">10.5194/amt-9-3283-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib16"><label>16</label><mixed-citation>
Hill, S. C., Pinnick, R. G., Niles, S., Fell, N. F., Pan, Y. L., Bottiger,
J., Bronk, B. V., Holler, S., and Chang, R. K.: Fluorescence from airborne
microparticles: dependence on size, concentration of fluorophores, and
illumination intensity, Appl. Opt., 40, 3005–3013, 2001.</mixed-citation></ref>
      <ref id="bib1.bib17"><label>17</label><mixed-citation>Huffman, D. R., Swanson, B. E., and Huffman, J. A.: A wavelength-dispersive instrument for characterizing fluorescence and scattering
spectra of individual aerosol particles on a substrate, Atmos. Meas. Tech., 9, 3987–3998, <ext-link xlink:href="https://doi.org/10.5194/amt-9-3987-2016" ext-link-type="DOI">10.5194/amt-9-3987-2016</ext-link>, 2016.</mixed-citation></ref>
      <ref id="bib1.bib18"><label>18</label><mixed-citation>
Huffman, J. A. and Santarpia, J.: Online Techniques for Quantification and
Characterization of Biological Aerosols, in: Microbiology of Aerosols, John
Wiley &amp; Sons, Inc., 2017.</mixed-citation></ref>
      <ref id="bib1.bib19"><label>19</label><mixed-citation>Huffman, J. A., Treutlein, B., and Pöschl, U.: Fluorescent biological aerosol particle concentrations and size distributions measured with an
Ultraviolet Aerodynamic Particle Sizer (UV-APS) in Central Europe, Atmos. Chem. Phys., 10, 3215–3233, <ext-link xlink:href="https://doi.org/10.5194/acp-10-3215-2010" ext-link-type="DOI">10.5194/acp-10-3215-2010</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib20"><label>20</label><mixed-citation>Huffman, J. A., Sinha, B., Garland, R. M., Snee-Pollmann, A., Gunthe, S. S., Artaxo, P., Martin, S. T., Andreae, M. O., and Pöschl, U.:
Size distributions and temporal variations of biological aerosol particles in the Amazon rainforest characterized by microscopy and
real-time UV-APS fluorescence techniques during AMAZE-08, Atmos. Chem. Phys., 12, 11997–12019, <ext-link xlink:href="https://doi.org/10.5194/acp-12-11997-2012" ext-link-type="DOI">10.5194/acp-12-11997-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib21"><label>21</label><mixed-citation>
Kaye, P. H., Stanley, W. R., Hirst, E., Foot, E. V., Baxter, K. L., and
Barrington, S. J.: Single particle multichannel bio-aerosol fluorescence
sensor, Opt. Exp., 13, 3583–3593, 2005.</mixed-citation></ref>
      <ref id="bib1.bib22"><label>22</label><mixed-citation>
Kiselev, D., Bonacina, L., and Wolf, J.-P.: A flash-lamp based device for
fluorescence detection and identification of individual pollen grains,
Rev. Sci. Instrum., 84, 2013.</mixed-citation></ref>
      <ref id="bib1.bib23"><label>23</label><mixed-citation>Könemann, T., Savage, N. J., Huffman, J. A., and Pöhlker, C.: Characterization of steady-state fluorescence properties of polystyrene
latex spheres using off- and online spectroscopic methods, Atmos. Meas. Tech., 11, 3987–4003, <ext-link xlink:href="https://doi.org/10.5194/amt-11-3987-2018" ext-link-type="DOI">10.5194/amt-11-3987-2018</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib24"><label>24</label><mixed-citation>Liu, Y., Li, Z., Xiong, H., Gao, X., and Wu, J.: Understanding of internal
clustering validation measures,  IEEE International Conference on Data Mining
INSPEC Accession Number: 11770459,
Sydney, NSW, Australia, 911–916, <ext-link xlink:href="https://doi.org/10.1109/ICDM.2010.35" ext-link-type="DOI">10.1109/ICDM.2010.35</ext-link>, 2010.</mixed-citation></ref>
      <ref id="bib1.bib25"><label>25</label><mixed-citation>
Miguel, A. G., Taylor, P. E., House, J., Glovsky, M. M., and Flagan, R. C.:
Meteorological influences on respirable fragment release from Chinese elm
pollen, Aerosol Sci. Technol., 40, 690–696, 2006.</mixed-citation></ref>
      <ref id="bib1.bib26"><label>26</label><mixed-citation>
Müllner, D.: fastcluster: Fast hierarchical, agglomerative clustering
routines for R and Python, J. Stat. Softw., 53, 1–18, 2013.</mixed-citation></ref>
      <ref id="bib1.bib27"><label>27</label><mixed-citation>
Norusis, M.: Cluster Analysis. In: IBM SPSS Statistics 19 Guide to Data
Analysis, Norusis &amp; SPSS Inc., 2011.</mixed-citation></ref>
      <ref id="bib1.bib28"><label>28</label><mixed-citation>
O'Connor, D. J., Healy, D. A., Hellebust, S., Buters, J. T. M., and Sodeau,
J. R.: Using the WIBS-4 (Waveband Integrated Bioaerosol Sensor) Technique
for the On-Line Detection of Pollen Grains, Aerosol Sci. Technol., 48,
341–349, 2014.</mixed-citation></ref>
      <ref id="bib1.bib29"><label>29</label><mixed-citation>Pan, Y. L., Pinnick, R. G., Hill, S. C., Rosen, J. M., and Chang, R. K.:
Single-particle laser-induced-fluorescence spectra of biological and other
organic-carbon aerosols in the atmosphere: Measurements at New Haven,
Connecticut, and Las Cruces, New Mexico, J. Geophys. Res.-Atmos., 112,
D24S19, <ext-link xlink:href="https://doi.org/10.1029/2007jd008741" ext-link-type="DOI">10.1029/2007jd008741</ext-link>, 2007.</mixed-citation></ref>
      <ref id="bib1.bib30"><label>30</label><mixed-citation>
Pan, Y.-L., Pinnick, R. G., Hill, S. C., and Chang, R. K.:
Particle-Fluorescence Spectrometer for Real-Time Single-Particle
Measurements of Atmospheric Organic Carbon and Biological Aerosol, Environ.
Sci. Technol., 43, 429–434, 2009a.</mixed-citation></ref>
      <ref id="bib1.bib31"><label>31</label><mixed-citation>
Pan, Y. L., Pinnick, R. G., Hill, S. C., and Chang, R. K.:
Particle-Fluorescence Spectrometer for Real-Time Single-Particle
Measurements of Atmospheric Organic Carbon and Biological Aerosol, Environ.
Sci. Technol., 43, 429–434, 2009b.</mixed-citation></ref>
      <?pagebreak page4942?><ref id="bib1.bib32"><label>32</label><mixed-citation>
Pan, Y. L., Huang, H., and Chang, R. K.: Clustered and integrated
fluorescence spectra from single atmospheric aerosol particles excited by a
263-and 351-nm laser at New Haven, CT, and Adelphi, MD, J.
Quant. Spectrosc. Ra., 113, 2213–2221, 2012.</mixed-citation></ref>
      <ref id="bib1.bib33"><label>33</label><mixed-citation>
Perring, A. E., Schwarz, J. P., Baumgardner, D., Hernandez, M. T.,
Spracklen, D. V., Heald, C. L., Gao, R. S., Kok, G., McMeeking, G. R.,
McQuaid, J. B., and Fahey, D. W.: Airborne observations of regional
variation in fluorescent aerosol across the United States, J. Geophys.
Res.-Atmos., 120, 1153–1170, 2015.</mixed-citation></ref>
      <ref id="bib1.bib34"><label>34</label><mixed-citation>
Pinnick, R. G., Hill, S. C., Pan, Y. L., and Chang, R. K.: Fluorescence
spectra of atmospheric aerosol at Adelphi, Maryland, USA: measurement and
classification of single particles containing organic carbon, Atmos.
Environ., 38, 1657–1672, 2004.</mixed-citation></ref>
      <ref id="bib1.bib35"><label>35</label><mixed-citation>
Pinnick, R. G., Fernandez, E., Rosen, J. M., Hill, S. C., Wang, Y., and Pan,
Y. L.: Fluorescence spectra and elastic scattering characteristics of
atmospheric aerosol in Las Cruces, New Mexico, USA: Variability of
concentrations and possible constituents and sources of particles in various
spectral clusters, Atmos. Environ., 65, 195–204, 2013.</mixed-citation></ref>
      <ref id="bib1.bib36"><label>36</label><mixed-citation>Pöhlker, C., Huffman, J. A., and Pöschl, U.: Autofluorescence of atmospheric bioaerosols – fluorescent biomolecules and potential
interferences, Atmos. Meas. Tech., 5, 37–71, <ext-link xlink:href="https://doi.org/10.5194/amt-5-37-2012" ext-link-type="DOI">10.5194/amt-5-37-2012</ext-link>, 2012.</mixed-citation></ref>
      <ref id="bib1.bib37"><label>37</label><mixed-citation>Pöhlker, C., Huffman, J. A., Förster, J.-D., and Pöschl, U.: Autofluorescence of atmospheric bioaerosols: spectral
fingerprints and taxonomic trends of pollen, Atmos. Meas. Tech., 6, 3369–3392, <ext-link xlink:href="https://doi.org/10.5194/amt-6-3369-2013" ext-link-type="DOI">10.5194/amt-6-3369-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib38"><label>38</label><mixed-citation>Robinson, E. S., Gao, R.-S., Schwarz, J. P., Fahey, D. W., and Perring, A. E.: Fluorescence calibration method for single-particle
aerosol fluorescence instruments, Atmos. Meas. Tech., 10, 1755–1768, <ext-link xlink:href="https://doi.org/10.5194/amt-10-1755-2017" ext-link-type="DOI">10.5194/amt-10-1755-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib39"><label>39</label><mixed-citation>Robinson, N. H., Allan, J. D., Huffman, J. A., Kaye, P. H., Foot, V. E., and Gallagher, M.: Cluster analysis of WIBS
single-particle bioaerosol data, Atmos. Meas. Tech., 6, 337–347, <ext-link xlink:href="https://doi.org/10.5194/amt-6-337-2013" ext-link-type="DOI">10.5194/amt-6-337-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib40"><label>40</label><mixed-citation>Ruske, S., Topping, D. O., Foot, V. E., Kaye, P. H., Stanley, W. R., Crawford, I., Morse, A. P., and Gallagher, M. W.: Evaluation
of machine learning algorithms for classification of primary biological aerosol using a new UV-LIF spectrometer,
Atmos. Meas. Tech., 10, 695–708, <ext-link xlink:href="https://doi.org/10.5194/amt-10-695-2017" ext-link-type="DOI">10.5194/amt-10-695-2017</ext-link>, 2017.</mixed-citation></ref>
      <ref id="bib1.bib41"><label>41</label><mixed-citation>Ruske, S., Topping, D. O., Foot, V. E., Morse, A. P., and Gallagher, M. W.: Machine learning for improved data analysis of
biological aerosol using the WIBS, Atmos. Meas. Tech. Discuss., <ext-link xlink:href="https://doi.org/10.5194/amt-2018-126" ext-link-type="DOI">10.5194/amt-2018-126</ext-link>, in review,
2018.</mixed-citation></ref>
      <ref id="bib1.bib42"><label>42</label><mixed-citation>Savage, N. J., Krentz, C. E., Könemann, T., Han, T. T., Mainelis, G., Pöhlker, C., and Huffman, J. A.: Systematic
characterization and fluorescence threshold strategies for the wideband integrated bioaerosol sensor (WIBS)
using size-resolved biological and interfering particles, Atmos. Meas. Tech., 10, 4279–4302, <ext-link xlink:href="https://doi.org/10.5194/amt-10-4279-2017" ext-link-type="DOI">10.5194/amt-10-4279-2017</ext-link>, 2017.
</mixed-citation></ref><?xmltex \hack{\newpage}?>
      <ref id="bib1.bib43"><label>43</label><mixed-citation>
Shiraiwa, M., Ueda, K., Pozzer, A., Lammel, G., Kampf, C. J., Fushimi, A.,
Enami, S., Arangio, A. M., Frohlich-Nowoisky, J., Fujitani, Y., Furuyama,
A., Lakey, P. S. J., Lelieveld, J., Lucas, K., Morino, Y., Poschl, U.,
Takaharna, S., Takami, A., Tong, H. J., Weber, B., Yoshino, A., and Sato,
K.: Aerosol Health Effects from Molecular to Global Scales, Environ. Sci.
Technol., 51, 13545–13567, 2017.</mixed-citation></ref>
      <ref id="bib1.bib44"><label>44</label><mixed-citation>
Sivaprakasam, V., Lin, H.-B., Huston, A. L., and Eversole, J. D.: Spectral
characterization of biological aerosol particles using two-wavelength
excited laser-induced fluorescence and elastic scattering measurements,
Opt. Exp., 19, 6191–6208, 2011.</mixed-citation></ref>
      <ref id="bib1.bib45"><label>45</label><mixed-citation>
Sodeau, J. R. and O'Connor, D. J.: Chapter 16 – Bioaerosol Monitoring of the
Atmosphere for Occupational and Environmental Purposes, in: Comprehensive
Analytical Chemistry, edited by: de la Guardia, M. and Armenta, S., Elsevier,
2016.</mixed-citation></ref>
      <ref id="bib1.bib46"><label>46</label><mixed-citation>
Stanley, W. R., Kaye, P. H., Foot, V. E., Barrington, S. J., Gallagher, M.,
and Gabey, A.: Continuous bioaerosol monitoring in a tropical environment
using a UV fluorescence particle spectrometer, Atmos. Sci. Lett.,
12, 195–199, 2011.</mixed-citation></ref>
      <ref id="bib1.bib47"><label>47</label><mixed-citation>
Suphioglu, C., Singh, M. B., Taylor, P., Knox, R. B., Bellomo, R., Holmes,
P., and Puy, R.: Mechanism of grass-pollen-induced asthma, Lancet, 339,
569–572, 1992.</mixed-citation></ref>
      <ref id="bib1.bib48"><label>48</label><mixed-citation>Swanson, B. E. and Huffman, J. A.: Development and characterization of an
inexpensive single-particle fluorescence spectrometer for bioaerosol
monitoring, Opt. Exp., 26, 3646–3660, <ext-link xlink:href="https://doi.org/10.1364/OE.26.003646" ext-link-type="DOI">10.1364/OE.26.003646</ext-link>, 2018.</mixed-citation></ref>
      <ref id="bib1.bib49"><label>49</label><mixed-citation>
Taylor, P. E., Flagan, R. C., Miguel, A. G., Valenta, R., and Glovsky, M.
M.: Birch pollen rupture and the release of aerosols of respirable
allergens, Clin. Exp. Allergy, 34, 1591–1596, 2004.</mixed-citation></ref>
      <ref id="bib1.bib50"><label>50</label><mixed-citation>Toprak, E. and Schnaiter, M.: Fluorescent biological aerosol particles measured with the Waveband Integrated Bioaerosol
Sensor WIBS-4: laboratory tests combined with a one year field study, Atmos. Chem. Phys., 13, 225–243, <ext-link xlink:href="https://doi.org/10.5194/acp-13-225-2013" ext-link-type="DOI">10.5194/acp-13-225-2013</ext-link>, 2013.</mixed-citation></ref>
      <ref id="bib1.bib51"><label>51</label><mixed-citation>Wright, T. P., Hader, J. D., McMeeking, G. R., and Petters, M. D.: High
Relative Humidity as a Trigger for Widespread Release of Ice Nuclei, Aerosol
Sci. Technol., 48, i-v, <ext-link xlink:href="https://doi.org/10.1080/02786826.2014.968244" ext-link-type="DOI">10.1080/02786826.2014.968244</ext-link>, 2014.</mixed-citation></ref>
      <ref id="bib1.bib52"><label>52</label><mixed-citation>Yu, X., Wang, Z., Zhang, M., Kuhn, U., Xie, Z., Cheng, Y., Pöschl, U., and Su, H.: Ambient measurement of fluorescent aerosol particles
with a WIBS in the Yangtze River Delta of China: potential impacts of combustion-related aerosol particles, Atmos. Chem. Phys., 16,
11337–11348, <ext-link xlink:href="https://doi.org/10.5194/acp-16-11337-2016" ext-link-type="DOI">10.5194/acp-16-11337-2016</ext-link>, 2016.</mixed-citation></ref>

  </ref-list></back>
    <!--<article-title-html>Evaluation of a hierarchical agglomerative clustering method applied to WIBS laboratory data for improved discrimination of biological particles by comparing data preparation techniques</article-title-html>
<abstract-html><p>Hierarchical agglomerative clustering (HAC) analysis has been successfully
applied to several sets of ambient data (e.g., Crawford et al., 2015;
Robinson et al., 2013) and with respect to standardized particles in the
laboratory environment (Ruske et al., 2017, 2018). Here
we show for the first time a systematic application of HAC to a comprehensive
set of laboratory data collected for many individual particle types using the
wideband integrated bioaerosol sensor (WIBS-4A)
(Savage et al., 2017). The impact of the ratio of
particle concentrations on HAC results was investigated, showing that
clustering quality can vary dramatically as a function of ratio. Six
strategies for particle preprocessing were also compared, concluding that
using raw fluorescence intensity (without normalizing to particle size) and
logarithmically transforming data values (scenario B) consistently produced
the highest-quality results for the particle types analyzed. A total of 23
one-to-one matchups of individual particles types was investigated. Results
showed a cluster misclassification of &lt;&thinsp;15&thinsp;% for 12 of 17 numerical
experiments using one biological and one nonbiological particle type each.
Inputting fluorescence data using a baseline +3<i>σ</i> threshold
produced a lower degree of misclassification than when inputting either all particles
(without a fluorescence threshold) or a baseline +9<i>σ</i> threshold.
Lastly, six numerical simulations of mixtures of four to seven components
were analyzed using HAC. These results show that a range of 12&thinsp;%–24&thinsp;% of
fungal clusters was consistently misclassified by inclusion of a mixture of
nonbiological materials, whereas bacteria and diesel soot were each able to
be separated with nearly 100&thinsp;% efficiency. The study gives significant
support to clustering analysis commonly being applied to data from commercial
ultraviolet laser/light-induced fluorescence (UV-LIF) instruments used for bioaerosol research across the
globe and provides practical tools that will improve clustering results
within scientific studies as a part of diverse research disciplines.</p></abstract-html>
<ref-html id="bib1.bib1"><label>1</label><mixed-citation>
Bond, T. C., Doherty, S. J., Fahey, D. W., Forster, P. M., Berntsen, T.,
DeAngelo, B. J., Flanner, M. G., Ghan, S., Karcher, B., Koch, D., Kinne, S.,
Kondo, Y., Quinn, P. K., Sarofim, M. C., Schultz, M. G., Schulz, M.,
Venkataraman, C., Zhang, H., Zhang, S., Bellouin, N., Guttikunda, S. K.,
Hopke, P. K., Jacobson, M. Z., Kaiser, J. W., Klimont, Z., Lohmann, U.,
Schwarz, J. P., Shindell, D., Storelvmo, T., Warren, S. G., and Zender, C.
S.: Bounding the role of black carbon in the climate system: A scientific
assessment, J. Geophys. Res.-Atmos., 118, 5380–5552, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib2"><label>2</label><mixed-citation>
Crawford, I., Robinson, N. H., Flynn, M. J., Foot, V. E., Gallagher, M. W., Huffman, J. A., Stanley, W. R., and Kaye, P. H.: Characterisation of
bioaerosol emissions from a Colorado pine forest: results from the BEACHON-RoMBAS experiment, Atmos. Chem. Phys., 14, 8559–8578,
<a href="https://doi.org/10.5194/acp-14-8559-2014" target="_blank">https://doi.org/10.5194/acp-14-8559-2014</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib3"><label>3</label><mixed-citation>
Crawford, I., Ruske, S., Topping, D. O., and Gallagher, M. W.: Evaluation of hierarchical agglomerative cluster analysis methods for
discrimination of primary biological aerosol, Atmos. Meas. Tech., 8, 4979–4991, <a href="https://doi.org/10.5194/amt-8-4979-2015" target="_blank">https://doi.org/10.5194/amt-8-4979-2015</a>, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib4"><label>4</label><mixed-citation>
Crawford, I., Lloyd, G., Herrmann, E., Hoyle, C. R., Bower, K. N., Connolly, P. J., Flynn, M. J., Kaye, P. H., Choularton, T. W.,
and Gallagher, M. W.: Observations of fluorescent aerosol-cloud interactions in the free troposphere at the High-Altitude
Research Station Jungfraujoch, Atmos. Chem. Phys., 16, 2273–2284, <a href="https://doi.org/10.5194/acp-16-2273-2016" target="_blank">https://doi.org/10.5194/acp-16-2273-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib5"><label>5</label><mixed-citation>
Després, V. R., Huffman, J. A., Burrows, S. M., Hoose, C., Safatov, A.
S., Buryak, G. A., Fröhlich-Nowoisky, J., Elbert, W., Andreae, M. O.,
Pöschl, U., and Jaenicke, R.: Primary Biological Aerosol Particles in
the Atmosphere: A Review, Tellus B,
64, 15598, <a href="https://doi.org/10.3402/tellusb.v64i0.15598" target="_blank">https://doi.org/10.3402/tellusb.v64i0.15598</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib6"><label>6</label><mixed-citation>
Douwes, J., Thorne, P., Pearce, N., and Heederik, D.: Bioaerosol health
effects and exposure assessment: Progress and prospects, Ann. Occupa. Hyg., 47, 187–200, 2003.
</mixed-citation></ref-html>
<ref-html id="bib1.bib7"><label>7</label><mixed-citation>
Eick, C. F., Zeidat, N., and Zhao, Z.: Supervised clustering-algorithms and
benefits, Conference proceedings,
Tools with Artificial Intelligence,  ICTAI 2004, <a href="https://doi.org/10.1109/ICTAI.2004.111" target="_blank">https://doi.org/10.1109/ICTAI.2004.111</a>,  774–776, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib8"><label>8</label><mixed-citation>
Fennelly, M. J., Sewell, G., Prentice, M. B., O'Connor, D. J., and Sodeau,
J. R.: The Use of Real-Time Fluorescence Instrumentation to Monitor Ambient
Primary Biological Aerosol Particles (PBAP), Atmosphere, 9, 1–39, <a href="https://doi.org/10.3390/atmos9010001" target="_blank">https://doi.org/10.3390/atmos9010001</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib9"><label>9</label><mixed-citation>
Foot, V. E., Kaye, P. H., Stanley, W. R., Barrington, S. J., Gallagher, M.,
and Gabey, A.: Low-cost real-time multi-parameter bio-aerosol sensors,
Proceedings of the SPIE – The International Society for Optical Engineering,
7116, 711601, <a href="https://doi.org/10.1117/12.800226" target="_blank">https://doi.org/10.1117/12.800226</a>, 2008.
</mixed-citation></ref-html>
<ref-html id="bib1.bib10"><label>10</label><mixed-citation>
Fröhlich-Nowoisky, J., Kampf, C. J., Weber, B., Huffman, J. A.,
Pöhlker, C., Andreae, M. O., Lang-Yona, N., Burrows, S. M., Gunthe, S.
S., Elbert, W., Su, H., Hoor, P., Thines, E., Hoffmann, T., Després, V.
R., and Pöschl, U.: Bioaerosols in the Earth system: Climate, health, and
ecosystem interactions, Atmos. Res., 182, 346–376,
<a href="https://doi.org/10.1016/j.atmosres.2016.07.018" target="_blank">https://doi.org/10.1016/j.atmosres.2016.07.018</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib11"><label>11</label><mixed-citation>
Gabey, A. M., Gallagher, M. W., Whitehead, J., Dorsey, J. R., Kaye, P. H., and Stanley, W. R.: Measurements and comparison of primary
biological aerosol above and below a tropical forest canopy using a dual channel fluorescence spectrometer, Atmos. Chem. Phys., 10,
4453–4466, <a href="https://doi.org/10.5194/acp-10-4453-2010" target="_blank">https://doi.org/10.5194/acp-10-4453-2010</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib12"><label>12</label><mixed-citation>
Gosselin, M. I., Rathnayake, C. M., Crawford, I., Pöhlker, C., Fröhlich-Nowoisky, J., Schmer, B., Després, V. R., Engling, G.,
Gallagher, M., Stone, E., Pöschl, U., and Huffman, J. A.: Fluorescent bioaerosol particle, molecular tracer, and fungal spore
concentrations during dry and rainy periods in a semi-arid forest, Atmos. Chem. Phys., 16, 15165–15184, <a href="https://doi.org/10.5194/acp-16-15165-2016" target="_blank">https://doi.org/10.5194/acp-16-15165-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib13"><label>13</label><mixed-citation>
Healy, D. A., O'Connor, D. J., Burke, A. M., and Sodeau, J. R.: A laboratory
assessment of the Waveband Integrated Bioaerosol Sensor (WIBS-4) using
individual samples of pollen and fungal spore material, Atmos.
Environ., 60, 534–543, 2012a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib14"><label>14</label><mixed-citation>
Healy, D. A., O'Connor, D. J., and Sodeau, J. R.: Measurement of the
particle counting efficiency of the “Waveband Integrated Bioaerosol
Sensor” model number 4 (WIBS-4), J. Aerosol Sci., 47, 94–99,
2012b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib15"><label>15</label><mixed-citation>
Hernandez, M., Perring, A. E., McCabe, K., Kok, G., Granger, G., and Baumgardner, D.: Chamber catalogues of optical and fluorescent
signatures distinguish bioaerosol classes, Atmos. Meas. Tech., 9, 3283–3292, <a href="https://doi.org/10.5194/amt-9-3283-2016" target="_blank">https://doi.org/10.5194/amt-9-3283-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib16"><label>16</label><mixed-citation>
Hill, S. C., Pinnick, R. G., Niles, S., Fell, N. F., Pan, Y. L., Bottiger,
J., Bronk, B. V., Holler, S., and Chang, R. K.: Fluorescence from airborne
microparticles: dependence on size, concentration of fluorophores, and
illumination intensity, Appl. Opt., 40, 3005–3013, 2001.
</mixed-citation></ref-html>
<ref-html id="bib1.bib17"><label>17</label><mixed-citation>
Huffman, D. R., Swanson, B. E., and Huffman, J. A.: A wavelength-dispersive instrument for characterizing fluorescence and scattering
spectra of individual aerosol particles on a substrate, Atmos. Meas. Tech., 9, 3987–3998, <a href="https://doi.org/10.5194/amt-9-3987-2016" target="_blank">https://doi.org/10.5194/amt-9-3987-2016</a>, 2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib18"><label>18</label><mixed-citation>
Huffman, J. A. and Santarpia, J.: Online Techniques for Quantification and
Characterization of Biological Aerosols, in: Microbiology of Aerosols, John
Wiley &amp; Sons, Inc., 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib19"><label>19</label><mixed-citation>
Huffman, J. A., Treutlein, B., and Pöschl, U.: Fluorescent biological aerosol particle concentrations and size distributions measured with an
Ultraviolet Aerodynamic Particle Sizer (UV-APS) in Central Europe, Atmos. Chem. Phys., 10, 3215–3233, <a href="https://doi.org/10.5194/acp-10-3215-2010" target="_blank">https://doi.org/10.5194/acp-10-3215-2010</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib20"><label>20</label><mixed-citation>
Huffman, J. A., Sinha, B., Garland, R. M., Snee-Pollmann, A., Gunthe, S. S., Artaxo, P., Martin, S. T., Andreae, M. O., and Pöschl, U.:
Size distributions and temporal variations of biological aerosol particles in the Amazon rainforest characterized by microscopy and
real-time UV-APS fluorescence techniques during AMAZE-08, Atmos. Chem. Phys., 12, 11997–12019, <a href="https://doi.org/10.5194/acp-12-11997-2012" target="_blank">https://doi.org/10.5194/acp-12-11997-2012</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib21"><label>21</label><mixed-citation>
Kaye, P. H., Stanley, W. R., Hirst, E., Foot, E. V., Baxter, K. L., and
Barrington, S. J.: Single particle multichannel bio-aerosol fluorescence
sensor, Opt. Exp., 13, 3583–3593, 2005.
</mixed-citation></ref-html>
<ref-html id="bib1.bib22"><label>22</label><mixed-citation>
Kiselev, D., Bonacina, L., and Wolf, J.-P.: A flash-lamp based device for
fluorescence detection and identification of individual pollen grains,
Rev. Sci. Instrum., 84, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib23"><label>23</label><mixed-citation>
Könemann, T., Savage, N. J., Huffman, J. A., and Pöhlker, C.: Characterization of steady-state fluorescence properties of polystyrene
latex spheres using off- and online spectroscopic methods, Atmos. Meas. Tech., 11, 3987–4003, <a href="https://doi.org/10.5194/amt-11-3987-2018" target="_blank">https://doi.org/10.5194/amt-11-3987-2018</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib24"><label>24</label><mixed-citation>
Liu, Y., Li, Z., Xiong, H., Gao, X., and Wu, J.: Understanding of internal
clustering validation measures,  IEEE International Conference on Data Mining
INSPEC Accession Number: 11770459,
Sydney, NSW, Australia, 911–916, <a href="https://doi.org/10.1109/ICDM.2010.35" target="_blank">https://doi.org/10.1109/ICDM.2010.35</a>, 2010.
</mixed-citation></ref-html>
<ref-html id="bib1.bib25"><label>25</label><mixed-citation>
Miguel, A. G., Taylor, P. E., House, J., Glovsky, M. M., and Flagan, R. C.:
Meteorological influences on respirable fragment release from Chinese elm
pollen, Aerosol Sci. Technol., 40, 690–696, 2006.
</mixed-citation></ref-html>
<ref-html id="bib1.bib26"><label>26</label><mixed-citation>
Müllner, D.: fastcluster: Fast hierarchical, agglomerative clustering
routines for R and Python, J. Stat. Softw., 53, 1–18, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib27"><label>27</label><mixed-citation>
Norusis, M.: Cluster Analysis. In: IBM SPSS Statistics 19 Guide to Data
Analysis, Norusis &amp; SPSS Inc., 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib28"><label>28</label><mixed-citation>
O'Connor, D. J., Healy, D. A., Hellebust, S., Buters, J. T. M., and Sodeau,
J. R.: Using the WIBS-4 (Waveband Integrated Bioaerosol Sensor) Technique
for the On-Line Detection of Pollen Grains, Aerosol Sci. Technol., 48,
341–349, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib29"><label>29</label><mixed-citation>
Pan, Y. L., Pinnick, R. G., Hill, S. C., Rosen, J. M., and Chang, R. K.:
Single-particle laser-induced-fluorescence spectra of biological and other
organic-carbon aerosols in the atmosphere: Measurements at New Haven,
Connecticut, and Las Cruces, New Mexico, J. Geophys. Res.-Atmos., 112,
D24S19, <a href="https://doi.org/10.1029/2007jd008741" target="_blank">https://doi.org/10.1029/2007jd008741</a>, 2007.
</mixed-citation></ref-html>
<ref-html id="bib1.bib30"><label>30</label><mixed-citation>
Pan, Y.-L., Pinnick, R. G., Hill, S. C., and Chang, R. K.:
Particle-Fluorescence Spectrometer for Real-Time Single-Particle
Measurements of Atmospheric Organic Carbon and Biological Aerosol, Environ.
Sci. Technol., 43, 429–434, 2009a.
</mixed-citation></ref-html>
<ref-html id="bib1.bib31"><label>31</label><mixed-citation>
Pan, Y. L., Pinnick, R. G., Hill, S. C., and Chang, R. K.:
Particle-Fluorescence Spectrometer for Real-Time Single-Particle
Measurements of Atmospheric Organic Carbon and Biological Aerosol, Environ.
Sci. Technol., 43, 429–434, 2009b.
</mixed-citation></ref-html>
<ref-html id="bib1.bib32"><label>32</label><mixed-citation>
Pan, Y. L., Huang, H., and Chang, R. K.: Clustered and integrated
fluorescence spectra from single atmospheric aerosol particles excited by a
263-and 351-nm laser at New Haven, CT, and Adelphi, MD, J.
Quant. Spectrosc. Ra., 113, 2213–2221, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib33"><label>33</label><mixed-citation>
Perring, A. E., Schwarz, J. P., Baumgardner, D., Hernandez, M. T.,
Spracklen, D. V., Heald, C. L., Gao, R. S., Kok, G., McMeeking, G. R.,
McQuaid, J. B., and Fahey, D. W.: Airborne observations of regional
variation in fluorescent aerosol across the United States, J. Geophys.
Res.-Atmos., 120, 1153–1170, 2015.
</mixed-citation></ref-html>
<ref-html id="bib1.bib34"><label>34</label><mixed-citation>
Pinnick, R. G., Hill, S. C., Pan, Y. L., and Chang, R. K.: Fluorescence
spectra of atmospheric aerosol at Adelphi, Maryland, USA: measurement and
classification of single particles containing organic carbon, Atmos.
Environ., 38, 1657–1672, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib35"><label>35</label><mixed-citation>
Pinnick, R. G., Fernandez, E., Rosen, J. M., Hill, S. C., Wang, Y., and Pan,
Y. L.: Fluorescence spectra and elastic scattering characteristics of
atmospheric aerosol in Las Cruces, New Mexico, USA: Variability of
concentrations and possible constituents and sources of particles in various
spectral clusters, Atmos. Environ., 65, 195–204, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib36"><label>36</label><mixed-citation>
Pöhlker, C., Huffman, J. A., and Pöschl, U.: Autofluorescence of atmospheric bioaerosols – fluorescent biomolecules and potential
interferences, Atmos. Meas. Tech., 5, 37–71, <a href="https://doi.org/10.5194/amt-5-37-2012" target="_blank">https://doi.org/10.5194/amt-5-37-2012</a>, 2012.
</mixed-citation></ref-html>
<ref-html id="bib1.bib37"><label>37</label><mixed-citation>
Pöhlker, C., Huffman, J. A., Förster, J.-D., and Pöschl, U.: Autofluorescence of atmospheric bioaerosols: spectral
fingerprints and taxonomic trends of pollen, Atmos. Meas. Tech., 6, 3369–3392, <a href="https://doi.org/10.5194/amt-6-3369-2013" target="_blank">https://doi.org/10.5194/amt-6-3369-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib38"><label>38</label><mixed-citation>
Robinson, E. S., Gao, R.-S., Schwarz, J. P., Fahey, D. W., and Perring, A. E.: Fluorescence calibration method for single-particle
aerosol fluorescence instruments, Atmos. Meas. Tech., 10, 1755–1768, <a href="https://doi.org/10.5194/amt-10-1755-2017" target="_blank">https://doi.org/10.5194/amt-10-1755-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib39"><label>39</label><mixed-citation>
Robinson, N. H., Allan, J. D., Huffman, J. A., Kaye, P. H., Foot, V. E., and Gallagher, M.: Cluster analysis of WIBS
single-particle bioaerosol data, Atmos. Meas. Tech., 6, 337–347, <a href="https://doi.org/10.5194/amt-6-337-2013" target="_blank">https://doi.org/10.5194/amt-6-337-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib40"><label>40</label><mixed-citation>
Ruske, S., Topping, D. O., Foot, V. E., Kaye, P. H., Stanley, W. R., Crawford, I., Morse, A. P., and Gallagher, M. W.: Evaluation
of machine learning algorithms for classification of primary biological aerosol using a new UV-LIF spectrometer,
Atmos. Meas. Tech., 10, 695–708, <a href="https://doi.org/10.5194/amt-10-695-2017" target="_blank">https://doi.org/10.5194/amt-10-695-2017</a>, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib41"><label>41</label><mixed-citation>
Ruske, S., Topping, D. O., Foot, V. E., Morse, A. P., and Gallagher, M. W.: Machine learning for improved data analysis of
biological aerosol using the WIBS, Atmos. Meas. Tech. Discuss., <a href="https://doi.org/10.5194/amt-2018-126" target="_blank">https://doi.org/10.5194/amt-2018-126</a>, in review,
2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib42"><label>42</label><mixed-citation>
Savage, N. J., Krentz, C. E., Könemann, T., Han, T. T., Mainelis, G., Pöhlker, C., and Huffman, J. A.: Systematic
characterization and fluorescence threshold strategies for the wideband integrated bioaerosol sensor (WIBS)
using size-resolved biological and interfering particles, Atmos. Meas. Tech., 10, 4279–4302, <a href="https://doi.org/10.5194/amt-10-4279-2017" target="_blank">https://doi.org/10.5194/amt-10-4279-2017</a>, 2017.

</mixed-citation></ref-html>
<ref-html id="bib1.bib43"><label>43</label><mixed-citation>
Shiraiwa, M., Ueda, K., Pozzer, A., Lammel, G., Kampf, C. J., Fushimi, A.,
Enami, S., Arangio, A. M., Frohlich-Nowoisky, J., Fujitani, Y., Furuyama,
A., Lakey, P. S. J., Lelieveld, J., Lucas, K., Morino, Y., Poschl, U.,
Takaharna, S., Takami, A., Tong, H. J., Weber, B., Yoshino, A., and Sato,
K.: Aerosol Health Effects from Molecular to Global Scales, Environ. Sci.
Technol., 51, 13545–13567, 2017.
</mixed-citation></ref-html>
<ref-html id="bib1.bib44"><label>44</label><mixed-citation>
Sivaprakasam, V., Lin, H.-B., Huston, A. L., and Eversole, J. D.: Spectral
characterization of biological aerosol particles using two-wavelength
excited laser-induced fluorescence and elastic scattering measurements,
Opt. Exp., 19, 6191–6208, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib45"><label>45</label><mixed-citation>
Sodeau, J. R. and O'Connor, D. J.: Chapter 16 – Bioaerosol Monitoring of the
Atmosphere for Occupational and Environmental Purposes, in: Comprehensive
Analytical Chemistry, edited by: de la Guardia, M. and Armenta, S., Elsevier,
2016.
</mixed-citation></ref-html>
<ref-html id="bib1.bib46"><label>46</label><mixed-citation>
Stanley, W. R., Kaye, P. H., Foot, V. E., Barrington, S. J., Gallagher, M.,
and Gabey, A.: Continuous bioaerosol monitoring in a tropical environment
using a UV fluorescence particle spectrometer, Atmos. Sci. Lett.,
12, 195–199, 2011.
</mixed-citation></ref-html>
<ref-html id="bib1.bib47"><label>47</label><mixed-citation>
Suphioglu, C., Singh, M. B., Taylor, P., Knox, R. B., Bellomo, R., Holmes,
P., and Puy, R.: Mechanism of grass-pollen-induced asthma, Lancet, 339,
569–572, 1992.
</mixed-citation></ref-html>
<ref-html id="bib1.bib48"><label>48</label><mixed-citation>
Swanson, B. E. and Huffman, J. A.: Development and characterization of an
inexpensive single-particle fluorescence spectrometer for bioaerosol
monitoring, Opt. Exp., 26, 3646–3660, <a href="https://doi.org/10.1364/OE.26.003646" target="_blank">https://doi.org/10.1364/OE.26.003646</a>, 2018.
</mixed-citation></ref-html>
<ref-html id="bib1.bib49"><label>49</label><mixed-citation>
Taylor, P. E., Flagan, R. C., Miguel, A. G., Valenta, R., and Glovsky, M.
M.: Birch pollen rupture and the release of aerosols of respirable
allergens, Clin. Exp. Allergy, 34, 1591–1596, 2004.
</mixed-citation></ref-html>
<ref-html id="bib1.bib50"><label>50</label><mixed-citation>
Toprak, E. and Schnaiter, M.: Fluorescent biological aerosol particles measured with the Waveband Integrated Bioaerosol
Sensor WIBS-4: laboratory tests combined with a one year field study, Atmos. Chem. Phys., 13, 225–243, <a href="https://doi.org/10.5194/acp-13-225-2013" target="_blank">https://doi.org/10.5194/acp-13-225-2013</a>, 2013.
</mixed-citation></ref-html>
<ref-html id="bib1.bib51"><label>51</label><mixed-citation>
Wright, T. P., Hader, J. D., McMeeking, G. R., and Petters, M. D.: High
Relative Humidity as a Trigger for Widespread Release of Ice Nuclei, Aerosol
Sci. Technol., 48, i-v, <a href="https://doi.org/10.1080/02786826.2014.968244" target="_blank">https://doi.org/10.1080/02786826.2014.968244</a>, 2014.
</mixed-citation></ref-html>
<ref-html id="bib1.bib52"><label>52</label><mixed-citation>
Yu, X., Wang, Z., Zhang, M., Kuhn, U., Xie, Z., Cheng, Y., Pöschl, U., and Su, H.: Ambient measurement of fluorescent aerosol particles
with a WIBS in the Yangtze River Delta of China: potential impacts of combustion-related aerosol particles, Atmos. Chem. Phys., 16,
11337–11348, <a href="https://doi.org/10.5194/acp-16-11337-2016" target="_blank">https://doi.org/10.5194/acp-16-11337-2016</a>, 2016.
</mixed-citation></ref-html>--></article>
