the Creative Commons Attribution 4.0 License.

the Creative Commons Attribution 4.0 License.

# Flexible approach for quantifying average long-term changes and seasonal cycles of tropospheric trace species

### David D. Parrish

### Richard G. Derwent

### Simon O'Doherty

### Peter G. Simmonds

We present an approach for deriving a systematic
mathematical representation of the statistically significant features of the
average long-term changes and seasonal cycle of concentrations of trace
tropospheric species. The results for two illustrative data sets (time
series of baseline concentrations of ozone and N_{2}O at Mace Head,
Ireland) indicate that a limited set of seven or eight parameter values
provides this mathematical representation for both example species. This
method utilizes a power series expansion to extract more information
regarding the long-term changes than can be provided by oft-employed linear
trend analyses. In contrast, the quantification of average seasonal cycles
utilizes a Fourier series analysis that provides less detailed seasonal
cycles than are sometimes represented as 12 monthly means; including
that many parameters in the seasonal cycle representation is not usually
statistically justified, and thereby adds unnecessary “noise” to the
representation and prevents a clear analysis of the statistical uncertainty
of the results. The approach presented here is intended to maximize the
statistically significant information extracted from analyses of time series
of concentrations of tropospheric species, regarding their mean long-term
changes and seasonal cycles, including nonlinear aspects of the long-term
trends. Additional implications, advantages and limitations of this approach
are discussed.

Utilizing observations to fully characterize the four-dimensional (latitude,
longitude, altitude and time) concentration distribution of a trace
tropospheric species is a daunting prospect. This paper discusses an
analysis approach for quantification of only a small part of the full
distribution – the mean long-term changes and seasonal cycle at a particular
point in the troposphere – but it provides that quantification in an
accurate, precise and simple form. The discussion focuses on monthly mean
baseline ozone (Derwent et al., 2018a) and N_{2}O concentrations reported
for the surface site at Mace Head, Ireland, but the analysis approach is
general, and thus can be applied to other locations and trace species. This
approach extends the techniques developed in earlier publications: Parrish
et al. (2012, 2014, 2017) for long-term changes and Parrish et al. (2016)
and Derwent et al. (2016, 2018b) for seasonal cycles. This extension
provides consistently defined parameters with confidence limits that
quantify these systematic temporal variations.

From a wider, more formal perspective, Bowdalo et al. (2016) discuss the temporal variability of hourly average tropospheric ozone concentrations through an extensive spectral analysis. They identify two distinct scaling regimes of ozone variability, one at high frequencies with periods from 2 h to about 10 d, and a second at lower frequencies with 10 d to 5-year periods (the maximum period they considered). Analogous with the spectral analysis of meteorological variability, Bowdalo et al. (2016) identify the higher frequencies as the “weather” regime, driven by meteorological processes with frequencies up to those of planetary-scale weather systems. Meteorological frequencies with periods greater than ∼10 d are driven by the average of the largest planetary-scale weather systems, which they term “macroweather”. They also suggest that there would be a third “climate” regime beginning at between 10 and 100 years, caused by low-frequency interactions such as solar, volcanic or anthropogenic forcings. Since they limited their consideration to time series of 5 years, they could not identify any evidence of the climate regime. In this work we consider some of the longest available observational records (as long as 30 years) but work with monthly mean concentrations. Thus, we aim to characterize the macroweather regime and the higher-frequency fraction of the climate regime.

The focus of our analysis is on mean seasonal cycles and long-term changes spanning the complete data records. By considering monthly average data we avoid most of the influence from higher-frequency variability, although interannual variability of lower-frequency weather regime phenomena contributes “noise” to the monthly averages, and thereby affects the precision of the mean seasonal cycle and long-term change quantifications. Even when using monthly averaged data, observable autocorrelation remains in the data after accounting for the seasonal cycle and long-term changes. This autocorrelation is estimated from an autoregressive process and from accounting for the autocorrelation results in expanded error estimates for the derived parameters.

An accurate and precise quantification of the mean long-term changes and seasonal cycles of trace species distributions with well-defined confidence limits is the ultimate goal of the analysis. This quantification meets three needs: (1) it provides a robust, minimum set of parameters capturing the statistically significant information in the observational data set regarding these two temporal variations, parameters which can serve as metrics for quantitative comparisons of long-term changes and seasonal cycles between data sets collected at different locations or for different species. (2) These same metrics serve as a basis for evaluation of results from models that simulate these temporal features of atmospheric concentrations. Finally, (3) it provides a coherent, conceptual view of these features of the species' concentration distribution, a view that can provide an indication of the need for more detailed studies of particular aspects of the distribution.

Importantly, no physical model underlies the statistical analysis. Instead
we use two mathematical series to fit the long-term change (a power series)
and the seasonal cycle (a Fourier series). These series provide flexible
fits to these temporal variations, even though the functional form of these
variations is not known a priori. The data sets themselves dictate the
functional form defined by the mathematical series. To avoid over fitting
the data, the number of terms in each series is limited to only those that
are statistically significant. Without an underlying physical model, care
must be exercised in the interpretation of the derived parameter values and
in the attribution of a physical cause to any of the terms in the series.
Parrish et al. (2016) do present evidence of a direct physical cause of the
statistically significant second harmonic of the seasonal cycle of ozone in
the marine boundary layer (MBL) by showing that the photolysis rate of
ozone, i.e., *j*(O^{1}D), which drives the loss of ozone in the MBL, also has
a second harmonic of opposite phase to that of ozone's seasonal cycle.
However, this identification required information and analysis beyond that of
the time series of concentration measurements alone.

The analysis approach discussed here is exemplified through application to
two data sets collected at Mace Head, Ireland: monthly mean concentrations of
ozone (O_{3}) and nitrous oxide (N_{2}O) filtered for baseline
conditions. These two data sets provide an informative contrast – ozone has
a strong seasonal cycle with a relatively small long-term change, while
N_{2}O has a relatively small seasonal cycle superimposed on a pronounced
long-term change. Derwent et al. (2018a) fully describe the ozone data set;
it covers the 30-year period from April 1987 to April 2017. The N_{2}O data
were provided by the AGAGE program and downloaded from the public U.S.
Department of Energy (DOE) Carbon Dioxide Information Analysis Center
(CDIAC) website (https://cdiac.ess-dive.lbl.gov/ftp/ale_gage_Agage/AGAGE/gc-md/monthly/, last access: 3 December 2018); these data
cover the 22-year period from March 1994 to September 2016. All AGAGE data are
available from the public AGAGE website (http://agage.mit.edu/data, last access: 24 June 2019) and the
World Data Center for Greenhouse Gases (WDCGG) in
Japan (https://gaw.kishou.go.jp, last access: 24 June 2019), as well as on the CDIAC website.

The overall goal of the analysis is to quantify the minimum set of
parameters (including robust confidence limits) that mathematically
describes the mean long-term evolution and seasonal cycle of an atmospheric
trace species' concentrations, within the limits of statistical
significance, from a time series of measured concentrations. The minimum set
of parameters is desired because (1) it minimizes the possibility of over
fitting to the data and (2) the analysis can quantify these parameters to the
highest precision, which thus provides the most concise picture of the
variations and the most stringent metrics for comparisons of different data
sets and for model evaluation through model–measurement comparisons. In
Sect. 3.1–3.4 the analysis is illustrated through application to the Mace
Head baseline ozone observations (Derwent et al., 2018a), but the method is
generally applicable to other trace species; Sect. 3.5 illustrates the
application to the Mace Head baseline N_{2}O observations.

Here, we first quantify the long-term changes through a power series fit to the time series of monthly mean ozone concentrations (Sect. 3.1). The derived function defining the long-term changes is then used to detrend the monthly mean data (Sect. 3.2) in order to facilitate characterization of the seasonal cycle through a Fourier transform and harmonic analysis of the detrended monthly means (Sect. 3.3). Finally, a nonlinear regression fit of a function containing the statistically significant terms of both the power and Fourier series gives the most precise determination (i.e., yields the smallest confidence limits) of the derived parameters (Sect. 3.4). Sect. 3.6 through 3.8 discuss additional statistical features of the data, including their influence on the confidence limits of the derived parameter values.

Working with monthly mean data reduces the impact of autocorrelation due to weather regimes and results in residuals that are more Gaussian in nature. As long as the temporal sampling scheme does not introduce any bias to monthly means (e.g., through sparse sampling such that the data are not fully representative of the actual monthly means), restricting the analysis to monthly mean time data rather than working with higher-frequency data does not reduce the statistically significant information regarding the average long-term trends or seasonal cycles. A qualitative explanation for this can be given. Deriving monthly means from higher-frequency data (e.g., hourly or daily mean data) is an averaging process that minimizes the sum of the squares of the deviations of the higher-frequency data from the derived monthly means. The fitting procedures employed in the analysis here minimize the sum of the squares of the deviations of the monthly mean data from the derived long-term changes and seasonal cycles. The overall result is independent of whether the sum of the squares of the deviations is minimized in two steps (monthly mean calculation followed by further fits) or in one step (extracting long-term trends and seasonal cycles directly from the higher-frequency data). One example of this independence is shown in Sect. 3.1, where a fit of the long-term change function to annual mean data gives results equivalent to the fit to monthly mean data. We work with monthly mean data because they provide clearer illustrations of the method and its results, compared to higher-frequency data.

## 3.1 Long-term change analysis

A power series fit is a general and convenient means of quantifying the long-term temporal evolution of a time series of concentration measurements. This is a general approach in that no underlying assumptions are made regarding the functional form of the temporal evolution of the data set, since any continuously varying curve can be fit to any desired accuracy given enough terms in a power series. In practice, the power series fit is obtained through a nonlinear regression fit of monthly mean data to a polynomial, as indicated in Eq. (1):

The fits utilized in this work include all terms in Eq. (1) with coefficients
that are statistically significant at the 95 % confidence level. This means
that as longer data records develop, additional terms can be added and new
insights can be gained. Figure 1 shows a fit (black solid curve) to monthly
mean baseline ozone data (blue solid circles) obtained at Mace Head, Ireland
(Derwent et al., 2018a). The annotation gives the derived values (with
95 % confidence limits) for the first three coefficients of Eq. (1). The fit
to the calendar annual mean data (dotted violet line and larger violet
symbols) are also shown. Table 1 compares the coefficients derived from
these fits. For this data set, only the first three terms of Eq. (1) are
retained, as the coefficients of higher-order terms are not significantly different statistically from zero (see derived *d* parameters in Table 1). All
three parameter values derived from the two fits agree within their
confidence limits; the small differences are due to the exclusion of the
partial years of data at the beginning and end of the data record when
calculating the calendar means. The scatter of the annual means about the
fitted curve is considerably smaller than that of the monthly means (compare
root-mean-square deviation, RMSD, values in Table 1) since the variability
associated with the seasonal cycle has been removed by the annual averaging
period.

To more precisely determine the coefficients, it is important for the time
origin to be well within the time span of all the data series considered.
Here we choose the year 2000 (i.e., *t* in Eq. (1) equals 2000). If the time
origin is selected outside the time spanned by the data (an extreme example
would be year 0), the confidence limits of the derived parameters and the
absolute values of *a* and *b* (but not *c*) change, but the fitted curve does not
change. With the year 2000 chosen as the time origin, the first coefficient
(*a*, with units ppb O_{3}) is the intercept of the fitted curve at the year
2000; it quantifies the absolute magnitude of the average ozone
concentration at that year. The second coefficient (*b*, with units ppb O_{3} yr^{−1}) is the slope of the fitted curve in that same year; it gives the
best estimate of the (continually varying) time rate of change of ozone at
that particular time. Finally, the third coefficient (*c*, with units ppb O_{3} yr^{−2}) is equal to one-half of the (constant) time rate of
change of the slope of the fitted curve. This third term is important for
characterizing the nonlinear aspects of long-term behavior of the data.
Many published studies rely on various approaches to analyze long-term
trends through linear fits; the recent Tropospheric Ozone Assessment Report
(TOAR) project (Chang et al., 2017; Gaudel et al., 2018; Lefohn et al.,
2018) takes this approach. The focus of TOAR is on shorter measurement
records at hundreds of sites where only the first two terms of Eq. (1) are
statistically significant, thus their choice is appropriate. Linear trend
approaches can accurately quantify the average rate of change in
concentrations over a measurement record of any length, but do not fully
quantify the long-term temporal evolution of data sets with strong
nonlinear behavior, as is the case in the Mace Head ozone data illustrated
in Fig. 1. The choice of examining linear behavior or more complex modes
of change depends upon the purpose of the analysis; this study focuses on
deriving scientific insights into concentration changes that may be driven
by nonlinear factors.

The long-term fit to the Mace Head data finds a statistically significant,
negative value for *c*, with ozone concentrations increasing early in the data
record. The polynomial fit reaches a maximum and then decreases later in
the record. When three terms are included, Eq. (2) allows the calculation of
the year that the maximum of the fit was reached, year_{max}:

The year_{max} calculated from Eq. (2) is included in Table 1, which is
within the time period of the observational record. The physical
interpretation of the maximum derived from the fit and any extrapolation to
a maximum year outside the observational time period would depend on the
scientific understanding of the factors driving the concentration changes.
As discussed later, the apparent decrease after the derived maximum of the
fit in Fig. 1 is not statistically significant, and the existence of a
physical maximum of Mace Head ozone concentrations remains an open question.
Extrapolation of fits derived from Eq. (1) is likely misleading, since
polynomials generally diverge to large negative or positive values when
extended outside the data range used to derive the polynomial itself.

In summary, only three parameters are required to describe the long-term
changes in the Mace Head ozone data set; additional terms are not
statistically significant and are therefore omitted from the analysis. This
parameter set is *a*, *b* and *c*, or equivalently *a*, *c* and year_{max}, with the last
derived from Eq. (2). The second parameter set has more direct physical
significance for this time series. Derwent et al. (2018a) conducted a
similar analysis of the long-term change in this data set and obtained
statistically equivalent results.

## 3.2 Detrending monthly mean data

The time series of monthly mean ozone data can be detrended simply by
subtracting the second and third terms of the fit to Eq. (1) from the original
time series. As expected, no significant long-term change remains; the
average of the detrended data (39.8 ppb) agrees with the *a* parameter derived
above (i.e., the year 2000 intercept of the original fit); and their
standard deviation (5.4 ppb) agrees with the RMSD of the original monthly
means about the long-term trend fit to Eq. (1). All three statistically
significant terms of Eq. (1) could be subtracted from the monthly means, which
would give detrended data averaging zero with the same standard deviation;
subtracting only the second and third terms preserves the year 2000
intercept as the mean of the data set.

## 3.3 Seasonal cycle analysis

The quantitative analysis of the seasonal cycle has two steps; first, a Fourier analysis determines the number of statistically significant harmonic contributors to the seasonal cycle of the detrended data, and second, a least-squares fit of those data to the significant harmonic terms provides a set of parameters that quantify the seasonal cycle to the fullest extent that is statistically justified.

A Fourier transform of a time series of data captures the information of
that time series in frequency space, i.e., as a series of sine functions,
whose magnitude and phase are expressed as a sequence of complex numbers.
Plotted in Fig. 2 are results from the Fourier transform of the detrended
data. These are the magnitudes of the real parts of each term, normalized to
give the magnitude of the respective sine functions plotted as a function of
frequency. There is one point that is off-scale at zero frequency, which
gives the magnitude of the average of the detrended monthly means. The
fundamental (frequency = 1 yr^{−1}) and the second harmonic (frequency = 2 yr^{−1}) terms clearly have much greater magnitudes than any of the
other terms of non-zero frequency. Terms of frequencies < 1 yr^{−1} describe the systematic, multi-year variability that contributes
to deviations from a purely repetitive seasonal cycle in Fig. 1. There is
an indication that the third harmonic (frequency = 3 yr^{−1}) may have a
significant magnitude, but it is on the edge of statistical significance; in
the following analysis, only the fundamental and second harmonic terms will
be considered further. This approach is consistent with that of Parrish et al. (2016), who found that, at most, two terms were required to quantify the
seasonal cycle of monthly mean ozone concentrations in the marine boundary
layer and in the lower free troposphere.

The Fourier transform indicates that the seasonal cycle of the detrended data is quantitatively described by two terms – the fundamental and the second harmonic – plus a third constant term equal to the annual average. The second step in this analysis is to fit these three terms to the detrended monthly means through a least-squares regression to Eq. (3):

Figure 3 illustrates this fit for the detrended data. The second and third
terms in Eq. (3) are the fundamental and second harmonic. If the Fourier
transform indicated one or more additional harmonics terms were
statistically significant, an additional term would be added to Eq. 3 for
each additional harmonic, but for the data sets investigated here, no
additional harmonics are statistically significant. Two parameters, the
amplitude, *A*, and the phase angle, *φ*, are required to define each of
these sine functions. *y*_{o} is the annual average ozone concentration over
the entire data set, and from the discussion above, must equal both *a* (the
year 2000 intercept derived from the fit to Eq. 1) and the average of the
detrended data. In Eq. (3) the variable *χ* spans a 1-year time period
in radians from 0 to 2*π*. The parameters derived from the least-squares
fit are annotated in Fig. 3; they agree closely with those derived for
Mace Head by Parrish et al. (2016) (see their Table 2). The small
differences between the results here and in that earlier work are due to the
baseline reanalysis and extra years of measurements (Derwent et al., 2018a),
now available from Mace Head. The extra years of measurements have resulted
in noticeably smaller confidence limits for most of the derived parameters.
Derwent et al. (2018a) conducted a similar analysis of this seasonal cycle
and obtained statistically equivalent results.

## 3.4 Improved confidence limits through simultaneous long-term change and seasonal cycle analysis

It is possible (and preferable) to do a simultaneous fit to the long-term change and seasonal cycle by utilizing an iterative, nonlinear regression to Eq. (4), which combines the statistically significant terms of Eqs. (1) and (3), giving a total of seven (or eight) parameter values:

where residuals represent the unexplained portion of the data and will be examined
in Sect. 3.6. Whether the fourth term (or even additional terms) of Eq. (1)
are included in Eq. (4) depends upon whether each additional term is
statistically significant. The violet curve in Fig. 4 shows the fit of Eq. (4), and the values derived for the seven parameters are annotated. (For ozone
the fourth terms in Eqs. (1) and (4) are not statistically significant, so no
value is given for the *d* parameter.) The results here are nearly identical to
those discussed earlier, except the confidence limits for the $a,b,c$ parameters
are smaller than those derived in the analysis illustrated in Fig. 1 (see
Table 2). This improvement is due to simultaneously treating the two
systematic sources of data variability (i.e., the long-term change and the
seasonal cycle).

## 3.5 Analysis of nitrous oxide time series

The preceding sections developed and illustrated the application of Eqs. (1)
through (4) for a time series of monthly mean ozone concentration data, but in
principle these equations and analysis approaches can be applied to a series
of measurements of any trace species. For example, Fig. 5 illustrates the
analogous analysis of the time series of monthly mean, baseline-selected
nitrous oxide (N_{2}O) measurements from Mace Head. This time series
(Fig. 5a) is significantly different from that of ozone, with the
long-term change dominating the variability of the data, perturbed by only a
relatively small seasonal cycle. Further, with ozone only three terms of Eq. (1) are statistically significant but for N_{2}O the fourth (cubic) term is
also statistically significant (while higher-order terms are not). Figure 5a
shows fits of Eq. (1) for two, three and four terms; it is difficult to discern
the differences between these fits in the figure, but as the annotations
indicate, these differences are statistically significant. For N_{2}O the
quadratic term in the three-term fit is positive, indicating that the rate
of increase in nitrous oxide has, on average, accelerated over the
measurement record in contrast to ozone whose rate of increase decelerated.
The statistically significant cubic term shows that the acceleration of the
rate of increase has not been constant over the measurement record; Sect. 3.7 discusses these issues in more detail.

The N_{2}O data can be detrended as for ozone by subtracting the second,
third and fourth terms of the fit to Eq. (1) from the time series (results not
shown). The Fourier transform of the detrended data (Fig. 5b) is similar
to that of ozone in that the only important harmonic terms are the
fundamental and second harmonic. For comparison the Fourier transform
results are shown for data detrended with both the cubic and quadratic fits;
the magnitudes at frequencies < 1 yr^{−1} are significantly
smaller for the cubic fit compared to the quadratic fit, reflecting the
reduced variability of the monthly mean data about the cubic fit. Also, for
N_{2}O the magnitudes at frequencies < 1 yr^{−1} are relatively
large compared to those for ozone (Fig. 2). These larger magnitudes reflect the
noticeable interannual departures of the N_{2}O monthly means from the
fitted curve (violet) in Fig. 5d. For example, in the years near 2000, the
data appear to be significantly smaller than the fit before 2000, and higher
after that year. Investigating statistically significant departures, such as
this example, may yield additional information regarding sources or sinks of
N_{2}O (or other trace species investigated through this analysis
approach).

The fit of the detrended data to Eq. (3) to define the seasonal cycle (Fig. 5c) is also similar to that of ozone in that the phases of the two harmonics are similar (see Table 2) for these two species, agreeing (or nearly agreeing) within their confidence limits. This close correspondence is consistent with the long-standing observation that many trace gases show a springtime maximum and a summertime minimum at Mace Head (e.g., Derwent et al., 1998); the cause(s) of this correspondence is an issue warranting further investigation. Finally, Fig. 5d illustrates the fit of the original data (plotted in Fig. 5a) to both the long-term change and the seasonal cycle as defined by Eq. (4), with the inclusion of the cubic term from Eq. (1).

## 3.6 Autocorrelation and parameter confidence limits

Through the preceding discussion the statistical fitting ignored any autocorrelation in the data. Systematic intra- or inter-annual variability associated with persistent meteorological and/or climate variability could possibly cause autocorrelation in these data sets. If such autocorrelation is significant, we expect that the derived parameter values would not be significantly affected, but the confidence limits derived for those parameter values would be unrealistically small. Parrish et al. (2016) considered this issue for ozone data sets from several sites within the marine boundary layer throughout the globe and found it to only have small influence. Here we discuss this issue from a more general perspective and illustrate this discussion through the two example Mace Head data sets.

The time series of the residuals (i.e., the deviations between the monthly
mean baseline ozone concentrations and the fit of Eq. (4) to these means) for
the two example data sets are shown in Figs. 6a and 7a. For N_{2}O the
residuals are shown for both the quadratic and cubic fits to the long-term
change. The characteristics of these time series differ noticeably; the
N_{2}O residuals (Fig. 7a) show much more coherent variability than is
apparent in the more chaotic ozone residuals (Fig. 6a).

The autocorrelations of the time series are shown in Figs. 6b and 7b. Each
plot shows the correlation of the time series of the monthly means with a
duplicate of itself as a function of a time offset (i.e., month lag) between
the time series and its duplicate. When the lag is zero, the correlation is
perfect (i.e., autocorrelation coefficient = 1), and as the lag increases,
the autocorrelation coefficient decreases. These plots differ markedly
between the two species. As expected, the more chaotic time series of the
ozone residuals shows smaller autocorrelation coefficients (Fig. 6b),
decreasing in an approximately exponential manner with a time constant (tau)
≈1 month as the time offset increases. The autocorrelation of the
N_{2}O residuals (Fig. 7b) is greater, also decreasing in an
approximately exponential manner with tau ≈4 and 10 months for the
cubic and quadratic fits, respectively). Leith (1973) discusses the degree
to which autocorrelation affects the confidence limits of parameters derived
from observational time series and finds that the confidence limits
increase proportionally to (2 tau)^{1∕2}. Thus, for the two example data
sets discussed here, the confidence limits annotated in Figs. 1 and 3–5
and included in Table 1 must be increased by a “correction factor” of 1.4
and 2.9 for ozone and N_{2}O (for cubic long-term change fit),
respectively; Table 2 gives the corrected confidence limits for the final
values derived for the seven or eight parameters.

As is common to all basic treatments of error propagation, the confidence limit analysis presented here is based on the assumption that the residuals of the fits are Gaussian distributed. The nearly linear relationships in Figs. 6c and 7c (at least for the cubic long-term change fit) give a qualitative indication that this assumption is approximately valid. Each time series has a few apparent outliers of unknown cause; since the prevalence of these outliers is small (no more than 1 % to 2 %), and they are not greatly outside the general distribution, their influence is believed to be minor and will not be considered further.

It is notable that Fig. 7 clearly reflects the improvement made by the
addition of the cubic term to the long-term change fit for the N_{2}O time
series. The standard deviation of the residuals is reduced, the degree of
autocorrelation is reduced, which results in improved confidence limits for
all of the derived parameters, and the residuals are more closely fit by a
Gaussian distribution.

## 3.7 Rate of change of concentrations

Estimates of the rate of change of the mean concentrations of ozone (or other species) can be derived through differentiation of Eq. (1) to give Eq. (5):

where the third term applies when a cubic fit to the long-term change is
statistically justified. Acceleration or deceleration in the rates of change
of a species may contain information regarding changes in the magnitude of
sources or sinks of the species, and thus may lead to improved physical
understanding of the processes that determine the observed atmospheric
concentrations. The quadratic fits to the two species indicate that over
their respective data records, the rate of increase on average decelerated
at ∼0.04 ppb yr^{−2} for ozone and accelerated at
∼0.01 ppb yr^{−2} for N_{2}O. The deceleration reversed
the trend of ozone from an increase of ∼0.8 ppb yr^{−1} to
a decrease of ∼0.3 ppb yr^{−1} over the 30-year data
record, while the acceleration increased the trend of N_{2}O from
∼0.9 to ∼1.1 ppb yr^{−1} over
the 23-year record. The statistically significant value of the cubic term
(i.e., the positive value of the *d* parameter) indicates that the acceleration
of the N_{2}O rate of increase was not uniform; the rate increased more
slowly near the middle (minimum ∼2002) than at the beginning
and end of the data record. In contrast, no statistically significant change
can be discerned in the deceleration of the trend derived from the ozone
data record. A caution to this discussion should be noted – Eq. (5) is
obtained by differentiation of an equation, whose parameters are derived
from fits to data sets that have significant unexplained variability. As
noted previously, Eq. (5) is not based on a physical model; hence, the above
discussion regarding the rate of change must be considered cautiously, as
discussed in Sect. 4.

## 3.8 Sources of variance of data sets

The squares of the standard deviations of the original data sets that are
annotated in Figs. 1 and 5a give the total variance in the original data,
and the square of the RMSD values that are annotated for all of the
illustrated fits provide an approximate measure of the variance remaining in
the data set after accounting for the average long-term change and/or the
seasonal cycle. Table 3 summarizes the fraction of the original variance of
the monthly mean time series due to the average long-term changes and
seasonal cycle. Despite the obvious differences of the data records in
Figs. 1 and 5a, the variance of the ozone and N_{2}O data sets are
similar (36 and 29 ppb^{2}, respectively); however, the source of that
variance is quite different. The average long-term change accounts for only
19 % of the ozone variance but 99.7 % of the N_{2}O variance, while
the seasonal cycle accounts for 58 % and 0.19 % of the ozone and
N_{2}O variance. The residuals thus account for the remaining 23 % and
0.12 % of the variance; these residuals represent systematic interannual
variability, i.e., the lower-frequency macroweather regime of Bowdalo et al. (2016), and any other effects leading to variability in the data record
(including any measurement errors or analysis biases).

The analysis approach presented in this work derives a limited set of
parameter values that defines a mathematical representation of the
statistically significant features of the mean long-term changes and
seasonal cycles of the concentrations of trace tropospheric species. The
results for the two example data sets (baseline concentrations of ozone and
N_{2}O at Mace Head, Ireland) selected to illustrate the analysis show that
no more than the seven or eight parameter values included in Table 2 are
needed for this mathematical representation. Three or four parameters (the
coefficients of the polynomials given by the first three or four terms of
Eq. 1) quantify the long-term changes, including the absolute concentration
in the reference year 2000, and four parameters (the amplitude and phase of
the two harmonic terms of Eq. 3) quantify the seasonal cycle. These
parameters provide a minimum set of parameters that capture the
statistically significant information in the observational data set
regarding these temporal variations. These parameters can serve as metrics
for quantitative comparisons of long-term changes and seasonal cycles
between different locations or for different species and can serve as a
basis for evaluation of model simulations atmospheric concentration
variations.

In the quantification of mean long-term concentration changes, the method
presented provides statistically significant information not given by linear
trend analysis, an approach often employed to quantify long-term trends.
Linear analysis does provide a quantification of the average annual rate of
change of a species' concentration over the time span of the measurement
record, but does not generally provide information about any statistically
significant changes in the rate of concentration change (i.e., acceleration
or deceleration of the rate of concentration increase or decrease) within
the data record. For baseline ozone concentrations, such changes of rate
have been identified as quite important, as shown here in Fig. 1, and have
been quantified in earlier work (Logan et al., 2012; Parrish et al., 2012,
2014, 2017; Derwent et al., 2018a); these analyses show an increase early in
the data record that slows with concentrations reaching maxima, followed by
decreases in the latter part of the record. In such cases, Eq. (2) provides an
estimate of the year when the maximum concentration was reached. The Mace
Head N_{2}O record gives a contrasting result, with a statistically
significant acceleration of the rate of increase throughout the data record,
which is in agreement with an independent analysis of N_{2}O trends and also identifies a significant acceleration of a similar magnitude (Rona
Thompson, personal communication, 2018).

In the quantification of average seasonal cycles, the method presented here provides less detailed seasonal cycles than are sometimes derived in analyses of average seasonal cycles. Here it is shown that only four statistically significant, independent pieces of information are needed to quantify the mean seasonal cycles in the example data sets. Published studies often represent the seasonal cycle as 12 monthly means; such an approach implicitly assumes that there are 12 statistically significant, independent pieces of information available from the mean seasonal cycle. At least for the example data sets examined in this and earlier work, (e.g., Parrish et al., 2016) the mean seasonal cycles are well described with no more than 4 independent pieces of information (i.e., independent parameter values) that can be extracted from the analysis. Including 12 monthly means in the seasonal cycle representation adds statistically insignificant variability to the results, and thus over fits to the available data, preventing a clear analysis of the statistical uncertainty of those results. For the greatest statistical significance of the description of the seasonal cycle, we recommend a harmonic analysis that includes only significant terms, as exemplified in the method presented in this work. However, it is possible that other data sets from different locations may warrant more or fewer terms.

The analysis approach presented here is based on nonlinear regression fits
to Eq. (4), which assumes a number of properties about the behavior of the
data, including that the data behave in a smooth manner, that the long-term
change is independent of season and that the seasonal cycle is stable over
the data record. Should any of these assumptions fail or additional
information be desired, the equation could be modified appropriately. The
analysis derived a minimum set of statistically significant parameters that
capture as much statistically significant information as possible from the
original data sets while avoiding over fitting the data. However, no
physical model underlies Eq. (4), so physical interpretation of the parameter
values and extrapolation of the functional fit must be done only very
cautiously. For example, the fit to the long-term changes in the Mace Head
ozone record indicates that maximum ozone concentrations occurred within 2.2
years of 2008.7, i.e., within ∼26 months of 1 September 2008,
and that after that maximum ozone concentrations have been decreasing. The
question arises as to whether the maximum and the subsequent decrease are
physically real or are simply mathematical implications of the three-term
polynomial utilized to fit the data. Supplementary trend analyses indicate
that (1) there has been no statistically significant trend after the year
2000 (average trend $=-\mathrm{0.015}\pm \mathrm{0.070}$ ppb yr^{−1}), so it is at
least clear that the positive trend in the early years of the data record
has ended, and (2) that any decrease after the derived maximum in the year
2008.7 is not statistically significant. Thus, answering the above question
requires additional information that perhaps may come from additional years
of data collected at Mace Head or analysis of other ozone data sets that
can be considered to reflect the same physical driving forces as those at
Mace Head.

The ozone data are available
from Derwent et al. (2018a) and the N_{2}O data are available from the
AGAGE, CDIAC and WDCGG websites as described in Sect. 2.

DDP conceived and performed the analysis, and prepared the manuscript and figures. RGD, SO'D and PGS developed the data sets over decades. All four authors contributed to the paper's discussion.

The authors declare that they have no conflict of interest.

The authors gratefully acknowledge the cooperation and efforts of the operators of the Mace Head Atmospheric Research Station and their support staff. The research facilities at Mace Head, Ireland, were generously provided by the School of Physics, National University of Ireland, Galway. Betsy Weatherhead of Jupiter, Rona Thompson of NILU and Alistair Manning of the UK Met Office provided helpful discussions.

This research has been supported by the Guangdong Innovative and Entrepreneurial Research Team Program (Research team on atmospheric environmental roles and effects of carbonaceous species, grant no. 2016ZT06N263).

This paper was edited by Keding Lu and reviewed by two anonymous referees.

Bowdalo, D. R., Evans, M. J., and Sofen, E. D.: Spectral analysis of atmospheric composition: application to surface ozone model–measurement comparisons, Atmos. Chem. Phys., 16, 8295–8308, https://doi.org/10.5194/acp-16-8295-2016, 2016.

Chang, K.-L., Petropavlovskikh, I., Cooper, O. R., Schultz, M. G., and Wang, T.: Regional trend analysis of surface ozone observations from monitoring networks in eastern North America, Europe and East Asia, Elem. Sci. Anth., 5, 50, https://doi.org/10.1525/elementa.243, 2017.

Derwent, R. G., Simmonds, P. G., Seuring, S., and Dimmer, C.: Observation and interpretation of the seasonal cycles in the surface concentrations of ozone and carbon monoxide at Mace Head, Ireland from 1990 to 1994, Atmos. Environ., 32, 145–157, 1998.

Derwent, R. G., Parrish, D. D., Galbally, I. E., Stevenson, D. S., Doherty, R. M., Young, P. J., and Shallcross, D. E.: Interhemispheric differences in seasonal cycles of tropospheric ozone in the marine boundary layer: Observation-model comparisons, J. Geophys. Res.-Atmos., 121, 11075–11085, https://doi.org/10.1002/2016JD024836, 2016.

Derwent, R. G., Manning, A. J., Simmonds, P. G., Spain, T. G., and O'Doherty, S.: Long-term trends in ozone in baseline and European regionally-polluted air at Mace Head, Ireland over a 30-year period, Atmos. Environ., 179, 279–287, https://doi.org/10.1016/j.atmosenv.2018.02.024, 2018a.

Derwent, R. G., Parrish, D. D., Galbally, I. E., Stevenson, D. S., Doherty, R. M., Naik, V., and Young, P. J.: Uncertainties in models of tropospheric ozone based on Monte Carlo analysis: Tropospheric ozone burdens, atmospheric lifetimes and surface distributions, Atmos. Environ., 180, 93–102, https://doi.org/10.1016/j.atmosenv.2018.02.047, 2018b.

Gaudel, A. , Cooper, O. R., Ancellet, G., Barret, B., Boynard, A., Burrows, J. P., Clerbaux, C., Coheur, P.-F., Cuesta, J., Cuevas, E., Doniki, S., Dufour, G., Ebojie, F., Foret, G., Garcia, O., Granados Muños, M. J., Hannigan, J. W., Hase, F., Huang, G., Hassler, B., Hurtmans, D., Jaffe, D., Jones, N., Kalabokas, P., Kerridge, B., Kulawik, S. S., Latter, B., Leblanc, T., Le Flochmoën, E., Lin, W., Liu, J., Liu, X., Mahieu, E., McClure-Begley, A., Neu, J. L., Osman, M., Palm, M., Petetin, H., Petropavlovskikh, I., Querel, R., Rahpoe, N., Rozanov, A., Schultz, M. G., Schwab, J., Siddans, R., Smale, D., Steinbacher, M., Tanimoto, H., Tarasick, D. W., Thouret, V., Thompson, A. M., Trickl, T., Weatherhead, E., Wespes, C., Worden, H. M., Vigouroux, C., Xu, X., Zeng, G., and Ziemke, J.: Tropospheric Ozone Assessment Report: Present-day distribution and trends of tropospheric ozone relevant to climate and global atmospheric chemistry model evaluation, Elem. Sci. Anth., 6, 39, https://doi.org/10.1525/elementa.291, 2018.

Lefohn, A. S., Malley, C. S., Smith, L., Wells, B., Hazucha, M., Simon, H., Naik, V., Mills, G., Schultz, M. G., Paoletti, E., De Marco, A., Xu, X., Zhang, L., Wang, T., Neufeld, H. S., Musselman, R. C., Tarasick, D., Brauer, M., Feng, Z., Tang, H., Kobayashi, K., Sicard, P., Solberg, S., and Gerosa, G.: Tropospheric ozone assessment report: Global ozone metrics for climate change, human health, and crop/ecosystem research, Elem. Sci. Anth., 6, 28, https://doi.org/10.1525/elementa.279, 2018.

Leith, C. E.: The standard error of time-averaged estimates of climatic means, J. Appl. Meteorol., 12, 1066–1069 1973.

Logan, J. A., Staehelin, J., Megretskaia, I. A., Cammas, J.-P., Thouret, V., Claude, H., De Backer, H., Steinbacher, M., Scheel, H.-E., Stübi, R., Fröhlich, M., and Derwent, R.: Changes in ozone over Europe: Analysis of ozone measurements from sondes, regular aircraft (MOZAIC) and alpine surface sites, J. Geophys. Res., 117, D09301, https://doi.org/10.1029/2011JD016952, 2012.

Parrish, D. D., Law, K. S., Staehelin, J., Derwent, R., Cooper, O. R., Tanimoto, H., Volz-Thomas, A., Gilge, S., Scheel, H.-E., Steinbacher, M., and Chan, E.: Long-term changes in lower tropospheric baseline ozone concentrations at northern mid-latitudes, Atmos. Chem. Phys., 12, 11485–11504, https://doi.org/10.5194/acp-12-11485-2012, 2012.

Parrish, D. D., Lamarque, J.-F., Naik, V., Horowitz, L., Shindell, D. T., Staehelin, J., Derwent, R., Cooper, O. R., Tanimoto, H., Volz-Thomas, A., Gilge, S., Scheel, H.-E., Steinbacher, M., and Fröhlich, M.: Long-term changes in lower tropospheric baseline ozone concentrations: Comparing chemistry-climate models and observations at northern midlatitudes, J. Geophys. Res.-Atmos., 119, 5719–5736, https://doi.org/10.1002/2013JD021435, 2014.

Parrish, D. D., Galbally, I. E., Lamarque, J.-F., Naik, V., Horowitz, L., Shindell, D. T., Oltmans, S. J.,
Derwent, R., Tanimoto, H., Labuschagne, C., and Cupeiro, M.: Seasonal cycles of O_{3} in the marine boundary
layer: Observation and model simulation comparisons, J. Geophys. Res.-Atmos., 121, 538–557, https://doi.org/10.1002/2015JD024101, 2016.

Parrish, D. D., Petropavlovskikh, I., and Oltmans, S. J.: Reversal of long-term trend in baseline ozone concentrations at the North American West Coast, Geophys. Res. Lett., 44, 10675–10681, https://doi.org/10.1002/2017GL074960, 2017.