Spatio-temporal pattern analysis of MOPITT total column CO using varimax rotation and singular spectrum analysis

McKinnon, John; Roychoudhury, Chayan; Arellano Jr., Avelino; Gaubert, Benjamin

doi:10.5194/amt-19-2529-2026

Articles | Volume 19, issue 7

https://doi.org/10.5194/amt-19-2529-2026

Articles | Volume 19, issue 7

Research article

16 Apr 2026

Research article |

| 16 Apr 2026

Spatio-temporal pattern analysis of MOPITT total column CO using varimax rotation and singular spectrum analysis

John McKinnon, Chayan Roychoudhury, Avelino Arellano Jr., and Benjamin Gaubert

Abstract

In this paper, we apply varimax Empirical Orthogonal Function (EOF) analysis to Measurement of Pollution in the Troposphere (MOPITT) total column CO retrievals to answer the question of whether or not it is possible to disentangle the dominant CO sources associated with inferred modes of variability at regional to global scales. This study aims to highlight the strengths and limitations of EOF analysis, specifically their usage in the field of atmospheric chemistry. In particular, we highlight an instance where varimax rotation fails to reveal additional spatial structure of a dataset. Additionally, we emphasize to the reader that the success of EOF analysis to determine linearly independent physical modes relies on both the statistical distribution of the dataset as well as its temporal covariance structure. We analyzed daily MOPITT Version 8 Level 3 joint (TIR-NIR) products from 2005 to 2018, aggregated every 8 d on a 1° by 1° grid. Our findings show that EOF patterns of MOPITT CO are consistent across various regional subdomains, demonstrating that these spatial patterns are independent of the chosen domain. A comparison of the eigenvalue spectrum reveals that unrotated EOF analysis yields three distinct modes, while varimax rotation reduces these to two. The power spectra of the principal components indicate that the first two unrotated modes are primarily driven by annual and semi-annual cycles, while the third mode reflects seasonal variations occurring over roughly three months. To further isolate these modes, we employed singular spectrum analysis (SSA) at each grid point to generate long-term, seasonal, and residual EOF patterns. The power spectrum analysis of the principal components shows that the long-term EOFs replicate the original two dominant modes, while the seasonal EOFs reveal significant variations over 2 to 3 months, and the residual modes exhibit time scales of 2 months or shorter. By plotting the mean skewness field, we show the dataset is non-Gaussian, leading us to conclude its principal components are time-dependent despite being uncorrelated. The periodic decay observed in the temporal autocorrelation function for each time series suggests a classification of wide-sense cyclostationary behavior. The non-stationarity of each time series and the temporal dependence of modes lead us to conclude that EOF analysis alone cannot fully disentangle individual CO sources. Careful consideration is therefore required when interpreting EOF patterns of other composition datasets with similar characteristics.

Download & links

Article (PDF, 5252 KB)

Download & links

How to cite.

Received: 05 Nov 2024 – Discussion started: 06 Jan 2025 – Revised: 17 Oct 2025 – Accepted: 11 Nov 2025 – Published: 16 Apr 2026

1 Introduction

The Role of Carbon Monoxide in Atmospheric Composition

Carbon monoxide (CO) plays an important role in tropospheric chemistry and composition. Firstly, its reaction with the hydroxyl radical (OH) has a significant impact on the atmospheric oxidizing capacity. CO as well as non-methane volatile organic compounds (NMVOCs) are both responsible for positive climate forcing because they raise tropospheric ozone (O₃) and methane (CH₄) by creating a global sink for OH (Fiore et al., 2012). The spatiotemporal variations in CO (and hence OH) can lead to changes in O₃ abundance depending on whether the concentration of nitrogen oxides (NO_x) is low or high (Sillman et al., 1990; Jin et al., 2017). Regions of high NO_x experience O₃ production while clean areas with low NO_x experience the destruction of O₃ (Holloway et al., 2000). Because OH is a photochemical sink of CO through oxidation, its global distribution determines the lifetime of CO and therefore, values of CO are lowest with the shortest lifetime in regions with the highest OH. Secondly, CO can also serve as a tracer of pollution transport given its medium-length lifetime (weeks to months) (Cicerone, 1988). Mainly a product of incomplete combustion of wood and fossil fuels, CO is co-emitted with nitrogen dioxide (NO₂), black carbon (BC), and carbon dioxide (CO₂). It is more concentrated in the atmospheric boundary layer (with less in the stratosphere) so observing CO can be useful as well to study the exchange of these gases and aerosols across the troposphere. Because CO is the main sink of OH (Gaubert et al., 2020), it is important for quantifying losses of CH₄ in the troposphere as well (Myhre and Shindell, 2014; Gaubert et al., 2016, 2017; Nguyen et al., 2020). The lack of constraints on OH spatiotemporal variability as well as the interannual variability of CH₄ have both made it difficult to accurately determine the global CH₄ budget (Saunois et al., 2016; Turner et al., 2019; Prather and Holmes, 2017). This means there is a need to reduce the uncertainty of primary drivers of OH, including CO, O₃, H₂O, NO_x, and NMVOCs (Gaubert et al., 2020).

Determining the spatial and temporal distribution of CO, along with the distribution of co-emitted species including NO₂, CO₂, CH₄ and aerosols, helps us to understand trends of combustion, biomass burning, and other sources of air pollution. For example, Lin et al. (2020 b) uses a combined mean and standard deviation approach for total column CO to classify local regions of Southeast and East Asia based on combustion activity into areas of intense urbanization, large-scale biomass burning, regions undergoing significant change from one type to another, and clean regions. Raman and Arellano (2017) utilized variations in enhancement ratios of elemental carbon (EC), CO, and NO₂ to investigate processes driving EC levels across the United States. Tang and Arellano Jr. (2017) demonstrated that local enhancement ratios of $Δ CO / Δ {NO}_{2}$ can distinguish different combustion types, identifying whether or not a fire was in a flaming phase or smoldering phase while Silva et al. (2013) used $Δ CO / Δ {CO}_{2}$ to identify combustion efficiency patterns in urban agglomeration. Building on this, Silva and Arellano (2017) combined $Δ CO / Δ {CO}_{2}$ and $Δ CO / Δ {NO}_{2}$ to better characterize regional-scale combustion process while Tang et al. (2019) used $Δ CO / Δ {NO}_{2}$ and $Δ CO / Δ {SO}_{2}$ to track the common combustion emission pathways across China. Recently, Mottungan et al. (2024) demonstrated the value of multi-species enhancement ratios (CO, CH₄, CO₂) along with multivariate regression methods to differentiate between regional and local influences, and identify the dominant processes driving these patterns.

However, determining accurate trends in air pollution is difficult in practice for many different reasons. While there has been an increasing availability of satellite measurements of trace gases and aerosols in the recent decade, most of these remote sensing retrievals cannot, in general, tell us where their source or sink was due to the diffusive nature of atmospheric transport (Murayama et al., 2004). Another challenge is that space-based CO measurements often lack accuracy in areas with persistent cloud cover and near the surface, where thermal infrared sensors struggle due to interference from surface radiation and limited retrieval sensitivity (Clerbaux et al., 2008). These significant sources of uncertainty require us to cross-validate satellites across different instruments as well as with ground and aircraft measurements during field campaigns, and are especially problematic over areas such as Sub-Saharan Africa, where there are very few air quality monitoring stations (Malings et al., 2020; Gualtieri et al., 2024). Determining the background value of constituents is especially difficult on a global scale because it changes across time and space based on the sources and sinks for a given region and chemical regime (Jones et al., 2003; Wang et al., 2018; Borsdorff et al., 2019). Because satellites measure the total amount of a constituent rather than the source of emission for a given region, it is difficult to tell whether local changes in the overall trend are due to anthropogenic or biogenic sources or the background or residual burden (Novelli, 1999; Tang et al., 2022). As discussed in Buchholz et al. (2021) there are no overall trends in either CO or aerosol optical depth (AOD) in Central/Southern Africa despite the increasing trend in anthropogenic combustion in Central Africa as determined by Zheng et al. (2019) which may be counteracting the global decrease in CO. Buchholz et al. (2021) have also noted that the increase in burning in the region as shown in Andela et al. (2017) may be potentially counteracting transport.

There are several complex challenges when determining emission inventories. As discussed by Gaubert et al. (2020), emission inventories in China are low in comparison to in situ observations in the case of both forward and inverse models, and in particular there is an underestimation of CO despite having revised inventories (Li et al., 2017; Feng et al., 2020; Kong et al., 2020) as explained by an underestimation of residential coal combustion from either heating or cooling (Chen et al., 2017; Cheng et al., 2017; Zhi et al., 2017). Even though this underestimation of fossil fuel burning does explain the underestimation of Northern Hemisphere (NH) CO in global models (Shindell et al., 2006), this is confounded by the large difference in modeled variability of the regional distributions of OH in the NH, which could offer an alternative explanation (Naik et al., 2013; Young, 2013). Gaubert et al. (2020) used observations from the Korea-United States Air Quality (KORUS-AQ) field campaign to investigate this underestimate over East Asia and found that assimilation of CO improves the representation of OH in global chemical transport models by correcting OH $/$ HO₂ partitioning. This also implies that assimilation of CO should help study coupled CH₄-CO-OH reactions, which allow for chemical feedback. Effectively distinguishing the signature of secondary CO from the influence of surface emissions – whether anthropogenic, biogenic, or from fires – is crucial for enhancing the reliability of source attribution studies, including inverse modeling for air quality assessments and chemistry-climate simulations (Gaubert et al., 2016, 2017; Worden et al., 2019; Gaubert et al., 2020; Miyazaki et al., 2020; Gaubert et al., 2023). This separation also aids in the physical interpretation of observed air pollution patterns, with important implications for understanding the distributions of both OH and CH₄ (Gaubert et al., 2017; Vimont et al., 2019; Hooghiem et al., 2025).

To address some of these challenges, we attempt to answer the following questions in this study:

How can we use the spatial and temporal distribution of CO to detect the difference between anthropogenic and natural variability of CO?
Can the global background of CO be disentangled from the local-to-regional influences on observed CO abundance?

We use the term “global background” to refer to a mathematical, non-physical quantity broadly represented as an average. This is different from terminology frequently used in literature. For example, Menichini et al. (2007) defines a background concentration as the observed concentration at a site “not affected by local sources of pollution”. More recently, Han et al. (2015) described background concentration as the collective contribution of sources from regional, anthropogenic, natural emissions, and long-range transport.

To attempt an answer to these questions, we use Empirical Orthogonal Function (EOF) analysis, which is a pattern analysis method to determine the dominant spatio-temporal patterns of a dataset (Hannachi et al., 2007). To avoid confusion with language, we clarify that the term EOF analysis refers to the usage of principal component analysis (PCA) (a statistical technique used to reduce dimensionality) when it is applied to the fields of climate science and geophysics. While the two methods are virtually identical, the use of EOF analysis implies that the principal components being found are not just linearly uncorrelated, but they are also being used to explain the maximum amount of variance possible in the original dataset.

EOF analysis has been extensively utilized in climatology, for instance Wallace and Gutzler (1981) performed an EOF analysis of geopotential height to generate what we now refer to as atmospheric teleconnections. It has had fewer applications to atmospheric chemistry, but for instance Li et al. (2009) uses varimax rotated EOF analysis of de-seasonalized aerosol index to isolate major dust and biomass burning regions and dust transport. Li et al. (2013) performed a similar analysis on AOD datasets. Lin et al. (2020 a) also verified the result of their classification of urban and biomass burning regions using EOF analysis. Yin et al. (2019) used EOF analysis to analyze the dominant patterns in summer O₃ pollution over Eastern China and their relationship to atmospheric circulations. EOF analysis has also seen applications to air quality monitoring. For instance, Eder et al. (1993) uses the principal components corresponding to rotated modes to analyze seasonality of surface ozone concentrations in the Eastern United States. Pires et al. (2008) used principal component analysis together with cluster analysis to examine monitoring sites and identify emission sources as well as cities with similar pollution behavior. They successfully applied this method to several species, including CO, O₃, and NO₂.

Our use of EOF analysis is especially motivated by studies that investigate the global CO budget through source attribution, for example, see e.g. Granier et al. (1999); Holloway et al. (2000); Duncan et al. (2007); Zheng et al. (2019); Lichtig et al. (2024). Holloway et al. (2000) used three-dimensional-modeled CO to study the total CO budget by simulating the tagged contributions from fossil fuel, biomass burning, methane oxidation, and biogenic sources. In doing so, Holloway et al. (2000) was able to reproduce global patterns in the CO seasonal cycle, most importantly the interhemispheric gradient observed in total CO measurements. The same gradient was also detected in the fossil fuel tag, which showed that the global CO seasonal cycle can be thought of as a convolution of the seasonal cycles for individual source terms. Our approach in this study is similar; however, we begin with satellite-based total column CO to separate it into patterns that represent the variability of its individual sources and sinks. This is precisely the class of problem that EOF analysis was designed to solve. Our focus in this paper is to explore the use of EOF analysis on satellite data of atmospheric composition in an attempt to separate anthropogenic and biogenic modes of variability from a global background field. Our results find that while EOFs have proven useful in climatological studies before, using it on atmospheric composition has significant limitations that require careful considerations, as detailed in our study.

2 Data and Methods

2.1 Total Column CO from MOPITT

We use total column CO data from the Measurements of Pollution in the Troposphere (MOPITT) instrument, which uses a correlation radiometer from the NASA Terra instrument. Terra flies at a normal altitude of 705 km in a Sun-synchronous polar orbit that passes over the equator at approximately 10:30 and 22:30 local time. The temporal resolution is daily, which has been averaged into 8 d periods, and the spatial pixel resolution is 22 km × 22 km at nadir, resampled to 1° from L3 products. The cross-track scanning angle is ±26° and yields a swath of approximately 640 km with global coverage every 3 d.

The radiometer performs gas correlation spectrometry for broadband measurements in thermal infrared (TIR) at approximately 2140 cm⁻¹ and near-infrared (NIR) around 4725 cm⁻¹ (Drummond et al., 2010; Buchholz et al., 2021). This is done by modulating either pressure or length of a correlation cell filled with the gas of the target to determine spectral line differences. The TIR is measured using terrestrial thermal radiation, while the overtone band from NIR is measured from reflected solar radiation and enhances retrievals, especially near the surface. Products are available in thermal infrared only, near-infrared only, or joint TIR-NIR, although NIR signals are only available during the day and over land (Buchholz et al., 2017). For this study, we use Level 3 data using the V8 retrieval algorithm for TIR-NIR. Level 2 and Level 3 products for MOPITT for the V8 retrieval algorithms (and their most recent version, V9) are publicly available through repositories located at https://terra.nasa.gov/data/mopitt-data (last access: 20 November 2025).

The V8 product includes updates to MODIS cloud cover Collection version 6.1, spectral data for H₂O and N₂ as well as aircraft data from NOAA (Deeter et al., 2019). This enables a radiance-based bias correction and accounts for both temporal bias drift and bias as a result of water vapor. Daytime retrievals of total column CO for V8 are stable and have a nominal drift of $- 0.015 \pm 0.061 %$ relative to NOAA airborne flask sampling during the MOPITT mission. MOPITT measurements have been verified extensively using cross-validation with other satellites, in-situ measurements at the ground, as well as via aircraft, and more recently through ground-based solar Fourier transform infrared spectrometer measurements (George et al., 2015; Buchholz et al., 2017; Martínez-Alonso et al., 2020).

We note that satellite retrievals of tropospheric CO are also available using a multitude of other instruments, including the Atmospheric InfraRed Sounder (AIRS) onboard Aqua (McMillan et al., 2011), the Tropospheric Emission Spectrometer (TES) on Aura (Rinsland et al., 2006), the Infrared Atmospheric Sounder Interferometer (IASI) on the MetOp platform (Clerbaux et al., 2015), the Tropospheric Monitoring Instrument (TROPOMI) (Borsdorff et al., 2018), and the Cross-track Infrared Sounder (CrIS) (Nalli et al., 2020). These instruments are all sun-synchronous, capable of measuring CO in the infrared region, and have been shown to have consistent hemispheric CO variability when compared to MOPITT CO records from 2002 to 2018 (in the case of CrIS, from 2015 onward) (Buchholz et al., 2021). Because CrIS was launched in October 2011, it does not have a long or consistent enough period of measurement to be suitable for the determination of long-term trends of CO. The same applies to TROPOMI, which was launched in 2017. While CO measurements for IASI-A are long enough to determine trends, they have potential discontinuities since CO retrievals are not done using consistent humidity, temperature, and cloud information (Barret et al., 2025). Measurements from TES may be up to 2 orders of magnitude fewer than other instruments and are more difficult to verify due to not being collocated (Buchholz et al., 2021). Measurements from AIRS have a long and stable enough record for trend determination and have been shown to reproduce trends from MOPITT (Yurganov et al., 2008); however, AIRS uses a cloud-clearing algorithm that increases global coverage at the expense of increasing ground pixel sizes to 45 km × 45 km, resulting in coarser resolution (Cho and Staelin, 2006). Our choice of using MOPITT retrievals is because it has the longest available CO record with a verified history of reliable and stable observations and relatively fine spatial resolution.

2.2 Sources and Sinks of CO

Here, we briefly describe the source categories of CO, which include fossil fuel/biofuel burning, biomass burning, biogenic, non-methane hydrocarbon oxidation, and CH₄ oxidation as summarized in Holloway et al. (2000); Edwards et al. (2004, 2006). Describing these categories is useful because surface emissions of CO can display unique signatures that may be linked to specific spatial and temporal patterns in observed CO abundance.

CO from anthropogenic combustion (fossil fuel/biofuel burning) is much larger in the NH compared to the Southern Hemisphere (SH), which is attributed to significantly more fossil fuel consumption in urban and industrial regions. We consider these areas of high emission to be mostly associated with eastern China, India, the eastern United States, and Western Europe. Historically, eastern China has contributed significantly to anthropogenic emissions (Jia et al., 2024) because of a distinct reliance on coal and biofuel burning (Du et al., 2023; Kao et al., 2024). In recent years, however, China has made significant progress in reducing emissions and improving air quality (Greenstone et al., 2021; Li et al., 2024). This has been the case since 2014 when Premier Li Kequiang declared a “war on pollution”, and implemented widespread changes in regulatory policy (Greenstone et al., 2021).

Biomass burning is largest in the SH and a primary source of CO. Biomass burning consists of burning of savanna, forests, agricultural residue, fuelwood, and animal waste and is most concentrated in tropical and sub-tropical forests and grasslands (Holloway et al., 2000). The largest contributions of fire emissions come from Africa and South America, although fires also occur in Indonesia, the Boreal forests of Canada, Alaska, and Russia, as well as Australia. African NH fires occur in the savanna south of the Sahara Desert and in tropical rainforests north of the equator, while SH fires begin burning in the western portion of the continent near Angola and then spread southeast down the eastern coast. Fires in South America happen on the southern edge of the Amazon rain forest, the cerrado grasslands, and drought-deciduous forests in the south (Edwards et al., 2006). Another distinct feature of CO fire emissions is their timing (Wiedinmyer et al., 2023). While long-term drivers such as land-use and land cover changes, as well as drought conditions, heavily influence fire occurrences, seasonal variations are primarily driven by agricultural burning practices and prevailing weather conditions. The anomalously high CO levels from these fires, coupled with their distinct spatial and temporal variability inherent to biomass burning, provide a more reliable signature for distinguishing fire emissions from other CO sources.

The contribution of CH₄ oxidation, as well as oxidation from non-methane hydrocarbons (NMHCs), represents secondary sources of CO that are most apparent in the SH. Oxidation from CH₄ represents roughly 28 % of the global budget (Folberth et al., 2006) while the biogenic component of NMHCs represents roughly 15 % (Duncan et al., 2007). The oxidation of NMHCs includes a wide variety of constituents, which include isoprene, terpenes, acetone, as well as industrial and biomass NMHCs. Temperature-dependent (Sharkey and Yeh, 2001) biogenic emissions of isoprenes are considered the most dominant hydrocarbon emissions due to their large annual contribution (400–600 TgC globally Arneth et al., 2008) and their high reactivity with tropospheric oxidants (Atkinson, 2000). Isoprenes are volatile organic compounds that are directly emitted by trees and shrubs through their leaves. They are also emitted by animals, including humans, and can be measured through various biological processes, including exhaled breath, perspiration, urine, feces, and tear production (Moura et al., 2023). Terpenes are produced mostly from coniferous forests and are released in higher amounts with warmer weather (Holzke et al., 2006). The largest emissions of hydrocarbons occur in the tropics, which are primarily from isoprene and occur because of the large density of biomass and high temperature conditions. During the NH winter, there is a strong gradient resulting from biogenic emissions that are south of the equator, where the intertropical convergence zone is located, and includes emissions from South America, Southern Africa, and Australia. During the NH Summer, biogenic sources contribute more in the NH, resulting in a weaker but still observable gradient. This gradient is significantly stronger for surface observations but is observable at all levels of the atmosphere (Holloway et al., 2000). CO production from CH₄ oxidation can be observed across the troposphere and tends to be spatially homogeneous given the longer lifetime of CH₄ (≈10 years). It does, however, exhibit some seasonality due to chemical loss with its reaction to OH.

The dominant source of oceanic CO is due to sunlight-initiated photolysis of chromophoric dissolved organic matter (CDOM) in the open ocean (Stubbins et al., 2006). CDOM represents the portion of decaying plants and microbes that absorbs light, degrading into CO and other products in the process. The secondary sources of oceanic CO include dark production (the production of CO from CDOM without the presence of sunlight) and emission from phytoplankton (Zhang et al., 2008; Day and Faloona, 2009; Sarda-Esteve A and Bonsang A, 2009). While the physical processes that lead to dark production are not well understood, studies have observed it to be temperature dependent (Von Hobe et al., 2001). The direct emission of phytoplankton is a similar source to CDOM in that CO is produced when exposed to photosynthetic radiation. The primary removal process for CO in the ocean is due to microbial consumption, and secondary removal comes from sea-air exchanges when the ocean releases CO into the atmosphere (Conte et al., 2019). A recent budget for oceanic CO in the Eastern Indian Ocean (Ji et al., 2025) found that CDOM, dark production, and phytoplankton emission accounted for roughly 67 %, 30 %, and 3 % of oceanic emissions, respectively. Microbial consumption accounted for 94 % of the removal of oceanic CO while sea-air exchanges only contributed 6 %. Overall, the oceanic cycle of CO is not very well understood due to a general lack of observations and the fact that microbial consumption is difficult to predict due to its dynamically changing environmental conditions (Xie et al., 2005).

In addition to source signatures, the variability in CO (particularly its seasonality) is significantly influenced by its chemical loss with the OH radical. Notably, CO serves as the primary sink for OH, while CH₄ and NMVOCs both serve as secondary sinks. Reactions between CO and CH₄ have highly complex non-linear chemical feedback (Prather, 1996, 2007; Prather and Holmes, 2017). This is due to the link between OH and CH₄ as well as the production of OH, which is tied to O₃. In addition to the primary production of OH through photolysis with O3, secondary production can also occur through photolysis with VOCs. These secondary reactions cause oxidation with either CO or VOCs to produce O₃ in the presence of NO_x, while also recycling OH in the process (Fiore et al., 2024). The chemical lifetime of CO and, therefore, its abundance depends photochemically on the distribution of OH. Because oxidation increases with sunlight, OH is much more concentrated in the tropics and increases with latitude. This leads to the lifetime of CO being shortest in the tropics and longest near the poles, with an average lifetime of about 2 months.

2.3 EOF Analysis

Here we provide a simplified description of EOF analysis. A more complete description can be found in Björnsson and Venegas (1997); von Storch and Zwiers (2003); Hannachi et al. (2007); Wilks (2011); Hannachi (2021). The exposition presented here is detailed and specifically tailored for the use of EOF analysis in atmospheric chemistry datasets, especially because most references approach the subject from the perspective of meteorological applications. We present key equations relevant to our main discussion, while derivations are provided in Appendix A. We do this to emphasize the use of EOFs in atmospheric chemistry while keeping our discussion focused on their strengths and limitations rather than straying too far into technicalities.

Assume that we have a set of n observations in time of a gridded field with a total of m gridpoints. Let f_ij where $i = 1, 2, \dots n$ $j = 1, 2, \dots m$ represent the value of the field observed at time i and location j, then the data is represented by the n×m matrix F:

\begin{matrix} (1) & F = {(f_{1}, f_{2}, \dots f_{n})}^{T} = (\begin{array}{cccc} f_{11} & f_{12} & \dots & f_{1 m} \\ f_{21} & f_{22} & \dots & f_{2 m} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ f_{n 1} & f_{n 2} & \dots & f_{n m} \end{array}) . \end{matrix}

Every row of F represents a 1×m vector corresponding to a single observation in time across all grid points.

We now construct the anomaly matrix F^′ that represents the departure from normal climatology by removing from each grid point its corresponding temporal mean. Instead of constantly referring to F^′ as the anomaly matrix, we will simply refer to it as F from now on.

The goal of EOF Analysis is to decompose the anomaly matrix into a matrix of orthogonal spatial patterns called the EOF modes C, and a set of uncorrelated time series called principal components (PCs) Φ such that

\begin{matrix} (2) & \begin{aligned} F (x, t) & = λ_{1} ϕ_{1} (t) c_{1} (x)^{T} + λ_{2} ϕ_{2} (t) c_{2} (x)^{T} + \\ \dots λ_{p} ϕ_{p} (t) c_{p} (x)^{T} = \sum_{k = 1}^{p} λ_{k} ϕ_{k} (t) c_{k} (x)^{T} \end{aligned} \end{matrix}

where p≤r, and r=rank(F) which satisfies:

\begin{matrix} (3) & r \leq \min (n, m) . \end{matrix}

The number of modes p to retain in the truncation is always chosen such that the maximum percent variance of the original dataset can be explained using the new set of basis functions. The data is compressed more when fewer modes of variability are kept in the expansion.

The EOF spatial patterns C correspond to the eigenvectors of the spatial covariance matrix ( $R = F^{'} F$ ) of F. We can similarly show that the PC time series Φ corresponds to the eigenvectors of the temporal covariance matrix ( $L = {FF}^{'}$ ) of F. It is important to note that because the covariance matrices are symmetric and positive semi-definite, they have non-negative eigenvalues with well-defined square roots as well as orthogonal eigenvectors. An equivalent formulation is to use singular value decomposition (SVD) on F, so we expand it into unitary matrices Φ and C and a matrix

\begin{matrix} (4) & Σ = Λ^{1 / 2} \end{matrix}

of singular values such that:

\begin{matrix} (5) & F = Φ Σ C^{T} . \end{matrix}

Note that in this case that each column of F is divided by its standard deviation so that the covariance matrix then becomes a correlation matrix. We note, however, that our total column values are large in magnitude (they are measured in molec. cm⁻²), so we choose to use correlation matrices to avoid potentially having large differences in the variance, skewing our interpretations.

In Appendix A1, we detail exactly how the EOF patterns and PC time series can be computed numerically either using singular value decomposition (SVD) or by using eigendecomposition on the spatial or temporal covariance matrices and then projecting the resulting eigenvectors onto the original matrix F. In Appendix A2, we show that these two methods yield equivalent results in the context of our problem space. We note, however, that these methods are NOT in general the same. While SVD can be performed for rectangular matrices, eigendecomposition can only be done for square matrices (Van Loan and Golub, 1996; Trefethen and Bau, 2022). More importantly, while both methods are matrix factorizations for the original matrix F, SVD is generally preferred because it has better precision and is more stable computationally (Trefethen and Bau, 2022). This is because SVD uses a diagonal matrix of singular values (the square roots of the eigenvalues), which is always constructed to be non-negative. Unfortunately, SVD may require too much computing power in the case that F is very large because it requires us to compute eigenvectors of both the spatial covariance R and the temporal covariance L. Because F has m≫n (more gridpoints than observations in time), then dim(R)≫dim(L), so we decided to forego using SVD in favor of an eigendecomposition on L. On the other hand, if the number of observations had been large (n≫m), then dim(L)≫dim(R) and we would have instead used an eigendecomposition on R. Another alternative that is considerably faster for large matrices with no loss of stability is to use Lanczos iteration to solve the eigenvalue problem (Bennani and Braconnier, 1994; Toumazou and Cretaux, 2001). Lanczos iteration is faster for larger matrices because it relies on Krylov subspaces to compute the largest eigenvalue first (Kuijlaars, 2006). While Lanczos iteration is more efficient, it is more complex to execute and therefore goes beyond the scope of our discussion.

2.3.1 Separation of Modes Using the North Test

After generating the matrix of EOF modes C (the matrix of eigenvectors corresponding to R), we need to determine how many of these modes are significant and provide useful information. Assuming that the p eigenvalues have been sorted in descending order of magnitude (this is done automatically in SVD algorithms but not necessarily in eigendecompositions), then the percent variance that is accounted for by the mth mode is given by Hannachi et al. (2007):

\begin{matrix} (6) & % var = \frac{λ_{m}}{\sum_{k = 1}^{p} λ_{k}} \times 100 % . \end{matrix}

Because of correlations between variables (spatial autocorrelation in F), F can't have truly independent samples. The sampling uncertainty for the eigenvalues can be estimated by using a variety of methods, including asymptotic approximation, probabilistic principal component analysis (PPCA), Monte-Carlo resampling Methods, or bootstrapping (North et al., 1982; Kim and North, 1993; Tipping and Bishop, 2002; Hendrikse et al., 2009; Isokääntä et al., 2020; Hannachi, 2021). In our paper, we only consider the use of asymptotic expansions because it is the only method out of the ones listed that does not rely on resampling. We take this approach not only because of simplicity, but because resampling would become computationally cumbersome given the large size of our data matrix.

In this paper, we emphasize the asymptotic method as described by North et al. (1982) who established an estimate of the typical error Δλ_k between two neighboring eigenvalues to be:

\begin{matrix} (7) & Δ λ_{k} \approx \sqrt{\frac{2}{N^{*}}} λ_{k} \end{matrix}

We see that according to Eq. (7), the error in each eigenvalue in is proportional to its magnitude. Since this approximation relies on the central limit theorem and the law of large numbers, it is valid as long as the effective number of independent samples N^* is large. According to Hartmann (2016), N^* can be estimated in terms of a first-order stationary autoregressive Gaussian process (AR1) also known as red noise. By assuming N is large and that our process is AR1, we use the estimate for N^* given by (Bartlett, 1935; Jones, 1975; Bretherton et al., 1999; Hartmann, 2016):

\begin{matrix} (8) & N^{*} = N (\frac{1 - a}{1 + a}) . \end{matrix}

We derive Eq. (8) in Appendix A3, and clarify the reasoning for these assumptions. North et al. (1982) also gives an estimate for the error between eigenmodes (c_k is the EOF for λ_k) as:

\begin{matrix} (9) & Δ c_{k} = (\frac{Δ λ_{k}}{λ_{j} - λ_{k}}) c_{k}, \end{matrix}

which shows that large errors in eigenvalues will also cause large errors in a neighboring EOF mode. When there is a significant overlap between the error bars of adjacent eigenvalues, it is a good indicator that these modes are either degenerate due to being contaminated by error or potentially represent white noise. One downside of using this approach is that this estimation is inexact and inherently subjective. A more robust way to determine if the eigenvalues correspond to modes that have genuine spatial structure is to subject them to a stochastic null-hypothesis test (see e.g. Dommenget, 2007; Montano and Jombart, 2017), which is beyond the scope of this study.

2.3.2 Limitations of EOF Analysis

While EOF analysis can be used as a very powerful tool to compress data and determine dominant modes of climate variability, the method suffers from numerous potential limitations. EOF modes have been extensively found to suffer from the following major drawbacks in comparison to rotated EOFs (REOFs) (Richman, 1986; Hannachi, 2021):

Domain Dependence

Domain dependence is a problem that was first reported by Kaiser (1958). Buell (1978) determined that the primary factor that determines the shape of an EOF was not its covariance structure, but rather the shape and size of the domain (also Lehr and Hohenbrink, 2025). This is an immediate problem, as the covariance structure contains the features of our data and allows us to separate it into different modes of variability. Buell (1978) also showed that it is possible to find several different correlation functions that have a very similar structure over the same domain.
Subdomain Instability (Non-locality)

Sub-domain instability is closely related to the issue of domain dependence and references the lack of invariance when EOFs from a given domain are compared to those generated from localized subdomains (Richman, 1986). For an EOF to be physically insightful, it should, for instance, be the case that EOF patterns over Africa and Asia should reproduce those generated globally. If this were not the case, the modes generated would be impossible to reproduce with any semblance of consistency. This difficulty is caused by the orthogonality constraint of EOFs, which causes the variance criterion to become maximized globally rather than locally, and leads to the inability of EOF analysis to separate independent and localized features that share similar covariance structure (Horel, 1981; Monahan et al., 2009). This non-locality issue is also consistent with the mixing property for EOFs, which refers to their tendency to take a signal that begins as a linear superposition of several different independent and potentially uncorrelated signals, and then combine them to maximize variance (Hannachi, 2021).
Sampling Error

Sampling error between neighboring eigenvalues is an issue that we have discussed in Sect. 2.3.1 of this document, so we will not examine it further here.
Inaccurate Representation of Physical Phenomena

The difficulty EOF analysis suffers in expressing physical phenomena is a complex issue, and here we summarize the main points on this topic as discussed in Monahan et al. (2009). The first major issue is that statistical modes generated in EOF analysis do not, in general, represent dynamical modes that correspond to true physical modes of variability, and in fact hold only under very strict circumstances. This is not unexpected, as the forced orthogonality of modes for EOFs is a mathematical constraint rather than a physical one. In particular, linearized climate models have conditionally orthogonal modes. The linearized barotropic model given in Simmons et al. (1983) does not have orthogonal processes. A more general climate model with conditional orthogonality is given in Monahan et al. (2009).

We should also not expect EOF analysis to produce principal component time series that are truly independent, because in the absence of autocorrelation, we could still have uncorrelated variables that may be related in a nonlinear way. Only in the case that a probability distribution is jointly Gaussian will it have modes that are uncorrelated and independent. This observation is verified in Appendix A4. Without this jointly Gaussian assumption, it does not follow in general that uncorrelated random variables are independent. A simple counterexample to this is given in Hannachi (2021). The degree to which a field F deviates from Gaussian behavior can be estimated by calculating the skewness. Only in specific circumstances can climate data be considered to have low skewness, and therefore be considered Gaussian with independent principal components. For example, mean annual temperature records that do not show abrupt increases in warming or cooling may be considered to be Gaussian (Skelton et al., 2020). Some, but not all of these limitations in EOF analysis can be resolved using REOFs, but even they come with their own drawbacks, which will be discussed in the next section.

2.3.3 Varimax Rotation

We describe rotated EOFs (REOFs) using the varimax criterion, which are EOFs that have been rotated according to a maximization condition. REOFs are motivated by the need to make EOF patterns more localized, easier to interpret in physical space, and less dependent on the domain. This is accomplished by relaxing orthogonality constraints for the EOFs and/or the uncorrelated constraints of the PCs subject to a specified criterion. Rotations can either be oblique or orthogonal and include more than ten different possible criteria according to Richman (1986). We are only concerned with the varimax criterion, which is popular because orthogonal rotation is less sensitive to changes in the number of variables, and it avoids some convergence issues that can happen during matrix inversion in oblique rotation. In simple terms, the varimax criterion attempts to maximize the sum of the spatial variances of the squared and rotated EOF patterns. Orthogonality is preserved while achieving simplicity by forcing patterns to have magnitude near either 0 or 1 so that each pattern can be easily represented by the linear combination of only a few basis vectors. One potential drawback of varimax rotation is that leading patterns produced by globally maximizing variance could potentially be destroyed. Suppose we can obtain the first p leading varimax modes. As p increases and we rotate a higher number of EOF patterns, the rotated EOFs become more localized at the expense of the leading patterns, which may no longer be invariant under the orthogonal transformation. When creating the varimax code for our analysis, we used the dualvarimax code presented in Björnsson and Venegas (1997) as a template.

2.4 Seasonal Decomposition using Singular Spectrum Analysis

Singular spectrum analysis (SSA) is a very powerful tool in time series analysis with a wide variety of applications, some of the more notable ones including forecasting, change point detection, filtering, and detection of spatio-temporal patterns. SSA also has applications to meteorology and air quality monitoring, for instance, Macias et al. (2014) uses SSA on the global surface temperature to separate a multi-decadal oscillation from a secular trend, and Gruszczynska et al. (2019) uses multichannel SSA to perform spatiotemporal analysis for atmospheric, continental hydrology, and non-tidal ocean changes to determine an annual signal for 16 sections depending on the climate zone. Another application is from Espinosa et al. (2022), which uses the time derivative of a resulting SSA signal to detect air quality anomalies.

Broadly, our goal is to take a time series $x = [x_{1}, x_{2}, \dots, x_{N}]$ of length N and then decompose it into a sum of trends, periodic (or quasi-periodic) components, or noise, the sum of which will reconstruct the original series. A detailed description of the method is fairly technical, and we therefore refer the reader to summaries found in Hassani (2007) and Golyandina and Zhigljavsky (2013) for a more complete description. We perform a seasonal decomposition on the mean-centered and standardized time series for each grid point of our dataset A by using the trenddecomp function in Matlab with one mode of seasonal variability. This gives us

\begin{matrix} (10) & A = LT + S + R, \end{matrix}

where LT is the long-term variation, S is the seasonal variation, and R is the remainder.

3 Results and Discussion

We present our results and discussion in 5 parts. In Sect. 3.1, we first show the spatial and temporal mean of our MOPITT dataset in Figs. 1 and 2. We then discuss how the interannual variability of CO is linked to differences in sources and sinks. In Sect. 3.2, we show in Fig. 3 that the first leading EOF mode of MOPITT CO is independent of its domain because it preserves its spatial structure across three regional subdomains. Then, in Fig. 4, we display the percent variance of the first 6 EOFs for both unrotated and varimax modes together with their error bars using the North test. This figure shows that the unrotated modes had better overall modal representation compared to the varimax modes, and is the case both for our original time series as well as for our SSA decomposition into long-term, seasonal, and residual modes (with residual modes being an exception to having better representation). In Sect. 3.3, we discuss the SSA decomposition for our MOPITT dataset used in combination with EOF analysis. In Fig. 5, we show the overall effect of the decomposition on our MOPITT time series changes across different regions of the globe by examining how the magnitude and phase of each mode compare with those of the original series. In Fig. 6, we show spatial maps for the first 3 unrotated EOF modes for both the original dataset as well as the long-term, seasonal, and residual modes and their percent variance. We discuss that while the overall EOF mode 1 clearly shows a spatial pattern indicative of an annual cycle, physical interpretations of higher EOF modes are limited considerably in their usefulness. In Sect. 3.4, we perform a spectral analysis of our EOF modes by examining the time series of their principal components in Fig. 7, and the cumulative percent total power of each component in Fig. 8. We discuss the specific time scales that contribute the most power for each mode, and we emphasize that our long-term modes can reproduce our first two unrotated modes. Finally, in Sect. 3.5, we examine the mean skewness field of our data in Fig. 9, and discuss how it implies that our dataset is non-Gaussian and therefore has time-dependent principal components. In Fig. 10, we show that the autocorrelation for all of our modes is either wide-sense cyclostationary or wide-sense polycyclostationary. Our principal components being time-dependent, together with our modes being non-stationary, suggests that EOF analysis cannot result in modes that are directly associated with sources or sinks of CO, representing a serious limitation on our study.

3.1 Spatio-temporal Variations of CO from MOPITT

As we discussed in Sect. 2.1, we use total column CO data from MOPITT for our study. We use daily data averaged to 8 d periods to give us a sufficient resolution to observe CO during its lifetime while also reducing computation time and giving a more consistent overpass time. We also emphasize that for our specific application, daily data would not be sufficient because MOPITT does not have daily coverage, and EOF analysis fails to produce patterns if there are any gaps in the data. Even in using 8 d coverage, we needed to fill a small number of gaps using the inpaintn function offered by MATLAB. We use measurements for the 14-year duration of 2005–2018 to have sufficient data to study long-term patterns. Our data had long-term trends removed by taking the globally averaged time series and applying a Fourier fit to it. We note that because the trends in the total column data are a non-linear combination of the trends from individual sources, it is not immediately clear what sources may have had their long-term trends removed. Plots are shown below of the spatial (Fig. 1) and temporal (Fig. 2) means of de-trended CO for global data over our stated observation period. We now discuss the seasonal and interannual variability observed by MOPITT, focusing on its relationship to the sources and sinks of CO that we described in Sect. 2.2.

Reports of the interannual variability of CO can be identified as fluctuations in the observed gradient of its abundance between the NH and the SH. This gradient is driven by the significant differences in surface emissions between the hemispheres, along with the associated air mass exchange that occurs on timescales of a year or longer. To a first approximation, CO concentrations are lowest where OH abundance is highest, as CO's chemical lifetime is strongly dependent on reactions with OH. These areas are characterized by having high photochemical activity (solar radiation), as well as high concentrations of O₃ and NO_x, and correspond to continental regions during the summer. On the other hand, oceanic (remote) regions during the winter correspond to areas with the lowest OH and therefore the highest CO. In general, CO loading tends to reach maximum values in April and minimum values in September (Edwards et al., 2004). However, the transport and mixing of surface emissions complicates this first-order relationship.

Seasonal differences in the distribution of OH and the imbalance of emissions in each hemisphere lead to a North-South gradient in CO that is clearly visible during the NH winter but is absent in the NH Summer. During the NH winter, low OH in the NH leads to high CO, while high OH in the SH leads to low CO. This, in turn, reinforces large NH anthropogenic emissions from Europe, North America, and Asia that peak during early springtime, which leads to a steep gradient during the NH winter. During the NH summer, we instead have low CO in the NH, but high CO in the SH, which balances a much lower budget of emissions of CO in the SH.

Significant changes in the annual cycle have been connected to fire emissions due to biomass burning, and limited studies have connected variability in the SH to climate indices where biomass burning is the primary driver. In the NH, fires from Boreal forests in North America and Eurasia have been shown to contribute to significant interannual variability. In the SH, while fires in southern Africa and South America generate overall high CO loadings, a primary driver of variability is due to fires in the Maritime Continent and Northern Australia. Buchholz et al. (2018) were able to use multilinear regression with four climate indices to explain the variability of CO across the SH and tropics, including Maritime SEA, two subregions in each of Australasia, Southern Africa, and South America. The indices used to explain variability included the El Niño Southern Oscillation (ENSO), the Indian Ocean Dipole (IOD), the Tropical South Atlantic Sea Surface Temperature Anomaly (TSA), and the Antarctic Oscillation (AAO) (Trenberth, 1997; Gong and Wang, 1999; Saji, 2003; Kucharski and Joshi, 2017). They found that more than 50 % variability is explained over all regions, and more than 70 % variability was explained across maritime Southeast Asia and North Australasia. The fire intensity and burned area are connected to the amount, type, and dryness of fuel availability, which depend on climate-sensitive water availability and ecosystem responses. This indicates that SH interannual variability of CO loading has a clear but complex dependence on climate.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f01

Figure 1Spatially averaged time series of de-trended MOPITT CO over the period of 2005–2018 across the globe from 60° S to 60° N (in molec. cm⁻²).

Download

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f02

Figure 2Temporally averaged global map of de-trended MOPITT CO over the period of 2005–2018 across the globe from 60° S to 60° N (in molec. cm⁻²).

3.2 Domain Independence and Separation of Modes for MOPITT CO

As discussed in Sect. 2.3.2, one of the first potential issues that needs to be addressed to produce physical EOF modes is whether or not the modes capture the local structure and if they are reproducible across differing subdomains. Before generating EOF modes, our data matrix of size (longitude, latitude, time) is reshaped into size (time, grid), and every grid point is mean-centered across time and then standardized. In Fig. 3, we show the first leading EOF mode for the dataset across the entire globe (a) and then recompute the first mode after subdividing the domain into North American (b), African (c), and Asian (d). To verify that these spatial patterns did not change over time, we recomputed the EOFs after segmenting the time series by year and determined that the patterns remained unchanged (they only showed negligible differences in magnitude). The spatial structure of the original pattern has been preserved in each of the three subdomains, showing us that the local and global structure for the pattern is the same, and therefore our dataset is independent of its domain.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f03

Figure 3Spatial map showing the first EOF mode of MOPITT CO for (a) the globe, (b) North America, (c) Africa, and (d) Asia. All magnitudes are unitless, and all modes represented have unit norm.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f04

Figure 4The percent variance that is explained by the first six normalized EOF modes for (a) the MOPITT dataset and (b) long-term, (c) seasonal, and (d) residual components after using an SSA decomposition. Results are shown globally for both varimax EOFs (blue) and unrotated EOFs (green). Additionally, we display the cumulative percent variance explained for varimax EOFs (orange) and unrotated EOFs (purple). Our error bars, computed using the North test, are shown in red.

Download

In some problems, it is often a good idea to attempt varimax rotation to ensure that the resulting patterns are simple and easy to interpret (Kaiser, 1958; Horel, 1981; Richman, 1986), but in our case, there is not a real need to do so because our problem does not suffer from domain instability (Buell, 1975; Richman, 1986). To demonstrate that varimax does not improve our resulting modes, we show the eigenvalue spectrum of the first six unrotated EOF modes together with the spectrum of our varimax modes in Fig. 4. Each mode is plotted as a fraction of the percent variance, with the exact value shown within the text box in the middle of the plot. The varimax and unrotated modes are shown in blue and green, respectively, and the cumulative percent variance is shown in orange and purple. We compute the error bars using Eq. (9) for Δλ_k in Sect. 2.2.2 and plot the results in red. For our unrotated modes of the original MOPITT dataset in (a), the variance together with error bars (the length above/below each data value) are given by: 61.55±3.43 %, 20.52±1.143 %, 8.24±0.459 %, 3.92±0.218 %, 1.10±0.061 % and 0.69±0.038 %. After using varimax rotation, the percent variance of each mode was: 86.02±4.793 %, 11.56±0.644 %, 1.87±0.104 %, 0.49±0.027 %, 0.03±0.002 % and 0.01±0.001 %.

While our plots of percent variance for the varimax modes show better representation for mode 1 of every dataset compared to our unrotated modes, all of the remaining modes showed significantly worse representation (except the residual modes). None of these eigenvalues represent degenerate modes because their error bars do not overlap with adjacent neighbors. We note, however, that varimax modes 4, 5, and 6 have such a small percent variance that it is unlikely for them to rise above the level of noise and represent independent and important features. On the other hand, the higher modes of our unrotated EOFs have a much larger percent variance and are more likely to represent independent features, with modes 5 and 6 most likely representing noise. Comparing the cumulative percent variance for the first 3 MOPITT modes, we see our unrotated modes accounted for 90.3 % of the variance compared to 99.4 % for varimax. Because our third varimax mode only accounted for 1.87 % variance as opposed to 8.24 % for our unrotated mode, we could not guarantee that it would be well represented. We therefore decide to only analyze spatial patterns of the first 3 unrotated modes in our SSA decomposition, which comes at the cost of explaining 9.1 % less variance overall. We discuss the long-term, seasonal, and residual modes in greater detail in Sect. 3.3.

Finally, we note that our choice to display the first six modes does not represent an “optimal” representation of modes as could be determined using statistical tests for eigenvalues (Tipping and Bishop, 2002; Hendrikse et al., 2009; Isokääntä et al., 2020). This choice is made to ensure we display at least one potentially degenerate mode for both our varimax and unrotated EOFs. We also do this to show there is little motivation to keep more than 5 EOFs in our analysis, since each mode beyond the fifth adds a negligible (<1 %) amount of variance, except the residual modes.

3.3 Seasonal Decomposition for MOPITT CO

To visualize our SSA decomposition, rather than choosing a single grid point as an example, we plot a decomposition for the temporal mean of the dataset in Fig. 5 to show the average overall effect. Our results are plotted only for 2005–2009, as five yearly cycles were sufficient to show the effect of the decomposition. We show the decomposition for 4 different regions: Global (60° S–60° N), Northern Extratropics (25–60° N), Southern Extratropics (25–60° S), and Tropics (24° S–24° N).

The result is that the long-term component (red) has a yearly periodicity that reproduces the peaks of the original MOPITT series (blue), while the seasonal component (green) has a shorter temporal periodicity (6 months). The residual component (orange) shows the shortest periodicity, being on the order of 3 months. Note that in each case, the decomposition is phase shifted with reduced magnitude in comparison to MOPITT. When we compare the decomposition in each region, we see that the long-term component does the best job of reconstructing the original time series over the northern extratropics, as shown in Fig. 5b. The magnitude of the long-term component matches well with the original series, but peaks earlier in time. The same thing can be said regarding the southern extratropics, but with reduced magnitude relative to the original series. In Fig. 5d, we note that the long-term mode in the tropics is significantly shifted forward in time compared to the original series. Finally, we observe that the seasonal mode contributes the most to the reconstruction in the tropics since its magnitude relative to the original time series is largest.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f05

Figure 5A time series plot of the rescaled MOPITT CO mean (blue) and the long-term (red), seasonal (green) and remainder (orange) components of the SSA decomposition shown from 2005–2009. We show the overall effect of the decomposition in 4 different regions: (a) Global (60° S–60° N), (b) Northern Extratropics (25–60° N), (c) Southern Extratropics (25–60° S), and (d) Tropics (24° S–24° N).

Download

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f06

Figure 6Spatial maps of the first 3 unrotated EOF modes of the (a–c) MOPITT dataset and its (d–f) long-term, (g–i) seasonal, and (j–l) residual components after using a SSA decomposition. The percent variance that is explained for each mode is displayed at the bottom right corner. All magnitudes are unitless, and all modes represented have unit norm.

By implementing SSA on each grid point, we produce 3 matrices, each with the same size as our original dataset, which are again mean-centered and standardized. We then conduct EOF analysis on the resulting matrices, displaying the first 3 global modes for MOPITT in Fig. 6 together with the first 3 modes of the long-term, seasonal, and remainder datasets. While there are some differences in magnitude and percent variance, we note that the first two long-term EOF modes reproduce the spatial pattern of the first two modes for the original MOPITT CO patterns. Examination of the third long-term EOF mode shows that the pattern appears to be noise, as indicated by the low percent variance and the complete lack of any spatial structure. We also note that there is a very similar structure between the third MOPITT EOF mode and the first Seasonal EOF mode; however, the first seasonal mode has a larger contrast between the positive and negative patterns, which is especially noticeable in Northeastern Asia and the east coast of the United States. The structure for seasonal modes 2 and 3, as well as all three residual modes, represents patterns distinct from our MOPITT EOF modes before we performed SSA on the data.

We also note that according to Fig. 6d, our long-term EOF mode 1 has the largest positive and negative anomalies in the extratropics. This observation is consistent with our time series in Fig. 5b and c, which showed that the long-term mode did the best job of matching the magnitude and phase of the original MOPITT time series in this region. Similarly, our seasonal EOF mode 1 in Fig. 6g showed the largest anomaly in the tropics. This is consistent with the time series of our seasonal mode in Fig. 5d, which had the largest magnitude relative to the original time series. A reduced influence of the long-term mode and a corresponding increase in the influence of the seasonal mode in the tropics is consistent with CO having a shorter lifetime in this region.

Our intention in performing EOF analysis was that each EOF mode would represent the variability from either a source or sink in CO; however, the patterns depicted in Fig. 6 do not correlate with physically separable sources. If each mode represented a source of CO, then we would expect to see modes that correlated with variability due to fossil fuel burning, biomass burning, biogenic oxidation, or CH₄ oxidation. However, none of the modes depicted in Fig. 6 show patterns of variability that can be directly connected with source distributions, nor do they appear to directly correspond with synoptic weather patterns or transport pathways.

As an example, let us attempt to connect MOPITT EOF 2 with fossil fuel burning. If we attempt the interpretation that blue areas represent areas with high contributions of fuel burning, this would be inconsistent with our knowledge of fossil fuel burning as a source of CO. While we do see significant areas of blue highlighted on the east coast of the United States which is consistent with combustion, the pattern shows too little activity over Eastern China and India and higher activity over Russia than expected. Similarly, when we examine EOF 3, we see a pattern that has some consistency with biomass burning in Africa and South America; however, the features are far too smeared to be a potential interpretation. For example, the pattern stretches too far across oceanic regions as well as the Sahara and Saudi Arabia to be classified as potential fire activity. For similar reasoning, it would not make sense for us to interpret this as a pattern of biogenic emission.

When examining EOF mode 1, we see a very clear boundary between the NH and SH as indicative of the annual cycle in CO caused by the combination of larger combustion activity and lower hydroxide radicals in the NH during the winter season. This interpretation is consistent with an observation we state in Sect. 3.3 on spectral analysis and time scales, which is that the time scale for EOF 1 is dominated by annual/semiannual variation.

The majority of the areas with the highest contrast in Residual EOF mode 1 are seen to be over oceanic regions, which suggests the possibility of a connection between Residual mode 1 and oceanic variability. However, these patterns do not match the variation of CO due to oceanic cycles, such as patterns as shown in Conte et al. (2019). This could be because we would expect oceanic variability to be strongest near the surface, and patterns due to surface variations would be obscured by the changes in CO throughout other areas of the atmosphere, which we are accounting for by using total column data. This could also be because budgets for oceanic cycles of CO vary widely both spatially and temporally due to changing environmental conditions (Ji et al., 2025).

Examining the remaining seasonal and residual patterns, we see patterns that could be indicative of regional-scale transport; however, verification and further discussion of these modes is beyond the scope of our analysis at this time.

3.4 Spectral Analysis and Time Scales

In Fig. 7, we display the principal component time series for the original MOPITT dataset as well as the long-term, seasonal, and residual components. We first note that the first two long-term principal components have very similar periodic patterns as the original 2 MOPITT modes, while the third mode is, of course, noise because, as we have already seen, the third long-term EOF pattern had no spatial structure. We also see that the first two seasonal components have a similar periodic pattern to the third MOPITT component, even though we saw from Fig. 6 that the second seasonal EOF has a very different spatial structure. The long-term modes have the largest period, while the seasonal components have shorter periodicity, and the residual modes have the shortest.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f07

Figure 7Time series for the first 3 principal components of the (a–c) MOPITT dataset and its (d–f) long-term, (g–i) seasonal, and (j–l) residual components after using an SSA decomposition. All magnitudes are unitless, and all modes represented have unit norm.

Download

To determine the most dominant time scale for a given mode, we used Fourier transforms to compute the power spectral density of each principal component and then used a numerical trapezoid rule to determine the total cumulative power as a percentage, which is then plotted in Fig. 8 on a logarithmic scale. The percent of total power is labeled for a period of oscillation for 100 d, 1 year, and 1000 d. We note that in this context, having time scales longer than the lifetime of CO still makes physical sense because some variability of CO is driven by large-scale weather patterns. The intrahemispheric mixing time is about 6 months, while interhemispheric mass exchanges can happen across 1–2 years (Bowman and Cohen, 1997). Variability on this time scale could also be indicative of changes to climate drivers. For example, deforestation over several years may lead to long-term reduction of biogenic and fire emissions, while urbanization may lead to an increase in anthropogenic emissions. A dotted line and the value t_max indicate the period of oscillation that contributed the largest amount to the total power. For example, examining MOPITT Spectrum Mode 1, we see that 1.3 % of power is from periods of 100 d or smaller, 26.9 % of power is from periods of 1 year or smaller, and 43.2 % of power is from periods of 1000 d or smaller. We see that the first two MOPITT patterns correspond to variation dominated by an annual/semi-annual cycle, while the third mode is dominated by seasonal changes on the order of 3 months. Our long-term EOFs reproduce our original 2 modes, while our seasonal EOFs show dominant time scales on the order of 3 or 2 months, and our residual modes show dominant time scales on the order of 2 months or less.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f08

Figure 8The cumulative percent total power for each principal component. Power values are labeled at 100 d, 1 year, and 1000 d. The dotted line and the value t_max indicate the period of oscillation that has contributed the largest amount to the total power.

Download

3.5 Time Dependence and Non-Stationarity of Modes

As discussed in Sect. 2.2.3, one weakness of EOF analysis is that even though the principal component time series are constructed to be uncorrelated, it is not necessarily appropriate to assume that they represent independent processes, which complicates the interpretation of assigning physical meaning to the modes. Only in the special case that the data is Gaussian does it follow that uncorrelated random variables are then deemed independent. In Fig. 9, we plot the mean skewness field for MOPITT CO over the period of interest from 2005–2018. It is standard to consider values of absolute skewness less than 0.5 to have negligible skewness corresponding to a relatively symmetric distribution, while values between 0.5 and 1.0 are moderately skewed, and values larger than 1.0 are highly skewed. Contour lines are shown for skewness values of −1, −0.5, +0.5, and +1.0 to highlight areas that have a notable degree of skewness.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f09

Figure 9A spatial plot showing the mean skewness field of MOPITT CO from 2005–2018. Contour lines are shown to highlight skewness values of −1.0, −0.5, 0.5, and 1.0, indicating regions that are moderately or heavily skewed.

We note that these sharp gradients in skewness in Fig. 9 are driven by long-term seasonal changes in sources and sinks depending on the region. Regions of low skewness generally indicate places where pollution levels are relatively constant throughout the year, with few extreme highs or lows. High positive skewness values indicate a larger number of extreme pollution events are occurring in the area, and may be interpreted as local enhancement from long-term transport. For example, we interpret the high skewness we see in central Africa as transport due to biomass burning plumes which frequently move westward (Krishnamurti et al., 1996; Fisher et al., 2015). Similarly, large skewness across South America may be indicative of the secondary contribution of isoprene oxidation near the equator. This observation is consistent with Fisher et al. (2015), who discuss how isoprene in South America is uplifted to sufficient levels to easily react with OH, where it is slowly transported far south and into the extratropics. The bands with high skewness across the SH are consistent with the north/south movement of the inter-tropical convergence zone due to exchange of CO through the Hadley circulation (Bowman and Cohen, 1997; Bowman, 2006). The skewness patterns we see in the NH are more diverse and are indicative of the seasonal changes in fossil fuel usage, which vary by region and sector.

Overall, we see from Fig. 9 that our dataset has significant skewness in the SH, especially in South America, as well as over China, Indonesia, and East Asia. This indicates that we cannot consider our dataset to be approximately Gaussian, and this is a potential explanation for why our principal components show significant dependence in the distributions of their spectral responses. EOF analysis assumes that the dataset is stationary so that the mean and temporal covariance remain constant in time, which is done to ensure that the first and second moments (the mean and covariance) of random variables can be determined using a limit of simple summations (or integrals). In practice, this assumption is rarely realistic with physical data. An equivalent statement is that the temporal autocorrelation is constant as a function of lag time τ. If we consider the mean and covariance of CO, it would be realistic to assume that these statistics should be a function of time because their sources and sinks are correlated temporally. If we imagine a smokestack emitting CO at a fixed location, the CO measured in the future is highly correlated with the amount of CO in the present. This brings us to our next main point, which is that another reason why our modes cannot be interpreted in terms of individual sources and sinks lies in the non-stationarity of the MOPITT dataset, which remains unaccounted for in standard EOF analysis and is a fact we will now show.

https://amt.copernicus.org/articles/19/2529/2026/amt-19-2529-2026-f10

Figure 10The autocorrelation for (a) MOPITT CO as well as the autocorrelation of the (b) long-term, (c) seasonal, and (d) residual modes after using an SSA decomposition, plotted as a function of lag time (measured in 8 d periods).

Download

As shown in Fig. 10, the autocorrelation for the original MOPITT time series, as well as each component from the seasonal decomposition, does not decay to zero but instead shows periodic variation that persists across time. The MOPITT series has a positive correlation that peaks for every year exactly and shows two negative peaks, both of which also recur at intervals of 1 year. We could therefore consider the MOPITT autocorrelation as being a sum of three periodic components. The long-term and seasonal autocorrelation are simpler functions, and both have a single periodic component that peaks exactly once per year and 6 months, respectively. The residual autocorrelation, like MOPITT, is a sum of periodic processes. It is important to note that we cannot force the dataset to become stationary by removing an annual cycle, because the autocorrelation represents a superposition of multiple periodic processes. We may therefore consider both the long-term and seasonal series to be wide-sense cyclostationary processes since they have a periodic temporal mean and a periodic temporal autocorrelation. Because the MOPITT series and residual series are composed of multiple periodic processes, we may consider them to be wide-sense polycyclostationary. We conclude this discussion by mentioning that in future studies, it may be worth exploring the use of cyclostationary EOF (CSEOF) analysis on either the original time series or on the seasonal modes. CSEOF analysis is a method that is well-suited for datasets that possess autocorrelation functions of this type (Kim et al., 2015). Unlike traditional EOF methods that assume stationary covariance, CSEOF can better capture evolving physical modes, a feature essential for analyzing atmospheric time series.

4 Conclusions

In this study, we explored the application of EOF analysis to evaluate whether CO source signatures can be discerned from observed CO abundances. We hypothesized that spatial and temporal differences in CO are influenced by surface emission patterns, including contributions from CH₄ and NMHCs, atmospheric transport, and reactions with OH. Rather than interpreting physical patterns, we emphasized the methodological rationale and limitations underlying our findings.

Our results in Fig. 3 demonstrate that EOF modes derived from MOPITT CO columns are not domain-dependent, as generating regional domains yielded spatial patterns nearly identical to those from a global domain. Using the North test, we showed in Fig. 4 that unrotated EOF modes provided better data representation than varimax modes. Consequently, seasonal analysis via SSA was performed exclusively on unrotated modes due to their improved representation and domain-independence, which are characteristics specific to our dataset. However, we emphasize that this decision should not be generalized across all climate datasets.

Plots of cumulative percent total power for the principal components in Fig. 8 revealed that lower-order modes capture long-term variability, while higher-order modes reflect shorter-term fluctuations. Each mode exhibited power distributed across a range of frequencies, suggesting that variability arises from a combination of time scales rather than isolated periodicities.

Seasonal skewness analysis from Fig. 9 confirmed that, although the principal component time series are uncorrelated, they are not independent due to the non-Gaussian nature of the dataset. As a result, they cannot be directly associated with distinct sources or sinks. We also discussed how these sharp gradients in skewness are driven by long-term seasonal changes of CO.

The first unrotated EOF in Fig. 6a and the long-term pattern in Fig. 6d reflect the annual/semiannual cycle of CO. However, the temporal dependence among modes, non-stationarity of the time series, and our use of total column data may confound distinct variability patterns, making it difficult to attribute higher-order modes to specific sources or sinks. Future work should investigate CO statistics across vertical levels to determine whether time-independent modes can be identified and whether Gaussian assumptions are better satisfied.

As demonstrated in the autocorrelation plots (Fig. 10), the MOPITT CO time series exhibits wide-sense polycyclostationarity. Because CSEOF analysis is designed to handle time-evolving covariance structures of this type, we believe it is worthwhile to consider its utility to study patterns in MOPITT CO and other atmospheric data in the future. If CSEOF can successfully separate source-specific variability, methods such as Covariance Discrimination Analysis (CDA) can be utilized, which compare datasets based on covariance structures. This may enable the identification of statistical discrepancies between modeled and observed CO, and help attribute these to biases in source attribution, including fossil fuel combustion, biomass burning, and oxidation of CH₄ and hydrocarbons.

Finally, we acknowledge that these limitations stem from the highly diffusive nature of atmospheric transport, the reactive chemistry of CO and OH, and the limited resolution of column-averaged CO observations. These factors obscure the distinction between regional sources and broader background CO. We propose that a multi-species, multi-scale approach, leveraging synergies among diverse data sources and incorporating advanced methods – such as pattern recognition, dimensionality reduction, and machine learning – offers a promising path forward in attributing source signatures to physical processes. This is particularly relevant given the recent availability of high-resolution atmospheric composition datasets.

Appendix A

A1 Eigenvectors of Temporal and Spatial Covariance Matrices

Suppose F is an n×m matrix where measurements are taken over n points in time and over a spatial grid with m points. We define the m×m spatial covariance matrix as R=F^TF and the n×n temporal covariance matrix as L=FF^T. Note that here we do not normalize our covariance matrices using the sample size purely to make the rest of this discussion (and the subsequent discussion in Appendix A2) easier to follow. In the case that the number of gridpoints is very large and m≫n, it becomes computationally infeasible to perform an eigendecomposition on R. It is instead desirable to decompose the much smaller covariance matrix L and project the resulting PC time series to compute the EOF spatial patterns, which are eigenvectors of R. We begin by performing an eigendecomposition on the spatial covariance matrix R to yield the eigenvector matrix C, which corresponds to the EOF spatial patterns and satisfies:

\begin{matrix} (A1) & RC = C Λ \end{matrix}

Multiplying on the LHS yields:

\begin{matrix} (A2) & FRC = {FF}^{T} FC = LFC \end{matrix}

and multiplication by F on the RHS then gives us:

\begin{matrix} (A3) & L (FC) = (FC) Λ \end{matrix}

And defining A=FC we have:

\begin{matrix} (A4) & LA = A Λ \end{matrix}

which shows that A is the matrix of eigenvectors for L, while the eigenvalues Λ are the same as for R. We note that while the eigenvectors C of R are size m×m and represent a spatial pattern, the eigenvector matrix A of L is size n×n and represents the principal component (PC) time series constructed by projecting the original matrix F onto the EOF patterns C. We can reconstruct the original matrix F using the EOF spatial patterns and the un-normalized PC time series as:

\begin{matrix} (A5) & F = {AC}^{T} \end{matrix}

The fact that A is not normalized is apparent from the PCA property of R:

\begin{matrix} (A6) & \begin{aligned} A^{T} A & = {(FC)}^{T} FC = C^{T} F^{T} FC = C^{T} RC \\ = C^{T} C Λ {CC}^{T} = Λ \end{aligned} \end{matrix}

which shows us that $| | A^{T} A | | = | | Λ | | \neq I$ . We can therefore define a normalized PC time series Φ by dividing each term by $λ^{1 / 2}$ :

\begin{matrix} (A7) & Φ = FC Λ^{- 1 / 2} = A Λ^{- 1 / 2} \end{matrix}

then it follows that Φ is orthogonal:

\begin{matrix} (A8) & Φ^{T} Φ = {(A Λ^{- 1 / 2})}^{T} A Λ^{- 1 / 2} = (A^{T} A) Λ^{- 1} = Λ Λ^{- 1} = I \end{matrix}

This expression of Φ is important, because it allows us to express the singular value decomposition of the matrix F as:

\begin{matrix} (A9) & F = Φ Λ^{1 / 2} C^{T} = Φ Σ C^{T} \end{matrix}

If we use the singular value decomposition for F in the covariance matrices, then we recover the eigendecompositions for both R and L:

\begin{matrix} (A10) & R = F^{T} F = {(Φ Σ C^{T})}^{T} Φ Σ C^{T} = C Λ C^{T} \\ (A11) & L = {FF}^{T} = {(Φ Σ C^{T})}^{T} Φ Σ C^{T} = Φ Λ Φ^{T} \end{matrix}

so we see that orthogonal eigenvector matrices for R and L are the normalized EOF spatial patterns C and the PC time series Φ, respectively. Therefore, when n≫m and we cannot easily compute C, our strategy is to perform eigendecomposition on L to find Φ, and then determine C by rearranging the SVD expression:

\begin{matrix} (A12) & C = (F^{T} Φ) Λ^{- 1 / 2} = G Λ^{- 1 / 2} \end{matrix}

where G=F^TΦ. The original matrix F can then be reconstructed using the full rank expression for the SVD. If we are using correlation matrices so that F has zero mean and has been scaled by its standard deviation, the original matrix in physical space F^* is given by:

\begin{matrix} (A13) & F^{*} = F std (F) + mean (F) = (Φ Σ C^{T}) std (F) + mean (F) . \end{matrix}

A2 Equivalence of Determining EOFs and PCs Using Eigenvector Decomposition and SVD

In the case of eigenvector decomposition, we describe three cases in accordance with the size of F as shown in Table A1. When we have m≫n, so the spatial grid is large, then L=FF^T has size n×n, and as described before, we compute the eigendecomposition of L to determine the PC time series, which are the column vectors of Φ. In MATLAB, we would simply use $[Φ, Λ] = eig (L)$ to compute the normalized eigenvectors and the eigenvalues. We can then compute the matrix G and find the EOF spatial patterns, which are the column vectors of C (or the row vectors of C^T).

In the case that n≫m so the time points are large, then R=F^TF has size m×m and we first compute the EOF spatial patterns C which are the normalized eigenvectors of R, and are found in Matlab as $[C, Λ] = eig (R)$ . We then determine the PC time series by projecting the original matrix using Φ=FC.

When m≈n, then either method from above would be appropriate to compute the EOFs and PCs. It would also be appropriate (and equivalent) to compute the singular value decomposition of F using Eq. (5) which is determined in matlab as $[Φ, Σ, C] = svd (F)$ , where the singular values Σ are the square roots of the eigenvalues Λ. In this case, the left singular vectors are the normalized PC time series, and the right singular vectors are the normalized EOF spatial patterns.

Table A1Computation of EOFs and PCs shown in three cases depending on the size of F. When m≠n, it is appropriate to use eigenvector decomposition on whichever covariance matrix is smallest. When m≈n, eigenvector decomposition and SVD yield equivalent results.

Download Print Version | Download XLSX

A3 Estimate of N^* in Terms of an AR1 Process

The following is a derivation of N^*, which we use for the estimate for the effective sample size as discussed in Hartmann (2016). Red noise is characterized by the property that future observations are predicted using a linear combination of the previous observations plus a random noise term. Red noise has exponential autocorrelation, which decays according to:

\begin{matrix} (A14) & r (τ) = \exp (- τ / T) \end{matrix}

where τ is the lag time and T is the e-folding time (the time interval over which the autocorrelation drops to $1 / e$ ). For a first-order process, note that if we define the autocorrelation at one time step Δt to be:

\begin{matrix} (A15) & a = r (Δ t) = \exp (- Δ t / T) \end{matrix}

then the autocorrelation evaluated n time steps in the future can be written as:

\begin{matrix} (A16) & r_{n} = r (n Δ t) = (\exp (- Δ t / T))^{n} = a^{n} . \end{matrix}

We can express N^* in terms of the true sample size N as follows (Thiébaux and Zwiers, 1984):

\begin{matrix} (A17) & N^{*} = \frac{N}{\sum_{n = - N - 1}^{N + 1} (1 - \frac{| n |}{N}) r_{n}} . \end{matrix}

If we further assume that N is very large and that our process is AR1, Eq. (A17) can be written as:

\begin{matrix} (A18) & N^{*} = \frac{N}{\sum_{n = - \infty}^{\infty} r_{n}} = \frac{N}{1 + 2 (\sum_{n = 1}^{\infty} a^{n})} \end{matrix}

and we see the sum in the denominator of Eq. (A18) is a geometric series with $| a | < 1$ which converges to the value:

\begin{matrix} (A19) & \sum_{n = 1}^{\infty} a^{n} = a + a^{2} + \dots + = \frac{a}{1 - a} . \end{matrix}

Substituting Eq. (A19) into Eq. (A18) then yields our estimate for N^* given by Eq. (8) in our methods section. We reiterate that this estimation is only valid for a large number of independent samples and when we assume our time series is an AR1 stochastic process. This is because we rely on the property given by Eq. (A16) of AR1 processes and large N to both be true to guarantee that Eq. (A18) is valid. We also rely on N to be large for the sum in Eq. (A19) to be accurate.

A4 Jointly Gaussian Distributions Are Independent

In this section, we verify the fact that jointly Gaussian distributions have modes that are uncorrelated and independent. To see this fact, we note that in the case of 2 random variables, if X₁ and X₂ are uncorrelated and jointly Gaussian, then the associated covariance matrix is diagonal and the joint PDF takes the form:

\begin{matrix} (A20) & \begin{aligned} f_{X} & (x_{1}, x_{2}) = \frac{1}{\sqrt{(2 π)^{2} σ_{1}^{2} σ_{2}^{2}}} \exp \\ (- \frac{1}{2} (\frac{(x_{1} - μ_{1})^{2}}{σ_{1}^{2}} + \frac{(x_{2} - μ_{2})^{2}}{σ_{2}^{2}})) \end{aligned} \end{matrix}

which can be written as:

\begin{matrix} (A21) & \begin{aligned} f_{X} (x_{1}, x_{2}) & = \frac{1}{\sqrt{2 π σ_{1}^{2}}} \exp (- \frac{1}{2} (\frac{(x_{1} - μ_{1})^{2}}{σ_{1}^{2}})) \\ \times \frac{1}{\sqrt{2 π σ_{2}^{2}}} \exp (- \frac{1}{2} (\frac{(x_{2} - μ_{2})^{2}}{σ_{2}^{2}})) \end{aligned} \\ (A22) & = f_{X} (x_{1}) \times f_{X} (x_{2}) . \end{matrix}

It follows that X₁ and X₂ are independent.

Data availability

The MOPITT V8 joint TIR-NIR L3 datasets can be accessed at the following DOI: https://doi.org/10.5067/TERRA/MOPITT/MOP03J_L3.008 (NASA Langley Research Center Atmospheric Science Data Center, 2019).

Author contributions

Conceptualization: JMM, AFA Jr.; investigation: JMM; methodology: JMM, CR, AFA Jr.; formal analysis: JMM; data curation: JMM, CR; validation: JMM; visualization: JMM; supervision: AFA Jr.; writing (original draft preparation): JMM; writing (review and editing): JMM, BG, CR, AFA Jr.

Competing interests

The contact author has declared that none of the authors has any competing interests.

Disclaimer

Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Acknowledgements

This research work is supported by NASA ACMAP (grant no. 80NSSC19K0947).

Financial support

This research has been supported by the Earth Sciences Division (grant no. 80NSSC19K0947).

Review statement

This paper was edited by Peer Nowack and reviewed by Santtu Mikkonen and two anonymous referees.

References

Andela, N., Morton, D. C., Giglio, L., Chen, Y., van der Werf, G. R., Kasibhatla, P. S., DeFries, R. S., Collatz, G. J., Hantson, S., Kloster, S., Bachelet, D., Forrest, M., Lasslop, G., Li, F., Mangeon, S., Melton, J. R., Yue, C., and Randerson, J. T.: A human-driven decline in global burned area, Science, 356, 1356–1362, https://doi.org/10.1126/science.aal4108, 2017. a

Arneth, A., Monson, R. K., Schurgers, G., Niinemets, Ü., and Palmer, P. I.: Why are estimates of global terrestrial isoprene emissions so similar (and why is this not so for monoterpenes)?, Atmos. Chem. Phys., 8, 4605–4620, https://doi.org/10.5194/acp-8-4605-2008, 2008. a

Atkinson, R.: Atmospheric chemistry of VOCs and NOx, Atmospheric Environment, 34, 2063–2101, https://doi.org/10.1016/S1352-2310(99)00460-4, 2000. a

Barret, B., Loicq, P., Le Flochmoën, E., Bennouna, Y., Hadji-Lazaro, J., Hurtmans, D., and Sauvage, B.: Validation of 12 years (2008–2019) of IASI-A CO with IAGOS aircraft observations, Atmos. Meas. Tech., 18, 129–149, https://doi.org/10.5194/amt-18-129-2025, 2025. a

Bennani, M. and Braconnier, T.: Stopping criteria for eigensolvers, CERFACS, Toulouse, France, Tech. Rep. TR/PA/94/22, https://www.researchgate.net/profile/Thierry-Braconnier/publication/2793730_Stopping_Criteria_for_Eigensolvers/links/09e415136fcc9a48dd000000/Stopping-Criteria-for-Eigensolvers.pdf (last access: 7 April 2026), 1994. a

Björnsson, H. and Venegas, S. A.: A manual for EOF and SVD analyses of climate data, 1997. a

Borsdorff, T., Aan de Brugh, J., Hu, H., Aben, I., Hasekamp, O., and Landgraf, J.: Measuring Carbon Monoxide With TROPOMI: First Results and a Comparison With ECMWF-IFS Analysis Data, Geophysical Research Letters, 45, 2826–2832, https://doi.org/10.1002/2018GL077045, 2018. a

Borsdorff, T., aan de Brugh, J., Pandey, S., Hasekamp, O., Aben, I., Houweling, S., and Landgraf, J.: Carbon monoxide air pollution on sub-city scales and along arterial roads detected by the Tropospheric Monitoring Instrument, Atmos. Chem. Phys., 19, 3579–3588, https://doi.org/10.5194/acp-19-3579-2019, 2019. a

Bowman, K. P.: Transport of carbon monoxide from the tropics to the extratropics, Journal of Geophysical Research: Atmospheres, 111, https://doi.org/10.1029/2005JD006137, 2006. a

Bowman, K. P. and Cohen, P. J.: Interhemispheric exchange by seasonal modulation of the Hadley circulation, Journal of the Atmospheric Sciences, 54, 2045–2059, 1997. a, b

Buchholz, R. R., Deeter, M. N., Worden, H. M., Gille, J., Edwards, D. P., Hannigan, J. W., Jones, N. B., Paton-Walsh, C., Griffith, D. W. T., Smale, D., Robinson, J., Strong, K., Conway, S., Sussmann, R., Hase, F., Blumenstock, T., Mahieu, E., and Langerock, B.: Validation of MOPITT carbon monoxide using ground-based Fourier transform infrared spectrometer data from NDACC, Atmos. Meas. Tech., 10, 1927–1956, https://doi.org/10.5194/amt-10-1927-2017, 2017. a, b

Buchholz, R. R., Hammerling, D., Worden, H. M., Deeter, M. N., Emmons, L. K., Edwards, D. P., and Monks, S. A.: Links Between Carbon Monoxide and Climate Indices for the Southern Hemisphere and Tropical Fire Regions, Journal of Geophysical Research: Atmospheres, 123, 9786–9800, https://doi.org/10.1029/2018JD028438, 2018. a

Buchholz, R. R., Worden, H. M., Park, M., Francis, G., Deeter, M. N., Edwards, D. P., Emmons, L. K., Gaubert, B., Gille, J., Martínez-Alonso, S., Tang, W., Kumar, R., Drummond, J. R., Clerbaux, C., George, M., Coheur, P.-F., Hurtmans, D., Bowman, K. W., Luo, M., Payne, V. H., Worden, J. R., Chin, M., Levy, R. C., Warner, J., Wei, Z., and Kulawik, S. S.: Air pollution trends measured from Terra: CO and AOD over industrial, fire-prone, and background regions, Remote Sensing of Environment, 256, 112275, https://doi.org/10.1016/j.rse.2020.112275, 2021. a, b, c, d, e

Buell, C. E.: The topography of empirical orthogonal functions., in: Preprints Fourth Conf. on Prob. and Stats, in: Atmos. Sci. Tallahassee, FL, Amer. Meteor. Soc. 188., 1975. a

Buell, C. E.: The Number of Significant Proper Functions of Two-Dimensional Fields, Journal of Applied Meteorology and Climatology, 17, 717–722, https://doi.org/10.1175/1520-0450(1978)017<0717:TNOSPF>2.0.CO;2, 1978. a, b

Chen, S., Xu, L., Zhang, Y., Chen, B., Wang, X., Zhang, X., Zheng, M., Chen, J., Wang, W., Sun, Y., Fu, P., Wang, Z., and Li, W.: Direct observations of organic aerosols in common wintertime hazes in North China: insights into direct emissions from Chinese residential stoves, Atmos. Chem. Phys., 17, 1259–1270, https://doi.org/10.5194/acp-17-1259-2017, 2017. a

Cheng, M., Zhi, G., Tang, W., Liu, S., Dang, H., Guo, Z., Du, J., Du, X., Zhang, W., Zhang, Y., and Meng, F.: Air pollutant emission from the underestimated households' coal consumption source in China, Science of The Total Environment, 580, 641–650, https://doi.org/10.1016/j.scitotenv.2016.12.143, 2017. a

Cho, C. and Staelin, D. H.: Cloud clearing of Atmospheric Infrared Sounder hyperspectral infrared radiances using stochastic methods, Journal of Geophysical Research: Atmospheres, 111, https://doi.org/10.1029/2005JD006013, 2006. a

Cicerone, R.: The Changing Atmosphere cds. FS Rowland and ISA Isaksen, pp. 49-61 John Wiley & Sons Ltd. S. Bernhard, Dahlem Konferenzen, 1988, The Changing Atmosphere, 2, 49, ISBN 0471920479, 1988. a

Clerbaux, C., George, M., Turquety, S., Walker, K. A., Barret, B., Bernath, P., Boone, C., Borsdorff, T., Cammas, J. P., Catoire, V., Coffey, M., Coheur, P.-F., Deeter, M., De Mazière, M., Drummond, J., Duchatelet, P., Dupuy, E., de Zafra, R., Eddounia, F., Edwards, D. P., Emmons, L., Funke, B., Gille, J., Griffith, D. W. T., Hannigan, J., Hase, F., Höpfner, M., Jones, N., Kagawa, A., Kasai, Y., Kramer, I., Le Flochmoën, E., Livesey, N. J., López-Puertas, M., Luo, M., Mahieu, E., Murtagh, D., Nédélec, P., Pazmino, A., Pumphrey, H., Ricaud, P., Rinsland, C. P., Robert, C., Schneider, M., Senten, C., Stiller, G., Strandberg, A., Strong, K., Sussmann, R., Thouret, V., Urban, J., and Wiacek, A.: CO measurements from the ACE-FTS satellite instrument: data analysis and validation using ground-based, airborne and spaceborne observations, Atmos. Chem. Phys., 8, 2569–2594, https://doi.org/10.5194/acp-8-2569-2008, 2008. a

Clerbaux, C., Hadji-Lazaro, J., Turquety, S., George, M., Boynard, A., Pommier, M., Safieddine, S., Coheur, P.-F., Hurtmans, D., Clarisse, L., and Van Damme, M.: Tracking pollutants from space: Eight years of IASI satellite observation, Comptes Rendus Geoscience, 347, 134–144, https://doi.org/10.1016/j.crte.2015.06.001, 2015. a

Conte, L., Szopa, S., Séférian, R., and Bopp, L.: The oceanic cycle of carbon monoxide and its emissions to the atmosphere, Biogeosciences, 16, 881–902, https://doi.org/10.5194/bg-16-881-2019, 2019. a, b

Day, D. A. and Faloona, I.: Carbon monoxide and chromophoric dissolved organic matter cycles in the shelf waters of the northern California upwelling system, Journal of Geophysical Research: Oceans, 114, https://doi.org/10.1029/2007JC004590, 2009. a

Deeter, M. N., Edwards, D. P., Francis, G. L., Gille, J. C., Mao, D., Martínez-Alonso, S., Worden, H. M., Ziskin, D., and Andreae, M. O.: Radiance-based retrieval bias mitigation for the MOPITT instrument: the version 8 product, Atmos. Meas. Tech., 12, 4561–4580, https://doi.org/10.5194/amt-12-4561-2019, 2019. a

Dommenget, D.: Evaluating EOF modes against a stochastic null hypothesis, Climate Dynamics, 28, 517–531, https://doi.org/10.1007/s00382-006-0195-8, 2007. a

Drummond, J. R., Zou, J., Nichitiu, F., Kar, J., Deschambaut, R., and Hackett, J.: A review of 9-year performance and operation of the MOPITT instrument, Advances in Space Research, 45, 760–774, https://doi.org/10.1016/j.asr.2009.11.019, 2010. a

Du, W., Wang, J., Feng, Y., Duan, W., Wang, Z., Chen, Y., Zhang, P., and Pan, B.: Biomass as residential energy in China: Current status and future perspectives, Renewable and Sustainable Energy Reviews, 186, 113657, https://doi.org/10.1016/j.rser.2023.113657, 2023. a

Duncan, B. N., Logan, J. A., Bey, I., Megretskaia, I. A., Yantosca, R. M., Novelli, P. C., Jones, N., and Rinsland, C. P.: The global budget of CO, 1988-1997: source estimates and validation with a global model, Journal of Geophysical Research, 112, D22301-1–D22301-29, 2007. a, b

Eder, B. K., Davis, J. M., and Bloomfield, P.: A characterization of the spatiotemporal variability of non-urban ozone concentrations over the eastern United States, Atmospheric Environment. Part A. General Topics, 27, 2645–2668, https://doi.org/10.1016/0960-1686(93)90035-W, 1993. a

Edwards, D. P., Emmons, L. K., Hauglustaine, D. A., Chu, D. A., Gille, J. C., Kaufman, Y. J., Pétron, G., Yurganov, L. N., Giglio, L., Deeter, M. N., Yudin, V., Ziskin, D. C., Warner, J., Lamarque, J.-F., Francis, G. L., Ho, S. P., Mao, D., Chen, J., Grechko, E. I., and Drummond, J. R.: Observations of carbon monoxide and aerosols from the Terra satellite: Northern Hemisphere variability, Journal of Geophysical Research: Atmospheres, 109, https://doi.org/10.1029/2004JD004727, 2004. a, b

Edwards, D. P., Emmons, L. K., Gille, J. C., Chu, A., Attié, J.-L., Giglio, L., Wood, S. W., Haywood, J., Deeter, M. N., Massie, S. T., Ziskin, D. C., and Drummond, J. R.: Satellite-observed pollution from Southern Hemisphere biomass burning, J. Geophys. Res.-Atmos., 111, https://doi.org/10.1029/2005JD006655, 2006. a, b

Espinosa, F., Bartolomé, A. B., Hernández, P. V., and Rodriguez-Sanchez, M. C.: Contribution of Singular Spectral Analysis to Forecasting and Anomalies Detection of Indoors Air Quality, Sensors, 22, 3054, https://doi.org/10.3390/s22083054, 2022. a

Feng, S., Jiang, F., Wu, Z., Wang, H., Ju, W., and Wang, H.: CO Emissions Inferred From Surface CO Observations Over China in December 2013 and 2017, Journal of Geophysical Research: Atmospheres, 125, e2019JD031808, https://doi.org/10.1029/2019JD031808, 2020. a

Fiore, A. M., Naik, V., Spracklen, D. V., Steiner, A., Unger, N., Prather, M., Bergmann, D., Cameron-Smith, P. J., Cionni, I., Collins, W. J., Dalsøren, S., Eyring, V., Folberth, G. A., Ginoux, P., Horowitz, L. W., Josse, B., Lamarque, J.-F., MacKenzie, I. A., Nagashima, T., O'Connor, F. M., Righi, M., Rumbold, S. T., Shindell, D. T., Skeie, R. B., Sudo, K., Szopa, S., Takemura, T., and Zeng, G.: Global air quality and climate, Chemical Society Reviews, 41, 6663–6683, https://doi.org/10.1039/C2CS35095E, 2012. a

Fiore, A. M., Mickley, L. J., Zhu, Q., and Baublitz, C. B.: Climate and Tropospheric Oxidizing Capacity, Annual Review of Earth and Planetary Sciences, 52, 321–349, https://doi.org/10.1146/annurev-earth-032320-090307, 2024. a

Fisher, J. A., Wilson, S. R., Zeng, G., Williams, J. E., Emmons, L. K., Langenfelds, R. L., Krummel, P. B., and Steele, L. P.: Seasonal changes in the tropospheric carbon monoxide profile over the remote Southern Hemisphere evaluated using multi-model simulations and aircraft observations, Atmos. Chem. Phys., 15, 3217–3239, https://doi.org/10.5194/acp-15-3217-2015, 2015. a, b

Folberth, G. A., Hauglustaine, D. A., Lathière, J., and Brocheton, F.: Interactive chemistry in the Laboratoire de Météorologie Dynamique general circulation model: model description and impact analysis of biogenic hydrocarbons on tropospheric chemistry, Atmos. Chem. Phys., 6, 2273–2319, https://doi.org/10.5194/acp-6-2273-2006, 2006. a

Gaubert, B., Arellano Jr., A. F., Barré, J., Worden, H. M., Emmons, L. K., Tilmes, S., Buchholz, R. R., Vitt, F., Raeder, K., Collins, N., Anderson, J. L., Wiedinmyer, C., Martinez Alonso, S., Edwards, D. P., Andreae, M. O., Hannigan, J. W., Petri, C., Strong, K., and Jones, N.: Toward a chemical reanalysis in a coupled chemistry-climate model: An evaluation of MOPITT CO assimilation and its impact on tropospheric composition, Journal of Geophysical Research: Atmospheres, 121, 7310–7343, https://doi.org/10.1002/2016JD024863, 2016. a, b

Gaubert, B., Worden, H. M., Arellano, A. F. J., Emmons, L. K., Tilmes, S., Barré, J., Martinez Alonso, S., Vitt, F., Anderson, J. L., Alkemade, F., Houweling, S., and Edwards, D. P.: Chemical Feedback From Decreasing Carbon Monoxide Emissions, Geophysical Research Letters, 44, 9985–9995, 2017. a, b, c

Gaubert, B., Emmons, L. K., Raeder, K., Tilmes, S., Miyazaki, K., Arellano Jr., A. F., Elguindi, N., Granier, C., Tang, W., Barré, J., Worden, H. M., Buchholz, R. R., Edwards, D. P., Franke, P., Anderson, J. L., Saunois, M., Schroeder, J., Woo, J.-H., Simpson, I. J., Blake, D. R., Meinardi, S., Wennberg, P. O., Crounse, J., Teng, A., Kim, M., Dickerson, R. R., He, H., Ren, X., Pusede, S. E., and Diskin, G. S.: Correcting model biases of CO in East Asia: impact on oxidant distributions during KORUS-AQ, Atmos. Chem. Phys., 20, 14617–14647, https://doi.org/10.5194/acp-20-14617-2020, 2020. a, b, c, d, e

Gaubert, B., Edwards, D. P., Anderson, J. L., Arellano, A. F., Barré, J., Buchholz, R. R., Darras, S., Emmons, L. K., Fillmore, D., Granier, C., Hannigan, J., Ortega, I., Raeder, K., Soulie, Tang, W., Worden, H., and Ziskin, D.: Global scale inversions from MOPITT CO and MODIS AOD, Remote Sensing, 15, 4813, https://doi.org/10.3390/rs15194813, 2023. a

George, M., Clerbaux, C., Bouarar, I., Coheur, P.-F., Deeter, M. N., Edwards, D. P., Francis, G., Gille, J. C., Hadji-Lazaro, J., Hurtmans, D., Inness, A., Mao, D., and Worden, H. M.: An examination of the long-term CO records from MOPITT and IASI: comparison of retrieval methodology, Atmos. Meas. Tech., 8, 4313–4328, https://doi.org/10.5194/amt-8-4313-2015, 2015. a

Golyandina, N. and Zhigljavsky, A.: Singular Spectrum Analysis for Time Series, SpringerBriefs in Statistics, https://doi.org/10.1007/978-3-642-34913-3_2, 2013. a

Gong, D. and Wang, S.: Definition of Antarctic oscillation index, Geophysical Research Letters, 26, 459–462, 1999. a

Granier, C., Müller, J., Pétron, G., and Brasseur, G.: A three-dimensional study of the global CO budget, Chemosphere – Global Change Science, 1, 255–261, https://doi.org/10.1016/S1465-9972(99)00007-0, 1999. a

Greenstone, M., He, G., Li, S., and Zou, E. Y.: China's War on Pollution: Evidence from the First 5 Years, Review of Environmental Economics and Policy, 15, 281–299, https://doi.org/10.1086/715550, 2021. a, b

Gruszczynska, M., Rosat, S., Klos, A., Gruszczynski, M., and Bogusz, J.: Multichannel Singular Spectrum Analysis in the Estimates of Common Environmental Effects Affecting GPS Observations, in: Geodynamics and Earth Tides Observations from Global to Micro Scale, edited by: Braitenberg, C., Rossi, G., and Geodynamics and Earth Tides Editor group, 211–228, ISBN 978-3-319-96277-1, https://doi.org/10.1007/978-3-319-96277-1_17, 2019. a

Gualtieri, G., Ahbil, K., Brilli, L., Carotenuto, F., Cavaliere, A., Gioli, B., Giordano, T., Katiellou, G. L., Mouhaimini, M., Tarchiani, V., Vagnoli, C., Zaldei, A., and Bacci, M.: Potential of low-cost PM monitoring sensors to fill monitoring gaps in areas of Sub-Saharan Africa, Atmospheric Pollution Research, 15, 102158, https://doi.org/10.1016/j.apr.2024.102158, 2024. a

Han, S., Zhang, Y., Wu, J., Zhang, X., Tian, Y., Wang, Y., Ding, J., Yan, W., Bi, X., Shi, G., Cai, Z., Yao, Q., Huang, H., and Feng, Y.: Evaluation of regional background particulate matter concentration based on vertical distribution characteristics, Atmos. Chem. Phys., 15, 11165–11177, https://doi.org/10.5194/acp-15-11165-2015, 2015. a

Hannachi, A.: Patterns Identification and Data Mining in Weather and Climate, Springer Atmospheric Sciences, Springer International Publishing, ISBN 978-3-030-67072-6 978-3-030-67073-3, https://doi.org/10.1007/978-3-030-67073-3, 2021. a, b, c, d, e

Hannachi, A., Jolliffe, I. T., and Stephenson, D. B.: Empirical orthogonal functions and related techniques in atmospheric science: A review, International Journal of Climatology, 27, 1119–1152, https://doi.org/10.1002/joc.1499, 2007. a, b, c

Hartmann, D.: ATM S 552 notes, https://atmos.uw.edu/~dennis/552_Notes_ftp.html (last access: 20 November 2025), 2016. a, b

Hassani, H.: Singular Spectrum Analysis: Methodology and Comparison, Journal of Data Science, 5, 239–257, https://doi.org/10.6339/JDS.2007.05(2).396, 2007. a

Hendrikse, A., Spreeuwers, L., and Veldhuis, R.: A bootstrap approach to eigenvalue correction, in: 2009 Ninth IEEE International Conference on Data Mining, 818–823, IEEE, https://doi.org/10.1109/ICDM.2009.111, 2009. a, b

Holloway, T., Levy II, H., and Kasibhatla, P.: Global distribution of carbon monoxide, Journal of Geophysical Research: Atmospheres, 105, 12123–12147, https://doi.org/10.1029/1999JD901173, 2000. a, b, c, d, e, f, g

Holzke, C., Hoffmann, T., Jaeger, L., Koppmann, R., and Zimmer, W.: Diurnal and seasonal variation of monoterpene and sesquiterpene emissions from Scots pine (Pinus sylvestris L.), Atmospheric Environment, 40, 3174–3185, https://doi.org/10.1016/j.atmosenv.2006.01.039, 2006. a

Hooghiem, J. J., Gromov, S., Kivi, R., Popa, M. E., Röckmann, T., and Chen, H.: Isotopic source signatures of stratospheric CO inferred from in situ vertical profiles, npj Climate and Atmospheric Science, 8, 110, 2025. a

Horel, J. D.: A Rotated Principal Component Analysis of the Interannual Variability of the Northern Hemisphere 500 mb Height Field, Monthly Weather Review, 109, 2080–2092, https://doi.org/10.1175/1520-0493(1981)109<2080:ARPCAO>2.0.CO;2, 1981. a, b

Isokääntä, S., Kari, E., Buchholz, A., Hao, L., Schobesberger, S., Virtanen, A., and Mikkonen, S.: Comparison of dimension reduction techniques in the analysis of mass spectrometry data, Atmos. Meas. Tech., 13, 2995–3022, https://doi.org/10.5194/amt-13-2995-2020, 2020. a, b

Ji, X., Zhao, M.-L., Ni, J., Xu, G.-B., Zhang, J., and Yang, G.-P.: Distribution, emission, and cycling processes of carbon monoxide in the tropical open ocean, Marine Chemistry, 268, 104482, https://doi.org/10.1016/j.marchem.2024.104482, 2025. a, b

Jia, M., Jiang, F., Evangeliou, N., Eckhardt, S., Stohl, A., Ding, A., Huang, X., Feng, S., He, W., Wang, J., Hengmao, W., and Mousong, W., and Weimin, J.: Anthropogenic carbon monoxide emissions during 2014-2020 in China constrained by in-situ observations, in: AGU Fall Meeting Abstracts, 2024, A43R–04, 2024AGUFMA43R...04J, 2024. a

Jin, X., Fiore, A. M., Murray, L. T., Valin, L. C., Lamsal, L. N., Duncan, B., Folkert Boersma, K., De Smedt, I., Abad, G. G., Chance, K., and Tonnesen, G. S.: Evaluating a Space-Based Indicator of Surface Ozone-NOx-VOC Sensitivity Over Midlatitude Source Regions and Application to Decadal Trends, Journal of Geophysical Research: Atmospheres, 122, 10,439–10,461, https://doi.org/10.1002/2017JD026720, 2017. a

Jones, D. B., Bowman, K. W., Palmer, P. I., Worden, J. R., Jacob, D. J., Hoffman, R. N., Bey, I., and Yantosca, R. M.: Potential of observations from the Tropospheric Emission Spectrometer to constrain continental sources of carbon monoxide, Journal of Geophysical Research: Atmospheres, 108, https://doi.org/10.1029/2003JD003702, 2003. a

Kaiser, H.: The Varimax Criterion For Analytic Rotation in Factor Analysis, Psychometrika, 23, 187–200, https://doi.org/10.1007/BF02289233, 1958. a, b

Kao, X., Liu, Y., Wang, W., Wen, Q., and Zhang, P.: The pressure of coal consumption on China's carbon dioxide emissions: A spatial and temporal perspective, Atmospheric Pollution Research, 15, 102188, https://doi.org/10.1016/j.apr.2024.102188, 2024. a

Kim, K., Hamlington, B., and Na, H.: Theoretical foundation of cyclostationary EOF analysis for geophysical and climatic variables: Concepts and examples, Earth-Science Reviews, 150, 201–218, 2015. a

Kim, K. Y. and North, G. R.: EOF Analysis of Surface Temperature Field in a Stochastic Climate Model, Journal of Climate, 6, 1681–1690, https://doi.org/10.1175/1520-0442(1993)006<1681:EAOSTF>2.0.CO;2, 1993. a

Kong, L., Tang, X., Zhu, J., Wang, Z., Fu, J. S., Wang, X., Itahashi, S., Yamaji, K., Nagashima, T., Lee, H.-J., Kim, C.-H., Lin, C.-Y., Chen, L., Zhang, M., Tao, Z., Li, J., Kajino, M., Liao, H., Wang, Z., Sudo, K., Wang, Y., Pan, Y., Tang, G., Li, M., Wu, Q., Ge, B., and Carmichael, G. R.: Evaluation and uncertainty investigation of the NO2, CO and NH3 modeling over China under the framework of MICS-Asia III, Atmos. Chem. Phys., 20, 181–202, https://doi.org/10.5194/acp-20-181-2020, 2020. a

Krishnamurti, T. N., Sinha, M. C., Kanamitsu, M., Oosterhof, D., Fuelberg, H., Chatfield, R., Jacob, D. J., and Logan, J.: Passive tracer transport relevant to the TRACE A experiment, Journal of Geophysical Research: Atmospheres, 101, 23889–23907, https://doi.org/10.1029/95JD02419, 1996. a

Kucharski, F. and Joshi, M. K.: Influence of tropical South Atlantic sea-surface temperatures on the Indian summer monsoon in CMIP5 models, Quarterly Journal of the Royal Meteorological Society, 143, 1351–1363, 2017. a

Kuijlaars, A. B.: Convergence analysis of Krylov subspace iterations with methods from potential theory, SIAM Review, 48, 3–40, 2006. a

Lehr, C. and Hohenbrink, T. L.: Technical note: An illustrative introduction to the domain dependence of spatial principal component patterns, Hydrol. Earth Syst. Sci., 29, 6735–6760, https://doi.org/10.5194/hess-29-6735-2025, 2025. a

Li, J., Carlson, B. E., and Lacis, A. A.: A study on the temporal and spatial variability of absorbing aerosols using Total Ozone Mapping Spectrometer and Ozone Monitoring Instrument Aerosol Index data, Journal of Geophysical Research: Atmospheres, 114, https://doi.org/10.1029/2008JD011278, 2009. a

Li, J., Carlson, B. E., and Lacis, A. A.: Application of spectral analysis techniques in the intercomparison of aerosol data: 1. An EOF approach to analyze the spatial-temporal variability of aerosol optical depth using multiple remote sensing data sets, Journal of Geophysical Research: Atmospheres, 118, 8640–8648, https://doi.org/10.1002/jgrd.50686, 2013. a

Li, M., Zhang, Q., Kurokawa, J.-I., Woo, J.-H., He, K., Lu, Z., Ohara, T., Song, Y., Streets, D. G., Carmichael, G. R., Cheng, Y., Hong, C., Huo, H., Jiang, X., Kang, S., Liu, F., Su, H., and Zheng, B.: MIX: a mosaic Asian anthropogenic emission inventory under the international collaboration framework of the MICS-Asia and HTAP, Atmos. Chem. Phys., 17, 935–963, https://doi.org/10.5194/acp-17-935-2017, 2017. a

Li, Z., Zhao, K., Yuan, X., Zhou, Y., Yang, L., and Geng, H.: Evolution and Control of Air Pollution in China over the Past 75 Years: An Analytical Framework Based on the Multi-Dimensional Urbanization, Atmosphere, 15, https://doi.org/10.3390/atmos15091093, 2024. a

Lichtig, P., Gaubert, B., Emmons, L. K., Jo, D. S., Callaghan, P., Ibarra-Espinosa, S., Dawidowski, L., Brasseur, G. P., and Pfister, G.: Multiscale CO budget estimates across South America: quantifying local sources and long range transport, Journal of Geophysical Research: Atmospheres, 129, e2023JD040434, https://doi.org/10.1029/2023JD040434, 2024. a

Lin, C., Cohen, J. B., Wang, S., and Lan, R.: Application of a combined standard deviation and mean based approach to MOPITT CO column data, and resulting improved representation of biomass burning and urban air pollution sources, Remote Sensing of Environment, 241, 111720, https://doi.org/10.1016/j.rse.2020.111720, 2020a. a

Lin, C., Cohen, J. B., Wang, S., Lan, R., and Deng, W.: A new perspective on the spatial, temporal, and vertical distribution of biomass burning: quantifying a significant increase in CO emissions, Environmental Research Letters, 15, 104091, https://doi.org/10.1088/1748-9326/abaa7a, 2020b. a

Macias, D., Stips, A., and Garcia-Gorriz, E.: Application of the Singular Spectrum Analysis Technique to Study the Recent Hiatus on the Global Surface Temperature Record, PLOS ONE, 9, 1–7, https://doi.org/10.1371/journal.pone.0107222, 2014. a

Malings, C., Westervelt, D. M., Hauryliuk, A., Presto, A. A., Grieshop, A., Bittner, A., Beekmann, M., and R. Subramanian: Application of low-cost fine particulate mass monitors to convert satellite aerosol optical depth to surface concentrations in North America and Africa, Atmos. Meas. Tech., 13, 3873–3892, https://doi.org/10.5194/amt-13-3873-2020, 2020. a

Martínez-Alonso, S., Deeter, M., Worden, H., Borsdorff, T., Aben, I., Commane, R., Daube, B., Francis, G., George, M., Landgraf, J., Mao, D., McKain, K., and Wofsy, S.: 1.5 years of TROPOMI CO measurements: comparisons to MOPITT and ATom, Atmos. Meas. Tech., 13, 4841–4864, https://doi.org/10.5194/amt-13-4841-2020, 2020. a

McMillan, W. W., Evans, K. D., Barnet, C. D., Maddy, E. S., Sachse, G. W., and Diskin, G. S.: Validating the AIRS Version 5 CO Retrieval With DACOM In Situ Measurements During INTEX-A and -B, IEEE Transactions on Geoscience and Remote Sensing, 49, 2802–2813, https://doi.org/10.1109/TGRS.2011.2106505, 2011. a

Menichini, E., Iacovella, N., Monfredini, F., and Turrio-Baldassarri, L.: Atmospheric pollution by PAHs, PCDD/Fs and PCBs simultaneously collected at a regional background site in central Italy and at an urban site in Rome, Chemosphere, 69, 422–434, https://doi.org/10.1016/j.chemosphere.2007.04.078, 2007. a

Miyazaki, K., Bowman, K., Sekiya, T., Eskes, H., Boersma, F., Worden, H., Livesey, N., Payne, V. H., Sudo, K., Kanaya, Y., Takigawa, M., and Ogochi, K.: Updated tropospheric chemistry reanalysis and emission estimates, TCR-2, for 2005–2018, Earth Syst. Sci. Data, 12, 2223–2259, https://doi.org/10.5194/essd-12-2223-2020, 2020. a

Monahan, A. H., Fyfe, J. C., Ambaum, M. H. P., Stephenson, D. B., and North, G. R.: Empirical Orthogonal Functions: The Medium is the Message, Journal of Climate, 22, 6501–6514, https://doi.org/10.1175/2009JCLI3062.1, 2009. a, b, c

Montano, V. and Jombart, T.: An eigenvalue test for Spatial Principal Component analysis, BMC Bioinformatics, 18, https://doi.org/10.1186/s12859-017-1988-y, 2017. a

Mottungan, K., Roychoudhury, C., Brocchi, V., Gaubert, B., Tang, W., Mirrezaei, M. A., McKinnon, J., Guo, Y., Griffith, D. W. T., Feist, D. G., Morino, I., Sha, M. K., Dubey, M. K., De Mazière, M., Deutscher, N. M., Wennberg, P. O., Sussmann, R., Kivi, R., Goo, T.-Y., Velazco, V. A., Wang, W., and Arellano Jr., A. F.: Local and regional enhancements of CH4, CO, and CO2 inferred from TCCON column measurements, Atmos. Meas. Tech., 17, 5861–5885, https://doi.org/10.5194/amt-17-5861-2024, 2024. a

Moura, P., Raposo, M., and Vassilenko, V.: Breath volatile organic compounds (VOCs) as biomarkers for the diagnosis of pathological conditions: A review, Biomedical Journal, 46, https://doi.org/10.1016/j.bj.2023.100623, 2023. a

Murayama, S., Taguchi, S., and Higuchi, K.: Interannual Variation in the Atmospheric CO2 Growth Rate: Role of Atmospheric Transport in the Northern Hemisphere, Journal of Geophysical Research Atmospheres, https://doi.org/10.1029/2003jd003729, 2004. a

Myhre, G. and Shindell, D.: Anthropogenic and Natural Radiative Forcing, in: Climate Change 2013: The Physical Science Basis. Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, Cambridge University Press, 659–740, https://doi.org/10.1017/CBO9781107415324.018, 2014. a

Naik, V., Voulgarakis, A., Fiore, A. M., Horowitz, L. W., Lamarque, J.-F., Lin, M., Prather, M. J., Young, P. J., Bergmann, D., Cameron-Smith, P. J., Cionni, I., Collins, W. J., Dalsøren, S. B., Doherty, R., Eyring, V., Faluvegi, G., Folberth, G. A., Josse, B., Lee, Y. H., MacKenzie, I. A., Nagashima, T., van Noije, T. P. C., Plummer, D. A., Righi, M., Rumbold, S. T., Skeie, R., Shindell, D. T., Stevenson, D. S., Strode, S., Sudo, K., Szopa, S., and Zeng, G.: Preindustrial to present-day changes in tropospheric hydroxyl radical and methane lifetime from the Atmospheric Chemistry and Climate Model Intercomparison Project (ACCMIP), Atmos. Chem. Phys., 13, 5277–5298, https://doi.org/10.5194/acp-13-5277-2013, 2013. a

Nalli, N. R., Tan, C., Warner, J., Divakarla, M., Gambacorta, A., Wilson, M., Zhu, T., Wang, T., Wei, Z., Pryor, K., Kalluri, S., Zhou, L., Sweeney, C., Baier, B. C., McKain, K., Wunch, D., Deutscher, N. M., Hase, F., Iraci, L. T., Kivi, R., Morino, I., Notholt, J., Ohyama, H., Pollard, D. F., Té, Y., Velazco, V. A., Warneke, T., Sussmann, R., and Rettinger, M.: Validation of Carbon Trace Gas Profile Retrievals from the NOAA-Unique Combined Atmospheric Processing System for the Cross-Track Infrared Sounder, Remote Sensing, 12, https://doi.org/10.3390/rs12193245, 2020. a

NASA Langley Research Center Atmospheric Science Data Center: MOPITT CO gridded daily means (Near and Thermal Infrared Radiances) V008, [data set], https://doi.org/10.5067/TERRA/MOPITT/MOP03J_L3.008, 2019. a

Nguyen, N. H., Turner, A. J., Yin, Y., Prather, M. J., and Frankenberg, C.: Effects of Chemical Feedbacks on Decadal Methane Emissions Estimates, Geophysical Research Letters, 47, e2019GL085706, https://doi.org/10.1029/2019GL085706, 2020. a

North, G. R., Bell, T. L., Cahalan, R. F., and Moeng, F. J.: Sampling Errors in the Estimation of Empirical Orthogonal Functions, Monthly Weather Review, 110, 699–706, https://doi.org/10.1175/1520-0493(1982)110<0699:SEITEO>2.0.CO;2, 1982. a, b, c

Novelli, P. C.: CO in the atmosphere: measurement techniques and related issues, Chemosphere – Global Change Science, 1, 115–126, https://doi.org/10.1016/S1465-9972(99)00013-6, 1999. a

Pires, J., Sousa, S., Pereira, M., Alvim-Ferraz, M., and Martins, F.: Management of air quality monitoring using principal component and cluster analysis–Part II: CO, NO2 and O3, Atmospheric Environment, 42, 1261–1274, https://doi.org/10.1016/j.atmosenv.2007.10.041, 2008. a

Prather, M.: Lifetimes and time scales in atmospheric chemistry, Philosophical Transactions Of the Royal Society, 365, 1705–1726, 2007. a

Prather, M. J.: Time scales in atmospheric chemistry: Theory, GWPs for CH4 and CO, and runaway growth, Geophysical Research Letters, 23, 2597–2600, https://doi.org/10.1029/96GL02371, 1996. a

Prather, M. J. and Holmes, C. D.: Overexplaining or underexplaining methane’s role in climate change, Proceedings of the National Academy of Sciences, 114, 5324–5326, https://doi.org/10.1073/pnas.1704884114, 2017. a, b

Raman, A. and Arellano, A. F. J.: Spatial and Temporal Variations in Characteristic Ratios of Elemental Carbon to Carbon Monoxide and Nitrogen Oxides across the United States, Environmental Science & Technology, 51, 6829–6838, https://doi.org/10.1021/acs.est.7b00161, 2017. a

Richman, M. B.: Rotation of principal components, Journal of Climatology, 6, 293–335, https://doi.org/10.1002/joc.3370060305, 1986. a, b, c, d, e

Rinsland, C. P., Luo, M., Logan, J. A., Beer, R., Worden, H., Kulawik, S. S., Rider, D., Osterman, G., Gunson, M., Eldering, A., Goldman, A., Shephard, M., Clough, S., Rodgers, C., Lampel, M., and Chiou, L.: Nadir measurements of carbon monoxide distributions by the Tropospheric Emission Spectrometer instrument onboard the Aura Spacecraft: Overview of analysis approach and examples of initial results, Geophysical Research Letters, 33, https://doi.org/10.1029/2006GL027000, 2006. a

Saji, N.: Possible impacts of Indian Ocean dipole mode events on global climate, Climate Research, 25, 151–169, 2003. a

Sarda-Esteve A. R. and Bonsang A. B.: Carbon monoxide emissions by phytoplankton: evidence from laboratory experiments, Environmental Chemistry, 6, 369–379, https://doi.org/10.1071/EN09020, 2009. a

Saunois, M., Jackson, R. B., Bousquet, P., Poulter, B., and Canadell, J. G.: The growing role of methane in anthropogenic climate change, Environmental Research Letters, 11, 120207, https://doi.org/10.1088/1748-9326/11/12/120207, 2016. a

Sharkey, T. and Yeh, S.: Isoprene emission from plants, Annual Review of Plant Physiology and Plant Molecular Biology, 52, 407–436, 2001. a

Shindell, D. T., Faluvegi, G., Stevenson, D. S., Krol, M. C., Emmons, L. K., Lamarque, J.-F., Pétron, G., Dentener, F. J., Ellingsen, K., Schultz, M. G., Wild, O., Amann, M., Atherton, C. S., Bergmann, D. J., Bey, I., Butler, T., Cofala, J., Collins, W. J., Derwent, R. G., Doherty, R. M., Drevet, J., Eskes, H. J., Fiore, A. M., Gauss, M., Hauglustaine, D. A., Horowitz, L. W., Isaksen, I. S. A., Lawrence, M. G., Montanaro, V., Müller, J.-F., Pitari, G., Prather, M. J., Pyle, J. A., Rast, S., Rodriguez, J. M., Sanderson, M. G., Savage, N. H., Strahan, S. E., Sudo, K., Szopa, S., Unger, N., van Noije, T. P. C., and Zeng, G.: Multimodel simulations of carbon monoxide: Comparison with observations and projected near-future changes, Journal of Geophysical Research: Atmospheres, 111, https://doi.org/10.1029/2006JD007100, 2006. a

Sillman, S., Logan, J. A., and Wofsy, S. C.: The sensitivity of ozone to nitrogen oxides and hydrocarbons in regional ozone episodes, Journal of Geophysical Research: Atmospheres, 95, 1837–1851, https://doi.org/10.1029/JD095iD02p01837, 1990. a

Silva, S. J. and Arellano, A. F.: Characterizing Regional-Scale Combustion Using Satellite Retrievals of CO, NO2 and CO2, Remote Sensing, 9, 744, https://doi.org/10.3390/rs9070744, 2017. a

Silva, S. J., Arellano, A. F., and Worden, H. M.: Toward anthropogenic combustion emission constraints from space-based analysis of urban CO2/CO sensitivity, Geophysical Research Letters, 40, https://doi.org/10.1002/grl.50954, 2013. a

Simmons, A. J., Wallace, J. M., and Branstator, G. W.: Barotropic Wave Propagation and Instability, and Atmospheric Teleconnection Patterns, Journal of Atmospheric Sciences, 40, 1363–1392, https://doi.org/10.1175/1520-0469(1983)040<1363:BWPAIA>2.0.CO;2, 1983. a

Skelton, A., Kirchner, N., and Kockum, I.: Skewness of Temperature Data Implies an Abrupt Change in the Climate System Between 1985 and 1991, Geophysical Research Letters, 47, e2020GL089794, https://doi.org/10.1029/2020GL089794, 2020. a

Stubbins, A., Uher, G., Law, C. S., Mopper, K., Robinson, C., and Upstill-Goddard, R. C.: Open-ocean carbon monoxide photoproduction, Deep Sea Research Part II: Topical Studies in Oceanography, 53, 1695–1705, https://doi.org/10.1016/j.dsr2.2006.05.011, 2006. a

Tang, W. and Arellano Jr., A. F.: Investigating dominant characteristics of fires across the Amazon during 2005–2014 through satellite data synthesis of combustion signatures, Journal of Geophysical Research: Atmospheres, 122, 1224–1245, https://doi.org/10.1002/2016JD025216, 2017. a

Tang, W., Arellano, A. F., Gaubert, B., Miyazaki, K., and Worden, H. M.: Satellite data reveal a common combustion emission pathway for major cities in China, Atmos. Chem. Phys., 19, 4269–4288, https://doi.org/10.5194/acp-19-4269-2019, 2019. a

Tang, Z., Chen, J., and Jiang, Z.: Discrepancy in assimilated atmospheric CO over East Asia in 2015–2020 by assimilating satellite and surface CO measurements, Atmos. Chem. Phys., 22, 7815–7826, https://doi.org/10.5194/acp-22-7815-2022, 2022. a

Thiébaux, H. J. and Zwiers, F. W.: The Interpretation and Estimation of Effective Sample Size, Journal of Applied Meteorology and Climatology, 23, 800–811, https://doi.org/10.1175/1520-0450(1984)023<0800:TIAEOE>2.0.CO;2, 1984. a

Tipping, M. E. and Bishop, C. M.: Probabilistic Principal Component Analysis, Journal of the Royal Statistical Society Series B: Statistical Methodology, 61, 611–622, https://doi.org/10.1111/1467-9868.00196, 2002. a, b

Toumazou, V. and Cretaux, J.-F.: Using a Lanczos Eigensolver in the Computation of Empirical Orthogonal Functions, Monthly Weather Review, 129, 1243–1250, https://doi.org/10.1175/1520-0493(2001)129<1243:UALEIT>2.0.CO;2, 2001. a

Trefethen, L. N. and Bau, D.: Numerical linear algebra, SIAM, 2022. a, b

Trenberth, K. E.: The definition of el nino, Bulletin of the American Meteorological Society, 78, 2771–2778, 1997. a

Turner, A. J., Frankenberg, C., and Kort, E. A.: Interpreting contemporary trends in atmospheric methane, Proceedings of the National Academy of Sciences, 116, 2805–2813, https://doi.org/10.1073/pnas.1814297116, 2019. a

Van Loan, C. F. and Golub, G.: Matrix computations (Johns Hopkins studies in mathematical sciences), Matrix Computations, 5, 32, ISBN 0-8018-5414-8, 1996. a

Vimont, I. J., Turnbull, J. C., Petrenko, V. V., Place, P. F., Sweeney, C., Miles, N., Richardson, S., Vaughn, B. H., and White, J. W. C.: An improved estimate for the δ13C and δ18O signatures of carbon monoxide produced from atmospheric oxidation of volatile organic compounds, Atmos. Chem. Phys., 19, 8547–8562, https://doi.org/10.5194/acp-19-8547-2019, 2019. a

Von Hobe, M., Cutter, G. A., Kettle, A. J., and Andreae, M. O.: Dark production: A significant source of oceanic COS, Journal of Geophysical Research: Oceans, 106, 31217–31226, 2001. a

von Storch, H. and Zwiers, F.: Statistical Analysis in Climate Research, Cambridge University Press, virtual edn., https://doi.org/10.1175/1520-0493(1981)109<0784:TITGHF>2.0.CO;2, 2003. a

Wallace, J. and Gutzler, D.: Teleconnections in the Geopotential Height Field during the Northern Hemisphere Winter, Monthly Weather Review, 109, 784–812, 1981. a

Wang, P., Elansky, N., Timofeev, Y. M., Wang, G., Golitsyn, G., Makarova, M., Rakitin, V., Shtabkin, Y., Skorokhod, A., Grechko, E., Fokeeva, E., Safronov, A., Ran, L., and Wang, T.: Long-term trends of carbon monoxide total columnar amount in urban areas and background regions: ground-and satellite-based spectroscopic measurements, Advances in Atmospheric Sciences, 35, 785–795, 2018. a

Wiedinmyer, C., Kimura, Y., McDonald-Buller, E. C., Emmons, L. K., Buchholz, R. R., Tang, W., Seto, K., Joseph, M. B., Barsanti, K. C., Carlton, A. G., and Yokelson, R.: The Fire Inventory from NCAR version 2.5: an updated global fire emissions model for climate and chemistry applications, Geosci. Model Dev., 16, 3873–3891, https://doi.org/10.5194/gmd-16-3873-2023, 2023. a

Wilks, D. S.: Statistical Method in the Atmospheric Sciences, Elsevier Inc., third edn., ISBN 978-0-12-385022-5, 2011. a

Worden, H. M., Bloom, A. A., Worden, J. R., Jiang, Z., Marais, E. A., Stavrakou, T., Gaubert, B., and Lacey, F.: New constraints on biogenic emissions using satellite-based estimates of carbon monoxide fluxes, Atmos. Chem. Phys., 19, 13569–13579, https://doi.org/10.5194/acp-19-13569-2019, 2019. a

Xie, H., Zafiriou, O. C., Umile, T. P., and Kieber, D. J.: Biological consumption of carbon monoxide in Delaware Bay, NW Atlantic and Beaufort Sea, Marine Ecology Progress Series, 290, 1–14, 2005. a

Yin, Z., Cao, B., and Wang, H.: Dominant patterns of summer ozone pollution in eastern China and associated atmospheric circulations, Atmos. Chem. Phys., 19, 13933–13943, https://doi.org/10.5194/acp-19-13933-2019, 2019. a

Young, P. J., Archibald, A. T., Bowman, K. W., Lamarque, J.-F., Naik, V., Stevenson, D. S., Tilmes, S., Voulgarakis, A., Wild, O., Bergmann, D., Cameron-Smith, P., Cionni, I., Collins, W. J., Dalsøren, S. B., Doherty, R. M., Eyring, V., Faluvegi, G., Horowitz, L. W., Josse, B., Lee, Y. H., MacKenzie, I. A., Nagashima, T., Plummer, D. A., Righi, M., Rumbold, S. T., Skeie, R. B., Shindell, D. T., Strode, S. A., Sudo, K., Szopa, S., and Zeng, G.: Pre-industrial to end 21st century projections of tropospheric ozone from the Atmospheric Chemistry and Climate Model Intercomparison Project (ACCMIP), Atmos. Chem. Phys., 13, 2063–2090, https://doi.org/10.5194/acp-13-2063-2013, 2013. a

Yurganov, L. N., McMillan, W. W., Dzhola, A. V., Grechko, E. I., Jones, N. B., and van der Werf, G. R.: Global AIRS and MOPITT CO measurements: Validation, comparison, and links to biomass burning variations and carbon cycle, Journal of Geophysical Research: Atmospheres, 113, https://doi.org/10.1029/2007JD009229, 2008. a

Zhang, Y., Xie, H., Fichot, C. G., and Chen, G.: Dark production of carbon monoxide (CO) from dissolved organic matter in the St. Lawrence estuarine system: Implication for the global coastal and blue water CO budgets, Journal of Geophysical Research: Oceans, 113, https://doi.org/10.1029/2008JC004811, 2008. a

Zheng, B., Chevallier, F., Yin, Y., Ciais, P., Fortems-Cheiney, A., Deeter, M. N., Parker, R. J., Wang, Y., Worden, H. M., and Zhao, Y.: Global atmospheric carbon monoxide budget 2000–2017 inferred from multi-species atmospheric inversions, Earth Syst. Sci. Data, 11, 1411–1436, https://doi.org/10.5194/essd-11-1411-2019, 2019. a, b

Zhi, G., Zhang, Y., Sun, J., Cheng, M., Dang, H., Liu, S., Yang, J., Zhang, Y., Xue, Z., Li, S., and Meng, F.: Village energy survey reveals missing rural raw coal in northern China: Significance in science and policy, Environmental Pollution, 223, 705–712, https://doi.org/10.1016/j.envpol.2017.02.009, 2017. a

Articles

Short summary

We use a statistical method called Empirical Orthogonal Function (EOF) analysis to analyze complex data, focusing on its strengths and limitations. This method is widely used in climate research, but its use in atmospheric chemistry is relatively new. While EOF analysis can be powerful, it may not be suitable for datasets that do not follow specific statistical assumptions. Our research provides recommendations to improve how we use this technique in analyzing atmospheric chemistry data.

Spatio-temporal pattern analysis of MOPITT total column CO using varimax rotation and singular spectrum analysis

The Role of Carbon Monoxide in Atmospheric Composition

2.1 Total Column CO from MOPITT

2.2 Sources and Sinks of CO

2.3 EOF Analysis

2.3.1 Separation of Modes Using the North Test

2.3.2 Limitations of EOF Analysis

2.3.3 Varimax Rotation

2.4 Seasonal Decomposition using Singular Spectrum Analysis

3.1 Spatio-temporal Variations of CO from MOPITT

3.2 Domain Independence and Separation of Modes for MOPITT CO

3.3 Seasonal Decomposition for MOPITT CO

3.4 Spectral Analysis and Time Scales

3.5 Time Dependence and Non-Stationarity of Modes

A1 Eigenvectors of Temporal and Spatial Covariance Matrices

A2 Equivalence of Determining EOFs and PCs Using Eigenvector Decomposition and SVD

A3 Estimate of N* in Terms of an AR1 Process

A4 Jointly Gaussian Distributions Are Independent

A3 Estimate of N^* in Terms of an AR1 Process