the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.

Assessing nitrogen dioxide monitoring techniques: a comparative analysis of Sentinel-5 Precursor satellite and ground measurements in Catalonia
Danielly Garcia Santos
Maria Eulàlia Parés
Effective monitoring of air pollution is essential for the development of environmental and public health policies. Comprehensive air quality management requires precise tools and strategies to assess the spatial and temporal distribution of pollutants. This study investigates the correlation between nitrogen dioxide (NO2) concentrations detected by the Sentinel-5 Precursor (S5p) satellite and those measured at ground stations by Catalonia's official air quality monitoring network during 2022 and 2023. The methodology integrates satellite and surface data aligned in space and time. The relationship between both measurements is analyzed under different frameworks: (i) global, considering the entire territory; (ii) by geographic zone (urban, suburban, and rural, as well as inside and outside the Barcelona Metropolitan Area (BMA)); (iii) according to the type of stations (traffic, background, industrial); and (iv) at a seasonal level, covering different quarters of the year. Statistical tools are then used to identify patterns and differences based on zones, typology, and seasonality characteristics. The results show a moderate positive correlation at the global level, with r=0.66. By zones, the analysis reveals that suburban (r=0.66) and non-BMA zones (r=0.67) present stronger correlations compared to urban zones (r=0.55), traffic typology (r=0.61), or stations located in the BMA zone (r=0.42). Seasonally, the correlation peaks in winter (r=0.70) and autumn (r=0.66), periods with more stable atmospheric conditions for NO2 concentrations, while it is lowest in spring (r=0.61) and summer (r=0.57). These findings highlight the utility of the S5p satellite as a complement to ground-based networks in NO2 monitoring, while revealing the limitations of applying a direct relationship between both types of data at the regional level and across different geographic zones.
- Article
(9640 KB) - Full-text XML
- BibTeX
- EndNote
Air pollution exposure imposes a significant impact on human health globally, contributing to a substantial burden of disease. It is estimated that millions of deaths and years of healthy life are lost annually due to air pollution, making it a major global health risk, comparable to unhealthy diets and tobacco smoking (World Health Organization, 2021).
To address this issue, effective air quality monitoring is crucial for public health, environmental management, and urban development policies (Balestrini et al., 2021; Verma et al., 2024; World Health Organization, 2016). Traditional monitoring systems, such as fixed ground stations, face challenges due to limited data availability and coverage. In response, there has been a growing interest in advanced remote sensing technologies that provide a broader and more continuous view of the dispersion of atmospheric pollutants globally (e.g., Mauro et al., 2023; Jimenez and Brovelli, 2023; Morillas et al., 2024, 2025; Petetin et al., 2023).
Among the numerous pollutants that affect air quality, nitrogen dioxide (NO2) stands out for its significant impacts on health and the environment, making it the focus of this study. It is primarily emitted from combustion processes such as vehicle engines, power plants, and industrial activities. In the atmosphere, NO2 plays a role in the formation of tropospheric ozone (O3) and particulate matter (PMX), both of which have detrimental effects on human health and the environment (Filonchyk and Peterson, 2024). Gaseous NO2 can also undergo chemical reactions to produce nitric acid and nitrate particles, contributing to acid rain and affecting ecosystems (Dvořák et al., 2010). The reactions of NO2 are influenced by factors such as sunlight intensity, temperature, humidity, and atmospheric stability. Higher temperatures and intense sunlight accelerate photochemical reactions that generate O3, especially in warm and sunny climates. Humidity can modify the concentration of O3 precursors and the efficiency of these reactions, while atmospheric stability can lead to pollutant accumulation, particularly under high-pressure and low-wind conditions (Douros et al., 2023; Han et al., 2011; Morillas et al., 2025; Voiculescu et al., 2020).
Remote sensing techniques enable the measurement of atmospheric NO2 by detecting how these molecules absorb sunlight in the atmosphere (ESA, 2022; Veefkind et al., 2012). Several satellite missions are currently in operation, capturing data on atmospheric gases and pollutants to provide information for air quality monitoring. This landscape is evolving rapidly; satellites with high temporal coverage, such as the Geostationary Environmental Monitoring Spectrometer (GEMS), operating in East Asia, and the Tropospheric Emissions: Monitoring Pollution (TEMPO), operating in North America, are already operational but do not provide data for Europe (Kim et al., 2020; Zoogman et al., 2017). Sentinel-4, launched in July 2025, is the first geostationary air quality monitoring mission in Europe and will provide hourly data on key air pollutants (Copernicus, 2025). In the meantime, the Sentinel-5 Precursor (S5p) satellite remains the primary satellite source for air quality monitoring in Europe. S5p is widely used for regional air quality monitoring, leveraging its advanced TROPOspheric Monitoring Instrument (TROPOMI), which provides daily global maps (3.5 km×5.5 km) of multiple species, including NO2 gases (Douros et al., 2023; Ialongo et al., 2020; Jimenez and Brovelli, 2023; Morillas et al., 2024, 2025; Pinardi et al., 2020; Omrani et al., 2020; Veefkind et al., 2012).
S5p data enhance the monitoring and understanding of air pollution on a regional scale, complementing traditional air quality monitoring systems, which often have limited coverage. Using TROPOMI, S5p measures the spectral signature of sunlight after it has passed through the atmosphere and reflected back into space. These observations are processed to produce vertical column densities (VCDs) of NO2, which represent the total amount of NO2 in a column of air extending from the Earth's surface to the top of the atmosphere (Choo et al., 2023). These measurements provide a comprehensive overview of the horizontal distribution of NO2 and are particularly useful in regions with scarce or non-existent ground monitoring. Nonetheless, satellite measurements may not always accurately reflect actual ground-level NO2 concentrations (Dimitropoulou et al., 2020). Despite the advanced capabilities of satellite remote sensing, a key research question remains: how strongly do NO2 measurements obtained by the S5p satellite correlate with ground-based monitoring data in a geographically diverse region?
In this context, Catalonia, located in the northeast of the Iberian Peninsula, constitutes an ideal case study. The region is characterized by diverse topography, land use, and climate variability within a compact area. These conditions make it an ideal region to assess the correlation between satellite-derived NO2 measurements and ground-based monitoring data. To address this comprehensively, the following research questions were explored: how closely do S5p satellite NO2 data correlate with surface measurements from the official measurement of the air pollution network in Catalonia, considering the geographic diversity; how does the correlation between satellite data and station data vary depending on the zone or station type; how does seasonality (winter, spring, summer, and autumn) influence the correlation between satellite NO2 measurements and surface observations in Catalonia; and what are the benefits and limitations of using S5p data to complement surface-measured NO2 information, and how could they be integrated into monitoring strategies and public policies in Catalonia?
Answering these questions is essential to understanding the reliability and limitations of satellite data when applied to regional air quality studies. Although several initiatives have examined the integration of S5p data with ground-based observations, most have focused on individual urban areas or on regions with environmental and geographical contexts that differ from broader and more diverse areas like Catalonia (e.g., Hu et al., 2024; Jimenez and Brovelli, 2023; Morillas et al., 2024, 2025; Petetin et al., 2023). An exception is the study conducted by Petetin et al. (2023), which included the Iberian Peninsula but did not specifically address the heterogeneity of Catalonia. To the authors' knowledge, no previous research has applied a similar methodology exclusively to the diverse landscape of Catalonia, and compared to existing studies, this research makes three key contributions: it focuses on geographic heterogeneity by analyzing Catalonia's diverse topography and land use within a compact area; it employs a layered approach, going beyond simply comparisons between Sentinel-5P data and surface measurements by analyzing correlations between zones and station typologies; and it examines seasonal variability, studying correlations based on the season of the year, both for the complete dataset and for data grouped by zones and station typologies.
From a scientific perspective, these contributions are crucial for refining air pollution mitigation strategies and public health policies. Integrating observations of the NO2 tropospheric column from the S5p satellite with point measurements from the ground-based air quality monitoring network aims to identify optimal correlation strategies between these datasets to improve the accuracy of air quality assessments.
This paper is structured as follows. Section 2 provides an overview of previous studies on the use of S5p satellite data for NO2 monitoring, with emphasis on data integration approaches and current methodological challenges. Section 3 offers a brief explanation of the methodology and data used in this work. Section 4 presents the obtained results, and Sect. 5 discusses the findings. Section 6 outlines the conclusions and future work for this research.
Recent investigations have explored the efficacy of integrating S5p satellite data with other data sources to accurately estimate ground-level NO2 concentrations. For instance, Mauro et al. (2023) used machine learning approaches, specifically categorical boosting, demonstrating the potential of advanced algorithms in translating satellite observations into meaningful ground-level data (Verma et al., 2024). However, their study does not address the impact of geographical variability or regions with limited ground-based monitoring data. Similarly, Hu et al. (2024) refined these models by examining various machine-learned models for surface NO2 concentration mapping, significantly improving model accuracy by incorporating domain knowledge. Despite these advancements, unresolved challenges remain, such as error propagation in satellite-derived inputs and the need for further algorithmic improvements to enhance prediction reliability.
The work presented in Morillas et al. (2024) investigated the correlation between S5p NO2 data and ground-based observations in the Community of Madrid, Spain. They found a high linear correlation coefficient between daily NO2 data and ground-based measurements, which increased when considering only the data within the satellite overpass time. Expanding on this research, Morillas et al. (2025) examined the relationship between NO2 data from air quality networks and S5p with O3 and meteorological variables across Spain's largest cities. Their findings underscore the complex interactions between various pollutants and environmental factors, further emphasizing the need for integrated approaches in urban air quality management.
In addition, Petetin et al. (2023) explored the capability of the TROPOMI instrument to analyze spatiotemporal variations of NO2 as a function of land use, highlighting anthropogenic factors that influence pollutant concentrations in observations from the Iberian Peninsula. Nevertheless, more information is required to better define the profile of the observations as a function of land use, especially in regions with geographical heterogeneity and varied human activities.
Moreover, Jimenez and Brovelli (2023) estimated NO2 concentrations at ground level in Milan using S5p satellite images and ERA5 meteorological data. Their model, which offered a cost-effective solution for air quality assessment in low- and middle-income countries, achieved a significantly lower root mean square error (RMSE) than the standard deviation of real measurements, indicating precise air quality monitoring from space. However, their study did not fully explore the applicability of such methods in regions with significant geographical diversity or under varying atmospheric conditions.
Collectively, these studies underscore the critical role of satellite data in enhancing the spatial resolution of air quality monitoring systems. While they reveal the potential of integrating satellite data with ground-based observations and advanced modeling techniques, they also highlight important limitations, particularly in addressing geographical heterogeneity and the accuracy of surface-level estimates. These gaps underscore the necessity for further research to refine methodologies and extend their applicability by incorporating geographically diverse areas, measurements across different zones, station typologies, and seasons, providing information to optimize air quality assessment models in complex regions.
This section provides detailed information on the data and methodology of this study. The data from the S5p satellite and the surface data collected by the official air quality monitoring network in Catalonia are presented. Additionally, this section provides an overview of the study area in Catalonia, highlighting the air quality zones (AQZs) and the characteristics of the station network. The section ends with a short description on the mathematical models implemented to examine the correlation between air quality data.
3.1 Study area
The study focuses on Catalonia, a region in northeast Spain with an area of approximately 32 108 km2. Its geographic diversity, illustrated in Fig. 1, which includes mountains over 3000 m high, plains, and coasts, plays a significant role in the region's air quality (Gencat, 2022a, b; Idescat, 2020).

Figure 1Map showing the geographical features of Catalonia and the distribution of air quality monitoring stations.
Catalonia presents a variety of landscapes and climates that influence the dispersion and concentration of atmospheric pollutants. Key geographic features include the Pyrenees, the Central Catalan System, and the Lleida Plain. Climatically, most of Catalonia experiences a Mediterranean climate, while the Pyrenees have a mountainous climate characterized by colder and wetter conditions (Gencat, 2022b). The Pyrenees influence air currents and can act as a barrier to pollutant dispersion. The Central Catalan System, separated by the coastal plain, presents a relief that affects air circulation and pollutant dispersion, creating local microclimates. Catalonia also has a significant urban center, Barcelona, whose surrounding areas are known as the Barcelona Metropolitan Area (BMA), a region of considerable demographic and economic impact. In the interior, Lleida's extensive agricultural activity contributes to thermal inversion conditions. Along the coast, sea breezes can help disperse pollutants in coastal areas, but they can also transport them inland.
Other factors such as hydrography and anthropogenic influences also play a significant role in air quality. Major rivers such as the Ebro, Ter, Llobregat, and Besós are key features of the region. These river valleys frequently experience thermal inversion phenomena during winter, trapping pollutants close to the ground and affecting nearby populations.
3.2 Ground air quality data source
The ground data for this study are obtained from the Xarxa de Vigilància i Previsió de la Contaminació Atmosfèrica (XVPCA) network of Catalonia, which provides near-real-time (NRTI) air quality data across the study area. The XVPCA network operates monitoring stations that measure various pollutants, including NO2, PM10, PM2.5, O3, carbon monoxide (CO), and sulfur dioxide (SO2). These stations are strategically distributed across different geographic zones and typologies, providing hourly data and comprehensive coverage of the region's air quality (Gencat, 2024a).
The XVPCA network follows the European standard EN 14211:2012 for the measurement of NO2 concentrations, which defines the chemiluminescence method for determining NO2 and NOx levels (Gencat, 2024a). This standardized approach ensures typical uncertainties of approximately ±10 % under standard operating conditions (CEN, 2012).
3.2.1 Air quality zones in Catalonia
Catalonia is divided into several air quality zones (AQZs), established to monitor and manage atmospheric pollution levels across the region. The aim of these zones is to be representative of the entire territory due to a combination of geographic, demographic, and economic activity criteria (Gencat, 2024b). The Generalitat de Catalunya regulates these zones to ensure compliance with air quality standards set by the European Union and national legislation. Figure 1 illustrates the geographical features of Catalonia and the distribution of air quality monitoring stations in urban, suburban, and rural zones. The detailed specifications of these stations – including their zones, typology, measured pollutants, technology, and equipment – are provided in Appendix A of this paper, offering further insights into the characteristics of the XVPCA network.
3.3 NO2 satellite data source
The S5p satellite is part of the European Union's Copernicus Earth observation program, and it is the first Copernicus mission specifically dedicated to atmospheric monitoring. The S5p data, freely accessible through the Copernicus platform, are essential for global air quality monitoring (Veefkind et al., 2012).
S5p provides daily and global data on various atmospheric pollutants, including O3, CO, NO2, SO2, methane (CH4), formaldehyde (HCHO), and aerosols (AOD) (Veefkind et al., 2012). It uses the TROPOMI instrument to collect these data, covering a width of 2600 km, allowing the entire planet to be mapped in a 24 h period (ESA, 2024; van Geffen et al., 2022). Of the different data provided by S5p on NO2, in this study, the band containing the information on the tropospheric column of NO2 has been specifically used. This field provides the total amount of NO2 present in the troposphere column, which is the most relevant atmospheric region for air quality monitoring, as it reflects the direct impact of NO2 at ground level (see Fig. 2).

Figure 2Tropospheric nitrogen dioxide column over Catalonia, measured by the Sentinel-5 Precursor satellite during the period 11–17 December 2023. Values are expressed in µmol m−2.
Datasets downloaded directly from the Copernicus platform (level 2 data) have a native spatial resolution of approximately 3.5 km×5.5 km (ESA, 2022). Based on mission requirements and validation reports, the random precision (1σ) of the tropospheric NO2 column is 0.7×1015 molec cm−2, while the systematic uncertainty (bias) ranges from 25 % to 50 %
(ESA, 2015). In our study, however, NO2 values were extracted from the Sentinel-5P level 3 dataset using the Google Earth Engine (GEE) platform. These level 3 data are provided on a regular 0.01°×0.01° grid (approximately 1.1 km×0.85 km over Spain), generated by applying the harpconvert
tool to the level 2 product after filtering out pixels with qa_value
<0.75 (Google Earth Engine, 2019). This process remaps the data spatially but does not recalculate or propagate the precision band, so the official level 2 uncertainties are retained.
It is important to note that these data can occasionally show negative values – an observation that has been documented in previous studies (Jimenez and Brovelli, 2023; Finch et al., 2022; Douros et al., 2023; Tonion and Pirotti, 2022). These negative values in tropospheric NO2 columns can arise due to uncertainties inherent to the data retrieval process, which involves separating the tropospheric component column from the total NO2 column by subtracting the estimated stratospheric contribution using differential optical absorption spectroscopy (DOAS) techniques (ESA, 2022; Veefkind et al., 2012). Since this separation process can be subject to errors, especially under very low NO2 concentrations, negative values are generally considered to be retrieval artifacts or instrumental noise rather than physically meaningful concentrations (Jimenez and Brovelli, 2023; Finch et al., 2022; Douros et al., 2023). In particular, these cases are more likely to appear in regions with minimal NO2 concentrations or complex atmospheric conditions, being within the typical instrument uncertainty range.
The products are available in NRTI and offline (OFFL) modes. NRTI data are typically made available within 2–3 h of acquisition by the satellite (van Geffen et al., 2022), making them ideal for applications requiring almost instantaneous information, such as air quality monitoring, early pollution alerts, or environmental emergencies. On the other hand, OFFL data, typically available 1–2 d after acquisition (Verhoelst et al., 2021), offer more comprehensive processing, with more accurate atmospheric and meteorological models, making them more suitable for detailed scientific analysis and long-term climate studies.
In addition to being available for direct download from the Sentinel-5P Pre-Operations Data Hub, the datasets are also available on Google Earth Engine (GEE), a platform that allows accessing and working with historical and NRTI atmospheric data (Salmani et al., 2023). Since analysis of large volumes of NetCDF data can be computationally expensive and slow, accessing and processing through GEE provides a more efficient alternative. In GEE, S5p data can be integrated with other datasets, such as from the XVPCA network, facilitating broader and faster analysis.
3.4 Methodology
The main objective of this study is to characterize the correlation between NO2 measurements obtained from the S5p satellite and those recorded by the XVPCA surface monitoring network in Catalonia. This analysis takes into account the geographical diversity of the study area, as well as the distinctive characteristics of the monitoring stations and seasonal variations. Previous studies (Morillas et al., 2025; Jimenez and Brovelli, 2023) have explored the integration of meteorological data in air quality analysis. However, incorporating such data at this stage could mask the direct relationship we aim to establish between satellite- and ground-based NO2 measurements. By focusing on this direct relationship, we aim to identify the most appropriate methods to estimate ground-level air quality using satellite data in Catalonia, leaving the integration of meteorological variables for future research.
The methodology is organized into several steps. First, data preprocessing is performed, including spatial alignment of satellite pixels with ground station locations and temporal synchronization of measurement times. Next, the analysis begins with a global approach and becomes more specific, focusing on zone-based, typology-based, and seasonal aspects. Finally, statistical methods, including correlation plots, Pearson correlation coefficients (r), regression analysis, and coefficient of variation (CV), are applied to evaluate the relationship between satellite- and ground-based data.
3.4.1 Data preprocessing
To achieve the objective, it is essential to accurately match satellite- and ground-based data. This matching process involves spatial and temporal alignment of data from the years 2022 and 2023.
-
Spatial matching. For each ground station in the XVPCA network, the corresponding satellite pixel values are assigned based on its geographic coordinates. This ensures that the data accurately reflect the same geographic conditions. In our study, NO2 values were extracted from the S5p level 3 dataset (0.01°×0.01°, approximately 1.1 km×0.85 km) using GEE. Due to this relatively high resolution and the spatial distribution of monitoring stations, we did not encounter cases of multiple stations falling within the same pixel.
However, in other study areas or when using S5p level 2 data, multiple ground monitoring stations could indeed be located within the same pixel. In such scenarios, the presence of multiple stations per pixel can be valuable for exploring within-pixel variability, revealing how surface NO2 concentrations can differ even within the same satellite measurement.
-
Temporal matching. The S5p satellite passes over the study area daily around 12:00 UTC (Verhoelst et al., 2021), with data sometimes recorded as late as 13:00 UTC. Ground-based hourly measurements were converted to UTC to ensure consistency with satellite overpass times and enable direct comparison.
3.4.2 Variable-based correlation analysis
The correlation between satellite- and ground-based data is analyzed globally across the region, as well as by geographical zones, typology, and season. This approach allows us to assess how different environments and seasonal factors affect the relationship between satellite- and ground-based measurements.
-
Global. An overall correlation analysis is performed on all paired data points, providing a comprehensive view of the relationship between satellite and ground NO2 levels. This study utilizes data from 65 XVPCA stations that provide NO2 measurements.
-
Zone-based. Stations within the study area are categorized into two groups. The first classification is based on zones – urban, suburban, and rural – according to the definitions provided by the XVPCA network (see Table 1). The second categorization divides stations into those within the BMA zone, which includes seven urban stations, and those outside the BMA zone (non-BMA), as shown in Table 1. This categorization enables us to analyze the influence of varying population densities, urban infrastructure, and environmental conditions on the correlation between satellite- and ground-based data.
-
Typology-based. To address the localized impact of pollution sources, such as vehicular traffic or industrial activity, which may affect the relationship between satellite and ground-level NO2 measurements, correlation analyses are performed for each station typology: traffic, background, and industrial (Table 1).
-
Seasonal. To investigate the influence of seasonal variations on the correlation, the data were grouped into four quarters of the year as an approximation for dividing the data by seasons (winter, spring, summer, autumn). This approach was necessary because the use of annual data did not allow for precise alignment with the exact start and end dates of each season. This analysis aims to identify whether seasonal factors, such as temperature and sunlight intensity, impact the correlation between satellite- and ground-based measurements.
3.4.3 Statistical tools for correlation analysis
To comprehensively assess the relationship between satellite- and ground-based NO2 measurements, it is essential to employ robust statistical tools. In this section, we describe the methods proposed to analyze the correlation between these two data sources, ensuring a complete understanding of the behavior of the data across geographic regions and timescales.
-
Correlation plots. These plots visually represent the relationship between satellite- and ground-based NO2 measurements, helping identify patterns, trends, and outliers. They provide a visual basis for assessing the linearity and differences across geographic and seasonal contexts (Boslaugh and Watters, 2008; Kutner et al., 2004).
-
Pearson correlation coefficient. The r quantifies the strength and direction of the linear relationship between satellite and ground measurements, ranging from −1 (strong negative correlation) to +1 (strong positive correlation). This metric complements regression analysis by assessing the overall relationship between both data sources (Boslaugh and Watters, 2008; Kutner et al., 2004).
-
Regression analysis. Linear regression models are used to quantify the relationship between satellite- and ground-based NO2 measurements. The slope (β1) indicates changes in ground-level NO2 concentrations per satellite unit, while the intercept (β0) reflects potential biases. The coefficient of determination (R2) measures how well the model explains the relationship between both measurements (Boslaugh and Watters, 2008; Kutner et al., 2004; Chatterjee and Hadi, 2012).
-
Coefficient of variation. The CV is defined as the ratio of the standard deviation to the mean () and provides a measure of the relative dispersion of the data. In this study, it has been applied to seasonal data to compare the variability of NO2 measurements over the course of the year (Boslaugh and Watters, 2008; Kutner et al., 2004).
This section presents the results of the correlation analysis between NO2 concentrations measured during 2022 and 2023 using data from the S5p satellite- and ground-based measurements from the XVPCA network in Catalonia. The analysis examines the overall correlation across the entire study area and evaluates specific correlations for datasets grouped by geographical zone, station typology, and seasonality.
The initial step of the analysis involved calculating the Pearson correlation coefficient (r) between the single daily satellite data and ground-based NO2 concentrations recorded at each hour. Figure 3 presents the resulting correlations for both years, highlighting how the strength of this relationship varies throughout the day. In this figure, the X axis corresponds to measurement times, while the Y axis represents the correlation coefficients, standardized to UTC.

Figure 3Hourly Pearson correlation coefficients between NO2 concentrations measured by the S5p satellite (single daily measurement) and the ground-based XVPCA network (hourly measurement) for 2022 and 2023. The data are color-coded by year, and the subpanels correspond to (a) global correlation; (b) zone-based correlation, BMA; (c) zone-based correlation, non-BMA; (d) zone-based correlation, urban; (e) zone-based correlation, suburban; (f) zone-based correlation, rural; (g) typology-based correlation, traffic; (h) typology-based correlation, background; and (i) typology-based correlation, industrial.
The results indicate that the highest correlation between the datasets predominantly occurs at 13:00 UTC, coinciding with the approximate overpass time of the S5p satellite. A secondary peak is observed at 19:00 UTC in most zones, except in rural zones, where no secondary peaks are detected. The only exception to this pattern is the BMA zone, where correlations at 13:00 and 19:00 UTC are similarly high. Although the analysis was performed for all time slots, for clarity and better understanding, this paper focuses on the results corresponding to 13:00 UTC.
4.1 Global
Before presenting the results we would like to note that in the presentation of the dispersion results and statistical calculations, the original data types have been retained: surface concentration (µg m−3) for ground-based measurements and column-integrated values (mol m−2) for satellite observations. This approach was chosen because the conversion from column-integrated values to surface concentration involves complex calculations that require additional assumptions about the vertical distribution of NO2 within the atmospheric column. Factors such as temperature, pressure, solar radiation, and local meteorological conditions significantly influence this conversion. Including these variables would introduce an additional layer of uncertainty into the analysis, potentially altering the direct relationship between the datasets.
Hereafter we present two scatter diagrams comparing the NO2 concentrations measured by the S5p satellite and by the XVPCA ground-based network (Fig. 4), along with a table containing the correlation coefficient r and regression parameters (Table 2). The global analysis of the data considers all the values measured in Catalonia during the years 2022 and 2023 without applying any filters or divisions to the dataset.

Figure 4Global scatter plots comparing NO2 concentrations measured by Sentinel-5P (mol m−2) and ground-based stations (µg m−3) from the XVPCA network in Catalonia for 2022 (a) and 2023 (b). Each panel includes the linear regression line with its corresponding equation, the number of data points (n), the Pearson correlation coefficient (r), and the coefficient of determination (R2). Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.
Table 2Pearson correlation coefficient (r), slope (β1), intercept (β0), and coefficient of determination (R2) for NO2 levels in Catalonia during 2022–2023. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

In this study, the negative values of satellite measurements were kept in the scatter diagrams and statistical calculations because the main objective is to understand the direct correlation between NO2 measurements from the S5p satellite and those recorded by the XVPCA terrestrial network in Catalonia. Keeping these values in the analyses provided us with valuable information on that correlation in urban zones.
In the global results (Fig. 4 and Table 2), the r was 0.65 in 2022 and 0.66 in 2023, indicating a moderate positive correlation between measurements made by the S5p satellite and surface stations. Table 2 shows that the β1 values remained close between the two years, with a value of in 2022 and in 2023, indicating that the sensitivity of satellite measurements to changes in terrestrial NO2 concentrations is constant.
However, using satellite measurements in their original unit (mol m−2) does not allow a direct interpretation for the analysis of the regression line. For this reason, we present a conversion to equivalent units (µg m−3) to facilitate result interpretation. This conversion does not take into account the atmosphere's complexity and assumes that NO2 is uniformly mixed throughout the tropospheric column. Consequently, the regression lines have been recalculated, this time using values measured by S5p converted to µg m−3. The results of these lines reveal, in general, a low β1 in the analyzed data. For the global dataset, the β1 value of 0.0163 indicates that, for each increase of 1 µg m−3 in the NO2 concentration measured at the surface, the concentration measured by the S5p satellite only increases by 0.0163 µg m−3. Additionally, the β0 of 11.5 µg m−3 suggests that even when surface measurements are zero, the satellite still predicts a positive concentration. The R2 obtained is 0.43 for both years, indicating that 43 % of the variance in satellite measurements is explained by surface measurements.
4.2 Zone-based
A correlation analysis was performed for two groups based on zones: one based on urban, suburban, and rural zones and the other distinguishing between BMA and non-BMA zones. This approach allowed us to assess variations in correlation patterns across different zone types.
4.2.1 Urban, suburban, and rural zones
The first analysis focuses on data grouped by urban, suburban, and rural zones, as shown in Fig. 5, with the statistical results presented in Table 3.

Figure 5Scatter plots comparing NO2 concentrations measured by Sentinel-5P (mol m−2) and ground-based stations (µg m−3) from the XVPCA network in Catalonia for 2022 and 2023, grouped by zone type: (a) urban 2022, (b) urban 2023, (c) suburban 2022, (d) suburban 2023, (e) rural 2022, and (f) rural 2023. Each panel includes the linear regression line with its corresponding equation, the number of data points (n), the Pearson correlation coefficient (r), and the coefficient of determination (R2).
Table 3Pearson correlation coefficient (r), slope (β1), intercept (β0), and coefficient of determination (R2) for NO2 levels in different zones (urban, suburban, and rural) during 2022–2023. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

In urban zones, moderate correlation coefficients were observed in both 2022 and 2023 (Table 3), with r values of 0.54 and 0.55, respectively. Suburban zones exhibit the strongest correlations in this dataset, with values of 0.68 in 2022 and 0.66 in 2023. Meanwhile rural zones show moderate correlations, with r of 0.58 in 2022 and 0.55 in 2023.
In terms of the regression parameters (Table 3), β1 values are relatively consistent across zones and years, ranging from to , indicating a generally stable relationship between satellite- and ground-based measurements. However, β0 shows greater variability, with higher values in urban zones ( in 2022 and in 2023), suggesting a systematic overestimation of NO2 concentrations by the satellite in these zones when surface values are low. Conversely, suburban zones achieve the highest R2 values (0.46 in 2022 and 0.44 in 2023), reflecting better alignment between satellite and ground measurements in these less complex environments.
4.2.2 BMA and non-BMA zones
The second analysis focuses on data grouped by BMA and non-BMA zones, as shown in Fig. 6, with the corresponding statistical results presented in Table 4.

Figure 6Scatter plots comparing NO2 concentrations measured by Sentinel-5P (mol m−2) and ground-based stations (µg m−3) from the XVPCA network in Catalonia for 2022 and 2023, grouped by BMA and non-BMA zones: (a) BMA 2022, (b) BMA 2023, (c) non-BMA 2022, and (d) non-BMA 2023. Each panel includes the linear regression line with its corresponding equation, the number of data points (n), the Pearson correlation coefficient (r), and the coefficient of determination (R2).
Table 4Pearson correlation coefficient (r), slope (β1), intercept (β0), and coefficient of determination (R2) for NO2 levels in BMA and non-BMA zones during 2022–2023. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

Despite the difference in the number of monitoring stations (see Table 1) between the urban zone (22 stations) and the BMA zone (7 stations), the r results for these two zones are quite similar. In the BMA zone (Table 4), the values were slightly lower (0.47 and 0.51) for the years 2022 and 2023, respectively. Furthermore, as observed in panels a and b of Fig. 6, no negative satellite values were detected at the locations of the BMA ground-based stations.
4.3 Typology-based
Hereafter, we present the correlation analysis based on different station typologies: traffic, background, and industrial stations. This analysis allows us to assess how NO2 concentrations align with satellite data, depending on the type of activity in the vicinity of the stations. The results for 2022 and 2023 are illustrated in Fig. 7, which displays scatter plots and regression lines for each typology. The corresponding statistical values are summarized in Table 5.

Figure 7Scatter plots comparing NO2 concentrations measured by Sentinel-5P (mol m−2) and ground-based stations (µg m−3) from the XVPCA network in Catalonia for 2022 and 2023, grouped by station typology: (a) traffic 2022, (b) traffic 2023, (c) background 2022, (d) background 2023, (e) industrial 2022, and (f) industrial 2023. Each panel includes the linear regression line with its corresponding equation, the number of data points (n), the Pearson correlation coefficient (r), and the coefficient of determination (R2).
Table 5Pearson correlation coefficient (r), slope (β1), intercept (β0), and coefficient of determination (R2) for NO2 levels based on station typology: traffic, background, and industrial during 2022–2023. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

For traffic stations, the correlation values were moderate, at 0.63 in 2022 and 0.61 in 2023. Background stations showed slightly higher correlations, with values of 0.66 in 2022 and 0.68 in 2023. Similarly, industrial stations demonstrated correlations of 0.68 in 2022 and 0.66 in 2023.
4.4 Seasonal
Finally, we present in Table 6 a seasonal analysis that covers all recorded values from 2022 and 2023 and provides a detailed view of how seasonal factors – such as temperature and atmospheric conditions – affect both the r correlation and the coefficient of variation (CV) between satellite and surface measurements of NO2 throughout the year. Additionally, Tables 7 and 8 summarize the regression analysis results, including β1, β0, and R2 for NO2 measurements across different seasons and typologies, highlighting the seasonal variability observed in the data.
Table 6Pearson correlation coefficients (r) for NO2 levels across different seasonal periods in 2022 and 2023, categorized by global values, zones, and typologies. The coefficient of variation (CV) provides an additional measure of relative variability across the seasonal periods, calculated using data from both years. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

The correlation coefficients show marked seasonal variability throughout the year. Overall, winter and autumn stand out with the highest correlations, reaching a value of 0.70 in both years. In contrast, during spring and summer, significant decreases are observed in all zones and typologies, with the global minimum being 0.55 in the summer of 2022 and 0.57 in 2023. This seasonal reduction is especially evident in urban and the BMA zones, where correlations dropped to 0.38 and 0.23, respectively, in the summer of 2022.
The CV supports these observations by providing a measure of the relative variability of the seasonal correlations. The urban and BMA zones showed the highest CVs, at 16.61 % and 31.03 %, respectively, indicating a high degree of fluctuation. By contrast, the non-BMA zone exhibited a much lower CV (7.46 %), reflecting more consistent correlations with reduced variability throughout the year.
Results by station typology indicate that background stations maintained the highest correlations throughout the year, peaking at 0.74 in winter 2022 and a CV of 10.02 %, suggesting stable seasonal performance. Industrial stations also demonstrated strong seasonal stability (CV 11.28 %), with correlations exceeding 0.65 during the colder months. In contrast, traffic stations exhibited the greatest seasonal variability, with correlations falling to 0.47 in summer 2022 but recovering to 0.68 in winter 2022. Nevertheless, their CV of 11.75 % highlights this intermediate level of variability.
The regression analysis, summarized in Tables 7 and 8, supports these findings. β1 and R2 were highest in winter and autumn, reflecting a stronger sensitivity of satellite measurements to ground-level NO2 during these seasons. Conversely, the summer months exhibited lower values for both. The β0 values are consistently positive, with higher values observed in urban zones during summer and autumn.
Table 7Regression analysis results showing slope (β1), intercept (β0), and coefficient of determination (R2) for NO2 levels measured during winter and spring in 2022 and 2023, divided by global values, zones, and typologies. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

Table 8Regression analysis results showing slope (β1), intercept (β0), and coefficient of determination (R2) for NO2 levels measured during summer and autumn in 2022 and 2023, divided by global values, zones, and typologies. Calculations are based on daily satellite data matched with surface NO2 measurements recorded at 13:00 UTC.

This section discusses the results, focusing on the interpretation across the entire study area, as well as the differences by geographical zones, station typology, and seasonality. Additionally, the implications of these findings are contextualized within existing literature.
The consistently high correlation coefficients (r) at 13:00 UTC (Fig. 3) across most datasets and both years confirm that the strongest correlation between satellite and surface measurements tends to occur at the approximate overpass time of the S5p satellite (Verhoelst et al., 2021). However, the behavior in the BMA zone diverges from this trend, showing similarly high correlations at both 13:00 UTC and 19:00 UTC. This distinct pattern underscores the complexity of representing surface-level NO2 in densely urbanized zones, likely influenced by evening traffic emissions typical of high-density urban areas.
5.1 Global
The 2022 scatter plot (Fig. 4) shows a greater dispersion of the measurements when compared to the 2023 graph. This difference is partly attributed to improvements implemented in the processing and calibration of the S5p data, which have reduced the bias and increased the accuracy of the measurements (ESA, 2023). These refinements demonstrate the impact of continuous updates in satellite data processing on improving the reliability of satellite–ground correlations.
The global results indicate a moderate positive correlation between measurements made by the S5p satellite and surface stations (Table 2). This result verifies that there is a general correspondence between both datasets, although it is not strong enough to be considered a good agreement between the data. Similar observations have been reported in previous studies. For instance, Ialongo et al. (2020) found a moderate positive correlation coefficient (r=0.68) between TROPOMI NO2 total column and ground-based measurements in Helsinki.
In contrast, the results obtained by Morillas et al. (2024) in a study conducted in Madrid reported moderately higher correlation coefficients (r=0.78) compared to those presented in Table 2 of this study. A notable difference between the two studies is that, while the analysis conducted here observed higher correlations in rural zones than in urban zones, the study in Madrid identified the opposite trend. Finally, in the same study, the authors agreed that the geographic distribution of monitoring stations can significantly influence the correlation coefficients obtained. These differences in results can be directly attributed to the size of the study areas. While their research focused exclusively on the city of Madrid (8028 km2), the present analysis covered the entire region of Catalonia (32 108 km2). This larger area introduces variations in geography and area characteristics that influence the results.
Regression analysis, using S5p values converted to µg m−3, reveals a β1 of 0.0163 and a β0 of 11.5 µg m−3. The low β1 suggests that variations in surface measurements are not proportionally captured by the satellite. The positive β0 implies a bias in the data; even when surface measurements are 0, the satellite predicts a positive value. As with r, where the results indicate a general correspondence, the R2 (Table 2) obtained indicates that 43 % of the variance in satellite measurements is explained by surface measurements, leaving 57 % of the variance attributable to other factors not captured by ground-based monitoring. These values are essential for studies that seek to estimate surface concentrations from satellite data. These findings are important for understanding how the correlation between satellite and surface data evolves in response to changes in measured air pollutant levels.
5.2 Zone-based
This section analyzes how the correlation between satellite measurements and surface data varies depending on the two groups defined by zones (urban, suburban, and rural) and (BMA and non-BMA). The discussion focuses on interpreting the patterns observed in the correlation coefficients and in the regression adjustments.
5.2.1 Urban, suburban, and rural zones
The results in Table 3 and Fig. 5 show that the highest correlations are observed at stations classified as suburban zone (r=0.68 in 2022, panel c, and r=0.66 in 2023, panel d), followed by rural and finally urban zones. These correlation results are consistent with the R2 values, which remain moderate – 0.43 for the global results and 0.46 and 0.44 for suburban zones in 2022 and 2023, respectively. This pattern can be attributed primarily to the intensity and distribution of emissions in each zone type. In suburban zones, the moderate NO2 concentrations reflect a mix of emission sources that are more dispersed than those in strictly urban environments. These results align with previous findings highlighting the influence of emissions in less dense environments on satellite–ground consistency (Wang et al., 2022; Munir et al., 2021).
Taken together, these findings underscore the importance of emission source density, source types, and spatial representativeness of measurements when interpreting relationships between NO2 values derived from satellite data and ground stations.
The scatter plots reveal distinctive patterns across the three zone types. Rural stations present a compact cloud of points concentrated at low values, indicating that most measurements fall within the low-concentration range. In contrast, urban stations show very few points near zero, reflecting consistently high NO2 levels characteristic of dense urban environments. Suburban stations, situated between these extremes, display a more balanced distribution across mid-range values, contributing to the strongest correlations with satellite observations. These differences highlight how emission intensity and distribution shape the agreement between satellite-derived and ground-based NO2 measurements.
5.2.2 BMA and non-BMA zones
The second analysis focuses on data grouped by BMA and non-BMA zones (Fig. 6 and Table 4). The BMA scatter plots (panels a and b of Fig. 6) reveal a markedly different data dispersion compared to all other sets. Unlike the other graphs, the BMA plots do not display points clustered around zero in the ground-based NO2 measurements (x axis) and the satellite-derived NO2 columns (y axis). Despite the low correlation values observed, these scatter plots effectively demonstrate the ability of satellite data to capture spatial differences in metropolitan zones.
The low correlation values are consistent with findings from previous studies (Munir et al., 2021; Jamali et al., 2020), which suggest that higher anthropogenic activity, topographic complexity, and urban density – typical of the BMA zone – tend to reduce the agreement between surface NO2 measurements and satellite data. One of the main challenges in air pollution data in metropolitan areas is traffic variability: within the spatial extent of a single S5p pixel, multiple types of neighborhoods can coexist (from traffic-calmed areas to high-density zones), resulting in significant spatial heterogeneity in NO2 concentrations. In contrast, the absence of negative satellite values in the zone contrasts with the findings of Douros et al. (2023) and Tonion and Pirotti (2022), who reported negative satellite concentrations in regions with complex environmental conditions. This discrepancy may be attributed to regional factors unique to Catalonia.
Conversely, non-BMA zones exhibited patterns similar to suburban zones, a finding also highlighted by other authors when comparing the effectiveness of satellite data in different geographic environments (Wang et al., 2022; Munir et al., 2021). The regression coefficient β1 (Table 4) illustrates key differences between these zones, indicating that changes in NO2 concentrations are more effectively captured by satellites in these less complex environments. In contrast, the lower R2 values in urban and BMA zones reflect the greater complexity of these environments, suggesting a less direct relationship with satellite data.
5.3 Typology-based
The typology-based analysis allows for the assessment of how NO2 concentrations align with satellite data depending on the type of activity in the vicinity of the stations. (Fig. 7 and Table 5).
The traffic data, shown in panels (a) and (b) of Fig. 7, do not exhibit negative satellite values at the monitoring station locations – similar to what is observed in the urban and BMA datasets (Figs. 5a and b and 6a and b). This result highlights the importance of retaining negative data measured by the S5p satellite in the analyses. Although traffic stations display the lowest correlations among typologies, they are still higher than those observed in urban and BMA zones.
In contrast, industrial and background stations show slightly stronger correlations compared to traffic stations, which is consistent with the R2 results (Table 5). These findings underline the utility of separating the analysis by station typology to better understand the factors influencing satellite–ground measurement alignment.
5.4 Seasonal
The seasonal analysis, summarized in Table 6, reveals significant variations in the r throughout the four quarters of the year. In general, the highest correlations are observed during the coldest quarters, winter and autumn, with the maximum values reached in the first quarter of both years. This similarity in the results in both years is consistent across most zones and typologies and also indicates the higher correlation between satellite and surface measurements of NO2 during the coldest months of the year. During the second quarter, corresponding to spring, a slight decrease in correlation is observed across all zones and typologies, although it is not significantly pronounced. Similar trends are observed for both zone-based and typology-based stations. In the third quarter, which includes the summer months, correlation coefficients reach their lowest values.
The CV provides additional insight into the stability and variability of these seasonal correlations. Urban and BMA zones exhibit the highest CV values (16.61 % and 31.03 %, respectively), indicating a moderate degree of fluctuation due to environmental and anthropogenic factors, such as increased photochemical reactions. In contrast, non-BMA zones show CVs of 7.46 %, highlighting more stable values across seasons.
Research shows that increased solar radiation during the summer months intensifies the NO2 photolysis process (Yang et al., 2004), reducing its concentration near the ground. This seasonal effect is particularly pronounced during summer due to higher solar intensity and longer daylight hours, which enhance photochemical reactions and accelerate the conversion of NO2 into other compounds. Consequently, the lower correlations observed during this period can be attributed to this intensified photolysis and the resultant variability in NO2 concentrations (Douros et al., 2023; Voiculescu et al., 2020; Han et al., 2011). These seasonal dynamics help to explain the variations in correlations between satellite data and surface measurements across the different quarters analyzed.
As in the zone-based analyses for the entire year, suburban and non-BMA zones have higher correlations than urban and BMA zones, indicating a greater impact of urban-specific factors. Typology-based analysis reveals that background and industrial stations maintain relatively high correlations throughout the year, with CVs of 10.02 % and 11.28 %, respectively. Industrial areas reached their highest correlation in the first quarter of 2023, while traffic stations exhibited slightly greater seasonal variability (CV 11.75 %), with the lowest correlation occurring in the summer months, particularly in 2022, when it dropped to 0.47 (Table 6).
The R2 results also vary seasonally (Tables 7 and 8), similar to the r coefficient, with the highest percentages of explained variance recorded in the first and fourth quarters. This trend highlights the stronger relationship between satellite- and ground-based NO2 measurements during the winter and autumn months. The summer months exhibited lower values for β1 and R2, suggesting greater unexplained variance and reduced comparability between datasets. The higher β0 observed in urban zones and during summer and autumn indicates a systematic overestimation of NO2 concentrations by the satellite.
The consistently positive β0 values across all areas and seasons indicate that satellite measurements capture a broader regional signal, integrating contributions from atmospheric layers above ground level. Meanwhile, variability in CV, β1, and R2 values highlights the influence of local environmental conditions, seasonal changes, and the spatial distribution of NO2 sources on satellite–ground comparability.
5.5 Overall discussion
The strongest correlations are observed at 13:00 UTC – coincident with the S5p overpass – highlighting the need for temporal alignment in any satellite-based estimation model. Correlation strength varies with geographic location and station typology: urban and BMA zones display lower r values and larger seasonal variability than suburban and non-BMA zones, reflecting their greater emission density and environmental complexity.
The diversity of conditions within a satellite pixel can influence the observed correlations. A single TROPOMI pixel may cover diverse land uses, such as urban, suburban, and rural zones, or encompass areas with both point and diffuse emissions. This heterogeneity can reduce the correlation between satellite-derived NO2 values and ground measurements, particularly in complex zones such as urban and BMA zones. In contrast, more homogeneous regions, such as suburban and non-BMA zones, tend to have higher correlations also related to less variability within the pixel. In terms of typological stations, background and industrial stations record higher correlations compared to traffic stations, where local emissions and geographic heterogeneity reduce the representativeness of satellite data.
During winter and autumn, stable atmospheric conditions prevail, with colder air trapped in the lower boundary layers, limiting vertical mixing. This atmospheric stability reduces photochemical reactions, allowing NO2 to accumulate near the surface and contributing more significantly to the observed tropospheric column (Li et al., 2019). As a result, correlations between surface and satellite data are stronger, particularly in background stations and in suburban and non-BMA zones, where lower CVs indicate more consistent seasonal alignment. In contrast, during summer months, atmospheric instability, characterized by anticyclones and higher temperatures, enhances NO2 vertical mixing. This process dilutes NO2 throughout the boundary layer, increases photolysis rates, and reduces the representativeness of ground-level measurements in satellite observations. This seasonal instability introduces greater variability, as reflected in higher CVs for urban and traffic stations.
The results obtained allow us to address the research questions posed at the beginning of this study. Regarding how closely S5p satellite NO2 data correlate with surface measurements from the official measurement of air pollution network in Catalonia, the study found moderate correlations on a global scale (r≈0.65) and highlighted significant geographic variability. Suburban and non-BMA zones showed the strongest correlations, suggesting that satellite data more accurately capture NO2 concentrations in less dense areas. Urban and BMA zones showed lower correlations due to their complexity and the influence of localized emissions. Moreover, the analysis of the correlation between satellite data and station data depending on zone or station type revealed that background and industrial stations align better with satellite data (r≈0.66–0.68) compared to traffic stations (r≈0.61–0.63). This highlights the impact of local emissions and environmental complexity in reducing correlation at traffic monitoring sites.
Regarding the seasonal variation, the analysis showed that winter and autumn reach the highest correlations (up to r=0.70), while summer and spring present lower values. The reduced correlation during summer is attributed to stronger photolysis processes and atmospheric variability, which decrease the representativeness of satellite observations. These findings emphasize the seasonal dependence of satellite–ground agreement and suggest a specific focus on winter data for modeling.
Finally, regarding the benefits and limitations of using S5p data to complement surface-measured NO2 information and how they could be integrated into monitoring strategies and public policies in Catalonia, the results highlight that S5p data provide valuable spatial coverage, especially in suburban zones and outside the BMA zone. These data allow for the identification of pollution trends and regions for prioritizing interventions, as well as the better allocation of resources for installing new ground-based monitoring stations.
This study has explored the correlation between S5p satellite data and ground-based NO2 measurements in Catalonia, addressing geographical diversity, station typologies and seasonal variability. The findings confirm that satellite measurements align moderately with ground-based observations and provide insights for areas with limited ground-based station coverage.
The degree of alignment depends on factors such as geographical context, station typology, and seasonal variability. Suburban and less-built-up zones show better correlations and lower variability, while denser urban zones and those with concentrated emissions present lower correlations. Station typology also influences the relationship, with background and industrial stations showing stronger alignment and more stable seasonal performance.
The impact of seasonality on correlations, both for global data and for data segmented by zone and type, is very significant and demonstrates the viability of generating a model for estimating surface NO2 in Catalonia. The CV values show that urban and traffic stations, as well as the summer months, experience greater seasonal variability, which can introduce additional variance into estimation models. On the one hand, it must be taken into account that including data from months with lower correlation can introduce variance that does not represent the behavior of the pollutant. On the other hand, it reinforces the idea that it is preferable to optimize computational processing by keeping only the data from the winter months, which have the highest correlations and the lowest seasonal variability, as indicated by lower CVs. It is preferable to use data from different years rather than data from the whole year.
The results align with studies that claim that it is possible to estimate surface pollution levels from satellite data. However, this study has shown that in the region of Catalonia, a direct relationship between satellite and surface data is not fully achievable due to geographic and temporal variability. This underscores the significance of this study and the results obtained for further developing an effective surface estimation methodology for a geographically diverse area such as Catalonia.
Despite its limitations, S5p data provide significant opportunities to complement traditional monitoring networks, particularly in remote regions or areas with limited ground station coverage. The coefficient of variation offers a useful tool for assessing the stability of seasonal correlations and identifying areas where satellite data may be more reliable. Integrating satellite and surface data requires careful consideration of geographic and temporal factors to enhance reliability and utility. These findings reinforce the importance of combining multiple data sources to improve the accuracy of air quality assessments and support the development of effective public policies for pollution mitigation.
For future research, if the aim is to develop a model to estimate surface NO2 from satellite data, it is advisable to consider separating the data by season or focusing on data from the winter months, which have the most stable correlations. In addition, it is necessary to integrate atmospheric variables such as temperature and solar radiation. Although the influence of these variables is lower in winter, it is still necessary to take into account the variance of approximately 50 % of the surface data that are not directly influenced by satellite data. Although high accuracy in estimating surface NO2 from satellite data may not be fully achievable, this approach opens the way to a useful tool for visualizing pollution levels and raising public awareness of air quality in Catalonia.
A1 Zone of stations
Classification of measurement points based on their geographical location.
-
Urban: measurement points located in urban areas characterized by continuous construction and urban infrastructure, including public services.
-
Suburban: measurement points located on the outskirts of a city, adjacent to or very close to a highly urbanized and populated area.
-
Rural: measurement points located in areas that are neither urban nor suburban.
A2 Typology of stations
Classification of measurement points based on the type of emission sources they monitor.
-
Traffic: measurement points located in areas directly impacted by traffic emissions.
-
Industrial: measurement points located in areas directly impacted by industrial emissions.
-
Background: measurement points located in areas not directly impacted by traffic or industrial emissions. The air at these points has mixed and comes from various sources.
A3 Measured pollutants
The primary atmospheric pollutants measured by the Xarxa de Vigilància i Previsió de la Contaminació Atmosfèrica (XVPCA) network.
-
Nitrogen dioxide (NO2): primarily emitted by vehicular traffic and some industrial activities.
-
Suspended particles (PM10 and PM2.5): originating from various sources such as traffic, industry, and agricultural activities.
-
Ozone (O3): a secondary pollutant formed from photochemical reactions between precursors like nitrogen oxides and volatile organic compounds (VOCs).
-
Carbon monoxide (CO): emissions from incomplete combustion in vehicles and some industries.
-
Sulfur dioxide (SO2): mainly from burning fossil fuels in power plants and some industries.
A4 Technology and equipment
Technological and operational aspects of the XVPCA network.
-
Automatic sensors: sensors use advanced technologies for continuous pollutant measurement. Data are collected in real-time and sent to a control center for analysis.
-
Manual sensors: samples are collected on-site, analyzed in a laboratory, validated, and then included in the atmospheric pollution database.
-
Calibration and maintenance: XVPCA stations undergo rigorous calibration and maintenance processes to ensure data accuracy and reliability.
-
Near-real-time (NRTI) data: measurement results are published a few hours later on the Generalitat de Catalunya's website, allowing citizens and authorities to access up-to-date air quality information.
The Sentinel-5 Precursor (S5p) NO2 data used in this study are publicly available from the Copernicus Browser (https://browser.dataspace.copernicus.eu/, last access: 8 October 2025). The ground-based air quality monitoring data for Catalonia were obtained from the Portal de Dades Obertes Catalunya and are available at https://mediambient.gencat.cat/ca/05_ambits_dactuacio/atmosfera/qualitat_de_laire/vols-saber-que-respires/descarrega-de-dades/descarrega-dades-automatiques/ (last access: 8 October 2025).
DGS defined the use case and the tests to be done, performed the calculus, compiled the results and analyzed them, and wrote the manuscript. MEP defined the use case and the tests to be done, analyzed the results, and wrote the manuscript.
The contact author has declared that neither of the authors has any competing interests.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
The publications and other results were carried out with the support of the predoctoral program AGAUR-FI grant (2023 FI-1 00297) Joan Oró, funded by the Secretariat of Universities and Research of the Department of Research and Universities of the Generalitat de Catalunya and the European Social Fund Plus. This work has also been partially supported by AGAUR, Generalitat de Catalunya, through the Consolidated Research Group RSE, “Geomatics” (Ref. 2021-SGR-00536). This work has been carried out within the framework of the geography doctoral program of the Universitat Autònoma de Barcelona. The authors would like to thank Eduard Angelats for his time and all the fruitful discussions. The authors acknowledge the support of artificial intelligence tools for linguistic revision.
This research has been supported by the Agència de Gestió d'Ajuts Universitaris i de Recerca (grant no. 2023 FI-1 00297).
This paper was edited by Rainer Volkamer and reviewed by two anonymous referees.
Balestrini, M., Kotsev, A., Ponti, M., and Schade, S.: Collaboration matters: capacity building, up-scaling, spreading, and sustainability in citizen-generated data projects, Humanit. Soc. Sci. Commun., 8, 169, https://doi.org/10.1057/s41599-021-00851-5, 2021. a
Boslaugh, S., and Watters, P. A.: Statistics in a Nutshell: A Desktop Quick Reference, O'Reilly Media, ISBN 978-0-596-51049-7, 2008. a, b, c, d
Chatterjee, S., and Hadi, A. S.: Regression Analysis by Example, 5th edn., Wiley, ISBN 978-0-470-90584-5, 2012. a
Choo, G.-H., Lee, K., Hong, H., Jeong, U., Choi, W., and Janz, S. J.: Highly resolved mapping of NO2 vertical column densities from GeoTASO measurements over a megacity and industrial area during the KORUS-AQ campaign, Atmos. Meas. Tech., 16, 625–644, https://doi.org/10.5194/amt-16-625-2023, 2023. a
Copernicus: OBSERVER: Sentinel-4 – A new era in European air quality monitoring, Copernicus, https://www.copernicus.eu/en/news/news/observer-sentinel-4-new-era-european-air-quality-monitoring, last access: 2 April 2025. a
Dimitropoulou, E., Hendrick, F., Pinardi, G., Friedrich, M. M., Merlaud, A., Tack, F., De Longueville, H., Fayt, C., Hermans, C., Laffineur, Q., Fierens, F., and Van Roozendael, M.: Validation of TROPOMI tropospheric NO2 columns using dual-scan multi-axis differential optical absorption spectroscopy (MAX-DOAS) measurements in Uccle, Brussels, Atmos. Meas. Tech., 13, 5165–5191, https://doi.org/10.5194/amt-13-5165-2020, 2020. a
Douros, J., Eskes, H., van Geffen, J., Boersma, K. F., Compernolle, S., Pinardi, G., Blechschmidt, A.-M., Peuch, V.-H., Colette, A., and Veefkind, P.: Comparing Sentinel-5P TROPOMI NO2 column observations with the CAMS regional air quality ensemble, Geosci. Model Dev., 16, 509–534, https://doi.org/10.5194/gmd-16-509-2023, 2023. a, b, c, d, e, f
Dvořák, R., Chlápek, P., Jecha, D., Puchýř, R., and Stehlík, P.: New approach to common removal of dioxins and NOx as a contribution to environmental protection, J. Clean. Prod., 18, 881–888, https://doi.org/10.1016/j.jclepro.2010.01.024, 2010. a
European Committee for Standardization (CEN): EN 14211:2012 Ambient air – Standard method for the measurement of the concentration of nitrogen dioxide and nitric oxide by chemiluminescence, European Committee for Standardization, Brussels, https://standards.cencenelec.eu/dyn/www/f?p=CEN:110:0::::FSP_PROJECT,FSP_ORG_ID:67486,6245&cs=14FE9683506ED10DD59EF1FF589C897A4 (last access: 8 October 2025), 2012. a
European Space Agency: European Space Agency S5P Level-1b and Level-2 Product Validation Requirements Document, https://sentinels.copernicus.eu/documents/247904/2506504/S5P-Level-1b-L2-numbered-validation-requirements.pdf (last access: 8 October 2025), 2015. a
European Space Agency: Sentinel-5P Level 2 Product User Manual: Nitrogen Dioxide, https://sentinels.copernicus.eu/documents/247904/2474726/Sentinel-5P-Level-2-Product-User-Manual-Nitrogen-Dioxide.pdf (last access: 8 October 2025), 2022. a, b, c
European Space Agency: Sentinel-5P Nitrogen Dioxide Level 2 Product Readme File, https://sentinels.copernicus.eu/documents/247904/3541451/Sentinel-5P-Nitrogen-Dioxide-Level-2-Product-Readme-File (last access: 19 October 2024), 2023. a
European Space Agency: Sentinel-5P Mission Overview, Sentiwiki Copernicus, https://sentiwiki.copernicus.eu/web/s5p-mission, last access: 12 October 2024. a
Filonchyk, M., and Peterson, M. P.: Analysis of air pollution from vehicle emissions for the contiguous United States, J. Geovis. Spat. Anal., 8, 16, https://doi.org/10.1007/s41651-024-00180-6, 2024. a
Finch, D. P., Palmer, P. I., and Zhang, T.: Automated detection of atmospheric NO2 plumes from satellite data: a tool to help infer anthropogenic combustion emissions, Atmos. Meas. Tech., 15, 721–733, https://doi.org/10.5194/amt-15-721-2022, 2022. a, b
Gencat: Politics and economy, https://web.gencat.cat/en/temes/catalunya/coneixer/politica-economia/index.html (last access: 16 July 2024), 2022a. a
Gencat: Territory and people, https://web.gencat.cat/en/temes/catalunya/coneixer/territori-poblacio/index.html (last access: 16 July 2024), 2022b. a, b
Gencat: Xarxa de Vigilància i Previsió de la Contaminació Atmosfèrica (XVPCA) – Conceptes clau, https://mediambient.gencat.cat/ca/05_ambits_dactuacio/atmosfera/qualitat_de_laire/avaluacio/xarxa_de_vigilancia_i_previsio_de_la_contaminacio_atmosferica_xvpca/conceptes_clau/ (last access: 24 July 2024), 2024a. a, b
Gencat: Zones de qualitat de l'aire (ZQA), 2024, https://mediambient.gencat.cat/es/detalls/Articles/Zones-de-qualitat-de-laire_ZQA (last access: 24 July 2024), 2024b. a
Google Earth Engine: COPERNICUS/S5P OFFL L3 NO2: Tropospheric NO2 (Offline, Level 3), Google Earth Engine, https://developers.google.com/earth-engine/datasets/catalog/COPERNICUS_S5P_OFFL_L3_NO2 (last access: 6 May 2025), 2019. a
Han, S., Bian, H., Feng, Y., Liu, A., Li, X., Zeng, F., and Zhang, X.: Analysis of the relationship between O3, NO, and NO2 in Tianjin, China, Aerosol Air Qual. Res., 11, 128–139, https://doi.org/10.4209/aaqr.2010.07.0055, 2011. a, b
Hu, M., Bai, K., Li, K., Zheng, Z., Sun, Y., Shao, L., Li, R., and Liu, C.: Improving machine-learned surface NO2 concentration mapping models with domain knowledge from data science perspective, Atmos. Environ., 322, 120372, https://doi.org/10.1016/j.atmosenv.2024.120372, 2024. a, b
Ialongo, I., Virta, H., Eskes, H., Hovila, J., and Douros, J.: Comparison of TROPOMI/Sentinel-5 Precursor NO2 observations with ground-based measurements in Helsinki, Atmos. Meas. Tech., 13, 205–218, https://doi.org/10.5194/amt-13-205-2020, 2020. a, b
Idescat: Geographical and political boundaries. Counties and Aran, and provinces, 2020, https://www.idescat.cat/indicadors/?id=aec&n=15179&lang=en (last access: 24 July 2024), 2020. a
Jamali, S., Klingmyr, D., and Tagesson, T.: Global-scale patterns and trends in tropospheric NO2 concentrations, 2005–2018, Remote Sens.-Basel, 12, 3526, https://doi.org/10.3390/rs12213526, 2020. a
Jimenez, J., and Brovelli, M. A.: NO2 concentration estimation at urban ground level by integrating Sentinel 5P data and ERA5 using machine learning: the Milan (Italy) case study, Remote Sens.-Basel, 15, 5400, https://doi.org/10.3390/rs15225400, 2023. a, b, c, d, e, f, g
Kim, J., Jeong, U., Ahn, M., Kim, J. H., Park, R. J., Lee, H., Song, C. H., Choi, Y., Lee, K., Yoo, J., Jeong, M., Park, S. K., Lee, K., Song, C., Kim, S., Kim, Y. J., Kim, S., Kim, M., Go, S., Liu, X., Chance, K., Chan Miller, C., Al-Saadi, J., Veihelmann, B., Bhartia, P. K., Torres, O., Abad, G. G., Haffner, D. P., Ko, D. H., Lee, S. H., Woo, J., Chong, H., Park, S. S., Nicks, D., Choi, W. J., Moon, K., Cho, A., Yoon, J., Kim, S., Hong, H., Lee, K., Lee, H., Lee, S., Choi, M., Veefkind, P., Levelt, P. F., Edwards, D. P., Kang, M., Eo, M., Bak, J., Baek, K., Kwon, H., Yang, J., Park, J., Han, K. M., Kim, B., Shin, H., Choi, H., Lee, E., Chong, J., Cha, Y., Koo, J., Irie, H., Hayashida, S., Kasai, Y., Kanaya, Y., Liu, C., Lin, J., Crawford, J. H., Carmichael, G. R., Newchurch, M. J., Lefer, B. L., Herman, J. R., Swap, R. J., Lau, A. K. H., Kurosu, T. P., Jaross, G., Ahlers, B., Dobber, M., McElroy, C. T., and Choi, Y.: New era of air quality monitoring from space: Geostationary Environment Monitoring Spectrometer (GEMS), B. Am. Meteorol. Soc., 101, E1–E22, https://doi.org/10.1175/BAMS-D-18-0013.1, 2020. a
Kutner, M. H., Nachtsheim, C. J., Neter, J., and Li, W.: Applied Linear Statistical Models, 5th edn., McGraw-Hill Education, https://users.stat.ufl.edu/~winner/sta4211/ALSM_5Ed_Kutner.pdf (last access: 8 April 2024), 2004. a, b, c, d
Li, X., Hu, X.-M., Ma, Y., Wang, Y., Li, L., and Zhao, Z.: Impact of planetary boundary layer structure on the formation and evolution of air-pollution episodes in Shenyang, Northeast China, Atmos. Environ., 214, 116850, https://doi.org/10.1016/j.atmosenv.2019.116850, 2019. a
Mauro, F., Russo, L., Janku, F., Sebastianelli, A., and Ullo, S. L.: Estimation of ground NO2 measurements from Sentinel-5P tropospheric data through categorical boosting, in: 2023 IEEE International Conference on Metrology for eXtended Reality, Artificial Intelligence and Neural Engineering (MetroXRAINE), Milan, Italy, 1116–1121, https://doi.org/10.1109/MetroXRAINE58569.2023.10405670, 2023. a, b
Morillas, C., Álvarez, S., Serio, C., Masiello, G., and Martínez, S.: TROPOMI NO2 Sentinel-5P data in the Community of Madrid: a detailed consistency analysis with in situ surface observations, Remote Sens. Appl. Soc. Environ., 33, 101083, https://doi.org/10.1016/j.rsase.2023.101083, 2024. a, b, c, d, e
Morillas, C., Álvarez, S., Pires, J. C. M., García, A. J., and Martínez, S.: Linking satellite and ground observations of NO2 in Spanish cities: influence of meteorology and O3, Nitrogen, 6, 32, https://doi.org/10.3390/nitrogen6020032, 2025. a, b, c, d, e, f
Munir, S., Mayfield, M., and Coca, D.: Understanding spatial variability of NO2 in urban areas using spatial modelling and data fusion approaches, Atmosphere-Basel, 12, 179, https://doi.org/10.3390/atmos12020179, 2021. a, b, c
Omrani, H., Omrani, B., Parmentier, B., and Helbich, M.: Spatio-temporal data on the air pollutant nitrogen dioxide derived from Sentinel satellite for France, Data Brief, 28, 105089, https://doi.org/10.1016/j.dib.2019.105089, 2020. a
Petetin, H., Guevara, M., Compernolle, S., Bowdalo, D., Bretonnière, P.-A., Enciso, S., Jorba, O., Lopez, F., Soret, A., and Pérez García-Pando, C.: Potential of TROPOMI for understanding spatio-temporal variations in surface NO2 and their dependencies upon land use over the Iberian Peninsula, Atmos. Chem. Phys., 23, 3905–3935, https://doi.org/10.5194/acp-23-3905-2023, 2023. a, b, c, d
Pinardi, G., Van Roozendael, M., Hendrick, F., Theys, N., Abuhassan, N., Bais, A., Boersma, F., Cede, A., Chong, J., Donner, S., Drosoglou, T., Dzhola, A., Eskes, H., Frieß, U., Granville, J., Herman, J. R., Holla, R., Hovila, J., Irie, H., Kanaya, Y., Karagkiozidis, D., Kouremeti, N., Lambert, J.-C., Ma, J., Peters, E., Piters, A., Postylyakov, O., Richter, A., Remmers, J., Takashima, H., Tiefengraber, M., Valks, P., Vlemmix, T., Wagner, T., and Wittrock, F.: Validation of tropospheric NO2 column measurements of GOME-2A and OMI using MAX-DOAS and direct sun network observations, Atmos. Meas. Tech., 13, 6141–6174, https://doi.org/10.5194/amt-13-6141-2020, 2020. a
Salmani, B., Ardakani, A., Alavi, S. J., Kiani, M. H., Ebrahimian, S., and Mohammadi, O.: Monitoring trends of CO, NO2, SO2, and O3 pollutants using time-series Sentinel-5 images based on Google Earth Engine, Pollutants, 3, 255–279, https://doi.org/10.3390/pollutants3020019, 2023. a
Tonion, F. and Pirotti, F.: SENTINEL-5P NO2 DATA: CROSS-VALIDATION AND COMPARISON WITH GROUND MEASUREMENTS, Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B3-2022, 749–756, https://doi.org/10.5194/isprs-archives-XLIII-B3-2022-749-2022, 2022. a, b
van Geffen, J., Eskes, H., Compernolle, S., Pinardi, G., Verhoelst, T., Lambert, J.-C., Sneep, M., ter Linden, M., Ludewig, A., Boersma, K. F., and Veefkind, J. P.: Sentinel-5P TROPOMI NO2 retrieval: impact of version v2.2 improvements and comparisons with OMI and ground-based data, Atmos. Meas. Tech., 15, 2037–2060, https://doi.org/10.5194/amt-15-2037-2022, 2022. a, b
Veefkind, J. P., Aben, I., McMullan, K., Förster, H., de Vries, J., Otter, G., Claas, J., Eskes, H. J., de Haan, J. F., Kleipool, Q., van Weele, M., Hasekamp, O., Hoogeveen, R., Landgraf, J., Snel, R., Tol, P., Ingmann, P., Voors, R., Kruizinga, B., Vink, R., Visser, H., and Levelt, P. F.: TROPOMI on the ESA Sentinel-5 precursor: a GMES mission for global observations of atmospheric composition for climate, air quality, and ozone layer applications, Remote Sens. Environ., 120, 70–83, https://doi.org/10.1016/j.rse.2011.09.027, 2012. a, b, c, d, e
Verhoelst, T., Compernolle, S., Pinardi, G., Lambert, J.-C., Eskes, H. J., Eichmann, K.-U., Fjæraa, A. M., Granville, J., Niemeijer, S., Cede, A., Tiefengraber, M., Hendrick, F., Pazmiño, A., Bais, A., Bazureau, A., Boersma, K. F., Bognar, K., Dehn, A., Donner, S., Elokhov, A., Gebetsberger, M., Goutail, F., Grutter de la Mora, M., Gruzdev, A., Gratsea, M., Hansen, G. H., Irie, H., Jepsen, N., Kanaya, Y., Karagkiozidis, D., Kivi, R., Kreher, K., Levelt, P. F., Liu, C., Müller, M., Navarro Comas, M., Piters, A. J. M., Pommereau, J.-P., Portafaix, T., Prados-Roman, C., Puentedura, O., Querel, R., Remmers, J., Richter, A., Rimmer, J., Rivera Cárdenas, C., Saavedra de Miguel, L., Sinyakov, V. P., Stremme, W., Strong, K., Van Roozendael, M., Veefkind, J. P., Wagner, T., Wittrock, F., Yela González, M., and Zehner, C.: Ground-based validation of the Copernicus Sentinel-5P TROPOMI NO2 measurements with the NDACC ZSL-DOAS, MAX-DOAS and Pandonia global networks, Atmos. Meas. Tech., 14, 481–510, https://doi.org/10.5194/amt-14-481-2021, 2021. a, b, c
Verma, A., Ranga, V., and Vishwakarma, D. K.: BREATH-Net: a novel deep learning framework for NO2 prediction using bi-directional encoder with transformer, Environ. Monit. Assess., 196, https://doi.org/10.1007/s10661-024-12455-y, 2024. a, b
Voiculescu, M., Constantin, D.-E., Condurache-Bota, S., Călmuc, V., Rosu, A., and Dragomir Bălănică, C. M.: Role of meteorological parameters in the diurnal and seasonal variation of NO2 in a Romanian urban environment, Int. J. Environ. Res. Public Health, 17, 6228, https://doi.org/10.3390/ijerph17176228, 2020. a, b
Wang, H., Luo, X., Liu, C., Fu, Q., and Yi, M.: Spatio-temporal variation-induced group disparity of intra-urban NO2 exposure, Int. J. Environ. Res. Public Health, 19, 5872, https://doi.org/10.3390/ijerph19105872, 2022. a, b
World Health Organization: Ambient air pollution: a global assessment of exposure and burden of disease, World Health Organization, 121pp., https://apps.who.int/iris/handle/10665/250141 (last access: 10 April 2024), 2016. a
World Health Organization: WHO global air quality guidelines: particulate matter (PM2.5 and PM10), ozone, nitrogen dioxide, sulfur dioxide, and carbon monoxide, World Health Organization, xxi + 273pp., https://apps.who.int/iris/handle/10665/345329 (last access: 10 April 2024), 2021. a
Yang, J., Honrath, R. E., Peterson, M. C., Parrish, D. D., and Warshawsky, M.: Photostationary state deviation-estimated peroxy radicals and their implications for HOx and ozone photochemistry at a remote northern Atlantic coastal site, J. Geophys. Res.-Atmos., 109, D02312, https://doi.org/10.1029/2003JD003983, 2004. a
Zoogman, P., Liu, X., Suleiman, R. M., Pennington, W. F., Flittner, D. E., Al-Saadi, J. A., Hilton, B. B., Nicks, D. K., Newchurch, M. J., Carr, J. L., Janz, S. J., Andraschko, M. R., Arola, A., Baker, B. D., Canova, B. P., Chan Miller, C., Cohen, R. C., Davis, J. E., Dussault, M. E., Edwards, D. P., Fishman, J., Ghulam, A., González Abad, G., Grutter, M., Herman, J. R., Houck, J., Jacob, D. J., Joiner, B. J., Kerridge, J., Kim, J., Krotkov, N. A., Lamsal, L., Li, C., Lindfors, A., Martin, R. V., McElroy, C. T., McLinden, C., Natraj, V., Neil, D. O., Nowlan, C. R., O'Sullivan, E. J., Palmer, P. I., Pierce, R. B., Pippin, M. R., Saiz-Lopez, A., Spurr, R. J. D., Szykman, J. J., Torres, O., Veefkind, J. P., Veihelmann, B., Wang, H., Wang, J., and Chance, K.: Tropospheric Emissions: Monitoring of Pollution (TEMPO), J. Quant. Spectrosc. Ra., 186, 17–39, https://doi.org/10.1016/j.jqsrt.2016.05.008, 2017. a