Direct-sun versus sky-scan Pandora formaldehyde retrievals: implications for satellite validation and sampling representativeness in Tropical Southeast Asia

Arul, Santanasawry A. L. David; Chang, Jackson Hian-Wui; Wong, Yong Jie; Ooi, Maggie Chel-Gee; Liew, Juneng; Chee, Fuei Pien; Dayou, Jedol; Sentian, Justin; Aryastana, Putu; Lin, Neng-Huei

doi:10.5194/amt-19-3713-2026

Articles | Volume 19, issue 11

https://doi.org/10.5194/amt-19-3713-2026

Articles | Volume 19, issue 11

Research article

04 Jun 2026

Research article |

| 04 Jun 2026

Direct-sun versus sky-scan Pandora formaldehyde retrievals: implications for satellite validation and sampling representativeness in Tropical Southeast Asia

Santanasawry A. L. David Arul, Jackson Hian-Wui Chang, Yong Jie Wong, Maggie Chel-Gee Ooi, Juneng Liew, Fuei Pien Chee, Jedol Dayou, Justin Sentian, Putu Aryastana, and Neng-Huei Lin

Abstract

Ground-based Pandora spectrometers are widely used for validating satellite formaldehyde (HCHO) retrievals, yet the roles of retrieval geometry and sampling representativeness remain poorly constrained in tropical environments. This study evaluates Pandora Level-2 HCHO columns from five Southeast Asian stations (2021–2024), distinguishing between direct-sun (DS) and sky-scan (SS) observations and comparing them with satellite products from Ozone Monitoring Instrument (OMI), TROPOspheric Monitoring Instrument (TROPOMI), and Geostationary Environment Monitoring Spectrometer (GEMS) following uncertainty-based quality control. DS and SS retrievals show strong internal consistency under high-quality conditions (r > 0.7). Besides, DS generally exhibits greater variability and, at several sites, stronger sensitivity to localized variability, whereas SS often yields lower RMSE/MAE and more spatially representative agreement with satellite observations. Satellite comparisons reveal a clear performance hierarchy: OMI shows weak correlations (r < 0.4) and large errors, TROPOMI improves agreement (r ≈ 0.3–0.5), and GEMS further enhances performance in urban environments (r ≈ 0.58–0.65 at Bangkok) with reduced error magnitudes due to higher temporal sampling. However, discrepancies persist even under near-synchronous conditions, indicating that improved temporal resolution – within current satellite capabilities – does not fully resolve satellite–ground differences. These residual differences are consistent with sampling scale mismatches between localized Pandora measurements and spatially averaged satellite footprints. Overall, the results demonstrate that satellite validation in tropical regions is governed by the combined effects of retrieval geometry, spatial sampling, and temporal resolution, providing a framework for interpreting satellite–ground HCHO comparisons and guiding future validation strategies.

Download & links

Article (PDF, 7130 KB)

Supplement (2013 KB)

Download & links

How to cite.

Received: 06 Feb 2026 – Discussion started: 23 Feb 2026 – Revised: 18 May 2026 – Accepted: 19 May 2026 – Published: 04 Jun 2026

1 Introduction

Formaldehyde (HCHO) is a key intermediate in tropospheric photochemistry and one of the most important carbonyl compounds in the atmosphere. It is produced primarily through the oxidation of volatile organic compounds (VOCs) from both biogenic and anthropogenic sources and serves as an effective proxy for VOC emissions at local to regional scales. Through photolysis, HCHO represents a major source of hydroperoxy (HO₂) radicals, thereby enhancing ozone (O₃) production in the presence of nitrogen oxides (NO_x) and contributing to secondary organic aerosol formation. Owing to its short atmospheric lifetime, typically on the order of hours (Lim et al., 2019), HCHO exhibits strong spatial and temporal variability and is highly sensitive to changes in emissions, meteorology, and photochemical activity (Fang et al., 2017; Liao et al., 2021; Lim et al., 2019). Accurate characterization of HCHO is therefore essential for understanding air quality, constraining chemical transport models, and evaluating emission control strategies, particularly in regions with intense photochemistry and episodic pollution.

A range of techniques exists for measuring atmospheric HCHO, including in situ sensors and ground-based remote sensing methods such as differential optical absorption spectroscopy (DOAS) (Liu et al., 2020; Pinardi et al., 2013), Fourier transform infrared spectroscopy (FTIR) (Jones et al., 2009; Vigouroux et al., 2018), and cavity-enhanced absorption spectrometers (Glowania et al., 2021). While these instruments can provide high-precision observations, their deployment is limited by cost, logistical complexity, and maintenance requirements (Lee et al., 2024; Tian et al., 2019). As a result, routine HCHO measurements are rarely included in national air quality monitoring networks, leading to substantial observational gaps, especially in rapidly developing regions such as Southeast Asia. Satellite remote sensing has therefore become a critical tool for monitoring HCHO, offering consistent spatial coverage and long-term observations that enable the identification of emission hotspots, seasonal variability, and regional trends.

Among satellite instruments, the Ozone Monitoring Instrument (OMI) onboard NASA's Aura satellite has provided global HCHO observations since 2004 (Tanskanen et al., 2006), with near-daily coverage and a nadir footprint of approximately 13 × 24 km² (Ahn et al., 2008). OMI HCHO products have been widely used for air quality and atmospheric chemistry studies (Zhu et al., 2017); however, their accuracy is affected by cloud contamination, aerosol loading, surface reflectance, and viewing geometry. Consequently, robust validation using independent ground-based measurements remains essential (Harkey et al., 2021). In recent years, the Pandonia Global Network (PGN) of Pandora spectrometers has emerged as a key resource for satellite validation. Pandora instruments retrieve total vertical columns of trace gases using high-resolution UV–visible spectroscopy and offer standardized, long-term observations across a growing global network.

For HCHO, Pandora spectrometers provide two physically distinct retrieval modes – Direct-sun and Sky-scan – yet their differing sensitivities and implications for satellite validation remain insufficiently quantified (Herman et al., 2015). Direct-sun retrievals sample the atmospheric column along a narrow solar beam, while sky-scan observations integrate scattered radiation across multiple viewing angles. Differences between direct-sun and sky-scan retrievals are primarily associated with sampling characteristics and spatial representativeness. While the two retrieval modes may differ in their effective sensitivity to atmospheric structure, this study focuses on their observational behaviour and consistency rather than explicit vertical sensitivity differences. Direct-sun (DS) and sky-scan (SS) retrievals are often analyzed separately in validation studies due to their differing measurement characteristics. Recent work has proposed approaches to combine DS and SS observations by accounting for systematic differences in bias and sampling (Rawat et al., 2025). However, the extent to which these retrieval geometries influence satellite–ground agreement, particularly in terms of spatial-temporal representativeness, remains insufficiently quantified. This lack of distinction is particularly consequential in tropical environments, where strong emission heterogeneity, biomass burning, and rapid photochemical production amplify sub-pixel variability (Herman et al., 2015). A systematic evaluation of Direct-sun versus Sky-scan Pandora HCHO, and their respective consistency with satellite observations, therefore, represents a critical but largely unexplored gap in current validation frameworks.

Validation efforts for satellite HCHO products have largely focused on mid-latitude regions in North America, Europe, and East Asia (Palmer et al., 2003; Spinei et al., 2018; Zhu et al., 2016), often leveraging intensive field campaigns. Comparatively few studies have examined tropical environments, where high solar irradiance, frequent convection, complex cloud fields, and recurrent biomass burning introduce additional challenges for both satellite and ground-based retrievals (Hansen et al., 2019). Southeast Asia is a particularly critical yet underexplored region, characterized by dense urban emissions, seasonal agricultural burning, and persistent transboundary haze, all of which drive strong variability in HCHO (Cheong et al., 2019; Fu et al., 2007). These conditions amplify the importance of understanding how ground-based sampling geometry interacts with satellite spatial resolution.

Recent satellite instruments such as the TROPOspheric Monitoring Instrument (TROPOMI) provide substantially higher spatial resolution than OMI and enable improved detection of localized HCHO enhancements under favorable conditions (Lee et al., 2024; Su et al., 2020). However, higher spatial resolution alone does not eliminate representativeness errors when comparing satellite and ground-based observations, particularly in heterogeneous tropical environments (Boersma et al., 2016). TROPOMI retrievals remain sensitive to cloud fraction, aerosol loading, and surface reflectance, and their smaller pixel size can increase sensitivity to localized plumes that may not be representative of broader atmospheric columns (Boersma et al., 2016; De Smedt et al., 2018). In addition, OMI's coarser spatial footprint provides a stable reference for diagnosing first-order effects related to spatial representativeness. Complementing these polar-orbiting sensors, the Geostationary Environment Monitoring Spectrometer (GEMS) offers hourly observations over East and Southeast Asia, enabling improved characterization of diurnal variability and reducing temporal sampling mismatches in satellite–ground comparisons (Bak et al., 2019). The combined use of OMI, TROPOMI, and GEMS therefore provides a comprehensive framework to disentangle the relative roles of spatial resolution, temporal sampling, and retrieval geometry in satellite validation. In this context, differences between Pandora Direct-sun and Sky-scan observations can be evaluated more robustly across multiple observational scales, providing improved insight into the factors governing satellite–ground consistency in tropical environments.

In this study, we present a comprehensive evaluation of Pandora HCHO observations across Southeast Asia, explicitly distinguishing between direct-sun and sky-scan retrievals and assessing their consistency with multiple satellite products (OMI, TROPOMI, and GEMS). By applying an uncertainty-based quality-control framework and a unified temporal collocation strategy, this work aims to quantify how retrieval geometry, temporal sampling, and spatial representativeness jointly influence satellite–ground agreement in tropical environments.

2 Method

2.1 Ground-based Pandora formaldehyde observations

This study utilizes formaldehyde (HCHO) measurements from five Pandora spectrometer systems located across Southeast Asia (Fig. 1), a region characterized by a tropical climate and high atmospheric variability. Table 1 summarizes the Pandora monitoring stations used in this study, including its location, altitude, product status, and data availability. While previous validation studies have extensively focused on Pandora stations situated in mid-latitude regions (e.g., North America, Europe, Korea) (Lee et al., 2024; Spinei et al., 2018; Tzortziou et al., 2012), relatively few have addressed stations in low-latitude, equatorial zones. This gap is particularly relevant, as the tropics play a critical role in global atmospheric chemistry, with intense photochemical activity and biomass burning events influencing trace gas distributions.

https://amt.copernicus.org/articles/19/3713/2026/amt-19-3713-2026-f01

Figure 1Geographic distribution of the Pandora observation sites used in this study, including Bangkok (a), Bandung (b), Agam (c), Pontianak (d), and Singapore-NUS (e). All sites utilize Pandora data version rfus5p1-8 (Direct-sun) and rfuh5p1-8 (Sky-scan).

Table 1Summary of Pandora monitoring stations used in this study, including location, altitude, product status, and data availability. Data description: Formaldehyde (HCHO) Level 2, Version: rfus5p1-8 and rfuh5p1-8 (http://data.pandonia-global-network.org, last access: 27 February 2025).

Download Print Version | Download XLSX

All Pandora instruments analyzed in this study operate using both Direct-sun and Sky-scan viewing geometries, enabling a systematic assessment of geometry-dependent retrieval behavior (Herman et al., 2015). We use Level 2 HCHO products from the rfus5p1-8 and rfuh5p1-8 processing version (http://data.pandonia-global-network.org), which provides HCHO columns derived from direct-Sun and diffuse-sky measurements. Direct-sun (DS) retrievals provide total column HCHO along the solar beam, whereas sky-scan (SS) retrievals represent a tropospheric column derived from multi-angle scattered radiation measurements, with sensitivity that depends on retrieval configuration and atmospheric conditions. The rfus5p1-8 product is selected for its improved numerical stability and reduced noise relative to the rfuh5p1-8 product, which incorporates horizon scans and is more susceptible to variability under heterogeneous cloud and aerosol conditions. The inclusion of both Direct-sun and Sky-scan retrievals allows for a robust evaluation of retrieval performance under varying solar geometries and atmospheric conditions, particularly relevant in tropical environments.

The selected stations include Bangkok (190s1, 13.78° N, 100.54° E, 60 m a.s.l.), an urban megacity with heavy traffic and industrial emissions; Bandung (210s1, −6.89° S, 107.59° E, 752 m a.s.l.), a highland city in Indonesia surrounded by volcanic mountains and agricultural activity; Agam (211s1, −0.20° S, 100.32° E, 865 m a.s.l.), a remote and elevated background site in West Sumatra with limited anthropogenic influence; Pontianak (212s1, 0.04° N, 109.34° E, 1 m a.s.l.), a coastal equatorial station in West Kalimantan, Indonesia, known for frequent cloud cover and convective activity; and Singapore-NUS (77s1, 1.30° N, 103.77° E, 77 m a.s.l.), an urban tropical island site with a dense population and significant marine and urban air interactions. All Pandora instruments used in this study are part of the Pandora Global Network; however, data quality is not assumed a priori and is evaluated using uncertainty-based quality control criteria applied in this work. Collectively, this network offers a valuable opportunity to evaluate satellite HCHO products in complex tropical environments that are typically underrepresented in validation studies.

Uncertainty-based quality control protocol

To improve the robustness of ground-based HCHO observations used for intercomparison and satellite validation, an uncertainty-based quality control (QC) protocol following the methodological framework of Rawat et al. (2025) was applied to contemporaneous Pandora direct-sun (DS) and sky-scan (SS) observations. DS and SS retrievals were first paired within a 5 min tolerance window. A high-quality reference subset was then defined using Pandora quality flags QF = 0 or 10 for both DS and SS retrievals, and dynamic absolute uncertainty thresholds were calculated separately for DS and SS as the mean plus three standard deviations of the uncertainty in this subset. Matched observations were retained when both DS and SS absolute uncertainties were below the dynamic thresholds. In addition, observations exceeding the absolute uncertainty thresholds were retained if both relative uncertainties were ≤ 10 %. Additional filters required WRMS < 0.01 for both DS and SS retrievals and, for sky-scan observations, maximum horizontal distance (MHxD) < 20 km when available. Pandora quality flags were subsequently used to classify observations into high-quality (QF = 0, 10), medium-quality (QF = 1, 11), low-quality (QF = 2, 12), and unusable (QF ≥ 20) categories for diagnostic analysis. This procedure reduces the influence of retrieval noise, poor spectral fits, and unfavorable viewing geometry prior to satellite collocation.

Application of this uncertainty-based QC protocol serves two primary purposes. First, it removes retrievals affected by elevated noise, poor spectral fits, or unfavorable viewing geometry, thereby improving internal consistency between DS and SS datasets. Second, it reduces the impact of retrieval artefacts that may otherwise propagate into satellite validation analyses. In this way, the assured/not-assured classification within the Pandora Global Network does not directly determine data usability for this study. Instead, data quality is evaluated using uncertainty-based criteria, including relative uncertainty, spectral fitting residual (WRMS), and additional screening parameters, ensuring consistent selection of physically reliable observations. Importantly, the QC filtering was applied prior to temporal collocation with satellite observations, ensuring that validation statistics reflect physically meaningful retrieval differences rather than artefacts associated with measurement uncertainty. Although the QC procedure reduces the number of available observations, particularly for sky-scan retrievals under heterogeneous atmospheric conditions, it improves the reliability of ground-based reference data used for satellite validation. This is especially critical in tropical regions, where strong variability in cloud cover, boundary-layer dynamics, and photochemical production can introduce substantial retrieval uncertainty if not appropriately constrained.

2.2 Satellite HCHO retrievals from OMI, TROPOMI and GEMS

The OMI HCHO product (OMHCHO Version 003) provides tropospheric vertical column densities at a nominal nadir spatial resolution of approximately 13 × 24 km² (Herman et al., 2018; Lamsal et al., 2014). OMI has a fixed local overpass time near 13:30, enabling comparison with ground-based measurements during the early afternoon photochemical period. Station-level OMI HCHO values were extracted from swath pixels within a 10 km radius of each Pandora site. Standard quality screening included cross-track quality flag XtrackQualityFlags = 0, solar zenith angle SZA < 60°, and cloud fraction AMFCloudFraction < 0.3 (Johnson et al., 2024). Although OMI retrievals are limited by pixel-level noise and susceptibility to cloud contamination, their long-term continuity and global coverage provide valuable insight into atmospheric HCHO variability (Harkey et al., 2021), especially in tropical regions where ground-based observations are scarce.

TROPOMI, launched in 2017, provides substantially finer spatial sampling than OMI and improved signal-to-noise performance. For the product version used here, the nominal pixel size is approximately 5.5 × 3.5 km² (De Smedt et al., 2021). The TROPOMI HCHO product (S5P OFFL HCHO) is derived using a similar DOAS framework but includes updated air-mass factor calculations and surface reflectance treatment (Su et al., 2020). Station-level TROPOMI HCHO values were extracted from pixels within a 10 km radius of each Pandora site. Quality screening followed recommended criteria, including qa_value ≥ 0.5, cloud fraction cloud_fraction_crb < 0.3, and SZA < 60° (De Smedt et al., 2021; Dimitropoulou et al., 2021). TROPOMI can be regarded as the next-generation continuation of the UV–visible trace-gas observing capability established by OMI, providing improved spatial resolution and signal-to-noise performance while maintaining similar measurement principles and orbital sampling. The temporal overlap between OMI and TROPOMI enables consistent long-term validation of satellite HCHO retrievals and facilitates assessment of algorithm evolution across successive instrument generations. The inclusion of both OMI and TROPOMI allows evaluation of retrieval consistency across successive satellite generations. While OMI provides a long-term observational baseline beginning in 2004, TROPOMI extends this record with enhanced spatial resolution and improved sensitivity to sub-pixel variability. The overlap period between the two sensors enables assessment of temporal continuity in satellite HCHO products and supports robust validation of long-term atmospheric composition trends.

Satellite observations from the Geostationary Environment Monitoring Spectrometer (GEMS) onboard the GEO-KOMPSAT-2B platform were additionally used to complement polar-orbiting measurements. GEMS provides hourly hyperspectral observations over East and Southeast Asia, enabling improved characterization of diurnal variability in tropospheric formaldehyde (HCHO) (Lee et al., 2024). In this study, Level-2 HCHO data (GEMS L2 HCHO) from January 2021 to December 2024 were obtained via the National Institute of Environmental Research (NIER) API, with only forward-calculated (FC) retrievals retained to ensure algorithmic consistency and data reliability. Station-level GEMS HCHO values were derived by averaging pixels within a 10 km radius of each Pandora site. Quality control followed conservative filtering criteria, including FinalAlgorithmFlags = 0, cloud radiance fraction < 0.4, and solar zenith angle SZA < 60° (Lee et al., 2024). The inclusion of GEMS provides enhanced temporal sampling relative to polar-orbiting sensors, allowing improved assessment of sub-daily variability and reducing temporal representativeness errors in satellite–ground validation over Southeast Asia.

2.3 Pandora–satellite collocation strategy and validation diagnostics

To evaluate the consistency between ground-based and satellite-derived HCHO columns, filtered Pandora observations were collocated with station-level OMI, TROPOMI and GEMS retrievals using a time-based matching framework designed to account for differences in temporal sampling. The analysis includes observations from OMI, TROPOMI, and GEMS over the period 2021–2024, allowing a more robust and statistically consistent evaluation of satellite–ground agreement across multiple observational platforms. The overall methodology of the study is illustrated in Fig. 2. Two complementary approaches were applied. First, a nearest-time matching method paired each satellite observation with the closest Pandora measurement within a ±2 h tolerance window. Second, an overpass-window averaging method was used, in which all Pandora observations within symmetric windows centered on the satellite overpass time were averaged to form representative ground-based column estimates. Three temporal windows were tested (±30 min, ±1 h, and ±2 h) to assess sensitivity to temporal smoothing. Satellite HCHO columns were calculated by averaging all valid pixels within a 10 km radius of each Pandora site, providing a spatially representative estimate consistent with the effective sampling scale of ground-based observations. For each collocation configuration, mean bias, mean absolute error (MAE), root-mean-square error (RMSE), Pearson correlation coefficient (r), and linear regression parameters were calculated. Time-series and scatter-plot analyses were used to examine temporal consistency and structural agreement across stations and viewing geometries. To further investigate the origin of satellite–ground discrepancies, the relationship between absolute bias and short-timescale Pandora variability was also examined, where Pandora variability was quantified as the range (maximum minus minimum) of Pandora HCHO observations within the nearest-time matching of ±2 h tolerance window.

https://amt.copernicus.org/articles/19/3713/2026/amt-19-3713-2026-f02

Figure 2Flowchart illustrating the satellite–Pandora HCHO validation framework applied in this study. The methodology includes uncertainty-based quality control of Pandora observations following Rawat et al. (2025), standard quality screening of OMI, TROPOMI and GEMS retrievals, temporal collocation using multiple overpass windows, and statistical evaluation of bias, error metrics, and representativeness effects in tropical environments.

Direct-sun versus sky-scan Pandora formaldehyde retrievals: implications for satellite validation and sampling representativeness in Tropical Southeast Asia

2.1 Ground-based Pandora formaldehyde observations

Uncertainty-based quality control protocol

2.2 Satellite HCHO retrievals from OMI, TROPOMI and GEMS

2.3 Pandora–satellite collocation strategy and validation diagnostics

3.1 Impact of uncertainty-based quality control on Pandora HCHO retrievals

3.2 Consistency between direct-sun and sky-scan HCHO retrievals after quality control

3.3 Distributional characteristics and quality-flag behaviour of Pandora HCHO retrievals

3.4 Impact of temporal collocation and sub-pixel variability on OMI and TROPOMI validation

3.5 Role of High-Temporal-Resolution Observations: Insights from GEMS HCHO Retrievals