Nitrogen dioxide and formaldehyde measurements from the GEOstationary Coastal and Air Pollution Events ( GEO-CAPE ) Airborne Simulator over Houston , Texas

The GEOstationary Coastal and Air Pollution Events (GEO-CAPE) Airborne Simulator (GCAS) was developed in support of NASA’s decadal survey GEO-CAPE geostationary satellite mission. GCAS is an airborne pushbroom remote-sensing instrument, consisting of two channels which make hyperspectral measurements in the ultraviolet/visible (optimized for air quality observations) and the visible–near infrared (optimized for ocean color observations). The GCAS instrument participated in its first intensive field campaign during the Deriving Information on Surface Conditions from Column and Vertically Resolved Observations Relevant to Air Quality (DISCOVER-AQ) campaign in Texas in September 2013. During this campaign, the instrument flew on a King Air B-200 aircraft during 21 flights on 11 days to make air quality observations over Houston, Texas. We present GCAS trace gas retrievals of nitrogen dioxide (NO2) and formaldehyde (CH2O), and compare these results with trace gas columns derived from coincident in situ profile measurements of NO2 and CH2O made by instruments on a P-3B aircraft, and with NO2 observations from ground-based Pandora spectrometers operating in direct-sun and scattered light modes. GCAS tropospheric column measurements correlate well spatially and temporally with columns estimated from the P-3B measurements for both NO2 (r2 = 0.89) and CH2O (r2 = 0.54) and with Pandora direct-sun (r2 = 0.85) and scattered light (r2 = 0.94) observed NO2 columns. Coincident GCAS columns agree in magnitude with NO2 and CH2O P-3B-observed columns to within 10 % but are larger than scattered light Published by Copernicus Publications on behalf of the European Geosciences Union. 5942 C. R. Nowlan et al.: GCAS measurements of NO2 and CH2O Pandora tropospheric NO2 columns by 33 % and direct-sun Pandora NO2 columns by 50 %.


Introduction
The GEOstationary Coastal and Air Pollution Events (GEO-CAPE) Airborne Simulator (GCAS) is an airborne hyperspectral remote-sensing instrument that was developed in support of future Earth-observing geostationary satellite missions.GCAS was originally developed by NASA Goddard Space Flight Center's (GSFC) Radiometric Calibration and Flight Development Laboratory as a simulator for GEO-CAPE, a NASA decadal survey mission for observing pollution and ocean color from geostationary orbit (Fishman et al., 2012).GCAS is now also a test bed instrument for the Tropospheric Emissions: Monitoring of POllution (TEMPO) instrument (Chance et al., 2013;Zoogman et al., 2017), which will monitor air quality over North America from a geostationary orbit.TEMPO is the ultraviolet-visible-near-infrared (UV-Vis-NIR) air quality component of GEO-CAPE and is scheduled for launch in the 2019-2021 time frame.As a satellite airborne simulator, GCAS provides an algorithm development test bed for GEO-CAPE and TEMPO, serves as a satellite analogue during field campaigns, and will eventually act as a validation instrument when geostationary satellite instruments are on orbit.
GCAS is a push-broom remote-sensing instrument consisting of two spectrometers.The first spectrometer operates in the UV-Vis region of the spectrum, where observations can be made of several atmospheric constituents of interest to air quality.The second spectrometer operates in the Vis-NIR for measurements focused on ocean color.In this paper, we focus on air quality observations of nitrogen dioxide (NO 2 ) and formaldehyde (CH 2 O) using data from the UV-Vis channel collected during the Deriving Information on Surface Conditions from Column and Vertically Resolved Observations Relevant to Air Quality (DISCOVER-AQ) campaign in Texas during September 2013.NO 2 and CH 2 O have spectral absorption signatures in the UV-Vis channel and are two core operational data products of future geostationary air quality instruments.
Nitrogen oxides (NO x =NO + NO 2 ) are of central importance to air quality and atmospheric chemistry.NO x is involved in the formation of photochemical ozone and fine aerosol particles, with implications for both surface air quality and climate.Both short-and long-term enhanced NO 2 concentrations are associated with increased mortality (Hoek et al., 2013;Mills et al., 2015).NO x emissions can also lead to excess nitrogen deposition (Fowler et al., 2013;Nowlan et al., 2014).Globally, the major sources of NO x are combustion, lightning and soils.In populated regions, sources are typically dominated by combustion of fuel for transportation and industry.The relatively strong NO 2 spectral absorption features at ultraviolet (Yang et al., 2014) and visible (Martin et al., 2002;Boersma et al., 2008;Richter et al., 2011;Bucsela et al., 2013) wavelengths have been used for over 2 decades to derive global maps of NO 2 from several sunsynchronous satellite sensors in low Earth orbit.
Formaldehyde (CH 2 O) is found in the Earth's atmosphere due to the oxidation of both methane and the non-methane volatile organic compounds (NMVOCs) that result from biogenic and anthropogenic activity and fires (Fried et al., 2008(Fried et al., , 2011(Fried et al., , 2016a, and references therein), and references therein).Industrial activity and fires can also be direct sources of CH 2 O (Fried et al., 2016b).The absorption signature of CH 2 O in the ultraviolet has permitted its detection from the same nadir-viewing satellite instruments that measure NO 2 (Chance et al., 2000;De Smedt et al., 2008, 2012;González Abad et al., 2015, 2016).Its short lifetime of ∼ 1.5-3 h (around local noon) means that satellite-observed CH 2 O can be used as a proxy of NMVOC emissions (Barkley et al., 2008;Zhu et al., 2014;Stavrakou et al., 2015).
NO 2 amounts over industrial regions and urban areas have been mapped at high spatial resolution by several recently developed airborne push-broom sensors (Heue et al., 2008;Popp et al., 2012;Schönhardt et al., 2015;Lawrence et al., 2015;Nowlan et al., 2016;Meier et al., 2017;Tack et al., 2017Tack et al., , 2018;;Vlemmix et al., 2017;Broccardo et al., 2018).Airborne remote-sensing CH 2 O measurements have previously been made from aircraft using limb-viewing geometry by airborne multi-axis differential optical absorption spectroscopy (AMAX-DOAS) (Baidar et al., 2013) and by the whisk-broom scanning technique (where the cross-track spatial dimension is provided by mechanical scanning) using the Airborne Compact Atmospheric Mapper (ACAM) (Liu et al., 2015b).Operated by the NASA GSFC Radiometric Calibration and Flight Development Laboratory, ACAM is a precursor instrument to GCAS and has also been used to measure NO 2 and ozone (Liu et al., 2015b;Lamsal et al., 2017).To the best of our knowledge, the GCAS measurements presented here are the first published CH 2 O observations from an airborne push-broom nadir mapper.
GCAS flew in its first field campaign during the DISCOVER-AQ campaign in Texas in 2013.In the following sections, we present and validate trace gas retrievals of NO 2 and CH 2 O from the GCAS instrument during DISCOVER-AQ Texas.Section 2 describes the GCAS instrument and measurement approach.Section 3 describes the DISCOVER-AQ Texas campaign deployment, measurements from GCAS and relevant ground-based spectrometers and in situ aircraft instruments, and the atmospheric models used in data analysis.Section 4 presents the trace gas retrievals, including the spectral fitting used to derive NO 2 and CH 2 O slant columns, and air mass factor (AMF) calculations.Section 5 describes the vertical column results from the campaign.Section 6 presents comparisons of GCAS observations with other coincident observations of NO 2 and CH 2 O.

The GCAS instrument
The GCAS instrument is a nadir-looking hyperspectral instrument consisting of two Offner spectrometers operating at wavelengths 300-490 nm (UV-Vis, air quality channel) and 480-900 nm (Vis-NIR, ocean color channel).The instrument has dimensions of 48 cm × 48 cm × 46 cm and a mass of 36 kg.We briefly describe the GCAS instrument below; a more detailed description of the instrument and laboratory characterization can be found in Kowalewski and Janz (2014).
Both the air quality and ocean color spectrometers in the GCAS instrument use charge-coupled device (CCD) array detectors to measure solar radiation backscattered from the surface and atmosphere.The push-broom technique used by GCAS provides data for constructing two-dimensional maps beneath the aircraft, and it is also employed by satellite instruments such as the Ozone Monitoring Instrument (OMI) (Levelt et al., 2006) and Ozone Mapping Profiler Suite (OMPS) nadir mapper (Flynn et al., 2014).In these instruments, one axis of the CCD array detector provides spectral information, while the other CCD axis provides spatial crosstrack information below the aircraft or satellite.The second spatial dimension is provided by the movement of the aircraft or satellite in its flight track.
The UV-Vis air quality channel consists of a thermoelectrically cooled 1072 × 1024 CCD detector array measuring an image with 1072 wavelengths in the spectral dimension and 1024 positions in the spatial dimension across the flight track, with a spectral sampling of 0.2 nm and spectral resolution of ∼ 0.57 nm.Polarization sensitivity is reduced by the use of a dual-wedge crystal quartz and fused-silica depolarizer fitted between the slit and instrument fore-optics.The Vis-NIR ocean color channel uses a 1004 × 1002 CCD array to collect spectra with a spectral sampling of 0.8 nm and resolution of 2.8 nm, and has an order sorting filter to reduce second-order grating effects.The spectrometer units are operated at a temperature of 20 • C and are stable to 0.25 • C 40 min after a nominal takeoff to a typical cruise altitude (Kowalewski and Janz, 2014).In addition to the two spectrometers, a video camera is also included in the housing for the purpose of collecting relevant scene information.
The GCAS instrument fore-optics collect backscattered light below the aircraft through a common fused-silica window.The full field of view (FOV) of the air quality channel covers 41 • in the cross-track dimension, and the instantaneous FOV (IFOV) along the flight track is 0.8 mrad.At a typical flight altitude of 9 km, this results in a swath width on the ground of about 6.7 km.The ocean color channel full FOV is 70 • , with an IFOV of 1.2 mrad.All observations in this study use the UV-Vis air quality channel.
Spectra are spatially averaged in post-processing to increase the signal-to-noise ratio for air quality trace gas observations.NASA GSFC typically produces averaged Level 1B calibrated spectra at 21 cross-track positions, at a spatial res-olution of 250 m across track and 500 m along track from a ∼ 9 km flight altitude, with a resulting signal-to-noise ratio of ∼ 360 at 340 nm and ∼ 540 at 440 nm.GCAS does not have a zenith sky reference measurement capability, unlike the Geostationary Trace gas and Aerosol Sensor Optimization (GeoTASO) (Nowlan et al., 2016) or ACAM (Liu et al., 2015b) airborne instruments also operated by the NASA GSFC.As a result, the reference spectra required by the GCAS trace gas retrievals must be derived from nadir observations over clean areas with relatively low pollution.

DISCOVER-AQ Texas 2013
DISCOVER-AQ (http://discover-aq.larc.nasa.gov/,last access: 23 October 2018) was a suborbital-class NASA Earth Venture mission consisting of four major field campaigns (Maryland 2011, California 2013, Texas 2013and Colorado 2014) whose goal was to improve air quality monitoring by satellites.During the campaigns, NASA's King Air B-200 (remote sensing) and P-3B (in situ) aircraft made measurements of trace gases, aerosols and meteorological variables, while balloon-borne, ship-based, mobile and stationary instruments collected large amounts of in situ and remotesensing data.
As part of the remote-sensing component of DISCOVER-AQ, NASA GSFC deployed the airborne ACAM scanning instrument during the Maryland 2011 (Liu et al., 2015a, b;Lamsal et al., 2017) and California 2013 campaigns, and the GCAS instrument during the Texas 2013 and Colorado 2014 campaigns.Additionally, the first test flights of the Geo-TASO airborne instrument, another geostationary airborne simulator, were performed during the Texas (Nowlan et al., 2016) and Colorado (Crawford et al., 2016) campaigns from the NASA HU-25C Falcon aircraft.Preliminary GCAS and GeoTASO NO 2 observations were compared in a previous paper (Nowlan et al., 2016).
The DISCOVER-AQ Texas campaign took place in September 2013.The campaign aircraft, sondes and groundbased instruments were based in and around Houston, Texas, an urban area with large emission contributions from both transportation and the petrochemical industry, and air quality often influenced by land-sea breezes.Figure 1 shows the location of the 10 DISCOVER-AQ ground sites with Pandora spectrometers which GCAS overflew and a day of flight tracks from the King Air B-200 and P-3B aircraft.Flight paths were chosen so that the aircraft passed over eight existing ground sites with surface air quality monitors several times per day, in support of the mission goal of investigating the relationship between trace gas columns and surface air quality.

GCAS observations
During the DISCOVER-AQ Texas campaign in September 2013, the NASA King Air B-200 carried the GCAS instrument for remote sensing of trace gases and aerosols, as well as the NASA High Spectral Resolution Lidar-2 (HSRL-2) instrument (Hair et al., 2008) for the purpose of measuring aerosol profiles below the aircraft.The B-200 typically flew at a cruise altitude of ∼ 9 km, with occasional descents to avoid cirrus clouds.Table 1 summarizes the 26 GCAS flights (21 for air quality and 5 for ocean color), which took place on 13 days.Most flights were designed to coincide with P-3B flight paths.The B-200 aircraft was based at Ellington Field in southeast Houston and typically flew a morning flight, refueled and then flew an afternoon flight.Each B-200 flight over Houston consisted of two overpasses of nearly the same flight path, so that there are typically four GCAS overpasses of Houston each day.The ocean color flights involved collecting data over the Gulf of Mexico in support of the ocean color component of GEO-CAPE.In this study, we focus only on the air quality flights over the Houston area.

Pandora observations
Total column observations of NO 2 were made from 15 ground-based Pandora spectrometers viewing in directsun (DS) mode (Herman et al., 2009) at 11 sites during the DISCOVER-AQ Texas campaign.GCAS overflew 14 of these spectrometers at 10 sites, which are summarized in Table 2. Pandora NO 2 is determined at a temporal resolution of 90 s using the ratio of direct-sun spectra to a reference spectrum derived by a top-of-the-atmosphere Langley extrapolation using spectra collected on a clear day with low NO 2 (Herman et al., 2009).Spectra are fit from 400 to 440 nm with NO 2 cross sections interpolated to 264 K ( Vandaele et al., 1998) and O 3 at 225 K (Brion et al., 1993).At solar zenith angles (SZAs) less than 80 • , the observed DS slant column is converted to vertical total column using a simple geometric air mass factor (Herman et al., 2009).Pandora DS NO 2 measurements have a nominal precision of 2.7 × 10 14 molecules cm −2 and accuracy of 2.7 × 10 15 molecules cm −2 .Pandora observations with fitting root mean square < 0.005 and relative error < 10 % are included in this study, to exclude possible cloud-contaminated measurements.
Pandoras also operated in multi-axis sky-scanning mode (MAX-DOAS) measuring lower-tropospheric NO 2 distribution and tropospheric columns at the La Porte, Moody Tower and Smith Point sites.Pandora head sensors sequentially pointed at 1, 2, 3, 4, 6, 8, 10, 15, 20, 30, 40 and 90 • elevation angles from the horizon with a field of view of 1.6 • .Azimuth angles were chosen to ensure an unobstructed view down to the horizon and were 320 • from north at La Porte, 45 • at Moody Tower and 270 • at Smith Point.Differential slant column densities of NO 2 and O 2 −O 2 within a single scan were calculated using a zenith sky reference spectrum.A temperature-dependent NO 2 absorption cross section (lin-  3 for references) were used in the MAX-DOAS fitting window 425-490 nm.The profile inversion was performed using the maximum a posteriori optimal estimation method (Rodgers, 2000) with aerosol and gas weighting functions calculated using the Vector Linearized Discrete Ordinate Radiative Transfer (VLIDORT) model (Spurr, 2008).Tropospheric columns were also estimated using a geometrical approach using NO 2 and O 2 −O 2 columns derived from 15 • elevation angle measurements when inversions failed.
The Pandora dataset contains observations from two Pandora instruments placed at the Moody Tower site at the University of Houston, 70 m above the surface.We correct for the column in the bottom 70 m of the atmosphere using in situ observations at the base and top of the towers collected every 5 min by the University of Houston following Nowlan et al. (2016).The in situ measurements indicated that NO 2 within these altitudes was usually well mixed at the overpasses.This correction varies in magnitude from 0.3 × 10 15 to 3.7×10 15 molecules cm −2 for different GCAS overpasses.

P-3B aircraft observations
The P-3B aircraft carried a suite of in situ instruments in order to profile the atmosphere during the campaign.Profiles were collected during aircraft spirals near eight DISCOVER-AQ ground sites, with each site typically overflown two or three times each day.Depending on the site and flight, the aircraft typically flew between a lowermost altitude of 0-300 m and an uppermost altitude of 3.5-5 km.The typical radius of a spiral was 4-5 km.
The National Center for Atmospheric Research's (NCAR) chemiluminescence instrument (P-CL) (Ridley and Grahek, 1990) measured in situ NO 2 concentrations from the P-3B.P-CL observations of NO 2 have uncertainties of 0.02 ppbv in precision and 10 % in accuracy.
The NCAR Differential Frequency Generation Absorption Spectrometer (DFGAS) (Weibring et al., 2006(Weibring et al., , 2007) ) measured in situ CH 2 O concentrations from the P-3B.The DFGAS instrument collects data with a temporal resolution of 1 s, with a 15 s background zero-air addition period every 60 to 120 s.This addition captures and removes both inlet/sample cell CH 2 O outgassing and optical noise.For a typical spiral, the temporal resolution translates to a vertical resolution of approximately 5 m.The 1 s measurements have a precision of ∼ 0.08 ppbv (upper limit) and an estimated accuracy of 4 % at the 1σ level.

Model simulations
This study uses model-simulated trace gas profiles for radiative transfer calculations in order to determine vertical column densities from observed slant column densities.Tropospheric simulations are performed with the Environmental Protection Agency's (EPA) Community Multiscale Air Quality (CMAQ) version 5.0.2 modeling system (Byun and Schere, 2006) over the campaign domain at a spatial resolution of 4 × 4 km 2 and a temporal resolution of 20 min.The model has 45 vertical levels from the surface to 50 hPa.The model's vertical resolution ranges from 22 m at the surface to ∼ 200 m at an altitude of 2 km, further increasing to ∼ 650 m by the aircraft flight altitude.CMAQ simulations are driven by offline meteorology from the Advanced Research Weather and Forecasting (WRF-ARW) model (Skamarock et al., 2008) via the Meteorology-Chemistry Interface Processor (MCIP) (Otte and Pleim, 2010).Loughner and Follette-Cook (2015) describe the CMAQ and WRF modeling approach used for the DISCOVER-AQ Texas campaign in detail.
Stratospheric NO 2 profiles used in the study are estimated using the PRATMO chemical box model (Prather, 1992;McLinden et al., 2000) from simulated profiles provided as a function of month, solar zenith angle and latitude.Stratospheric ozone profiles are from the September 2013 monthly climatology derived at 1 • × 1 • from the OMI ozone profile product (Liu et al., 2010) up to 0.3 hPa.

GCAS trace gas retrievals
The GCAS vertical column density retrieval uses a two-step approach.First, we derive the slant column density (SCD) by directly fitting a modeled spectrum to the observed spectrum, starting from a reference spectrum derived from observations over an unpolluted area.Second, we convert SCD to a vertical column density (VCD) using an AMF that represents the path of light through the atmosphere based on the viewing geometry and radiative transfer calculations.
The GCAS trace gas retrieval algorithms used in this paper are derived from the Smithsonian Astrophysical Observatory (SAO) trace gas algorithms originally developed for Global Ozone Monitoring Experiment (GOME), and since applied to GOME-2, SCIAMACHY, OMI, OMPS and Geo-TASO for a range of trace gases (Chance, 1998;Chance et al., 2000;Sioris et al., 2004;Nowlan et al., 2011;Chan Miller et al., 2014;Wang et al., 2014;González Abad et al., 2015, 2016;Nowlan et al., 2016).These algorithms are also the basis for the TEMPO trace gas retrieval algorithms.A separate slant column trace gas product at 350 m × 1000 m was provided by the GCAS instrument team at NASA GSFC to the DISCOVER-AQ data archive shortly after the campaign using the publicly available QDOAS spectral fitting package (http://uv-vis.aeronomie.be/software/QDOAS/,last access: 23 October 2018) and preliminary calibrated spectra.This product is not examined in the current study.Nowlan et al. (2016) compared preliminary SAO GCAS NO 2 slant columns with GeoTASO slant columns within 10 min and 500 m from four coincident flights during the DISCOVER-AQ Texas campaign (13, 14, 18 and 24 September) at a resolution of 250 m × 500 m.Overall, slant columns agreed well (r = 0.81, N = 77 320), with GCAS lower than GeoTASO by ∼ 6 %.The current GCAS retrieval algorithm used in this study is similar to the previous algorithm, but the slant column retrieval uses a separate reference spectrum for each cross-track position and a cross-track dependent instrument line shape, so that the results no longer require a crosstrack bias correction.The new NO 2 and CH 2 O products also include improved georegistration.

Spectral calibration
We perform spectral fitting to derive slant columns using radiometrically calibrated spectra (Level 1B), which are geolocated and derived from raw (Level 0) data using characteriza-tion data collected in the laboratory before the campaign, as described in detail by Kowalewski and Janz (2014).The firstguess wavelength calibration was determined from spectra collected in the laboratory using mercury-argon, cadmium, neon and krypton discharge lamps as sources.The pre-flight slit function shape and width as functions of cross-track position, wavelength and temperature were also determined using a tunable laser with an integrating sphere.These laboratory tests indicated the instrument's spectral shift is ∼ 0.004 nm and the change in the slit function's full width at half maximum (FWHM) is less than 0.0013 nm within the instrument's thermal stability range of ±0.25 • C and nominal operating temperature of 20 • C. Pressure changes within the instrument may also shift the wavelength calibration through changes in the index of refraction (Kuhlmann et al., 2016).These changes are minimized in GCAS and are primarily due to changes in ambient temperature, as the instrument is backfilled with gaseous nitrogen and sealed prior to aircraft integration to mitigate moisture.The impact of wavelength shifts on retrievals is further minimized through simultaneous fitting of a wavelength shift for each observation as described in Sect.4.2.3.
We further refine the instrument spectral registration and slit function calibration using spectra collected during the Texas flights, following our calibration approach previously applied to GOME, GOME-2, OCO-2, ACAM and GeoTASO (Liu et al., 2005;Cai et al., 2012;Liu et al., 2015b;Nowlan et al., 2016;K. Sun et al., 2017a, b).As a first step in the spectral fitting, we simultaneously derive a wavelength dispersion and slit function shape by fitting a reference spectrum to a high-spectral-resolution solar atlas (Chance and Kurucz, 2010).This is similar to the approach employed in our satellite retrievals, but, as the airborne nadir reference spectrum contains atmospheric features (which are not present in a satellite-observed exo-atmospheric reference), we also simultaneously fit preliminary amounts of the atmospheric molecular absorbers listed in Table 3 and the Ring effect (ro-tational Raman scattering) to account for these spectral features (Liu et al., 2015b;Nowlan et al., 2016).
We determine a separate wavelength dispersion and slit function shape for each of the 21 cross-track positions.The wavelength dispersion is determined by fitting the coefficients in a fifth-order (NO 2 ) or seventh-order (CH 2 O) polynomial that represents the wavelength as a function of detector pixel.For NO 2 , we model the slit function using an asymmetric super-Gaussian (Beirle et al., 2017).For CH 2 O, we fit parameters describing the shape and width of an asymmetric Gaussian (Cai et al., 2012;Nowlan et al., 2016).While the super-Gaussian works well for NO 2 , it results in a very small increase (∼ 5 %) in fitting residuals for CH 2 O over an asymmetric Gaussian, possibly due to the presence of a double shoulder to one side of the slit function shape, as measured in the laboratory at wavelengths less than 380 nm (Kowalewski and Janz, 2014).
The retrieved slit function in the NO 2 fitting window is nearly symmetric and very similar in width to the FWHM = 0.58 nm (Kowalewski and Janz, 2014) measured in the laboratory.Using in-flight data, the retrieved FWHM is 0.57 nm at the nadir center position, expanding to 0.58 nm at the edges of the swath.The retrieved slit width in the CH 2 O fitting window changes in a similar way but is larger at the edges (0.57 nm at swath center, growing to 0.60 nm at the leftmost cross-track position (no. 1) and to 0.63 nm at the rightmost cross-track position (no.21)).We estimate the uncertainty in the slit width is ∼ 0.01 nm, primarily due to temperature fluctuations during flight.In-flight data show the center detector pixel-to-wavelength registration for both the NO 2 and CH 2 O fitting windows varies nearly linearly as a function of swath cross-track position across the left-hand side of the swath, varying by ∼ 0.1 nm between cross-track positions 1 and 11, but remains approximately constant from positions 11 to 21.The retrieved wavelength calibration is stable to ∼ 0.002 nm after the instrument has thermally stabilized during flight.

Spectral fitting
We determine NO 2 and CH 2 O slant columns using leastsquares minimization to directly fit a modeled radiance spectrum F (x, b) to our observed radiance spectrum.The modeled spectrum is a function of pre-determined model parameters b and the retrieved state vector x.The modeled spectrum is represented by In this equation, I 0 is a reference spectrum determined from clean nadir observations, scaled by a retrieved intensity pa-rameter x a (which represents reflectivity factors such as surface albedo or clouds).The derivation of the reference is discussed in Sect.4.2.2.The term b u (λ) describes a correction for spectral undersampling (Chance et al., 2005), while b r (λ) represents the effects of rotational Raman scattering (Chance and Spurr, 1997).The retrieved differential slant columns are represented by x i .These differential slant columns are the differences between the slant columns in the nadir observation of interest and the slant columns in the reference spectrum.Their absorption cross sections, as listed in Table 3, convolved with the instrument line shape and corrected for the "I 0 effect" (Aliwell et al., 2002), are included as b i (λ).In addition, the retrieval also determines scaling (of order j ) and baseline (of order k) wavelength-dependent polynomial coefficients (x SC and x BL ) that represent lowfrequency wavelength-dependent effects from surface reflectivity, molecular scattering, aerosols and instrument artifacts.

Reference spectrum
Each trace gas retrieval uses a reference spectrum determined from nadir observations over a clean area.We determine a mean reference spectrum for each of the 21 cross-track positions by averaging 40 spectra at 250 m × 500 m resolution for each cross-track position from a cloud-free and clean area over the Gulf of Mexico during the 6 September afternoon flight.This location and date were chosen after CMAQ simulations and preliminary retrievals of NO 2 and CH 2 O predicted relatively low columns of those trace gases.In addition, we found that the use of a reference collected before the instrument was thermally stable (within the first ∼ 40 min of a flight) resulted in cross-track biases in the CH 2 O retrieval.
As observations over the relatively clean Gulf are often collected in the period soon after takeoff, this constraint limited the availability of a suitable reference to a reference spectrum taken late in the flight, close to landing, and with a relatively high solar zenith angle (58 • ).
We use a single reference spectrum at each cross-track position for the entire campaign, instead of a daily or higherfrequency reference, to ensure that all days during the campaign have the same background correction applied for the reference spectrum.Due to the use of a nadir reference, the retrieved differential slant columns must be corrected by the reference background column derived from the model to produce an effective tropospheric column (discussed in further detail in Sect.4.3).We find that the use of a single reference for the campaign removes day-to-day relative background biases in the GCAS column that can result from uncertainties in daily modeled columns and improves the daily consistency of background CH 2 O in the in situ P-3B CH 2 O comparison discussed later in Sect.6.1.The use of a single versus daily reference spectrum has little effect on the NO 2 validation.The NO 2 and CH 2 O slant column density retrievals use the fitting parameters summarized in Table 3. NO 2 is fit at wavelengths 420-465 nm with an NO 2 absorption cross section at 294 K.The NO 2 retrieval also simultaneously fits O 3 at two temperatures, as well as H 2 O vapor and O 2 −O 2 , which all have spectral absorption features in the NO 2 wavelength fitting window.The CH 2 O retrieval is performed at 328.5-356.5 nm and simultaneously fits NO 2 , O 3 , BrO and O 2 −O 2 .Both retrievals also fit the undersampling correction, Ring spectrum, a fifth-order scaling polynomial and a fourth-order baseline polynomial.Each retrieval also determines a wavelength shift that represents the relative difference in the detector pixel to wavelength registration between the radiance and reference spectra.

Conversion to vertical column
For air quality applications, we are interested in the vertical column density, V , of the trace gas (NO 2 or CH 2 O) in the troposphere.The vertical column density can be derived from the slant column density, S, using an air mass factor, A, which describes the mean light path through the atmosphere, by In practice, the retrieval algorithm determines a differential slant column S, which is the difference between the slant column S of the absorber in the spectrum of interest and the slant column S R in the reference spectrum.Each of these slant columns is the sum of the slant column of absorber in the light path above (↑) and below (↓) the aircraft, so that In terms of the air mass factor and vertical column, the vertical column below the aircraft can then be expressed as where the vertical columns V ↑ , V ↓ R and V ↑ R are typically determined from a model.Because the flight altitude of 9 km is well above the majority of tropospheric NO 2 and CH 2 O, we refer to V ↓ and V ↑ as the tropospheric and stratospheric trace gas columns.NO 2 above the aircraft is dominated by stratospheric NO 2 , varies primarily by time of day and ranges within V ↑ = 2.3-3.8 × 10 15 molecules cm −2 .CH 2 O in the model is more variable, with the early part of the campaign (4 to 18 September) seeing levels of V ↑ = 2-25 × 10 14 molecules cm −2 and the latter part seeing levels of V ↑ = 1-3 × 10 14 molecules cm −2 .For our chosen reference location, the modeled vertical columns below the aircraft are V ↓ R = 2.0×10 15 molecules cm −2 for NO 2 and V ↓ R = 7.5 × 10 15 molecules cm −2 for CH 2 O.The modeled vertical columns above the aircraft at the reference location are V ↑ R = 3.6 × 10 15 molecules cm −2 for NO 2 and V ↑ R = 7.9 × 10 14 molecules cm −2 for CH 2 O.

Air mass factor calculation
We calculate the air mass factors on a scene-by-scene basis using the formulation of Palmer et al. (2001) and Martin et al. (2002) with the VLIDORT radiative transfer model (Spurr, 2006(Spurr, , 2008)).In this approach, the radiative transfer model provides scattering weights w as a function of altitude z.The scattering weights describe the sensitivity of the measurement to the different altitude layers and are a function of the viewing geometry, ozone profile, aerosol and molecular scattering, and surface reflectance.These can be used with shape factor s, which is the normalized partial column n of the trace gas at each altitude layer: The AMF is defined as The air mass factor below the aircraft A ↓ is calculated from the surface z 0 to the aircraft altitude z ac as while the air mass factor above the aircraft A ↑ is determined from the aircraft altitude to the top of the atmosphere at z TOA , with

Radiative transfer calculations
We use the radiative transfer algorithm to determine scattering weights in 56 vertical layers.These include the 45 CMAQ layers up to ∼ 19 km and 11 additional layers to 0.3 hPa.We use the MODIS BRDF (bidirectional reflectance distribution functions) gap-filled MCD43GF V005 Band 3 product (Schaaf et al., 2002;Q. Sun et al., 2017) to represent surface reflectance in the VLIDORT model.This BRDF product is provided at a spatial resolution of 30 arcsec (∼ 0.80 km in longitude by 0.92 km in latitude over Houston) every 8 days, based on 16 days of MODIS measurements.
The MODIS Band 3 product is derived at 470 nm.While this is close to the NO 2 fitting window, there currently exists no BRDF climatology at shorter wavelengths.We determine effective BRDFs at 442 nm (NO 2 ) and 342 nm (CH 2 O) by scaling the BRDF functions by the ratio of the 0.5 • × 0.5 • monthly OMI Earth Surface Reflectance Climatology product (OMLER) (Kleipool et al., 2008) at either 442 or 342 nm to its value at 470 nm.These results are typically within 2 %-3 % of the results derived using a black-sky/white-sky approach to estimate surface reflectance (McLinden et al., 2014).
Figure 2 shows profiles for (1) a sample polluted observation at the Moody Tower site in downtown Houston and (2) the reference spectrum.For the AMF calculation, the shape factors are derived from the model profiles shown in Fig. 2a and c and then applied to the corresponding scattering weights.Differences in the scattering weights of the reference and Moody Tower observations at higher altitudes are mainly driven by differences in the solar zenith angles.The smaller CH 2 O scattering weights near the surface relative to those of NO 2 indicate the relatively lower sensitivity of the observations to near-surface CH 2 O.This is due primarily to the wavelength dependency of the AMF, as stronger Rayleigh scattering and ozone absorption at shorter wavelengths decreases the measurement sensitivity to lower altitudes.The AMF is calculated scene by scene for each nadir observation.The reference spectrum AMFs at the swath center are A

Cloud flagging
Only cloud-free measurements are used in this study, and the radiative transfer calculations assume cloud-free conditions.Unlike the case of satellite observations with footprints on the order of tens of square kilometers, GCAS observations are of sufficiently high spatial resolution that cloudy pixels can be discarded without loss of a significant amount of data.We flag as cloudy any pixel that has a mean radiance in the NO 2 fitting window over a threshold of 2 × 10 13 photons cm −2 nm −1 s −1 sr −1 , which is typically only exceeded in the case of a bright cloud.The Ring scattering parameter retrieved simultaneously with NO 2 and a color index (the radiance ratio at wavelengths 320 to 440 nm) are also used to flag less bright pixels where clouds likely occur (Wagner et al., 2014).

Trace gas uncertainties
Uncertainties in the vertical column density result from uncertainties in (1) the slant column fitting; (2) the air mass factor calculation; and (3) the modeled reference and stratospheric columns needed for determining the vertical column below the aircraft using Eq.(4).

Slant column uncertainties
The slant column fitting uncertainty on a single observation is dominated by the random noise in the spectrum.Over a typical day, the mean fitting uncertainty in an NO 2 differential slant column at 250 m × 500 m resolution is 1.3 × 10 15 molecules cm −2 , including all solar zenith angles in the morning and afternoon flights.The typical fitting uncertainty in a CH 2 O differential slant column is 2.5 × 10 16 molecules cm −2 .After AMFs are applied, typical mean vertical column precisions are 1×10 15 molecules cm −2 for NO 2 and 1.9 × 10 16 molecules cm −2 for CH 2 O.The precision requirements of the TEMPO instrument are 1 × 10 15 molecules cm −2 for NO 2 and 1 × 10 16 molecules cm −2 for CH 2 O (Zoogman et al., 2017).(Note that the signalto-noise ratio is higher at CH 2 O wavelengths relative to that at NO 2 wavelengths for TEMPO, which is the opposite of GCAS.)CH 2 O in particular is noisy at the provided GCAS spatial resolution of 250 m × 500 m, with enhanced CH 2 O columns often on the order of the retrieval precision.GCAS CH 2 O must be spatially averaged to meet the TEMPO precision requirement and to improve the detection limit in order to observe polluted columns over Houston.As a result, later in this paper we present CH 2 O maps  Marais et al., 2014;Zhu et al., 2014).Additional errors in NO 2 slant column retrievals can also result from the use of an NO 2 cross section at a single temperature (Boersma et al., 2004).The profile-weighted effective temperature of NO 2 during the Houston campaign in polluted observations was typically within a few degrees of the 294 K cross-section temperature, resulting in an expected bias within 1 %-2 % in the tropospheric slant column.The stratospheric slant column may be biased by ∼ 15 % due to its colder temperature, but the influence of this uncertainty is minimized by the use of a nadir reference spectrum, resulting in a possible systematic bias on the order of 4 × 10 14 molecules cm −2 (an uncertainty of 1 %-2 % for polluted pixels).Uncertainties in the laboratory cross sections introduce additional uncertainties in the slant columns of 2 % for NO 2 (Boersma et al., 2004) and 5 % for CH 2 O (Chance and Orphal, 2011).Uncertainties in the differential slant columns due to uncertainties in the spectral calibration are ∼ 5 × 10 13 molecules cm −2 for NO 2 and ∼ 2 × 10 15 molecules cm −2 for CH 2 O.

Air mass factor uncertainties
The air mass factor uncertainties in cloud-free satellite observations are typically dominated by uncertainties in the surface albedo, trace gas profile shape and aerosols (Boersma et al., 2004).A recent study by Lorente et al. (2017) found an average AMF structural uncertainty of 42 % in polluted observations and 31 % in unpolluted regions when different retrieval groups used different inputs to NO 2 AMF calculations; the most significant impacts overall were from differences in surface albedo, cloud parameters and trace gas profile inputs.
MODIS BRDF comparisons with aircraft observations of the surface indicate an uncertainty in the MODIS BRDF product of 20 % for both accuracy and precision (Román et al., 2011) at GCAS spatial resolutions.We estimate the impact of those uncertainties from the MODIS surface BRDF on our individual AMFs to be 10 % for polluted observations and 5 % for clean observations.Y. Wang et al. (2010) showed that the use of the Lambertian approximation in the derivation of the MODIS products may result in surface reflectance underestimation of 0.008 by MODIS in the green bands.This surface bias on average could cause the GCAS AMF to be underestimated (and the resulting trace gas column to be overestimated) by ∼ 10 %.
The radiative effects of aerosols are not typically included in operational satellite retrieval trace gas AMFs, except as an implicit component of the cloud fraction, and we have not included aerosols in the current study.In reality, the pres-ence of aerosols can increase or decrease the AMF, with effects depending on aerosol type and altitude (Leitão et al., 2010;Lin et al., 2014;Chimot et al., 2016;Kwon et al., 2017;Meier et al., 2017).When scattering aerosols are in the boundary layer, for example, the backscattered light path increases the radiative sensitivity (an enhancement effect), resulting in an increase in the AMF.Ignoring these aerosols in the radiative transfer calculation will cause the retrieved column to be overestimated.When scattering aerosols are aloft, the radiative sensitivity decreases near the surface (a shielding effect), resulting in a decrease in the AMF.Absorbing aerosols aloft or at the altitude of the trace gas can decrease the measurement sensitivity by reducing the number of photons backscattered to the instrument, thereby reducing the AMF.Even when aerosols are considered, assumptions about aerosol optical properties and profiles can cause large uncertainties; Lorente et al. (2017) found different aerosol corrections used by different research groups introduced an average uncertainty of 50 % for polluted satellite observations with high aerosol loading.
Aerosol optical depth (AOD) measured by the HSRL lidar on the B-200 (Sawamura et al., 2017) showed aerosols varying day to day along the flight track and with altitude during the DISCOVER-AQ Texas campaign.The beginning of the campaign saw moderate AOD on the order of 0.2-0.3(532 nm), often with a smoke plume at altitudes 2-4 km which sometimes merged with aerosols from lower layers later in the day.Observed AODs rose sharply on 14 September, with AODs in excess of 0.7 in some areas.Aerosol loading from 18 September onwards was relatively low (< 0.15) and primarily located near the surface, with AOD occasionally reaching 0.25 at some points along the flight track.A full assessment of the effects of aerosols on the AMF is beyond the scope of this paper and the subject of ongoing work, but our simulations with typical AOD profiles from the HSRL lidar show a potential overestimation of the column of 10 %-30 % for individual polluted pixels when scattering aerosols in the planetary boundary layer (PBL) are ignored and a potential 15 % underestimation of the column when the smoke layer aloft is ignored.These results are consistent with Lin et al. (2014), whose satellite biases are typically within ±25 % due to the neglect of aerosols at these AODs.Nowlan et al. (2016) previously compared mean profile shapes from the P-CL observations and the CMAQ simulations for the eight core ground sites during DISCOVER-AQ Texas; mean differences were typically within 20 % for individual sites.Individual total column observations can vary by > 100 % (Nowlan et al., 2016), with differences mostly resulting from the small-scale features of NO 2 plumes, which are difficult to resolve with model resolution.Previous comparisons of DISCOVER-AQ Texas CMAQ CH 2 O 1 km simulations with P-3B DFGAS observations showed agreement between the model and observations for most days of the campaign (Fried et al., 2016b).The average of daily mean biases indicated a low bias of CMAQ relative to DFGAS of Atmos.Meas. Tech., 11, 5941-5964, 2018 www.atmos-meas-tech.net/11/5941/2018/−0.44 ± 0.39 ppbv in the PBL and −0.32 ± 0.40 ppbv overall (−11.8±15.7 %) over all days, excluding 25 September: a unique day characterized by very large CH 2 O levels of up to 25 ppbv as measured by the DFGAS instrument on the P-3B in the boundary layer over petrochemical facilities in Houston and up to 33 ppbv downwind over Galveston Bay and Smith Point later in the day due to photochemical processing (Fried et al., 2016b).From P-3B comparisons discussed later in this paper (Sect.6.3), we estimate these profile shape uncertainties typically result in uncertainties in the AMF of 10 % for NO 2 and 8 % for CH 2 O. Souri et al. (2018) calculated GCAS NO 2 vertical columns independently for our derived slant columns and found a mean tropospheric AMF over all days of 1.26 ± 0.32.This compares closely with our mean AMF of 1.29 ± 0.27.Their inputs included MODIS BRDF for surface reflectance, GEOS-Chem modeled stratospheric profiles, and an independently run CMAQ simulation whose aerosol fields were used to determine aerosol optical depths for input to the VLI-DORT model.The similar AMF from a separate study suggests a low structural uncertainty in AMF calculations using currently available ancillary information.

Modeled column uncertainties
Equation ( 4) requires the modeled vertical column below the aircraft at the reference spectrum location (V ↓ R ) and the modeled vertical columns above the aircraft at the observation location (V ↑ ) and the reference location (V ↑ R ).Systematic uncertainties in the effective slant columns above the aircraft at the observation and the reference may cancel out to some degree, but small uncertainties may still propagate to the final vertical column through the use of different observation and reference times and locations.We estimate an uncertainty of 30 % in the NO 2 stratospheric column, based on PRATMO comparisons with the Optical Spectrograph and InfraRed Imaging System (OSIRIS) limb sounder (Bourassa et al., 2011).We estimate reference location tropospheric vertical column uncertainties of 40 % for NO 2 and 31 % for CH 2 O, based on comparisons of the CMAQ model columns with the P-3B-inferred columns of the four cleanest spirals during the campaign at the coastal sites Galveston and Smith Point.An additional uncertainty is added by uncertainty in the reference AMFs, as discussed in the previous section.

Vertical column results
Retrieved GCAS columns in the Houston area during the campaign show enhanced NO 2 amounts over central Houston (close to Moody Tower), in the vicinity of the Houston Ship Channel industrial area (Pandora sites Channelview, Deer Park and La Porte) and sometimes along the more suburban flight track to the west of and over Manvel Croix, which is the case for morning overpasses on 6 and 13 September (Nowlan et al., 2016).Individual NO 2 plumes can also often be observed from single industrial sites and stacks.Emissions estimates using GCAS and CMAQ indicate the highest source regions for NO x are the Houston metropolitan area (145 t day −1 ), where mobile sources dominate; the Houston Ship Channel region (54 t day −1 ), where many petrochemical plants are concentrated; and, to a lesser extent, the Texas City area (17 t day −1 ), which is home to petroleum refining and petrochemical processing facilities (Souri et al., 2018).
Figures 3 and 4 show examples of retrieved NO 2 and CH 2 O tropospheric vertical columns for two consecutive days during the campaign and illustrate both the day-today and hourly variabilities observed in NO 2 and CH 2 O columns.In general, the largest NO 2 columns are seen in morning flights during all days of the campaign, with the peak columns varying with overpass time and meteorological conditions.The day of 24 September is typical of columns measured during the campaign in terms of magnitude.The 25 September flights show the largest pollution episode of the campaign.
Formaldehyde observations are noisier, but enhanced CH 2 O columns are clearly observable on some days when data are spatially averaged.In particular, 4 and 25 September show the largest CH 2 O enhancements, with peak values on the order of 5 × 10 16 molecules cm −2 at 1 km 2 resolution.Figure 4 shows the significant enhancement in CH 2 O near the Houston Ship Channel industrial area on 25 September.Several other days exhibit enhanced background over land, with the largest values of CH 2 O columns on these days to the north of Houston over the Conroe region, potentially from biogenic sources as well as transport of CH 2 O and its precursors.These days with large background CH 2 O highlight the importance of using the clean reference over the water, where background CH 2 O is typically lower than over land.
The month of September 2013 was relatively dry over Houston, and B-200 flights typically occurred on dry days with little cloud cover.Li et al. (2016) and Loughner and Follette-Cook (2015) describe the overall meteorological conditions present during the campaign as well as detailed www.atmos-meas-tech.net/11/5941/2018/Atmos.Meas.Tech., 11, 5941-5964, 2018  meteorological conditions on certain days.In the early part of September, the area saw little influence from strong synoptic weather systems, with winds mostly light and northeasterly in the early morning, shifting clockwise to southeasterly in the afternoons and resulting in the transport of clean marine air over Houston.The days of 11-14 September were characterized by winds from the northeast, parallel to the coastline.A cold front passed over Houston with northerly transport on 24 September, while 26 September saw stagnant conditions overnight, followed by a sea breeze.
The 25 September pollution episode has been previously examined in several model and in situ measurement studies (Loughner and Follette-Cook, 2015;Li et al., 2016;Souri et al., 2016;Fried et al., 2016b;Mazzuca et al., 2017;Pan et al., 2017).This day saw a light morning land breeze which removed pollutants from the Ship Channel to Galveston Bay.A later bay breeze then brought pollutants from the bay back to land.In the mid-morning, the prevailing winds were northwesterly over most of the city, with northeasterly winds observed at the Houston Ship Channel.When combined with the bay breeze, these winds set up a convergence zone, trapping pollutants from the Ship Channel region.In combination with a suspected emissions event (Fried et al., 2016b;Souri et al., 2018), these meteorological conditions produced very high levels of ozone, NO 2 , CH 2 O and related species.NO 2 columns on this day are largest in the morning flight, while CH 2 O columns are largest in the afternoon.GCAS is likely measuring both directly emitted CH 2 O and secondary CH 2 O produced from other precursors.

Comparisons with coincident measurements
In this section, we compare GCAS observations from all days with coincident observations from Pandoras and the P-3B aircraft.Figures 5 and 6 show enlarged views of NO 2 and CH 2 O observations over the downtown and Ship Channel regions of Houston on 25 September, along with coincident Pandora ground site observations and the P-3B flight track nearest in time.These figures illustrate the typical coverage of P-3B spirals relative to GCAS swaths, as well as the air mass measured in the bottom 2 km of the atmosphere by Pandora DS ground-based instruments.

P-3B airborne in situ measurements
We compare the retrieved GCAS NO 2 and CH 2 O columns with columns derived from in situ observations on the P-3B aircraft.The P-3B profiles are converted into column amounts below the top flight altitude (usually 3.5-5 km) using mixing ratios and pressure/temperature profiles measured on board the P-3B aircraft.Comparisons between GCAS and P-3B columns are shown in Fig. 7.

P-3B and GCAS column preparation
Each P-3B column is calculated by integrating the NO 2 or CH 2 O partial columns derived from observed mixing ratios over the altitude of the spiral.The lowest altitude of each P-3B spiral varies by location.At Deer Park, Galveston and West Houston, the mean minimum spiral altitude is ∼ 20-40 m, while Conroe and Smith Point spirals typically go as low as ∼ 130 m.At Channelview, Manvel Croix and Moody Tower, the lowest spiral altitude is typically ∼ 300 m.To de-termine the NO 2 profile below the lowest P-3B altitude, we estimate the P-3B mixing ratio below the aircraft following Lamsal et al. (2014), by extrapolating the mixing ratio at the lowest aircraft altitude to the surface using the vertical gradient from the CMAQ model at altitudes below the spiral.A large source of error from these extrapolations is the inhomogeneity of the trace gas field, which is particularly strong for NO 2 (see Fig. 5 for example), as the lowest mixing ratio could be measured in or out of an area of high NO 2 and is then extended to the ground.Lamsal et al. (2014) estimated errors in the DISCOVER-AQ Maryland P-3B NO 2 columns of generally less than 20 % from extrapolation of the NO 2 profile below ∼ 300 m, assuming a factor-of-2 error in the extrapolation.CH 2 O DFGAS mixing ratios below the spiral are extrapolated to the ground from the lowest mixing ratio in the bottom 100 m of the spiral, as described by Fried et al. (2018).As CH 2 O gradients near the surface tend to be smaller than those of NO 2 , the extrapolation error is also likely less significant.P-3B CH 2 O columns calculated with an extrapolated model gradient and a direct extrapolation vary by about 5 %.
The GCAS column for the P-3B comparison is calculated by averaging all GCAS columns within 1 h and 5 km of a spiral center.We exclude spirals where there are fewer than 30 GCAS observations within the coincident area; most spirals typically include hundreds of GCAS pixels.The modeled NO 2 column above the top P-3B spiral altitude is subtracted from the retrieved GCAS tropospheric NO 2 column (∼ 3 × 10 14 molecules cm −2 on average).Comparisons of CMAQ and P-3B NO 2 profiles in the free troposphere (3-5 km) suggest a mean absolute error of 70 % in the free troposphere (CMAQ is 10 % higher than the P-3B on average).If we assume similar discrepancies above the highest P-3B altitude, this may lead to an uncertainty of ∼ 2 × 10 14 molecules cm −2 in the GCAS column from the removal of the column above the P-3B.
Free-tropospheric CH 2 O in the model is much larger than that observed by the in situ instrument during several early flights during the 4-14 September period, possibly due to the transport of too much boundary layer air in the model (Fried et al., 2016b).The mean absolute error from CMAQ versus P-3B between 3 and 5 km is 40 %, with much larger biases of ∼ 100 % on certain days.We find its removal introduces daily background biases that reduce the overall correlation between P-3B and GCAS observations; as a result, we do not remove the modeled CH 2 O above the spiral from the GCAS results in these comparisons.This results in an uncertainty on the order of 1-3×10 15 molecules cm −2 , depending on the flight.

NO 2
The overall correlation between the P-3B P-CL and GCAS NO 2 measurements is very good (r 2 = 0.89).The two instruments also agree well in magnitude, with GCAS slightly   lower than the P-3B at larger NO 2 columns by ∼ 10 %.At background levels, GCAS overestimates the P-3B columns by ∼ 1.6 × 10 15 molecules cm −2 .This background offset is most likely due to a combination of uncertainties introduced by the GCAS stratospheric correction and the modeled tropospheric background column in the reference spectrum in Eq. ( 4), with a possible contribution from the uncertainty in the column below the minimum P-3B spiral altitude.Each GCAS vertical column is the mean of all retrieved cloud-free GCAS columns below the aircraft within 5 km and 1 h of its coincident P-3B spiral center.GCAS air mass factors are determined using modeled CMAQ profiles.The solid line represents the 1 : 1 ratio.The dotted line represents the reduced major axis linear regression.Error bars represent the uncertainty in the GCAS mean column due to retrieval noise from the observations used to calculate a mean column (typically several hundred at 250 m × 500 m resolution); in the case of NO 2 , this uncertainty is generally negligible due to low relative error.Column precisions for P-3B observations are approximately 2 × 10 13 molecules cm −2 (NO 2 ) and 6 × 10 13 molecules cm −2 (CH 2 O).Uncertainties from spatial variability and measurement accuracy are discussed in the text.

CH 2 O
The agreement between the P-3B DFGAS and GCAS CH 2 O columns is also reasonably good (r 2 = 0.54), with GCAS on average 8 % larger than DFGAS.There appears to be little background offset bias influence from the reference spectrum, although the GCAS columns are likely overestimated by some small amount as the CH 2 O above the P-3B has not been removed, as discussed previously.Large columns are often seen at Deer Park and Channelview near industrial facilities, and at Conroe and West Houston (likely from biogenic sources as well as transport from the industrial regions).

Pandora NO 2 column measurements
Figures 8 and 9 show comparisons of GCAS tropospheric columns with Pandora NO 2 columns derived from both DS and MAX-DOAS scattered light retrievals, by day and by site.Figure 10 shows the Pandora measurements at four sites as a function of time, and GCAS coincidences with those observations.In the case of the Pandora DS observations, we have estimated the tropospheric Pandora column by subtracting the modeled NO 2 above the GCAS instrument (typically ∼ 2.5-4 × 10 15 molecules cm −2 ) from the closest Pandora observation in time within 3 min.The uncertainty in the stratospheric NO 2 column in our model is estimated at 30 % (see Sect. 4.5.3).In the case of MAX-DOAS comparisons, we compare a single GCAS observation over each site with the closest MAX-DOAS observation within 20 min.For the comparison with DS observations, we have determined the GCAS observation from the mean of GCAS ground pixels intersected by the Pandora line of sight in the bottom 2 km of the atmosphere (shown in Fig. 5).This helps to minimize the influence of the Pandora viewing geometry on the comparison.For instance, GCAS consistently measures large columns over the Deer Park site, with some of the largest NO 2 often to the north of the site; however, when viewing the sun directly, Pandora always looks south into cleaner air.The use of a GCAS NO 2 amount determined along the Pandora DS line of sight reduces the influence of these biased site locations on the results, with an overall reduction in the GCAS-versus-Pandora bias of 20 %.There remain, however, several sites with an obvious difference in GCAS versus Pandora DS measurements, despite considering the field of view.
Overall, GCAS tropospheric NO 2 is larger than Pandora (GCAS/Pandora = 1.50 for DS and 1.33 for MAX-DOAS), although the spatial correlations are very good at r 2 = 0.85 (DS) and r 2 = 0.94 (MAX-DOAS).A background offset of ∼ 2 × 10 15 molecules cm −2 is seen between GCAS and the Pandora DS measurements, similar to that seen in the P-3B comparisons.Again, this is most likely from uncertainties in the modeled stratospheric correction and reference spectrum correction, with a possible contribution from the Pandora reference as well.More surprisingly, GCAS NO 2 is 50 % (DS) and 33 % (MAX-DOAS) larger at high NO 2 values.

AMF from P-3B profiles
In order to assess the dependence of the GCAS observations on the profile uncertainty, we also apply the P-3B profiles in place of model profiles in the GCAS AMF calculations and compare the new GCAS columns with the P-3B and Pandora columns.When the spiral profiles are applied to the GCAS observations within their vicinity, the use of the observed profiles lowers the overall slope of GCAS tropospheric NO 2 columns by 4 % (P-3B) and 2 % (Pandora) and the CH 2 O columns by 2 % (P-3B) as compared with coincident observations.The NO 2 correlations with the P-3B and Pandora remain the same, but the correlation increases to r 2 = 0.62 for P-3B CH 2 O. Individual coincident observations can change by as much as −50 % to +35 % for NO 2 (mean change of +1 ± 10 %) and −15 % to +25 % for CH 2 O (mean change of +3 ± 8 %).The largest mean changes for a single day occur at the Deer Park site in the Pandora comparisons, where the GCAS NO 2 column on 25 September is reduced by 15 % on average.

Discussion of coincident measurement comparisons
Overall, the GCAS observations correlate well spatially and temporally with the P-3B and Pandora observations.The GCAS observations also show agreement in magnitude with the P-3B-inferred NO 2 and CH 2 O columns well within the measurement uncertainties.The GCAS NO 2 observations are significantly higher than those of the Pandora direct-sun in-struments by 50 %.They are also higher than the Pandora MAX-DOAS by 33 %, although there are fewer coincidences with the MAX-DOAS observations, and there is good agreement at the La Porte site.
Differences between GCAS and the P-3B primarily result from errors in the GCAS AMF (surface reflectance, aerosols and profile shape); the inability of the P-3B to capture profiles of near-surface gases below 300 m near the Channelview, Manvel Croix and Moody Tower sites; and spatial variability and P-3B sampling.Much of the variability observed in individual spirals in the NO 2 column comparison in Fig. 7 is due to the large radius of the P-3B spiral, which can mean the P-3B sometimes flies in and out of NO 2 plumes, as seen in Fig. 5.
The differences between GCAS and Pandora NO 2 are much larger.GCAS NO 2 columns could be influenced by several uncertainties that can result in cumulative biases in the AMF calculation (again, primarily from errors in surface reflectance, aerosols and profile shape).Different factors likely dominate the uncertainties at different sites; some sites are located at locations with very inhomogeneous surface reflectance (Smith Point and Moody Tower), and some at locations with large uncertainties in profile shape.The slope is also dominated by the larger polluted measurements on 25 September, which was a day with complicated meteorology, a morning boundary layer of ∼ 200 m (according to HSRL data) and uncertain emissions (Souri et al., 2018).2018) also found a large difference between GCAS and Pandora observations during the Texas campaign.By using a Bayesian inversion to constrain the MODIS BRDF, they reduced the overestimation of GCAS relative to Pandora by 23 % through a 0.023 increase in surface albedo, broadly consistent with studies that have found a low bias in MODIS surface reflectance (Y.Wang et al., 2010;Salomon et al., 2006) at short wavelengths.The exclusion of aerosols in our AMF calculation may cause the AMF to be underestimated (and therefore the vertical column to be overestimated) in some cases, particularly where scattering aerosols are in the lowest part of the boundary layer (see discussion in Sect.4.5.2).We find that the GCAS vertical columns at Pandora coincidences are reduced on average by 10 % when the air mass factor is calculated using the nearest HSRL aerosol optical thickness profiles below the aircraft for scattering aerosols.In the previous section, we saw there is likely a small +2 % bias in the GCAS column from the use of CMAQ-modeled NO 2 profile shapes in measurements coincident with Pandora.Therefore, while profile shape may contribute to large errors on individual observations, it is unlikely to produce a large bias in the GCAS observations overall.
Differences in the GCAS and Pandora slant column retrievals themselves may also play a role, including the wavelength fitting region and atmospheric temperature assumptions.Previous comparisons of Pandora DS total column NO 2 observations with other ground-based observations have shown good agreement (Herman et al., 2009;S. Wang et al., 2010).In contrast, Knepp et al. (2017) compared a year of Pandora zenith sky stratospheric NO 2 slant columns with those from a zenith-looking UV-Vis spectrometer (DOAS M07) from the Network for the Detection of Atmospheric Composition Change (NDACC) using different retrieval settings and found Pandora underestimated the NDACC instrument by 7 %-40 %.The Pandora slant column product used in our study was produced assuming a fixed effective temperature of 264 K, which could result in a low bias in the retrieved Pandora slant column of 10 % (Spinei et al., 2014).
Despite the sources of uncertainty on the GCAS columns, it should be noted that a large reduction in the GCAS vertical columns from the use of different AMF inputs resulting in better agreement with the Pandora columns would mean a significant underestimation by GCAS of both the NO 2 and CH 2 O P-3B columns.Recent comparisons of Pandora DS NO 2 columns and NO 2 inferred from the P-3B P-CL instrument for the four DISCOVER-AQ campaigns (Maryland, California, Texas and Colorado) show the P-3B agrees well with Pandora DS measurements for all campaigns except Texas, where Pandora NO 2 is significantly underestimated (Sungyeon Choi, personal communication).Previous airborne comparisons with the GeoTASO instrument during DISCOVER-AQ Texas on four relatively unpolluted or cloudy days (13, 14, 18, 24 September) also suggested airborne NO 2 larger than Pandora (Nowlan et al., 2016).

Conclusions
We have presented trace gas retrievals of NO 2 and CH 2 O from the GCAS instrument during the DISCOVER-AQ Texas 2013 campaign.In these retrievals, we first use a spectral fit to derive slant column densities from nadir spectra, in combination with reference spectra measured over a clean area.We then convert those slant columns to vertical columns using tropospheric trace gas profiles from the CMAQ model and surface reflectance from the MODIS BRDF product.At a spatial resolution of 250 m × 500 m, the NO 2 product has a mean precision of 1 × 10 15 molecules cm −2 , and the CH 2 O product has a mean precision of 1.9 × 10 16 molecules cm −2 .In order to meet TEMPO precision requirements, and to detect enhanced CH 2 O during the DISCOVER-AQ Texas campaign, we recommend CH 2 O be spatially averaged to 1 km 2 .Uncertainties in NO 2 polluted observations are dominated by air mass factor uncertainties, which result primarily from uncertainties in surface reflectance, aerosol loading and trace gas profile shape.These air mass factor uncertainties also play a role in individual CH 2 O uncertainties but can be similar in magnitude to uncertainties from spectral fitting noise.
Comparisons between GCAS and P-3B and Pandora observations show GCAS data are very well correlated with these coincident measurements, but in some cases they show differences in magnitude.GCAS columns agree well with those inferred from P-3B in situ profiles for both NO 2 (r 2 = 0.89; GCAS/P-3B slope = 0.90 and intercept = 1.6 × 10 15 molecules cm −2 ) and CH 2 O (r 2 = 0.54; GCAS/P-3B slope=1.08 and intercept = −5 × 10 14 molecules cm −2 ).The use of P-3B profiles instead of modeled profile shapes results in a mean difference of 2 %-4 % in GCAS columns in comparisons with coincident observations.GCAS is higher than Pandora MAX-DOAS tropospheric NO 2 columns but shows excellent spatial agreement (r 2 = 0.94; GCAS/Pandora slope = 1.33 and intercept = 4×10 14 molecules cm −2 ); these differences in magnitude, however, remain within the bounds of GCAS systematic error estimates in the AMF.The largest discrepancies in magnitude are seen between GCAS and Pandora direct-sun observations, although spatial correlations are very good (r 2 = 0.85; GCAS/Pandora slope = 1.50 and intercept = 1.9×10 15 molecules cm −2 ).As both Pandora and GCAS are key instruments in planned TEMPO validation activities, there is clearly a need to resolve these differences in magnitude to ensure reliable validation studies.Further opportunities for comparisons over different geographic areas and pollution regimes exist in other campaigns.
Since DISCOVER-AQ Texas in 2013, the airborne GCAS and GeoTASO instruments have been deployed in the DISCOVER-AQ Colorado field campaign (2014), KORUS-AQ field campaign (2016), GOES-R validation campaign (2017), Lake Michigan Ozone Study (2017) and Long Island Sound Tropospheric Ozone Study (2018).These data are currently under study and offer further opportunities to examine the effects of surface characterization, profile shape, aerosols, viewing geometries and trace gas heterogeneity on ground, airborne and satellite remotely sensed trace gas columns.
Data availability.The GCAS and P-3B NO 2 and C 2 HO data and Pandora direct-sun NO 2 columns are publicly available from the DISCOVER-AQ data archive at http://www-air.larc.nasa.gov/missions/discover-aq/discover-aq.html(last access: 23 October 2018) (Nowlan et al., 2017).The archived GCAS data also include coincident model profiles for each observation.
Author contributions.CRN performed the slant column retrievals, air mass factor calculations and instrument comparisons, and wrote the manuscript.CRN, XL and KC initiated and designed the research.SJJ and MGK collected and calibrated the GCAS spectra.HAK assisted in the analysis of spectra and slant column retrievals.GGA provided the radiative transfer model and guided interpretations.MBFC, CPL and KEP provided the CMAQ model simulations.AF, DR, JW, PW and AJW collected and provided the P-3B in situ aircraft data.JRH and ES provided the Pandora data.LMJ provided in situ data for correcting Pandora observations at Moody Tower.All authors contributed to interpretations and edited the manuscript.
Competing interests.The authors declare that they have no conflict of interest.

Figure 1 .
Figure 1.Map of Houston area showing sample flight tracks for the King Air B-200 (GCAS) and the P-3B aircraft on 6 September 2013, and ground sites where Pandora spectrometers were located.Major roads are shown in yellow.ExxonMobil Baytown and Texas City are large petrochemical and petroleum refinery complexes.The Baytown complex lies near the entrance to the main part of the Houston Ship Channel industrial area, which ends 6.5 km to the east of the downtown.The red triangle shows the location of the observations used to calculate the GCAS reference spectra.
C. R. Nowlan et al.: GCAS measurements of NO 2 and CH 2 O 4.2.3NO 2 and CH 2 O fitting

Figure 2 .
Figure 2. Sample mixing ratio (mxr) and scattering weight (w(z)) profiles used in the GCAS AMF calculations.The reference spectrum profiles are taken from the 6 September afternoon flight over the Gulf of Mexico at an average location of 29.126 • N, 94.818 • W at 17:04 LT (local time: UTC time − 5 h) with SZA = 58.0• and VZA = 10.5 • .The profiles at the Moody Tower site in downtown Houston are from the 25 September morning flight at 10:56 LT with SZA = 45.0 • and VZA = 10.7 • .The estimated surface reflectivities at 442 nm (NO 2 ) and 342 nm (CH 2 O) are 0.04 and 0.05 at the reference location and are both 0.07 at Moody Tower.The dashed black line indicates the aircraft flight altitude.

Figure 3 .
Figure 3. Tropospheric NO 2 and CH 2 O vertical columns measured by GCAS over Houston on 24 September 2013.NO 2 observations are at ∼ 250 m × 500 m resolution, and CH 2 O columns are at 0.01 • × 0.01 • (∼ 1 km 2 ) resolution.Times are local time.Black crosses indicate ground sites.

Figure 5 .
Figure 5. GCAS tropospheric NO 2 columns measured near DISCOVER-AQ ground sites in the area of downtown Houston on 25 September 2013.P-3B flight tracks are shown in white.Pandora direct-sun NO 2 tropospheric columns (total column NO 2 minus modeled NO 2 above the aircraft) are shown in filled circles.Black lines represent the line of sight of each Pandora intersecting the bottom 2 km of the atmosphere.The largest NO 2 column observed by GCAS on this day was 16 × 10 16 molecules cm −2 .The NO 2 precision at this resolution is ∼ 1 × 10 15 molecules cm −2 .Periodic cross-track gaps in the data are due to write-to-disk intervals of the instrument.During these periods, the instrument does not acquire data, thus producing small gaps in coverage.

Figure 6 .
Figure 6.GCAS tropospheric CH 2 O columns measured near DISCOVER-AQ ground sites in the area of downtown Houston on 25 September 2013.P-3B flight tracks are shown in white.CH 2 O columns are spatially averaged on a 0.01 • × 0.01 • grid (∼ 1 km 2 ).The CH 2 O precision at this resolution is ∼ 7 × 10 15 molecules cm −2 .

Figure 7 .
Figure7.Columns derived from in situ measurements of (a) NO 2 from the chemiluminescence instrument and (b) CH 2 O from the DFGAS instrument on the P-3B aircraft compared with vertical columns measured by the GCAS instrument, over 9 days during the DISCOVER-AQ Texas campaign.Each GCAS vertical column is the mean of all retrieved cloud-free GCAS columns below the aircraft within 5 km and 1 h of its coincident P-3B spiral center.GCAS air mass factors are determined using modeled CMAQ profiles.The solid line represents the 1 : 1 ratio.The dotted line represents the reduced major axis linear regression.Error bars represent the uncertainty in the GCAS mean column due to retrieval noise from the observations used to calculate a mean column (typically several hundred at 250 m × 500 m resolution); in the case of NO 2 , this uncertainty is generally negligible due to low relative error.Column precisions for P-3B observations are approximately 2 × 10 13 molecules cm −2 (NO 2 ) and 6 × 10 13 molecules cm −2 (CH 2 O).Uncertainties from spatial variability and measurement accuracy are discussed in the text.

Figure 8 .
Figure 8. Pandora direct-sun NO 2 tropospheric columns vs. GCAS NO 2 tropospheric columns by day for cloud-free observations over Houston during DISCOVER-AQ Texas 2013.The Pandora columns are the total NO 2 columns measured by Pandora minus the colocated modeled stratospheric NO 2 columns used in the GCAS analysis.All correlations are statistically significant at the p < 0.001 level except for those of 14 (p = 0.09) and 27 (p = 0.01) September.The solid line represents the 1 : 1 ratio.The dotted line represents the reduced major axis linear regression.

Figure 9 .
Figure 9. Pandora NO 2 tropospheric columns from direct-sun and MAX-DOAS observations vs. GCAS NO 2 tropospheric columns by site for cloud-free observations over Houston during DISCOVER-AQ Texas 2013.The Pandora direct-sun columns are the total NO 2 columns measured by Pandora minus the colocated modeled stratospheric NO 2 columns used in the GCAS analysis.The solid line represents the 1 : 1 ratio.The dotted lines represent the reduced major axis linear regressions.

Figure 10 .
Figure 10.Tropospheric NO 2 columns from Pandora direct-sun (DS) and MAX-DOAS observations as a function of time between 4 and 27 September at Deer Park, La Porte, Moody Tower and Smith Point sites, and GCAS coincidences with those observations, as well as stratospheric NO 2 from a model at Pandora DS measurements.Pandora DS tropospheric columns are derived by removing the modeled stratosphere from the retrieved Pandora total columns.
C. R. Nowlan et al.: GCAS measurements of NO 2 and CH 2 O Souri et al. (

Table 1 .
Summary of GCAS flights during DISCOVER-AQ Texas 2013.Times are local time (LT: UTC − 5 h).Days with P-3B aircraft flights are denoted by an X in the rightmost column.

Table 2 .
DISCOVER-AQ sites with Pandora spectrometers overflown by GCAS.The Pandora ID is a identification number given to each individual Pandora instrument.

Table 3 .
Fitting details and fitted parameters used in GCAS trace gas retrievals."n/a" means not applicable.