Articles | Volume 14, issue 2
Research article
10 Feb 2021
Research article |  | 10 Feb 2021

Assimilation of lidar planetary boundary layer height observations

Andrew Tangborn, Belay Demoz, Brian J. Carroll, Joseph Santanello, and Jeffrey L. Anderson

Lidar backscatter and wind retrievals of the planetary boundary layer height (PBLH) are assimilated into 22-hourly forecasts from the NASA Unified – Weather and Research Forecast (NU-WRF) model during the Plains Elevated Convection at Night (PECAN) campaign on 11 July 2015 in Greensburg, Kansas, using error statistics collected from the model profiles to compute the necessary covariance matrices. Two separate forecast runs using different PBL physics schemes were employed, and comparisons with six independent radiosonde profiles were made for each run. Both of the forecast runs accurately predicted the PBLH and the state variable profiles within the planetary boundary layer during the early morning, and the assimilation had a small impact during this time. In the late afternoon, the forecast runs showed decreased accuracy as the convective boundary layer developed. However, assimilation of the Doppler lidar PBLH observations was found to improve the temperature and V-velocity profiles relative to independent radiosonde profiles. Water vapor was overcorrected, leading to increased differences with independent data. Errors in the U velocity were made slightly larger. The computed forecast error covariances between the PBLH and state variables were found to rise in the late afternoon, leading to the larger improvements in the afternoon. This work represents the first effort to assimilate PBLH into forecast states using ensemble methods.

1 Introduction

The planetary boundary layer (PBL) plays an important role in weather, climate and pollution through its role in land–atmosphere interactions and mediation of Earth's water and energy cycles (Santanello et al., 2018). This layer is where the Earth’s surface interacts with the atmosphere, exchanging momentum, heat, moisture and pollutants. The PBL height (PBLH) is central to these interactions and is controlled by the energy flux from the surface. Under certain conditions during daytime it defines the convective boundary layer (CBL) and during nighttime it is the stable (non-convective) boundary layer (SBL). Trace gases and aerosols emitted from the surface are rapidly transported within the CBL by turbulent atmospheric motion, and transfer of energy and mass into the free troposphere occurs across an interfacial layer at the top of the PBL. The PBL affects convection in the troposphere, which is generally initiated within the boundary layer and then penetrates its top (Hong and Pan, 1998; Browning et al., 2007). Thus, accurate knowledge of the PBLH is essential for weather, pollution and climate forecasting.

The PBLH is defined by thermodynamic properties such as a temperature inversion or hydrolapse which can be measured by radiosonde. Alternatively, the drop-off in aerosol concentration that occurs across the top of the PBL is used, since aerosols are well-mixed throughout the PBL when the CBL is present (Hicks et al., 2019). Atmospheric models rely on parameterization schemes to define the structure of the PBL and compute PBLH. These are generally either local mixing schemes that use local turbulent kinetic energy (TKE; Janjic, 1994) or non-local flux schemes (Hong and Pan, 1996). Generally, these PBL parameterizations have systematically higher PBLH relative to observed values (Hegarty et al., 2018) and also have difficulties modeling the growth of the convective layer during the morning. The variety of definitions of PBLH make it difficult to effectively evaluate existing models or develop new ones.

Observations of PBLH are traditionally made by radiosonde measurements, which have high vertical resolution but are expensive to launch frequently and are thus limited to special experiments and/or ill-timed launches (e.g., 00:00/12:00 UTC National Weather Service launches) with respect to convective and stable PBL development. Likewise, spaceborne measurements of the lower troposphere from passive and active instruments are severely limited in vertical, spatial and/or temporal resolution (Wulfmeyer et al., 2015). Ground-based measurement of PBLH has been proposed for an extensive network of ceilometers by adding to the functionality of instruments that were designed for measuring cloud heights (Hicks et al., 2016). The ceilometer measures the time required for a laser pulse to return to a receiver, from which the height of the scattering is determined. The intensity of the backscatter is correlated with the density of aerosols at a given height, and the PBLH is inferred from the location of the maximum negative gradient of the backscatter intensity. Several algorithms employ wavelet transforms to identify the location of the negative gradient (e.g., Brooks, 2003; Knepp et al., 2017). This existing network of ceilometers could be used to create a relatively dense network of frequent PBLH observations, as was recommended by the 2009 study from the National Research Council (NRC, 2009) and the Thermodynamic Profiling Technologies Workshop (Hardesty and Hoff, 2012).

Since the ceilometer PBLH observations were not yet available for the time period we are studying, we employ Doppler lidar observations made at the Plains Elevated Convection at Night (PECAN) site in Greensburg, Kansas, to demonstrate the methodology. PECAN was an intensive campaign to study organized mesoscale convection systems (MCSs) during the period 1 June–15 July 2015. It employed three aircraft and a large array of ground-based lidar, radar and ground weather stations. The data we are using are from a Leosphere WINDCUBE 200S Doppler lidar owned and operated by the University of Maryland, Baltimore County (Delgado et al., 2016). This lidar operates at an infrared wavelength and hence receives its strongest backscattered signal within the aerosol-laden PBL and is often below the measurement noise floor above the PBL. The Doppler shift of the backscattered signal is used to calculate wind speed as a function of range, which can then be used to produce a multitude of wind and turbulence variables useful for PBL characterization (e.g., vertical velocity variance and signal-to-noise ratio variance). While Doppler lidars and ceilometers are similar in aerosol detection, a Doppler lidar's additional wind measurement capability makes it more broadly applicable and at times more accurate than a ceilometer for PBLH retrievals. The PBLH algorithm applied for this study combines several such aerosol and wind variables, and each PBLH retrieval involves measurement of turbulence intensity, horizontal wind profiles and backscatter intensity. The heights of steep gradients in these quantities are determined using empirical thresholds and wavelet transform techniques, and the three estimates are combined using fuzzy logic. This is described at length in Bonin et al. (2018). Additional lidar parameters and the application of the algorithm to PECAN data were presented in Carroll et al. (2019). The PBLH retrievals were made from a repeating 25 min lidar scan cycle. This Doppler lidar and PBLH algorithm combination are generally well-suited for accurate and precise measurement of the PBLH during the daytime boundary layer, nocturnal boundary layer and morning transition period (Bonin et al., 2018; Carroll et al., 2019). The evening transition is the most challenging for this setup due to difficulties in defining a clear mixing layer during the decay of a turbulent daytime PBL (Lothon et al., 2014).

The question remaining is how to assimilate these observations into a numerical weather prediction (NWP) model. A number of studies have explored assimilating boundary layer wind profile measurements from lidar (Hu et al., 2019, Coniglio et al., 2019a, b; Degelia et al., 2019) and have shown that this increases the accuracy of forecasts due to improvements within the PBL. And further studies (Degelia et al., 2020; Chipilski et al., 2020) found that convective initiation (CI) was enhanced through the assimilation of thermodynamic profiles within the PBL, though the former found that CI was degraded by the assimilation of kinematic (velocity) profiles. This work highlights the important role that the PBL plays in forecasting convective events, so that any observations that can improve estimation of the model state should be an important source of new information. We are interested in assimilating the PBLH observations directly because the ceilometer network described above will focus on these retrievals, and satellite missions which measure PBLH are also planned. PBLH is a diagnostic variable in NWP parameterized physics models. This means any correction to PBLH will be lost during the model forecast unless the PBLH height observation is used to correct state variables such as temperature and moisture. This could be done either by adopting a variational data assimilation scheme or through the use of an ensemble Kalman filter which would determine the error covariances between PBLH and state variables in the model. We choose the latter so as to avoid the task of linearizing the model physics. The structure of the covariance, and how the state variables are changed by assimilating PBLH, will depend on which PBL scheme is used. We will show how such a system could work by conducting a posteriori lidar PBLH observation impact experiments using forecast fields from a NASA Unified Weather and Research Forecast (NU-WRF, Peters-Lidard et al., 2015) model run for 1 d during the Plains Elevated Convection at Night (PECAN) campaign on 11 July 2015. The assimilation is done on 22-hourly WRF forecast fields throughout the day without cycling the analysis fields back into the model, using two different PBL parameterizations. In this paper, we demonstrate a new and promising method that uses the lidar-based aerosol backscatter and wind-derived PBLH to correct model-forecasted state variables. The purpose here is to show how ensemble-computed error covariance can transfer observational information from PBLH to the state variable profiles.

2 Methodology

The assimilation methodology is based on the ensemble Kalman filter (EnKF) (Evenensen, 1994; Burgers et al., 1998; Evensen, 2009), where the analysis state is the estimate with a minimized error norm, relative to the given error statistics. It differs from the EnKF in that the analysis is not used as an initial state for the next model forecast. Rather, two existing 1 d NU-WRF forecasts, with different PBL physics schemes, are used when lidar measurements are available at a single location. These forecasts were produced as a part of the PECAN campaign in 2015, and we reuse them here to demonstrate the assimilation algorithm that we have developed. These were not ensemble forecasts so we cannot build a standard ensemble Kalman filter from them. Instead we use ensemble optimal interpolation (EnOI), in which profiles from neighboring model grid points are used to obtain an estimate of error statistics (Oke et al., 2010; Evensen, 2003, 2009; Keppenne et al., 2014). This approach will allow for the construction of the vertical component of covariance, which is needed in order to understand how PBLH can be used to correct atmospheric profiles through the use of profile and PBLH statistics. We use profiles from nearby model grid points and have tested the system with varying numbers of grid points in the ensemble. An ensemble Kalman filter would likely give different covariance information, but the basic relationship between the state variable profiles and the PBLH is determined by the model in the same manner here.

The NU-WRF simulations, taken from existing forecast runs used for the PECAN campaign (Santanello et al., 2019), are initialized using a National Center for Environmental Prediction (NCEP) Global Forecast System (GFS) reanalysis. The two NU-WRF simulations use the Mellor–Yamada–Janjic (MYJ) (Mellor and Yamada, 1974, 1982; Janjic, 2002) and Mellor–Yamada–Nakanishi–Niino level 2.5 (MYNN) (Nakanishi and Niino, 2009), which are local 1.5- and 2.5-order turbulence closure schemes, respectively. The PBLH in each of these models is estimated using the turbulent kinetic energy (TKE) method. The NU-WRF forecast state variables are temperature (T), specific humidity (Q) and velocity (U, V), and we define the forecast vector xf=[TfQfUfVf (PBLH)f], where we have combined PBLH with the state variables to enable the covariance calculation between them. The vector x is a column vector, so that the error covariance defined below only includes vertical covariances. The forecast runs are initiated from the NOAA global forecast system (GFS) reanalysis interpolated to the local domain of 30–48 N and 84–110 W, with 220×220 lat–long and 54 vertical levels, at 00:00 UTC. At this time, the initial state has assimilated all of the conventional and satellite observations globally. The two WRF forecast experiments start at 00:00 UTC and are run for 22 and 23 h for the MYJ and MYNN experiments, respectively. We use an ensemble of the 20×20 nearest grid points, so that all of the ensemble members are within about 30 km of the lidar observations (since the grid spacing is about 3 km). Generally, larger ensembles using grid points farther away will result in larger forecast error covariance because of the geographic variability. So this ensemble size was chosen as a balance between ensemble size and geographic localization. The forecast standard deviation for PBLH on the chosen ensemble was around 27 m at 22:00 UTC. Lidar PBLH observations were made every 25 min on that day in Greensburg, KS (37.6 N, 99.3 W), while balloon soundings were launched from that location six times as part of the Plains Elevated Convection At Night (PECAN; Gerts et al., 2017).

For an EnKF the generalized analysis equations are

(1) x a = x f + K ( y o - H ( x f ) ) ,

where xa is the analysis state, xf is the forecast state, yo is the observation vector and H is the non-linear observation operator. The gain matrix, K, is defined by

(2) K = P f H T ( HP f H T + ( R ) - 1 ,

and Pf is the forecast error covariance, R is the observation error covariance and H is the linearized observation operator. The matrices PfHT and HPfHT are formed from the ensemble of forecasts. In the present work, we use the EnOI method and assimilate observations one at a time using the ensemble of profiles described above. In this case, xa and xf depend only on vertical level, and yo=yo, R=(σo)2 and HPfHT=(σf)2 become scaler quantities. The analysis equations are then

(3) x a = x f + K ( y o - H ( x f ) )


(4) K = P f H T ( ( σ f ) 2 + ( σ o ) 2 ) - 1 .

The observation error standard deviation supplied by the lidar retrieval is σo, which is determined from the combined uncertainty of the vertical velocity variance, velocity gradient and backscatter gradient. Generally, when these quantities change rapidly at the top of the PBL, then the estimated error is small. The error estimates are larger when (during the evening) the gradients are much more gradual. H is the linearized observation operator for PBLH. Because the PBLH is related to the state variables via the two PBL physics schemes, determining H would require linearizing the PBL physics at every analysis time. Rather, here we use the EnOI described above to get

(5) P f H T ( x f - μ x f ) ( H ( x f - μ x f ) ) T


(6) HP f H T = ( σ f ) 2 H ( x f - μ x f ) ( H ( x f - μ x f ) ) T ,

where μxf is the mean forecast state of the ensemble of profiles. See Houtekamer and Zhang (2016) for a review of ensemble Kalman filter techniques.

We expect the correlation between the air mass within the PBL and the free troposphere to drop away rapidly, because of limited interactions between them. We found that this can cause errors in the analysis profiles if error covariance between the state variables and PBLH is allowed to continue into the troposphere. To reduce these errors, we have added an exponential decay starting at the model level closest to the PBLH (kPBLH) to define a vertical localization factor:

(7) C loc = exp - α ( k - k PBLH k PBLH ) 2 ,

where k is the model level and α=8 is an experimentally determined factor. The factor Cloc is multiplied by the vertical covariance in Eq. (5) to ensure that the covariance between the PBLH and the state variables becomes small within a couple of model levels into the free troposphere.

Equations (3)–(4) are solved at each hour using the nearest lidar profile observation in time, and the resulting analysis fields are compared to radiosonde profiles when the latter are also available. There are 22 or 23 analyses (for each forecast run) and six times where comparison with radiosonde profiles is made. We focus on the impact of the assimilation on the state variables T, Q, U and V rather than the PBLH because only the state variables would be retained by a forecast.

3 Results

This section describes the NU-WRF simulation results, the assimilation of PBLH into these forecasts, and the relationship between the assimilation impact and the time-varying forecast and observation error covariances.

3.1 NU-WRF simulations

The 1 d NU-WRF simulations are presented in this section. Figure 1 shows the PBLH during that day, derived from the two NU-WRF forecasts, lidar observations and soundings. We have determined the sounding PBLH using the parcel method (Holzworth, 1964), which defines the top as the height where the potential temperature first exceeds the ground temperature. The lidar PBLH (black *, derived using the method reported in Bonin, 2018) closely matches the radiosonde estimates (green triangles) in the late evening to nighttime (02:00-07:00 UTC), while it is somewhat lower late afternoon to early evening (18:00–24:00 UTC). The two NU-WRF forecasts differ from the observations depending on the time of day. During nighttime and early morning the MYJ (red triangles) and MYNN (blue squares) forecasts are higher than the observations and then rise less than the lidar observations in the late morning and early afternoon (12:00–17:00 UTC; there are no radiosonde measurements to compare to here) before rising much higher than the observations in the late afternoon (18:00–24:00 UTC).

Figure 1PBLH vs. UTC time for 11 July 2015 for lidar backscatter (black *), WRF model – MYJ (red triangles), WRF model – MYNN (blue squares) and radiosonde observations using the parcel method (green triangles).


Figure 2The rms difference for the lowest eight layers vs. time of forecast (blue x) and analysis (red square) with radiosonde profiles for potential temperature (a), water vapor (b), U velocity (c) and V velocity (d).


Figure 3Same as Fig. 2, but for the MYNN PBL model, with forecast (black X's) and analysis (green squares).


3.2 Impact of assimilation on state variables

Since we are primarily interested in the impact of the assimilation on state variables within the boundary layer, in Figs. 2 and 3 we plot the rms difference between the model and the independent (unassimilated) radiosonde profiles from the surface to roughly the top of the boundary layer in the late afternoon. This corresponds to the first eight layers or about 800 mbar. We use a fixed number of layers so as to make the comparisons of the rms differences consistent during the day, rather than computing the rms over a different number of layers as the PBL grows during the day. For the temperature forecast, the rms difference would be

(8) RMS ( t a ) = 1 8 i = 1 8 ( T i f - T i sonde ) 2 1 / 2 ,

where ta is the analysis time and “i” represents the model level. Figures 2 and 3 show the rms differences with the radiosonde profiles throughout the day for the forecasts (blue x) and analyses (red squares) for potential temperature (a), water vapor mixing ratio WV (b), and the U (c) and V (d) components of velocity.

Figure 4Profiles from radiosonde (green), forecast (blue) and analysis (red) for potential temperature (a), water vapor mixing ratio (b), u velocity (c) and v velocity (d) at 04:00 UTC, 11 July 2015 in Greensburg, KS. The model uses the MYJ physics parameterization.


During the night (02:00–09:00 UTC), the assimilation has a relatively smaller impact on the potential temperature rms differences (upper left) in the early morning (06:00 and 08:00 UTC), and the two forecasts have similar accuracy. By late afternoon (22:00 and 23:00 UTC, note that the MYJ forecast stops at 22:00 UTC) the radiosonde comparisons show that the assimilation reduces rms differences in the potential temperatures by around 1.5 K for MYJ and 2 K for MYNN. The water vapor mixing ratio (upper right) also has little impact from the assimilation between 02:00 and 08:00 UTC, but at 22:00 UTC (the next radiosonde profile) the rms differences for both MYJ and MYNN analyses increase by at least 1.5×10-3 kg/kg in the late afternoon. The U-velocity profiles (lower right) show small differences between the MYJ and MYNN through 08:00 UTC (03:00 local time), and the assimilation increases the rms differences with radiosonde profiles by nearly 1 m/s starting at 22:00 UTC for both models. The V-velocity profiles (d) begin to differ between MYJ and MYNN for the forecasts at 08:00 UTC (0.5 m/s decrease), and assimilation also decreases the rms differences with radiosondes in late afternoon by 1.5–2 m/s.

Figure 5Same as Fig. 4 except using the MYNN model.


Figure 6Same as Fig. 4 except at time 22:00 UTC.


Figure 7Same as Fig. 6 except using the MYNN model.


We would like to understand why there is a smaller impact during nighttime and early morning, whereas there are decreases in the rms differences in temperature and V velocity and increases in moisture and U velocity in the late afternoon. To this end, we plot the forecast, analysis and radiosonde profiles (T, Q, U and V) at 04:00 UTC (23:00 local time) and 22:00 UTC (17:00 local time) in Figs. 47. At 04:00 UTC (Figs. 4, 5) these clearly indicate that there are small corrections made by the assimilation, as the red and blue profiles closely overlap. But it also shows that the profiles (particularly temperature and moisture) more accurately follow the radiosonde profiles (except for the U velocity above the PBL), meaning that any substantial corrections would have made the profiles worse relative the radiosonde profiles and ultimately degrade the next PBLH forecast. In contrast, Fig. 1 shows that the forecast PBLH (particularly MYJ) is quite a bit higher than the lidar observation at 04:00 UTC. In the late afternoon Figs. 6 and 7 indicate that there are large differences between the forecasts and radiosonde profiles for all of the state variables. The forecast PBLH values differ substantially from the lidar measurements as well. The correction to the forecast profiles generally pushes the analyses towards the independent radiosonde profiles, particularly for temperature and V velocity. So the forecasts that predicted both PBLH and state variables with relatively greater accuracy in the early morning were not corrected, while the less accurate afternoon forecast was drawn towards the independent radiosonde measurements. The assimilation also made changes to the vertical velocity (W) in the afternoon, but there are no independent data to compare with so we have not included it.

Figure 8Covariance PfHT between PBLH and temperature (a), water vapor (b), U velocity (c) and V velocity (d), at times 04:00, 08:00, 22:00 and 23:00 UTC, for PBL physics model MYNN.


The WV is shown to be increased by the assimilation (since WV and PBLH are negatively correlated and higher PBLH corresponds to lower WV levels in the PBL models), but the analysis overshoots the radiosonde WV profile for MYNN, hence causing the increase in the water vapor rms difference in Figs. 2 and 3. The MYJ forecast for WV is mostly too high, so the analysis also increases the rms difference. Compared to temperature, WV is highly variable in time and space, and it has been shown in the past that slanted balloon trajectories underestimate the WV present (Demoz et al., 2006; Crook, 1996). The U-velocity difference with the radiosonde is larger for the analysis, but this correction is more difficult because the differences (at least for MYJ) are both positive and negative and the PBLH observation only contains a single piece of information. The V velocity is, on the other hand, greatly improved by the assimilation. These analysis profiles show that, for this one analysis time, the assimilation is pushing the state variables in the proper direction for temperature, V velocity and moisture, though the moisture correction overshoots the radiosonde profile. PBLH is not a prognostic variable, so that the analysis PBLH values are not retained and therefore cannot directly affect the next forecast. But it is important to note that the temperature and moisture profiles are changed by the assimilation in a way that indicates that the next forecast is likely to have a more accurate PBLH estimate. Figures 6 and 7 both show that the level at which the potential temperature begins to rise and the WV mixing ratio begins to drop has been moved to a level much closer to that observed by the lidar. We do not make forecasts from the analysis fields, but these profiles show promise for improved PBLH forecasts when cycling experiments are done in a future implementation.

3.3 Ensemble error covariances

The increasing differences between PBLH and profile forecasts from early morning to late afternoon only partly explain the much larger impact of the assimilation at 22:00 UTC. We can also analyze the assimilation by investigating the error covariance between PBLH and each of the state variables (PfHT) and the relative error variances in observation space (HPfHT and R). We show PfHT in Fig. 8 for the MYNN PBL physics model at the six radiosonde times. The covariance with temperature is always positive and grows by a factor of 4 by late afternoon near the surface. The covariance with WV is mostly negative and grows by roughly a factor of 5, while the covariance with the two components of velocity oscillates between positive and negative and shows less consistent growth. Thus, the largest impact of assimilation on temperature and moisture occurs in late afternoon while more limited velocity corrections are largely constrained by the correlations determined by the ensemble of model forecast states. In addition, the covariance between PBLH and the U velocity is substantially smaller than that with the V velocity. This means that spurious correlations between PBLH and U might be present, given the relatively small ensemble and the geographic variation in the ensemble members. The error variances are also plotted at the radiosonde times in Fig. 9, which shows that the observation errors are much larger than the forecast errors during evening and early morning times (02:00, 04:00, 06:00, 08:00 UTC) and then become relatively smaller in the late afternoon (22:00, 23:00 UTC). This is an additional contributing factor to the minimal impact of PBLH observations early in the day and the much larger impact in the afternoon.

Figure 9Forecast (HPfHt) and observation (R) error covariance for the PBL physics model MYNN at the six radiosonde times.


4 Discussion and conclusions

These offline data assimilation experiments indicate that assimilating ground-based lidar backscatter and wind measurements of PBLH into a regional NWP model will likely lead to corrections to profiles within the PBL, particularly when, in the future, this approach is applied to an EnKF assimilation system with cycling. Using two NU-WRF forecasts over a period of 1 d with different PBL physics models, we show how the state variables, T, WV, U and V can be corrected using an assimilation system with ensemble-based error covariances. During the night and early morning the assimilation has relatively little impact on the state variables, but by late afternoon the temperature field is drawn closer to independent radiosonde measurements. We have shown that the lack of data impact early in the day is due to the relatively higher accuracy of the model and lack of correlation between the forecast PBLH and temperature profiles at that time. Later in the day, when the model is less accurate in predicting the growth of the boundary layer, the data begin to draw the analyses mostly toward the independent radiosonde profiles. The assimilation overcorrected the water vapor mixing ratio in the direction of radiosonde data, and this could likely be tuned in an assimilation system. And it corrected the V-velocity component by a smaller amount and reduced differences with the radiosonde profiles for the V velocity. These corrections are the result of ensemble-computed error covariances between the PBLH and the state variable profiles within the PBL. The results here indicate that this approach has some potential to be used in a forecast system in a way that the PBLH observational information could be carried forward in time so as to impact the forecast accuracy within the PBL. An additional value of assimilating PBLH is its close connection with the PBL scheme used in the model. The ensemble covariances between PBLH and the different state variables are controlled through the PBL physics scheme. This has an impact on the corrections made to the profiles within the PBL, which can be used as another way to evaluate the physics parameterizations. For example, the MYJ and MYNN result in forecast profiles that differ, particularly in WV in the late afternoon. And the differences in response to assimilation are an indication of how the two different PBL schemes affect the covariance between PBLH and the state variables. However, a full evaluation would require that the assimilation be implemented into a cycling data assimilation system.

This work is intended only to demonstrate a necessary first step in terms of how ensemble statistics can help to constrain profiles within the PBL by assimilating PBLH observations. A more complete demonstration of this approach will require the construction of an EnKF, which should be run over many days with a variety of weather patterns, including significantly warmer (cooler) and wetter (drier) days. This is needed to show how the assimilated PBLH observations will impact future forecasts within the PBL. More of the PBL physics schemes need to be investigated as well, since the correlation between PBLH and state variables will vary widely depending on which scheme is used. An estimate of the forward operator error should be included in the algorithm as well. There are also differences in the way PBLH is computed in the PBL physics schemes and the methods used for radiosonde observations (see Hegarty, et al., 2018). This will impact the manner in which the assimilation and resulting forecasts are validated. The larger uncertainty in the lidar PBLH retrievals during nighttime (Fig. 9) means that the assimilation will not significantly constrain the model state within the PBL during this period. So it would be very helpful to complement PBLH observations with thermodynamic and kinematic profile observations, particularly overnight. The fact that PBLH is a non-negative variable means that the observation minus forecast (O-F) statistics will likely be non-Gaussian so that the assimilation algorithm would need to include an extension to handle this possibility (e.g., Cohn, 1997).

In addition, a cycling EnKF will involve spatial covariances in both the horizontal and vertical directions and will allow for both inflation and horizontal localization. This will enable further tuning of the system to optimize the analysis state relative to the independent radiosonde observations. The PBLH assimilation within the EnKF framework could be done in any of numerous existing EnKF assimilation systems that connect with WRF, including NU-WRF (Peters-Lidard et al., 2015) and WRF-DART (Anderson et al., 2009). Future development of PBLH assimilation algorithms will also need to address the effect of the different definitions of PBLH, such as the TKE method used in the physics schemes and the backscatter and wind profile method used in the retrievals.

Data availability

PECAN ( data are archived by NCAR/EOL (2021), which is funded by NSF. The forecast and analysis fields produced for this work are stored at (UMBC, 2021).

Author contributions

AT built the assimilation system, with input from JLA on the algorithm. BD and BJC provided the lidar observations. JS provided background information on PBL physics. All of the authors contributed to writing and revising the paper.

Competing interests

The authors declare that they have no conflict of interest.


The careful reading and comments by Rohith Muraleedharan Thundathil and the three anonymous reviewers have helped to greatly improve the quality of this paper.

Financial support

Andrew Tangborn was funded through the JCET cooperative agreement with NASA Goddard Space Flight Center. Belay Demoz was funded by National Science Foundation award AGS‐1503563 to the University of Maryland, Baltimore County and through NOAA Cooperative Science Center in Atmospheric Sciences and Meteorology, funded by the Educational Partnership Program at NOAA in collaboration with Howard University. Joseph Santanello was funded through a NASA Decadal Survey Study Team grant.

Review statement

This paper was edited by Daniel Perez-Ramirez and reviewed by Rohith Muraleedharan Thundathil and three anonymous referees.


Anderson, J. L., Hoar, T., Raeder, K., Liu, H., Collins, N., Torn R., and Arellano, A.: The Data Assimilation Research Testbed: A Community Facility, B. Am. Meteorol. Soc., 90, 1283–1296,, 2009. 

Bonin, T. A., Carroll, B. J., Hardesty, R. M., Brewer, W. A., Hajney, K., Salmon, O. E., and Shepson P.B.: Doppler Lidar Observations of the Mixing Height in Indianapolis Using an Automated Composite Fuzzy Logic Approach, J. Atmos. Ocean. Tech., 35, 473–490, 2018. 

Brooks, I. M.: Finding Boundary Layer Top: Application of a Wavelet Covariance Transform to Lidar Backscatter Profiles, J. Atmos. Ocean. Tech., 20, 1092–1105, 2003. 

Browning, K. A., Blyth, A. M., Clark, P. A., Corsmeier, U., Morcrette, C. J., Agnew, J. L., Ballard, S. P., Bamber, D., Barthlott, C., Bennett, L. J., Beswick, K. M., Bitter, M., Bozier, K. E., Brooks, B. J., Collier, C. G., Davies, F., Deny, B., Dixon, M. A., Feuerle, T., Forbes, R. M., Gaffard, C., Gray, M. D., Hankers, R., Hewison, T. J., Kalthoff, N., Khodayar, S., Kohler, M., Kottmeier, C., Kraut, S., Kunz, M., Ladd, D. N., Lean, H. W., Lenfant, J., Li, Z. H., Marsham, J., McGregor, J. Mobbs, S. D., Nicol, J., Norton, E., Parker, D. J., Perry, F., Ramatschi, M.,Ricketts, H. M. A., Roberts, N. M., Russell, A., Schulz, H., Slack, E. C., Vaughan, G., Waight, J., Wareing, D. P., Watson, R. J., Webb, A. R., and Wieser, A.: The Convective Storm Initiation Project, B. Am. Meteorol. Soc., 88, 1939–1955,, 2007. 

Burgers, G., Van Leeuwen, P. J., and Evensen, G.: Analysis scheme in the ensemble Kalman filter, Mon. Weather Rev., 126, 1719–1724,<1719:ASITEK>2.0.CO;2, 1998. 

Carroll, B. J., Demoz, B. B., and Delgado, R.: An overview of low‐level jet winds and corresponding mixed layer depths during PECAN, J. Geophys. Res.-Atmos., 124, 9141–9160,, 2019. 

Chipilski, H. G., Wang, X., and Parsons, D. B.: Impact of assimilating PECAN profilers on the prediction of bore-driven nocturnal convection: A multiscale forecast evaluation for the 6 July 2015 case study, Mon. Weather Rev., 148, 1147–1175,, 2020. 

Cohn, S.: An Introduction to Estimation Theory, J. Meteorol. Soc. Jpn., 75, 257–288,, 1997. 

Coniglio, M. C., Romine, G. S., Turner, D. D., and Torn, R. D.: Impacts of Targeted AERI and Doppler Lidar Wind Retrievals on Short-Term Forecasts of the Initiation and Early Evolution of Thunderstorms, Mon. Weather Rev., 147, 1149–1170,, 2019a. 

Coniglio, M. C., Romine, G. S., Turner, D. D., and Torn, R. D.: Impacts of Targeted AERI and Doppler Lidar Wind Retrievals on Short-Term Forecasts of the Initiation and Early Evolution of Thunderstorms, Mon. Weather Rev., 147, 1149–1170, 2019b. 

Crook, N. A.: Sensitivity of moist convection forced by boundary layer processes to low-level thermodynamic fields, Mon. Weather Rev., 124, 1767–1785, 1996. 

Degelia, S. K., Wang, X., and Stensrud, D. J.: An Evaluation of the Impact of Assimilating AERI Retrievals, Kinematic Profilers, Rawinsondes, and Surface Observations on a Forecast of a Nocturnal Convection Initiation Event during the PECAN Field Campaign, Mon. Weather Rev., 147, 2739–2764, 2019. 

Degelia, S. K., Wang, X., Stensrud, D. J., and Turner, D. D: Systematic evaluation of the impacts of assimilating a network of ground-based remote sensing profilers for forecasts of nocturnal convection initiation during PECAN, Mon. Weather Rev., 4703–4728,, 2020. 

Delgado, R., Carroll, B., and Demoz, B.: FP2 UMBC Doppler Lidar Line of Sight Wind Data, Version 1.1, UCAR/NCAR-Earth Observing Laboratory,, 2016. 

Demoz, B., Flamant, C., Weckwerth, T., Whiteman, D., Evans, K., Fabry, F., Di Girolamo, P., Miller, D., Geerts, B., Brown, W., Schwemmer, G., Gentry, B., Feltz, W., and Wang, Z.: The dryline on 22 May 2002 during IHOP-2002: Convective scale measurements at the profiling site, Mon. Weather Rev., 134, 294–310, 2006. 

Evensen, G.: Sequential data assimilation with a nonlinear quasi-geostrophic model using Monte Carlo methods to forecast error statistics, J. Geophys. Res., 99,, 1994. 

Evensen, G.: The Ensemble Kalman Filter: Theoretical formulation and practical implementation, Ocean Dynam., 53, 343–367,, 2003. 

Evensen, G.: Data assimilation: the ensemble Kalman filter, Springer, 2009. 

Geerts, B., Parsons, D. ,Ziegler, C. L., Weckwerth, T. M., Biggerstaff, M. I., Clark, R. D., Coniglio, M. C., Demoz, B. B., Ferrare, R. A., Gallus, W. A., Haghi, K., Hanesiak, J. M., Klein, P. M., Knupp, K. R., Kosiba, K., McFarquhar, G. M., Moore, J. A., Nehrir, A. R., Parker, M. D., Pinto, J. O., Rauber, R. M., Schumacher, R. S., Turner, D. D., Wang, Q., Wang, X. Q., Wang, Z., and Wurman, J.: The 2015 Plains Elevated Convection At Night field project, B. Am. Meteorol. Soc., 98, 767–786,, 2017. 

Hardesty, R. M. and Hoff, R. M.: NCAR Technical Note: Thermodynamic Profiling Technologies Workshop Report to the National Science Foundation and the National Weather Service, National Center for Atmospheric Research, Boulder, CO, USA, 2012. 

Hegarty, J. D., Lewis, J., McGrath-Spangler, E. L., Henderson, J., Scarino, A. J., DeCola, P., Ferrare, R., Hicks, M., Adams-Selin, R. D., and Welton, E. J.: Analysis of the Planetary Boundary Layer Height during DISCOVER-AQ Baltimore-Washington, D.C., with Lidar and High-Resolution WRF Modeling, J. Appl. Meteorol. Clim., 57, 2679–2696, 2018. 

Hicks, M., Atkinson, D., Demoz, B., Vermeesch, K., and Delgado, R.: The National Weather Service Ceilometer Planetary Boundary Layer Project, The 27th International Laser Radar Conference (ILRC 27), 10–14 January 2016, New Orleans, 269454,, 2016. 

Hicks, M., Demoz, B., Vermeesch, K., and Atkinson, D.: Intercomparison of Mixing Layer Heights from the National Weather Service Ceilometer Test Sites and Collocated Radiosondes, J. Atmos. Ocean. Tech., 36, 129–137, 2019. 

Holzworth, G. C.: Estimates of mean maximum mixing depths in the contiguous United States, Mon. Weather Rev., 92, 235–242, 1964. 

Hong, S.-Y. and Pan, H.-L.: Nonlocal boundary layer vertical diffusion in a medium-range forecast model, Mon. Weather Rev., 124, 2332–2339, 1996. 

Hong, S.-Y. and Pan, H.-L.: Convective Trigger Function for a Mass-Flux Cumulus Parameterization Scheme, Mon. Weather Rev., 126, 2599–2620, 1998. 

Houtekamer, P. L. and Zhang, F.: Review of the Ensemble Kalman Filter for Atmospheric Data Assimilation, Mon. Weather Rev., 144, 4489–4532, 2016. 

Hu, J., Yussouf, N., Turner, D. D., Jones, T. A., and Wang, X.: Impact of Ground-Based Remote Sensing Boundary Layer Observations on Short-Term Probabilistic Forecasts of a Tornadic Supercell Event, Weather Forecast., 34, 1453–1476, 2019. 

Janjic, Z. I.: The Step-mountain eta coordinate model: Further developments of the convection, viscous sublayer, and turbulence closure, Mon. Weather Rev., 122, 927–945, 1994. 

Janjic, Z. I.: Nonsingular Implementation of the Mellor-Yamada Level 2.5 Scheme in the NCEP Meso model, NCEP Office Note No. 437, National Centers for Environmental Prediction, College Park, MD, USA, 2002. 

Keppenne, C. L., Rienecker, M. M., Kovach, R. M., and Vernieres, G.: Background Error Covariance Estimation using Information from a Single Model Trajectory with Application to Ocean Data Assimilation into the GEOS-5 Coupled Model, in NASA/TM–2014-104606, Vol. 34, edited by: Kostner, R. D., 2014. 

Knepp, T. N., Szykman, J. J., Long, R., Duvall, R. M., Krug, J., Beaver, M., Cavender, K., Kronmiller, K., Wheeler, M., Delgado, R., Hoff, R., Berkoff, T., Olson, E., Clark, R., Wolfe, D., Van Gilst, D., and Neil, D.: Assessment of mixed-layer height estimation from single-wavelength ceilometer profiles, Atmos. Meas. Tech., 10, 3963–3983,, 2017. 

Lothon, M., Lohou, F., Pino, D., Couvreux, F., Pardyjak, E. R., Reuder, J., Vilà-Guerau de Arellano, J., Durand, P., Hartogensis, O., Legain, D., Augustin, P., Gioli, B., Lenschow, D. H., Faloona, I., Yagüe, C., Alexander, D. C., Angevine, W. M., Bargain, E., Barrié, J., Bazile, E., Bezombes, Y., Blay-Carreras, E., van de Boer, A., Boichard, J. L., Bourdon, A., Butet, A., Campistron, B., de Coster, O., Cuxart, J., Dabas, A., Darbieu, C., Deboudt, K., Delbarre, H., Derrien, S., Flament, P., Fourmentin, M., Garai, A., Gibert, F., Graf, A., Groebner, J., Guichard, F., Jiménez, M. A., Jonassen, M., van den Kroonenberg, A., Magliulo, V., Martin, S., Martinez, D., Mastrorillo, L., Moene, A. F., Molinos, F., Moulin, E., Pietersen, H. P., Piguet, B., Pique, E., Román-Cascón, C., Rufin-Soler, C., Saïd, F., Sastre-Marugán, M., Seity, Y., Steeneveld, G. J., Toscano, P., Traullé, O., Tzanos, D., Wacker, S., Wildmann, N., and Zaldei, A.: The BLLAST field experiment: Boundary-Layer Late Afternoon and Sunset Turbulence, Atmos. Chem. Phys., 14, 10931–10960,, 2014 

Mellor, G. L. and Yamada, T.: A Hierarchy of Turbulence Closure Models for Planetary Boundary Layers, J. Atmos. Sci., 31, 1791–1806, 1974. 

Mellor, G. L. and Yamada, T.: Development of a turbulence closure model for geophysical fluid problems, Rev. Geophys., 20, 851–875, 1982. 

Nakashini, M. and Niino, H.: Development of an improved turbulence closure model for the atmospheric boundary layer, J. Meteorol. Soc. Jpn., 87, 895–912, 2009. 

National Research Council: Observing Weather and Climate from the Ground Up: A Nationwide Network of Networks, edited by: Natlional Academies Press, Washington, USA, 2009. 

NCAR/EOL: PECAN, available at:, last access: 2 February 2021. 

Oke, P. R., Brassington, G. B., Griffin, D. A., and Schiller, A.: Ocean data assimilation: a case for ensemble optimal interpolation, Austr. Meteorol. Ocean., 59, 67–76, 2010. 

Peters-Lidard, C. D., Kemp, E. M., Matsui, T., Santanello, J. A., KumareJossy, S. V., Jacob, P., Clune, T., Taog, W.-K., Chin, M., Hou, A., Case, J. L., Kim, D. C., Kim, K.-M., Laum, W., Liun, Y. Q., Shio, J., Starr, D., Tan, Q., Tao, Z. N., Zaitchik, B. F., Zavodsky, B., Zhang, S. Q, and Zupanski, M.: Integrated modeling of aerosol, cloud and land processes at satellite-resolved scales, Environ. Modell. Softw., 67, 149–159, 2015. 

Santanello, J. A., Dirmeyer, P. A., Ferguson, C. R., Findell, K. L., Tawfik, A. B., Berg, A., Ek, M., Gentine, P., Guillod, B. P., van Heerwaarden, C., Roundy, J., and Wulfmeyer, V.: Land-Atmosphere Interactions: The LoCo Perspective, B. Am. Meteorol. Soc., 99, 1253–1272,, 2018. 

Santanello, J. A., Zhang, S. Q., Turner, D. D., Lawston, P., and Blumberg, W. G.: PBL Thermodynamic Profile Assimilation and Impacts on Land-Atmosphere Coupling, AGU Fall Meeting, San Francisco, CA, USA, 9–13 December 2019, A11T-2834, 2019. 

UMBC: Forecast and analysis fields, available at:, last access: 8 February 2021. 

Wulfmeyer, V., Hardesty, R. M., Turner, D. D., Behrendt, A., Cadeddu, M. P., Di Girolamo, P., Schlussel, P., Van Baelen, J., and Zus, F.: A review of the remote sensing of lower tropospheric thermodynamic profiles and its indispensable role for the understanding and the simulation of water and energy cycles, Rev. Geophys., 53, 819–895,, 2015. 

Short summary
Accurate prediction of the planetary boundary layer is essential to both numerical weather prediction (NWP) and pollution forecasting. This paper presents a methodology to combine these measurements with the models through a statistical data assimilation approach that calculates the correlation between the PBLH and variables like temperature and moisture in the model. The model estimates of these variables can be improved via this method, and this will enable increased forecast accuracy.