Noise filtering options for conically scanning Doppler lidar measurements with low pulse accumulation

Päschke, Eileen; Detring, Carola

doi:https://doi.org/10.5194/amt-17-3187-2024

Articles | Volume 17, issue 10

https://doi.org/10.5194/amt-17-3187-2024

Articles | Volume 17, issue 10

Research article

28 May 2024

Research article |

| 28 May 2024

Noise filtering options for conically scanning Doppler lidar measurements with low pulse accumulation

Eileen Päschke and Carola Detring

Abstract

Doppler lidar (DL) applications with a focus on turbulence measurements sometimes require measurement settings with a relatively small number of accumulated pulses per ray in order to achieve high sampling rates. Low pulse accumulation comes at the cost of the quality of DL radial velocity estimates and increases the probability of outliers, also referred to as “bad” estimates or noise. Careful filtering is therefore the first important step in a data processing chain that begins with radial velocity measurements as DL output variables and ends with turbulence variables as the target variable after applying an appropriate retrieval method. It is shown that commonly applied filtering techniques have weaknesses in distinguishing between “good” and “bad” estimates with the sensitivity needed for a turbulence retrieval. For that reason, new ways of noise filtering have been explored, taking into account that the DL background noise can differ from generally assumed white noise. It is shown that the introduction of a new coordinate frame for a graphical representation of DL radial velocities from conical scans offers a different perspective on the data when compared to the well-known velocity–azimuth display (VAD) and thus opens up new possibilities for data analysis and filtering. This new way of displaying DL radial velocities builds on the use of a phase-space perspective. Following the mathematical formalism used to explain a harmonic oscillator, the VAD’s sinusoidal representation of the DL radial velocities is transformed into a circular arrangement. Using this kind of representation of DL measurements, bad estimates can be identified in two different ways: either in a direct way by singular point detection in subsets of radial velocity data grouped in circular rings or indirectly by localizing circular rings with mostly good radial velocity estimates by means of the autocorrelation function. The improved performance of the new filter techniques compared to conventional approaches is demonstrated through both a direct comparison of unfiltered with filtered datasets and a comparison of retrieved turbulence variables with independent measurements.

Download & links

How to cite.

Received: 18 Jul 2023 – Discussion started: 04 Aug 2023 – Revised: 07 Mar 2024 – Accepted: 25 Mar 2024 – Published: 28 May 2024

1 Introduction

Doppler lidars (DLs) are widely used for measurements of atmospheric wind and turbulence variables in different application areas, such as wind energy, aviation, and meteorological research (Liu et al., 2019; Sathe and Mann, 2013; Thobois et al., 2019; Krishnamurthy et al., 2013; Filioglou et al., 2022; Drew et al., 2013; O'Connor et al., 2010; Sathe and Mann, 2013; Bodini et al., 2018; Sanchez Gomez et al., 2021; Beu and Landulfo, 2022). The wide application range became possible due to the flexible configuration options of several modern systems benefitting from the all-sky-scanner technique. This technical flexibility allows for the employment of user-defined scan patterns with respect to azimuth and elevation as well as the choice of specific sampling frequencies in order to meet the data requirements for certain application-oriented retrieval processes.

At the Meteorological Observatory Lindenberg – Richard Aßmann Observatory (MOL-RAO) the interest in long-term operational DL profile observations for both wind and turbulence variables is motivated by different application aspects. The data can be helpful in analyzing and interpreting the kinematic properties of the vertical structure of the atmospheric wind and turbulence under different weather conditions and states of the ABL during the course of the day (e.g., stable ABL, convective mixed ABL, transitions between different ABL states). In addition, the profile information can be useful for regular validation purposes of atmospheric numerical models. This includes not only modeled wind profiles but also the performance of turbulence parameterizations (e.g., TKE closure) used to describe subgrid-scale processes. Due to increasingly higher model resolutions and the associated changes in the applicability and relative importance of parameterization schemes, long-term DL-based turbulence measurements are also interesting when it comes to developing appropriately adapted parameterization approaches that meet these new requirements.

A variety of scanning techniques and retrieval methods for vertical profiles of wind and turbulence variables based on DL measurements have been developed (Smalikho, 2003; Päschke et al., 2015; Sathe et al., 2015; Newsom et al., 2017; Bonin et al., 2017; Steinheuer et al., 2022). Several of these methods rely on specific scanning configurations and are tailored towards a specific data product. For the derivation of different data products this implies either the use of more than one DL system or cyclic configuration changes in a single DL. With respect to this limitation, the relatively new scanning and retrieval method introduced by Smalikho and Banakh (2017) stands out from other methods. Their approach is based on a carefully derived set of model equations, describing functional relationships between radial velocity observations measured along a conical scan with high azimuthal and temporal resolution (Δθ∼1°, Δt∼0.2 s) and a set of meaningful wind turbulence variables such as turbulence kinetic energy (TKE), eddy dissipation rate (EDR), momentum fluxes, and the integral scale of turbulence. Hence, the essential benefit of this approach relies on the deployment of an internally consistent set of simultaneous wind and turbulence profile observations based on just one scan strategy. As a further outstanding feature the method provides correction terms to account for the typical underestimation of the TKE due to the averaging over the pulse volume of the DL. This issue has frequently been mentioned as the most challenging task in turbulence measurements using DL (Sathe and Mann, 2013; Liu et al., 2019).

Because of the strength of the Smalikho and Banakh (2017) approach, the method has been implemented and tested for routine application at MOL-RAO. From the first quasi-routine test measurements with a StreamLine DL from the manufacturer HALO Photonics (now HALO Photonics by Lumibird) three things became apparent: (1) the measurements of radial velocity show an increased level of noise which is noticeable through an increased number of outliers (“bad” estimates) even at rather low height levels in the ABL; (2) the reliability of both retrieved wind and above all turbulence variables strongly depends on the degree of noise contamination, i.e., the number and distribution of bad radial velocity estimates, in the input data; and (3) if just the signal-to-noise ratio (SNR) thresholding technique is used to remove noise from the data, the final turbulence product availability is relatively low. The first finding can be attributed to short accumulation times, which is an inevitable consequence of the technical realization of the scanning strategy with high spatiotemporal resolution. The length of the accumulation time determines how many available spectra of backscattered light can be used to estimate the frequency shift f_d (Doppler frequency) and therewith the radial velocity defined through $V_{r} = - λ f_{d} / 2$ . The longer a signal is sampled, the more accurate this estimation will be. For that reason it is a common approach to accumulate the spectra of backscattered light from multiple pulses N_a (Frehlich, 1995; Rye and Hardesty, 1993; Banakh and Werner, 2005; Li et al., 2012). For the retrieval of wind profiles as proposed in Päschke et al. (2015), for instance, DL measurements have been performed using a comparably high number of N_a = 75 000 pulses. At this point the method of Smalikho and Banakh (2017) requires a sensible compromise. Using a StreamLine DL, for technical reasons a conical scan with the required high azimuthal and temporal resolution can only be achieved with a rather low number of accumulated pulses per measurement ray, i.e., N_a∼2000. This in turn has the consequence that the occurrence of bad estimates in the measurements becomes more likely (Frehlich, 1995). Such outliers contain no wind information (Stephan et al., 2018), and, if not excluded from the measured dataset, they may contribute to large errors in the retrieved meteorological variables (Dabas, 1999). The latter explains the aforementioned second finding and indirectly confirms the recommendations given in Banakh et al. (2021) that the method for determining wind turbulence parameters presented in Smalikho and Banakh (2017) is only applicable if the probability P_b of bad estimates of the radial velocity is close to zero. A closer examination of the third finding mentioned above revealed that with the proper choice of the threshold value the SNR thresholding technique is indeed very effective in removing noisy data, but it also bears the risk of discarding a lot of reliable measurements. This in turn proves to be ineffective for the overall product availability and would not justify a routine application of the retrieval method.

To overcome the issues described above, new filter methods were developed in the course of implementing the retrieval method by Smalikho and Banakh (2017) for routine applications at MOL-RAO. In particular, a filter method was sought which allows for a reliable removal of all noise contributions and circumvents an unnecessary refusal of reliable data at the same time. A detailed presentation of these methods is the main objective of this work. In addition, their advantages over commonly used filtering techniques for turbulence-measurement-oriented routine applications are presented. The article is organized as follows: in Sect. 2 technical information on the measuring system used, its configuration, and typical characteristics in measured data due to short accumulation times is given. To motivate the need for new ideas of improved filtering techniques, pros and cons of common filter methods to detect bad estimates are discussed in Sect. 3. In Sect. 4 a new type of visualization for analyzing DL measurements from conical scans is presented. Building on this, ideas for two new filter approaches are developed and discussed. An overview of how the new filter methods affect the quality of retrieved turbulence variables using the method by Smalikho and Banakh (2017) is provided in Sect. 5.

2 Doppler lidar measurements

The DL measurements serving as the basis for this work were taken at the boundary-layer field site Falkenberg (in German: Grenzschichtmessfeld, GM, Falkenberg), which is an open field embedded in a flat landscape, with main wind directions from WSW, located about 5 km to the south of the MOL-RAO observatory site. The flat terrain characteristics meet the requirements for the application of the turbulence measurement approach by Smalikho and Banakh (2017) in non-complex terrain.

2.1 Technical system specifications and configuration

At GM Falkenberg a StreamLine DL from the manufacturer HALO Photonics with the specifications given in Table 1 was used and operated using a conical-scan-mode configuration to apply the turbulence retrieval approach by Smalikho and Banakh (2017). This configuration is defined by three key parameters, namely the elevation angle (ϕ=35.3°), the azimuthal resolution (Δθ∼1°), and the time duration for one single scan (T_scan=72 s). In order to realize this scanning strategy the DL was configured to be in continuous scan motion (CSM) while sampling data. A custom scan file (see Appendix A) has been defined for the scanner configuration including information about the angular rotation rate ω_s, the start and end positions of the scanner, and the elevation angle ϕ. In analogy to the work of Smalikho and Banakh (2017) we set ω_s=5° s⁻¹ to nearly satisfy Δθ∼1°. Note that the latter implies measurements on an irregular grid which for analysis purposes later on requires the transfer of the data to an equidistant grid with Δθ=1°. The specific value for ϕ goes back to an earlier theoretical work of Kropfli (1986) and Eberhard et al. (1989), who focused on Doppler-radar-based turbulence measurements. In addition, Teschke and Lehmann (2017) have shown that using DL this value is also an optimum beam elevation angle for a mean wind retrieval with a minimum in the retrieval error. With the specifications for Δθ and ω_s and due to the pulse repetition frequency f_p=10 kHz (see Table 1) we had to adjust the configuration setting for the number of pulses per ray to N_a=2000 using the relation $N_{a} = Δ θ f_{p} / ω_{s}$ (Banakh and Smalikho, 2013). This is a minor difference compared to the value suggested in Smalikho and Banakh (2017), i.e., N_a=3000, which is due to a higher pulse repetition frequency, i.e., f_p=15 kHz, characterizing their DL system. Note that for StreamLine DL systems the system-specific parameter f_p cannot be changed by the user. The low value for N_a is non-favorable if a high measurement quality is needed. For best possible measurement quality in the lower ABL it is therefore important to use the focus setting option to improve the signal intensities within a selected height range. For the DL used in our studies (DL78 hereafter) the focus was set to 500 m. Working with StreamLine DL systems, the range resolution ΔR along the line of sight (LOS) can also be adjusted. For reasons of compatibility with the pulse length of τ_p=180 ns the range resolution was set to $Δ R = c τ_{p} / 2 \approx 30$ m, where c denotes the speed of light.

Table 1Instrument specifications of the HALO Photonics StreamLine DL operated at MOL-RAO.

Download Print Version | Download XLSX

Note that with StreamLine XR DL systems HALO Photonics by Lumibird offers a further development of the StreamLine series. XR systems operate with larger pulse length in order to increase the range, depending on the presence of scattering particles in the atmosphere. The larger pulse length, however, reduces the spatial resolution of the measurements along the line of sight (LOS), which is not an option for measurements in the ABL if the focus is on the detection and investigation of small-scale structures.

2.2 Typical measurement examples and their noise characteristics

For the measurements carried out in this work the relevant DL output variables are the radial velocity estimates V_r along each single LOS of the conical scan and the associated SNR values. The estimation of V_r is based on the determination of the Doppler shift f_d of the backscattered signal by an onboard signal processor. A number of methods are available to determine f_d (Frehlich, 1995), but the DL manufacturers usually do not disclose to the customer the details of the implemented algorithm. The performance of the estimation algorithms and thus the quality of V_r may vary. The assessment of the performance of the estimation algorithms is generally based on the probability density function (PDF) of the velocity estimates. According to Frehlich (1995), the PDF of velocity estimators performing well is characterized by a localized distribution of “good” estimates centered around the true mean velocity and a fraction of uniformly distributed bad estimates. This leads to the following frequent distinction in DL radial wind measurements:

\begin{matrix} (1) & V_{r} = \{\begin{cases} V_{r} + V_{e} & in the case of good estimate \\ V_{b} & in the case of bad estimate, \end{cases} \end{matrix}

with V_e denoting a random instrumental error (Stephan et al., 2018). In the literature, bad estimates are mostly described as random outliers or noise uniformly distributed over the resolved velocity space (Frehlich, 1995; Dabas, 1999). It can be shown that the occurrence of noise in a series of radial velocity measurements based on a conical scan can be determined by means of the autocorrelation function (ACF) evaluated at lag 1 (Appendix B). In particular, for a conical scan with high azimuthal resolution of Δθ∼1° as used in this work, ACF = 1 indicates noise-free measurements, while ACF < 1 gives an indication of the occurrence of noise.

https://amt.copernicus.org/articles/17/3187/2024/amt-17-3187-2024-f01

Figure 1Examples for measurements from one and the same conically scanning Doppler lidar. Each column represents measurements during a 30 min interval at different times and range gates (i.e., measurement heights along the line of sight) which are characterized by different kinds of noise (left: noise-free, middle: type A noise, right: type B noise). The plots of each row depict the measurements from different perspectives. The first row shows a time series plot of the radial velocities (V_r) (a, e, i). In a similar way the second row (b, f, j) illustrates the corresponding signal intensities (SNR) of the measurements. Here, the horizontal dotted line indicates an SNR threshold level calculated as proposed in Abdelazim et al. (2016) for N_a=2000 (see Sect. 3.1). The third row (c, g, k) shows the DL measurements from a VAD perspective, i.e., a display of the radial velocity as a function of the azimuth angle. The ACF value indicates the degree of noise contamination in the measured time series (see Appendix B). The fourth row (d, h, l) shows histograms of V_r.

Noise filtering options for conically scanning Doppler lidar measurements with low pulse accumulation

2.1 Technical system specifications and configuration

2.2 Typical measurement examples and their noise characteristics

3.1 SNR thresholding

3.2 Consensus averaging

4.1 Framework of the VV90D perspective

4.2 Coarse filtering techniques

4.2.1 Filtering by single point analysis – coarse filter I

4.2.2 Filtering based on ACF analysis – coarse filter II

4.3 Post-processing filter for optimization

4.3.1 Two-stage MAD filter

4.3.2 Determination of the sinusoidal corridor of good estimates

4.4 Intercomparison of approach I and approach II under different atmospheric wind conditions

5.1 Comparisons with sonic anemometer as an independent reference

5.2 Comparisons with alternative reference data