Algorithm for vertical distribution of boundary layer aerosol components in remote-sensing data

. The vertical distribution of atmospheric aerosol components is vital to the estimation of radiative forcing and the catalysis of atmospheric photochemical processes. Based on the synergy of ground-based lidar and sun-photometer in Generalized Aerosol Retrieval from Radiometer and Lidar Combined data (GARRLiC), this paper developed a new algorithm to get the vertical mass concentration proﬁles of ﬁne-mode aerosol components for the ﬁrst time. Retrieval of aerosol properties was achieved based on the sky radiance at multiple scatter angles, total optical depth (TOD) at 440, 675, 870, and 1020 nm, and lidar signals at 532 and 1064 nm. In addition, the internal mixing model and normalized volume size distribution (VSD) model were estab-lished according to the absorption and water solubility of the aerosol components, to separate the proﬁles of black carbon (BC), water-insoluble organic matter (WIOM), water-soluble organic matter (WSOM), ammonium nitrate-like (AN), and ﬁne aerosol water (AW) content. Results showed a reasonable vertical distribution of aerosol components compared with in situ observations and reanalysis data. The estimated and observed BC concentrations matched well with a correlation coefﬁcient up to 0.91, while there was an evident overestimation of organic matter (OM = WIOM + WSOM, NMB = 0.98). Moreover, the retrieved AN concentrations were closer to the simulated results ( R = 0.85), especially in polluted conditions. The BC and OM correlations were relatively weaker, with a correlation coefﬁcient of ∼ 0.5. Be-sides, the uncertainties caused by the input parameters (i.e., relative humidity (RH), volume concentration, and extinction coefﬁcients) were assessed using the Monte Carlo method. The AN and AW had smaller uncertainties at higher RH. remote-sensing AN


Introduction
Atmospheric aerosols play a key role in the radiation budget and energy balance (Andrews and Forster, 2020;Hasekamp et al., 2019). The aerosols with different optical and physical properties have diverse radiative forcing effects (Boucher et al., 2013). For example, the soot dominated by black carbon (BC) has the most significant effect on cloud cover and precipitation due to its strong absorption (Gu et al., 2006;Xia, 2014), while the negative radiative forcing of sulfate and nitrate is more prominent (Myhre et al., 2013). Especially, different aerosol components can even cause opposite radiation changes vertically (Jiang et al., 2013). On the other hand, aerosols can affect the atmospheric oxidation capacity by changing the photolysis rate of trace gases (Bian et al., 2003;Lou et al., 2014;Xing et al., 2017). Liao et al. (1999) once found that nonabsorbent aerosols generally enhanced photolysis rates, contrary to the soot aerosols. Moreover, the ability of aerosols to reduce the photolysis frequency of O 3 decreases with altitude on a regional scale (Li et al., 2011). Therefore, the vertical distribution of atmospheric aerosol components is of vital importance to reduce the uncertainty of radiative forcing estimation and understand the impact of haze on atmospheric photochemical processes.
At present, there are many studies focusing on the aerosol components on the ground (Han et al., 2015;Huang et al., 2014;Zhao et al., 2013). However, the ways to get the aerosol components in the atmosphere are finite. Although the field campaigns often launched aircraft Zhang et al., 2009) or tethered balloons Ran et al., 2016) to detect the atmospheric structure, there are still many limitations in the resolution and representation due to the restricted aircraft control. In such a situation, continuous remote-sensing technology with high temporal resolution, such as sun-photometer and lidar, provides a powerful tool for the identification of aerosol components. Besides, the establishment of ground-based networks, e.g., AERONET (Dubovik et al., 2002, AD-Net Nishizawa et al., 2017), and MPLNET (Chew et al., 2013;Huang et al., 2011), also improves the spatial detection resolution.
So far, the algorithms that use instantaneous remotesensing measurements to retrieve atmospheric aerosol components have been greatly developed. The available aerosol parameters from sun-photometer make it possible to distinguish the components. Schuster et al. (2005) proposed a three-component model constrained by the refractive index to infer BC, ammonium sulfate, and water. Subsequently, the absorbing organic carbon (OC) and dust were supplemented to the model by Arola et al. (2011) and , respectively. By joint use of the refractive index, single scattering albedo, sphericity, and other measurements, Van Beelen et al. (2014) and Xie et al. (2017) greatly increased the identifiable aerosol components. Besides, internal mixing and hygroscopic growth of aerosols were also considered in the algorithms to reproduce the real state (Schuster et al., 2016;Zhang et al., 2018aZhang et al., , 2020. However, the aerosol vertical distribution is not available from the sun-photometer, which can be made up by the ground-based lidar with its vertical resolution of meters. Burton et al. (2012) and Groß et al. (2011) found the characteristics of different aerosol types in lidar parameters, proving the feasibility of lidar. Nishizawa et al. (2007) made use of dual-wavelength elastic lidar but only separated watersoluble aerosol from dust or sea salt. After that, the application of Raman lidar and more wavelengths made it possible to get the profiles of sea salt, soot, dust, and water-soluble aerosols (Nishizawa et al., 2011Hara et al., 2018). Mamouri and Ansmann (2017) refined the fine and coarse dust and separated them from maritime and anthropogenic aerosols based on a polarization/Raman lidar. However, limited by the available lidar information, there has been no breakthrough in the aerosol types identified by lidar measurements. Therefore, how to use limited lidar channels to distinguish more atmospheric aerosol components is what we need to investigate. And considering the advantages and limitations of sun-photometer and lidar, it may be a good choice to combine them.
In this study, based on the synergy of ground-based Mie lidar and sun-photometer in Generalized Aerosol Retrieval from Radiometer and Lidar Combined data (GARRLiC), a new algorithm to get the vertical profiles of fine-mode aerosol components, including BC, water-insoluble organic matter (WIOM), water-soluble organic matter (WSOM), ammonium nitrate-like (AN), and fine aerosol water (AW) content, is proposed for the first time. The details about the algorithm and the measurement data applied to the algorithm will be described in Sect. 2. Section 3 will present the evaluation, uncertainty analysis, and application of our algorithm to confirm the validity of inversion results.
2 Methodology and data 2.1 Methodology

Aerosol microphysical characteristics
Atmospheric aerosol is a complicated mixture of different components, which have different size distributions and complex refractive indexes (CRIs). This makes it possible to separate aerosol components by remote sensing. However, we should note that the aerosol components in the remotesensing model are not completely equivalent to the chemical compositions traditionally, and there are some limitations in identifying compositions compared with surface chemical measurements. For example, distinguishing sulfate and nitrate seems to be beyond the scope of remote sensing due to their similar optical properties in light scattering, particle size, and shape. Despite all of this, the common remotesensing components, including black carbon (BC), brown carbon (BrC), dust, organic matter (OM), ammonium nitratelike (AN), sea salt, and water uptake, are close to the species defined in the Intergovernmental Panel on Climate Change (IPCC) 2013 (Boucher et al., 2013) and enough to satisfy the need for climate change and environmental monitoring. In fact, there are often different aerosol definitions and classification schemes focusing on some key components, which depend on the limited inputs and specific research purposes. Generally, sea-salt aerosol is considered as coarse particles and neglected in Beijing . Dust and OM have similar light-absorbing characteristics due to the pres-ence of hematite (Formenti et al., 2014). Therefore, they are usually separated by particle size since OM and dust are mostly present in fine-and coarse-mode aerosols, respectively (Schuster et al., 2016;Zhang et al., 2018a). Certainly, fine-mode dust will also be taken into account with sufficient constraints. By assuming the dust volume concentration ratio of fine to coarse mode, Xie et al. (2017) further separated fine-and coarse-mode dust with the aid of spectra refractivity at four wavelengths, sphericity, and single scattering albedos. Besides, they also defined OM as non-absorbing and hydrophobic aerosols to be separated from inorganic salt and absorbing carbon (BC and BrC), while Zhang et al. (2018a) divided the OM into two categories (WIOM and WSOM) to better model complex liquid systems, and BrC is considered as a part of WIOM.
In this paper, considering the limited lidar constraints (only two wavelengths), we only focus on fine-mode aerosol and treat it as a mixture of five components like Zhang et al. (2018a): BC, OM (WIOM + WSOM), AN, and AW, and omit the presence of dust in fine mode to optimize the algorithm performance, although it seems to not cover all possible aerosol types in the atmosphere. Exactly as in Table 1, BC has the largest CRI at different wavelengths, which indicates its strong optical absorption (Mueller et al., 2007;Burton et al., 2012). On the contrary, AN (denoting the inorganic salt such as nitrate and sulfate) is mainly characterized by scattering with the smallest CRI, except for AW Xu and Penner, 2012). Generally, AW content directly depends on the hygroscopic AN at a certain ambient relative humidity (RH), especially in heavy haze episodes . With the properties of spectral absorbing, WIOM is significantly different from BC in water-insoluble matter. While in water-soluble ones, hygroscopicity is the key to distinguishing WSOM and AN. According to the summary in Zhang et al. (2018a), the growth factors of inorganic salts are all above 1.5, much larger than that of WSOM. Thus, it is considered that the aerosol hygroscopicity only comes from AN rather than WSOM in this algorithm to separate them.
Based on the distinct aerosol microphysical characteristics, we retrieve the profiles of the fine-mode aerosol components by constructing the aerosol model and microphysical parameterization schemes. Figure 1 shows the flowchart of our algorithm proposed in this study and the details will be described below.

Fine-mode aerosol properties from GARRLiC
In fact, there have been developed algorithms that combine the sun-/sky-photometer with lidar. For pursuing an even deeper synergy of lidar and sun-photometer, GARRLiC was created by modification of AERONET algorithms to adapt them for inclusion of lidar data (Lopatin et al., 2013). As a part of the extensive Generalized Retrieval of Atmosphere and Surface Properties (GRASP), it can get the property profiles separately for fine-and coarse-mode particles. So far, Table 1. GARRLiC has been applied for the characterization of atmospheric aerosols (Lopatin et al., 2013;Tsekeri et al., 2017;Bovchaliuk et al., 2016) and evaluated by airborne in situ measurements (Benavent-Oltra et al., 2021. Although GARRLiC was able to quantitatively retrieve aerosol components (L. , it still stayed on the columnar level and cannot get the flexible volume proportion of different components vertically. Note particularly that the volume concentration profiles provided by GARRLiC build a bridge between the retrieval of sun-photometer and lidar. In previous lidar algorithms, lidar parameters such as lidar ratio (the ratio of extinction to backscattered coefficients) were employed to avoid the use of volume concentration. But now, due to the accessible volume concentration, complex refractive index (CRI) and volume size distribution (VSD) can be directly used to construct the aerosol model in lidar algorithms based on the Mie theory (Bohren and Huffman, 1998). Consequently, the combination of ground-based remote-sensing technology not only enriches the inversion output but also provides a new idea for lidar inversion.
However, the Mie theory is only applicable to spherical particles, which is in contradiction with the irregular shape of dust aerosols (Mamouri and Ansmann, 2014;Sugimoto et al., 2002). In retrieval algorithms, it is generally assumed that the aerosol size distribution is bimodal and the dust aerosol is distributed in the coarse mode (Nishizawa et al., 2007(Nishizawa et al., , 2011Schuster et al., 2016;Xie et al., 2017;Zhang et al., 2018a). Therefore, in this study, we only focus on the profiles of fine aerosol components based on the outputs of GAR-RLiC, which include the aerosol extinction and volume concentration profiles in the fine mode. Similar to AERONET , the radius of 0.576 µm was used as a separation point in GARRLiC. According to the field experiments, the retrieved fine-mode aerosol components, including BC, WIOM, WSOM, and AN, were almost distributed in PM 1 (particles with the aerodynamic diameter less than 1 µm) (Liu et al., 2020;Reddington et al., 2013;Zhang et al., 2018b). To some extent, the fine modal truncation radius of 0.576 µm is reasonable for inversion.

Aerosol modeling
In the actual atmosphere, the internal mixing of aerosols is very common due to aerosol collision, condensation, and chemical reactions. Generally, the Maxwell Garnett (MG) mixing rule is more appropriate for the mixture of waterinsoluble matter embedded in the host environment (Choi and Ghim, 2016;Dey et al., 2006;Schuster et al., 2005). The effective permittivity of the mixture ε mix can be expressed as follows: where f j is the volume fraction of water-insoluble components: BC and WIOM. Here, the host environment represents the mixture of water-soluble matter, including AN, WSOM, and AW. The effective permittivities of the host environment and insoluble matter are ε host and ε j , respectively, which can be calculated from the corresponding CRIs by Eq. (2): where m and ε are the CRI and effective permittivity, respectively. For the CRI of the host environment m host , which refers to the water-soluble matter, can be obtained by the volume-averaged (VA) mixing rule to strengthen the physical constraints between multi-component liquid systems (Zhang et al., 2018a).
where m j and f j are the CRI and volume fraction of soluble components, respectively. Then the CRI of the aerosol mixture m mix can be acquired by combining Eqs.
(1)-(3). In addition to CRI, VSD is the other requirement for the Mie theory. Here, the normalized VSD of each component can be simulated according to the lognormal distribution parameters in Table 1, which are all in the dry state. Considering the hygroscopicity of AN, the growth factor is introduced to fit the AN normalized VSD under ambient RH (AW is taken into account at the same time). Then, we can model the normalized VSD of the aerosol mixture based on the assumed component volume fraction f j as follows: where dV N (lnr) dlnr is the normalized VSD of the aerosol mixture and dV j (lnr) dlnr is the normalized VSD of component j . It can be expressed by Eq. (5): where σ j and r j are the geometric standard deviation and mean radius of component j , respectively, which are listed in Table 1.
Combining the fine-mode volume concentration profiles V (h) from GARRLiC, the extinction coefficients at different wavelengths and levels σ m (λ, h) can be modeled according to the Mie theory: where Q ext is the Mie efficiency factor, which is related to lidar wavelength, particle size, and CRI (Bohren and Huffman, 1998); dV (lnr) dlnr can be obtained by V (h) dV N (lnr) dlnr . Finally, the residual between modeled extinction σ m and fine-mode extinction from GARRLiC σ c is quantified by the iterative kernel function χ 2 to find the optimal combination of component volume fractions: where g (λ, h) is the relative fitting residual between lidar measurement and modeled lidar signal from GARRLiC at different wavelengths, which is added to avoid the interference of the uncertainty resulting from GARRLiC modeling. Further, the component volume fractions can be transformed to the mass concentrations M j (h) by the density (ρ j ) of aerosol component j :

Microphysical parameterization scheme
The matched number of input parameters and the output aerosol types is the prerequisite for a reasonable aerosol model. Due to the limitation of lidar wavelengths, the input parameters of lidar are not as many as sun-photometer. Therefore, the aerosol parameterization scheme should be constructed to establish the relationship between aerosol components, thereby reducing the number of unknowns. In our algorithm, we separated water-soluble and waterinsoluble aerosols firstly by the parameterization scheme of Zhang et al. (2018a), which was re-parameterized with RH based on Schuster et al. (2009). The volume ratio of water-F. Wang et al.: Retrieval of the aerosol components profiles insoluble to water-soluble matter can be expressed as follows: where f i and f s are the water-insoluble and water-soluble volume fractions respectively. ϕ (RH) is the re-parameterized part of the function with RH. ε (D) is the climatological function of water-soluble volume fraction and D is the aerosol diameter. ε 0 , ε v , d 0 and σ log are the average fitting parameters in Kandler and Schuetz (2007), which can represent the general aerosol properties. Moreover, f i +f s = 1 is an important guarantee for the success of retrieval.
For the water-soluble matter, we assumed that AN was the only hygroscopic component as mentioned in Sect. 2.1.1. For enhancing the interaction between AN and AW, the relationship between solute mass concentration and water activity was applied in our algorithm, which was investigated in Tang (1996). And the volume ratio of AN to AW can be obtained by combining the Eqs. (12)-(15): where a w is the water activity, which can be approximately regarded as RH due to the lower curvature effect (Tang, 1996). ρ s is the density of solution and x is the weight percent of AN. C k and A k are the polynomial coefficient of ammonium nitrate from Tang (1996), which is considered as the representative of inorganic salt. f AN and f AW are the volume fractions of AN and AW respectively. With that, the growth factor (GF) of AN can also be acquired, which plays a vital role in the aerosol normalized volume distribution model of Sect. 2.1.3.
where r dry is the dry particle radius; r wet is the particle radius under the ambient RH. Fine-mode aerosols are categorized into water-insoluble aerosols and water-soluble aerosols according to the above relationship, which can be quantified with the help of the climatological parameterization scheme in Eq. (9). For water-insoluble aerosols (BC and WIOM), once the volume fraction of one is known, the volume fraction of other is determined. Meanwhile, for water-soluble aerosols (AN, AW, and WSOM), the relationship between AN and AW is established by Eq. (15), so that AN and AW can be considered as a whole. Thus, only two unknowns, one from watersoluble species and the other from water-insoluble species, are enough to achieve our requirement. In our algorithm, we iteratively changed the volume fractions of WIOM and WSOM and those of AN, AW, and BC can be correspondingly obtained. What's more, we also constrained the relationship between WSOM and WIOM to ensure the reliability of inversion. The ratio of WSOM mass concentration to the total OM is limited to 0.44 to 0.77, which has been applied in Zhang et al. (2018a) according to the statistics of observation experiments.
In summary, as presented in Fig. 1, if the volume fractions of WIOM and WSOM are initialized, the other species would be determined with the aid of parameterization schemes. Then the extinction coefficient can be calculated by the constructed aerosol model. Through multiple iterations and the constraints of fine-mode extinction coefficients from GAR-RLiC, the optimal combination of volume fractions will be found. Subsequently, the optimal mass concentration results are compared with surface components measurements and model products, including OM, BC, and AN, which verifies the inversion performance of our algorithm. Besides, the possible sources of error are discussed and the uncertainties from these sources are assessed in Sect. 3.2.

The input data of GARRLiC
The input data of GARRLiC consist of sun-photometer sun and sky radiance, and lidar signals. Here, the sun-photometer measurements were from the Beijing station (39.977 • N, 116.381 • E) of the AERONET (Aerosol Robotic Network, https://aeronet.gsfc.nasa.gov/, last access: 28 September 2022) in February of 2021. The sky radiance (raw almucantar with 26 scattering angles) and the version 3 level 1.5 product (i.e., automatically cloud cleared but may not have final calibration applied) of total optical depth (TOD) at 440, 675, 870, and 1020 nm were applied to drive GARRLiC. Besides, the AERONET products of fine aerosol optical depth (AOD) and fine volume concentration were employed to validate the outputs from GARRLiC.
For the available sky radiance sequence, the correlative lidar signal data were chosen from a dual-wavelength elastic lidar in the corresponding ±15 min time window, which was set up on the roof of a 28 m high building in the tower of the Institute of Atmospheric Physics at the Chinese Academy of Sciences (39.976 • N, 116.378 • E). The normalized lidar signals at 532 and 1064 nm were used to run the GARRLiC with the sun-photometer data. In advance, the lidar signals were averaged for 15 min and computed for 60 log-spaced heights between 150 and 6000 m above the ground to avoid the instrumental error just as the Eq. (17): where P k is the normalized lidar signal and P k is the raw averaged lidar signal. The upper and lower height limits are represented by Z max and Z min , respectively. Besides, for the accuracy of GARRLiC, the cases with the relative residual larger than 15 % in the inversion process have been eliminated according to Benavent-Oltra et al. (2021). Consequently, there were 133 retrievals remaining in February of 2021.

Relative humidity data
The vertical profile of relative humidity (RH) data used in our algorithm was interpolated linearly from the European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis v5 (ERA5) hourly data from 1000 to 300 hPa, which has been verified by the sounding data from the University of Wyoming (http://weather.uwyo.edu/upperair/bufrraob.shtml, last access: 20 September 2022) in Fig. S1 of Supplement.

Components data
The mass concentrations of aerosol components near the surface on 8-15 February 2021, including water-soluble inorganic salt, BC, and OC, were provided by the China National Environmental Monitoring Center to validate the results of the retrieved components. Besides, the Nested Air Quality Prediction Model System (NAQPMS), a three-dimensional chemistry transport model developed by the Institute of Atmospheric Physics (IAP; Li et al., 2012), was also employed to verify the reliability of estimated component profiles. The meteorology field was provided by the Weather Research and Forecasting model (WRF), which is driven by Final Analysis data (FNL) from the National Center for Environmental Prediction (NCEP). The outputs of the NAQPMS used in this paper have been assimilated through the Parallel Data Assimilation Framework (PDAF) system, which has a fairly good correlation with measurements (Wang et al., 2022). For comparable aerosol components, we used the sum of the water-soluble inorganic salt from surface measurements and NAQPMS products, such as sulfate, nitrate, and ammonium, were used to compare with AN. Due to the limited available data, the mass concentration of OC multiplied by the conversion factor of 1.7 (Bürki et al., 2020) was considered as the observed organics to compare with the total retrieved OM (WIOM + WSOM). The estimated BC can be directly validated by the observation data and model products. In this study, in addition to the correlation coefficient (R), two statistics of root-mean-square error (RMSE) and normalized mean bias (NMB) were introduced to evaluate the algorithm performance, which can be expressed as follows: where X r represents different aerosol components of BC, AN, and OM; n is the sample size; X o is the corresponding components from surface measurements and NAQPMS products. As an index to measure the deviation from true values (Wang et al., 2021), NMB > 0 indicates the overestimation of estimated results -the larger the value, the greater the overestimation.
3 Results and discussion

Evaluation of the outputs from GARRLiC
Since GARRLiC provides the input and constraints for our algorithm, whether the GARRLiC outputs are reliable directly determines the accuracy of the component inversion. Therefore, the GARRLiC outputs have to be validated based on the products of AERONET, which are widely used in the validation of remote-sensing results (Che et al., 2009). Due to the unavailable volume concentration profile of AERONET, Fig. 2a presents the comparison of fine columnar volume concentration between GARRLiC and AERONET. It's clear that the correlation coefficient can be up to 0.94 and RMSE was only 0.017. The mean percentage error (MPE), which is the average percent of error from the truth, was about 42 %. This deviation was acceptable since the estimated uncertainty for CRI in the Level 2 AERONET products is about 50 % . Moreover, the extinction coefficients from GARRLiC were also compared with the results retrieved by the Fernald method (Fernald, 1984) with the lidar ratio of 50 sr (Wang et al., 2020). From Fig. 2b we can see that the two results were highly consistent and the correlation coefficient was close to 1. Figure 2c shows the vertical distribution of extinction coefficients from GARRLiC and lidar. Obviously, the extinction average and standard deviation profiles of the two almost coincided, confirming the validity of the GARRLiC outputs. In fact, the extinction profiles from GARRLiC depend directly on the fine-mode AOD and the aerosol vertical profiles (unit: km −1 ), which are retrieved by lidar signal. Therefore, we validated the fine-mode AOD and fitting lidar signal with AERONET and lidar signal  measurements, respectively. As shown in Fig. 3, not only the fine-mode AOD but also the fitting lidar signal was in good agreement with their respective reference, with the correlation coefficient greater than 0.99. And the total MPEs of fine extinction at two wavelengths were both about 14 %, largely dependent on the fine-mode AOD due to the little error in vertical lidar signal fitting. All of the above analyses indicate that the fine volume concentration and extinction profiles from GARRLiC are reliable enough to drive the component retrieval.

Comparison with surface observations
In order to validate the estimated mass concentration of components, the comparison with observations on 8-15 February 2021 is presented here. During the test experiment period, the number of samples was limited by the availability of AERONET data. As shown in Fig. 4a, the fine volume concentration between GARRLiC and AERONET matched well, with a correlation coefficient of 0.89. The NMB of 0.031 indicated the credible results from GARRLiC; but there was still a slight overestimation at high aerosol load-ing. On 11-12 February 2021, the RH dropped from the peak value companied with decreased extinction coefficients ( Fig. 4b-c), and the RH in the experiment period changed from 20 % to 70 %, which was enough to reflect the general atmospheric situation. Due to the lack of observed component profile, the observed mass concentrations of water-soluble inorganic salt, BC, and OC near the surface were used to verify the remotely sensed results preliminarily. The estimated components from remote sensing at 150 m were employed for the verification. Figure 5 gives the comparable results of AN, BC, and OM between observation and retrieval results at the available time. An encouraging coherence in the variation trend of AN between estimation and observation (R = 0.67) was found although there was an underestimation on 12 February. Besides, there was a better consistency between the estimated and observed BC. The correlation coefficient can be up to 0.91. However, the overestimation of BC was obvious on 12 February, which was just the opposite of AN. This deviation can be attributed to the decreasing RH from 11 to 12 February, which influences the parameterization schemes as mentioned in Sect. 2.1.4. Moreover, when the extinction coefficients changed little (Fig. 4b), the decreased fine volume concentrations after 12 February also had responsibility for the error, which led to the underestimated total mass concentration relative to observation. As shown in Fig. 5c, the overestimation of OM was evident, and the mass concentrations of WSOM were closer to the observation. We should note that the components in the remote-sensing models are not equivalent to the concepts in chemical research (Z. , which is the primary error of comparisons. On the other hand, there must be differences in the mass concentration of aerosol between the surface and 150 m due to the influence of the atmospheric mixing state and emission sources of different aerosol components.

Verification of estimated vertical profiles
Similar to the comparisons with surface observations, the mass concentration of OC multiplied by 1.7 was chosen to compare with OM, and the sum of the nitrate, sulfate, and ammonium salt was used to compare with AN. The estimated AN from remote sensing had the best correlation with that from NAQPMS (R = 0.85 in Table 2), and there was a slight underestimation (NMB = −0.19). The BC and OM correlations were relatively weaker, with a correlation coefficient of ∼ 0.5, which corresponded to the relationship of OM in Zhang et al. (2018a). The deviation can be explained by the different input RH data of our algorithm and NAQPMS. Moreover, the differences in the results from two different principles are reasonable. After all, our classification of aerosol components is based on their optical characteristics. Here, in order to evaluate the performance of vertical profiles, we present two cases with lower and higher aerosol loading in Fig. 6. It can be seen that the mass concentration profiles of aerosol components from remote sensing and NAQPMS were comparable. In the relatively clean condition, OM had a higher mass concentration than AN, and the vertical distribution of the estimated OM was similar to that from NAQPMS, with the smallest deviation. There were fluctuations in both estimated and simulated AN profiles < 1 km. Subsequently, the maximum local concentration of the estimated AN occurred at 2.5 km, while for simulated AN pro- files, it occurred at ∼ 2 km. The two BC profiles had similar distributions. In the case of a higher aerosol load (Fig. 6df), AN estimation performed best according to the simulated AN profile from NAQPMS, but still with a little underestimation. In comparison, the overestimation of OM and BC at ∼ 1 km was obvious. It is noteworthy that the vertical distribution of different aerosol components was synchronous in both remote sensing and NAQPMS. Therefore, the vertical patterns of components depend largely on that of total extinction profiles, which is why the two results from remote sensing and NAQPMS cannot match exactly.

Uncertainty assessment of component estimation
In fact, the uncertainties of component retrieval mainly come from the errors of input parameters, i.e., RH, volume concentration, and extinction coefficients. Among them, RH influences the component estimation indirectly by the parameterization schemes, which are closely related to RH. Zhang et al. (2018a) discussed the uncertainty of f i f s caused by ϕ (RH) and the mean error is about 31.6 % when the RH is no more than 85 %. Since the influence of RH on a parameterization scheme always exists, here, we take 55 % as the critical point of higher and lower RH to evaluate the uncertainty from input microphysical parameters. In this paper, the Monte Carlo method was employed based on the random generation of input parameters by a Gaussian distribution with the original values and errors as mean and standard deviation, respectively. The error of RH was considered as about 10 % according to the uncertainty from ERA5 (Gamage et al., 2020). Additionally, MPEs of the fine volume concentration and extinction coefficient mentioned in Sect. 3.1.1 were applied in the Monte Carlo method. Each input parameter was sampled with 30 iterations at different heights (Mattis et al., 2016). Relative uncertainty was characterized by the ratio of standard deviation to mean values of 30 iteration results. The uncertainties of AN, AW, WSOM, BC, and WIOM from RH are given in Fig. 7a, with the mean values of 34.5 %, 48 %, 16.5 %, 40 %, and 7 % under the low-RH condition, and 24.7 %, 40 %, 57 %, 68.8 %, and 65.8 % under the high-RH condition, respectively. For other parameters, there were similar quantitative relationships of the component estimation uncertainties. The uncertainties of AN and AW at higher RH were smaller than those at lower RH for all parameters. That is because the parameterization scheme described in Sect. 2.1.4 is closer to the actual condition at higher RH (Tang, 1996). On the contrary, the higher RH made the larger uncertainties for WIOM, BC, and WSOM, which may be due to the increasing error of f i f s caused by ϕ (RH) below the RH of 85 % (Zhang et al., 2018a). As shown in Fig. 7b, the larger error of fine volume concentration with 42 % brought greater uncertainty to component estimation. Similarly, with the input CRI varying by more than 1 order of magnitude, Schuster et al. (2016) found that the uncertainty of BrC can change from 50 % to 440 %. Obviously, the estimation of BC was more sensitive to the input parameters. This may be attributed to the smaller amount of BC, the volume fraction of which is 1-2 orders of magnitude less than that of other components. The uncertainties caused by the constraints of the extinction coefficients were mainly below 50 % for different components, which is comparable with the uncertainty of retrievals by remote sensing . It should be noted that the uncertainty of aerosol components, such as BC in emission inventories, can be 200 % and more (Schuster et al., 2005). Therefore, it is valuable to retrieve by our algorithm.
In fact, some errors exist exactly but are difficult to quantify realistically. Just as the assumption of internal mixing does not apply to all situations, so do the microphysical parameters. Cheng et al. (2012) observed that the number fraction of internally mixed soot in total soot particles had pronounced diurnal cycles. When the aging process converts externally mixed soot into internally mixed ones, emissions tend to emit more fresh and externally mixed soot particles. Another unquantifiable error is from the Mie theory based on the spherical hypothesis, which idealizes aerosol particles. However, the uncertainties related to assumptions are endemic to all retrievals by remote sensing, as well as the chemistry transport models . We should mention that field measurement also cannot avoid inconsistent assumptions.

Optical closure test
Based on the measurement data of lidar and sun-photometer in February of 2021, the mass concentration profiles of aerosol components in Beijing were retrieved. Figure 8 shows a quantitative optical closure test under different RH conditions to validate the consistency between recovered extinction and the constraints from GARRLiC. It can be seen that the modeled extinctions at 532 and 1064 nm both had a good correlation with the reference values. The correlation coefficients were both close to 1. However, there were still some large residuals at the two wavelengths, especially at the RH between 70 %-80 %. It seems to underestimate the extinction when the RH was larger than 70 %. That is probably because the water-insoluble fraction is limited at high RH, and BC in water-insoluble matter tends to contribute greatly to extinction. On the contrary, the overestimation of 532 nm at the RH of about 30 % can be attributed to the larger proportion of water-insoluble matter. We should realize that the parameterization scheme of water-soluble and water-insoluble matter may have trouble in reflecting the real atmosphere situation. But for now, there are still not sufficient observation experiments to construct a more realistic scheme. Moreover, the added constraint of the relationship between WIOM and WSOM can also limit the BC fraction, although ignoring the constraint could bring about a well-matched closure result but might lead to unreasonable component volume fractions. Figure 9 shows two typical cases under different situations. It can be seen that the vertical distribution of aerosol components can be quantified even with extremely low aerosol loading (Fig. 9a). Under the clean condition, aerosols were mainly distributed below 3 km with different patterns of component profiles. There were similar peak values in the mass concentrations of AN and WIOM but at different heights. The mass concentration of AW was the smallest throughout the vertical direction due to the low RH. As shown in Fig. 9b, the two local maximums of RH being about 40 % appeared at about 700 m and 2.4 km, respectively, which was consistent with extinction profiles below 3 km. Besides, there was a fairly good relationship between the optical fit extinc-  (d-f) same as (a)-(c) but for 12:29 LT on 26 February 2021, which was under polluted conditions. tions and inputs from GARRLiC with a mean relative error of 4.63 % at 532 nm and −1.99 % at 1064 nm (Fig. 9c).

Vertical profiles of aerosol components in Beijing
In the polluted case, aerosol components were concentrated below 1 km due to the weak atmospheric diffusion capacity as shown in Fig. 9d. The maximum mass concentration of AN occurred near the surface, being about 125 µg m −3 . Subsequently, there was a decreasing trend with a fluctuation between about 300 to 600 m, while the mass concentration of WIOM and WSOM peaked at 672 m and were only about 10 µg m −3 near the ground. According to Lou et al. (2017), sulfate and nitrate, transformed from SO 2 and NO 2 , were mainly responsible for the fine particle pollution at the RH of about 70 %, which explains the high proportion of AN in Fig. 9d. Generally, pollution is usually accompanied by high RH. As shown in Fig. 9e, the maximum RH matched the large value of extinction below 1 km well. Moreover, the well-recovered extinction profiles at two lidar wavelengths indicated the stability of our algorithm.

Conclusions
By combining ground-based lidar and sun-photometer, we develop a new algorithm to get the vertical profiles of finemode aerosol components, including black carbon (BC), water-insoluble organic matter (WIOM), water-soluble organic matter (WSOM), ammonium nitrate-like (AN), and fine aerosol water (AW) content, which increases the retrieved aerosol types from the dual-wavelength Mie lidar. On this basis, the vertical profiles of aerosol components obtained in February 2021 in Beijing are retrieved and compared with in situ measurements and simulated results from NAQPMS, proving the validity of our component estimation. There is the best consistency between the estimated and observed BC with a correlation coefficient up to 0.91. The trend of AN between estimation and observation is accordant but with a little underestimation. Meanwhile, compared with the simulated results, the retrieved AN from remote sensing had the best correlation (R = 0.85) with a slight underestimation (NMB = −0.19). The BC and OM correlations were relatively weaker, with a correlation coefficient of ∼ 0.5. The vertical distribution of different aerosol components was synchronous in both remote sensing and NAQPMS. Considering the distinct principles, the differences between remote sensing and simulated results are reasonable to some extent. In addition, the reliability of the retrieval algorithm is also verified by the well-recovered extinction coefficients in the quantitative optical closure test.
Based on the products of AERONET, the mean errors obtained with respect to the input parameters are introduced to assess the uncertainty of component estimation by the Monte Carlo method. The uncertainties caused by extinction coefficients are mainly less than 50 % for different components. Additionally, increasingly better component estimation results can be obtained with the increasing accuracy of the in-put parameters. However, errors from the assumptions, such as internal mixing and spherical hypothesis, are difficult to quantify realistically. Further, we should mention that the assumptions are endemic to all retrievals by remote sensing. Meanwhile, the parameterization schemes and aerosol microphysical parameters used in the algorithm, which vary over time and place, still need to be improved by conducting sufficient observation experiments. In the future, the distinguishable aerosol types will be increased by upgrading parameterization schemes, employing more lidar wavelengths, and considering the irregular shape of dust.
Data availability. All data in this manuscript are freely available upon request through the corresponding author (tingyang@mail.iap.ac.cn).
Author contributions. FW and TY designed the whole structure of this work. FW analyzed the data and wrote the manuscript. HW provided the components data from NAQPMS. TY, XC, and ZW helped polish the manuscript. YS provided constructive suggestions for the revision of the paper; JL, GT and WC provided aerosol components observation data to verify and improve the algorithm.
Competing interests. The contact author has declared that none of the authors has any competing interests.
Disclaimer. Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Review statement. This paper was edited by Cheng Liu and reviewed by Francisco Molero and one anonymous referee.