Articles | Volume 19, issue 3
https://doi.org/10.5194/amt-19-1059-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
Improved estimation of diurnal variations in near-global PBLH through a hybrid WCT and transfer learning approach
Download
- Final revised paper (published on 16 Feb 2026)
- Supplement to the final revised paper
- Preprint (discussion started on 15 Oct 2025)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-4918', Anonymous Referee #1, 07 Nov 2025
- AC1: 'Reply on RC1', Yarong Li, 11 Jan 2026
-
RC2: 'Comment on egusphere-2025-4918', Anonymous Referee #2, 23 Nov 2025
- AC2: 'Reply on RC2', Yarong Li, 11 Jan 2026
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Yarong Li on behalf of the Authors (12 Jan 2026)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (20 Jan 2026) by Meng Gao
RR by Anonymous Referee #2 (22 Jan 2026)
RR by Anonymous Referee #3 (31 Jan 2026)
ED: Publish as is (31 Jan 2026) by Meng Gao
AR by Yarong Li on behalf of the Authors (08 Feb 2026)
This study reports a new hybrid approach that combines the wavelet covariance transform (WCT) with a transfer-learning deep residual network to estimate the diurnal evolution of planetary boundary layer height (PBLH) at near-global scale from the non-sun-synchronous CATS spaceborne lidar. The proposed transfer-learning strategy is both novel and practical, effectively leveraging the large-sample coverage of reanalysis products and the high accuracy of radiosonde measurements. The methodology is sound, the experiments are comprehensive, and the results clearly demonstrate substantial performance gains over conventional algorithms. The findings are valuable for improving boundary-layer parameterizations and advancing our understanding of global PBL diurnal variability, and they fall well within AMT’s scope.
However, several aspects require deeper discussion and additional evidence to further strengthen the reliability of some results and enhance the paper’s scientific contribution and technical impact. I therefore recommend acceptance after minor revision.
Specific comments and suggestions
Pretraining with MERRA-2-constrained pseudo-labels inherently injects reanalysis systematic biases into the learned representation. Even after fine-tuning with 4,662 radiosonde-matched samples, residual biases may persist (as also suggested by the closer agreement of the pretrained model with MERRA-2). Please discuss and, if possible, quantify this effect and its impact on the final estimates.
The manuscript states that 2016 data are used for pretraining and that the transfer stage uses a 4,000/662 split, but it does not clarify whether the test set is strictly separated by station and time window. To avoid information leakage from adjacent or same-station samples, please clarify the split strategy and consider a station- and season-stratified (or leave-one-site-out) evaluation.
You conclude that the model exhibits a weaker afternoon decay and better agreement with radiosondes, while morning correlations are slightly lower but accuracy is higher (smaller bias). I recommend a joint assessment of statistical consistency (e.g., R, MAE) and physical consistency (e.g., decay rate after the diurnal peak and the timing of the peak). This would help reconcile performance metrics with expected PBL diurnal physics.
The diagnosis for reduced performance over high-elevation and desert regions is reasonable, but quantitative uncertainty information is missing. Please provide uncertainty maps and/or tables, for example seasonal and hourly MAE/bias boxplots specifically for these regions.
Since “candidate PBLH” and temperature together contribute >50% of the importance, with LST/elevation next and TAB/WCT shape metrics relatively low, the conclusions should more explicitly articulate the implication for classical algorithms: rather than further refining profile-shape heuristics, incorporating thermodynamic and terrain-related diagnostics appears more beneficial.
In Figure 10, please annotate each land-cover curve with the peak time and amplitude to aid interpretation. For Table 1 (hourly R/MAE/RMSE), consider adding 95% confidence intervals or bootstrap-based uncertainty bands.
Although 480 m is identified as the optimal dilation, it would be helpful to include in the Supplement a systematic comparison table showing (i) five-peak hit rates under different dilation factors and (ii) “largest-peak only” vs. “multi-peak candidate” performance.