Articles | Volume 19, issue 9
https://doi.org/10.5194/amt-19-3095-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
A hybrid optimal estimation and machine learning approach to predict atmospheric composition
Download
- Final revised paper (published on 11 May 2026)
- Preprint (discussion started on 07 Oct 2025)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-4864', Daniel Miller, 22 Jan 2026
- AC1: 'Reply on RC1', Frank Werner, 26 Mar 2026
-
RC2: 'Comment on egusphere-2025-4864', Anonymous Referee #2, 05 Feb 2026
- AC2: 'Reply on RC2', Frank Werner, 26 Mar 2026
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Frank Werner on behalf of the Authors (26 Mar 2026)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (27 Mar 2026) by Zhao-Cheng Zeng
RR by Anonymous Referee #2 (20 Apr 2026)
ED: Publish as is (20 Apr 2026) by Zhao-Cheng Zeng
AR by Frank Werner on behalf of the Authors (20 Apr 2026)
Manuscript
Review AMT - egusphere-2025-4864
Title: A hybrid optimal estimation and machine learning approach to predict atmospheric composition
First author: Frank Werner
Summary
This paper describes the development and data product of HYbrid REtrieval Framework (HYREF), which predicts sub-column carbon monoxide (CO) concentrations from Cross-track Infrared Sounder (CrIS) observations. This model was trained on the lower spatial resolution optimal estimation (OE) retrievals from TRopospheric Ozone and its Precursors from Earth System Sounding (TROPESS). The resulting machine learning (ML) data product for CrIS combines high spatial resolution and characterization of degrees of freedom and retrieval errors – which are critical to the ability to compare datasets to other observations, models, and use in data assimilation.
Overall Feedback
I think that this paper is great and worthy of publication with only minor revisions. Most of the feedback below consists of recommendations about different analytical techniques that may help improve the paper.
One specific point of feedback is a rather general critique of the value of the simple linear regression analysis for statistically comparing similar datasets. In the limit that correlations approach 1 linear regression plots start to provide limited visual evaluation ability – or to put it more bluntly everyone has seen a good looking regression and they often look less than usefully similar. There is a more robust approach to construct comparisons like this known as the “Bland-Altman Plot”, that also helps to incorporate additional statistical information about the data and better poses the fundamental question: “could variable B statistically replace variable A”. Figure 2 in your paper currently does a reasonable job of displaying the other dimensions that can matter for such a regression; given that they display retrievals, errors, and degrees of freedom.
I would recommend at least looking at Bland-Altman plots in response to this review and potentially including such analysis in the paper itself. In particular, Bland-Altman offers a better visual framework for handling data comparison tasks if the distribution of data is not linearly distributed or has uniquely variable uncertainty in either of the compared datsets. This is particularly true for variables with non-gaussian variability (e.g., logarithmic distributed variables such as optical thickness) or heteroscedastic uncertainty/variability. It looks to me as though these considerations might matter for the datasets in panels a,b, and c of Figure 2 – whereas panel d appears clearly normally distributed at all scales. One further relevant concern here is that neural network architectures such as yours are largely tuned toward gaussian process prediction and can struggle (without adequate consideration) to handle heteroscedastic variability in datasets because of the common isotropic noise assumption [Stirn et al., 2022].
An example of demonstrating a situation where analysis with Bland-Altman can significantly improve your analytical toolkit can be found in Knobelspiesse, et al. (2019). This paper explores an instrument intercomparison for radiometric polarimeters – which exhibit non-gaussian distributions in observed radiances as well as heteroscedastic variability in in the degree of linear polarization (DoLP) uncertainty. The example therein is discussed in section 3.C and summarized visually in Figure 8 and Figure 9. The links below summarize the methodology and has a python notebook demonstrating examples.
https://github.com/knobelsp/BlandAltman?tab=readme-ov-file
https://colab.research.google.com/github/knobelsp/BlandAltman/blob/main/BlandAltman.ipynb
Furthermore, as a ML retrieval example of how heteroscedasticity can cause issues with application of machine learning methods - the cloud microphysics retrievals in Miller et al., 2020 struggle to handle retrievals across the whole range of variability of the retrieval datasets. This is because of the statistical distributions of radiances and DoLP have rather heteroscedastic dependencies on the geophysical variables attempting to be retrieved.
Specific Feedback
Citations
Stirn, A., Wessels, H.-H., Schertzer, M., Pereira, L., Sanjana, N. E., and Knowles, D. A., “Faithful Heteroscedastic Regression with Neural Networks”, arXiv e-prints, Art. no. arXiv:2212.09184, 2022. doi:10.48550/arXiv.2212.09184.
Knobelspiesse K, Tan Q, Bruegge C, Cairns B, Chowdhary J, van Diedenhoven B, Diner D, Ferrare R, van Harten G, Jovanovic V, Ottaviani M, Redemann J, Seidel F, Sinclair K. Intercomparison of airborne multi-angle polarimeter observations from the Polarimeter Definition Experiment. Appl Opt. 2019 Jan 20;58(3):650-669. doi: 10.1364/AO.58.000650. PMID: 30694252; PMCID: PMC6996873.
Miller, D. J., Segal-Rozenhaimer, M., Knobelspiesse, K., Redemann, J., Cairns, B., Alexandrov, M., van Diedenhoven, B., and Wasilewski, A.: Low-level liquid cloud properties during ORACLES retrieved using airborne polarimetric measurements and a neural network algorithm, Atmos. Meas. Tech., 13, 3447–3470, https://doi.org/10.5194/amt-13-3447-2020, 2020.