Articles | Volume 19, issue 10
https://doi.org/10.5194/amt-19-3539-2026
© Author(s) 2026. This work is distributed under the Creative Commons Attribution 4.0 License.
Extraction of spatially confined small-scale waves from high-resolution all-sky airglow images based on machine learning
Download
- Final revised paper (published on 29 May 2026)
- Supplement to the final revised paper
- Preprint (discussion started on 13 Oct 2025)
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on egusphere-2025-4611', Anonymous Referee #1, 13 Nov 2025
- AC1: 'Reply on RC1', Sabine Wüst, 29 Jan 2026
-
RC2: 'Comment on egusphere-2025-4611', Anonymous Referee #2, 15 Nov 2025
- AC2: 'Reply on RC2', Sabine Wüst, 29 Jan 2026
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by Sabine Wüst on behalf of the Authors (13 Feb 2026)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (22 Feb 2026) by Jorge Luis Chau
RR by Anonymous Referee #1 (11 Mar 2026)
RR by Anonymous Referee #2 (17 Mar 2026)
ED: Publish subject to technical corrections (17 Mar 2026) by Jorge Luis Chau
AR by Sabine Wüst on behalf of the Authors (10 Apr 2026)
Author's response
Manuscript
This manuscript presents a method for detecting small-scale airglow wave structures using a modified YOLOv7. The paper is well written, scientifically sound, and a welcome contribution. A few concerns regarding the methodology need to be addressed, however. Below are my itemized comments.
Pretraining using BYOL may not be particularly beneficial for this application, and the manuscript provides no evidence that BYOL improves performance. Nevertheless, the authors should at least include further details on the implementation of BYOL and YOLOv7. In particular:
Has the author tried using YOLOv7 in its original form and compared the numbers?
This is not an appropriate way to report regression performance. Regression tasks should be evaluated using continuous error metrics such as MSE or RMSE, and wavelength and orientation should be reported separately with their respective error distributions. Using a binary threshold to count predictions as “correct” obscures the actual performance and does not provide enough information to assess model accuracy.
The reported performance is subpar for a task that should not be particularly difficult for a modern neural network. This suggests there might be issues with the data, the net config, and/or training. I suggest that the authors retrain the network without the additional regression features, expand the training data if possible, and include a validation set. If the size of the training dataset is the main constraint, using the testing set as the validation set and reporting the validation metrics is also acceptable.
The orientation and wavelength can be handled much more effectively by a dedicated CNN or ViT that processes the image content within the bounding box. Or even better, a DETR-based model would be more suitable for predicting both the bounding box and the orientation. However, adapting the method to DETR would require substantial additional work and is not strictly necessary here.
This is not a fair comparison. The 2D-FFT results are evaluated using an error threshold of 2.5° for orientation and 3 percent for wavelength, while the YOLOv7 results are evaluated using a much looser threshold of 10° and 10 percent. Because the criteria differ by a large factor, the “78 percent correct” numbers for the two methods cannot be directly compared.
While I understand the authors are trying to show that 2D-FFT performs better on normal images, the comparison is still pretty weird. It would be better to compare both methods under the same benchmark. MSE or RMSE is the standard metrics for regression tasks like these.