A cloud screening algorithm for ground-based sun photometry using all-sky images and deep transfer learning

Wendt, Eric A.; Ford, Bonne; Volckens, John

doi:10.5194/amt-2022-217

Preprints

https://doi.org/10.5194/amt-2022-217

Preprints

18 Aug 2022

| 18 Aug 2022

Status: this preprint was under review for the journal AMT. A final paper is not foreseen.

A cloud screening algorithm for ground-based sun photometry using all-sky images and deep transfer learning

Eric A. Wendt, Bonne Ford, and John Volckens

Abstract. Aerosol optical depth (AOD) is used to characterize aerosol loadings within Earth’s atmosphere. Sun photometers measure AOD from the Earth’s surface based on direct-sunlight intensity readings by spectrally narrow light detectors. However, when the solar disk is partially obscured by cloud cover, sun photometer measurements can be biased due to the interaction of sunlight with cloud constituents. We present a novel deep transfer learning model on all-sky images to support more accurate AOD retrievals. We used three independent image datasets for training and testing: the novel Northern Colorado All-Sky Image (NCASI), the Whole Sky Image SEGmentation (WSISEG), and the METCRAX-II datasets from the National Center for Atmospheric Research (NCAR). We visually partitioned all-sky images into three categories: 1) clear sky around the solar disk, 2) thin cirrus obstructing the solar disk, and 3) thick, non-cirrus clouds obstructing the solar disk. Two-thirds of the images were allocated for training and one-third were allocated for testing. We trained models based on all possible combinations of the training sets. The best-performing model successfully classified 95.5 %, 96.9 %, and 89.1 % of testing images from NCASI, METCRAX-II and WSISEG datasets, respectively. Our results demonstrate that all-sky imaging with deep transfer learning can be applied toward cloud screening, which would aid ground-based AOD measurements.

This preprint has been withdrawn.

Received: 19 Jul 2022 – Discussion started: 18 Aug 2022

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 866 KB)

Withdrawal notice
This preprint has been withdrawn.
Preprint (866 KB)

Supplement (11339 KB)

Download & links

This preprint has been withdrawn.

Eric A. Wendt, Bonne Ford, and John Volckens

Interactive discussion

Status: closed

RC1: 'Comment on amt-2022-217', Anonymous Referee #1, 27 Sep 2022

The manuscript by Wendt et al. used a deep transfer learning model to develop a cloud screening algorithm for ground-based sun photometry applications. Sun photometer images from three different sites are used for both data training and testing, and the images are classified as clear, cirrus and thin clouds. The algorithm can achieve hitting rates about 90% for the three datasets. The manuscript is overall reasonably presented. However, as a model based on deep learning, there are still some major problems with the model development and validation. The following lists my detailed comments.

1. It is noticed that a total of approximately 1500 images are used for both data training and testing, which is very small for deep learning model development. With such a small number of samples, there would be possible problems on overfitting, which can be demonstrated by the significant drop of accuracy for different datasets in Table 2. There are other problems related to small sample size as well. Thus, I don’t think such a small sample size would result any solid algorithm based on deep learning.

2. Images from three different sites are used. What’s the essential differences among them during data training and testing. It is noticed that the fractions of images of different classes (clear, cirrus or cloud) are quite different. Again, this further demonstrates the insufficiency of datasets for deep learning.

3. How are the prepared images labeled before the learning? As noticed from Session 2.2, the procedure is automated. If this is true, the authors have developed a physical model for the classification, and it is not necessary to develop a deep learning model any more. Because if we understood the physical model mentioned in Session 2.2 is the truth, the deep-learning based one will never beat it. Thus, the preparation for the image classification has to be better discussed.

4. Neither the model development nor the results were discussed in details. As an AMT articles, readers expect really detailed techniques and results to fully understand and to fully repeat the methods. The current manuscript is really concise, which makes it difficult to evaluate the method.

5. The manuscript mostly discussed their own algorithm. How is the current algorithm compared with traditional ones? Is the deep learning algorithm showing any advantages compared with conventional ones? In other words, the current model should be compared with similar ones.

Citation: https://doi.org/10.5194/amt-2022-217-RC1
RC2:
'Comment on amt-2022-217', Anonymous Referee #2, 10 Oct 2022
Review of “A cloud screening algorithm for ground-based sun photometry using all-sky images and deep transfer learning”, by Eric A. Wendt et al.

The manuscript introduces a new machine learning approach for cloud screening in sunphotometer measurements using an additional low-priced allsky camera. It describes in some detail the setup of camera system and machine learning approach. A training data set from three different camera systems is presented including a validation using parts of the data not used for training.

Major points of criticism:

In my opinion there is a lack of motivation for the introduction of this system. It might be low cost, but still there is a need for additional instrumentation while cloud screening in sun photometer data is usually done using the sunphotometer data itself (by spatial of temporal variation tests, e.g., for the AERONET network).

The manuscript only introduces a limited validation of the method and no comparison to established methods.

The assumed better instrument independence of your approach, compared to standard methods, is at least questionable unless you show clear evidence. You are using compressed camera images of variable quality and finally state that this has clear effect yourself.

At the same time, I doubt that the remaining presentation of the setup of a low-cost camera system from standard parts and the adjustment of an existing machine learning technique for general imagery (VGG-16, University of Oxford) to the allsky image cloud detection task is sufficient to justify a scientific publication in an atmospheric science publication like AMT.

In the present form I recommend the rejection of the manuscript. Resubmission after extension of the validation and comparison to other methods could be interesting.

More specific:

Reading the first part of the introduction I already ask myself why you do not announce to compare your new method to a standard one based on the sunphotometer itself. An improvement could justify your approach.

In line 47 you mention the “instrument-specific nature” of existing techniques, but in the following you will show that your method is very much instrument-specific itself. Compared to its dependence on camera system, the mentioned existing method is AOD based and thus should be – by design – instrument independent.

From line 56-104 you describe a long list of existing machine learning approaches to analyze sky images for cloud classification and detection. I’m missing reasons why the community would consequently need your new technique. The reason that no other has been used in the context of cloud detection for sunphotometer purposes does not seem enough.

Lines 124ff: You list the three camera systems’ imagery you will use without stating the image format provided. For the WSISEG I could check that it is PNG format. Already this casts doubts on the instrument independence of your method as quite some processing and different types of compression happened to the data.

Lines 156 ff: The lengthy description of the partial automation of the preparation of the “truth” training data is confusing. You should in the beginning of this paragraph what is “manual” and what “automation”.

Table1: What is a “sample”? One image, isn’t it? This all sounds like a small data set. More problematic – a small data set with just 300 samples makes it impossible to compare to other methods validated in other specific or less specific situations.

Table 2: Quite a part of the number in here does not seem “good”. The problem is that you never stated what “sufficient” or “good” would be. And how limited other techniques are.

Line 263: The statement “performed well relative to prior AOD screening algorithms” sounds very soft and is not corroborated by any shown data. Neither to your own “prior algorithms” nor to standard methods of the community (AERONET). The word “well” without any supporting data is used more often further down.

In your limitations section you honestly state important points, but the manuscript does not provide the necessary cure or discussion. You are depending on image processing steps not within your control, i.e., camera configuration (white balance, contrast and color enhancement, compression, …) which makes your method instrument specific! And the selection of your small validation data set (e.g. without situations of high aerosol load) makes your scores hard to compare to other methods’ results.
Citation: https://doi.org/10.5194/amt-2022-217-RC2

Interactive discussion

Status: closed

RC1: 'Comment on amt-2022-217', Anonymous Referee #1, 27 Sep 2022

The manuscript by Wendt et al. used a deep transfer learning model to develop a cloud screening algorithm for ground-based sun photometry applications. Sun photometer images from three different sites are used for both data training and testing, and the images are classified as clear, cirrus and thin clouds. The algorithm can achieve hitting rates about 90% for the three datasets. The manuscript is overall reasonably presented. However, as a model based on deep learning, there are still some major problems with the model development and validation. The following lists my detailed comments.

1. It is noticed that a total of approximately 1500 images are used for both data training and testing, which is very small for deep learning model development. With such a small number of samples, there would be possible problems on overfitting, which can be demonstrated by the significant drop of accuracy for different datasets in Table 2. There are other problems related to small sample size as well. Thus, I don’t think such a small sample size would result any solid algorithm based on deep learning.

2. Images from three different sites are used. What’s the essential differences among them during data training and testing. It is noticed that the fractions of images of different classes (clear, cirrus or cloud) are quite different. Again, this further demonstrates the insufficiency of datasets for deep learning.

3. How are the prepared images labeled before the learning? As noticed from Session 2.2, the procedure is automated. If this is true, the authors have developed a physical model for the classification, and it is not necessary to develop a deep learning model any more. Because if we understood the physical model mentioned in Session 2.2 is the truth, the deep-learning based one will never beat it. Thus, the preparation for the image classification has to be better discussed.

4. Neither the model development nor the results were discussed in details. As an AMT articles, readers expect really detailed techniques and results to fully understand and to fully repeat the methods. The current manuscript is really concise, which makes it difficult to evaluate the method.

5. The manuscript mostly discussed their own algorithm. How is the current algorithm compared with traditional ones? Is the deep learning algorithm showing any advantages compared with conventional ones? In other words, the current model should be compared with similar ones.

Citation: https://doi.org/10.5194/amt-2022-217-RC1
RC2:
'Comment on amt-2022-217', Anonymous Referee #2, 10 Oct 2022
Review of “A cloud screening algorithm for ground-based sun photometry using all-sky images and deep transfer learning”, by Eric A. Wendt et al.

The manuscript introduces a new machine learning approach for cloud screening in sunphotometer measurements using an additional low-priced allsky camera. It describes in some detail the setup of camera system and machine learning approach. A training data set from three different camera systems is presented including a validation using parts of the data not used for training.

Major points of criticism:

In my opinion there is a lack of motivation for the introduction of this system. It might be low cost, but still there is a need for additional instrumentation while cloud screening in sun photometer data is usually done using the sunphotometer data itself (by spatial of temporal variation tests, e.g., for the AERONET network).

The manuscript only introduces a limited validation of the method and no comparison to established methods.

The assumed better instrument independence of your approach, compared to standard methods, is at least questionable unless you show clear evidence. You are using compressed camera images of variable quality and finally state that this has clear effect yourself.

At the same time, I doubt that the remaining presentation of the setup of a low-cost camera system from standard parts and the adjustment of an existing machine learning technique for general imagery (VGG-16, University of Oxford) to the allsky image cloud detection task is sufficient to justify a scientific publication in an atmospheric science publication like AMT.

In the present form I recommend the rejection of the manuscript. Resubmission after extension of the validation and comparison to other methods could be interesting.

More specific:

Reading the first part of the introduction I already ask myself why you do not announce to compare your new method to a standard one based on the sunphotometer itself. An improvement could justify your approach.

In line 47 you mention the “instrument-specific nature” of existing techniques, but in the following you will show that your method is very much instrument-specific itself. Compared to its dependence on camera system, the mentioned existing method is AOD based and thus should be – by design – instrument independent.

From line 56-104 you describe a long list of existing machine learning approaches to analyze sky images for cloud classification and detection. I’m missing reasons why the community would consequently need your new technique. The reason that no other has been used in the context of cloud detection for sunphotometer purposes does not seem enough.

Lines 124ff: You list the three camera systems’ imagery you will use without stating the image format provided. For the WSISEG I could check that it is PNG format. Already this casts doubts on the instrument independence of your method as quite some processing and different types of compression happened to the data.

Lines 156 ff: The lengthy description of the partial automation of the preparation of the “truth” training data is confusing. You should in the beginning of this paragraph what is “manual” and what “automation”.

Table1: What is a “sample”? One image, isn’t it? This all sounds like a small data set. More problematic – a small data set with just 300 samples makes it impossible to compare to other methods validated in other specific or less specific situations.

Table 2: Quite a part of the number in here does not seem “good”. The problem is that you never stated what “sufficient” or “good” would be. And how limited other techniques are.

Line 263: The statement “performed well relative to prior AOD screening algorithms” sounds very soft and is not corroborated by any shown data. Neither to your own “prior algorithms” nor to standard methods of the community (AERONET). The word “well” without any supporting data is used more often further down.

In your limitations section you honestly state important points, but the manuscript does not provide the necessary cure or discussion. You are depending on image processing steps not within your control, i.e., camera configuration (white balance, contrast and color enhancement, compression, …) which makes your method instrument specific! And the selection of your small validation data set (e.g. without situations of high aerosol load) makes your scores hard to compare to other methods’ results.
Citation: https://doi.org/10.5194/amt-2022-217-RC2

Eric A. Wendt, Bonne Ford, and John Volckens

Supplement

https://doi.org/10.5194/amt-2022-217-supplement

Eric A. Wendt, Bonne Ford, and John Volckens

Viewed

Total article views: 1,411 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
1,020	319	72	1,411	211	98	123

HTML: 1,020
PDF: 319
XML: 72
Total: 1,411
Supplement: 211
BibTeX: 98
EndNote: 123

Views and downloads (calculated since 18 Aug 2022)

Month	HTML	PDF	XML	Total
Aug 2022	129	33	5	167
Sep 2022	27	17	1	45
Oct 2022	31	12	4	47
Nov 2022	15	5	1	21
Dec 2022	5	3	0	8
Jan 2023	9	2	0	11
Feb 2023	16	8	1	25
Mar 2023	7	5	1	13
Apr 2023	8	7	1	16
May 2023	9	5	1	15
Jun 2023	16	8	2	26
Jul 2023	9	8	2	19
Aug 2023	11	6	1	18
Sep 2023	23	9	4	36
Oct 2023	11	6	0	17
Nov 2023	13	3	1	17
Dec 2023	19	8	1	28
Jan 2024	11	6	1	18
Feb 2024	14	7	0	21
Mar 2024	10	3	5	18
Apr 2024	17	2	2	21
May 2024	18	2	2	22
Jun 2024	10	3	2	15
Jul 2024	17	5	6	28
Aug 2024	10	1	1	12
Sep 2024	10	5	1	16
Oct 2024	5	5	0	10
Nov 2024	8	5	0	13
Dec 2024	3	4	0	7
Jan 2025	14	9	2	25
Feb 2025	14	5	3	22
Mar 2025	8	5	1	14
Apr 2025	7	3	0	10
May 2025	17	5	3	25
Jun 2025	6	3	0	9
Jul 2025	10	4	3	17
Aug 2025	52	9	0	61
Sep 2025	272	14	2	288
Oct 2025	19	14	2	35
Nov 2025	31	27	3	61
Dec 2025	18	9	2	29
Jan 2026	34	12	3	49
Feb 2026	27	7	2	36

Cumulative views and downloads (calculated since 18 Aug 2022)

Month	HTML	PDF	XML	Total
Aug 2022	129	33	5	167
Sep 2022	27	17	1	45
Oct 2022	31	12	4	47
Nov 2022	15	5	1	21
Dec 2022	5	3	0	8
Jan 2023	9	2	0	11
Feb 2023	16	8	1	25
Mar 2023	7	5	1	13
Apr 2023	8	7	1	16
May 2023	9	5	1	15
Jun 2023	16	8	2	26
Jul 2023	9	8	2	19
Aug 2023	11	6	1	18
Sep 2023	23	9	4	36
Oct 2023	11	6	0	17
Nov 2023	13	3	1	17
Dec 2023	19	8	1	28
Jan 2024	11	6	1	18
Feb 2024	14	7	0	21
Mar 2024	10	3	5	18
Apr 2024	17	2	2	21
May 2024	18	2	2	22
Jun 2024	10	3	2	15
Jul 2024	17	5	6	28
Aug 2024	10	1	1	12
Sep 2024	10	5	1	16
Oct 2024	5	5	0	10
Nov 2024	8	5	0	13
Dec 2024	3	4	0	7
Jan 2025	14	9	2	25
Feb 2025	14	5	3	22
Mar 2025	8	5	1	14
Apr 2025	7	3	0	10
May 2025	17	5	3	25
Jun 2025	6	3	0	9
Jul 2025	10	4	3	17
Aug 2025	52	9	0	61
Sep 2025	272	14	2	288
Oct 2025	19	14	2	35
Nov 2025	31	27	3	61
Dec 2025	18	9	2	29
Jan 2026	34	12	3	49
Feb 2026	27	7	2	36

Viewed (geographical distribution)

Total article views: 1,424 (including HTML, PDF, and XML) Thereof 1,424 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 28 Feb 2026

Download

This preprint has been withdrawn.

Preprint (866 KB)
Metadata XML

Short summary

Outdoor air pollution poses global public health and environmental risks. One method to quantify outdoor air pollution is sun photometery, a technique that measures how much airborne particles affects sunlight intensity. Clouds obscuring the sun can bias sun photometer measurements. Here we propose an image-based deep learning framework for automatic quality control of sun photometer measurements. We show our algorithm is effective at classifying images of the sun as cloud-contaminated or not.


Total:	0
HTML:	0
PDF:	0
XML:	0