Preprints
https://doi.org/10.5194/amt-2022-296
https://doi.org/10.5194/amt-2022-296
19 Dec 2022
 | 19 Dec 2022
Status: a revised version of this preprint was accepted for the journal AMT and is expected to appear here in due course.

Gap-Filling of Turbulent Heat Fluxes over Rice–Wheat-Rotation Croplands Using the Random Forest Model

Jianbin Zhang, Zexia Duan, Shaohui Zhou, Yubin Li, and Zhiqiu Gao

Abstract. This study investigated the accuracy of the Random Forest (RF) model in gap-filling the sensible (H) and latent heat (LE) fluxes, by using the observation data collected at a site over rice–wheat-rotation croplands in Shouxian County of eastern China from 15 July 2015 to 24 April 2019. Firstly, the variable significances of the machine learning (ML) model’s five input variables, including the net radiation (Rn), winds speed (WS), temperature (T), relative humidity (RH), and air pressure (P), were examined, and it was found that Rn accounted for 78 % and 76 % of the total variable significance in H and LE calculating, respectively, showing that it was the most important input variable. Secondly, the RF model's accuracy with the five-variable (Rn, WS, T, RH, P) input combination was evaluated, and the results showed that the RF model could reliably gap-fill the H and LE with mean absolute errors (MAEs) of 5.88 Wm−2 and 20.97 Wm−2, and root mean square errors (RMSEs) of 10.67 Wm−2 and 29.46 Wm−2, respectively. Thirdly, 4-variable input combinations were tested, and it was found that the best input combination was (Rn, WS, T, P) with the MAE of H and LE reduced by 12.65 % and 7.12 %, respectively, after removing RH from the input list. At last, through the Taylor diagram, H and LE gap-filling accuracy of the RF model, the support vector machine (SVM) model, the k-nearest neighbor (KNN) model, and the gradient boosting decision tree (GBDT) model was inter-compared, and the statistical metrics showed that RF was the most accurate for both H and LE gap-filling, while the LR and KNN model performed the worst for H and LE gap-filling, respectively.

Jianbin Zhang et al.

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on amt-2022-296', Anonymous Referee #2, 04 Jan 2023
    • AC1: 'Reply on RC1', Yubin Li, 01 Feb 2023
  • RC2: 'Comment on amt-2022-296', Anonymous Referee #1, 01 Feb 2023
    • AC2: 'Reply on RC2', Yubin Li, 01 Feb 2023

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on amt-2022-296', Anonymous Referee #2, 04 Jan 2023
    • AC1: 'Reply on RC1', Yubin Li, 01 Feb 2023
  • RC2: 'Comment on amt-2022-296', Anonymous Referee #1, 01 Feb 2023
    • AC2: 'Reply on RC2', Yubin Li, 01 Feb 2023

Jianbin Zhang et al.

Jianbin Zhang et al.

Viewed

Total article views: 334 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
267 52 15 334 5 4
  • HTML: 267
  • PDF: 52
  • XML: 15
  • Total: 334
  • BibTeX: 5
  • EndNote: 4
Views and downloads (calculated since 19 Dec 2022)
Cumulative views and downloads (calculated since 19 Dec 2022)

Viewed (geographical distribution)

Total article views: 328 (including HTML, PDF, and XML) Thereof 328 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 25 Mar 2023
Download
Short summary
In this paper, we used Random Forest Model to fill the observation gaps of the fluxes measured during 2015–2019. We found that the net radiation was the most important input variable. And we justified the reliability of the model. Further, it was revealed that the model performed better after relative humidity was removed from the input. At last, we compared the results of the model with those of the other three machine learning models, and we found that the model outperformed all of them.