Articles | Volume 15, issue 20
Research article
19 Oct 2022
Research article |  | 19 Oct 2022

Comparison of GRUAN data products for Meisei iMS-100 and Vaisala RS92 radiosondes at Tateno, Japan

Shunsuke Hoshino, Takuji Sugidachi, Kensaku Shimizu, Eriko Kobayashi, Masatomo Fujiwara, and Masami Iwabuchi

A total of 99 dual soundings with Meisei iMS-100 radiosonde and Vaisala RS92 radiosondes were carried out at the Aerological Observatory of the Japan Meteorological Agency, known as Tateno (36.06 N, 140.13 E, 25.2 m; the World Meteorological Organization, WMO, station number 47646), from September 2017 to January 2020. Global Climate Observing System (GCOS) Reference Upper-Air Network (GRUAN) data products (GDPs) from both sets of radiosonde data for 59 flights were subsequently created using a documented processing programme along with the provision of optimal estimates for measurement uncertainty. Differences in radiosonde performance were then quantified using these GDPs. For daytime observations, the iMS-100 temperature is around 0.5 K cooler than RS92-GDP in the stratosphere, with significant differences in the upper troposphere and lower stratosphere in consideration of combined uncertainties. For nighttime observations, the difference is around −0.1K, and data are mostly in agreement. For relative humidity (RH), iMS-100 is around 1 % RH–2 % RH higher in the troposphere and 1 % RH smaller in the stratosphere than RS92, but both GDPs are in agreement for most of the profile. The mean pressure difference is ≤0.1hPa, the wind speed difference is from −0.04 to +0.14m s−1, the wind direction difference is 6.4, and the root mean square vector difference (RMSVD) for wind is ≤1.04m s−1.

1 Introduction

The Aerological Observatory of the Japan Meteorological Agency (JMA), called Tateno (located at 36.06 N, 140.13 E, 25.2 m above mean sea level), was established in 1920 and has played a leading role in the operation of all JMA radiosonde stations. The Tateno station was chosen as a candidate site for the Global Climate Observation System (GCOS) Reference Upper-Air Network (GRUAN; Seidel et al.2009; Bodeker et al.2016) in 2009 and was certified as a GRUAN site in 2018. The Vaisala RS92-SGP radiosonde (referred to here as RS92; Dirksen et al.2014) was used for routine observations at the Tateno site from December 2009. While RS92-SGP provides data with a high time resolution, i.e. high-altitude resolution, and highly reliable data and has (GRUAN) data products (GDPs), it was necessary to seek an alternative radiosonde model for operational reasons. This is because the payloads often fall within the greater Tokyo metropolitan region (i.e. highly populated area), and therefore, for safety reasons, the use of lighter instrumentation (the weight of RS92 is 290 g) is necessary. In response to this request, the Meisei RS-11G radiosonde (Kizu et al.2018; Kobayashi et al.2019; Hoshino et al.2022a), whose weight is 85 g, was released in 2012 and started being used at Tateno in July 2013. The GDP for RS-11G was developed and certificated in 2019. Following the release of the Meisei iMS-100 radiosonde (Kizu et al.2018; Hoshino et al.2022a), smaller (55×53×131mm3) and lighter (40 g), model in 2014, it has been used since September 2017. The temperature and humidity sensors of iMS-100 are basically identical to that of RS-11G, but the relative humidity sensor of iMS-100 has a dedicated thermometer, and the global positioning system (GPS) module is different between the two. RS-11G has also been used in operation at Syowa station (Antarctica) since March 2018, while the iMS-100 has been used at Minamitorishima and other JMA stations since 2017 and at stations of other meteorological service providers and numerous research institutes and universities. Syowa and Minamitorishima are currently candidate GRUAN sites.

Figure 1 gives an overview of the GRUAN data stream. The raw data output from the ground system (dc3cb or MWX files for RS92 radiosonde data and JFMT files for iMS-100 radiosonde data) and meta-data files are submitted to the GRUAN Lead Centre (LC) for archiving, and GRUAN LC issues a data processing ticket file, including the data and meta-data file names, IDs for the measurement and the product, the output file name, and other elements for processing. The data processing centre (PC; Lindenberg for RS92 and Tateno for iMS-100 radiosondes) collects raw data, meta-data, and ticket files from GRUAN LC and performs processing to create GDP files in a NetCDF format. The output GDP file are submitted to GRUAN LC. The GDP files are distributed at the National Centers for Environmental Information (NCEI;, last access: 12 October 2022) after quality checking.

Figure 1GRUAN data file stream.


The evaluation of GDP for RS-11G (RS-11G-GDP.1; Kizu et al.2019), using GDP for RS92 (RS92-GDP.2; Sommer et al.2012) in dual flight, was discussed in Kobayashi et al. (2019). In Kobayashi et al. (2019), the RS-11G-GDP.1 temperature is around −0.4K lower than RS92-GDP.2 in the daytime measurement in the stratosphere, while nighttime measurements generally agree well. The RS-11G-GDP.1 relative humidity (RH) was 2 % RH smaller than the RS92-GDP.2 for 90 % RH–100 % RH, and the RS-11G-GDP.1 was around 5 % RH larger than the RS92-GDP.2 at values lower than 50 % RH, so the RH difference exceeds 2 % RH between 500 and 150 hPa in both the daytime and nighttime data. The pressure difference was 0.5 hPa in the troposphere, and the geopotential height difference was around 10–20 m in the stratosphere.

Although the sensors of iMS-100 are almost identical with RS-11G, the data processing algorithms have some distinct updates for IMS-100-GDP, such as the correction of the hysteresis effect of the RH sensor and the correction of geoid model used in GPS module.

In this paper, Sect. 2 describes the instrumentation used and GRUAN data products (including a brief description of the data processing) for iMS-100 and RS92. Section 3 outlines the dual soundings, and Sect. 4 describes the comparison analysis method. The comparison results are given in Sect. 5, and Sect. 6 discusses outcomes from a dual flight of iMS-100 and a chilled-mirror hygrometer, Meisei SKYDEW (Sugidachi2019). Section 7 summarises the findings. The related abbreviations are listed in Appendix A.

2 Instrumentation

2.1 Sensor material and specifications

Table 1 shows the specifications of iMS-100 and RS92. More detailed specifications for each model are described in Vaisala Oyj. (2013), Dirksen et al. (2014), Meisei Electric Co., Ltd. (2020), and Hoshino et al. (2022a). The ground system for iMS-100 is Meisei MGPS2 and that for RS92 was Vaisala DigiCORA III and was replaced with Vaisala MW41 in September 2019.

(Hyland and Wexler1983)

Table 1Specifications of the radiosondes (Vaisala Oyj.2013; Meisei Electric Co., Ltd.2020; Dirksen et al.2014; Hoshino et al.2022a). Note: GBAS is ground-based augmentation system, SBAS is Satellite-Based Augmentation System, and DWH is depth, width, and height.

a Wind also can be derived from Doppler shift in GPS signal, like RS-11G. b Weight for biodegradable materia model, while 38 g for EPS model.

Download Print Version | Download XLSX

2.2 GRUAN data processing for iMS-100

Figure 2 gives an overview of GRUAN data processing for iMS-100. The ticket file (*.gpt) contains the raw data file name (*JFMT.DAT) and meta-data file name (*.gmd), the individual identification number for the observation in the GRUAN data archive, and the file name for processed data. GMDB is the abbreviation for the GRUAN Meta-data Data Base, which contains information on payload equipment (such as radiosondes, balloons, parachutes, unwinders, and rigs). In pre-processing, the lag of the data acquisition timing in the processor between temperature/humidity and GPS-related data (time and positioning) are adjusted, relative humidity is recalibrated with additional ground check data (at 0 % RH and 100 % RH conditions, where available), geometric altitude is corrected with the finer geoid model (see Sect. 2.2.3), and the initial data for ascending are extracted by identifying the start and end of ascent. Usually, the end of ascent is when the radiosonde reached its maximum altitude (thus, the start of the descent or the end of radiosonde signals), but ascent data are truncated to ensure their reliability in the event of missing temperature values (i.e. gaps in observation) over a period of 180 s or more. Initial data are processed at each step, as outlined below, to produce the final data. Derived data related to humidity (such as partial water vapour pressure, frost point temperature, and precipitable water vapour) are also calculated. To support climate record quality, GDP data include their uncertainty at each measurement point. The processed data are stored in NetCDF format, and the GDP for iMS-100 radiosonde data is denoted as iMS-100-GDP. The processing algorithm is documented in Hoshino et al. (2022a) for detail, so the outline and difference from RS-11G-GDP.1 (Kizu et al.2018; Kobayashi et al.2019) is described in this paper.

Figure 2GRUAN data processing for iMS-100 (GMDB from GRUAN Meta-data Data Base). The circle is the process starting point. Parallelograms and rectangles with double lines on the left and right sides represent the data and sub-processes for calculation, respectively.


2.2.1 Temperature

An overview of the temperature calculation is given in Fig. 3. T0, U0, Psurf, and Hfin are uncorrected temperature values, uncorrected relative humidity values, surface pressure, and corrected geopotential height (see Sect. 2.2.3). If the length of the string or the unwinder is equal to or less than 10 m, then temperature spike filtering is applied when pressure is below 30 hPa. For this filtering and radiation correction based on the heat balance equation (JMA1995), provisional pressure P0 is derived using T0, U0, surface pressure data, Psurf, and Hfin (Sect. 2.2.3). T1, the temperature after spike filtering, is T0 if no filtering is applied. The ascent rate, asc, at the previous sampling time is used as the ventilation speed, vent, for the calculation of the radiation correction. The ventilation speed at the release time is set as 0 m s−1. The moving averages of T1 within ±1 s are calculated to determine the uncorrected temperature data with radiation correction, T2. The amount of radiation correction, Trad_corr, is calculated using T2, P0, Psurf, vent, and the solar elevation angle, which is derived from GPS time, ϕ0 (latitude), and λ0 (longitude). Trad_corr is subtracted from T2, to determine T3. As the thermistor is not completely spherical, there is a small dependence on the angle to sunlight. Accordingly, the Kaiser filter is applied to T3 to minimise this orientation effect on obtaining the final temperature value, Tfin. The uncertainty budget for temperature is shown in Table 2. The uncertainties are classified as being either correlated or uncorrelated when the source of the uncertainty is systematic or random, respectively (Dirksen et al.2014). For example, the uncertainties due to the smoothing processes are classified as uncorrelated uncertainties, and uncertainties due to calibration or solar radiation correction are considered to be correlated uncertainties. Figure 4a and b show the vertical profiles of temperature and its uncertainty at 00:00 UTC on 15 September 2017. The dashed line in Fig. 4a indicates the tropopause height. The blue, green, and red lines in Fig. 4b are correlated, uncorrelated, and total uncertainty, respectively. The blue and red lines almost overlap, but the correlated uncertainty is dominant. For daytime observation, the correlated uncertainty increases with altitude because the amount of the solar radiation correction increases, and the uncertainty from it also increases.

Figure 3Temperature processing for iMS-100 (L is the total string, or unwinder, length; SEA is the Sun elevation angle derived from time and the radiosonde position). The meanings of the shapes are same as in Fig. 2. Diamonds represent the decision.


Table 2Sources contributing to iMS-100 temperature measurement uncertainty.

Download Print Version | Download XLSX

Figure 4The profile of (a) temperature, (b) temperature uncertainty, (c) RH, (d) RH uncertainty, and (e) pressure uncertainty at 00:00 UTC on 15 September 2017. The dashed line in panel (a) is the tropopause height. The blue, green, and red lines in panels (b), (d), and (e) are correlated, uncorrelated, and total uncertainty, respectively. Black lines in panels (b), (d), and (e) show the criteria for screening described in Sect. 4.1.


2.2.2 Relative humidity (RH)

An overview of the RH calculation is given in Fig. 5. Thum and L are the temperature of the humidity sensor and length of unwinder or string, respectively. For the time lag correction, which corrects the effect of the time constant delay, the response time coefficients are constant for the RS-11G-GDP.1 but are dependent on whether the RH sensor is absorbing or desorbing water vapour molecules (i.e. whether RH is increasing or decreasing, respectively) for the iMS-100-GDP. The trend of RH change is estimated from the slope of the tangent of U0, using the method proposed by Savitzky and Golay (1964), using the U0 values within ±5 s values. The response time, τhum, is calculated from the obtained trend value together with the temperature of the RH sensor, Thum. Time lag correction is applied to U0 to determine U1 values, which are separated into low- and high-frequency components, U1low and U1high, via the digital filtering with a cutoff period set as 4 times the period of pendulum motion, Tpend. High-frequency components sometimes exhibit spike-like variations, which are considered to be associated with water droplets or ice attached on the humidity sensor. These spike-like variations are removed by using a minimum-pass filter, moving averages, and an infinite impulse response (IIR) filter to determine U2high. The low- and high-frequency components, U1low and U2high, are summed to create U2. For the RS-11G-GDP.1, the hysteresis errors, i.e. the small error remaining after the time lag correction, possibly due to the inhomogeneity of change in the thin-film polymer RH sensor (termed the “slow-regime” in Dupont et al.2020), are not corrected but only considered to be an uncertainty component. For the iMS-100-GDP, the estimation of the hysteresis errors are formulated as follows: the slow part ratio and its response times for absorption and for desorption are assumed as 3 % and 300 and 12 000 s, respectively. The RH of the slow part, Udelay, and the hysteresis-corrected RH, U3 are derived via iterative calculation with the following:


where α is the slow part ratio, τhys is the response time for the slow part, and U3 is the corrected RH.

Figure 5As per Fig. 3 but for RH.


The temperature–humidity dependence (TUD) correction is applied to minimise RH biases at lower temperatures in the same way as for RS-11G-GDP.1 by Eqs.(3) and (4), but the coefficients are redetermined via additional chamber experiments and flight comparison results with the Cryogenic Frostpoint Hygrometer (Vömel et al.2007, 2016; Table 3).

(3) Δ U = K 0 , 0 + K 0 , 1 × U 3 + K 0 , 2 × U 3 2 1 - K 1 , 0 + K 1 , 1 × U 3 exp - T hum / K 2 , 0 + K 2 , 1 × U 3 + K 3 , 0 + K 3 , 1 × U 3 + K 3 , 2 × U 3 2 ,

(4) U 4 = U 3 - Δ U .

Table 3Coefficients for TUD correction (Eq. 3).

Download Print Version | Download XLSX

During soundings, the temperature of the RH sensor is not necessarily equal to the ambient air temperature because of solar heating and the thermal lag of the RH sensor. Values monitored with an RH sensor that are higher (lower) than those of the ambient air would result in dry (wet) biases. Accordingly, U4 values need to be translated to RH with respect to the saturation vapour pressure associated with air temperature using the following equation (referred as the Ts/Ta correction).

(5) U fin = U 4 × e s ( T hum ) e s ( T a ) ,

where es(T) is saturated water vapour pressure over liquid water, based on the formulation by Hyland and Wexler (1983), as follows:

(6) log e s = a 0 T + a 1 + a 2 T + a 3 T 2 + a 4 T 3 + a 5 T 4 + a 6 log T ,

where T is the temperature (K) with the coefficients, ai, as shown in Table 4.

Table 4Coefficients for saturated water vapour in Eq. (6).

Download Print Version | Download XLSX

Ufin is the final value of RH. The uncertainty budget for RH is shown in Table 5. The uncertainties due to the calibration and time lag correction are classified as correlated uncertainties. The uncertainty due to the low-pass filter is uncorrelated uncertainty. The uncertainty due to the IIR filter has correlated components propagating from the time lag correction and uncorrelated components from the low-pass filter. The uncertainties from the hysteresis correction, the TUD correction, and the Ts/Ta correction are calculated by propagation from the uncertainties from all previous steps. The final correlated uncertainty is obtained from synthesising the calibration uncertainty and the correlated uncertainty after the Ts/Ta correction is applied. Figure 4c and d show the vertical profiles of RH and its uncertainty at 00:00 UTC on 15 September 2017. The blue, green, and red lines in Fig. 4d are correlated, uncorrelated, and total uncertainty, respectively. The green and red lines almost overlap at most altitudes. This means that the uncorrelated uncertainty is dominant. The correlated uncertainty has some magnitude (1 % RH–4 % RH) in the troposphere but is nearly negligible above 50 hPa level. If Ufin+u(U) is less than 0 % RH, then these are treated as missing values (NaN) in output.

Table 5Sources contributing to iMS-100 relative humidity measurement uncertainty.

Download Print Version | Download XLSX

2.2.3 Geopotential height

An overview of the geopotential height calculation flow is given in Fig. 6. The raw geometric altitude, Z0, is calculated from ellipsoidal height derived with GPS signal and the internal geoid model of iMS-100, but the grid resolution of the internal geoid model with 10×10 is too coarse, and the interpolated geoid height may differ from the actual geoid height by 20 m or more in some regions (near Japan is the one of the regions with large differences; see Fig. 7). Accordingly, geometric altitude is recalculated using the geoid model (Pavlis et al.2012) with a finer grid (5×5) to derive Z1. The results of the verification using the Global Navigation Satellite System (GNSS) simulator show that Z0 was found to have a ≃1s delay with respect to the assumed altitude. Z1 is corrected for this delay to obtain Z2. Also, this verification shows that there is a delay of several seconds in measurements just after the launch. This delay is attributed to Kalman filtering in positioning by the GPS module, and the difference becomes negligible several seconds after launch. Accordingly, the altitude in this transition period is interpolated with the known release altitude and the observed altitude at the end of transition to obtain Z3, which is then applied with a moving average to determine the final geometric altitude, Zfin. The geopotential height, Hfin, is calculated using the following:

(7) H fin = g ϕ g 0 R × Z fin R + Z fin ,

where R is the radius of the Earth (6 378 136.0 m), g0 is the standard gravity acceleration (9.80665 m s−2), and gϕ is the gravity acceleration at the observation latitude.

Figure 6As per Fig. 3 but for geometric altitude, Z, and geopotential height, H.


Figure 7(a) Geoid height for iMS-100-GDP (EGM2008; 5×5), (b) iMS-100 original (10×10), and (c) the difference (northwestern Pacific basin).

The geopotential height uncertainty consists of the components due to vertical positioning precision (correlated) and smoothing (uncorrelated).

2.2.4 Pressure

The iMS-100 radiosonde is not equipped with a pressure sensor. In recent years, radiosondes without a pressure sensor have been widely used. This is in part because of reduction in the weight and manufacturing costs. Also, the accuracy of the atmospheric pressure derived using geopotential height obtained from GPS measurements and hydrostatic equation-based approximation is sufficient in the troposphere and superior in the stratosphere (e.g. Nash et al.2011).

Pressure is calculated from the geopotential height, Hfin (Sect. 2.2.3), temperature, Tfin, and RH, Ufin, using the hydrostatic equation.

(8) P i = P i - 1 exp - g 0 T v m R d Δ H ,

where Pi is the calculated pressure (hPa), Pi−1 is the pressure at the previous level (or time step, i.e. 1 s before), ΔH is the thickness (m) between the two consecutive levels, ΔH=Hi-Hi-1, g0 is the standard gravity acceleration, Tvm is the average virtual temperature between the two consecutive levels (K), and Rd is the specific gas constant of dry air. The virtual temperature is calculated as follows:

(9) T v = T ( 1 + r v / ε ) 1 + r v ,


(10) r v = ε e P - e ,

where Tv is the virtual temperature (K), rv is the mixing ratio of water vapour, ε is the ratio of the molecular weight between water vapour and dry air (=0.622), P is pressure, and e is water vapour pressure calculated from temperature and RH based on the water vapour pressure equation by Hyland and Wexler (1983). For P, Pi−1 is used. The pressure uncertainty is calculated at each data point based on the error propagation rule from temperature, RH, and geopotential height uncertainties from the previous time step. The propagation for correlated and uncorrelated uncertainties is calculated separately and combined to derive the total uncertainty. An example of the pressure uncertainty profile is shown in Fig. 4e.

2.2.5 Wind

An overview of the wind calculation flow is given in Fig. 8. While the initial (unsmoothed) wind speed and direction are derived from GPS Doppler shift for RS-11G-GDP.1, the initial wind speed wspeed0 and direction wdir0 are derived as motion vectors from longitude (λ; lon0 in Fig. 8) and latitude (ϕ; lat0 in Fig. 8) based on GPS positioning for iMS-100-GDP. The great circular distance, d, and direction, θ, between the position at ti (λi, ϕi) and ti+1 (λi+1, ϕi+1) are given with spherical trigonometry by the following:

(11) d = R arccos sin ϕ i sin ϕ i + 1 + cos λ i cos λ i + 1 cos λ i + 1 - λ i ,

(12) θ = 90 - arctan cos λ i tan λ i + 1 - sin ϕ i cos λ i + 1 - λ i sin λ i + 1 - λ i ,

where R is the radius of the Earth, and d and θ are rendered as wspeed0 and wdir0, respectively. The smoothing process is same as RS-11G-GDP.1. The wind vectors with a wind speed of wspeed0 and a wind direction of wdir0 are decomposed to the zonal and meridional wind components, u0 and v0. Each of these components are smoothed by a low-pass filter using a Kaiser window with a cutoff frequency of fcutoff=1/Tpend, for which Tpend is the cycle of the pendulum motion by string with the length L to derive the final wind components of ufin and vfin, respectively. The final wind components are synthesised to wind speed, wspeedfin, and direction, wdirfin.

Figure 8As per Fig. 3 but for wind.


Uncertainties in the zonal and meridional wind components consist of those related to the positioning uncertainty and those from smoothing by the Kaiser filter. Both components are uncorrelated uncertainties. The positioning component is estimated as 2.5 m s−1, which is statistically derived using the GPS simulator. The uncertainties in wind speed and direction are calculated from the partial differentiation of the formula that converts wind components to wind speed or direction.

2.3 GRUAN data processing for RS92

As data processing for RS92-GDP version 2 has been detailed by Dirksen et al. (2014), only a brief description is provided here.

Raw temperature data are corrected for solar radiation and heat spike errors. Solar radiation errors relate to the overall direct and scattered solar irradiance, ambient pressure, and ventilation and are estimated at the GRUAN LC from a radiative transfer model that takes into account the solar elevation angle at the time of monitoring. Vaisala radiation error correction data are also available in table form. GRUAN data processing for RS92 involves an application of the average of the two, as it remains unclear which of the two correction models is more appropriate. Heat spike errors are removed via a low-pass digital filter with a cutoff frequency of 0.1 Hz.

RS92 RH sensors have a temperature-dependent dry bias. GRUAN data processing corrects for this based on multiplication with an empirical correction factor before other forms of the correction are applied. Raw RH data are corrected for radiation dry bias, sensor time lag, and temperature-dependence errors. Radiation dry bias is caused by solar heating on the RH sensors, and the same approach as for the temperature sensor is used to estimate the amount of the correction required. The RH sensor response slows at low temperatures, and time lag becomes significant below −40C. This is corrected based on the relationship between a time constant and temperature using a low-pass filter in the GRUAN data product for RS92 (Dirksen et al.2014).

The RS92 used at Tateno has a pressure sensor and a GPS receiver, both of which can be used to calculate geopotential height. Pressure measurement data are used to derive geopotential height in the lower part of the profile where the signal-to-noise performance of the pressure sensor is sufficiently good, and measurements from the GPS sensor are used in the upper part of the profile. The altitude of the switch is typically between 9 and 17 km (Sommer et al.2016). The pressure sensor is recalibrated against the reference value from a station barometer during the ground check, and a calculation is performed to determine the correction factor for application to the entire pressure profile during sounding (Dirksen et al.2014). U and V data are retrieved from the Doppler shift in the GPS carrier signal, and noise is removed using a low-pass digital filter. The smoothed data are converted into wind speed and direction values (Dirksen et al.2014). Uncertainties in each parameter for RS92 are described in Dirksen et al. (2014).

While the authors used version 2 of the RS92 GDP, version 3 is supposed to be available in the near future (Sommer et al.2016), and it would be useful to redo the analysis with it.

3 Method used for dual sounding

Dual soundings with iMS-100 and RS92 for intercomparison have been conducted from September 2017 to January 2020 at 00:00 UTC (09:00 LT; daytime) or 12:00 UTC (21:00 LT; nighttime) once a week, except for wind conditions in which a payload may fall to populated areas around metropolitan Tokyo (in general, from July to mid-September, with relatively weak westerly winds in the upper troposphere due to the northward displacement of the subtropical jet stream and the stratospheric easterly winds). There were 99 flights during this period (52 daytime and 47 nighttime).

Payload configuration for dual sounding is shown in Fig. 9. A 1200 g balloon was used for all dual soundings. The iMS-100 and RS92 radiosondes were hung on both ends of a 0.9–1 m corrugated plastic board or bamboo rod, and this rig was covered with aluminium tape to reduce the effects of radiation, based on the proposal of von Rohden et al. (2016).

Figure 9Payload configurations for dual soundings.


Figure 10The profile of (a) temperature, (b) RH, and (c) the uncertainties for RH at 00:00 UTC on 9 November 2018. The horizontal dashed line in panel (a) represents the tropopause. The blue and red lines in panel (c) show the criteria of screening, according to Eqs. (14) and (16). Panels (d) and (e) show whether the data within (blue) or exceeding (red) the criteria by Eqs. (14) and (16) at each altitude, respectively. Panels (f) and (g) show the percentage of data within or exceeding in the whole profile and are, thus, equivalent to the rearrangement of panels (d) and (e).


4 Method for comparison

GDP data both for iMS-100 and RS92 are collected at 1 s intervals. Temporally simultaneous measurements were compared using the two statistical approaches adopted by Kobayashi et al. (2019) to evaluate differences in the data products. The first method, which is the layer mean in the ensemble of dual flights (described in Sect. 4.4), is to obtain a profile of the differences between the two data products. This is necessary to create a homogenised data set for the climatological discussion at a site where there were instrument changes. The second method, which is the consistency verification using the uncertainty per single measurement level following the Type B evaluation in Immler et al. (2010, described in Sect. 4.5), is necessary to know if the results obtained in the first method can be regarded as being significantly different.

4.1 Screening with quality assessment

Prior to statistical comparison, irregular or inappropriate data for comparison should be excluded. RS92-GDP, processed by the GRUAN LC, has been checked with a quality assessment algorithm by the GRUAN LC. As the algorithm for iMS-100-GDP in this study is designed with reference to that for RS92-GDP, the algorithm for RS92-GDP is described before that for iMS-100-GDP.

For RS92-GDP, the first quality screening is based on differences between the radiosonde and reference sensor in the ground check. The thresholds are 1.5 hPa for pressure, 1.0 K for temperature, and 1.5 % RH for RH (Sommer2013). Data with larger difference are not used. The second screening is based on the quantified uncertainty estimates (Sommer2013). This screening is based on the idea that data with uncertainties exceeding the criteria are of questionable reliability and need to be verified individually. The thresholds for uncertainty, uspecRS92, are calculated as follows:


When uncertainties for 95 % or more of data in the entire profile are within ulimit, the profile is approved and otherwise labelled as “Checked”.

For RH, however, the criterion in Eq. (14) is too strict because it is lower than actual typical values, especially for 50 % RH–70 % RH. Thus, the original formula, as per Eq. (16), is used here.

(16) u limit RS 92 ( U ) = - 0.000578 U 2 + 0.0925 U + 1.457 .

However, the lower limit of ulimitRS92(U) is set to 2.5 % RH. Figure 4a–c show the profiles of temperature, RH, and u(U) at 00:00 UTC on 9 November 2018. The blue and red line represent the criteria calculated by Eqs. (14) and (16), respectively. Figure 4d and e show whether the data are within (blue) or exceeding (red) the criteria by Eqs. (14) and (16) at each altitude, respectively. Figure 10f and g show the percentage of data within or exceeding in the whole profile and are, thus, equivalent to the rearrangement of Fig. 10d and e. The percentage of data within the criteria, using Eqs. (14) and (16), is 90 % and 98 % in Fig. 10d and e, respectively. Therefore, this case is classified as “Checked” with the LC criteria (i.e. 95 %). However, it is included in the analysis in this study.

For iMS-100-GDP, the first screening is performed at the ground check under room conditions during preparation. The thresholds of the differences from reference sensors for temperature and RH are ±0.5C and 7 % RH, respectively. Sensor values exceeding these criteria are not used for observation.

The second screening checks the contamination or changes in the RH sensor specifications associated with icing. In some soundings with iMS-100, abnormal RH profiles, such as the one shown in Fig. 11 (for 12:00 UTC on 22 September 2017), are sometimes observed. For RS92, there is little need to consider the potential freezing of its humidity sensor since it is heated. On the other hand, the very slow RH decreasing after its passage through the supercooled layer (red shading in Fig. 11b) and relatively high RH values in the stratosphere for iMS-100-GDP are not explained by the hysteresis but are probably due to contamination and changes in the RH sensor specifications related to icing or freezing during passage through supercooled droplet clouds. As checking to determine whether the radiosonde has passed through such clouds is impractical from the RH profile, ice-saturated regions (pink shaded layer in Fig. 11b) are considered. However, as not all data associated with ice-saturated clouds' passage are contaminated, the length of the ice-supersaturated region (ISSR) and the RH and water vapour mixing ratio at the top of the upper troposphere and lower stratosphere (UTLS) are used for screening in this study. ISSR determination is based on saturated water vapour for liquid water and ice that is calculated using Hyland and Wexler equation (Hyland and Wexler1983). The observed RH is the ratio of water vapour pressure to es_liq and is limited to Us_ice=esat_ice/esat_liq for the ice phase. Accordingly, ISSR is the layer in which U>Usat_ice. In this study, two adjacent ISSRs, with intervals of ≤60 s or where the minimum of UUsat_ice is more than −10 % RH (i.e. with no dry layer in the interval), are merged and treated as a single ISSR.

Figure 11Temperature (a) and RH (b) profiles for RS92-GDP (orange) and iMS-100-GDP (blue) at 12:00 UTC on 22 September 2017. The dashed line in panel (a) represents tropopause pressure. The black line in panel (b) shows the RH profile for ice-saturated air with iMS-100-GDP temperature; air is supersaturated for ice if the observed RH exceeds this value. The pink shaded layer (453.2–232.4 hPa) shows the ISSR for iMS-100-GDP.


For the screening of ice-contaminated profiles, the probability of icing, Price, is derived using logistic regression analysis (e.g. Cox1958) after variable selection from the length of ISSR, RH, and the volume mixing ratio at several levels. The 452 routine observations (twice per day) with a single payload taken from April to November 2018 at Tateno are used as training data. The Python scikit-learn package (Pedregosa et al.2011) was used for the actual coefficient derivation. The Price is calculated with the following:

(17) Pr ice = 1 1 + exp - 22.38 + 7.18 × 10 - 3 T ISSR _ max + 0.517 U ST 1 + 0.977 U ST 2 + 0.105 W ST 1 + 0.427 W ST 2 ,

where subscripts ST1 and ST2 represent the data points at 2000 and 4000 m above the tropopause in geopotential height, U (% RH) is RH, W (parts per million by volume, ppmv) is the water vapour volume mixing ratio, and TISSR_max is the pass-through period of maximum ISSR in seconds. In this study, profiles with Price>0.5 are considered to be potentially ice-contaminated data and excluded from the RH analysis below.

The third screening is based on uncertainties for temperature, RH, and pressure as with RS92. The thresholds for these values are derived from Eqs. (18)–(20), respectively. Coefficients are determined empirically.

(18) u limit iMS ( T ) = 0.7 .

(19) u limit iMS ( U ) = - 1.438 × 10 - 6 U 3 + 9.867 × 10 - 4 U 2 - 1.020 × 10 - 2 U + 3.081 ( for daytime ) - 1.195 × 10 - 5 U 3 + 2.631 × 10 - 3 U 2 - 7.308 × 10 - 2 U + 3.351 ( for nighttime ) .

However, the lower limit of ulimitiMS(U) is set to 3.3 % RH for both cases.

(20) u limit iMS ( P ) = 1.609 × 10 - 9 P 3 - 4.589 × 10 - 6 P 2 + 4.120 × 10 - 3 P + 3.810 × 10 - 2 .

For example, the criteria for temperature and RH uncertainty are shown as black lines in Figs. 4b and d, respectively. In this case, the RH uncertainty exceeds the criteria near 370 and 250 hPa. The ratio criterion is set to 90 %; thus, if more than 90 % of the whole profile has data whose uncertainty does not exceed the threshold, the iMS-100-GDP is included in the following analysis. For the case in Fig. 4, the ratio of data for which the uncertainty is within the threshold is greater than 90 %. On the other hand, Fig. 12 (at 00:00 UTC on 6 December 2019) shows the rejected case with only 84 % of the data within the RH criteria due to the large uncertainty above the 15 hPa level. For the case in Fig. 13 (at 00:00 UTC on 24 November 2017), the large pressure uncertainty layer exceeding the criteria is found between 540 and 230 hPa. Only 82 % of the data are within the pressure criteria, and thus this case is rejected. In this case, the large correlated uncertainty in the middle troposphere is caused by a large vertical component of the dilution of precision (VDOP), i.e. the precision of GPS positioning (not shown). It should be noted that this quality assessment and screening are for this study only and are not authorised as the standard for GDPs. The standard method for quality assessment of GDPs is currently under discussion by the quality task force in the GRUAN community.

Figure 12As per Fig. 4 but for the case at 00:00 UTC on 6 December 2019.


Figure 13As per Fig. 4 but for the case at 00:00 UTC on 24 November 2017.


As a result of the screening process described above, the 59 dual sounding flights (29 daytime and 30 nighttime) shown in Table 6) are used as a suitable data set for comparison. Seasonal profiles for temperature, RH, and wind speed are shown in Figs. 1517, with the seasonal classification (March–April, MA, May–July, MJJ, August–November, ASON, and December–February, DJF) described in Sect. 4.4. The major factors associated with data screening are illustrated in Fig. 14 for each season and daytime/nighttime. The rejection rates are low in DJF but high in MA and MJJ. The primary reasons are the large uncertainties in humidity or pressure, the icing of the humidity sensor for iMS-100, and the bad RS92 sensors, i.e. humidity differences exceeding thresholds to the reference sensor in the ground check. Thus, the data are rejected mainly due to humidity sensor issues. It maybe is related to the climatology at Tateno, where it tends to be dry in winter and humid and highly variable with altitude in MA and MJJ (Fig. 16). The highly variable humidity may cause large uncertainties for the time lag correction or hysteresis correction. Passing through ISSRs increases the likelihood of the humidity sensor icing. For RS92, the ground check of the humidity sensor is performed with the assumption that the humidity of the chamber filled with molecular sieves is 0 % RH, but the molecular sieves in the chamber gradually becomes damp because they take in water vapour from the atmosphere. At Tateno, molecular sieves were replaced when the humidity difference between RS92 sensors, which exceeded 2 % RH and which may have exceeded the RS92-GDP screening criteria.

Table 6List of dual sounding events with iMS-100 and RS92 used for comparison. The Weather code is according to the World Meteorological Organization (WMO) code table 4677.

* Truncated due to excessive temperature gaps. RS92 achieved 34 434.6 m (6.1 hPa).

Download XLSX

Figure 14Major factors in the data screening. The term “Adopted” is used for comparison.


Figure 15Seasonal profiles of iMS-100-GDP. Grey lines represent profiles for individual flights, red represents profile means, and black represents profile medians. Shading represents 25th–75th percentiles.


Figure 16As per Fig. 15 but for RH.


Figure 17As per Fig. 15 but for wind speed.


4.2 Timestamp adjustment

Radiosonde observations have timestamps from the relevant sounding system (for iMS-100, this is based on received GPS clock data). As there may be minor discrepancies in balloon launch timestamps, these data are time-adjusted using the temperature profile. In this study, the shift registration in functional data (Ramsay and Silverman2005) is used for the adjustment using the temperature data from between 5 and 20 min of the launch time of RS92. Temperature data for each radiosonde, Ti(t), in this period are converted to functional data, xi(t). Shift values, δi, are calculated to minimise the registered curve sum of square error (REGSEE), which is defined as follows:

(21) REGSEE = i x i t + δ i - μ ^ ( t ) 2 d t ,

where μ^(t) is the mean function of xi(t). For the calculation of δi, the scikit-fda package (Carreño et al.2020) for Python is used, and actual adjustment values (seconds to shift) are derived with the following:

(22) δ t = δ t RS 92 - δ t iMS .

When δt>0, the timestamps for iMS-100 are shifted backward by |δt|, and vice versa.

4.3 Data pretreatment

Profiles with >10% of abnormal data points in the whole profile are excluded via screening, as described in Sect. 4.1, while abnormal data are also seen at individual points in the overall normal profiles (e.g.  superadiabatic lapse rates). Such data points should be excluded from statistical comparison. In this study, superadiabatic lapse rate layers and abnormal wind data immediately before balloon burst are pre-treated for masking.

4.4 Statistical comparison for binned layers based on pressure

After timestamp adjustment, the per second differences between iMS-100 and RS92 measurements were calculated, and the differences were allocated to the 13 pressure layers based on RS92 pressure data (PiRS92, where i indicates the time step) based on Kobayashi et al. (2012, 2019). The bins for the 13 layers are listed in Table 7.

Table 7Pressure range for allocation of iMS-100–RS92 differences based on RS92 pressure (i.e. bottomPiRS92>top).

Download Print Version | Download XLSX

Table 8Terminology for pair checking in independent measurements with identical quantities for consistency.

Download Print Version | Download XLSX

Ai,jiMS and Ai,jRS92 are values at the time step i for iMS-100 and RS92 in the jth dual sounding data set (j=1,,M; M is the number of data sets, here 59), respectively. The differences between the two radiosonde types, ΔAi,j=Ai,jiMS-Ai,jRS92, are averaged for each pressure layer with the following:

(23) Δ A k , j = i = i b , j i t , j Δ A i , j N k , j ,

where k is the layer number (k=1,,13), Nk,j is the number of data points in the layer k, and ib,j and it,j are the bottom and top time step at the jth data set, respectively. The ensemble mean of the difference in each pressure layer is calculated as follows:

(24) Δ A k = j = 1 M Δ A k , j M .

The ensemble standard deviation of the mean difference for individual pressure layers is as follows:

(25) σ ( A k ) = j = 1 M ( Δ A k , j - Δ A k ) 2 M .

A comparison of the wind data is performed with the indices of the wind speed bias, BIAS, and the root mean square error of the vector difference, RMSVD (CGMS2003), for each pressure layer at each dual sounding. BIAS, vector difference (VD), mean vector difference (MVD), standard deviation vector difference (SD), and RMSVD are calculated as follows:

(26) BIAS k , j = 1 N ( V i , j iMS - V i , j RS 92 )

(27) VD i , j = ( u i , j iMS - u i , j RS 92 ) 2 + ( v i , j iMS - v i , j RS 92 ) 2

(28) MVD k , j = 1 N k , j VD i , j

(29) SD k , j = 1 N k , j ( VD i , j - MVD k , j ) 2

(30) RMSVD k , j = MVD k , j 2 + SD k , j 2 ,

where V, u, and v are the wind speed, zonal wind speed, and meridional wind speed, respectively. The mean difference in wind direction (Szantai et al.2007), ΔDIR, is also used.

(31) Δ DIR k , j = 180 N π arccos V i , j iMS V i , j RS 92 V i , j iMS V i , j RS 92 ,

where V is the wind vector. The ensemble mean of these parameters is then calculated as for Eq. (24).

Figure 18(a) Profiles of ensemble mean temperature differences (iMS-100-GDP minus RS92-GDP) with standard deviations (error bars) for all seasons combined. Red, blue, and black lines represent daytime, nighttime, and all data, respectively. Points are vertically shifted for ease of viewing. Panel (b) is as per panel (a) but represents seasonal profiles for daytime data. The green, red, orange, blue, and black lines represent MA, MJJ, ASON, DJF, and all seasons, respectively. Panel (c) is as per panel (b) but for nighttime data.


Statistics for each pressure layer are calculated for daytime, nighttime, and individual seasons. Due to the safety consideration described in Sect. 3, few dual soundings are performed in July and August, and the weather conditions for August flights are categorised as for autumn rather than summer. Thus, in this study, the flights from August to November are categorised as those for autumn (here, ASON), and flights in previous studies for the seasonal comparison campaigns of radiosondes at Tateno (Kobayashi2015; Kobayashi and Hoshino2018) are categorised as follows: the summer covered the period from May to June, flights in March and April are categorised with spring (here, MA), those from May to July are categorised with summer (MJJ), and those from December to February are categorised with winter (DJF).

Figure 19Temperature consistency rank for individual pressure layers.


Figure 20Distribution of temperature differences for daytime observations. aX is the histogram, bX is the box plot, and cX is the quantile–quantile plot. X indicates the layer for (1) L09 (50–30 hPa), (2) L07 (100–70 hPa), (3) L05 (200–150 hPa), and (4) L03 (500–300 hPa).


4.5 Method for verification of consistency with uncertainties

The uncertainty estimation for RS92 and iMS-100 GDPs are described in Dirksen et al. (2014) and Hoshino et al. (2022a), respectively.

Immler et al. (2010) proposed the terminology for comparing pairs of independent measurements of the same quantity for consistency using estimated uncertainties, as described in the following. Consider two independent measurements, m1 and m2, of the same measurand with standard uncertainties, u1 and u2, respectively. Under the assumptions that the measurements m1 and m2 have independent errors with measurement uncertainties u1 and u2, and the difference m1m2 has a normal distribution, the probability that

(32) m 1 - m 2 > k u 1 2 + u 2 2

occurs only by chance is roughly 4.5 % for k=2 and 0.27% for k=3. If Eq. (32) is true for k=2, then it is very likely that the two measurements did in fact not measure the same thing, probably due to an unrecognised or unaccounted-for systematic effect in either one or both measurements. Immler et al. (2010) proposed an expression for the degree of consistency, as shown in Table 8. This approach is a Type B evaluation of uncertainty. For the Type A evaluation, Immler et al. (2010) concluded that it is not expected to play an important role within GRUAN, and thus it is not considered in this study.

For a statistical consistency check, the total consistency ranks shown in Table 8 (k<1 is consistent, 1k<2 is in agreement, 2k<3 is significantly different, or k≥3 is inconsistent) between RS92 and iMS-100 within a specific pressure layer for a particular parameter are estimated as being the 95 % percentile value of consistency ranking numbers within the layer.

5 Results

5.1 Temperature

Figure 18 shows the ensemble mean (lines) and standard deviation (error bar) of temperature differences for daytime and nighttime for all seasons (Fig. 18a), seasonal daytime (Fig. 18b), and seasonal nighttime (Fig. 18c). In the stratosphere, the iMS-100-GDP values are around −0.5K lower than RS92-GDP for the daytime. For the nighttime, differences are around −0.1K below the 10 hPa level. Seasonal differences are small.

Figure 19 shows the percentage of the consistency rank in each layer for daytime (Fig. 19a) and nighttime (Fig. 19b). The percentages of significantly different or inconsistent data exceed 50 % in L07 (100–70 hPa) and 30 % in L08 (70–50 hPa) and L09 (50–30 hPa) for daytime observations (Fig. 19a). However, 80 % of the data are in agreement for nighttime comparison (Fig. 19b). Figure 20 shows the distribution of temperature at L09 (50–30 hPa), L07 (100–70 hPa), L05 (200–150 hPa), and L03 (500–300 hPa). Figure 20a1, a2, and a3 show that, in general, differences in L09, L07, and L05 are normally distributed with a sufficient sample size. Accordingly, daytime temperature differences in the stratosphere and the upper troposphere are associated with systematic effects. These differences (especially between 100 and 30 hPa) are attributed to the difference in the solar radiation correction models. Further discussion about the contributions of different radiation heating correction methods to the temperature difference needs other observation-data-like satellites or other types of radiosondes, like RS41. But GNSS-RO-based temperature data are very limited, and no comparative observations have been made at Tateno between the three sondes (iMS-100, RS92, and RS41). Therefore, additional discussion is expected after the results of the comparisons between iMS-100 vs. RS41 and RS92 vs. RS41 are published. However, some samples show significant differences (>±0.5K) even in the troposphere or for nighttime soundings (not shown), which are associated either with issues during flights or calibration problems.

Kobayashi et al. (2019) found that the RS-11G-GDP.1 temperature data are about −0.4K lower than RS92-GDP.2 data for daytime observations in the stratosphere. This means that the iMS-100-GDP temperatures show larger differences from RS92-GDP.2 than the RS-11G-GDP.1 temperatures do. On the other hand, the ratio of data that are evaluated as consistent or in agreement with RS92-GDP.2 temperatures is greater for iMS-100-GDP than RS-11G-GDP.1. This is probably due to the newly included correction for the sensor orientation effects in iMS-100-GDP, which will increase the uncorrelated uncertainty. Further investigation is needed, using intercomparison results between RS-11G and iMS-100 radiosondes.

5.2 Relative humidity

Figure 21 shows the ensemble mean (lines) and standard deviation (error bars) of RH differences for daytime and nighttime for all seasons (Fig. 21a), seasonal daytime (Fig. 21b), and seasonal nighttime (Fig. 21c). This shows that iMS-100 RH is around 1 % RH–2 % RH larger than the RS92 RH around the tropopause and −1 % RH smaller in the stratosphere. Unlike temperature, systematic differences (i.e. biases) between daytime and nighttime soundings are small, but seasonal variations are large in the upper troposphere and the lower stratosphere. Figure 22 shows the ensemble mean of RH differences for six RH ranks. In the lower troposphere (below 500 hPa level), differences are below 2 % RH and exhibit limited correspondence with RH values. In the middle and upper troposphere (500–100 hPa), the difference increases with altitude for 100 % RH–90 % RH. For data with RH ≤10 % RH, the difference is within ±1 % RH for layers above 500 hPa level; iMS-100 RH is wetter in the troposphere and drier in the stratosphere. For data with RH >90 % RH, the data set is limited above 300 hPa and drier than RS92 figures for the middle troposphere (500–300 hPa).

Figure 21As per Fig. 18 but for RH.


Figure 22Mean RH difference for six RH ranks.


Kobayashi et al. (2019) found that RS-11G-GDP.1 RH data show about 2 % RH dry tendencies for conditions with RH >90 % RH and about 1 % RH wet tendencies for conditions with RH ≤10 % RH. These different behaviours between RS-11G-GDP.1 and iMS-100-GDP with respect to RS92-GDP.2 in very dry and very wet conditions make the ΔU profiles different in the lower troposphere (below 700 hPa) and in the stratosphere (above 50 hPa; see Fig. 21 in this study and Fig. 11 in Kobayashi et al.2019). In particular, the 1 % RH wet bias of RS-11G-GDP.1 with respect to RS92-GDP.2 and the dry bias of iMS-100-GDP in the stratosphere has notable differences. The major reason for these differences could be the inclusion of the hysteresis correction in the iMS-100-GDP. The ratio of data that are evaluated as consistent or in agreement with RS92-GDP.2 RH data in the troposphere are greater for RS-11G-GDP.1. This is also due to the inclusion of the hysteresis correction in the iMS-100-GDP because the RH profiles in the troposphere often show rapid changes, as shown in Fig. 16. Further investigation on the differences between RS-11G and iMS-100, resulting from intercomparison flights, is a future task.

Figure 23 shows the percentage of consistency rank in each layer for daytime (Fig. 23a) and nighttime (Fig. 23b). In the troposphere and lower stratosphere (L1–L7, below 70 hPa level), around 10 %–20 % of data are significantly different or inconsistent. The RH profiles for individual flights (Fig. 16) show that RH often shows rapid changes. For the flight at 12:00 UTC (21:00 LT) on 19 April 2019 (Fig. 24), iMS-100 shows a slow tendency in relation to the rapid decrease in RH (e.g. at about 330 hPa) compared to RS92. This difference in the response to rapid changes, especially in case from high to low humidity, is considered to be a reason for the inconsistency of 1 s RH values between the two radiosondes. As described in Sect. 2.2.2, the iMS-100 RH sensor has hysteresis with the large time constant, but the RS92 RH sensor is heated, and its hysteresis is negligible. This difference in the characteristics of the RH sensor could cause a large difference, especially in the rapidly decreasing RH case. In the stratosphere (L8–L13, above the 70 hPa level), RH data from iMS-100 and RS92 seem to be almost in agreement.

Figure 23As per Fig. 19 but for RH.


Figure 24Comparison between 500 and 150 hPa levels (L3 to L5) at 12:00 UTC (21:00 LT) on 19 April 2019. (a) Temperature. (b) RH. (c) RH difference.


5.3 Pressure

Figure 25 shows the ensemble mean (lines) and standard deviation (error bars) of pressure differences for daytime and nighttime for all season (Fig. 25a), seasonal daytime (Fig. 25b), and seasonal nighttime (Fig. 25c). The absolute value of the ensemble mean difference is less than 0.4 hPa, but there are cases with large differences in the lower troposphere (below 700 hPa level). This may be attributable to the effect of the pressure differences between the RS92 pressure sensor and the barometer used for surface observation. The pressure of iMS-100, with no pressure sensor, is derived from a recursive calculation via the hydrostatic equation so that the surface pressure is equal to that observed using ground-based barometer. Meanwhile, RS92 involves independent pressure sensor usage, meaning that near-surface pressure may differ between GDPs. A histogram of the RS92 GDP surface pressure error for ground-based barometer content (Fig. 26) shows that the difference is not normally distributed around zero. RS92 pressure tends to be slightly higher than the ground-based barometer content, with a difference median of 0.33 hPa, which is greater than the barometer uncertainty (0.06 hPa for k=1). The consistency check for surface pressure between the barometer and RS92 resulted in significantly different or inconsistent results in 20 cases. The effect of this difference decreases with height but is more noticeable near the ground.

Figure 25As per Fig. 18 but for pressure.


Figure 26Histogram for the surface pressure difference (RS92 minus barometer values).


For RS-11G-GDP.1, the pressure is about −0.5hPa lower for daytime observation in the middle of troposphere, but the difference between iMS-100-GDP and RS92-GDP.2 is statistically small in those layers. At this time, the reason for this difference in pressure is not clear.

5.4 Geopotential height

Figure 27 shows the ensemble mean (lines) and standard deviation (error bars) of geopotential height differences for daytime and nighttime for all seasons (Fig. 27a), seasonal daytime (Fig. 27b), and seasonal nighttime (Fig. 27c). The difference in geopotential height is around 2–3 m in the lower and middle troposphere but becomes larger with altitude above 100 hPa level and becomes about 10 m at 20 hPa. This tendency is attributed to the difference in geoid height, as referenced by iMS-100-GDP and RS92-GDP. As described in Sect. 2.2.3, the grid size of the original geoid model used for the iMS-100 GPS module is 10×10, which is replaced with a 5×5 model for geometric height calculation in GDP processing. The grid size difference causes a significant discrepancy in geoid and geometric height, especially for the northwestern Pacific basin around Japan. Figure 28a and b show the geoid height for iMS-100-GDP and original iMS-100, respectively, Fig. 28c shows differences in geoid height and typical radiosonde track for each season (green for MA, red for MJJ, orange for ASON and blue for DJF), and Fig. 28d shows geoid correction values for typical seasonal tracks. The difference increases with height for all seasons. The grid size of the geoid model for RS92-GDP is unknown, but as geometric height values are used without modification, the geoid model differences may have caused geopotential height differences.

Figure 27As per Fig. 18 but for geopotential height.


Figure 28(a) Geoid height for iMS-100-GDP, (b) iMS-100 original values, (c) difference in geoid height and seasonal typical tracks, and (d) correction values along typical tracks.

The geopotential height difference between iMS-100-GDP and RS92-GDP.2 seems to have no noticeable difference with that between RS-11G-GDP.1 and RS92-GDP.2, although the RS-11G-GDP.1 is not a corrected geoid model. This implies that the geoid model used in RS-11G has enough resolution for GDP.

5.5 Wind

Figure 29 compares wind for the wind speed bias (Fig. 29a), RMSVD (Fig. 29b), and wind direction difference for all seasons (Fig. 29c). The difference is small enough, with BIAS from −0.04 to +0.14m s−1, RMSVD is less than 1.04 m s−1, except for L13 (above 10 hPa), and ΔDIR is less than 6.4.

Figure 29Seasonal wind difference parameters with colouring, as per Fig. 18b. (a) BIAS. (b) RMSVD. (c) ΔDIR.


Similar to the verification for temperature and relative humidity, the consistency checks in each pressure-binned layer for wind speed and direction are conducted (see Fig. 30). The figure shows that more than 20 % of the data for wind speed are significantly different or inconsistent in the lowest layer (below 700 hPa level) and between 100 and 20 hPa levels. For wind direction, more than 20 % of the data in most layers and more than 50 % of the data in the lowest layer and between 100 and 20 hPa levels are significantly different or inconsistent. This result appears to be inconsistent with the verification result, using the mean differences for each layer mentioned above.

Figure 30Wind consistency rank for individual pressure layers. (a) Wind speed. (b) Wind direction.


To understand this difference, the wind speed difference and the uncertainty profile at 12:00 UTC on 12 July 2019 are shown in Fig. 31 as an example. The inset figures are an enlarged view of L07 (100–70 hPa), where the wind speed was estimated as being significantly different. In Fig. 31a, which shows the wind speed profile, the overall change trends are consistent for both GDPs, but the RS92-GDP (the orange line) looks smoother than the iMS-100-GDP (the blue line). This difference in the smoothing causes the variability in wind speed differences (black line in Fig. 31b), resulting in some of the data exceeding the range of the synthetic uncertainty threshold (the green line in Fig. 31b for k=2). As a result, the ratio of the data that are estimated as consistent or in agreement is less than 95 % in the layer, and the consistency check gives significantly different or inconsistent results. Although the two GDPs are not in agreement in the layer, the differences due to this smoothing are cancelled out when averaged and, therefore, do not appear as statistical errors. Similar significant difference problems due to smoothing are found in the wind direction profiles. Figure 32 shows the wind direction difference and the uncertainty profile for the same sounding, and the inset figures show the enlarged view of L02 (700–500 hPa). Figure 32 suggests that the significant difference in wind direction, caused by the difference in the degree of the smoothing method, is as with the wind speed. Additionally, for the lowest level (below 700 hPa level, especially in the boundary layer), the differences in the positioning performance of the GPS modules immediately after launch, or the possible unstable posture of the payload due to the double pendulum motion, rig rotation, or swaying, may have caused a slight difference in the wind calculation.

Figure 31(a) Profiles of wind speed with uncertainty (shades) at 12:00 UTC on 12 July 2019. Blue and orange lines represent iMS-100-GDP and RS92-GDP, respectively. (b) Profiles of wind speed difference (iMS-100-GDP minus RS92-GDP; black line) and synthetic uncertainty (blue and green lines for k=1 and k=2, respectively) are shown.


Figure 32As per Fig. 31 but for wind direction. The differences in wind direction are calculated as the angle between the wind vectors for both GDPs, which is not considered to be clockwise/anticlockwise.


6 Comparison with a frost point hygrometer

Due to the technical limitations of the RH sensors mounted on operational radiosondes in low temperatures and dry conditions, GRUAN requires comparisons of RH data with values from reference instruments. At Tateno, the comparison flights with the radiosondes and cryogenic frost point hygrometer (CFH; Vömel et al.2007, 2016) have been conducted twice a year since 2015. However, since the R-23 (HFC-23) liquid cryogen material used to cool the mirror of CFH was regulated under the Montreal Protocol, a new frost point hygrometer, Meisei SKYDEW (Sugidachi2019), was adopted in 2020 for cooling the mirror with a Peltier element instead. Figure 33 shows results from the comparison conducted at 06:00 UTC (15:00 LT) on 21 October 2020. Although the differences in RH are significant (with ΔU exceeding the extended uncertainty of k=2) around the tropopause (92.2 hPa, 16 983.0 m, and −73.4C) and in the lower part of the troposphere (800–400 hPa), more than 80 % of iMS-100-GDP RH data, except for L02 (700–500 hPa) and L07 (100–70 hPa), are consistent or in agreement. In particular, almost all data above 70 hPa are consistent with SKYDEW. In this study, the uncertainty in SKYDEW RH is not implemented in the consistency check. If the uncertainty in SKYDEW RH is estimated, then the result is expected to be more consistent between the RH of iMS-100-GDP and SKYDEW. Figure 33c shows that the RH of iMS-100-GDP and SKYDEW is generally in agreement for the entire profile and consistent for most of the troposphere.

Figure 33Comparison of RH profiles between iMS-100-GDP and SKYDEW at 06:00 UTC (15:00 LT) on 21 October 2020. (a) Air temperature from iMS-100. The dashed line shows the tropopause height (92.2 hPa; 16 983.0 m in geopotential height). (b) RH from both. The blue and black lines show iMS-100-GDP and SKYDEW, respectively. SKYDEW RH is calculated from SKYDEW frost/dew point temperature and iMS-100 air temperature. (c) RH difference in iMS-100 from SKYDEW (black) and RH uncertainty for iMS-100-GDP (blue/green for k=1 and 2, respectively).


7 Summary

To characterise GDPs for iMS-100 and RS92, data from dual soundings conducted at Tateno from September 2017 to January 2020 are analysed in this study. The iMS-100 temperature is around 0.5 K lower than RS92-GDP for daytime observations in the stratosphere, and over 50 % of data from between 100 and 70 hPa and over 30 % from between 70 and 30 hPa show significant differences from RS92-GDP. For nighttime observations, the difference is around −0.1K, with over 80 % of data showing agreement both in the troposphere and the stratosphere. The difference for daytime measurements in the stratosphere is attributed to the correction procedures for solar radiation heating and differences in sensor characteristics.

The iMS-100 RH is around 1 % RH–2 % RH higher in the troposphere and 1 % RH lower in the stratosphere than RS92, but both GDPs are generally in agreement in the troposphere and stratosphere. The difference may be larger in places where rapid RH change occurs. A comparison flight with the SKYDEW and frost point hygrometer, shows that iMS-100 RH agrees well with SKYDEW both in the troposphere and stratosphere.

While there are some cases where significant differences for pressure are observed in the lower troposphere (≥700hPa) in the consistency check, the mean pressure difference is less than 0.4 hPa. The difference in geopotential height is around 2–3 m in the lower and middle troposphere but increases with altitude above 100 hPa level, from 10 m at 20 hPa. This relationship between height and related differences may stem from differences in the geoid model used for the two GDPs.

In wind comparison, although the consistency check based on uncertainty may estimate the wind speed and direction for both GDPs as being significantly different for each simultaneous data, the comparison parameters show good correspondence; BIAS is between −0.04 and +0.14m s−1, RMSVD is lower than 1.04 m s−1, except for above 10 hPa, and ΔDIR is smaller than 6.4 in the statistical comparison for each pressure-binned layer. This seemingly contradictory result is mainly due to the difference in the degree of smoothing for the two GDPs.

The modified data processing described in Sects. 2.2.1 and 2.2.2 will be implemented to processing for RS-11G to create new version of RS-11G-GDP, which will be evaluated with intercomparison campaigns written in Kobayashi et al. (2019, RS-11G and RS92) and Kobayashi and Hoshino (2018, iMS-100 and RS-11G). Further direct comparisons between RS-11G-GDP and iMS-100-GDP will be discussed in a future study.

This study involved the evaluation of the characteristics of iMS-100-GDP values with RS92-GDP as a reference, as the latter is certified as a GRUAN data product. GRUAN certification for iMS-100 is underway, and an ongoing analysis of GDP data is considered important for the provision of high-quality products to the climate research/monitoring community. Since February 2020, regular dual soundings of iMS-100 and Vaisala RS41 (the successor to RS92) have been ongoing. RS41-GDP is under development (von Rohden et al.2022), and the comparison results will be published when sufficient iMS-100-GDP and RS41-GDP data are available. In addition, iMS-100-GDP, RS41-GDP, and M10-GDP (Dupont et al.2020) are also candidates of the reference data in the WMO 2022 Upper-Air Instrument Intercomparison Campaign (CIMO Task Team on Upper-air Intercomparison2020) which will be conducted in 2022. These data will support further evaluation and improvements in iMS-100-GDP.

The interpolation and the estimation of uncertainty for data missing periods are discussed in some articles. For example, Fassò et al. (2020) proposed a method for temperature data, using the Gaussian process, and Colombo and Fassò (2022) attempted to apply it to RH data. These studies will be considered for future improved versions of the iMS-100-GDP.

Appendix A: Abbreviations
EPS Expanded polystyrene Styrofoam
GBAS Ground-based augmentation system
GCOS Global Climate Observing System
GDP GRUAN data product
GMDB GRUAN Meta-data Data Base
GNSS Global Navigation Satellite System
GPS Global positioning system
GRUAN GCOS Reference Upper-Air Network
ISSR Ice-supersaturated region
JMA Japan Meteorological Agency
JFMT JMA transmission format
LC Lead Centre
Meisei Meisei Electric Co., Ltd.
NCEI NOAA National Centers for
Environmental Information
RH Relative humidity
RMSVD Root mean square vector difference
SBAS Satellite-Based Augmentation System
WMO World Meteorological Organization
Code availability

The scikit-fda pacakge is available at (last access: 19 October 2022) and (Carreño et al.2020).

Data availability

The GRUAN data products for iMS-100-GDP.2 are available from (Hoshino et al.2022b). The GRUAN data products for RS-11G-GDP.1 are available from (Kizu et al.2019). The GRUAN data products for RS92-GDP.2 are available from (Sommer et al.2012).

Author contributions

SH, TS, KS, and EK developed the iMS-100 GRUAN data product, with SH and EK performing the data analysis and creating the figures. All authors provided ideas and contributed to interpretation of the results. The first draft of the paper was written by SH, with all authors, including MF and MI, contributing to improving the paper.

Competing interests

The contact author has declared that none of the authors has any competing interests.


Publisher’s note: Copernicus Publications remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.


The authors are grateful to the Aerological Observatory, the Observation Division of JMA's Atmosphere and Ocean Department, and Meisei Electric Co., Ltd., for their support and helpful advice. Thanks are also due to GRUAN Lead Centre staff, for their support.

Review statement

This paper was edited by Roeland Van Malderen and reviewed by three anonymous referees.


Bodeker, G. E., Bojinski, S., Cimini, D., Dirksen, R. J., Haeffelin, M., Hannigan, J. W., Hurst, D. F., Leblanc, T., Madonna, F., Maturilli, M., Mikalsen, A. C., Philipona, R., Reale, T., Seidel, D. J., Tan, D. G. H., Thorne, P. W., Vömel, H., and Wang, J.: Reference Upper-Air Observations for Climate: From Concept to Reality, B. Am. Meteorol. Soc., 97, 123–135,, 2016. a

Carreño, C. R., Suárez, A., Torrecilla, J. L., Berrocal, M. C., Manchón, P. M., Manso, P. P., Bernabé, A. H., Fernández, D. G., and Hong, Y.: GAA-UAM/scikit-fda: Version 0.4 (0.4), Zenodo [code],, 2020. a, b

CGMS: Consolidated report of CGMS activities (10th edition, V10), The Coordination Group for Meteorological Satellites (CGMS), Tech. rep., (last access: 3 December 2020), 2003. a

CIMO Task Team on Upper-air Intercomparison: Project Plan for the WMO Upper-Air Instrument Intercomparison,, (last access: June 2021) 2020. a

Colombo, P. and Fassò, A.: Quantifying the interpolation uncertainty of radiosonde humidity profiles, Meas. Sci. Technol., 33, 074001,, 2022. a

Cox, D. R.: The regression analysis of binary sequences, J. R. Stat. Soc. B, 20, 215–232, 1958. a

Dirksen, R. J., Sommer, M., Immler, F. J., Hurst, D. F., Kivi, R., and Vömel, H.: Reference quality upper-air measurements: GRUAN data processing for the Vaisala RS92 radiosonde, Atmos. Meas. Tech., 7, 4463–4490,, 2014. a, b, c, d, e, f, g, h, i, j

Dupont, J.-C., Haeffelin, M., Badosa, J., Clain, G., Raux, C., and Vignelles, D.: Characterization and Corrections of Relative Humidity Measurement from Meteomodem M10 Radiosondes at Midlatitude Stations, J. Atmos. Ocean. Tech., 37, 857–871,, 2020. a, b

Fassò, A., Sommer, M., and von Rohden, C.: Interpolation uncertainty of atmospheric temperature profiles, Atmos. Meas. Tech., 13, 6445–6458,, 2020. a

Hoshino, S., Sugidachi, T., Shimizu, K., Iwabuchi, M., Kobayashi, E., and Fujiwara, M.: Technical Characteristics and GRUAN Data Processing for the Meisei RS-11G and iMS-100 Radiosondes, GRUAN Lead Centre, GRUAN Technical Document 5 (Rev. 2.0), in review, 2022a. a, b, c, d, e, f

Hoshino, S., Sugidachi, T., Shimizu, K., Kobayashi, E., Fujiwara, M., and Iwabuchi, M.: iMS-100 GRUAN Data Product Version 2, GRUAN [data set],, 2022b. a

Hyland, R. and Wexler, A.: Formulations for the Thermodynamic Properties of the Saturated Phases of H2O from 173.15 K to 473.15 K, ASHRAE Trans., 89, 500–519, 1983. a, b, c, d

Immler, F. J., Dykema, J., Gardiner, T., Whiteman, D. N., Thorne, P. W., and Vömel, H.: Reference Quality Upper-Air Measurements: guidance for developing GRUAN data products, Atmos. Meas. Tech., 3, 1217–1231,, 2010. a, b, c, d

JMA: Guidelines of Radiosonde Soundings, 1995. a

Kizu, N., Sugidachi, T., Kobayashi, E., Hoshino, S., Shimizu, K., Maeda, R., and Fujiwara, M.: Technical Characteristics and GRUAN Data Processing for the Meisei RS-11G and iMS-100 Radiosondes, GRUAN Technical Document 5, GRUAN Lead Centre, (last access: 12 October 2022), 2018. a, b, c

Kizu, N., Sugidachi, T., Kobayashi, E., Hoshino, S., Shimizu, K., Maeda, R., and Fujiwara, M.: RS-11G GRUAN Data Product Version 1, GRUAN [data set],, 2019. a, b

Kobayashi, E.: Quantitative comparison of the Meisei RS-11G radiosonde and the Vaisala RS92-SGP radiosonde for characterization of routine soundings, Journal of the Aerological Observatory, 73, 11–24, 2015. a

Kobayashi, E. and Hoshino, S.: Quantitative comparison of the iMS-100 and the RS-11G GPS sondes for characterization of routine soundings, Journal of the Aerological Observatory, 75, 17–38, 2018. a, b

Kobayashi, E., Noto, Y., Wakino, S., Yoshii, H., Ohyoshi, T., Saito, S., and Baba, Y.: Comparison of Meisei RS2-91 Rawinsondes and Vaisala RS92-SGP Radiosondes at Tateno for the Data Continuity for Climatic Data Analysis, J. Meteorol. Soc. Jpn., 90, 923–945,, 2012. a

Kobayashi, E., Hoshino, S., Iwabuchi, M., Sugidachi, T., Shimizu, K., and Fujiwara, M.: Comparison of the GRUAN data products for Meisei RS-11G and Vaisala RS92-SGP radiosondes at Tateno (36.06 N, 140.13 E), Japan, Atmos. Meas. Tech., 12, 3039–3065,, 2019. a, b, c, d, e, f, g, h, i, j

Meisei Electric Co., Ltd.: GPS Radiosonde iMS-100,, last access: 24 November 2020. a, b

Nash, J., Oakley, T., Vömel, H., and Wei, L.: WMO Intercomparison of High Quality Radiosonde Systems, Yangjiang, China, 12 July–3 August 2010, WMO, Tech. rep., (last access: 10 June 2022) 2011. a

Pavlis, N. K., Holmes, S. A., Kenyon, S. C., and Factor, J. K.: The Development and Evaluation of the Earth Gravitational Model 2008 (EGM2008), J. Geophys. Res.-Sol. Ea., 117, B04406,, 2012. a

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E.: Scikit-learn: Machine Learning in Python, J. Mach. Learn. Res., 12, 2825–2830, 2011. a

Ramsay, J. O. and Silverman, B. W.: The Registration and Display of Functional Data, in: Functional Data Analysis, 2nd edn., Springer New York, New York, NY, 127–145,, 2005. a

Savitzky, A. and Golay, M. J. E.: Smoothing and Differentiation of Data by Simplified Least Squares Procedures, Anal. Chem., 36, 1627–1639,, 1964. a

Seidel, D. J., Berger, F. H., Immler, F., Sommer, M., Vömel, H., Diamond, H. J., Dykema, J., Goodrich, D., Murray, W., Peterson, T., Sisterson, D., Thorne, P., and Wang, J.: Reference Upper-Air Observations for Climate: Rationale, Progress, and Plans, B. Am. Meteorol. Soc., 90, 361–369,, 2009. a

Sommer, M.: GRUAN Data Flow and RS92 Data Product V2, in: 5th GRUAN Implementation and Coordination Meeting, De Bilt, Netherlands, (last access: 8 December 2020), 2013. a, b

Sommer, M., Dirksen, R., and Immler, F.: RS92 GRUAN Data Product Version 2, GRUAN [data set],, 2012. a, b

Sommer, M., Dirksen, R., and von Rohden, C.: Brief Description of the RS92 GRUAN Data Product (RS92-GDP), GRUAN Lead Centre, GRUAN Technical Document 4 (Rev. 2.0), (last access: 8 December 2020), 2016. a, b

Sugidachi, T.: Meisei SKYDEW Instrument: Analysis of Results from Lindenberg Campaign, in: GRUAN ICM-11, Singapore, (last access: 19 October 2022), 2019. a, b

Szantai, A., Mémin, E., Cuzol, A., Papadakis, N., Héas, P., Wieneke, B., Alvarez, L., Becker, F., and Lopes, P.: Comparison of MSG dense atmospheric motion vector fields produced by different methods, in: 2007 EUMETSAT Meteorological Satellite Conference/15th AMS Satellite Meteorology & Oceanography Conference, EUMETSAT, 24–28 September 2007, Amsterdam, The Netherlands, (last access: 12 October 2022), 2007. a

Vaisala Oyj.: Vaisala Radiosonde RS92-SGP, (last access: 24 November 2020), 2013. a, b

Vömel, H., David, D. E., and Smith, K.: Accuracy of Tropospheric and Stratospheric Water Vapor Measurements by the Cryogenic Frost Point Hygrometer: Instrumental Details and Observations, J. Geophys. Res.-Atmos., 112, D08305,, 2007. a, b

Vömel, H., Naebert, T., Dirksen, R., and Sommer, M.: An update on the uncertainties of water vapor measurements using cryogenic frost point hygrometers, Atmos. Meas. Tech., 9, 3755–3768,, 2016. a, b

von Rohden, C., Sommer, M., and Dirksen, R.: Rigging Recommendations for Dual Radiosonde Soundings, GRUAN Lead Centre, GRUAN Technical Note 7, Lindenberg, (last access: 29 March 2020) 2016. a

von Rohden, C., Sommer, M., Naebert, T., Motuz, V., and Dirksen, R. J.: Laboratory characterisation of the radiation temperature error of radiosondes and its application to the GRUAN data processing for the Vaisala RS41, Atmos. Meas. Tech., 15, 383–405,, 2022. a

Short summary
GRUAN data products (GDPs) from Meisei iMS-100 and Vaisala RS92 were compared with 59 dual sounding data. For daytime observations, the iMS-100 temperature is around 0.5 K lower than RS92-GDP in the stratosphere, but for nighttime observations, the difference is around −0.1 K, and data are mostly in agreement. For relative humidity (RH), iMS-100 is around 1–2 % RH higher in the troposphere and 1 % RH smaller in the stratosphere than RS92, but both GDPs are in agreement for most of the profile.