1 Introduction

The imbalance of the measured fluxes at the earth’s surface is known as the energy balance closure problem and micrometeorologists have been aware of it since the late 1980s (Foken 2008a; Leuning et al. 2012). Many micrometeorological experiments over low vegetation reveal that the available energy, which is the sum of the net radiation and the ground heat flux, is larger than the sum of the sensible and latent heat fluxes. These experiments include the EBEX experiment (EBEX, ‘Energy Balance Experiment’, Oncley et al. 2007), which was especially designed to study the energy balance at the earth’s surface, and the LITFASS-2003 experiment (LITFASS, ‘Lindenberg Inhomogeneous Terrain – Fluxes between Atmosphere and Surface: a long-term Study’, Beyrich and Mengelkamp 2006), which aimed to study the effect of surface heterogeneity. To conserve energy, the residual was added to the energy budget equation over low vegetation at the earth’s surface, in which for the stationary and horizontally homogeneous surface layer,

$$\begin{aligned} Res = -Q^* - (Q_\mathrm{G} + Q_\mathrm{H} + Q_\mathrm{E}), \end{aligned}$$
(1)

where \(Res\) is the residual or missing energy, \(Q^*\) is the net radiation, \(Q_\mathrm{G}\) is the ground heat flux, \(Q_\mathrm{H}\) is the sensible heat flux, and \(Q_\mathrm{E}\) is the latent heat flux. Each term in Eq. 1 is positive, as the energy is transported away from the ground.

In the past few years, despite improvements in measuring and data processing techniques (see review in Foken 2008a; Foken et al. 2011), the energy balance closure problem still remains. According to several studies using large-eddy simulation (LES), the energy balance is significantly improved with contributions from secondary circulations or turbulent organized structures (Kanda et al. 2004; Inagaki et al. 2006; Steinfeld et al. 2007). These secondary circulations are large-scale eddies (several km) and relatively stationary (either static or move very slowly). They are generated by surface heterogeneity (Stoy et al. 2013) and normally move away from the ground . Due to their large size and slow motion, their contributions to the low frequency part of the turbulent spectrum cannot be detected by the eddy-covariance (EC) measurement, which is typically averaged over a period of 30 min. This results in the underestimation of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\), when normally measured by the EC technique.

An extension of the averaging time was suggested and expected to result in a greater contribution from the low frequency parts. Two different approaches were introduced for this task: the ogive analysis (Desjardins et al. 1989; Oncley et al. 1990) and the block ensemble average (Bernstein 1966, 1970; Finnigan et al. 2003). The ogive analysis, which requires a steady-state condition, uses the turbulent spectrum to estimate the turbulent fluxes for different frequency ranges, allowing assessment of the contribution of the low frequencies to the turbulent fluxes measured by the EC method. In Foken et al. (2006), the ogive analysis was applied to the data measured over a maize field of the LITFASS-2003 experiment and was focused mainly on data from three selected days, where the averaging time was extended up to 4 h. It was found that the time extension did not significantly increase the turbulent fluxes overall.

For the block ensemble average, which does not require a steady-state condition, low frequency contributions were added to the turbulent fluxes from long-term fluctuations over several hours to days. Detailed discussion of the block ensemble average can be found in Aubinet et al. (2012). In Mauder and Foken (2006), the block ensemble average was also applied to the dataset from the same maize field of the LITFASS-2003 experiment. The length of the selected dataset was 15 days, while the block ensemble averaging period varied from 5 min to 5 days. This study showed that the block ensemble average can close the energy balance at longer averaging times. Extensive discussion of the energy balance closure of the LITFASS-2003 experiment can be found in Foken et al. (2010).

In our study, we applied both ogive analysis and block ensemble average to the data from the LITFASS-2003 experiment, which was collected by multiple EC towers over a large heterogeneous area. This surface heterogeneity may induce secondary circulations, some of which may still exist in the vicinity of the measuring stations and influence the measured fluxes. We examine the impact of averaging time extension over different types of surface that were broadly exposed to the same atmospheric condition. However, the data obtained from different measuring stations do not always have the same sampling rate, thus minor modifications in both ogive analysis and block ensemble average were made. These modifications were validated by repeating the ogive analysis in Foken et al. (2006) and the block ensemble average in Mauder and Foken (2006). Since the energy balance of this dataset was previously analyzed over 30 min (Beyrich et al. 2006; Foken et al. 2010), we could mainly concentrate on the averaging time beyond 30 min, which is more related to the low frequency contributions.

2 Material and Method

2.1 LITFASS-2003 Experiment and Data Processing

The LITFASS-2003 experiment was performed between 19 May 2003 and 18 June 2003 near the Richard-Aßmann-Observatory of the German Meteorological Service in Lindenberg, Germany. The local time zone in this area is UTC + 1. During the experiment, there were 14 ground-based micrometeorological measuring stations over 13 sites, and two elevated measuring stations on the tower at 50 and 90 m heights. This experiment covered an area of 20 \(\times \) 20 km\(^2\) and made up of six major land-use types: grass, maize, rape, cereals (include rye, barley and triticale), lake and forest. More information about the LITFASS-2003 experiment can be found in Beyrich and Mengelkamp (2006).

To cover the most important land-use types for the LITFASS-2003 experiment, we selected the following measuring stations for our study: grass (NV2 and NV4), maize (A6), rape (A7), rye (A5), lake (FS) and forest (HV). Note that NV2 and NV4 were actually installed on the same field, and were oriented to different wind sectors to monitor turbulence at this field from all wind directions. We combined these two stations according to the wind direction and they were reported as the single station NV. Information of these selected stations can be found in Table 1.

Table 1 Summary of selected measuring stations from the LITFASS-2003 experiment during 20 May 2003, 1200 UTC–18 June 2003, 0000 UTC

All selected stations were equipped with EC systems as listed in Table 1. Four-component net radiometers and soil heat flux plates were also installed, therefore all the energy balance components in Eq. 1 were measured at each station. Details of these measurements were well described in Mauder et al. (2006) and Liebethal et al. (2005). These measurements allow estimation of the residual, which, on average, reached its maximum at 1000–1200 UTC. For low vegetation, its average value during this time ranged from 75 to 145 W m\(^{-2}\) (or 20–30 % of the available energy as shown in Table 1).

All kinds of plants store energy in their canopies. This canopy heat storage has two main contributions from the plant material (or biomass) and the air between plants. In Oncley et al. (2007), it was shown that over low vegetation, such as a cotton field, both contributions of canopy heat storage are relatively small and negligible. According to the study in maize and soybean (Meyers and Hollinger 2004), the stored energy in biomass is significant when a canopy is fully developed, while \(Q_\mathrm{G}\) is very low. During the LITFASS-2003 experiment, the maize field grew from bare soil up to approximately 0.5 m height at the end of the experiment. Therefore, the stored energy in biomass can be neglected in our analyses. However, a forest’s canopy heat storage is significant (Lindroth et al. 2010) and needs to be included in the energy budget equation (Eq. 1). Unfortunately, we did not collect all required biomass properties of the forest during the LITFASS-2003 experiment, so a forest’s canopy heat storage could not be precisely estimated. Hence, all analyses of this site were done without a canopy heat storage term. Since a forest’s canopy heat storage during the daytime would release heat back to the atmosphere during the nighttime, it is more important at the sub-diurnal scale (Haverd et al. 2007). Therefore, the omission of a forest’s canopy heat storage would have a minimal effect over a long-term basis. For the lake, due to its characteristics (e.g. large heat capacity), its energy budget cannot be described by Eq. 1. Therefore, its residual is not reported in Table 1.

During the LITFASS-2003 campaign, the raw data were processed and averaged over 30 min. For this task, all the participating groups agreed to use the software package TK2 (Mauder and Foken 2004), which has been tested and compared internationally (Mauder et al. 2008). During flux calculation, several corrections were applied. Cross-correlation analysis was used for fixing the time delay between the sonic anemometer and hygrometer, and a correction was used for the spectral loss in the high frequency range (Moore 1986). The planar-fit rotation was used to align the sonic anemometer with a long-term mean streamline (Wilczak et al. 2001); a correction was used to convert the sonic temperature, as recorded by the sonic anemometer, to the actual temperature (Schotanus et al. 1983); a correction was used to correct for density fluctuation (Webb et al. 1980). Crosswind correction was used to account for a different type of sonic anemometer (Liu et al. 2001), and a correction was used for the cross sensitivity between \({\hbox {H}_2\hbox {O}}\) and \({\hbox {O}_2}\) molecules (Tanner et al. 1993), which was only applied for the Krypton hygrometer KH20 (deployed in A5 and A7). More details of these corrections can be found in Mauder et al. (2006) and Foken et al. (2012).

After all flux corrections, quality flags were assigned to each 30-min period. These quality flags are the steady-state flag, the integral turbulence characteristic (ITC) flag (Foken and Wichura 1996) and combined flag. The steady-state flag is a result of the steady-state test and represents the stationarity of the data. The ITC flag represents the development of turbulent conditions, which is the result of the flux variance similarity test. The combined flag is the combination of the steady-state and ITC flags. All these flags range from 1 to 9 (from best to worst). High quality data, considered suitable for fundamental scientific research, have flag values of 1–3. More details of the data quality analysis can be found in Foken et al. (2004, 2012).

Besides flux calculations, flux corrections and assignment of data quality flags, the package TK2 can also generate short-term averages and covariances at 5-min or 10-min intervals. Due to the limited storage capacity, these short-term average data points were stored instead of the raw data from several measuring stations. However, the statistics for longer periods can be reconstructed from this short-term information with the following (Foken 2008b),

$$\begin{aligned} \overline{a^{\prime } b^{\prime }} = \frac{1}{M-1} \left[ (U-1) \sum \limits _{j=1}^N \left( \overline{a^{\prime }b^{\prime }} \right) _j + U \sum \limits _{j=1}^N \overline{a }_j \,\, \overline{b}_j - \frac{U^2}{M-1} \sum \limits _{j=1}^N \overline{a }_j \sum \limits _{j=1}^N \overline{b}_j \right] , \end{aligned}$$
(2)

where \(\overline{a^{\prime } b^{\prime }}\) is the long-term covariance and \(M\) is the number of measurement points of the long-term time series. This long-term time series consists of \(N\) short-term time series, whose number of measurement points is \(U\); \(\left( \overline{a^{\prime } b^{\prime }} \right) _j \) is the short-term covariance, and \(\overline{a }_j\) and \(\overline{b}_j\) are the short-term averages. These short-term averages are derived from raw data, to which no flux correction has been applied. Therefore, any necessary flux corrections must be included when using these short-term averages for flux calculations. These short-term average data points from selected stations were used for both ogive analysis and block ensemble average calculations. Short-term averaging intervals of selected stations are shown in Table 1.

2.2 Data Selection

In most selected measuring stations, ground heat flux and radiation data are only available from 20 May 2003, 1200 UTC, so the period 20 May 2003, 1200 UTC–18 June 2003, 0000 UTC was used in our study. To ensure high data quality, as well as to minimize the irrelevant factors that might influence turbulent fluxes, we imposed sets of data selection criteria to the ogive analysis and block ensemble average separately. For the ogive analysis, we increased the averaging time to up to 4 h. This 4-h period consists of eight consecutive subperiods (or blocks) of 30 min. We performed the ogive analysis over any 4-h period only if all blocks satisfied the selection criteria.

The first selection criterion is identical to Mauder et al. (2006), which is that the sonic anemometers must not be disturbed by either the internal boundary layer resulting from the heterogeneity of the surface, or the flow distortion caused by obstacles. The internal boundary-layer height was estimated from

$$\begin{aligned} z_\mathrm{m} \le \delta =0.3 \sqrt{x}, \end{aligned}$$
(3)

(Raabe 1983) where \(z_\mathrm{m}\) is the measurement height, \(\delta \) is the internal boundary-layer height and \(x\) is the distance from the sensor to boundary of the next land-use class. To keep the measurement undisturbed, \(z_\mathrm{m}\) must not exceed \(\delta \). Hence, we rejected any wind direction whose corresponding \(x\) did not satisfy Eq. 3. The undisturbed wind sectors (\(\theta \)) of each measuring station, from both internal boundary layer and flow distortion, are listed in Table 1. Additionally, footprint analysis was used to confirm that the target land-use type has a significant contribution to our measurement. This contribution varied over the stability range. We further rejected any wind sectors whose contribution from target land-use type is less than 80 %.

The next data selection criterion is a steady-state condition of the time series, which is indicated by the steady-state flag (Sect. 2.1). We only accepted data with high quality flags (flag 1–3). In this study, we did the ogive analysis of the energy balance components (\(Q_\mathrm{H}\) and \(Q_\mathrm{E}\)) and CO\(_2\) flux (\(F_\mathrm{c}=\overline{w^{\prime } c^{\prime }}_\mathrm{CO_{2}}\)) separately. For the energy balance components, we only considered the steady-state flags of friction velocity (\(u_{*}\)), \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\). We only performed the ogive analysis on any periods during which these three steady-state flags qualified simultaneously. For \(F_\mathrm{c}\), we considered only steady-state flags of \(u_{*}\) and \(\mathrm{C}O_{2}\) flux, and performed the ogive analysis on any periods where these two steady-state flags were accepted simultaneously.

We avoided the transition period by excluding from our analysis the time period covering one hour before to one hour after both sunrise and sunset. We also specified the threshold value of each turbulent flux as a minimum requirement in our analysis. For \(u_*\), which indicates the intensity of turbulence (Massman and Lee 2002), the threshold value is 0.1 m s\(^{-1}\); this was set to rule out very small turbulent fluxes, which can result from instrumentation noise. This limit normally excludes periods with very weak flow as well. For \(Q_\mathrm{H}\), \(Q_\mathrm{E}\) and \(F_\mathrm{c}\), threshold values were formulated to avoid complication with their measurement errors. According to Mauder et al. (2006), based on a 30-min averaging time, the measurement errors of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) are 10–20 % of the turbulent flux at 30 min, or 10–20 W m\(^{-2}\), whichever is larger. For \(u_*\) and \(F_\mathrm{c}\), the measurement errors are 0.02–0.04 m s\(^{-1}\) and 0.5–1 \(\upmu \)mol m\(^{-2} \)s\(^{-1}\), respectively (Meek et al. 2005). Therefore, we set the threshold values of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) to be 20 W m\(^{-2}\), and the threshold value of \(F_\mathrm{c}\) to be 1 \(\upmu \)mol m\(^{-2}\) s\(^{-1}\). Unusually large uncertainty of \(F_\mathrm{c}\) during the nighttime was taken into account by using only data periods with \(u_* >\) 0.25 m s\(^{-1}\) (Hollinger and Richardson 2005).

Similar selection criteria cannot apply to the block ensemble average, as this involves averaging times of several hours to days. Therefore, the quality control of this part was done by discarding any periods with more than 10 % of missing raw data. This missing data could result from various factors, e.g. electrical failure.

2.3 Modified Ogive Analysis

The ogive analysis was introduced by Desjardins et al. (1989) and Oncley et al. (1990) to investigate the flux contribution from each frequency range as well as to determine suitable averaging periods to capture most of the turbulent fluxes. The ogive function of the turbulent flux (\(og_{w,c}\)) is defined as the cumulative integral of the cospectrum of the turbulent flux (\(Co_{w,c}\)) starting with the highest frequency,

$$\begin{aligned} og_{w,c} (f_0) = \int ^{f_0}_{\infty } Co_{w,c} (f) \;\hbox {d}f, \end{aligned}$$
(4)

where \(w\) is the vertical wind velocity, \(c\) is the horizontal wind velocity component or a scalar quantity (i.e. temperature and humidity), and \(f\) is a frequency that corresponds to a time period (\(\tau \)) as

$$\begin{aligned} \tau = \frac{1}{f}. \end{aligned}$$
(5)

This analysis was applied to data obtained over the maize field (A6) of the LITFASS-2003 in Foken et al. (2006), where the ogive function was calculated from raw 20 Hz data over a 4-h period and mainly focused on three selected days (7–9 June 2003). It was shown that the ogive curves could be classified into three cases. Case 1 is where the ogive curve exhibits an asymptotic behavior toward the low frequency within a 30-min period. This indicates that the 30-min averaging time is sufficient to capture most of the turbulent fluxes. Case 2 is where the ogive curve shows the extreme value (peak) within a 30-min period, meaning that the total turbulent fluxes have been reached before 30 min. Hence a longer averaging time obviously reduces the flux and a period shorter than 30 min would be sufficient to capture most of the turbulent fluxes. Case 3 is where the ogive curve does not converge within a 30-min period. This implies that there is a significant contribution from the low frequency part of the turbulent spectrum and a 30-min averaging time is not sufficient to capture most of the fluxes.

As mentioned in Sect. 2.1, only the short-term average data at every 5 or 10 min exist for all selected sites, and we therefore developed a modified ogive analysis to deal with these data. According to the spectral analysis, the spectra calculated from high and low frequency data behave similarly in the low frequency region (Kaimal et al. 1972). This means that we can use the turbulent spectrum calculated from the short-term average data to estimate the change in turbulent fluxes after 30 min, without any information prior to 30 min (Fig. 1).

Fig. 1
figure 1

The short-term average time series can estimate the turbulent flux at a 30-min period \((\overline{F}_{30})\) and its evolution after that (grey solid lines in a grey band). The error band of width \(2\eta \) (grey band) was defined for identifying the ogive case. See Table 2 for ogive case definition

Table 2 Ogive case definition in analogy to Foken et al. (2006)

We calculated the turbulent cospectra of the short-term average data with a standard fast Fourier transform (FFT) method. To avoid influences from the diurnal effect, we still keep the time extension up to 4 h as in Foken et al. (2006). We then determined the change in turbulent fluxes after a 30-min period from the cumulative integral of the cospectra starting from the frequency that corresponds to a period of 30 min, and set its maximum value to be the maximum flux difference (\({\Delta }_{\max }\)), i.e.,

$$\begin{aligned} {\Delta }_{\max } = \mathrm{max} \left( \int ^{\tau }_{\tau =30} Co_{w,c} (f) \hbox {d}f \right) . \end{aligned}$$
(6)

We compared \({\Delta }_{\max }\) with the turbulent flux at 30 min (\(\overline{F}_{30}\)), which we can estimate in two different ways. Firstly, by averaging the fluxes from each 30-min block, \((\overline{w^{\prime } c^{\prime }})_j\), together as

$$\begin{aligned} \overline{F}_{30}=\frac{1}{8} \sum \limits _{j=1}^8 (\overline{w^{\prime } c^{\prime }})_j. \end{aligned}$$
(7)

Secondly, we calculated the total flux over a 4-h period (\(\overline{F}_{4\,\mathrm{h}}\)) from short-term average data with the help of Eq. 2 and the turbulent flux after a 30-min period (\(F_{\tau > 30}\)) from the cumulative integral of the cospectra from the lowest frequency (\(f_{\min }\)) to the frequency corresponding to a 30-min period,

$$\begin{aligned} F_{\tau > 30} =\int ^{\tau =30}_{f_{\min }} Co_\mathrm{w,c} (f) \hbox {d}f. \end{aligned}$$
(8)

The difference between \(\overline{F}_{4\,\mathrm{h}}\) and \(F_{\tau >30}\) can give us the estimation of \(\overline{F}_{30}\) as

$$\begin{aligned} \overline{F}_{30}= \overline{F}_{4\,\mathrm{h}} - F_{\tau >30}. \end{aligned}$$
(9)

Both estimations in Eqs. 7 and 9 give compatible values of \(\overline{F}_{30}\). We then set the error band of width \(2\eta \) for the turbulent flux at a 30-min period (Fig. 1). If \({\Delta }_{\max }\) is still confined within this band, it indicates that the turbulent flux difference after 30 min is not significant, which conforms to case 1 in Foken et al. (2006). If the maximum flux difference after a 30-min period exceeds this band, this means the turbulent flux difference is significant and could be classified into two cases, depending on the changes in turbulent fluxes after a 30-min period. It is equivalent to case 2 in Foken et al. (2006), when the size of turbulent flux decreases and case 3, when the size of turbulent flux increases. In this study, we set \(\eta \) to be 10 % (or 20 %) of \(\overline{F}_{30}\), which must not be smaller than the measurement errors of each turbulent flux (Sect. 2.2). The ogive case definition in analogy to Foken et al. (2006) is shown in Table 2.

To extend the investigation beyond three golden days and cover more land-use types, the modified ogive analysis was applied to a selected dataset (as described in Sect. 2.2) from selected ground-based stations of the LITFASS-2003 experiment (Table 1). The modified ogive analysis was applied to all selected sites for the energy balance component, while for \(F_\mathrm{c}\), it was only applied to these sites: maize, grass and forest. This is because \(F_\mathrm{c}\) measurements were not available at the rye (A5) and rape (A7) sites (both equipped with KH20), and \(F_\mathrm{c}\) was very low over the lake. Note that in this study, we did not apply the flux correction as mentioned in Sect. 2.1. Since each point of the cospectrum corresponds to the turbulent flux at a different duration, the choice of suitable duration for the flux corrections would be ambiguous. According to Mauder and Foken (2006), these flux corrections would reduce the residual by 17 %, and we may therefore assume that this reduction would have reflected in an increase of the sensible and latent heat fluxes.

2.4 Block Ensemble Average

The averaging operator that we apply to the single tower EC measurement is the time average. Over a period \(P\), the time average of any variable \(a(t)\) is (see Fig. 2 in Finnigan et al. 2003)

$$\begin{aligned} \overline{a(t)}=\frac{1}{P} \int _0^P a(t) \;\hbox {d}t. \end{aligned}$$
(10)

This averaging operator can apply to the mass balance equation as long as it satisfies the Reynolds averaging rules (Feriet 1951; Bernstein 1966, 1970). A standard approach is to impose a steady-state condition over a period \(P\), which makes the time average constant in this period (\(\overline{a(t)}=\overline{a}\)). Then any variable \(a(t)\) can be decomposed into mean (\(\overline{a}\)) and fluctuation parts (\(a'(t)\)) with the Reynolds decomposition as

$$\begin{aligned} a(t)=\overline{a}+a'(t). \end{aligned}$$
(11)

When Eq. 11 is applied to the product of vertical velocity \(w\) and variable \(c\), which can be horizontal wind velocity or a scalar quantity, we obtain

$$\begin{aligned} \overline{w(t)c(t)} = \overline{w } \, \overline{c } + \overline{w^{\prime }c^{\prime }}, \end{aligned}$$
(12)

which represents the mean vertical transport of a scalar or momentum over the period \(P\). This period must be long enough to capture most of the atmospheric turbulence, yet it must not violate a steady-state condition. The typical value of \(P\) in most EC measurements is 30 min. We can further simplify Eq. 12 by applying coordinate rotation, which sets \(\overline{w}\) to zero, e.g., the double rotation (Kaimal and Finnigan 1994).

When the averaging period \(P\) is extended to be much longer than 30 min, it is very difficult to maintain a steady-state condition. Without a steady-state condition, the Reynolds averaging rules no longer hold, in which case the time average is no longer a good representative statistic. Finnigan et al. (2003) and Bernstein (1966, 1970) proposed using the block ensemble average as it always obeys the Reynolds averaging rules, allowing the formulation to be carried out without a steady-state condition.

Suppose we extend our period of interest to \(NP\), which consists of \(N\) consecutive blocks (or subperiods, or runs) of period \(P\). Let a subscript \(n\) represent the \(n\)th block, whose time average of any variable \(a_n(t)\) in this block is \(\overline{a}_n(t)\). This time average becomes a function of time because it can vary from block to block. The block ensemble average of all \(N\) blocks (denoted by \(\langle \,\, \rangle \)) of \(a_n(t)\) over period \(NP\) is

$$\begin{aligned} \left\langle \overline{a} \right\rangle =\frac{1}{N} \sum \limits _{n=1}^N \overline{a}_n(t), \end{aligned}$$
(13)

where \(\left\langle \overline{a} \right\rangle \) is always constant over the period \(NP\) and obeys Reynolds averaging rules. This allows us to use the block ensemble average operator with the mass balance equation. The time average of each block \(\overline{a}_n(t)\) deviates from \(\left\langle \overline{a} \right\rangle \) by \(\tilde{a}_n(t)\),

$$\begin{aligned} \tilde{a}_n(t) = \overline{a}_n(t) - \left\langle \overline{a} \right\rangle . \end{aligned}$$
(14)

Hence we replace the Reynolds decomposition by the triple decomposition, in which any variable in the \(n\)th block can be separated into three parts as (see Fig. 3 in Finnigan et al. 2003)

$$\begin{aligned} a_n(t)=\left\langle \overline{a} \right\rangle + \tilde{a}_n(t) +a^{\prime }(t), \end{aligned}$$
(15)

where \(\tilde{a}_n(t)\) is the block-to-block fluctuation with contributions from eddies with time scales from \(P\) to \(NP\), while \(a^{\prime }(t)\) is an instantaneous fluctuation. This triple decomposition leads to the block ensemble average of the vertical transport of momentum or scalar over \(N\) blocks of period \(P\) as (we drop the subscript \(n\) and omit \(t\))

$$\begin{aligned} \left\langle \overline{w(t)c(t)} \right\rangle =\left\langle \overline{wc} \right\rangle =\left\langle \overline{w} \right\rangle \left\langle \overline{c} \right\rangle +\left\langle \tilde{w} \tilde{c} \right\rangle +\left\langle \overline{w^{\prime }c^{\prime }} \right\rangle . \end{aligned}$$
(16)

This shows that the mean vertical flux average over a period \(NP\) does not only depend on the usual turbulent flux \(\overline{w^{\prime }c^{\prime }}\), but also depends on the flux caused by block to block variations \(\tilde{w} \tilde{c}\) (also known as ‘mesoscale flux’ in the literature, e.g. Nakamura and Mahrt 2006). In Bernstein (1966), the moving average or overlapped block ensemble average was used instead of a non-overlapped one as in Finnigan et al. (2003).

To use the block ensemble average, every single block in period \(NP\) must be in the same coordinate system: the long-term coordinate. It has been shown in Finnigan et al. (2003) that a period-to-period rotation, e.g. the double rotation, is not a long-term coordinate. It sets \(\overline{w}\) of each \(n\)th period to zero and acts as a high-pass filter. In our analysis, we obtained the long-term coordinate through the planar fit rotation (Paw et al. 2000; Wilczak et al. 2001), which determines the rotation angle from multiple periods. This rotation set the block ensemble average of vertical velocity of the period \(NP\) to zero (\(\left\langle \overline{w} \right\rangle = 0 \)), while the mean vertical velocity in each period \(P\) is not necessarily zero. Thus the block ensemble average of the vertical flux becomes

$$\begin{aligned} \left\langle \overline{wc} \right\rangle = \left\langle \tilde{w}\tilde{c} \right\rangle + \left\langle \overline{w^{\prime }c^{\prime }} \right\rangle , \end{aligned}$$
(17)

where, according to Finnigan et al. (2003), \(\tilde{w}\tilde{c}\) has two roles, viz.

  1. 1.

    To balance the unsteady horizontal flux divergence and transient changes in source and storage terms.

  2. 2.

    To carry the low frequency contribution to the long-term vertical flux.

The first role can cause \(\tilde{w}\tilde{c}\) to become very large in any arbitrary period, which can be much larger than the mean vertical flux itself. It was suggested that the period \(NP\) must be long enough to suppress and minimize the effect of the first role. Then only the second role would contribute to the vertical flux. In our case, in addition to suppressing the transient effect, we would like to suppress the diurnal effect as well, so an observation period over a few days would help to balance the strong daytime fluxes with the weak nighttime fluxes as well as suppress any extreme days in between. Therefore, only the low frequency part of the diurnal effects would remain. However, an observation period over a few days would also intensify any errors in \(\tilde{w}\tilde{c}\), typically from instrumentation drift, gaps and synoptic-scale events. Since our observation period lasted only about a month and all sensors were carefully checked, instrumentation drift can be neglected. Hence, we need to select an observation period not influenced by any synoptic events, and with minimum gaps, to minimize the errors.

This approach was applied with the dataset from the Amazonian rain forest in Finnigan et al. (2003), where it was shown that the residual goes to zero at around 4 h. A similar strategy was applied to the 15-day dataset from the maize field (A6) of the LITFASS-2003 experiment during the period 2 June 2003, 1800 UTC–18 June 2003, 0000 UTC (Mauder and Foken 2006), which we also used in our study as a period \(NP\). Overlapping blocks ensemble averages were used, with the starting point of each consecutive block being shifted by 5 min, where the period \(P\) of the block ensemble average was varied from 5 min to 5 days. Flux corrections were applied as described in Sect. 2.1 in each individual block. It was shown that the energy balance was closed within a day and mainly due to the increase of \(\left\langle Q_\mathrm{H} \right\rangle \).

In the present study, to investigate whether this approach could generally close the energy balance, we applied the block ensemble average to selected ground-based stations of the LITFASS-2003 experiment as listed in Table 1 and used an identical period as in Mauder and Foken (2006). We made slight changes compared to Mauder and Foken (2006), because the turbulent data from lake and forest as well as the ground heat fluxes and net radiations from most selected stations are only available every 10 min. Our block ensemble period \(P\) was varied from 10 min to 5 days, with the starting points of each consecutive block being shifted by 10 min. We also used the same flux corrections as in Mauder and Foken (2006).

2.5 Scale Analysis

Since we are interested in how secondary circulations contribute to the low frequency part of the turbulent spectra, we used the wavelet analysis to resolve the underlying scale of motion. The wavelet analysis routine employed in this study is similar to Mauder et al. (2007), which is based on the algorithm described in Torrence and Compo (1998). The Morlet wavelet was used in this routine due to its localization strength in the frequency domain. We can only apply the wavelet analysis to the data from rye, maize and grassland, whose high frequency data are available. The CO\(_2\) flux is not discussed, as it is not related to the energy balance. Other than the high frequency data requirement, the wavelet analysis also consumes large amounts of computing resources, so it is almost impossible to apply over quite large datasets. Therefore, a specific period when large-scale structures exist needs to be identified before the wavelet analysis is performed over this specific period.

3 Results and Discussion

3.1 Modified Ogive Analysis

As mentioned in Sect. 2.3, we investigated the impact of averaging time extension with the modified ogive analysis on energy balance components and \(F_\mathrm{c}\) separately. Our data selection criteria (Sect. 2.2) ruled out most nighttime periods in both analyses, because their turbulent fluxes were below thresholds. We expected measuring stations with broader undisturbed wind sectors, namely rye (A5), grass (NV) and forest (HV), to have more qualified periods. This was confirmed by the highest number of qualified periods from grass and rye stations, however, the number of qualified periods of the forest station for the modified ogive analysis of the energy balance components was less than for the other two measuring stations. This is because measurements of \(Q_\mathrm{E}\) at the forest were often rejected due to poor steady-state conditions during daytime. This contrasts with data from the lake (FS), where \(Q_\mathrm{H}\) was often rejected for the same reason. Over low vegetation, steady-state flags of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) were normally qualified during 0600–1600 UTC. Some random unsteady periods mostly appeared in the afternoon. For all selected measuring stations, steady-state flags of \(F_\mathrm{c}\) (if measurements were available) were randomly disqualified throughout the day, while steady-state flags of \(u_*\) were mostly qualified. Hence, passing the steady-state criterion is mainly dependent on the stationarity of \(Q_\mathrm{H}\), \(Q_\mathrm{E}\) and \(F_\mathrm{c}\). In the end, at each measuring station, only 5–20 % of available periods were left for the modified ogive analysis, and these periods occurred mainly during 0600–1600 UTC. For the energy balance components, all periods had unstable stratification, while for \(F_\mathrm{c}\), there were a few periods with stable stratification.

The results of the modified ogive analysis of energy balance components and \(F_\mathrm{c}\) are shown in Tables 3 and 4, respectively. The different analyses both gave very similar results for \(u_*\). Hence, only the results of \(u_*\) from the modified ogive analysis of \(F_\mathrm{c}\) (Table 4) are shown. In these two Tables, each measuring station is identified by the canopy type and the station code (column 1). We report the number of qualified periods for the modified ogive analysis (Tot no., column 1), the average of \(\overline{F}_{30}\) (\(\left\langle \overline{F}_{30} \right\rangle \), column 4, 6 and 9), and the percentage of qualified periods in each ogive case (no., column 5, 8 and 11). All sets of information are reported at two different sizes of error bands (\(\eta \)), 10 and 20 %, which must be larger than the threshold fluxes (Sect. 2.2). For cases 2 and 3, the average of maximum flux difference (\(\left\langle {\Delta }_{\max } \right\rangle \), columns 7 and 10) is also presented.

Table 3 Results from the modified ogive analysis of the energy balance components (\(Q_\mathrm{H}\) and \(Q_\mathrm{E}\)) from selected stations of the LITFASS-2003 experiment between 20 May 2003, 1200 UTC–18 June 2003, 0000 UTC
Table 4 Results from the modified ogive analysis of friction velocity (\(u_*\)) and CO\(_2\) flux (\(F_\mathrm{c}\))

\(\left\langle \overline{F}_{30} \right\rangle \) of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) were closely grouped over low vegetation. \(\left\langle \overline{F}_{30} \right\rangle \) of \(Q_\mathrm{H}\) was largest over the forest and smallest over the lake, and vice versa for \(Q_\mathrm{E}\). Over lake and low vegetation, the modified ogive analysis classified most qualified periods of both \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) as case 1. This suggests that a 30-min averaging time is generally sufficient to capture most turbulent fluxes. However, there were significant numbers of cases 2 and 3 of both \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) from rye, grass, maize and, remarkably, forest stations. These periods of cases 2 and 3 of rye, grass and maize sites were closely related to the stationarity of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) over a 4-h period. For these three sites, periods of case 1 usually had a 4-h steady-state flag of 1 for \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\), while cases 2 and 3 usually had steady-state flags of 2 or greater. This relation was not readily apparent in the forest site, implying that when the atmosphere becomes less stationary at longer averaging time, the measured fluxes over low vegetation can be either increased or decreased. If we restrict our consideration to rye, grass, maize and forest sites, we found that the number of case 3 was normally greater than the number of case 2 in both \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\). This would tell us that the averaging time extension most likely increases \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\). The average maximum flux difference (\(\left\langle {\Delta }_{\max } \right\rangle \)) for \(Q_\mathrm{H}\) was mostly higher than for \(Q_\mathrm{E}\). \(\left\langle {\Delta }_{\max } \right\rangle \) is increased with larger size of an error band (\(\eta \)), while lower numbers of cases 2 and 3 were observed. This would indicate that the fewer periods left had a larger \(\left\langle {\Delta }_{\max } \right\rangle \). However, even with the greatest \(\left\langle {\Delta }_{\max } \right\rangle \) added on top of flux corrections, the energy increase is still not sufficient to close the energy balance. Furthermore, from scalar similarity of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\), we expected these fluxes to increase or decrease together. This means we should see case 2 or case 3 in both \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\) simultaneously, which was rarely observed.

Values of \(\left\langle \overline{F}_{30} \right\rangle \) of \(u_*\) were highest over the forest and smallest over the lake, and were closely grouped together over low vegetation. Our modified ogive analysis classified most periods from all sites as case 1. This suggests that the time extension has almost no impact on \(u_*\), regardless of canopy types.

For \(F_\mathrm{c}\), all sites gave comparable values of \(\left\langle \overline{F}_{30} \right\rangle \). Case 1 was still in the majority, with a larger fraction of cases 2 and 3 than the energy balance components. Forest also had larger fraction of cases 2 and 3 than did low vegetation. Overall, there was a greater number of case 3 than number of case 2, and \(\left\langle {\Delta }_{\max } \right\rangle \) also increased with \(\eta \). The 4-h steady-state flags were normally 1 for case 1 and higher for cases 2 and 3. However, case 2 generally had higher steady-state flags than case 3. This suggests that when the atmosphere becomes less stationary at longer averaging times, the measured values of \(F_\mathrm{c}\) tend to increase. However, when the degree of unsteadiness becomes stronger, the measured values of \(F_\mathrm{c}\) start to decrease.

3.2 Block Ensemble Average

The block ensemble averages (Eq. 17) for all selected sites during 2 June 2003, 1800 UTC–18 June 2003, 0000 UTC, are shown in Fig. 2. We chose this period as our observation period \(NP\) to repeat Mauder and Foken (2006), with some minor modifications (Sect. 2.4). We found that our result from the maize station (Fig. 2d) differed from the original by less than the measurement errors of \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\). Therefore, these modifications still give the same results and we can confidently apply them to the data from other selected measuring stations.

Fig. 2
figure 2

Block ensemble averages of sensible heat flux and latent heat flux (Eq. 17 with \(c\) as temperature and absolute humidity), and their corresponding residuals, during 2 June 2003, 1800 UTC–18 June 2003, 0000 UTC of selected sites in the LITFASS-2003 experiment: a lake, b forest, c rye, d maize (from Mauder and Foken (2006)), e grass and f barley

The outcome of the block ensemble averaging was quite unexpected to us. It could close the energy balance over only maize, rye and rape sites. For maize and rye sites, the closures were at around 15–30 h, which is close to the results obtained in Mauder and Foken (2006). These closures were mainly caused by an increase of \(\left\langle Q_\mathrm{H} \right\rangle \) with a longer block ensemble averaging period \(P\). For the rape site, both \(\left\langle Q_\mathrm{H} \right\rangle \) and \(\left\langle Q_\mathrm{E} \right\rangle \) were approximately constant at all values of \(P\). During the observation period, this site was also influenced by rain events in the southern part of the LITFASS area. Therefore, its closure at very large values of \(P\) was not enhanced by the block ensemble average. For grassland and lake, \(\left\langle Q_\mathrm{H} \right\rangle \) was decreased with greater \(P\), which was canceled by the increase in \(\left\langle Q_\mathrm{E} \right\rangle \), and caused the residual to be approximately constant at all values of \(P\). For lake and forest sites, we must interpret the results carefully, because the lake has different characteristics from other terrain sites and we cannot precisely estimate the canopy heat storage (Sect. 2.1) of the forest from our data.

At all sites, both \(\left\langle Q_\mathrm{H} \right\rangle \) and \(\left\langle Q_\mathrm{E} \right\rangle \) were approximately constant within the first few hours. Over longer \(P\), \(\left\langle Q_\mathrm{E} \right\rangle \) was more steady than \(\left\langle Q_\mathrm{H} \right\rangle \). The inflection at the diurnal scale was found at all sites of both \(\left\langle Q_\mathrm{H} \right\rangle \) and \(\left\langle Q_\mathrm{E} \right\rangle \). As all these selected sites are practically in the same \(20 \times 20\) km\(^2\) area, the diurnal effects should not be very different and the degree of inflection should be comparable. Therefore, the stronger inflection over some sites and fluxes may not be entirely caused by the diurnal effects.

As the block ensemble average could not close the energy balance for all selected sites from 2 June 2003, 1800 UTC to 18 June 2003, 0000 UTC, we need to determine the reason for this and whether it would be the same in a different observation period \(NP\). We know that the \(\tilde{w}\tilde{c}\) term of the block ensemble average is related to the low frequency flux contribution. In principle, \(\tilde{w}\tilde{c}\) represents the flux contributions beyond the averaging period \(P\). If we set \(P\) to be 30 min, \(\tilde{w}\tilde{c}\) would represent additional flux after the 30-min averaging time. Hence, long-term observation of \(\tilde{w}\tilde{c}\) would show variation of additional fluxes from low frequency contributions, which may be related to observed block ensemble average fluxes. These variations can be observed more clearly when the observation period \(NP\) is sufficiently long to suppress any transient effects in the block ensemble average fluxes.

Our observation period \(NP\), which covered an entire period of the LITFASS-2003 experiment, was 20 May 2003, 1200 UTC–18 June 2003, 0000 UTC. We used \(\tilde{w}\tilde{c}\) from all 30-min non-overlapping blocks (\(P=30\) min) within this period \(NP\) to construct the Hovmøller diagrams of \(\tilde{Q}_\mathrm{H}\) (mesoscale flux of sensible heat or \(\tilde{w}\tilde{T}\) in units of energy, \(T\) is temperature) and \(\tilde{Q}_\mathrm{E}\) (mesoscale flux of latent heat or \(\tilde{w}\tilde{a}\) in units of energy, \(a\) is absolute humidity). These diagrams show the variation of additional fluxes beyond the 30-min averaging interval. According to Sect. 2.4, \(\tilde{w}\tilde{c}\) can be very large in any arbitrary blocks. We therefore expected to observe some randomly occurring large \(\tilde{Q}_\mathrm{H}\) and \(\tilde{Q}_\mathrm{E}\) in these diagrams.

The Hovmøller diagrams of \(\tilde{Q}_\mathrm{H}\) for rye and grassland are shown in Fig. 3. Throughout the entire experiment, we found large \(\tilde{Q}_\mathrm{H}\) more often than large \(\tilde{Q}_\mathrm{E}\). We started with the period during 2 June 2003, 1800 UTC–18 June 2003, 0000 UTC. Within this period, large \(\tilde{Q}_\mathrm{H}\) were mainly positive for the rye (Fig. 3a) and maize (not shown) sites, and mainly negative over the grassland (Fig. 3b). These observations are consistent with the observed block ensemble average fluxes, in which \(\left\langle Q_\mathrm{H} \right\rangle \) increased with increasing \(P\) for the rye and maize sites, and vice versa for the grassland (Fig. 2). The lake site is more dominated by large negative \(\tilde{Q}_\mathrm{H}\) (not shown), which is consistent with the decrease of \(\left\langle Q_\mathrm{H} \right\rangle \) at longer \(P\). There were only few large \(\tilde{Q}_\mathrm{E}\) at all sites, which are consistent with approximately constant \(\left\langle Q_\mathrm{E} \right\rangle \) at all values of \(P\) over rye, maize, rape and forest sites. However, over lake and grassland, these few large \(\tilde{Q}_\mathrm{E}\) were extremely large when compared to their own block ensemble averages, and caused \(\left\langle Q_\mathrm{E} \right\rangle \) to increase with increasing \(P\).

Fig. 3
figure 3

The Hovmøller diagrams of \(\tilde{Q}_\mathrm{H}\) (mesoscale flux of sensible heat) from a rye, b grass and c 90-m tower, representing \(\tilde{w}\tilde{T}\) of each 30-min block in units of energy. Series of large \(\tilde{Q}_\mathrm{H}\) were observed in rye (positive) and grass (mainly negative) during 1 June 2003–5 June 2003. These patterns are related to secondary circulations, which is consistent with frequent observations of large \(\tilde{Q}_\mathrm{H}\) at 90-m height. Each colour depicts the flux in W m\(^{-2}\).

More interestingly, large \(\tilde{Q}_\mathrm{H}\) were observed consecutively for a few days during 1 June 2003–5 June 2003, over rye, maize, grass and lake. This period was the dry period between the rain events and was not influenced by any significant synoptic events. These large \(\tilde{Q}_\mathrm{H}\) were positive for rye and maize, and mainly negative for grass and lake. Large \(\tilde{Q}_\mathrm{E}\) was not found in this same period. As described in Sect. 2.4, large \(\tilde{Q}_\mathrm{H}\) (or large \(\tilde{w}\tilde{T}\)) could compensate for a strong horizontal divergence in an individual block. However, consecutive occurrences indicate that they were certainly not transient effects. A strong horizontal divergence would imply a strong horizontal advection, which is related to secondary circulations. Hence, we believe that these patterns of large \(\tilde{Q}_\mathrm{H}\) were caused by near-surface secondary circulations. To support this statement, we inspected the Hovmøller diagram of \(\tilde{Q}_\mathrm{H}\) obtained from the measurement at 90 m height (M90). At this height, there always exist secondary circulations, which means that we should observe series of large \(\tilde{Q}_\mathrm{H}\) more often than in ground measurements. We did actually observe series of large positive and negative \(\tilde{Q}_\mathrm{H}\) throughout the entire period of the LITFASS-2003 experiment (Fig. 3c).

To observe the effect of near-surface secondary circulations more clearly, we chose 1 June 2003, 1500 UTC–5 June 2003, 1500 UTC as the new observation period \(NP\). We used the beginning and ending time of 1500 UTC to avoid gaps in the data from the maize field on the morning of 1 June 2003 and the rain event on the evening of 5 June 2003. In addition, we wanted to complete a daily cycle. Since this period only lasted for 4 days, the block ensemble averaging period \(P\) was varied from 10 min to 3 days. The block ensemble averages of this new observation period are shown in Fig. 4. As expected, we found that during this period, the energy balance is closed when averaging times of half a day are used for rye and maize sites. Over grassland and lake, \(\left\langle Q_\mathrm{H} \right\rangle \) decreased at larger \(P\), which corresponds to the large negative \(\tilde{Q}_\mathrm{H}\). \(\left\langle Q_\mathrm{E} \right\rangle \) were approximately constant at all \(P\) at all sites, which is consistent with the absence of large \(\tilde{Q}_\mathrm{E}\).

Fig. 4
figure 4

Block ensemble averages of sensible heat flux and latent heat flux (Eq. 17 with \(c\) as temperature and absolute humidity), and their corresponding residuals, during 1 June 2003, 1500 UTC–5 June 2003, 1500 UTC for selected sites in the LITFASS-2003 experiment: a lake, b forest, c rye, d maize, e grass and f barley

Accepted models state that secondary circulations can only reach down to levels near the earth’s surface under the free convection condition, which occurs when the buoyancy term dominates the shear production term, as \(z/L \le -1\). This situation is also accompanied by low friction velocity (Eigenmann et al. 2009). As we did not observe any free convection during 1 June 2003, 1500 UTC–5 June 2003, 1500 UTC, we believe that these near-surface secondary circulations were caused (either thermally or mechanically) by the surface heterogeneity between different land-use types (Stoy et al. 2013).

3.3 Scale Analysis

We used the wavelet analysis to resolve the scales of motion during 1 June 2003, 1500 UTC–5 June, 2003 1500 UTC with data from the rye, maize and grassland stations. The wavelet analyses of rye and grassland stations are shown in Figs. 5 and 6, respectively. Results for the maize field and the rye field are very similar. From these wavelet cross-scalograms, we found small and large scales of motion. The duration of the small scales is around a few min, which should be captured by the eddy-covariance measurement over a 30-min averaging period. They are present during the daytime at all sites and transport both \(Q_\mathrm{H}\) and \(Q_\mathrm{E}\). The size of the larger scale is approximately one day, and mainly transports \(Q_\mathrm{H}\). It tends to increase \(Q_\mathrm{H}\) in the maize and rye sites, while decreasing \(Q_\mathrm{H}\) over grass. This is consistent with the patterns of \(\tilde{Q}_\mathrm{H}\) and the block-ensemble average fluxes. This scale of motion would not be captured by the eddy-covariance measurement over the 30-min averaging period.

Fig. 5
figure 5

Wavelet cross-scalograms over the rye field during 1 June 2003, 1500 UTC–5 June 2003, 1500 UTC of a sensible heat flux and b latent heat flux. The colour indicates values in W m\(^{-2}\). The black solid line represents the cone of influence

Fig. 6
figure 6

Wavelet cross-scalograms over the grassland during 1 June 2003, 1500 UTC–5 June 2003, 1500 UTC of a sensible heat flux and b latent heat flux. The colour indicates values W m\(^{-2}\). The black solid line represents the cone of influence

Both patterns from the Hovmøller diagram and the wavelet cross-scalograms show the increase or decrease of \(\tilde{Q}_\mathrm{H}\). However, they do not actually show what contributes to these changes. For the turbulent fluxes (\(\overline{w^{\prime }c^{\prime }}\)), which are caused by instantaneous fluctuations, we can carry out a quadrant analysis by dividing instantaneous contributions of \(\overline{w^{\prime }c^{\prime }}\) into four quadrants of \(w^{\prime }\) and \(c^{\prime }\) (Shaw 1985). Our findings suggest that the main contribution to closing the energy balance is made by \(\tilde{Q}_\mathrm{H}\), which results from block-to-block fluctuations (\(\tilde{w}\) and \(\tilde{c}\)). We therefore divided block-to-block contributions of \(\tilde{w}\tilde{c}\) into four quadrants of \(\tilde{w}\) and \(\tilde{c}\). We used \(\tilde{T}\) (temperature) and \(\tilde{a}\) (absolute humidity) as horizontal axes and \(\tilde{w}\) (vertical velocity) as a vertical axis, which gave our four quadrants (\(Q_i, i=1,\ldots ,4\)) as

\({Q_1\!:}\) :

\(\tilde{w}>0\) and \(\tilde{T} >0\) or \(\tilde{a} >0\) warm air rising or moist air rising,

\({Q_2\!:}\) :

\(\tilde{w}>0\) and \(\tilde{T} <0\) or \(\tilde{a} <0\) cold air rising or dry air rising,

\({Q_3\!:}\) :

\(\tilde{w}<0\) and \(\tilde{T} <0\) or \(\tilde{a} <0\) cold air sinking or dry air sinking,

\({Q_4\!:}\) :

\(\tilde{w}<0\) and \(\tilde{T} >0\) or \(\tilde{a} >0\) warm air sinking or moist air sinking.

\(Q_1\) and \(Q_3\) contribute to the positive flux, while \(Q_2\) and \(Q_4\) contribute to the negative flux. We then normalized each axis by its standard deviation and set the hyperbolic hole size to 0.5 (\(H=0.5\)). We neglect the weak contributions inside the hole and only consider any points which satisfy

$$\begin{aligned} \left| \frac{\tilde{w} \tilde{T}}{\sigma _{\tilde{w}} \sigma _{\tilde{T}}} \right| \;\; \mathrm {or} \;\; \left| \frac{\tilde{w} \tilde{a}}{\sigma _{\tilde{w}} \sigma _{\tilde{a}}} \right| >H. \end{aligned}$$
(18)

With the quadrant analysis, we expect to see which types of turbulence actually contribute to the increasing or decreasing of \(\tilde{w}\tilde{c}\). To make it consistent with our Hovmøller diagrams, we used the same observation period \(NP\), which is 20 May 2003, 1200 UTC–18 June 2003, 0000 UTC, and set \(P\) to 30 min (non-overlapped). Therefore, any points on the quadrant analysis diagram represent the normalized \(\tilde{w}\tilde{c}\) from each non-overlapped 30-min period.

The results of the quadrant analysis of rye and grassland stations are shown in Fig. 7. In this figure, we used red dots to highlight all points obtained during 1 June 2003, 1500 UTC–5 June 2003, 1500 UTC. By considering only strong contributions outside a hyperbolic hole (blue line), we found that during this period, \(\tilde{Q}_\mathrm{H}\) (via \(\tilde{w}\tilde{T}\)) has a higher contribution from \(Q_1\) (warm air rising) for the rye sites (Fig. 7a), while there is greater contribution from \(Q_4\) (warm air sinking) for the grassland (Fig. 7b). There was no significant contribution outside the hyperbolic hole for \(\tilde{Q}_\mathrm{E}\) (via \(\tilde{w}\tilde{a}\)) in both rye and grass stations. Over the maize field, the quadrant analysis is similar to that of the rye field, while the lake is similar to the grassland. For both rape and forest (not shown), \(Q_1\) and \(Q_4\) contributed equally to \(\tilde{Q}_\mathrm{H}\), with no significant contribution outside the hole for \(\tilde{Q}_\mathrm{E}\). These results tell us that the increase of \(\left\langle Q_\mathrm{H} \right\rangle \) at larger \(P\) at rye and maize fields was caused by warm air near the surface rising, while the decrease of \(\left\langle Q_\mathrm{H} \right\rangle \) at grassland and lake was caused by warm air aloft sinking. For forest and rape stations, both contributions from \(Q_1\) and \(Q_4\) cancel each other and keep \(\left\langle Q_\mathrm{H} \right\rangle \) approximately constant at all \(P\). The absence of significant contributions outside the hyperbolic hole keeps \(\left\langle Q_\mathrm{E} \right\rangle \) approximately constant at all sites.

Fig. 7
figure 7

Quadrant analysis of \(\tilde{w}\tilde{T}\) (left panels represent the sensible heat flux) and \(\tilde{w}\tilde{a}\) (right panels represent the latent heat flux) of a rye and b grass during 20 May 2003, 1200 UTC–18 June 2003, 0000 UTC. The period from 1 June 2003, 1500 UTC to 5 June 2003, 1500 UTC has been highlighted using red dots. The blue solid lines represent the hyperbolic hole (\(H=0.5\))

4 Conclusions

The modified ogive analysis, which requires steady-state conditions, reveals that extension of the averaging time by a few hours does not significantly improve the energy balance. The time extension has a greater impact over tall vegetation. Therefore, the 30-min averaging time is still, overall, sufficient for the eddy-covariance calculation. Sensible heat flux, latent heat flux and CO\(_2\) flux are more sensitive to the time extension than is friction velocity. Over low vegetation, when the atmosphere becomes less stationary with greater averaging times, these three turbulent fluxes tend to increase. However, an increase in the degree of unsteadiness tends to decrease the CO\(_2\) flux. The increase of sensible heat flux is generally greater than the increase of latent heat flux. Over a longer period, the increases or decreases of sensible and latent heat fluxes do not always behave according to the scalar similarity as expected. And lastly, the sizes of the increases in both sensible and latent heat fluxes are not sufficient to close the energy balance at all sites.

Without assuming steady-state conditions, the block ensemble average can extend the averaging time to several days through the inclusion of the period-to-period fluctuations (\(\tilde{w}\tilde{c}\), \(c\) is temperature or humidity) in the mean vertical flux. However, this does not usually close the energy balance. The Hovmøller diagram, which shows variations of \(\tilde{w}\tilde{c}\) over a long period, can help to identify when secondary circulations exist in the vicinity of the sensor by exhibiting consecutive large \(\tilde{w}\tilde{T}\). From our findings, secondary circulations that exist near the earth’s surface mainly transport sensible heat. This result also supports the poor scalar similarity between the sensible and latent heat fluxes in the low frequency region (Ruppert et al. 2006; Foken et al. 2011).

Since secondary circulations move very slowly and are relatively large in size, a single-tower EC measurement averaging over 30 min is unable to detect them. If the sensor is, coincidentally, at the right time and location when secondary circulations transport near-ground warm air upwards, the block ensemble average at a longer period yields higher sensible heat flux, which improves the energy balance closure. However, when these near-surface secondary circulations transport warm air aloft downwards, the block ensemble average yields a lower sensible heat flux at a longer averaging time. This suggests that near-surface secondary circulations do transport significant amounts of energy, and these are responsible for the energy balance closure problem rather than sensor deficiencies.

To account for low frequency contributions to turbulent fluxes caused by near-surface secondary circulations, we must accept that the scalar similarity between the sensible and latent heat fluxes is no longer valid at all scales. Therefore, the widely used energy balance correction in Twine et al. (2000), EBC-Bo, which assumes the scalar similarity between sensible and latent heat fluxes by preserving the Bowen ratio, would not generally hold.

As we found that near-surface secondary circulations transport more sensible heat, the EBC-Bo correction may attribute less residual to the sensible heat flux than expected. Hence, we propose an alternative energy balance correction for a near-surface EC measurement through the buoyancy flux ratio (EBC-HB), in which convection plays a key role. The buoyancy flux, \(Q_\mathrm{B}\), is defined as

$$\begin{aligned} Q_\mathrm{B} = \rho c_\mathrm{p} \, \overline{w^{\prime }T^{\prime }_\mathrm{v}}, \end{aligned}$$
(19)

where \(\rho \) is the air density, \(c_\mathrm{p}\) is the specific heat capacity of air at constant pressure, and \(T_\mathrm{v}\) is the virtual temperature, which can be replaced by the sonic temperature (\(T_\mathrm{S}\)) with negligible loss of accuracy (Kaimal and Gaynor 1991). This means that sonic anemometers can be used to directly measure \(Q_\mathrm{B}\) with a good accuracy. The virtual temperature is related to the actual temperature (\(T\)) and specific humidity (\(q\)) in the same way as the sonic temperature (Schotanus et al. 1983), which leads to

$$\begin{aligned} Q_\mathrm{B} = \rho c_\mathrm{p} \, \overline{w^{\prime }T^{\prime }_\mathrm{v}}&= \rho c_\mathrm{p} \, \left( \overline{w^{\prime }T^{\prime }} + 0.61 \, \overline{T} \, \overline{w^{\prime }q^{\prime }}\right) \nonumber \\&= Q_\mathrm{H} \,\left( 1+ 0.61 \, \overline{T} \, \frac{c_\mathrm{p}}{\lambda \, Bo} \right) , \end{aligned}$$
(20)

where \(\lambda \) is the heat of evaporation of water and \(Bo\) is the Bowen ratio. We next partition the residual via the buoyancy flux ratio, which contains both sensible and latent heat fluxes. The fraction of the residual attributed to the sensible heat flux depends on the relative contribution of the sensible heat flux to the buoyancy flux. The remainder is then added to the latent heat flux. Therefore the corrected sensible and latent heat fluxes obtained with EBC-HB (\(Q_\mathrm{H}^\mathrm{EBC-HB}\) and \(Q_\mathrm{E}^\mathrm{EBC-HB}\) respectively) are,

$$\begin{aligned} Q_\mathrm{H}^\mathrm{EBC-HB}&= Q_\mathrm{H} + f_\mathrm{HB} Res,\end{aligned}$$
(21)
$$\begin{aligned} Q_\mathrm{E}^\mathrm{EBC-HB}&= Q_\mathrm{E} + \left( 1 - f_\mathrm{HB} \right) Res, \end{aligned}$$
(22)

with

$$\begin{aligned} f_\mathrm{HB} = \frac{Q_\mathrm{H}}{Q_\mathrm{B}} = \left( 1+ 0.61 \, \overline{T} \, \frac{c_\mathrm{p}}{\lambda \, Bo} \right) ^{-1} . \end{aligned}$$
(23)

Since this method does not preserve the Bowen ratio, Eqs. 2123 must be calculated iteratively until the Bowen ratio in Eq. 23 converges. The comparison between EBC-Bo and EBC-HB corrections is shown in Fig. 8. Both approaches are identical at very high Bowen ratios, i.e. all the residual is shifted to the sensible heat flux. For the typical range of the Bowen ratio, however, the EBC-HB correction would attribute more residual to the sensible heat flux than EBC-Bo correction, which is more consistent with our findings.

Fig. 8
figure 8

Fraction of the residual attributed to the sensible heat flux at different Bowen ratios from two different approaches. The Bowen ratio approach (EBC-Bo, black line) assumes the scalar similarity between the sensible and latent heat fluxes by preserving the Bowen ratio (Twine et al. 2000). The buoyancy flux ratio approach (EBC-HB, grey lines) partitions the residual according to the ratio between the sensible heat flux and the buoyancy flux, and is shown at different temperatures from \(-\)30 to 30 \(^{\circ }\)C. Although both approaches are identical at very large Bowen ratios, in most cases EBC-HB would attribute a larger fraction of the residual to the sensible heat flux than would EBC-Bo