Reconstructing long-term wind data at an offshore met-mast location using cyclostationary empirical orthogonal functions

Reconstructing long-term wind data at an offshore met-mast location using cyclostationary empirical orthogonal functions

J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158 Contents lists available at ScienceDirect Journal of Wind Engineering and Industrial Aerodynamics jour...

3MB Sizes 0 Downloads 18 Views

J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Contents lists available at ScienceDirect

Journal of Wind Engineering and Industrial Aerodynamics journal homepage: www.elsevier.com/locate/jweia

Reconstructing long-term wind data at an offshore met-mast location using cyclostationary empirical orthogonal functions Ji-Young Kim a,n, Kwang-Yul Kim b a b

Korea Electric Power Research Institute, Daejeon, Republic of Korea School of Earth and Environmental Science, Seoul National University, Seoul, Republic of Korea

art ic l e i nf o

a b s t r a c t

Article history: Received 22 June 2015 Received in revised form 28 July 2016 Accepted 28 July 2016

For the development of a wind power plant, plant design and its project feasibility analysis are implemented with wind data observed by a met-mast at a target location. Since observation period/time of a met-mast is normally about one year before the plant design, correlation with long-term data existing in the neighborhood of the target location is used for hindcasting past met-mast data to reduce uncertainty in the feasibility analysis, which is called the Measure-Correlate-Predict (MCP) method. In this study, cyclostationary empirical orthogonal function (CSEOF) analysis as a new approach is employed to extend the 1.5-year offshore met-mast HeMOSU-1 data into 34-year long-term data based on the MERRA reanalysis dataset. Both the one- and two-dimensional CSEOF results are compared with that of the widely-used MCP method. The CSEOF method shows a similar level of accuracy to the existing method for mean wind speed, while the former exhibits a slightly better accuracy for the frequency distribution of wind speed and the capacity factor as an index related to the estimation of wind power generation. In additional hypothetical test based on reanalysis datasets, the 1D-CSEOF method shows, in general, a better performance than the conventional MCP method in terms of the accuracy of statistical properties of wind. & 2016 Elsevier Ltd. All rights reserved.

Keywords: Offshore met-mast Wind data Reconstructing long-term data CSEOF

1. Introduction Wind power is an energy source that is in the spotlight, spurring the development of new and renewable energy all around the world. Korea is also experiencing a shortage of electricity, but it is not easy to construct thermal or nuclear power plants anymore because of its small land area. Considering the geographic characteristics of the Korean Peninsula surrounded by the seas on the three sides, there is a rising interest in the development of the offshore areas with relatively rich resources. The Korean government has recently established a development plan for a 2.5 GW offshore wind power plant in the West (Yellow) Sea as an effort to leap as a leading country in offshore wind power. The project and research now has been proceeding with the phase 1 of the test site construction on a scale of 100 MW. An optimal location that is economically feasible and available for a large-scale development has been chosen considering wind resource distribution according to a numerical wind resource map, topographic characteristics such as sea floor depth, and distance from the coastline (Kim et al., 2013). Korea's first offshore met-mast, HeMOSU-1, has been set up n

Corresponding author. E-mail address: [email protected] (J.-Y. Kim).

http://dx.doi.org/10.1016/j.jweia.2016.07.017 0167-6105/& 2016 Elsevier Ltd. All rights reserved.

and operated near Buan-gun, Jeollabuk-do, and Yeonggwang-gun, Jeollanam-do, where the selected target site was placed in 2010 as shown in Fig. 1. Various meteorological data including wind direction and speed observed at HeMOSU-1 are used in assessing the project feasibility, wind farm layout design, and turbine structure design (Oh et al., 2012). When evaluating the feasibility of wind power project or estimating design load for structure, it is important to characterize the long-term trends and year-to-year variability of observational data. Since met-mast data generally has a short observation period, around one to two years, past data are reconstructed by extrapolation through regression, known as the measure-correlatepredict (MCP) technique, using neighborhood long-term observation data. Several MCP algorithms have been suggested based on the linear regression method proposed by Derrick (1992), which estimates the relationship between the wind speeds at target and reference sites. Rogers et al. (2005) compared four MCP algorithms in terms of the accuracy of the estimated wind speed and the effect of the length of employed data on the accuracy of estimation. Thøgersen et al. (2007) implemented general-purpose software for four of the MCP methods including a linear regression model; this model is universally accepted so far. Various methods were developed to overcome limitations of the linear regression method, but they do not consistently show improved results in the context

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

147

Fig. 1. Locations of the measurements (HeMOSU-1) and one-point reference (MERRA).

of the target data characteristics (García-Rojo, 2004; Sreevalsan et al., 2007; Clive, 2008; Carta and Velázquez, 2011; SaavedraMoreno et al., 2013). Since most existing methods consider correlation of the wind speed for each bin of wind directions, it is difficult to understand how the principal physical components, such as the diurnal cycle of wind, in the data are calibrated by the regression method. In the present study, a new approach is developed by using cyclostationary empirical orthogonal functions (CSEOFs) in order to carry out regression analysis with respect to individual processes instead of individual wind direction bins. Kim et al. (1996); Kim and North (1997); Kim and Wu (1999), and Kim and Chung (2001) introduced the concept of CSEOF analysis to identify and extract individual spatio-temporal modes from a given dataset. By carrying out CSEOF analysis, data are decomposed into cyclostationary loading vectors (CSLVs), each of which is modulated by corresponding amplitude or principal component (PC) time series following the naming convention of Kim et al. (1996). Each CSLV reflects a distinct process in the data and is rendered as temporally-varying spatial patterns together with their amplitude fluctuations on longer time scales. This provides a unique means of matching common processes between the target and reference data in terms of their evolution patterns and amplitudes. Hamlington et al. (2011) used the CSEOF technique to extend significantly the period of sea level reconstructions based on tide gauge data. In a similar manner, the CSEOF technique is applied in this study for the purpose of extending the short-term measurements at HeMOSU-1 in conjunction with long-term reference data. By carrying out regression analysis in CSEOF space and comparing the regressed CSLVs against those derived from the target data, the accuracy of the reference data can be addressed more clearly in the context of major physical modes in the data. Long-term hindcasting or forecasting of wind data is very difficult due to its irregular variation and various attempts are needed to improve the accuracy of estimation. As mentioned above, a new approach is introduced in the present study as an alternative to the existing MCP methods and to improve the estimation accuracy. This new concept will be addressed in detail in the Method section.

2. Data HeMOSU-1, which is an offshore met-mast to promote the west coast offshore wind power project, is located on the West Sea about 30 km offshore as shown in Fig. 1. It performs various meteorological observations including wind direction and speed at

several altitudes as shown in Fig. 2. HeMOSU-1 has accumulated approximately 2.5 years of observational data starting from February 2011. Since there are few offshore meteorological observatories operating for an appreciable length of time, it is unavoidable to use reanalysis reference data to estimate the longterm wind characteristics at the test site. In this study, Modern Era-Retrospective analysis for Research and Application (MERRA) data are used as a reference dataset, which is of relatively high spatial and temporal resolutions among the long-term reanalysis datasets (GMAO, 2014). MERRA is a reprocessed dataset, into which various satellite-era datasets were blended by using the GEOS-5 atmospheric data assimilation system at Global Modeling and Assimilation Office (GMAO) of NASA. In this study, reference data are used in two different manners – only single-point data near HeMOSU-1 and 2-dimensional spatial array to consider the spatial distribution around the Korean Peninsula. A MERRA grid point at approximately 12.2 km west outbound from HeMOSU-1 is used for a one-dimensional (1D) analysis (see Fig. 1b and Table 1). For a 2-dimensional (2D) analysis, the domain includes offshore area around the Korean Peninsula (32°–39°N  120°–135.33°E; see Table 1). Data at 50 m above the sea level, which is closest to the hub height of the wind turbine, are used. The details of the MERRA reference data are shown in Table 1. Meanwhile, the observational data at 46.3 m height from the sea level are taken from HeMOSU-1; this level is closest to the elevation of the MERRA data used in the present study. Since there are missing data due to data logger defect for about one month from June 5 to July 14, 2011, only the data after the missing period are used for training, leaving the earlier data before the missing period for verification. We also used the 1.5°  1.5° European Center for Medium-Range Weather Forecasts Reanalysis (ERA) Interim product (Dee et al., 2011) in a hypothetical test.

3. Method 3.1. CSEOF analysis It is useful to decompose complex physical system into simpler basis functions in order to understand the basic physical processes in the physical system. The CSEOF technique is introduced to separate individual physical processes from the datasets. This technique has been used in many previous studies and the details can be found in Kim et al. (1996, 2015); Kim and North (1997); Kim and Wu (1999), and Kim and Chung (2001). In CSEOF analysis, space-time data, T ( r , t ), are expressed as a unique linear superposition

148

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Fig. 2. HeMOSU-1 offshore met-mast and the elevations of various observational devices.

3.2. Regression analysis in CSEOF space

Table 1 Characteristics of the target and reference data. Data

Target data

Reference (Predictor) data

Name

HeMOSU-1 measurement data 16 Feb. 2011–31 Dec. 2012 Hourly mean of 10minute data 46.3 m 35.4653°N, 126.1293°E

MERRA reanalysis data

Period Time interval Height Location Spatial interval Missing period Training period Verification period

2 Jan. 1979 – 31 Dec. 2012 1h

50 m 35.5°N, 126.0°E for 1d, 32.0– 39.0°N, 120.0–135.33°E for 2d – 1/2° in latitude, 2/3° in longitude 5 Jun. 2011–14 Jul. 2011 (only target data) 15 Jul. 2011–31 Dec. 2012 16 Feb. 2011–4 Jun. 2011

T ( r , t )= ∑ LVn ( r , t ) PCn ( t ), n

T ( r , t )= ∑ Bn ( r , t ) Tn ( t ), n

(1)

where LVn ( r , t ) and PCn (t ) are cyclostationary loading vectors (CSLVs) and principal component (PC) time series, respectively. Unlike EOF analysis, LVs are time dependent and periodic in CSEOF analysis, that is,

LVn ( r , t )=LVn (r , t +d),

Similar to the conventional MCP method, regression analysis is conducted between two sets of CSEOF PC time series to generate physically consistent evolution patterns between two variables or datasets. This procedure is called the regression analysis in CSEOF space. A crucial difference from the existing methods is that regression analysis is conducted on the PC time series of CSEOF modes instead of the raw data. In other words, amplitude variation of a physical process in one variable is matched with that in another variable. This is a sensible approach since physical processes are rendered differently in two different variables (datasets), and henceforth statistical relationship should be identified for each mode (physical process) instead of regression relationship between two raw datasets. Let T ( r , t ) be a target variable and P ( r , t ) be a predictor variable. CSEOF analysis yields:

(2)

where d is called the nested period and represents the periodicity of the space-time covariance function. Each CSLV expresses a temporally evolving physical process, and corresponding PC time series denotes the amplitude modulation of the physical process. In this study, the nested period is set to 24 h in order to understand the diurnal structures of variation of wind. The corresponding PC time series, then, show the amplitude fluctuations on time scales longer than a day.

(3)

and

P ( r , t )= ∑ Cn ( r , t ) Pn ( t ), n

(4)

where Bn (r , t ) and Cn (r , t ) are respectively the CSLVs of the target and predictor variables, and Tn (t ) and Pn (t ) are respectively the PC time series of the target and predictor variables, n is the mode number, and r and t denote space and time. In general, there is no one-to-one correspondence between the two sets of PC time series. That is, Tn (t ) and Pn (t ) are generally not identical for each n. Thus, regression analysis should be conducted in CSEOF space to derive physically consistent evolutions of the predictor variable with those of the target variable. The procedure consists of two steps:

Tn ( t )= ∑

M m=1

αm(n) Pm ( t )+ε ( n) (t ),

(5)

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

149

and

Dn ( r , t )= ∑

M m=1

αm(n) Cm ( r , t ),

(6)

where αm(n) are regression coefficients and ε( n) (t ) is regression error time series. As a result of (5) and (6), one can write the predictor variable as

P ( r, t ) ≑

∑ Dn (r , t ) Tn ( t ). n

(7)

The right-hand side is not exactly equal to the left-hand side because the regression error in (5) is not exactly zero. In fact, this deviation becomes greater as regression error variance, which is measured by 1 − R2 value, increases. The R2 value is defined by

R2=1 −

∑t ε (n)(t ) ε (n)(t ) ∑t Tn (t ) Tn (t )

,

(8)

which measures the magnitude of regression error variance with respect to that of the total variance of the target time series. Thus, regression error variance approaches zero as R approaches 1. In CSEOF analysis, R2 values are, in general, close to unity, which implies that physical relationship between two variables is well established. Forecasting (hindcasting) wind field at a met-mast location proceeds in the following manner. Once the regression relationship in (5) is determined for the training period, met-mast data can be extended by using

^ ^ T ( r , t )= ∑ Bn ( r , t ) Tn (t ), n

(9)

where Bn (r , t ) is the CSLV derived from the target variable (metmast), and T^n (t ) is the target PC time series estimated from the predictor PC time series of the reference data (MERRA) by using (5). While the loading vectors, representing specific physical evolutions, derived from the target and the reference data as in (3) and (7) may not necessarily be identical, long-term modulation may be close to each other if the reference dataset is reasonably accurate; this, of course, is an important caveat. Another important caveat is that physical processes, Bn (r , t ), remain identical in the forecasting (hindcasting) period. Thus, the amplitude time series can be derived from the reference data for the prediction interval, which in conjunction with the target loading vectors are used to generate “synthetic” wind field for the prediction interval. We will test the validity and accuracy of this procedure in the next section.

4. Results 4.1. Comparison between the target and reference data Two datasets, HeMOSU-1 measurements and MERRA reanalysis data, are compared to evaluate their proximity before regression analysis. A strong correlation is anticipated since the observation site is located offshore with little influence from the Korean Peninsula or China, and the MERRA grid point is relatively close to the observation site. Fig. 3 shows the time series of the wind components at the reference point against the target data (HeMOSU-1). As shown, correlations for both the wind components are reasonably high. The reference data, however, tend to show lower amplitudes than the target data, which is a general characteristic of model datasets. Fig. 4 shows the spatial patterns of correlations, which show the spatial correlation scales of the zonal and meridional winds around the Korean Peninsula with respect to the target time series. As shown in Fig. 4, relatively high correlation values are found over a domain of considerable size; correlation length scale increases slightly over the marginal seas

Fig. 3. Comparison of (a) zonal wind and (b) meridional wind between the HeMOSU (target) and the MERRA-1D (reference) data. In each panel, R1 denotes correlation for the verification period and R2 correlation for the training period.

due most likely to weaker influence by topography. The zonal component is weaker than the meridional component of wind (Fig. 3), and the former is less correlated with the target data than the latter (Fig. 4). Fig. 4 shows that the reference data in the region of high correlations can be used to improve long-term hindcasting based on one-point reference data. Since regression analysis in CSEOF space is conducted for individual modes, CSLV and PC time series of each mode are compared between the target data and reference data. Because of the relatively short length of the target data, it is not plausible to compare the long-term trend in the datasets. Instead, short-term (daily) physical evolutions including the diurnal cycle and longerterm evolutions of their amplitudes have been extracted by using the nested period of one day. The first two modes together explain more than 70% of the total variability both in the target and reference data. As shown in Fig. 5, the first CSEOF mode represents a predominant meridional wind throughout the day whereas the second CSEOF mode depicts a predominant zonal wind throughout the day both in the target and reference datasets. Diurnal variation is relatively small in the two modes. The PC time series measuring the amplitudes of the LVs are also similar between the two datasets for the first two modes (see Table 2). Fig. 6 shows the PC time series of the first CSEOF mode. As can be seen, the first PC time series derived from the 1D MERRA data is similar in evolution and magnitude to that of the target data. For the 2D MERRA data, on the other hand, the amplitude of the PC time series often differs appreciably from the target data while the evolution pattern is similar. This may reflect that the strength of wind often varies irregularly in the analysis domain. Considering the complex terrain of the Korean Peninsula, such non-uniform variation of the wind speed should be expected. The PC time series of the third CSEOF mode derived from the 2D MERRA data exhibits negligible correlation with that of the target data; this indicates that there is, in general, no one-to-one correspondence between two sets of CSEOF modes derived from two

150

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Fig. 4. Correlation of (a) zonal and (b) meridional wind components between the HeMOSU (target) and the MERRA-2D data (reference).

similar but different datasets. Regression analysis in CSEOF space is needed to establish one-to-one correspondence between the two sets of CSEOF modes. 4.2. Regression analysis Regression analysis in CSEOF space was conducted as delineated in (5) and (6). Regression coefficients were computed for each PC time series over the training period, in which the target and the reference data coexist. It needs to be determined how many target PC time series should be used for regression in order to reconstruct optimal wind data. Correlation coefficients between the target and the reconstructed data for each wind component were calculated as a function of the number of modes used for regression (Fig. 7 and Table 3). In reconstructing the wind field, the regressed PC time series are used in conjunction with the target CSLVs as in (9). In other words, regressed PC time series are constructed based on (5) from the reference data and are multiplied with the target CSLVs in order to generate data over the validation or prediction period. As can be seen in Table 3, increase in correlation for each wind component is less than 0.01 when additional mode is added beyond the mode number 5; results are similar for both the 1D and 2D cases. The first 5 modes of the target data together explain more than 90% of the total variability. Consequently, the first 5 modes are used to reconstruct the regressed data for both the 1D and 2D cases. Specifically, correlations of u and v increased respectively by 5.6% and 4.3% for MERRA-1D analysis, and 27.6% and 14.0% for MERRA-2D analysis based on the 5-mode reconstructions (Table 3).

As should be expected, correlation increases significantly after the regression as a comparison between Figs. 6 and 8 shows (see also Tables 2 and 4). The correlation for the verification period is not less than that of the training period for the MERRA-1D analysis. This reflects that the regression relationship established over the training period holds fairly well over the verification period. Meanwhile, correlation over the verification period is somewhat lower than that over the training period for the third CSEOF mode in the MERRA-2D analysis; on the other hand, it is a significant improvement over the value before regression. It is worth mentioning that the PC time series of the MERRA-2D data can be fitted to the PC time series of HeMOSU-1 data nearly as well as those of the MERRA-1D data via regression in CSEOF space. Thus, in theory, it is possible to generate reasonable spatial patterns based on measurements at a single station, or use neighboring measurements to improve reconstruction at a target station. The results of regression in CSEOF space can easily be understood by comparing Figs. 9 and 10, and Figs. 11 and 12 (see also Table 3). The regression relationship is fairly reasonable and the accuracy of the 5-mode reconstructions is generally improved both for the MERRA-1D and MERRA-2D experiments. Figs. 10 and 12 clearly show that the regression in CSEOF space improved the 5-mode reconstructions both in terms of the correlation and the magnitude of the wind components in both the verification and training periods. When using the MERRA-1D reference data, correlation of u and v increased respectively by 4.5% and 3.5%, and 1.6% and 1.0% for the MERRA-2D data (Table 5). Although the increase after the regression is relatively low, there is not much room for a significant improvement; correlation of the reference data is already rather high before regression. It should be noted that correlation of v is over 90% after the regression for the MERRA-1D data. When using the MERRA-2D data, correlation is somewhat lower than the MERRA-1D data, but correlation over the analysis domain increased after regression as a comparison between Figs. 4 and 13 shows. Consequently, the area with a close proximity to the target data can be expanded; correlation for u and v exceeds 80% and 90% respectively along the entire southwestern coast of South Korea. 4.3. Comparison with the MCP method It has been demonstrated that a wind field having a higher correlation than the raw reference data could be reconstructed based on regression analysis in CSEOF space. In this section, the performance of the CSEOF-based method is compared with that of the widely-used MCP method. In the MCP method, linear regression is conducted between two sets of wind speed data according to the wind directional sector. Fig. 14 shows the monthly mean values of the reconstructed wind speed after regression against those of the HeMOSU data. As shown in Fig. 14a, all three results are fairly consistent with the target data and reproduce the windspeed variability in a reasonable manner. A close examination shows that the period of the best performance differs from one method to another. There were some missing data in June and July 2011; as a result, wind speed is underestimated in this period. During a strong typhoon, Bolaven, reaching a maximum wind speed of 50 m s  1 in August 2012, wind speed again is underestimated. Long-term variability at the target station is estimated from the MERRA-1D data as shown in Fig. 14b. As shown in the figure, it is possible to improve reliability and assess uncertainty of the expected energy production based on a longer estimation of wind data in feasibility assessment of wind power development. Various statistics have been compared in order to evaluate the three different methods quantitatively (Table 6). Mean Bias in the MERRA raw data (reference) has been reduced in all three methods. The regressed wind also exhibits slightly higher correlations

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

151

Fig. 5. Loading vectors for zonal (left) and meridional (right) components of wind in the target and the reference data for the first two CSEOF modes.

Table 2 Correlation of the PC time series between the target and reference data for the first five CSEOF modes: R1 denotes the verification period and R2 the training period. Mode

1 2 3 4 5

With MERRA-1D

With MERRA-2D

R1

R2

R1

R2

0.9397 0.8086 0.6518 0.6647 0.5788

0.9063 0.8905 0.7633 0.7195 0.5972

0.8516 0.5936 0.0658 0.1898 0.1014

0.8119 0.7230 0.0766 0.1384 0.0443

and lower relative RMSE except for the 2D CSEOF method. For the MERRA-2D data, CSEOF modes depict variability over a wide area and may not solely reflect variability at the met-mast location. As a result, both correlation and RMSE deteriorate slightly. Skewness measures the asymmetry of the probability distribution; positive skewness indicates that the distribution has longer or fatter tail on the right side and vice versa. All the data including the MERRA raw data (reference) exhibit the same sign of skewness. Kurtosis is a measure of “tailedness” of a probability distribution; distributions with kurtosis less than 3 are associated with fewer and less extremes and vice versa. The CSEOF-based methods produce the same sign of “excess” kurtosis (kurtosis minus 3) for all three variables (zonal wind, meridional wind, and wind speed). Fig. 15 shows a comparison of the HeMOSU data and the regressed data from the MCP and CSEOF-1D methods. The figure shows that both the methods reproduce the histograms of wind reasonably. The

Fig. 6. The PC time series of the first CSEOF mode for the target and the (a) MERRA1D, and (b) MERRA-2D reference data.

152

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Fig. 7. Correlation coefficients between the target and the regressed data as a function of the number of modes used for reconstruction for each wind component: (a) MERRA-1D, and (b) MERRA-2D data.

Table 3 Correlation between the target and the reference data for each wind component before (columns 2-5) and after (columns 6–9) the regression as a function of the number of modes used for reconstruction (column 1).

Table 4 Correlation between the target and the regressed PC time series for the first five modes: R1 denotes the verification period and R2 the training period. Mode

Mode Target vs. reconstructed renumber ference data (before regression) MERRA-1D

1 2 3 4 5 6 7

MERRA-2D

Target vs. reconstructed regressed data (after regression) MERRA-1D

MERRA-2D

u

v

u

v

u

v

u

v

0.5256 0.7111 0.7589 0.7745 0.7798 0.7871 0.7887

0.7614 0.8284 0.8451 0.8616 0.8704 0.8739 0.8757

0.4992 0.6052 0.6091 0.6253 0.6274 0.6265 0.6437

0.6731 0.7380 0.7390 0.7433 0.7775 0.8104 0.8138

0.3373 0.7520 0.7948 0.8110 0.8235 0.8331 0.8355

0.8507 0.8645 0.8853 0.9029 0.9079 0.9103 0.9114

0.3393 0.7375 0.7769 0.7934 0.8007 0.8053 0.8067

0.8425 0.8558 0.8706 0.8835 0.8862 0.8871 0.8876

1 2 3 4 5

MERRA-1D

MERRA-2D

R1

R2

R1

R2

0.9717 0.8926 0.7011 0.7333 0.6723

0.9548 0.9143 0.7886 0.7439 0.6712

0.9545 0.8578 0.5197 0.6437 0.3708

0.9468 0.8915 0.6807 0.6568 0.4063

Fig. 9. Comparison between the HeMOSU and the MERRA-1D data based on the 5-mode reconstructions before regression: (a) zonal wind, and (b) meridional wind.

Fig. 8. A comparison of the target and the regressed PC time series for the first CSEOF mode: (a) MERRA-1D, and (b) MERRA-2D data.

histogram of wind speed is skewed to the right and exhibits a long tail. The histograms of zonal and meridional components of wind are nearly Gaussian with small excess kurtosis. The directional distribution of winds from the regressed data also looks reasonably similar to that of the HeMOSU (target) data. Winds are predominantly in the

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Fig. 10. Comparison between the HeMOSU and the MERRA-1d data based on the 5-mode reconstructions after regression: (a) zonal and (b) meridional wind components.

153

Fig. 12. Comparison between the HeMOSU and the MERRA-2d data based on the 5-mode reconstructions after regression: (a) zonal and (b) meridional wind components.

Table 5 Correlation between the target data and the 5-mode reconstructions of the regressed data for MERRA-1D (columns 4 and 5) and MERRA-2D (columns 6-7) experiments in comparison with the correlation between the target and the MERRA1D reference data (columns 2 and 3) before regression. Variable Before regression

u v

After regression, MERRA-1d

After regression, MERRA-2d

R1

R2

R1

R2

R1

R2

0.7289 0.8987

0.7882 0.8775

0.8375 0.9227

0.8235 0.9079

0.8064 0.8912

0.8007 0.8862

estimating the amount of generation, the frequency distribution of wind speed and the resulting power generation are compared for individual estimates. A wind speed frequency distribution model most frequently used in estimating wind power generation is the Weibull distribution (IEC, 2005a):

f ( u)=

⎡ ⎛ u ⎞k ⎤ k ⎛⎜ u ⎞⎟(k −1) exp ⎢ −⎜ ⎟ ⎥, c⎝c⎠ ⎣ ⎝c⎠ ⎦

(10)

where u is wind speed, c is the scale factor, k is the shape factor. Amount of power generation is calculated by the Annual Energy Production (AEP) (IEC, 2005b): Fig. 11. Comparison between the HeMOSU and the MERRA-2d data based on the 5-mode reconstructions before regression: (a) zonal wind, and (b) meridional wind.

east-south-east (ESE) direction with slightly less frequent easterlies (W direction). The wind speed and its standard deviation as a function of wind direction also look fairly reasonable with strong winds and their variability in the W direction. Actual wind power generation is assessed for each method. Since the frequency distribution of wind speed is used when

AEP = 24 × 365 ×

u

P (u) f (u), ∑ ucutout =0

(11)

where P (u) is turbine output (kW) when wind speed is u, and ucutout is the maximum wind speed for turbine operation. The capacity factor, then, is the ratio of the calculated AEP at the target site according to the wind distribution given by (10) to the theoretical maximum AEP; it is considered a crucial indicator for feasibility assessment of wind power plants. Comparison of the mean wind speed, parameters for the Weibull distribution, and the

154

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Table 6 Mean bias, correlation, relative RMSE, skewness, and kurtosis of the hourly zonal wind, meridional wind, and wind speed for the MERRA raw data and the 5-mode reconstructions of regressed data derived from the three different methods in comparison with the HeMOSU (target) data. This comparison is for the verification period. Data

Fig. 13. Correlation of (a) zonal and (b) meridional wind components between the HeMOSU and the 5-mode reconstruction based on the regressed MERRA-2d data.

capacity factor are shown in Table 7 for each method against those of the met-mast measurements. The capacity factor is calculated by applying the power curve of a 3 MW turbine of company D. As

Mean Bias

Corr.

Rel. RMSE

Skewness

Kurtosis

0.692  0.421 0.072

0.494  0.159  0.049

HeMOSU

u v Speed

MERRA raw

u v Speed

0.459  0.681 0.222

0.729 0.899 0.812

0.441 0.434 0.591

0.390  0.439 0.210

 0.289  0.273 0.021

MCP

u v Speed

0.145 0.206 0.102

0.800 0.925 0.832

0.389 0.357 0.560

0.529  0.392 0.061

0.079  0.434 0.138

1D-CSEOF

u v Speed

0.387 0.111 0.012

0.837 0.923 0.815

0.395 0.317 0.592

0.615  0.432 0.064

0.117  0.324  0.267

2D-CSEOF

u v Speed

0.405 0.103 0.175

0.806 0.891 0.751

0.474 0.355 0.702

0.663  0.488 0.020

0.611  0.144  0.277

can be seen in Fig. 16, the frequency distribution of the estimated wind speed is reasonably similar to that of the HeMOSU data. As can be seen in Table 7, the CSEOF reconstruction based on the MERRA-1D data is closest to the target data. The accuracy of capacity factor derived from the existing MCP method is 94.8% whereas the 1D CSEOF method yields 96.5%; there is a slight improvement in the accuracy of the reconstructed wind speed. Overall accuracy is lowest for the wind field generated by the CSEOF method based on the MERRA-2D data. Nonetheless, the accuracy of estimated capacity factor is 93.5%, indicating that the wind field based on the CSEOF analysis of the MERRA-2D data is still useful for estimating wind power generation.

Fig. 14. Comparison of monthly mean wind speed between the target (HeMOSU) and the regressed data for (a) the training and verification periods and (b) the prediction period.

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

155

Fig. 15. (upper panel) The histogram of wind speed (a), zonal wind (b), and meridional wind (c). (lower panel) The frequency of wind (d; maximum scale: 10%), mean wind speed (e; maximum scale: 10 m s  1), standard deviation of wind speed (f; maximum scale: 5 m s  1) as a function of wind direction: HeMOSU data (black), reconstructed data based on the CSEOF method (red), and regressed data based on the MCP method (blue). The bin size is 1 in the upper panel, and is 10 degrees in the lower panel. This analysis is based on the data over the verification period.

Table 7 Mean, Weibull factors and capacity factor for each method. Variables

HeMOSU

CSEOF-1D

CSEOF-2D

Linear MCP

Mean wind speed

Value Ratio

6.334

6.283 0.992

6.179 0.976

6.253 0.987

Weibull c

Value Ratio

7.124

7.083 0.994

6.961 0.977

7.052 0.990

Weibull k

Value Ratio

1.807

1.920 1.063

1.884 1.043

1.957 1.083

C.F. [%]

Value Ratio

24.64

23.78 0.965

23.04 0.935

23.37 0.948

4.4. Additional test in different settings As shown in the test results derived from the offshore met-mast and MERRA data, the performance of the new CSEOF method applied to the MERRA-1D data is as good as the conventional MCP method. In order to confirm that the performance of the 1D CSEOF method is not merely coincidental, additional hypothetical test was conducted. Since we do not have another set of tower measurements, MERRA data near the HeMOSU location is used as the target variable and ERA-Interim data (Dee et al., 2011) at a remote location is used as the reference variable (Fig. 17 and Table 8). This is a rather stringent test, since the two locations, separated by  400 km, exhibit fairly distinct wind patterns. MERRA data at station 1 are generated for the period of 2000–2015 using the ERA-Interim data at station 2 as the predictor variable using both the CSEOF and conventional MCP methods. Data for a 2-year period (2014–2015) are used for training. Fig. 18 shows the zonal and meridional components of wind reproduced by the two methods in comparison with the target and

Fig. 16. Comparison of the Weibull distribution for the regressed wind speeds against that of the HeMOSU data.

reference data for a two-month period. Correlations of u and v increased respectively by 12.0% and 10.6% for the results by CSEOF method, whereas correlation of u decreased by 3.6% and v increased by 5.9% for the results by MCP method (Table 9). Relative RMSE also decreased slightly for both the modes. Table 10 shows a comparison between the reconstructed wind speed against the actual wind speed in terms of the mean and standard deviation. The statistics comparing the sample means is defined by the twosample t statistic:

156

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

Fig. 17. Locations of the target (MERRA_st.1) and reference (ERA-interim_st.2) data for the additional test.

Table 8 Characteristics of the target and reference data for additional test.

Fig. 18. Time series of the regressed data derived from 2 different methods, the target and reference data: (a) zonal wind, (b) meridional wind.

Data

Target data

Reference (Predictor) data

Name Period Time interval Height Location

MERRA_st.1 2000–2015 3h 10 m 36.0°N, 126.0°E

ERA-interim_st.2 2000–2015 3h 10 m 37.5°N, 129.0°E

Training period Verification period

2014–2015 (2 years) 2000–2013 (14 years)

Table 10 The 90%, 95%, and 99% confidence intervals for x1̅ −x2̅ and S12/S22 for the wind speed data, and the actual values derived from the CSEOF method and the MCP method.

Table 9 Correlation and relative RMSE between the target data (MERRA St. 1) and the regressed data derived from two different methods in comparison with the correlation and relative RMSE between the target and the reference data (ERA-Interim St. 2) before regression. This comparison is for the 14-year verification period (2000–2013). Variable Correlation Before Regression

0.612 0.642

u v

t~

Relative RMSE After Regression MCP

CSEOF

0.590 0.678

0.686 0.710

Before Regression

0.963 0.762

After Regression MCP

CSEOF

0.885 0.742

0.800 0.743

( x1̅ −x2̅ )−( μ1−μ2 ) , S12 n

+

S22 m

(12)

where x1̅ and x2̅ are sample means, S12 and S22 are sample variances, μ1 and μ2 are true means, and n and m are sample sizes. The degree of freedom is k, which is either the smaller of n−1 and m −1 or the calculated degree of freedom. The test statistic

F=

S12/σ12 , S22/σ22

(13)

has an F distribution with n−1 and m −1 degrees of freedom. Here

99%

95%

90%

 0.047

 0.033

 0.026  0.011

0.94

0.96

0.97

90%

95%

99%

x1̅ −x2̅ CSEOF MCP

0.026

0.033

0.047

S12/S22

1.03

1.04

1.06

CSEOF MCP

0.269 1.06 2.06

σ12 and σ22 are true variances of two samples. The degree of freedom for both distributions is assumed to be 5845 (¼40,912/7), since the autocorrelation time scale is close to 7 points and total data points over the valid period are 40,912. The mean of wind speed for the validation period is respectively 5.369, 5.380 and 5.100 for the MERRA data, the reconstructed data based on the CSEOF method, and the regressed data based on the MCP method. The variance of the wind speed is given by 8.476, 7.973, and 4.114, respectively. Then, the test results are summarized as in Table 10. As can be seen in the table, the wind speed data derived from the CSEOF method agree better with the MERRA data. Fig. 19 shows the comparison of the MERRA station 1 data (target) and the regressed data from the ERA-Interim station 2 data (reference) based on the MCP and CSEOF-1D methods. The histogram of the MERRA wind speed shows that the MCP method overestimates the occurrence of average wind speed. The histograms of the zonal and meridional components of wind show that the MCP method significantly underestimates the very weak wind events and overestimates the slightly stronger wind events. This is the reason for a significant overestimation in the wind speed range of 3–7 m s  1. The directional distributions of winds from the regressed data look different from that of the MERRA data. The

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

157

Fig. 19. (upper panel) The histogram of wind speed (a), zonal wind (b), and meridional wind (c). (lower panel) The frequency of wind (d; maximum scale 10%), mean wind speed (e; maximum scale: 7 m s  1), standard deviation of wind speed (f; maximum scale: 3 m s  1) as a function of wind direction: MERRA (target) data (black), reconstructed data based on the CSEOF method (red), and regressed data based on the MCP method (blue) using ERA interim as the reference data. The bin size is 1 in the upper panel, and is 10 degrees in the lower panel. This analysis is based on the data over the verification period.

CSEOF method produces a higher occurrence of wind in the SE direction and to a lesser extent in the SSW direction instead of prevalent wind in the SSE direction. The MCP method produces a large fraction of wind in the NE direction and to a lesser extent in the SW direction, which is not consistent with the MERRA data (target). The angular distribution of wind speed seems reasonable although the direction of maximum wind speed differs among the three datasets. The regressed data generally underestimates the standard deviation of wind speed. This underestimation is more significant for the MCP method.

5. Summary and conclusions In order to extend the wind measurements from an offshore met-mast, HeMOSU-1, a new method was developed to construct wind at the target position based on the MERRA data. The MERRA data, which is a long-term reanalysis dataset, were used as the reference data in the present study. Based on the regression analysis in CSEOF space, wind field was constructed at the target location using the reference dataset. The accuracy of the constructed wind field was evaluated against that of the commonly used MCP method. For the wind field based on the regression analysis in CSEOF space, correlation of u and v increased from that of the raw reference data by 4.5% and 3.5% respectively when using the MERRA-1D data, and 1.6% and 1.0% respectively when using the MERRA-2D data. It should be pointed out that the improvement in correlation is small, since the raw MERRA data are already fairly close to the met-mast data. A comparison with the existing MCP method shows that the accuracy, measured in terms of correlation and RMSE, of the reconstructed wind field is similar between the two methods. In the accuracy of the Weibull frequency

distribution of wind speed, the CSEOF-based method is quite comparable to the MCP method. The capacity factor, an important indicator for wind power generation, is slightly improved by CSEOF analysis of the MERRA-1D data. CSEOF analysis based on the MERRA-2D data results in the lowest accuracy, even lower than the raw MERRA data. CSEOF modes derived from the MERRA-2D data reflect variability not only at the met-mast location but also in the entire data domain. It is obvious, therefore, that the long-term variability in the PC time series derived from the MERRA-2D data may not necessarily be strongly correlated with the variability at the met-mast location. Thus, regression error, in general, is larger for 2D CSEOF analysis than for 1D analysis. It should be pointed out, however, that the emphasis of the 2D CSEOF analysis is different from that of the 1D CSEOF analysis. In some applications, wind field and other physical variables in the area surrounding the target location provide beneficial information for power plant operations. Through 2D CSEOF analysis, optimal locations of future met-masts can be determined without much redundancy with existing met-masts. As demonstrated in this study, the performance of the new CSEOF method applied to the MERRA-1D data is as good as the conventional MCP method in explaining the wind variability at the HeMOSU site. As a new method, however, more tests and comparisons in different settings are certainly necessary. In particular, the CSEOF-based method, both one- and two-dimensional versions, needs to be evaluated in more stringent settings, such as the estimation of wind field at a mountainous location, which is currently not possible due to the lack of available met-mast measurements around the Korean Peninsula. In a hypothetical and more stringent experiment, the 1D-CSEOF method reproduces wind in the MERRA data (target) reasonably based on the ERAInterim reference data located  400 km away from the target location. The 1D-CSEOF method generally performs better than

158

J.-Y. Kim, K.-Y. Kim / J. Wind Eng. Ind. Aerodyn. 156 (2016) 146–158

the MCP method in terms of the veracity of histograms and directional properties of wind.

Acknowledgments This work was supported by a New & Renewable Energy of the Korea Institute of Energy Technology Evaluation and Planning (KETEP) grant funded by the Korea government Ministry of Knowledge Economy (Project no. 20113040020010).

References Carta, J.A., Velázquez, S., 2011. A new probabilistic method to estimate the longterm wind speed characteristics at a potential wind energy conversion site. Energy 36, 2671–2685. Clive, P.J.M., 2008. Non-linearity in MCP with Weibull distributed wind speeds. Wind Eng. 32, 319–324. Dee, D.P., et al., 2011. The ERA-Interim reanalysis: configuration and performance of the data assimilation system. Q. J. Meteorol. Soc. 137, 553–597. Derrick, A., 1992. Development of the measure-correlate-predict strategy for site assessment. Proc. BWEA, 1992. García-Rojo, R., 2004. Algorithm for the estimation of the long-term wind climate at a meteorological mast using a joint probabilistic approach. Wind Eng. 28, 213–224. GMAO, 2014. Global Modeling and Assimilation Office. 〈http://gmao.gsfc.nasa.gov/ merra/〉 (accessed 19.08.14). Hamlington, B.D., Leben, R.R., Nerem, R.S., Han, W., Kim, K.Y., 2011. Reconstructing sea level using cyclostationary empirical orthogonal functions. J. Geophys. Res. 116, C12015.

IEC, 2005a. IEC 61400 Wind Turbines – Part 1: Design Requirements. IEC, 2005b. IEC 61400 Wind Turbines – Part 12-1: Power Performance Measurements of Electricity Producing Wind Turbines. Kim, J.Y., Oh, K.Y., Kang, K.S., Lee, J.S., 2013. Site selection of offshore wind farms around the Korean Peninsula through economic evaluation. J. Renew. Energy 54, 189–195. Kim, K.Y., North, G.R., Huang, J., 1996. EOFs of one-dimensional cyclostationary time series: Computations, examples, and stochastic modeling. J. Atmosp. Sci. 53, 1007–1017. Kim, K.Y., North, G.R., 1997. EOFs of harmonizable cyclostationary processes. J. Atmosp. Sci. 54, 2416–2427. Kim, K.Y., Wu, Q., 1999. A comparison of study of EOF techniques: analysis of nonstationary data with periodic statistics. J. Clim. 12, 185–199. Kim, K.Y., Chung, C., 2001. On the evolution of the annual cycle in the tropical Pacific. J. Clim. 14, 991–994. Kim, K.Y., Hamlington, B.D., Na, H., 2015. Theoretical foundation of cyclostationary EOF analysis for geophysical and climatic variables: Concepts and examples. Earth. Sci. Rev. 150, 201–218. Oh, K.Y., Kim, J.Y., Lee, J.K., Ryu, M.S., Lee, J.S., 2012. An assessment of wind energy potential at the demonstration offshore wind farm in Korea. J. Energy 46, 555–563. Rogers, A.L., Rogers, J.W., Manwell, J.F., 2005. Comparison of the performance of four measure-correlate-predict algorithms. J. Wind Eng. Ind. Aerodyn. 93, 243–264. Saavedra-Moreno, B., Salcedo-Sanz, S., Carro-Calvo, L., Gascón-Moreno, J., JiménezFernández, S., Prieto, L., 2013. Very fast training neural-computation techniques for real measure-correlate-predict wind operations in wind farms. J. Wind Eng. Ind. Aerodyn. 116, 49–60. Sreevalsan, E., Das, S.S., Sasikumar, R., Ramesh, M.P., 2007. Wind farm site assessment using measure-correlate-predict (MCP) analysis. Wind Eng. 31 (2), 111–116. Thøgersen, M.L., Motta, M., Sørensen, T., Nielsen, P., 2007. Measure-Correlate-Predict Methods: Case Studies and Software Implementation. Proc. EWEC 2007.