Assessing fruit-tree crop classification from Landsat-8 time series for the Maipo Valley, Chile

Remote Sensing of Environment 171 (2015) 234–244 Contents lists available at ScienceDirect Remote Sensing of Environment journal homepage: www.elsev...

Download PDF

2MB Sizes 0 Downloads 85 Views

Report

PDF Reader
Full Text

Remote Sensing of Environment 171 (2015) 234–244

Contents lists available at ScienceDirect

Remote Sensing of Environment journal homepage: www.elsevier.com/locate/rse

Assessing fruit-tree crop classiﬁcation from Landsat-8 time series for the Maipo Valley, Chile M.A. Peña a,⁎, A. Brenning b a b

Departamento de Geografía, Universidad Alberto Hurtado, Cienfuegos 41, Santiago, Chile Department of Geography, Friedrich Schiller University, Jena, Löbdergraben 32, 07737 Jena, Germany

a r t i c l e

i n f o

Article history: Received 24 April 2015 Received in revised form 28 September 2015 Accepted 25 October 2015 Available online 3 November 2015 Keywords: Satellite image time series NDVI temporal proﬁle Landsat-8 Crop type classiﬁcation

a b s t r a c t Satellite image time series (SITS) provide spectral–temporal features that describe phenological changes in vegetation over the growing season, which is expected to facilitate the classiﬁcation of crop types. While most SITS-based crop type classiﬁcations were focused on NDVI (normalized difference vegetation index) temporal proﬁles, less attention has been paid to using the complete image spectral resolution of the time series. In this work we assessed different approaches to SITS-based classiﬁcation of four major fruit-tree crops in the Maipo Valley, central Chile, during the 2013–14 growing season. We compared four feature sets from a time series comprised of eight cloud-free Landsat-8 images: the full-band SITS, the NDVI and NDWI (normalized difference water index) temporal proﬁles, and an image stack with all the feature sets combined. State-of-the-art classiﬁers (linear discriminant analysis, LDA; random forest; and support vector machine) were applied on each feature set at different training sample sizes (N = 100, 200, 400, 800 and 2291 ﬁelds), and classiﬁcation results were assessed by cross-validation of the misclassiﬁcation error rate (MER). For all the feature sets overall results were good (MERs ≤ 0.21) although substantially improved classiﬁcation accuracies were achieved when the full-band SITS was employed (MER 0.14–0.05). Classiﬁcations applied on the NDVI temporal proﬁle consistently had the worst performance. For a sample size of 200 ﬁelds, LDA using the full-band SITS of image dates 1, 3, 6 and 8 produced the best tradeoff between the number of images and classiﬁcation accuracy (MER = 0.06), being the green, red, blue and SWIR (short-wave infrared) bands of image date 1 (acquired at the early greenup stage) the most relevant for crop type discrimination. Our results show the importance of considering the complete image spectral resolution for SITS-based crop type classiﬁcations as the commonly used NDVI temporal proﬁle and their red and near infrared bands were not found the most signiﬁcant to discriminate the crop types of interest. Furthermore, in light of the good results obtained, the methodology used here might be transferred to similar agricultural lands cultivated with the same crop types, thus providing a reliable and relatively efﬁcient methodology for creating and updating crop inventories. © 2015 Elsevier Inc. All rights reserved.

1. Introduction Passive optical remote sensing may provide meaningful information for monitoring and managing agricultural lands. A particularly signiﬁcant application is the spectral classiﬁcation of crop types from satellite-based multispectral images, which makes it possible to create or update crop inventories based on limited ﬁeld data. It may furthermore help to direct management practices (e.g., site-speciﬁc water and nutrient supply) and assist in yield forecasting, particularly where little historical ﬁeld data is available. Nonetheless, different vegetation species may show quite similar spectral behavior (or low inter-speciﬁc spectral variability), especially during some particular phenological stages and at the typical spectral resolution and bandwidth of satellite⁎ Corresponding author. E-mail addresses: [email protected] (M.A. Peña), [email protected] (A. Brenning).

http://dx.doi.org/10.1016/j.rse.2015.10.029 0034-4257/© 2015 Elsevier Inc. All rights reserved.

based multispectral images (Esch, Metz, Marconcini, & Keil, 2014; Galvão, Epiphanio, Breunig, & Formaggio, 2012). This issue may strongly limit the accuracy of crop types classiﬁed from single image dates (Jewell, 1989; Lo, Scarpace, & Lillesand, 1986; Murakami, Ogawa, Ishitsuka, Kumagai, & Saito, 2001; Van Niel & McVicar, 2004). The use of satellite image time series (SITS) arises as a promising approach in remote sensing-based crop type classiﬁcation. By using SITS, spectral–temporal proﬁles of the crops of interest are retrieved from a set of multispectral images acquired within a temporal window of interest (usually the entire growing season). This multi-temporal spectral behavior is related to the phenology of the crop, i.e., their seasonal dynamics or intra-annual developmental stages, whose onset and offset dates usually differ from other crop types planted on the same agricultural land. Thus, distinctive spectral–temporal features of each crop type may be extracted from SITS, increasing the chances of classifying them correctly (Jakubauskas, Legates, & Kastens, 2002; Masialeti, Egbert, & Wardlow, 2010; Odenweller & Johnson, 1984;

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

Ozdogan, 2010; Wardlow, Egbert, & Kastens, 2007). Within a growing season the main phenological stages driving the bulk spectral behavior of a crop are: (1) greenup: the date of onset of photosynthetic activity, (2) maturity: the date at which plant green leaf area is maximum, and (3) senescence: the date at which photosynthetic activity and green leaf area begin to rapidly decrease (Zhang et al., 2003). Although there is a growing body of work and practical experience in agricultural land cover classiﬁcations using multi-temporal image sets as part of land use and land cover (LULC) mapping and monitoring (see references in Giri, 2012, and Xie, Sha, & Yu, 2008), we center our attention more narrowly on SITS-based classiﬁcation of individual types of crop. While LULC classiﬁcations are often focused on tracking longterm changes of major classes, crop type classiﬁcation primarily deals with ﬁner thematic granularity and subtle spectral differences. In this case, the spectral separability between crop types is pursued by tracking their phenological differences across a set of images regularly acquired within a given growing season. SITS have been widely used to classify single and multiple crops types mainly by retrieving NDVI (normalized difference vegetation index) temporal proﬁles as indicators of crop phenology from coarse spatial resolution images such as MODIS (Moderate Resolution Imaging Spectroradiometer) (Arvor et al., 2011; Chenab, Sonb, Changab, & Chenb, 2011; Jakubauskas et al., 2002; Mingwei et al., 2008; Masialeti et al., 2010; Sakamoto et al., 2005; Shao, Lunetta, Ediriwickrema, & Liames, 2010; Sibanda & Murwira, 2012; Sun, Xu, Lin, Zhang, & Mei, 2012; Wardlow & Egbert, 2010; Xavier, Rudorff, Shimabukuro, Berka, & Moreira, 2006; Zhong, Hawkins, Biging, & Gong, 2011) and from ﬁner spatial resolution images such as Landsat or SPOT (Satellite Pour l'Observation de la Terre) (Badhwar, Gargantini, & Redondo, 1987; Jewell, 1989; Murakami et al., 2001; Nguyen, De Bie, Ali, Smaling, & Chu, 2011; Simonneaux et al., 2007; Turker & Arikan, 2005; Zhong, Gong, & Biging, 2014; Zheng, Myint, Thenkabail, & Aggarwal, 2015). The NDVI is constructed from a red band (R) sensitive to foliar chlorophyll content, and a near infrared (NIR) band sensitive to canopy foliage amount, as (NIR − R) / (NIR + R). This spectral index represents in a convenient and synthetic fashion the physiological and structural condition of vegetation, often referred as greenness, which from a temporal perspective relates to vegetation phenology (Odenweller & Johnson, 1984; Jones & Vaughan, 2010). The widespread reliance on MODIS for crop type classiﬁcation is mainly due to the free availability of 16- and 32-day NDVI image composites, as well as 8-day surface reﬂectance products that enable to construct 8-day NDVI composites. These composite products strongly increase the availability of cloud-free imagery while reducing the need for radiometric and geometric processing by the end user. Furthermore, they are particularly useful in agricultural lands with relatively large and homogeneous crop ﬁelds, considering MODIS spatial resolutions ranging from 250 to 1000 m (Murakami et al., 2001; Ozdogan, 2010; Roy et al., 2014; Wardlow et al., 2007; Zhong et al., 2011). The phenology of a given crop type may change in space and time in response to variation in environmental conditions (e.g., temperature, precipitation, soil type and humidity) and management practices (e.g., fertilization, irrigation, crop rotation). As a result, spectral–temporal proﬁles extracted from ﬁelds corresponding to the same crop type may show some intra-annual differences (within-crop phenological variability) as well as inter-annual differences (early or delayed onset of phenological stages), even at local scales (Arvor et al., 2011; Masialeti et al., 2010; Odenweller & Johnson, 1984; Wardlow et al., 2007). Intra-annual variation in the amplitude of the NDVI temporal proﬁles of two ﬁelds of the same crop type (with phase unchanged) indicates different vegetation conditions. Meanwhile, intra-annual variation in the phase of temporal proﬁles (with amplitude unchanged) of ﬁelds of the same crop type indicates shifts in developmental stages (Jakubauskas et al., 2002). To support and enhance the crop type classiﬁcation based on NDVI temporal proﬁles, reference ﬁeld data about crop phenology have been used (Sakamoto et al., 2005; Shao et al.,

235

2010; Sibanda & Murwira, 2012; Zheng et al., 2015; Zhong et al., 2011, 2014), although the success of these approaches may strongly depend on the quality and quantity of such data (Masialeti et al., 2010). Although the use of the complete spectral resolution of SITS in the realm of agricultural land cover classiﬁcations is not a new approach (see early references in Lo et al., 1986), quantitative assessments of its potential are still lacking, and it is necessary to update earlier results considering the state of the art in data mining and statistical learning. Full-band Landsat-7 time series were previously used by Van Niel and McVicar (2004) to classify four cereal crop types in the Coleambally Irrigation Area, Australia. In their study the SITS only included image dates that previously yielded the highest classiﬁcation accuracies for each single crop in an iterative multi-date classiﬁcation process of the whole time series. So, even though they did not utilize all the images of the SITS, their results showed per-class accuracies mostly above 80%. The scarce attempts to classify crop types by full-band SITS may be attributed to some extent to the general assumption that using the complete spectral resolution of a multispectral image (e.g., from blue to short-wave infrared, SWIR, bands) may provide redundant or little additional information about crop condition compared with a synthetic measure such as the commonly used NDVI. However, the underlying assumption of an autocorrelation between band absorption and reﬂection features may not necessarily hold true for all vegetation types. For instance, throughout the phenological stages of some vegetation targets it has been found that variations in foliar chlorophyll and water content (sensitive at SWIR wavelengths) may not necessarily be proportional due to the numerous environmental factors affecting their physiological and structural properties (Ceccato, Flasse, & Gregoire, 2002; Ceccato, Flasse, Tarantola, Jacquemoud, & Grégoire, 2001). On the other hand, some image bands are usually avoided for many applications because they may contain a large amount of noise (e.g., blue and SWIR bands) even after atmospheric correction, yielding unreliable spectral signals (Jones & Vaughan, 2010; Peña & Altmann, 2009; Roberto et al., 2012). However, new satellite remote sensing technologies provide improved signal-to-noise ratios, and the capabilities of novel atmospheric correction algorithms to remove extraneous path radiance under a wide range of environmental conditions have been continuously enhanced (Liang, Li, & Wang, 2012; Roy et al., 2014). These advances have improved the quality of bands traditionally considered noisy. In this paper we assessed the SITS-based classiﬁcation of four major fruit-tree crop types in a study area located in central Chile during the 2013–14 growing season. We used all the cloud-free Landsat-8 images available for the season of interest. Three state-of-the-art statisticallearning classiﬁers were selected and applied at different training sample sizes on the full-band SITS, as well as on NDVI and NDWI (normalized difference water index) temporal proﬁles, and their accuracies were then assessed. The methods to be employed for the study area must consider the relatively small ﬁeld size of the crops of interest, which precludes the use of MODIS products, as well as the absence of reliable auxiliary data on the phenology of the crops of interest. We expect to ﬁnd optimal combinations of feature sets, classiﬁcation techniques and training sample sizes to classify the crop types of interest, which may be transferable to nearby similar agricultural lands. This could result in the construction of less costly and more frequent crop inventories, which are currently updated in our study area at least every four years mostly by ﬁeld campaigns. 2. Materials and methods 2.1. Study area The study area is located in the Maipo River basin (15,157 km2) in the Metropolitan Region of Chile south of the capital Santiago. The Mediterranean climate in which the basin is located is characterized by hot, dry summers and cool, wet winters, which results in hydrological regimes controlled by snowmelt in the Andean headwaters in spring

236

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

and rain in winter. Agriculture is its main land use, although native sclerophyllic vegetation still remains as forest and shrubland formations mainly in steep terrain. Most of its agricultural land is occupied by fruit

trees (36%), vegetables (17%) and cereals (11%) (ODEPA (Oﬁcina de Estudios y Políticas Agrarias), 2014). The study area administratively comprises nine counties situated along the middle reaches of the

Fig. 1. Distribution of the tree-fruit crops within the study area (top), located in the Maipo Valley in central Chile. Bottom: Maipo River basin (thick line) and administrative boundaries of the Metropolitan Region (thin line) highlighting the study area's counties in gray.

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

Maipo River: Padre Hurtado, Peñaﬂor, Calera de Tango, Talagante, Isla de Maipo, Buin, Paine and Pirque (Fig. 1). Together, these counties reach an area of 1878 km2, mostly cultivated with four tree-fruit species: walnut (Juglans regia L.), table grape (Vitis vinifera L.), almond (Prunus dulcis Mill.) and European plum (Prunus domestica L.) (Table 1), heterogeneously distributed in the form of relatively small ﬁelds (N = 3120; mean size 44,000 m2).Greenup stage of these species broadly starts during October (southern spring), while senescence stage starts in early May (southern autumn). Inter- and intra-speciﬁc onset dates of these stages show shifts of several days according to the variety or management practice applied on the crop. The mentioned species were the target crops of this study. 2.2. Satellite image time series All cloud-free Landsat-8 images acquired from early October, 2013 to late April, 2014 (every 16 days, the revisit time of the satellite) were selected and downloaded from the Global Visualization Viewer (http://glovis.usgs.gov/) of the USGS (United States Geological Survey). The list of image acquisition dates used in this study is presented in Table 2. Landsat-8 satellite was launched on February 11th, 2013, carrying the multispectral pushbroom sensor OLI (Operational Land Imager), which incorporates numerous technical improvements in regard with their predecessor instrument, ETM + (Enhanced Thematic Mapper Plus), on board Landsat-7. Particularly noteworthy are the higher signal-to-noise ratio (increased by a factor of at least eight in comparison with ETM +) and radiometric resolution (12 bits rather than 8 bits used by ETM+) as well as the improved geometric ﬁdelity (by an on-board global positioning system, GPS) and the better combination of pre-launch, on-board and vicarious calibration procedures (Roy et al., 2014). In this study, the panchromatic band 8 (0.5–0.68 μm) was not used because of its broad spectral width, and bands 1 (0.43–0.45 μm) and 9 (1.36–1.39 μm) were omitted because they were designed for coastal water and atmospheric aerosol applications (Roy et al., 2014). Meanwhile, thermal bands 10 (10.60– 11.19 μm) and 11 (11.50–12.51 μm) acquired by the TIRS (Thermal Infrared Sensor) instrument on board this satellite were also discarded, mainly because their relatively large original image pixel size of 100 m limits their utility in the classiﬁcation of the crop types targeted in this study. Table 3 lists the main technical properties of the Landsat-8 bands used in this study. 2.3. Image pre-processing Landsat-8 images are originally distributed as Level 1 terraincorrected (L1T) products. The procedures applied on these images include a systematic geometric correction, precision correction assisted by ground control chips, and the use of a digital elevation model (DEM) to correct parallax error due to terrain relief. The output coordinate system assigned is the Universal Transverse Mercator (UTM) with World Geodetic System 1984 (WGS84) datum, and the circular geolocation error is b12 m at the 90% conﬁdence level (Roy et al., 2014). As products subject to the same geometric correction procedures, L1T images Table 1 Area occupied by the fruit-tree species of interest and the rest of the tree fruits cultivated in the study area. Tree-fruit crop

Walnut Table grape Almond European plum Others Total

Area Km2

%

55.35 42.21 21.64 18.66 47.7 182.9

29.2 24.6 10.1 10.1 26.1 100

237

Table 2 Acquisition dates of the Landsat-8 images used in this study. Image

Season in the study area

#

Acquisition date

1 2 3 4 5 6 7 8

October 5, 2013 October 21, 2013 November 22, 2013 January 9, 2014 January 25, 2014 February 10, 2014 February 26, 2014 April 15, 2014

Spring Spring Spring Summer Summer Summer Summer Autumn

acquired at different dates over the same path/row satisfy the spatial match required for time series analysis. In spite of this, image geometric coregistration was visually assessed by checking the spatial match of some randomly selected ﬁeld boundaries within the study area across the time series. The spatial shifts in the target boundaries were found to be far below half the pixel size and therefore negligible for our purposes. The conversion of image digital numbers to absolute at-sensor radiances was automatically performed by applying to each band the OLI radiometric calibration parameters available in the ENVI© (Environment for Visualizing Images) 5.0.3 software (Exelis Visual Information Solutions, Inc., Boulder, USA). Afterwards, images were atmospherically corrected using its FLAASH (Fast Line-of-Sight Atmospheric Analysis of Spectral Hypercubes) module. This is a MODTRAN (Moderate Resolution Atmospheric Transmission) based algorithm that aims to reduce the extraneous path radiance affecting the pixel's at-sensor radiometry (i.e., adjacency and haze effects), while modeling the at-surface irradiance. To do that, the algorithm requires a set of user-deﬁned scene and study area parameters (image acquisition date and time, geographic location, terrain elevation, visibility, atmosphere and aerosol type, among others). The output of this procedure is an image in apparent surface reﬂectance pixel units (resulting from rationing at-surface irradiance to at-surface radiance). 2.4. Training stage Fields required to train the classiﬁcation of the crops of interest were taken from the Fruit Cadastre of the study area, carried out by Chile's Oﬁcina de Estudios y Políticas Agrarias (ODEPA). This is an online digital database (http://icet.odepa.cl/) that contains crop ﬁeld boundaries as polygons updated by ﬁeld campaigns to the 2013–2014 season. The training sites were manually selected from within-ﬁeld areas in which a continuous vegetation cover was observed. To reﬁne this task, we ﬁrst removed too small ﬁelds (≤22,500 m2 or ≤5 × 5 pixels) or excessively narrow ﬁelds from the original database because they are mainly comprised of mixed edge (Landsat-8) pixels useless as training sites and detrimental for the subsequent classiﬁcation accuracy assessments. As a result, the original number of polygons per crop type (walnut: 1,121, table grape: 1,106, almond: 477, European plum: 416) was somewhat reduced (796, 889, 308 and 298 ﬁelds, respectively; Table 4). Table 3 Main technical characteristics of the Landsat-8 OLI (Operational Land Imager) image bands used in this study. Band #

Spectral region

Spectral width (μm)

Spatial resolution (m)

Radiometric resolution (bits)

2 3 4 5 6 7

Blue Green Red Near infrared Shortwave infrared Shortwave infrared

0.45–0.51 0.53–0.59 0.64–0.67 0.85–0.88 1.57–1.65 2.11–2.19

30 m

12

238

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

Table 4 Number and size of the crop ﬁelds used in this study. Crop type

Field #

Walnut Table grape Almond European plum Total

Size

Polygons

Pixels

Mean (m2)

Standard deviation (m2)

796 889 308 298 2291

8310 7374 3468 3533 22,685

63,525 43,677 63,173 57,096 59,940

50,531 21,982 41,402 35,726 39,442

2.5. Classiﬁcation and validation stages Crop type classiﬁcation was carried out for four feature sets with and without full-band SITS data, using three different classiﬁers and ﬁve different training sample sizes in order to identify efﬁcient classiﬁcation procedures using a methodology similar to Brenning, Long, and Fieguth (2012). Backward and forward image selection was furthermore carried out for the best-performing classiﬁer and feature set in order to determine whether a reduced number of image dates can achieve competitive results. The following feature sets were derived from the Landsat-8 time series: (1) NDVI temporal proﬁle (i.e., eight NDVI values as predictors), (2) NDWI temporal proﬁle, (3) full-band SITS (i.e., a total of 8 × 6 predictors) and (4) an image stack with all the previously mentioned feature sets combined (64 predictors). NDVI was calculated as mentioned in the introduction chapter of this work, while NDWI was calculated using the same formula, but replacing the red band with the SWIR band comprising 1.57–1.65 μm (Gao, 1996). Since different classiﬁcation techniques differ in their ability to leverage nonlinear or otherwise complex relationships between features and crop type, we used three different statistical and machine learning techniques that are representative of the state–of-the-art in image classiﬁcation (Brenning et al., 2012): linear discriminant analysis (LDA), random forest (RF) and support vector machine (SVM). While LDA is less ﬂexible than the other methods as it is only able to separate the classes linearly, it is less prone to overﬁtting and computationally very efﬁcient. RF is a bootstrap-aggregated tree-based classiﬁer that is particularly suited to detect nonlinear interactions among predictors (Breiman, 2001). SVM is also a relatively novel technique that is based on nonlinear kernel transformations of the predictors, projecting them into a higher-dimensional feature space. In this space, an optimal separating hyperplane is computed (Moguerza & Muñoz, 2006). We refer the reader to James, Witten, Hastie, and Tibshirani (2013) for further details on the classiﬁcation techniques used. RF was applied with its default settings in the R implementation used in this study (500 trees, square root of no. of predictors selected randomly; Liaw & Wiener, 2002). SVM in its implementation in the ‘e1071’ package (Meyer, Dimitriadou, Hornik, Weingessel, & Leisch, 2014) was used for C-classiﬁcation with radial basis function kernels. The costs parameter C and the kernel bandwidth parameter γ were tuned using an inner cross-validation grid search on each training data set, resulting in a computationally highly expensive procedure. In order to reduce noise and obtain an unambiguous class prediction for each ﬁeld polygon, pixel-level predictions were ﬁltered by picking each ﬁeld's predicted majority class. All analyses were performed in the data analysis software R (version 3.1.1; R Core Team, 2014). Classiﬁcation accuracy for each classiﬁer and feature set was assessed for ﬁve training sample sizes (N = 100, 200, 400 and 800 randomly selected ﬁelds, and all 2291 ﬁelds) using cross-validation estimation of the misclassiﬁcation error rate (MER). To do that, MERs were estimated by cross-validation. MER is deﬁned as the total proportion of objects in a data set that a classiﬁer assigns to the wrong class. Cross-validation is a resampling-based bias-reduced technique for the

estimation of predictive performance. It repeatedly generates training and test data sets from the available set of ﬁelds with known class membership, effectively using the entire data set for performance estimation. k-fold cross-validation consists of partitioning the set of ﬁelds into k equally-sized subsets; a classiﬁer is trained on all but one of these subsets and evaluated on the remaining one. To be independent of a particular partitioning, the whole procedure is repeated r times (r-repeated k-fold cross-validation). Error rates are averaged over all test data sets. In this study, we used k = 10 and r = 100 with the exception of SVM where only r = 20 was feasible due to the high computational complexity of the additional inner cross-validation step on each of the k × r training data sets. Cross-validation was performed on the ﬁeld level, not on the pixel level, as the latter would result in a contamination of test data sets with pixels that are pseudoreplications of training pixels, which would yield over-optimistic accuracy estimates and encourage over-ﬁtting (Brenning, Kaden, & Itzerott, 2006). Field-level cross-validation was carried out using the ‘sperrorest’ package in R (Brenning, 2012). Considering a 5% level of signiﬁcance and a Bonferroni correction to control the overall error rate of a family of tests, an estimate for the critical difference in cross-validated performance estimates was determined to be 0.007 (i.e., 0.7 percentage points) based on 100 crossvalidation repetitions and 0.017 based on 20 cross-validation repetitions. This critical difference was calculated for a two-sample t-test and using the largest observed standard deviation of cross-validated MERs to obtain a conservative signiﬁcance threshold. In the Bonferroni correction for multiple comparisons we accounted for all possible tests between situations where only one of the factors changed, e.g. LDA versus SVM performance for the same feature set and training sample size. Using the best feature set and classiﬁer from the performance comparison, forward and backward image selection were performed in order to explore if similar performances can be achieved with a smaller set of image dates. In forward (backward) selection, predictors corresponding to one image date were added to (removed from) the feature set, starting with an empty feature set (the full feature set). In each step, the image combination that achieved the best performance was selected. Image selection was performed for a training sample size of 200 ﬁelds only based on intermediate results from the previous steps. In order to assess which of the features within the resulting feature set contributed most to the classiﬁer's predictive capability, a permutation-based variable accuracy importance measure was furthermore calculated (Brenning et al., 2012). In this approach, one predictor at a time is permuted or “messed up” in order to calculate how much the MER increases compared to predictions using undisturbed data. This technique was embedded in the ﬁeld-level spatial cross-validation using ‘sperrorest’ (Brenning, 2012). For each variable, permutation was repeated 100 times within each cross-validation repetition and fold. 3. Results 3.1. Overall classiﬁcation performance Regardless of the feature set and training sample size used, all the classiﬁcations achieved good accuracies (MER ≤ 0.21; Fig. 2 and Table 5). For all the feature sets, classiﬁcation accuracies consistently increased as the training sample size increased from 100 to all 2291 ﬁelds (MER differences of 0.02–0.1, depending on the classiﬁer; mean differences N0.007 are signiﬁcant at the 5% level while accounting for multiplicity of tests). However, performance improvements with training sample sizes increasing above 200 ﬁelds were relatively small in full-band SITS classiﬁcation and when using all feature sets combined (MER differences of 0.01–0.05 compared to a training sample size of 200 ﬁelds). Classiﬁcation performances obtained with the full-band SITS and all the feature sets combined were almost identical (MER differences ≤0.01

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

239

Fig. 2. Cross-validated MERs for fruit-tree ﬁelds using (a) image stacks and (b) only NDVI/NDWI temporal proﬁles for different classiﬁers and training sample sizes. Mean differences of N0.007 (SVM: N0.017) should be considered signiﬁcant at the 5% level while accounting for multiple comparisons.

for same classiﬁer and training sample size), providing higher accuracies across the ﬁve training sample sizes than classiﬁcations using only NDVI or NDWI temporal proﬁles (MER differences of 0.05–0.1, depending on the classiﬁer and the training sample size used). LDA using the full-band SITS was the best-performing classiﬁer at each of the training sample sizes, providing the maximum overall accuracy value at all 2291 ﬁelds (MER = 0.05). RF using the full-band SITS started with poorer classiﬁcation performances for smaller training sample sizes but caught up with LDA when using the largest training sample size. SVM using the full-band SITS yielded classiﬁcation accuracies between LDA and RF but came at a high computational cost, making its application impractical for the larger training sample sizes. Classiﬁcations based on NDVI temporal proﬁle consistently had the worst performance, being LDA the classiﬁer that produced the lowest overall accuracies at each of the training sample sizes. By comparison, NDWI temporal proﬁle consistently achieved a somewhat better performance (up to 3 percentage points). RF trained on all 2291 ﬁelds and using NDWI temporal proﬁle produced the highest accuracy among index-based classiﬁcations (MER = 0.1). Based on these results, we consider that LDA using (at least) the full-band SITS provided the most satisfying classiﬁcation performance overall, taking into account also its computational efﬁciency and relative insensitivity to training

Table 5 Cross-validated MERs provided by three classiﬁers applied on four feature sets at ﬁve training sample sizes. Differences of N0.007 (SVM: N0.017) are signiﬁcant at the 5% level while accounting for multiplicity of comparisons. Classiﬁer

Feature set

Misclassiﬁcation error rate

sample size. With this classiﬁer, nearly optimal results were achieved with a training sample of only 200 ﬁelds (MER = 0.06). 3.2. Classiﬁcation performance using linear discriminant analysis When used with the full-band SITS or all the feature sets combined, LDA had the best overall performance (MERs 0.08–0.05; Fig. 2 and Table 5). Classiﬁcation accuracies between both features sets were about the same (MER b 0.01), showing a slight increasing as the training sample size increased from 100 to all 2291 ﬁelds (MER difference of 0.03). Conversely, LDA had the worst overall performance when used with index-based feature sets, particularly with NDVI temporal proﬁle (MERs 0.21–0.17). For both index-based feature sets, classiﬁcation accuracies showed a slight increase as the training sample size increased from 100 to all 2291 ﬁelds (MER difference of 0.04). 3.3. Classiﬁcation performance using support vector machine Classiﬁcation accuracies obtained from SVM were somewhat lower than those achieved with LDA when used with the full-band SITS or all the features combined (Fig. 2 and Table 5). Both feature sets provided the highest classiﬁcation accuracies (MERs 0.06–0.01), showing a very similar performance (MER ≤ 0.01) and a slight increasing as the training sample size increased from 100 to 400 ﬁelds (MER difference of 0.05). For comparison, index-based feature sets and especially NDVI temporal proﬁle yielded poorer performances (MERs 0.20–0.13). Classiﬁcation accuracies obtained from NDVI and NDWI temporal proﬁles slightly increased as the training sample size increased from 100 to 400 ﬁelds (MER difference between 0.05 and 0.04, respectively).

Training sample size

Linear discriminant analysis

Random forest

Support vector machine

NDVI NDWI Full-band SITS All feature sets NDVI NDWI Full-band SITS All feature sets NDVI NDWI Full-band SITS All feature sets

100

200

400

800

All 2291 ﬁelds

0.21 0.18 0.08 0.08 0.20 0.18 0.13 0.14 0.20 0.17 0.11 0.10

0.19 0.17 0.06 0.06 0.17 0.15 0.10 0.11 0.17 0.14 0.08 0.08

0.18 0.16 0.06 0.06 0.14 0.13 0.08 0.09 0.15 0.13 0.06 0.06

0.17 0.15 0.05 0.05 0.12 0.11 0.06 0.07 – – – –

0.17 0.14 0.05 0.05 0.10 0.10 0.05 0.06 – – – –

3.4. Classiﬁcation performance using random forest RF had the worst overall performance when used in classiﬁcations that included full-band SITS; it did, however, exploit index-based feature sets more effectively than LDA especially for large sample sizes (Fig. 2 and Table 5). Classiﬁcation accuracies obtained from full-band SITS and all the feature sets combined were also in this case higher than those retrieved by index-based feature sets. Full-band SITS classiﬁcation accuracies at all training sample sizes were only one percentage point higher than those produced by combining all the feature sets (MERs 0.13–0.05). Both feature sets showed a more substantial performance improvement with training sample size than LDA (MER difference of 0.08).

240

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

3.5. Optimal image selection and band importance A forward image selection based on the best-performing classiﬁer and feature set (LDA using full-band SITS feature set) at a training sample size of 200 ﬁelds showed that the combination of only four images: 1, 3, 6 and 8 (see acquisition dates in Table 1), achieved a nearly optimal MER (0.06), arising as the best tradeoff between number of images and classiﬁcation accuracy (Fig. 3; backward selection not shown because of nearly identical results). The map resulting from this classiﬁcation is shown in Fig. 4. Classiﬁcation accuracies increased only slightly when the remaining images were progressively added to the model. Classiﬁcation accuracies started to decrease more substantially when three or fewer images were considered. The image acquired on April 15th, 2014 (image no. 8, Table 1) provided the best single date-based classiﬁcation accuracy (MER = 0.19, Fig. 3), which equals the result yielded by this classiﬁer when applied on the NDVI temporal proﬁle of all eight images at the same training sample size. A further examination of variable importance in LDA using full-band SITS of image dates 1, 3, 6 and 8 and a training sample size of 200 ﬁelds revealed that the most important bands belonged to image date 1, with the green, red, blue and SWIR bands leading the ranking (Fig. 5). Meanwhile, the less important bands were mostly concentrated in image dates 6 and 8, with a mixed combination of visible to SWIR bands from both dates contributing the least to predictive performance. The cross-validated confusion matrix of LDA using full-band SITS from image dates 1, 3, 6 and 8 and a sample size of 200 ﬁelds yielded very good overall and per-class results, with user's and producer's accuracies showing only little variations among crop types (Table 6). The highest commission and omission errors (i.e, the complement of user and producer accuracies, respectively) were obtained for European plum, while both error measures were very low for table grape. 4. Discussion 4.1. Classiﬁcation performance and feature sets The three classiﬁers used in this work had a similar overall performance, providing classiﬁcation accuracies that in the realm of remote sensing-based classiﬁcations are commonly considered as good. Pal (2005) obtained good overall accuracies (88%) and per-class accuracies (≥81%) from SVM and RF applied on a Landsat image to classify seven crop types, but did not compare these computationally intensive statistical learning methods with simpler linear discriminant techniques. Pal and Mather (2005) found that SVM had a slightly better performance than a neural network classiﬁer when applied on an airborne hyperspectral image to classify eight land cover types, including some crop types (overall accuracies of 93.6 and 86%, respectively). The fairly consistent classiﬁcation accuracies found in our work were useful for the pursued analysis as instead of comparing the performance

of a set of classiﬁers we rather focused on assessing the performance of different SITS-derived feature sets to classify the crop types of interest. We selected the classiﬁers simply based on their representativeness as state-of–the-art machine-learning techniques. During the last years, these techniques have gained increasing interest in the remote sensing community because of their potential to deal with high-dimensional and complex relationships that are often required to discriminate spectrally similar targets (Gislason et al., 2006; Mountrakis, Im, & Ogole, 2011; Szuster, Chen, & Borger, 2011). Nevertheless, in our study linear techniques outperformed nonlinear SVM and RF classiﬁers, which underlines the utility of linear techniques and their variants (e.g., with lasso penalties) for computationally efﬁcient processing of remote sensing data with often competitive predictive performances (e.g., Zandler, Brenning, & Samimi, 2015). A particular strength conferred to machine learning classiﬁers has been their relatively low sensitivity to training sample size reduction (Huang, Davis, & Townshend, 2002; Mathur & Foody, 2008; Mountrakis et al., 2011). Sample size effects have rarely been addressed in crop or similar vegetation land cover classiﬁcations using SITS. Zheng et al. (2015) classiﬁed nine single and double crop types applying SVM on a NDVI temporal proﬁle constructed with 22 Landsat-5 and − 7 images acquired throughout one year. They compared classiﬁcation accuracies using training sites selected by the aid of auxiliary data (n = 44) and by a stratiﬁed random sampling technique (n = 174). Although in some cases producer's and user's accuracies showed significant variations, overall accuracies remained stable (90 and 86%, respectively). Similarly, Rodriguez-Galiano, Ghimire, Rogan, Chica-Olmo, and Rigol-Sanchez (2012) classiﬁed 14 land covers (including some crops and other vegetation targets) applying RF on two Landsat-5 images acquired in spring and summer. They found that the reduction of training sample size did not have a signiﬁcant effect on the classiﬁer's accuracy until reaching a 50% data reduction. Our results agree with these ﬁndings, as regardless of the feature set and classiﬁer used, classiﬁcation accuracies did not signiﬁcantly diminish when the training sample size decreased even to 200 ﬁelds. Unlike our work, most of the previously published SITS-based crop type classiﬁcations have relied on the use of MODIS-derived temporal proﬁles of greenness indices like the NDVI. These classiﬁcations have been commonly applied at regional scales on relatively large graminaceous crop ﬁelds. Using Maximum Likelihood algorithm, Van Niel and McVicar (2004) classiﬁed four crop types with an overall accuracy of 95.8%, and Arvor et al. (2011) obtained an overall accuracy of 74% in the classiﬁcation of three crops types planted in single or double cropping systems. With decision trees, Wardlow and Egbert (2008) classiﬁed four crop types with an overall accuracy of 84%, and Zhong et al. (2011) obtained overall accuracies of 74.6 and 78.6% in the classiﬁcation of two areas with eight and 12 crop types, respectively. By using Fourier analysis Jakubauskas et al. (2002) classiﬁed three crop types with an overall accuracy of 52%, and Mingwei et al. (2008) obtained a

Fig. 3. MERs obtained in a forward image selection based on the best-performing classiﬁer and feature set: LDA applied on the full-band SITS, at a training sample size of 200 ﬁelds.

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

241

Fig. 4. Crop type classiﬁcation using LDA on the full-band SITS comprised by the image dates 1, 3, 6 and 8 at a training sample size of 200 ﬁelds.

RMSE (root mean square error) of 48.11 and 82 km2 in the area estimation of cotton and maize, respectively. By using temporal unmixing and independent component analysis Ozdogan (2010) obtained RMSE values between 15 and 30% in the area estimation of three crops types

subject to different management practices. In spite of the good results reported in these works, our study showed that substantial information is lost when a SITS-based crop type classiﬁcation is performed by an index-based temporal proﬁle rather than by the full-band SITS, having

Fig. 5. Permutation-based variable importance of each band in terms of the mean increase in MER for image dates 1, 3, 6 and 8 previously selected for LDA with a training sample size of 200 ﬁelds. Y axis labels identify the image number (ﬁrst digit) and the band number (second number in variable name).

242

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

Table 6 Confusion matrix of crop types classiﬁed by LDA applied on the full-band SITS comprised by the image dates 1, 3, 6 and 8 at a training sample size of 200 ﬁelds. Numbers in the table are pixel counts (table margins: fractions) averaged over 100 cross-validation repetitions.

Walnut Table grape European plum Almond Producer's accuracy Overall accuracy

Walnut

Table grape

European plum

Almond

User's accuracy

7902 80 191 137 0.95

177 7130 41 27 0.97

295 13 3052 173 0.86

131 25 149 3163 0.91

0.92 0.98 0.89 0.9

Overall accuracy

0.94

a detrimental impact on the classiﬁcation accuracies. As a matter of fact, we found that even the best-performing single-image classiﬁer was able to achieve the same performance as a classiﬁer using NDVI temporal proﬁle based on all eight images. Our study area comprised relatively small ﬁelds of fruit-tree crops which technically preclude the use of the typical MODIS-derived NDVI composite products. Instead, we decided to use a Landsat-8 time series for a given growing season to examine if the full-band SITS or the temporal proﬁle of a wetness spectral index (NDWI) would yield improved predictive performances. Even when the acquisition dates of the images comprising the time series did not necessarily match with critical phenological dates of the crops of interest and some images had to be missed due to cloud cover, the results obtained here are quite promising. Classiﬁcation accuracies were the highest when the full-band SITS was employed, regardless of the classiﬁer and training sample size used (MER 0.14–0.05). The best classiﬁcation accuracies were obtained with the simple and less ﬂexible LDA classiﬁer, which is consistent with the results obtained by Brenning et al. (2006), who used Landsat time series from 1996 and 2000 to classify seven and nine crop types, respectively, in Havelland, Germany. In that work, classiﬁcation accuracy obtained for LDA (mean MER = 20.1) was among the best results obtained for the set of classiﬁers used, which included SVM. As in our work, this classiﬁer also stood out by achieving very good classiﬁcation with a small training sample size of 200 ﬁelds. Moreover, in our work classiﬁcations performed using the NDWI temporal proﬁle provided slightly better results than using the NDVI temporal proﬁle. This result highlights the potential that a wetness spectral index may have to track changes in vegetation phenology, which for SITS-based crop type classiﬁcations has been overlooked. NDVI temporal proﬁles of graminaceous crop types have shown good separabilities due to the clear differences in some of the phenological stages of these crops, resulting in good classiﬁcation accuracies. As annual crops, ﬁrst interclass differences in the NDVI temporal proﬁles of gramineous may be found as a result of the different times at which some of them are planted (e.g., winter wheat versus summer crops). Budburst and green-up times therefore offer good opportunities to discriminate them. Moreover, many of them are cultivated in annual rotations or double cropping systems. Harvesting and regrowth therefore improve the ability to discriminate them by NDVI temporal proﬁles based on their distinctive bimodal patterns (Arvor et al., 2011; Masialeti et al., 2010; Mingwei et al., 2008; Murakami et al., 2001; Ozdogan, 2010; Wardlow et al., 2007; Zheng et al., 2015). Although studies addressing SITS-based classiﬁcation of fruit-tree crop types are still rare (e.g., Simonneaux et al., 2007; Zhong et al., 2011), it is possible to argue that classiﬁcation based on NDVI temporal proﬁles may be more challenging than for graminaceous crops, because as perennial summer crops they may exhibit similar interclass timings in their phenological stages. Zhong et al. (2011) found similar NDVI temporal proﬁles for almonds, pistachio and vineyard crops, with increasing values during spring (green-up), constant high values throughout the summer (maturity), and a gradual decrease in the autumn (senescence and defoliation). Further complications may arise if the temporal resolution of

the NDVI temporal proﬁles is too broad to clearly distinguish the interclass differences. 4.2. Optimal image selection and band importance Previous studies have found a decreasing tendency in classiﬁcation accuracies when data dimensionality increases, a widely-known issue in remote sensing that refers the so-called Hughes phenomenon: the reduction of the predictive power as the data dimensionality increases. Pal and Mather (2003, 2005) classiﬁed eight land covers applying several parametric and non-parametric classiﬁers (including RF, SVM and an artiﬁcial neural network classiﬁer) on an airborne hyperspectral image. They examined classiﬁcation accuracies for a ﬁxed training sample size by progressively increasing the band number and found that values declined or almost unchanged when the band number used in all the classiﬁers exceeded a few tens. Given the high temporal dimensionality of the SITS deﬁned in our work we addressed the question of how many images comprising the deﬁned SITS are enough to achieve an optimal classiﬁcation result. Taking into account the classiﬁer and the feature set that yielded the best accuracy at a training sample size of 200 ﬁelds we found that a combination of only four images arise as the best tradeoff between number of images and classiﬁcation accuracy. This result suggests that temporal resolution of the SITS is not so critical to achieve a reliable classiﬁcation of the crop types of interest. This ﬁnding contrasts with results of Ozdogan (2010), who found that the ability to classify crop types smaller than the image pixels of a MODIS time series (500 m) by applying a temporal unmixing technique (based on independent component analysis) was dependent on temporal resolution more than spectral resolution. In his work, winter wheat, corn and soybean crops subject to different management practices were classiﬁed by using only the red or NIR reﬂectance or their arithmetic combinations in the form of simple ratio (SR = NIR/red) and NDVI. Classiﬁcation accuracy results were assessed in terms of per-crop area estimation by RMSE, which produced values around 15 and 30% depending on the crop type. As previously discussed, in cases where timings of the phenological stages of a given set of crops markedly differ between them it is likely to expect a relatively more important role in the image temporal resolution. The optimal image acquisition dates found in our work are located toward the extremes of the considered growing season, i.e., the greenup and senescence phenological stages of the crops of interest. As stated before, during these stages the chances of discriminate crop types may increase as a result of its different growing and senescing timings, which are expressed as different intensities in the spectral signals of the wavelengths sensitive to the vegetation state (Jones & Vaughan, 2010). For instance, Arvor et al. (2011); Wardlow et al. (2007) and Wardlow and Egbert (2008), found some of the best separabilities between the NDVI temporal proﬁles of different crop types in spring, as a result of the different crop planting dates, resulting in different interclass foliage amounts and hence, different interclass NDVI signals. However, these characteristic signals may not be necessarily the strongest at greenness wavelengths (i.e., red and NIR bands). As a matter of fact, for the previously mentioned image combination we found some of the most meaningful bands in terms of crop type discrimination at the green, red, blue and SWIR (1.57–1.65 μm) bands of image date 1 (acquired at the early greenup stage: October 5th, 2013, Table 1), while the NIR band of that image was ranked far away from the top list (Fig. 5). As mentioned in the introduction of this work, the visible and SWIR bands are well-known for their sensitive to the bulk pigment and water content in the foliage, respectively. Regarding the former, green band has been used to construct the GNDVI (green normalized difference vegetation index), whose formulation resembles the NDVI but replacing the red band with the green band (Gitelson, Kaufman, & Merzlyak, 1996). The reason to do that is because the green band saturates at higher foliar chlorophyll concentrations than the red band. According to this, foliar chlorophyll content of the crops of interest

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

might be relatively high at the greenup stage, explaining the best performance of the green band. Results obtained in this work should be compared with SITS-based classiﬁcations of the same crop types subject to similar and different agrological settings throughout different years in order to assess the transferability of the methods employed here to a wider range of the crop types of interest. Regarding this, attempts to compare NDVI temporal proﬁles of graminaceous crop types from different years have shown to be consistent if scene dependent factors are controlled (e.g., using the same type of images subject to the same preprocessing procedures) (Masialeti et al., 2010). 5. Conclusions We addressed the SITS-based classiﬁcation of tree-fruits crops by adopting as feature sets not only the widely used NDVI temporal proﬁle but also a NDWI temporal proﬁle and the complete spectral resolution of a Landsat-8 time series corresponding to the 2013–2014 growing season of the crops of interest. For all the feature sets the overall results were good (MER ≤ 0.21), but showed that classiﬁcation accuracies are substantially improved – typically by 5–10 percentage points – if fullband SITS are used (MER 0.14–0.05). Meanwhile, the worst overall classiﬁcation accuracies were obtained by using the NDVI temporal proﬁle. For a training sample size of 200 ﬁelds, LDA using the full-band SITS of image dates 1, 3, 6 and 8 produced the best tradeoff between the number of images and classiﬁcation accuracy (MER = 0.06), being the green, red, blue and SWIR bands of image date 1 (acquired at the early greenup stage), the most relevant for crop type discrimination. These results emphasize the usefulness that the complete image spectral resolution may have for SITS-based crop type classiﬁcations, a methodological approach that is often overlooked in the scientiﬁc literature. Moreover, NDWI temporal proﬁle performed slightly better than NDVI temporal proﬁle, which highlights a potential has also been overlooked in SITS-based crop type classiﬁcation. In light of the encouraging classiﬁcation accuracies obtained in this work, the proposed methodology constitutes a reliable and less costly approach to the design and updating of crop inventories compared with costly ﬁeld mapping. Acknowledgments The use of facilities of the Shared Hierarchical Academic Research Computing Network (SHARCNET; www.sharcnet.ca) and Compute/ Calcul Canada is acknowledged. References Arvor, D., Jonathan, M., Simões, M., Meirelles, P., Dubreuil, V., & Durieux, L. (2011). Classiﬁcation of MODIS EVI time series for crop mapping in the state of Mato Grosso, Brazil. International Journal of Remote Sensing, 32(22), 7821–7847. Badhwar, G.D., Gargantini, C.E., & Redondo, F.V. (1987). LANDSAT classiﬁcation of Argentina summer crops. Remote Sensing of Environment, 21, 111–117. Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. Brenning, A. (2012). Spatial cross-validation and bootstrap for the assessment of prediction rules in remote sensing: The R package ‘sperrorest’. 2012 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 23–27 July 2012 (pp. 5372–5375). Brenning, A., Kaden, K., & Itzerott, S. (2006). Comparing classiﬁers for crop identiﬁcation based on multitemporal Landsat TM/ETM data. Proceedings, Second Workshop of the EARSeL Special Interest Group on Remote Sensing of Land Use and Land Cover, Bonn, 28–30 September 2006 (pp. 64–71). Brenning, A., Long, S., & Fieguth, P. (2012). Detecting rock glacier ﬂow structures using Gabor ﬁlters and IKONOS imagery. Remote Sensing of Environment, 125, 227–237. Ceccato, P., Flasse, S., & Gregoire, J. (2002). Designing a spectral index to estimate vegetation water content from remote sensing data: Part 2. Validation and applications. Remote Sensing of Environment, 82, 198–207. Ceccato, P., Flasse, S., Tarantola, S., Jacquemoud, S., & Grégoire, J. -M. (2001). Detecting vegetation leaf water content using reﬂectance in the optical domain. Remote Sensing of Environment, 77, 22–33. Chenab, C.F., Sonb, N.T., Changab, L.Y., & Chenb, C.R. (2011). Classiﬁcation of rice cropping systems by empirical mode decomposition and linear mixture model for time-series MODIS 250 m NDVI data in the Mekong Delta, Vietnam. International Journal of Remote Sensing, 32(18), 5115–5134.

243

Esch, T., Metz, A., Marconcini, M., & Keil, M. (2014). Combined use of multi-seasonal high and medium resolution satellite imagery for parcel-related mapping cropland and grassland. International Journal of Applied Earth Observation and Geoinformation, 28, 230–237. Galvão, L.S., Epiphanio, J.C.N., Breunig, F.M., & Formaggio, A.R. (2012). Crop type discrimination using hyperspetral data. In P.S. Thenkabail, J.G. Lyon, & A. Huete (Eds.), Hyperspectral remote sensing of vegetation (pp. 397–421). Boca Raton, FL: CRC Press. Gao, B. (1996). NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sensing of Environment, 58(3), 257–266. Giri, C.P. (2012). Remote sensing of land use and land cover, principles and applications. Boca Raton, Florida: CRC Press (477 pp.). Gislason, P.O., Benediktsson, J.A., & Sveinsson, J.R. (2006). Random Forests for land cover classiﬁcation. Pattern Recognition Letters, 27, 294–300. Gitelson, A., Kaufman, Y.J., & Merzlyak, M.N. (1996). Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sensing of Environment, 58, 289–298. Huang, C., Davis, L.S., & Townshend, J.R.G. (2002). An assessment of support vector machines for land cover classiﬁcation. International Journal of Remote Sensing, 23(4), 725–749. Jakubauskas, M.E., Legates, D.R., & Kastens, H. (2002). Crop identiﬁcation using harmonic analysis of time-series AVHRR NDVI data. Computers and Electronics in Agriculture, 37, 127–139. James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning with applications in R. New York: Springer (426 pp.). Jewell, N. (1989). An evaluation of multi-date SPOT data for agriculture and land use mapping in the United Kingdom. International Journal of Remote Sensing, 10, 939–951. Jones, H., & Vaughan, R. (2010). Remote sensing of vegetation: Principles, techniques and applications. New York: Oxford University Press (353 pp.). Liang, S., Li, X., & Wang, J. (2012). Advanced remote sensing: Terrestrial information extraction and applications. Oxford: Academic Press (799 pp.). Liaw, A., & Wiener, M. (2002). Classiﬁcation and regression by randomForest. R News, 2(3), 18–22. Lo, T.H.C., Scarpace, F.L., & Lillesand, T.M. (1986). Use of multitemporal spectral proﬁles in agricultural land-cover classiﬁcation. Photogrammetric Engineering and Remote Sensing, 52(4), 535–544. Masialeti, I., Egbert, S., & Wardlow, B.D. (2010). A comparative analysis of phenological curves for major crops in Kansas. GI Sciences and Remote Sensing, 47(2), 241–259. Mathur, A., & Foody, G. (2008). Crop classiﬁcation by support vector machine with intelligently selected training data for an operational application. International Journal of Remote Sensing, 29(8), 2227–2240. Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A., & Leisch, F. (2014). e1071: Misc functions of the Department of Statistics (e1071), TU Wien. R package version 1.6-4. http://CRAN.R-project.org/package=e1071 Mingwei, Z., Qingbo, Z., Zhongxin, C., Jia, L., Yong, Z., & Chongfa, C. (2008). Crop discrimination in northern China with double cropping systems using Fourier analysis of time-series MODIS data. International Journal of Applied Earth Observation and Geoinformation, 10, 476–485. Moguerza, J.M., & Muñoz, A. (2006). Support vector machines with applications. Statistical Science, 21(3), 322–336. Mountrakis, G., Im, J., & Ogole, C. (2011). Support vector machines in remote sensing: A review. ISPRS Journal of Photogrammetry and Remote Sensing, 66, 247–259. Murakami, T., Ogawa, S., Ishitsuka, K., Kumagai, K., & Saito, G. (2001). Crop discrimination with multitemporal SPOT/HRV data in the Saga Plains, Japan. International Journal of Remote Sensing, 22(7), 1335–1348. Nguyen, T.T.H., De Bie, C.A.J.M., Ali, Smaling, E.M.A., & Chu, T.H. (2011). Mapping the irrigated rice cropping patterns of the Mekong Delta, Vietnam, through hypertemporal SPOT NDVI image analysis. International Journal of Remote Sensing, 33(2), 415–434. Odenweller, J.B., & Johnson, K.I. (1984). Crop identiﬁcation using Landsat temporalspectral proﬁles. Remote Sensing of Environment, 14(1–3), 39–54. ODEPA (Oﬁcina de Estudios y Políticas Agrarias) (2014). Región metropolitana: Información regional 2014. Santiago, Chile: Ministerio de Agricultura (14 pp.). Ozdogan, M. (2010). The spatial distribution of crop types from MODIS data: Temporal unmixing using independent component analysis. Remote Sensing of Environment, 114, 1190–1204. Pal, M. (2005). Random forest for classiﬁcation in remote sensing. International Journal of Remote Sensing, 26(1), 217–222. Pal, M., & Mather, P.M. (2003). An assessment of the effectiveness of decision tree methods for land cover classiﬁcation. Remote Sensing of Environment, 86, 554–565. Pal, M., & Mather, P.M. (2005). Support vector machines for classiﬁcation in remote sensing. International Journal of Remote Sensing, 26(5), 1007–1011. Peña, M.A., & Altmann, S.H. (2009). Use of satellite-derived hyperspectral indices to identify stress symptoms in an Austrocedrus chilensis forest infested by the aphid Cinara cupressi. International Journal of Pest Management, 55(3), 197–206. R Core Team (2014). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing (URL http://www.R-project.org/). Roberto, C., Lorenzo, B., Michele, M., Micol, R., & Cinzia, P. (2012). In P. S. Thenkabail, J. G. Lyon, & A. Huete (Eds.), Optical remote sensing of vegetation water content. In Hyperspectral Remote Sensing of Vegetation (pp. 227–244). Boca Raton: FL: CRC Press. Rodriguez-Galiano, V.F., Ghimire, B., Rogan, J., Chica-Olmo, M., & Rigol-Sanchez, J.P. (2012). An assessment of the effectiveness of a random forest classiﬁer for land-cover classiﬁcation. ISPRS Journal of Photogrammetry and Remote Sensing, 67, 93–104. Roy, D.P., Wulder, M.A., Loveland, T.R., Woodcock, C.E., Allen, R.G., Anderson, M.C., ... Zhu, Z. (2014). Landsat-8: science and product vision for terrestrial global change research. Remote Sensing of Environment, 145, 154–172.

244

M.A. Peña, A. Brenning / Remote Sensing of Environment 171 (2015) 234–244

Sakamoto, T., Yokozawa, M., Toritani, H., Shibayama, M., Ishitsuka, N., & Ohno, H. (2005). A crop phenology detection using time-series MODIS data. Remote Sensing of Environment, 96, 366–374. Shao, J., Lunetta, R.S., Ediriwickrema, J., & Liames, J. (2010). Mapping cropland and major crop types across the Great Lakes Basin using MODIS-NDVI data. Photogrammetric Engineering and Remote Sensing, 75(1), 73–84. Sibanda, M., & Murwira, A. (2012). The use of multi-temporal MODIS images with ground data to distinguish cotton from maize and sorghum ﬁelds in smallholder agricultural landscapes of southern Africa. International Journal of Remote Sensing, 33(16), 4841–4855. Simonneaux, V., Duchemin, B., Helson, D., Er-Raki, S., Olioso, A., & Chehboun, A.G. (2007). The use of high-resolution image time series for crop classiﬁcation and evapotranspiration estimate over an irrigated area in central Morocco. International Journal of Remote Sensing, 29(1), 95–116. Sun, H., Xu, A., Lin, H., Zhang, L., & Mei, Y. (2012). Winter wheat mapping using temporal signatures of MODIS vegetation index data. International Journal of Remote Sensing, 33(16), 5026–5042. Szuster, B.W., Chen, Q., & Borger, M. (2011). A comparison of classiﬁcation techniques to support land cover and land use analysis in tropical coastal zones. Applied Geography, 31, 525–532. Turker, M., & Arikan, M. (2005). Sequential masking classiﬁcation of multi-temporal Landsat7 ETM + images for ﬁeld-based crop mapping in Karacabey, Turkey. International Journal of Remote Sensing, 26(17), 3813–3830. Van Niel, T.G., & McVicar, T.R. (2004). Determining temporal windows for crop discrimination with remote sensing: A case study in south-eastern Australia. Computers and Electronics in Agriculture, 45(1–3), 91–108. Wardlow, B., & Egbert, S. (2010). A comparison of MODIS 250-m EVI and NDVI data for crop mapping: A case study for southwest Kansas. International Journal of Remote Sensing, 31(3), 805–830.

Wardlow, B.D., & Egbert, S.L. (2008). Large-area crop mapping using time-series MODIS 250 m NDVI data: an assessment for the U.S. Central great plains. Remote Sensing of Environment, 112, 1096–1116. Wardlow, B., Egbert, S., & Kastens, J.H. (2007). Analysis of time-series MODIS 250 m vegetation index data for crop classiﬁcation in the U.S. Central Great Plains. Remote Sensing of Environment, 108, 290–310. Xavier, A.C., Rudorff, B.F.T., Shimabukuro, Y.E., Berka, L.M.S., & Moreira, M.A. (2006). Multi-temporal analysis of MODIS data to classify sugarcane crop. International Journal of Remote Sensing, 27(4), 755–768. Xie, Y., Sha, Z., & Yu, M. (2008). Remote sensing imagery in vegetation mapping: A review. Journal of Plant Ecology, 1(1), 9–23. Zandler, H., Brenning, A., & Samimi, C. (2015). Quantifying dwarf shrub biomass in an arid environment: Comparing empirical methods in a high dimensional setting. Remote Sensing of Environment, 158, 140–155. Zhang, X., Friedl, M.A., Schaaf, C.B., Strahler, A.H., Hodges, J.C.F., Gao, F., ... Huete, A. (2003). Monitoring vegetation phenology using MODIS. Remote Sensing of Environment, 84, 471–475. Zheng, B., Myint, S.W., Thenkabail, P.S., & Aggarwal, R.M. (2015). A support vector machine to identify irrigated crop types using time-series Landsat NDVI data. International Journal of Applied Earth Observation and Geoinformation, 34, 103–112. Zhong, L., Gong, P., & Biging, G. (2014). Efﬁcient corn and soybean mapping with temporal extendability: A multi-year experiment using Landsat imagery. Remote Sensing of Environment, 140, 1–13. Zhong, L., Hawkins, T., Biging, G., & Gong, P. (2011). A phenology-based approach to map crop types in the San Joaquin Valley, California. International Journal of Remote Sensing, 32(22), 7777–7804.

Assessing fruit-tree crop classification from Landsat-8 time series for the Maipo Valley, Chile

Assessing fruit-tree crop classification from Landsat-8 time series for the Maipo Valley, Chile

Recommend Documents