2,449
Views
8
CrossRef citations to date
0
Altmetric
Research Article

Comparison of high-resolution NAIP and unmanned aerial vehicle (UAV) imagery for natural vegetation communities classification using machine learning approaches

ORCID Icon &
Article: 2177448 | Received 26 Sep 2022, Accepted 02 Feb 2023, Published online: 21 Feb 2023

ABSTRACT

To map and manage forest vegetation including wetland communities, remote sensing technology has been shown to be a valid and widely employed technology. In this paper, two ecologically different study areas were evaluated using free and widely available high-resolution multispectral National Agriculture Imagery Program (NAIP) and ultra-high-resolution multispectral unmanned aerial vehicle (UAV) imagery located in the Upper Great Lakes Laurentian Mixed Forest. Three different machine learning algorithms, random forest (RF), support vector machine (SVM), and averaged neural network (avNNet), were evaluated to classify complex natural habitat communities as defined by the Michigan Natural Features Inventory. Accurate training sets were developed using both spectral enhancement and transformation techniques, field collected data, soil data, texture, spectral indices, and expert knowledge. The utility of the various ancillary datasets significantly improved classification results. Using the RF classifier, overall accuracies (OA) between 83.8% and 87.7% with kappa (k) values between 0.79 and 0.85 for the NAIP imagery and between 87.3% and 93.7% OA with k values between 0.83 and 0.92 for the UAV dataset were achieved. Based on the results, we concluded RF to be a robust choice for classifying complex forest vegetation including surrounding wetland communities.

1. Introduction

A key goal of forest management is to maintain and preserve natural biodiversity and protect the pristine landscape (Lindenmayer and Franklin Citation2002) by using efficient, affordable science-based practices. In order to maintain a balance between biological diversity and societal needs, it is important to develop management plans capable of achieving these goals (Bettinger et al. Citation2016). Within the natural resource community, there are numerous techniques used for management, monitoring and conservation planning. Field surveys and aerial photography are traditional techniques for obtaining information about forest conditions, identifying tree species or evaluating habitats. However, they are expensive, time-consuming, and have limitations (Ruiliang and Landry Citation2012). Aerial photography and satellite imagery interpretations provide areal coverage but are constrained by temporal, spectral, and spatial resolutions. Free or low-cost multispectral imagery commonly has spatial resolutions between 10 and 30 m. This information is employed for vegetation monitoring, predicting species abundance in association with environmental changes, and mapping habitats for species distribution modeling (Taylor et al. Citation2000; Buchanan et al. Citation2005; Prasad, Iverson, and Liaw Citation2006; Bradter et al. Citation2011; Monahan et al. Citation2022).

Within the remote sensing community, forest land use/cover classification using satellite imagery, piloted aircraft imagery and/or unmanned aerial vehicle (UAV) data is well documented (Homer et al. Citation2004; Hansen et al. Citation2010; Hayes, Miller, and Murphy Citation2014; Yang et al. Citation2018; Maxwell et al. Citation2019; Bhatt et al. Citation2022b). Land use/cover classification schemes typically use well defined, non-overlapping categories (Anderson Citation1976; Vogelmann et al. Citation2001; Bailey Citation2004, Citation2009) to classify forests, wetlands, grasslands, etc. These frameworks are uniform in the characterization of land use/cover and are based on canopy cover as detected by airborne sensors and include human activities.

By contrast, natural community habitats have a much broader definition and are based not only on the canopy but also understory vegetation, soils, and landform (Bradter et al. Citation2011; Cohen et al. Citation2014). “Protecting, managing, and restoring these communities is critical to biodiversity conservation, since native organisms are best adapted, to environmental and biotic forces with they have survived and evolved over the millennia” (Cohen et al. Citation2014). Natural community descriptions are derived from “physiography, hydrology, soils, natural processes and vegetation and do not include modern anthropogenic disturbances” (Cohen et al. Citation2014). The classification is organized hierarchically by ecological class, group, and type. Each community differs in its physical environment and species composition and the same plant species (canopy cover) often occurs in more than one community (Cohen et al. Citation2014; Cohen Citation2020). This increases class complexity and classification challenges due to a lack of distinct identifying spectral signatures (Jensen Citation2015; Lillesand et al. Citation2015). Delineating and mapping these communities require high spatial resolution imagery for improved feature delineation, image enhancements, and transformations as well as the incorporation of biogeophysical data.

Along with high spatial resolution imagery, classification accuracy depends on selecting the correct classification algorithm. Use of machine learning (ML) classification approaches has exponentially increased in the last decade (Schulz et al. Citation2018) and applied to a variety of environmental and natural resource applications (Mahesh and Mather Citation2003; Mountrakis et al. Citation2011; Rodriguez-Galiano et al. Citation2012b; Hayes, Miller, and Murphy Citation2014; Maxwell, Warner, and Fang Citation2018; Yang et al. Citation2018). The algorithms (MLAs) use a nonparametric approach to model and classify data and do not require normally distributed data. Numerous land use/cover classification studies highlight the advantages of using MLAs such as random forest (RF) and support vector machine (SVM) (Gunn Citation1998; Huang et al. Citation2002; Hayes, Miller, and Murphy Citation2014). AvNNet (Burton Citation1993) has been used in studies for predicting soil organic carbon, groundwater quality index, and land use/cover classifications (Taghizadeh-Mehrjardi et al. Citation2020; Kavhu et al. Citation2021; Ahn et al. Citation2022). MLAs were utilized in the classification of the 2001 National Land Cover Database (NLCD) [16]. They have also been used with NAIP imagery for accurate land cover classification (Kulkarni and Lowe Citation2016; Maxwell et al. Citation2019).

Useful ancillary data are equally important given the depth and breadth of the natural community class definitions. These datasets help overcome spectral limitations of the imagery and provide information beyond the bird’s eye view of the canopy. Researchers have used various environmental and geomorphological variables to improve classification results (Anderson Citation1976; Corcoran et al. Citation2013; Hayes, Miller, and Murphy Citation2014; Juel et al. Citation2015; Berhane et al. Citation2018; Kumar et al. Citation2020). Therefore, it is important to understand the contribution each ancillary dataset provides to the classification. Widely available feature selection methods evaluate ancillary datasets’ importance (Guyon and Elisseeff Citation2003) by reducing data complexity (Hughes Citation1968) and improving computational times (Maxwell, Warner, and Fang Citation2018). The robustness of the approach used in this study expanded on a previous study done by the authors (Bhatt et al. Citation2022b), which included the use of NAIP coupled with biogeophysical variables and widely used machine learning algorithms like RF and SVM.

Researchers have used NAIP and UAV datasets for delineating and mapping land cover, identifying tree species, delineating wetlands, habitat mapping, and invasive species mapping (Maxwell et al. Citation2017; Bhatt Citation2018, Citation2022; Hogland et al. Citation2018; Bhatt et al. Citation2022a; Monahan et al. Citation2022, Citation2022 2022). Each has advantages and disadvantages in terms of spatial, spectral, and temporal resolution. To date, a direct comparison between the two sets of imagery for natural habitat community classification has not been completed. The key objective of this research is to compare and contrast the utility of ultra-high spatial resolution UAV imagery versus high spatial resolution NAIP imagery to delineate and map complex natural community habitats.

2. Materials

2.1. Study areas

Upper Midwest forests are classified as Laurentian Mixed Forest (LMF) which is made up of complex geomorphology, climate, soils, fauna, and vegetation due to the extensive glaciation which occurred over thousands of years. Natural community boundaries may be sharply defined or change gradually. The area has a climatic tension zone, and sites along the Great Lakes shoreline support vegetation with northern and southern affinities which pose classification challenges. Dividing this landscape into natural communities provides guidance to better describe, understand, and restore the native community diversity (Cohen et al. Citation2014; Cohen Citation2020).

The study areas are within the Hiawatha National Forest and fall under the International Union for Conservation of Nature (IUCN) category IV and contain diverse upland and lowland ecosystems, including extensive pristine coastal forests and wetlands. Two study sites, Point aux Chenes (PAC) Bay () and Carp River Mouth (CRM) (), were selected as there are numerous natural community habitats which are unique in vegetation, soil, and landform within relatively small geographic areas (Cohen et al. Citation2014). Several of the communities (Interdunal Wetlands, Open Dunes and Wooded Dune and Swale Complex) are considered imperiled due to rarity and/or vulnerability (Cohen et al. Citation2014). The PAC site encompasses 420 ha (1,038 ac) and 790 ha (1,952 ac) for CRM. The study areas are located within glacial lake and outwash plain landforms, respectively (Jerome Citation2006). Current threats to these areas include unauthorized off-road vehicle use, poorly designed or degraded road and stream crossing structures which create physical barriers to hydrologic function, roads that parallel coastlines with inadequate drainage structures, and the presence and/or expansion of non-native invasive species.

Figure 1. Pointe aux Chenes Bay study area. The shoreline is adjacent to Lake Michigan. NAIP imagery was collected in August 2018. UAV imagery was collected in August 2019.

Figure 1. Pointe aux Chenes Bay study area. The shoreline is adjacent to Lake Michigan. NAIP imagery was collected in August 2018. UAV imagery was collected in August 2019.

Figure 2. Carp River Mouth study area. The shoreline is adjacent to Lake Huron. NAIP imagery was acquired September 2018. UAV imagery was collected in August 2019.

Figure 2. Carp River Mouth study area. The shoreline is adjacent to Lake Huron. NAIP imagery was acquired September 2018. UAV imagery was collected in August 2019.

2.1.1. Datasets and software

High-resolution multispectral NAIP and ultra-high-resolution UAV imagery were used for the study. NAIP imagery has four bands (Blue (420–492 nm), Green (533–587 nm), Red (604–664 nm), and Near-Infrared (683–920 nm)) (USDA Citation2022) and was acquired 4,877 m (16,000 feet) above ground level (AGL) with a Leica ADS100 airborne digital sensor. The imagery has 8-bit radiometric resolution with 0.6 m spatial resolution. Imagery tiles dated 11 August 2018 and 6 September 2018 were downloaded from USGS Earth Explorer for PAC and CRM, respectively.

UAV data were collected in August 2019 using a fixed-wing Trimble U×5-AG aircraft. The U×5-AG has a 1 m wingspan with 2.5 kg weight and is capable of flying up to 45 minutes with a cruise speed of 80 km/h. Imagery with 80% overlap was acquired using a five-band (Blue (475 ± 20 nm), Green (560 ± 20 nm), Red (668 ± 10 nm), Red Edge (717 ± 10 nm), and Near-Infrared (840 ± 40 nm)) MicaSense camera mounted onboard. Flying height was between 104 and 134 m (341 ft–440 ft) with a 7 cm spatial resolution for the PAC and 9 cm for the CRM. The spatial resolution varied due to the flying height and differences in terrain geometry. Days with optimum sunlight conditions and minimum clouds were selected for flying to minimize illumination and shadowing inconsistencies. Using the onboard high-accuracy Global Navigation Satellite System (GNSS) positioning data, the UAV images were processed and mosaicked with Agisoft Metashape 1.5.3 software using the standard workflow procedure provided by the USFS UAV office (Sloan Citation2017).

ERDAS IMAGINE (Hexagon Geospatial) was used to generate Principal Component Analysis (PCA), spectral indices, and Gray-Level Co-Occurrence Matrix (GLCM) texture layers for both sets of imagery. Random training points were generated using ArcPro software. Machine learning algorithms were implemented using the “caret” (Kuhn et al. Citation2020) package within R (Team Citation2013) programming language.

3. Methods

Spectral variability and similarity within and between the vegetative components of the natural community habitats created classification challenges and was documented by Bhatt et al. (Citation2022a). However, the high spatial resolution NAIP and UAV imagery combined with ML permitted utilization of the variability (Maxwell et al. Citation2017). All NAIP and UAV spectral bands were utilized for training set generation. An integrated classification approach incorporating ancillary data (image transformation and enhancement techniques), field data, and expert knowledge was developed (Whittaker Citation1962; Adam, Mutanga, and Rugege Citation2009; Corcoran et al. Citation2013; Cohen et al. Citation2014; Lane et al. Citation2014; Berhane et al. Citation2018; Congalton and Green Citation2019). Accurately delineated training area polygons were critical for optimal performance of Machine Learning Algorithms (MLAs) (Maxwell, Warner, and Fang Citation2018).

3.1. Image transformation techniques

PCA is commonly used for various classification applications and is one of the most widely used transformation techniques and generates uncorrelated components (Dunteman Citation1989; Jensen Citation2015). It has been used by natural resource managers to delineate vegetation, map change detection, and observe vegetation distribution (Almeida and Souza Filho Citation2004; Munyati Citation2004; Lasaponara Citation2006; Dronova et al. Citation2015). By contrast, Independent Component Analysis (ICA) uses higher-order statistics and considers each component to be non-Gaussian (Shah et al. Citation2002). The transformation highlights minute details in the imagery even when the feature occupies a small area (Hyvärinen and Oja Citation2000). However, it has been used minimally to map vegetation and for land use/cover classification (Shah et al. Citation2007a, Citation2007b; Fangfang and Xiao Citation2011). Components from both transformations were visually assessed for edge detection within and between the natural habitat communities to generate valid training sets. With the multispectral data, the UAV imagery was preprocessed and mosaicked using Metashape and the NAIP imagery was preprocessed by the contractor after acquisition.

3.2. Topographic data

Land surface characteristics for the study areas are greatly influenced by extensive glaciation and influence elevation and soil characteristics such as drainage and pH which influence natural community development. A high-resolution (1 m) LiDAR Digital Elevation Model (DEM) was used to identify topographic details of the natural community habitats for the NAIP imagery. Additionally, an ultra-high-resolution DEM was generated in Metashape from the point cloud data to use with the UAV imagery.

3.3. Texture

Similar spectral signatures occur between the natural habitat communities and increase the difficulty of accurate separation during training set development and classification. However, the communities do display various types of texture traditionally used in manual interpretation. Different texture statistics can detect unique information and spatial patterns for features which are hard to separate using only spectral information (Haralick, Shanmugam, and Dinstein Citation1973; Maillard Citation2003; Lane et al. Citation2014; Hall-Beyer Citation2017). In the past, texture-based variables have been incorporated by researchers into species detection, for fine-scale wetland classification, and in land use/cover classification (Rodriguez-Galiano et al. Citation2012a; Feng et al. Citation2015; Berhane et al. Citation2018; Franklin and Ahmed Citation2018; Tassi and Vizzari Citation2020). GLCM texture measures were calculated from the first and second PCA components and created two uncorrelated texture datasets. For both the NAIP and the UAV imagery, first (55.38% to 67.72%) and second (27.02% to 38.32%) principal components contributed the highest to explaining the data variability (Dunteman Citation1989). Four GLCM texture measures (contrast, entropy, standard deviation, dissimilarity) were calculated. Contrast measures the local variations present in the image, entropy measures the randomness within the data, standard deviation looks at its frequency of occurrence with reference and neighboring pixel values, and dissimilarity measures the differences in elements of the GLCM from each other (Haralick, Shanmugam, and Dinstein Citation1973; Hall-Beyer Citation2017). Data were generated with a 32-bit grayscale level and two Euclidean geometry offsets (2, 2 and 2, −2). Window sizes of 3 × 3, 5 × 5, 7 × 7, and 9 × 9 were evaluated.

3.4. Spectral indices

Spectral indices have been used extensively to map and monitor vegetation (Bannari et al. Citation1995; Berhane et al. Citation2018; Bhatt et al. Citation2022b). The normalized vegetation index (NDVI) is widely used by researchers to look at vegetation growth, phenology extraction, and landcover classification (Rouse et al. Citation1974; Tucker Citation1979; Shuang et al. Citation2021) and was employed in this study to classify the natural communities. NDVI calculation takes ratio between the red (R) and near-infrared (NIR), while the two modified water indices based on the WorldView water index (WV-WI) were developed by Wolf (Wolf Citation2012). The water index for the NAIP imagery (WINAIP) using its blue and near-IR bands, and a water index for the UAV imagery (WIUAV) using the blue and near-IR bands of the Micasense camera created customized indices.

3.5. Evaluation of ancillary datasets

When classifying natural community habitats, inputting multispectral imagery alone was not adequate for accurate data classification. With manual interpretations, ancillary data such as soil maps are traditionally used for improved boundary delineation and vegetation classification. It made sense to provide the ML classifiers with this type of information as well. DEMs, GLCM-textures (contrast, entropy, standard deviation, dissimilarity), and spectral indices (NDVI, WINAIP, WIUAV) were calculated. The next step was to understand each ancillary dataset’s contribution to classification improvement as using all of them does not guarantee the best result. Ancillary input dataset selection approaches have been used in many remote sensing applications (i.e. data mining, natural language processing, bioinformatics, image processing, crop classification, crop yield prediction, and mineral mapping) (Guyon and Elisseeff Citation2003; Hoque et al. Citation2014; Zhong et al. Citation2019; Kumar et al. Citation2020; Momm, ElKadiri, and Porter Citation2020; Zheng et al. Citation2021), but have not been extensively used in natural habitat community classification (Bhatt et al. Citation2022a). Input data selection (also known as variable or feature selection) approaches are fast, cost-effective, and provide insight into the contribution of each ancillary dataset (Guyon and Elisseeff Citation2003). For this study, joint mutual information maximization (JMIM) (Bennasar et al. Citation2015), a filter-based method, was used.

3.6. Training set development

Training polygons, from which the ML training points are selected, were manually drawn with careful consideration given to vegetation community species, soil drainage classes and pH, elevation and landform, information highlighted in the PCA and ICA components, ground truth data, and expert knowledge of the areas. Eight natural community classes and three non-community classes were identified ().

Table 1. Natural community habitats and associated vegetation components. Communities 9–11 were developed for land uses not natural community habitats.

Accurate training sets are critically important for the classification accuracies (Maxwell, Warner, and Fang Citation2018). Multiple training sets created with randomly selected training points were developed across the study sites to capture spectral and spatial variability. More training polygons (812 for UAV vs 136 for NAIP) were needed to classify the UAV imagery because the higher spatial resolution provided greater detail and it was important to incorporate the spectral details of different natural community classes. Boundaries of the training set polygons () are verified from extensive field assessment and from existing stand maps. This approach was selected based on its successful use in previous research (Bhatt et al. Citation2022b).

Figure 3. Natural habitat communities training and testing polygons for CRM study area.

Figure 3. Natural habitat communities training and testing polygons for CRM study area.

Figure 4. Natural habitat communities training and testing polygons for CRM study area.

Figure 4. Natural habitat communities training and testing polygons for CRM study area.

3.7. Image classification

RF, SVM, and avNNet are widely used classifiers within the remote sensing community for land use/cover, agriculture crop classifications, and groundwater quality assessment (Gong et al. Citation1997; Huang et al. Citation2002; Clark, Roberts, and Clark Citation2005; Bandos, Bruzzone, and Camps-Valls Citation2009; Bradter et al. Citation2011; Hayes, Miller, and Murphy Citation2014; Berhane et al. Citation2018; Mahdianpari et al. Citation2018; Gómez et al. Citation2019; Ahn et al. Citation2022). RF was developed by Brieman (Citation1984) as an ensemble classifier which utilizes nonparametric classification and regression tree (CART) (Breiman and Cutler Citation2007) rules to predict. It can work with spatially large and complex datasets which are highly correlated and can work robustly without having optimization parameters (Breiman Citation1999; Maxwell, Warner, and Fang Citation2018). SVM, a supervised machine learning algorithm which identifies an optimal hyperplane separating two classes in a feature space and was developed by Vapnik (Citation2013). Having a higher dimensional feature space to project the non-linear and noisy data distributions can help overcome the overfitting issue and classify the real-world data better (Boser et al. Citation1992; Maxwell, Warner, and Fang Citation2018). Averaged neural network (avNNet) functions by applying averaging technique to the neural network (Ripley Citation2007). avNNet can be used for both regression and classification by applying the modified ordinary differential equations to the neural network (Ripley Citation2007). For classifying the data, model scores are averaged first and then applied it to the predicted class (Burton Citation1993).

In a recent study completed by Bhatt et al. (Citation2022a) with NAIP imagery, RF was shown to be a better classifier for natural community habitats compared to SVM. Along with RF and SVM, another ML algorithm, averaged neural network (avNNet) from the caret (Kuhn et al. Citation2020) package was tested. Within the training polygons, 75% of the randomly selected training points were used to develop the training sets and the remaining 25% were reserved for accuracy assessment. To avoid any overfitting issues, a 10-fold cross validation was applied to the data (Kohavi Citation1995). All three classifiers were ran with “center” and “scale” pre-processing parameters to standardize the ancillary datasets (Kuhn Citation2015; Kuhn et al. Citation2020). Classifications were executed using the “caret” package (Kuhn et al. Citation2020) in the “R” programming language (Team Citation2013). Results for the three classifiers were compared using Overall Accuracy (OA) and kappa coefficient (k) (Congalton and Green Citation2019). Individual communities were evaluated employing User’s Accuracy (UA), Producer’s Accuracy (PA), and F1 Scores (Congalton and Green Citation2019). Between 10 and 15 ground truth observations were made for each natural habitat community class during field visits (August 2019) to each study site.

3.8. Accuracy assessment and post-classification refinement

Accuracy assessment was completed using the reserved test points plus independently collected field points. These are referred to as validation points. Evaluations were completed by comparing classification values against the validation points (Congalton and Green Citation2019). Using the resulting accuracy assessment matrices, UA and PA values were calculated for each habitat class.

“Salt and pepper” effects (Lillesand et al. Citation2015) were smoothed to create a more easily interpreted final classification map. A majority filter using a 7 × 7 moving window was run based on previous research (Munyati Citation2004).

4. Results

Pixel-based image classifications were run using RF, SVM, and avNNet with RF producing the best classifications for the various combinations of imagery and variables. show the classification results for both sets of imagery at each study site. The figures show zoomed-in classification snippets highlighting various mapped details. Visual assessment of the classifications showed that the UAV classifications delineated finer boundary detail for each community. This is due to the finer spatial resolution when compared to the NAIP derived boundaries. The NAIP imagery also presented a more generalized natural community habitat map with less “salt-and-pepper” artifacts (Lillesand et al. Citation2015).

Figure 5. Selected areas of the PAC classification delineated by the RF classification for NAIP and UAV imageries highlighting detail differences for the natural community classes.

Figure 5. Selected areas of the PAC classification delineated by the RF classification for NAIP and UAV imageries highlighting detail differences for the natural community classes.

Figure 6. Selected areas of the CRM study site delineated by the RF classification for NAIP and UAV imageries highlighting detail differences for the natural community classes.

Figure 6. Selected areas of the CRM study site delineated by the RF classification for NAIP and UAV imageries highlighting detail differences for the natural community classes.

The PAC NAIP-based classification achieved OAs between 86.28% and 88.33% using all input variables with RF. When the highest contributing variables based on the JMIM scores were used, the OAs ranged between 85.28% and 87.74%. Reducing the input variables from 15 to 9 did not significantly affect the OAs and kappa (k) (). However, accuracies decreased significantly using only the 4 NAIP bands and ranged from 72.49% to 77.15% (). This illustrates the important contribution made by the variables. The final classification had an OA of 87.74% and a k of 0.85. Similar results occurred with the PAC UAV imagery. Using the 16 available inputs with the UAV imagery, the OA was slightly lower () compared to inputting the best 10 JMIM selected inputs. Once again, using just the five UAV reflectance bands decreased OA between 5% and 8% with all three classifiers (). The final UAV imagery classification using RF achieved an OA of 93.74% and 0.92 k. Overall, the UAV classifications provided better end products than the NAIP classifications by a 6% increase in OA (0.7 k) using RF ().

Table 2. Accuracy assessments of the NAIP derived classifications for PAC and CRM.

Table 3. Accuracy assessments of the UAV derived classifications for PAC and CRM.

The NAIP and UAV classifications for the CRM study site achieved higher accuracies with RF compared to SVM and avNNet. Final NAIP-CRM classification OA was 83.85% with 0.79 k (). The UAV-CRM OA and k were 87.31% and 0.83, respectively (). Both classifications performed best using JMIM scores to select input variables compared to using all available data (). Using only the NAIP spectral bands decreased the OA by 16% (), and with the UAV the accuracy decreased 11.8% (). These results are similar to those seen with the PAC study site and the work completed by Bhatt et al. (Citation2022b). They indicate the robustness of the classification approach across the diverse natural communities.

Kappa values were categorized into three groups, good (k < 0.80) (Landis and Koch Citation1977; Altman Citation1990), strong (k 0.80–0.90), and almost perfect (k > 0.90) (McHugh Citation2012). With this study, the classifications showed strong to almost perfect relationships between “truth” and the classified results (). Along with the OA and k, F1 scores were compared for each natural community class. Within the remote sensing community, F1 scores are widely considered when the dataset does not have a balanced accuracy. It is essentially the harmonic mean of precision (UA) and recall (PA) (Etten et al. Citation2018; Zhang et al. Citation2020). Both precision (UA) and recall (PA) should be 1 for a good classification, though this seldom occurs with complex real-world datasets. show the F1 scores by community class for the RF classifier. For PAC NAIP classification, lower F1 scores () are observed with Great Lakes Marsh (0.71), Interdunal Wetland (0.77), and SGB (0.74). These lower scores for Great Lakes Marsh and Interdunal Wetland can be attributed to the similarities in the vegetation with other natural habitat communities like Wooded Dune and Swale Complex and Emergent Marsh. With the PAC UAV classification, lower accuracies () were observed with Great Lakes Marsh (0.66) due to spectral similarities in natural habitat communities with Emergent Marsh and Wooded Dune and Swale Complex.

Figure 7. RF classifier F1 scores for the natural communities between NAIP and UAV imagery for Pointe aux Chenes.

Figure 7. RF classifier F1 scores for the natural communities between NAIP and UAV imagery for Pointe aux Chenes.

Figure 8. RF classifier F1 scores for the natural communities between NAIP and UAV imagery for Carp River Mouth.

Figure 8. RF classifier F1 scores for the natural communities between NAIP and UAV imagery for Carp River Mouth.

Lowest F1 Scores for CRM classification were observed with Rich Conifer Swamp natural community class for both NAIP (0.52) and UAV (0.47) imagery (). Major confusion for Rich Conifer Swamp was observed with Wooded Dune and Swale Complex (Appendix - ) Emergent Marsh natural community showed lower F1 scores () for NAIP imagery, whereas for UAV it was high. Major confusion for Emergent Marsh was observed with Great Lakes Marsh, Rich Conifer Swamp and Wooded Dune and Swale Complex (Appendix ). The higher Emergent Marsh F1 score for UAV could be attributed to the more precise training data collection and higher spatial resolution. Similarly, Great Lakes Marsh class had a much better F1 score for NAIP than UAV. Majority of confusion for Great Lakes Marsh in UAV classification was with Northern Shrub Thicket and Open Water classes (Appendix ). Highest F1 scores for both study areas were seen with Open Water, Open Land, and Impervious Surface, due to their uniform surface reflectance (). Overall, the F1 scores for both the study areas were reasonably well given the complexity and spectral similarities of the natural habitat communities.

PAC NAIP classification () was more generalized and contributed to Wooded Dune & Swale Complex and Emergent Marsh over prediction when compared to field observations. The UAV classification delineated the natural communities well except for Emergent Marsh, Interdunal Wetlands, and Great Lakes Marsh. This confusion is due to the same vegetation components and water being found in all of them (); hence spectral (Appendix ) and textural similarities between them. These communities are smaller in size and intermixed with more commonly occurring, larger area, communities and therefore have fewer validation points. Fewer points mean misclassifications have a greater impact on UA and PA and were lower compared to the other classes (natural communities) (Appendix ).

The confusion matrix for NAIP CRM classification (Appendix ) shows both lower UAs and PAs for Emergent Marsh, Northern Shrub Thicket, and Rich Conifer Swamp. The matrix shows confusion between Emergent Marsh and Northern Shrub Thicket. Evaluation of the training sets showed large standard deviations and these training sets were replaced with ones with less variability (smaller standard deviations). The misclassification of the Rich Conifer Swamp is primarily with Wooded Dune & Swale Complex. This confusion is the result of a high percentage of the same conifer species in both communities including northern white cedar, tamarack, white pine, and tag alder (). Landform is also a strong indicator for Wooded Dune & Swale Complex (). However, if the swales are widely spaced between the dunes, the texture changes and contributes to the error particularly with the coarser NAIP spatial resolution (). The UAV classification shows lower UAs and PAs for Great Lakes Marsh, Northern Shrub Thicket, and Rich Conifer Swamp. Again, the poor classification of Great Lakes Marsh and Northern Shrub Thicket is due to high variability in the training sets, and sets were retaken.

Figure 9. Landforms influence on natural communities for the PAC and CRM study sites are highlighted when draped over a multi directional oblique weighted (MDOW) hillshade. Classifications derived using Random Forest.

Figure 9. Landforms influence on natural communities for the PAC and CRM study sites are highlighted when draped over a multi directional oblique weighted (MDOW) hillshade. Classifications derived using Random Forest.

Ancillary data sets were evaluated using JMIM feature selection. JMIM scores range between 0 and 2 regardless of the measurement units of the input variables. This allows direct comparison between the variables in ascertaining the unique contribution each input makes to the classification. Plots of the JMIM scores () show that the input variables (ancillary data) maintain the same pattern of importance for both sets of imagery across the study sites. The same results were seen with work completed by Bhatt et al. (Citation2022a). These included DEMs, GLCM-textures (contrast, entropy, standard deviation, dissimilarity), NDVIs, and modified water indexes specific to the NAIP and UAV imagery. The water index for each set of imagery differed due to the NAIP having four spectral bands (B, G, R, near-IR) while the UAV has five bands (B, G, R, red edge, and near-IR). JMIM scores were calculated for all of the spectral bands as well (). At both study sites, the most important input features with the NAIP imagery were all NAIP spectral bands, NDVI, WINAIP, DEM and contrast texture (). For the UAV data, the five Micasense spectral bands, NDVI, WIUAV, DEM, and contrast texture () showed high importance. It is important to remember that JMIM-based selection values do not guarantee a more accurate classification but provide quantitative guidance to input variable selection.

Figure 10. Ancillary dataset (variable) importance scores using JMIM feature selection method. B-blue, C1-contrast texture (PC1, 7×7 moving window), C2-contrast texture (PC2, 7×7), DEM-digital elevation model, Dissim1-Dissimilarity texture (PC1, 7×7), Dissim2-Dissimilarity texture (PC2, 7×7), Ent1-entropy texture (PC1, 7×7), Ent2-entropy texture (PC2, 7×7), G-green, NDVI-normalized difference vegetation index, NIR-near-infrared, R-red, SD1-standard deviation texture (PC1, 7×7), SD2-standard deviation texture (PC2, 7×7), WINAIP- NAIP modified water index, WIUAV-UAVmodified water index.

Figure 10. Ancillary dataset (variable) importance scores using JMIM feature selection method. B-blue, C1-contrast texture (PC1, 7×7 moving window), C2-contrast texture (PC2, 7×7), DEM-digital elevation model, Dissim1-Dissimilarity texture (PC1, 7×7), Dissim2-Dissimilarity texture (PC2, 7×7), Ent1-entropy texture (PC1, 7×7), Ent2-entropy texture (PC2, 7×7), G-green, NDVI-normalized difference vegetation index, NIR-near-infrared, R-red, SD1-standard deviation texture (PC1, 7×7), SD2-standard deviation texture (PC2, 7×7), WINAIP- NAIP modified water index, WIUAV-UAVmodified water index.

Both NDVI and WINAIP, two of the highest JMIM scores, provided information to accurately delineate the community habitat classes. Classes such as Wooded Dune and Swale Complex, Rich Conifer Swamp, and Northern Shrub Thicket showed higher values with NDVI compared to the remainder of the habitat community classes. Incorporation of the DEM, NDVI, WINAIP, contrast textures 1, and 2 increased the OA of the natural community classifications an average of 13.59% for NAIP and 18.22% for the UAV when compared to only using the spectral bands of each set of imagery (). Overall, the UAV-based classification outperformed the NAIP classification due to its high-spatial resolution and greater texture details.

5. Discussion

Most studies classifying land use/cover are for specific resource management purposes and categorize the imagery into narrowly defined, non-overlapping classes. However, classifying imagery into well defined, robust natural community habitats provides a holistic approach to resource management and is more representative of field condition variability. Until recently, there were no natural community habitat classification studies of the complex Laurentian Mixed Forest in the Upper Great Lakes. Inventorying, monitoring, and preserving these pristine habitats, particularly along coastlines, are increasingly important given the impacts of climate change. Field-based monitoring alone is not able to complete these tasks in a timely, cost-effective manner. The approach presented in this study is applicable to other areas across United States and Canada (White and Host Citation2008; Brown et al. Citation2013). New Hampshire (Sperduto and Nichols Citation2004), North Carolina (Schafale and Weakley Citation1990), Wisconsin (Curtis Citation1959; Noss Citation1987), Massachusetts (Whitlatch Citation1977; Kearsley Citation1999), Indiana (Jackson Citation1979; Homoya et al. Citation1984), Minnesota (Hanson and Hargrave Citation1996; Aaseng et al. Citation2011; Wilson and Ek Citation2017), and (Inventory, Florida Natural Areas Citation1990) are a few of the states that have developed Natural Communities classification schemes. While the hierarchy of the classification varies from state to state, the foundation of the schemes is the same: group recurring assemblages of flora and fauna found in particular physical environments into a descriptive classification scheme. These maps provide information essential to conservation planning and protecting biodiversity by communicating a story to resource managers, landowners, land-use planners, and scientists (Cohen Citation2020).

Which imagery is best for natural community classification is unclear. Variations in reflectance values between the UAV and NAIP for the same areas are shown in . This variation is due, in part, to the different bandwidths for the RGB bands. Imagery pair A in shows a gray textured area (red arrow) in the UAV imagery which is Princess Pine (Dendrolycopodium obscurum) an endangered club moss and indicates a small, but important area of Wooded Dune Swale Complex. It is not visible in the NAIP; rather, the entire area is classified as Interdunal Wetland and does not provide the spatial and spectral detail required to manage and protect this at-risk species. Pair B () highlights underwater seepage and drainage patterns from springs not detectable on the UAV imagery. The presence of a small Great Lakes Marsh trapped by a sand bar is visible on the UAV imagery (Pair C, ) but appears as Sand and Gravel Beach on the NAIP. These small “trapped” Great Lakes Marsh Areas support bird’s eye primrose (Primula mistassinica) and blue vervain (Verbena hastata). Both species are important food sources for native bees and need to be monitored frequently given the imperiled status of many bee species. On the UAV imagery, the bright magenta areas found in Pair D () are Phragmites and are not visible in the NAIP. Common reed (Phragmites australis ssp. americanus) is a native species. However, Phragmites australis ssp. australis is an invasive species and is considered to be problematic in North America. It invades marsh meadows and cattail zones and reduces the number of bird species found in these habitats (Robichaud and Rooney Citation2017). In , Pair A shows individual stems of downed coarse woody debris which provides food and habitat for a wide range of organisms, slows water flow in river and streams, and recycles nutrients trapped in the wood. Pair B () shows dead tree in both images, but the UAV imagery permits counting of the snags and species identification. Snags are critical landscape features for numerous wildlife species.

Figure 11. Spectral reflectance differences between UAV and NAIP imagery for PAC due to differences in the red, green and blue bandwidths.

Figure 11. Spectral reflectance differences between UAV and NAIP imagery for PAC due to differences in the red, green and blue bandwidths.

Figure 12. Spectral reflectance differences between UAV and NAIP imagery for CRM due to differences in the red, green and blue bandwidths.

Figure 12. Spectral reflectance differences between UAV and NAIP imagery for CRM due to differences in the red, green and blue bandwidths.

For areal coverage, NAIP is more comprehensive as it is acquired by aircraft flying at a constant speed and altitude. It undergoes rigorous radiometric and geometric corrections, and mosaics are easy to create. By contrast, UAV imagery has variations in lighting conditions due to acquisition times often spaced throughout the day, and the vehicle is more susceptible to pitch, yaw, and roll distortions due to its lighter weight. Radiometric and geometric conditions are applied to the UAV imagery, but the user must be willing to accept quality variation across the mosaicked imagery. A study completed by Citation2022b), recommends an image overlap of 80% to help overcome geometric variability and produce acceptable quality orthoimages. The finer spatial resolution of the UAV imagery maps textural changes very well, which are important to the accurate delineation of spectrally similar natural communities such as Northern Shrub Thicket, Rich Conifer Swamp and Wood Dune & Swale Complex ().

Performing radiometric and geometric corrections on the UAV imagery is time-consuming. Current processing software has limitations in photogrammetric robustness and ease of use. The physical size of the UAV imagery is also an important consideration, and adequate computing resources and storage space are prerequisites. Processing times need to be considered. With the two study areas, it took 88 hours to preprocess the UAV imagery and generate the final orthomosaics for the two study areas.

For NAIP, temporal resolution is not ideal as it is collected every 5 years especially for catastrophic events such as fires and flooding as well as phenological timed events. UAV imagery offers advantages to these types of studies as the platform can be airborne in a short-time frame and is ideal for data collection over small geographic areas where the vehicle can be kept in the operator’s line of sight.

Accurate training sets and informative ancillary datasets are equally important to the success of mapping natural communities. Spectral signatures (Appendix ) alone do not provide enough clear separation to accurately delineate the natural communities. This study used PCA, ICA, soil data, landform, field data, and expert image interpretation to outline training set polygons for each natural habitat community. Training points for the MLAs were extracted from within the polygons ( and 4). It is important to note that the difference between the number of training sets for NAIP and UAV is attributed to the spatial resolution. Larger training polygons within the ultra-fine UAV imagery contain too much variation (large standard deviations) and result in noisy training sets and lead to misclassification of the natural communities.

The natural habitat communities at both study sites exhibited significant amounts of local variation. The finer spatial resolution of the UAV imagery emphasizes the high degree of texture created within and between the natural communities and contributed to the accurate delineation of the natural community boundaries. Each community exhibits distinctive patterns and shapes such as the well-defined ridge and valley complex associated with the Wooded Dune and Swale Complex. Contrast texture, using a 7 × 7 window size, was the most informative variable when evaluated against entropy, standard deviation, and dissimilarity using JMIM feature selection (). Low contrast values were observed with smooth texture classes such as Open Water, Open Land, and Impervious Surface. Highly textured community classes (i.e. Wooded Dune and Swale Complex, Rich Conifer Swamp, Northern Shrub Thicket) have a rougher texture represented by higher contrast values as illustrated in . Due to the small area of some of the natural communities and having multiple vegetation species within each community, it is not possible to run large texture windows (i.e. 21 × 21, 33 × 33). The detailed texture image degrades dramatically as window size increases ( – Appendix) and it would add more confusion to the classification (Cohen et al. Citation2014; Hall-Beyer Citation2017).

This discussion presents advantages and disadvantages to using each set of imagery and recommending using one to the exclusion of the other would be wrong. How will the final map be used is the question(s) which should dictate to mix of UAV imagery and NAIP. RF is recommended as the classifier of choice when working with ecologically complex natural community habitats. SVM and avNNet are less efficient compared to RF and always produce lower accuracies in this study. It may be argued that tuning the RF parameters may further improve the classifications. However, parameter tuning is time-consuming and not cost-effective given the acceptable accuracies achieved for the natural community delineation and mapping.

6. Conclusions

Natural community mapping with high and ultra-high spatial resolution imagery along with informative ancillary data is challenging and critically important for the forest management and policy. The data sets are physically large and have high computational requirements along with lengthy processing times. The development of good training sets requires time and detailed knowledge of the study area. Widely used and robust machine learning algorithms like RF overcome these intricacies. However, acceptable results are achieved in this study not only in identifying the correct natural community label but also mapping accurate and detailed boundaries between the communities.

This research has many unexplored and unanswered questions to be further investigated. In the future studies, efficacy of the red edge band and derived indices needs exploring and compared with traditional indices like NDVI. Red edge indices have been used in the past with Sentinel-2A, Rapid-Eye, WorldView-2, and UAV imagery to observe vegetation phenology, spatial variability of crop growth, leaf area index, and burn severity assessments (Hill Citation2013; Shang et al. Citation2015; Fernández-Manso et al. Citation2016; Zhu et al. Citation2017; Guo et al. Citation2021) but not in mapping natural habitat communities. Investigation of training data size and quality should also be explored in reference to two different imagery datasets.

Due to the increasing need for sustainable planning and management practices, mapping natural communities at finer scales is important and requires robust workflows such as one provided in this study and use of advanced remote sensing techniques. Future work should involve testing the robustness of this workflow for larger areas and other regions. Lastly, advanced methods like deep learning should be considered and compared with machine learning algorithms.

Description of authors’ responsibilities

PB collected and analyzed the data, designed the study approach, performed the experiments, did field work and wrote the original manuscript draft; AM wrote the original project proposal, did field work, advised on methodology improvements, revised and edited the manuscript.

Acknowledgments

We would like to thank the College of Forest Resources and Environmental Science, Michigan Technological University for their support. We also thank Ian Anderson (Chief Product Owner) of Hexagon Geospatial for his crucial help at the beginning of this project, Jim Ozenberger of the Hiawatha National Forest for assistance with field work and Emily Clegg of The Nature Conservancy for providing technical support. We would like to acknowledge Dr. Curtis Edson and Ben Miller who helped in UAV flight planning, and data collection.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data that support the findings of this study are available from the corresponding author, PB, upon reasonable request.

Additional information

Funding

This research was funded by the US Forest Service, Hiawatha National Forest (Grant Number 17-PA-11091000-023), The Nature Conservancy (Grant Number R45984) and the College of Forest Resources and Environmental Science.

References

  • Aaseng, N. E., Almendinger J. C, Dana R. P., Hanson D. S., Lee M. D., Rowe E. R., Rusterholz K. A., Wovcha D. S. 2011. “Minnesota’s Native Plant Community Classification: A Statewide Classification of Terrestrial and Wetland Vegetation Based on Numerical Analysis of Plot Data.” Biological Report 108: 1–24.
  • Adam, E., O. Mutanga, and D. Rugege. 2009. “Multispectral and Hyperspectral Remote Sensing for Identification and Mapping of Wetland Vegetation: A Review.” Wetlands Ecology and Management 18 (3): 281–296. doi:10.1007/s11273-009-9169-z.
  • Ahn, S. H., Jeong D.H., Kim M., Lee T. K., Kim H. K. 2022. “Prediction of Groundwater Quality Index to Assess Suitability for Drinking Purpose Using Averaged Neural Network and Geospatial Analysis.” Hydrology and Earth System Sciences Discussions 1–30.
  • Almeida, T. I. R. D., and D. Souza Filho. 2004. “Principal Component Analysis Applied to Feature-Oriented Band Ratios of Hyperspectral Data: A Tool for Vegetation Studies.” International Journal of Remote Sensing 25 (22): 5005–5023. doi:10.1080/01431160412331270812.
  • Altman, D. G. 1990. Practical Statistics for Medical Research. USA: CRC press.
  • Anderson, J. R. 1976. A Land Use and Land Cover Classification System for Use with Remote Sensor Data. Vol. 964. Washington, USA: US Government Printing Office.
  • Bailey, R. G. 2004. “Identifying Ecoregion Boundaries.” Environmental Management 34 (1): S14–26. doi:10.1007/s00267-003-0163-6.
  • Bailey, R. G. 2009. Ecosystem Geography: From Ecoregions to Sites. New York, USA: Springer Science & Business Media.
  • Bandos, T. V., L. Bruzzone, and G. Camps-Valls. 2009. “Classification of Hyperspectral Images with Regularized Linear Discriminant Analysis.” IEEE Transactions on Geoscience and Remote Sensing 47 (3): 862–873. doi:10.1109/TGRS.2008.2005729.
  • Bannari, A., Morin D., Bonn F., Huete A. 1995. “A Review of Vegetation Indices.” Remote Sensing Reviews 13 (1–2): 95–120. doi:10.1080/02757259509532298.
  • Bennasar, M., Hicks Y., Setchi R. 2015. “Feature Selection Using Joint Mutual Information Maximisation.” Expert Systems with Applications 42 (22): 8520–8532. doi:10.1016/j.eswa.2015.07.007.
  • Berhane, T. M., C. Lane, Q. Wu, B. Autrey, O. Anenkhonov, V. Chepinoga, and H. Liu. 2018. “Decision-Tree, Rule-Based, and Random Forest Classification of High-Resolution Multispectral Imagery for Wetland Mapping and Inventory.” Remote Sens (Basel) 10 (4): 580. doi:10.3390/rs10040580.
  • Bettinger, P., Boston K., Siry J., Grebner D. L. 2016. Forest Management and Planning. Cambridge, MA: Academic press.
  • Bhatt, P. 2018. Mapping Coastal Wetland and Phragmites on the Hiawatha National Forest Using Unmanned Aerial System (UAS) Imagery: Proof of Concepts. Houghton, MI, USA: Michigan Technological University. doi:10.37099/mtu.dc.etdr/711.
  • Bhatt, P. 2022. 'FINE SCALE MAPPING OF LAURENTIAN MIXED FOREST NATURAL HABITAT COMMUNITIES USING MULTISPECTRAL NAIP AND UAV DATASETS COMBINED WITH MACHINE LEARNING METHODS', Open Access Dissertation, Michigan Technological University. 10.37099/mtu.dc.etdr/1503
  • Bhatt, P., Edson C., Maclean A. 2022b. “Image Processing in Dense Forest Areas Using Unmanned Aerial System (UAS)“. Michigan Tech Publications. doi:10.37099/mtu.dc.michigantech-p/16366.
  • Bhatt, P., Maclean A., Dickinson Y., Kumar C. 2022a. “Fine-Scale Mapping of Natural Ecological Communities Using Machine Learning Approaches.” Remote Sensing 14 (3): 563. doi:10.3390/rs14030563.
  • Boser, B. E., Guyon I. M., Vapnik V. N. 1992. “A Training Algorithm for Optimal Margin Classifiers.” Paper presented at the Proceedings of the fifth annual workshop on Computational learning theory, July 27 - 29, 1992, Pennsylvania, Pittsburgh, USA.
  • Bradter, U., Thom T. J., Altringham J. D., Kunin W. E., Benton T. G. 2011. “Prediction of National Vegetation Classification Communities in the British Uplands Using Environmental Data at Multiple Spatial Scales, Aerial Images and the Classifier Random Forest.” The Journal of Applied Ecology 48 (4): 1057–1065. doi:10.1111/j.1365-2664.2011.02010.x.
  • Breiman, L. 1999. “Random Forests.” UC Berkeley TR567.
  • Breiman, L., and A. Cutler. 2007. “Random Forests-Classification Description.” Department of Statistics, Berkeley 2.
  • Brieman, L., 1984. Classification and Regression Tree Analysis. Boca, Raton, FL: CRC Press. doi:10.1201/9781315139470.
  • Brown, T., Meysembourg P., Host G. E. 2013. “Geospatial Modeling of Native Plant Communities of Minnesota’s Laurentian Mixed Forest.“ University of Minnesota Duluth. https://hdl.handle.net/11299/187333
  • Buchanan, G., Pearce‐Higgins J., Grant M., Robertson D., Waterhouse T. 2005. “Characterization of Moorland Vegetation and the Prediction of Bird Abundance Using Remote Sensing.” Journal of Biogeography 32 (4): 697–707. doi:10.1111/j.1365-2699.2004.01187.x.
  • Burton, T. A. 1993. “Averaged Neural Networks.” Neural Networks 6 (5): 677–680. doi:10.1016/S0893-6080(05)80111-X.
  • Clark, M. L., D. A. Roberts, and D. B. Clark. 2005. “Hyperspectral Discrimination of Tropical Rain Forest Tree Species at Leaf to Crown Scales.” Remote Sensing of Environment 96 (3–4): 375–398. doi:10.1016/j.rse.2005.03.009.
  • Cohen, J. G., Albert D. A., Slaughter B. S., Kost M. A. 2014. A Field Guide to the Natural Communities of Michigan. Michigan, USA: Michigan State University Press.
  • Cohen, J.G., Kost, M.A., Slaughter, B.S., Albert, D.A., Lincoln, J.M., Kortenhoven, A.P., Wilton, C.M., Enander, H.D., Korroch, K.M. 2020. Michigan Natural Community Classification [web application]. Michigan Natural Features Inventory, Michigan State University Extension, Lansing, Michigan. https://mnfi.anr.msu.edu/communities/classification
  • Congalton, R. G., and K. Green. 2019. Assessing the Accuracy of Remotely Sensed Data : Principles and Practices. Third Edition ed. Milton: Chapman and Hall/CRC.
  • Corcoran, J., Knight J. F., Gallant A. L. 2013. “Influence of Multi-Source and Multi-Temporal Remotely Sensed and Ancillary Data on the Accuracy of Random Forest Classification of Wetlands in Northern Minnesota.” Remote Sensing 5 (7): 3212–3238. doi:10.3390/rs5073212.
  • Curtis, J. T. 1959. The Vegetation of Wisconsin: An Ordination of Plant Communities. Madison, WI: University of Wisconsin Pres.
  • Dronova, I., Gong P., Wang L., Zhong L. 2015. “Mapping Dynamic Cover Types in a Large Seasonally Flooded Wetland Using Extended Principal Component Analysis and Object-Based Classification.” Remote Sensing of Environment 158: 193–206. doi:10.1016/j.rse.2014.10.027.
  • Dunteman, G. H. 1989. In Lewis-Beck, Michael(ed) Basic Concepts of Principal Components Analysis. London: SAGE Publications Ltd. 15–22.
  • Fangfang, L., and B. Xiao. 2011. “Aquatic Vegetation Mapping Based on Remote Sensing Imagery: An Application to Honghu Lake.” Paper presented at the 2011 International Conference on Remote Sensing Nanjing, China, Environment and Transportation Engineering.
  • Feng, Q., Liu J., Gong J. 2015. “UAV Remote Sensing for Urban Vegetation Mapping Using Random Forest and Texture Analysis.” Remote Sensing 7 (1): 1074–1094. doi:10.3390/rs70101074.
  • Fernández-Manso, A., Fernández-Manso O., Quintano C. 2016. “SENTINEL-2A Red-Edge Spectral Indices Suitability for Discriminating Burn Severity.” International Journal of Applied Earth Observation and Geoinformation 50: 170–175. doi:10.1016/j.jag.2016.03.005.
  • Franklin, S. E., and O. S. Ahmed. 2018. “Deciduous Tree Species Classification Using Object-Based Analysis and Machine Learning with Unmanned Aerial Vehicle Multispectral Data.” International Journal of Remote Sensing 39 (15–16): 5236–5245. doi:10.1080/01431161.2017.1363442.
  • Gómez, D., . 2019. “Potato Yield Prediction Using Machine Learning Techniques and Sentinel 2 Data.” Remote Sensing 11 (15): 1745. doi:10.3390/rs11151745.
  • Gong, P., Pu R., Yu B. 1997. “Conifer Species Recognition: An Exploratory Analysis of in situ Hyperspectral Data.” Remote Sensing of Environment 62 (2): 189–200. doi:10.1016/S0034-4257(97)00094-1.
  • Gunn, S. R. 1998. “Support Vector Machines for Classification and Regression.” ISIS Technical Report 14 (1): 5–16.
  • Guo, X., Wang M., Jia M., Wang W. 2021. “Estimating Mangrove Leaf Area Index Based on Red-Edge Vegetation Indices: A Comparison Among UAV, WorldView-2 and Sentinel-2 Imagery.” International Journal of Applied Earth Observation and Geoinformation 103: 102493. doi:10.1016/j.jag.2021.102493.
  • Guyon, I., and A. Elisseeff. 2003. “An Introduction to Variable and Feature Selection.” Journal of Machine Learning Research 3 (Mar): 1157–1182.
  • Hall-Beyer, M. 2017. “Practical Guidelines for Choosing GLCM Textures to Use in Landscape Classification Tasks Over a Range of Moderate Spatial Scales.” International Journal of Remote Sensing 38 (5): 1312–1338. doi:10.1080/01431161.2016.1278314.
  • Hansen, M. C., R. S. Defries, J. R. G. Townshend, and R. Sohlberg. 2010. “Global Land Cover Classification at 1 Km Spatial Resolution Using a Classification Tree Approach.” International Journal of Remote Sensing 21 (6–7): 1331–1364. doi:10.1080/014311600210209.
  • Hanson, D. S., and B. Hargrave. 1996. “Development of a Multilevel Ecological Classification System for the State of Minnesota.” Environmental Monitoring and Assessment 39 (1): 75–84. doi:10.1007/BF00396137.
  • Haralick, R. M., K. Shanmugam, and I. Dinstein. 1973. “Textural Features for Image Classification.” IEEE Transactions on Systems, Man, and Cybernetics 6: 610–621. doi:10.1109/TSMC.1973.4309314.
  • Hayes, M. M., S. N. Miller, and M. A. Murphy. 2014. “High-Resolution Landcover Classification Using Random Forest.” Remote Sensing Letters 5 (2): 112–121. doi:10.1080/2150704X.2014.882526.
  • Hill, M. J. 2013. “Vegetation Index Suites as Indicators of Vegetation State in Grassland and Savanna: An Analysis with Simulated SENTINEL 2 Data for a North American Transect.” Remote Sensing of Environment 137: 94–111. doi:10.1016/j.rse.2013.06.004.
  • Hogland, J., N. Anderson, J. St. Peter, J. Drake, and P. Medley. 2018. “Mapping Forest Characteristics at Fine Resolution Across Large Landscapes of the Southeastern United States Using NAIP Imagery and FIA Field Plot Data.” ISPRS International Journal of Geo-Information 7 (4). doi:10.3390/ijgi7040140.
  • Homer, C., Huang C., Yang L., Wylie B. K., Coan M. 2004. “Development of a 2001 National Land-Cover Database for the United States.” Photogrammetric Engineering & Remote Sensing 70 (7): 829–840. doi:10.14358/PERS.70.7.829.
  • Homoya, M. A., Abrell D. B., Aldrich J. A., Post T. W. 1984. “The Natural Regions of Indiana.” Paper presented at the Proceedings of the Indiana Academy of Science.
  • Hoque, N., Bhattacharyya D. K., Kalita J. K. 2014. “MIFS-ND: A Mutual Information-Based Feature Selection Method.” Expert Systems with Applications 41 (14): 6371–6385. doi:10.1016/j.eswa.2014.04.019.
  • Huang, C., Davis L. S., Townshend J. R. 2002. “An Assessment of Support Vector Machines for Land Cover Classification.” International Journal of Remote Sensing 23 (4): 725–749. doi:10.1080/01431160110040323.
  • Hughes, G. 1968. “On the Mean Accuracy of Statistical Pattern Recognizers.” IEEE Transactions on Information Theory 14 (1): 55–63. doi:10.1109/TIT.1968.1054102.
  • Hyvärinen, A., and E. Oja. 2000. “Independent Component Analysis: Algorithms and Applications.” Neural Networks 13 (4–5): 411–430. doi:10.1016/S0893-6080(00)00026-5.
  • Inventory, Florida Natural Areas. 1990. Guide to the Natural Communities of Florida. Florida, USA: Florida Natural Areas Inventory and Florida Department of Natural Resources.
  • Jackson, M. T. 1979. “A Classification of Indiana Plant Communities.” Paper presented at the Proceedings of the Indiana Academy of Science.
  • Jensen, J. R. 2015. Introductory Digital Image Processing: A Remote Sensing Perspective, 4th edition. Glenview, IL, USA: Pearson.
  • Jerome, D. S. 2006. Landforms of the Upper Peninsula, Michigan. Marquette, MI, USA: Natural Resources Conservation Service. In, 56.
  • Juel, A., Groom G. B., Svenning J. C., Ejrnaes R. 2015. “Spatial Application of Random Forest Models for Fine-Scale Coastal Vegetation Classification Using Object Based Analysis of Aerial Orthophoto and DEM Data.” International Journal of Applied Earth Observation and Geoinformation 42: 106–114. doi:10.1016/j.jag.2015.05.008.
  • Kavhu, B., Mashimbye Z. E., Luvuno L. 2021. “Climate-Based Regionalization and Inclusion of Spectral Indices for Enhancing Transboundary Land-Use/cover Classification Using Deep Learning and Machine Learning.” Remote Sensing 13 (24): 5054. doi:10.3390/rs13245054.
  • Kearsley, J. B. 1999. “Inventory and Vegetation Classification of Floodplain Forest Communities in Massachusetts.” Rhodora, Vol. 101, No. 906, 105–135. https://www.jstor.org/stable/23313353
  • Kohavi, R. 1995. “A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection.” Paper presented at the Ijcai.
  • Kuhn, M. 2015. Caret: Classification and Regression Training. Astrophysics Source Code Library.
  • Kuhn, M., P. Balbo, M. E. Banker, R. M. Czerwinski, M. Kuhn, T. S. Maurer, J. -B. Telliez, F. Vincent, and A. J. Wittwer. 2020. The Advantages of Describing Covalent Inhibitor in vitro Potencies by IC50 at a Fixed Time Point. IC50 Determination of Covalent Inhibitors Provides Meaningful Data to Medicinal Chemistry for SAR Optimization. The R Journal29. 10.1016/j.bmc.2020.115865
  • Kulkarni, A. D., and B. Lowe. 2016. “Random Forest Algorithm for Land Cover Classification. Computer Science Faculty Publications and Presentations. http://hdl.handle.net/10950/341
  • Kumar, C., S. Chatterjee, T. Oommen, and A. Guha. 2020. “Automated Lithological Mapping by Integrating Spectral Enhancement Techniques and Machine Learning Algorithms Using AVIRIS-NG Hyperspectral Data in Gold-Bearing Granite-Greenstone Rocks in Hutti, India.” International Journal of Applied Earth Observation and Geoinformation 86: 102006. doi:10.1016/j.jag.2019.102006.
  • Landis, J. R., and G. G. Koch. 1977. “The Measurement of Observer Agreement for Categorical Data.” Biometrics 33: 159–174. doi:10.2307/2529310.
  • Lane, C., Liu H., Autrey B. C., Anenkhonov O. A., Chepinoga V. V., Wu Q. 2014. “Improved Wetland Classification Using Eight-Band High Resolution Satellite Imagery and a Hybrid Approach.” Remote Sensing 6 (12): 12187–12216. doi:10.3390/rs61212187.
  • Lasaponara, R. 2006. “On the Use of Principal Component Analysis (PCA) for Evaluating Interannual Vegetation Anomalies from SPOT/VEGETATION NDVI Temporal Series.” Ecological Modelling 194 (4): 429–434. doi:10.1016/j.ecolmodel.2005.10.035.
  • Lillesand, T., Kiefer R. W., Chipman J. 2015. Remote Sensing and Image Interpretation, Seventh Edition. Hoboken, NJ, USA: John Wiley & Sons.
  • Lindenmayer, D. B., and J. F. Franklin. 2002. Conserving Forest Biodiversity: A Comprehensive Multiscaled Approach. Washington DC, USA: Island press.
  • Mahdianpari, M., Salehi B., Mohammadimanesh F., Brisco B., Mahdavi S., Amani M., Granger J. E. 2018. “Fisher Linear Discriminant Analysis of Coherency Matrix for Wetland Classification Using PolSar Imagery.” Remote Sensing of Environment 206: 300–317. doi:10.1016/j.rse.2017.11.005.
  • Mahesh, P., and P. M. Mather. 2003. “An Assessment of the Effectiveness of Decision Tree Methods for Land Cover Classification.” Remote Sensing of Environment 86 (4): 554–565. doi:10.1016/s0034-4257(03)00132-9.
  • Maillard, P. 2003. “Comparing Texture Analysis Methods Through Classification.” Photogrammetric Engineering & Remote Sensing 69 (4): 357–367. doi:10.14358/PERS.69.4.357.
  • Maxwell, A. E., M. P. Strager, T. A. Warner, C. A. Ramezan, A. N. Morgan, and C. E. Pauley. 2019. “Large-Area, High Spatial Resolution Land Cover Mapping Using Random Forests, GEOBIA, and NAIP Orthophotography: Findings and Recommendations.” Remote Sensing 11 (12). doi:10.3390/rs11121409.
  • Maxwell, A. E., T. A. Warner, and F. Fang. 2018. “Implementation of Machine-Learning Classification in Remote Sensing: An Applied Review.” International Journal of Remote Sensing 39 (9): 2784–2817. doi:10.1080/01431161.2018.1433343.
  • Maxwell, A. E., T. A. Warner, B. C. Vanderbilt, and C. A. Ramezan. 2017. “Land Cover Classification and Feature Extraction from National Agriculture Imagery Program (NAIP) Orthoimagery: A Review.” Photogrammetric Engineering & Remote Sensing 83 (11): 737–747. doi:10.14358/PERS.83.10.737.
  • McHugh, M. L. 2012. “Interrater Reliability: The Kappa Statistic.” Biochemia Medica 22 (3): 276–282. doi:10.11613/BM.2012.031.
  • Momm, H. G., R. ElKadiri, and W. Porter. 2020. “Crop-Type Classification for Long-Term Modeling: An Integrated Remote Sensing and Machine Learning Approach.” Remote Sensing 12 (3): 449. doi:10.3390/rs12030449.
  • Monahan, W. B., C. E. Arnspiger, P. Bhatt, Z. An, F. J. Krist, T. Liu, R. P. Richard, 2022. “A Spectral Three-Dimensional Color Space Model of Tree Crown Health.” PLoS One 17 (10): e0272360. doi:10.1371/journal.pone.0272360.
  • Monahan, W., C. Arnspiger, P. Bhatt, A. Zhongming, F. Krist Jr., T. Liu, R. Richard, 2022. Data and Code From: A Spectral Three-Dimensional Color Space Model of Tree Crown Health. DRYAD. doi:10.5061/dryad.wm37pvmpp.
  • Mountrakis, G., Im J., Ogole C. 2011. “Support Vector Machines in Remote Sensing: A Review.” ISPRS Journal of Photogrammetry and Remote Sensing 66 (3): 247–259. doi:10.1016/j.isprsjprs.2010.11.001.
  • Munyati, C. 2004. “Use of Principal Component Analysis (PCA) of Remote Sensing Images in Wetland Change Detection on the Kafue Flats, Zambia.” Geocarto International 19 (3): 11–22. doi:10.1080/10106040408542313.
  • Noss, R. F. 1987. “From Plant Communities to Landscapes in Conservation Inventories: A Look at the Nature Conservancy (USA).” Biological Conservation 41 (1): 11–37. doi:10.1016/0006-3207(87)90045-0.
  • Prasad, A. M., L. R. Iverson, and A. Liaw. 2006. “Newer Classification and Regression Tree Techniques: Bagging and Random Forests for Ecological Prediction.” Ecosystems (New York, NY) 9 (2): 181–199. doi:10.1007/s10021-005-0054-1.
  • Ripley, B. D. 2007. Pattern Recognition and Neural Networks. Cambridge, England: Cambridge university press.
  • Robichaud, C. D., and R. C. Rooney. 2017. “Long-Term Effects of a Phragmites Australis Invasion on Birds in a Lake Erie Coastal Marsh.” Journal of Great Lakes Research 43 (3): 141–149. doi:10.1016/j.jglr.2017.03.018.
  • Rodriguez-Galiano, V. F., Chica-Olmo M., Abarca-Hernandez F., Atkinson P. M., Jeganathan C. 2012b. “Random Forest Classification of Mediterranean Land Cover Using Multi-Seasonal Imagery and Multi-Seasonal Texture.” Remote Sensing of Environment 121: 93–107. doi:10.1016/j.rse.2011.12.003.
  • Rodriguez-Galiano, V. F., Ghimire B., Rogan J., Chica-Olmo M., Rigol-Sanchez J. P. 2012a. “An Assessment of the Effectiveness of a Random Forest Classifier for Land-Cover Classification.” ISPRS Journal of Photogrammetry and Remote Sensing 67: 93–104. doi:10.1016/j.isprsjprs.2011.11.002.
  • Rouse, J. W., Haas R. H., Schell J. A., Deering D. W. 1974. “Monitoring Vegetation Systems in the Great Plains with ERTS.” NASA Special Publication 351: 309.
  • Ruiliang, P., and S. Landry. 2012. “A Comparative Analysis of High Spatial Resolution IKONOS and WorldView-2 Imagery for Mapping Urban Tree Species.” Remote Sensing of Environment 124: 516–533. doi:10.1016/j.rse.2012.06.011.
  • Schafale, M. P., and A. S. Weakley. 1990. “Classification of the Natural Communities of North Carolina.” In Third Approximation. North Carolina Department of Environment, Health, and Natural Resources, Division of Parks and Recreation. Raleigh, NC: Natural Heritage Program, Raleigh 326.
  • Schulz, K., Hänsch R., Sörgel U. 2018. “Machine Learning Methods for Remote Sensing Applications: An Overview.” Earth Resources and Environmental Remote Sensing/GIS Applications IX 10790: 1079002.
  • Shah, C. A., Anderson I., Gou Z., Hao S., Leason A. 2007b. “Towards the Development of Next Generation Remote Sensing Technology–Erdas Imagine Incorporates a Higher Order Feature Extraction Technoque Based on Ica.” Paper presented at the Proceedings of the ASPRS 2007 Annual Conference May 7-11, 2007 Tampa, Florida.
  • Shah, C. A., Arora M. K., Robila S. A., Varshney P. K. 2002. “ICA Mixture Model Based Unsupervised Classification of Hyperspectral Imagery.” Paper presented at the Applied Imagery Pattern Recognition Workshop, 2002. Proceedings Washington, DC, USA.
  • Shah, C. A., Varshney P. K., Arora M. K. 2007a. “ICA Mixture Model Algorithm for Unsupervised Classification of Remote Sensing Imagery.” International Journal of Remote Sensing 28 (8): 1711–1731. doi:10.1080/01431160500462121.
  • Shang, J., Liu, J., Ma, B., Zhao, T., Jiao, X., Geng, X., Huffman, T., Kovacs, J., Walters, D., et al. 2015. “Mapping Spatial Variability of Crop Growth Conditions Using RapidEye Data in Northern Ontario, Canada.” Remote Sensing of Environment 168: 113–125. doi:10.1016/j.rse.2015.06.024.
  • Shuang, L., L. Xu, Y. Jing, H. Yin, X. Li, and X. Guan. 2021. “High-Quality Vegetation Index Product Generation: A Review of NDVI Time Series Reconstruction Techniques.” International Journal of Applied Earth Observation and Geoinformation 105: 102640. doi:10.1016/j.jag.2021.102640.
  • Sloan, J. L. 2017. National Unmanned Aircraft Systems Project Office: U.S. Geological Survey. Denver, Colorado: Agisoft PhotoScan Workflow.
  • Sperduto, D. D., and W. F. Nichols. 2004. Natural Communities of New Hampshire: UNH Cooperative Extension. Durham, NH: New Hampshire Natural Heritage Bureau.
  • Taghizadeh-Mehrjardi, R., Schmidt, K., Amirian-Chakan, A., Rentschler, T., Zeraatpisheh, M., Sarmadian, F., Valavi, R., Davatgar, N., Behrens, T., Scholten, T., et al. 2020. “Improving the Spatial Prediction of Soil Organic Carbon Content in Two Contrasting Climatic Regions by Stacking Machine Learning Models and Rescanning Covariate Space.” Remote Sensing 12 (7): 1095. doi:10.3390/rs12071095.
  • Tassi, A., and M. Vizzari. 2020. “Object-Oriented Lulc Classification in Google Earth Engine Combining Snic, Glcm, and Machine Learning Algorithms.” Remote Sensing 12 (22): 3776. doi:10.3390/rs12223776.
  • Taylor, J. C., Brewer, T.R., Bird, A.C., et al. 2000. “Monitoring Landscape Change in the National Parks of England and Wales Using Aerial Photo Interpretation and GIS.” International Journal of Remote Sensing 21 (13–14): 2737–2752. doi:10.1080/01431160050110269.
  • Team, R. C. 2013. “R: A Language and Environment for Statistical Computing.“ R Foundation for Statistical Computing. https://www.r-project.org/index.html
  • Tucker, C. J. 1979. “Red and Photographic Infrared Linear Combinations for Monitoring Vegetation.” Remote Sensing of Environment 8 (2): 127–150. doi:10.1016/0034-4257(79)90013-0.
  • USDA. 2022. “2018 Michigan Image Dates.” USDA, Accessed 8 July 2022. https://naip-image-dates-usdaonline.hub.arcgis.com/datasets/8abca94b0db34143b3b5cdd2c99e7fe9_0/about.
  • Van Etten Adam, Lindenbaum Dave, Bacastow Todd M. 2018. “Spacenet: A Remote Sensing Dataset and Challenge Series.” arXiv preprint arXiv: 180701232 doi:https://doi.org/10.48550/arXiv.1807.01232.
  • Vapnik, V. 2013. “The Nature of Statistical Learning theory: Springer Science & Business Media Springer 1–334 .”
  • Vogelmann, J. E., Howard, S.M., Yang, L., Larson, C.R., Wylie, B.K., Van Driel, N. 2001. “Completion of the 1990s National Land Cover Data Set for the Conterminous United States from Landsat Thematic Mapper Data and Ancillary Data Sources.” Photogrammetric Engineering and Remote Sensing 67: 650-655, 657-659, 661–662.
  • White, M. A., and G. E. Host. 2008. “Forest Disturbance Frequency and Patch Structure from Pre-European Settlement to Present in the Mixed Forest Province of Minnesota, USA.” Canadian Journal of Forest Research 38 (8): 2212–2226. doi:10.1139/X08-065.
  • Whitlatch, R. B. 1977. “Seasonal Changes in the Community Structure of the Macrobenthos Inhabiting the Intertidal Sand and Mud Flats of Barnstable Harbor, Massachusetts.” The Biological Bulletin 152 (2): 275–294. doi:10.2307/1540565.
  • Whittaker, R. H. 1962. “Classification of Natural Communities.” The Botanical Review 28 (1): 1–239. doi:10.1007/BF02860872.
  • Wilson, D. C., and A. R. Ek. 2017. “Imputing Plant Community Classifications for Forest Inventory Plots.” Ecological Indicators 80: 327–336. doi:10.1016/j.ecolind.2017.04.043.
  • Wolf, A. F. 2012. “Using WorldView-2 Vis-NIR Multispectral Imagery to Support Land Mapping and Feature Extraction Using Normalized Difference Index Ratios.” Paper presented at the Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XVIII.
  • Yang, L., Jin, S., Danielson, P., Homer, C., Gass, L., Bender, S.M., Case, A., Costello, C., Dewitz, J., Fry, J., Funk, M. 2018. “A New Generation of the United States National Land Cover Database: Requirements, Research Priorities, Design, and Implementation Strategies.” ISPRS Journal of Photogrammetry and Remote Sensing 146: 108–123. doi:10.1016/j.isprsjprs.2018.09.006.
  • Zhang, W., Liu, H., Wu, W., Zhan, L., Wei, J. 2020. “Mapping Rice Paddy Based on Machine Learning with Sentinel-2 Multi-Temporal Data: Model Comparison and Transferability.” Remote Sensing 12 (10): 1620. doi:10.3390/rs12101620.
  • Zheng, C., Abd-Elrahman A., Whitaker V. 2021. “Remote Sensing and Machine Learning in Crop Phenotyping and Management, with an Emphasis on Applications in Strawberry Farming.” Remote Sensing 13 (3): 531. doi:10.3390/rs13030531.
  • Zhong, L., Hu L., Zhou H. 2019. “Deep Learning Based Multi-Temporal Crop Classification.” Remote Sensing of Environment 221: 430–443. doi:10.1016/j.rse.2018.11.032.
  • Zhu, Y., Liu K., Liu L., Myint S. W., Wang S., Liu H., He Z. 2017. “Exploring the Potential of Worldview-2 Red-Edge Band-Based Vegetation Indices for Estimation of Mangrove Leaf Area Index with Machine Learning Algorithms.” Remote Sensing 9 (10): 1060. doi:10.3390/rs9101060.

Appendix A

Table A1. NAIP and UAV classification accuracy assessment matrices for PAC. EM – Emergent Marsh, SM – Submergent Marsh, GLM – Great Lakes Marsh, IW – Interdunal Wetlands, WDSC – Wooded Dune & Swale Complex, SGB – Sand & Gravel Beach, OW – Open Water, IS – Impervious Surface.

Table A2. NAIP and UAV classification accuracy assessment matrices for the CRM. EM – Emergent Marsh, GLM – Great Lakes Marsh, NST – Northern Shrub Thicket, RCS – Rich Conifer Swamp, WDSC – Wooded Dune & Swale Complex, OW – Open Water, OL Open Land, IS – Impervious Surface.

Figure A1. Spectral reflectance signatures for PAC study area shown using NAIP (8-bit-unsigned) and UAV (16-bit-unsigned) imagery DN Values.

Figure A1. Spectral reflectance signatures for PAC study area shown using NAIP (8-bit-unsigned) and UAV (16-bit-unsigned) imagery DN Values.

Figure B1. Spectral reflectance signatures for CRM study area shown using NAIP (8-bit-unsigned) and UAV (16-bit-unsigned) imagery DN Values.

Figure B1. Spectral reflectance signatures for CRM study area shown using NAIP (8-bit-unsigned) and UAV (16-bit-unsigned) imagery DN Values.

Figure A2. GLCM texture (Contrast) differences in details caused by the window sizes.

Figure A2. GLCM texture (Contrast) differences in details caused by the window sizes.