476
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Predictions of Spartina alterniflora leaf functional traits based on hyperspectral data and machine learning models

, , , , , , , , & show all
Article: 2294951 | Received 26 Jul 2023, Accepted 09 Dec 2023, Published online: 22 Dec 2023

ABSTRACT

Investigating the functional traits of Spartina alterniflora can provide insights towards understanding its invasion mechanism, and developing a method leaves can improve its management in coastal wetlands. Here, we examined the relationship between 11 leaf functional traits of S. alterniflora and hyperspectral data and investigated the feature bands through importance score analysis. Using original spectral and first-order differential conversion data of feature bands, we established four prediction models: random forest (RF), support vector machine (SVM), eXtreme Gradient Boosting (XGBoost), and back propagation neural network (BPNN). The study results showed that: (1) the SVM model based on Random Forest Importance Score is well-suited for S. alterniflora leaf functional trait inversion; (2) the importance score of leaf functional traits differed, and first-order differential spectral data produced more bands with high scores compared with the original hyperspectral reflectance data; (3) first-order differential data modelling effects were slightly better than those of the original spectral data. However, the first-order differential treatment did not show a significant improvement in the validation accuracy compared with the original data, and the accuracy of some traits decreased. Our study provides a new methodological approach for improving the monitoring and management of S. alterniflora in coastal wetlands.

Introduction

Coastal wetlands are ecologically dynamic areas where the sea and land intersect, and they play important roles in protecting coastlines, preventing erosion, regulating climate, purifying water, and providing habitats for a diverse range of organisms. Thus, these wetlands are among the most abundant ecosystems in terms of species richness worldwide (Barbier et al., Citation2011; Cai et al., Citation2009; Sheaves et al., Citation2014). However, with the increasing impact of human activities and climate change, these wetlands are facing serious threats, resulting in considerable damage to their ecosystem functions (Blankespoor et al., Citation2014; Kirwan & Megonigal, Citation2013; Tian et al., Citation2016). Spartina alterniflora (S. alterniflora) is a perennial herb commonly found in intertidal wetlands. It is native to the southeastern coast of the United States and provides ecological functions such as soil improvement, air purification, greening, wind and wave protection for soil, beach sand suppression, and wetland area expansion (Liu et al., Citation2021; Yu et al., Citation2022). Introduced to China’s coast in 1979, S. alterniflora initially played an important ecological role in resisting typhoons and ameliorating wave-induced damage to the coastline (An et al., Citation2007). However, recent studies have identified it as a threat to the biodiversity of coastal wetland ecosystems (Chen et al., Citation2018; Liu et al., Citation2018). Therefore, studies on S. alterniflora can help provide a theoretical basis for the conservation and monitoring of local biodiversity.

Plant functional traits are defined as a series of plant morphological and physiological attributes that can significantly affect their colonization, survival, and growth (Violle et al., Citation2007; Westoby & Wright, Citation2006). Leaf functional traits are among the most important components of plant functional traits, and they can reflect the adaptive strategies of plants to environmental changes (Jankowska & Włodarska-Kowalczuk, Citation2022; Jiang et al., Citation2022). For instance, the specific leaf area (SLA) and leaf nutrient content are closely related to the relative growth rate (Reich & Oleksyn, Citation2004). Additionally, the chlorophyll concentration, leaf nitrogen, SLA, and nitrogen to phosphorus ratio are indicative of the survival strategies of plants to optimize their access to resources (Carlucci et al., Citation2015; Liu et al., Citation2017).

Research on the functional traits of wetland plants has increased in recent years. Zhang (Citation2022) analysed the functional traits of seven dominant plants in the lakeside wetland of the Inner Mongolian Plateau, including Phragmites australis and Suaeda glauca. They discovered that environmental changes significantly affect plant height, leaf carbon content, and leaf nitrogen content. Similarly, Zhou et al. (Citation2022) studied the leaf functional traits of Phragmites australis, a representative wetland plant, and observed that the traits leaf thickness (LT) and SLA decreased from wetland to desert habitats and soil bulk density impacted the leaf functional traits in wetland habitats. In another study, Zhu (Citation2023) conducted a statistical comparison of the functional characteristics of 12 wetland plants, including Hemerocallis fulva and Isachne globose, and found significant correlations between certain traits and rich variations in the functional traits of wetland plants. Leaf functional traits of wetland plants are closely related to their resource utilization efficiency, which can reflect the adaptation strategies of wetland plants under different resource conditions, and are of great ecological importance.

Traditional measurement methods for leaf functional traits involve the transportation of leaf samples back to the laboratory for chemical measurement immediately after collection. Although the measurement accuracy of this processing method is high, it is time-consuming and laborious. Moreover, due to the complexity of the geographic conditions of coastal wetlands and high temperature and humidity conditions in summer, leaf traits are difficult to measure in a timely manner, and dried, curly leaves lead to measurement deviations. However, the increasingly mature hyperspectral remote sensing technology has provided new ideas and methods for solving the current problem of measuring plant functional traits. The spectral data acquired by this technology are high-resolution, information-rich, and characterized by rapid, nondestructive, repeatable and large-area monitoring, which has enabled high-throughput and high-precision monitoring of plant functional traits at the leaf scale (Asner et al., Citation2016; Li et al., Citation2020; Pacheco-Labrador et al., Citation2022; Rebelo et al., Citation2018).

Currently, significant progress has been made in the use of hyperspectral data for the inversion of vegetation functional traits in forest and grassland ecosystems (Gitelson et al., Citation2001; Yendrek et al., Citation2017). In wetlands, several studies have also explored the inversion of functional traits using hyperspectral techniques. For example, the correcting normalized difference nitrogen index (NDNI) has been proved to have great potential in the prediction of canopy and leaf nitrogen content of aquatic vegetation in wetlands (Wang & Wei, Citation2016; Zhang et al., Citation2019). Dou et al. (Citation2019) used linear and partial least squares regressions to establish an inversion model for Suaeda salsa biomass and found that the partial least squares regression model had good accuracy. In addition, random forest (RF) model has been shown to be effective in inverting the ecological chemometric characteristics of Phragmites communis and Kandelia candel (Cui et al., Citation2020; Tang et al., Citation2022). And Li et al. (Citation2020) estimated the total nitrogen (TN) and total phosphorus (TP) contents of artificial wetlands using two wetland plant canopy spectral reflectance data, thus confirming the feasibility of using plant canopy spectral reflectance data to estimate water and sediment TN and TP contents. Furthermore, some scholars have also explored the inversion of functional traits of wetland plants using hyperspectral techniques. For instance, Rebelo et al. (Citation2018) modelled the relationship between the canopy spectra and functional traits of 22 dominant plant species in South African wetlands, thus providing an effective tool for the differentiation of plant functional groups and even species. Zuo (Citation2021) conducted an inversion study of the functional traits of S. alterniflora using machine learning models, such as RF, and they achieved good accuracy and provided theoretical support for the inversion of functional traits using hyperspectral techniques.

An increasing number of studies have shown that plant functional traits play an important role in mechanistic analysis and process prediction at numerous ecological scales in organs, species, and ecosystems, and represent a reliable method of addressing important ecological questions at the population, community, and ecosystem scales (He et al., Citation2022; Levine, Citation2016; Sack et al., Citation2013). Systematic studies on the functional traits of S. alterniflora have revealed the growth status of S. alterniflora and analysed its growth strategies and adaptive mechanisms. Although traditional trait measurement methods are difficult to apply for the large-scale measurement of S. alterniflora functional traits, the rapid development of hyperspectral technology provides technical possibilities for realizing the inversion of functional traits of S. alterniflora. Exploring methods for inverting plant functional traits through hyperspectral techniques can pave the way for direct predictions of a wide range of plant functional traits from hyperspectral data and analyses of ecological problems based on the predicted trait values. In this study, we aimed to identify the optimal hyperspectral techniques and machine learning models to invert the functional traits of S. alterniflora. Specifically, we measured the leaf functional traits of S. alterniflora and used hyperspectral data to establish prediction models of the main leaf functional traits. This study provides a theoretical basis for further understanding the invasion mechanism of S. alterniflora and its environmental response strategies and proposes a new approach for monitoring and managing S. alterniflora in coastal wetlands.

Materials and methods

Study area

The Yancheng coastal wetland in Jiangsu Province, China (32°56’–33°36’ N, 120°13’–120°56’ E), is an extensive intertidal system covering a 582 km coastline, and it is located along the lower reaches of the Huaihe River and faces the Yellow Sea to the east. The region experiences northerly winds from high-latitude regions in winter and southerly winds from the Pacific Ocean in summer, resulting in a regulated annual average temperature with limited variations. The area is characterized by various types of coastal wetland ecosystems that include salt marsh wetlands, mudflats, and shallow water areas. The study site was located in the Dafeng Elk Nature Reserve of Yancheng and the Jiangsu Yancheng Wetland National Nature Reserve Rare Birds, which are known for their diverse halophytic vegetation, including S. alterniflora, P. communis, and Imperata cylindrica. However, the rapid expansion of S. alterniflora since its initial invasion has become a major threat to the local ecology and economy, thus highlighting the need for effective management and monitoring of this invasive species.

Acquisition of hyperspectral data

In August 2019 and August 2020, 238 sampling points were randomly arranged and sampled in the Yancheng coastal wetland, with 120 sampling sites located in Dafeng Elk and 118 sampling sites in Rare Birds. Sampling sites were separated by 10 m, and each site consisted of a single species: S. alterniflora. Spectral information was acquired using an FS4 portable ground object spectrometer produced by the Analytical Spectral Device (ASD) company (United States), and it had a spectral range of 350–2500 nm and a sampling interval of 1 nm. Three unfurled, intact, and disease-free healthy leaves were randomly collected from the upper, middle, and lower parts of S. alterniflora at each sampling site. An ASD FS4 with leaf clips was utilized to measure the spectral reflectance from the middle part of the leaves, avoiding the main leaf veins. Ten spectral data points were collected for each leaf, and the spectral reflectance values of the three leaves were averaged to represent the spectral reflectance of the leaves at the sampling point.

Determination of leaf functional traits

The freshly collected leaves were sealed in aluminum envelopes after collection, stored in a portable cooler, and measured for their fresh weight (FW), LT, soil plant analysis development (SPAD) index, and leaf area within 4 h of collection. Subsequently, the leaves were blanched at 105°C for 15 min and then dried to a constant weight at 85°C for 48 h to obtain the dry weight. Other indicators were determined after crushing and sieving the dried leaves. The TN and total carbon (TC) contents were simultaneously determined using an elemental analyser (vario PYRO cube; Elemental, Germany). The plant samples were decomposed via high-temperature combustion, and the mixed gas was automatically measured using a thermal conductivity detection system. The organic phosphorus in the plants was converted into phosphate by digestion with H2SO4–H2O2 and measured using a spectrophotometer. To determine the TP content, a standard curve was generated. lists the methods used for determining the various leaf functional traits of S. alterniflora.

Table 1. Methods for determining the main leaf functional traits of Spartina alterniflora.

Spectral data conversion

Viewspec Pro software (ASD, Inc., Boulder, Colorado, USA) was used to process the collected hyperspectral data of the S. alterniflora leaves. The spectral curve had to be corrected because the acquisition of the spectrum may cause pulsation of the connection points.

In this experiment, two spectral conversion methods, namely, the original spectral and first-order differential (FD) reflectance methods, were used to study the relationship between spectral reflectance and leaf functional traits. The FD processing of the spectrum can amplify the difference in the spectral characteristics among the samples (Chen et al., Citation2019), and its applicability for functional trait inversion has been demonstrated (Zuo, Citation2021). The FD conversion formula is as follows:

(1) FDRλi=Rλi+1Rλi1Δλ(1)

where λi is the wavelength at band i, FDR(λi) is the first-order differential spectral value at wavelength λi, and λ is the wavelength difference between band i +1 and band i.

Feature selection

Utilizing hyperspectral data directly can pose challenges in optimization and lead to extended computational times due to the characteristics of the data, including a large number of bands, high correlation, and data redundancy. Therefore, we used the special importance calculation methods that come with both Random Forest (RF) and eXtreme Gradient Boosting (XGBoost) algorithms to calculate the importance scores of the input variables for each leaf functional trait of S. alterniflora, respectively. We arranged them in descending order and filtered out the bands with importance scores exceeding the standard deviation, identifying them as the characteristic bands. The computation of RF feature importance scores was implemented in MATLAB, while the computation of XGBoost feature importance scores was implemented in Python using the xgboost package.

Model establishment

Based on the principle that the ratio between the number of validation and training sets is 1/3, 238 samples were randomly divided into 179 modelling and 59 verification samples. We chose four models, RF, Back Propagation Neural Network (BPNN), Support Vector Machine (SVM) and XGBoost, to realize the inversion of functional traits of S. alterniflora for hyperspectral data. SVM is a kernel-based learning method that uses a kernel function to map input variables to a high-dimensional feature space and extracts a linear hyperplane from the feature space as a decision function to solve the regression problem (Beltrami & da Silva, Citation2020). The learning rule of BPNN is to minimize the sum of squared errors of the network by using the most rapid descent method, which continuously adjusts the weights and winks of the network through backpropagation (Wan et al., Citation2018). RF is an integrated algorithm based on decision trees. Its fundamental concept involves the random selection of multiple samples, with replacement, from the training set to construct a set of entirely independent decision trees. The final prediction result is independent of each other and the final prediction result is decided by the majority voting principle (Breiman, Citation2001; Liaw & Wiener, Citation2002). XGBoost is a machine learning algorithm for regression, classification, and ranking. It is an efficient implementation of Gradient Boosting Decision Trees (GBDT) that can run on large-scale datasets with strong generalization capabilities (Chen & Guestrin, Citation2016). The three models RF, BPNN, and SVM are implemented relying on the Weka 3.8.0 software, while the XGBoost model is implemented based on the xgboost package for Python. Default values were used for the parameters of all four models. Weka is an open-source data mining software in the field of machine learning (Hall et al., Citation2009; Suykens, Citation2014).

The accuracy of the models was assessed by calculating the determination coefficient (R2) of the regression model and the root mean square error (RMSE) of the model simulation value. The model with the highest accuracy was selected as the optimal model. The calculation formulas are as follows:

(2) R2=i=1n(yyi)2i=1n(yˉyi)2(2)
(3) RMSE=i=1n(yyi)2n(3)

where y represents the measured values of leaf functional traits, yi represents the model predictive value of leaf functional traits, yˉ represents the measured average value of the leaf functional traits, and n represents the number of samples.

Results

Leaf functional traits

gThe leaf functional traits of S. alterniflora were analysed, and the results are presented in . The FW of a single leaf ranged from 0.68–5.84 g, with a standard deviation of 0.84. The largest LT value was 0.74 mm, and the standard deviation was 0.09. The minimum, maximum, and standard deviation values of the SPAD index were 14.97, 54.00, and 5.38, respectively. Among the nutrient indicators, the TC content had the highest average value of 360.96 g/kg and the TP content had the lowest average value of 1.90 g/kg. The mean N/P, C/N, and C/P values were 8.00, 25.31, and 298.63, respectively, with N/P being the lowest. The average MC of the leaves was 68.18, with a standard deviation of 5.75. The maximum observed SLA value was 87.42 cm2/g, with a standard deviation of 28.33. SLA data showed the largest dispersion, with a coefficient of variation of 32.40%, while MC data showed the smallest, with a coefficient of variation of 8.43%. Overall, the range of functional traits and inclusion of the standard deviation indicate the differences among the leaves of S. alterniflora, thus providing conditions for the inversion of functional traits based on hyperspectral data.

Table 2. Leaf functional traits of Spartina alterniflora.

Extraction of spectral features

In our study, we used two importance score methods to extract characteristic bands for the functional traits of S. alterniflora leaves. The importance scores of the original and FD spectra for each functional trait based on RF are shown in , while shows the importance scores of OR and FD spectra for each functional trait based on the XGBoost method. Each functional trait had specific importance score bands.

Figure 1. Analysis of random forest (RF) importance scores for the original (OR) and first-order differential (FD) hyperspectral data: (a) FW, (b) LT, (c) SPAD, (d) MC, (e) SLA, (f) TN, (g) TP, (h) TC, (i) C/N, (j) C/P, and (k) N/P.

Figure 1. Analysis of random forest (RF) importance scores for the original (OR) and first-order differential (FD) hyperspectral data: (a) FW, (b) LT, (c) SPAD, (d) MC, (e) SLA, (f) TN, (g) TP, (h) TC, (i) C/N, (j) C/P, and (k) N/P.

Figure 2. Analysis of eXtreme Gradient Boosting (XGBoost) importance scores for the or and FD hyperspectral data: (a) C/N, (b) C/P, (c) N/P, (d) TC, (e) TN, (f) TP, (g) FW, (h) LT, (i) MC, (j) SLA, and (k) SPAD.

Figure 2. Analysis of eXtreme Gradient Boosting (XGBoost) importance scores for the or and FD hyperspectral data: (a) C/N, (b) C/P, (c) N/P, (d) TC, (e) TN, (f) TP, (g) FW, (h) LT, (i) MC, (j) SLA, and (k) SPAD.

Feature bands based on RF importance scores

shows that the number of bands with high-importance scores increased significantly for the FD data compared with that for the original spectral data, and the largest importance scores were observed. Specifically, TN, TP, C/P, C/N, MC, and SPAD had more high-importance bands for the original spectral data while other functional traits had fewer. However, for the FD processed spectral data, each functional trait index had bands with higher importance scores. It should be noted that TC and SLA had relatively fewer bands with high importance scores for the FD processed spectral data.

Bands with values larger than the standard deviation of the importance score were selected as characteristic bands (Supplementary table S1). The high-importance score bands of TN, TP, C/N, C/P, and N/P were distributed across all ranges; nevertheless, it mainly concentrated in the 300–700 nm band. The high-importance score bands of the FD processing data of TC were mainly distributed between 2200 and 2500 nm, while those of the original spectral data of TC were mainly distributed between 300 and 500 nm. The high-importance score bands of FW and SLA were mainly distributed between 300 and 500 nm, while those of SPAD were distributed between 400 and 800 nm. For LT, the high-importance score bands of the FD processing data were mainly distributed between 900 and 1300 nm while those of the original spectral data were between 300 and 400 nm and 1400 and 1500 nm. For MC, the high-importance score bands of the FD processing data were distributed in the 1600–1900 nm range, while those of the original spectral data were in the 300–500 nm range.

Feature bands based on XGBoost importance scores

Combining and Supplementary table S2, we find that FW screens the least number of feature bands, which is due to the fact that it has bands with importance scores greater than 0.3 in both OR spectral data and FD spectral data. The feature bands of the FD spectra of LT are mainly distributed in the 400–1400 band range, while the feature bands of the OR spectra, although distributed in all ranges, have fewer feature bands after 2000 nm. The feature bands of SLA and SPAD are mainly concentrated in the region of the visible wavelength. In terms of the distribution of the characteristic bands of chemical element content, TC and TN have fewer characteristic bands, while C/N has more characteristic bands. In addition, the importance scores of the characteristic bands of TP, C/N, C/P and N/P are generally low, which leads to the high number of feature bands.

Evaluation of leaf functional trait modelling

Model accuracy based on random forest band importance scores

The remaining 59 validation data samples were used to evaluate the performance of the four models in predicting the functional traits of S. alterniflora leaves. The validation results are shown in , which shows that the RF model had the best inversion accuracy for TC, with R2 values of 0.56 and 0.69 and RMSEs of 27.46 and 26.38 for the original and FD data, respectively. However, N/P showed the lowest prediction accuracy. The SVM model had the best inversion accuracy for MA, with R2 values of 0.92 and 0.90 for the original and FD data, respectively. The BPNN model had the best inversion accuracy for LT using the original data, with an R2 of 0.88; nevertheless, it had the best inversion accuracy for FW using the FD data, with an R2 of 0.76. The XGBoost model had the best inversion accuracy for TC using the FD data, with an R2 of 0.66; however, it showed but poor prediction accuracy for eco-chemometric features, such as TP and N/P, with R2 values of less than 0.1, which is similar to the validation by RF.

Table 3. Modelling accuracy verification.

Overall, the SVM model had better validation accuracy than the other three models, however, it showed the lowest modelling accuracy. In addition, the difference in RMSE between the predicted values of each functional trait after FD processing was not significant compared with that for the original data. However, in the modelling, the difference in RMSE between the FD and original data for each functional trait was significant.

In addition, the FD treatment improved the accuracy of the RF and XGBoost models, and the inversion accuracy of the FD treatment data for each functional trait was higher than that of the original data, although the effect was not apparent for the SVM and BPNN models. In the SVM model, the inversion accuracy of the original data for MC and SLA was significantly higher than that of the FD processing data, while the inversion accuracy of the original data for TC was significantly lower. Similarly, in the BPNN model, the inversion accuracy of the original data for LT, SLA, TN, and C/N was also greater than that of the FD processing data. The accuracy of the four models for most functional traits decreased in the validation phase to varying degrees, with the RF and XGBoost model being the most affected and showing high modelling accuracy but poor validation results. Nonetheless, some traits did show improvements in validation accuracy, such as the MC prediction based on the SVM model, which presented R2 values of 0.92 and 0.90 for the original and FD processed data, respectively.

Model accuracy based on xgboost band importance scores

RF model

The accuracy of the RF model is shown in , where a larger R2 and smaller RMSE indicate better model prediction ability and higher model ability. Among the six traits of carbon, nitrogen and phosphorus and their ecological stoichiometric characteristics, TC exhibited the highest validation accuracy, whose R2 of the spectral data of OR and FD was about 0.6, and the R2 of the other traits was below 0.2. Meanwhile, the validation accuracies of FW and MC were relatively good, and the R2 of the two data treatments were above 0.5. Overall, the data of each trait after FD processing showed better validation effects, except for SLA. The R2 of the FD data of SLA was 0.26 and the RMSE was 23.81, whereas the R2 of the OR data was 0.41 and the RMSE was 20.98. The validation accuracy of the FD data showed a decrease in the R2 and an increase in the RMSE compared with that of the original data.

Figure 3. Scatter plots of RF model accuracy.

Figure 3. Scatter plots of RF model accuracy.

SVM model

The results of the SVM validation are shown in . The highest accuracy of OR data validation is FW with R2 of 0.57, RMSE of 0.59, and the worst validation of C/P with R2 of only 0.07 and RMSE of 50.82. The best validation of FD data is MC with R2 of 0.65, RMSE of 2.85, and the lowest accuracy is TP with R2 of 0.13, RMSE of 0.36. The SVM validation results show that the validation accuracy of hyperspectral data after first-order differentiation is significantly improved compared with that of the original spectral data. Overall, the validation accuracy of the SVM model is lower than that of the RF model for both the OR data and the FD data.

Figure 4. Scatter plots of SVM modelling accuracy.

Figure 4. Scatter plots of SVM modelling accuracy.

BPNN model

The effect of the BPNN model is illustrated in , which shows that the accuracy of the neural network model was relatively different between the original and FD processing spectra. The validation accuracy of the BPNN model is poor, with R2 mostly below 0.3. Overall, the validation accuracy has not improved much after first-order differentiation, and even C/N and FW have the phenomenon of smaller R2. Combined with the scatter distribution in the figure, only FW and TC have slightly better model validation accuracies after FD processing, with R2 of 0.53 and 0.54, respectively. In addition, traits such as TC and C/P exhibit a situation where both R2 and RMSE increase after the FD processing. This contradicts the principle that the larger R2 and the smaller RMSE indicate higher model accuracy, suggesting lower model reliability. Comparing the validation accuracies of different models for the same trait, the RMSE of the BPNN model is larger than that of RF and SVM, indicating that the model is less stable.

Figure 5. Scatter plots of BPNN modelling accuracy.

Figure 5. Scatter plots of BPNN modelling accuracy.

Xgboost model

The accuracy of the XGBoost model is shown in . After the FD processing, the XGBoost model demonstrates improved predictions for the four indicators: TC, FW, MC, and SPAD, with R2 values exceeding 0.50, while the R2 and RMSE values had a slightly lower accuracy than that of the RF model; however, they outperform SVM and BPNN. C/N, C/P, and N/P had the lowest prediction accuracy, and the R2 of both data processing methods is lower than 0.2. After the first-order differential transform, the accuracy of each model built by the XGBoost algorithm improved, among which the LT had the highest improvement, and the R2 and RMSE of the original spectral data are 0.18 and 0.09, respectively, while those of the FD spectra are 0.40 and 0.07, and the R2 is apparently more improved. Overall, the modelling accuracy of XGBoost is slightly lower than that of RF, and more accurate and stable than SVM and BPNN.

Figure 6. Scatter plots of XGBoost modelling accuracy.

Figure 6. Scatter plots of XGBoost modelling accuracy.

Discussion

Analysis of the feature bands extracted using the importance score method

The high correlation between consecutive bands of hyperspectral data causes an inevitable dimensionality disaster. Therefore, dimensionality reduction of the data is required before constructing the model, which reduces the complexity of the model, improves computational efficiency, and reduces the risk of overfitting (Ding et al., Citation2020). The importance score method is used to rank the input features of a prediction model according to their relative importance in making predictions. In this study, we used this method to extract feature bands that were strongly correlated to the target parameters from the hyperspectral data, which reduced the dimensionality of the data and eliminated redundancy. We found that the importance score of different leaf functional traits and hyperspectral data differed, which led to variations in the extracted characteristic bands. IIn general, the results of the feature bands screened by the importance value ranking based on the two methods of Random Forest Fand XGBoost exhibited similarities. Both methods screened a small number of bands with high importance scores for FW, TC, and SLA. For SPAD, the selected feature bands screened are concentrated in the visible light zone, while chemical elements such as TN, TP, and ecological stoichiometric characteristics are distributed uniformly throughout the zone. This indicates that the two methods, RF and XGBoost, are consistent in the screening of feature bands and show certain regularity for different features. This provides important guidance and reference for further research and application. While the characteristic bands extracted from the original spectrum were significantly different from those extracted from the FD processing data, the importance score after FD processing was greatly improved compared to that for the original reflectance. This may be because the original hyperspectral data presented certain issues that hindered the extraction of relevant information, such as lack of linear correlation between reflectivity and measured traits and a low signal-to-noise ratio. The original reflectivity information is easily obscured by redundant information; thus, the dimension must be reduced by data conversion. The FD treatment eliminated noise and enhanced the subtle changes in spectral reflectivity (Cui et al., Citation2021; Liu et al., Citation2020), which facilitation the extraction of characteristic bands with high-importance scores. However, we used two spectral downscaling methods, XGBoost and RF, respectively, for the subsequent analysis, which may have an impact on the accuracy of other models. In future studies, we will further explore the fusion of multiple spectral dimensionality reduction methods and synthesize the advantages of each method to obtain more reliable and robust feature selection results.

Spectral modelling of functional traits in the leaves of S. alterniflora

In this study, the data of 11 leaf traits of S. alterniflora were modelled uniformly with the corresponding spectra using four methods, namely RF, SVM, BPNN and XGBoost. In the final trait-spectrum models obtained, the accuracy of TC, FW, and MC improved, followed by LT, SLA, and SPAD, and the inversion accuracy of TN, TP, and three ecological chemometric traits was poor. In this study, the leaf water content model was fitted better, and the R2 of only two of the eight models established by the two characteristic band screening methods after FD processing was lower than 0.50, while the others were higher than 0.55, which was better than the previous model fit for leaf water content of wetland plants (R2 = 0.49; Yao et al., Citation2022), indicating that the model had some predictive ability. The inverse modelling effect for SPAD of S. alterniflora leaves was poor, only the SVM and BPNN models based on the high importance score bands of random forests had better accuracy, with R2 higher than 0.60, and the R2 of the rest of the models was below 0.50, which was poorer than the inverse study of SPAD of mangrove leaves (R2 >0.70; Dou et al., Citation2019). This may be due to the different absorption, scattering and transmission characteristics of different plant leaves, coupled with the differences in the environmental conditions in which S. alterniflora grows, which may lead to a more complex and unstable relationship between leaf SPAD values and other environmental factors, thus reducing the effectiveness of the inversion model.

In this study, we also conducted an inversion study on the chemical element content of S. alterniflora leaves, and the results showed that only the TC had a better inversion effect, while the inversion effect of the other indexes was poor. This may be due to the failure to consider proper model selection and algorithm optimization in this inversion study. Previous studies have shown that the PLS model has a better performance in inversion of elemental content (Tang et al., Citation2022; Wold et al., Citation2001). In our study, the four regression models were less effective in predicting chemical elemental content, but it does not mean that the chemical elemental content of S. alterniflora leaves could not be predicted by hyperspectral data, and it is necessary to continue to explore more model combinations and parameter optimization. Therefore, we will further explore the performance of different models in the inversion of chemical elemental content and select more appropriate algorithms and models for experiments to improve the inversion effect and accuracy in future studies.

As a whole, when we divided the dataset, we divided it into training and testing sets separately, which may lead to a large difference in data distribution between the training and testing sets. Thus, we obtained a model with a better accuracy during training and not achieved the expected results during testing due to the inability of the testing dataset to adapt to the model, which is one of the larger decreases in the validation accuracy of this study compared to the modelling accuracy possible reason (Große‐Stoltenberg et al., Citation2018).

Analysis and comparison of inverse studies on functional traits in S. alterniflora

Currently, numerous studies have inverted the functional characteristics of S. alterniflora, as shown in . For example, Chen et al. (Citation2022) used a generative adversarial network with a constrained factor (GAN-CF) model and hyperspectral data to invert the ground biomass of S. alterniflora and obtained a good inversion effect. Similarly, González Trilla et al. (Citation2013) predicted the biomass, leaf area index, percent canopy cover, and other biophysical indicators of S. alterniflora in the Bahia Blanca estuary, Argentina, based on the relationship between spectral reflectance data and the normalized vegetation index and achieved good prediction results. Zhu et al. (Citation2020) used vegetation indices to predict the crude protein, fibre, and fat contents of S. alterniflora leaves based on hyperspectral data, and the findings confirmed the applicability of these data in the inversion of S. alterniflora characteristics. Furthermore, Ai et al. (Citation2015) used the correlation between the spectral and vegetation indices to invert the photosynthetic pigment content of S. alterniflora leaves in the Minjiang estuary wetland and found that FD spectral data had a good prediction effect on the photosynthetic pigment content of S. alterniflora leaves.

Table 4. Inversion of the functional traits.

In this study, we obtained the importance scores of 11 leaf functional traits from hyperspectral data of S. alterniflora leaves. Using these scores, we established inversion models for the leaf functional traits using the RF importance score based on the original and FD spectral data and the RF, SVM, XGboost, and BPNN models. Our results showed that the SVM model had good prediction ability for the functional traits of S. alterniflora leaves. Previous studies on S. alterniflora have focused more on component content traits such as chlorophyll and protein, paying comparatively less attention on elemental content and morphological structural traits, specifically at the leaf scale. While the accuracy of the present study is slightly lower than theirs, it distinguishes itself by employing a systematic approach that relies on four machine learning models to study morphological structural traits (e.g. FW, LT, SLA) and component content traits (e.g. C, N, P), which provides ideas for studying the functional traits of S. alterniflora. However, we acknowledge some limitations in this study, such as the lack of analysis of the best inversion model for some functional traits, which should be further explored in future research.

Conclusion

Our study focused on the hyperspectral inversion of functional traits of S. alterniflora leaves from the Yancheng coastal wetland, Jiangsu Province, China. We identified the characteristic bands between original spectrum and FD hyperspectral data and the functional traits of each leaf using the RF and XGBoost importance score. Inversion models for FW, LT, SPAD, TN, TP, TC, MC, SLA, N/P, C/N, and C/P of S. alterniflora were established, and the prediction results of XGBoost, BPNN, SVM, and RF were compared to identify the optimal data type and model. The following conclusions were drawn.

  1. The validation results demonstrated that the SVM model built based on the feature bands screened by RF importance score outperformed the remaining three models in terms of accuracy and stability for inverting the functional traits of S. alterniflora leaves.

  2. In terms of carbon, nitrogen, and phosphorus and their ecological chemometric traits, only the model of TC had predictability, and the model built based on the XGBoost importance score method showed better performance.

  3. Using the extracted characteristic bands from the original spectral and FD hyperspectral data, prediction models for different functional traits were developed and compared. The results showed that the FD hyperspectral data-based models generally outperformed the models based on the original data for most functional traits.

Supplemental material

Supplemental Material

Download MS Word (22.3 KB)

Supplemental Material

Download MS Word (23.3 KB)

Acknowledgments

We would like to express our gratitude to Deputy Director Chen Hao, Section Chief Zhang Yanan and Mr. Li Zhenghao of the reserve for their valuable assistance in conducting our field experiments.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

Hyperspectral data and functional trait data used in this study are available from the corresponding author upon request.

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/22797254.2023.2294951

Additional information

Funding

This work was supported by the National Key R&D Program of China under Grant [2017YFC0506200].

References

  • Ai, J., Chen, W., Chen, L., Zhang, Y., Zhou, Y., Guo, X., & Chu, W. (2015). Hyperspectral remote sensing estimation models for foliar photosynthetic pigment contents at canopy level in an invasive species, Spartina alterniflora. Acta Ecologica Sinica, 35(4), 1175–15. https://doi.org/10.5846/stxb201304270834
  • An, S. Q., Gu, B. H., Zhou, C. F., Wang, Z. S., Deng, Z. F., Zhi, Y. B., Li, H. L., Chen, L., Yu, D. H., & Liu, Y. H. (2007). Spartina invasion in China: implications for invasive species management and future research. Weed Research, 47, 183–191. https://doi.org/10.1111/j.1365-3180.2007.00559.x
  • Asner, G. P., Knapp, D. E., Anderson, C. B., Martin, R. E., & Vaughn, N. (2016). Large-scale climatic and geophysical controls on the leaf economics spectrum. Proceedings of the National Academy of Sciences, 113(28), E4043–E4051. https://doi.org/10.1073/pnas.1604863113
  • Barbier, E. B., Hacker, S. D., Kennedy, C., Koch, E. W., Stier, A. C., & Silliman, B. R. (2011). The value of estuarine and coastal ecosystem services. Ecological Monographs, 81(2), 169–193. https://doi.org/10.1890/10-1510.1
  • Beltrami, M., & da Silva, A. C. L. (2020). A grid-quadtree model selection method for support vector machines. Expert Systems with Applications, 146, 113172. https://doi.org/10.1016/j.eswa.2019.113172
  • Blankespoor, B., Dasgupta, S., & Laplante, B. (2014). Sea-level rise and coastal wetlands. Ambio, 43(8), 996–1005. https://doi.org/10.1007/s13280-014-0500-4
  • Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324
  • Cai, F., Su, X., Liu, J., Li, B., & Lei, G. (2009). Coastal erosion in China under the condition of global climate change and measures for its prevention. Progress in Natural Science, 19(4), 415–426. https://doi.org/10.1016/j.pnsc.2008.05.034
  • Carlucci, M. B., Debastiani, V. J., Pillar, V. D., & Duarte, L. D. (2015). Between‐and within‐species trait variability and the assembly of sapling communities in forest patches. Journal of Vegetation Science, 26(1), 21–31. https://doi.org/10.1111/jvs.12223
  • Chen, T., & Guestrin, C. (2016). Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, 785–794. https://doi.org/10.1145/2939672.2939785
  • Chen, C., Ma, Y., Ren, G., & Wang, J. (2022). Aboveground biomass of salt-marsh vegetation in coastal wetlands: Sample expansion of in situ hyperspectral and sentinel-2 data using a generative adversarial network. Remote Sensing of Environment, 270, 112885. https://doi.org/10.1016/j.rse.2021.112885
  • Chen, X. P., Sun, J., Wang, Y., Zhang, H. Y., He, C. Q., Liu, X. Y., Bu, N. S., & Long, X. E. (2018). Temporal and spatial impact of Spartina alterniflora invasion on methanogens community in Chongming Island, China. Journal of Microbiology, 56(7), 507–515. https://doi.org/10.1007/s12275-018-8062-y
  • Chen, Y., Wang, J., Liu, G., Yang, Y., Liu, Z., & Deng, H. (2019). Hyperspectral estimation model of forest soil organic matter in northwest Yunnan Province, China. Forests, 10(3), 217. https://doi.org/10.3390/f10030217
  • Cui, L., Dou, Z., Liu, Z., Zuo, X., Lei, Y., Li, J., Zhao, X., Zhai, X., Pan, X., & Li, W. (2020). Hyperspectral inversion of Phragmites communis carbon, nitrogen, and phosphorus stoichiometry using three models. Remote Sensing, 12(12), 1998. https://doi.org/10.3390/rs12121998
  • Cui, L., Zuo, X., Dou, Z., Huang, Y., Zhao, X., Zhai, X., Lei, Y., Li, J., Pan, X., & Li, W. (2021). Plant identification of Beijing Hanshiqiao wetland based on hyperspectral data. Spectroscopy Letters, 54(5), 381–394. https://doi.org/10.1080/00387010.2021.1928224
  • Ding, X., Zhang, S., Li, H., Wu, P., Dale, P., Liu, L., & Cheng, S. (2020). A restrictive polymorphic ant colony algorithm for the optimal band selection of hyperspectral remote sensing images. International Journal of Remote Sensing, 41(3), 1093–1117. https://doi.org/10.1080/01431161.2019.1655810
  • Dou, Z., Li, Y., Cui, L., Pan, X., Ma, Q., Huang, Y., Lei, Y., Li, J., Zhao, X., & Li, W. (2019). Hyperspectral inversion of Suaeda salsa biomass under different types of human activity in liaohe estuary wetland in north-eastern China. Marine and Freshwater Research, 71(4), 482–492. https://doi.org/10.1071/MF19030
  • Gitelson, A. A., Merzlyak, M. N., & Chivkunova, O. B. (2001). Optical properties and nondestructive estimation of anthocyanin content in plant leaves. Photochemistry and Photobiology, 74(1), 38–45. https://doi.org/10.1562/0031-8655(2001)074<0038:OPANEO>2.0.CO;2
  • González Trilla, G., Pratolongo, P., Beget, M. E., Kandus, P., Marcovecchio, J., & DiBella, C. (2013). Relating biophysical parameters of coastal marshes to hyperspectral reflectance data in the Bahia Blanca estuary, Argentina. Journal of Coastal Research, 29(1), 231–238. https://doi.org/10.2112/JCOASTRES-D-11-00214.1
  • Große‐Stoltenberg, A., Hellmann, C., Thiele, J., Oldeland, J., Werner, C., & Schmidtlein, S. (2018). Invasive acacias differ from native dune species in the hyperspectral/biochemical trait space. Journal of Vegetation Science, 29(2), 325–335. https://doi.org/10.1111/jvs.12608
  • Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., & Witten, I. H. (2009). The WEKA data mining software: An update. Acm sigkdd Explorations Newsletter, 11(1), 10–18. https://doi.org/10.1145/1656274.1656278
  • He, N., Yan, P., Liu, C., Xu, L., Li, M., Van Meerbeek, K., Zhou, G., Zhou, G., Liu, S., Zhou, X., Li, S., Niu, S., Han, X., Buckley, T. N., Sack, L., & Yu, G. (2022). Predicting ecosystem productivity based on plant community traits. Trends in Plant Science, 28(1), 43–53. https://doi.org/10.1016/j.tplants.2022.08.015
  • Jankowska, E., & Włodarska-Kowalczuk, M. (2022). Habitat-builders complexity boosts associated fauna functional trait richness (zostera marina meadows, Baltic Sea). Ecological Indicators, 144, 109512. https://doi.org/10.1016/j.ecolind.2022.109512
  • Jiang, F., Cadotte, M. W., & Jin, G. (2022). Size-and environment-driven seedling survival and growth are mediated by leaf functional traits. Proceedings of the Royal Society B, 289(1983), 20221400. https://doi.org/10.1098/rspb.2022.1400
  • Kirwan, M. L., & Megonigal, J. P. (2013). Tidal wetland stability in the face of human impacts and sea-level rise. Nature, 504(7478), 53–60. https://doi.org/10.1038/nature12856
  • Levine, J. (2016). A trail map for trait-based studies. Nature, 529(7585), 163–164. https://doi.org/10.1038/nature16862
  • Liaw, A., & Wiener, M. (2002). Classification and regression by random forest. International Journal of Offshore and Polar Engineering, 2(3), 18–22.
  • Li, W., Dou, Z., Cui, L., Wang, R., Zhao, Z., Cui, S., Lei, Y., Li, J., Zhao, X., & Zhai, X. (2020). Suitability of hyperspectral data for monitoring nitrogen and phosphorus content in constructed wetlands. Remote Sensing Letters, 11(5), 495–504. https://doi.org/10.1080/2150704X.2020.1734247
  • Liu, M. C., Kong, D. L., Lu, X. R., Huang, K., Wang, S., Wang, W. B., Qu, B., & Feng, Y. L. (2017). Higher photosynthesis, nutrient‐and energy‐use efficiencies contribute to invasiveness of exotic plants in a nutrient poor habitat in northeast China. Physiologia plantarum, 160(4), 373–382. https://doi.org/10.1111/ppl.12566
  • Liu, C., Li, P., Xie, W., Sha, M., & Ding, W. (2021). Changes of sulfur fractions in sediment following Spartina alterniflora invasion in a seaward direction in a temperate salt marsh, China. Ecological Indicators, 131, 108217. https://doi.org/10.1016/j.ecolind.2021.108217
  • Liu, M., Mao, D., Wang, Z., Li, L., Man, W., Jia, M., Ren, C., & Zhang, Y. 1933. (2018). Rapid invasion of Spartina alterniflora in the coastal zone of mainland China: New observations from landsat OLI images. Remote Sensing, 10(12). https://doi.org/10.3390/rs10121933.
  • Liu, S., Yu, H. Y., Piao, Z. J., Chen, M. C., Yu, T., Kong, L. J., Zhang, L., Dang, J. M., & Sui, Y. Y. (2020). Study on extracting characteristic wavelength of soybean physiological information based on hyperspectral technique. Spectroscopy and Spectral Analysis, 40(11), 3542–3548. https://doi.org/10.3964/j.issn.1000-0593(2020)11-3542-07
  • Pacheco-Labrador, J., Migliavacca, M., Ma, X., Mahecha, M., Carvalhais, N., Weber, U., Benavides, R., Bouriaud, O., Barnoaiea, I., Coomes, D. A., Bohn, F. J., Kraemer, G., Heiden, U., Huth, A., & Wirth, C. (2022). Challenging the link between functional and spectral diversity with radiative transfer modeling and data. Remote Sensing of Environment, 280, 113170. https://doi.org/10.1016/j.rse.2022.113170
  • Rebelo, A. J., Somers, B., Esler, K. J., & Meire, P. (2018). Can wetland plant functional groups be spectrally discriminated? Remote Sensing of Environment, 210, 25–34. https://doi.org/10.1016/j.rse.2018.02.031
  • Reich, P. B., & Oleksyn, J. (2004). Global patterns of plant leaf N and P in relation to temperature and latitude. Proceedings of the National Academy of Sciences, 101(30), 11001–11006. https://doi.org/10.1073/pnas.0403588101
  • Sack, L., Scoffoni, C., John, G. P., Poorter, H., Mason, C. M., Mendez-Alonzo, R., & Donovan, L. A. (2013). How do leaf veins influence the worldwide leaf economic spectrum? review and synthesis. Review and Synthesis Journal of Experimental Botany, 64(13), 4053–4080. https://doi.org/10.1093/jxb/ert316
  • Sheaves, M., Brookes, J., Coles, R., Freckelton, M., Groves, P., Johnston, R., & Winberg, P. (2014). Repair and revitalisation of Australia׳ s tropical estuaries and coastal wetlands: Opportunities and constraints for the reinstatement of lost function and productivity. Marine Policy, 47, 23–38. https://doi.org/10.1016/j.marpol.2014.01.024
  • Suykens, J. A. (2014). Introduction to machine learning. In Academic press library in signal processing (Vol. 1, pp. 765–773). Elsevier. https://doi.org/10.1016/B978-0-12-396502-8.00013-9
  • Tang, X., Dou, Z., Cui, L., Liu, Z., Gao, C., Wang, J., Li, J., Lei, Y., Zhao, X., Zhai, X., & Li, W. (2022). Hyperspectral prediction of mangrove leaf stoichiometries in different restoration areas based on machine learning models. Journal of Applied Remote Sensing, 16(3), 034525–034525. https://doi.org/10.1117/1.JRS.16.034525
  • Tian, B., Wu, W., Yang, Z., & Zhou, Y. (2016). Drivers, trends, and potential impacts of long-term coastal reclamation in China from 1985 to 2010. Estuarine, Coastal and Shelf Science, 170, 83–90. https://doi.org/10.1016/j.ecss.2016.01.006
  • Violle, C., Navas, M. L., Vile, D., Kazakou, E., Fortunel, C., Hummel, I., & Garnier, E. (2007). Let the concept of trait be functional. Oikos, 116(5), 882–892. https://doi.org/10.1111/j.0030-1299.2007.15559.x
  • Wang, L., & Wei, Y. (2016). Revised normalized difference nitrogen index (NDNI) for estimating canopy nitrogen concentration in wetlands. Optik, 127(19), 7676–7688. https://doi.org/10.1016/j.ijleo.2016.05.115
  • Wan, R., Wang, P., & Wang, X. (2018). Modeling wetland aboveground biomass in the poyang lake national nature reserve using machine learning algorithms and landsat-8 imagery. Journal of Applied Remote Sensing, 12(4), 046029–046029. https://doi.org/10.1117/1.JRS.12.046029
  • Westoby, M., & Wright, I. J. (2006). Land-plant ecology on the basis of functional traits. Trends in Ecology & Evolution, 21(5), 261–268. https://doi.org/10.1016/j.tree.2006.02.004
  • Wold, S., Sjöström, M., & Eriksson, L. (2001). PLS-regression: A basic tool of chemometrics. Chemometrics and Intelligent Laboratory Systems, 58(2), 109–130. https://doi.org/10.1016/S0169-7439(01)00155-1
  • Yao, Y., Wang, X., Tan, X., & Shan, Y. (2022). Models building between leaf spectral reflectance and traits for typical marshland wetland plants based on plsr: A case of sanjiang national nature reserve. Scientia geographica sinica, 42(9), 1638–1645. https://doi.org/10.13249/j.cnki.sgs.2022.09.013
  • Yendrek, C. R., Tomaz, T., Montes, C. M., Cao, Y., Morse, A. M., Brown, P. J., McIntyre, L. M., Leakey, A. D. B., & Ainsworth, E. A. (2017). High-throughput phenotyping of maize leaf physiological and biochemical traits using hyperspectral reflectance. Plant Physiology, 173(1), 614–626. https://doi.org/10.1104/pp.16.01447
  • Yu, C., Cao, J., Du, W., Zhu, Z., & Xu, M. (2022). Changes in the population and functional profile of bacteria and fungi in the rhizosphere of Suaeda salsa is driven by invasion of spartina alterniflora. Ecological Indicators, 144, 109516. https://doi.org/10.1016/j.ecolind.2022.109516
  • Zhang, M., Li, M., Liu, W., Cui, L., Li, W., Wang, H., Wei, Y., Guo, Z., Wang, D., Hu, Y., Xu, W., Yang, S., Xiao, H., & Long, S. (2019). Analyzing the performance of statistical models for estimating leaf nitrogen concentration of phragmites australis based on leaf spectral reflectance. Spectroscopy Letters, 52(9), 483–491. https://doi.org/10.1080/00387010.2019.1619584
  • Zhang, R., Liu, H. M., Kou, X., Xu, Z. C., Yu, X. W., Cao, X. A., Wen, L., Ma, L. Q, & Wang, L. X. (2022). Functional traits of dominant plants and their adaptations in lakeshore wetlands of the inner mongolia plateau. Acta Ecologica Sinica, 42(19), 7773–7784. https://doi.org/10.5846/stxb202110092802
  • Zhou, Y., Jiao, L., Qin, H. J., Wu, J. J., & Che, X. C. (2022). Responses of leaf functional traits of clonal plant phragmites australis to heterogeneous environments. The Journal of Applied Ecology, 33(8), 2171–2177. https://doi.org/10.13287/j.1001-9332.202208.010
  • Zhu, Y., Wu, Y., Zhou, Z., Zhang, Z., & An, Y. (2020). Inversion of nutrient components of Spartina alterniflora based on hyperspectral data. Journal of Beijing Forestry University, 42(9), 92–99. https://doi.org/10.12171/j.1000−1522.20200008
  • Zhu, H., Yang, L., Li, D. B., He, L. P., Yue, C. L., & Li, H. P. (2023). Functional traits of twelve species of marsh plants in siming mountain, zhejiang Province. Acta Ecologica Sinica, 43(7), 2881–2890. https://doi.org/10.5846/stxb202203300788
  • Zuo, X. Y., Cui, L. J., Li, W., Dou, Z. G., Cai, Y., Liu, Z. J., & Lei, Y. R. (2021). Inversion of functional traits of spartina alterniflora leaves based on hyperspectral data. Acta Ecologica Sinica, 41(15), 6159–6269. https://doi.org/10.5846/stxb202004020781