Prediction of plant-level tomato biomass and yield using machine learning with unmanned aerial vehicle imagery

Tatsumi, Kenichi; Igarashi, Noa; Mengxue, Xiao

doi:10.1186/s13007-021-00761-2

Research
Open access
Published: 15 July 2021

Prediction of plant-level tomato biomass and yield using machine learning with unmanned aerial vehicle imagery

Plant Methods volume 17, Article number: 77 (2021) Cite this article

4133 Accesses
19 Citations
2 Altmetric
Metrics details

Abstract

Background

The objective of this study is twofold. First, ascertain the important variables that predict tomato yields from plant height (PH) and vegetation index (VI) maps. The maps were derived from images taken by unmanned aerial vehicles (UAVs). Second, examine the accuracy of predictions of tomato fresh shoot masses (SM), fruit weights (FW), and the number of fruits (FN) from multiple machine learning algorithms using selected variable sets. To realize our objective, ultra-high-resolution RGB and multispectral images were collected by a UAV on ten days in 2020’s tomato growing season. From these images, 756 total variables, including first- (e.g., average, standard deviation, skewness, range, and maximum) and second-order (e.g., gray-level co-occurrence matrix features and growth rates of PH and VIs) statistics for each plant, were extracted. Several selection algorithms (i.e., Boruta, DALEX, genetic algorithm, least absolute shrinkage and selection operator, and recursive feature elimination) were used to select the variable sets useful for predicting SM, FW, and FN. Random forests, ridge regressions, and support vector machines were used to predict the yield using the top five selected variable sets.

Results

First-order statistics of PH and VIs collected during the early to mid-fruit formation periods, about one month prior to harvest, were important variables for predicting SM. Similar to the case for SM, variables collected approximately one month prior to harvest were important for predicting FW and FN. Furthermore, variables related to PH were unimportant for prediction. Compared with predictions obtained using only first-order statistics, those obtained using the second-order statistics of VIs were more accurate for FW and FN. The prediction accuracy of SM, FW, and FN by models constructed from all variables (rRMSE = 8.8–28.1%) was better than that from first-order statistics (rRMSE = 10.0–50.1%).

Conclusions

In addition to basic statistics (e.g., average and standard deviation), we derived second-order statistics of PH and VIs at the plant level using the ultra-high resolution UAV images. Our findings indicated that our variable selection method reduced the number variables needed for tomato yield prediction, improving the efficiency of phenotypic data collection and assisting with the selection of high-yield lines within breeding programs.

Background

Tomato (Solanum lycopersicum L.) is one of the most widely and globally grown vegetables in the world and plays an important role in human health maintenance [1]. In 2018, the annual production of fresh tomatoes was about 180 million tons globally [2]. Approximately a quarter of those were cultivated for processing and were consumed as pastes, ketchup, salsa, and juice [3, 4]. The main production countries are China, India, Pakistan, Turkey, and the U.S., which account for approximately 60% of the world tomato production. Tomato production and harvested area are increasing every year [2]. In terms of health, tomato is a source of vitamin C, potassium, folate, and vitamin K, which have been linked to many health benefits, such as antioxidant protection against cancer, strengthening the heart, and constipation prevention [5].

Over the past few years, unmanned aerial vehicles (UAVs) have been receiving much attention as ways to measure secondary traits, such as plant height (PH) and spectral reflectance, in a wide area because of the UAV advantages: ease of operation, highly flexible and timely control, super-high spatial resolution, and quick retrieval of wide-area field information owing to reduced planning time [6, 7]. A UAV can be equipped with a wide range of sensors useful in agricultural applications, such as RGB [8] and multispectral cameras [9]. In addition, UAVs have attracted much attention in the field of agricultural remote sensing due to the development of low-cost UAVs and imaging sensors. In particular, UAVs provide an entire new perspective to the agricultural landscape by collecting remote sensing data at very low altitudes. Regarding tomato, Senthilnath et al. [10] used a UAV to acquire RGB imagery of a tomato field and to classify tomato and non-tomato plants. However, they found that many fruits were pretermitted because they were visually obscured by leaves and stems. Johansen et al. [11] used a time series of RGB and multispectral datasets to delineate tomato plants using an automated object-based image analysis and to assess phenotypic traits of tomatoes including plant area, growth rates, condition, and plant projective cover. Furthermore, they used the mapped traits to identify tomato plant accessions that performed the best in terms of yield. Johansen et al. [12] researched the predictability of fresh shoot mass (SM), number of fruits (FN), and yield mass at harvest using UAV-based imagery and indicated that plant area, border length, width, and length of plant had the highest importance in the random forest approach to modeling of biomass and yield. Candiago et al. [13] examined the vegetation vigor of vineyards and tomatoes using three different vegetation indices (VIs) based on orthoimages and demonstrated the great potential of high-resolution UAV data. Enciso et al. [14] indicated that canopy cover estimated using a UAV was correlated with measured leaf area index. In other crops, UAV imagery for plant phenotyping has been applied for plant height assessment [15,16,17,18], crop growth and biomass, and yield [19,20,21,22,23]. In addition, machine learning (ML) approach with UAV imagery has been used to estimate biomass of crops including wheat [24], rice [25, 26], maize [22], and barley [27]. Except for studies by Moeckel et al. [28] and Johansen et al. [11, 12], we did not identify any studies that used UAV-based time series to predict tomato plant biomass and yield at harvest at the plant level.

The UAV-based studies on yield prediction with remotely sensed phenotypic traits during the growing period used a variety of artificial intelligence approaches, such as ML techniques, and obtained useful findings [29]. In contrast, collecting the required datasets of multitemporal traits needed for large-scale application of ML approaches remains time-consuming and computationally expensive. Currently, if a few principal UAV-derived phenotypic traits and growth stages for crop yield are usable, the data collection and processing effort can be efficient. Furthermore, to reduce computational complexity, improve efficient analysis of data and data understanding, determine essential phenotypic traits or growth stages, variable selection methods involve evaluating important phenotypic traits on yield.

Therefore, although a relatively high number of investigations of optimal variable selection of UAV-derived phenotypic traits and ML for prediction of grain yield have been conducted, only few studies have addressed the leading variable selection on UAV-derived phenotypic traits for prediction of tomato yield. Furthermore, higher-level feature information can be extracted from the ultra-high spatial resolution UAV-acquired imagery at plant level rather than extracting only basic statistics, such as mean and standard deviation, at the plot level. ML algorithms should provide the best predictive results for tomato yield using the selected principal variables. Accurate prediction of tomato yield using sensor-derived secondary traits, such as PH and spectral reflectance, will improve the accuracy genotype selection, shorten the breeding cycle, and reduce the labors in field phenotyping collection and data preprocessing. The specific aims of this study were as follows: (1) to select optimal feature variables for the yield prediction from the UAV-derived PH and VI maps; (2) to evaluate the predictive power for tomato SM, defined as the aboveground biomass except fruit part, fruit weight (FW), and FN using multiple ML algorithms with the set of the selected variables.

Materials and methods

Study site and experimental design

This study was conducted in an experimental research field at the Field Museum Fuchu, Tokyo University of Agriculture and Technology (35.68°N, 139.48°E). The tomato variety was “Natsunoshun,” which is suitable for processing tomato plants in open fields. The field is of predominantly andosol soil type. Tomato was grown during the growing season from May 13 to July 30, 2020 with three replications and a plot size of 5 × 5 m (Fig. 1). The plants were sown at a greenhouse nursery a month before transplanting, and they were transplanted directly into the ground without staking or trellising tomato plants with bamboo poles or wood stakes. They were arranged in five rows of approximately 0.85 m length with 0.40 m spacing between hills, producing a combined total of 210 plants. Pre-planting N, P, and K fertilizer (N:P:K = 10:10:10 kg 10 a⁻¹) was supplied in a split application before transplanting. Drip irrigation was applied through tubing into each plot in the amount of 500 ml for 30 min in the morning and evening for one week after transplanting. Following the initial irrigation period, plots were fed only by rainfall. For each plot, plastic agricultural mulch film was used and weed was periodically manually mowed.

Data acquisition by UAV

A gimbal-stabilized Zenmuse X5S camera (DJI Co., Ltd., Shenzhen, China) and multispectral image sensor camera Altum (MicaSense Co., Ltd., SEA, USA) mounted on a DJI Matrice 210 V2 (DJI Co., Ltd., Shenzhen, China) were used to collect the aerial RGB imagery and multispectral images using six bands (blue: 475 nm center, green: 560 nm center, red: 668 nm center, red edge: 717 nm center, near infrared: 842 nm center, and thermal infrared: 8–14 μm). The sensor resolution of each RGB megapixel and individual spectral image (except thermal infrared) was 5280 × 3956 and 2064 × 1554 pixels, respectively. The flight height was 12 m above ground to extract the features of each plant with ultra-high resolution and without being affected by wind caused by the UAV, and the forward and side overlaps were set as 90% and 70%, respectively. The RGB imagery and multispectral reflectance image data were acquired on May 24, May 30, June 5, June 11, June 18, June 26, July 2, July 12, July 16, and July 24. Both types of image data were obtained between 10:00 and 11:00 local time. Aerial multispectral images were radiometrically calibrated with a MicaSense’s Calibrated Reflectance Panel and MicaSense downwelling light sensor mounted on top of the UAV facing up towards the sky (Fig. 2). The reflectance values of the calibrated panel across blue, green, red, near infrared reflectance (nir), and red edge were 0.528, 0.531, 0.531, 0.529, and 0.531, respectively.

Flight parameter settings, including flight path, were designated using the flight planning software Pix4Dcapture (Pix4D S.A., Lausanne, Switzerland), and the ground sampling distance was 0.3 cm/pixel for RGB images and 0.52 cm/pixel for spectral images. Each sensor acquired 80 RGB and 400 spectral images on average per flight. Ground control points, for which we used a black and white cross-centered board, were placed at each of the four corners of the target field. Geometric calibration was conducted during the orthomosaic imagery process in Pix4Dmapper Pro version 4.6.3 (Pix4D S.A., Lausanne, Switzerland) using the ground control points. Digital surface model (DSM), which represents the elevation of plant structures, was created by Pix4Dmapper automatically. The digital terrain model (DTM) which represents the elevation of the soil surface, was estimated by interpolating segmented soil pixels. In this study, the threshold value for segmenting soil and vegetation pixels was set to NDVI = 0.1. For multispectral images, radiometric calibration was performed in Pix4Dmapper during the orthorectification process using the calibration data by panel reflectance values collected during the flight with the downwelling light sensor. Finally, the reflectance map of six-band with GeoTIFF format was obtained automatically using Pix4Dmapper.

Yield survey

SM, FW, and FN of individual plants in the tomato field were harvested on July 29 and July 30. These yield components were used for variable selection, prediction model training, and prediction accuracy analysis.

PH and VI calculation

PH maps were calculated by subtracting the DTM from the DSM. The three VIs typically used for measurements of leaf chlorophyll content, plant height, biomass, and crop growth indicators [9, 30,31,32] were calculated from the multispectral maps: green normalized difference vegetation index (GNDVI) [33], normalized difference vegetation index (NDVI) [34], and weighted difference vegetation index (WDVI) [35]. We selected these three VIs because (1) the use of several types of VIs is not necessarily efficient as time-series analysis takes time, and high correlation coefficients among the VIs causes multicollinearity; (2) NDVI and GNDVI are correlated with leaf chlorophyll content and are widely used for yield predictions [9, 30,31,32]; 3) WDVI can take into account the soil background influences.

$${\text{GNDVI}} = \left( {{\text{NIR}}{-}{\text{Green}}} \right)/\left( {{\text{NIR}} + {\text{Green}}} \right).$$

(1)

$${\text{NDVI}} = \left( {{\text{NIR}}{-}{\text{Red}}} \right)/\left( {{\text{NIR}} + {\text{Red}}} \right).$$

(2)

$${\text{WDVI}} = {\text{NIR}}{-}a \times {\text{Red}}{\text{.}}$$

(3)

Here, NIR is crop reflectance in the near infrared band, Green is crop reflectance in the green band, Red is crop reflectance in the red band, and a is the slope of the soil line.

Variable extraction from PH and VI maps

In order to extract the variables that may be related to the tomato biomass and yield of each plant, the following preprocessing was performed on the PH and VI maps.

(1)
Extraction of plant part: Tomato plant parts for each plant were extracted from the pixels with NDVI > 0.5 using the orthomosaic photo image from May 14. Some pixels identified as weeds were manually removed.
(2)
Centroid determination of each plant: Using the closed vector line of each plant sample obtained in step 1, the centroid of each plant was determined.
(3)
Region of interest (ROI) extraction of each plant: To calculate the feature variables for each plant, a circle with a radius of 20 cm centered on the centroid of each plant estimated in steps 1 and 2 was extracted as ROI.

In the present study, pixel statistics and dynamic growth rate were extracted as candidates for explanatory variables. In the ROI of each plant, five first-order statistics (average (AVE), standard deviation (SD), skewness (SKEW), range (RANGE), and maximum (MAX)) were extracted as basis statistics from PH and VI maps. Next, thirteen second-order statistics, which only considered the spatial pattern based on gray-level co-occurrence matrix (GLCM) [36], sum average (SA), entropy (Ent), different entropy (DE), sum entropy (SE), variance (Var), difference variance (DV), sum variance (SV), angular second moment (ASM), inverse difference moment (IDM), contrast (Con), correlation (Cor), and information measures of correlation (MOC-1, MOC-2) were derived from PH and VIs maps as GLCM features. Table 1 shows calculation formulas of the 13 extracted feature texture metrics. Next, dynamic average growth rates of PH and VIs were included as second-order statistics in this study and were considered as explanatory variables. In this study, growth rates were calculated for each plot as the change of PH and VIs over two consecutive measurement days divided by the measurement interval. As a result, a total of 756 variables (18 features (first- and second-order statistics) × 10 dates od PH (180 variables); 18 features × 10 days of three VIs (540 variables); 9 dynamic growth rates from PH (9 variables); 9 dynamic growth rates from three VIs (27 variables)) were extracted for variable selection.

Table 1 Selected gray-level co-occurrence matrix (GLCM) texture measures and their abbreviations and equations

Full size table

Variable selection for tomato yield prediction

To reduce computational complexity, promote efficient data analysis and data understanding, determine critical phenotypic traits or growth stages, variable selection methods involve evaluating important phenotypic traits on yield. In addition, variable selection is a fundamental step in ML algorithm and regression modeling. In the present study, the two groups of extracted variables, a total of 200 (5 first-order statistics × 10 dates for PH and 3 VI maps) and all 756 available variables, were handled as candidates in variable selection methods to explain tomato SM, FW, and FN. The reason we prepared two groups of variables is to determine if there is a difference in the prediction accuracy of tomato biomass and yield when using only basic statistics and all 756 variables as candidates. After generating first- and second-order statistics and dynamic growth rates from PH and VI maps, normalization was conducted before each variable selection procedure as preprocessing. Next, we applied five effective and powerful variable selection techniques to extract candidate variables. These were Boruta [37], DALEX [38], genetic algorithm (GA) [39], least absolute shrinkage and selection operator (LASSO) [40], and recursive feature elimination (RFE) [41].

Boruta is a non-parametric feature ranking and selection algorithm based on random forest algorithms that can decide if a variable is important and contributes to selection of statistically significant confirmed variables. DALEX is a potent non-parametric tool that explains various attributes such as implemented loss functions about the variables used in a machine learning model. GA is a non-parametric stochastic method for function optimization based on the mechanics of natural genetics and biological evolution. LASSO is a method of automatic parametric variable selection that eventually reduces the coefficients of certain unwanted features to zero due to penalization with L1-norm and minimizes the prediction error. RFE is a non-parametric feature selection that fits a model and removes the weakest feature until the specified number of features is reached. Furthermore, it attempts to eliminate dependencies and collinearity that may exist in the model. All variable selection processes were conducted using R (Version 3.6.3).

In this study, top five variables with the highest importance scores were selected as useful in predicting tomato SM, FW, and FN by five variable selection methods. Each of the variable groups selected from 200 and 756 variables were used as input for random forest (RF), ridge regression (RI), and support vector machine (SVM) in R to predict SM, FW, and FN. RF model is a machine learning method that can be used for a variety of tasks including classification and regression. It consists of a large number of decision trees and combines the predictors of the estimators to produce a more accurate prediction [42]. RI model is a technique for analyzing multiple regression data that suffer from multicollinearity, and it performs L2 regularization [43,44,45]. SVM model is a supervised machine learning algorithm that is used for classification, regression, and detection of outliers and is capable of addressing the collinearity issue [46]. Hyperparameters for RF, RI, and SVM were optimized using “tuneRF” function in randomForest package, tenfold cross validation, and “tune svm” functions in e1071 package, respectively. To evaluate the model performance, 80% of all observation data were used as training data and the remaining 20% was used for the model evaluation. The prediction performance of each model was evaluated using coefficient of determination (R²) and relative root mean square error (rRMSE).

Results and discussion

Temporal change of PH and VIs

Multitemporal UAV-derived data allow the quantitative evaluation of tomato growth with PH and VIs using DSM, DTM, and multispectral reflectance images. Figures 3 and 4 show the multitemporal growth change of PH and GNDVI, respectively, during the growing period in the entire field. Additional file 2: Figures S1 and S2 indicate the multitemporal maps of NDVI and WDVI, respectively. Yellower and greener pixels in Figs. 3 and 4, Additional file 2: Figures S1 and S2 mean taller PH and tomato vigor, respectively. PH increased linearly until flowering [around 160 DOY (day of year)]; after that, although the leaves spread horizontally, PH grew at a very small rate and remained almost unchanged during mature fruiting. In contrast, NDVI and GNDVI peaked on 184 DOY, and WDVI peaked on 194 DOY (Fig. 5). The growth trend of PH, which is a sigmoid curve, was also found in other research [47]. In addition, the phenomenon that VIs begin to decline at the end of the growing period is due to the leaf aging and yellowing. In summary, relatively large growth rate of PH and small growth rate of VIs were found until flowering date (mid-June); subsequently, the growth rate of PH slowed down and the growth rates of VIs increased. The ripening stage was characterized by a decrease in the VIs.

Variable selection

Tables 2, 3, and 4 show the selected top five variables according to averaged importance score estimated by Boruta, DALEX, GA, LASSO, and RFE for SM, FW, and FN, respectively. As for SM, first- and second-order statistics related to PH and VIs during mid-fruit formation stage (from late-June to mid-July) were selected by all variable selection methods. AVE and MAX of PH and RANGE of GNDVI selected from extracted first-order statistics, and SV and DV of NDVI selected from all statistics were ranked in top five by all variable selection methods (Table 2). Moreover, other second-order statistics features such as MOC-1, DV, SE, Entr, IDM, MOC-2, and Con were also ranked as selected variables. Selected first- and second-order statistics of VIs and PH from extracted basic and all variables describe homogeneity and heterogeneity in the entire field. Although basic statistics and GLCM features of PH and VIs are important for SM estimation, all variable selection methods selected more PH-related variables after flowering compared with results of FW and FN. Although these results may vary depending on the employed explanatory variables and the variable selection method, in this study, the importance of PH in SM estimation was confirmed. Prediction of SM is important for estimating the assimilation potential by leaf photosynthesis. Therefore, the relationship of PH and SM has been an interesting issue for researchers and breeders. It was potentially shown that the variables of PH and VIs in the later growth period were significant for the final SM estimation.

Table 2 Top five variables selected from first-order statistics and all variables by Boruta, DALEX, genetic algorithm (GA), least absolute shrinkage and selection operator (LASSO), and recursive feature elimination (RFE) for shoot mass (SM)

Full size table

Table 3 Top five variables selected from first-order statistics and all variables by Boruta, DALEX, genetic algorithm (GA), least absolute shrinkage and selection operator (LASSO), and recursive feature elimination (RFE) for fruit weight (FW)

Full size table

Table 4 Top five variables selected from first-order statistics and all variables by Boruta, DALEX, genetic algorithm (GA), least absolute shrinkage and selection operator (LASSO), and recursive feature elimination (RFE) for number of fruit (FN)

Full size table

In contrast, most selected variables by all variable selection models for FW were VI-related variables. RANGE of WDVI on June 18 selected by Boruta, DALEX, GA, and RFE from basic and all variables was ranked in top five variables (Table 3). Moreover, AVE of NDVI was also ranked in top five variables by all variable selection models except for GA from all available variables. Interestingly, the results show that VI variables about one month prior to harvest are critical for estimating FW. The finding that the VIs at the onset of fruiting are determinants of the final FW can be relevant for field management. Similar to FW, FN is an important factor for determining yield. Technology to estimate the FN on each plant, which is not visible in aerial images, can contribute to cultivation management. Although the selected variables by variable selection methods are different, AVE of NDVI or WDVI was selected in top five variables from both basic and all available variables (Table 4). In addition, it can be seen that the first- and second-order statistics of VIs on early to mid-fruit formation period are useful for estimating FN. However, unlike with SM, PH-related variables were found to be unnecessary variables for estimating FN.

Tomato yield prediction by models using selected variables

RF, RI, and SVM were built by using 80% (n = 120) of the data for training with a total of five sets of selected important variables. The performance of constructed models was evaluated using the remaining 20% for testing by tenfold cross validation. Figures 6, 7, and 8 show the relationship between observed and simulated SM, FW, and FN using RF, RI, and SVM models with the selected variables set, respectively. Table 5 indicates rRMSE between the observed and simulated values by RF, RI, and SVM models with the five sets of selected variables, reflecting the prediction accuracy of the validated model with test data.

Table 5 relative Root mean square error (rRMSE) value of tomato shoot mass (SM), fruit weight (FW), and number of fruits (FN) using random forest (RF), ridge regression (RI), and support vector machine (SVM) models with selected variables set from first-order statistics and all variables

Full size table

Comparing among the five different variable sets for SM prediction, variable sets from first-order statistics selected by Boruta and DALEX had better goodness of fit with RF model (R² = 0.41 for Boruta; R² = 0.45 for DALEX) (Fig. 6a) than the other combinations of variable selection method and prediction model. In the rRMSE, GA-selected variable set from all variables with RI and SVM had smaller absolute error of a model (rRMSE = 11.0% for RI; rRMSE = 11.2% for SVM). For SM prediction, RF with selected variable sets from first-order basic statistics had better performance in R²; whereas second-order statistics decreased the rRMSE value for many combinations of prediction models and variable selection methods (Table 5). For the comparison with R², it was an interesting result that prediction of SM does not require the GLCM texture information, and only the first-order statistics are sufficient to obtain the certain prediction accuracy.

Regarding FW, the following combinations had superior performance among the variable sets and prediction models combinations: Boruta-, DALEX-, and GA-selected variable sets from all variables with RI (R² = 0.73, 0.76, and 0.70, respectively) (Fig. 7d), LASSO-selected variable set from all variables with SVM (R² = 0.75) (Fig. 7f), and RFE-selected variable set from first-order statistics variables with RI (R² = 0.55) (Fig. 7c). In particular, focusing on the RI, the prediction accuracy of the models, except RFE, with the selected variables using all variables was greatly improved compared with that using selected variable set from first-order statistics only. For example, GA-selected variable set from first-order statistics with RI model had lower R² value (0.45), whereas that from all variables had higher R² value (0.70). IDM of NDVI from all variables was ranked as top one variable by GA (Table 3). IDM feature relates inversely to the contrast measure and is a direct measure of the local homogeneity of a digital image. Therefore, this result shows the importance of second-order statistics for predicting FW. For FN prediction, simulated value with selected variable sets from all variables by all prediction models had significantly higher goodness of fit compared with selected variable sets from first-order statistics. In particular, Boruta-, DALEX-, GA-, and RFE-selected important variable sets with RF (Fig. 8b), and LASSO-selected feature variable sets with SVM (Fig. 8f) achieved higher prediction performances compared with the other combinations of selected feature variable sets and prediction models (R² = 0.81 for Boruta; R² = 0.83 for DALEX; R² = 0.82 for RFE; R² = 0.77 for GA; R² = 0.82 for RFE; R² = 0.90 for LASSO).

In the present study, RF with Boruta-selected variable set from first-order statistics, RI with DALEX-selected variable set from all variables, and SVM with LASSO-selected variable set from all variables had best prediction performance R² with the observed SM, FW, and FN, respectively. Although it is difficult to make simple comparisons due to the differences of cultivation environments, varieties, and extracted variables, Li et al. [48] suggested that non-parametric (parametric) prediction model is adopted to match the non-parametric (parametric) variable selection. In this study, there were no clear effect relationships between parametric (non-parametric) variable selection method and parametric (non-parametric) model on tomato yield prediction accuracy. Furthermore, prediction accuracy of FW and FN using the selected variable set from all variables was significantly better compared with that using selected variable set from first-order statistics. Moreover, statistic variables of the VIs about one month before harvest were found to be important in predicting tomato yield.

Narrowing the focus to secondary traits and growth stages that affect tomato yield will contribute to more effective phenotypic data collection. In addition, the super-high-resolution field images obtained from the UAV provided helpful traits, such as temporal change of plant height and vegetation indices including secondary-order statistics of the field. Although we only used these three VIs in this study, we intend to use more VIs, including blue, red edge, and thermal bands, to predict the yield in the near future. Our next goal is to extract the features necessary to build a robust prediction model by testing the proposed variable selection with more data collected in multipoint and multiple years and thus contribute to the efficient selection of high-yield lines in breeding process.

Conclusion

In this study, we sought to examine the prediction accuracy of SM, FW, and FN using RF, RI, and SVM using variable sets selected by Boruta, DALEX, GA, LASSO, and RFE. PH and VIs (NDVI, GNDVI, and WDVI) from UAV-derived imagery were used for extraction of first-order basic statistics and second-order statistics (GLCM features and dynamic growth rate). First-order statistics of PH and VIs at early to mid-fruit formation period were ranked as important variables for prediction of SM by all feature selection methods. GLCM features of NDVI and WDVI from June 18 were significantly important for prediction of FW. Similar to FW prediction, GLCM features of VIs one month before harvest were significant to predict FN. Furthermore, all prediction models with the selected variable sets from all variables achieved good performance for FW and FN prediction compared with selected variable sets using basic statistics only. In particular, RF with Boruta-selected variable set from the basic statistics, RI with DALEX-selected variable set from all variables, and SVM with LASSO-selected variable set from all variables were best combinations for predicting SM, FW, and FN, respectively. These results indicate that filtering secondary traits and growth stages that contribute to the prediction of tomato yield can contribute to saving of time and labor required for phenotypic data collection and processing. In addition, it is possible to obtain useful features for breeding, other than first-order basic statistics, such as second-order statistics in PH and VIs for each plant, from the ultra-high-resolution image obtained by UAV. Overall, our findings indicate that reduced features needed for tomato yield prediction by variable selection method will help improve the efficiency of phenotypic data collection and assist with the selection of high-yield lines in breeding programs.

In traditional machine learning techniques, it is difficult to apply a trained model to other tomato cultivars or other crop species. In a future study, we will apply transfer learning techniques to extract the phenotype of different domains quantitatively using an expanded UAV dataset, and we will compare the results with those of the process in this study.

Availability of data and materials

The all datasets used analyzed in this study are available from the corresponding author on reasonable request.

References

Li N, Wu X, Zhuang W, Xia L, Chen Y, Wu C, et al. Tomato and lycopene and multiple health outcomes: umbrella review. Food Chem. 2021;343:128396.
Article CAS PubMed Google Scholar
FAO. FAOSTAT. http://www.fao.org/faostat/en/#home. Accessed 18 Jan 2021.
Ramasamy S, Ravishankar M. Integrated pest management strategies for tomato under protected structures. In: Sustainable management of arthropod pests of tomato. Elsevier; 2018, pp. 313–322.
Islam J, Kabir Y. Effects and mechanisms of antioxidant-rich functional beverages on disease prevention. In: Functional and medicinal beverages. Elsevier; 2019, pp. 157–198.
Megan Ware RDNLD. Everything you need to know about tomatoes. https://medilinkblog.com/everything-you-need-to-know-about-tomatoes/. Accessed 27 Dec 2020.
Shi Y, Thomasson JA, Murray SC, Pugh NA, Rooney WL, Shafian S, et al. Unmanned aerial vehicles for high-throughput phenotyping and agronomic research. PLoS ONE. 2016;11(7):e0159781.
Article PubMed PubMed Central CAS Google Scholar
Barbedo JGA. A review on the use of unmanned aerial vehicles and imaging sensors for monitoring and assessing plant stresses. Drones. 2019;3(2):40.
Article Google Scholar
Du M, Noguchi N. Monitoring of wheat growth status and mapping of wheat yield’s within-field spatial variations using color images acquired from UAV-camera system. Remote Sens. 2017;9(3):289.
Article Google Scholar
Duan T, Chapman SC, Guo Y, Zheng B. Dynamic monitoring of NDVI in wheat agronomy and breeding trials using an unmanned aerial vehicle. Field Crop Res. 2017;210:71–80.
Article Google Scholar
Senthilnath J, Dokania A, Kandukuri M, Ramesh KN, Anand G, Omkar SN. Detection of tomatoes using spectral-spatial methods in remotely sensed RGB images captured by UAV. Biosyst Eng. 2016;146:16–32.
Article Google Scholar
Johansen K, Morton MJL, Malbeteau YM, Aragon B, Al-Mashharawi SK, Ziliani MG, et al. Unmanned aerial vehicle-based phenotyping using morphometric and spectral analysis can quantify responses of wild tomato plants to salinity stress. Front Plant Sci. 2019;10:370.
Article PubMed PubMed Central Google Scholar
Johansen K, Morton MJL, Malbeteau Y, Aragon B, Al-Mashharawi S, Ziliani M, et al. Predicting biomass and yield at harvest of salt-stressed tomato plants using UAV imagery. Int Arch Photogramm Remote Sens Spatial Inf Sci. 2019;XLII-2/W13:407–11.
Article Google Scholar
Candiago S, Remondino F, De Giglio M, Dubbini M, Gattelli M. Evaluating multispectral images and vegetation indices for precision farming applications from UAV images. Remote Sens. 2015;7(4):4026–47.
Article Google Scholar
Enciso J, Avila CA, Jung J, Elsayed-Farag S, Chang A, Yeom J, et al. Validation of agronomic UAV and field measurements for tomato varieties. Comput Electron Agric. 2019;158:278–83.
Article Google Scholar
Holman F, Riche A, Michalski A, Castle M, Wooster M, Hawkesford M. High throughput field phenotyping of wheat plant height and growth rate in field plot trials using UAV based remote sensing. Remote Sens. 2016;8(12):1031.
Article Google Scholar
Hu P, Chapman SC, Wang X, Potgieter A, Duan T, Jordan D, et al. Estimation of plant height using a high throughput phenotyping platform based on unmanned aerial vehicle and self-calibration: example for sorghum breeding. Eur J Agron. 2018;95:24–32.
Article Google Scholar
Wang X, Singh D, Marla S, Morris G, Poland J. Field-based high-throughput phenotyping of plant height in sorghum using different sensing technologies. Plant Methods. 2018;14(1):53.
Article PubMed PubMed Central Google Scholar
Fathipoor H, Arefi H, Shah-Hosseini R, Moghadam H. Corn forage yield prediction using unmanned aerial vehicle images at mid-season growth stage. J Appl Rem Sens. 2019;13(03):1.
Article Google Scholar
Tattaris M, Reynolds MP, Chapman SC. A direct comparison of remote sensing approaches for high-throughput phenotyping in plant breeding. Front Plant Sci. 2016;7:1131.
Article PubMed PubMed Central Google Scholar
Yang G, Liu J, Zhao C, Li Z, Huang Y, Yu H, et al. Unmanned aerial vehicle remote sensing for field-based crop phenotyping: current status and perspectives. Front Plant Sci. 2017;8:1111.
Article PubMed PubMed Central Google Scholar
Ballesteros R, Ortega JF, Hernandez D, Moreno MA. Onion biomass monitoring using UAV-based RGB imaging. Precision Agric. 2018;19(5):840–57.
Article Google Scholar
Han L, Yang G, Dai H, Xu B, Yang H, Feng H, et al. Modeling maize above-ground biomass based on machine learning approaches using UAV remote-sensing data. Plant Methods. 2019;15(1):10.
Article PubMed PubMed Central Google Scholar
Nevavuori P, Narra N, Lipping T. Crop yield prediction with deep convolutional neural networks. Comput Electron Agricult. 2019;163:104859.
Article Google Scholar
Lu N, Zhou J, Han Z, Li D, Cao Q, Yao X, et al. Improved estimation of aboveground biomass in wheat from RGB imagery and point cloud data acquired with a low-cost unmanned aerial vehicle system. Plant Methods. 2019;15(1):17.
Article PubMed PubMed Central Google Scholar
Jiang Q, Fang S, Peng Y, Gong Y, Zhu R, Wu X, et al. UAV-based biomass estimation for rice-combining spectral, TIN-based structural and meteorological features. Remote Sens. 2019;11(7):890.
Article Google Scholar
Yang Q, Shi L, Han J, Zha Y, Zhu P. Deep convolutional neural networks for rice grain yield estimation at the ripening stage using UAV-based remotely sensed images. Field Crop Res. 2019;235:142–53.
Article Google Scholar
Escalante HJ, Rodríguez-Sánchez S, Jiménez-Lizárraga M, Morales-Reyes A, De La Calleja J, Vazquez R. Barley yield and fertilization analysis from UAV imagery: a deep learning approach. Int J Remote Sens. 2019;40(7):2493–516.
Article Google Scholar
Moeckel T, Dayananda S, Nidamanuri R, Nautiyal S, Hanumaiah N, Buerkert A, et al. Estimation of vegetable crop parameter by multi-temporal UAV-borne images. Remote Sens. 2018;10(5):805.
Article Google Scholar
Liakos K, Busato P, Moshou D, Pearson S, Bochtis D. Machine learning in agriculture: a review. Sensors. 2018;18(8):2674.
Article PubMed Central Google Scholar
Kyratzis AC, Skarlatos DP, Menexes GC, Vamvakousis VF, Katsiotis A. Assessment of vegetation indices derived by UAV imagery for durum wheat phenotyping under a water limited and heat stressed mediterranean environment. Front Plant Sci. 2017;8:1114.
Article PubMed PubMed Central Google Scholar
Guan S, Fukami K, Matsunaka H, Okami M, Tanaka R, Nakano H, et al. Assessing correlation of high-resolution NDVI with fertilizer application level and yield of rice and wheat crops using small UAVs. Remote Sens. 2019;11(2):112.
Article Google Scholar
Hassan MA, Yang M, Rasheed A, Yang G, Reynolds M, Xia X, et al. A rapid monitoring of NDVI across the wheat growth cycle for grain yield prediction using a multi-spectral UAV platform. Plant Sci. 2019;282:95–103.
Article CAS PubMed Google Scholar
Gitelson AA, Merzlyak MN. Remote sensing of chlorophyll concentration in higher plant leaves. Adv Space Res. 1998;22:689–92.
Article CAS Google Scholar
Rouse JW, Haas RH, Schell JA, Deering DW. Monitoring vegetation systems in the great plains with ERTS. In: Third Earth Resources Technology Satellite (ERTS) Symposium, NASA SP-351; 1973. pp 309–317.
Qi J, Chehbouni A, Huete AR, Kerr YH, Sorooshian S. A modified soil adjusted vegetation index. Remote Sens Environ. 1994;48:119–26.
Article Google Scholar
Haralick RM, Shanmugam K, Dinstein I. Textural features for image classification. IEEE Trans Syst Man Cybern. 1973;SMC-3(6):610–21.
Article Google Scholar
Kursa MB, Jankowski A, Rudnicki WR. Boruta—a system for feature selection. Fund Inform. 2010;101(4):271–85.
Google Scholar
Biecek P. DALEX: explainers for complex predictive models. J Mach Learn Res. 2018;19:1–14.
Google Scholar
Scrucca L. GA: a package for genetic algorithms in R. J Stat Soft. 2013;53(4):1–37.
Article Google Scholar
Tibshirani R. Regression shrinkage and selection via the lasso. J Roy Stat Soc: Ser B (Methodol). 1996;58(1):267–88.
Google Scholar
Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. 2002;46(1/3):389–422.
Article Google Scholar
Breiman L. Random Forests. Mach Learn. 2001;45(1):5–32.
Article Google Scholar
Hoerl AE, Kennard RW. Ridge regression: applications to nonorthogonal problems. Technometrics. 1970;12(1):69–82.
Article Google Scholar
Hoerl AE, Kennard RW. Ridge regression: biased estimation for nonorthogonal problems. Technometrics. 1970;12(1):55–67.
Article Google Scholar
de Vlaming R, Groenen PJF. The current and future use of ridge regression for prediction in quantitative genetics. Biomed Res Int. 2015;2015:1–18.
Article Google Scholar
Cortes C, Vapnik V. Support-vector networks. Mach Learn. 1995;20(3):273–97.
Article Google Scholar
Esyanti RR, Dwivany FM, Almeida M, Swandjaja L. Physical, chemical and biological characteristics of space flown tomato (Lycopersicum esculentum) seeds. J Phys Conf Ser. 2016;771:012046.
Article CAS Google Scholar
Li J, Veeranampalayam-Sivakumar A-N, Bhatta M, Garst ND, Stoll H, Stephen Baenziger P, et al. Principal variable selection to explain grain yield variation in winter wheat from features extracted from UAV imagery. Plant Methods. 2019;15(1):123.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Shogo Takahashi and Makoto Niida from the Tokyo University of Agricultural and Technology, Japan, for their support with measurements.

Funding

This study was funded in part by JST PRESTO Grant Number JPMJPR1603, Japan, and JSPS KAKENHI Grant Number 16KK0169 and 19K15944.

Author information

Authors and Affiliations

Department of Environmental and Agricultural Engineering, Tokyo University of Agriculture and Technology, 3-5-8 Saiwai-cho, Fuchu, Tokyo, 183-8509, Japan
Kenichi Tatsumi
Faculty of Agriculture Tokyo University of Agriculture and Technology, 3-5-8 Saiwai-cho, Fuchu, Tokyo, 183-8509, Japan
Noa Igarashi
Graduate School of Agriculture, Tokyo University of Agriculture and Technology, 3-5-8 Saiwai-cho, Fuchu, Tokyo, 183-8509, Japan
Xiao Mengxue

Authors

Kenichi Tatsumi
View author publications
You can also search for this author in PubMed Google Scholar
Noa Igarashi
View author publications
You can also search for this author in PubMed Google Scholar
Xiao Mengxue
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

KT conceptualized the study, collected data, modelling, data analysis, wrote the paper—original draft. IN, and XM provided the field truth data. KT conducted the aerial data collections. KT, IN, and XM reviewed, edited, and finalized the paper. All authors have read and agreed to the published version of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Kenichi Tatsumi.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Competing interests

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Auxiliary formula for calculation of gray-level co-occurrence matrix (GLCM) features.

Additional file 2: Figure S1.

Spatial multitemporal normalized difference vegetation index (NDVI) (−). Figure S2. Spatial multitemporal weighted difference vegetation index (WDVI) (−).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Tatsumi, K., Igarashi, N. & Mengxue, X. Prediction of plant-level tomato biomass and yield using machine learning with unmanned aerial vehicle imagery. Plant Methods 17, 77 (2021). https://doi.org/10.1186/s13007-021-00761-2

Download citation

Received: 18 March 2021
Accepted: 04 June 2021
Published: 15 July 2021
DOI: https://doi.org/10.1186/s13007-021-00761-2

Prediction of plant-level tomato biomass and yield using machine learning with unmanned aerial vehicle imagery