Skip to main content

An automatic method for counting wheat tiller number in the field with terrestrial LiDAR

Abstract

Background

The tiller number per unit area is one of the main agronomic components in determining yield. A real-time assessment of this trait could contribute to monitoring the growth of wheat populations or as a primary phenotyping indicator for the screening of cultivars for crop breeding. However, determining tiller number has been conventionally dependent on tedious and labor-intensive manual counting. In this study, an automatic tiller-counting algorithm was developed to estimate the tiller density under field conditions based on terrestrial laser scanning (TLS) data. The novel algorithm, which is named ALHC, involves two steps: (1) the use of an adaptive layering (AL) algorithm for cluster segmentation and (2) the use of a hierarchical clustering (HC) algorithm for tiller detection among the clusters. Three field trials during the 2016–2018 wheat seasons were conducted to validate the algorithm with twenty different wheat cultivars, three nitrogen levels, and two planting densities at two ecological sites (Rugao & Xuzhou) in Jiangsu Province, China.

Result

The results demonstrated that the algorithm was promising across different cultivars, years, growth stages, planting densities, and ecological sites. The tests from Rugao and Xuzhou in 2016–2017 and Rugao in 2017–2018 showed that the algorithm estimated the tiller number of the wheat with regression coefficient (R2) values of 0.61, 0.56 and 0.65, respectively. In short, tiller counting with the ALHC generally underestimated the tiller number and performed better for the data with lower plant densities, compact plant types and the jointing stage, which were associated with overlap and noise between plants and inside the dense canopy.

Conclusions

Differing from the previous methods, the ALHC proposed in this paper made full use of 3D crop information and developed an automatic tiller counting method that is suitable for the field environment.

Background

Wheat is one of the primary food crops, feeding more than half of the world’s population [1, 2]. With a predicted world population of 9 billion in 2050, the demand for wheat is expected to increase by 60%–110% [3,4,5]. To meet this demand, the annual wheat yield increment must rise from the current level of below 1% to at least 1.6% [4, 5]. At present, wheat yield mainly relies on the increase in grain yield per unit area [6]. The increase in productive tiller number generally enhances yield potential over a range of environmental conditions [7]. Therefore, an accurate and efficient method for acquiring the tiller number in real time under field conditions can be helpful for determining a reasonable group density; assessing the efficiency of crop management, e.g., irrigation and applying fertilizer; or functioning as a phenotyping trait for breeding practices in the field.

To date, the most common method used for measuring tiller numbers is manual counting, which is extremely time-consuming and labor-intensive and is constrained by human error [8]. Remote sensing provides an alternative for the high-throughput evaluation of tiller number and has been widely used in the past several decades [10,11,12,13,14,15,16, 29]. The common practice is to establish a linear relationship between the tiller number and spectral features from optical remote sensing imagery. Flowers et al. [10] used spectral information from aerial photographs to predict wheat tiller density through correlation with a series of vegetation indexes (VIs). The results demonstrated that the VIs were reliable indicators for estimating wheat tiller density in the field. Since then, studies have tried to further prove the accuracy and reliability of VIs, especially the normalized difference vegetation index (NDVI), to forecast the tiller number [8,9,10,11]. However, the method for measuring tiller number using correlation with VIs is influenced by environmental factors and changes in the external conditions, such as crop varieties and field management.

With the development of image processing technology, image-based methods have been successfully applied to overcome some shortcomings of VIs [10, 11]. Boyle et al. [12] collected RGB images of a single spring wheat plant at different filming angles (0°, 45°, 90°) and calculated the number of wheat tillers using the Frangi filter algorithm. More recently, Constantino et al. [13] used an image processing system, Seight, to measure the height and number of tillers of a single plant, in which the Canny edge detection algorithm was utilized to detect the tillers. However, similar to the VIs, the counting accuracy with images from two-dimensional (2D) cameras is constrained by the illumination conditions at the time of image acquisition. Moreover, RGB images lack spatial and volumetric information, which is more closely related to plant function and yield-related traits [14].

More advanced than 2D RGB cameras, some sensors have the capability to collect three-dimensional information from the target objects. Innovative research was performed by Yang et al. [15] in which a high-throughput system (H-SMART) based on a conventional X-ray computed tomography (CT) system that was developed to automatically measure rice tillers could achieve a performance with a mean absolute error (MAE) of 0.22 at the tillering stage. Despite the high accuracy, the CT system requires a rigorous experimental environment to ensure the safety of the experimenter and to prevent genetic mutation in experimental objects. Inspired by the work of Yang’s team, Huang et al. [16] applied magnetic resonance imaging (MRI), which features nondestructive methods without radiation to acquire the number of rice tillers. However, these experiments are only applicable to the laboratory or the assembly line, and very few studies have explored methods to acquire tiller information automatically and efficiently in field environments.

Light detection and ranging (LiDAR), which is an active sensor, overcomes many of the disadvantages of passive sensing because it is capable of operation regardless of the ambient light conditions and directly provides highly accurate three-dimensional (3D) information [17,18,19]. While LiDAR use in forestry has been well established for decades [20,21,22,23], its application to cereal crops is still in the early stages [24]. Recently, LiDAR has drawn extensive attention in plant phenotyping, which has made the method popular as the key technical component for developing next-generation plant phenotyping techniques [19]. Currently, research is focused on rapid determination of biomass, plant height, and leaf area [20, 24,25,26,27,28]. However, there are few reports about the detection of tiller number using LiDAR. To our knowledge, Guo et al. [29] developed a high-throughput crop phenotyping platform, Crop 3D, which included a fractional module that used LiDAR to scan single rice plants into point clouds. The classical k-means clustering algorithm was used to calculate the tiller number, with an accuracy (R2) that reached 0.80. Although the k-means algorithm is simple and efficient, it is necessary to define the number of classes in advance, which does not apply to cases where the class number is unknown. Moreover, unlike the individual plants on the assembly line in Guo’s research, the canopy is much denser in the field, which is compounded by other uncontrollable factors such as wind, which also means much more severe occlusion between plants. Overall, direct tiller segmentation or detection in a field population based on large quantities of LiDAR point data and the inevitable accompanying noise is still a major challenge.

Therefore, the main objectives of this study were (1) to propose an automatic approach suitable for counting tiller number under field conditions based on TLS data for wheat and (2) to validate the applicability of the method under different treatments and growth stages. For the second objective, different wheat cultivars, plant densities, and nitrogen fertilization rates were designed in three field trials. This method is expected to provide a new perspective for the detection of wheat tiller number in three dimension and to enhance the practical use of TLS in precision agriculture.

Methods

Plant materials and growth conditions

2016–2017 field trial configuration

Experiment A (Exp. A): The field trial was conducted at the Rugao Experimental Demonstration Base (32°15′ N, 120°38′ E) of the National Information Agricultural Engineering Technology Center in Rugao City, Jiangsu Province. Two wheat cultivars, ‘Yangmai 15′ (V1) and ‘Yangmai 16′ (V2), were selected to represent compact and diffuse plant types, respectively. Three nitrogen rates of 0 kg/ha (N0), 150 kg/ha (N1, 484.91 g/plot), and 300 kg/ha (N2, 969.83 g/plot) were set, among which N1 was consistent with the average nitrogen level. Fifty percent of the N fertilizers were applied on the sowing day, and 50% were applied at the jointing stage. Two planting density levels were set for the experiment: 25 cm (2.4 × 106 seedlings/ha) for D1 and 40 cm (1.5 × 106 seedlings/ha) for D2, which corresponded to 26 rows/plot and 17 rows/plot. For each set of growing conditions, the experimental design was established with randomized blocks with three replicates, for a total of 36 plots, each of which had an area of 30 m2 (6 m × 5 m).

Experiment B (Exp. B): The second field trial was conducted at the Xuzhou Agricultural Research Institute in Xuzhou City, Jiangsu Province (34°19′ N, 117°2′ E). A total of 17 varieties were used: ‘Jimai 211′, ‘Saidema 5′, ‘Cunmai 11′, ‘Zhengmai 119′, ‘Loumai 956′, ‘Anke 1405′, ‘Zhumai 328′, ‘Xinong 528′, ‘Ruihua 1426′, ‘Luomai 6010′, ‘Xinong 501′, ‘Huaichuan 365′, ‘Zhengmai 1836′, ‘Zhumai 305′, ‘Xinong 364′, ‘Zhoumai 18CK’, and ‘Yanzhan 4110CK1′. In this trial, the nitrogen rate was set at 225 kg/ha (348.60 g/plot). Fifty percent of the N fertilizers were applied on the sowing day, and 50% were applied at the jointing stage. The field density was set with a unified line spacing (23 cm, 2.2 × 106 seedlings/ha, 38 rows/plot). The experimental design was established in randomized blocks with three replicates for a total of 51 plots, each of which had an area of 13 m2 (8.3 m × 1.6 m).

2017–2018 field trial configuration

Experiment C (Exp. C): The field trial was conducted at the Rugao Experimental Demonstration Base. Two wheat cultivars, ‘Shengxuan 6′ (V1) and ‘Yangmai 16′ (V2), were selected to represent compact and diffuse plant types, respectively. All other experimental conditions were identical to those in Exp. A.

The specific conditions for each trial (Exp. A, B, C) are shown in Table 1. Orthophotographs of the trials (A, B, C) are shown in Figs. 1, 2.

Table 1 Details of the field experiments
Fig. 1
figure 1

Images of the field experiment area in Rugao (Exp. A &C with two varieties, two planting density levels and three nitrogen rates) & Xuzhou (Exp. B with the same nitrogen rate and planting density level for 17 varieties); the left: the location of experiment sites; the right: the orthophotos captured by the UAV system

Fig. 2
figure 2

a RIEGL-VZ 1000 working in the field at Rugao at a height of 1.5 m; b the TLS data of the whole experimental area (the jointing stage, Rugao, 2017–2018) displayed in the supporting software RiCSAN PRO; ScanPos001-ScanPos008 represent the locations of the sites (colors indicate the canopy height); a circular blind zone with a diameter of 1 m around the sites

Data acquisition

Terrestrial laser scanning measurements

The terrestrial LiDAR system used in this study is a RIEGL-VZ 1000 (RIEGL, Austria, https://www.riegl.com), which is a pulsed 3D scanner that emits near-infrared lasers. The instrument operates on the principle that laser pulses are emitted from the instrument and, using a series of built-in rotating prisms, the laser lights are projected at different angles. The detector receives the signal reflected from the target object and records the target information. The specifications of the RIEGL-VZ 1000 are shown in Table 2.

Table 2 The specifications of the RIEGL-VZ 1000

To mitigate the occlusion effects, a multiple-scan scheme was conducted in all trials. According to previous research on the impact of the number and position of the scanning site [30], we settled on an 8-station scanning strategy (Fig. 3). Scanning sites outside the plots could not only supplement the point cloud information of the edge plots, but also obtained sufficient information about surrounding objects. This off-plot information served the registration of the point cloud between different scanning sites. The scan mode was set to mode 60, which indicated an angular resolution of 0.06° and corresponded to a point spacing of 10 cm at the 100 m scanning range.

Fig. 3
figure 3

The scanner position schemes: a in Rugao and b in Xuzhou. The varieties in the set of three rows are consistent; the red solid circle points stand for the scanning positions of the LiDAR instrument. Note: D represents the line spacings (D1: 25 cm, 2.4 × 106 seedlings/ha and D2: 40 cm, 1.5 × 106 seedlings/ha), V represents the varieties (V1: Shengxuan 6 and V2: Yangmai 16), and N represents the three levels of pure nitrogen (N0: 0 kg/ha, N1: 150 kg/ha, N2: 300 kg/ha)

Manual field measurements

In all three trials, the total tiller numbers were counted in two one-meter row sections per plot, and then, the number of tillers per unit area was calculated. The manual-counting data were used as the observed value to evaluate the algorithm. The manual measurements were performed at two stages (tillering and jointing), and the terrestrial LiDAR measurements were collected simultaneously (Table 3).

Table 3 Summary of the field sampling for the wheat experiments

Data processing and analysis

The preprocessing of TLS data was carried out in professional software bundled with the scanner. The coordinate registration was the first step. An iterative closest point (ICP) algorithm was applied to register each independent scanner coordinate to the same reference coordinate system. ICP calculated the transformation matrix by a least-squares method based on the corresponding points [31]. The coordinate registration was completed with an average error of 0.006 m for each campaign. Abnormal points floating above, which was caused by insects or small airborne particles, was removed manually. Then we manually intercepted a random row of data from each plot. Finally, the data of each area was exported into separate files.

In this study, the 3D point cloud data for the tillering and jointing stages obtained from the LiDAR measurements are used to estimate the tiller number after preprocessing, e.g., point cloud registration and detour. The processed data for a random row per plot are input into our algorithm to calculate the tiller number of this row (Trow). Then, the tiller density for the whole plot (Tplot) is calculated as follows: Tplot = Trow * r / S where r and S are the number of rows and area of the plot, respectively.

The automatic tiller-counting algorithm (ALHC)

The tiller-counting algorithm employs algorithms in two steps: (1) the adaptive layering (AL) algorithm and (2) the hierarchical clustering (HC) algorithm. We developed our algorithm in MATLAB (version 2016, MathWorks®, USA). A flowchart describing the process is provided in Fig. 4.

Fig. 4
figure 4

Workflow of the proposed tiller number estimation method (ALHC) using the TLS data

Description of the adaptive layering (AL) algorithm

This step takes a random row of preprocessed point cloud data from each plot as the input, and the number of wheat clusters is the output. In this study, the principle of the adaptive layering algorithm is to identify the interspaces between the wheat stems and to obtain the number of wheat clusters accordingly. A cluster of wheat refers to densely aggregated wheat plants, which usually contain several tillers and stems that may be from the same plant or different plants. It should be noted that the term "stem" here is borrowed for the tillering stage and redefined as the trunk of the aboveground tillers, excluding the leaves.

The specific implementation steps are as follows.

After preprocessing, a random row of wheat point data is obtained for each plot. The X axis is defined as parallel to the direction of the row, which means that the Y axis is perpendicular to the direction of the row and the direction of the plant height is the Z axis.

  1. 1.

    Layering & labeling. A row of wheat points is divided into 1 to n layers according to the plant height (H) from the top to bottom. The first layer (the 1st layer) along the Z axis and the last layer (the nth layer) are removed. The variable ‘n’ is determined as an optimal value based on the degree of overcounting and undercounting of the clusters observed in the layers compared with the planting density after the trial runs. Then, starting from the second layer, the smaller serial number layer is labeled as the ‘leaf’, and the adjacent layer in the sequence is labeled as the ‘stem’. Here, the labels are only used to distinguish among the layers. For example, layers 2 and 3 demonstrate one leaf-stem combination, with 2 as the leaf and 3 as the stem in Fig. 5A b.

    Fig. 5
    figure 5

    A illustrates the algorithm: a one row of wheat was layered; b a leaf layer was extracted and then a stem layer was extracted; c two layers were superimposed onto one mixed layer; (d) the number of wheat clusters (\({T}_{i}\) represented the number of wheat clusters) was calculated. b The workflow of the adaptive layering algorithm

  2. 2.

    Iterating. One leaf-stem combination participates in each iteration. The combination is projected along the Z axis to the XOY plane to form one layer of the superimposed point cloud, which retains only the X and Y coordinates of the points. The overlapping leaves allow the leaf point cloud to be considered continuous, while the stem point cloud is discontinuous due to the intervals between the wheat clusters. When the leaf and stem layers are projected along the Z axis to the XOY plane, the intervals are filled by the leaf points called ‘the continuous leaf part’, while the part where the leaf layer meets the stem layer is called ‘the mixed part’ (Fig. 5A c). The criterion for determining a cluster of wheat is to find a continuous leaf part that is located between two adjacent mixed parts. Once an eligible part is detected, the number of clusters (Ti-(i-1), where i represents the layer number) is added. The entire row is traversed to obtain Ti-(i-1), which completes an iteration.

  3. 3.

    Counting. A total of (n-3) iteration results (T2-3, T3-4, T4-5, …, T(n-2)-(n-1)) are compared to determine the leaf layer combination with the largest Ti-(i-1) as the separation layer between the stem and leaf. All layers above the separation layer are marked (excluding itself) as ‘leaf’, and the other layers are marked as ‘stem’. The final output of this step is the result of the iteration in step (2), in which one of the leaf layers and one of the stem layers are selected as the input.

    Thus far, the approximate number of clusters in a row has been calculated.

Description of the hierarchical clustering (HC) algorithm

An approximate wheat cluster number is obtained in the previous step. When the tillers are too close to each other, they cannot be distinguished with the layering algorithm. Therefore, the hierarchical clustering algorithm is introduced to further calculate the tiller number in each cluster, which may contain several tillers the same plant or different plants.

The specific steps used in the hierarchical clustering [32,33,34] are as follows.

(1) Calculate the similarity matrix of all points within a cluster. If one cluster contains N points, the similarity measurement matrix would be produced by N*N. Taking six random points (N = 6, and the matrix is 6*6) from the same cluster containing coordinate information (xpoint, ypoint, zpoint) as example to comprise a group of raw data (Fig. 6b). We obtain the similarity measurement matrix by calculating the similarity between each variable in the group.

Fig. 6
figure 6

Point clouds clustered based on hierarchical clustering: a 3D point cloud of a cluster of wheat as an abstract sample, where the red dots marked with numbers represent sample points; b table of the original data; c the similarity matrix metric and the shortest Euclidean distance are marked with italicized numbers; d clustering order; and e the clustering diagram

The similarity measurement determines the similarity between two classes. Commonly used similarity metrics include the distance coefficient, correlation coefficient, and the cosine of angle. The appearance of tillers is very similar, where the cosine of angle may not be suitable for. Moreover, it is hard to define correlation within the unstructured and unordered point data, which is also not our choice. However, there are differences in spatial position. Therefore, we choose the Euclidean distance, a frequently used distance coefficient, as the similarity metric in this study. The specific formula is as follows

$${d}_{ij}=\sqrt{\sum\nolimits_{k=1}^{n}{\left({x}_{ik}-{x}_{jk}\right)}^{2}}, i, j=\mathrm{1,2},3\dots ,m$$
(1)

where dij is the Euclidean distance between the variables i and j, n is the dimension of variables i and j, and xik and xjk are the data of the kth dimension for variables x and j, respectively.

(2) Calculate the Euclidean distance between all points, then merge the nearest two points into one class (Fig. 6c).

(3) Calculate the Euclidean distance between the newly generated class and other points, and end if the setting condition is met. Otherwise, continue the iteration until all points have been classified (Fig. 6d).

In our algorithm, the smallest unit is a cluster rather than a point. And the Euclidean distance d equaling 2 cm is set as the end condition of the iteration. We consider 2 cm as the minimum distance between two tillers based on the characteristics of wheat in field. To be discreet, we conduct a series of experiments to test the sensitivity of the algorithm to d. The total number of tillers per row is the sum of the number of tillers in each cluster, which is the cumulative value of all iteration result.

Accuracy assessment

We evaluate the agreement between the tiller density computed with our method and the reference tiller density (field measurements) by the coefficient of determination (R2), calculated in Eq. (2). We also assess the precision of the computed tiller density with respect to the reference data using the root mean square error (RMSE) and the root mean square relative error (RRMSE), which are calculated with Eqs. (3) and (4), respectively, as follows:

$${\text{R}}^{2}\text{=1-}\frac{\sum_{\text{i} = \text{1}}^{\text{n}}{\left({\text{O}}_{\text{i}}-{\text{P}}_{\text{i}}\right)}^{2}}{{\sum }_{\text{i} = \text{1}}^{\text{n}}{\left({\text{O}}_{\text{i}}-\stackrel{\mathrm{-}}{\text{P}}\right)}^{2}}$$
(2)
$$\mathrm{RMSE}=\sqrt{\frac{1}{n}\sum_{i=1}^{n}{\left({O}_{i}-{P}_{i}\right)}^{2}}$$
(3)
$$\mathrm{RRMSE}=\frac{\sqrt{\frac{1}{n}\sum_{i=1}^{n}{\left({O}_{i}-{P}_{i}\right)}^{2}}}{\frac{1}{n}\sum_{i=1}^{n}\left({O}_{i}-{P}_{i}\right)}\times 100\%$$
(4)

where Oi is the ith reference attribute measurement, Pi is the ith computed attribute measurement and n is the total number of measurements being compared.

Results

Estimation of the wheat tiller with terrestrial LiDAR

The three experiments were used as examples of the estimation of the wheat tiller through terrestrial LiDAR with ALHC. Regarding the efficiency, the total calculation time was around 18 min on a PC with Intel Core i5-8265U CPU @ 1.6 GHz and 8 GB RAM, comparing with six hours one person of manual counting in our case. The time consumption of manual and TLS methods is summarized in Table 4.

Table 4 The time consumption of manual and TLS methods

The results showed performance (Fig. 7). Exp. C achieved the highest accuracy, with R2, RMSE, and RRMSE values of 0.65, 102 tillers/m2, and 22.84%, respectively. The lowest accuracy was observed in Exp. B (R2 = 0.56, RMSE = 143 tillers/m2 and RRMSE = 26.26%). The accuracy of Exp. A was of R2 = 0.61, RMSE = 106 tillers/m2 and RRMSE = 34.53%, which was at the same ecological site as Exp. C. Overall, there was a general underestimation of the ALHC estimation over all three experiments.

Fig. 7
figure 7

The tiller number estimation results for a Exp. A, b Exp. B, and c Exp. C; the dotted line is a 1:1 line; different colors represent different levels of nitrogen (Orange: N0, Green: N1, Purple: N2, Red: one nitrogen level between N1 and of N2 in Xuzhou)

Accuracy of the estimated tiller numbers in the various treatments

To evaluate the performance of the algorithm across multiple treatments, growth stages, and planting seasons, we formed a complete database with 2 years of data (Exp. A & C).

First, to study the influence of the growth stages on the algorithm, we evaluated two years of data from the same ecological site (Rugao) for detailed comparison (Fig. 8). The results demonstrated that the correlation between the reference data and the jointing stage data (R2 = 0.77, RMSE = 70 tillers/m2, and RRMSE = 18.73%) was stronger than that with the tillering stage (R2 = 0.67, RMSE = 122 tillers/m2, and RRMSE = 33.23%). The growth stages had an important effect on the estimation accuracy, which revealed that the ALHC algorithm was more suitable for the jointing stage than the tillering stage.

Fig. 8
figure 8

Comparison between the observed values and estimated values for the different growth stages (a). Field images of the plots under the same treatments (b: tillering stage; c: jointing stage)

Second, to explore the influences of the planting densities (Fig. 9), two years of data of shared V2 with the two planting densities in the jointing stage were employed. As expected, higher accuracy was seen with lower planting density (R2 = 0.67, RMSE = 107 tillers/m2, and RRMSE = 28.13%). As the planting density increased, the accuracy was reduced (R2 = 0.59, RMSE = 115 tillers/m2 and RRMSE = 37.49%). The results indicated that the ALHC algorithm was negatively affected by planting density.

Fig. 9
figure 9

Comparison between the observed values and estimated values for the different planting densities (a). Field images of the plots under the same treatments (b: 25 cm; c: 40 cm) in the jointing stage

Finally, to explore the influences of the varieties, we employed two years of data of the jointing stage for the loose (Yangmai 16) and compact types (Shengxuan 6, Yangmai 15) (Fig. 10). Differences could be observed between the three varieties, which indicated that the loose plant type negatively impacted the estimation accuracy. Although Yangmai 15 and Shengxuan 6 were of the same plant type, the latter was estimated more accurately than the former.

Fig. 10
figure 10

Comparison between the observed values and estimated values of the different varieties (b, Shengxuan 6: RMSE: 114 tiller/m2, RRMSE: 26.23%; Yangmai 15: RMSE: 167 tiller/m2, RRMSE: 38.43%; Yangmai 16: RMSE: 218 tiller/m2, RRMSE: 38.75%). Indoor images of the plants of different varieties (a: compact plant type: Shengxuan 6; c: loose plant type: Yangmai 16)

Overall, although the different treatments affected the estimation accuracy of the ALHC, the accuracy fluctuated within a certain range.

In addition, we performed a series of experiments to explore the influence of the parameter (the Euclidean distance threshold d) in the algorithm on the results. Tanking the data from Exp. A (V2: Yangmai 16, D2: 40 cm, the jointing stage) as an example, the results were shown in Table 4. According to the numbers, the algorithm was sensitive to the changes of parameter. When d changed from small to large, the accuracy decreases first increased and then decreased. The trend implied that there was an optimal value to be explored (Table 5).

Table 5 The accuracy assessments of the ALHC on the example data from Exp. A (V2: Yangmai 16, D2: 40 cm, the jointing stage) with different d

Discussion

Pros and cons of the ALHC method with TLS data

We automatically counted the tiller number based on TLS data under field conditions for the first time. Although improvements in the accuracy are required, the ALHC could provide a new perspective for subsequent studies. Compared with a previous study that used a single clustering algorithm [29], the ALHC considered the spatial structure of the wheat plants in the adaptive layering step, where the dense canopy was segmented into small groups according to the distribution of the spaces between plants. Another consequence of this step was to sort the disordered point clouds along the X axis, which made the subsequent clustering step more efficient. Although the classical k-means algorithm adopted in Guo’s study [29] was versatile, the hierarchical clustering algorithm had the advantage of fewer input parameters and the lack of a need to define the number of classes in advance [35, 36], which was more suitable for tiller counting. Combining these two steps, we upgraded from a single plant scale to a plot scale and from indoor laboratories to outside fields. In fact, the theoretical max measurement range of the TLS can be expanded by adding scanning sites. The stitching of point clouds from different sites would also achieve the broad coverage. For vast fields prepared for breeding selection, TLS is very practical. Moreover, the ALHC is expected to have the potential to be transferred to other cereal crops (e.g., rice) with similar spatial structures in the field.

The primary limitation of the algorithm is the method of determining the threshold. First, in the first part of the adaptive layering step, the height of every layer was chosen based on the degree of overcounting and undercounting of clusters that were observed in the layers compared with the planting density after the trail runs, which was an inefficient and biased method. Then, in the clustering step, the Euclidean distance threshold of ending the iteration in the clustering step was settled at 2 cm, which was considered to be the minimum distance between the clusters of wheat after field observation. However, the distance between the individual wheat plants will change as the crop grows, especially under different treatments. We could conclude from the trends (Table 4) that there were optimal values for the distances, which could be determined by the appropriate methods. A one-size-fits-all and subjective distance threshold may cause underestimation in high-density plots or overestimation in low-density plots, resulting in a decrease in overall accuracy. Future work will be devoted to developing objective methods for determining dynamically changing thresholds to eliminate empirical bias, such as the Otsu thresholding method [37] and the probability density function (pdf). Moreover, a few variants and alternatives of the hierarchical clustering algorithm have been suggested for improving the function of the algorithm [33, 38], which may help with the algorithm performance under large numbers of disordered points and the accompanying noise.

Uncertainty of the ALHC counts of tiller number based on TLS data

This method has not only remedied the deficiency of passive remote sensing methods through using spatial information of the 3D point clouds but also filled the gap in counting the tiller number in the field. However, there were still some shortcomings. Compared with the previous studies [8,9,10,11,12,13, 15, 16, 29], the accuracy of the TLS data was unexpectedly low. The main reasons could be concluded from two aspects: (1) noise and (2) occlusion, which are currently inevitable in the TLS data.

It has been proven that noise negatively impacts TLS-derived parameters [39]. Noise may be generated by the common registration differences of point clouds, multiple echoes for one emitted shot, or instrumental errors associated with the measurement range [40]. Because it is more complicated than a single background laboratory, the field environment was surrounded by multisource noise, such as wind-caused noise. In our case, the denser the canopy was (the higher the planting density was), the more noise points it contained, which was intuitively reflected in the reduced accuracy. In contrast, more noise came from the edges of the crop foliage and the ground in the tillering stage, where it was covered by leaves in the jointing stage, which would contribute to higher accuracy [26]. In addition, noise was also one of the main reasons for overestimation. And the other was due to the incorrect recognition of leaves as tillers in the clustering step. Robust noise filtering algorithms are needed to make TLS a more powerful tool for crop field phenotyping.

Occlusion causes a significant loss of data. When the laser beam was intercepted by canopy elements, the space behind these elements was not sampled, which is called occlusion. In contrast to the individual plants monitored in the indoor laboratory, the overlap between and inside the plants was extremely challenging in the outdoor field, which prevents the laser from penetrating the canopy to obtain information from the middle and lower parts of the plant. In our case, for example, the accuracy declined with the changes in wheat varieties from a compact type to a diffuse type, where the occlusion was more serious in the latter. The same decrease in accuracy could also be observed with increased planting density. The incomplete canopy information caused by occlusion could also explain the general underestimation among our experiments (Fig. 7). Since the TLS was mounted on a tripod with a height of 2 m, the laser pulses stroke the plants at large incident angles (from 40° to 90°). To improve the laser’s penetration of the dense canopy, it would be beneficial to use a close to a nadir viewpoint instead of a gentle oblique viewpoint [41]. The airborne LiDAR could provide a smaller incident angle, which could penetrate the canopy more powerfully. Compensating for the loss of point cloud information by increasing the instrument height and increasing the number of scanning sites could also alleviate the problem [38].

In addition to the above methods, it is worth paying attention to methods based on machine learning, which could be tailored to learn complex features from large amounts of high-dimension data automatically [42]. These have the potential to outperform traditional methods.

Conclusion

This study proposed an automatic and nondestructive method for automatically counting wheat tillers in the field with terrestrial LiDAR data, which first separated the clusters with the adaptive layering algorithm and then detected the tillers in every cluster with the hierarchical clustering algorithm. The results showed that the ALHC method was promising and had the adaptability across different years, growth stages, planting densities, and ecological sites. However, due to the serious occlusion between and inside the plants and the numerous noise points, there was a general underestimation of the wheat tillers. The counting accuracy still needs to be improved in the ongoing work of replacing the missing points in the cloud and eliminating the noise points in the cloud. It would be meaningful to transfer the improved ALHC method to other cereal crops (e.g., rice) to potentially enhance the universality of the tiller counting algorithm.

Availability of data and materials

The datasets used and/or analyzed during the current study available from the corresponding author on reasonable request.

References

  1. Long SP, Ort DR. More than taking the heat: crops and global change. Curr Opin Plant Biol. 2010;13(3):240–7.

    Article  Google Scholar 

  2. Timothy GR, Graeme T, Gordon R. Save and Grow in practice: maize, rice, wheat. Rome: FAO; 2016.

    Google Scholar 

  3. Godfray HCJ, Beddington JR, Crute IR, Haddad L, Lawrence D, Muir JF, Pretty J, Robinson S, Thomas SM, Toulmin C. Food security: the challenge of feeding 9 billion people. Science. 2010;327(5967):812–8.

    Article  CAS  PubMed  Google Scholar 

  4. Tilman D, Balzer C, Hill J, Befort BL. Global food demand and the sustainable intensification of agriculture. Proc Natl Acad Sci USA. 2011;108(50):20260–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Ray DK, Mueller ND, West PC, Foley JA. Yield trends are insufficient to double global crop production by 2050. PLoS ONE. 2013;8(6):e66428.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Cai T, Xu HC, Peng DL, Yin YP, Yang WB, Ni YL, Chen XG, Xu CL, Yang DQ, Cui ZY, Wang ZL. Exogenous hormonal application improves grain yield of wheat by optimizing tiller productivity. Field Crops Research. 2014;155:172–83.

    Article  Google Scholar 

  7. Naruoka Y, Talbert LE, Lanning SP, Blake NK, Martin JM, Sherman JD. Identification of quantitative trait loci for productive tiller number and its relationship to agronomic traits in spring wheat. Theor Appl Genet. 2011;123:1043–53.

    Article  CAS  PubMed  Google Scholar 

  8. Scotford IM, Miller PCH. Estimating tiller density and leaf area index of winter wheat using spectral reflectance and ultrasonic sensing techniques. Biosys Eng. 2004;89:395–408.

    Article  Google Scholar 

  9. Flowers M, Weisz R, Heiniger R. Remote sensing of winter wheat tiller density for early nitrogen application decisions. Agron. 2001;93:783–9.

    Article  Google Scholar 

  10. Phillips SB, Keahey DA, Warren JG, Mullins GL. Estimating winter wheat tiller density using spectral reflectance sensors for early-spring, variable-rate nitrogen applications. Agron. 2004;96:591–600.

    Google Scholar 

  11. Zhang M, Sun H, Li MZ, Zhang Q, Zheng LH. Prediction of wheat tiller number based on 4-band crop spectrometer. Chinese Society of Agricultural Machinery. 2016. https://doi.org/10.6041/j.issn.1000-1298.2016.09.046.

    Article  Google Scholar 

  12. Boyle RD, Corke FMK, Doonan JH. Automated estimation of tiller number in wheat by ribbon detection. Mach Vis Appl. 2016;27:637–46.

    Article  Google Scholar 

  13. Constantino KP, Gonzales EJ, Lazaro LM, Serrano EC, Samson BP. Towards an automated plant height measurement and tiller segmentation of rice crops using image processing. Mechatronics and Machine Vision in Practice. 2018;3:155–68.

    Article  Google Scholar 

  14. Yang WN, Duan LF, Chen G, Xiong L, Liu Q. Plant phenomics and high-throughput phenotyping: accelerating rice functional genomics using multidisciplinary technologies. Curr Opin Plant Biol. 2013;16(2):180–7.

    Article  CAS  PubMed  Google Scholar 

  15. Yang WN, Xu XC, Duan LF, Luo QM, Chen SB, Zeng SQ, Liu Q. High-throughput measurement of rice tillers using a conveyor equipped with x-ray computed tomography. Rev Sci Instrum. 2011;82(2):025102.

    Article  PubMed  Google Scholar 

  16. Huang ZF, Gong L, Liu CL, Huang YX, Niu QL. Measurement of rice tillers based on magnetic resonance imaging. IFAC-PapersOnLine. 2016;49(16):254–8.

    Article  Google Scholar 

  17. Deery D, Jimenez-Berni J, Jones H, Sirault X, Furbank R. Proximal remote sensing buggies and potential applications for field-based phenotyping. Agronomy. 2014;4:349–79.

    Article  Google Scholar 

  18. Tilly N, Hoffmeister D, Schiedung H, Hütt C, Brands J, Bareth G. Terrestrial laser scanning for plant height measurement and biomass estimation of maize. Int Arch Photogr Remote Sens Spat Inform Sci. 2014;7(7):181–7.

    Article  Google Scholar 

  19. Lin Y. LiDAR: An important tool for next-generation phenotyping technology of high potential for plant phenomics? Comput Electr Agric. 2015;119:61–73.

    Article  Google Scholar 

  20. Drakea JB, Dubayaha RO, Clarkb DB, Knoxd RG, Blaird JB, Hoftona MA, Chazdone RL, Weishampelf JF, Prince SD. Estimation of tropical forest structural characteristics using large-footprint lidar. Remote Sens Environ. 2002;79:305–19.

    Article  Google Scholar 

  21. Dassot M, Constant T, Fournier M. The use of terrestrial LiDAR technology in forest science: application fields, benefits and challenges. Ann For Sci. 2011;68:959–74.

    Article  Google Scholar 

  22. Wulder MA, White JC, Nelson RF, Næsset E, Ørka HO, Coops NC, Hilker T, Bater CW, Gobakken T. Lidar sampling for large-area forest characterization: A review. Remote Sens Environ. 2012;121:196–209.

    Article  Google Scholar 

  23. Chen YM, Zhang WM, Hu RH, Qi JB, Shao J, Li D, Wan P, Qiao C, Shen AJ, Yan GJ. Estimation of forest leaf area index using terrestrial laser scanning data and path length distribution model in open-canopy forests. Agric For Meteorol. 2018;263:323–33.

    Article  Google Scholar 

  24. Jimenez-Berni JA, Deery DM, Rozas-Larraondo P, Condon AG, Rebetzke GJ, James RA, Bovill WD, Furbank RT, Sirault XRR. High throughput determination of plant height, ground cover, and above-ground biomass in wheat with LiDAR. Fornt Plant Sci. 2018;9:237–55.

    Article  Google Scholar 

  25. Tripathi P, Behera MD. Plant height profiling in western India using LiDAR data. Curr Sci. 2013;105(7):970–7.

    Google Scholar 

  26. Eitel JUH, Magney TS, Vierling LA, Brown TT, Huggins DR. LiDAR based biomass and crop nitrogen estimates for rapid, non-destructive assessment of wheat nitrogen status. Field Crops Res. 2014;159:21–322.

    Article  Google Scholar 

  27. Sun SP, Li CY, Paterson AH. In-field high-throughput phenotyping of cotton plant height using LiDAR. Remote Sens. 2017;9(4):377–98.

    Article  Google Scholar 

  28. Madec S, Baret F, de Solan B, Thomas S, Dutartre D, Jezequel S, Hemmerlé M, Colombeau G, Comar A. High-throughput phenotyping of plant height: comparing unmanned aerial vehicles and Ground LiDAR Estimates. Front Plant Sci. 2017;8:2002–166.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Guo QH, Wu FF, Pang SX, Zhao XQ, Chen LH, Liu J, Xue BL, Xu GC, Li L, Jing H, Chu C. Crop 3D—a LiDAR based platform for 3D high-throughput crop phenotyping. Sci China. 2018;61(3):328–39.

    Article  CAS  Google Scholar 

  30. Guo T, Fang Y, Cheng T, Tian YC, Zhu Y, Chen Q, Qiu XL, Yao X. Detection of wheat height using optimized multi-scan mode of LiDAR during the entire growth stages. Comput Electron Agric. 2019;165:104959–68.

    Article  Google Scholar 

  31. Besl PJ, McKay ND. A method for registration of 3-D shapes. IEEE Trans Pattern Anal Mach Intell. 1992;14(2):239–56.

    Article  Google Scholar 

  32. Zhou JX, Yan XM, Jiao JY. MATLAB from entry to proficient. 2nd ed. Beijing: People's Posts and Telecommunications Press; 2008.

    Google Scholar 

  33. Murtagh F, Contreras P. Algorithms for hierarchical clustering: an overview. WIREs Data Mining Knowl Discov. 2012;2:86–97.

    Article  Google Scholar 

  34. Cai RC, Zhang ZJ, Tung AKH, Dai CY, Hao ZF. A general framework of hierarchical clustering and its applications. Inf Sci. 2014;272:29–48.

    Article  Google Scholar 

  35. Saxena A, Prasad M, Gupta A, Bharill N, Patel OP, Tiwari A, Joo EM, Ding WP, Lin CT. A review of clustering techniques and developments. Neurocomputing. 2017;267:664–81.

    Article  Google Scholar 

  36. Mondal SA. An improved approximation algorithm for hierarchical clustering. Pattern Recogn Lett. 2018;104:23–8.

    Article  Google Scholar 

  37. Otsu N. A threshold selection method from gray-level histograms. IEEE Trans.Syst., Man, Cybernet. 1979;9:62–66.

  38. Ros F, Guillaume S. A hierarchical clustering algorithm and an improvement of the single linkage criterion to deal with noise. Expert Syst Appl. 2019;128:96–108.

    Article  Google Scholar 

  39. Malamboa L, Popescua SC, Horneb DW, Pughb NA, Rooney WL. Automated detection and measurement of individual sorghum panicles using density-based clustering of terrestrial lidar data. ISPRS. 2019;149:1–13.

    Google Scholar 

  40. Cuartero A, Armesto J, Rodríguez PG, Arias P. Error analysis of terrestrial laser scanning data by means of spherical statistics and 3D graphs. Sensors. 2010;10:10128–45.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Hosoi F, Omasa K. Estimation of vertical plant area density profiles in a rice canopy at different growth stages by high-resolution portable scanning lidar with a lightweight mirror. ISPRS J Photogramm Remote Sens. 2012;74:11–9.

    Article  Google Scholar 

  42. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–44.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We would like to thank Xiao Zhang and Miaomiao Zhai for their help in the data collection.

Funding

This work was supported by grants from the Key Projects (Advanced Technology) of Jiangsu Province (BE 2019383), National Natural Science Foundation of China (31671582, 31971780), the Jiangsu Collaborative Innovation Center for Modern Crop Production (JCICMCP), the Qinghai Project of Transformation of Scientific and Technological Achievements (2018-NK-126), Xinjiang Corps Great Science and Technology Projects (2018AA00403), and the 111 project (B16026).

Author information

Authors and Affiliations

Authors

Contributions

FY and XLQ were first co-authors. FY, YX and QX designed and wrote the algorithms to process the point cloud data; XQ, TG, and YW were involved in the development of algorithms. TG, CT, ZY, CQ, CW, YX and MZ designed and performed the experimental work, and acquired data. FY, NQ, HY, GL drafted the manuscript, and all authors contributed equally to the editing and improving the final manuscript.

Corresponding author

Correspondence to Xia Yao.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Consent and approval for publication was obtained from all authors. Consent for the use of images and manual measurement was obtained from Nanjing Agricultural University.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Fang, Y., Qiu, X., Guo, T. et al. An automatic method for counting wheat tiller number in the field with terrestrial LiDAR. Plant Methods 16, 132 (2020). https://doi.org/10.1186/s13007-020-00672-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13007-020-00672-8

Keywords