[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Next Article in Journal
Vibration Mitigation in the Transport of Fruit Boxes Using 3D-Printed Devices
Previous Article in Journal
Parameter Calibration Method for Discrete Element Simulation of Soil–Wheat Crop Residues in Saline–Alkali Coastal Land
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Study on Canopy Volume Measurement Model for Fruit Tree Application Based on LiDAR Point Cloud

1
College of Agricultural Engineering and Food Science, Shandong University of Technology, Zibo 255000, China
2
Shandong Academy of Agricultural Machinery Sciences, Jinan 252100, China
3
Shandong Key Laboratory of Intelligent Agricultural Equipment in Hilly and Mountainous Areas, Jinan 252100, China
*
Author to whom correspondence should be addressed.
Agriculture 2025, 15(2), 130; https://doi.org/10.3390/agriculture15020130
Submission received: 13 December 2024 / Revised: 2 January 2025 / Accepted: 7 January 2025 / Published: 9 January 2025
(This article belongs to the Section Digital Agriculture)
Figure 1
<p>LZY604 driverless wheeled tractor.</p> ">
Figure 2
<p>Point cloud data acquisition. (<b>a</b>) Acquisition site. (<b>b</b>) Multi-view point cloud image.</p> ">
Figure 3
<p>SOR filter denoising before and after comparison. (<b>a</b>) Top view of before and after denoising. (<b>b</b>) Front view before and after denoising.</p> ">
Figure 4
<p>Point cloud of single fruit trees.</p> ">
Figure 5
<p>3D point cloud plane projection. (<b>a</b>) <span class="html-italic">X</span>-axis projection. (<b>b</b>) <span class="html-italic">Y</span>-axis projection. (<b>c</b>) <span class="html-italic">Z</span>-axis projection.</p> ">
Figure 6
<p>Triangular sectioning and edge circle drawing.</p> ">
Figure 7
<p>2D point cloud contour lines. (<b>a</b>) Comparison of contours in X-projection direction; (<b>b</b>) Comparison of contours in the Y-projection direction; (<b>c</b>) Comparison of contours in Z-projection direction.</p> ">
Figure 7 Cont.
<p>2D point cloud contour lines. (<b>a</b>) Comparison of contours in X-projection direction; (<b>b</b>) Comparison of contours in the Y-projection direction; (<b>c</b>) Comparison of contours in Z-projection direction.</p> ">
Figure 8
<p>Comparison of manual labeling Alpha Shape contour extraction algorithm with actual contour lengths. (<b>a</b>) Alpha Shape contour extraction algorithm manually labeled lengths. (<b>b</b>) Manual marking of actual contour lengths.</p> ">
Figure 9
<p>Comparison of before and after improvement of grid integral volume method. (<b>a</b>) Grid integral volume method before improvement. (<b>b</b>) Improved grid integral volume method.</p> ">
Figure 10
<p>Body element and point cloud presentation diagrams. (<b>a</b>) The body element is taken as 0.5 m. (<b>b</b>) The body element is taken as 0.1 m.</p> ">
Figure 11
<p>Comparison of results between the body element method and the improved grid integration method.</p> ">
Figure 12
<p>Test set results of a model test set for canopy volume measurement based on PLSR. (<b>a</b>) Plot of predicted versus true values for the test set. (<b>b</b>) Plot of test set scatter points versus residual fit.</p> ">
Figure 13
<p>BP neural network-based canopy volume measurement model test set results. (<b>a</b>) Plot of predicted versus true values for the test set. (<b>b</b>) Plot of test set scatter points versus residual fit.</p> ">
Figure 14
<p>Test set results of canopy volume measurement model based on GBDT. (<b>a</b>) Plot of predicted versus true values for the test set. (<b>b</b>) Plot of test set scatter points versus residual fit.</p> ">
Figure 15
<p>Comparison of sycamore canopy contour extraction and volume acquisition. (<b>a</b>) Comparison of contours in X-projection direction. (<b>b</b>) Comparison of contours in the Y-projection direction. (<b>c</b>) Comparison of contours in Z-projection direction. (<b>d</b>) Volume acquisition comparison.</p> ">
Figure 15 Cont.
<p>Comparison of sycamore canopy contour extraction and volume acquisition. (<b>a</b>) Comparison of contours in X-projection direction. (<b>b</b>) Comparison of contours in the Y-projection direction. (<b>c</b>) Comparison of contours in Z-projection direction. (<b>d</b>) Volume acquisition comparison.</p> ">
Versions Notes

Abstract

:
The accurate measurement of orchard canopy volume serves as a crucial foundation for wind regulation and dosage adjustments in precision orchard management. However, existing methods for measuring canopy volume fail to satisfy the high precision and real-time requirements necessary for accurate variable-rate applications in fruit orchards. To address these challenges, this study develops a canopy volume measurement model for orchard spraying using LiDAR point cloud data. In the domain of point cloud feature extraction, an improved Alpha Shape algorithm is proposed for extracting point cloud contours. This method improves the validity judgment for contour line segments, effectively reducing contour length errors on each 3D point cloud projection plane. Additionally, improvements to the mesh integral volume method incorporate the effects of canopy gaps in height difference calculations, significantly enhancing the accuracy of canopy volume estimation. For feature selection, a random forest-based recursive feature elimination method with cross-validation was employed to filter 10 features. Ultimately, five key features were retained for model training: the number of point clouds, the 2D point cloud contour along the X- and Z-projection directions, the 2D width in the Y-projection direction, and the 2D length in the Z-projection direction. During model construction, the study optimized the hyperparameters of partial least squares regression (PLSR), backpropagation (BP) neural networks, and gradient boosting decision trees (GBDT) to build canopy volume measurement models tailored to the dataset. Experimental results indicate that the PLSR model outperformed other approaches, achieving optimal performance with three principal components. The resulting canopy volume measurement model achieved an R2 of 0.9742, an RMSE of 0.1879, and an MAE of 0.1161. These results demonstrate that the PLSR model exhibits strong generalization ability, minimal prediction bias, and low average prediction error, offering a valuable reference for precision control of canopy spraying in orchards.

1. Introduction

Chemical pest and disease control is essential in orchard management, with fruit trees requiring 8–15 pesticide applications annually [1], comprising about 30% of overall orchard management activities [2].Traditional pesticide application methods, mainly relying on “high-capacity, rain-type” spraying, suffer from low utilization rates of merely 20% to 40% [3].This inefficient method leads to substantial pesticide waste, excessive residues in agricultural products, and environmental contamination. Orchard precision variable pesticide application technology utilizes sensors to identify targets and regulates spray volume and wind power based on parameters such as target location, volume, and density [4]. This approach effectively overcomes the limitations of traditional spraying methods.
The precise application of sprays using wind sprayers requires both wind regulation and dosage control [5]. Wind power and dosage must be adjusted based on the canopy volume and branch density within the canopy to meet specific requirements [6]. The ratio of the volume of the canopy void to the total canopy volume can, to some extent, characterize the density of the branches inside the canopy. If the measured volume of the canopy gap is small, it may result in insufficient wind and inadequate pesticide application, leading to ineffective control. Conversely, if the measured volume of the canopy void is large, it can cause excessive wind and an over-application of pesticides, causing the spray to drift outside the canopy of the fruit tree, thereby wasting pesticides and severely polluting the agricultural environment. Therefore, accurate measurement of canopy volume is of great significance for the precise application of pesticides in orchards.
The traditional method for measuring canopy volume relies on manual measurements, primarily using tools such as tape measures and altimeters [7]. This approach measures canopy width, height, and other data, calculating the canopy volume through established volume formulas [8]. While this method is simple to operate, it suffers from low measurement accuracy [9], lacks real-time capability, and cannot be applied for precision variable pesticide application. However, the introduction of technologies such as ultrasonic sensors and LiDAR sensors for canopy volume measurement [10,11,12,13,14] has significantly enhanced both the accuracy and real-time performance of these measurements. Yu Long et al. [15] established a canopy parameter detection system for fruit trees utilizing an attitude heading reference system, RTK-DGPS, and a planar laser rangefinder. They employed a slicing method to obtain the canopy volume, demonstrating reduced error when compared to manual measurements. Li Qiujie et al. [16] utilized 2D LiDAR to collect frame-by-frame data for tree detection, segmentation, and trunk identification. Those data were then used to calculate the thickness of single-tree canopies, which were discretized into small rectangles to obtain the volume for each frame, subsequently accumulating these to derive the total canopy volume. Li Peng et al. [17] proposed a method for measuring fruit tree canopy volume based on an irregular triangular prism module. Their irregular triangular prism segmentation method accounted for the spatial variation of the canopy’s outer edge, in contrast to the rectangular prism segmentation method. The studies mentioned above have effectively addressed the issues of low accuracy associated with manual measurements and the lack of real-time performance, thereby enhancing the automation of canopy volume measurement. However, the various segmentation methods employed in these studies treat the canopy as a dense entity, which leads to measurement errors by failing to account for the branches and voids within the fruit tree canopy [7]. Zhou et al. [18] developed a multivariate regression analysis model to predict canopy volume. This model utilized real measurements of trees, inputting crown height and trunk radius calculated from LiDAR point clouds to estimate canopy volume. However, the actual crown values were derived from manual measurements and geometric volume approximations, resulting in significant errors for trees with large internal gaps. Consequently, the trained model exhibited similar inaccuracies for these types of trees. James P. Underwood et al. [19] discretized the point cloud into cubic voxels, multiplying the total count by the voxel size to estimate canopy volume. Given that LiDAR scanning is typically one-sided, they calculated the volumes of the two halves of the canopy separately and summed them to obtain the total canopy volume. While this method is high precision, it becomes computationally complex when dealing with large volumes of point cloud data, making real-time canopy volume detection challenging and limiting its application in precision spraying.
In precision variable pesticide application, canopy volume significantly influences both wind regulation and pesticide application volume. However, existing methods for measuring canopy volume often overlook canopy gaps, leading to substantial measurement errors and inadequate real-time performance, which renders them unsuitable for real-time precision variable pesticide application in orchards. To address these issues, this study develops a canopy volume measurement model utilizing LiDAR point cloud data. The input features utilized in the model comprise the number of point clouds, the 2D point cloud contour in the X-projection direction, the 2D point cloud contour in the Z-projection direction, the 2D width in the Y-projection direction, and the 2D length in the Z-projection direction. The canopy volume, derived from the grid integration volume method, serves as the output variable. Hyperparameter optimization is conducted for partial least squares regression (PLSR), backpropagation neural networks (BP), and gradient boosting decision trees (GBDT) utilizing the dataset from this study. Models are constructed using the optimal hyperparameters to provide support for an accurate canopy volume measurement model for wind regulation and dosage control in precision variable pesticide application technology.

2. Materials and Methods

2.1. Acquisition of Point Cloud Data

2.1.1. Experimental Platform

This paper employs an LZY604 wheeled tractor (manufacturer: Shandong Weifang Luzhong Tractor Co., Weifang, China) equipped with an autonomous driving function to collect LiDAR point cloud data from the fruit tree canopy. The tractor is equipped with a 16-line LiDAR system, and the installation setup is illustrated in Figure 1. The installation height is 205 cm, and the LiDAR system has a horizontal field of view of 360°, a measurement accuracy of ±2 cm, a rotation frequency set to 10 Hz, and a horizontal angle resolution of 0.2°. The tractor operates at a travel speed of 3.6 km/h.

2.1.2. Fruit Tree LiDAR Point Cloud Data Acquisition

The point cloud collection site is located in Jinniu Mountain Orchard, Tai’an City, Shandong Province (116°94′~116°97′ E, 36°13′~36°36′ N). This region is classified within the warm temperate semi-humid monsoon climate zone. The orchard primarily cultivates Fuji apple trees using a dwarf rootstock dense planting system. The row spacing between fruit trees is 4 m, with a plant spacing of 3 m, resulting in a total of 121 sampled trees. The collected samples effectively reflect the current growth status of the fruit tree varieties in the experimental field. The point cloud collection process is illustrated in Figure 2, with measurements recorded in meters (m).

2.2. Point Cloud Data Pre-Processing

2.2.1. Point Cloud Denoising

During the actual point cloud data collection process, various factors, such as vibrations from the tractor, environmental wind, and the inherent characteristics of the fruit trees, can introduce unavoidable noise into the point cloud data. These noise points increase the volume of the point cloud data, which can negatively impact the accuracy of modeling and information extraction. Common point cloud denoising techniques include median filtering [20], mean filtering [21], Gaussian filtering [21], bilateral filtering [22], and statistical outlier removal (SOR) [23]. Among these methods, Gaussian and mean filtering smooth the data to achieve denoising but can cause edge distortion, making them unsuitable for this study since boundary profiles are critical features. Median filtering assumes a uniform data distribution, making it less flexible in handling regions with density variations, such as edges or sparse points. Bilateral filtering has complex parameters, making it challenging to achieve an appropriate balance between smoothing and edge preservation. In contrast, SOR is based on statistical analysis and assesses whether a point is an outlier by calculating the average distance between the point and its neighboring points. This technique effectively removes points that deviate significantly from the data distribution without smoothing, thereby preserving edge and detail information. Furthermore, SOR maintains the characteristics of point cloud data well in both sparse and dense areas, and its straightforward principle makes it suitable for the dataset used in this study. This study employs the Statistical Outlier Removal (SOR) technique to eliminate outliers from the point cloud data. The specific steps are as follows:
  • To calculate the average distance from each point to its K nearest neighbors, the value of K should be determined based on the scale of the data and the density of the point cloud. After conducting multiple trials, it was found that the optimal results were achieved with a K value of 20.
  • Calculate the global average and standard deviation of the average distances for all points in the dataset.
  • By establishing a distance threshold using the global average and standard deviation, the threshold is then compared with the average distance of each point. Points that exceed this threshold are marked as outliers and subsequently removed from the dataset.
The comparison of the statistical outlier removal (SOR) denoising technique before and after application is presented in Figure 3. In Figure 3a, the top view comparison indicates that the SOR technique effectively removed outliers from the background, significantly reducing the impact of noise on subsequent processing. Figure 3b demonstrates that the SOR technique has a minimal impact on the canopy profile, effectively preserving edge and detail information, thereby ensuring the completeness and reliability of the subsequent canopy profile processing.

2.2.2. Target Area Extraction

The point cloud data contains a significant number of background points that are unrelated to the fruit trees, increasing the difficulty of subsequent processing. Since the precise application targets individual fruit trees, it is essential to obtain parameters for each fruit tree based on LiDAR point cloud data. To achieve this, CloudCompare (2.13.1) software is employed for the manual segmentation of the point cloud. The process begins with trimming from the frontal view, followed by a careful examination from multiple angles, including top and side views, to ensure the accuracy of the obtained point cloud data for a single fruit tree. Figure 4 illustrates a schematic representation of the point cloud for a single fruit tree after the cropping process.

2.3. Point Cloud Data Feature Information Acquisition

2.3.1. Canopy Boundary Contour Line Extraction

The alpha shapes algorithm proposed by H. Edelsbrunner [24] is a simple and effective method for rapidly extracting boundary points. For any shape of a planar point cloud, the contour points are formed by the trajectory of a circle with a radius of a as it rolls around the point cloud. However, the alpha shapes algorithm can only detect contour points in two-dimensional point clouds. Therefore, to extract the contour points of a point cloud arranged in a straight line, the points must be projected onto a two-dimensional plane, typically the XOY plane.
A 3D point cloud consists of a collection of disordered points in 3D space, which can be projected onto any 2D plane to obtain 2D views and related feature information. The three-dimensional planes are represented uniformly as follows:
a x + b y + c a + d = 0
where a , b and c represent the normal vectors of the X, Y, and Z planes, respectively, which determine the orientation of the plane; d is the negative value of the distance from the plane to the origin, defining the translation of the plane. Since this study projects onto the X, Y, and Z planes, the value of d is set to 0. For instance, when the projection plane is oriented in the Z direction (XOY), the values of a , b , and c are 0, 0, and 1, respectively, and similar values apply for other projection directions.
Assuming that x 0 , y 0 , z 0 and x 2 , y 2 , z 2 are two points on the plane, substituting into Equation (1) gives:
a x 1 x 2 + b y 1 y 2 + c z 1 z 1 + d = 0
x 1 x 2 , y 1 y 2 , z 1 z 2  is a vector in the plane, and according to the above equation, ( a , b , c ) is perpendicular to this vector, i.e., ( a , b , c ) is normal to the plane.
Suppose x 0 , y 0 , z 0  is any point in space, which has the projected coordinates ( x , y , z ) in the plane, and the vector consisting of these two points. That is, assuming the normal vector of the plane, then the vector should be parallel to the normal vector ( a , b , c ) ,  thus obtaining:
x x 0 a = y y 0 b = z z 0 c = t
where t is the distance the point x 0 , y 0 , z 0 moves along the direction vector (a, b, c).
Parametric expressions for linear equations are as follows:
x = a t + x 0 y = b t + y 0 z = c t + z 0
Substituting the above results into the plane equation Equation (1) yields:
t = a x 0 + b y 0 + c z 0 + d a 2 + b 2 + c 2
Substitute Equation (5) back to Equation (4) to obtain the post-projection coordinates and read the post-projection point cloud, as shown in Figure 5 3D point cloud plane projection.
Figure 6 presents a comparison between the standard Alpha Shape contour extraction algorithm and the improved Alpha Shape contour extraction algorithm across the X-, Y-, and Z-projection directions. The standard Alpha Shape contour extraction algorithm [25] first conducts Delaunay triangulation on the 2D point cloud and then traverses each edge of all triangles to assess its validity. The formula for determining the two possible center positions of the circles defined by the endpoints of the edge and a fixed radius is as follows:
c = m ± r 2 L 2 2 × n
where c is the center of the circle; m is the centroid of the edge; r is the radius of the circle, 1/alpha, alpha = 1.95; L is the length of the edge; and n is the normal vector of the edge.
The KD Tree is utilized to query the number of point clouds within a radius r of the two circle center locations. A line segment is deemed valid if the number of neighboring points returned by the KD Tree query is less than or equal to 2 for any given circle center; otherwise, it is considered invalid. In practical applications, the contour of the dense point cloud and its shape may result in situations where the number of points surrounding the contour line circle exceeds 2. The judgment method described above is unable to identify this portion of the boundary. As illustrated in Figure 6, the contour line presents a triangular section, with the edge line circle drawn around it. Although this line segment represents the boundary contour, the line segment circle contains 6 points, leading to its incorrect classification as a non-boundary contour. Consequently, the final computed value for the actual contour is underestimated, resulting in a significant error.
Given the aforementioned problems, this paper improves the Alpha Shape contour extraction algorithm, focusing primarily on the boundary validity judgment following the triangular dissection. The enhancement utilizes the fork product method, which involves multiplying two vectors by the results of their positive and negative signs. This approach determines whether each neighboring point within the circle is located on the same side as the two endpoints of the edge and the center of the circle. The formula is as follows:
P c 1 × P A = x c x p · y a y p y c y p · x a x p c 1 Q × c 1 A = x q x c · y a y c y q y c · x a x c
where P and Q are the two endpoints of the sideline, with coordinates x p , y p , x q , y q ; c 1   is the center of the circle, with coordinates x c , y c . Two fork product results with the same sign indicate that the neighborhood point is located on the same side of the P c 1 Q fold line. For a boundary line with more than two points within the circle, if all neighboring points are positioned on one side of the P c 1 Q fold line, the boundary line is deemed a valid contour boundary.
The comparison of the Alpha Shape contour extraction algorithm before and after improvement is illustrated in Figure 7. The unimproved Alpha Shape contour extraction algorithm exhibits a significant deviation from the actual contour and lacks regularity. In contrast, the improved Alpha Shape contour extraction algorithm effectively reduces instances of actual boundaries being misclassified as non-boundaries. However, it still presents cases where non-boundary points are incorrectly identified as boundaries.
In response to the aforementioned issues, the true value is fitted to the value obtained by the algorithm to achieve a more accurate contour boundary length. The true value is determined through analysis using CAD (2022) software, and the corresponding analysis formula is as follows:
l s c a d l s = l t c a d l t
where l s c a d  is the length manually marked in SolidWorks by Alpha Shape contour extraction algorithm, l s  is the length of the contour obtained by Alpha Shape contour extraction algorithm which has a large error, l t c a d is the length of the actual boundary manually marked in CAD, and   l t   is the length of the real contour which is more accurate, while l s c a d , l t c a d are shown in Figure 8. Fitting   l t and l s in SPSSAU software, the fitting equations were obtained as follows:
l t = 4.57 + 0.0046 l s + 0.0082 l s 2

2.3.2. Canopy Volume Acquisition

The grid integration method [26] is a straightforward approach for estimating the volume of a point cloud. The core concept involves projecting the 3D point cloud onto a 2D plane and dividing that plane into several regular grids. For each grid, the vertical distance between the highest and lowest points of the statistical point cloud data is calculated, representing the height difference. This height difference is then multiplied by the grid area to obtain the approximate volume of each grid. Finally, by summing the volumes of all the grids, the total volume of the object represented by the point cloud is determined.
The grid integration volume calculation process did not account for potential gaps within the height of the grid. The height difference was directly determined by the highest and lowest points, assuming continuity in height. However, the actual tree canopy contains numerous gaps and is often discontinuous, leading to significant errors in volume calculations due to these oversights. In this study, we experimented with different grid sizes and compared the results, ultimately determining the optimal grid size to be 0.11 m. To address the issues with the grid integration method, we improved the height difference calculation by introducing a threshold based on the grid size, set to three times the grid size (0.33 m). Height differences between two points with a Z value greater than the threshold are classified as gaps and excluded from the volume integration. Typically, the central region of the tree canopy is densely branched, while the outer area has sparse foliage, resulting in greater height differences in the center and smaller differences at the periphery. Figure 9 illustrates the comparison of crown height difference distributions between the standard grid integration volume method and the improved method. Analysis shows that the height difference distribution produced by the improved algorithm aligns more closely with actual conditions. However, the improved grid volume integration method requires extensive grid operations, mask filtering, and sorting, resulting in longer run times and a significant memory requirement to store intermediate results from point cloud calculations. This complexity poses challenges for real-time applications in orchard spraying. Therefore, the volume results obtained from the improved grid volume integration method are used as labels for volume prediction in a machine learning model, facilitating the development of an accurate and efficient real-time canopy volume measurement model for precision spraying.
Since the fruit tree canopy is an irregular three-dimensional structure, its overall spatial morphology cannot be precisely described by a mathematical model. Additionally, the manual measurement process does not account for the effects of voids, making it impossible to directly determine the theoretical value of the canopy volume. Wei Xuehua et al. [27] proposed the body element simulation method, which involves calculating the canopy volume using body elements with a side length of 0.2 m. A comparative analysis of traditional canopy volume calculation methods concluded that the body element simulation method is more accurate and objective. Therefore, this paper builds upon the body element simulation method, utilizing body element sizes of 0.15 m and 0.1 m when estimating results to serve as theoretical references for assessing volume acquisition accuracy. Furthermore, 30 samples were randomly selected for accuracy analysis.
As illustrated in Figure 10, which displays a canopy body element alongside the point cloud representation, the red squares represent the body elements, while the blue dots signify the canopy point clouds. Figure 10a presents the body element and point cloud display for a canopy with a body element size of 0.15 m. In this case, the body element essentially encompasses all the point clouds within the canopy; however, it also includes all the internal gaps, which theoretically results in an overestimation of the measured volume. Conversely, Figure 10b depicts the body element and point cloud representation for a canopy with a body element size of 0.1 m. Here, the body element fails to cover all the point clouds in the canopy, leaving several significant gaps within the canopy that are clearly not typical of normal canopy gaps. Consequently, the measured volume is theoretically underestimated.
From Figure 11, it is evident that the volume fluctuations of the body element simulation method and the improved grid integration method are largely consistent, demonstrating a high degree of agreement. This similarity arises from the fact that both volume measurement methods share the same fundamental principles, with the primary difference being their consideration of voids. In the body element simulation method, when the body element size is set to 0.15 m, it encompasses the voids within the canopy, leading to an inflated volume measurement. Conversely, when the body element size is reduced to 0.1 m, it fails to cover the entire point cloud, resulting in a diminished volume estimate. The volume obtained using the improved grid integration method in this study falls between the values derived from the body element simulation method for 0.15 m and 0.1 m, respectively. This observation aligns with the analysis that takes voids into account, indicating that the measurement accuracy is relatively higher.

2.4. Feature Selection

Feature selection is a crucial aspect of feature engineering, focusing on identifying the optimal subset of features while eliminating irrelevant or redundant ones. This process not only reduces the overall number of features but also enhances model accuracy and decreases running time [28]. Feature selection can be classified into three categories: filter methods, wrapper methods, and embedded methods [29]. Filtering methods assess individual features based on their dispersion or relevance, establishing a threshold or selecting a specific number of features for filtering. However, these methods typically rely on statistical properties (e.g., correlation, variance) and may overlook complex patterns and relationships between features, potentially resulting in the omission of important features. Wrapper methods evaluate feature relevance through model performance, considering the interactions between features, which allows for capturing more complex relationships. They can automatically determine the optimal number of features through cross-validation, although this incurs significant computational overhead. Embedded methods integrate feature selection into the model construction process, leveraging the model’s inherent characteristics. However, this approach is highly dependent on the performance of the chosen model, which may lead to suboptimal feature selection if the model is poorly selected. Given the relatively small number of features in this study, the wrapper method is expected to perform better; therefore, this paper opts for the wrapper approach.
The cross-validation recursive elimination (RFE-CV) method integrates the recursive elimination technique with cross-validation, where the recursive elimination method serves as a wrapper approach that employs sequential selection and backward search [30]. The specific steps include selecting a base model, training the model, systematically eliminating unimportant features, and evaluating model performance using cross-validation as a metric until the optimal number of features is identified. RFE-CV is widely used, exhibits good stability, and effectively captures complex relationships between features.
The goal of the feature selection phase is to identify a subset of features that are highly correlated with the target variable without needing to consider the specific assumptions made by subsequent models regarding feature relationships. Random forests make fewer assumptions about these relationships and are highly robust, focusing solely on identifying the important features during the selection phase without assuming that the relationships among these features are linear or non-linear. Therefore, this paper selects random forest as the base model for recursive elimination.
The dataset utilized in this study was input into the RFE-CV method combined with the Random Forest model to evaluate the following ten features: the number of points in the point cloud, the 2D point cloud outline in the X-projection direction, the 2D point cloud outline in the Y-projection direction, the 2D point cloud outline in the Z-projection direction, the 2D point cloud length in the X-projection direction, the 2D point cloud width in the Y-projection direction, the 2D point cloud length in the Z-projection direction, the 2D point cloud density in the X-projection direction, the 2D point cloud density in the Y-projection direction, and the 2D point cloud density in the Z-projection direction.
The recursive feature elimination algorithm with cross-validation based on Random Forests identifies five important features: the number of points in the point cloud, the 2D point cloud outline in the X-projection direction, the 2D point cloud outline in the Z-projection direction, the 2D width of the point cloud in the Y-projection direction, and the 2D length of the point cloud in the Z-projection direction.

2.5. Model Construction for Canopy Volume Measurement

In practice, the most suitable model for each study area is not unique, and there is no single optimal machine learning model [31]. Therefore, model applicability screening must be considered when training canopy volume measurement models using machine learning. Given the variability of data characteristics, this paper selects three different types of models—partial least squares regression (PLSR), backpropagation (BP) neural networks, and gradient boosting decision trees (GBDT)—that can accommodate a wide range of datasets for training and evaluation. This study comprehensively analyzes the performance of these models on the current dataset to identify the optimal model for canopy volume measurement.

2.5.1. Model Construction for Canopy Volume Measurement Based on Partial Least Squares Regression

The partial least squares regression (PLSR) model [32,33] is a method that integrates the advantages of principal component analysis, canonical correlation analysis, and multiple linear regression analysis. The core idea is to identify new principal components (orthogonal projection directions) that maximize the covariance between the projected dependent and independent variables, thereby establishing a predictive model. The model development process involves the following steps: first, the covariance matrix of the independent and dependent variables is calculated to extract the initial set of principal components using an iterative algorithm. Second, the extracted principal components are treated as new independent variables, and linear regression modeling is performed on the dependent variable. Finally, the iteration is repeated to extract new principal components and perform regression until a predetermined stopping criterion is met.
In the PLSR model, the number of principal components is a crucial hyperparameter. Selecting an appropriate number of principal components can effectively balance information retention and model simplification, leading to improved predictive performance [34]. If the number of principal components is too low, the model may fail to capture the nuances of data variation, resulting in underfitting. Conversely, if the number is too high, the model may incorporate noise or random fluctuations in the data, resulting in overfitting and increased model complexity. To address these challenges, this study employs a 10-fold cross-validation method to determine the optimal number of principal components. Given that the dataset contains a total of five features, the range for hyperparameter tuning is set from 1 to 5 for the number of principal components.

2.5.2. Canopy Volume Measurement Model Construction Based on BP Neural Network

The backpropagation (BP) neural network, commonly referred to as the BP neural network, is a feedforward neural network model that is trained and optimized using the backpropagation algorithm [35,36]. In the actual model development process, the first step involves setting initial values for hyperparameters such as the learning rate, the number of neurons in each layer, weights, and biases. Subsequently, sample data are fed into the neural network through the input layer, and calculations are performed between layers until the final output is obtained. The error between the predicted results and the actual results is then computed, and backpropagation of the error is performed using the chain rule, which propagates the error backward through the network from the output layer to the input layer. Afterward, the gradients for each neuron are calculated based on the error, and parameters such as weights and biases are updated while reducing the learning rate. Finally, these steps are repeated until the maximum number of iterations is reached.
Choosing the appropriate number of layers and neurons is crucial for model performance. Increasing the number of hidden layers and neurons can enhance the network’s ability to learn complex patterns; however, it may also lead to overfitting and increased computational burden. A learning rate that is too high can cause training instability and may prevent convergence, while a rate that is too low can result in a slow training process and may halt before reaching the optimal solution. Additionally, a regularization coefficient that is too large can lead to a smoother model, helping to avoid overfitting, but may result in underfitting. Conversely, a coefficient that is too small may cause overfitting. To optimize the hyperparameters of the BP neural network—such as the number of hidden layers, the number of neurons in each layer, the learning rate, and the regularization coefficient—ten-fold cross-validation is employed [37].
Based on the complexity of the dataset, an appropriate range of hyperparameters has been selected for the BP neural network model. The specific ranges for the hyperparameters are as follows: the number of hidden layers and neurons in each layer include [32, 64, 128, (32, 32), (64, 64), (64, 32), (128, 64), (128, 32)]; the learning rate options are [0.01, 0.001, 0.0001]; and the regularization factors are [0.001, 0.0001, 0.00001]. Given the simplicity of the dataset, the selected hyperparameter ranges encompass common settings while providing sufficient flexibility to adjust the model with reasonable parameters suited for simple datasets. These hyperparameters can be freely combined into 81 different configurations, and this study employs a grid search strategy [38] to evaluate the performance of the various hyperparameter combinations through cross-validation and to identify the optimal combination.

2.5.3. Canopy Volume Measurement Model Construction Based on Gradient Boosting Decision Tree

Gradient boosting decision trees (GBDT) are an advanced ensemble learning model built upon the CART algorithm [39]. In the process of developing the GBDT model, an initial prediction model is created first. In each iteration, the residuals of the previous model are calculated, and a new decision tree is constructed to fit these residuals on the training data. The newly trained decision tree is then incorporated into the current model, typically adjusted by a learning rate to control its contribution. Through multiple iterations, new decision trees are continuously added, with each iteration aiming to minimize the current model’s loss and gradually converge toward the optimal solution.
The main hyperparameters of GBDT include the number of decision trees, learning rate, and maximum depth of the trees [40]. Ten-fold cross-validation is employed for hyperparameter tuning. A limited number of decision trees may fail to adequately capture the complexity of the data, while an excessive number can increase computational load and potentially lead to overfitting. The learning rate controls the contribution of each decision tree to the final prediction; a smaller learning rate can improve model accuracy but requires more trees, whereas a larger learning rate may cause the model to bypass the optimal solution during training. A greater maximum depth of the decision trees enhances their complexity, making them suitable for intricate datasets, but it may also lead to overfitting. Conversely, a smaller depth could result in underfitting.
According to the complexity of the dataset, a suitable range of hyperparameters is selected. The specific hyperparameter ranges for the gradient boosting decision tree model are as follows: the number of decision trees: [100, 150, 200, 250, 300]; the learning rate: [0.01, 0.1, 0.2]; and the maximum depth of the decision trees: [3, 4, 5]. A total of 45 different parameter combinations are utilized in the grid search strategy, which is cross-validated separately, and the optimal parameter combinations are determined based on the cross-validation evaluation scores. The selected hyperparameter ranges in this paper encompass common values for the number of decision trees, learning rate, and maximum depth, making them suitable for a wide array of problems, thus effectively exploring the performance of various model combinations.

3. Results

The PLSR, BP neural network, and GBDT models are selected for optimal hyperparameter tuning. The 121 datasets from the dataset used in this study are divided into a training set and a test set in a 7:3 ratio. This division ratio is widely applied in various machine learning projects [4], ensuring that both the training set and the test set contain a sufficient number of data, resulting in 45 datasets for the training set and 20 datasets for the test set. The optimal combinations of selected hyperparameters are used to fit the data samples through the training set, yielding the canopy volume multiple regression model. This model is then evaluated using the test set to assess its final performance and generalization ability. The evaluation metrics include R2 (coefficient of determination), RMSE (root mean square error), and MAE (mean absolute error). The R2 value, which ranges from 0 to 1, is employed to evaluate the model’s fitting effectiveness; the closer the value is to 1, the greater the model’s ability to interpret the data and the better the fitting performance. RMSE and MAE serve as error measures, representing the square root of the squared errors and the mean of the absolute errors between the true and predicted values, respectively. While R2 primarily measures the explanatory power of the model, RMSE emphasizes the impact of large errors, and MAE provides an intuitive assessment of predictive accuracy. Combining these three metrics offers a comprehensive understanding of the model’s predictive performance, facilitating better decision-making.

3.1. Results of Canopy Volume Measurement Model Construction Based on Partial Least Squares Regression

Table 1 shows the average R2, RMSE, and MAE values obtained through 10-fold cross-validation for different numbers of principal components. When the number of principal components is set to 3, 4, or 5, the 10-fold cross-validation scores are similar and yield relatively better results. In contrast, with only one principal component, the average R2 value is 0.7007, which is approximately 0.25 lower than the optimal hyperparameter performance. This finding underscores the importance of hyperparameter tuning in the partial least squares regression (PLSR) model.
The models were trained with principal component numbers 3, 4, and 5, respectively, and the test set was used to evaluate the final model performance. With three principal components, the test set of the PLSR model achieved an R2 of 0.9742, RMSE of 0.1879, and MAE of 0.1161. With four principal components, the test set of the PLSR model yielded an R2 of 0.9739, RMSE of 0.1892, and MAE of 0.1228. Finally, with five principal components, the test set of the PLSR model recorded an R2 of 0.9737, RMSE of 0.1898, and MAE of 0.1229.
A comparative analysis of the training results of the partial least squares regression (PLSR) model with varying numbers of principal components indicates that the cross-validation results are quite similar when the number of principal components is set to 3, 4, and 5. However, the test set results, including R2, RMSE, and MAE, indicate a slight advantage for the trained canopy volume measurement model when the number of principal components is set to 3. Given the relative simplicity of the model with three principal components, this study selects 3 as the optimal hyperparameter for the PLSR model and proceeds with model training to derive the canopy volume measurement model. Figure 12a illustrates the comparison between the predicted and actual values of the test set for the canopy volume measurement model, while Figure 12b presents the scatter plot and residual fitting plot corresponding to the test set. The figures clearly illustrate that the canopy volume measurement model trained using the PLSR model with three principal components achieves a low prediction error and demonstrates strong generalization capability for the dataset analyzed in this study.

3.2. Results of Canopy Volume Measurement Model Construction Based on BP Neural Network

The cross-validation results for 81 parameter combinations of the BP neural network model are ranked from best to worst according to the average R2. The worst hyperparameter combination results in an average R2 of only 0.5833, which is nearly 0.4 lower than the optimal combination’s R2 of 0.9532, underscoring the importance of selecting the best hyperparameters for the BP neural network model. Identify the top three hyperparameter combinations that demonstrated superior performance during cross-validation for further analysis. The specific values of the selected hyperparameters are detailed below: combination 1 comprises (128, 128) hidden layers and neurons per layer, with a learning rate of 0.01 and a regularization coefficient of 0.0001; combination 2 consists of (128, 64) for the number of hidden layers and neurons per layer, with a learning rate of 0.01 and a regularization coefficient of 0.00001; combination 3 features (128, 64) for the number of hidden layers and neurons per layer, with a learning rate of 0.01 and a regularization coefficient of 0.001. Table 2 presents the average R2, RMSE, and MAE values obtained through 10-fold cross-validation for the first three sets of hyperparameter combinations of the BP neural network regression model.
For hyperparameter combination 1, the test set results for the BP neural network model yield an R2 of 0.9699, an RMSE of 0.2032, and an MAE of 0.1261. In combination 2, the model achieves an R2 of 0.9703, an RMSE of 0.2016, and an MAE of 0.1317. Finally, for combination 3, the model’s test set results indicate an R2 of 0.9720, an RMSE of 0.1959, and an MAE of 0.1190.
A comparative analysis of the training results of the BP neural network model under various parameter combinations revealed that the model trained with hyperparameter combination 3 achieved the best performance for canopy volume estimation. Figure 13a presents a comparison between the predicted and actual values for the test set of the canopy volume estimation model, while Figure 13b illustrates the scatter plot of the test set predictions along with the residual fitting. Utilizing hyperparameter combination 3, the BP neural network algorithm generated a canopy volume estimation model with minimal error between the predicted and actual values, demonstrating strong generalization capability.

3.3. Results of Canopy Volume Measurement Model Construction Based on Gradient Boosting Decision Tree

The average R2 of the parameter combination with the worst cross-validation performance for the GBDT model is only 0.7682, which is approximately 0.2 lower than the average R2 of the optimal parameter combination at 0.9394. This result underscores the significance of hyperparameter optimization for the GBDT model. The top three hyperparameter combinations that demonstrated superior performance during cross-validation have been identified for further analysis. The specific values of the selected hyperparameters are as follows: combination 1 includes 100 decision trees, a learning rate of 0.1, and a maximum depth of 3; combination 2 consists of 150 decision trees, a learning rate of 0.1, and a maximum depth of 3; and combination 3 features 250 decision trees, a learning rate of 0.1, and a maximum depth of 3. The cross-validation results for these three sets of hyperparameter combinations are presented in Table 3.
When hyperparameter combination 1 is applied, the GBDT model yields a test set R2 of 0.9315, with an RMSE of 0.3062 and an MAE of 0.1732. In the case of hyperparameter combination 2, the model produces slightly improved results, with an R2 of 0.9319, an RMSE of 0.3055, and an MAE of 0.1716. Finally, when hyperparameter combination 3 is utilized, the gradient boosting decision tree model achieves an R2 of 0.9319, an RMSE of 0.3054, and an MAE of 0.1704.
Comparative analysis of the GBDT model training results under various parameter combinations reveals that the canopy volume measurement model trained with hyperparameter combination 3 demonstrates the highest effectiveness. Figure 14a illustrates the comparison between the predicted values and the actual values for the test set of the canopy volume measurement model, while Figure 14b presents the scatter plot and residual fitting plot for the same test set. The error between the predicted and actual values of the canopy volume measurement model obtained through GBDT algorithm training with hyperparameter combination 3 is minimal, indicating strong generalization capability.

3.4. Summary of Canopy Volume Measurement Models and Comparative Analysis of Effects

In this dataset, the optimal hyperparameters for the PLSR model involve setting the number of principal components to 3. The decision coefficient R2 of the trained canopy volume measurement model is 0.9742, with a root mean square error (RMSE) of 0.1879 and a mean absolute error (MAE) of 0.1161. For the BP neural network model, the optimal hyperparameters include the number of hidden layers and neurons per layer, respectively of (128, 64), alongside a learning rate of 0.01 and a regularization coefficient of 0.001. The decision coefficient R2 for the trained canopy volume measurement model is 0.9720, with an RMSE of 0.1959 and an MAE of 0.1190. The optimal hyperparameters for the GBDT model consist of 250 decision trees, a learning rate of 0.1, and a maximum tree depth of 3. The decision coefficient R2 of the trained canopy volume measurement model is 0.9319, accompanied by an RMSE of 0.3054 and an MAE of 0.1704.
Comparative analysis revealed that the fitting accuracy of the PLSR model was superior to that of both the BP neural network and the GBDT models. The decision coefficients of the canopy volume measurement model obtained from the PLSR model improved by 0.0022 compared to the BP neural network model and by 0.0423 compared to the GBDT model. Additionally, the root mean square errors (RMSE) were reduced by 0.0080 and 0.1175, respectively, while the mean absolute errors (MAE) decreased by 0.0029 and 0.0543, respectively. These results indicate that the canopy volume measurement model derived from the PLSR model outperformed the other two models in terms of model generalization ability, prediction bias, and mean prediction error.

4. Discussion

This study establishes a canopy volume estimation model based on LiDAR point cloud data, which can effectively and accurately predict canopy volume, providing significant guidance for wind regulation and dosage control in precision spraying within orchards. Compared to traditional manual measurements of canopy volume, the method presented in this study reduces subjective human factors by analyzing point cloud features to obtain canopy volume, resulting in higher and more stable model accuracy [7]. In contrast to conventional slicing methods that assume canopies are solid, the approach taken in this study accounts for the presence of canopy gaps, thereby reducing measurement errors. The improved Alpha Shape point cloud contour extraction algorithm introduced in the feature extraction section significantly enhances the accuracy of canopy profile estimation. Beyond its focus on fruit tree spraying, this research also has important implications for studies in fruit yield prediction and forest monitoring.
In practical applications within orchards, the canopy volume measured in this study, which accounts for gaps, can effectively reflect both plant density and material density within the canopy, including leaves and branches. During spraying, the presence of leaves and branches can lead to wind loss and droplet adhesion. The next step could involve investigating the mechanisms of wind loss and droplet adhesion within the canopy based on the canopy volume obtained in this study, along with factors such as leaf density and planting patterns. This research could lead to the development of precise variable spraying prescription maps for fruit trees. By utilizing these prescription maps and related strategies to control wind force and dosage during spraying, it would be possible to achieve precision variable spraying in orchards, addressing the issue of excessive application associated with traditional spraying methods while reducing pesticide residues, waste, and environmental pollution caused by over-application.
This study primarily focuses on improving methods for obtaining canopy profiles and volumes, as well as optimizing hyperparameters and training the canopy volume estimation model based on the dataset utilized in this research. However, it does not consider other factors, such as the automatic clipping of point clouds. Additionally, variations in crop species, growth environments, growth stages, and planting methods all affect canopy sparsity and density, resulting in different relationships between canopy volume and certain features (such as canopy profile length). This suggests that the relationship models may vary. For specific plant species, volume estimation can employ the improved feature extraction methods proposed in this study to retrain the model. The impact of different plant species and canopy densities on the accuracy of the enhanced Alpha Shape point cloud contour extraction algorithm and the improved canopy volume algorithm presented in this study is relatively minor. For instance, Figure 15 illustrates a comparison of canopy profile extraction and volume estimation for sycamore trees. Further research is essential to confirm the specific effects of the improved algorithms proposed in this study on different plant species.

5. Conclusions

  • The General Alpha Shape contour extraction algorithm suffers from significant errors due to limitations in effective boundary judgment. To address this issue, this paper proposes an improved Alpha Shape contour extraction algorithm that enhances the validity judgment of contour line segments, thereby significantly improving the accuracy of the contour extraction results.
  • This study employs the grid integration method for canopy volume measurement. However, conventional grid integration methods frequently disregard canopy gaps when calculating height differences, leading to significant errors. To resolve this issue, an improved grid integration method is introduced, which accounts for the influence of gaps in height calculations, thereby substantially improving the accuracy of canopy volume measurements.
  • This study employs a cross-validated recursive feature elimination method based on random forest to select features from a total of 10. Ultimately, five features were retained for the model training dataset: the number of point clouds, the 2D point cloud contour in the X-projection direction, the 2D point cloud contour in the Z-projection direction, the 2D width in the Y-projection direction, and the 2D length in the Z-projection direction. By eliminating irrelevant or redundant features, this approach enhances the accuracy and predictive capability of the canopy volume measurement model.
  • The selection of optimal hyperparameters for PLSR, BP neural networks, and GBDT was conducted using the dataset from this study, resulting in the construction of canopy volume measurement models for each method under their respective optimal hyperparameters. Analysis of the results revealed that the PLSR model with three principal components yielded the best performance for canopy volume measurement. This model achieved an R2 value of 0.9742, with the lowest RMSE and MAE at 0.1879 and 0.1161, respectively. These results indicate that the model possesses strong generalization ability, low prediction bias, and minimal average prediction error, facilitating fast and accurate measurement of fruit tree canopy volume.

Author Contributions

Conceptualization, N.G. and N.X.; methodology, N.G., N.X., J.K., Q.M., M.N. and W.W.; software, N.G., Q.M. and X.Z.; validation, N.G., N.X. and J.K.; formal analysis, N.G.; investigation, N.G., N.X., Q.M. and X.Z.; resources, N.G. and N.X.; data curation, N.G., Q.M. and X.Z.; writing—original draft preparation, N.G., N.X. and J.K.; writing—review and editing, N.G., N.X., J.K., G.Z., Q.M., M.N. and W.W.; visualization, N.G.; supervision, N.X., J.K. and G.Z.; project administration, N.X. and J.K.; funding acquisition, N.X. and J.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Shandong Province Key Research and Development Plan (Major Science and Technology Innovation Project) (2022CXGC020706) and Shandong Model construction project for rural revitalization (2022DXAL0208) and Shandong Province Agricultural Mechanization R&D, Manufacturing, Promotion and Application Integration Pilot Project (NJYTHSD-202326).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author.

Acknowledgments

Thanks to all the authors cited in this article and the reviewers for their helpful comments and suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Xu, T.; Lu, X.L.; Qi, Y.N.; Zeng, J.; Lui, X.H.; Yuan, Q.C. Research status on equipment and technology of orchard air-assisted sprayer in China. J. Chin. Agric. Mech. 2023, 44, 69–77. [Google Scholar]
  2. Niu, C.Q.; Zhang, W.J.; Wang, Q.; Zhao, X.X.; Fan, G.J.; Jiang, H.H. Current status and trends of research on adjusting air volume of orchard air spray. J. Chin. Agric. Mech. 2020, 41, 48–54. [Google Scholar]
  3. He, X. Research progress and developmental recommendations on precision spraying technology and equipment in China. Smart Agric. 2020, 2, 133–146. [Google Scholar]
  4. Gu, C.C.; Zhai, C.Y.; Chen, L.P.; Li, Q.; Hu, L.N.; Yang, Z.F. Detection Model of Tree Canopy Leaf Area Based on LiDAR Technology. Trans. Chin. Soc. Agric. Mach. 2021, 52, 278–286. [Google Scholar]
  5. Zhao, C.Y.; Zhao, C.J.; Ning, W.; Long, J.; Wang, X.; Weckler, P.; Zhang, H.H. Research progress on precision control methods of air-assisted spraying in orchards. Trans. Chin. Soc. Agric. Eng. 2018, 34, 1–15. [Google Scholar]
  6. He, X.Q. Plant protection precision application technology and equipment. Agric. Eng. Technol. 2017, 37, 22–26. [Google Scholar]
  7. Ding, W.M.; Zhao, S.Q.; Zhao, S.Q.; Gu, J.B.; Qiu, W.; Guo, B.B. Measurement Methods of Fruit Tree Canopy Volume Based on Machine Vision. Trans. Chin. Soc. Agric. Mach. 2016, 47, 1–10+20. [Google Scholar]
  8. Wang, Q.; Hu, H.; Wu, Y.L.; Kang, R.H.; Xu, B.X.; Liang, Z.Y. Automatic Tree Crown Volume Calculation Method Based on Point Cloud Data. J. Chin. J. Northwest For. Univ. 2017, 32, 242–246. [Google Scholar]
  9. He, C.; Zhang, S.Y.; Matteo, C.; Zhou, A.M.; Hong, F.X. Algorithm of Crown Volume with Square Grid-based Method. Trans. Chin. Soc. Agric. Mach. 2014, 45, 90–97. [Google Scholar]
  10. Yu, D.H.; Feng, Z.K. Tree crown volume measurement method based on oblique aerial images of UAV. Trans. Chin. Soc. Agric. Eng. 2019, 35, 90–97. [Google Scholar]
  11. Shu, Y.P.; Li, Q.J.; Zhou, H.P.; Tao, R.; Xu, L.Y. Design of variable rate spray control system based on LiDAR detection. J. For. Eng. 2020, 5, 139–147. [Google Scholar]
  12. Ma, W.F.; Wu, X.D.; Wang, C.; Wen, P.; Wang, J.L.; Cao, L.; Xiao, Z.L. A spherical coordinate integration method for extracting crownvolumes of individual trees based on the TLS point clouds. Remote Sens. Nat. Resour. 2024, 36, 81–87. [Google Scholar]
  13. Wang, Y.T.; Wang, J.; Niu, L.W.; Cheng, S.P.; Sun, L. Comparative analysis of extraction algorithms for crown volume and surface area using UAV tilt photogrammetry. J. For. Eng. 2022, 7, 166–173. [Google Scholar]
  14. Cheng, G.; Wang, J.G.; Yang, J.; Zhao, Z.Z.; Wang, L. Calculation Method of 3D Point Cloud Canopy Volume Based on Improved α-shape Algorithm. Trans. Chin. Soc. Agric. Mach. 2021, 52, 175–183. [Google Scholar]
  15. Yu, L.; Huang, J.; Zhao, Z.X.; Zhang, L.; Sun, D.Z. Laser Measurement and Experiment of Hilly Fruit Tree Canopy Volume. Trans. Chin. Soc. Agric. Mach. 2013, 44, 224–228. [Google Scholar]
  16. Li, Q.J.; Zheng, J.Q.; Zhou, H.P.; Zhang, H.; Shu, P.Y.; Xu, B. Online Measurement of Tree Canopy Volume Using Vehicle-borne 2-D Laser Scanning. Trans. Chin. Soc. Agric. Mach. 2016, 47, 309–314. [Google Scholar]
  17. Li, P.; Zhang, M.; Dai, X.S.; Wang, T.; Zheng, J.Q.; Yi, L.S.; Lyu, Q. Real-Time Estimation of Citrus Canopy Volume Based on Laser Scanner and Irregular Triangular Prism Module Method. Sci. Agric. Sin. 2019, 52, 4493–4504. [Google Scholar]
  18. Zhou, H.; Zhang, J.; Ge, L.; Yu, X.; Wang, Y.; Zhang, C. Research on volume prediction of single tree canopy based on three-dimensional (3D) LiDAR and clustering segmentation. Int. J. Remote Sens. 2021, 42, 738–755. [Google Scholar] [CrossRef]
  19. Underwood, J.P.; Hung, C.; Whelan, B.; Sukkarieh, S. Mapping almond orchard canopy volume, flowers, fruit and yield using lidar and vision sensors. Comput. Electron. Agric. 2016, 130, 83–96. [Google Scholar] [CrossRef]
  20. Xu, L.; Hou, J.; Cheng, Q.L.; Qing, Y.Q.; Peng, Y.C.; Huang, G. A new adaptive median denoising model combined with cyclic iterative method. J. Sichuan Univ. (Nat. Sci. Ed.) 2022, 59, 51–60. [Google Scholar]
  21. Lei, K.; Tang, X.; Li, X.; Lu, Q.; Long, T.; Zhang, X.; Xiong, B. Research and Preliminary Evaluation of Key Technologies for 3D Reconstruction of Pig Bodies Based on 3D Point Clouds. Agriculture 2024, 14, 793. [Google Scholar] [CrossRef]
  22. Zhang, L.; Hao, Q.; Mao, Y.; Su, J.; Cao, J. Beyond Trade-Off: An Optimized Binocular Stereo Vision Based Depth Estimation Algorithm for Designing Harvesting Robot in Orchards. Agriculture 2023, 13, 1117. [Google Scholar] [CrossRef]
  23. Rusu, R.B.; Cousins, S. 3D is here: Point Cloud Library (PCL). In Proceedings of the 2011 IEEE International Conference on Robotics and Automation, Shanghai, China, 9–13 May 2011; pp. 1–4. [Google Scholar]
  24. Edelsbrunner, H.; Kirkpatrick, D.; Seidel, R. On the shape of a set of points in the plane. IEEE Trans. Inf. Theory 1983, 29, 551–559. [Google Scholar] [CrossRef]
  25. Alpha Shapes Extract 2D Point Cloud Boundaries. Available online: https://blog.csdn.net/a394467238/article/details/132556295 (accessed on 7 December 2023).
  26. Calculation of Point Cloud Volume Based on Python Grid Method. Available online: https://blog.csdn.net/m0_56729804/article/details/135017822 (accessed on 2 April 2024).
  27. Wei, X.H.; Wang, Y.G.; Zheng, J.; Wang, M.; Feng, Z.K. Tree Crown Volume Calculation Based on 3-D Laser Scanning Point Clouds Data. Trans. Chin. Soc. Agric. Mach. 2013, 44, 235–240. [Google Scholar]
  28. Shen, Y.; Li, Q.Z.; Du, X.; Wang, H.Y.; Zhang, Y. Indicative features for identifying corn and soybean using remote sensing imagery at middle and later growth season. Natl. Remote Sens. Bull. 2022, 26, 1410–1422. [Google Scholar] [CrossRef]
  29. Dash, M.; Liu, H. Feature selection for classification. Intell. Data Anal. 1997, 1, 131–156. [Google Scholar] [CrossRef]
  30. Sun, D.L.; Chen, D.L.; Mi, C.L.; Chen, X.Y.; Mi, S.W.; Li, X.Q. Evaluation of landslide susceptibility in the gentle hill-valleyareas based on the interpretable random forest-recursivefeature elimination model. J. Geomech. 2023, 29, 202–219. [Google Scholar]
  31. Zhang, L.; Guo, Z.G.; Qi, S.; Wu, B.C.; Li, P. Predicting shallow landslides in highly vegetation-covered areas using machine learning models. Trans. Chin. Soc. Agric. Eng. 2024, 40, 149–160. [Google Scholar]
  32. Zhang, G.Q.; Gan, Z.L.; Yan, Y.; Gao, W.M. Rapid detection of the physicochemical properties of honey based on infrared spectroscopy. Trans. Chin. Soc. Agric. Eng. 2023, 39, 275–284. [Google Scholar]
  33. Zhao, J.; Shen, M.S.; Pu, Y.G.; Chen, A.; Li, H. Out-of-warehouse Evaluation and Prediction Model of AppleBased on Near-infrared Spectroscopy Combined with Multiple Quality Indexes. Trans. Chin. Soc. Agric. Mach. 2023, 54, 386–395. [Google Scholar]
  34. Chen, W.J.; Zhou, D.Q.; Cui, C.H.; Ren, Z.J.; Zuo, W.J. Prediction Model of Farinograph Characteristics of Wheat Flour Based on Near Infrared Spectroscopy. Spectrosc. Spectr. Anal. 2023, 43, 3089–3097. [Google Scholar]
  35. An, X.F.; Dai, J.Y.; Li, L.W.; Lu, H.; Yin, Y.X.; Meng, Z.J. Research on wheat moisture content prediction model for combine harvester based on GA-BP method. Trans. Chin. Soc. Agric. Mach. 2024, 1–10. Available online: http://kns.cnki.net/kcms/detail/11.1964.S.20241016.1603.004.html (accessed on 12 December 2024).
  36. He, B.; Li, L.Q.; Chengjiang, Y.C.; Zhou, Z.X.; Zhang, L.; Liang, X.C. Performance prediction of PV/T cogeneration system based on artificial neural network. Trans. Chin. Soc. Agric. Eng. 2024, 40, 309–318. [Google Scholar]
  37. Liu, T.C.; Dong, K.Y.; Zhang, B.; Song, Y.S.; Liang, Z.L.; Li, J.W.; Wang, Y.B.; Zhang, L.; Yan, G.Q.; Hu, W.Y. A Grid-Based BP Neural Network Positioning Method for a Space Optical Communication Spot Center. Chin. J. Lasers 2024, 51, 206–213. [Google Scholar]
  38. Li, H.X.; Song, D.L.; Kong, J.N.; Song, Y.F.; Chang, H.Y. Evaluation of Hyperparameter Optimization Techniques for Traditional Machine Learning Models. Comput. Sci. 2024, 51, 242–255. [Google Scholar]
  39. Meng, R.; Shen, W.; Luan, Q.F.; Ji, Q.; Rao, Y.L. Water depth retrieval based on gradient boosting decision tree algorithm. Trans. Oceanol. Limnol. 2023, 45, 45–50. [Google Scholar]
  40. Zeng, P.Z.; Wang, W.; Yuan, M.X.; Yang, Z.F. Research on Inversion of Nitrogen Content in Apple Tree Canopy Based on Remote Sensing of Unmanned Aerial Vehicles. Shandong Agric. Sci. 2024, 56, 167–173. [Google Scholar]
Figure 1. LZY604 driverless wheeled tractor.
Figure 1. LZY604 driverless wheeled tractor.
Agriculture 15 00130 g001
Figure 2. Point cloud data acquisition. (a) Acquisition site. (b) Multi-view point cloud image.
Figure 2. Point cloud data acquisition. (a) Acquisition site. (b) Multi-view point cloud image.
Agriculture 15 00130 g002
Figure 3. SOR filter denoising before and after comparison. (a) Top view of before and after denoising. (b) Front view before and after denoising.
Figure 3. SOR filter denoising before and after comparison. (a) Top view of before and after denoising. (b) Front view before and after denoising.
Agriculture 15 00130 g003
Figure 4. Point cloud of single fruit trees.
Figure 4. Point cloud of single fruit trees.
Agriculture 15 00130 g004
Figure 5. 3D point cloud plane projection. (a) X-axis projection. (b) Y-axis projection. (c) Z-axis projection.
Figure 5. 3D point cloud plane projection. (a) X-axis projection. (b) Y-axis projection. (c) Z-axis projection.
Agriculture 15 00130 g005
Figure 6. Triangular sectioning and edge circle drawing.
Figure 6. Triangular sectioning and edge circle drawing.
Agriculture 15 00130 g006
Figure 7. 2D point cloud contour lines. (a) Comparison of contours in X-projection direction; (b) Comparison of contours in the Y-projection direction; (c) Comparison of contours in Z-projection direction.
Figure 7. 2D point cloud contour lines. (a) Comparison of contours in X-projection direction; (b) Comparison of contours in the Y-projection direction; (c) Comparison of contours in Z-projection direction.
Agriculture 15 00130 g007aAgriculture 15 00130 g007b
Figure 8. Comparison of manual labeling Alpha Shape contour extraction algorithm with actual contour lengths. (a) Alpha Shape contour extraction algorithm manually labeled lengths. (b) Manual marking of actual contour lengths.
Figure 8. Comparison of manual labeling Alpha Shape contour extraction algorithm with actual contour lengths. (a) Alpha Shape contour extraction algorithm manually labeled lengths. (b) Manual marking of actual contour lengths.
Agriculture 15 00130 g008
Figure 9. Comparison of before and after improvement of grid integral volume method. (a) Grid integral volume method before improvement. (b) Improved grid integral volume method.
Figure 9. Comparison of before and after improvement of grid integral volume method. (a) Grid integral volume method before improvement. (b) Improved grid integral volume method.
Agriculture 15 00130 g009
Figure 10. Body element and point cloud presentation diagrams. (a) The body element is taken as 0.5 m. (b) The body element is taken as 0.1 m.
Figure 10. Body element and point cloud presentation diagrams. (a) The body element is taken as 0.5 m. (b) The body element is taken as 0.1 m.
Agriculture 15 00130 g010
Figure 11. Comparison of results between the body element method and the improved grid integration method.
Figure 11. Comparison of results between the body element method and the improved grid integration method.
Agriculture 15 00130 g011
Figure 12. Test set results of a model test set for canopy volume measurement based on PLSR. (a) Plot of predicted versus true values for the test set. (b) Plot of test set scatter points versus residual fit.
Figure 12. Test set results of a model test set for canopy volume measurement based on PLSR. (a) Plot of predicted versus true values for the test set. (b) Plot of test set scatter points versus residual fit.
Agriculture 15 00130 g012
Figure 13. BP neural network-based canopy volume measurement model test set results. (a) Plot of predicted versus true values for the test set. (b) Plot of test set scatter points versus residual fit.
Figure 13. BP neural network-based canopy volume measurement model test set results. (a) Plot of predicted versus true values for the test set. (b) Plot of test set scatter points versus residual fit.
Agriculture 15 00130 g013
Figure 14. Test set results of canopy volume measurement model based on GBDT. (a) Plot of predicted versus true values for the test set. (b) Plot of test set scatter points versus residual fit.
Figure 14. Test set results of canopy volume measurement model based on GBDT. (a) Plot of predicted versus true values for the test set. (b) Plot of test set scatter points versus residual fit.
Agriculture 15 00130 g014
Figure 15. Comparison of sycamore canopy contour extraction and volume acquisition. (a) Comparison of contours in X-projection direction. (b) Comparison of contours in the Y-projection direction. (c) Comparison of contours in Z-projection direction. (d) Volume acquisition comparison.
Figure 15. Comparison of sycamore canopy contour extraction and volume acquisition. (a) Comparison of contours in X-projection direction. (b) Comparison of contours in the Y-projection direction. (c) Comparison of contours in Z-projection direction. (d) Volume acquisition comparison.
Agriculture 15 00130 g015aAgriculture 15 00130 g015b
Table 1. Table of cross-validation scores for PLSR model.
Table 1. Table of cross-validation scores for PLSR model.
Number of Principal ComponentsAverage R2Average RMSEAverage MAE
10.70070.43310.3163
20.89250.23890.1882
30.95590.14370.1069
40.95850.13930.1040
50.95920.13780.1030
Table 2. Table of cross-validation scores for BP neural network regression models.
Table 2. Table of cross-validation scores for BP neural network regression models.
CombinatorialAverage R2Average RMSEAverage MAE
10.94880.16130.1170
20.95320.15920.1179
30.95210.15960.1158
Table 3. GBDT model cross-validation score table.
Table 3. GBDT model cross-validation score table.
CombinatorialAverage R2Average RMSEAverage MAE
10.93940.18740.1359
20.93820.18890.1361
30.93720.19000.1365
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Guo, N.; Xu, N.; Kang, J.; Zhang, G.; Meng, Q.; Niu, M.; Wu, W.; Zhang, X. A Study on Canopy Volume Measurement Model for Fruit Tree Application Based on LiDAR Point Cloud. Agriculture 2025, 15, 130. https://doi.org/10.3390/agriculture15020130

AMA Style

Guo N, Xu N, Kang J, Zhang G, Meng Q, Niu M, Wu W, Zhang X. A Study on Canopy Volume Measurement Model for Fruit Tree Application Based on LiDAR Point Cloud. Agriculture. 2025; 15(2):130. https://doi.org/10.3390/agriculture15020130

Chicago/Turabian Style

Guo, Na, Ning Xu, Jianming Kang, Guohai Zhang, Qingshan Meng, Mengmeng Niu, Wenxuan Wu, and Xingguo Zhang. 2025. "A Study on Canopy Volume Measurement Model for Fruit Tree Application Based on LiDAR Point Cloud" Agriculture 15, no. 2: 130. https://doi.org/10.3390/agriculture15020130

APA Style

Guo, N., Xu, N., Kang, J., Zhang, G., Meng, Q., Niu, M., Wu, W., & Zhang, X. (2025). A Study on Canopy Volume Measurement Model for Fruit Tree Application Based on LiDAR Point Cloud. Agriculture, 15(2), 130. https://doi.org/10.3390/agriculture15020130

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop