Using Synthetic Tree Data in Deep Learning-Based Tree Segmentation Using LiDAR Point Clouds
<p>Example tree point clouds: (<b>a</b>) real LiDAR scanned point clouds of trees taken from the three study sites and, (<b>b</b>) Synthetic tree point clouds generated using the simulation process.</p> "> Figure 2
<p>Simulation process for randomised synthetic tree data: (<b>a</b>) Example tree main stem cylinder model. (<b>b</b>) randomised meshes used for the placement of smaller branches, foliage and other tree canopy. (<b>c</b>) Simulated LiDAR point cloud using point-based sampling of the tree structure.</p> "> Figure 3
<p>Performance (mean IoU of stem class) of real data-trained models on real data evaluation sets with increasing number of real examples N used during training. Plots show the average IoU across five different models trained for each site and N combination, with error bars indicating the standard deviation over these different models.</p> "> Figure 4
<p>Performance (mean IoU of stem class) of synthetic data-trained models on real data evaluation sets with increasing number of synthetic examples used during training.</p> "> Figure 5
<p>Examples of predicted segmentations of real tree point clouds using models trained using real vs. synthetic training data. (<b>a</b>) ground truth class labels, (<b>b</b>) predicted classes by model trained on limited real data (Real N50), (<b>c</b>) predicted classes by model trained on all real data, and (<b>d</b>) predicted classes by model trained on synthetic data.</p> ">
Abstract
:1. Introduction
- 1.
- Does the use of simulated training examples help boost performance of point segmentation models on real data, compared to training with limited real examples only?
- 2.
- How does the accuracy of trained models depend on the amount of real or simulated data?
- 3.
- What level of sophistication in the simulation of individual tree point clouds is necessary to achieve a boost in performance?
1.1. Related Work
1.1.1. Individual Tree Segmentation and Forest Point Cloud Deep Learning
1.1.2. Deep Learning Using Synthetic Data
1.2. Contributions of This Paper
- 1.
- We develop a tree point cloud simulation framework that generates realistic synthetic data of forest trees that can be used for training deep learning-based segmentation models. Our simulation code is open-source and made available to the research community (https://github.com/mitchbryson/SimpleSynthTree (accessed on 3 April 2023)).
- 2.
- We demonstrate an approach to deep learning-based tree stem point detection that can use our synthetic data, and demonstrate its effectiveness on real forest LiDAR point clouds collected from a variety of data sources (aerial and terrestrial) and different forest sites.
- 3.
- We demonstrate that models trained on synthetic data have competitive performance, or out-perform models built by training on limited amounts of real data or when using real data from the non-target forest sites.
2. Materials and Methods
2.1. Real LiDAR Datasets
2.1.1. Study Areas and Data Capture
- Tumut: Commercial forest located outside of Tumut, NSW, Australia consisting primarily of mature Radiata pine trees, collected using airborne scanning.
- HQP: A commercial plantation consisting primarily of Pinus Caribaea spp. trees, located in Queensland, Australia, captured using mobile ground-based scanning.
- DogPark: Recreational forest (various species) in Rotorua, New Zealand, captured using mobile ground-based scanning.
2.1.2. Initial Point Cloud Processing
2.2. Development of a Simulator for Generating Synthetic Tree Point Cloud Data
2.2.1. Tree Mesh Model Simulation
2.2.2. Mesh Point Sampling
2.2.3. Simulation Implementation
2.3. Supervised Deep Learning Point Cloud Semantic Segmentation Model
3. Results
3.1. Models Trained on Real Data
3.2. Models Trained on Synthetic-Only Data
3.3. Comparison of Real vs. Synthetic Data-Trained Models
3.4. Cross-Dataset Performance of Real Data-Trained Models vs. Synthetic-Only Trained Model
3.5. Synthetic Data Simulator Ablation Study
- 1.
- No sweep and taper (ST): tree stems were modelled as straight cylinders with a constant diameter () along the height of the tree and no cubic spline parameters.
- 2.
- No foliage distribution (FD): the distribution of tree branching and foliage was generated along the height of the tree with a uniform random distribution and constant branch length of 2 m.
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Ayrey, E.; Hayes, D. The Use of Three-Dimensional Convolutional Neural Networks to Interpret LiDAR for Forest Inventory. Remote Sens. 2018, 10, 649. [Google Scholar] [CrossRef]
- Hamraz, H.; Jacobs, N.; Contreras, M.; Clark, C. Deep learning for conifer/deciduous classification of airborne LiDAR 3D point clouds representing individual trees. ISPRS J. Photogramm. Remote Sens. 2018, 158, 219–230. [Google Scholar] [CrossRef]
- Chen, J.; Chen, Y.; Liu, Z. Classification of Typical Tree Species in Laser Point Cloud Based on Deep Learning. Remote Sens. 2021, 13, 4750. [Google Scholar] [CrossRef]
- Liu, B.; Chen, S.; Huang, H.; Tian, X. Tree species classification of backpack laser scanning data using the PointNet++ point cloud deep learning method. Remote Sens. 2022, 14, 3809. [Google Scholar] [CrossRef]
- Xi, Z.; Hopkinson, C.; Chasmer, L. Filtering Stems and Branches from Terrestrial Laser Scanning Point Clouds Using Deep 3-D Fully Convolutional Networks. Remote Sens. 2018, 10, 1215. [Google Scholar] [CrossRef]
- Windrim, L.; Bryson, M. Detection, segmentation, and model fitting of individual tree stems from airborne laser scanning of forests using deep learning. Remote Sens. 2020, 12, 1469. [Google Scholar] [CrossRef]
- Krisanski, S.; Taskhiri, M.; Gonzalez-Aracil, S.; Herries, D.; Turner, P. Sensor Agnostic Semantic Segmentation of Structurally Diverse and Complex Forest Point Clouds Using Deep Learning. Remote Sens. 2021, 13, 1413. [Google Scholar] [CrossRef]
- Wu, B.; Zheng, G.; Chen, Y. An Improved Convolution Neural Network-Based Model for Classifying Foliage and Woody Components from Terrestrial Laser Scanning Data. Remote Sens. 2020, 12, 1010. [Google Scholar] [CrossRef]
- Wang, F.; Bryson, M. Tree Segmentation and Parameter Measurement from Point Clouds Using Deep and Handcrafted Features. Remote Sens. 2023, 15, 1086. [Google Scholar] [CrossRef]
- Lines, E.; Allen, M.; Cabo, C.; Calders, K.; Debus, A.; Greive, S.; Miltiadou, M.; Noach, A.; Owen, H.; Puliti, S. AI applications in forest monitoring need remote sensing benchmark datasets. arXiv 2022, arXiv:2212.09937. [Google Scholar]
- Ros, G.; Sellart, L.; Materzynska, J.; Vazquez, D.; Lopez, A. The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
- Johnson-Roberson, M.; Barto, C.; Mehta, R.; Sridhar, S.N.; Rosaen, K.; Vasudevan, R. Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks? In Proceedings of the International Conference on Robotics and Automation (ICRA), Singapore, 29 May–3 June 2017.
- Ward, D.; Moghadam, P.; Hudson, N. Deep Leaf Segmentation Using Synthetic Data. In Proceedings of the British Machine Vision Conference (BMVC), Newcastle, UK, 3–6 September 2018. [Google Scholar]
- Georgakis, G.; Mousavian, A.; Berg, A.; Kosecka, J. Synthesizing Training Data for Object Detection in Indoor Scenes. In Proceedings of the Robotics: Science and Systems, Cambridge, MA, USA, 12–16 July 2017. [Google Scholar]
- Wood, E.; Baltrusaitis, T.; Hewitt, C. Fake it till you make it: Face analysis in the wild using synthetic data alone. In Proceedings of the International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021. [Google Scholar]
- Weber, J.; Penn, J. Creation and rendering of realistic trees. In Proceedings of the 22nd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), Los Angeles, CA, USA, 6–11 August 1995. [Google Scholar]
- Hewitt, C. Procedural Generation of Tree Models for Use in Computer Graphics. Undergraduate Dissertation, Trinity Hall, Dublin, Ireland, 2017. [Google Scholar]
- Westling, F.; Bryson, M.; Underwood, J. SimTreeLS: Simulating aerial and terrestrial laser scans of trees. Comput. Electron. Agric. 2021, 187, 106277. [Google Scholar] [CrossRef]
- Bryson, M. PointcloudITD: A software package for individual tree detection and counting. In Deployment and Integration of Cost-Effective, High Spatial Resolution, Remotely Sensed Data for the Australian Forestry Industry; FWPA Technical Report; Forest & Wood Products Australia: Melbourne, VIC, Australia, 2017; pp. 1–19. [Google Scholar]
- Vandendaele, B.; Fournier, R.; Vepakomma, U.; Pelletier, G.; Lejeune, P.; Martin-Ducup, O. Estimation of northern hardwood forest inventory attributes using UAV laser scanning (ULS): Transferability of laser scanning methods and comparison of automated approaches at the tree- and stand-level. Remote Sens. 2021, 13, 2796. [Google Scholar] [CrossRef]
- Neuville, R.; Bates, J.; Jonard, F. Estimating forest structure from UAV-mounted LiDAR point cloud using machine learning. Remote Sens. 2021, 13, 352. [Google Scholar] [CrossRef]
- Hao, Y.; Widagdo, F.; Liu, X.; Liu, Y.; Dong, L.; Li, F. A hierarchical region-merging algorithm for 3-D segmentation of individual trees using UAV-LiDAR point clouds. IEEE Trans. Geosci. Remote Sens. 2022, 69, 5701416. [Google Scholar] [CrossRef]
- Persson, P.; Olofsson, K.; Holmgren, J. Two-phase forest inventory using very-high-resolution laser scanning. Remote Sens. Environ. 2022, 271, 112909. [Google Scholar] [CrossRef]
- Gobakken, T.; Naesset, E. Estimation of diameter and basal area distributions in coniferous forest by means of airborne laser scanner data. Scand. J. For. Res. 2004, 19, 529–542. [Google Scholar] [CrossRef]
- Maltamo, M.; Suvanto, A.; Packalén, P. Comparison of basal area and stem frequency diameter distribution modelling using airborne laser scanner data and calibration estimation. For. Ecol. Manag. 2007, 247, 26–34. [Google Scholar] [CrossRef]
- Olofsson, K.; Holmgren, J.; Olsson, H. Tree stem and height measurements using terrestrial laser scanning and the RANSAC algorithm. Remote Sens. 2014, 6, 4323–4344. [Google Scholar] [CrossRef]
- Lamprecht, S.; Stoffels, J.; Dotzler, S.; Hab, E.; Udelhoven, T. aTrunk—An ALS-Based Trunk Detection Algorithm. Remote Sens. 2015, 7, 9975–9997. [Google Scholar] [CrossRef]
- Raumonen, P.; Kaasalainen, M.; Akerblom, M.; Kaasalainen, S.; Kaartinen, H.; Vastaranta, M.; Holopainen, M.; Disney, M.; Lewis, P. Fast Automatic Precision Tree Models from Terrestrial Laser Scanner Data. Remote Sens. 2013, 5, 491–520. [Google Scholar] [CrossRef]
- Qi, C.; Yi, L.; Su, H.; Guibas, L. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
- Qi, C.; Yi, L.; Su, H.; Guibas, L. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. In Proceedings of the Conference on Neural Information Processing Systems (NIPS), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Allworth, J.; Windrim, L.; Bennett, J.; Bryson, M. A transfer learning approach to space debris classification using observational light curve data. Acta Astronaut. 2021, 181, 301–315. [Google Scholar] [CrossRef]
- Griffiths, D.; Boehm, J. SynthCity: A large-scale synthetic point cloud. arXiv 2019, arXiv:1907.04758. [Google Scholar]
- Uggla, G.; Horemuz, M. Towards synthesized training data for semantic segmentation of mobile laser scanning point clouds: Generating level crossings from real and synthetic point cloud samples. Autom. Constr. 2021, 130, 103839. [Google Scholar] [CrossRef]
- Kingma, D.; Ba, J. Adam: A method for stochastic optimization. In Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Xiao, A.; Huang, J.; Guan, D.; Zhan, F.; Lu, S. Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation. In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), Vancouver, BC, Canada, 20–28 February 2022. [Google Scholar]
- Tzeng, E.; Hoffman, J.; Saenko, K.; Darrell, T. Adversarial Discriminative Domain Adaptation. In Proceedings of the Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
- Tsai, Y.; Sohn, K.; Schulter, S.; Chandraker, M. Domain Adaptation for Structured Output via Discriminative Patch Representations. In Proceedings of the International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019. [Google Scholar]
Parameter | Values Used | Parameter | Values Used |
---|---|---|---|
tree height range () | 30–50 m | Maximum canopy width | 7 m |
base diameter range () | 0.5–1 m | Max canopy width height | 0.4 –0.8 |
stem split/fork height range () | 0.15 –0.5 | tree top x-y distance from base range | ±2.5 m |
stem split probability | 0.15 | tree mid x-y distance from base range | ±0.5 m |
number of small branches | 60–100 | foliage point randomised height | 0.5 m |
Minimum canopy height | 0.2 –0.5 | stem point ratio () | 0.1–0.3 |
Stem | Foliage | ||||||
---|---|---|---|---|---|---|---|
Test Dataset | Model Trained on | IoU | Precision | Recall | IoU | Precision | Recall |
Tumut | Real (N50) | 0.598 | 0.927 | 0.638 | 0.974 | 0.979 | 0.995 |
Real (all) | 0.761 | 0.884 | 0.841 | 0.985 | 0.991 | 0.994 | |
Synthetic | 0.675 | 0.790 | 0.806 | 0.978 | 0.989 | 0.989 | |
HQP | Real (N50) | 0.817 | 0.858 | 0.947 | 0.945 | 0.989 | 0.955 |
Real (all) | 0.849 | 0.865 | 0.978 | 0.952 | 0.995 | 0.957 | |
Synthetic | 0.830 | 0.852 | 0.968 | 0.947 | 0.993 | 0.953 | |
DogPark | Real (N50) | 0.790 | 0.872 | 0.897 | 0.791 | 0.911 | 0.863 |
Real (all) | 0.875 | 0.917 | 0.950 | 0.880 | 0.954 | 0.918 | |
Synthetic | 0.810 | 0.854 | 0.941 | 0.797 | 0.943 | 0.837 |
Test Dataset | Model Trained On | IoU | Precision | Recall |
---|---|---|---|---|
Tumut | HQP Model | 0.596 | 0.875 | 0.644 |
DogPark Model | 0.602 | 0.859 | 0.658 | |
Synthetic Model | 0.675 | 0.790 | 0.806 | |
HQP | Tumut Model | 0.862 | 0.875 | 0.984 |
DogPark Model | 0.882 | 0.899 | 0.980 | |
Synthetic Model | 0.830 | 0.852 | 0.968 | |
DogPark | Tumut Model | 0.789 | 0.855 | 0.914 |
HQP Model | 0.798 | 0.893 | 0.882 | |
Synthetic Model | 0.810 | 0.854 | 0.941 |
Test Dataset | Synthetic Data-Trained Models | IoU | Precision | Recall |
---|---|---|---|---|
Combined Real Sites | Full Sim | 0.769 | 0.832 | 0.903 |
Full Sim -ST | 0.725 | 0.804 | 0.875 | |
Full Sim -FD | 0.709 | 0.850 | 0.803 | |
Full Sim -ST -FD | 0.591 | 0.654 | 0.830 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bryson, M.; Wang, F.; Allworth, J. Using Synthetic Tree Data in Deep Learning-Based Tree Segmentation Using LiDAR Point Clouds. Remote Sens. 2023, 15, 2380. https://doi.org/10.3390/rs15092380
Bryson M, Wang F, Allworth J. Using Synthetic Tree Data in Deep Learning-Based Tree Segmentation Using LiDAR Point Clouds. Remote Sensing. 2023; 15(9):2380. https://doi.org/10.3390/rs15092380
Chicago/Turabian StyleBryson, Mitch, Feiyu Wang, and James Allworth. 2023. "Using Synthetic Tree Data in Deep Learning-Based Tree Segmentation Using LiDAR Point Clouds" Remote Sensing 15, no. 9: 2380. https://doi.org/10.3390/rs15092380
APA StyleBryson, M., Wang, F., & Allworth, J. (2023). Using Synthetic Tree Data in Deep Learning-Based Tree Segmentation Using LiDAR Point Clouds. Remote Sensing, 15(9), 2380. https://doi.org/10.3390/rs15092380