JOURNAL OF NANJING FORESTRY UNIVERSITY ›› 2024, Vol. 48 ›› Issue (4): 93-103.doi: 10.12302/j.issn.1000-2006.202209055

Special Issue: 专题报道Ⅲ:智慧林业之森林可视化研究

Previous Articles     Next Articles

UAV forestry land-cover image segmentation method based on attention mechanism and improved DeepLabV3+

ZHAO Yugang1(), LIU Wenping1,*(), ZHOU Yan1, CHEN Riqiang1, ZONG Shixiang2, LUO Youqing2   

  1. 1. School of Information, Beijing Forestry University, Engineering Research Center for Forestry-oriented Intelligent Information Processing of National Forestry and Grassland Administration, Beijing 100083, China
    2. School of Forestry, Beijing Forestry University, Beijing 100083, China
  • Received:2022-09-24 Revised:2022-11-01 Online:2024-07-30 Published:2024-08-05
  • Contact: LIU Wenping E-mail:15621377528@163.com;wendyl@vip.163.com

Abstract:

【Objective】This study proposes the feature segmentation method Tree-DeepLab for unmanned aerial vehicle (UAV) forest images, based on an attention mechanism and the DeepLabV3+ semantic segmentation network, to extract the main feature distribution information in forest areas.【Method】First, the forest images were annotated according to feature types from six categories (Platanus orientalis, Ginkgo biloba, Populus sp., grassland, road, and bare ground) to obtain the semantic segmentation datasets. Second, the following improvements were made to the semantic segmentation network: (1) the Xception network, the backbone of the DeepLabV3+ semantic segmentation network, was replaced by ResNeSt101 with a split attention mechanism; (2) the atrous convolutions of different dilation rates in the atrous spatial pyramid pooling were connected using a combination of serial and parallel forms, while the combination of the atrous convolution dilation rates was simultaneously changed; (3) a shallow feature fusion branch was added to the decoder; (4) spatial attention modules were added to the decoder; and (5) efficient channel attention modules were added to the decoder.【Result】Training and testing were performed based on an in-house dataset. The experimental results revealed that the Tree-DeepLab semantic segmentation model had mean pixel accuracy (mPA) and mean intersection over union (mIoU) values of 97.04% and 85.01%, respectively, exceeding those of the original DeepLabV3+ by 4.03 and 14.07 percentage points, respectively, and outperforming U-Net and PSPNet.【Conclusion】The study demonstrates that the Tree-DeepLab semantic segmentation model can effectively segment UAV aerial photography images of forest areas to obtain the distribution information of the main feature types in forest areas.

Key words: unmanned aerial vehicle(UAV), land-cover image segmentation, forestry images, DeepLabV3+, attention mechanism, ResNeSt

CLC Number: