JOURNAL OF NANJING FORESTRY UNIVERSITY ›› 2020, Vol. 44 ›› Issue (4): 215-221.doi: 10.3969/j.issn.1000-2006.201904004

Previous Articles     Next Articles

Transfer learning based recognition for forestry business images

LIN Chaojian1(), ZHANG Guangqun1, YANG Jie2, XU Peng2, LI Yingjie2, WANG Hangjun2()   

  1. 1.School of Information Engineering, Zhejiang A & F University, Hangzhou 311300, China
    2.Jiyang College, Zhejiang A & F University, Zhuji 311800, China
  • Received:2019-04-01 Revised:2019-06-27 Online:2020-07-22 Published:2020-08-13
  • Contact: WANG Hangjun E-mail:980368108@qq.com;whj@zafu.edu.cn

Abstract: Objective

The recognition and classification of forestry business images are helpful for forestry administrative departments to make reasonable disposal plans, instruct others, and communicate decisions for relevant events. Therefore, full information can be provided to the forest protection personnel, which will improve the level of forest management and protection. Then, the protection of forest resources and ecological security can be achieved.

Method

We proposed an automatic classification model of convolutional neural networks based on the transfer learning for forestry business images. Four convolutional neural network models were pre-trained using a large-scale auxiliary image dataset, called ImageNet. Then, the forestry business image data, with a relatively small number of images, were used to transfer the trained model. The classification accuracy was compared and analyzed to select the model with the best classification effect. In this study, four convolutional neural network models were selected.The forestry business image dataset used for the model training and testing contained four classes, including animal death (AD), forest harvesting (FH), forest fire (FF), and forest pests (FP). The dataset included a total of 280 images, with 70 images in each class. The forestry business image dataset was divided into a training and test set, which accounted for 90% and 10%, respectively. The 10-fold cross-validation method was used to obtain the final test accuracy rate. A new fully connected layer was used to replace the original fully connected layer, while the other layers remained unchanged. The transfer learning method was divided into two steps. The Adam optimization algorithm was used to train the models. In addition, to compare the impacts of different data enhancement methods on the classification effect, three experimental scenarios were used in this study.

Result

The transfer learning models that we used with the four pre-trained convolutional neural network structures all had a higher classification accuracy compared with the traditional method. In particular, Inception-v3 had the highest test accuracy rate, reaching 96.4%, and the unit inference time was only 1.38 s. Inception-v1’s accuracy rate was slightly higher than Inception-v2’s. VGG-16 had the lowest accuracy rate, at only 92.1%, and the unit inference time was also the longest, at 2.42 s. Concerning the four categories of forestry business image datasets, the Inception-v3-based transfer learning model had the highest recognition accuracy. Compared with the VGG-16 model, the classification accuracy of Inception-v3 had a better recognition performance at a shorter recognition time. The experimental results concerning the different data enhancement methods showed that, compared with the test accuracy rate without data enhancement, the test accuracy rate after enhancing the data by 4 times was still 96.4% after 6 000 training steps. However, the test accuracy rate was approximately 96.8% between 1 000 and 3 000 steps, which was a slight improvement. Comparing the curves of the models with no-data enhancement and data enhanced 16 times, we found that the test accuracy rate of the enhanced dataset decreased, and as the number of training iterations increased the test accuracy rate became lower. In addition, the fluctuation of the test accuracy rate curve of the dataset enhanced 16 times was more pronounced compared with that of the other two cases. To analyze the “understanding” of the forestry business image dataset based on the Inception-v3 transfer learning model, a confusion matrix was constructed. It was found that FH was more likely to be confused with AD, while FP was more likely to be confused with FH. In contrast, FF was rarely confused with the other classes. There were obvious red flame features on the FF images, which were rare in the images of the other classes, so it was easy to distinguish between them. However, the other three classes did not have such distinguishing features, so there was confusion on some occasions. For the traditional classification methods, the extracted features were manually selected and had poor generalization characteristics. In this study, the traditional methods had a relatively poor classification accuracy concerning forestry business image classification.

Conclusion

Compared with the traditional feature extraction recognition methods, the convolutional neural network model had stronger classification capabilities. By modifying the network structure, the accuracy rate can be effectively improved. Therefore, it is feasible to use convolutional neural network models based on the transfer learning for forestry business image classification. With the improved recognition rate of convolution neural network models, forest rangers can play a greater role in forest protection.

Key words: forestry business image, transfer learning, forest manage and protect, convolutional neural network, image recognition

CLC Number: