
融合Swin Transformer的虫害图像实例分割优化方法研究
高家军, 张旭, 郭颖, 刘昱坤, 郭安琪, 石蒙蒙, 王鹏, 袁莹
南京林业大学学报(自然科学版) ›› 2023, Vol. 47 ›› Issue (3) : 1-10.
融合Swin Transformer的虫害图像实例分割优化方法研究
Research on the optimized pest image instance segmentation method based on the Swin Transformer model
【目的】为了实现对虫害的精准监测,提出了一种融合Swin Transformer的图像实例分割优化方法,以期有效解决复杂真实场景下多幼虫个体图像识别分割困难的问题。【方法】选用Swin Transformer模型,改进Mask R-CNN实例分割模型的主干网部分,对黄野螟幼虫虫害图像进行识别分割。针对不同结构参数的Swin Transformer模型与ResNet模型,调整各层的输入输出维度,将其分别设置为Mask R-CNN的主干网进行对比实验,从定量与定性两个角度分析不同主干网的Mask R-CNN模型对黄野螟幼虫的识别分割精度与效果,确定最佳模型结构。【结果】①该方法在虫害识别框选方面的测度(F1)分数可达89.7%,平均精度(AP)可达88.0%;在虫害识别分割方面的F1分数可达84.3%,AP可达82.2%。相较于Mask R-CNN,在目标框选与目标分割方面分别提升8.75%与8.40%。②对于小目标虫害识别分割任务,该方法在虫害识别框选方面的F1分数可达88.4%,AP可达86.3%;在虫害识别分割方面的F1分数可达84.0%,AP可达81.7%。相较于Mask R-CNN,在目标框选与目标分割方面分别提升9.30%与9.45%。【结论】对于复杂真实场景下的图像实例分割任务,其识别分割效果极大地依赖于模型对图像特征的提取能力,而融合了Swin Transformer的Mask R-CNN实例分割模型,在主干网的特征提取能力更强,模型整体的识别分割效果更好,可为虫害的识别监测提供技术支撑,同时为保护农、林、牧等产业资源提供解决方案。
【Objective】To achieve accurate pest monitoring, the author proposes an optimized instance segmentation method based on the Swin Transformer to effectively solve the difficulty in image recognition and segmentation of multi-larval individuals under complex real scenarios.【Method】The Swin Transformer model was selected to improve the backbone network of the Mask R-CNN instance segmentation model and to identify and segment Heortia vitessoides larvae which harmed Aquilaria sinensis. The input and output dimensions of all layers of the Swin Transformer and ResNet models with different structural parameters were adjusted. Both models were set as the backbone networks of Mask R-CNN for comparative experiments. H. vitessoides moore larvae identification and segmentation performances for different backbone networks were quantitatively and qualitatively analyzed using Mask R-CNN models to determine the best model structure.【Result】(1) Using this method, the F1 score and AP were 89.7% and 88.0%, respectively, in terms of pest identification framing, and 84.3% and 82.2%, respectively, in terms of pest identification and segmentation, increasing by 8.75% and 8.40%, respectively, compared to that of the Mask R-CNN model in terms of target framing and segmentation. (2) For small target pest identification and segmentation tasks, the F1 score and AP were 88.4% and 86.3%, respectively, in terms of pest identification framing, 84.0% and 81.7%, respectively, in terms of pest identification and segmentation, and increased by 9.30% and 9.45%, respectively, compared to that of the Mask R-CNN model in terms of target framing and segmentation.【Conclusion】In segmentation tasks under complex real scenarios, the recognition and segmentation effects depend to a large extent on the model’s ability to extract image features. By integrating the Swin Transformer, the mask R-CNN instance segmentation model has a stronger ability to extract features in the backbone network, with a better overall recognition and segmentation effect. It could provide technical support for the identification and monitoring of pests and solutions for the protection of agriculture, forestry, animal husbandry, and other industrial resources.
虫害识别 / Swin Transformer / Mask R-CNN / 实例分割 / 土沉香 / 黄野螟
pest recognition / Swin Transformer / Mask R-CNN / instance segmentation / Aguilaria sinensis / Heortia vitessoides
[1] |
徐锡祥. 林业病虫害特点、原因及综合防治解析[J]. 新农业, 2021(22):22.
|
[2] |
张旭. 森林病虫害的发生特点及综合防治技术[J]. 农业灾害研究, 2021, 11(7):21-22.
|
[3] |
樊巍, 苑静, 赵波. 森林病虫害的发生特点及综合防治技术[J]. 农业与技术, 2019, 39(23):70-71.
|
[4] |
|
[5] |
|
[6] |
|
[7] |
李静, 陈桂芬, 安宇. 基于优化卷积神经网络的玉米螟虫害图像识别[J]. 华南农业大学学报, 2020, 41(3):110-116.
|
[8] |
|
[9] |
|
[10] |
|
[11] |
田洪宝. 基于深度卷积神经网络的林区航拍图像虫害区域分割[D]. 北京: 北京林业大学, 2019.
|
[12] |
陈冬梅, 张赫, 魏凯华, 等. 复杂背景下昆虫图像的快速分割与识别[J]. 江苏农业科学, 2021, 49(24):195-204.
|
[13] |
王卫民, 符首夫, 顾榕蓉, 等. 基于卷积神经网络的虫情图像分割和计数方法[J]. 计算机工程与科学, 2020, 42(1):110-116.
|
[14] |
张善文, 邵彧, 齐国红, 等. 基于多尺度注意力卷积网络的作物害虫检测[J]. 江苏农业学报, 2021, 37(3):579-588.
|
[15] |
谭富祥, 钱育蓉, 孔钰婷, 等. 基于Transformer的多分支单图像去雨方法[J]. 计算机应用研究, 2022, 39(8):2500-2505,2519.
|
[16] |
陈敏, 王君, 董明利, 等. 改进的Mask R-CNN多尺度实例分割算法研究[J]. 激光杂志, 2020, 41(5):40-44.
|
[17] |
|
[18] |
田应仲, 卜雪虎. 基于注意力机制与Swin Transformer模型的腰椎图像分割方法[J]. 计量与测试技术, 2021, 48(12):57-61.
|
[19] |
张重生, 陈杰, 纵瑞星, 等. 基于Transformer的低质场景字符检测算法[J]. 北京邮电大学学报, 2022, 45(2):124-130.
|
[20] |
|
[21] |
|
[22] |
蒋开彬, 祝文娟, 潘文, 等. 基于SRAP标记的土沉香遗传多样性分析[J]. 中南林业科技大学学报, 2020, 40(1):131-136.
|
[23] |
庞圣江, 张培, 杨保国, 等. 林隙大小对土沉香人工更新幼树生长发育的影响[J]. 西北农林科技大学学报(自然科学版), 2020, 48(4):83-88.
|
[24] |
张小霞. 土沉香开发利用研究进展[J]. 防护林科技, 2020(4):63-66.
|
[25] |
宋晓琛, 王西洋, 杨光, 等. 无机盐与激素混合对土沉香结香的诱导[J]. 林业科学, 2020, 56(8):121-130.
|
[26] |
洪仁辉, 尹吉锋, 陈彧, 等. 白木香重要害虫黄野螟研究进展[J]. 热带林业, 2019, 47(3):66-68.
|
[27] |
王忠, 谢伟忠, 朱诚棋, 等. 黄野螟的羽化和生殖行为节律[J]. 中国森林病虫, 2018, 37(1):24-27,30.
|
[28] |
茅裕婷, 张蒙, 靳秀芳, 等. 土沉香对黄野螟的抗性研究[J]. 华南农业大学学报, 2017, 38(6):89-96.
|
[29] |
严珍, 岳建军. 温度及补充营养对黄野螟生长发育和繁殖的影响[J]. 热带作物学报, 2019, 40(9):1789-1795.
|
[30] |
高新波, 莫梦竟成, 汪海涛, 等. 小目标检测研究进展[J]. 数据采集与处理, 2021, 36(3):391-417.
|
[31] |
|
[32] |
|
[33] |
张海燕, 徐心语, 马雪芬, 等. 超声图像中复合材料褶皱形态的Mask-RCNN识别方法[J]. 物理学报, 2022, 71(7):074302.
|
[34] |
|
[35] |
周维, 牛永真, 王亚炜, 等. 基于改进的YOLOv4-GhostNet水稻病虫害识别方法[J]. 江苏农业学报, 2022, 38(3):685-695.
|
[36] |
刘晨曦, 刘大铭, 杨芳, 等. 基于改进水平集的水稻虫害分割算法[J]. 宁夏大学学报(自然科学版), 2019, 40(3):246-254.
|
[37] |
卜俊怡, 孙国祥, 王迎旭, 等. 基于诱虫板图像的温室番茄作物害虫识别与监测方法[J]. 南京农业大学学报, 2021, 44(2):373-383.
|
/
〈 |
|
〉 |