Peduncle Detection of Ripe Strawberry to Localize Picking Point using DF-Mask R-CNN and Monocular Depthopen access
- Authors
- Tamrakar, Niraj; Paudel, Bhola; Karki, Sijan; Deb, Nibas Chandra; Arulmozhi, Elanchezhian; Kook, Jung Hoo; Kang, Myeong Yong; Kang, Dae Yeong; Ogundele, Oluwasegun Moses; Nakarmi, Bikash; Byung-Eun, Moon; Kim, Hyeon Tae
- Issue Date
- Apr-2025
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- Instance segmentation; Mask R-CNN; Monocular depth estimation; Peduncle detection; Picking Point estimation; Ripeness Detection
- Citation
- IEEE Access, v.13, pp 73889 - 73902
- Pages
- 14
- Indexed
- SCIE
SCOPUS
- Journal Title
- IEEE Access
- Volume
- 13
- Start Page
- 73889
- End Page
- 73902
- URI
- https://scholarworks.gnu.ac.kr/handle/sw.gnu/78319
- DOI
- 10.1109/ACCESS.2025.3564288
- ISSN
- 2169-3536
2169-3536
- Abstract
- Accurate localization of picking points and depth estimation is critical for implementing a robotic strawberry harvesting system. Due to the delicate nature of strawberries, harvesting must be performed without bruising or damage, typically by grasping and cutting the peduncle of the ripe strawberry. However, accurately detecting and localizing the thin peduncle in a cluttered environment is a significant challenge. This study proposed depth fused Mask R-CNN (DF-Mask R-CNN), which integrates depth information of the scene with the RGB image to enhance the detection, localization, and segmentation of strawberries and their peduncles in a greenhouse environment. To generate a dense depth map, a cutting-edge monocular depth estimator, ZoeDepth was used. The proposed DF-Mask R-CNN with ResNet101-FPN exhibited superior instance segmentation performance, with an overall mAP of 81.9%, with mAPsmall at 33.3%, mAPmedium at 78.79%, mAPlarge at 88.8 and APIOU=0.5 at 98.1%. In tests with 300 ripe strawberry samples, the method demonstrated a robust picking point detection, with a mean absolute error and root mean square error of 1.98 cm and 2.12 cm, respectively. These results highlight the effectiveness of the DF-Mask R-CNN model combined with the ZoeDepth estimator in enhancing the detection, localization, and segmentation of strawberries and their peduncles. This approach enables precise picking point localization and depth estimation for efficient vision systems for robotic strawberry harvesting. © 2013 IEEE.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 농업생명과학대학 > 생물산업기계공학과 > Journal Articles
- 학과간협동과정 > 스마트팜학과 > Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.