SwiftMOS: A Fast and Lightweight Moving Object Segmentation via Feature Flowing Direct View Transformation
Citations

WEB OF SCIENCE

0
Citations

SCOPUS

0

초록

Autonomous vehicles must recognize their surroundings and distinguish between dynamic and static objects to avoid collisions. Most recent moving object segmentation (MOS) studies project LiDAR point-cloud streams into multiple views to capture spatio-temporal cues. When a single view proves insufficient, the common strategy is to combine or convert views. However, existing methods rebuild features in full 3D space during conversion, significantly increasing time and memory costs. They also depend on high-end graphics processing units, which limits their on-board deployment. We introduce SwiftMOS, a lightweight framework centered on a direct view transformation module that rapidly converts bird's-eye view to range view without restoring 3D features. Direct view transformation leverages precomputed grids and view-specific hint maps to efficiently map coordinates, and an addition-fusion feature flow quickly merges past and current frames to capture the spatio-temporal context. Extensive quantitative and qualitative experiments on the SemanticKITTI dataset confirm the validity and real-time capability of the proposed method. SwiftMOS achieves a substantial latency reduction while maintaining competitive performance. It operates within the typical LiDAR scan rate on on-board hardware and also achieves robust performance on the Sipailou Campus Dataset, underscoring its practicality for real-world autonomous driving.

키워드

Computer Vision for TransportationDeep Learning for Visual PerceptionObject Detection, Segmentation and Categorization
제목
SwiftMOS: A Fast and Lightweight Moving Object Segmentation via Feature Flowing Direct View Transformation
저자
Lee, MinjaeKim, UngsikKim, Gun-WooLee, Suwon
DOI
10.1109/LRA.2026.3655199
발행일
2026-03
유형
Article
저널명
IEEE Robotics and Automation Letters
11
3
페이지
2873 ~ 2880