DSORT-MCU: Detecting Small Objects in Real-Time on Microcontroller Units


METADATA ONLY
Loading...

Date

2024-12-15

Publication Type

Journal Article

ETH Bibliography

yes

Citations

Altmetric
METADATA ONLY

Data

Rights / License

Abstract

Advances in lightweight neural networks have revolutionized computer vision in a broad range of Internet of Things (IoT) applications, encompassing remote monitoring and process automation. However, the detection of small objects, which is crucial for many of these applications, remains an underexplored area in current computer vision research, particularly for low-power embedded devices that host resource-constrained processors. To address said gap, this paper proposes an adaptive tiling method for lightweight and energy-efficient object detection networks, including YOLO-based models and the popular Faster Objects More Objects (FOMO) network. The proposed tiling enables object detection on low-power Microcontroller Units (MCUs) with no compromise on accuracy compared to large-scale detection models. The benefit of the proposed method is demonstrated by applying it to FOMO and TinyissimoYOLO networks on a novel RISC-V -based MCU with built-in Machine Learning (ML) accelerators. Extensive experimental results show that the proposed tiling method boosts the F1-score by up to 225% for both FOMO and TinyissimoYOLO networks while reducing the average object count error by up to 76% with FOMO and up to 89% for TinyissimoYOLO. Furthermore, the findings of this work indicate that using a soft F1 loss over the popular binary cross-entropy loss can serve as an implicit non-maximum suppression for the FOMO network. To evaluate the real-world performance, the networks are deployed on the RISC-V based GAP9 microcontroller from GreenWaves Technologies , showcasing the proposed method’s ability to strike a balance between detection performance (58% − 95% F1 score), low latency (0.6ms/Inference - 16.2ms/Inference), and energy efficiency (31 μJ/Inference - 1.27mJ/Inference) while performing multiple predictions using high-resolution images on a MCU.

Publication status

published

Editor

Book title

Volume

24 (24)

Pages / Article No.

40231 - 40239

Publisher

IEEE

Event

Edition / version

Methods

Software

Geographic location

Date collected

Date created

Subject

IoT; Microcontrollers; Object detection; TinyML

Organisational unit

03996 - Benini, Luca / Benini, Luca check_circle

Notes

Funding

Related publications and datasets