Paper
7 June 2023 Deep learning based compressed sensing in machine vision: an iterative approach to multi object detection
A. Birk, K. Frenner, W. Osten
Author Affiliations +
Proceedings Volume 12701, Fifteenth International Conference on Machine Vision (ICMV 2022); 1270109 (2023) https://doi.org/10.1117/12.2683929
Event: Fifteenth International Conference on Machine Vision (ICMV 2022), 2022, Rome, Italy
Abstract
The use of Compressed Sensing (CS) and Deep Learning (DL) techniques in Machine Vision (MV) is an area of significant research interest. It is especially promising in the area of image free MV, where one deliberately skips the image reconstruction step and performs the information extraction directly on a set of highly compressed raw data, as provided e.g. through CS schemes. These approaches tend to perform well on simplified data sets where there is a single object to detect with otherwise little background. To make them useful in practice, they need to be able to detect multiple objects from this raw data. In this work, we present an expansion to our own DL based detection scheme that satisfies this condition. Its defining feature is that it works without requiring extra data acquisition steps compared to the single object case. We will discuss trainability and robustness aspects as well as the mathematical background that enables the concept. Finally, we will show this implementation in action.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
A. Birk, K. Frenner, and W. Osten "Deep learning based compressed sensing in machine vision: an iterative approach to multi object detection", Proc. SPIE 12701, Fifteenth International Conference on Machine Vision (ICMV 2022), 1270109 (7 June 2023); https://doi.org/10.1117/12.2683929
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Neural networks

Compressed sensing

Binary data

Pose estimation

Deep learning

Machine vision

Back to Top