Complex behaviours can make it difficult for human observers to maintain a coherent understanding of a highdimensional system’s state due to the large number of degrees of freedom that have to be monitored and reasoned about. This problem can lead to cognitive overload in operators who are monitoring these systems. An example of this is the problem of observing drone swarms to determine their behaviour and infer possible goals. Generative artificial intelligence techniques, such as variational autoencoders (VAEs), can be used to assist operators in understanding these complex behaviours by reducing the dimensionality of the observations.
This paper presents a modified boid simulation that produces data that is representative of a swarm of coordinated drones. A sensor model is employed to simulate observation noise. A VAE architecture is proposed that can encode data from observations of homogeneous swarms and produce visualisations detailing the potential states of the swarm, the current state of the swarm, and the goals that these states relate to. One of the challenges addressed in this paper is the permutation variance problem of working with large datasets of points which represent interchangeable, unlabelled objects. This is addressed by the proposed VAE architecture through the use of a PointNet-inspired layer that implements a symmetric function approximation, and chamfer distance loss function. An ablation study for the proposed permutation invariance modifications and a sensitivity analysis focused on the algorithm’s behaviour with respect to sensor noise are presented. The use of the decoder to create goal boundaries on the visualisation, the use of the visualisation for swarm trajectories, and the explainability of the visualisation are discussed.
Using a convolutional neural network to develop an optimal sampling strategy for LIDAR remote sensing. Detecting the distance to object is important for autonomous vehicles, surveying, and other remote sensing applications. LIDAR detects distances using a pulsed laser and a time-of-flight system to measure the position of all objects in a scene, however they are limited in the maximum distance they can measure due to low signal return. A convolutional neural network has been used to develop a sampling basis to effectively sample the scene, and also the reconstruction algorithm to recreate the 3D scene.
We present a prototype light detection and ranging (lidar) system that compressively samples the scene using our deep learning optimised sampling basis and reconstruction algorithms. This approach improves scene reconstruction quality compared to an orthogonal sampling method, with reflectivity and depth accuracy improvements for one frame per second acquisition rates. This method may pave the way for improved scan-free lidar systems for driverless cars and for fully optimised sampling through to decision-making pipelines. The requirement for 3D imaging is a challenge across a range of sectors including gaming, robotics, health-care and automotive industries. Mature technologies such as radar and ultra-sound sensing are effective at long and short ranges respectively. With lidar capable of millimetric depth precision, with good spatial resolution at ranges of around 100 m, it has become a key technology in this area, with depth information typically gained through time-of-flight photon-counting measurements of a scanned laser spot. Single-pixel imaging (SPI) is an alternative imaging modality for recovering spatial information. SPI methods offer an alternative approach to spot-scanning, which allows a choice of sampling basis. Unlike scanning systems, the freedom to choose the sampling basis in SPI provides the opportunity to use compressed sensing techniques, where a high-quality image can be reconstructed from a number of measurements that is fewer than the number of pixels in the image. Compressed sensing has been demonstrated using an optimised imaging basis and reconstruction algorithm derived from a trained convolutional neural network. This deep learning approach achieves a 4% compression ratio, enabling lidar imaging using 25 times less measurements such that faster acquisition times can be used.
Gathering information of objects hidden from the field of view is an extremely relevant problem in many areas of science and technology. Some state-of-the-art techniques are able to detect and image an object behind an obstacle at the cost of high computational and processing times. Alternatively, other methods can track the object in real-time without giving information on the objects shape. Here we make use of a non-scanning ultrashort pulsed light source, a Single-Photon Avalanche Diode (SPAD), and artificial neural networks (ANNs) to demonstrate a system that can detect, identify, and track objects hidden from view. SPAD technology, characterised by a temporal resolution of 100 ps, provides us with the time traces of the light back-scattered by the environment (including the hidden object). By using different known objects placed at different known positions, we generate a library of time traces that are used to train the ANN algorithm. The application of the trained ANN algorithm in an experimental scenario allow us to identify unknown objects hidden from view in real time with cm resolution. These results open new routes for exciting novel machine learning applications with high impact in the fields of machine vision, self-driving cars, and defence.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.