Presentation + Paper
7 June 2024 Constrained deep reinforcement learning for maritime platform defense
Author Affiliations +
Abstract
We present a method for maritime platform defense using constrained deep reinforcement learning (DRL), showing how competing desires to reliably defend a fleet and conserve inventory may be managed through a dual optimization strategy. Against persistent and variable raids of threats, our agents minimize inventory expenditure subject to a constraint on the average time before a threat impacts the fleet being defended. Critically, the additional inventory consideration is introduced only after the agent has learned to defend the fleet well enough to consistently satisfy the constraint. In evaluations against a realistic simulation environment and with variable multi-ship geometries, we find that our strategy may be tuned to either (1) enable the agent to make significant gains in efficiency while losing very little in terms of reliability or (2) closely track specified reliability constraints while reducing inventory expenditure even further. The result is an agent with considerably stronger long-term viability, since the conserved inventory may be used for future engagements. We speculate on the potential of this method to provide a tunable, trustworthy artificial assistant to human decision-makers tasked with defense scheduling.
Conference Presentation
(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Jared Markowitz "Constrained deep reinforcement learning for maritime platform defense", Proc. SPIE 13051, Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications VI, 1305114 (7 June 2024); https://doi.org/10.1117/12.3014007
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Defense and security

Network architectures

Reliability

Neural networks

Safety

Control systems

Defense systems

Back to Top