Modern air defense confrontations demand rapid, precise task assignments in environments where threats evolve within seconds.
This paper proposes an algorithm for missile manoeuvring based on a hierarchical proximal policy optimization (PPO) reinforcement learning algorithm, which enables a missile to guide to a target and ...