Abstract: Applications of reinforcement learning (RL) have become common in many decision-making problems. One of these applications is the air combat maneuver decision problem. It is still an open ...