Abstract: This article proposes a novel Q-learning algorithm that relies solely on input-output data to address the output regulation control problem of complex discrete-time systems affected by ...