News

In this letter, we consider the application of max-plus-linear approximators for Q-function in offline reinforcement learning of discounted Markov decision processes. In particular, we incorporate ...
An approach to safe and fast online learning of constraints for a continuous-time linear system subject to linear inequality constraints is developed, assuming that the number of constraints is known ...