News
In this letter, we consider the application of max-plus-linear approximators for Q-function in offline reinforcement learning of discounted Markov decision processes. In particular, we incorporate ...
An approach to safe and fast online learning of constraints for a continuous-time linear system subject to linear inequality constraints is developed, assuming that the number of constraints is known ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results