News
In this letter, we consider the application of max-plus-linear approximators for Q-function in offline reinforcement learning of discounted Markov decision processes. In particular, we incorporate ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results