News
The formulation of the problem and various versions of dynamic programming method are discussed. These versions are experimentally evaluated on randomly generated instances. We draw conclusions about ...
In this work, a novel value function-based reinforcement learning (RL) approach, descending dynamic policy programming (DDPP) is proposed to address the issues of sample-efficiency and learning ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results