News

The formulation of the problem and various versions of dynamic programming method are discussed. These versions are experimentally evaluated on randomly generated instances. We draw conclusions about ...
In this work, a novel value function-based reinforcement learning (RL) approach, descending dynamic policy programming (DDPP) is proposed to address the issues of sample-efficiency and learning ...