How do I build the following neural network for stochastic optimal control in Keras?
I’m a NN newbie. I would like to emulate the following NN architecture, taken from this paper. The goal of the stochastic optimal control problem is to minimise a total cost C_T
. In the diagram below, s_i
, a_i
and xi_i
are the state of the system, the action, and the noise at step i
respectively. The system evolves according to