REINFORCE algo don’t learn
Im doing the reinforce algo to solve gym env, I try it for discrete action space and everything seems good. But when I tried it on continious action space my algo don’t learn. The thing that i dont understand that there is only one function (and one network) that differ for discrete and continious action space.