Model-based RL | How to convert a custom Gym environment with discrete actions to continuous
I am working on a project where I have a Gym environment with a discrete action space and a continuous observation space similar to the CartPole environment from Gym. I am trying to convert the discrete action space to continuous so that the environment will work with model-based RL algos that only work with continuous actions and observations. What is the way to do this? Thank you in advance.