Training Error with DDPG/TD3/PPO/TRPO #1

LeeEuShane · 2021-09-12T03:49:13Z

Describe the bug
1)I am not able to train using DDPG/TD3/PPO/TRPO. Everything is default except the agent and timestep which was chosen by me.

To Reproduce
Steps to reproduce the behavior:
1)python train.py -agent DDPG --timesteps 100000

Expected behavior
The script should be able to run and train using the chosen agent. It worked for option_critic and dac_ppo.

Screenshots

Desktop (please complete the following information):

OS: Linux (Workstation) ; Windows 10 (PC)
Browser: Chrome
Version: 93.0.4577.63

Smartphone (please complete the following information):

Device: iphone 11
OS: iOS 14.71
Browser: Safari
Version: Can't find but it's the latest

Additional context
I think the issue is with the environment being 1 dimensional instead of giving 3 dimensions. I'm not sure how to troubleshoot this part as I don't know where the environment is taken from.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Error with DDPG/TD3/PPO/TRPO #1

Training Error with DDPG/TD3/PPO/TRPO #1

LeeEuShane commented Sep 12, 2021

Training Error with DDPG/TD3/PPO/TRPO #1

Training Error with DDPG/TD3/PPO/TRPO #1

Comments

LeeEuShane commented Sep 12, 2021