How many hours did you take to train agents in each substrate？ #15

YetAnotherPolicy · 2022-02-25T07:22:03Z

Dear authors,

Thanks for building such ambitious environments for MARL research. In your paper, I found it will take 10^9 steps to run the simulation for each agent. In order to train agents, how many rollout workers did you set and how many hours did you take to get the final results in Table 1: Focal per-capita returns?

Thank you.

duenez · 2022-03-10T12:33:52Z

Hello,

Estimating training time is very difficult, since it entirely depends on the training stack, available compute, etc. There is typically a fundamental tradeoff between wall-clock time and compute. From our side, we have tried two very different training stacks, and one of them trained populations in a bit under a week, and in another stack it took just one day. The number of workers was also quite different in the two stacks.

We recognise that compute is likely a limiting factor in training these population which is why we are actively working on improving the performance of the substrates, including reducing the time spent in Python, trying instead to delegate to the underlying C++ implementation of the substrate engine (Lab2D) as soon as possible.

Hope this helps

YetAnotherPolicy · 2022-03-10T15:29:51Z

Dear @duenez, thanks for the detailed and helpful reply. I appreciate your team's efforts to make MeltingPot a great testbed in MARL research.

ManuelRios18 · 2022-07-07T17:17:55Z

@YetAnotherPolicy I am curious to know how long you take to train these populations!

In my case, I can train 1e^6 steps in almost exactly an hour using 4 RLlib workers and a 64GB RAM machine with Rtx 3060 Nvidia GPU.

ManuelRios18 · 2022-07-08T00:48:13Z

@YetAnotherPolicy
Could you please tell me your hardware specs ?
I mean, num CPU’s , RAM, GPU ?
or do you train in the cloud ?

YetAnotherPolicy · 2022-07-08T00:49:57Z

@YetAnotherPolicy I am curious to know how long you take to train these populations!

In my case, I can train 1e^6 steps in almost exactly an hour using 4 RLlib workers and a 64GB RAM machine with Rtx 3060 Nvidia GPU.

Hi in my case I use 32 workers and it will take 8 minutes to run 1M steps. Note that it depends on the simulation speed.

YetAnotherPolicy · 2022-07-08T00:52:25Z

@YetAnotherPolicy
Could you please tell me your hardware specs ?
I mean, num CPU’s , RAM, GPU ?
or do you train in the cloud ?

I use very common Intel's CPUs, 40 in total. As the states are RGB images. I use A100, which can be faster than 3090. RAM is 256G.

ManuelRios18 · 2022-07-11T23:51:04Z

@YetAnotherPolicy Sorry! I am back with the questions!

Which algorithm are you using to train?
I have notice that in my case PPO is 8 times slower than A3C. Have you experienced anything similar?

YetAnotherPolicy · 2022-07-12T01:39:21Z

@YetAnotherPolicy Sorry! I am back with the questions!

Which algorithm are you using to train? I have notice that in my case PPO is 8 times slower than A3C. Have you experienced anything similar?

Hi, I use PPO. Note that there is an inner training loop in each update in PPO, see this link: https://github.com/openai/spinningup/blob/master/spinup/algos/pytorch/ppo/ppo.py#L265. Please also check if RLlib uses this trick.

Training with PPO costs 1.5 days for 200M.

yesfon · 2022-07-12T02:14:16Z

Hello @YetAnotherPolicy,

I got confused for your last message, i would like to know if for the training of the workers you used the RLlib library?

YetAnotherPolicy · 2022-07-12T02:22:19Z

Hello @YetAnotherPolicy,

I got confused for your last message, i would like to know if for the training of the workers you used the RLlib library?

Hi, @yesfon, I did not use RLlib.

yesfon · 2022-07-12T02:29:48Z

Hello @YetAnotherPolicy,
I got confused for your last message, i would like to know if for the training of the workers you used the RLlib library?

Hi, @yesfon, I did not use RLlib.

May I ask what did you use ?

YetAnotherPolicy · 2022-07-12T02:37:34Z

Hello @YetAnotherPolicy,
I got confused for your last message, i would like to know if for the training of the workers you used the RLlib library?

Hi, @yesfon, I did not use RLlib.

May I ask what did you use ?

Hi, I use multiprocessing as well as ray's remote actor to collect data. RLlib is also good, but it takes a lot of time to learn its APIs.

duenez closed this as completed Mar 10, 2022

YetAnotherPolicy mentioned this issue Jul 26, 2022

Add A3C example with RlLib #62

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How many hours did you take to train agents in each substrate？ #15

How many hours did you take to train agents in each substrate？ #15

YetAnotherPolicy commented Feb 25, 2022

duenez commented Mar 10, 2022

Uh oh!

YetAnotherPolicy commented Mar 10, 2022

Uh oh!

ManuelRios18 commented Jul 7, 2022

Uh oh!

ManuelRios18 commented Jul 8, 2022

Uh oh!

YetAnotherPolicy commented Jul 8, 2022 •

edited

Loading

Uh oh!

YetAnotherPolicy commented Jul 8, 2022 •

edited

Loading

Uh oh!

ManuelRios18 commented Jul 11, 2022

Uh oh!

YetAnotherPolicy commented Jul 12, 2022

Uh oh!

yesfon commented Jul 12, 2022

Uh oh!

YetAnotherPolicy commented Jul 12, 2022

Uh oh!

yesfon commented Jul 12, 2022

Uh oh!

YetAnotherPolicy commented Jul 12, 2022

Uh oh!

How many hours did you take to train agents in each substrate？ #15

How many hours did you take to train agents in each substrate？ #15

Comments

YetAnotherPolicy commented Feb 25, 2022

duenez commented Mar 10, 2022

Uh oh!

YetAnotherPolicy commented Mar 10, 2022

Uh oh!

ManuelRios18 commented Jul 7, 2022

Uh oh!

ManuelRios18 commented Jul 8, 2022

Uh oh!

YetAnotherPolicy commented Jul 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YetAnotherPolicy commented Jul 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ManuelRios18 commented Jul 11, 2022

Uh oh!

YetAnotherPolicy commented Jul 12, 2022

Uh oh!

yesfon commented Jul 12, 2022

Uh oh!

YetAnotherPolicy commented Jul 12, 2022

Uh oh!

yesfon commented Jul 12, 2022

Uh oh!

YetAnotherPolicy commented Jul 12, 2022

Uh oh!

YetAnotherPolicy commented Jul 8, 2022 •

edited

Loading

YetAnotherPolicy commented Jul 8, 2022 •

edited

Loading