Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
EdanToledo authored Mar 10, 2024
1 parent 01f1e92 commit 4c81ff8
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -73,9 +73,9 @@ Stoix natively supports logging to json files which adhere to the standard sugge
As the code in Stoix (at the time of creation) was in essense a port of [Mava][mava], for further speed comparisons we point to their repo. Additionally, we refer to the PureJaxRL blog post [here](https://chrislu.page/blog/meta-disco/) where the speed benefits of end-to-end JAX systems are discussed.

Below we provide some plots illustrating that Stoix performs equally to that of [PureJaxRL][purejaxrl] but with the added benefit of the code being already set up for `pmap` distribution over devices as well as the other features provided (algorithm implementations, logging, config system, etc).

<img src="docs/images/ppo_compare.png" alt="ppo" width="500"/> <img src="docs/images/dqn_compare.png" alt="dqn" width="500"/>

<p align="center">
<img src="docs/images/ppo_compare.png" alt="ppo" width="45%"/> <img src="docs/images/dqn_compare.png" alt="dqn" width="45%"/>
</p>
I've also included a plot of the training time for 5e5 steps of PPO as one scales the number of environments. PureJaxRL does not pmap and thus runs on a single a device.

<p align="center">
Expand Down

0 comments on commit 4c81ff8

Please sign in to comment.