Model-Based Reparameterization Policy Gradient Methods

Code for Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms. Paper accepted at NeurIPS 2023!

Authors: Shenao Zhang, Boyi Liu, Zhaoran Wang* , Tuo Zhao* (* indicates equal advising)

Installation

The code can be set up by:

git clone https://github.com/agentification/RP_PGM.git
cd RP_PGM
python setup.py develop

Basic Example

After setup, the following example can be run to train RP-DP-SN in the ant environment.

python train.py env=mbpo_ant device=cuda:0 seed=0

To train in other environments, change the env argument to the ones in ./config/env. Our code is adapted from the repository of the SVG-SAC algorithm.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.idea		.idea
config		config
dmc2gym		dmc2gym
nbs		nbs
rp_pgm		rp_pgm
.DS_Store		.DS_Store
README.md		README.md
img.png		img.png
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model-Based Reparameterization Policy Gradient Methods

Installation

Basic Example

About

Uh oh!

Releases

Packages

Uh oh!

Languages

agentification/RP_PGM

Folders and files

Latest commit

History

Repository files navigation

Model-Based Reparameterization Policy Gradient Methods

Installation

Basic Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages