Skip to content

agentification/RP_PGM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Model-Based Reparameterization Policy Gradient Methods

Code for Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms. Paper accepted at NeurIPS 2023!

Authors: Shenao Zhang, Boyi Liu, Zhaoran Wang* , Tuo Zhao* (* indicates equal advising)

RP algo

Installation

The code can be set up by:

git clone https://github.com/agentification/RP_PGM.git
cd RP_PGM
python setup.py develop

Basic Example

After setup, the following example can be run to train RP-DP-SN in the ant environment.

python train.py env=mbpo_ant device=cuda:0 seed=0 

To train in other environments, change the env argument to the ones in ./config/env. Our code is adapted from the repository of the SVG-SAC algorithm.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published