Skip to content

[rllib] Improve IMPALA examples and premerge#59927

Merged
aslonnie merged 55 commits intoray-project:masterfrom
pseudo-rnd-thoughts:impala-premerge-nightly
Jan 17, 2026
Merged

[rllib] Improve IMPALA examples and premerge#59927
aslonnie merged 55 commits intoray-project:masterfrom
pseudo-rnd-thoughts:impala-premerge-nightly

Conversation

@pseudo-rnd-thoughts
Copy link
Member

Description

Updates the IMPALA examples and premerge with CartPole and TicTacToe.
We only have a minimal number of examples as most users should use PPO or APPO rather than IMPALA.

Mark Towers and others added 30 commits November 21, 2025 17:06
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
# Conflicts:
#	rllib/BUILD.bazel
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
# Conflicts:
#	rllib/BUILD.bazel
#	rllib/examples/algorithms/bc/cartpole_bc.py
#	rllib/examples/algorithms/bc/cartpole_bc_with_offline_evaluation.py
#	rllib/examples/algorithms/bc/pendulum_bc.py
#	rllib/examples/algorithms/iql/pendulum_iql.py
#	rllib/examples/algorithms/marwil/cartpole_marwil.py
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
# Conflicts:
#	rllib/BUILD.bazel
#	rllib/examples/algorithms/appo/cartpole_appo.py
#	rllib/examples/algorithms/appo/halfcheetah_appo.py
#	rllib/examples/algorithms/appo/multi_agent_cartpole_appo.py
#	rllib/examples/algorithms/appo/multi_agent_pong_appo.py
#	rllib/examples/algorithms/appo/multi_agent_stateless_cartpole_appo.py
#	rllib/examples/algorithms/appo/pendulum_appo.py
#	rllib/examples/algorithms/appo/pong_appo.py
#	rllib/examples/algorithms/appo/stateless_cartpole_appo.py
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
# Conflicts:
#	rllib/BUILD.bazel
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
# Conflicts:
#	rllib/examples/algorithms/appo/halfcheetah_appo.py
#	rllib/examples/algorithms/appo/multi_agent_cartpole_appo.py
#	rllib/examples/algorithms/appo/multi_agent_pong_appo.py
#	rllib/examples/algorithms/appo/multi_agent_stateless_cartpole_appo.py
#	rllib/examples/algorithms/appo/pendulum_appo.py
#	rllib/examples/algorithms/appo/pong_appo.py
#	rllib/examples/algorithms/appo/stateless_cartpole_appo.py
#	rllib/utils/test_utils.py
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Mark Towers <mark@anyscale.com>
@@ -1,4 +1,51 @@
"""Example showing how to run IMPALA on the CartPole environment.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great descriptions!

Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
@kamil-kaczmarek
Copy link
Contributor

IMPALA uses Aggregator Actors, so I added them to the premerge args.

Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
@kamil-kaczmarek
Copy link
Contributor

We also need "exclusive", tag - added.

Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
@kamil-kaczmarek kamil-kaczmarek requested a review from a team as a code owner January 17, 2026 00:17
Copy link
Contributor

@kamil-kaczmarek kamil-kaczmarek left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@aslonnie aslonnie merged commit 1d5f9f8 into ray-project:master Jan 17, 2026
6 checks passed
limarkdcunha pushed a commit to limarkdcunha/ray that referenced this pull request Jan 18, 2026
## Description
Updates the IMPALA examples and premerge with CartPole and TicTacToe.
We only have a minimal number of examples as most users should use PPO
or APPO rather than IMPALA.

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: Limark Dcunha <limarkdcunha@gmail.com>
jinbum-kim pushed a commit to jinbum-kim/ray that referenced this pull request Jan 29, 2026
## Description
Updates the IMPALA examples and premerge with CartPole and TicTacToe.
We only have a minimal number of examples as most users should use PPO
or APPO rather than IMPALA.

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: jinbum-kim <jinbum9958@gmail.com>
ryanaoleary pushed a commit to ryanaoleary/ray that referenced this pull request Feb 3, 2026
## Description
Updates the IMPALA examples and premerge with CartPole and TicTacToe. 
We only have a minimal number of examples as most users should use PPO
or APPO rather than IMPALA.

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Kamil Kaczmarek <kamil@anyscale.com>
peterxcli pushed a commit to peterxcli/ray that referenced this pull request Feb 25, 2026
## Description
Updates the IMPALA examples and premerge with CartPole and TicTacToe.
We only have a minimal number of examples as most users should use PPO
or APPO rather than IMPALA.

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: peterxcli <peterxcli@gmail.com>
peterxcli pushed a commit to peterxcli/ray that referenced this pull request Feb 25, 2026
## Description
Updates the IMPALA examples and premerge with CartPole and TicTacToe.
We only have a minimal number of examples as most users should use PPO
or APPO rather than IMPALA.

---------

Signed-off-by: Mark Towers <mark@anyscale.com>
Signed-off-by: Kamil Kaczmarek <kamil@anyscale.com>
Co-authored-by: Mark Towers <mark@anyscale.com>
Co-authored-by: Kamil Kaczmarek <kamil@anyscale.com>
Signed-off-by: peterxcli <peterxcli@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

docs An issue or change related to documentation go add ONLY when ready to merge, run all tests rllib RLlib related issues

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants