Skip to content

WIP - RL Token#3201

Draft
s1lent4gnt wants to merge 10 commits intomainfrom
feat/RL-token
Draft

WIP - RL Token#3201
s1lent4gnt wants to merge 10 commits intomainfrom
feat/RL-token

Conversation

@s1lent4gnt
Copy link
Member

Type / Scope

  • Type: RL algorithm
  • Scope: RL Token algorithm

Summary / Motivation

…nd SAC restructuring

- Add RLAlgorithm base class and RLAlgorithmConfig with draccus.ChoiceRegistry
- Add RLTrainer for unified training orchestration with iterator pattern
- Add DataMixer and OnlineOfflineMixer for online/offline data mixing
- Restructure SAC algorithm with batch iterator and factory pattern
- Add observation normalization pre/post processors
- Add comprehensive tests for all new components
@s1lent4gnt s1lent4gnt self-assigned this Mar 23, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant