Skip to content

GPU Monitoring bug when step advances faster than train loop steps #83

@terrykong

Description

@terrykong

The GPU monitoring advances the wandb step faster than the main training loop causing the metrics to be ignored of past steps. The solution is to use wandb custom log axes.

https://docs.wandb.ai/guides/track/log/customize-logging-axes/

Metadata

Metadata

Assignees

Labels

bugSomething isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions