Does anyone have a recommendation as to how I can fit the model across multiple 24gb GPUs?
Does anyone have a recommendation as to how I can fit the model across multiple 24gb GPUs?