The rationale around asking for something like this would be to burden the main machine I am actually from a bit less compared to the machine I use only for AI usage. I am sure this is something other users are thinking too and want.
Would be a great feature to optimize / let the user choose the number of layers per device or at least some % of usage per device.