we need to do discover once per cluster and api version and then reuse that for parallel clients
so for example dup a cached+discovered client and replace it's connection ... ideally use Rails.cache for storing the discovered endpoints too and only refresh once per hour