Machine creation retry too frequently for machines with `ResourceExhausted`

**How to categorize this issue?**
/area robustness
/kind enhancement
/priority 3

**What would you like to be added**:

- Adding a check for `codes.ResourceExhausted` with a higher retry period.

https://github.com/gardener/machine-controller-manager/blob/1e2563f85f5b6912aeb7ea42b5c5264ca67d667a/pkg/util/provider/machinecontroller/machine_util.go#L503-L515

**Why is this needed**:
Currently, machines that fail due to `codes.ResourceExhaused`, are retried using `machineutils.MediumRetry` which is every 3 minutes.
When a resource in the underlying infrastructure is exhausted, it's unlikely that this will change in that short period of time.

The `provider-openstack` first creates the `volume` and then the `machine`.
However, depending on the size of the `nodePool`, this can lead to a large number of unnecessary create/delete API calls.

	func (c controller) machineCreateErrorHandler(ctx context.Context, machine v1alpha1.Machine, createMachineResponse *driver.CreateMachineResponse, err error) (machineutils.RetryPeriod, error) {
	var (
	retryRequired = machineutils.MediumRetry
	lastKnownState string
	)
	machineErr, ok := status.FromError(err)
	if ok {
	switch machineErr.Code() {
	case codes.Unknown, codes.DeadlineExceeded, codes.Aborted, codes.Unavailable:
	retryRequired = machineutils.ShortRetry
	lastKnownState = machine.Status.LastKnownState
	}
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Machine creation retry too frequently for machines with `ResourceExhausted` #977

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Machine creation retry too frequently for machines with ResourceExhausted #977

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Machine creation retry too frequently for machines with `ResourceExhausted` #977