Pool tasks for `sp_tasks::spawn` and add execution limits

Currently [the way we implement the `sp_tasks::spawn` runtime API](https://github.com/paritytech/substrate/blob/c9af3c6ae49c3720f4320ebbd6c5f936951dfdcf/client/executor/src/native_executor.rs#L413) (which is roughly equivalent to a `std::thread::spawn`, just called from *within* the runtime and without any shared memory between the caller and the callee) has a few problems from what I can see:

 - it is unbound**: you could exhaust system resources by spawning enough tasks
 - it uses `spawn` instead of `spawn_blocking` yet it *is* blocking (it uses a blocking channel, and calling into the runtime is also effectively a blocking operation)
 - you can call it recursively without any limit

It can deadlock the whole async loop when called enough times in parallel, or called enough times recursively.

[Its docs also say that it's dangerous when used incorrectly](https://github.com/paritytech/substrate/blob/c9af3c6ae49c3720f4320ebbd6c5f936951dfdcf/primitives/tasks/src/lib.rs#L50).

(** - although due to it using `spawn` you'll probably exhaust all of the tokio worker threads and deadlock the whole process first)

What we should do to fix this:
- Soft-limit the number of tasks which can be concurrently executed; add the rest into a queue. There's no point in supporting an unbound number of tasks executing at the same time since calling into the runtime is an inherently blocking operation, and you don't have an infinite number of CPU cores to run them all anyway. We should pick a preset number of supported parallel tasks and execute only at most that many simultaneously. (This also reduces the risk of triggering network partitions if this is used in a place which can affect consensus.)
- Hard-limit the number of levels the tasks can be spawned recursively, or (preferably) even just disallow it completely; otherwise you'll still risk a deadlock

It would also be nice to add a hard-limit for the number of tasks which can be queued at the same time, but unfortunately that could be a source of non-determinism depending on how fast the already queued tasks are executed; we could however add a potential hard-limit to the *total* number of tasks.

Fortunately from what I can see no one currently really uses this API? (At least it isn't used in `polkadot`, if my grepping is correct; not sure about other chains though.)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pool tasks for `sp_tasks::spawn` and add execution limits #11227

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Pool tasks for sp_tasks::spawn and add execution limits #11227

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Pool tasks for `sp_tasks::spawn` and add execution limits #11227