Document actor system by pquentin · Pull Request #1748 · elastic/rally

pquentin · 2023-07-12T14:12:17Z

You can see this live at https://github.com/pquentin/rally/blob/document-actor-system/docs/architecture/actor_system.md.

inqueue

Thank you for this. I left some minor suggestions.

docs/architecture/actor_system.md

Co-authored-by: Jason Bryan <jbryan@elastic.co> Co-authored-by: Brad Deam <54515790+b-deam@users.noreply.github.com>

gbanasiak

Many thanks! This helps in building the Rally mental map.

I went through PrepareBenchmark so far. A few remarks, a wish list for later mostly:

For consistency, TrackPreparator should be renamed to TrackPreparationActor in the charts as that's the name of the class I think.
It would be great to include 10k feet description of what each component is doing, e.g. 1) TrackPreparationActor loads a track and track plugins (custom extensions such as parameter sources and runners), gets a list of track processors (TrackProcessor interface) from TrackProcessorRegistry and stores them in a queue. Then it sends each processor for execution to TaskExecutionActor. 2) TackExecutionActor executes each processor in a background task and verifies its status periodically. Once task completes, it reports its readiness with WorkComplete. Not sure what level of details makes sense here - too much is just a repetition of code, too little forces us to go through the flow again if we revisit this after enough time.
It would be nice to explain how individual components grow as the scale of the system increases, e.g. TrackPreparationActor set grows with the number of load driver hosts, while TaskExecutionActor set can grow with the number of cores declared per load driver.

docs/architecture/actor_system.md

dliappis · 2023-07-13T07:20:21Z

docs/architecture/actor_system.md

+
+At its heart, Rally is a distributed system. It has been designed that way to
+allow using multiple load drivers in the same benchmark, to ensure that Rally
+is never a bottleneck. In the vast majority of cases, using a powerful load


Another reason (the first one, IIRC) is that distributing Rally also provides an easy way to handle multiple target hosts and ensure that they all build/install/start the required ES version via the race subcommand (this is for example the mode used on the Hetzner nightly benchmarks).
Later on we got the dedicated/decoupled esrally build/install/start/stop subcommands (which we use e.g. with cloud based benchmarks) and these are decoupled from the actor system.

The current formulation isn't correct then, however my understanding is that this is achievable without a full-fledged actor system, while we could easily end up re-implementing Thespian badly to support multiple load drivers.

however my understanding is that this is achievable without a full-fledged actor system, while we could easily end up re-implementing Thespian badly to support multiple load drivers.

I mostly agree. I don't think the risk of supporting >1 load drivers without the actor system would necessarily result in re-implementing Thespian, but as always, the devil is in the details.

I've changed the wording slightly in ac1aec8.

pquentin · 2023-07-13T07:40:55Z

I went through PrepareBenchmark so far. A few remarks, a wish list for later mostly:

I fixed TrackPreparationActor, would you like contributing your other fixes to this branch or in another pull request?

gbanasiak · 2023-07-14T13:26:09Z

@dliappis @inqueue I've adjusted description based on the previous discussion and also updated diagrams for StartBenchmark section while I was reviewing the code myself. I will continue with extra additions in this doc in other PRs, if needed. PTAL.

dliappis

This LGTM -- so great to see these internal aspects of Rally documented.

One thing I wanted to ask is how come this isn't living under the official docs e.g. in the Reference Documentation or Additional Information sections? (is it because Mermaid isn't available?)

gbanasiak · 2023-07-17T16:12:14Z

One thing I wanted to ask is how come this isn't living under the official docs e.g. in the Reference Documentation or Additional Information sections? (is it because Mermaid isn't available?)

It should be possible to use Mermaid with Sphinx but it would require additional package, see https://github.com/mgaitan/sphinxcontrib-mermaid. That's work in progress. If/when this matures we can include this in official documentation.

pquentin added :Docs Changes to the documentation :internal Changes for internal, undocumented features: e.g. experimental, release scripts labels Jul 12, 2023

pquentin added this to the 2.8.1 milestone Jul 12, 2023

pquentin requested review from danielmitterdorfer and gbanasiak July 12, 2023 14:12

pquentin self-assigned this Jul 12, 2023

pquentin added 2 commits July 12, 2023 20:39

Document actor system

e9f0dd8

Split PrepareBenchmark and StartBenchmark

906b9dc

pquentin force-pushed the document-actor-system branch from eaede77 to 906b9dc Compare July 12, 2023 16:43

Minor corrections

fd88140

inqueue requested changes Jul 13, 2023

View reviewed changes

docs/architecture/actor_system.md Outdated Show resolved Hide resolved

docs/architecture/actor_system.md Outdated Show resolved Hide resolved

docs/architecture/actor_system.md Outdated Show resolved Hide resolved

docs/architecture/actor_system.md Outdated Show resolved Hide resolved

b-deam reviewed Jul 13, 2023

View reviewed changes

docs/architecture/actor_system.md Outdated Show resolved Hide resolved

Apply suggestions from code review

fa92b34

Co-authored-by: Jason Bryan <jbryan@elastic.co> Co-authored-by: Brad Deam <54515790+b-deam@users.noreply.github.com>

pquentin requested review from b-deam and inqueue July 13, 2023 02:16

gbanasiak reviewed Jul 13, 2023

View reviewed changes

docs/architecture/actor_system.md Outdated Show resolved Hide resolved

dliappis reviewed Jul 13, 2023

View reviewed changes

Rename TrackPreparator to TrackPreparationActor

a935a7e

Add BenchmarkCoordinator and complete StartBenchmark

ac1aec8

Add missing BenchmarkCoordinator methods

e2c77a4

gbanasiak mentioned this pull request Jul 17, 2023

Inject build_flavor in track templates #1750

Merged

dliappis self-requested a review July 17, 2023 15:41

dliappis approved these changes Jul 17, 2023

View reviewed changes

gbanasiak merged commit e39834d into elastic:master Jul 18, 2023

gbanasiak modified the milestones: 2.8.1, 2.9.0 Jul 25, 2023

Conversation

pquentin commented Jul 12, 2023

Uh oh!

inqueue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gbanasiak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dliappis Jul 13, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pquentin Jul 13, 2023

Choose a reason for hiding this comment

Uh oh!

dliappis Jul 13, 2023

Choose a reason for hiding this comment

Uh oh!

gbanasiak Jul 14, 2023

Choose a reason for hiding this comment

Uh oh!

pquentin commented Jul 13, 2023

Uh oh!

gbanasiak commented Jul 14, 2023

Uh oh!

dliappis left a comment

Choose a reason for hiding this comment

Uh oh!

gbanasiak commented Jul 17, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

dliappis Jul 13, 2023 •

edited

Loading