Skip to content

feat(docs): Multiple datasets#2228

Open
valekjo wants to merge 14 commits intomasterfrom
feature/multiple-datasets
Open

feat(docs): Multiple datasets#2228
valekjo wants to merge 14 commits intomasterfrom
feature/multiple-datasets

Conversation

@valekjo
Copy link
Member

@valekjo valekjo commented Feb 5, 2026

The feature is now supported in both SDKs and API docs is out - we can finally publish the respective section in docs :)

@github-actions github-actions bot added this to the 133rd sprint - Console team milestone Feb 5, 2026
@github-actions github-actions bot added the t-console Issues with this label are in the ownership of the console team. label Feb 5, 2026
@apify-service-account
Copy link

Preview for this PR was built for commit 93a1efe and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit 29d954f and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit 9bd61b87 and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit dbf37acf and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit da0b5f0f and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit 97effaca and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit 652317c4 and is ready at https://pr-2228.preview.docs.apify.com!

@valekjo valekjo force-pushed the feature/multiple-datasets branch from 652317c to 97effac Compare March 5, 2026 08:58
@apify-service-account
Copy link

Preview for this PR was built for commit 97effaca and is ready at https://pr-2228.preview.docs.apify.com!

@danpoletaev danpoletaev force-pushed the feature/multiple-datasets branch from 97effac to bf23549 Compare March 6, 2026 21:56
@apify-service-account
Copy link

Preview for this PR was built for commit bf23549c and is ready at https://pr-2228.preview.docs.apify.com!

@metalwarrior665 metalwarrior665 force-pushed the feature/multiple-datasets branch from bf23549 to 97effac Compare March 7, 2026 00:02
@apify-service-account
Copy link

Preview for this PR was built for commit 97effaca and is ready at https://pr-2228.preview.docs.apify.com!

@apify-service-account
Copy link

Preview for this PR was built for commit c0dd0c4b and is ready at https://pr-2228.preview.docs.apify.com!

@valekjo valekjo requested a review from janbuchar March 12, 2026 08:49
@valekjo valekjo requested a review from Pijukatel March 12, 2026 08:49
@valekjo valekjo marked this pull request as ready for review March 12, 2026 08:50
@valekjo valekjo requested a review from TC-MO as a code owner March 12, 2026 08:50
@valekjo valekjo requested a review from jgagne March 12, 2026 08:51
Copy link
Contributor

@TC-MO TC-MO left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Focused on grammar and file structure now, once those are dealt with I'll do another pass for other issues

When the Javascript SDK is used outside of Apify Platform, aliases fallback to names - using alias would be the same as using named dataset.
There is one difference - when alias is used, the dataset is purged on first access (if the default dataset should be purged).

The Python SDK behaves slightly differently, it uses the [aliasing mechanism](https://crawlee.dev/python/docs/guides/storages#named-and-unnamed-storages) specific to Python SDK. Aliases are created as unnamed, but also purged on Actor start.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Grammar fix & also we are talking about Apify Python SDK specifically yet linking to Crawlee?

Suggested change
The Python SDK behaves slightly differently, it uses the [aliasing mechanism](https://crawlee.dev/python/docs/guides/storages#named-and-unnamed-storages) specific to Python SDK. Aliases are created as unnamed, but also purged on Actor start.
The Python SDK behaves differently, it uses the [aliasing mechanism](https://crawlee.dev/python/docs/guides/storages#named-and-unnamed-storages) specific to Python SDK. Aliases are created as unnamed, but also purged on Actor start.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, changing to "uses the aliasing mechanism specific to Crawlee for Python"


### Run Storages tab

The Storage tab of Actor run view is displaying all the dataset defined by Actor and datasets that were used by the run (up to some limit).
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some limit is pretty vague, can we offer anything more concrete here?

Suggested change
The Storage tab of Actor run view is displaying all the dataset defined by Actor and datasets that were used by the run (up to some limit).
The Storage tab of Actor run view displays all datasets defined by the Actor and datasets that were used by the run (up to some limit).

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Changing to "up to 10 runs"

@apify-service-account
Copy link

Preview for this PR was built for commit 1f7afe07 and is ready at https://pr-2228.preview.docs.apify.com!

@valekjo
Copy link
Member Author

valekjo commented Mar 12, 2026

@TC-MO Thanks! I should have caught the "the"s and JavaScripts etc.

@valekjo valekjo requested a review from TC-MO March 12, 2026 14:20
@apify-service-account
Copy link

Preview for this PR was built for commit 1b23702d and is ready at https://pr-2228.preview.docs.apify.com!

@TC-MO
Copy link
Contributor

TC-MO commented Mar 12, 2026

@valekjo no worries! A lot of those can be now caught using our review skill for Claude-code (we might enhance them to be agent agnostic, but for now it's Claude-centric)

@TC-MO
Copy link
Contributor

TC-MO commented Mar 12, 2026

I'll make another pass that focus more on the doc itself later today instead of low-hanging fruits

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

t-console Issues with this label are in the ownership of the console team.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants