Add cohort data exporter from PlantsModel by davidorme · Pull Request #912 · ImperialCollegeLondon/virtual_ecosystem

davidorme · 2025-06-21T12:03:07Z

Description

This PR introduces the models.plants.exporter module that provides a the CommunityDataExporter tool for plant community data. This data could be worked into the data object and exported by the standard mechanism, but the data is not used by any other model and consists of sets of highly ragged arrays of community data per cell that is much more suited to a data frame style export than the DataArray format.

There are three kinds of data:

cohort data - one line per cohort per cell
community canopy data - one line per canopy layer per cell
stem canopy data - one line per canopy layer per cohort per cell

Each of those different classes has a bunch of attributes that are stored within the communities, canopies and stem_allocation attributes within the Plants Model.

So, this PR:

Adds new section of configuration for the plants model schema to specify output paths for each kind of a data and an attribute list to subset the data written out.

[plants.community_data_export]
required_data = ['canopy', ... ]
cohort_attributes = []
community_canopy_attributes = []
stem_canopy_attributes = []

Adds the new exporter class CommunityDataExplorer with:
- an __init__ method that the settings above directly
- a from_config factory method that creates an instance from the settings loaded in a Config object.
- a dump method that can be called to compile and output the three data types to CSV file. It switches from write to append mode after first use to avoid duplicating column headers.
- Some private validation methods for paths and attribute subsets.
Updates PlantsModel to require a CommunityDataExplorer object at __init__ and then to call the dump() method at the end of model setup and in each update.
Adds a bunch of tests in tests.models.plants.test_explorer.py

Fixes #911 (issue)

Type of change

New feature (non-breaking change which adds functionality)
Optimization (back-end change that speeds up the code)
Bug fix (non-breaking change which fixes an issue)

Key checklist

Make sure you've run the pre-commit checks: $ pre-commit run -a
All tests pass: $ poetry run pytest

Further checks

Code is commented, particularly in hard-to-understand areas
Tests added that prove fix is effective or that feature works
Relevant documentation reviewed and updated

…nction yet

codecov-commenter · 2025-06-21T12:08:41Z

Codecov Report

❌ Patch coverage is 98.07692% with 3 lines in your changes missing coverage. Please review.
✅ Project coverage is 94.00%. Comparing base (31cb58d) to head (38e7977).
⚠️ Report is 1738 commits behind head on develop.

Files with missing lines	Patch %	Lines
virtual_ecosystem/models/plants/exporter.py	97.93%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop     #912      +/-   ##
===========================================
+ Coverage    93.89%   94.00%   +0.10%     
===========================================
  Files           77       78       +1     
  Lines         5912     6067     +155     
===========================================
+ Hits          5551     5703     +152     
- Misses         361      364       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dalonsoa

Conceptually, I think this approach makes sense, leaving up to the model to export some of the data relevant for it. My only concern is if the data it exports might be ever be updated by a model run after plats within the same timestep. I wonder if it would be better to have an export step that is run for every model after all the updates are done.

davidorme · 2025-06-24T09:00:08Z

My only concern is if the data it exports might be ever be updated by a model run after plats within the same timestep. I wonder if it would be better to have an export step that is run for every model after all the updates are done.

After all the updates are done within a time step. That's an interesting idea that I think might be useful. In this specific case, we're intending to exporting the internal cohort structure and tree allometry, which is only under the control of the plants model, and indeed isn't exposed to the other models through data.

But... I think we could revisit this same structure and make it part of the BaseModel to create model exporter methods for dumping internal model state. I'm just not keen to do that now 😄

…porter-from-plantsmodel

…logging

dalonsoa

I have a couple of comments, but it is broadly looking good.

virtual_ecosystem/models/plants/exporter.py

dalonsoa · 2025-07-02T04:45:09Z

virtual_ecosystem/models/plants/plants_model.py

+            time=self.model_timing.start_time
+            + time_index * self.model_timing.update_interval,


It might come handy to add a simple method in the base model that returns the current time:

def current_time(self, time_index: int) -> int: # was this in epoch? return self.model_timing.start_time + time_index * self.model_timing.update_interval

We need to revisit the timing in more detail but this is a good idea.

Co-authored-by: Diego Alonso Álvarez <6095790+dalonsoa@users.noreply.github.com>

for more information, see https://pre-commit.ci

…://github.com/ImperialCollegeLondon/virtual_rainforest into 911-add-cohort-data-exporter-from-plantsmodel

davidorme · 2025-07-02T19:06:51Z

@sallymatson I've finished updating from @dalonsoa's review now. Diego - your suggestions have simplified the code and testing quite a bit. Nothing hugely changed but just more polished, I think.

dalonsoa

Much better :)

…odel

arne-exe · 2025-07-30T08:50:34Z

Hi @davidorme, I just had a look at this, the exporter looks very neat and will be super useful for looking at the outputs!

davidorme added 5 commits June 20, 2025 19:30

First sketch of exporter structure

514cf9d

Add exporter to PlantsModel __init__ and integrate into model - no fu…

74af04f

…nction yet

Config redesign: TOML does not like NULL

66c9573

Update exporter to new config args

afcbf03

Tweaking exporter __init__ and from_config methods

24c0427

davidorme linked an issue Jun 21, 2025 that may be closed by this pull request

Add cohort data exporter from PlantsModel #911

Closed

Add new module to API docs

16cedd1

davidorme requested review from dalonsoa and sallymatson June 21, 2025 19:02

dalonsoa reviewed Jun 24, 2025

View reviewed changes

davidorme added 18 commits June 27, 2025 11:59

Merge branch '918-update-pyrealm-version' into 911-add-cohort-data-ex…

a2a5271

…porter-from-plantsmodel

Building out exporter path validation and testing

8b716c2

Export cohort data with standalone tests and from PlantsModel

b5777a1

Capture the stem allocation data as a model attribute and export it, …

490144c

…logging

Apparently did not add the new test unit file - so... new tests

fce4238

Updated to output community level canopy layer data

27efc99

Sharing test code for CSV file checks

f3ddea0

Updating exporter to three output files - part 1

75a4010

Updating tests to new 3 path export

6756a8b

Exporting stem canopy data implemented, test outputs

5b067c0

Updating attribute subset code and test

f15d65d

Attribute subset testing complete

2ae0ac4

Better typing of missing paths in exporter __init__

0c41708

Docstrings

cbcc603

Adding extensive testing of settings; bug fixes

99ecad3

Doc tweaks

e594f47

Bloody useless intersphinx

4bed267

Windows test bug fix attempt

0d03f14

davidorme added 2 commits July 1, 2025 16:46

Skip test on Windows

0ad4661

Skip test on Windows

246f18a

davidorme marked this pull request as ready for review July 1, 2025 17:55

davidorme requested a review from dalonsoa July 1, 2025 17:55

dalonsoa requested changes Jul 2, 2025

View reviewed changes

davidorme and others added 16 commits July 2, 2025 09:35

Update virtual_ecosystem/models/plants/exporter.py

30d39f9

Co-authored-by: Diego Alonso Álvarez <6095790+dalonsoa@users.noreply.github.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

fedd2ed

for more information, see https://pre-commit.ci

Refactor following @dalonsoa review

5c8ef63

Tidying - docstrings and minor attributes before retest and snagging

d3e9ca3

Snagging and testing check_and_set_paths

ddb9606

Snagging and testing _check_attribute_subsets

5ba0fce

Testing new _dump_cohort_data

b3be338

Similar testing for canopy data dumper methods

dde518e

Fixing simplified direct and model dump tests

725dfbd

Fixing JSONSchema and from_config

e7b538a

Minor attribute checks

f5f32c5

Output directory test and more config testing

c8193c5

Merge branch '911-add-cohort-data-exporter-from-plantsmodel' of https…

7a1cfd5

…://github.com/ImperialCollegeLondon/virtual_rainforest into 911-add-cohort-data-exporter-from-plantsmodel

CI bugs

b2dfab3

Bloody windows paths and escape characters

0dd73de

Attempting TOML literal strings

0007d85

davidorme requested a review from dalonsoa July 2, 2025 19:05

dalonsoa approved these changes Jul 3, 2025

View reviewed changes

Merge branch 'develop' into 911-add-cohort-data-exporter-from-plantsm…

38e7977

…odel

davidorme merged commit 189340b into develop Jul 3, 2025
29 of 30 checks passed

davidorme deleted the 911-add-cohort-data-exporter-from-plantsmodel branch July 3, 2025 14:11

davidorme restored the 911-add-cohort-data-exporter-from-plantsmodel branch July 3, 2025 14:14

davidorme mentioned this pull request Jul 3, 2025

Missed push from #911 #929

Merged

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cohort data exporter from PlantsModel#912

Add cohort data exporter from PlantsModel#912
davidorme merged 46 commits intodevelopfrom
911-add-cohort-data-exporter-from-plantsmodel

davidorme commented Jun 21, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Jun 21, 2025 •

edited

Loading

Uh oh!

dalonsoa left a comment

Uh oh!

davidorme commented Jun 24, 2025

Uh oh!

dalonsoa left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dalonsoa Jul 2, 2025

Uh oh!

davidorme Jul 2, 2025

Uh oh!

davidorme commented Jul 2, 2025

Uh oh!

dalonsoa left a comment

Uh oh!

Uh oh!

arne-exe commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		time=self.model_timing.start_time
		+ time_index * self.model_timing.update_interval,

Conversation

davidorme commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Key checklist

Further checks

Uh oh!

codecov-commenter commented Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dalonsoa left a comment

Choose a reason for hiding this comment

Uh oh!

davidorme commented Jun 24, 2025

Uh oh!

dalonsoa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dalonsoa Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

davidorme Jul 2, 2025

Choose a reason for hiding this comment

Uh oh!

davidorme commented Jul 2, 2025

Uh oh!

dalonsoa left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

arne-exe commented Jul 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

davidorme commented Jun 21, 2025 •

edited

Loading

codecov-commenter commented Jun 21, 2025 •

edited

Loading