Skip to content

Conversation

@niekdejonge
Copy link
Collaborator

Added a max inchikey sampling as setting

To make it a bit more consistent I changed average_pairs_per_bin to average_inchikey_sampling_count

@sonarqubecloud
Copy link

sonarqubecloud bot commented Nov 7, 2024

self.val_spectra_per_inchikey = 1
self.random_seed: Optional[int] = None
self.max_pair_resampling = 1
self.max_pair_resampling = 10000
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why this high number?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@florian-huber Basically meant as infinite. As in there is no max. I noticed that the amount of resampling was never a lot. And any maximum imposed, could have quite bad effect on the balance of pairs selection. I will share the analysis that came to this conclusion later today.

@florian-huber florian-huber merged commit 036adc7 into main Nov 8, 2024
@niekdejonge niekdejonge deleted the update_sampling_algorithm branch November 11, 2024 09:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants