fix bug that mistakenly checks for positive ndts on missing data by AlexanderFengler · Pull Request #782 · lnccbrown/HSSM

AlexanderFengler · 2025-08-17T20:10:44Z

This fixes an issue where we mistakenly check for ndt - rt > 0 for missing data, which wrongly set the log likelihood to the lower bound in those cases.

cpaniaguam

Just a few observations to consider for readability and maintainability:

Possibly refactor some conditionals and logic for clarity
Could be worth adjusting logging to defer string formatting

cpaniaguam · 2025-08-19T13:58:12Z

src/hssm/distribution_utils/dist.py

+    if params_is_reg is not None:
+        params_only = True if (not any(params_is_reg)) else False
+    else:
+        params_only = False


Suggested change

if params_is_reg is not None:

params_only = True if (not any(params_is_reg)) else False

else:

params_only = False

params_only = params_is_reg is not None and not any(params_is_reg)

cpaniaguam · 2025-08-19T14:11:24Z

src/hssm/data_validator.py

        if missing_data and not deadline:
            network = MissingDataNetwork.CPN
-        elif not missing_data and deadline:
+        elif missing_data and deadline:


There is a change in logic here. Is it intentional?

cpaniaguam · 2025-08-19T14:18:15Z

src/hssm/hssm.py

            if self.model_config.backend != "pytensor":
                missing_data_callable = make_missing_data_callable(
-                    self.loglik_missing_data, "jax", params_is_reg, params_only
+                    self.loglik_missing_data, "jax", params_is_reg, None
                )
            else:
                missing_data_callable = make_missing_data_callable(
                    self.loglik_missing_data,
                    self.model_config.backend,
+                    params_is_reg,
                    None,


Why not determine the backend first and keep a single call to make_missing_data_callable? It could look like this:

backend = "jax" if self.model_config.backend != "pytensor" else self.model_config.backend missing_data_callable = make_missing_data_callable( self.loglik_missing_data, backend, params_is_reg, None, )

cpaniaguam · 2025-08-19T14:22:17Z

src/hssm/hssm.py

+            _logger.info(
+                f"Re-arranging data to put split missing and observed datapoints. "
+                f"Missing data (rt == {self.missing_data_value}) will be on top, "
+                f"observed datapoints follow."
+            )


Suggested change

_logger.info(

f"Re-arranging data to put split missing and observed datapoints. "

f"Missing data (rt == {self.missing_data_value}) will be on top, "

f"observed datapoints follow."

)

_logger.info(

"Re-arranging data to put split missing and observed datapoints. "

"Missing data (rt == %s) will be on top, observed datapoints follow.",

self.missing_data_value

)

codecov · 2025-08-24T15:42:12Z

Codecov Report

❌ Patch coverage is 97.50000% with 7 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
src/hssm/hssm.py	71.42%	6 Missing ⚠️
src/hssm/distribution_utils/dist.py	87.50%	1 Missing ⚠️

Files with missing lines	Coverage Δ
src/hssm/data_validator.py	`96.29% <100.00%> (-1.33%)`	⬇️
src/hssm/distribution_utils/onnx.py	`94.82% <ø> (-1.61%)`	⬇️
src/hssm/param/regression_param.py	`94.57% <100.00%> (+0.08%)`	⬆️
src/hssm/plotting/model_cartoon.py	`91.45% <ø> (ø)`
tests/slow/test_mcmc.py	`100.00% <100.00%> (ø)`
tests/slow/test_missing_data_and_deadline_mcmc.py	`100.00% <100.00%> (ø)`
tests/slow/test_missing_data_and_deadline_vi.py	`100.00% <100.00%> (ø)`
tests/slow/test_missing_data_mcmc.py	`100.00% <100.00%> (ø)`
tests/slow/test_missing_data_vi.py	`100.00% <100.00%> (ø)`
tests/slow/test_vi.py	`100.00% <ø> (ø)`
... and 9 more

... and 3 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

digicosmos86

LGTM! I have forgotten what conversations we had about go-nogo and how to branch different use cases, but as far as this PR goes, this looks good

src/hssm/param/regression_param.py

digicosmos86 · 2025-09-02T14:43:31Z

tests/slow/test_mcmc.py


 @pytest.mark.slow
-@pytest.mark.parametrize(parameter_names, parameter_grid)
+@pytest.mark.parametrize(PARAMETER_NAMES, PARAMETER_GRID)


This is more a stylistic choice, but why we are using all caps for variables that are not constants?

hm from my perspective these are global constants, defined up top.
weird case.

digicosmos86 · 2025-09-02T14:44:35Z

tests/test_data_sanity.py

-        match="You have no missing data in your dataset, "
-        + "which is not allowed when `missing_data` or `deadline` is set to "
-        + "True.",
+        match=r"Missing data is provided as True, "


Maybe use missing_data to signify that it is an argument?

digicosmos86 · 2025-09-02T14:45:04Z

tests/test_data_sanity.py

-    )
+    with pytest.raises(
+        ValueError,
+        match="Missing data provided as False. \n"


Same with this

fix bug that mistakenly checks for positive ndts on missing data

7fbf9c6

AlexanderFengler requested review from cpaniaguam and digicosmos86 August 17, 2025 20:10

AlexanderFengler added 4 commits August 17, 2025 22:11

drop some unnecessary comments

9c7e70f

work on test failures

be74cc1

fix tests

c8961ee

fix rlddm tests, curveball

d011606

cpaniaguam requested changes Aug 19, 2025

View reviewed changes

AlexanderFengler added 8 commits August 19, 2025 17:41

drop some cpn tests that need to get reactivated in a follow up PR

01375db

mark failing tests as xfail

61926bf

fix tests

b696b82

typo in logger string

8877673

fix more tests

2633304

more test fixing

05018b8

tests

3d7f224

fix tests

e4b82be

digicosmos86 approved these changes Sep 2, 2025

View reviewed changes

AlexanderFengler added 7 commits September 23, 2025 11:11

reorganize tests and reactivate cpns

cb3bbff

merge main

824fe4d

small adjustment to rldm tests

78ef256

drop bin_dim arg from simulator calls...deprecated on ssms side

c6942a6

drop a bunch of print statements

67fad5a

tiny improvements to address comments

b62ea97

fix mypy issue

1931067

AlexanderFengler merged commit a72abb6 into main Sep 26, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix bug that mistakenly checks for positive ndts on missing data#782

fix bug that mistakenly checks for positive ndts on missing data#782
AlexanderFengler merged 20 commits intomainfrom
fix-opns

AlexanderFengler commented Aug 17, 2025

Uh oh!

cpaniaguam left a comment

Uh oh!

cpaniaguam Aug 19, 2025

Uh oh!

cpaniaguam Aug 19, 2025

Uh oh!

cpaniaguam Aug 19, 2025

Uh oh!

cpaniaguam Aug 19, 2025

Uh oh!

codecov bot commented Aug 24, 2025 •

edited

Loading

Uh oh!

digicosmos86 left a comment

Uh oh!

Uh oh!

digicosmos86 Sep 2, 2025

Uh oh!

AlexanderFengler Sep 26, 2025

Uh oh!

digicosmos86 Sep 2, 2025

Uh oh!

digicosmos86 Sep 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AlexanderFengler commented Aug 17, 2025

Uh oh!

cpaniaguam left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Aug 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

digicosmos86 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Aug 24, 2025 •

edited

Loading