Support for serialising detectors with keops backends by ascillitoe · Pull Request #681 · SeldonIO/alibi-detect

ascillitoe · 2022-11-23T15:27:54Z

Following on from #656, this PR adds support for serialising detectors with backend='keops'. Since keops detectors are still pytorch based, this is a relatively simple addition. The main changes involve removing NotImplementedError's and support for saving keops kernels.

codecov-commenter · 2022-11-23T17:39:36Z

Codecov Report

Merging #681 (35e3025) into master (9a344ae) will increase coverage by 0.39%.
The diff coverage is 96.55%.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #681      +/-   ##
==========================================
+ Coverage   79.76%   80.16%   +0.39%     
==========================================
  Files         131      133       +2     
  Lines        9133     9176      +43     
==========================================
+ Hits         7285     7356      +71     
+ Misses       1848     1820      -28

Flag	Coverage Δ
macos-latest-3.10	`76.66% <20.68%> (-0.29%)`	⬇️
ubuntu-latest-3.10	`80.05% <96.55%> (+0.40%)`	⬆️
ubuntu-latest-3.7	`79.95% <96.42%> (+0.39%)`	⬆️
ubuntu-latest-3.8	`80.00% <96.55%> (+0.40%)`	⬆️
ubuntu-latest-3.9	`80.00% <96.55%> (+0.40%)`	⬆️
windows-latest-3.9	`76.59% <20.68%> (-0.29%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
alibi_detect/saving/saving.py	`87.44% <ø> (+0.54%)`	⬆️
alibi_detect/saving/_keops/loading.py	`88.23% <88.23%> (ø)`
alibi_detect/saving/_keops/__init__.py	`100.00% <100.00%> (ø)`
alibi_detect/saving/loading.py	`93.71% <100.00%> (+0.48%)`	⬆️
alibi_detect/saving/registry.py	`100.00% <100.00%> (ø)`
alibi_detect/saving/schemas.py	`98.77% <100.00%> (ø)`
alibi_detect/saving/validators.py	`90.38% <100.00%> (ø)`
alibi_detect/utils/keops/kernels.py	`98.01% <100.00%> (+0.61%)`	⬆️
alibi_detect/cd/keops/mmd.py	`98.21% <0.00%> (+1.78%)`	⬆️
alibi_detect/cd/keops/learned_kernel.py	`94.16% <0.00%> (+18.97%)`	⬆️

jklaise

A few minor comments but otherwise LGTM!

jklaise · 2022-11-23T17:23:40Z

alibi_detect/utils/keops/kernels.py

+        cfg = self.config.copy()
+        if isinstance(cfg['sigma'], torch.Tensor):
+            cfg['sigma'] = cfg['sigma'].detach().cpu().numpy().tolist()


Are we certain we don't need a deep copy here (depends on the dict values)? Wouldn't be ideal if self.config turns out to not be idempotent (different depending whether get_config has been called or not.

I guess the same question applies to every component for which we have self.config and which may have mutable values - do we need to do an audit / have tests?

Upon review of this I think you're right. We seem to get away with it in our tests (and in the informal tests I did previously) because when we update items in config we are typically redefining the entire item, rather than mutating it (e.g. we change sigma from None to a Tensor, rather than mutating an element in sigma). I think this means the shallow copy is OK since the reference is broken, e.g.:

orig_a = [1,2,3] dict1 = {'a': deepcopy(orig_a), 'b': 'cat'} dict2 = dict1.copy() dict1['a'] = [1,1,1] dict2['a'] == orig_a >>> True

However, as you say, if we were to mutate an item in config, we would have a problem:

orig_a = [1,2,3] dict1 = {'a': deepcopy(orig_a), 'b': 'cat'} dict2 = dict1.copy() dict1['a'][1] = 1 dict2['a'] == orig_a >>> False

I might be missing something but seems we should add deepcopy to all of these. I can update these ones in this PR and then do a follow-up for the rest of them?

Yea I think better safe than sorry.

Done for keops kernel only in 669a8c6

Tracker for the remaining changes: #683

jklaise · 2022-11-23T17:25:26Z

alibi_detect/utils/keops/kernels.py

+        kernel_a: Union[nn.Module, str] = 'rbf',
+        kernel_b: Optional[Union[nn.Module, str]] = 'rbf',


I would say to use Literals but I know you'll point me to the other open issue... :)

Ha ha I hope I'm not developing a reputation for deflecting to other issues!

I can see why Literal might be a good idea here since we really do only accept rbf as a str. Since we'd want to do this for the pytorch and tensorflow kernels it might be best for me to follow-up straight after this PR?

Changed to Literal for keops kernel only in 35e3025.

Tracker for wider changes: #683

jklaise · 2022-11-23T17:27:34Z

alibi_detect/utils/keops/kernels.py

        return similarity
+
+    def get_config(self) -> dict:
+        return self.config.copy()


No flavour here?

Bear with me on this one! We have flavour in ModelConfig etc (see saving.schemas) because we use this to decide which load_model_... function to use (i.e. _tf, _pt or _sk). I've kept flavour decoupled from the detector backend here so that we have the option of using different flavour preprocessing to the model backend (e.g. a scikit-learn preprocessor with a tensorflow MMD detector). Plus, we might have a model even when there is no backend (e.g. any preprocessing model with KS detector).

The situation is a little different for kernels. These are only used for detectors which have backends, and it doesn't make sense to decouple their flavour from backend. Ideally we would not actually have flavour at all for kernels (we actually decide which load_kernel_... function to use based on the detector backend. The only reason we have flavour is so that the coerce_2_tensor pydantic validator knows what type of Tensor to coerce sigma to (this happens before we get to to load_kernel_....

We don't need to do any coercion for DeepKernel hence why no flavour in DeepKernelConfig. I could add it in just for consistency? But it wouldn't actually be used at load time.

jklaise · 2022-11-23T17:36:03Z

alibi_detect/saving/tests/models.py

 from alibi_detect.cd.pytorch import HiddenOutput as HiddenOutput_pt
 from alibi_detect.cd.tensorflow import HiddenOutput as HiddenOutput_tf
+from alibi_detect.utils.frameworks import has_keops
+if has_keops:


Is this conditional purely because we skip keops on some platforms in CI? Should we add a comment for this? Same question for the other test module.

Yep precisely. I'll add notes to both modules now!

Edit: Done in 5bb7314

alibi_detect/saving/loading.py

mauicv

LGTM

ascillitoe · 2022-12-13T15:29:27Z

@jklaise would you be able to look into the responses to your comments when you have a chance, please?

jklaise

LGTM

Ashley Scillitoe added 2 commits November 23, 2022 14:40

keops save/load support

13c42fe

Add missing saving/_keops submodule

2e7daea

ascillitoe added Type: Serialization Serialization proposals and changes WIP PR is a Work in Progress labels Nov 23, 2022

Ashley Scillitoe added 3 commits November 23, 2022 15:37

Only test keops save/load when installed (on linux)

c8e8ca5

avoid keops imports when keops not installed

0930844

Set sigma as float32

1b1b92a

ascillitoe added this to the v0.11.0 milestone Nov 23, 2022

Update changelog

dbe24a9

ascillitoe requested review from arnaudvl, jklaise and mauicv November 23, 2022 17:14

ascillitoe removed the WIP PR is a Work in Progress label Nov 23, 2022

ascillitoe mentioned this pull request Nov 23, 2022

Support for serialising detectors with keops backends #680

Closed

1 task

jklaise reviewed Nov 23, 2022

View reviewed changes

Add comments re has_keops

5bb7314

ascillitoe mentioned this pull request Nov 24, 2022

Follow-up's to #681 #683

Open

Ashley Scillitoe added 2 commits November 24, 2022 11:46

Replace .copy()'s with deepcopy

669a8c6

Type str as Literal in keops/kernels.py

35e3025

mauicv reviewed Nov 30, 2022

View reviewed changes

alibi_detect/saving/loading.py Show resolved Hide resolved

mauicv approved these changes Nov 30, 2022

View reviewed changes

jklaise approved these changes Dec 13, 2022

View reviewed changes

ascillitoe merged commit 83b5d2f into SeldonIO:master Jan 3, 2023

		kernel_a: Union[nn.Module, str] = 'rbf',
		kernel_b: Optional[Union[nn.Module, str]] = 'rbf',

Conversation

ascillitoe commented Nov 23, 2022

Uh oh!

codecov-commenter commented Nov 23, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jklaise left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ascillitoe Nov 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ascillitoe Nov 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ascillitoe Nov 24, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mauicv left a comment

Choose a reason for hiding this comment

Uh oh!

ascillitoe commented Dec 13, 2022

Uh oh!

jklaise left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Nov 23, 2022 •

edited

Loading

ascillitoe Nov 24, 2022 •

edited

Loading

ascillitoe Nov 24, 2022 •

edited

Loading

ascillitoe Nov 24, 2022 •

edited

Loading