Fix legacy loading of detectors with tf models (v0.10.5 patch) by ascillitoe · Pull Request #729 · SeldonIO/alibi-detect

ascillitoe · 2023-01-23T14:28:02Z

This PR fixes a bug in the legacy load functionality (of .pickle/.dill files generated by old alibi-detect versions). The bug meant that the detectors stored in https://console.cloud.google.com/storage/browser/seldon-models/alibi-detect could not be successfully loaded, because models were expected to be loaded from filepath/encoder/model.h5, instead of filepath/model/encoder.h5.

This was not picked up in the test_saving_legacy.py tests, because these operate by instantiating a detector, saving it, and then reloading it (hence the incorrect naming convention was adopted when saving).

Testing

CI runs: https://github.com/ascillitoe/alibi-detect/actions/runs/3995538470

Backward compatibility has been tested by loading (and checking) the following test matrix of artefacts from https://console.cloud.google.com/storage/browser/seldon-models/alibi-detect (see #729 (comment)):

TESTS = [
    # Outlier detectors
    {'detector_type': 'outlier', 'detector_name': 'IForest', 'dataset': ['kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'LLR', 'dataset': ['fashion_mnist', 'genome']},
    {'detector_type': 'outlier', 'detector_name': 'Mahalanobis', 'dataset': ['kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierAE', 'dataset': ['cifar10']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierAEGMM', 'dataset': ['kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierProphet', 'dataset': ['weather']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierSeq2Seq', 'dataset': ['ecg']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierVAE', 'dataset': ['adult', 'cifar10', 'kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierVAEGMM', 'dataset': ['kddcup']},
    # Adversarial detectors
    {'detector_type': 'adversarial', 'detector_name': 'model_distillation', 'dataset': ['cifar10'], 'model': ['resnet32']},
    # Drift detectors (not supported by `fetch_detector`...)
    {'detector_type': 'drift', 'detector_name': 'ks', 'dataset': ['cifar10', 'imdb'], 'version': ['0.6.2']},
    {'detector_type': 'drift', 'detector_name': 'mmd', 'dataset': ['cifar10'], 'version': ['0.8.1']},
    {'detector_type': 'drift', 'detector_name': 'tabular', 'dataset': ['income'], 'version': ['0.7.0', '0.8.1']},
]

where {'detector_type': 'outlier', 'detector_name': 'IForest', 'dataset': ['kddcup']} corresponds to seldon-models/alibi-detect/od/IForest/kddcup, and {'detector_type': 'drift', 'detector_name': 'ks', 'dataset': ['cifar10', 'imdb'], 'version': ['0.6.2']} to seldon-models/alibi-detect/cd/ks/imdb-0_6_2.

Results

All the above artifacts are loaded successfully except for {'detector_type': 'outlier', 'detector_name': 'LLR', 'dataset': 'genome'} which fails with:

AttributeError: Can't get attribute 'likelihood_fn' on <module '__main__`

This artifact is very old. It is saved as .pickle instead of .dill, and the likelihood_fn defined in https://docs.seldon.io/projects/alibi-detect/en/stable/examples/od_llr_genome.html is not found at load time.

Note: artifacts with version numbers <v0.6 are not included in TESTS since they use .pickle, and objects they reference such as preprocess_drift have since been moved to different locations in alibi_detect.

ascillitoe · 2023-01-24T09:46:44Z

alibi_detect/saving/tensorflow/_loading.py


 def load_model(filepath: Union[str, os.PathLike],
-               load_dir: str = 'model',
+               filename: str = 'model',


load_dir is removed, and any subdirectory (i.e. model/) is just added to filepath before passing to load_model.

filename is added instead, since the legacy saving used to save model's with various filenames such as encoder.h5 or model.h5.

ascillitoe · 2023-01-24T10:00:27Z

A basic test script to test loading of artifacts in https://console.cloud.google.com/storage/browser/seldon-models is included below. We might want to have a separate conversation about how best to generate new artifacts, and test new (and old) ones.

from alibi_detect.saving import load_detector
from alibi_detect.utils.fetching.fetching import fetch_detector, fetch_state_dict, fetch_tf_model
from alibi_detect.utils.url import _join_url
import tensorflow as tf
import pytest
import itertools
from cloudpathlib import CloudPath


TESTS = [
    # Outlier detectors
    {'detector_type': 'outlier', 'detector_name': 'IForest', 'dataset': ['kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'LLR', 'dataset': ['fashion_mnist', 'genome']},
    {'detector_type': 'outlier', 'detector_name': 'Mahalanobis', 'dataset': ['kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierAE', 'dataset': ['cifar10']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierAEGMM', 'dataset': ['kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierProphet', 'dataset': ['weather']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierSeq2Seq', 'dataset': ['ecg']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierVAE', 'dataset': ['adult', 'cifar10', 'kddcup']},
    {'detector_type': 'outlier', 'detector_name': 'OutlierVAEGMM', 'dataset': ['kddcup']},
    # Adversarial detectors
    {'detector_type': 'adversarial', 'detector_name': 'model_distillation', 'dataset': ['cifar10'], 'model': ['resnet32']},
    # Drift detectors (not supported by `fetch_detector`...)
    {'detector_type': 'drift', 'detector_name': 'ks', 'dataset': ['cifar10', 'imdb'], 'version': ['0.6.2']},
    {'detector_type': 'drift', 'detector_name': 'mmd', 'dataset': ['cifar10'], 'version': ['0.8.1']},
    {'detector_type': 'drift', 'detector_name': 'tabular', 'dataset': ['income'], 'version': ['0.7.0', '0.8.1']},
]

def dict_product(dicts):
    return (dict(zip(dicts, x)) for x in itertools.product(*dicts.values()))


trials = []
for test in TESTS:
    for k, v in test.items():
        if not isinstance(v, list):
            test[k] = [v]
    trials += list(dict_product(test))
n_tests = len(trials)


@pytest.fixture
def unpack_trials(request):
    return trials[request.param]


@pytest.mark.parametrize("unpack_trials", list(range(n_tests)), indirect=True)
def test_fetch_detector(unpack_trials, tmp_path):
    kwargs = unpack_trials
    print(kwargs)
    if kwargs['detector_type'] in ('outlier', 'adversarial'):
        _ = fetch_detector(tmp_path, **kwargs)

    else:
        # create url of detector
        version = kwargs.get('version', '')
        version = version.replace('.', '_')
        url = 'gs://seldon-models/alibi-detect/'
        url += 'cd/' + kwargs['detector_name'] + '/' + kwargs['dataset'] + '-' + version + '/'

        # Download bucket directory
        cloudpath = CloudPath(url)
        cloudpath.copytree(tmp_path)

        dd = load_detector(tmp_path)
        dd = dd._detector if hasattr(dd, '_detector') else dd
        dd = dd._detector if hasattr(dd, '_detector') else dd
        if kwargs['detector_name'] != 'tabular':
            assert dd.preprocess_fn is not None

ascillitoe · 2023-01-24T11:45:15Z

alibi_detect/saving/tensorflow/_saving.py

 def save_model(model: tf.keras.Model,
               filepath: Union[str, os.PathLike],
-               save_dir: Union[str, os.PathLike] = 'model',
+               filename: str = 'model',


Saving must also be updated since save_model was saving using an incorrect naming convention. i.e. everything was saved to model.h5 instead of allowing encoder.h5 etc

mauicv · 2023-01-24T16:50:38Z

LGTM, is there any reason we don't include the test for remote artefacts? Perhaps we could only have it run on nightly CI? Maybe this requires more thought though?

ascillitoe · 2023-01-24T16:59:27Z

LGTM, is there any reason we don't include the test for remote artefacts? Perhaps we could only have it run on nightly CI? Maybe this requires more thought though?

Yeh we should include something IMO. But I think we need to have more of a think about:

How these artifacts are generated in the first place.
What exactly we are testing i.e. just the latest artifacts, or test loading of previous versions too (to discover backward compatibility issues like the one fixed here).
Where this testing code sits (in the main alibi-detect repo or elsewhere). Some of the artifacts are pulled in by the alibi_detect fetch_detector function, but some are used elsewhere such as seldon-core. As an aside, fetch_detector doesn't support any drift detectors whatsoever...

jklaise

LGTM

Fix legacy loading of tf models

9031a8c

ascillitoe changed the title ~~Fix legacy loading of tf models~~ Fix legacy loading of detectors with tf models Jan 23, 2023

ascillitoe added the WIP PR is a Work in Progress label Jan 23, 2023

ascillitoe mentioned this pull request Jan 23, 2023

Run CI on all branches #730

Merged

Empty commit to trigger CI

aae14c6

ascillitoe commented Jan 24, 2023

View reviewed changes

ascillitoe removed the WIP PR is a Work in Progress label Jan 24, 2023

ascillitoe requested review from jklaise and mauicv January 24, 2023 10:01

Ashley Scillitoe added 2 commits January 24, 2023 10:18

Update new loading function

dff6b0b

Update legacy saving

fe2c318

ascillitoe changed the title ~~Fix legacy loading of detectors with tf models~~ Fix legacy loading of detectors with tf models (v0.10.5 patch) Jan 24, 2023

ascillitoe commented Jan 24, 2023

View reviewed changes

mauicv approved these changes Jan 24, 2023

View reviewed changes

Ashley Scillitoe added 2 commits January 24, 2023 17:35

Add changelog entry

a1abd09

Update CHANGELOG.md

cbf87ea

jklaise approved these changes Jan 25, 2023

View reviewed changes

ascillitoe merged commit b7a3b41 into SeldonIO:patch/v0.10.5 Jan 25, 2023

ascillitoe deleted the fix/tf_model_legacy_load branch January 25, 2023 10:56

This was referenced Jan 25, 2023

Set x_ref_preprocessed=True during legacy loading (v0.10.5 patch) #732

Merged

Merge patch/v0.10.5 into master #733

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix legacy loading of detectors with tf models (v0.10.5 patch)#729

Fix legacy loading of detectors with tf models (v0.10.5 patch)#729
ascillitoe merged 6 commits intoSeldonIO:patch/v0.10.5from
ascillitoe:fix/tf_model_legacy_load

ascillitoe commented Jan 23, 2023 •

edited

Loading

Uh oh!

ascillitoe Jan 24, 2023

Uh oh!

ascillitoe commented Jan 24, 2023 •

edited

Loading

Uh oh!

ascillitoe Jan 24, 2023

Uh oh!

mauicv commented Jan 24, 2023

Uh oh!

ascillitoe commented Jan 24, 2023

Uh oh!

jklaise left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ascillitoe commented Jan 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Testing

Results

Uh oh!

ascillitoe Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

ascillitoe commented Jan 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ascillitoe Jan 24, 2023

Choose a reason for hiding this comment

Uh oh!

mauicv commented Jan 24, 2023

Uh oh!

ascillitoe commented Jan 24, 2023

Uh oh!

jklaise left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ascillitoe commented Jan 23, 2023 •

edited

Loading

ascillitoe commented Jan 24, 2023 •

edited

Loading