Add unit tests for the output of problem.evaluate() and .evaluateS1() to cover the expected shapes and types.
Align the method argument type with the docstring. Use either dict[str, np.ndarray] or dict[str, np.ndarray[np.float64]] - as suggested by Brady here:
#676 (review)