I have tried the model on several images. For higher resolutions, tile decoding is required.
I am posting here several images that got weird results (using the vanilla inference script) - are these just model limitations or bugs in the script?
Do the model expect a specific resolution and on other resolutions tiling is needed?
(The order of the images is: top - original, bottom - generated)



