You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for open sourcing this project - it's really helpful!
I wanted to evaluate your checkpoint at a lower resolution (default is 1280x704, I try to evaluate at 576x320), and to make your gen3c_single_image.py run, I added on this lineself.model.state_shape=[16,16,40,72] which I believe correspond to the latent shape at the lower resolution. Then I ran the following command, specifying the height and width arguments.
And the video below is the result I got. It follows the overall desired trajectory, but has obvious artefacts around the edges and some disoccluded parts. I wonder:
If the way I am inferring at lower resolutions looks correct to you;
If so, are the artefacts expected from the model at lower resolutions;
If so, how would you suggest I evaluate the checkpoint on low resolution benchmarks, e.g. RE10K at 576x320?
Thanks for open sourcing this project - it's really helpful!
I wanted to evaluate your checkpoint at a lower resolution (default is 1280x704, I try to evaluate at 576x320), and to make your
gen3c_single_image.pyrun, I added on this lineself.model.state_shape=[16,16,40,72]which I believe correspond to the latent shape at the lower resolution. Then I ran the following command, specifying theheightandwidtharguments.And the video below is the result I got. It follows the overall desired trajectory, but has obvious artefacts around the edges and some disoccluded parts. I wonder:
Thank you a lot for your help!
test_single_image.mp4