I want to replace the OneFormer used in your demo with some other segmentation models (for instance, SAM), now suppose I have ``` input_img: H1 * W1 * 3, uint8 input_mask H1 * W1, binary 0,1 ref_img: H2 * W2 * 3, uint8 ref_mask: H2 * W2 binary 0,1 ``` How should I modify the demo code to get correct structure and appearance guidance features? Thanks for your help in advance!