Thanks for your great work. I have one question:  Why do you use CLIP encoder for foreground encoding but use VAE encoder for background encoding? Hope for your response. Thank you! Best Wishes
This issue appears to be discussing a feature request or bug report related to the repository. Based on the content, it seems to be resolved. The issue was opened by CHNxindong and has received 2 comments.