I tried to get SD-XL to generate an image of a frog with its eyes closed. It refused. I even cranked up the attention on closed to an absurd level, and it seemed to get sassy with me.

    • nul@programming.dev
      link
      fedilink
      English
      arrow-up
      4
      ·
      11 months ago

      Did you try putting (eyes open) in the negative prompt instead? I find that when it doesn’t have a strong understanding of a compound phrase, it sometimes focuses more on the individual words. So, “eyes closed” may have been impeded by a stronger influence from “eyes”.

    • wewbull@feddit.uk
      link
      fedilink
      English
      arrow-up
      1
      ·
      11 months ago

      The problem here is that you have the token “eyes” with very heavy weighting, and it’s showing you eyes. Another way of thinking about it is…

      What do you see when somebody closes their eyes? Eyelids