This is straight from the SDXL beta on discord… can’t wait to have this to try on my own PC with Auto1111, I’m sure tweaking the step and cfg settings would fix the texture details. Already loving how much better it is at text and at composition, and from what I’ve read should be really easy to train and tweak.

    • pablonaj@feddit.deOP
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 年前

      Yes, it was:

      Photo of a lemming with a welcome sign, “welcome” written with marker.

      No negatives, no fancy adjectives…

      I made 8 images and this one was the best (some had no text, some had uglier lemmings, some had misspelled signs) but most were acceptable just not what I wanted. Also the cfg and steps and sampler are random so once we can control that it will be much easier.

      • j4k3@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 年前

        Any word on the real world hardware requirements? I’m currently shopping for a machine

        • pablonaj@feddit.deOP
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 年前

          Not 100% sure, but from what I read it’s not too far from what 2.1 needs. They have even fine tuned it on normal GPUs. I’m not sure if they will have different versions like they had with 2.1 where they released a 512 and a 768 trained version, which would require less VRAM.