526
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jmbirn on 2024-09-17 20:46:27+00:00.

527
1
WPAP Style LoRA [FLUX] (www.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-17 20:02:23+00:00.

528
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/tom83_be on 2024-09-17 18:21:14+00:00.


Update: Now runs with about 7 GB VRAM, see bold text on updated settings below!

I posted a guide (basically working settings) for OneTrainer LoRA/DoRA training here. There was a question concerning support for 8 GB VRAM. I tried a few settings and it seems to run at just below 8 GB VRAM. Since I do not own such a card I need people with these cards to validate it (maybe there are spikes that I do not see).

Please do the folkowing:

  • Use the settings provided here:
  • EMA OFF (training tab) => maybe not needed, see update below
  • Rank = 16, Alpha = 16 (LoRA tab)
  • activating "fused back pass" in the optimizer settings (training tab) seems to yield another 100MB of VRAM saving => maybe not needed, see update below
  • "LoRA weight data type" (LoRA tab) to bfloat16 again saves some VRAM. => maybe not needed, see update below
  • Update: You can also set "gradient checkpointing" to "CPU_OFFLOADED" in the "training"-tab. After that it runs with less than 7 GB VRAM, but a bit slower for me (3,7 s/it vs. 3.4 s/it). Thanks to u/setothegreat for that idea! If you keep EMA enabled, still use float32 as the "LoRA weight data type" and also do not activate "fused back pass", it still runs at 7,2 GB VRAM and 3,9 s/it for me. So it might be enough to

It now trains with just below 7,8 / 7,9 GB of VRAM. I would like to get feedback from 8 GB VRAM users if this works.

I can also give no guarantee on quality/success of the training! Let's find out together!

PS: I am using my card for training/AI only; the operating system is using the internal GPU, so all of my VRAM is free. For 8 GB VRAM users this might be crucial to get it to work...

529
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Secure-Message-8378 on 2024-09-18 00:52:17+00:00.

530
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/RalFingerLP on 2024-09-17 21:11:07+00:00.

531
1
Deep Sea [FLUX] (www.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-17 20:26:24+00:00.

532
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Equal_Couple6552 on 2024-09-17 20:01:11+00:00.

533
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FugueSegue on 2024-09-17 19:19:17+00:00.


The rule the angry moderator cited was: "Your post/comment has been removed because it contains content created with closed source tools. OP has stated they used Photoshop and Topaz on some elements."

This is the message I just sent to all the moderators of this subreddit:

Why did you delete my post? According to the message I received:

"Your post/comment has been removed because it contains content created with closed source tools. OP has stated they used Photoshop and Topaz on some elements."

THERE IS NO RULE ABOUT THAT. If you're referring to rule #1:

"All posts must be Open-Source / Local AI image generation related. All tools used to create post content must be open source/local AI image generation. Comparisons with other AI generation platforms are accepted."

You're saying I violated that rule?!?!? THAT'S INSANE! Are one of your moderators really THAT vindictive? Almost EVERYONE uses Photoshop and any other image processor to get their work done! This includes preparing datasets, inpainting with SD plugins, to final presentation. ALL of the work that was done to create that image was done with Stable Diffusion models and LoRAs! I use Photoshop to do my inpainting with ComfyUI! ALMOST ALL WORKING DIGITAL ARTISTS USE PHOTOSHOP! It's a standard tool! I use Topaz whenever I need to enlarge an element that I send through img2img!

Are you really going to be THAT dogmatic about rule #1? Because if you do, then you'll have to delete half the images posted here! You'll have to start a massive, ugly inquisition.

Did it ever occur to you to ASK me about these things? Or asking if I used Adobe's generative fill? Because I didn't! Did you consider making even the SLIGHTEST inquiry? Instead of just deleting the post about a painting I worked on? On my cake day, no less.

Do you want generative AI art accepted in the rest of the art world? Because this isn't the way to do it.

534
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Patient-Librarian-33 on 2024-09-17 17:00:20+00:00.

535
1
Sakura tree (i.redd.it)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/EcoPeakPulse on 2024-09-17 18:03:11+00:00.

536
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Disastrous-Hope-2537 on 2024-09-17 11:32:21+00:00.


Paper:

537
1
Under The Red Moon (www.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/TheArchivist314 on 2024-09-17 03:20:07+00:00.

538
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DustWorlds on 2024-09-17 00:28:12+00:00.

539
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/StelfieTT on 2024-09-17 08:03:44+00:00.

540
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/tom83_be on 2024-09-17 07:37:21+00:00.

541
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ToastersRock on 2024-09-16 23:54:21+00:00.


Here it is. It is not perfect and does require writing prompts that describe a scene with miniature people. Check some of the sample images for examples.

542
1
animatediff mwseo (old.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/mwseo_ai on 2024-09-16 22:55:27+00:00.

543
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Hot_Opposite_1442 on 2024-09-17 04:35:36+00:00.

544
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Agreeable_Effect938 on 2024-09-16 23:19:57+00:00.

545
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/dropitlikeitshot999 on 2024-09-16 22:57:49+00:00.

546
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-16 21:15:46+00:00.

547
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ninjasaid13 on 2024-09-16 18:27:42+00:00.

548
2
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/phr00t_ on 2024-09-16 18:17:45+00:00.


I found where the Image2Video CogVideo 5B model has been released:

清华大学云盘 (tsinghua.edu.cn)

Found on this commit:

llm-flux-cogvideox-i2v-tools · THUDM/CogVideo@b410841 (github.com)

It looks like this branch has the latest repository changes:

THUDM/CogVideo at CogVideoX_dev (github.com)

The pull request to update the Gradio app is here (with example images used to I2V):

gradio app update by zRzRzRzRzRzRzR · Pull Request #290 · THUDM/CogVideo (github.com)

The model is a pt, so it may need some massaging into a safetensors or quantization. However, it appears like all of the pieces of the puzzle are available now -- just need to be put together (ideally as ComfyUI nodes, hehe).

EDIT: The webspace demo has been updated with I2V!!

CogVideoX-5B - a Hugging Face Space by THUDM

EDIT2: Looks like the PyTorch file for download is corrupted:

Image2Video Support (CogVideo recent update) · Issue #54 · kijai/ComfyUI-CogVideoXWrapper (github.com)

... but has been uploaded to HuggingFace, just private. I did file an issue with CogVideo about the corrupted model, but probably need to wait (again) for a working model download. Looks like we can play with the Gradio demo in the meantime.

549
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/oodelay on 2024-09-16 17:05:20+00:00.

550
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Novokhrodiivka on 2024-09-16 16:34:35+00:00.

view more: ‹ prev next ›

StableDiffusion

98 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS