326
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Cute_Ride_9911 on 2024-10-02 16:59:30+00:00.

327
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Devajyoti1231 on 2024-10-02 16:35:15+00:00.

328
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-10-02 14:26:37+00:00.


The below text quoted from resource : https://huggingface.co/ostris/OpenFLUX.1

Beta Version v0.1.0

After numerous iterations and spending way too much of my own money on compute to train this, I think it is finally at the point I am happy to consider it a beta. I am still going to continue to train it, but the distillation has been mostly trained out of it at this point. So phase 1 is complete. Feel free to use it and fine tune it, but be aware that I will likely continue to update it.

What is this?

This is a fine tune of the FLUX.1-schnell model that has had the distillation trained out of it. Flux Schnell is licensed Apache 2.0, but it is a distilled model, meaning you cannot fine-tune it. However, it is an amazing model that can generate amazing images in 1-4 steps. This is an attempt to remove the distillation to create an open source, permissivle licensed model that can be fine tuned.

How to Use

Since the distillation has been fine tuned out of the model, it uses classic CFG. Since it requires CFG, it will require a different pipeline than the original FLUX.1 schnell and dev models. This pipeline can be found in open_flux_pipeline.py in this repo. I will be adding example code in the next few days, but for now, a cfg of 3.5 seems to work well.

329
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ampp_dizzle on 2024-10-02 14:24:00+00:00.

330
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ThinkDiffusion on 2024-10-02 13:23:01+00:00.

331
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ItsCreaa on 2024-10-02 11:16:43+00:00.

332
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZyloO_AI on 2024-10-02 15:09:24+00:00.

333
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/a_beautiful_rhind on 2024-10-02 12:25:37+00:00.

334
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Viking_Baboon on 2024-10-02 11:13:31+00:00.

335
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Devajyoti1231 on 2024-10-02 06:02:13+00:00.

336
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ninjasaid13 on 2024-10-02 05:15:57+00:00.

337
1
A giant dog (old.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/xavier047 on 2024-10-02 03:17:37+00:00.

338
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Total-Resort-3120 on 2024-10-01 23:56:31+00:00.

339
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/terminusresearchorg on 2024-10-01 21:02:31+00:00.


Performance

  • Improved launch speed for large datasets (>1M samples)
  • Improved speed for quantising on CPU
  • Optional support for directly quantising on GPU near-instantly (--quantize_via)

Compatibility

  • SDXL, SD1.5 and SD2.x compatibility with LyCORIS training
  • Updated documentation to make multiGPU configuration a bit more obvious.
  • Improved support for torch.compile(), including automatically disabling it when eg. fp8-quanto is enabled
    • Enable via accelerate config or config/config.env via TRAINER_DYNAMO_BACKEND=inductor
  • TorchAO for quantisation as an alternative to Optimum Quanto for int8 weight-only quantisation (int8-torchao)
  • f8uz-quanto, a compatibility level for AMD ROCm users to experiment with FP8 training dynamics
  • Support for multigpu PEFT LoRA training with Quanto enabled (not fp8-quanto)
    • Previously, only LyCORIS would reliably work with quantised multigpu training sessions.
  • Ability to quantise models when full-finetuning, without warning or error. Previously, this configuration was blocked. Your mileage may vary, it's an experimental configuration.

Integrations

  • Images now get logged to tensorboard (thanks u/anhi)
  • FastAPI endpoints for integrations (undocumented)
  • "raw" webhook type that sends a large number of HTTP requests containing events, useful for push notification type service

Optims

  • SOAP optimiser support
    • uses fp32 gradients, nice and accurate but uses more memory than other optims, by default slows down every 10 steps as it preconditions
  • New 8bit and 4bit optimiser options from TorchAO (ao-adamw8bit, ao-adamw4bit etc)

Schnell

Recently we discovered that training LyCORIS LoKr on Flux.1 Dev works perfectly fine on Flux.1 Schnell at just 4 steps, and that the problems of transferring it over are specific to LoRA.

No special training is needed, other than to just train on Dev instead of Schnell.

The release:

The quickstart:

Some docs have been updated for v1.1, mostly OPTIONS.md and the FLUX quickstart.

340
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/an303042 on 2024-10-01 14:31:17+00:00.

341
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/uisato on 2024-10-01 16:34:24+00:00.

342
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Corinstit on 2024-10-01 16:11:44+00:00.

343
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/missing-in-idleness on 2024-10-01 12:42:17+00:00.

344
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/3deal on 2024-10-01 11:58:55+00:00.

345
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/FortranUA on 2024-10-01 11:58:52+00:00.

346
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-10-01 09:22:59+00:00.


  • Interesting find of the week: Kat, an engineer who built a tool to visualize time-based media with gestures.
  • Flux updates:
    • Outpainting: ControlNet Outpainting using FLUX.1 Dev in ComfyUI demonstrated, with workflows provided for implementation.
    • Fine-tuning: Flux fine-tuning can now be performed with 10GB of VRAM, making it more accessible to users with mid-range GPUs.
    • Quantized model: Flux-Dev-Q5_1.gguf quantized model significantly improves performance on GPUs with 12GB VRAM, such as the NVIDIA RTX 3060.
    • New Controlnet models: New depth, upscaler, and surface normals models released for image enhancement in Flux.
    • CLIP and Long-CLIP models: Fine-tuned versions of CLIP-L and Long-CLIP models now fully integrated with the HuggingFace Diffusers pipeline.
  • James Cameron joins Stability.AI: Renowned filmmaker James Cameron has joined Stability AI's Board of Directors, bringing his expertise in merging cutting-edge technology with storytelling to the AI company.
  • Put This On Your Radar:
    • MIMO: Controllable character video synthesis model for creating realistic character videos with controllable attributes.
    • Google's Zero-Shot Voice Cloning: New technique that can clone voices using just a few seconds of audio sample.
    • Leonardo AI's Image Upscaling Tool: New high-definition image enlargement feature rivaling existing tools like Magnific.
    • PortraitGen: AI portrait video editing tool enabling multi-modal portrait editing, including text-based and image-based effects.
    • FaceFusion 3.0.0: Advanced face swapping and editing tool with new features like "Pixel Boost" and face editor.
    • CogVideoX-I2V Workflow Update: Improved image-to-video generation in ComfyUI with better output quality and efficiency.
    • Ctrl-X: New tool for image generation with structure and appearance control, without requiring additional training or guidance.
    • Invoke AI 5.0: Major update to open-source image generation tool with new features like Control Canvas and Flux model support.
    • JoyCaption: Free and open uncensored vision-language model (Alpha One Release) for training diffusion models.
    • ComfyUI-Roboflow: Custom node for image analysis in ComfyUI, integrating Roboflow's capabilities.
    • Tiled Diffusion with ControlNet Upscaling: Workflow for generating high-resolution images with fine control over details in ComfyUI.
    • 2VEdit: Video editing tool that transforms entire videos by editing just the first frame.
    • Flux LoRA showcase: New FLUX LoRA models including Simple Vector Flux, How2Draw, Coloring Book, Amateur Photography v5, Retro Comic Book, and RealFlux 1.0b.

📰 Full newsletter with relevant links, context, and visuals available in the original document.

🔔 If you're having a hard time keeping up in this domain - consider subscribing. We send out our newsletter every Sunday.

347
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Opening-Ad5541 on 2024-10-01 07:51:06+00:00.

348
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/formalsystem on 2024-10-01 03:43:29+00:00.

349
1
Just the Police. (www.reddit.com)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/EldrichArchive on 2024-10-01 01:54:45+00:00.

350
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/MikirahMuse on 2024-09-30 18:33:51+00:00.

view more: ‹ prev next ›

StableDiffusion

98 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS