601
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/kopasz7 on 2024-09-13 15:04:19+00:00.

602
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CeFurkan on 2024-09-13 15:00:59+00:00.

Original Title: Tried Expressions with FLUX LoRA training with my new training dataset (includes expressions and used 256 images (image 19) as experiment) - even learnt body shape perfectly - prompts, workflow and more information at the oldest comment

603
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/TerryCrewsHasacrew on 2024-09-13 12:56:44+00:00.

604
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ZALIA_BALTA on 2024-09-13 12:03:51+00:00.

605
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/haofanw on 2024-09-13 11:23:29+00:00.

606
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-13 09:22:22+00:00.


  • Open-source of Qwen2-VL (VLM) coming soon (GITHUB) via NielsRogge on X
  • FineVideo: 66M words across 43K videos spanning 3.4K hours - CC-BY licensed video understanding dataset. It enables advanced video understanding, focusing on mood analysis, storytelling, and media editing in multimodal settings (HUGGING FACE)
  • Fluxgym Update: automatically generates sample images during training; use ANY resolution, not just 512 or 1024 (for example 712, etc.) via cocktailpeanut on X (creator)
  • Fish Speech 1.4: text to speech model trained on 700K hours of speech, multilingual (8 languages); voice cloning; low latency; ~1GB model weights (OPEN WEIGHTS) (HUGGING FACE SPACES)
  • Out of Focus v1.0: uses diffusion inversion for prompt-based image manipulation using Gradio UI, requires a high-end GPU for optimal performance (GITHUB)
  • Google NotebookLM launches "Audio Overview" feature: can turn any document into a podcast conversation. Once you upload the document and hit the generate button, two AI moderators will kick off a conversation-like discussion, diving deep into the main takeaways from the document (LINK)
  • Video Model is coming to Adobe Firefly via icreatelife on X
  • Midjourney is pioneering a new 3D exploration format for images, led by Alex Evans, innovator behind Dreams' graphics via MartinNebelong on X
  • FBRC & AWS present Culver Cup GenAI film competition at LA Tech Week via me :) on X
  • Coming soon: Vchitect 2.0 - A new text-to-video and Image-to-video model.
  • UVR5 UI: Ultimate Vocal Remover with Gradio UI (GITHUB)
  • Vidu AI Update: new "Reference to Video" feature, you can now apply consistency to anything—whether real or fictional (LINK)
  • Vchitect 2.0: new image2video/text2video model soon (LINK)
  • and slightly unrelated, but special mention: 🍓!

Wednesday's updates - link

Last week's updates - link

607
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hackerzcity on 2024-09-13 00:40:59+00:00.


Now you Can Create a Own LoRAs using FluxGym that is very easy to install you can do it by one click installation and manually

This step-by-step guide covers installation, configuration, and training your own LoRA models with ease. Learn to generate and fine-tune images with advanced prompts, perfect for personal or professional use in ComfyUI. Create your own AI-powered artwork today!

You just have to follow Step to create Own LoRs so best of Luck

608
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/DiienOfficial on 2024-09-13 06:43:10+00:00.

609
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/deadlyorobot on 2024-09-13 04:51:32+00:00.

610
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/hudsonreaders on 2024-09-13 03:23:38+00:00.

611
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/theroom_ai on 2024-09-12 18:11:57+00:00.

612
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/ChristinaTreasure on 2024-09-12 16:32:54+00:00.

613
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/phr00t_ on 2024-09-12 23:57:28+00:00.


A commit yesterday to the CogVideo repo added Image2Video support!

Merge pull request #272 from THUDM/CogVideoX_dev · THUDM/CogVideo@87ad61b (github.com)

I added a feature request on the ComfyUI wrapper:

Image2Video Support (CogVideo recent update) · Issue #54 · kijai/ComfyUI-CogVideoXWrapper (github.com)

EDIT: This isn't Image2Video yet, it is work towards supporting Image2Video. The developer said it will be released within the month:

hope for image to video · Issue #270 · THUDM/CogVideo (github.com)

614
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/wonderflex on 2024-09-12 21:40:22+00:00.

615
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Z3ROCOOL22 on 2024-09-12 21:18:04+00:00.


616
1
AI 10 years ago: (i.redd.it)
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/CyberEcho777 on 2024-09-12 22:23:33+00:00.

617
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/EndlessSeaofStars on 2024-09-12 16:25:30+00:00.


I know a lot of people here poke gentle fun of r/restofthefuckingowl but Flux actually did a decent job of it :)

numbered step-by-step drawing from sketched pencil outline to drawing of an owl on a tree branch

Steps: 24, Sampler: Euler, Schedule type: Beta, CFG scale: 1, Distilled CFG Scale: 3.5, Seed: 337531687, Size: 832x1280, Model hash: 275ef623d3, Model: flux1-dev-fp8, Template: numbered step-by-step drawing from sketched pencil outline to drawing of an owl on a tree branch, Beta schedule alpha: 0.6, Beta schedule beta: 0.6, Version: f2.0.1v1.10.1-previous-528-ge55cde9b, Module 1: ae

618
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Patient-Librarian-33 on 2024-09-12 16:09:11+00:00.

619
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-12 17:54:33+00:00.

620
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/seekingforwhat on 2024-09-12 17:45:53+00:00.


PuLID-FLUX provides a tuning-free ID customization solution for FLUX.1-dev model.

github link:

description about the model:

visual results:

Showcase of PuLID-FLUX

621
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/RepresentativeJob937 on 2024-09-12 15:04:56+00:00.


Code:

Writeup:

622
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/MooseBoys on 2024-09-12 07:47:07+00:00.

623
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Howlesh on 2024-09-12 09:14:20+00:00.

624
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/jonbristow on 2024-09-12 07:29:52+00:00.


100% of the top posts are about flux now

625
1
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Eveune on 2024-09-12 01:31:28+00:00.

view more: ‹ prev next ›

StableDiffusion

98 readers
1 users here now

/r/StableDiffusion is back open after the protest of Reddit killing open API access, which will bankrupt app developers, hamper moderation, and...

founded 1 year ago
MODERATORS