1
This week in SD - all the major developments in a nutshell
(old.reddit.com)
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/OkSpot3819 on 2024-09-08 09:16:45+00:00.
- FluxMusic: New text-to-music generation model using VAE and mel-spectrograms, with about 4 billion parameters.
- Fine-tuned CLIP-L text encoder: Aimed at improving text and detail adherence in Flux.1 image generation.
- simpletuner v1.0: Major update to AI model training tool, including improved attention masking and multi-GPU step tracking.
- LoRA Training Techniques: Tutorial on training Flux.1 Dev LoRAs using "ComfyUI Flux Trainer" with 12 VRAM requirements.
- Fluxgym: Open-source web UI for training Flux LoRAs with low VRAM requirements.
- Realism Update: Improved training approaches and inference techniques for creating realistic "boring" images using Flux.
⚓ Links, context, visuals for the section above ⚓
- AI in Art Debate: Ted Chiang's essay "Why A.I. Isn't Going to Make Art" critically examines AI's role in artistic creation.
- AI Audio in Parliament: Taiwanese legislator uses ElevenLabs' voice cloning technology for parliamentary questioning.
- Old Photo Restoration: Free guide and workflow for restoring old photos using ComfyUI.
- Flux Latent Upscaler Workflow: Enhances image quality through latent space upscaling in ComfyUI.
- ComfyUI Advanced Live Portrait: New extension for real-time facial expression editing and animation.
- ComfyUI v0.2.0: Update brings improvements to queue management, node navigation, and overall user experience.
- Anifusion.AI: AI-powered platform for creating comics and manga.
- Skybox AI: Tool for creating 360° panoramic worlds using AI-generated imagery.
- Text-Guided Image Colorization Tool: Combines Stable Diffusion with BLIP captioning for interactive image colorization.
- ViewCrafter: AI-powered tool for high-fidelity novel view synthesis.
- RB-Modulation: AI image personalization tool for customizing diffusion models.
- P2P-Bridge: 3D point cloud denoising tool.
- HivisionIDPhotos: AI-powered tool for creating ID photos.
- Luma Labs: Camera Motion in Dream Machine 1.6
- Meta's Sapiens: Body-Part Segmentation in Hugging Face Spaces
- Melyns SDXL LoRA 3D Render V2
⚓ Links, context, visuals for the section above ⚓
- FLUX LoRA Showcase: Icon Maker, Oil Painting, Minecraft Movie, Pixel Art, 1999 Digital Camera, Dashed Line Drawing Style, Amateur Photography [Flux Dev] V3