StableDiffusion
- Yamato-e style Flux lora
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/Devajyoti1231 on 2024-09-29 08:47:42+00:00.
- lorakit: A Simple Toolkit for Rapid Prototyping SDXL LoRA Modelsold.reddit.com lorakit: A Simple Toolkit for Rapid Prototyping SDXL LoRA Models
Hey guys, So I've been working on this thing I'm calling lorakit. It's just a little toolkit I threw together for training SDXL LoRA models. It is...
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/os75 on 2024-09-29 02:53:10+00:00. *** Hey guys, So I've been working on this thing I'm calling lorakit. It's just a little toolkit I threw together for training SDXL LoRA models. It is heavily based on DreamBooth from AutoTrain but with similar configuration style as ai-toolkit. Nothing fancy, but it's been pretty handy for quick experiments and prototyping. Thought some of you might wanna check it out:
- How do I make realistic animals like this in Flux?
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/smusamashah on 2024-09-29 13:06:03+00:00.
- Minecraft for nothing (AD unsampling)old.reddit.com Minecraft for nothing (AD unsampling)
Posted in r/StableDiffusion by u/stbl_reel • 38 points and 5 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/stbl_reel on 2024-09-29 12:15:43+00:00.
- When will SD3.1 medium be released, if at all?
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/reditor_13 on 2024-09-29 08:49:35+00:00.
- Testing depth-aware image-to-image animation with Flux + Controlnetold.reddit.com Testing depth-aware image-to-image animation with Flux + Controlnet
Posted in r/StableDiffusion by u/rolux • 20 points and 2 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/rolux on 2024-09-29 08:16:48+00:00.
- I wanted to achieve some natural look with FLUX and some mix of LORAs. Does it look good?
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/kozakfull2 on 2024-09-29 00:02:25+00:00.
- Audio Reactive Playhead in COMFYUIold.reddit.com Audio Reactive Playhead in COMFYUI
Posted in r/StableDiffusion by u/ryanontheinside • 29 points and 12 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/ryanontheinside on 2024-09-28 21:51:32+00:00.
- InvokeAI New Update is Crazy
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/NeededMonster on 2024-09-29 00:11:27+00:00.
- Local video generation has come a long way. Flux Dev+CogVideoold.reddit.com Local video generation has come a long way. Flux Dev+CogVideo
1. Generate image with Flux 2. Use as starter image for CogVideo 3. Run image batch through upscale workflow 4. Interpolate from 8fps to 60fps
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/LocoMod on 2024-09-28 22:39:45+00:00.
- Retro Comic Flux LoRA
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/renderartist on 2024-09-28 22:36:15+00:00.
- Comfyui Tutorial: Outpainting using flux & SDXL lightning (Workflow and Tutorial in comments)
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/cgpixel23 on 2024-09-28 17:54:59+00:00.
- Instagram Edition - v5 - Amateur Photography Lora [Flux Dev]
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/Major_Specific_23 on 2024-09-28 21:05:00+00:00.
- Man carrying 100 tennis balls
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/TabCompletion on 2024-09-28 15:54:01+00:00.
- What trainer for LoRA is better for you and why (Flux version) ?old.reddit.com What trainer for LoRA is better for you and why (Flux version) ?
As I was trying to save time while having good results, I tried 3 different ones (Kohya_SS, ComfyUI/Kohya and Ai-toolkit) I still think Ai-toolkit...
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/TableFew3521 on 2024-09-28 11:03:53+00:00. *** As I was trying to save time while having good results, I tried 3 different ones (Kohya\_SS, ComfyUI/Kohya and Ai-toolkit) I still think Ai-toolkit is way better than Kohya, and I think is because the shceduler "Flowmatch", is the only different config, and even with bad quality images you can achieve amazing skin texture on LoRAs, but in Kohya even tho I save like 5 hours (wich is crazy), I get good results but with this plastic skin texture of Flux no matter the resolution of the images I use. What is your experience? You agree or disagree with me? You think there's a better trainer than the ones I mentioned?
- 🖼 Advanced Live Portrait 🔥 Jupyter Notebook 🥳old.reddit.com 🖼 Advanced Live Portrait 🔥 Jupyter Notebook 🥳
Posted in r/StableDiffusion by u/camenduru • 33 points and 5 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/camenduru on 2024-09-28 16:03:48+00:00.
- Some very surprising pages from the 14th century "Golden Haggadah" illuminated manuscript
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-28 13:45:35+00:00.
- New, Improved Flux.1 Prompt Dataset - Photorealistic Portraits
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/kastmada on 2024-09-28 11:34:36+00:00.
- Steve Mould randomly explains the inner workings of Stable Diffusion better than I've ever heard beforeold.reddit.com Steve Mould randomly explains the inner workings of Stable Diffusion better than I've ever heard before
[https://www.youtube.com/watch?v=FMRi6pNAoag](https://www.youtube.com/watch?v=FMRi6pNAoag) I already liked Steve Mould...a dude that's appeared...
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/AdQuirky7106 on 2024-09-28 02:14:31+00:00. ***
I already liked Steve Mould...a dude that's appeared on Numberphile many times. But just now watching a video on a certain kind of dumb little visual illusion, he unexpectedly launched into the most thorough and understandable explanation of how CLIP-inferred diffusion models work that I've ever seen. Like, by far. It's just incredible. For those that haven't seen this, enjoy the little epiphanies from connecting diffusion-based image models, LLMs, and CLIP, and how they all work together with cross-attention!!
Starts at about 2 minutes in.
- WINAMP SKIN V1 - Flux LoRAold.reddit.com WINAMP SKIN V1 - Flux LoRA
Posted in r/StableDiffusion by u/blankey1337 • 22 points and 9 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/blankey1337 on 2024-09-27 13:56:33+00:00. ***
Download it at civitai:
Have fun!
- Ctrl-X code released, controlnet without finetuning or guidance.old.reddit.com Ctrl-X code released, controlnet without finetuning or guidance.
Posted in r/StableDiffusion by u/NunyaBuzor • 28 points and 10 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/NunyaBuzor on 2024-09-27 23:46:54+00:00. *** Code:
Project Page:
Note: Everything information you see below comes from the project page, please take the results with a grain of salt on its quality.
Ctrl-X is a simple tool for generating images from text without the need for extra training or guidance. It allows users to control both the structure and appearance of an image by providing two reference images—one for layout and one for style. Ctrl-X aligns the image’s layout with the structure image and transfers the visual style from the appearance image. It works with any type of reference image, is much faster than previous methods, and can be easily integrated into any text-to-image or text-to-video model.
Ctrl-X works by first taking the clean structure and appearance data and adding noise to them using a diffusion process. It then extracts features from these noisy versions through a pretrained text-to-image diffusion model. During the process of removing the noise, Ctrl-X injects key features from the structure data and uses attention mechanisms to transfer style details from the appearance data. This allows for control over both the layout and style of the final image. The method is called "Ctrl-X" because it combines structure preservation with style transfer, like cutting and pasting.
Results of training-free and guidance-free T2I diffusion with structure and appearance control
Results of training-free and guidance-free T2I diffusion with structure and appearance control
Ctrl-X is capable of multi-subject generation with semantic correspondence between appearance and structure images across both subjects and backgrounds. In comparison, ControlNet + IP-Adapter often fails at transferring all subject and background appearances.
Ctrl-X also supports prompt-driven conditional generation, where it generates an output image complying with the given text prompt while aligning with the structure of the structure image. Ctrl-X continues to support any structure image/condition type here as well. The base model here is Stable Diffusion XL v1.0.
- Epic Movie Poster LoRA - [FLUX]
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/jenza1 on 2024-09-27 18:07:49+00:00.
- Greek Pantheon
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/Much_Can_4610 on 2024-09-27 14:56:14+00:00.
- hszd_copper | FLUX art Copper Wire Style
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/zazaoo19 on 2024-09-27 11:20:49+00:00.
- I wanted to see how many bowling balls I could prompt a man holding
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/rwbronco on 2024-09-27 21:17:11+00:00.
- CogVideoX-I2V updated workflow
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/lhg31 on 2024-09-27 21:08:37+00:00.
- New Upscaler, depth and normal maps ControlNets for FLUX.1-dev are now available on Hugging Face hub.
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/SideMurky8087 on 2024-09-27 14:33:32+00:00.
- Tests on infinite loops with CogVIdeoX-FUNold.reddit.com Tests on infinite loops with CogVIdeoX-FUN
Posted in r/StableDiffusion by u/jjjnnnxxx • 40 points and 7 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/jjjnnnxxx on 2024-09-27 09:11:55+00:00.
- Flux and/or SD Outpaint Wallpaper Generator
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/wonderflex on 2024-09-27 04:51:38+00:00.
- AI Video Avatarold.reddit.com AI Video Avatar
Hey together! I’m working on an AI avatar right now using mimic motion. Do you have any ideas how to do this more realistic?
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/137nft on 2024-09-27 08:54:41+00:00.
- 32 GB, 512-Bit, GDDR7, Leaked by Kopite7kimi
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/yasashikakashi on 2024-09-27 08:07:51+00:00.
- Google Street View × DynamiCrafter-interpold.reddit.com Google Street View × DynamiCrafter-interp
Posted in r/StableDiffusion by u/nomadoor • 56 points and 5 comments
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/nomadoor on 2024-09-27 07:31:50+00:00.
- If you had to start as a raw beginner today, what would you start with?old.reddit.com If you had to start as a raw beginner today, what would you start with?
I'm interested to hear from the both the semi-experienced and the veterans out there. I'm just curious if you were starting over from scratch and...
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/0260n4s on 2024-09-27 03:01:06+00:00. *** I'm interested to hear from the both the semi-experienced and the veterans out there. I'm just curious if you were starting over from scratch and had to relearn everything, which environment, models and tools would you choose to dive headfirst into...and why?
- Elektroschutz ⚡styled warnings nobody asked for
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/an303042 on 2024-09-27 05:34:34+00:00.
- Movie Scene to Figure Style
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/BigRub7079 on 2024-09-27 04:48:58+00:00.
- Emu3: open source multimodal models for Text-to-Image & Video and also Captioning
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/hinkleo on 2024-09-27 04:14:03+00:00.
- Dragonball poster created with Flux and Photoshop
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/chaindrop on 2024-09-26 08:51:09+00:00.
- Flux based Upscaler - Demo on Huggingface and new Controlnet Model - Anyon have a Comfy Workflow?huggingface.co Flux.1-dev Upscaler - a Hugging Face Space by jasperai
Discover amazing ML apps made by the community
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/CliffDeNardo on 2024-09-26 22:17:33+00:00.
- group of friends (sdxl)
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/PixarCEO on 2024-09-26 17:44:15+00:00.
- I trained a vintage movie_poster Lora (Link in comments)
This is an automated archive made by the Lemmit Bot.
The original was posted on /r/stablediffusion by /u/44Beatzz on 2024-09-26 13:25:53+00:00.