If they charge by the image, and there is any expectation that AI image-generators might produce garbage results, then AI providers are incentivized to make you pay for their experimentation
without you ever knowing about it, as well as (perhaps) swapping in a cheaper-to-operate model some percentage of the time, perhaps as request loads peak, hoping you'll just roll the dice and try again.
I'm still unsure why anyone would pay for AI image generation purely because of the trial and error it takes. I get that not everyone has a GPU that can do it, but I use stable diffusion through automatic 1111 and I'll likely be about 2-300 generations of text to image, image to image, some inpainting and editing, then some more image to image and upscaling before I get a representation of what's in my head down.
I love the process of it all, but paying for tokens would completely limit me. Is there a specific reason that people use paid models? Or is it just because a lot of people are limited by their gpu?
GPU limiting and a general lack of either knowledge or wanting to put in the effort to do it themselves. Even just going into Github in the first place is enough of a barrier for a lot of people, unfortunately
Yep, I've a mobile 3070 in my laptop, and whilst I feasibly could run some of the smallest models around, paying on a per-use basis gets me way better quality results for relatively cheaply.
Besides, running it locally isn't free either. Your hardware deprecates and depreciates over time, in addition to non-negligible power costs.
Easiest GitHub install I have ever done - works well on both Linux and windows per my own testing. Go onto the Automatic1111 stable-diffusion-webui github and find the git clone link, and then, from a command shell, use “cd” to get into your directory of choice and then do a “git clone <link>”. After that, go into said folder and literally just run the webui.bat/.sh file (might be called webui or might be called setup, can’t remember). That’s it - it should install all of the packages and python stuff and everything for you, and from there, honestly just fuck around with it for a while. On Linux, I had to install a few extra things/screw around a bit to get it working fully, but it wasn’t too tough. I highly recommend also installing the “dynamic prompts” extension and playing around with wildcards and such. Civitai has some great models and LoRAs you can play around with, too - Dreamshaper is a great one. If you’ve not got a 4000-series GPU, I would suggest editing the webui-user.bat/sh file and adding “—medvram” or “—lowvram” to help you get higher res images at the cost of a little bit of speed. Let me know if you’ve got any more questions! If nothing else, it’s honestly just a lot of fun to use.
Not sure if YouTube links are allowed here, but if you just search for automatic 1111 install tutorials, Sebastian Kamph's channel is excellent and pretty much everything you'll need to know is on there!
Keep an eye for his updated videos though, as you can imagine the tech moves so quickly that some of his more popular videos have been remade and updated!