"Out of 100 avatars I generated, 16 were topless, and in another 14 it had put me in extremely skimpy clothes and overtly sexualized poses,” Heikkilä wrote.
Just to qualify first; asian women are certainly highly festishized in general, so I'd totally believe there's some gross bias, so I don't discount the possibility.
But, that said, when I messed around with AI image generators pretty much any kind of prompt that included woman or female designations tended towards sexualized versions, even to the point of violating its own content policy.
I asked it to create a sort of witchy, sorceress character and many of the generations she was fully topless with her boobs out, despite me not asking that or even explicitly putting "fully clothed" into the prompt. There was one image that the system created and then removed and threatened me with a ban for it being too sexualized despite me putting no sexual language in the prompt and it being all the AI.
All that to say, I think AI like this is reflective of societies hyper-sexualization of women of all kinds, it doesn't come as a great surprise.
But, that said, when I messed around with AI image generators pretty much any kind of prompt that included woman or female designations tended towards sexualized versions, even to the point of violating its own content policy.
Tried it on the copilot app and one result had an asian but wasnt sexual but indeed very sexy in style.
Prompt: Generate me a picture of a female wizard reading a massive book of spells
Pictures:
Edit:
Female wizard: Kinda magical fantasy. Has good intentions
Witch: Spooky and mysterious. Halloween themes
Sorceress: Same as wizard but with my selfish/bad intentions.
That's DALL-E. DALL-E is different than Stable Diffusion, which is different from Midjourney, which is different from the many NAI anime models out there.
We need to stop treating LD models like they are all the same thing. Models are based on the data they are trained on. Sure, a lot of them started out from a Stable Diffusion model, but that's not always the case, and enough training can have them go off in specialized directions.
Yeah, if you go back through hundreds of years of artwork, most of it are pictures of women. Some of them are nude. There are many many artists that only draw women, modern or classical. And there's a ton of male Japanese artists from centuries ago that did the same thing.
I asked it to create a sort of witchy, sorceress character and many of the generations she was fully topless with her boobs out, despite me not asking that or even explicitly putting “fully clothed” into the prompt. There was one image that the system created and then removed and threatened me with a ban for it being too sexualized despite me putting no sexual language in the prompt and it being all the AI.
That's just one model, and obviously not Stable Diffusion. LD models are just based on whatever they were trained on. If you don't like it, download another model trained on something else and try it out. Or train one yourself.
Also, I wish everybody would download a SD client and just use this software locally. All of these toy websites are shit, and local clients aren't going to threaten to ban you because of what you generated. It's a good learning experience to figure out the software, and these tools are useful for more things than just bitching about the tech on the web.
Scroll through the trained models on civit.ai and you'll quickly get a feeling of the dystopian level of "prettifying" everything in the AI-generation world.
I also once searched for "brown" just to see if any models were trained to create non-white-skinned people, and got shocked when the result was filled with models trained on Millie Bobby Brown from Stranger Things. I don't even want to know what those models are used for.
You're absolutely correct, yet ask someone who's very pro AI and they might dismiss such claims as "needing better prompts". Also many people may not be as tech informed as you are, and bringing light to algorithmic bias can help them understand and navigate the world we now live in. Dismissing the article just because you already know the answer doesn't really encourage people to participate in a discussion.
It's really hard getting dark skin sometimes. A lot of the time it's not even just the model, LoRAs and Textual Inversions make the skin lighter again so you have to try even harder. It's going to take conscious effort from people to tune models that are inclusive. With the way media is biased right now, I feel like it's going to take a lot of effort.
And every single Asian game and anime tends to go for skimpy or virtual softcore with it's female characters. Rarely you see a female character in full armor.
For reference, check out how that same negative prompt turns a chubby-ish poorly shaved average guy, into a male pornstar, or a valet into a rich daddy's boy.
Can we please collectively get into the habit of editing these borderline-clickbait titles or at least add sub-titles explaining the real article? This isn't reddit where you can't edit anything and can't add explanatory text!
Looking at some of the replied that tried to dismiss the issue and the general lack of concern from moderators against aggressive replies from AI apologists (in this thread but also other AI related threads) are disheartening.
If I had to guess, they probably did a shit job labeling training data or used pre labeled images, now where in the world could they have found huge amounts of pictures of women on the internet with the specific label of “Asian”?
Almost like, most of what determines the quality of the output is not “prompt engineering” but actually the back end work of labeling the training data properly, and you’re not actually saving much labor over more traditional methods, just making the labor more anonymous, easier to hide, and thus easier to exploit and devalue.
Almost like this shit is a massive farce just like the “meta verse” and crypto that will fail to be market viable and waist a shit ton of money that could have been spent on actually useful things.
They did literally nothing and seem to use the default stable diffusion model which is supposed to be a techdemo. Would have been easy to put "(((nude, nudity, naked, sexual, violence, gore)))" as the negative prompt
I'm not exposed to a huge amount of media coming out of Asia, outside of a handful of Korean shows that Netflix has picked up and anime. But like, if anime is any indicator, I'm not really surprised that the training data for Asian women is leaning more toward overt sexualization. Even setting aside the whole misogynistic 'fan service' thing, I don't feel like I see as much representation of women who defy traditional gender roles as the last twenty or so years of Western media.
It certainly could be that anime is actually a huge outlier here, but if the training data is primarily from the English speaking web, it might be overrepresented anyway. But like, when it comes to weird AI image behaviors, it pays to think about the probable training data.
Like, stable diffusion seems to do a better job of rendering jewelry if you tell it to surround it with berries. Given the output, this seems to be due to Christmas themed jewelry ads. They also tend to add a lot of bokeh for the same reason.
Absolutely this. The reason AI defaults female into "female armor mode" is the same reason Excel has January February Maruary. Our spicy autocorrect overlords cannot extrapolate data in a direction that it's training has no knowledge of.
While i agree there is a big issue with the bad biased and sexist training data this entire article is about the lensa app which uses (i assume) the default stable diffusion model laion-5b.
Intentional creating sexualized pictures is banned in their guidelines. And yet no one thought of creating a good negative prompt that negates any kind of nudity or eroticism? It still doesn't properly fix the training data but at least people aren't unwillingly presented porn of their own images.
Also everyone can create a dataset and build a stable diffusion model, so why is lensa relying on the default model which is more like a quick and dirty tech demo. They had all the tools to do this right but decided to not even uses the easy lazy ones.