Skip Navigation
Civitai Joins the Open Model Initiative | Civitai
civitai.com Civitai Joins the Open Model Initiative | Civitai

Today, we’re excited to announce the launch of the Open Model Initiative, a new community-driven effort to promote the development and adoption of ...

Civitai Joins the Open Model Initiative | Civitai
0
Izutsumi - Dungeon Meshi

(motimalu) (2024)

Image description: A girl with feline features dressed in an adventurous outfit with a red scarf, leather breastplate, and gauntlets. She has large yellow eyes and black hair covering her head and parts of her body.

Full Generation Parameters:

score_9,score_8_up,score_7_up,score_6_up, source_anime, izutsumi,1girl, solo, animal ears, tail, cat girl, black hair, yellow eyes, black fur, body fur, furry, cat tail, cat ears, dungeon meshi, furry female, looking at viewer, white fur, short hair, slit pupils, fingernails, sharp fingernails, claws, izutsumi original outfit, sleeveless, brown skirt, miniskirt, scarf, armor, crop top, red scarf, arm guards, standing, midriff, from above, cowboy shot, white background, simple background, googly eyes, masterpiece, best quality, very aesthetic, absurdres <lora:dungeon_meshi_collection_v23:1>

Negative prompt: score_4, score_3, score_2, score_1, source_pony, source_furry, source_cartoon, 3d, monochrome, greyscale

Steps: 28, CFG scale: 7, Sampler: Euler a, Seed: 421035125, Size: 832x1216, Model: mugenmalumixSDXL_v42, Version: f0.0.17v1.8.0rc-latest-276-g29be1da7, Model hash: 3b561e8947, Hires steps: 10, Hires upscale: 2, Hires upscaler: 4x-UltraSharp,

ADetailer model: face_yolov8n.pt, ADetailer version: 24.5.1, Denoising strength: 0.3, ADetailer mask blur: 4, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, Clip skip: 2

0
Nebular Cat

(me_miserum) (2024)

Image description: A person standing in a grassy field at twilight, with a colossal, nebular cat that blends into a starry sky above them. The cat's body is a fiery orange, while its head is a contrasting cool blue.

Full Generation Parameters:

Bosstyle, a Woman fighting a giant cute kitty-kite Boss By Wes Anderson and HR Giger: Capture the essence of nostalgia and color in this vintage photograph of a kite festival. Dominating the scene is a meticulously detailed, translucent multicolored fire-kitty-kite, adorned with vibrant patterns, blending seamlessly with its surroundings. Its long tail, composed of intricately woven ribbons in a spectrum of colors, trails behind, adding to the spectacle of the moment. Amidst the quietude of the night, its presence exudes a sense of ancestral strength and resilience. Behind it, the low-toned hues of the Milky Way cast a mesmerizing backdrop, a cosmic tapestry weaving tales of both past and future. In this moment, the convergence of tradition and technology, of ancient wisdom and industrial progress, hangs palpably in the air. In the depths of this nocturnal tableau lies the promise of epic tales yet untold, where the spirit of the Indianer melds seamlessly with the diesel-powered pulse of a world in flux.

Negative prompt: NEG-fixl-2, easynegative, bad proportions, low resolution, bad, ugly, terrible, painting, 3d, render, comic, anime, manga, unrealistic, flat, watermark, signature, worst quality, low quality, normal quality, lowres, simple background, inaccurate limb, extra fingers, fewer fingers, missing fingers, extra arms, (extra legs:1.3), inaccurate eyes, bad composition, bad anatomy, error, extra digit, fewer digits, cropped, low res, worst quality, low quality, normal quality, jpeg artifacts, extra digit, fewer digits, trademark, watermark, artist's name, username, signature, text, words, human,

Steps: 40, CFG scale: 5, Sampler: DPM++ 2M Karras, Seed: 180040627, Size: 832x1216, Created Date: 2024-06-24T0154:16.7061736Z, Clip skip: 2

1
The Open Model Initiative - Invoke, Comfy Org, Civitai and LAION, and others coordinating a new next-gen model. - r/StableDiffusion

Quoted from Reddit:

Today, we’re excited to announce the launch of the Open Model Initiative, a new community-driven effort to promote the development and adoption of openly licensed AI models for image, video and audio generation.

We believe open source is the best way forward to ensure that AI benefits everyone. By teaming up, we can deliver high-quality, competitive models with open licenses that push AI creativity forward, are free to use, and meet the needs of the community.

Ensuring access to free, competitive open source models for all.

With this announcement, we are formally exploring all available avenues to ensure that the open-source community continues to make forward progress. By bringing together deep expertise in model training, inference, and community curation, we aim to develop open-source models of equal or greater quality to proprietary models and workflows, but free of restrictive licensing terms that limit the use of these models.

Without open tools, we risk having these powerful generative technologies concentrated in the hands of a small group of large corporations and their leaders. ‍ From the beginning, we have believed that the right way to build these AI models is with open licenses. Open licenses allow creatives and businesses to build on each other's work, facilitate research, and create new products and services without restrictive licensing constraints. ‍ Unfortunately, recent image and video models have been released under restrictive, non-commercial license agreements, which limit the ownership of novel intellectual property and offer compromised capabilities that are unresponsive to community needs.

Given the complexity and costs associated with building and researching the development of new models, collaboration and unity are essential to ensuring access to competitive AI tools that remain open and accessible.

We are at a point where collaboration and unity are crucial to achieving the shared goals in the open source ecosystem. We aspire to build a community that supports the positive growth and accessibility of open source tools.

For the community, by the community

Together with the community, the Open Model Initiative aims to bring together developers, researchers, and organizations to collaborate on advancing open and permissively licensed AI model technologies.

The following organizations serve as the initial members:

  • Invoke, a Generative AI platform for Professional Studios
  • ComfyOrg, the team building ComfyUI
  • Civitai, the Generative AI hub for creators
  • LAION, one of the largest open source data networks for model training

To get started, we will focus on several key activities:

•Establishing a governance framework and working groups to coordinate collaborative community development.

•Facilitating a survey to document feedback on what the open-source community wants to see in future model research and training

•Creating shared standards to improve future model interoperability and compatible metadata practices so that open-source tools are more compatible across the ecosystem

•Supporting model development that meets the following criteria: ‍

  • True open source: Permissively licensed using an approved Open Source Initiative license, and developed with open and transparent principles
  • Capable: A competitive model built to provide the creative flexibility and extensibility needed by creatives
  • Ethical: Addressing major, substantiated complaints about unconsented references to artists and other individuals in the base model while recognizing training activities as fair use.

‍We also plan to host community events and roundtables to support the development of open source tools, and will share more in the coming weeks.

Join Us

We invite any developers, researchers, organizations, and enthusiasts to join us.

If you’re interested in hearing updates, feel free to join our Discord channel.

If you're interested in being a part of a working group or advisory circle, or a corporate partner looking to support open model development, please complete this form and include a bit about your experience with open-source and AI.

Sincerely,

Kent Keirsey CEO & Founder, Invoke

comfyanonymous Founder, Comfy Org

Justin Maier CEO & Founder, Civitai

Christoph Schuhmann Lead & Founder, LAION

2
Demon Lord - Level 1 Demon Lord and One Room Hero

(turkish8) (2024)

Image description: A boy with long blue hair and pointed ears dressed in a typical school uniform consisting of a white shirt with a blue collar, red necktie, and pleated skirt. He has light blue long hair and three eyes with the third on his forehead, and two large red horns on each side of his head.

Full Generation Parameters:

score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up, source_anime,rating_safe, 1girl,solo, <lora:lv1maou_b_v1:1>, lv1maou_b, horns, pointy ears, aqua hair, purple eyes, school uniform, pleated skirt, neckerchief, white legwear, smile,open mouth, shoes, standing, blush,looking at viewer

Negative prompt: source_furry, source_pony, source_cartoon, 3d, realistic, monochrome, text, watermark, signature, jpeg artifacts, sepia, (censored:1.1), lip, nose, rouge, lipstick, (3d, photo, hyperrealistic, rough sketch:1.1), censored, furry, bad anatomy, bad, bad hands, sketch, low quality, lowres, watermark, signature, simple background, white background, long torso,ahegao,watermark,signature,watermark,signature

Steps: 34, CFG scale: 7, Sampler: Euler A Turbo, Seed: 1841830709, VAE: sdxl_vae.safetensors, Size: 1024x1366, Model: autismmixSDXL_autismmixConfetti, Version: f0.0.17v1.8.0rc-latest-269-gef35383b, Mask blur: 4, Model hash: ac006fdd7e,

Inpaint area: Only masked, ADetailer model: face_yolov8n.pt, ADetailer version: 24.3.0, Denoising strength: 0.4, ADetailer mask blur: 4, Masked area padding: 32, ADetailer confidence: 0.81, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, ADetailer mask only top k largest: 1, Clip skip: 2

0
Aglow in the Forest

(LordTerror) (2024)

Image description: A coyote standing in the center of a forest path bathed in golden beams of sunlight.

Full Generation Parameters:

by [Henry Raeburn:Frank Xavier Leyendecker:0.25] and [Janek Sedlar|Jeremiah Ketner] in the style of Terry Redlin

Negative prompt: poorly drawn, deviant, mess, low quality, deformed, unprofessional

Steps: 30, CFG scale: 4, Sampler: DPM++ 3M SDE Karras, Seed: 1210910949, VAE: sdxl_vae.safetensors, Size: 768x1152, Model: newrealityxlAllInOne_Newreality40, Version: v1.8.0, Model hash: 32b09fe763, Hires steps: 12, Hires upscale: 1.5, Hires upscaler: 4x-UltraSharp,

ADetailer model: face_yolov8n.pt, ADetailer prompt: [object Object], ADetailer VAE 3rd: vae-ft-mse-840000-ema-pruned.ckpt, ADetailer version: 24.3.0, Denoising strength: 0.45, ADetailer mask blur: 12, ADetailer model 2nd: hand_yolov8n.pt, ADetailer model 3rd: female-breast-v4.0-fantasy.pt, ADetailer steps 3rd: 20, ADetailer confidence: 0.5, ADetailer prompt 2nd: [object Object], ADetailer prompt 3rd: [object Object], ADetailer sampler 3rd: DPM++ 2M Karras, ADetailer dilate erode: 12, ADetailer CFG scale 3rd: 7.0, ADetailer inpaint width: 1152, ADetailer mask blur 2nd: 12, ADetailer mask blur 3rd: 12, ADetailer checkpoint 3rd: SD1.5\wafflemix7.safetensors [4844f773bb], ADetailer confidence 2nd: 0.5, ADetailer confidence 3rd: 0.4, ADetailer inpaint height: 1152, ADetailer mask min ratio: 0.001, ADetailer inpaint padding: 256, ADetailer dilate erode 2nd: 12, ADetailer dilate erode 3rd: 12, ADetailer inpaint width 2nd: 896, ADetailer inpaint width 3rd: 1024, ADetailer denoising strength: 0.45, ADetailer inpaint height 2nd: 896, ADetailer inpaint height 3rd: 1024, ADetailer mask min ratio 2nd: 0.001, ADetailer mask min ratio 3rd: 0.001, ADetailer inpaint only masked: True, ADetailer inpaint padding 2nd: 256, ADetailer inpaint padding 3rd: 256, ADetailer negative prompt 3rd: [object Object], ADetailer use separate VAE 3rd: True, ADetailer denoising strength 2nd: 0.45, ADetailer denoising strength 3rd: 0.45, ADetailer use separate steps 3rd: True, ADetailer inpaint only masked 2nd: True, ADetailer inpaint only masked 3rd: True, ADetailer mask only top k largest: 5, ADetailer use inpaint width height: True, ADetailer use separate sampler 3rd: True, ADetailer use separate CFG scale 3rd: True, ADetailer mask only top k largest 2nd: 6, ADetailer mask only top k largest 3rd: 4, ADetailer use separate checkpoint 3rd: True, ADetailer use inpaint width height 2nd: True, ADetailer use inpaint width height 3rd: True

0
Francesca and Chingling - Pokémon

(Https18) (2024)

Image description: A girl dressed in a vibrant red suit with a matching wide-brimmed hat, accented by a blue shirt and pink bow tie. She is holding what appears to be a wadded up handkerchief in her raised hand. Accompanying the character is a small, cheerful creature with orange and cream coloring.

Full Generation Parameters:

score_9, score_8_up, score_7_up, source anime, anime, animification, anime screencap, anime coloring, vivid colors, shiny skin, skindentation, deep skin, depth of field, high quality, highres, (dynamic angle,dynamic pose), francesca \(pokemon\), 1girl, long hair, looking at viewer, smile, shirt, long sleeves, hat, ribbon, holding, closed mouth, purple eyes, jacket, earrings, green hair, collared shirt, pokemon, (creature), buttons, blue shirt, headwear removed, hat removed, top hat, holding clothes, holding hat BREAK masterpiece,high quality <lora:Francesca_XL-10:0.9>

Negative prompt: source_pony, source_furry, source_explicit, child, loli, deformed, bad anatomy, disfigured, poorly drawn face, mutated, extra limb, ugly, poorly drawn hands, missing limb, floating limbs, disconnected limbs, disconnected head, malformed hands, long neck, mutated hands and fingers, bad hands, missing fingers, cropped, worst quality, low quality, mutation, poorly drawn, huge calf, bad hands, fused hand, missing hand, disappearing arms, disappearing thigh, disappearing calf, disappearing legs, missing fingers, fused fingers, abnormal eye proportion, Abnormal hands, abnormal legs, abnormal feet, abnormal fingers, drawing, painting, crayon, sketch, graphite, impressionist, noisy, blurry, soft, deformed, ugly, cartoon, graphic, text, painting, crayon, graphite, abstract, glitch, 3D, cartoon, photorealistic, realistic, censored

Steps: 30, CFG scale: 6, Sampler: Euler a, Seed: 4268872312, : [object Object], RNG: CPU, VAE: sdxl_vae.safetensors, Size: 768x1160, Model: autismmixSDXL_autismmixConfetti, RP Flip: False, Version: v1.6.0-2-g4afaaf8a, RP Active: True, RP Ratios: [object Object], Model hash: ac006fdd7e, RP Options: [object Object], Hires steps: 10, RP Use Base: False, RP Calc Mode: Attention, RP threshold: 0.4, Hires upscale: 1.6, RP Use Common: True, Hires upscaler: R-ESRGAN 4x+ Anime6B, RP Base Ratios: 0.2, RP Divide mode: Matrix, RP Use Ncommon: False, RP Mask submode: Mask, RP LoRA Stop Step: 0, RP Matrix submode: Columns, RP Prompt submode: Prompt, Denoising strength: 0.45, RP LoRA Neg U Ratios: 0, RP LoRA Neg Te Ratios: 0, RP LoRA Hires Stop Step: 0, Clip skip: 2

0
Cliff Beacon

(unfazedanomaly964) (2024)

Image description: A lone figure standing at the base of a towering cliff with a bright green star etched into it that emits beams of light. The scene is dimly lit, with white flowers partially framing the view.

Full Generation Parameters:

1girl, gwen tennyson (/Ben10)/,(ultra HD quality details), bright orange hair, short hair, (green eyes) concept art Under a perpetually stormy sky, in a realm where darkness reigns supreme, a single, bright, lonely light pierces the blackness, emanating from a lone glowing flower at the edge of a cliff. This flower, with petals that radiate a soft, golden light, stands as the last beacon of hope in a world consumed by sadness and ruin. The landscape below is a desolate expanse of cracked earth and dead vegetation, stretching into the horizon under the weight of eternal twilight. The flower's glow casts long, dramatic shadows, illuminating the tears in the earth and the remnants of a forgotten past. This poignant image tells a story of resilience, where even in the depths of despair, a spark of light can bring a sense of hope and beauty. digital artwork, illustrative, painterly, matte painting, highly detailed

Negative prompt: (drawn), (bland), (inactive), CyberRealistic_Negative, (watermark), (name)

Steps: 18, CFG scale: 4.5, Sampler: Euler a, Seed: 1815237583, Size: 832x1216, Created Date: 2024-06-23T1452:45.7405035Z, Clip skip: 2

0
Jahy-sama - Jahy-sama wa Kujikenai!

(interfusor) (2024)

Image description: A girl with vibrant purple hair and striking amber eyes making a playful expression with her mouth open. She is wearing a white top with garbled text on it and has her hands raised in a claw-like gesture.

Full Generation Parameters:

score_9, score_8_up, score_8, source_anime, 1girl, <lora:Jahy:0.85> solo, chibi, dark skin, dark-skinned female, long hair, hair between eyes, purple hair, pointy ears, shirt, white shirt, clothes writing, oversized clothes, open mouth, laugh, single fang, upper body, light blue background, simple background,

Negative prompt: score_5, score_4, 3d, render, censored, source_cartoon, source_western, source_furry, source_pony

Steps: 20, CFG scale: 7, Sampler: Euler a, Seed: 3721670105, VAE: sdxl_vae.safetensors, Size: 832x1216, Model: autismmixSDXL_autismmixConfetti, Version: v1.7.0, Model hash: ac006fdd7e, Hires upscale: 2, Hires upscaler: R-ESRGAN 4x+,

ADetailer model: face_yolov8n.pt, ADetailer version: 24.5.1, Denoising strength: 0.3, ADetailer mask blur: 4, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, Clip skip: 2

1
Blossoms in the Storm

(Channel_42) (2024)

Image description: A solitary tree with pink blossoms standing tall on a slope against a stormy nocturnal sky. The ground is carpeted with green grass, sprinkled with colorful wildflowers.

Full Generation Parameters:

*Channel_42* *中* *Tree* *Cloud* *Outdoors* *Sky* *Grass* *Scenery* *Neon* *Mischievous* *Dark* *Smoosh* *Silver* *Midnight* *Flowers* *Raining* *Multicolor* *Sparkle*

Negative prompt: *I* *Do* *Not* *Use* *Negative* *Prompts*

Steps: 42, CFG scale: 4.2, Sampler: DPM++ 2M Karras, Seed: 435884914, Size: 832x1216, Created Date: 2024-06-20T1208:29.6778250Z, Clip skip: 2

0
Mochida Arisa - THE IDOLM@STER CINDERELLA GIRLS

(pelagic) (2024)

Image description: A cheerful young woman wearing a pink bunny onesie, sitting on a blue seat. The onesie features long ears, and she is making bunny ears with both hands near their face. A plush bunny toy sits beside her, mirroring her pose.

Full Generation Parameters:

score_9, score_8_up,score_7_up,1girl,solo,bunny pose,arms up,looking at viewer,(smile:0.6),(embarrassed:0.2),indoors,couch,open mouth, <lora:mochidaarisa_ponyXLV6:0.8>,cgmar,brown hair,long hair,low ponytail,hair scrunchie,brown eyes,pink bunny hood

Negative prompt: realistic,lowres,worst quality,bad anatomy,bad hands,high-contrast,monochrome

Steps: 30, CFG scale: 7, Sampler: Euler a, Seed: 3287128779, Size: 896x1152, Model: ponyDiffusionV6XL, Version: v1.7.0, Model hash: 67ab2fd8ec, Hires steps: 10, VAE Encoder: TAESD, Hires upscale: 2, Hires upscaler: 4x_fatal_Anime_500000_G, Variation seed: 1101, Denoising strength: 0.4, Variation seed strength: 0.11, Clip skip: 2

0
Shelled Wanderer

(OdiousStooge) (2024)

Image description: A large tortoise in the heart of a dense jungle. Green light filters through the canopy above onto the tortoise’s orange shell, providing a stark contrast against its dark blue-green skin.

Full Generation Parameters:

A vast, overgrown jungle where a giant turtle makes its way through the dense foliage. The turtle's massive shell is a miniature ecosystem, covered in giant, bioluminescent mushrooms that emit a soft, ethereal glow. Birds and small creatures nest among the fungi, adding life and movement to the scene. The turtle's slow, deliberate steps cause the ground to tremble slightly, and the glowing mushrooms light up its path through the dark, verdant jungle, in the style of decaying animatronic, claymation, VHS screen grab, grainy, old footage, in the style of an old 80's VHS dark fantasy, VHS screen grab, grainy, old footage , the scene is captured in dimly lit dark fantasy but vibrant colors, with bold ink lines defining form against the watercolor wash of the aged paper

Negative prompt: BadDream, worst quality, low quality, normal quality, monochrome, grayscale, (navel, lace), text, signature, watermark

Steps: 37, CFG scale: 3.5, Sampler: DPM++ 2M Karras, Seed: 352905922, Size: 832x1216, Created Date: 2024-06-23T2041:17.9999258Z, Clip skip: 2

0
Takano Akira - School Rumble

(dude_) (2024)

Image description: A young woman with short dark hair, sitting in a tranquil forest setting. She is dressed in a sleeveless dress and is positioned with her hands resting on the knees. Dappled sunlight filters through the canopy of green leaves above, casting patterned shadows across the scene and the character. The character gazes off to the side with a neutral expression, suggesting a moment of introspection or quiet contemplation.

Full Generation Parameters:

<lora:takano_akira_ponyxl_v2:0.9>, takira, short hair, dark red hair, brown hair, purple eyes, half-closed eyes, solo, looking at viewer, (giraffe costume:1.2), animal costume, outdoors, sitting on tree stump, dappled sunlight, tree, tree shade, large tree, foliage, grass, bush, nature, backlighting, score_9, score_8_up, score_7_up, score_6_up, anime coloring, uncensored, <lora:UrushiharaSatoshi_XL_PONY_V2:0.4>

Negative prompt: line art, watermark, logo, (worst quality:1.5), (low quality:1.5), (normal quality:1.5), lowres, bad anatomy, bad hands, multiple eyebrow, (cropped), extra limb, missing limbs, deformed hands, long neck, long body, signature, username, artist name, conjoined fingers, deformed fingers, error, (deformed|distorted|disfigured), poorly drawn, wrong anatomy, mutation, mutated, (mutated hands AND fingers), bad fingers, loss of a limb, extra limb, missing limb, floating limbs, amputation, deformed, black and white, disfigured, low contrast

Steps: 28, CFG scale: 7, Sampler: Euler a, Seed: 2017541030, VAE: ponyDiffusionV6XL_v6StartWithThisOne.vae.safetensors, Size: 1040x1520, Model: ponyDiffusionV6XL_v6StartWithThisOne, Model hash: 67ab2fd8ec, kohya_hrfix_enabled: True, kohya_hrfix_end_percent: 0.35, kohya_hrfix_block_number: 3, kohya_hrfix_start_percent: 0, kohya_hrfix_upscale_method: bicubic, kohya_hrfix_downscale_factor: 1.5, kohya_hrfix_downscale_method: bicubic, kohya_hrfix_downscale_after_skip: True

0
Gentle Cast

(jice) (2024)

Image description: A man, dressed in neutral tones and wearing a hat, stands at the edge of a calm stream dotted with blooming water lilies, engrossed in fishing. The surrounding landscape is lush with greenery, and the sunlight filtering through the trees casts a soft glow on the scene.

Full Generation Parameters:

a fisherman,lake,cristal clear water,beautiful trees,beautiful flowers,blue light,romanticism art,(landscape art stylized by Karol Bak:1.3),Paul Gauguin,Cyberpop,short lighting,F/1.8,extremely beautiful,<lora:oil-painting:2>, oil painting of

Steps: 10, CFG scale: 2, Sampler: Euler, Seed: 1784836563, Size: 900x1200, Model: CreaPrompt_Ultimate3, Version: v1.9.4, Model hash: c83f76acb6, Schedule type: Uniform,

ADetailer model: face_yolov8n.pt, ADetailer version: 24.6.0, ADetailer x offset: 4, ADetailer y offset: -2, ADetailer mask blur: 8, ADetailer model 2nd: mediapipe_face_mesh_eyes_only, ADetailer confidence: 0.44, ADetailer dilate erode: 4, ADetailer mask blur 2nd: 4, ADetailer confidence 2nd: 0.3, ADetailer inpaint padding: 32, ADetailer dilate erode 2nd: 4, ADetailer denoising strength: 0.36, ADetailer inpaint only masked: True, ADetailer inpaint padding 2nd: 32, ADetailer denoising strength 2nd: 0.49, ADetailer inpaint only masked 2nd: True, ADetailer mask only top k largest: 1

0
Nanako - Pokémon

(TecnoIA) (2024)

Image description: A girl with blue hair striking an enthusiastic pose is dressed in an outfit consisting of a white cap with a yellow symbol, a yellow-striped jacket, a blue t-shirt, and striped shorts.

Full Generation Parameters:

score_9, score_8_up, score_7_up, score_6_up, score_5_up, score_4_up, BREAK source_anime, anime screencap, <lora:CaseyPkmnXL-08:0.7> CaseyPkmnXL, 1girl, solo, striped shorts, clenched hand, open mouth, hat, shirt, blue hair, eyelashes, baseball cap, striped, smile, shorts, pink background, jacket, :d, knees, tongue, vertical stripes, looking at viewer, outstretched arm, open clothes, open jacket

Negative prompt: score_6, score_5, score_4, holding, monochrome, simple background, 3d,

Steps: 8, CFG scale: 3, Sampler: DPM++ 2S a, Seed: 703577234, Size: 896x1152, Model: autismmixSDXL_autismmixLightning, Version: f0.0.17v1.8.0rc-latest-276-g29be1da7, Model hash: f1f226aa36, Clip skip: 2

0
Solitude and Scarlet Leaves

(LordTerror) (2024)

Image description: A lone figure standing on the edge of a cliff overlooking a misty seascape. Above them towers an imposing tree with vibrant red foliage that contrasts starkly against the surrounding subdued tones.

Full Generation Parameters:

by (((Nathan Wirth) and Thomas Cole) and Becky Cloonan) and James Gilleard, surreal art, full body length, cinematic , hyperdetailed photography, , shallow depth of field, <lora:more_art:0.40><lora:faetastic:0.25><lora:midjourney:0.15>

Negative prompt: low quality, bad anatomy, mess, bizarre, cartoon, painting, illustration

Steps: 24, CFG scale: 7, Sampler: DPM++ 2M Karras, Seed: 2142316041, VAE: sdxl_vae.safetensors, Size: 768x1152, Model: zavychromaxl_v40, Version: v1.7.0, Template: [object Object], Model hash: 63a3752da1, Hires steps: 10, Hires upscale: 1.4, Hires upscaler: 4x-UltraSharp,

ADetailer model: face_yolov8n.pt, ADetailer prompt: [object Object], ADetailer VAE 3rd: vae-ft-mse-840000-ema-pruned.ckpt, ADetailer version: 24.1.2, Negative Template: [object Object], Denoising strength: 0.5, ADetailer mask blur: 12, ADetailer model 2nd: hand_yolov8n.pt, ADetailer model 3rd: female_breast_v3.2.pt, ADetailer steps 3rd: 20, ADetailer confidence: 0.5, ADetailer prompt 2nd: __prompt__ __detailer/hands__, ADetailer prompt 3rd: __prompt__ __detailer/breasts__, ADetailer sampler 3rd: DPM++ 2M Karras, Hires negative prompt: [object Object], ADetailer dilate erode: 12, ADetailer CFG scale 3rd: 7.0, ADetailer inpaint width: 1024, ADetailer mask blur 2nd: 12, ADetailer mask blur 3rd: 12, ADetailer checkpoint 3rd: SD1.5\wafflemix7.safetensors [4844f773bb], ADetailer confidence 2nd: 0.5, ADetailer confidence 3rd: 0.5, ADetailer inpaint height: 1024, ADetailer mask min ratio: 0.001, ADetailer inpaint padding: 256, ADetailer negative prompt: [object Object], ADetailer dilate erode 2nd: 12, ADetailer dilate erode 3rd: 12, ADetailer inpaint width 2nd: 896, ADetailer inpaint width 3rd: 1024, ADetailer denoising strength: 0.45, ADetailer inpaint height 2nd: 896, ADetailer inpaint height 3rd: 1024, ADetailer mask min ratio 2nd: 0.001, ADetailer mask min ratio 3rd: 0.001, ADetailer inpaint only masked: True, ADetailer inpaint padding 2nd: 256, ADetailer inpaint padding 3rd: 256, ADetailer negative prompt 2nd: [object Object], ADetailer negative prompt 3rd: [object Object], ADetailer use separate VAE 3rd: True, ADetailer denoising strength 2nd: 0.35, ADetailer denoising strength 3rd: 0.45, ADetailer use separate steps 3rd: True, ADetailer inpaint only masked 2nd: True, ADetailer inpaint only masked 3rd: True, ADetailer mask only top k largest: 5, ADetailer use inpaint width height: True, ADetailer use separate sampler 3rd: True, ADetailer use separate CFG scale 3rd: True, ADetailer mask only top k largest 2nd: 6, ADetailer mask only top k largest 3rd: 4, ADetailer use separate checkpoint 3rd: True, ADetailer use inpaint width height 2nd: True, ADetailer use inpaint width height 3rd: True

1
SD.Next Release for 2024-06-23
github.com GitHub - vladmandic/automatic: SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models - vladmandic/automatic

GitHub - vladmandic/automatic: SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

Highlights for 2024-06-23

Following zero-day SD3 release, a 10 days later here's a refresh with 10+ improvements including full prompt attention, support for compressed weights, additional text-encoder quantization modes.

But there's more than SD3:

  • support for quantized T5 text encoder FP16/FP8/FP4/INT8 in all models that use T5: SD3, PixArt-Σ, etc.
  • support for PixArt-Sigma in small/medium/large variants
  • support for HunyuanDiT 1.1
  • additional NNCF weights compression support: SD3, PixArt, ControlNet, Lora
  • integration of MS Florence VLM/VQA Base and Large models
  • (finally) new release of Torch-DirectML
  • additional efficiencies for users with low VRAM GPUs
  • over 20 overall fixes
0
Y'shtola Rhul - FFXIV

(Manityro) (2024)

Image description: A woman with cat-like features standing in a moonlit forest. She is holding an ornate staff, topped with a glowing purple orb and adorned with smaller gems of the same color. Dressed in a dark, feathered outfit, she exudes an air of mystique. The full moon casts a soft glow on the forest and the character.

Full Generation Parameters:

score_9, score_8_up, score_7_up, 1girl, solo, shtla, dark-skinned female, animal ears, tail, whisker markings, short hair, large breasts, <lora:YshtolaRhulPDXL_V1-Manityro-CAME:1.0>, grey eyes, white hair, shaOtft, feather earrings, feathers, brooch, black halterneck, black choker, black dress, long dress, fur-trimmed dress, wide sleeves, black gloves, partially fingerless gloves, claw ring, looking at viewer, serious, holding staff, wood staff, outdoors, dark, night, moon, spooky forest, purple magic, purple aura

Negative prompt: monochrome, text, signature, artist name, watermark, patreon, moles, mole, pubes, tanlines,

Steps: 24, CFG scale: 6, Sampler: Euler A AYS, Seed: 3622385744, VAE: sdxlVAE_sdxlVAE.safetensors, Size: 832x1216, Model: snowpony_v10, Version: f0.0.17v1.8.0rc-latest-287-g77bdb920, Model hash: d6f941b46b, Hires steps: 10, Hires upscale: 1.5, Hires upscaler: 4x-UltraSharp,

ADetailer model: Anzhc Face seg 640 v2 y8n.pt, ADetailer version: 24.6.0, Denoising strength: 0.4, ADetailer mask blur: 8, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer inpaint width: 1024, ADetailer inpaint height: 1024, ADetailer inpaint padding: 32, ADetailer denoising strength: 0.4, ADetailer inpaint only masked: True, ADetailer use inpaint width height: True, Clip skip: 2

0
Twilight Gazer

(CivitFowl) (2024)

Image description: A fantastic stag standing in a moonlit forest. Its body reflects the glow of the bioluminescent plants around it, and a puddle mirrors its striking form below.

Full Generation Parameters:

(8k digital photography, raw photo:1.2), a enchanted beautiful summer landscape at (dawn with a fantastical creature, magical lights in the sky:1.1) <lora:Bio-Luminescence:1> bioluminescent

Steps: 7, CFG scale: 3, Sampler: DPM++ SDE Karras, Seed: 2368414684, VAE: sdxl_vae.safetensors, Size: 612x1024, Model: stableFusionXLTurbo_v161, Version: f0.0.17v1.8.0rc-latest-276-g29be1da7, Model hash: 913421be31, Hires steps: 7, Hires upscale: 1.5, Hires upscaler: 4X-NMKD.pth, Denoising strength: 0.7

0
AI trained on photos from kids’ entire childhood without their consent
  • This isn't undermining artists, it's expanding access and knowledge, enabling individuals to take control of their own destinies. Open-source AI will empower artists, existing artists and newly active or returning artists who give this new medium a shot, by giving them the new tools that will push the frontiers of self-expression and redefine creativity this decade.

    100 years ago photographers and filmmakers significantly disrupted the careers of most illustrators, story tellers, and theater companies of the time. Despite this, storytelling and image making exploded, entering a new golden age. Musicians panicked over the use of synthesizers in the 80s too often refusing to work with people involved with synthesizers. As a result, there are fewer drummers today than in 1970, but out of that came hip hop and house. Suppressing that tool would have been a huge cultural loss. Generative art hasn't found its Marley Marl or Frankie Knuckles yet, but they're out there, and they're going to do stuff that will blow our minds. Cutting edge tools and techniques have always propelled art and artists forward. Every advancement a leap forward, leaving behind constraints and enabling more people to pursue their creative aspirations.

    That reminds me of a presentation I saw a little while back.

    If you want to fight against people's right to freely communicate and express themselves, be my guest, but it's not a fight you can win.

  • AI trained on photos from kids’ entire childhood without their consent
  • Giving all people a tool to help them more effectively communicate, express themselves, learn, and come together is something everyone should get behind.

    I firmly believe in the public's right to access and use information, while acknowledging artists should retain specific rights over their creations. I also accept that the rights they don't retain have always enabled ethical self-expression and productive dialogue.

    Imagine if copyright owners had the power to simply remove whatever wasn't profitable for them from existence. We'd be hindering critical functions such as critique, investigation, reverse engineering, and even the simple cataloging of knowledge. In place of all that good, we'd have an ideal world for those with money, tyrants, and all those who seek control, and the undermining of the free exchange of ideas.

  • AI trained on photos from kids’ entire childhood without their consent
  • Taking artists’ work without consent or compensation goes against the spirit of open source, though, doesn’t it?

    It doesn't. Making observations about others' works is a well-established tool for any researchers, reviewers, and people inventing new works. A concept which work perfectly within the open source framework. That's all these models are, original analysis of its training set in comparison with one another. Because it's a step one must necessarily take when doing anything, doing this doesn't require anyone's permission and is itself a right we all have.

  • AI trained on photos from kids’ entire childhood without their consent
  • I'm not fighting for the extremely wealthy, I'm fighting for the existence of competitive open source models. Something that can't happen with what you've proposed. That would just hand corporations a monopoly of a public technology by making it prohibitively expensive to for regular people to keep up with the megacorporations that already own vast troves of data and can afford to buy even more.

    This article by Katherine Klosek, the director of information policy and federal relations at the Association of Research Libraries does a good job of explaining what I'm talking about.

  • AI trained on photos from kids’ entire childhood without their consent
  • So the question that comes to mind is exactly how, on a practical level, it would work to make sure that when a company scrapes data, trains and AI, and then makes billions of dollars, the thousands or millions of people who created the data all get a cut after the fact. Because particularly in the creative sector, a lot of people are freelancers who don’t have a specific employer they can go after. From a purely practical perspective, paying artists before the data is used makes sure all those freelancers get paid. Waiting until the company makes a profit, taxing it out of them, and then distributing it to artists doesn’t seem practical to me.

    This isn't labor law.

  • AI trained on photos from kids’ entire childhood without their consent
  • Creating same-y pieces with AI will not improve the material conditions of artists’ lives, either. All that does is drag everyone down in a race to the bottom on who can churn out the most dreck the most quickly. “If we advance the technology enough, everybody can have it on their device and make as much AI-generated crap as they like” does not secure stable futures for artists.

    If you're worried about labor issues, use labor law to improve your conditions. Don't deny regular people access to a competitive, corporate-independent tool for creativity, education, entertainment, and social mobility for your monetary gain.

    Art ain't just a good; it's self-expression, communication, inspiration, joy – rights that belong to every human being. The kind of people wanting to relegate such a significant part of the human experience to a domain where only the few can benefit aren't the kind of people that want things to get better. They want to become the proverbial boot. The more people can participate in these conversations, the more we can all learn.

    I understand that you are passionate about this topic, and that you have strong opinions. However, insults, and derisive language aren't helping this discussion. They only create hostility and resentment, and undermine your credibility. If you’re interested, we can continue our discussion in good faith, but if your next comment is like this one, I won’t be replying.

  • AI trained on photos from kids’ entire childhood without their consent
  • The point is that It's not an activity you can force someone to pay for. Everyone that can run models on their own can benefit, and that group can expand with time as research makes it more feasible on more devices. But that can never come to pass if we destroy the rights that allow us to make observations and analyze data.

    counting words and measuring pixels are not activities that you should need permission to perform, with or without a computer, even if the person whose words or pixels you're counting doesn't want you to. You should be able to look as hard as you want at the pixels in Kate Middleton's family photos, or track the rise and fall of the Oxford comma, and you shouldn't need anyone's permission to do so.

    Creating an individual bargainable copyright over training will not improve the material conditions of artists' lives – all it will do is change the relative shares of the value we create, shifting some of that value from tech companies that hate us and want us to starve to entertainment companies that hate us and want us to starve.

  • InitialsDiceBearhttps://github.com/dicebear/dicebearhttps://creativecommons.org/publicdomain/zero/1.0/„Initials” (https://github.com/dicebear/dicebear) by „DiceBear”, licensed under „CC0 1.0” (https://creativecommons.org/publicdomain/zero/1.0/)EV
    Even_Adder @lemmy.dbzer0.com
    Posts 2.1K
    Comments 1K