Stable Diffusion IMG2IMG: EVERYTHING you need to know IN ONE PLACE!

111.93k views1756 WordsCopy TextShare

Incite AI

Img2img, inpainting, inpainting sketch, even inpainting upload, I cover all the basics in todays vid...

Video Transcript:

in today's video I'm going to show you everything you need to know about the imaged image tool in automatic 1111 image to image sketch in painting in paint sketch even in paint upload this video has it all ready let's get started the image to image tab is an essential tool in your air toolbox it lets you create a new image or new elements of an image from an existing picture that you provide there are so many powerful tools here but let's start with the image damage tab itself really you're going to use this tool if you have an image that you want to use as a starting point you can use that image to pull elements of composition and color into a brand new image so I've got an image here that I've generated earlier it's a portrait of a girl on a city street I use the sdxl base model to generate this image but you can use any image you want an existing photo a painting whatever so to start we'll drop that into the image to image box you can also send images here from the text image tab just click on send to image to image under your generation at the top just like we would for text image we can add in positive and negative prompts if you did send your image across from text to image the original prompt will appear just like you see here below the image box you'll see all the settings we can play with now a lot of these settings in this tab are shared with text image so I'll only touch on the image to image specific ones here for a more in-depth explanation of settings check out the video in the description the First new setting is resize mode use this if your new image will have a different size or aspect ratio than your original image for example if your original image is landscape and you want your new image to be portrait the default is just resize this will stretch or shrink your original image to fit the new image you create crop and resize will keep the aspect of your original image and crop it to fit the new canvas size resize and fill takes the original image fits it into the canvas and fills in the blanks with colors from the input image I don't see much use for this one but let me know in the comments if I'm missing something just resize latent upscale is just like just resize but also upscales the image if necessary note that this setting is a bit tricky to get right and there are easier ways to upscale your images below resize modes you'll find settings you're already familiar with including sampling method sampling steps your size and batch settings and the CFG scale the most important setting here is denoising strength stable diffusion makes images by taking random noise and converging it into an image based on your prompt this setting will control how much extra noise is added to the picture and in turn will determine how different the new image is from the original lower settings won't change the image that much and higher settings will make bigger changes I like to use 0. 7 as a good starting point okay so now for the fun part once you've sent over an image you can start tweaking some settings to refine your picture we sent this image over from text to image so we have all of our settings and the original prompt now we could just start tweaking our prompt here but bear in mind that the entire prompt plus the image will be used to create a new image so if you aren't using the same model seed and sampler from the original image you may get some weird results so instead we're just going to use some simpler instructions to see how it will affect the image so let's get rid of the prompt and put in pink and and blue hair and let's keep the same sampling method and steps we'll tick restore faces and we'll keep the default on just resize and keep our Dimensions the same and we'll set our denoising strength to 0. 7 so we should see a bit of a variation in our output foreign seven that's had a pretty big effect so if we want something a little more subtle let's try lowering the denoiser strength to 0.

5 [Music] and yeah that's a much more subtle impact on the image so you can keep on adding more detail here and reworking things until you get an image that you're looking for when we use this method though we can see that the prompt has had an impact on the entire image the hair and eyes have changed but so has the face and the clothes let's say we just want to change a particular part of the image and keep the rest just as it is that's where in paint comes in in paint is a powerful tool that allows us to paint over the specific parts of images that we want to change this is especially useful when you like the overall composition of your image but you get one of those Cosmic nightmare faces from the hell Dimension just no to start you can click on the in paint tab or just click in paint under the image to copy the image over and all of its settings so now we can just paint over the particular part of the image we want to change in this case the hair we can use the settings in the top right of the box to change the brush size or clear our selection if we go wrong so let's just paint over the hair and then we can check our settings we've seen some of these already so we'll keep resize mode on default mask blur is the amount of blurring that happens around your painted mask lower numbers will give you harder edges while cranking this up will affect how many pixels beyond your mask will be affected by your prompt the default of four is usually fine but feel free to play with this if you're getting hard edges mask mode determines what is actually changed in the image in paint mask will change the parts you've painted while in paint not masked will change everything but the parts you painted this is great if you want to change the background or large amounts of your image in one go next you can choose the masked content setting these settings tell stable diffusion what method it should use as its basis for generating the new image part The Fill option will take the impainted area blur it then generate using that as its base the original setting will use the original image unaltered as its base to generate a new image from latent noise will fill the impainted area with new random noise based on the seed number this can be used if you want to result completely different from the original Late nothing will fill in the painted area with a blend of the colors from the surrounding pixels and base the generation of that these settings will have a dramatic effect on the output but if you're making Minor Adjustments the original setting will usually work great the in-paint area setting is easily confused with mask mode but they are different choosing whole Picture Tells stable diffusion to use the whole picture as inspiration for the in-paint generation while only masked will treat the Mast area in isolation and not draw from the rest of the image typically whole image is the best setting if you want the painted area to blend in but if you do choose only masked you can change the padding size right next to it which will tell stable diffusion how many neighboring pixels it should take into consideration as inspiration for the new generation and of course below that we have the CFG scale and the noising strength let's stick this on 5. 5 and 0. 6 and see what happens now that looks pretty good but we can do even more within painting sketch so let's say we want to give our model a scarf we can send this image to in-paint sketch just like we did before we can paint over the image but we now have the option of choosing colors so let's go with red here and just paint in the area we want to add our scarf to and in the prompt we'll just type in red Woolen scarf for this you usually have to bump up the denoising strength so let's go to 0.