Composition Control is a series of settings, which gives users targeted and nuanced control of their outputs. Users simply put in a reference image, select a mode, and retain important information from their image that other tools fail to keep
In the past, it was difficult to communicate to an AI model what is should be referencing within an input image. Composition Control solves this problem by introducing an efficient method for Scenario Generators to produce high quality results with additional input conditions. By employing Composition Control, users can provide reference images to clearly communicate specific details about their image goals.
When Should You Use Composition Control?The best time to use Composition Control Modes is when you are trying to emulate the structure, linework, or general architecture of your reference image. Different modes can pick up poses, edges, lines, and even depth!
Img2Img is recommended if you are trying to retain color values, such a background, garment color, or otherwise.
Composition Control ModesYou can read more about our Composition Control Modes in our Advanced User article. However, we'd like to share with you four powerful modes to get you started. During the Composition Control process, images are converted into Mode Maps, which are then are used to create your images. We recommend starting with one of these four options, and have provided a quick reference guide to help you choose.
Pose mode is ideal for character creation. It works best with realistic or semi-realistic human images, as that is what it was trained on. Pose mode is not as useful for non-character work, but incredibly powerful at detecting faces and poses!
Depth Mode is a wonderful tool for differentiating the background and foreground of your reference image, as well as the leveled elements in an image. As you can see below, it retains both the outer structure, as well as many of the finer details, of the original image.
Structure Mode picks out and highlights all the fine edgework in an image, focusing mainly on what it considers the subject. Structure mode looks the most like the original input, but as is true with Composition Control, it will not carry over any of the original reference colors.
Segmentation mode only notices the areas of space taken up by subjects in an image. It tends to recognize the difference between the foreground and background, as well as typically being able to tell what different objects are in your reference image. However, it will pull in most of it's composition information from your generator, and less from the image itself.