Start with 5-15 images, potentially expanding up to 30 or more. Ensure the dataset is large enough for the model to learn relevant patterns but not so large that it becomes unwieldy.
Maintain consistency in the aspects you wish to train, like subject or aesthetics, but introduce diversity for elements you don't want the model to specifically remember.
High-resolution images are preferred. Square images work best and can be cropped and resized in the Scenario web app.
Experiment with various datasets to understand what works best for your specific needs.
Ensure not to have too many similar images to prevent the model from becoming overly specialized.
Include images with various poses, lighting conditions, and body shots for flexibility and accuracy in outcomes. Focus on different angles, facial expressions, and body positions.
Incorporate the character in multiple contexts and settings to help the model understand different scenarios.
The dataset should uniformly represent the style you're training for. If it's an anime style, for example, use specific anime tagging systems for descriptions.
Only include images that match the desired style. For instance, if you are training your own unique art style, only include images from your own style's collection.
Remove unwanted elements like stray objects or signatures to keep images clean.
For more information on image formatting follow this link
Building a good dataset is a craft, and it will take time to learn. We recommend checking out our tutorials, looking at the sample training sets we’ve included, and testing a few different datasets when you train a new generator
By adhering to these guidelines, you can effectively curate a dataset that helps train a LoRA model tailored to your specific requirements, whether it's for specific characters or general styles. The process involves a balance between consistency in key elements and diversity in secondary aspects to ensure the model learns effectively without biases or narrow focus.