A data scientist is training a generative AI model to produce realistic images of cats. The training dataset consists of millions of images of cats, but none of these images have explicit tags or annotations (e.g., "Siamese," "tabby," "sitting," "playing"). The model learns to generate new cat images by identifying patterns and features directly from this raw image data. What type of data is primarily being used for this training?