Problem with that is that taking away even specific parts of the dataset can have a large impact of performance as a whole… Like when they removed NSFW from an image generator dataset and suddenly it sucked at drawing bodies in general
Problem with that is that taking away even specific parts of the dataset can have a large impact of performance as a whole… Like when they removed NSFW from an image generator dataset and suddenly it sucked at drawing bodies in general