I want to personally train the controlnet, but I find it inconvenient to prepare the datasets. As I follow the huggingface tutorial available at this link: https://huggingface.co/blog/train-your-controlnet, I believe I should organize the dataset in the huggingface datasets format. My intention is to train the controlnet using various prompt settings and compare the outcomes. However, I realize that I will need to create multiple datasets for each experiment, which is time-consuming and space-inefficient because the images and conditional images in each dataset remain the same.
Should I create multiple datasets that differ solely in the prompt column, or is there a more efficient approach to accomplish this?