I'm pretty new at CNN and have I need to build a pipeline that loads the images and also get them ready for the CNN. The thing is that I need to build a dataset formed by images. There are three classes of images: COVID-19, Healthy lungs and Pneumonia. The files that I have are:
- 1 folder containing images of lungs with covid-19
- 1 folder containing images of healthy lungs
- 1 folder containing images with pneumonia
- 1 .txt file that has all the images for which the training dataset will be formed
- 1 .txt file that has all the images for which the validation dataset will be formed
- 1 .txt file that has all the images for which the text dataset will be formed
I´ve been searching on Internet but I don´t reach to find a way to build a dataset made by all the images but not even how to relate them to the .txt files and build the related training, test and validation dataset. Any suggestion? Please, find below the structure of the .txt file as an example:
2 PNEUMONIA/person888_bacteria_2812.jpeg
2 PNEUMONIA/person1209_bacteria_3161.jpeg
2 PNEUMONIA/person1718_bacteria_4540.jpeg
2 PNEUMONIA/person549_bacteria_2303.jpeg
2 PNEUMONIA/person831_bacteria_2742.jpeg
2 PNEUMONIA/person1571_bacteria_4108.jpeg
2 PNEUMONIA/person1310_bacteria_3300.jpeg