I have a bash-script that takes two input samples (IPDID_DNA and IPDID_RNA) and a purity value (purity) which are analysed in pairs. I would like to make a loop based on a samplesheet (samplesheet.txt) that loops throuigh the pairs which are specified in rows. Tips on how to automate this in a loop?
My input samplesheet could look something like this:
dna_sample_id rna_tumor_sample_id purity
Sample1_DNA Sample1_RNA 0.9
Sample2_DNA Sample2_RNA 0.1
This is how it looks when inputtin two samples manually for one run where I specify IPDID_DNA and IPDID_RNA and purity manually:
IPDID_DNA="Sample1_DNA"
IPDID_RNA="Sample1_RNA"
IPDID_Folder="IPDID_DNA"
dna_sample_id=${IPDID_DNA}
dna_sample_pair_id=${IPDID_DNA}
dna_sample_output_id=${IPDID_DNA}
rna_tumor_sample_id=${IPDID_RNA}
rna_tumor_sample_pair_id=${IPDID_RNA}
rna_tumor_sample_output_id=${IPDID_RNA}
purity="0.9"
singularity exec \
--no-home \
-B /data:/data \
-W /data \
docker.sif \
bash process.sh \
--output_directory ${IPDID_Folder} \
--reference_fasta_file genome.fa \
--dna_tumor_id ${dna_sample_id} \
--dna_tumor_pair_id ${dna_sample_pair_id} \
--dna_tumor_output_id ${dna_sample_output_id} \
--dna_tumor_purity ${purity} \
--rna_tumor_id ${rna_tumor_sample_id} \
--rna_tumor_pair_id ${rna_tumor_sample_pair_id} \
--rna_tumor_output_id ${rna_tumor_sample_output_id} \
--rna_tumor_localapp_run_directory ${ANALYSIS_OUTPUT_DIR}