After starting bioinformatics workflow in Coretex, I am getting the following message even though data seems to be in order:
“Failed to determine which column contains sampleIDs/names..
.” and then the list of available names, but I am using one from the list.
I am trying to run a microbiome sequencing task in Coretex, and I have used standard microbiome sequencing data in .fastq.gz
format. Run should have been successful but it is failing every time.
The code for the data import I use is, what is missing:
def main() -> None:
taskRun: TaskRun[CustomDataset] = currentTaskRun()
dataset = taskRun.dataset
dataset.download()
outputDir = folder_manager.createTempFolder("import_output")
multiplexedFastqs = ctx_qiime2.getFastqMPSamples(dataset)
if len(multiplexedFastqs) > 0:
importMultiplexed(multiplexedFastqs, taskRun, outputDir)
else:
importDemultiplexed(SequenceDataset.decode(dataset.encode()), taskRun, outputDir)
axcac is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.