Possibly the most important part of your workflow is finding samples.
Samples, like variables, come in two flavors. They are files, or they are directories. The first shows files, and the second directories.
Samples are files with the sample name and a .csv extension.
/home/user/workflow/
/data
/raw
sample1.csv
sample2.csv
---
global:
- indir: /home/user/workflow/data/raw
- outdir: /home/user/workflow/data/analysis
- file_rule: (sample.*).csv$
This time the sample names come from a directory, with stuff inside.
/path/to/indir
/sample1
billions_of_small_files
from_the_Sequencer
/sample2
billions_of_small_files
from_the_Sequencer
---
global:
- indir: /home/user/workflow/data/raw
- outdir: /home/user/workflow/data/analysis
- file_rule: (sample.*)
- find_by_dir: 1
- by_sample_outdir: 1
Before version 0.03
This module was originally developed at and for Weill Cornell Medical College in Qatar within ITS Advanced Computing Team. With approval from WCMC-Q, this information was generalized and put on github, for which the authors would like to express their gratitude.
As of version 0.03:
This modules continuing development is supported by NYU Abu Dhabi in the Center for Genomics and Systems Biology. With approval from NYUAD, this information was generalized and put on bitbucket, for which the authors would like to express their gratitude.