diff --git a/.gitignore b/.gitignore
new file mode 100644
index 0000000..b3e1100
--- /dev/null
+++ b/.gitignore
@@ -0,0 +1,20 @@
+*.class
+
+# Mobile Tools for Java (J2ME)
+.mtj.tmp/
+
+# Package Files #
+*.jar
+*.war
+*.ear
+
+# virtual machine crash logs, see http://www.java.com/en/download/help/error_hotspot.xml
+hs_err_pid*
+
+# NextFlow Specific
+.nextflow.log*
+.nextflow
+
+# Docker Specific
+*apt.list
+*conda.list
diff --git a/.zenodo.json b/.zenodo.json
new file mode 100644
index 0000000..9aa1f86
--- /dev/null
+++ b/.zenodo.json
@@ -0,0 +1,24 @@
+
+{
+    "creators": [ 
+        {
+            "name": "Ozadam, Hakan", 
+            "affiliation": "UT, Austin, TX, USA"
+        },
+        {
+            "name": "Cenik, Can",
+            "affiliation": "UT, Austin, TX, USA"
+        }
+    ], 
+    "keywords": [
+        "bioinformatics",
+        "genomics",
+        "ribosome",
+        "ribo-seq",
+        "Python"
+    ],
+    "description": "<p>RiboFlow is a NextFlow based pipeline for processing ribosome profiling data.</p>",
+    "access_right": "open",
+    "license": "MIT",
+    "upload_type": "software"
+}
diff --git a/README.md b/README.md
new file mode 100644
index 0000000..e87f436
--- /dev/null
+++ b/README.md
@@ -0,0 +1,176 @@
+# RiboFlow
+
+RiboFlow is a [Nextflow](https://www.nextflow.io/) based pipeline 
+for processing ribosome profiling data.
+
+## Installation
+
+### Requirements
+
+* [Nextflow](https://www.nextflow.io/)
+* [Docker](https://docs.docker.com/install/) (Optional) 
+* [Conda](https://conda.io/en/latest/miniconda.html) (Optional)
+
+First, follow the instructions in [Nextflow website](https://www.nextflow.io/) and install Nextflow. 
+
+The easiest way of using RiboFLow is using Docker.
+If using Docker is not an option, you can install the dependencies using Conda
+and run RiboFlow without Docker. 
+
+### Docker Option
+
+Install [Docker](https://docs.docker.com/install/). 
+Here is a [tutorial for Ubuntu.](https://www.digitalocean.com/community/tutorials/how-to-install-and-use-docker-on-ubuntu-18-04)
+
+All remaining dependencies come in the Docker image [ceniklab/riboflow](https://hub.docker.com/r/ceniklab/riboflow).
+This image is automatically pulled by RiboFlow when run with Docker (see test runs below). 
+
+### Conda Option
+
+This option has been tested on Linux systems only.
+
+Install  [Conda](https://conda.io/en/latest/miniconda.html). 
+
+All other dependencies can be installed using the environment file,
+environment.yaml, in this repository.
+```
+git clone https://github.com/ribosomeprofiling/riboflow.git
+conda env create -f riboflow/environment.yaml
+```
+
+The above command will create a conda environment called _ribo_
+and install dependencies in it.
+To start using RiboFlow, you need to activate the _ribo_ environment.
+
+`conda activate ribo`
+
+## Test Run
+
+For fresh installations, before running RiboFlow on actual data,
+it is recommended to do a test run.
+
+Clone this repository in a new folder and change your working directory to the RiboFlow folder. 
+```
+mkdir rf_test_run && cd rf_test_run
+git clone https://github.com/ribosomeprofiling/riboflow.git
+cd riboflow
+```
+
+Obtain a copy of the sample data in the working directory.
+```
+git clone https://github.com/ribosomeprofiling/rf_sample_data.git
+```
+
+### Run Using Docker
+
+Provide the argument `-profile docker_local` to Nextflow to indicate Docker use. 
+
+`nextflow RiboFlow.groovy -params-file project.yaml -profile docker_local`
+
+### Run Using Conda Environment
+
+Make sure that you have created the conda environment, called _ribo_,
+using the instructions above. Then activate the conda environment.
+
+`conda activate ribo` 
+
+If the above command fails to activate the ribo environment, try
+`source activate ribo`
+ 
+Now RiboFlow is ready to run.
+
+`nextflow RiboFlow.groovy -params-file project.yaml`
+
+## Output
+
+Pipeline run may take several minutes.
+When finished, the resulting files are in the `./output` folder.
+
+Mapping statistics are compiled in a csv file called `stats.csv` 
+
+```
+ls output/stats/stats.csv
+```
+
+Ribosome occupancy data is in a single 
+[ribo file](https://ribopy.readthedocs.io/en/latest/ribo_file_format.html) called `all.ribo`.
+
+`ls output/ribo/all.ribo`
+
+You can use 
+[RiboR](https://github.com/ribosomeprofiling/ribor) or
+[RiboPy](https://github.com/ribosomeprofiling/ribopy) to work with ribo files.
+
+
+## Actual Run
+
+For running RiboFlow on actual data, files must be organized and a parameters file must be prepared.
+You can examine the sample run above to see an example.
+
+1. Organize your data. The following files are required for RiboFlow
+* **Ribosome profiling sequencing data:** in gzipped fastq files 
+* **Transcriptome Reference:** Bowtie2 index files
+* **Filter Reference:** Bowtie2 index files (typically for rRNA sequences)
+* **Annotation:** A bed file defining CDS, UTR5 and UTR3 regions.
+* **Transcript Lengths:** A two column tsv file containing transcript lengths
+
+2. Prepare a custom `project.yaml` file. 
+You can use the sample file `project.yaml`, provided in this repository,
+as template.
+
+3. In `project.yaml`, provide RiboFlow parameters such as `clip_arguments`, alignment arguments etc.
+You can simply modify the arguments in the sample file `project.yaml` in this repository.
+
+4. You can adjust the hardware and computing environment settings in Nextflow configuration file(s).
+For Docker option, see `configs/docker_local.config`. If you are not using Docker,
+see `configs/local.config`.
+
+5. RNA-Seq data is optional for RiboFlow. So, if you do NOT have RNA-Seq data, in the project file, set
+
+`do_rnaseq: false`
+
+If you have RNA-Seq data to be paired with ribosome profiling data, see the __Advanced Features__ below.
+
+
+6. Metadata is optional for RiboFlow.. If you do NOT have metadata, in the project file, set
+
+`do_metadata: false`
+
+If you have metadata, see Advanced feature below.
+
+7. Run RiboFlow using the new parameters file `project.yaml`.
+
+Using Docker:
+ 
+`nextflow RiboFlow.groovy -params-file project.yaml -profile docker_local`
+
+Without Docker:
+
+`nextflow RiboFlow.groovy -params-file project.yaml`
+
+## Advanced Features
+
+### RNA-Seq Data
+
+If you have RNA-Seq data that you want to pair with ribosome profiling experiments,
+provide the paths of the RNA-Seq (gzipped) fastq files  in the configuration file in
+_input -> metadata_. See the file `project.yaml` in this repository for an example.
+Note that the names in defining RNA-Seq files must match the names in definig ribosome profiling data.
+Also turn set the do_rnaseq flag to true, in the project file:
+
+`do_rnaseq: true`
+
+Transcript abundance data will be stored in 
+
+### Metadata
+
+If you have metadata files for the ribosome profiling experiments,
+provide the paths of the metadata files (in yaml format) in the configuration file in
+_input -> metadata_. See the file `project.yaml` in this repository for an example.
+Note that the names in defining metadata files must match the names in definig ribosome profiling data.
+Also turn set the metadata flag to true, in the project file:
+
+`do_metadata: true`
+
+Metadata will be stored in the output ribo file.
+
diff --git a/RiboFlow.groovy b/RiboFlow.groovy
new file mode 100644
index 0000000..96abfa8
--- /dev/null
+++ b/RiboFlow.groovy
@@ -0,0 +1,2044 @@
+/*
+vim: syntax=groovy
+-*- mode: groovy;-*-
+*/
+
+/*
+Developed and tested on:
+N E X T F L O W  ~  version 19.04.1
+*/
+
+////////////////////////////////////////////////////////////////////////////////
+////// General Function Definitions ////////////////////////////////////////////
+
+String get_storedir(output_type){
+    new File( params.output.intermediates.base,
+              params.output.intermediates.get(output_type, output_type) )
+							.getCanonicalPath()
+}
+
+String get_publishdir(output_type){
+    new File( params.output.output.base,
+              params.output.output.get(output_type, output_type) )
+							.getCanonicalPath()
+}
+
+////// General Function Definitions ////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+
+fastq_base = params.input.get("fastq_base", "")
+if(! fastq_base.endsWith("/") && fastq_base != ""){
+	fastq_base = "${fastq_base}/"
+}
+
+// Group input files into a list of tuples where each item is
+// [ sample, fileindex, path_to_fastq_file]
+Channel.from(params.input.fastq.collect{k,v -> 
+	              v.collect{ z -> [k, v.indexOf(z) + 1, 
+									               file("${fastq_base}${z}")] }  }) 
+	.flatten().collate(3).into{  INPUT_SAMPLES_VERBOSE; 
+		                           INPUT_SAMPLES_MD5; 
+	                             INPUT_SAMPLES_EXISTENCE;
+	                             INPUT_SAMPLES_FASTQC; 
+															 INPUT_SAMPLES_CLIP;
+	                             INPUT_SAMPLES_LOG; 
+															 INPUT_SAMPLES_READ_LENGTH;
+														   INPUT_FOR_METADATA}
+	                              
+
+
+// Create a log file of index <-> fastq-file correspondence
+
+INPUT_SAMPLES_LOG.flatMap{ sample, index, fastq -> "${sample}\t${index}\t${fastq}" }
+    .collectFile(name: 'correspondence.txt', newLine: true)
+    .set{INPUT_SAMPLES_LOG_FILES}
+
+// Move the above correspondence file to an output folder via a process
+process write_fastq_correspondence{
+
+    executor 'local'
+
+	publishDir get_publishdir("stats"), mode: 'move'
+
+	input:
+	file(correspondence) from INPUT_SAMPLES_LOG_FILES
+
+	output:
+	file("index_fastq_correspondence.txt")
+
+	"""
+    cat ${correspondence} > index_fastq_correspondence.txt
+	"""
+}
+
+
+////////////////////////////////////////////////////////////////////////////////
+////// Check File Existence ////////////////////////////////////////////////////
+
+boolean file_exists(file_path) {    
+    this_file = file(file_path)
+    assert this_file.exists()  
+    return true
+}
+
+boolean hisat2_ref_exists(hisat2_ref) {    
+    Channel.from( ["1.ht2", "2.ht2", "3.ht2","4.ht2","5.ht2","6.ht2", "7.ht2", "8.ht2"])
+    .map{ this_suffix -> file_exists( "${hisat2_ref}.${this_suffix}".replaceAll('\\*', "") ) }
+    return true
+}
+
+boolean bt2_ref_exists(bt2_ref) {    
+    Channel.from( ["1.bt2", "2.bt2", "3.bt2","4.bt2","rev.1.bt2","rev.2.bt2"])
+    .map{ this_suffix -> file_exists( "${bt2_ref}.${this_suffix}".replaceAll('\\*', "") ) }
+    return true
+}
+
+if(params.do_check_file_existence){
+   // Make Sure Fastq Files Exist
+   INPUT_SAMPLES_EXISTENCE.map{ sample, index, this_file -> file_exists(this_file) }
+
+   // Make Sure bt2 and hisat reference files exist.
+   bt2_ref_exists( params.input.reference.filter )
+   bt2_ref_exists( params.input.reference.transcriptome )
+   if( params.input.reference.get("genome", false) ){
+       hisat2_ref_exists( params.input.reference.genome )
+   }
+   
+   if( params.input.reference.get("post_genome", false) ){
+       bt2_ref_exists( params.input.reference.post_genome )
+   }
+
+   file_exists(params.input.reference.regions)
+   file_exists(params.input.reference.transcript_lengths)
+   
+   root_meta_file = params.input.get("root_meta", false)
+   if( root_meta_file ){
+     file_exists(root_meta_file)
+   }
+}
+
+////// Check File Existence ////////////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+
+
+////////////////////////////////////////////////////////////////////////////////
+////////////////////////     P R O C E S S E S     /////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* RAW_FASTQC */
+
+process raw_fastqc{
+
+	publishDir get_publishdir("fastqc")+"/raw", mode: 'copy'
+
+	input:
+	set val(sample), val(index), file(fastq) from INPUT_SAMPLES_FASTQC
+
+	output:
+	set val(sample), file("${sample}.${index}_fastqc.html"), 
+                       file("${sample}.${index}_fastqc.zip") into RAW_FASTQC_OUT
+
+  when:
+  params.do_fastqc
+
+    """
+    if [ ! -f ${sample}.${index}.fastq.gz ]; then
+       ln -s $fastq ${sample}.${index}.fastq.gz
+    fi
+    fastqc ${sample}.${index}.fastq.gz --outdir=\$PWD -t ${task.cpus}
+    """
+
+} 
+
+// RAW_FASTQC
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* CLIP */
+
+process clip{
+
+    storeDir get_storedir("clip")
+
+    input:
+    set val(sample), val(index), file(fastq) from INPUT_SAMPLES_CLIP
+
+    output:
+    set val(sample), val(index), file("${sample}.${index}.clipped.fastq.gz") into CLIP_OUT
+    set val(sample), val(index), file("${sample}.${index}.clipped.log") into CLIP_LOG
+
+    """
+    cutadapt --cores=${task.cpus} ${params.clip_arguments} ${fastq} 2>${sample}.${index}.clipped.log  \
+     | gzip -c  > ${sample}.${index}.clipped.fastq.gz
+    """
+
+}
+
+// CLIP
+///////////////////////////////////////////////////////////////////////////////////////
+
+CLIP_OUT.into{ CLIP_OUT_FASTQC; CLIP_OUT_FILTER ; CLIP_OUT_READ_LENGTH}
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* CLIPPED FASTQC */
+
+process clipped_fastqc{
+
+    publishDir get_publishdir("fastqc") + "/clipped", mode: 'copy' 
+
+    input:
+    set val(sample), val(index), file(fastq)  from CLIP_OUT_FASTQC
+
+    output:
+    set val(sample), file("${sample}.${index}.clipped_fastqc.html"), 
+                       file("${sample}.${index}.clipped_fastqc.zip") into CLIPPED_FASTQC_OUT
+
+    when:
+    params.do_fastqc
+
+    """
+    if [ ! -f ${sample}.${index}.clipped.fastq.gz ]; then
+       ln -s $fastq ${sample}.${index}.clipped.fastq.gz
+    fi
+    fastqc ${sample}.${index}.clipped.fastq.gz --outdir=\$PWD -t ${task.cpus}
+    """
+}
+
+// CLIPPED FASTQC
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* FILTER */
+
+// Reads are mapped against (typically) rRNA, tRNA, adapter sequneces and etc
+// that have no use for downstream analysis
+// So we take the UNaligned reads from this process and use it for downstream processing
+
+FILTER_INDEX = Channel.from([[
+             params.input.reference.filter
+                .split('/')[-1]
+                .replaceAll('\\*$', "")
+                .replaceAll('\\.$', ""),
+             file(params.input.reference.filter),
+            ]])
+
+process filter{
+
+	storeDir get_storedir("filter")
+
+	input:
+	set val(sample), val(index), file(fastq) from CLIP_OUT_FILTER
+	set val(bowtie2_index_base), file(bowtie2_index_files) from FILTER_INDEX.first()
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.filter.bam") \
+        into FILTER_BAM
+    set val(sample), val(index), file("${sample}.${index}.filter.bam.bai") \
+        into FILTER_BAI
+    set val(sample), val(index), file("${sample}.${index}.aligned.filter.fastq.gz") \
+        into FILTER_ALIGNED
+    set val(sample), val(index), file("${sample}.${index}.unaligned.filter.fastq.gz") \
+        into FILTER_UNALIGNED
+    set val(sample), val(index), file("${sample}.${index}.filter.log") \
+        into FILTER_LOG
+    set val(sample), val(index), file("${sample}.${index}.filter.stats") \
+        into FILTER_STATS
+
+
+    """
+    bowtie2 ${params.alignment_arguments.filter} \
+            -x ${bowtie2_index_base} -q ${fastq} \
+            --threads ${task.cpus} \
+            --al-gz ${sample}.${index}.aligned.filter.fastq.gz \
+            --un-gz ${sample}.${index}.unaligned.filter.fastq.gz \
+                     2> ${sample}.${index}.filter.log \
+            | samtools view -bS - \
+            | samtools sort -@ ${task.cpus} -o ${sample}.${index}.filter.bam \
+            && samtools index -@ {task.cpus} ${sample}.${index}.filter.bam \
+            && samtools idxstats -@ {task.cpus} ${sample}.${index}.filter.bam  > \
+               ${sample}.${index}.filter.stats
+    """
+
+}
+
+FILTER_ALIGNED.into{FILTER_ALIGNED_FASTQ_READ_LENGTH; 
+                    FILTER_ALIGNED_FASTQ_FASTQC}
+FILTER_UNALIGNED.into{FILTER_UNALIGNED_FASTQ_READ_LENGTH; 
+                      FILTER_UNALIGNED_FASTQ_FASTQC;
+                      FILTER_UNALIGNED_TRANSCRIPTOME}
+
+// FILTER
+///////////////////////////////////////////////////////////////////////////////////////
+
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* TRANSCRIPTOME ALIGNMENT */
+
+TRANSCRIPTOME_INDEX = Channel.from([[
+             params.input.reference.transcriptome
+                .split('/')[-1]
+                .replaceAll('\\*$', "")
+                .replaceAll('\\.$', ""),
+             file(params.input.reference.transcriptome),
+            ]])
+
+
+process transcriptome_alignment{
+
+    storeDir get_storedir("transcriptome_alignment") + "/" + params.output.individual_lane_directory
+
+    input:
+    set val(sample), val(index), file(fastq) from FILTER_UNALIGNED_TRANSCRIPTOME
+		set val(transcriptome_reference), file(transcriptome_Reference_files) \
+		            from TRANSCRIPTOME_INDEX.first()
+
+    output:
+    set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.bam") \
+        into TRANSCRIPTOME_ALIGNMENT_BAM_PRE
+    set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.bam.bai") \
+        into TRANSCRIPTOME_ALIGNMENT_BAI
+    set val(sample), val(index), file("${sample}.${index}.aligned.transcriptome_alignment.fastq.gz") \
+        into TRANSCRIPTOME_ALIGNMENT_ALIGNED
+    set val(sample), val(index), file("${sample}.${index}.unaligned.transcriptome_alignment.fastq.gz") \
+        into TRANSCRIPTOME_ALIGNMENT_UNALIGNED
+    set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.log") \
+        into TRANSCRIPTOME_ALIGNMENT_LOG
+    set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.stats") \
+        into TRANSCRIPTOME_ALIGNMENT_STATS
+
+    """
+    bowtie2 ${params.alignment_arguments.transcriptome} \
+            -x ${transcriptome_reference} -q ${fastq} \
+            --threads ${task.cpus} \
+            --al-gz ${sample}.${index}.aligned.transcriptome_alignment.fastq.gz \
+            --un-gz ${sample}.${index}.unaligned.transcriptome_alignment.fastq.gz \
+						           2> ${sample}.${index}.transcriptome_alignment.log \
+            | samtools view -bS - \
+            | samtools sort -@ ${task.cpus} -o ${sample}.${index}.transcriptome_alignment.bam \
+            && samtools index -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.bam \
+            && samtools idxstats -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.bam  > \
+               ${sample}.${index}.transcriptome_alignment.stats
+    """
+}
+
+// TRANSCRIPTOME ALIGNMENT
+///////////////////////////////////////////////////////////////////////////////////////
+
+TRANSCRIPTOME_ALIGNMENT_BAM_PRE.into{ TRANSCRIPTOME_ALIGNMENT_BAM; 
+	                                    TRANSCRIPTOME_ALIGNMENT_BAM_MERGE;
+                                      TRANSCRIPTOME_ALIGNMENT_BAM_FOR_QUALITY}
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* QUALITY FILTER */
+
+process quality_filter{
+
+	storeDir get_storedir("quality_filter")
+
+	input:
+	set val(sample), val(index), file(bam) from TRANSCRIPTOME_ALIGNMENT_BAM_FOR_QUALITY
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.qpass.bam") \
+        into TRANSCRIPTOME_ALIGNMENT_QPASS_BAM_PRE
+    set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.qpass.bam.bai") \
+        into TRANSCRIPTOME_ALIGNMENT_QPASS_BAI
+    set val(sample), val(index), file("${sample}.${index}.qpass.count") \
+        into TRANSCRIPTOME_QPASS_COUNTS
+    set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.qpass.stats") \
+        into TRANSCRIPTOME_ALIGNMENT_QPASS_STATS
+
+	"""
+	samtools view -b -q ${params.mapping_quality_cutoff} ${bam}\
+	| samtools sort -@ ${task.cpus} -o ${sample}.${index}.transcriptome_alignment.qpass.bam \
+	&& samtools view -b -c ${sample}.${index}.transcriptome_alignment.qpass.bam > ${sample}.${index}.qpass.count \
+	&& samtools index -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.qpass.bam \
+	&& samtools idxstats -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.qpass.bam  > \
+               ${sample}.${index}.transcriptome_alignment.qpass.stats
+	"""
+}
+
+TRANSCRIPTOME_ALIGNMENT_QPASS_BAM_PRE.into{ QPASS_BAM_READ_LENGTH; 
+	                                        TRANSCRIPTOME_ALIGNMENT_QPASS_BAM}
+
+// QUALITY FILTER
+///////////////////////////////////////////////////////////////////////////////////////
+
+
+TRANSCRIPTOME_QPASS_COUNTS.into{TRANSCRIPTOME_QPASS_COUNTS_FOR_INDEX; 
+	                            TRANSCRIPTOME_QPASS_COUNTS_FOR_TABLE}
+
+// We need to copy output channels of transcriptome alignment
+// for merging and variaous steps of downstream processing
+
+TRANSCRIPTOME_ALIGNMENT_BAI.into{ TRANSCRIPTOME_ALIGNMENT_BAI_MERGE ;
+                                  TRANSCRIPTOME_ALIGNMENT_BAI_REGION_COUNT}
+
+TRANSCRIPTOME_ALIGNMENT_ALIGNED.into{ TRANSCRIPTOME_ALIGNMENT_ALIGNED_MERGE ;
+                                  TRANSCRIPTOME_ALIGNMENT_ALIGNED_LENGTH ;
+                                  TRANSCRIPTOME_ALIGNMENT_ALIGNED_FASTQC }
+
+TRANSCRIPTOME_ALIGNMENT_UNALIGNED.into{ TRANSCRIPTOME_ALIGNMENT_UNALIGNED_MERGE ;
+	                              TRANSCRIPTOME_ALIGNMENT_UNALIGNED_GENOME ;
+                                  TRANSCRIPTOME_ALIGNMENT_UNALIGNED_LENGTH ;
+                                  TRANSCRIPTOME_ALIGNMENT_UNALIGNED_FASTQC }
+
+TRANSCRIPTOME_ALIGNMENT_LOG.into{ TRANSCRIPTOME_ALIGNMENT_LOG_MERGE ;
+                                  TRANSCRIPTOME_ALIGNMENT_LOG_TABLE  }
+
+TRANSCRIPTOME_ALIGNMENT_STATS.into{ TRANSCRIPTOME_ALIGNMENT_STATS_MERGE ;
+                                    TRANSCRIPTOME_ALIGNMENT_STATS_TABLE  }
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* MERGE TRANSCRIPTOME ALIGNMENT */
+
+TRANSCRIPTOME_ALIGNMENT_BAM_MERGE.map{sample, index, bam -> [sample, bam]}.groupTuple()
+    .set{ TRANSCRIPTOME_ALIGNMENT_GROUPED_BAM }
+
+TRANSCRIPTOME_ALIGNMENT_ALIGNED_MERGE.map{sample, index, fastq -> [sample, fastq]}.groupTuple()
+    .set{ TRANSCRIPTOME_ALIGNMENT_GROUPED_ALIGNED }
+
+TRANSCRIPTOME_ALIGNMENT_UNALIGNED_MERGE.map{sample, index, fastq -> [sample, fastq]}.groupTuple()
+    .set{ TRANSCRIPTOME_ALIGNMENT_GROUPED_UNALIGNED }
+
+TRANSCRIPTOME_ALIGNMENT_LOG_MERGE.map{sample, index, log -> [sample, log]}.groupTuple()
+    .set{ TRANSCRIPTOME_ALIGNMENT_GROUPED_LOG }
+
+
+TRANSCRIPTOME_ALIGNMENT_GROUPED_BAM.join(TRANSCRIPTOME_ALIGNMENT_GROUPED_ALIGNED)
+                                   .join(TRANSCRIPTOME_ALIGNMENT_GROUPED_UNALIGNED)
+                                   .join(TRANSCRIPTOME_ALIGNMENT_GROUPED_LOG) 
+                                   .into{TRANSCRIPTOME_ALIGNMENT_GROUPED_JOINT;
+                                         TRANSCRIPTOME_ALIGNMENT_GROUPED_JOINED_VERBOSE}
+
+
+process merge_transcriptome_alignment{
+
+	storeDir get_storedir("transcriptome_alignment") + "/" + params.output.merged_lane_directory
+
+  input:
+	set val(sample), file(bam), file(aligned_fastq), 
+	    file(unaligned_fastq), file(alignment_log) from\
+           TRANSCRIPTOME_ALIGNMENT_GROUPED_JOINT
+
+	output:
+	set val(sample), file("${sample}.transcriptome.bam")                into \
+	      TRANSCRIPTOME_ALIGNMENT_MERGED_BAM
+	set val(sample), file("${sample}.transcriptome.bam.bai")            into \
+	      TRANSCRIPTOME_ALIGNMENT_MERGED_BAI
+	set val(sample), file("${sample}.transcriptome.aligned.fastq.gz")   into \
+	      TRANSCRIPTOME_ALIGNMENT_MERGED_ALIGNED
+	set val(sample), file("${sample}.transcriptome.unaligned.fastq.gz") into \
+	      TRANSCRIPTOME_ALIGNMENT_MERGED_UNALIGNED
+	set val(sample), file("${sample}.transcriptome.log")                into \
+	      TRANSCRIPTOME_ALIGNMENT_MERGED_LOG
+
+	"""
+	samtools merge ${sample}.transcriptome.bam ${bam} && \
+	samtools index ${sample}.transcriptome.bam && \
+    zcat ${aligned_fastq} | gzip -c > ${sample}.transcriptome.aligned.fastq.gz && \
+    zcat ${unaligned_fastq} | gzip -c > ${sample}.transcriptome.unaligned.fastq.gz && \
+    rfc merge bowtie2-logs --out ${sample}.transcriptome.log ${alignment_log}
+	"""
+
+}    
+
+// MERGE TRANSCRIPTOME ALIGNMENT
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* TRANSCRIPTOME INDIVIDUAL  FASTQC */
+
+process transcriptome_aligned_individual_fastqc{
+    publishDir get_publishdir("fastqc") + "/transcriptome_aligned", mode: 'copy' 
+
+    input:
+    set val(sample), val(index), file(fastq)  from TRANSCRIPTOME_ALIGNMENT_ALIGNED_FASTQC
+
+    output:
+    set val(sample), file("${sample}.${index}.transcriptome.aligned_fastqc.html"), 
+                       file("${sample}.${index}.transcriptome.aligned_fastqc.zip") \
+                        into TRANSCRIPTOME_ALIGNED_INDIVIDUAL_FASTQC_OUT
+
+    when:
+    params.do_fastqc
+
+    """
+    if [ ! -f ${sample}.${index}.transcriptome.aligned.fastq.gz ]; then
+       ln -s ${fastq} ${sample}.${index}.transcriptome.aligned.fastq.gz
+    fi
+    fastqc ${sample}.${index}.transcriptome.aligned.fastq.gz --outdir=\$PWD -t ${task.cpus}
+    """
+
+}
+
+process transcriptome_unaligned_individual_fastqc{
+    publishDir get_publishdir("fastqc") + "/transcriptome_unaligned", mode: 'copy' 
+
+    input:
+    set val(sample), val(index), file(fastq)  from TRANSCRIPTOME_ALIGNMENT_UNALIGNED_FASTQC
+
+
+    output:
+    set val(sample), file("${sample}.${index}.transcriptome.unaligned_fastqc.html"), 
+                       file("${sample}.${index}.transcriptome.unaligned_fastqc.zip") \
+                        into TRANSCRIPTOME_UNALIGNED_INDIVIDUAL_FASTQC_OUT
+    when:
+    params.do_fastqc
+
+    """
+    if [ ! -f ${sample}.${index}.transcriptome.unaligned.fastq.gz ]; then
+       ln -s ${fastq} ${sample}.${index}.transcriptome.unaligned.fastq.gz
+    fi
+    fastqc ${sample}.${index}.transcriptome.unaligned.fastq.gz --outdir=\$PWD -t ${task.cpus}
+    """
+
+}
+
+// TRANSCRIPTOME INDIVIDUAL FASTQC
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////
+///////////////////////////////////////////////////////////////////////////////
+///////////////////////////////////////////////////////////////////////////////
+/* GENOME ALIGNMENT */
+
+do_align_genome = params.input.reference.get("genome", false)
+
+if(do_align_genome){
+
+GENOME_INDEX = Channel.from([[
+                params.input.reference.genome
+                .split('/')[-1]
+                .replaceAll('\\*$', "")
+                .replaceAll('\\.$', ""),
+             file(params.input.reference.genome),
+            ]])
+
+
+process genome_alignment{
+
+	storeDir get_storedir("genome_alignment") + "/" + params.output.individual_lane_directory
+
+	input:
+	set val(sample), val(index), file(fastq) from TRANSCRIPTOME_ALIGNMENT_UNALIGNED_GENOME
+	set val(genome_base), file(genome_files) from GENOME_INDEX.first()
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.genome_alignment.bam") \
+        into GENOME_ALIGNMENT_BAM
+    set val(sample), val(index), file("${sample}.${index}.genome_alignment.bam.bai") \
+        into GENOME_ALIGNMENT_BAI
+    set val(sample), val(index), file("${sample}.${index}.genome_alignment.aligned.fastq.gz") \
+        into GENOME_ALIGNMENT_ALIGNED
+    set val(sample), val(index), file("${sample}.${index}.genome_alignment.unaligned.fastq.gz") \
+        into GENOME_ALIGNMENT_UNALIGNED
+    set val(sample), val(index), file("${sample}.${index}.genome_alignment.log") \
+        into GENOME_ALIGNMENT_LOG
+    set val(sample), val(index), file("${sample}.${index}.genome_alignment.csv") \
+        into GENOME_ALIGNMENT_CSV
+
+    """
+    hisat2 ${params.alignment_arguments.genome} \
+           -x ${genome_base} -U ${fastq} \
+           -p ${task.cpus} \
+           --al-gz ${sample}.${index}.genome_alignment.aligned.fastq.gz \
+           --un-gz ${sample}.${index}.genome_alignment.unaligned.fastq.gz \
+               2> ${sample}.${index}.genome_alignment.log \
+           | samtools view -bS - \
+           | samtools sort -@ ${task.cpus} -o ${sample}.${index}.genome_alignment.bam \
+           && samtools index -@ {task.cpus} ${sample}.${index}.genome_alignment.bam \
+           && rfc bt2-log-to-csv -o ${sample}.${index}.genome_alignment.csv \
+                  -n ${sample} -p genome -l ${sample}.${index}.genome_alignment.log
+    """
+
+}
+
+GENOME_ALIGNMENT_ALIGNED.into{ GENOME_ALIGNMENT_ALIGNED_FASTQ_READ_LENGTH;
+                               GENOME_ALIGNMENT_ALIGNED_MERGE;
+                               GENOME_ALIGNMENT_ALIGNED_FASTQ_FASTQC }
+
+GENOME_ALIGNMENT_UNALIGNED.into{ GENOME_ALIGNMENT_UNALIGNED_FASTQ_READ_LENGTH;
+                                 GENOME_ALIGNMENT_UNALIGNED_MERGE;
+                                 GENOME_ALIGNMENT_UNALIGNED_FASTQ_FASTQC;
+                                 FOR_POST_GENOME }
+
+
+
+// GENOME ALIGNMENT
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* MERGE GENOME ALIGNMENT */
+GENOME_ALIGNMENT_LOG.into{ GENOME_ALIGNMENT_LOG_MERGE; GENOME_ALIGNMENT_LOG_TABLE }
+
+GENOME_ALIGNMENT_LOG_TABLE
+    .map{ sample, index, genome_log -> [ [sample, index], genome_log ] }
+    .set{GENOME_ALIGNMENT_LOG_TABLE_INDEXED}
+
+
+GENOME_ALIGNMENT_BAM.map{sample, index, bam -> [sample, bam]}.groupTuple()
+    .set{ GENOME_ALIGNMENT_GROUPED_BAM }
+
+GENOME_ALIGNMENT_ALIGNED_MERGE.map{sample, index, fastq -> [sample, fastq]}.groupTuple()
+    .set{ GENOME_ALIGNMENT_GROUPED_ALIGNED_FASTQ }
+
+GENOME_ALIGNMENT_UNALIGNED_MERGE.map{sample, index, fastq -> [sample, fastq]}.groupTuple()
+    .set{ GENOME_ALIGNMENT_GROUPED_UNALIGNED_FASTQ }
+GENOME_ALIGNMENT_LOG_MERGE.map{sample, index, log -> [sample, log]}.groupTuple()
+    .set{ GENOME_ALIGNMENT_GROUPED_LOG }
+
+
+GENOME_ALIGNMENT_GROUPED_BAM.join( GENOME_ALIGNMENT_GROUPED_ALIGNED_FASTQ )
+                            .join(GENOME_ALIGNMENT_GROUPED_UNALIGNED_FASTQ)
+                            .join(GENOME_ALIGNMENT_GROUPED_LOG)
+                            .set{ GENOME_ALIGNMENT_GROUPED_JOINT }
+
+
+process merge_genome_alignment{
+
+	storeDir get_storedir("genome_alignment") + "/" + params.output.merged_lane_directory
+
+	input:
+
+    set val(sample), file(bam), file(aligned_fastq), \
+          file(unaligned_fastq), file(alignment_log) from GENOME_ALIGNMENT_GROUPED_JOINT
+
+
+	output:
+	set val(sample), file("${sample}.genome.bam") \
+                      into GENOME_ALIGNMENT_MERGED_BAM
+	set val(sample), file("${sample}.genome.bam.bai") \
+                      into GENOME_ALIGNMENT_MERGED_BAI
+	set val(sample), file("${sample}.genome.aligned.fastq.gz") \
+                      into GENOME_ALIGNMENT_MERGED_ALIGNED_FASTQ
+	set val(sample), file("${sample}.genome.unaligned.fastq.gz") \
+                      into GENOME_ALIGNMENT_MERGED_UNALIGNED_FASTQ
+	set val(sample), file("${sample}.genome.log") \
+                      into GENOME_ALIGNMENT_MERGED_LOG
+  set val(sample), file("${sample}.genome.csv") \
+                      into GENOME_ALIGNMENT_MERGED_CSV
+
+	"""
+	samtools merge ${sample}.genome.bam ${bam} && samtools index ${sample}.genome.bam && \
+    zcat ${aligned_fastq} | gzip -c > ${sample}.genome.aligned.fastq.gz && \
+    zcat ${unaligned_fastq} | gzip -c > ${sample}.genome.unaligned.fastq.gz && \
+    rfc merge bowtie2-logs -o ${sample}.genome.log ${alignment_log} && \
+    rfc bt2-log-to-csv -n ${sample} -l ${sample}.genome.log -p genome \
+                      -o ${sample}.genome.csv
+	"""
+
+}
+                           
+GENOME_ALIGNMENT_CSV
+   .map{ sample, index, stats_file -> stats_file }
+   .toSortedList().set{GENOME_ALIGNMENT_CSV_INDIVIDUAL_LIST}
+ 
+GENOME_ALIGNMENT_MERGED_CSV
+   .map{ sample, stats_file -> stats_file }
+   .toSortedList().set{GENOME_ALIGNMENT_CSV_MERGED_LIST}
+                           
+process combine_individual_genome_stats{
+  storeDir get_storedir("genome_alignment") + "/logs"
+               
+  input:
+  file(stats_input_files) from GENOME_ALIGNMENT_CSV_INDIVIDUAL_LIST
+  file(stats_input_files_merged) from GENOME_ALIGNMENT_CSV_MERGED_LIST
+  
+  output:
+  file("genome_individual_stats.csv") \
+        into GENOME_ALIGNMENT_CSV_INDIVIDUAL_COMBINED
+  file("genome_merged_stats.csv") \
+        into GENOME_ALIGNMENT_CSV_MERGED_COMBINED
+        
+  """
+  rfc merge overall-stats -o genome_individual_stats.csv ${stats_input_files} ; \
+  rfc merge overall-stats -o genome_merged_stats.csv ${stats_input_files_merged}
+  """
+
+}
+
+
+
+// MERGE  GENOME ALIGNMENT
+///////////////////////////////////////////////////////////////////////////////
+
+} // end of if(do_align_genome){
+// END OF GENOME ALIGNMENT
+///////////////////////////////////////////////////////////////////////////////
+///////////////////////////////////////////////////////////////////////////////
+///////////////////////////////////////////////////////////////////////////////
+///////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* BAM TO BED */
+
+
+/*
+We assume that duplicates are coming from PCR. So for each sample,
+we merge the sequencing lanes ,
+Add a sample.index column to the bed file.
+sort the entire bed file by the first 3 columns ( sort -k1,1 -k2,2n -k3,3n )
+Then we can deduplicate the entire file
+Then separate the file based on the additional column that we added
+*/
+
+process bam_to_bed{
+
+	storeDir get_storedir("bam_to_bed") + "/" + params.output.individual_lane_directory
+
+	input:
+	set val(sample), val(index), file(bam) from TRANSCRIPTOME_ALIGNMENT_QPASS_BAM
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.bed") into BAM_TO_BED
+	set val(sample), val(index), file("${sample}.${index}_nodedup_count.txt") \
+	   into INDIVIDUAL_DEDUP_COUNT_WITHOUT_DEDUP
+
+    """
+    if [ `samtools view -c ${bam}` -eq 0 ];
+    then
+       touch ${sample}.${index}.bed
+    else
+        bamToBed -i ${bam} > ${sample}.${index}.bed
+    fi
+    
+    wc -l ${sample}.${index}.bed > ${sample}.${index}_nodedup_count.txt
+    """
+
+}
+
+BAM_TO_BED.into{ BED_NODEDUP; BED_FOR_DEDUP; BED_FOR_INDEX_SEP_PRE }
+
+
+process add_sample_index_col_to_bed{
+
+	storeDir get_storedir("bam_to_bed") + "/" + params.output.individual_lane_directory
+
+	input:
+    set val(sample), val(index), file(bed) from BED_FOR_DEDUP
+
+	output:
+	set val(sample), file("${sample}.${index}.with_sample_index.bed")\
+	     into BED_FOR_DEDUP_INDEX_COL_ADDED
+
+	"""
+	awk -v newcol=${sample}.${index} '{print(\$0"\\t"newcol)}' ${bed}\
+	   > ${sample}.${index}.with_sample_index.bed
+	"""
+}
+
+BED_FOR_DEDUP_INDEX_COL_ADDED.groupTuple()
+    .set{ BED_FOR_DEDUP_INDEX_COL_ADDED_GROUPED }
+
+process merge_bed{
+
+	storeDir get_storedir("bam_to_bed") + "/" + params.output.merged_lane_directory
+
+	input:
+	set val(sample), file(bed_files) from BED_FOR_DEDUP_INDEX_COL_ADDED_GROUPED
+
+	output:
+	set val(sample), file("${sample}.merged.pre_dedup.bed") \
+	    into MERGE_BED_OUT
+
+	"""
+	cat ${bed_files} | sort -k1,1 -k2,2n -k3,3n > ${sample}.merged.pre_dedup.bed
+	"""
+}
+
+MERGE_BED_OUT.into{BED_FOR_DEDUP_MERGED_PRE_DEDUP;
+                   MERGED_BED_FOR_RIBO}
+
+process deduplicate{
+
+	storeDir  get_storedir("alignment_ribo") + "/" + params.output.merged_lane_directory
+
+	input:
+	set val(sample), file(bed) from BED_FOR_DEDUP_MERGED_PRE_DEDUP
+
+	output:
+	set val(sample), file("${sample}.merged.post_dedup.bed") \
+	     into BED_FOR_DEDUP_MERGED_POST_DEDUP
+
+	when:
+	params.get("deduplicate", false)
+
+	"""	
+	rfc dedup -i ${bed} -o ${sample}.merged.post_dedup.bed
+	"""
+}
+
+BED_FOR_DEDUP_MERGED_POST_DEDUP.into{BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_SEP;
+                                     BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_RIBO}
+
+BED_FOR_INDEX_SEP_PRE.map{ sample,index,file -> [sample, index] }
+    .combine(BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_SEP, by:0)
+    .set{ BED_FOR_INDEX_SEP_POST_DEDUP }
+
+
+process separate_bed_post_dedup{
+
+	storeDir  get_storedir("alignment_ribo") + "/" + params.output.individual_lane_directory
+
+	input:
+	set val(sample), val(index), file(bed) from BED_FOR_INDEX_SEP_POST_DEDUP
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.post_dedup.bed") \
+	   into BED_DEDUPLICATED
+	set val(sample), val(index), file("${sample}.${index}.count_after_dedup.txt")\
+	   into INDIVIDUAL_DEDUP_COUNT_WITH_DEDUP
+
+	when:
+	params.deduplicate
+
+	"""
+	awk -v this_sample=${sample}.${index} \
+	 '{ if(\$7 == this_sample ){print(\$1"\\t"\$2"\\t"\$3"\\t"\$4"\\t"\$5"\\t"\$6)} }' ${bed} \
+        > ${sample}.${index}.post_dedup.bed \
+	  && wc -l ${sample}.${index}.post_dedup.bed > ${sample}.${index}.count_after_dedup.txt
+	"""
+}
+
+
+if(params.get("deduplicate", false)){
+  BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_RIBO
+  .into{BED_FOR_SEPARATION; BED_FOR_RIBO; BED_FOR_RIBO_VERBOSE}
+  
+  INDIVIDUAL_DEDUP_COUNT_WITH_DEDUP
+  .set{INDIVIDUAL_DEDUP_COUNT}
+} 
+else{
+  MERGED_BED_FOR_RIBO
+  .into{BED_FOR_SEPARATION; BED_FOR_RIBO; BED_FOR_RIBO_VERBOSE}
+  
+  INDIVIDUAL_DEDUP_COUNT_WITHOUT_DEDUP
+  .set{INDIVIDUAL_DEDUP_COUNT}
+}
+
+
+
+///////////////////////////////////////////////////////////////////////////////////////
+
+
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* INDIVIDUAL ALIGNMENT STATS TABLE */
+
+
+// We need to group the log files by sample name and index
+// than flatten that list and group again so that each
+// entry can be emmited in groups of 6 for each task
+
+
+CLIP_LOG.map{ sample, index, clip_log -> [ [sample, index], clip_log ] }
+        .set{CLIP_LOG_INDEXED}
+FILTER_LOG.map{ sample, index, filter_log -> [ [sample, index], filter_log ] }
+          .set{FILTER_LOG_INDEXED}
+TRANSCRIPTOME_ALIGNMENT_LOG_TABLE
+    .map{ sample, index, transcriptome_log -> [ [sample, index], transcriptome_log ] }
+    .set{TRANSCRIPTOME_ALIGNMENT_LOG_TABLE_INDEXED}
+
+TRANSCRIPTOME_QPASS_COUNTS_FOR_INDEX
+    .map{ sample, index, qpass_count -> [ [sample, index], qpass_count ] }
+    .set{TRANSCRIPTOME_QPASS_COUNTS_INDEXED}
+INDIVIDUAL_DEDUP_COUNT
+     .map{ sample, index, dedup_count -> [ [sample, index], dedup_count ] }
+     .set{ INDIVIDUAL_DEDUP_COUNT_INDEXED }
+
+CLIP_LOG_INDEXED.join(FILTER_LOG_INDEXED)
+                .join(TRANSCRIPTOME_ALIGNMENT_LOG_TABLE_INDEXED)
+                .join(TRANSCRIPTOME_QPASS_COUNTS_INDEXED)
+                .join(INDIVIDUAL_DEDUP_COUNT_INDEXED)
+                .flatten()
+                .collate(7)
+                .set{ INDIVIDUAL_ALIGNMENT_STATS_INPUT }
+
+process individual_alignment_stats{
+	/*
+	Compiles statistics coming from the individual steps:
+	cutadapt, filter, transcriptome and genome alignment,
+	quality filtering and deduplication
+	*/
+	
+	 executor 'local'
+	
+   storeDir get_storedir("stats")
+
+   input:
+   set val(sample), val(index), file(clip_log), file(filter_log),\
+       file(transcriptome_log), file(qpass_count),\
+       file(dedup_count)\
+       from INDIVIDUAL_ALIGNMENT_STATS_INPUT
+
+   output:
+   set val(sample), val(index), file("${sample}.${index}.overall_alignment.csv") \
+      into INDIVIDUAL_ALIGNMENT_STATS
+
+   """
+   rfc compile-step-stats \
+	   -n ${sample}.${index} -c ${clip_log} \
+     -f ${filter_log} -t ${transcriptome_log} \
+		 -q ${qpass_count} \
+     -d ${dedup_count} \
+     -o ${sample}.${index}.overall_alignment.csv
+   """
+
+}
+
+// INDIVIDUAL ALIGNMENT STATS
+///////////////////////////////////////////////////////////////////////////////////////
+
+INDIVIDUAL_ALIGNMENT_STATS
+    .into{ INDIVIDUAL_ALIGNMENT_STATS_FOR_COLLECTION;
+           INDIVIDUAL_ALIGNMENT_STATS_FOR_GOUPING}
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* COMBINE INDIVIDUAL ALIGNMENT STATS */
+
+INDIVIDUAL_ALIGNMENT_STATS_FOR_COLLECTION
+    .map{ sample, index, stats_file -> stats_file }
+    .toSortedList().set{INDIVIDUAL_ALIGNMENT_STATS_COLLECTED}
+
+process combine_individual_alignment_stats{
+
+  executor 'local'
+
+	storeDir get_storedir("stats")
+
+	input:
+	file(stat_table) from INDIVIDUAL_ALIGNMENT_STATS_COLLECTED
+    
+	output:
+	file("essential_individual_stats.csv") \
+	      into COMBINED_INDIVIDUAL_ALIGNMENT_STATS
+
+	"""
+	  rfc merge overall-stats \
+	   -o raw_combined_individual_aln_stats.csv \
+	      ${stat_table} && \
+    rfc stats-percentage \
+	  -i raw_combined_individual_aln_stats.csv \
+	  -o essential_individual_stats.csv
+	"""
+}
+
+
+// COMBINE INDIVIDUAL ALIGNMENT STATS
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* SUM INDIVIDUAL ALIGNMENT STATS */
+
+/*
+For each sample, sums up the stats coming from individual lanes 
+*/
+
+INDIVIDUAL_ALIGNMENT_STATS_FOR_GOUPING
+    .map{ sample, index, file -> [ sample, file ] }
+    .groupTuple()
+    .into{ INDIVIDUAL_ALIGNMENT_STATS_GROUPED ;
+           INDIVIDUAL_ALIGNMENT_STATS_GROUPED_VERBOSE }
+
+process sum_individual_alignment_stats{
+
+  executor 'local'
+
+	storeDir get_storedir( "log/" + params.output.merged_lane_directory )
+
+	input:
+	set val(sample), file(stat_files) from INDIVIDUAL_ALIGNMENT_STATS_GROUPED
+
+	output:
+	set val(sample), file("${sample}.merged.alignment_stats.csv")\
+	   into MERGED_ALIGNMENT_STATS
+
+	"""
+	rfc sum-stats -n ${sample}\
+	  -o ${sample}.merged.alignment_stats.csv ${stat_files}
+	"""
+}
+
+// SUM INDIVIDUAL ALIGNMENT STATS
+////////////////////////////////////////////////////////////////////////////////
+
+MERGED_ALIGNMENT_STATS.map{ sample, stats_file -> stats_file }
+                      .toSortedList()
+                      .set{ MERGED_ALIGNMENT_STATS_COLLECTED }
+
+////////////////////////////////////////////////////////////////////////////////
+/* COMBINE MERGED ALIGNMENT STATS */
+
+process combine_merged_alignment_stats{
+
+	storeDir get_storedir("stats")
+	
+	executor 'local'
+
+	input:
+	file(stat_files) from MERGED_ALIGNMENT_STATS_COLLECTED
+
+	output:
+	file("essential_stats.csv") into COMBINED_MERGED_ALIGNMENT_STATS
+
+	"""
+	rfc merge overall-stats \
+	    -o raw_combined_merged_aln_stats.csv \
+	    ${stat_files} && \
+	rfc stats-percentage \
+	  -i raw_combined_merged_aln_stats.csv \
+	  -o essential_stats.csv
+	""" 
+}
+
+// COMBINE MERGED ALIGNMENT STATS 
+////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////
+/* METADATA CHANNELS */
+do_metadata = params.get("do_metadata", false) && params.input.get("metadata", false)
+
+if( do_metadata ){
+	meta_base = params.input.metadata.get("base", "")
+	if(meta_base != "" && !meta_base.endsWith("/") ){
+		meta_base = "${meta_base}/"
+	} 
+
+	Channel.from(params.input.metadata.files.collect{k,v ->
+	 	                  [k, file("${meta_base}${v}") ] })
+										     .into{METADATA_PRE; METADATA_PRE_VERBOSE }	
+                         
+  BED_FOR_RIBO
+                   .join(METADATA_PRE, remainder: true)
+                   .into{METADATA_RIBO; METADATA_VERBOSE}
+}
+else {
+  METADATA_PRE = Channel.from([null])
+  METADATA_PRE_VERBOSE = Channel.from([null])
+
+  BED_FOR_RIBO
+  .map{ sample, bed -> [sample, bed, null] }
+  .into{ METADATA_RIBO; METADATA_VERBOSE }
+}
+
+
+if (params.input.get("root_meta", false)){
+  ROOT_META = Channel.from([file(params.input.root_meta)])
+}
+else {
+  ROOT_META = Channel.from([null])
+}
+								
+// METADATA CHANNELS
+///////////////////////////////////////////////////////////////////////////////
+
+// FOR DEBUGGING
+// QUICK WAY TO CANCEL RIBO CREATION
+//METADATA_RIBO = Channel.empty()
+
+////////////////////////////////////////////////////////////////////////////////
+/* CREATE RIBO FILES */
+
+Channel.from( file(params.input.reference.transcript_lengths) ).
+into{T_LENGTHS_FOR_RIBO; T_LENGTHS_FOR_IND_RIBO}
+
+Channel.from( file(params.input.reference.regions) ).
+into{ANNOTATION_FOR_RIBO; ANNOTATION_FOR_IND_RIBO}
+
+if(params.ribo.coverage){
+	coverage_argument = ""
+} 
+else{
+  coverage_argument = "--nocoverage"	
+}
+
+
+
+
+process create_ribo{
+	
+	publishDir get_publishdir("ribo") + "/experiments", mode:'copy'
+  storeDir get_storedir("ribo") + "/experiments"
+	
+	input:
+	set val(sample), file(bed_file), file(meta_file) from METADATA_RIBO
+	file(transcript_length_file) from T_LENGTHS_FOR_RIBO.first()
+	file(annotation_file) from ANNOTATION_FOR_RIBO.first()
+  file(root_meta_file) from ROOT_META.first()
+	
+	output:
+	set val(sample), file("${sample}.ribo") into RIBO_MAIN
+	
+  script:
+  if (meta_file != null){
+    sample_meta_argument = "--expmeta ${meta_file}"
+  }
+  else {
+    sample_meta_argument = ""
+  }
+  
+  if(root_meta_file == null){
+    root_meta_argument = ""
+  }
+  else{
+    root_meta_argument = "--ribometa ${root_meta_file}"
+  }
+  
+
+	"""
+	ribopy create -n ${sample} \
+	             --reference ${params.ribo.ref_name} \
+	             --lengths ${transcript_length_file} \
+							 --annotation ${annotation_file} \
+							 --radius ${params.ribo.metagene_radius} \
+							 -l ${params.ribo.left_span} -r ${params.ribo.right_span} \
+							 --lengthmin ${params.ribo.read_length.min} \
+							 --lengthmax ${params.ribo.read_length.max} \
+               ${sample_meta_argument} \
+               ${root_meta_argument} \
+							 ${coverage_argument} \
+							 -n ${task.cpus} \
+               --alignmentfile ${bed_file} \
+                ${sample}.ribo
+	"""
+
+	
+}
+
+
+RIBO_MAIN.into{RIBO_FOR_RNASEQ; RIBO_AFTER_CREATION}
+
+
+// CREATE RIBO FILES 
+////////////////////////////////////////////////////////////////////////////////
+
+
+
+
+////////////////////////////////////////////////////////////////////////////////
+/* Post Genome */
+
+do_post_genome = params.input.reference.get("post_genome", false)
+
+if(do_align_genome && do_post_genome ){
+  
+  POST_GENOME_INDEX = Channel.from([[
+                  params.input.reference.post_genome
+                  .split('/')[-1]
+                  .replaceAll('\\*$', "")
+                  .replaceAll('\\.$', ""),
+               file(params.input.reference.post_genome),
+              ]])
+
+
+
+process post_genome_alignment{
+
+	storeDir get_storedir("post_genome_alignment") + "/" + params.output.individual_lane_directory
+
+	input:
+	set val(sample), val(index), file(fastq) from FOR_POST_GENOME
+	set val(post_genome_base), file(post_genome_files) from POST_GENOME_INDEX.first()
+
+  output:
+  set val(sample), val(index), file("${sample}.${index}.postgenome_alignment.bam") \
+      into POST_GENOME_ALIGNMENT_BAM
+  set val(sample), val(index), file("${sample}.${index}.postgenome_alignment.bam.bai") \
+      into POST_GENOMEE_ALIGNMENT_BAI
+  set val(sample), val(index), file("${sample}.${index}.aligned.postgenome_alignment.fastq.gz") \
+      into POST_GENOME_ALIGNMENT_ALIGNED
+  set val(sample), val(index), file("${sample}.${index}.unaligned.postgenome_alignment.fastq.gz") \
+      into POST_GENOME_ALIGNMENT_UNALIGNED
+  set val(sample), val(index), file("${sample}.${index}.postgenome_alignment.log") \
+      into POST_GENOME_ALIGNMENT_LOG
+  set val(sample), val(index), file("${sample}.${index}.postgenome_alignment.csv") \
+      into POST_GENOME_ALIGNMENT_CSV
+  set val(sample), val(index), file("${sample}.${index}.postgenome_alignment.stats") \
+      into POST_GENOME_ALIGNMENT_STATS
+
+  """
+  bowtie2 ${params.alignment_arguments.transcriptome} \
+          -x ${post_genome_base} -q ${fastq} \
+          --threads ${task.cpus} \
+          --al-gz ${sample}.${index}.aligned.postgenome_alignment.fastq.gz \
+          --un-gz ${sample}.${index}.unaligned.postgenome_alignment.fastq.gz \
+                     2> ${sample}.${index}.postgenome_alignment.log \
+          | samtools view -bS - \
+          | samtools sort -@ ${task.cpus} -o ${sample}.${index}.postgenome_alignment.bam \
+          && samtools index -@ {task.cpus} ${sample}.${index}.postgenome_alignment.bam \
+          && samtools idxstats -@ {task.cpus} ${sample}.${index}.postgenome_alignment.bam  > \
+             ${sample}.${index}.postgenome_alignment.stats \
+          && rfc bt2-log-to-csv -o ${sample}.${index}.postgenome_alignment.csv \
+                -n ${sample} -p post_genome -l ${sample}.${index}.postgenome_alignment.log
+  """
+
+}
+
+
+
+POST_GENOME_ALIGNMENT_ALIGNED.into{ POST_GENOME_ALIGNMENT_ALIGNED_FASTQ_READ_LENGTH;
+                                    POST_GENOME_ALIGNMENT_ALIGNED_MERGE;
+                                    POST_GENOME_ALIGNMENT_ALIGNED_FASTQ_FASTQC }
+
+POST_GENOME_ALIGNMENT_UNALIGNED.into{ POST_GENOME_ALIGNMENT_UNALIGNED_FASTQ_READ_LENGTH;
+                                      POST_GENOME_ALIGNMENT_UNALIGNED_MERGE;
+                                      POST_GENOME_ALIGNMENT_UNALIGNED_FASTQ_FASTQC}
+
+// POST_GENOME ALIGNMENT
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* MERGE POST_GENOME ALIGNMENT */
+POST_GENOME_ALIGNMENT_LOG.into{ POST_GENOME_ALIGNMENT_LOG_MERGE; POST_GENOME_ALIGNMENT_LOG_TABLE }
+
+
+POST_GENOME_ALIGNMENT_BAM.map{sample, index, bam -> [sample, bam]}.groupTuple()
+    .set{ POST_GENOME_ALIGNMENT_GROUPED_BAM }
+
+POST_GENOME_ALIGNMENT_ALIGNED_MERGE.map{sample, index, fastq -> [sample, fastq]}.groupTuple()
+    .set{ POST_GENOME_ALIGNMENT_GROUPED_ALIGNED_FASTQ }
+
+POST_GENOME_ALIGNMENT_UNALIGNED_MERGE.map{sample, index, fastq -> [sample, fastq]}.groupTuple()
+    .set{ POST_GENOME_ALIGNMENT_GROUPED_UNALIGNED_FASTQ }
+POST_GENOME_ALIGNMENT_LOG_MERGE.map{sample, index, log -> [sample, log]}.groupTuple()
+    .set{ POST_GENOME_ALIGNMENT_GROUPED_LOG }
+
+
+POST_GENOME_ALIGNMENT_GROUPED_BAM.join( POST_GENOME_ALIGNMENT_GROUPED_ALIGNED_FASTQ )
+                            .join(POST_GENOME_ALIGNMENT_GROUPED_UNALIGNED_FASTQ)
+                            .join(POST_GENOME_ALIGNMENT_GROUPED_LOG)
+                            .set{ POST_GENOME_ALIGNMENT_GROUPED_JOINT }
+
+
+process merge_post_genome_alignment{
+
+	storeDir get_storedir("post_genome_alignment") + "/" + params.output.merged_lane_directory
+
+	input:
+  set val(sample), file(bam), file(aligned_fastq), \
+          file(unaligned_fastq), file(alignment_log) from POST_GENOME_ALIGNMENT_GROUPED_JOINT
+
+	output:
+	set val(sample), file("${sample}.post_genome.bam") \
+      into POST_GENOME_ALIGNMENT_MERGED_BAM
+	set val(sample), file("${sample}.post_genome.bam.bai") \
+      into POST_GENOME_ALIGNMENT_MERGED_BAI
+	set val(sample), file("${sample}.post_genome.aligned.fastq.gz") \
+      into POST_GENOME_ALIGNMENT_MERGED_ALIGNED_FASTQ
+	set val(sample), file("${sample}.post_genome.unaligned.fastq.gz") \
+      into POST_GENOME_ALIGNMENT_MERGED_UNALIGNED_FASTQ
+	set val(sample), file("${sample}.post_genome.log") \
+      into POST_GENOME_ALIGNMENT_MERGED_LOG
+  set val(sample), file("${sample}.post_genome.csv") \
+      into POST_GENOME_ALIGNMENT_MERGED_CSV
+
+	"""
+	samtools merge ${sample}.post_genome.bam ${bam} && samtools index ${sample}.post_genome.bam && \
+    zcat ${aligned_fastq} | gzip -c > ${sample}.post_genome.aligned.fastq.gz && \
+    zcat ${unaligned_fastq} | gzip -c > ${sample}.post_genome.unaligned.fastq.gz && \
+    rfc merge bowtie2-logs -o ${sample}.post_genome.log ${alignment_log} && \
+    rfc bt2-log-to-csv -o ${sample}.post_genome.csv \
+          -n ${sample} -p post_genome -l ${sample}.post_genome.log
+	"""
+
+}
+
+POST_GENOME_ALIGNMENT_CSV
+   .map{ sample, index, stats_file -> stats_file }
+   .toSortedList().set{POST_GENOME_ALIGNMENT_CSV_INDIVIDUAL_LIST}
+ 
+POST_GENOME_ALIGNMENT_MERGED_CSV
+   .map{ sample, stats_file -> stats_file }
+   .toSortedList().set{POST_GENOME_ALIGNMENT_CSV_MERGED_LIST}
+                           
+process combine_individual_postgenome_stats{
+  storeDir get_storedir("post_genome_alignment") + "/logs"
+               
+  input:
+  file(stats_input_files) from POST_GENOME_ALIGNMENT_CSV_INDIVIDUAL_LIST
+  file(stats_input_files_merged) from POST_GENOME_ALIGNMENT_CSV_MERGED_LIST
+  
+  output:
+  file("postgenome_individual_stats.csv") \
+        into POST_GENOME_ALIGNMENT_CSV_INDIVIDUAL_COMBINED
+  file("postgenome_merged_stats.csv") \
+        into POST_GENOME_ALIGNMENT_CSV_MERGED_COMBINED
+        
+  """
+  rfc merge overall-stats -o postgenome_individual_stats.csv ${stats_input_files} ; \
+  rfc merge overall-stats -o postgenome_merged_stats.csv ${stats_input_files_merged}
+  """
+
+}
+
+// MERGE POST GENOME ALIGNMENT
+////////////////////////////////////////////////////////////////////////////////
+
+} // if( params.input.reference.get("post_genome", false) )
+
+// Post Genome
+////////////////////////////////////////////////////////////////////////////////
+
+
+if(do_align_genome){
+  
+  process append_genome_stats{
+    storeDir get_storedir("stats")
+    
+    executor 'local'
+    echo true
+    
+    input:
+    file(genome_alignment_individual) from GENOME_ALIGNMENT_CSV_INDIVIDUAL_COMBINED
+    file(genome_alignment_merged)     from GENOME_ALIGNMENT_CSV_MERGED_COMBINED
+    file(individual_alignment_stats)  from COMBINED_INDIVIDUAL_ALIGNMENT_STATS
+    file(merged_alignment_stats)      from COMBINED_MERGED_ALIGNMENT_STATS
+    
+    
+    output:
+    file("individual_stats_with_genome.csv") \
+        into COMBINED_INDIVIDUAL_ALIGNMENT_STATS_WITH_GENOME
+    file("merged_alignment_stats_with_genome.csv") \
+        into COMBINED_MERGED_ALIGNMENT_STATS_WITH_GENOME
+    
+        
+    """
+    rfc merge concat-csv -o individual_stats_with_genome.csv  \
+         ${individual_alignment_stats} ${genome_alignment_individual} && \
+    rfc merge concat-csv -o merged_alignment_stats_with_genome.csv \
+         ${merged_alignment_stats} ${genome_alignment_merged}
+    """
+  }  
+
+  if(do_post_genome){
+    
+      process append_post_genome_stats{
+        storeDir get_storedir("stats")
+        
+        executor 'local'
+        
+        input:
+        file(post_genome_alignment_individual) from POST_GENOME_ALIGNMENT_CSV_INDIVIDUAL_COMBINED
+        file(post_genome_alignment_merged)     from POST_GENOME_ALIGNMENT_CSV_MERGED_COMBINED
+        file(individual_alignment_stats)       from COMBINED_INDIVIDUAL_ALIGNMENT_STATS_WITH_GENOME
+        file(merged_alignment_stats)           from COMBINED_MERGED_ALIGNMENT_STATS_WITH_GENOME
+        
+        output:
+        file("individual_stats_with_post_genome.csv") \
+            into COMBINED_INDIVIDUAL_ALIGNMENT_STATS_WITH_POST_GENOME
+        file("merged_alignment_stats_with_post_genome.csv") \
+            into COMBINED_MERGED_ALIGNMENT_STATS_WITH_POST_GENOME
+            
+        """
+        rfc merge concat-csv -o individual_stats_with_post_genome.csv  \
+            ${individual_alignment_stats} ${post_genome_alignment_individual} ;
+        rfc merge concat-csv -o merged_alignment_stats_with_post_genome.csv \
+            ${merged_alignment_stats} ${post_genome_alignment_merged}
+        """
+        
+      } // process append_post_genome_stats
+      
+      COMBINED_INDIVIDUAL_ALIGNMENT_STATS_WITH_POST_GENOME
+      .set{ULTIMATE_INDIVIDUAL_STATS}
+      
+      COMBINED_MERGED_ALIGNMENT_STATS_WITH_POST_GENOME
+      .set{ULTIMATE_MERGED_STATS}
+      
+  } //if(do_post_genome){
+  else{
+    COMBINED_INDIVIDUAL_ALIGNMENT_STATS_WITH_GENOME
+    .set{ULTIMATE_INDIVIDUAL_STATS}
+    
+    COMBINED_MERGED_ALIGNMENT_STATS_WITH_GENOME
+    .set{ULTIMATE_MERGED_STATS}
+  } // (else of)  //if(do_post_genome){
+  
+} // end of if(do_align_genome)
+else{
+  //publish results
+  COMBINED_INDIVIDUAL_ALIGNMENT_STATS.set{ULTIMATE_INDIVIDUAL_STATS}
+  COMBINED_MERGED_ALIGNMENT_STATS.set{ULTIMATE_MERGED_STATS}
+} //(else of) if(do_align_genome)  
+
+process publish_stats{
+  
+  publishDir get_publishdir("stats"), mode: "copy"
+  
+  executor 'local'
+  
+  input: 
+  file(individual_stats) from ULTIMATE_INDIVIDUAL_STATS
+  file(merged_stats)     from ULTIMATE_MERGED_STATS
+  
+  output:
+  file("individual_stats.csv") into INDIVIDUAL_STATS_PUBLISHED
+  file("stats.csv")            into MERGED_STATS_PUBLISHED
+  
+  """
+  cp ${individual_stats} individual_stats.csv && \
+  cp ${merged_stats} stats.csv
+  """
+}
+
+
+////////////////////////////////////////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+///////                       /* RNA-Seq */                            /////////
+////////////////////////////////////////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+
+////////////////////////////////////////////////////////////////////////////////
+////// General Function Definitions ////////////////////////////////////////////
+
+String get_rnaseq_storedir(output_type){
+    new File( params.output.intermediates.base + "/rnaseq",
+              params.output.intermediates.get(output_type, output_type) )
+							.getCanonicalPath()
+}
+
+String get_rnaseq_publishdir(output_type){
+    new File( params.output.output.base + "/rnaseq",
+              params.output.output.get(output_type, output_type) )
+							.getCanonicalPath()
+}
+
+////// General Function Definitions ////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+
+
+
+// Both the boolean flag 'do_rnaseq' 
+// AND actual rnaseq node must be set to perform 
+// rnaseq data processing steps. 
+do_rnaseq = params.get("do_rnaseq", false) && \
+            params.get("rnaseq", false)
+
+// This outer if clause contains the rest of the RNASEQ 
+if (do_rnaseq){
+
+rnaseq_fastq_base =  params.rnaseq.get("fastq_base", "")
+if(! rnaseq_fastq_base.endsWith("/") && rnaseq_fastq_base != "") {
+	rnaseq_fastq_base = "${rnaseq_fastq_base}/"
+}
+
+// Group input files into a list of tuples where each item is
+// [ sample, fileindex, path_to_fastq_file]
+
+Channel.from(params.rnaseq.fastq.collect{k,v -> 
+	              v.collect{ z -> [k, v.indexOf(z) + 1, 
+									               file("${rnaseq_fastq_base}${z}")] }  }) 
+	.flatten().collate(3).into{  RNASEQ_FASTQ; 
+                               RNASEQ_FASTQ_VERBOSE; 
+                               RNASEQ_FASTQ_FASTQC; 
+                               RNASEQ_FASTQ_CLIP;
+                               RNASEQ_FASTQ_EXISTENCE}
+  
+if(params.do_check_file_existence){
+  // Make Sure Fastq Files Exist
+  RNASEQ_FASTQ_EXISTENCE
+  .map{ sample, index, this_file -> file_exists(this_file) }
+}  
+  
+process rnaseq_raw_fastqc{
+
+  	publishDir get_rnaseq_publishdir("fastqc"), mode: 'copy'
+
+  	input:
+  	set val(sample), val(index), file(fastq) from RNASEQ_FASTQ_FASTQC
+
+  	output:
+  	set val(sample), file("${sample}.${index}_fastqc.html"), 
+        file("${sample}.${index}_fastqc.zip") into RNASEQ_FASTQC_OUT
+
+    when:
+    params.do_fastqc && do_rnaseq
+
+      """
+      if [ ! -f ${sample}.${index}.fastq.gz ]; then
+         ln -s $fastq ${sample}.${index}.fastq.gz
+      fi
+      fastqc ${sample}.${index}.fastq.gz --outdir=\$PWD -t ${task.cpus}
+      """
+}
+
+process rnaseq_clip{
+  storeDir get_rnaseq_storedir("clip")
+
+  input:
+  set val(sample), val(index), file(fastq) from RNASEQ_FASTQ_CLIP
+
+  output:
+  set val(sample), val(index), file("${sample}.${index}.clipped.fastq.gz") \
+                                                      into RNASEQ_CLIP_OUT
+  set val(sample), val(index), file("${sample}.${index}.clipped.log") \
+                                                      into RNASEQ_CLIP_LOG
+
+  """
+  cutadapt --cores=${task.cpus} ${params.rnaseq.clip_arguments} ${fastq} 2>${sample}.${index}.clipped.log  \
+   | gzip -c  > ${sample}.${index}.clipped.fastq.gz
+  """
+} 
+
+RNASEQ_FILTER_INDEX = Channel.from([[
+             params.input.reference.filter
+                .split('/')[-1]
+                .replaceAll('\\*$', "")
+                .replaceAll('\\.$', ""),
+             file(params.input.reference.filter),
+            ]])
+
+process rnaseq_filter{
+
+	storeDir get_rnaseq_storedir("filter")
+
+	input:
+	set val(sample), val(index), file(fastq) \
+                          from RNASEQ_CLIP_OUT
+	set val(bowtie2_index_base), file(bowtie2_index_files) \
+                          from RNASEQ_FILTER_INDEX.first()
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.filter.bam") \
+        into RNASEQ_FILTER_BAM
+    set val(sample), val(index), file("${sample}.${index}.filter.bam.bai") \
+        into RNASEQ_FILTER_BAI
+    set val(sample), val(index), file("${sample}.${index}.aligned.filter.fastq.gz") \
+        into RNASEQ_FILTER_ALIGNED
+    set val(sample), val(index), file("${sample}.${index}.unaligned.filter.fastq.gz") \
+        into RNASEQ_FILTER_UNALIGNED
+    set val(sample), val(index), file("${sample}.${index}.filter.log") \
+        into RNASEQ_FILTER_LOG
+    set val(sample), val(index), file("${sample}.${index}.filter.stats") \
+        into RNASEQ_FILTER_STATS
+
+
+    """
+    bowtie2 ${params.rnaseq.filter_arguments} \
+            -x ${bowtie2_index_base} -q ${fastq} \
+            --threads ${task.cpus} \
+            --al-gz ${sample}.${index}.aligned.filter.fastq.gz \
+            --un-gz ${sample}.${index}.unaligned.filter.fastq.gz \
+                     2> ${sample}.${index}.filter.log \
+            | samtools view -bS - \
+            | samtools sort -@ ${task.cpus} -o ${sample}.${index}.filter.bam \
+            && samtools index -@ {task.cpus} ${sample}.${index}.filter.bam \
+            && samtools idxstats -@ {task.cpus} ${sample}.${index}.filter.bam  > \
+               ${sample}.${index}.filter.stats
+    """
+
+}
+
+RNASEQ_FILTER_UNALIGNED.into{RNASEQ_FILTER_UNALIGNED_FASTQ_READ_LENGTH; 
+                             RNASEQ_FILTER_UNALIGNED_FASTQ_FASTQC;
+                             RNASEQ_FILTER_UNALIGNED_TRANSCRIPTOME}
+                             
+
+rnaseq_bt2_arguments = params.rnaseq.get("bt2_argumments", "")
+
+RNASEQ_TRANSCRIPTOME_INDEX = Channel.from([[
+            params.input.reference.transcriptome
+               .split('/')[-1]
+               .replaceAll('\\*$', "")
+               .replaceAll('\\.$', ""),
+            file(params.input.reference.transcriptome),
+           ]])
+
+                             
+process rnaseq_transcriptome_alignment{
+
+   storeDir get_rnaseq_storedir("transcriptome_alignment") + "/" +\
+                                 params.output.individual_lane_directory
+
+   input:
+   set val(sample), val(index), file(fastq) \
+                from RNASEQ_FILTER_UNALIGNED_TRANSCRIPTOME
+	 set val(transcriptome_reference), file(transcriptome_Reference_files) \
+		            from RNASEQ_TRANSCRIPTOME_INDEX.first()
+
+   output:
+   set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.bam") \
+       into RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAM_PRE
+   set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.bam.bai") \
+       into RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAI
+   set val(sample), val(index), file("${sample}.${index}.aligned.transcriptome_alignment.fastq.gz") \
+       into RNASEQ_TRANSCRIPTOME_ALIGNMENT_ALIGNED
+   set val(sample), val(index), file("${sample}.${index}.unaligned.transcriptome_alignment.fastq.gz") \
+       into RNASEQ_TRANSCRIPTOME_ALIGNMENT_UNALIGNED
+   set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.log") \
+       into RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG
+   set val(sample), val(index), file("${sample}.${index}.transcriptome_alignment.stats") \
+       into RNASEQ_TRANSCRIPTOME_ALIGNMENT_STATS
+
+   """
+   bowtie2 ${rnaseq_bt2_arguments} \
+           -x ${transcriptome_reference} -q ${fastq} \
+           --threads ${task.cpus} \
+           --al-gz ${sample}.${index}.aligned.transcriptome_alignment.fastq.gz \
+           --un-gz ${sample}.${index}.unaligned.transcriptome_alignment.fastq.gz \
+						           2> ${sample}.${index}.transcriptome_alignment.log \
+           | samtools view -bS - \
+           | samtools sort -@ ${task.cpus} -o ${sample}.${index}.transcriptome_alignment.bam \
+           && samtools index -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.bam \
+           && samtools idxstats -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.bam  > \
+              ${sample}.${index}.transcriptome_alignment.stats
+   """
+}
+
+
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAM_PRE
+.into{ RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAM; 
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAM_MERGE;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAM_FOR_QUALITY}
+
+process rnaseq_quality_filter{
+
+	storeDir get_rnaseq_storedir("quality_filter")
+
+	input:
+	set val(sample), val(index), file(bam) \
+        from RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAM_FOR_QUALITY
+
+	output:
+	set val(sample), val(index), 
+        file("${sample}.${index}.transcriptome_alignment.qpass.bam") \
+        into RNASEQ_TRANSCRIPTOME_ALIGNMENT_QPASS_BAM_PRE
+    set val(sample), val(index), 
+        file("${sample}.${index}.transcriptome_alignment.qpass.bam.bai") \
+        into RNASEQ_TRANSCRIPTOME_ALIGNMENT_QPASS_BAI
+    set val(sample), val(index), 
+        file("${sample}.${index}.qpass.count") \
+        into RNASEQ_TRANSCRIPTOME_QPASS_COUNTS
+    set val(sample), val(index), 
+        file("${sample}.${index}.transcriptome_alignment.qpass.stats") \
+        into RNASEQ_TRANSCRIPTOME_ALIGNMENT_QPASS_STATS
+
+	"""
+	samtools view -b -q ${params.mapping_quality_cutoff} ${bam}\
+	| samtools sort -@ ${task.cpus} -o ${sample}.${index}.transcriptome_alignment.qpass.bam \
+	&& samtools view -b -c ${sample}.${index}.transcriptome_alignment.qpass.bam > ${sample}.${index}.qpass.count \
+	&& samtools index -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.qpass.bam \
+	&& samtools idxstats -@ {task.cpus} ${sample}.${index}.transcriptome_alignment.qpass.bam  > \
+               ${sample}.${index}.transcriptome_alignment.qpass.stats
+	"""
+}
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_QPASS_BAM_PRE
+.into{ RNASEQ_QPASS_BAM_READ_LENGTH; 
+	     RNASEQ_TRANSCRIPTOME_ALIGNMENT_QPASS_BAM}
+        
+// QUALITY FILTER
+///////////////////////////////////////////////////////////////////////////////////////
+
+RNASEQ_TRANSCRIPTOME_QPASS_COUNTS
+.into{RNASEQ_TRANSCRIPTOME_QPASS_COUNTS_FOR_INDEX; 
+	    RNASEQ_TRANSCRIPTOME_QPASS_COUNTS_FOR_TABLE}
+
+// We need to copy output channels of transcriptome alignment
+// for merging and variaous steps of downstream processing
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAI
+.into{ RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAI_MERGE ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_BAI_REGION_COUNT}
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_ALIGNED
+.into{ RNASEQ_TRANSCRIPTOME_ALIGNMENT_ALIGNED_MERGE ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_ALIGNED_LENGTH ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_ALIGNED_FASTQC }
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_UNALIGNED
+.into{ RNASEQ_TRANSCRIPTOME_ALIGNMENT_UNALIGNED_MERGE ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_UNALIGNED_GENOME ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_UNALIGNED_LENGTH ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_UNALIGNED_FASTQC }
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG
+.into{ RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG_MERGE ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG_TABLE  }
+
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_STATS
+.into{ RNASEQ_TRANSCRIPTOME_ALIGNMENT_STATS_MERGE ;
+       RNASEQ_TRANSCRIPTOME_ALIGNMENT_STATS_TABLE  }       
+
+
+process rnaseq_bam_to_bed{
+
+	storeDir get_rnaseq_storedir("bam_to_bed") + "/" + params.output.individual_lane_directory
+
+	input:
+	set val(sample), val(index), file(bam) from RNASEQ_TRANSCRIPTOME_ALIGNMENT_QPASS_BAM
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.bed") into RNASEQ_BAM_TO_BED
+	set val(sample), val(index), file("${sample}.${index}_nodedup_count.txt") \
+	   into RNASEQ_INDIVIDUAL_DEDUP_COUNT_WITHOUT_DEDUP
+
+   """
+   if [ `samtools view -c ${bam}` -eq 0 ];
+   then
+      touch ${sample}.${index}.bed
+   else
+       bamToBed -i ${bam} > ${sample}.${index}.bed
+   fi
+   
+   wc -l ${sample}.${index}.bed > ${sample}.${index}_nodedup_count.txt
+   """
+}
+
+ RNASEQ_BAM_TO_BED.into{  RNASEQ_BED_NODEDUP; 
+                          RNASEQ_BED_FOR_DEDUP; 
+                          RNASEQ_BED_FOR_INDEX_SEP_PRE }
+
+do_rnaseq_dedup = params.rnaseq.get("deduplicate", false)
+
+process rnaseq_add_sample_index_col_to_bed{
+
+	storeDir get_rnaseq_storedir("bam_to_bed") + "/" + params.output.individual_lane_directory
+
+	input:
+   set val(sample), val(index), file(bed) from  RNASEQ_BED_FOR_DEDUP
+
+	output:
+	set val(sample), file("${sample}.${index}.with_sample_index.bed")\
+	     into  RNASEQ_BED_FOR_DEDUP_INDEX_COL_ADDED
+
+	"""
+	awk -v newcol=${sample}.${index} '{print(\$0"\\t"newcol)}' ${bed}\
+	   > ${sample}.${index}.with_sample_index.bed
+	"""
+}
+
+RNASEQ_BED_FOR_DEDUP_INDEX_COL_ADDED.groupTuple()
+   .set{  RNASEQ_BED_FOR_DEDUP_INDEX_COL_ADDED_GROUPED }
+
+process rnaseq_merge_bed{
+
+	storeDir get_rnaseq_storedir("bam_to_bed") + "/" + params.output.merged_lane_directory
+
+	input:
+	set val(sample), file(bed_files) from  RNASEQ_BED_FOR_DEDUP_INDEX_COL_ADDED_GROUPED
+
+	output:
+	set val(sample), file("${sample}.merged.pre_dedup.bed") \
+	    into  RNASEQ_BED_MERGED_PRE_DEDUP
+
+
+	"""
+	cat ${bed_files} | sort -k1,1 -k2,2n -k3,3n > ${sample}.merged.pre_dedup.bed
+	"""
+}
+
+RNASEQ_BED_MERGED_PRE_DEDUP
+.into{RNASEQ_BED_FOR_DEDUP_MERGED_PRE_DEDUP;
+      RNASEQ_BED_NODEDUP_FOR_RIBO}
+
+
+process rnaseq_deduplicate{
+
+	storeDir  get_rnaseq_storedir("alignment_ribo") + "/" + params.output.merged_lane_directory
+
+	input:
+	set val(sample), file(bed) from RNASEQ_BED_FOR_DEDUP_MERGED_PRE_DEDUP
+
+	output:
+	set val(sample), file("${sample}.merged.post_dedup.bed") \
+	     into RNASEQ_BED_FOR_DEDUP_MERGED_POST_DEDUP
+
+	when:
+	do_rnaseq_dedup
+
+	"""	
+	rfc dedup -i ${bed} -o ${sample}.merged.post_dedup.bed
+	"""
+}
+  
+RNASEQ_BED_FOR_DEDUP_MERGED_POST_DEDUP
+.into{RNASEQ_BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_SEP;
+      RNASEQ_BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_RIBO}
+
+RNASEQ_BED_FOR_INDEX_SEP_PRE
+.map{ sample,index,file -> [sample, index] }
+.combine(RNASEQ_BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_SEP, by:0)
+.set{ RNASEQ_BED_FOR_INDEX_SEP_POST_DEDUP }  
+  
+process rnaseq_separate_bed_post_dedup{
+
+	storeDir  get_rnaseq_storedir("alignment_ribo") + "/" + params.output.individual_lane_directory
+
+	input:
+	set val(sample), val(index), file(bed) from  RNASEQ_BED_FOR_INDEX_SEP_POST_DEDUP
+
+	output:
+	set val(sample), val(index), file("${sample}.${index}.post_dedup.bed") \
+	   into  RNASEQ_BED_DEDUPLICATED
+	set val(sample), val(index), file("${sample}.${index}.count_after_dedup.txt")\
+	   into  RNASEQ_INDIVIDUAL_DEDUP_COUNT_WITH_DEDUP
+
+	"""
+	awk -v this_sample=${sample}.${index} \
+	 '{ if(\$7 == this_sample ){print(\$1"\\t"\$2"\\t"\$3"\\t"\$4"\\t"\$5"\\t"\$6)} }' ${bed} > ${sample}.${index}.post_dedup.bed \
+	  && wc -l ${sample}.${index}.post_dedup.bed > ${sample}.${index}.count_after_dedup.txt
+	"""
+}  
+  
+if(do_rnaseq_dedup){
+  RNASEQ_BED_FOR_DEDUP_MERGED_POST_DEDUP_FOR_RIBO
+     .into{RNASEQ_BED_FOR_SEPARATION; RNASEQ_BED_FOR_RIBO_FINAL}
+  RNASEQ_INDIVIDUAL_DEDUP_COUNT_WITH_DEDUP.set{RNASEQ_INDIVIDUAL_DEDUP_COUNT}
+} 
+else{
+  RNASEQ_BED_NODEDUP_FOR_RIBO
+      .into{RNASEQ_BED_FOR_SEPARATION; RNASEQ_BED_FOR_RIBO_FINAL}
+  RNASEQ_INDIVIDUAL_DEDUP_COUNT_WITHOUT_DEDUP.set{RNASEQ_INDIVIDUAL_DEDUP_COUNT}
+}
+
+
+////////////////////////////////////////////////////////////////////////////////
+
+
+// We need to group the log files by sample name and index
+// than flatten that list and group again so that each
+// entry can be emmited in groups of 6 for each task
+
+
+RNASEQ_CLIP_LOG.map{ sample, index, clip_log -> [ [sample, index], clip_log ] }
+        .set{RNASEQ_CLIP_LOG_INDEXED}
+RNASEQ_FILTER_LOG.map{ sample, index, filter_log -> [ [sample, index], filter_log ] }
+          .set{RNASEQ_FILTER_LOG_INDEXED}
+RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG_TABLE
+    .map{ sample, index, transcriptome_log -> [ [sample, index], transcriptome_log ] }
+    .set{RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG_TABLE_INDEXED}
+RNASEQ_TRANSCRIPTOME_QPASS_COUNTS_FOR_INDEX
+    .map{ sample, index, qpass_count -> [ [sample, index], qpass_count ] }
+    .set{RNASEQ_TRANSCRIPTOME_QPASS_COUNTS_INDEXED}
+RNASEQ_INDIVIDUAL_DEDUP_COUNT
+     .map{ sample, index, dedup_count -> [ [sample, index], dedup_count ] }
+     .set{ RNASEQ_INDIVIDUAL_DEDUP_COUNT_INDEXED }
+
+RNASEQ_CLIP_LOG_INDEXED.join(RNASEQ_FILTER_LOG_INDEXED)
+                .join(RNASEQ_TRANSCRIPTOME_ALIGNMENT_LOG_TABLE_INDEXED)
+                .join(RNASEQ_TRANSCRIPTOME_QPASS_COUNTS_INDEXED)
+                .join(RNASEQ_INDIVIDUAL_DEDUP_COUNT_INDEXED)
+                .flatten()
+                .collate(7)
+                .set{ RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_INPUT }
+
+
+process rnaseq_individual_alignment_stats{
+	
+	//Compiles statistics coming from the individual steps:
+	//cutadapt, filter, transcriptome and genome alignment,
+	//quality filtering and deduplication
+	
+	
+	 executor 'local'
+	
+   storeDir get_rnaseq_storedir("stats")
+
+   input:
+   set val(sample), val(index), file(clip_log), file(filter_log),\
+       file(transcriptome_log), file(qpass_count),\
+       file(dedup_count)\
+       from RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_INPUT
+
+   output:
+   set val(sample), val(index), file("${sample}.${index}.rnaseq_overall_alignment.csv") \
+      into RNASEQ_INDIVIDUAL_ALIGNMENT_STATS
+
+   """
+   rfc compile-step-stats \
+	   -n ${sample}.${index} \
+     -c ${clip_log} \
+     -f ${filter_log} \
+     -t ${transcriptome_log} \
+     -q ${qpass_count} \
+     -d ${dedup_count} \
+     -o ${sample}.${index}.rnaseq_overall_alignment.csv
+   """
+
+}
+
+// INDIVIDUAL ALIGNMENT STATS
+///////////////////////////////////////////////////////////////////////////////////////
+
+RNASEQ_INDIVIDUAL_ALIGNMENT_STATS
+    .into{ RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_FOR_COLLECTION;
+           RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_FOR_GOUPING}
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* COMBINE INDIVIDUAL ALIGNMENT STATS */
+
+RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_FOR_COLLECTION
+    .map{ sample, index, stats_file -> stats_file }
+    .toSortedList().set{RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_COLLECTED}
+
+process rnaseq_combine_individual_alignment_stats{
+
+  executor 'local'
+
+	publishDir get_rnaseq_publishdir("stats"), mode: 'copy' 
+
+	input:
+	file(stat_table) from RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_COLLECTED
+    
+	output:
+	file("rnaseq_individual_stats.csv") \
+	      into RNASEQ_COMBINED_INDIVIDUAL_ALIGNMENT_STATS
+
+	"""
+	  rfc merge overall-stats \
+	   -o raw_combined_individual_aln_stats.csv \
+	      ${stat_table} && \
+    rfc stats-percentage \
+	  -i raw_combined_individual_aln_stats.csv \
+	  -o rnaseq_individual_stats.csv
+	"""
+}
+
+
+// COMBINE INDIVIDUAL ALIGNMENT STATS
+///////////////////////////////////////////////////////////////////////////////////////
+
+///////////////////////////////////////////////////////////////////////////////////////
+/* SUM INDIVIDUAL ALIGNMENT STATS */
+
+/*
+For each sample, sums up the stats coming from individual lanes 
+*/
+
+RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_FOR_GOUPING
+    .map{ sample, index, file -> [ sample, file ] }
+    .groupTuple()
+    .into{ RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_GROUPED ;
+           RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_GROUPED_VERBOSE }
+
+process rnaseq_sum_individual_alignment_stats{
+
+  executor 'local'
+
+	storeDir get_rnaseq_storedir( "log/" + params.output.merged_lane_directory )
+
+	input:
+	set val(sample), file(stat_files) from RNASEQ_INDIVIDUAL_ALIGNMENT_STATS_GROUPED
+
+	output:
+	set val(sample), file("${sample}.rnaseq.merged.alignment_stats.csv")\
+	   into RNASEQ_MERGED_ALIGNMENT_STATS
+
+	"""
+	rfc sum-stats -n ${sample}\
+	  -o ${sample}.rnaseq.merged.alignment_stats.csv ${stat_files}
+	"""
+}
+
+// SUM INDIVIDUAL ALIGNMENT STATS
+////////////////////////////////////////////////////////////////////////////////
+
+RNASEQ_MERGED_ALIGNMENT_STATS
+.map{ sample, stats_file -> stats_file }
+.toSortedList()
+.set{ RNASEQ_MERGED_ALIGNMENT_STATS_COLLECTED }
+
+////////////////////////////////////////////////////////////////////////////////
+/* COMBINE MERGED ALIGNMENT STATS */
+
+process rnaseq_combine_merged_alignment_stats{
+
+	publishDir get_rnaseq_publishdir("stats"), mode: 'copy'
+	
+	executor 'local'
+
+	input:
+	file(stat_files) from RNASEQ_MERGED_ALIGNMENT_STATS_COLLECTED
+
+	output:
+	file("rnaseq_stats.csv") into RNASEQ_COMBINED_MERGED_ALIGNMENT_STATS
+
+	"""
+	rfc merge overall-stats \
+	    -o raw_combined_merged_aln_stats.csv \
+	    ${stat_files} && \
+	rfc stats-percentage \
+	  -i raw_combined_merged_aln_stats.csv \
+	  -o rnaseq_stats.csv
+	""" 
+}
+
+// COMBINE MERGED ALIGNMENT STATS 
+////////////////////////////////////////////////////////////////////////////////
+
+RNASEQ_FOR_RIBOPY   = Channel.create()
+RIBO_FOR_RNASEQ_EXCLUDED = Channel.create()
+
+/*
+Separate the ribo files which have rnaseq data and which dont
+*/
+RIBO_FOR_RNASEQ
+.join(RNASEQ_BED_FOR_RIBO_FINAL, remainder: true)
+.choice( RNASEQ_FOR_RIBOPY, RIBO_FOR_RNASEQ_EXCLUDED )
+{it[2] != null ? 0 : 1}
+
+RIBO_FOR_RNASEQ_EXCLUDED.map{ sample, ribo, bed_null -> [sample, ribo]}
+.set{ RIBO_FOR_RNASEQ_EXCLUDED_FOR_MERGE }
+
+process put_rnaseq_into_ribo{
+  publishDir get_publishdir("ribo") + "/experiments", mode: 'copy'
+  
+  input: 
+  set val(sample), file(ribo), file(rnaseq) from RNASEQ_FOR_RIBOPY
+  
+  output:
+  set val(sample), file(ribo) into RIBO_WITH_RNASEQ_PRE
+
+  """
+  ribopy rnaseq set -n ${sample} -a ${rnaseq} -f bed --force ${ribo}
+  """
+  
+}
+
+// For the downsrtream "merge_ribo" process,
+// we need to combine the ribos with and without rnaseq data.
+RIBO_WITH_RNASEQ_PRE.concat( RIBO_FOR_RNASEQ_EXCLUDED_FOR_MERGE )
+.into{RIBO_WITH_RNASEQ; RIBO_WITH_RNASEQ_VERBOSE}
+  
+} // if (do_rnaseq)
+// RNA-Seq
+////////////////////////////////////////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+////////////////////////////////////////////////////////////////////////////////
+
+/* Merge Ribos*/
+
+if(do_rnaseq){
+  RIBO_WITH_RNASEQ.set{RIBO_FOR_MERGE_PRE}
+}
+else{
+  RIBO_AFTER_CREATION.set{RIBO_FOR_MERGE_PRE}
+}
+
+RIBO_FOR_MERGE_PRE.map{ sample, ribo -> [ribo]}.flatten().collect()
+                  .set{RIBO_FOR_MERGE}
+
+process merge_ribos{
+  
+  publishDir get_publishdir("ribo"), mode:'copy'
+  
+  input:
+  file(sample_ribo) from RIBO_FOR_MERGE
+  
+  output:
+  file("all.ribo") into ALL_RIBO
+  
+  """
+  ribopy merge all.ribo ${sample_ribo}
+  """ 
+  
+}
+
+// Merge Ribos
+////////////////////////////////////////////////////////////////////////////////
diff --git a/VERSION b/VERSION
new file mode 100644
index 0000000..245f903
--- /dev/null
+++ b/VERSION
@@ -0,0 +1 @@
+version = '0.0.0'
diff --git a/configs/docker_local.config b/configs/docker_local.config
new file mode 100644
index 0000000..0c597b4
--- /dev/null
+++ b/configs/docker_local.config
@@ -0,0 +1,63 @@
+
+// Default configuration for running the pipeline on a local machine
+
+
+process {
+    // if the process name is not listed separately below
+    // the following settings are used
+    executor='local'
+    cpus = 1
+    maxRetries = 1
+    errorStrategy = 'retry'
+
+    cpus = 1
+    
+    // Override the following defaults 
+    // by specifying the process name
+
+    withName: quality_filter{
+        cpus = 4
+    }
+    
+    withName: clip{
+        cpus = 4
+    }
+    
+    withName: filter{
+        cpus = 4
+    }
+    
+    withName: transcriptome_alignment{
+        cpus = 4
+    }
+    
+    withName: quality_filter{
+        cpus = 4 
+    }
+    
+    withName: genome_alignment{
+        cpus = 4
+    }    
+
+    withName: create_ribo{
+        cpus = 4 
+    }
+    
+    withName: post_genome_alignment{
+        cpus = 4 
+    }
+
+}
+
+
+// Total number of CPUs reserved for nextflow
+executor {
+    cpus = 4
+}
+
+
+docker {
+    enabled = true
+    runOptions = '-u $(id -u):$(id -g)'
+    temp = 'auto'
+}
diff --git a/configs/local.config b/configs/local.config
new file mode 100644
index 0000000..e16dbea
--- /dev/null
+++ b/configs/local.config
@@ -0,0 +1,62 @@
+
+// Default configuration for running the pipeline on a local machine
+
+
+process {
+    // if the process name is not listed separately below
+    // the following settings are used
+    executor='local'
+    cpus = 1
+    maxRetries = 1
+    errorStrategy = 'retry'
+
+    cpus = 1
+    
+    // Override the following defaults 
+    // by specifying the process name
+
+    withName: quality_filter{
+        cpus = 4
+    }
+    
+    withName: clip{
+        cpus = 4
+    }
+    
+    withName: filter{
+        cpus = 4
+    }
+    
+    withName: transcriptome_alignment{
+        cpus = 4
+    }
+    
+    withName: quality_filter{
+        cpus = 4 
+    }
+    
+    withName: genome_alignment{
+        cpus = 4
+    }    
+
+    withName: create_ribo{
+        cpus = 4 
+    }
+    
+    withName: post_genome_alignment{
+        cpus = 4 
+    }
+
+}
+
+
+// Total number of CPUs reserved for nextflow
+executor {
+    cpus = 4
+}
+
+
+docker {
+    enabled = false
+    runOptions = '-u $(id -u):$(id -g)'
+}
diff --git a/configs/stampede_local.config b/configs/stampede_local.config
new file mode 100644
index 0000000..3aa1e71
--- /dev/null
+++ b/configs/stampede_local.config
@@ -0,0 +1,67 @@
+
+// Default configuration for running the pipeline on a node of TACC Stampede2
+
+
+process {
+    // if the process name is not listed separately below
+    // the following settings are used
+    executor='local'
+    cpus = 1
+    maxRetries = 1
+    errorStrategy = 'retry'
+
+    cpus = 1
+
+    
+    // Override the following defaults 
+    // by specifying the process name
+
+    withName: md5sum {
+        cpus = 1
+    }
+
+    withName: quality_filter{
+        cpus = 4
+    }
+    
+    withName: clip{
+        cpus = 4
+    }
+    
+    withName: filter{
+        cpus = 8
+    }
+    
+    withName: transcriptome_alignment{
+        cpus = 8
+    }
+    
+    withName: quality_filter{
+        cpus = 8 
+    }
+    
+    withName: genome_alignment{
+        cpus = 8
+    }    
+
+    withName: create_ribo{
+        cpus = 8 
+    }
+    
+    withName: post_genome_alignment{
+        cpus = 8 
+    }
+
+}
+
+
+// Total number of CPUs reserved for nextflow
+executor {
+    cpus = 48
+}
+
+
+docker {
+    enabled = false
+    runOptions = '-u $(id -u):$(id -g)'
+}
diff --git a/docker/Dockerfile b/docker/Dockerfile
new file mode 100644
index 0000000..711d272
--- /dev/null
+++ b/docker/Dockerfile
@@ -0,0 +1,31 @@
+FROM ubuntu:18.04
+
+RUN apt-get update --fix-missing && \
+  apt-get install -q -y wget curl bzip2 libbz2-dev git build-essential zlib1g-dev locales vim fontconfig ttf-dejavu
+
+
+# Set the locale
+RUN locale-gen en_US.UTF-8  
+ENV LANG en_US.UTF-8  
+ENV LANGUAGE en_US:en  
+ENV LC_ALL en_US.UTF-8     
+
+# Install conda
+RUN curl -LO http://repo.continuum.io/miniconda/Miniconda3-latest-Linux-x86_64.sh && \
+    bash Miniconda3-latest-Linux-x86_64.sh -p /miniconda3 -b && \
+    rm Miniconda3-latest-Linux-x86_64.sh 
+ENV PATH=/miniconda3/bin:${PATH}
+
+# Install conda dependencies
+ADD environment.yaml /
+ADD VERSION /
+RUN pwd
+RUN conda config --set always_yes yes --set changeps1 no && \
+    conda config --add channels conda-forge && \
+    conda config --add channels defaults && \
+    conda config --add channels bioconda && \
+    conda config --get && \
+    conda update -q conda && \
+    conda info -a && \
+    conda env update -q -n root --file environment.yaml && \
+    conda clean --tarballs --index-cache --lock
diff --git a/docker/build.sh b/docker/build.sh
new file mode 100644
index 0000000..180e2e5
--- /dev/null
+++ b/docker/build.sh
@@ -0,0 +1,19 @@
+set -ex
+
+cp  ../VERSION ./VERSION
+cp ../environment.yaml ./environment.yaml
+
+version=$(cat ./VERSION | sed -nre 's/^[^0-9]*(([0-9]+\.)*[0-9]+).*/\1/p')
+
+function cleanup {
+    rm  ./VERSION
+    rm  ./environment.yaml
+}
+
+trap cleanup EXIT
+
+
+docker build -t ceniklab/riboflow:latest . 
+docker run -it ceniklab/riboflow:latest apt list | sed 's/\x1b\[[0-9;]*m//g' > ./apt.list
+docker run -it ceniklab/riboflow:latest conda list > ./conda.list
+docker images
diff --git a/docker/deploy.sh b/docker/deploy.sh
new file mode 100644
index 0000000..7c416be
--- /dev/null
+++ b/docker/deploy.sh
@@ -0,0 +1,9 @@
+
+docker login -u ceniklab
+
+version=$(cat ../VERSION | sed -nre 's/^[^0-9]*(([0-9]+\.)*[0-9]+).*/\1/p')
+echo "version: $version"
+
+# push the image
+docker push ceniklab/riboflow:latest
+docker push ceniklab/riboflow:$version
diff --git a/docker/tag.sh b/docker/tag.sh
new file mode 100644
index 0000000..c3be5db
--- /dev/null
+++ b/docker/tag.sh
@@ -0,0 +1,8 @@
+
+set -ex
+
+version=$(cat ../VERSION | sed -nre 's/^[^0-9]*(([0-9]+\.)*[0-9]+).*/\1/p')
+echo "version: $version"
+
+# tag it
+docker tag ceniklab/riboflow:latest ceniklab/riboflow:${version}
diff --git a/environment.yaml b/environment.yaml
new file mode 100644
index 0000000..115021a
--- /dev/null
+++ b/environment.yaml
@@ -0,0 +1,26 @@
+name: ribo
+channels:
+    - conda-forge               
+    - defaults                  
+    - bioconda      
+    - cyclus
+dependencies:
+    - python=3.6
+    - nextflow=19.04.1
+    - openjdk=8
+    - samtools>=1.4
+    - sra-tools>=2.8.1
+    - fastqc=0.11.8
+    - setuptools 
+    - pip
+    - numpy
+    - pandas
+    - matplotlib
+    - bowtie2=2.3.4.3
+    - hisat2=2.1.0=py36h2d50403_1
+    - pysam
+    - bedtools=2.27.1
+    - cutadapt=1.18
+    - pip:
+      - git+https://github.com/ribosomeprofiling/ribopy.git
+      - git+https://github.com/ribosomeprofiling/rfcommands.git 
diff --git a/project.yaml b/project.yaml
new file mode 100644
index 0000000..56e3b76
--- /dev/null
+++ b/project.yaml
@@ -0,0 +1,200 @@
+# N E X T F L O W 
+##########################################################################
+#####   SAMPLE RIBOFLOW ARGUMENTS FILE WITH RNASEQ AND METADATA   ########
+########################################################################## 
+
+# Tested on  version 19.04.1
+
+# Perform fastqc at several stages of the pipeline
+do_fastqc: true
+
+# Check existnece of fastq.gz files and bowtie2 reference files 
+do_check_file_existence : true
+
+# Remove duplicate reads based on their length
+# and mapped position
+deduplicate: true
+
+# If you have RNA-Seq data additionally, 
+# that you want to pair with your ribosome profiling data,
+# you can set this flag to true 
+# AND PROVIDE RNA-Seq data
+# under the key rnaseq in this file. See below.
+# If you don't have RNA-Seq data, set this flag to false
+do_rnaseq: true
+
+# If you don't have metadata set do_metadata to false.
+# If you have metadata, provide yaml files for the experiments
+# under input -> metadata below. 
+do_metadata: true
+
+# These arguments are used for clipping adapters by cutadapt.
+# (see https://cutadapt.readthedocs.io/en/stable/guide.html )
+clip_arguments: '-u 1 -a CTGTAGGCACCATCAAT --overlap=4 --trimmed-only --maximum-length=40 --minimum-length=15 --quality-cutoff=28'
+
+# If you don't want to perform and adapter clipping, 
+# you can comment the above option and use the option below.   
+#clip_arguments: '--quality-cutoff=0'
+
+# Transcriptome alignments are filtered based on mapping quality.
+# This is the threshold that the alignments need to pass for
+# downstream quantification
+mapping_quality_cutoff: 2
+
+###############################################################################
+# Arguments for the aligner for 
+# corresponding steps
+alignment_arguments:
+   # bowtie2 arguments for rtRNA filtering step
+   filter:        '-L 15 --no-unal --norc'
+   
+   # bowtie2 arguments for transcriptome alignment step
+   transcriptome: '-L 15 --norc --no-unal'
+   
+   # hisat2 arguments
+   # use -k 1 so that each aligned read is reported once.
+   # otherwise, our read length analysis values might be inflated. 
+   genome: '--no-unal -k 1'
+
+###############################################################################
+# RiboPy parameters for ribo file generation.
+ribo:
+    ref_name:        "appris-v1"
+    metagene_radius: 50
+    left_span:       35
+    right_span:      10
+    read_length:
+       min: 28
+       max: 32
+    coverage: true
+   
+###############################################################################
+# Output folder settings
+# These entries typically don't need modifications.
+# Note that everything is placed as a subfolder under the *base* folder
+# *base* gives the actual folder location
+# The other parameters are folder names that are going to be under the *base* 
+output:
+   individual_lane_directory: 'individual'
+   merged_lane_directory: 'merged'
+   intermediates: 
+      # base is the root folder for the intermediate files
+      base: 'intermediates'
+      clip: 'clip'
+      log: 'log'
+      transcriptome_alignment: 'transcriptome_alignment'
+      filter: 'filter'
+      genome_alignment: 'genome_alignment'
+      bam_to_bed: 'bam_to_bed'
+      quality_filter: 'quality_filter'
+      genome_alignment: 'genome_alignment'
+      # alignment_ribo folder contains the bed files
+      # that are used as input to RiboPy to create ribo files.
+      alignment_ribo: 'alignment_ribo'
+   output: 
+      # base is the root folder for the output files
+      base: 'output'
+      log: 'log'
+      fastqc: 'fastqc'
+      ribo: 'ribo'
+      
+###############################################################################
+# In this exapmle we have two experiments with the names  
+# GSM1606107 and GSM1606108
+# These names are first introduced when providing fastq files 
+# for ribosome profiling data. (input -> fastq -> GSM1606107) and (input -> fastq -> GSM1606108)
+# 
+# If metadata or RNA-Seq data are provided, they must match these names
+# See below as an example. 
+
+
+input:
+   reference:
+   # filter indicates bowtie2 index files
+   # * is used as a wild card to match all bowtie2 index files:
+   # human_rtRNA.1.bt2, human_rtRNA.2.bt2, ....
+      filter: ./rf_sample_data/filter/human_rtRNA*
+
+      # transcriptome indicates bowtie2 index files
+      # Generated from isoform sequences.
+      transcriptome: ./rf_sample_data/transcriptome/appris_human_24_01_2019_selected*
+
+      # Main annotation file.
+      # CDS and UTR regions are defined in this file.
+      regions: ./rf_sample_data/annotation/appris_human_24_01_2019_actual_regions.bed
+
+      # Transcript lengths
+      transcript_lengths: ./rf_sample_data/annotation/appris_human_24_01_2019_selected.lengths.tsv
+      
+      ## Genome Alignment Reference
+      # Sequences that are NOT aligneod to the transcriptome 
+      # are mappoed to the genome
+      # This parameter (and the corresponding step) is optional.
+      # Comment the line below to skip this step
+      #genome: ./rf_sample_data/genome/mock_hg38*
+      
+      # Reads NOT aligned to the genome are mapped to this reference
+      # This parameter (and the corresponding step) is optional.
+      # Comment the line below to skip this step
+      #post_genome: ./rf_sample_data/post_genome/post_genome*
+
+   # This will be prefixed to the file paths below
+   # You can leave it as empty "" if you provide complete paths.
+   fastq_base: ""
+   fastq:
+       # We have two ribosome profiling experiments called 
+       # GSM1606107  and  GSM1606108
+       GSM1606107:
+         - ./rf_sample_data/fastq/ribosome_profiling/GSM1606107/SRR1795425.fastq.gz
+         - ./rf_sample_data/fastq/ribosome_profiling/GSM1606107/SRR1795426.fastq.gz
+
+       GSM1606108:
+         - ./rf_sample_data/fastq/ribosome_profiling/GSM1606108/SRR1795427.fastq.gz
+         - ./rf_sample_data/fastq/ribosome_profiling/GSM1606108/SRR1795428.fastq.gz
+    
+   ## INPUTS BELOW THIS LINE ARE POTIONAL
+   
+   # This is the metadata file stored at the root ribo file
+   # In this example, we are storing this yaml file    
+   # Any valid yaml file can be stored as metadata.
+   root_meta: "./project.yaml"   
+   
+   # The following metadata is stored under individual experiments.     
+   metadata:
+     
+        # This will be prefixed to the file paths below
+        # You can leave it as empty "" if you provide complete paths.
+        base: ""
+       
+        # file keys (left hand side of ":")
+        # must match experiment names for ribosome profiling data above.
+        files: 
+         GSM1606108: ./rf_sample_data/metadata/GSM1606108.yml
+         GSM1606107: ./rf_sample_data/metadata/GSM1606107.yml
+
+###############################################################################
+# If you have no RNA-Seq data to process,
+# remove "rnaseq" node from this yaml tree  
+rnaseq:
+ clip_arguments: '-u 1 --quality-cutoff=28'
+ 
+ # This will be prefixed to the file paths below
+ # You can leave it as empty "" if you provide complete paths.
+ fastq_base: ""
+ 
+ deduplicate: false
+ filter_arguments: '-L 15 --no-unal'
+ bt2_argumments: "-L 15  --no-unal"
+ 
+ # Keys must match the experiment names for the ribosome profiling data
+ # In this particular example, valid keys are
+ # GSM1606107 an d GSM1606108 
+ fastq:
+   GSM1606107:
+     - ./rf_sample_data/fastq/rnaseq/GSM1606099/SRR1795409.fastq.gz
+     - ./rf_sample_data/fastq/rnaseq/GSM1606099/SRR1795410.fastq.gz
+   GSM1606108:
+     - ./rf_sample_data/fastq/rnaseq/GSM1606100/SRR1795411.fastq.gz
+     - ./rf_sample_data/fastq/rnaseq/GSM1606100/SRR1795412.fastq.gz
+         
+