2 years ago · 5f7ebec844
--- a/1-Preprocessing/README.md
+++ b/1-Preprocessing/README.md
 ### Preprocess data
 TODO
 There are a number of scripts involved in this stage.
 - `expressionSummarizer.r` prepares expression data columns and Id values
 - `geneIdConverter.r` uses a Bioconductor annotation package to unify gene symbols
 - `prepare_mutation.py` prepares mutation data by removing useless columns and unwanted rows.
 - `preprocessing.py` this code generates a summary for each cancer that will be further used to aggregate in the next stage
 by running each python script with a `--help` flag you will learn more about input args.