| ### Preprocess data | ### Preprocess data | ||||
| TODO | |||||
| There are a number of scripts involved in this stage. | |||||
| - `expressionSummarizer.r` prepares expression data columns and Id values | |||||
| - `geneIdConverter.r` uses a Bioconductor annotation package to unify gene symbols | |||||
| - `prepare_mutation.py` prepares mutation data by removing useless columns and unwanted rows. | |||||
| - `preprocessing.py` this code generates a summary for each cancer that will be further used to aggregate in the next stage | |||||
| by running each python script with a `--help` flag you will learn more about input args. |