# DeepTraCDR: Prediction Cancer Drug Response using multimodal deep learning with Transformers
<!-- 
## Abstract
<div align="justify">

</div> -->


## Method
![final result](https://git.dml.ir/zahra.asgari/DeepTraCDR/raw/branch/master/main/assest/final.png)

### Requirements 

To run this project, you need to install the required dependencies first. Execute the following command in your terminal or command prompt:

```bash
pip install -r requirements.txt
```


## DeepTraCDR Model Overview

DeepTraCDR is a modular model consisting of **Common Modules** and **Experimental Modules**.

### Common Modules
- **Data**: Includes datasets for model training and evaluation:
  - **GDSC**: Contains `cell_drug.csv` (log IC50 matrix), `cell_drug_binary.csv` (binary matrix), `merged_file_GDSC.csv` (gene expression), `drug_feature.csv` (drug fingerprints), `null_mask.csv` (null values), and `threshold.csv` (sensitivity threshold).
  - **CCLE**: Similar to GDSC with `cell_drug.csv`, `cell_drug_binary.csv`, `merged_file.csv`(gene expression), and `drug_feature.csv`.
  - **PDX**: Includes `pdx_response.csv` (binary patient-drug matrix), `pdx_exprs.csv` (gene expression), `pdx_null_mask.csv` (null values), and `drug_feature.csv`.
  - **TCGA**: Contains `patient_drug_binary.csv` (binary matrix), `tcga_exprs.csv` (gene expression), `tcga_null_mask.csv` (null values), and `drug_feature.csv`.


### Experimental Modules
The experimental modules are organized into the following directories, each containing a `main.py` script to run the respective experiment:

- **`case_study`**: Contains scripts for case study experiments (e.g., `main_case_study.py`).
- **`Scenario1`**: Includes experiments for random clearing cross-validation (`Random`) and regression (`Regression`).
- **`Scenario2`**: Includes experiments for single row/column clearing (`new`) and targeted drug experiments (`Target`).
- **`Scenario3`**: Includes external validation experiments from in vitro to in vivo (`External`).

Each `main.py` script outputs true and predicted test data values after multiple cross-validations. The `utils.py` file supports performance analysis with metrics like AUC, AUPRC, ACC, F1, and MCC. The model is built using PyTorch with CUDA support.