PDX

Overview
Software Requirements
Installation Guide
How to Run
License

Overview

This is a project which establishes an optimal predictive model to screen lung cancer patients for NOG/PDX models, and also offers a general approach for building prediction models in small unbalanced biomedical samples based on machine learning.

Software Requirements

OS Requirements

This project is supported for Linux and Windows, and it has been tested on the following systems:

Linux: Ubuntu 16.04
Windows 10

Python Dependencies

The code is compatible with Python 3.7. The following dependencies are needed to run the training or testing tasks:

numpy
pandas
sklearn
smote-variants
xgboost
catboost

Installation Guide

You can get our basic codes from Github.

git clone https://github.com/dddtqshmpmz/PDX.git

How to Run

Install all the dependencies.
- pip install -r requirements.txt
Pre-process data.
- We randomly resample the data from the original dataset (original_data.csv) and generate 100 datasets which are saved in /tmpData100 directory. We do not provide the datasets due to data privacy.
Use different machine learning methods (CatBoost, XGBoost, SVM and LR) to train the PDX prediction models.
- python classify_with_smote.py Train and test different models with or without SMOTE. The mean scores (AUC, precision, recall, accuracy and F1-score) of different models are saved in /score directory.
- python cross_validation.py Use K-fold cross validation to get classification scores of train/val/test datasets.
- python ensemble_learning.py Integrate multiple models to get final test scores.
- You can see some results based on different models with different feature selections in /scoreWithDifferentFeatures directory.

License

This project is covered under the Apache 2.0 License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PDX

Overview

Software Requirements

OS Requirements

Python Dependencies

Installation Guide

How to Run

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.vscode		.vscode
score		score
scoreWithDifferentFeatures		scoreWithDifferentFeatures
smote_variants		smote_variants
tmpData100		tmpData100
LICENSE		LICENSE
README.md		README.md
classify_with_smote.py		classify_with_smote.py
cross_validation.py		cross_validation.py
ensemble_learning.py		ensemble_learning.py
original_data.csv		original_data.csv
requirements.txt		requirements.txt

License

dddtqshmpmz/PDX

Folders and files

Latest commit

History

Repository files navigation

PDX

Overview

Software Requirements

OS Requirements

Python Dependencies

Installation Guide

How to Run

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages