- python3.6
- pandas
- sklearn
- matplotlib
- imblearn
Update:
- TRAINFILENAME: The CSV file of training data
- TEST_FR: Fraction of training data to be kept aside for test data
- DISC_FEATURES_COL_TO_USE: Python List of Column names containing Discrete Features in TRAINFILENAME
- CONT_FEATURES_COL_TO_USE: List of Column names containing Continuous Features in TRAINFILENAME
- DISC_TARGET_COL_TO_USE: Name of Column names containing Target variable in TRAINFILENAME
- Type arg passed to training function in main(). Type of model to be used from:
- LR: Logistic Regression
- SVM: Support Vector Classifier
- RF: Random forest
- GBC: Gradient Boosting Classifier
- Default: Naive Bayes
Run: As ussual python script