Using NIFty to Train a Classification Model

It is possible to just train a classification model when running NIFty. Running NIFty in this mode presupposes that, at some point, you have used NIFty to generate relevant features.

The code and descriptive examples below will help you run NIFty is find_features only mode:

Minimum Required Changes to Configurations:

To run NIFty in this mode, the following are the minimum required changes to the default configuration file:

find_features = false
train_model = true
apply_model = false
reference_quant_file = "your/path/to/reference/quant.tsv"
reference_meta_file = "your/path/to/reference/meta.tsv"
feature_file = "your/path/to/selected_features.tsv"

NOTE: NIFty, by default, only requires one reference dataset to both train and validate a classifier. When run in this mode, NIFty internally splits the reference dataset into two, non-overlapping sets: a training/testing set, and a validation set. Alternatively, if you want to split your reference data yourself (or use multiple reference datasets for different portions of the pipeline), you can make these additional changes to the configuration file:

input_files = "individual"
train_quant_file = "your/path/to/training/testing/quant.tsv"
train_meta_file = "your/path/to/training/testing/meta.tsv"
validate_quant_file = "your/path/to/validation/quant.tsv"
validate_meta_file = "your/path/to/validation/meta.tsv"

Expected Output Files:

trained_model_and_model_metadata.pkl
model_information.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using NIFty to Train a Classification Model

Minimum Required Changes to Configurations:

Expected Output Files:

FilesExpand file tree

run_model_training.md

Latest commit

History

run_model_training.md

File metadata and controls

Using NIFty to Train a Classification Model

Minimum Required Changes to Configurations:

Expected Output Files: