Configuring Machine Learning Models | |
Configuring a model to make it ready for training involves specifying the data dimension fields, a label field if necessary, any required transformers and a learner. You can get to the model configuration page by finding the model in the Machine Learning List Models page, click on the model's actions button and select Edit.
|
Selecting Dimension and Label Fields | |
Dimension fields are chosen from a list of fields gotten from the data source tracker. These are shown in a multiselect list interface. Select a field by clicking on it. Select multiple fields by holding down the
The label field is the data attribute that contains the target to be predicted. A label field is required if the chosen learner is a classifier.
|
Handling Empty Data Values | |
Before a sample is used for training, Tiki by default will replace empty numeric fields with 0. Empty categorical fields will remain as empty strings. If you do not want this behaviour, you can make Tiki to simply ignore samples with empty fields by checking the Ignore items with empty values option.
|
Adding Transformers and Learners | |
You use transformers to preprocess data before model training. A learner is a machine learning algorithm on which the machine learning model will be based. The type of transformers and learner you choose will depend on the structure and format of the training data and the type of target that you want to predict.
Tiki internally uses Rubix ML for its Machine Learning functionality, so only transformers and learners available in Rubix ML are supported by Tiki. Due to Tiki Tracker's robust nature, some data transformations might not be necessary. For example, Numeric String Converter works by converting all numeric values that have been given as categorical values to their equivalent integer and floating point types. Tiki will handle this automatically if the given values belong to a numeric field type in the source tracker. Applying the least possible number of transformers will help reduce model latency. |
Related links | |