Selecting rows in the Model wizard
The right Select pane in the Model wizard has commands for quickly selecting rows in the table for further interaction to create the model.
The Select rows by buttons has various commands for selecting rows in the table:
-
Interval - select evenly spaced rows
-
Class - select rows matching a class
-
Property value - select rows which have certain property values
-
Random percentage - select a percentage of rows randomly
-
Model binning - select a percentage of rows by using the model space to distribute the selection evenly. This can be used to remove sparse outliers (noise) and keep the most representative data, resulting in a more stable model.
-
This works by grouping the scatter plot points into a 2D grid (similar to a 2D histogram). Density represents how many pixels fall within the same grid cell. The percentage threshold selects only points whose density is above a given fraction of the maximum density.
-
-
Included, Excluded - select rows that are Included in the model, or Excluded from the model.
You can also select rows in the table manually by clicking. Use Shift + Click to select are region, or Ctrl + Click to select disjoint rows.
After the rows are selected, use the toolbar above the table to act on these rows to for example excluding them from the model, or setting as test data.
Subpages
-
Cross-validation Cross-validation evaluates model performance by predicting unseen data, with various methods like random, stratified, and leave-one-out techniques to ensure robustness and generalizability.