Classification of nuts step 2

In this session, we'll use the same images as in the tutorial “Classification of nuts step 1 - basic” but add a new class variable to classify the type of nut. You will also test two classification models types (PLS-DA, SIMCA).

If this is new to you, we recommend you to go through step 1 first. Intro to Breeze: Classification of nuts step 1

Steps included in the tutorial

Import known class information to samples

Import the “true values” to be able to model and predict data later.

Press the “Import/Export” tab

For this tutorial, we already classified the true values in the previous step. So now we need to fetch those files.

Select “Import variables and id data

Press Next

Choose Nuts_Classification_Train.CSV

Press Next

For this tutorial leave un-changed.

For other use, choose the correct segmentation to import.

Press Finish.

In the Table view, you should see the new “Nut type” class variable that was imported. The reference values were automatically matched with the correct sample object..

The spreadsheet .CSV file that you imported looks like this when opened in Excel. The column “Measurement” matches the class data (“Nut or shell” and “Nut type”) to the correct images and samples.

Create classification model (PLS-DA)

Now you will create a Classification model for “Nut type”.

For this tutorial: In the 2nd and 3rd steps of the model wizard just press “Next” (use the default) so that you come to the 4th step (“Model”).

Create classification model (SIMCA)

Let’s compare the PLS-DA model with a different classification model type.

In the SIMCA method,

one PCA model is created for each class.

All samples are then compared to that class model to determine if they belong to that class. In the Coomans plot, you can set the critical distance for each class model. If a sample is inside the critical distance it belongs to the class.

a) Select the class model for “Almond” by using the tabs under the Coomans plot

b) Drag the vertical red line to adjust the limit to include all “Almond” samples (but as few as possible of the other samples). Samples to the left of the red line are included in the Almond model.

Press the tab for each of the classes and repeat the steps in a) and b)

(“Overview (Total for all Y)” is only showing how well each class model can explain the samples in that class. It does not show how well it can classify other samples)

Press “Finish” to complete the model

Screenshot 2024-06-03 145942.png — Drag the lines to include samples

Press the “Classification” tab to see how well the samples in the training data were classified.

Press the arrow to maximize the table view and the arrow to open the preview image.

Click on a field in the table to see the corresponding samples in the preview image.

In this example, with the SIMCA classification a total of only 1 samples were misclassified (you might get slightly different results depending on how you set the critical distance in the previous step). This can be compared to 14 misclassified samples for the PLS-DA.

Import the known class information for the test samples

To validate the model you should use an external test set to see how well it can classify samples that were not in the training data set. We will now add the known class information to the image “Mix” in the “Test” group.

Create workflow and Import Record test data

Now let’s test the new models for “Nut type” in the Play mode. Press “Play”

Click on the first “Nut type” model to see the settings menu for that model on the right side (pull the vertical line to expand the size). In this menu, you can see the settings for the selected “Node” in your “Analyse Tree”.
In the “Model” drop-down menu you can see that this is the SIMCA model. So in the “Alias” field, write “SIMCA” and press enter on your keyboard.

Click on the 2nd “Nut type” model in the Analyse Tree and change to using the PLS-DA model in the “Model” drop-down menu. Write the Alias as “PLS-DA” and press enter. As you can see the text for each model has now been updated in the “Analyse Tree”.

Nice job! You have reached the end of step 2 of the “Classification of Nuts” tutorial. See step 3 at:

Classification of nuts step 3