Skip to content

Behavioral Testing Summary

The Behavioral Testing Summary displays a report with the failure rates of each test, on both the evaluation set and the training set. Invariant (the modification should not change the predicted class) tests are currently supported.

Behavioral Testing in Key Concepts gives detailed information on the tests.

Screenshot

Table Content

  • Each test belongs to a family and a name. The test categorization is then further broken down by modification type. The verb of the description is used to name the modification type.
  • The failure rate (FR) indicates the number of total failed utterances, over all modified utterances generated for each test. Hover over the FR to see the average delta (absolute difference) in the prediction confidence, i.e. when the predicted class remains the same as the original utterance.
    • The failure rate on the training set is the performance of the modified training set when tested with the trained model. This might be useful to test the robustness of the model on data points it has already seen, before taking the extra step of understanding robustness in the presence of new data.
  • The last column shows one example from the dataset of the modifications that were made on the original utterance.

Sort the table

Click the failure rate headers to sort the values in ascending or descending order.

Download

A summary of the test results can be downloaded, as well as the modified sets generated for both the evaluation set and training set. Download the required file by clicking Export in the upper right corner of the table and then selecting the appropriate item from the menu.

Augment custom utterance

See our Custom Utterances page to learn how to augment additional data.