Releases
[2.5.3] - 2022-02-16
Fixed
- Support truncation for HF pipelines.
[2.5.2] - 2022-12-20
Fixed
- Show long utterances fully on hover in similar and perturbed utterances tables.
- Fixed webapp crash when there is no pipeline (with
"pipeline": null
configured). - Fixed sort utterance table by confidence or prediction without post-processing.
- Fixed crash on Safari and iOS browsers as they don't support lookbehind in regular expressions.
[2.5.1] - 2022-12-05
Fixed
- Ignore
language
environment variable as we have seen conflicts with it on certain machines.
[2.5.0] - 2022-12-05
Added
- Support for french: Azimuth now works on French datasets (and pipelines)! Language can be selected in the config, and language-specific defaults for syntax-tagging and behavioral tests (neutral tokens) will be set dynamically (or can be altered manually). See Reference.
- Class Overlap: New class overlap detection section. More details are available in the User Guide and in the Key Concepts.
- Pre/Post-processing Steps: The details of the pipeline pre/post-processing steps are now visible in the utterance details page.
- Accuracy metric: Accuracy was added as a default metric.
Changed
- The default port for the front end will now be 3000, the React default, instead of 3005.
Fixed
- ONNX models on GPU.
- Preserve white spaces in utterances. That includes
\n
s,\t
s, and consecutive spaces. - Fix predictions with 100% confidence not showing up in the confidence histogram and in the ECE.
[2.4.1] - 2022-10-21
Fixed
- Fixed Smart Tab Analysis table
- Column headers exceeding plots' width
- Links to Prediction Overview
Total
row squished when transposed and with a high number of rows
[2.4.0] - 2022-10-20
Added
- New dataset warning: Added new class imbalance warnings.
- Pipeline Comparison: Added a new pipeline comparison mode in the pipeline metrics table to compare the metrics on different pipelines.
- New Smart Tag Analysis: Added a new plot where smart tag patterns over classes can be easily examined in one view.
- New loading screen: When the start-up tasks are running, a new loading screen displays the status of all the tasks.
Changed
- Renaming: Some sections were renamed in the UI, such as:
- Dataset Class Distribution Analysis -> Dataset Warnings
- Performance Analysis -> Pipeline Metrics by Data Subpopulation
- Performance Overview -> Prediction Overview
- Proposed actions: We added a new action,
merge_classes
, and renamedconsider_new_class
todefine_new_class
. - Improved Confusion Matrix: The order of the classes in the confusion matrix is now smarter: classes where the model gets similarly confused will be closer to one another. The rejection class is always the last row/column in the confusion matrix. A toggle allows the user to keep the original order from the dataset if preferred.
- Refactoring: We improved the
MetricsPerFilter
module (which generates the pipeline metrics by data subpopulation table). It now takes ~5 times less time to compute. - New config fields: The memory of the dask cluster can now be set to large (12GB) for bigger models. The config can also be in read-only mode, to prevent users from changing its values.
- Offline Mode: Azimuth can now be launched without internet.
Fixed
- Fixed an issue related to HuggingFace where filtering on an empty dataset would result in an error.
[2.3.0] - 2022-08-17
Added
- Rows of Performance Analysis table now link to exploration page with filters applied, when the user clicks.
- Added visual bars to the similarity column in the semantically similar utterances.
- New attribute in the config allows users to change the thresholds that determine a short or long sentence.
Fixed
- Fixed utterances table poorly showing ids greater than 9999.
- Fixed filtering of aggregation modules without post-processing.
- Fixed high_epistemic_uncertainty smart tag which wasn't showing in the UI.
- Fixed crash when hitting
See more
when it would show over 100 rows in Performance Analysis table.
[2.2.3] - 2022-07-25
Fixed
- Fixed losing hidden columns from Performance Analysis table when changing view (Label, Prediction, etc.).
- Fixed utterances table poorly showing ids greater than 99, now supporting up to 9999.
[2.2.2] - 2022-07-19
Fixed
- Fixed losing hidden columns when sorting Performance Analysis table, clicking
See more
/less
, or switching dataset split or pipeline.
[2.2.1] - 2022-07-15
Fixed
- Fixed pipeline comparison smart tag family not working as a filter on the Exploration space.
[2.2.0] - 2022-07-08
Added
- Contextual information has been added to most functionalities, linking directly to the relevant documentation.
- The F1 metric was added as a default metric.
Changed
- Smart tags are now grouped in families, so it is easier to digest them. This also impacts how filters are displayed.
- The confusion matrix can be shown with raw values (not just normalized).
- We now show the top 20, instead of 10, most similar examples in the utterances details.
- Renamed our docker images to use
servicenowdocker
registry on Docker Hub.
Performance Overview Table
- Columns can be temporarily hidden.
- Use outcomes' icons as headers instead of the outcomes' full names.
- Added visual bars, so it is easier to analyze the model's performance.
[2.1.1] - 2022-06-06
Changed
- Our Docker images are now available through Docker Hub
- Renamed our docker images to use
servicenowdocker
registry on Docker Hub. - Users can now use
make launch
instead ofmake compose
.
- Renamed our docker images to use
- Add option
DEVICE=auto
to automatically run Azimuth on GPU if available. As such,DEVICE=cpu
orDEVICE=gpu
does not need to be specified. - We added analytics in our documentation, which will add a pop-up to accept cookies when users first access the page.
Fixed
- Fixed issue where the performance analysis table showed empty results for some smart tags.
- Fixed
BEHAVIORAL_TESTING
environment variable. - Fixed unexpected borders in Performance Analysis table.
- Fixed proposed actions which couldn't be applied on an utterance in the latest release.
[2.1.0] - 2022-05-27
- Ability to get predictions without postprocessing in the exploration space. See section "Excluding Post-Processing" here.
- New Smart Tags
pipeline_disagreement
andincorrect_for_all_pipelines
as a first step for pipeline comparison. See section "Pipeline Comparison" here. - Links on top words to filter utterances that contain it. See the section "Word Clouds" here.
[2.0.0] - 2022-04-12
First public release.