fiftyone.brain¶

Module contents¶

The brains behind FiftyOne: a powerful package for dataset curation, analysis, and visualization.

See https://github.com/voxel51/fiftyone for more information.

Copyright 2017-2023, Voxel51, Inc.
voxel51.com

Functions:

`compute_hardness`(samples, label_field[, …])	Adds a hardness field to each sample scoring the difficulty that the specified label field observed in classifying the sample.
`compute_mistakenness`(samples, pred_field, …)	Computes the mistakenness (likelihood of being incorrect) of the labels in `label_field` based on the predcted labels in `pred_field`.
`compute_uniqueness`(samples[, …])	Adds a uniqueness field to each sample scoring how unique it is with respect to the rest of the samples.
`compute_visualization`(samples[, …])	Computes a low-dimensional representation of the samples’ media or their patches that can be interactively visualized.
`compute_similarity`(samples[, patches_field, …])	Uses embeddings to index the samples or their patches so that you can query/sort by similarity.
`compute_exact_duplicates`(samples[, …])	Detects duplicate media in a sample collection.

fiftyone.brain.compute_hardness(samples, label_field, hardness_field='hardness', progress=None)¶

Adds a hardness field to each sample scoring the difficulty that the specified label field observed in classifying the sample.

Hardness is a measure computed based on model prediction output (through logits) that summarizes a measure of the uncertainty the model had with the sample. This makes hardness quantitative and can be used to detect things like hard samples, annotation errors during noisy training, and more.

All classifications must have their logits attributes populated in order to use this method.

Note

Runs of this method can be referenced later via brain key hardness_field.

Parameters

samples – a fiftyone.core.collections.SampleCollection
label_field – the fiftyone.core.labels.Classification or fiftyone.core.labels.Classifications field to use from each sample
hardness_field ("hardness") – the field name to use to store the hardness value for each sample
progress (None) – whether to render a progress bar (True/False), use the default value fiftyone.config.show_progress_bars (None), or a progress callback function to invoke instead

fiftyone.brain.compute_mistakenness(samples, pred_field, label_field, mistakenness_field='mistakenness', missing_field='possible_missing', spurious_field='possible_spurious', use_logits=False, copy_missing=False, progress=None)¶

Computes the mistakenness (likelihood of being incorrect) of the labels in label_field based on the predcted labels in pred_field.

Mistakenness is measured based on either the confidence or logits of the predictions in pred_field. This measure can be used to detect things like annotation errors and unusually hard samples.

For classifications, a mistakenness_field field is populated on each sample that quantifies the likelihood that the label in the label_field of that sample is incorrect.

For objects (detections, polylines, keypoints, etc), the mistakenness of each object in label_field is computed, using fiftyone.core.collections.SampleCollection.evaluate_detections() to locate corresponding objects in pred_field. Three types of mistakes are identified:

(Mistakes) Objects in label_field with a match in pred_field are assigned a mistakenness value in their mistakenness_field that captures the likelihood that the class label of the object in label_field is a mistake. A mistakenness_field + "_loc" field is also populated that captures the likelihood that the object in label_field is a mistake due to its localization (bounding box).
(Missing) Objects in pred_field with no matches in label_field but which are likely to be correct will have their missing_field attribute set to True. In addition, if copy_missing is True, copies of these objects are added to the ground truth label_field.
(Spurious) Objects in label_field with no matches in pred_field but which are likely to be incorrect will have their spurious_field attribute set to True.

In addition, for objects, the following sample-level fields are populated:

(Mistakes) The mistakenness_field of each sample is populated with the maximum mistakenness of the objects in label_field
(Missing) The missing_field of each sample is populated with the number of missing objects that were deemed missing from label_field.
(Spurious) The spurious_field of each sample is populated with the number of objects in label_field that were given deemed spurious.

Note

Runs of this method can be referenced later via brain key mistakenness_field.

Parameters

samples – a fiftyone.core.collections.SampleCollection
pred_field – the name of the predicted label field to use from each sample. Can be of type fiftyone.core.labels.Classification, fiftyone.core.labels.Classifications, fiftyone.core.labels.Detections, fiftyone.core.labels.Polylines, fiftyone.core.labels.Keypoints, or fiftyone.core.labels.TemporalDetections
label_field – the name of the “ground truth” label field that you want to test for mistakes with respect to the predictions in pred_field. Must have the same type as pred_field
mistakenness_field ("mistakenness") – the field name to use to store the mistakenness value for each sample
missing_field ("possible_missing) – the field in which to store per-sample counts of potential missing objects
spurious_field ("possible_spurious) – the field in which to store per-sample counts of potential spurious objects
use_logits (False) – whether to use logits (True) or confidence (False) to compute mistakenness. Logits typically yield better results, when they are available
copy_missing (False) – whether to copy predicted objects that were deemed to be missing into label_field
progress (None) – whether to render a progress bar (True/False), use the default value fiftyone.config.show_progress_bars (None), or a progress callback function to invoke instead

fiftyone.brain.compute_uniqueness(samples, uniqueness_field='uniqueness', roi_field=None, embeddings=None, model=None, model_kwargs=None, force_square=False, alpha=None, batch_size=None, num_workers=None, skip_failures=True, progress=None)¶

Adds a uniqueness field to each sample scoring how unique it is with respect to the rest of the samples.

This function only uses the pixel data and can therefore process labeled or unlabeled samples.

If no embeddings or model is provided, a default model is used to generate embeddings.

Note

Runs of this method can be referenced later via brain key uniqueness_field.

Parameters

samples – a fiftyone.core.collections.SampleCollection
uniqueness_field ("uniqueness") – the field name to use to store the uniqueness value for each sample
roi_field (None) – an optional fiftyone.core.labels.Detection, fiftyone.core.labels.Detections, fiftyone.core.labels.Polyline, or fiftyone.core.labels.Polylines field defining a region of interest within each image to use to compute uniqueness
embeddings (None) –
if no model is provided, this argument specifies pre-computed embeddings to use, which can be any of the following:
- a num_samples x num_dims array of embeddings
- if roi_field is specified, a dict mapping sample IDs to num_patches x num_dims arrays of patch embeddings
- the name of a dataset field containing the embeddings to use
If a model is provided, this argument specifies the name of a field in which to store the computed embeddings. In either case, when working with patch embeddings, you can provide either the fully-qualified path to the patch embeddings or just the name of the label attribute in roi_field
model (None) – a fiftyone.core.models.Model or the name of a model from the FiftyOne Model Zoo to use to generate embeddings. The model must expose embeddings (model.has_embeddings = True)
model_kwargs (None) – a dictionary of optional keyword arguments to pass to the model’s Config when a model name is provided
force_square (False) – whether to minimally manipulate the patch bounding boxes into squares prior to extraction. Only applicable when a model and roi_field are specified
alpha (None) – an optional expansion/contraction to apply to the patches before extracting them, in [-1, inf). If provided, the length and width of the box are expanded (or contracted, when alpha < 0) by (100 * alpha)%. For example, set alpha = 1.1 to expand the boxes by 10%, and set alpha = 0.9 to contract the boxes by 10%. Only applicable when a model and roi_field are specified
batch_size (None) – a batch size to use when computing embeddings. Only applicable when a model is provided
num_workers (None) – the number of workers to use when loading images. Only applicable when a Torch-based model is being used to compute embeddings
skip_failures (True) – whether to gracefully continue without raising an error if embeddings cannot be generated for a sample
progress (None) – whether to render a progress bar (True/False), use the default value fiftyone.config.show_progress_bars (None), or a progress callback function to invoke instead

fiftyone.brain.compute_visualization(samples, patches_field=None, embeddings=None, points=None, brain_key=None, num_dims=2, method=None, model=None, model_kwargs=None, force_square=False, alpha=None, batch_size=None, num_workers=None, skip_failures=True, progress=None, **kwargs)¶

Computes a low-dimensional representation of the samples’ media or their patches that can be interactively visualized.

The representation can be visualized by calling the visualize() method of the returned fiftyone.brain.visualization.VisualizationResults object.

If no embeddings or model is provided, the following default model is used to generate embeddings:

import fiftyone.zoo as foz

model = foz.load_zoo_model("mobilenet-v2-imagenet-torch")

You can use the method parameter to select the dimensionality reduction method to use, and you can optionally customize the method by passing additional parameters for the method’s fiftyone.brain.visualization.VisualizationConfig class as kwargs.

The builtin method values and their associated config classes are:

"umap": fiftyone.brain.visualization.UMAPVisualizationConfig
"tsne": fiftyone.brain.visualization.TSNEVisualizationConfig
"pca": fiftyone.brain.visualization.PCAVisualizationConfig
"manual": fiftyone.brain.visualization.ManualVisualizationConfig

Parameters

samples – a fiftyone.core.collections.SampleCollection
patches_field (None) – a sample field defining the image patches in each sample that have been/will be embedded. Must be of type fiftyone.core.labels.Detection, fiftyone.core.labels.Detections, fiftyone.core.labels.Polyline, or fiftyone.core.labels.Polylines
embeddings (None) –
if no model is provided, this argument specifies pre-computed embeddings to use, which can be any of the following:
- a dict mapping sample IDs to embedding vectors
- a num_samples x num_embedding_dims array of embeddings corresponding to the samples in samples
- if patches_field is specified, a dict mapping label IDs to to embedding vectors
- if patches_field is specified, a dict mapping sample IDs to num_patches x num_embedding_dims arrays of patch embeddings
- the name of a dataset field containing the embeddings to use
- a fiftyone.brain.similarity.SimilarityIndex from which to retrieve embeddings for all samples/patches in samples
If a model is provided, this argument specifies the name of a field in which to store the computed embeddings. In either case, when working with patch embeddings, you can provide either the fully-qualified path to the patch embeddings or just the name of the label attribute in patches_field
points (None) –
a pre-computed low-dimensional representation to use. If provided, no embeddings will be used/computed. Can be any of the following:
- a dict mapping sample IDs to points vectors
- a num_samples x num_dims array of points corresponding to the samples in samples
- if patches_field is specified, a dict mapping label IDs to points vectors
- if patches_field is specified, a num_patches x num_dims array of points whose rows correspond to the flattened list of patches whose IDs are shown below:
```
# The list of patch IDs that the rows of `points` must match
_, id_field = samples._get_label_field_path(patches_field, "id")
patch_ids = samples.values(id_field, unwind=True)
```
brain_key (None) – a brain key under which to store the results of this method
num_dims (2) – the dimension of the visualization space
method (None) – the dimensionality reduction method to use. The supported values are fiftyone.brain.brain_config.visualization_methods.keys() and the default is fiftyone.brain.brain_config.default_visualization_method
model (None) –
a fiftyone.core.models.Model or the name of a model from the FiftyOne Model Zoo to use to generate embeddings. The model must expose embeddings (model.has_embeddings = True)
model_kwargs (None) – a dictionary of optional keyword arguments to pass to the model’s Config when a model name is provided
force_square (False) – whether to minimally manipulate the patch bounding boxes into squares prior to extraction. Only applicable when a model and patches_field are specified
alpha (None) – an optional expansion/contraction to apply to the patches before extracting them, in [-1, inf). If provided, the length and width of the box are expanded (or contracted, when alpha < 0) by (100 * alpha)%. For example, set alpha = 1.1 to expand the boxes by 10%, and set alpha = 0.9 to contract the boxes by 10%. Only applicable when a model and patches_field are specified
batch_size (None) – an optional batch size to use when computing embeddings. Only applicable when a model is provided
num_workers (None) – the number of workers to use when loading images. Only applicable when a Torch-based model is being used to compute embeddings
skip_failures (True) – whether to gracefully continue without raising an error if embeddings cannot be generated for a sample
progress (None) – whether to render a progress bar (True/False), use the default value fiftyone.config.show_progress_bars (None), or a progress callback function to invoke instead
**kwargs – optional keyword arguments for the constructor of the fiftyone.brain.visualization.VisualizationConfig being used

Returns

a fiftyone.brain.visualization.VisualizationResults

fiftyone.brain.compute_similarity(samples, patches_field=None, embeddings=None, brain_key=None, model=None, model_kwargs=None, force_square=False, alpha=None, batch_size=None, num_workers=None, skip_failures=True, progress=None, backend=None, **kwargs)¶

Uses embeddings to index the samples or their patches so that you can query/sort by similarity.

Calling this method only creates the index. You can then call the methods exposed on the retuned fiftyone.brain.similarity.SimilarityIndex object to perform the following operations:

sort_by_similarity(): Sort the samples in the collection by similarity to a specific example or example(s)

All indexes support querying by image similarity by passing sample IDs to sort_by_similarity(). In addition, if you pass the name of a model from the FiftyOne Model Zoo like model="clip-vit-base32-torch" that can embed prompts to this method, then you can query the index by text similarity as well.

In addition, if the backend supports it, you can call the following duplicate detection methods:

find_duplicates(): Query the index to find all examples with near-duplicates in the collection
find_unique(): Query the index to select a subset of examples of a specified size that are maximally unique with respect to each other

If no embeddings or model is provided, the following default model is used to generate embeddings:

import fiftyone.zoo as foz

model = foz.load_zoo_model("mobilenet-v2-imagenet-torch")

Parameters

samples – a fiftyone.core.collections.SampleCollection
patches_field (None) – a sample field defining the image patches in each sample that have been/will be embedded. Must be of type fiftyone.core.labels.Detection, fiftyone.core.labels.Detections, fiftyone.core.labels.Polyline, or fiftyone.core.labels.Polylines
embeddings (None) –
embeddings to feed the index. This argument’s behavior depends on whether a model is provided, as described below.

If no model is provided, this argument specifies pre-computed embeddings to use:
- a num_samples x num_dims array of embeddings
- if patches_field is specified, a dict mapping sample IDs to num_patches x num_dims arrays of patch embeddings
- the name of a dataset field from which to load embeddings
- None: use the default model to compute embeddings
- False: do not compute embeddings right now
If a model is provided, this argument specifies where to store the model’s embeddings:
- the name of a field in which to store the computed embeddings
- False: do not compute embeddings right now
In either case, when working with patch embeddings, you can provide either the fully-qualified path to the patch embeddings or just the name of the label attribute in patches_field
brain_key (None) – a brain key under which to store the results of this method
model (None) –
a fiftyone.core.models.Model or the name of a model from the FiftyOne Model Zoo to use, or that was already used, to generate embeddings. The model must expose embeddings (model.has_embeddings = True)
model_kwargs (None) – a dictionary of optional keyword arguments to pass to the model’s Config when a model name is provided
force_square (False) – whether to minimally manipulate the patch bounding boxes into squares prior to extraction. Only applicable when a model and patches_field are specified
alpha (None) – an optional expansion/contraction to apply to the patches before extracting them, in [-1, inf). If provided, the length and width of the box are expanded (or contracted, when alpha < 0) by (100 * alpha)%. For example, set alpha = 1.1 to expand the boxes by 10%, and set alpha = 0.9 to contract the boxes by 10%. Only applicable when a model and patches_field are specified
batch_size (None) – an optional batch size to use when computing embeddings. Only applicable when a model is provided
num_workers (None) – the number of workers to use when loading images. Only applicable when a Torch-based model is being used to compute embeddings
skip_failures (True) – whether to gracefully continue without raising an error if embeddings cannot be generated for a sample
progress (None) – whether to render a progress bar (True/False), use the default value fiftyone.config.show_progress_bars (None), or a progress callback function to invoke instead
backend (None) – the similarity backend to use. The supported values are fiftyone.brain.brain_config.similarity_backends.keys() and the default is fiftyone.brain.brain_config.default_similarity_backend
**kwargs – keyword arguments for the fiftyone.brian.SimilarityConfig subclass of the backend being used

Returns

a fiftyone.brain.similarity.SimilarityIndex

fiftyone.brain.compute_exact_duplicates(samples, num_workers=None, skip_failures=True, progress=None)¶

Detects duplicate media in a sample collection.

This method detects exact duplicates with the same filehash. Use compute_similarity() to detect near-duplicate images.

If duplicates are found, the first instance in samples will be the key in the returned dictionary, while the subsequent duplicates will be the values in the corresponding list.

Parameters

samples – a fiftyone.core.collections.SampleCollection
num_workers (None) – an optional number of processes to use
skip_failures (True) – whether to gracefully ignore samples whose filehash cannot be computed
progress (None) – whether to render a progress bar (True/False), use the default value fiftyone.config.show_progress_bars (None), or a progress callback function to invoke instead

Returns

a dictionary mapping IDs of samples with exact duplicates to lists of IDs of the duplicates for the corresponding sample