Welcome to the latest installment of our ongoing blog series where we highlight notebooks and datasets from the FiftyOne Examples GitHub repository. The fiftyone-examples repository contains over 30 notebooks that make it easy to try various computer vision workflows and explore their associated datasets. In this post, we take a look at how to use the Segment Anything Model (SAM) for predictions and segmentations on Kaggle’s Football Player Segmentation dataset.

Wait, what’s FiftyOne?

FiftyOne is an open source machine learning toolset that enables data science teams to improve the performance of their computer vision models by helping them curate high quality datasets, evaluate models, find mistakes, visualize embeddings, and get to production faster.

About the Football Player Segmentation Dataset

The Football Player Segmentation dataset is intended for use on computer vision tasks related to player detection and segmentation in football matches. The dataset contains images of players in different playing positions, such as goalkeepers, defenders, midfielders, and forwards, captured from various angles and distances. The images are annotated with pixel-level masks that indicate the player's location and segmentation boundaries, making it ideal for training deep learning models for player segmentation.

Maintainer: Yaroslav Isaienkov
Download: Kaggle
License: CC0 - Public Domain
Size: 333 MB (527 images)

Can’t wait? You can preview the dataset in your browser at try.fiftyone.ai.

What is SAM?

In the realm of computer vision, segmentation helps us identify which image pixels belong to an object. This is a common task in computer vision applicable to a broad range of applications, from analyzing scientific imagery to editing photos. Previously, creating an accurate segmentation model for specific tasks required deep technical expertise, plus access to computing infrastructure and large volumes of carefully annotated data.

Earlier this year, Meta announced the Apache 2.0 licensed Segment Anything Model project that aims to democratize segmentation by making it more accessible to data scientists. The project includes both a task dataset and model for image segmentation.

About the Football Player Segmentation notebook

The Football Player Segmentation notebook was authored by Kishan Savant and can be cloned, downloaded (or viewed in nbviewer or Google Colab) here.

Step 1: Install FiftyOne

If you don’t already have FiftyOne installed on your laptop, it takes just a few minutes! For example on MacOs:

Verify your version of Python
Create and activate a virtual environment
Install IPython (optional)
Upgrade your Setuptools
Install FiftyOne

Learn more about how to get up and running with FiftyOne in the Docs.

Step 2: Install the required Python libraries

Now that you have FiftyOne installed, let’s install the Python libraries (including SAM) that we’ll need to successfully work with the notebook.

Step 3: Imports

We’ll need to import the FiftyOne Brain, which is a separate Python package for FiftyOne that gives you the ability to visualize embeddings, find similar images, uniqueness, and mistakenness. We’ll also need to import a variety of utilities like os, cv2, wget, matplotlib, torchvision, PIL and numpy.

Step 4: Get the current working directory

Next, let’s get the current working directory.

Step 5: Download and extract dataset from Kaggle

If you are not already a Kaggle user, you’ll need to create a Kaggle account. After the creation of the account, go to your Kaggle Account page and scroll down to the API section. Here you’ll need to click on “Create New API Token.” A new API token will be created in the form of a kaggle.json file. This kaggle.json file contains your Kaggle username and key.

The final steps here are to download this kaggle.json file to your current working directory, download and extract the dataset.

Step 6: Load the dataset

The football player segmentation dataset is already formatted in the COCO dataset format. This means we can import the dataset into FiftyOne using the fo.types.COCODetectionDataset common format.

Step 7: Add embeddings

Next, let’s use the FiftyOne Brain's embedding similarity capability to visualize several scenarios in a football game and launch the results inside the FiftyOne App.

Overview: Football Player Segmentation dataset in the FiftyOne App

The sample detail view of the Football Player Segmentation dataset in the FiftyOne App

Filtering data

Filter by detection field

With the dataset loaded in the FiftyOne App and the embeddings computed, let’s start exploring some interesting views, filters and segmentations. First, let’s create a simple filter by detection field.

Overview: Filter by detection field in the FiftyOne App

Filter by detection field in the sample detail view of the FiftyOne App

Filter by segmentations field

Let’s create another simple filter. In this case let’s do so by segmentations.

Overview: Filter by segmentations field in the FiftyOne App

Filter by segmentations field in the sample detail view of the FiftyOne App

Filter by id

Finally, let’s filter by the different positions of the people on the field. In this case, we’ll filter by an Id which is associated with one of the referees.

Filter by Id in the sample detail view of the FiftyOne App

Selecting clusters of samples in the embeddings view

Up next, let’s use the lasso tool in the FiftyOne App’s embeddings view to investigate some interesting clusters of samples.

In the GIF above, we lasso a selection that shows 13 samples of what looks like positions of footballers during a corner kick at a certain side. This set of similar images helps to track the positions of the players before and after the kick. Similar clusters can be used to analyze the player tracking for other corner kicks taken on same and opposite sides during the game.

In the second GIF we lasso a cluster that contains similar images showing the player positions during a throw-in.

Working with SAM

Cool! Now, let’s add segmentation predictions to a subset of dataset using SAM and evaluate them against ground_truths. First thing to do is download the Segment Anything model.

Next, let’s create a predictions view from a subset of the dataset.

A predictions view comprised of 30 samples

Now, let’s load the SAM model and predictor.

Since we have the bounding box available from the detection, we can use the bounding boxes to generate segmentation masks. For an in-depth explanation on the code and how instance segmentation with SAM works, check out this article written by Jacob Marks.

Instance segmentations predictions view in the FiftyOne App

Next, let’s evaluate the predictions.

Example output:

View the patches in the FiftyOne App.

Viewing the patches in the FiftyOne App

Talk to a computer vision expert