Segmenting with pre-trained Mask R-CNN

In this exercise, you will use the pre-trained Mask R-CNN model to perform instance segmentation on the following image of two cats.

two cats image

The model you will use has been pre-trained on the COCO dataset, which contains images of common objects, including animals. Thanks to this, the model should be able to recognize cats out of the box, without the need to fine-tune it.

Your task is to load the model and the two cats image, prepare the image, and pass it to the model to obtain the predictions. Image from PIL, torch, and transforms from torchvision have been imported for you.

Import maskrcnn_resnet50_fpn from the appropriate torchvision module.
Load the pretrained Mask R-CNN to model.
Transform the two cats image to a tensor and unsqueeze it.
Perform inference by passing the image to the model and assign the output to prediction.

Image classification with CNNs

Object recognition

Image Segmentation

Image Generation with GANs

Ubung

Segmenting with pre-trained Mask R-CNN

Anweisungen