VOC2012 Segmentation#

This topic describes how to manage the VOC2012 Segmentation dataset, which is a dataset with SemanticMask and InstanceMask labels (Fig. 4 and Fig. 5).

../../_images/example-semanticmask.png — Fig. 4 The preview of a semantic mask from “VOC2012 Segmentation”.#

../../_images/example-instancemask.png — Fig. 5 The preview of a instance mask from “VOC2012 Segmentation”.#

Authorize a Client Instance#

An accesskey is needed to authenticate identity when using TensorBay.

from tensorbay import GAS

# Please visit `https://gas.graviti.com/tensorbay/developer` to get the AccessKey.
gas = GAS("<YOUR_ACCESSKEY>")

Create Dataset#

gas.create_dataset("VOC2012Segmentation")

Organize Dataset#

Normally, dataloader.py and catalog.json are required to organize the “VOC2012 Segmentation” dataset into the Dataset instance. In this example, they are stored in the same directory like:

VOC2012 Segmentation/
    catalog.json
    dataloader.py

It takes the following steps to organize “VOC2012 Segmentation” dataset by the Dataset instance.

Step 1: Write the Catalog#

A Catalog contains all label information of one dataset, which is typically stored in a json file like catalog.json.

{
    "SEMANTIC_MASK": {
        "categories": [
            { "name": "background", "categoryId": 0 },
            { "name": "aeroplane", "categoryId": 1 },
            { "name": "bicycle", "categoryId": 2 },
            { "name": "bird", "categoryId": 3 },
            { "name": "boat", "categoryId": 4 },
            { "name": "bottle", "categoryId": 5 },
            { "name": "bus", "categoryId": 6 },
            { "name": "car", "categoryId": 7 },
            { "name": "cat", "categoryId": 8 },
            { "name": "chair", "categoryId": 9 },
            { "name": "cow", "categoryId": 10 },
            { "name": "diningtable", "categoryId": 11 },
            { "name": "dog", "categoryId": 12 },
            { "name": "horse", "categoryId": 13 },
            { "name": "motorbike", "categoryId": 14 },
            { "name": "person", "categoryId": 15 },
            { "name": "pottedplant", "categoryId": 16 },
            { "name": "sheep", "categoryId": 17 },
            { "name": "sofa", "categoryId": 18 },
            { "name": "train", "categoryId": 19 },
            { "name": "tvmonitor", "categoryId": 20 },
            { "name": "void", "categoryId": 255 }
        ]
    },
    "INSTANCE_MASK": {
        "categories": [
            { "name": "background", "categoryId": 0 },
            { "name": "void", "categoryId": 255 }
        ]
    }
}

The annotation types for “VOC2012 Segmentation” are SemanticMask and InstanceMask, and there are 22 category types for SemanticMask. There are 2 category types for InstanceMask, category 0 represents the background, and category 255 represents the border of instances.

Note

By passing the path of the catalog.json, load_catalog() supports loading the catalog into dataset.
The categories in InstanceMaskSubcatalog are for pixel values which are not instance ids.

Important

See catalog table for more catalogs with different label types.

Step 2: Write the Dataloader#

A dataloader is needed to organize the dataset into a Dataset instance.

#!/usr/bin/env python3
#
# Copyright 2021 Graviti. Licensed under MIT License.
#
# pylint: disable=invalid-name

"""Dataloader of VOC2012Segmentation dataset."""

import os

from tensorbay.dataset import Data, Dataset
from tensorbay.label import InstanceMask, SemanticMask

_SEGMENT_NAMES = ("train", "val")
DATASET_NAME = "VOC2012Segmentation"


def VOC2012Segmentation(path: str) -> Dataset:
    """`VOC2012Segmentation <http://host.robots.ox.ac.uk/pascal/VOC/voc2012/>`_ dataset.

    The file structure should be like::

        <path>/
            JPEGImages/
                <image_name>.jpg
                ...
            SegmentationClass/
                <mask_name>.png
                ...
            SegmentationObject/
                <mask_name>.png
                ...
            ImageSets/
                Segmentation/
                    train.txt
                    val.txt
                    ...
                ...
            ...

    Arguments:
        path: The root directory of the dataset.

    Returns:
        Loaded :class: `~tensorbay.dataset.dataset.Dataset` instance.

    """
    root_path = os.path.abspath(os.path.expanduser(path))

    image_path = os.path.join(root_path, "JPEGImages")
    semantic_mask_path = os.path.join(root_path, "SegmentationClass")
    instance_mask_path = os.path.join(root_path, "SegmentationObject")
    image_set_path = os.path.join(root_path, "ImageSets", "Segmentation")

    dataset = Dataset(DATASET_NAME)
    dataset.load_catalog(os.path.join(os.path.dirname(__file__), "catalog.json"))

    for segment_name in _SEGMENT_NAMES:
        segment = dataset.create_segment(segment_name)
        with open(os.path.join(image_set_path, f"{segment_name}.txt"), encoding="utf-8") as fp:
            for stem in fp:
                stem = stem.strip()
                data = Data(os.path.join(image_path, f"{stem}.jpg"))
                label = data.label
                mask_filename = f"{stem}.png"
                label.semantic_mask = SemanticMask(os.path.join(semantic_mask_path, mask_filename))
                label.instance_mask = InstanceMask(os.path.join(instance_mask_path, mask_filename))

                segment.append(data)

    return dataset

See SemanticMask annotation and InstanceMask annotation for more details.

There are already a number of dataloaders in TensorBay SDK provided by the community. Thus, in addition to writing, importing an available dataloader is also feasible.

from tensorbay.opendataset import VOC2012Segmentation

dataset = VOC2012Segmentation("<path/to/dataset>")

Note

Note that catalogs are automatically loaded in available dataloaders, users do not have to write them again.

Important

See dataloader table for dataloaders with different label types.

Upload Dataset#

The organized “VOC2012 Segmentation” dataset can be uploaded to tensorBay for sharing, reuse, etc.

dataset_client = gas.upload_dataset(dataset, jobs=8)
dataset_client.commit("initial commit")

Similar with Git, the commit step after uploading can record changes to the dataset as a version. If needed, do the modifications and commit again. Please see Version Control for more details.

See the visualization on TensorBay website.

Read Dataset#

Now “VOC2012 Segmentation” dataset can be read from TensorBay.

dataset = Dataset("VOC2012Segmentation", gas)

In dataset “VOC2012 Segmentation”, there are two segments: train and val. Get a segment by passing the required segment name or the index.

segment_names = dataset.keys()
segment = dataset[0]

In the train segment, there is a sequence of data, which can be obtained by index.

data = segment[0]

In each data, there are one SemanticMask annotation and one InstanceMask annotation.

from PIL import Image

label_semantic_mask = data.label.semantic_mask
semantic_all_attributes = label_semantic_mask.all_attributes
semantic_mask = Image.open(label_semantic_mask.open())
semantic_mask.show()

label_instance_mask = data.label.instance_mask
instance_all_attributes = label_instance_mask.all_attributes
instance_mask_url = label_instance_mask.get_url()

There are two label types in “VOC2012 Segmentation” dataset, which are semantic_mask and instance_mask. We can get the mask by Image.open() or get the mask url by get_url(). The information stored in SemanticMask.all_attributes is attributes for every category in categories list of SEMANTIC_MASK. The information stored in InstanceMask.all_attributes is attributes for every instance. See SemanticMask and InstanceMask label formats for more details.

Delete Dataset#

gas.delete_dataset("VOC2012Segmentation")