Coco注释：将RLE转换为多边形分割

Question

我有coco风格的注释（json格式），带有分段和bbox。
大多数分割都以像素（多边形）的列表列表的形式给出。

问题在于，某些分段以表示 RLE 值的字典（带有“counts”和“size”键）的形式给出，在这些情况下，“iscrowd”键等于 1（通常等于 0）。

我想将所有带有 iscrowd==1 的“注释”转换为多边形而不是 RLE。

我不需要掩码如建议的那样，但只需要 json 文件以仅具有多边形形状的分段。

这是一些注释的示例（来自同一图像），请注意前两个注释中的分割是多边形形状，后两个是 RLE 形状：

{'id': 53, 'image_id': 4, 'category_id': 2037037930, 'segmentation': [[344.51, 328.83, 316.02, 399.73, 358.3, 399.78, 375.85, 336.07]], 'area': 2561.4049499999965, 'bbox': [316.02, 328.83, 59.83000000000004, 70.94999999999999], 'iscrowd': 0, 'extra': {}} 

{'id': 54, 'image_id': 4, 'category_id': 2037037930, 'segmentation': [[376.43, 233.52, 368.93, 250.71, 375.96, 252.89, 369.4, 269.76, 378.62, 273.83, 372.21, 292.42, 400.09, 302.34, 400.09, 302.11, 400.1, 242.04]], 'area': 1596.5407000000123, 'bbox': [368.93, 233.52, 31.170000000000016, 68.81999999999996], 'iscrowd': 0, 'extra': {}} 

{'id': 67, 'image_id': 4, 'category_id': 2037037930, 'segmentation': {'counts': [55026, 2, 396, 4, 394, 7, 391, 9, 389, 12, 386, 14, 384, 17, 381, 19, 379, 21, 377, 24, 374, 26, 372, 29, 369, 31, 367, 33, 365, 36, 362, 38, 360, 41, 357, 43, 355, 46, 352, 48, 350, 50, 348, 53, 345, 55, 343, 58, 340, 38, 1, 21, 338, 37, 5, 21, 335, 37, 7, 21, 335, 34, 10, 19, 338, 32, 12, 16, 340, 33, 11, 14, 342, 33, 11, 11, 346, 33, 11, 8, 348, 33, 10, 7, 350, 33, 8, 8, 351, 34, 5, 11, 351, 33, 3, 13, 351, 49, 351, 49, 352, 49, 351, 49, 351, 49, 352, 48, 352, 49, 351, 49, 352, 46, 354, 44, 356, 41, 359, 39, 362, 36, 364, 35, 365, 35, 366, 35, 365, 35, 365, 35, 366, 34, 366, 34, 366, 35, 366, 34, 366, 34, 366, 32, 368, 29, 372, 25, 375, 23, 377, 20, 381, 18, 382, 19, 381, 19, 382, 18, 382, 18, 382, 19, 382, 18, 382, 18, 382, 19, 381, 19, 382, 16, 384, 13, 387, 9, 392, 5, 395, 2, 73808], 'size': [400, 400]}, 'area': 2598.0, 'bbox': [137, 174, 79, 65], 'iscrowd': 1, 'extra': {}} 

{'id': 68, 'image_id': 4, 'category_id': 2037037930, 'segmentation': {'counts': [76703, 2, 396, 4, 394, 7, 391, 9, 389, 11, 387, 14, 384, 16, 382, 19, 379, 21, 377, 23, 375, 26, 372, 28, 370, 30, 368, 33, 365, 35, 364, 37, 363, 37, 364, 36, 364, 37, 364, 36, 364, 36, 364, 37, 364, 36, 364, 37, 363, 37, 364, 36, 364, 37, 364, 36, 364, 36, 364, 37, 364, 15, 1, 20, 364, 13, 4, 19, 365, 10, 6, 20, 363, 9, 8, 20, 361, 9, 11, 20, 358, 9, 13, 20, 356, 11, 14, 19, 354, 14, 13, 20, 351, 16, 13, 20, 348, 20, 13, 19, 346, 22, 13, 20, 343, 24, 13, 20, 341, 27, 13, 20, 338, 29, 13, 20, 336, 32, 13, 19, 334, 34, 13, 20, 331, 37, 12, 20, 331, 37, 13, 19, 332, 36, 12, 21, 331, 37, 8, 24, 332, 36, 5, 28, 331, 37, 1, 31, 331, 69, 332, 69, 331, 69, 332, 68, 332, 69, 331, 69, 332, 68, 332, 69, 332, 68, 332, 69, 331, 69, 332, 68, 332, 48, 1, 20, 331, 45, 5, 19, 332, 41, 8, 19, 332, 38, 12, 19, 332, 36, 13, 19, 332, 37, 12, 20, 331, 37, 13, 19, 332, 36, 13, 19, 332, 37, 13, 19, 332, 36, 13, 19, 332, 37, 12, 19, 332, 37, 13, 19, 332, 36, 13, 19, 332, 37, 13, 19, 332, 36, 12, 20, 332, 36, 10, 22, 332, 37, 6, 26, 332, 36, 4, 28, 332, 37, 1, 28, 335, 63, 337, 61, 339, 59, 342, 56, 344, 53, 348, 50, 350, 48, 352, 46, 355, 43, 357, 40, 360, 38, 363, 35, 365, 33, 368, 30, 370, 28, 372, 25, 376, 22, 378, 20, 381, 17, 383, 15, 385, 12, 389, 9, 391, 7, 394, 4, 396, 2, 40521], 'size': [400, 400]}, 'area': 4551.0, 'bbox': [191, 253, 108, 82], 'iscrowd': 1, 'extra': {}}

测试 1 失败：

我已经尝试过以下方法：

for annotation in coco_data['annotations']:
    if type(annotation['segmentation']) == dict:
        # Get the values of the dictionary
        height = annotation['segmentation']['size'][0]
        width = annotation['segmentation']['size'][1]
        counts = annotation['segmentation']['counts']

        # Decode the RLE encoded counts
        rle = np.array(counts).reshape(-1, 2)
        starts, lengths = rle[:, 0], rle[:, 1]
        starts -= 1
        ends = starts + lengths
        pixels = []
        for lo, hi in zip(starts, ends):
            pixels.extend(range(lo, hi))
        pixels = np.array(pixels)

        # Convert the 1D pixels array to a 2D array
        segments = np.zeros((height, width), dtype=np.uint8)
        segments[pixels // width, pixels % width] = 1
        segments = np.where(segments == 1)

        # Update the segmentation and iscrowd fields
        annotation['segmentation'] = [segments[1].tolist(), segments[0].tolist()]
        annotation['iscrowd'] = 0

但出现以下错误：

ValueError                                Traceback (most recent call last)
<ipython-input-29-1bf7f4af292c> in <module>
     16 
     17         # Decode the RLE encoded counts
---> 18         rle = np.array(counts).reshape(-1, 2)
     19         starts, lengths = rle[:, 0], rle[:, 1]
     20         starts -= 1

ValueError: cannot reshape array of size 183 into shape (2)

afaik，它期望 RLE 的长度是偶数？不知道问题出在哪里以及如何解决。

测试 2 失败：

然后我尝试了一些与

import pycocotools.mask as mask

和

import skimage.measure as measure

有点不同的东西以及以下功能：

def rle_to_polygon(rle, height, width):
    if isinstance(rle, list):
        rle = mask.frPyObjects(rle, height, width)
    rle = mask.decode(rle)
    contours = measure.find_contours(rle, 0.5)
    polygon = []
    for contour in contours:
        contour = np.fliplr(contour) - 1
        contour = contour.clip(min=0)
        contour = contour.astype(int)
        if len(contour) >= 4:
            polygon.append(contour.tolist())
    return polygon

收到了

<ipython-input-43-84d17a601509> in rle_to_polygon(rle, height, width)
     79 def rle_to_polygon(rle, height, width):
     80     if isinstance(rle, list):
---> 81         rle = mask.frPyObjects(rle, height, width)
     82     rle = mask.decode(rle)
     83     contours = measure.find_contours(rle, 0.5)

pycocotools/_mask.pyx in pycocotools._mask.frPyObjects()

TypeError: object of type 'int' has no len()

如有任何建议，我们将不胜感激！

Answer 1

这是我的任务代码：

import logging
import cv2
from pycocotools import mask as cocomask
import copy

def rle_to_coco(annotation: dict) -> list[dict]:
    """Transform the rle coco annotation (a single one) into coco style.
    In this case, one mask can contain several polygons, later leading to several `Annotation` objects.
    In case of not having a valid polygon (the mask is a single pixel) it will be an empty list.
    Parameters
    ----------
    annotation : dict
        rle coco style annotation
    Returns
    -------
    list[dict]
        list of coco style annotations (in dict format)
    """

    annotation["segmentation"] = cocomask.frPyObjects(
        annotation["segmentation"],
        annotation["segmentation"]["size"][0],
        annotation["segmentation"]["size"][1],
    )

    maskedArr = cocomask.decode(annotation["segmentation"])
    contours, _ = cv2.findContours(maskedArr, cv2.RETR_TREE, cv2.CHAIN_APPROX_SIMPLE)
    segmentation = []

    for contour in contours:
        if contour.size >= 6:
            segmentation.append(contour)

    if len(segmentation) == 0:
        logging.debug(
            f"Annotation with id {annotation['id']} is not valid, it has no segmentations."
        )
        annotations = []

    else:
        annotations = list()
        for i, seg in enumerate(segmentation):

            single_annotation = copy.deepcopy(annotation)
            single_annotation["segmentation_coords"] = (
                seg.astype(float).flatten().tolist()
            )
            single_annotation["bbox"] = list(cv2.boundingRect(seg))
            single_annotation["area"] = cv2.contourArea(seg)
            single_annotation["instance_id"] = annotation["id"]
            single_annotation["annotation_id"] = f"{annotation['id']}_{i}"

            annotations.append(single_annotation)

    return annotations

您需要

opencv

和

pycocotools

才能使用此代码：

```
pip install opencv-python
```
```
pip install pycocotools
```

请注意，输入注释是 coco dict

annotations

键内的项目之一。像这样的东西：

{
    "image_id": 1,
    "category_id": 1,
    "bbox": [
        485.0489501953125,
        660.6587524414062,
        13.161041259765625,
        10.61248779296875
    ],
    "score": 0.8787025809288025,
    "segmentation": {
        "size": [
            960,
            1280
        ],
        "counts": "jXX>1mm03O1N10000000000001NY]mf0"
    },
    "id": 1,
    "iscrowd": 0,
    "attributes": {
        "occluded": false
    }
},

如果 rle 格式的掩码包含多个未连接的掩码，该函数将返回一个列表，其中每个掩码均采用 coco 格式。

希望有帮助！

Answer 2

我也遇到同样的问题，请问你找到解决办法了吗？

Coco注释：将RLE转换为多边形分割

问题描述投票：0回答：2

测试 1 失败：

测试 2 失败：

2个回答

最新问题

Coco注释：将RLE转换为多边形分割

问题描述 投票：0回答：2

测试 1 失败：

测试 2 失败：

2个回答

最新问题

问题描述投票：0回答：2