文章/答案/技术大牛

发布

社区首页 >问答首页 >如何添加额外的Conv2DTranspose层以获得56x56掩码

问如何添加额外的Conv2DTranspose层以获得56x56掩码
EN

Stack Overflow用户

提问于 2020-08-08 22:46:39

回答 1查看 80关注 0票数 0

def build_fpn_mask_graph(rois, feature_maps, image_meta,
                         pool_size, num_classes, train_bn=True):
    """Builds the computation graph of the mask head of Feature Pyramid Network.

    rois: [batch, num_rois, (y1, x1, y2, x2)] Proposal boxes in normalized
          coordinates.
    feature_maps: List of feature maps from different layers of the pyramid,
                  [P2, P3, P4, P5]. Each has a different resolution.
    image_meta: [batch, (meta data)] Image details. See compose_image_meta()
    pool_size: The width of the square feature map generated from ROI Pooling.
    num_classes: number of classes, which determines the depth of the results
    train_bn: Boolean. Train or freeze Batch Norm layers

    Returns: Masks [batch, num_rois, MASK_POOL_SIZE, MASK_POOL_SIZE, NUM_CLASSES]
    """
    # ROI Pooling
    # Shape: [batch, num_rois, MASK_POOL_SIZE, MASK_POOL_SIZE, channels]
    x = PyramidROIAlign([pool_size, pool_size],
                        name="roi_align_mask")([rois, image_meta] + feature_maps)

    # Conv layers
    x = KL.TimeDistributed(KL.Conv2D(256, (3, 3), padding="same"),
                           name="mrcnn_mask_conv1")(x)
    x = KL.TimeDistributed(BatchNorm(),
                           name='mrcnn_mask_bn1')(x, training=train_bn)
    x = KL.Activation('relu')(x)

    x = KL.TimeDistributed(KL.Conv2D(256, (3, 3), padding="same"),
                           name="mrcnn_mask_conv2")(x)
    x = KL.TimeDistributed(BatchNorm(),
                           name='mrcnn_mask_bn2')(x, training=train_bn)
    x = KL.Activation('relu')(x)

    x = KL.TimeDistributed(KL.Conv2D(256, (3, 3), padding="same"),
                           name="mrcnn_mask_conv3")(x)
    x = KL.TimeDistributed(BatchNorm(),
                           name='mrcnn_mask_bn3')(x, training=train_bn)
    x = KL.Activation('relu')(x)

    x = KL.TimeDistributed(KL.Conv2D(256, (3, 3), padding="same"),
                           name="mrcnn_mask_conv4")(x)
    x = KL.TimeDistributed(BatchNorm(),
                           name='mrcnn_mask_bn4')(x, training=train_bn)
    x = KL.Activation('relu')(x)

    x = KL.TimeDistributed(KL.Conv2DTranspose(256, (2, 2), strides=2, activation="relu"),
                           name="mrcnn_mask_deconv")(x)
    x = KL.TimeDistributed(KL.Conv2D(num_classes, (1, 1), strides=1, activation="sigmoid"),
                           name="mrcnn_mask")(x)
    return x

上面的代码是我从matterport掩码rcnn得到的。我需要添加一个额外的Conv2DTranspose层来获得一个56x56的蒙版。我该怎么做呢？我对TensorFlow一无所知。我认为上面的代码是为28x28掩模设计的。但我需要56x56分辨率的蒙版。

tensorflow

keras

deep-learning

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-08-08 22:58:06

您应该添加另一个

 x = KL.TimeDistributed(KL.Conv2DTranspose(256, (2, 2), strides=2, activation="relu"),
                           name="mrcnn_mask_deconv")(x)

在最后一层与sigmoid激活之前。

您是否了解TensorFlow或PyTorch都无关紧要。把TransposedConvolution()看作是Convolution()的对立面。

当在4x4的图像上应用具有步长=2的内核( 2，2)的convolution时，我们获得了2x2维的最终输出特征图。当涉及到TransposedConvolution时，情况正好相反；核为(2,2)、步长为2的转置卷积将使输出图像的大小加倍。因此将尺寸为2x2的输入图像转换为4x4的输出图像。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/63316700

复制

相似问题

问如何添加额外的Conv2DTranspose层以获得56x56掩码
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何添加额外的Conv2DTranspose层以获得56x56掩码EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何添加额外的Conv2DTranspose层以获得56x56掩码
EN