MobileNet

1. MobileNet
- 1.1. MobileNetV1
- 1.2. MobileNetV2
  - 1.2.1. Linear Bottleneck
  - 1.2.2. Network

1. MobileNet

moblienet 是在 Vgg 的基础上把 3x3 conv2d 换成 SeparatableConv2D, 使得它的参数和 flops 降低很多, 更适合移动端使用.

1.1. MobileNetV1

https://arxiv.org/pdf/1704.04861.pdf 2017/4, google

1.2. MobileNetV2

https://arxiv.org/pdf/1801.04381.pdf 2018/01, google

MobileNetV2 引入了 ResNet 的 bottleneck, 但做了一点修改, 称为 `linear bottleneck`

1.2.1. Linear Bottleneck

当 feature map 很小时直接做 relu 会丢失很多数据, 那就先用 1x1 conv2d 增大 channel, 然后再做 DepthwiseConv2d+relu, 最后用 1x1 conv2d 恢复成较小 channel.

这个做法和 PointNet 处理 pooling 的作法类似?

但这个做法与 resnet 的 bottleneck 是反的: resnet 的 bottleneck 是先用 1x1 减小 channel, 然后 conv 完再用 1x1 增大 channel, 所以称为 inverted residuals.

Figure 1: linear bottleneck

mobilenet 的 bottleneck 有两种:

当 stride = 1 时, 和 resnet 的 bottleneck 类似, 但 1x1 conv2d 做 expandsion 增大 channel 而不是减小. 同时最后的 1x1 conv2d 没有再接 relu.
当 stride = 2 时, 没有 skip connection, 可能是因为 shape 不匹配

1.2.2. Network

t 表示 expansion 的系数
n 表示 layer 有这多少个重复的 bottleneck
s 表示 stride, 同一个 layer 只有第一个 bottlenect 的 stride 为 s, 其它的为 1
c 输出 channel 数, 同一个 layer 的每个 bottleneck 输出的 channel 均为 c

Backlinks

ShuffleNet (ShuffleNet): shuffnet 也是针对移动端的模型, 和 MobileNetV2 类似, 也使用了 resnet 的 bottleneck 和 depthwise conv, 不同的是它认为 1x1 conv 代价太高, 所以把 1x1 conv 变成 1x1 group conv + channel shuffle

Backlinks

DepthwiseConv2D (CNN > DepthwiseConv2D): DepthwiseConv2D 最早是在 MobileNet 的应用的, 后续针对移动端的 ShuffleNet 也都会使用它, 以提高推理速度

Image Classification (Image Classification > MobileNet): MobileNet

SSD (SSD > anchor): ssd 使用的 mobilenet 的 `out_relu` 层的 feature map 大小为 [1, 19, 19, 576], 接一个 conv2d (4*4, kernel=3,strides=1,padding=same), 输出为 19*19*4*4, 对应 19*19*4 个 anchor

MobileNet

Table of Contents

1. MobileNet

1.1. MobileNetV1

1.2. MobileNetV2

1.2.1. Linear Bottleneck

1.2.2. Network

Backlinks

Backlinks