International Core Journal of Engineering 2020-26 | Page 153
So far, the basic module of YOLO V3 model has been
introduced. The structure of the model is described below.
Among them, and are the number of grids migrated by
the current grid relative to the upper left corner grid. function
is a logistic function, which normalizes coordinates to 0-1.
and
are normalized values relative to grid
The final
cells. The width and height of anchor box with the greatest
and
coincidence of , ΰ and groundtruth. In practice,
are also normalized to 0-1. In actual program,
and ΰ are
anchor's width, high and featuremap's width and high ratio.
and are normalized values relative to anchor.
The final
416*416*3
DBL
RES1
RES8
DBL
DBL
RES2
RES8
RES4
Upsampling
DBL*5
Darknet-53
concat DBL*5
concat DBL*5
Upsampling
Figure 3 Anchor Candidate Box
The two parts are composed of many small modules. The
modules used in the model are introduced separately below.
DBL
A. Darknetconv2D BN Leaky
DBL module is the basic module of YOLO V3 model, and
its composition is shown in Figure 3.
DBL
=
conv
conv Y1(13*13*15)
DBL conv Y2(26*26*15)
DBL conv Y3(52*52*15)
Leaky relu
BN
Figure 3 DBL module
Among them, conv refers to convolution layer, BN refers
to BN layer, Leaky relu refers to the activation layer whose
activation function is Leaky relu function.
Figure 6 Algorithmic Structural Diagram
B. Res Unit
RU module is the residual structure of Darknet-53
network. It is precisely because of the use of this module that
Darknet becomes deeper and more possible. Its composition
is shown in Figure 4.
RU
=
DBL
DBL
+
Figure 4 RU module
C. RESN Unit
RESN module is a module using residual structure in
Darknet-53 network. Among them, n represents the number,
that is, the positive RESN module uses several layers of RU
module, whose composition is shown in Figure 5.
RESN
=
Zero
padding
DBL
Figure 7 Darknet-53 Network Architecture
As shown in Fig. 6, except that the output y_n is slightly
modified, the other structures are consistent with the structure
of YOLO V3 model, in which upsampling refers to up-
sampling and concat refers to tensor splicing. Unlike the
addition of residual structures, tensor splicing is not a simple
numerical addition, but an expansion of tensor dimension.
RU*n
Figure 5 RESN module
Zero padding refers to zero padding and RU*n refers to
RU modules that repeat n layers.
131