YoloToBoundingBoxStepis used to convert an NDArray for the predictions of a YOLO model to a list of bounding box.The NDArray is assumed to be in standard YOLO output format, after activation functions (sigmoid/softmax) have been applied.
[minibatch, B*(5+C), H, W]if
[minibatch, H, W, B*(5+C)]if
falsewhere B is number of bounding box priors, C is number of classes, H is output/label height and W is output/label width.
416x416input, 32 down sampling by the network we have
13x13grid cells (each corresponding to 32 pixels in the input image). Thus, a center of X of 5.5 would be
xPixels = 5.5x32 = 176 pixelsfrom left. Widths and heights are similar: in this example, a width of 13 would be the entire image (416 pixels), and a height of 6.5 would be
6.5/13 = 0.5of the image (208 pixels).