International Core Journal of Engineering 2020-26 | Page 91

TABLE I. Identifier Right Left Straight right turn D ATABASE COMPOSITION sample graph Quantity 9771 9733 7718 Straight left turn 7636 Straight 14645 Crosswalk 12647 The images of the data set were randomly scrambled before training, and the expanded data was divided into training set, verification set and test set according to the ratio of 6:2:2, and the number was 22087, 7362, 7362 respectively. conclusions can be reflected. Here we named the test set that was randomly divided proportionally as test set 1, and the test set taken on Huashan Road was named test set 2. 3) Experimental results and analysis SSD, SSD (improved loss function), FSSD, and R-FCN models are all based on the Python interface under the Caffe framework. The operating system is Ubuntu 16.04 and the GPU is Nvidia 1080Ti. Test the two test sets separately under the premise of model convergence. The test results are shown in Table II and Table III. This paper prepares another test set that captures 848 images taken on a car camera on Huashan Road. Different from the previous data set, the test set randomly selects the road information of a specific location in China, which is more similar to the actual detection scenario. If the results of the two test sets are similar, the general applicability of the TABLE II. RESULTS OF DIFFERENT MODELS IN SET 1 model SSD SSD(Focal loss) FSSD R-FCN 0.619 0.623 0.620 0.631 0.859 0.897 0.708 33 0.627 0.659 0.638 0.620 0.860 0.906 0.718 33 0.668 0.657 0.669 0.659 0.887 0.916 0.743 41 0.911 0.905 0.898 0.899 0.906 0.924 0.907 56 category Right Left Straight right Straight left Straight Crosswalk mAP speed (ms) T ABLE III RESULTS OF DIFFERENT MODELS IN SET 1 model SSD SSD(Focal loss) FSSD R-FCN 0.619 0.623 0.620 0.631 0.859 0.897 0.708 33 0.627 0.659 0.638 0.620 0.860 0.906 0.718 33 0.668 0.657 0.669 0.659 0.887 0.916 0.743 41 0.911 0.905 0.898 0.899 0.906 0.924 0.907 56 category Right Left Straight right Straight left Straight Crosswalk mAP speed (ms) Through the above results, it can be noticed that the single- stage detection algorithm such as R-FCN has higher overall accuracy, and the two-stage detection algorithm such as SSD has lower overall accuracy. According to the average detection accuracy, the order from high to low is R-FCN, FSSD, SSD (Focal loss), SSD. SSD, SSD (Focal loss), FSSD algorithm is poor in detecting arrow marks. In terms of speed, the single-stage algorithm is faster and can meet real-time performance. The R-FCN is slower, but the gap is not large. and one-stage algorithms such as SSD do not utilize shallow features. The SSD/FSSD input fixed size of 300*300 increases the speed, but has a large impact on the small target detection of large images. For large target detection RFCN and ssd/fssd performance is equivalent. Using two different data sets for model testing, it can be seen from the test results that the accuracy is basically the same, which shows the accuracy and universal adaptability of the test results. In general, the R-FCN model is the best for road marking detection. V. C ONCLUSION R EFERENCES For small targets, detecting R-FCN is better. For large targets such as crosswalks, the two accuracy rates are comparable, but the onestage method is faster. Small target detection tends to rely more on shallow features since shallow features have high resolution but poor semantic distinction, [1] [2] 69 H. Vokhidov, H. Hong, K. Jin, , et al, “Recognition of Damaged Arrow-Road Markings by Visible Light Camera Sensor Based on Convolutional Neural Network”, Sensors, vol. 16, no. 12, 2016. R. Girshick, J. Donahue, T. Darrell, J. Malik, “Rich feature hierarchies