International Core Journal of Engineering 2020-26 | Page 171

four possible transform pairs of primary transform: (DCT2, DCT2), (DCT2, DST7), (DST7, DCT2), (DST7, DST7). The type of transform pair is uniquely determined according to the size of the current block, and there is no need to traverse all of transform pairs to select the best transform. Implicit MTS is cut-down version of explicit MTS, these two modes cannot coexist at the same time. When implicit MTS is enabled, type of horizontal and vertical transform can be uniquely determined using the size of the input block, DCT2 is used for the long side and DST7 for the short side. transform both to DCT2. It is judged whether to change the transform type to DST7 according to the size of the input block. When the conditions of the length of width or height are satisfied with greater or equal to 4 and less than or equal to 16, select DST7 as the corresponding type of horizontal or vertical transform. After the primary transform is completed, according to the value of LFNST index, determine whether the LFNST is enabled or disabled. There are three states of LFNST: LFNST0, LFNST1 and LFNST2. Where LFNST0 means LFNST disabled, LFNST1 means LFNST enabled and the transform kernel selects the first one in the transform set that selected by intra prediction mode, LFNST2 means LFNST enabled and the transform kernel selects the second one in the transform set that selected by intra prediction mode. Choose the best transform between 3 kinds of LFNST options, and the coefficients of the best transform are further compressed by quantization and entropy coding. Implicit MTS can be classified into two cases: ISP (Intra sub-partitions) mode or turn on implicit MTS switch [7-8]. ISP mode is an intra-block partition mode , which divides the intra luma prediction block into 2/4 sub-regions according to the size of vertical or horizontal, each sub-region has a minimum of 16 sampling points, and the minimum CU allowed to use ISP mode is 4*8/8*4. In the encoder, the sub- regions are sequentially encoded in order from top to bottom (left to right). After the coding of the previous sub-regions was completed, the inverse transform and inverse quantization were performed, and the predicted pixels are added to reconstruct the pixels so that the next sub-regions can be predicted. All sub-regions use the same intra- prediction mode. The other case is to turn on the implicit MTS switch and the Explicit MTS switch to turn off in the code. Although the MTS considers the dynamic transformation characteristics of the residual, the MTS is a separable transform and the compression capability is limited. In 2016, a Non-Separable Secondary Transforms (NSST) was proposed and implemented on top of HEVC [9]. The proposed method performs small size non-separable transforms on lower frequency transform coefficients of primary transform, so the transform efficiency was further improved and greatly enhances the overall performance of encoder. In 2019 Geneva meeting, LG Electronics came out Low Frequency Non-Separable Secondary Transform (LFNST) and has been adopted in VTM5.0. On the basis of NSST, the proposal proposed only keep the low-frequency coefficients of the secondary transforms and the high- frequency coefficients are zeroed, which greatly reduces the storage space and the amount of calculation. [10]. There are four transformation sets and each transform set contains two non-separable transform matrices (kernels) in LFNST. Among them, the selection of the transform set is related to the intra prediction mode, and the selection of the transform kernel is related to the LFNST index. Conclusions can be drawn from previous related research as follows: the coding efficiency of LFNST depends on the LFNST core, and the LFNST kernel is trained by the main transform coefficients when Explicit MTS is turned on [9- 10]. Therefore, the LFNST kernels are more suitable for the primary transform types when the explicit MTS is enabled, but these types do not contain the combination of DCT2 and DST7 when the implicit MTS is enabled. So that implicit MTS does not perform very efficiently for LFNST compared with explicit MTS. Fig. 1. The combination of implicit MTS and LFNST in VTM5.0. III. P ROPOSED COMBINATION OF IMPLICIT MTS AND LFNST According to the above analysis, this paper puts forward two improving schemes for the implicit MTS. The main thought of these schemes is: remove the type of transform pairs that may affect the performance of the LFNST and turn them into the transform pair suitable for LFNST. That is when the primary transform pair type is a combination of DCT2 and DST7, the DST7 is changed to DCT2. Two experimental plans were designed as shown in Fig. 2 and Fig. 3. Implicit MTS is simpler than explicit MTS, the correlation research are not enough. In addition, the performance of LFNST did not take into account the difference between implicit MTS and explicit MTS. Therefore in this paper, the influence of the difference in implicit MTS and explicit MTS on the performance of LFNST was studied, and puts forward some improvement schemes. Experimental results shown that encoding efficiency could be improved and coding complexity was almost unaffected The first experimental plan proposes as long as the primary transform pair is a combination of DCT2 and DST7, the DST7 is changed to DCT2, and then does the additional processing. The second experimental plan proposes that when LFNST index was 0, the LFNST was disabled, implicit MTS does not make any changes regardless of the type combination of the main transform pair, when LFNST index was greater than zero, the LFNST was enabled, as long as the primary transform pair is a combination of DCT2 and DST7, the DST7 is changed to DCT2, and then does the additional processing. II. C URRENT COMBINATION OF IMPLICIT MTS AND LFNST The combination of implicit MTS and LFNST in the VTM5.0 reference software of the next generation video coding standard is shown in Fig. 1. When implicit MTS is enabled, initialize the type of horizontal and vertical 149