International Core Journal of Engineering 2020-26 | Page 27

IV. P OWER M ARKET S ETTLEMENT D ATA R ISK P ROCESSING i is settlement interval. A. Analysis of Risk Processing of Power market Settlement Data In the process of settlement of the power market, data risks always exist. After the abnormal data is identified by the evidence theory, the identified abnormal data needs to be processed. In the spot market, due to the short settlement interval, more settlement data, how to process the identified abnormal data quickly becomes an important part of the settlement data risk. The paper will study the risk processing of settlement data in combination with the actual situation in the power market settlement and the above risk identification results. After the anomaly data identification of the data samples by the evidence theory, the degree of support is obtained. Therefore, the paper proposes the concept of reconstruction threshold T. If the reliability of a certain data is higher than the reconstruction threshold T, but lower than the lower limit of support, it means that the evidence is abnormal data but its abnormality can accepted. At this time, in order to improve the settlement efficiency, the abnormal data may still be settled, but it needs to be identified. After the settlement, analyze the cause of the abnormal data and retrieve the correct settlement data for error settlement. If the reliability of a certain data is lower than the reconstruction threshold T, the deviation from the normal level is excessive. At this time, if still settle by abnormal data, there is a big risk. Therefore, it is necessary to use a certain processing method to replace the abnormal data with the reliability lower than T, so that reduce the settlement risk. The polynomial curve fitting method only needs to reconstruct the data near the data, which is easy to obtain. However, when the settlement data fluctuates greatly, the reconstructed data fitted by the polynomial will have a large difference from the actual value. The historical correlation- based method is applied to historical contemporaneous data, which is more accurate when the settlement data fluctuates greatly, but when the market entity operates for a short time, it will lack relevant data. According to the above analysis, the paper proposes a combination of polynomial fitting and historical correlation. The steps are described. a) Perform polynomial curve fitting on the data needed to be reconstructed; b) Calculate the curvature of the fitted curve; c) If the curvature is less than the threshold value Q, the settlement data is considered to be less fluctuating, and the fitting curve is used as the reconstruction data; d) If the curvature is larger than the threshold value Q, it is considered that the settlement data fluctuates greatly, and the reconstructed data is calculated using the historical contemporaneous data using Equation (21). V. S IMULATION A NALYSIS A. Power market Settlement Data and Processing The paper selects a domestic power market as an example. In addition, abnormal data needs to be set in the settlement data. The paper selects each user to generate abnormal data for a certain month. Taking User 1 as an example, the settlement data is shown in Table I, in which March 2018 is the generated abnormal data. B. Power Market Settlement Data Reconstruction Method Reconstruction methods for settlement anomaly data generally have polynomial curve fitting [12] and reconstruction methods based on historical correlation of settlement data. TABLE I. T ABLE F OR C ONSUMER ’ S S ETTLEMENT D ATA 1) Polynomial curve fitting reconstruction method: In essence, polynomial curve fitting is a kind of linear regression method, and its mathematical expression is Equation (20). y x , w M ¦ Z x j (20) j j 0 Where M is the highest number of polynomials. 2) Reconstruction method based on historical correlation of settlement data: The settlement price and electricity are regularly followed, and closely related to time. So historical contemporaneous data can be used instead when resolving anomalous or missing data. In addition, the paper considers the characteristics of settlement data over time, and processes historical synchronic data through Equation (21). n f x f c x  k f c x  i 1 i 1 n Settlement price /(¥/kWh) Market electricity /kWh Actual electricity /kWh June 2017 0.21306 105000 105000 July 2017 0.18097 143000 143000 August 2017 0.12008 119084 119084 September 2017 0.1 160000 115098 October 2017 0.11699 140000 140000 November 2017 0.185 134000 117277 December 2017 0.22338 115000 113679 January 2018 0.232 110000 99120 February 2018 0.229 120000 115890 March 2018 0.12 178000 164980 April 2018 0.23937 81000 80360 May 2018 0.23655 99362 97227 The three-dimensional matrix X(I×J×K) will be expanded in the direction of the settlement interval to obtain X(20×36). PCA and ICA are required for the data samples. When performing PCA, the percentage of each principal component is calculated to determine the number of variables that need to be retained after dimension reduction. The paper selects the first 8 variables as the retained principal components, and the retained information can n ¦ f x  ¦ f c x Settlement interval (21) Where f x is refactoring data, f c x is historical data, 5