CN118570334A

CN118570334A - A method for reconstructing power load curve

Info

Publication number: CN118570334A
Application number: CN202411034879.1A
Authority: CN
Inventors: 吕家慧; 程铁军; 孔健沣; 冯明全; 陈近; 黄良栋
Original assignee: Yantai New And Old Kinetic Energy Conversion Research Institute And Yantai Demonstration Base For Transfer And Transformation Of Scientific And Technological Achievements; Yantai Dongfang Wisdom Electric Co Ltd
Current assignee: Yantai New And Old Kinetic Energy Conversion Research Institute And Yantai Demonstration Base For Transfer And Transformation Of Scientific And Technological Achievements; Yantai Dongfang Wisdom Electric Co Ltd
Priority date: 2024-07-31
Filing date: 2024-07-31
Publication date: 2024-08-30
Anticipated expiration: 2044-07-31
Also published as: CN118570334B

Abstract

The present invention discloses a method for reconstructing a power load curve, and belongs to the field of data processing. The method comprises the following steps: decomposing the original power load curve into multiple intrinsic mode component subsequences by variational mode decomposition; spatially reconstructing all intrinsic mode component subsequences to obtain a corresponding information table, wherein each row corresponds to a row vector composed of data points; for each information table, clustering the row vectors therein respectively, and then performing cross clustering to obtain the final clustering result; dividing each type of cluster into different sets according to the number of elements in each type of cluster in the final clustering result, and interpolating the corresponding rows in different ways respectively; obtaining the component reconstruction curve according to the interpolated information table and superimposing and merging them to obtain the final reconstruction curve. The present invention can better adapt to the complex changes of the curve, greatly improve the accuracy of reconstruction, and does not require training data, and also saves more computing resources.

Description

A method for reconstructing power load curve

技术领域Technical Field

本发明属于数据处理领域，具体涉及一种电力负荷曲线重构方法。The invention belongs to the field of data processing, and in particular relates to a method for reconstructing a power load curve.

背景技术Background Art

高分辨率的负荷曲线对于深入挖掘负荷波动的规律至关重要，它不仅能够精确预测新能源发电（如太阳能、风能）的波动，而且对新能源的消纳和电力现货交易具有显著的影响。High-resolution load curves are crucial for in-depth exploration of the laws of load fluctuations. They can not only accurately predict the fluctuations of renewable energy power generation (such as solar and wind power), but also have a significant impact on the consumption of renewable energy and spot electricity trading.

通过硬件虽然能实现高频度数据采集与上传，但需要对已有的硬件设备和通信网络进行改造，以支持高频度的数据采集任务和高速的数据传输，成本较高，且主要适用于关口计量数据采集，对于非关口计量场景则难以推广。Although high-frequency data collection and uploading can be achieved through hardware, the existing hardware equipment and communication networks need to be modified to support high-frequency data collection tasks and high-speed data transmission. The cost is relatively high and it is mainly suitable for gateway metering data collection. It is difficult to promote for non-gateway metering scenarios.

鉴于硬件改造的局限性，通过软件重构成为了当前的主流方案。目前，电力负荷曲线的重构主要有以下两种方式：Given the limitations of hardware transformation, software reconstruction has become the current mainstream solution. Currently, there are two main ways to reconstruct the power load curve:

一、通过单一的插值算法重构：针对低分辨率数据通过插值函数（如拉格朗日插值、牛顿插值等）在数据点之间进行插值操作，得到高分辨率的负荷曲线。虽然该方式可以保持原有规律，但它进行全量填充时会导致较大的误差，影响负荷预测的准确性，进而对新能源调控和电网运行计划的制定造成不利影响，因此只适合对少量缺失数据的填充。1. Reconstruction through a single interpolation algorithm: For low-resolution data, interpolation functions (such as Lagrange interpolation, Newton interpolation, etc.) are used to interpolate between data points to obtain a high-resolution load curve. Although this method can maintain the original rules, it will cause large errors when filling in the full amount, affecting the accuracy of load forecasting, and thus adversely affecting the regulation of new energy and the formulation of power grid operation plans. Therefore, it is only suitable for filling in a small amount of missing data.

二、通过有监督学习或深度学习方法重构：该方式理论上可以达到较高的准确率，但是训练所需的大量的高分辨率的数据样本难以获取。如果选择其他曲线作为样本，数据规模较小时会导致重构曲线不够平滑，规模较大时则需要大量的计算资源。2. Reconstruction through supervised learning or deep learning methods: This method can theoretically achieve a higher accuracy, but the large amount of high-resolution data samples required for training is difficult to obtain. If other curves are selected as samples, the reconstructed curve will not be smooth enough when the data scale is small, and a large amount of computing resources will be required when the scale is large.

发明内容Summary of the invention

本发明提出了一种电力负荷曲线重构方法，其目的是：解决现有重构方法存在的准确性差、训练数据难以获取、耗费大量计算资源的问题。The present invention proposes a power load curve reconstruction method, the purpose of which is to solve the problems of poor accuracy, difficulty in obtaining training data, and consumption of a large amount of computing resources in existing reconstruction methods.

本发明技术方案如下：The technical solution of the present invention is as follows:

一种电力负荷曲线重构方法，步骤包括：A method for reconstructing a power load curve, comprising the steps of:

步骤1、对原始电力负荷曲线中进行预处理；Step 1: preprocessing the original power load curve;

步骤2、通过变分模态分解将原始电力负荷曲线分解为个固有模态分量子序列； Step 2: Decompose the original power load curve into The intrinsic mode component subsequences;

步骤3、对所有固有模态分量子序列分别通过拆分的方式进行空间重构，得到各固有模态分量子序列所分别对应的信息表；信息表中的各行依次对应固有模态分量子序列中按序分割的各曲线段，信息表中的每一行包含对应的曲线段中所有顺序排列的、归一化后的数据点，即每一行对应一个由数据点组成的行向量；Step 3, spatially reconstruct all the intrinsic mode component subsequences by splitting them, and obtain the information tables corresponding to the extrinsic mode component subsequences; each row in the information table corresponds to each curve segment divided in sequence in the extrinsic mode component subsequence, and each row in the information table contains all the normalized data points in the corresponding curve segment arranged in sequence, that is, each row corresponds to a row vector composed of data points;

步骤4、对于各信息表，分别对其中的行向量进行聚类，从而得到个固有模态分量子序列所各自对应的行向量聚类结果； Step 4: For each information table, cluster the row vectors therein to obtain The row vector clustering results corresponding to each of the intrinsic mode component subsequences;

步骤5、对所有固有模态分量子序列的行向量聚类结果进行交叉聚类，得到最终聚类结果；Step 5: Perform cross clustering on the row vector clustering results of all intrinsic mode component subsequences to obtain the final clustering results;

步骤6、按最终聚类结果中各类簇的元素数量将各类簇划分至集合和集合：如果最终聚类结果中某类簇的元素数量为1，则将该类簇作为集合的子类簇，否则将该类簇作为集合的子类簇； Step 6: Divide each cluster into sets according to the number of elements in each cluster in the final clustering result and collection ： If the number of elements in a cluster in the final clustering result is 1, then the cluster is taken as a set , otherwise the cluster is treated as a set The subclass cluster of

步骤7、对所有信息表的各行分别进行插值，如果某一行对应集合中的某子类簇的元素，则该行在相邻的数据点之间采用线性插值；如果某一行对应集合中的某子类簇的其中一个元素，则该行在相邻的数据点之间采用近邻插值； Step 7: Interpolate each row of all information tables. If a row corresponds to a set If a row corresponds to a subclass cluster in the set, the row uses linear interpolation between adjacent data points; If it is an element of a subclass cluster in , the row uses nearest neighbor interpolation between adjacent data points;

步骤8、将所有插值后的信息表中的各行分别进行反归一化，然后将信息表中各行的数据点组成与所在行对应的曲线段，再将信息表中各行的曲线段按序组成该信息表所对应的固有模态分量子序列的分量重构曲线，最后将所有分量重构曲线叠加合并，得到最终的重构曲线。Step 8. Denormalize each row in the interpolated information table respectively, then combine the data points of each row in the information table into a curve segment corresponding to the row, and then combine the curve segments of each row in the information table in sequence into a component reconstruction curve of the inherent modal component subsequence corresponding to the information table, and finally superimpose and merge all the component reconstruction curves to obtain the final reconstructed curve.

作为所述的电力负荷曲线重构方法的进一步改进：步骤1中，预处理包括删除异常数据和对缺失的数据进行填充。As a further improvement of the power load curve reconstruction method: in step 1, the preprocessing includes deleting abnormal data and filling in missing data.

作为所述的电力负荷曲线重构方法的进一步改进：步骤3中，设各信息表种均包含行，每一行均包含个数据点； As a further improvement of the power load curve reconstruction method: in step 3, it is assumed that each information table contains rows, each containing data points;

所述归一化是指按行归一化，设第行中的第个数据点原始值为，则该点归一化后的数值为： The normalization is row-by-row normalization. The first The original value of the data point is , then the normalized value of this point is:

。 .

作为所述的电力负荷曲线重构方法的进一步改进：步骤5中，所述交叉聚类的过程为：将第一个固有模态分量子序列的行向量聚类结果和第二个固有模态分量子序列的行向量聚类结果相交得到的聚类结果作为第二个固有模态分量子序列所对应的相交后聚类结果，从第三个固有模态分量子序列开始，将当前固有模态分量子序列的聚类结果与前一个固有模态分量子序列的相交后聚类结果进行相交，得到当前固有模态分量子序列的相交后聚类结果，将最后一个固有模态分量子序列的相交后聚类结果作为所述最终聚类结果。As a further improvement of the power load curve reconstruction method: in step 5, the cross-clustering process is: the clustering result obtained by intersecting the row vector clustering result of the first eigenmode component subsequence and the row vector clustering result of the second eigenmode component subsequence is used as the intersected clustering result corresponding to the second eigenmode component subsequence, and starting from the third eigenmode component subsequence, the clustering result of the current eigenmode component subsequence is intersected with the intersected clustering result of the previous eigenmode component subsequence to obtain the intersected clustering result of the current eigenmode component subsequence, and the intersected clustering result of the last eigenmode component subsequence is used as the final clustering result.

作为所述的电力负荷曲线重构方法的进一步改进：步骤5中，对于任意两个聚类结果，其相交过程为：遍历第一个聚类结果中的类簇，对于每个类簇：将其与第二个聚类结果中的所有类簇分别相交，再将各相交结果组合成为该类簇对应的相交后类簇集合；然后将第一个聚类结果中的原所有类簇各自所对应的相交后类簇集合求并、得到两个聚类结果的相交后聚类结果。As a further improvement of the power load curve reconstruction method: in step 5, for any two clustering results, the intersection process is: traverse the clusters in the first clustering result, for each cluster: intersect it with all the clusters in the second clustering result respectively, and then combine the intersection results into the intersection cluster set corresponding to the cluster; then merge the intersection cluster sets corresponding to all the original clusters in the first clustering result to obtain the intersection clustering result of the two clustering results.

作为所述的电力负荷曲线重构方法的进一步改进，步骤7中，线性插值过程为：假设在相邻的数据点之间插入个数据点，前一个数据点是该行插值前第个数据点、值为，后一个数据点是该行插值前第个数据点、值为，则插入的第个新数据点的值为。 As a further improvement of the power load curve reconstruction method, in step 7, the linear interpolation process is: assuming that the linear interpolation between adjacent data points is data points, the previous data point is the interpolation value before the row data points, with values , the next data point is the first data point before the interpolation of the line data points, with values , then the inserted The value of the new data point is .

作为所述的电力负荷曲线重构方法的进一步改进，步骤7中，线性插值过程为：As a further improvement of the power load curve reconstruction method, in step 7, the linear interpolation process is:

对于任一信息表，假设当前进行插值的行对应集合中的某子类簇中的一个元素，将该子类簇中所有元素对应的行作为信息表的操作行； For any information table, assume that the row currently being interpolated corresponds to the set A subclass cluster in An element in the subclass cluster The rows corresponding to all elements in are used as the operation rows of the information table;

在对上述操作行中任意两个相邻数据点之间的区域进行插值时，设前一个数据点是该行插值前第个数据点，后一个数据点是该行插值前第个数据点，则设置搜索范围为本信息表内上述所有操作行中原有第个数据点及第个数据点的值；When interpolating the area between any two adjacent data points in the above operation row, let the previous data point be the first data point before the interpolation of the row. data points, and the next data point is the first data point before the interpolation of the row data points, then set the search range to the original Data points and The value of the data point;

沿从第个数据点至第个数据点的方向顺序插入插值点时，将搜索范围中与前一个原有数据点或插值点的值的欧式距离第二近的值作为当前插值点的值，直至完成所有操作行的近邻插值操作。 Along from Data points to When inserting the interpolation point in the direction of the data points, the value with the second closest Euclidean distance to the previous original data point or interpolation point in the search range is used as the value of the current interpolation point until the nearest neighbor interpolation operation of all operation rows is completed.

作为所述的电力负荷曲线重构方法的进一步改进，步骤8中，反归一化过程为：As a further improvement of the power load curve reconstruction method, in step 8, the denormalization process is:

设某信息表插值后第行中的第个数据点值为，则该点反归一化后的数值为： Suppose that after interpolation of a certain information table The first The data point value is , then the value of the point after denormalization is:

； ;

其中，和分别为该行归一化之前的数据点最大值和最小值。 in, and They are the maximum and minimum values of the data points in this row before normalization.

相对于现有技术，本发明具有以下积极效果：Compared with the prior art, the present invention has the following positive effects:

本发明通过模态分解将原始曲线分解为多个频率分量，保证曲线非平稳性和非线性的适应性，再通过分段处理精细地捕捉到各分量各段的特征，然后通过聚类和交叉分类将各段分为线性插值和近邻插值两种情况，充分利用各个分量的特征信息挖掘曲线全局和局部特征，选择各段最适合的插值方法分别插值。本发明相对于单一的插值方法，可以更好地适应曲线的复杂变化，大大提高重构的准确性；相对于有监督学习或深度学习方法，本发明不需要训练数据，也不需要参照其它大规模数据样本进行复杂的计算，更加节省计算资源。The present invention decomposes the original curve into multiple frequency components through modal decomposition to ensure the non-stationarity and nonlinear adaptability of the curve, and then finely captures the characteristics of each component and each segment through segment processing, and then divides each segment into two cases of linear interpolation and nearest neighbor interpolation through clustering and cross-classification, making full use of the characteristic information of each component to mine the global and local characteristics of the curve, and selecting the most suitable interpolation method for each segment for interpolation. Compared with a single interpolation method, the present invention can better adapt to the complex changes of the curve and greatly improve the accuracy of reconstruction; compared with supervised learning or deep learning methods, the present invention does not require training data, nor does it require complex calculations with reference to other large-scale data samples, which saves more computing resources.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为本发明方法的流程图。FIG. 1 is a flow chart of the method of the present invention.

具体实施方式DETAILED DESCRIPTION

下面结合附图详细说明本发明的技术方案。显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。The technical solution of the present invention is described in detail below in conjunction with the accompanying drawings. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments.

如图1，一种电力负荷曲线重构方法，步骤包括：As shown in FIG1 , a method for reconstructing a power load curve includes the following steps:

步骤1、对原始电力负荷曲线中进行预处理。具体包括，删除异常数据（如变化幅值明显过大的数据），并对缺失的数据进行填充（可采用线性插值等方式）。Step 1: Preprocess the original power load curve, including deleting abnormal data (such as data with obviously large change amplitude) and filling in missing data (using linear interpolation or other methods).

一般地，原始电力负荷曲线的数据频度选择15分钟、1小时或1日均可，本实施例以1小时频度，大约7日的数据为例。Generally, the data frequency of the original power load curve can be 15 minutes, 1 hour or 1 day. This embodiment takes the data of about 7 days with a frequency of 1 hour as an example.

步骤2、通过变分模态分解（VMD）将原始电力负荷曲线分解为个固有模态分量子序列：，其中为第个固有模态分量子序列。本实施例中分解为8个分量。 Step 2: Decompose the original power load curve into A subsequence of intrinsic mode components: ,in For the In this embodiment, the eigenmode component subsequences are decomposed into 8 components.

VMD技术的优势在于其自适应性，可以根据实际信号的特点确定模态分解的个数，并在分解过程中自适应地匹配每种模态的最佳中心频率和有限带宽，可以有效的避免模态分解过程中出现模态混叠。The advantage of VMD technology lies in its adaptability. It can determine the number of modal decompositions according to the characteristics of the actual signal, and adaptively match the optimal center frequency and limited bandwidth of each mode during the decomposition process, which can effectively avoid modal aliasing during the modal decomposition process.

步骤3、对所有固有模态分量子序列分别通过拆分的方式进行空间重构，得到各固有模态分量子序列所分别对应的信息表，可记为。信息表中的各行依次对应固有模态分量子序列中按序分割的各曲线段，信息表中的每一行包含对应的曲线段中所有顺序排列的、归一化后的数据点。 Step 3: Reconstruct all intrinsic modal component subsequences by splitting them into two parts, and obtain the information table corresponding to each intrinsic modal component subsequence, which can be recorded as Each row in the information table corresponds to each curve segment divided in sequence in the natural mode component subsequence, and each row in the information table contains all the normalized data points in the corresponding curve segment in sequence.

设各信息表种均包含行，每一行均包含个数据点，表示信息表中的第行中的个数据点所构成的行向量。 Assume that each information table contains rows, each containing data points, Indicates the information table In the row The row vector of data points.

。 .

步骤4、对于各信息表，分别对其中的行向量进行聚类，从而得到所有固有模态分量子序列所各自对应的行向量聚类结果。Step 4: For each information table, cluster the row vectors therein respectively, so as to obtain the row vector clustering results corresponding to all the intrinsic mode component subsequences.

本实施例中，共有8个固有模态分量子序列，每个固有模态分量子序列中包含9个行向量，设第个固有模态分量子序列的行向量聚类结果为，则各固有模态分量子序列的行向量聚类结果可分别表示为： In this embodiment, there are 8 intrinsic modal component subsequences, each of which contains 9 row vectors. The row vector clustering result of the intrinsic mode component subsequences is , then the row vector clustering results of each intrinsic mode component subsequence can be expressed as:

； ;

..........

。 .

本实施例中，采用k-means进行聚类。In this embodiment, k-means is used for clustering.

步骤5、对所有固有模态分量子序列的行向量聚类结果进行交叉聚类，得到最终聚类结果。Step 5: Perform cross clustering on the row vector clustering results of all intrinsic mode component subsequences to obtain the final clustering results.

所述交叉聚类的过程为：将第一个固有模态分量子序列的行向量聚类结果和第二个固有模态分量子序列的行向量聚类结果相交得到的聚类结果作为第二个固有模态分量子序列所对应的相交后聚类结果，从第三个固有模态分量子序列开始，将当前固有模态分量子序列的聚类结果与前一个固有模态分量子序列的相交后聚类结果进行相交，得到当前固有模态分量子序列的相交后聚类结果，将最后一个固有模态分量子序列的相交后聚类结果作为所述最终聚类结果。The cross-clustering process is as follows: the clustering result obtained by intersecting the row vector clustering result of the first eigenmodal component subsequence and the row vector clustering result of the second eigenmodal component subsequence is used as the intersected clustering result corresponding to the second eigenmodal component subsequence; starting from the third eigenmodal component subsequence, the clustering result of the current eigenmodal component subsequence is intersected with the intersected clustering result of the previous eigenmodal component subsequence to obtain the intersected clustering result of the current eigenmodal component subsequence; and the intersected clustering result of the last eigenmodal component subsequence is used as the final clustering result.

其中，对于任意两个聚类结果，其相交过程为：遍历第一个聚类结果中的类簇，对于每个类簇：将其与第二个聚类结果中的所有类簇分别相交，再将各相交结果组合成为该类簇对应的相交后类簇集合；然后将第一个聚类结果中的原所有类簇各自所对应的相交后类簇集合求并、得到两个聚类结果的相交后聚类结果。Among them, for any two clustering results, the intersection process is: traverse the clusters in the first clustering result, for each cluster: intersect it with all the clusters in the second clustering result respectively, and then combine the intersection results into the intersection cluster set corresponding to the cluster; then merge the intersection cluster sets corresponding to all the original clusters in the first clustering result to obtain the intersection clustering result of the two clustering results.

例如，上述和的相交过程为：先将中的第一个类簇与中的3个类簇分别相交，得到的3个相交结果分别为、空集、空集，中的第一个类簇对应的相交后类簇集合为。同理，中的第二个类簇与中的3个类簇分别相交，得到的相交后类簇集合为，中的第二个类簇的相交后类簇集合为，和的相交后聚类结果为。再将与相交得到，将与相交得到，依次类推得到最终聚类结果。 For example, the above and The intersection process is: first The first cluster in and The three clusters in intersect with each other, and the three intersection results are , empty set, empty set, The set of clusters after intersection corresponding to the first cluster in is Similarly, The second cluster in and The three clusters in intersect with each other, and the obtained cluster set after intersection is , The set of clusters after the intersection of the second cluster in is , and The clustering result after intersection is . Then and Intersect ,Will and Intersect , and so on to get the final clustering result .

步骤6、按最终聚类结果中各类簇的元素数量将各类簇划分至集合和集合：如果最终聚类结果中某类簇的元素数量为1，则将该类簇作为集合的子类簇，否则将该类簇作为集合的子类簇。 Step 6: Divide each cluster into sets according to the number of elements in each cluster in the final clustering result and collection ： If the number of elements in a cluster in the final clustering result is 1, then the cluster is taken as a set , otherwise the cluster is treated as a set subclass cluster of .

本实施例中，设集合，集合。显然，集合中所有子类簇的并集即为。 In this embodiment, set ,gather Obviously, the set The union of all subclass clusters in is .

步骤7、对所有信息表的各行分别进行插值，如果某一行对应集合中的某子类簇的元素，则该行在相邻的数据点之间采用线性插值；如果某一行对应集合中的某子类簇的其中一个元素，则该行在相邻的数据点之间采用近邻插值。 Step 7: Interpolate each row of all information tables. If a row corresponds to a set If a row corresponds to a subclass cluster in the set, the row uses linear interpolation between adjacent data points; If a row is an element of a subclass cluster in , the nearest neighbor interpolation is used between adjacent data points.

（1）线性插值过程为：(1) The linear interpolation process is:

假设在相邻的数据点之间插入个数据点，前一个数据点是该行插值前第个数据点、值为，后一个数据点是该行插值前第个数据点、值为，则插入的第个新数据点的值为。 Assume that we interpolate between adjacent data points data points, the previous data point is the interpolation value before the row data points, with values , the next data point is the first data point before the interpolation of the line data points, with values , then the inserted The value of the new data point is .

（2）近邻插值过程为：(2) The nearest neighbor interpolation process is:

对于任一信息表，假设当前进行插值的行对应集合中的某子类簇中的一个元素，将该子类簇中所有元素对应的行作为信息表的操作行。例如子类簇，则将第2行和第6行作为操作行。 For any information table, assume that the row currently being interpolated corresponds to the set A subclass cluster in An element in the subclass cluster The rows corresponding to all elements in the table are used as operation rows of the information table. For example, the subclass cluster , then the 2nd and 6th rows are used as operation rows.

在对上述操作行中任意两个相邻数据点之间的区域进行插值时，设前一个数据点是该行插值前第个数据点，后一个数据点是该行插值前第个数据点，则设置搜索范围为本信息表内上述所有操作行中原有第个数据点及第个数据点的值，本实施例中，假设在第3个数据点和第4个数据点之间插值，则搜索范围为第2行和第6行中的第3 个数据点和第4个数据点的值，共4个值。沿从第个数据点至第个数据点的方向顺序插入插值点时，将搜索范围中与前一个原有数据点或插值点的值的欧式距离第二近的值作为当前插值点的值，直至完成所有操作行的近邻插值操作。例如，在第2行第3个数据点和第4个数据点之间插入第1个插值点时，将搜索范围中与该行第3个数据点（原有）的值的欧式距离第二近的值作为当前插值点的值，插入第2个插值点时，将搜索范围中与前述第1个插值点的值的欧式距离第二近的值作为当前插值点的值，依次类推。 When interpolating the area between any two adjacent data points in the above operation row, let the previous data point be the first data point before the interpolation of the row. data points, and the next data point is the first data point before the interpolation of the row data points, then set the search range to the original Data points and In this embodiment, it is assumed that the interpolation is performed between the third and fourth data points, and the search range is the values of the third and fourth data points in the second and sixth rows, for a total of four values. Data points to When inserting interpolation points in the direction of the data points, the value in the search range that is the second closest to the value of the previous original data point or interpolation point in the Euclidean distance is used as the value of the current interpolation point until the nearest neighbor interpolation operation of all operation rows is completed. For example, when inserting the first interpolation point between the third and fourth data points in the second row, the value in the search range that is the second closest to the value of the third data point (original) in the row is used as the value of the current interpolation point. When inserting the second interpolation point, the value in the search range that is the second closest to the value of the first interpolation point in the Euclidean distance is used as the value of the current interpolation point, and so on.

其中，反归一化过程为：The denormalization process is:

。 .

需要说明的是，对于本领域技术人员而言，显然本发明不限于上述示范性实施例的细节，而且在不背离本发明的精神或基本特征的情况下，能够以其他的具体形式实现本发明。It should be noted that, for those skilled in the art, it is obvious that the present invention is not limited to the details of the above exemplary embodiments, and the present invention can be implemented in other specific forms without departing from the spirit or essential features of the present invention.

Claims

1. A method of reconstructing an electrical load curve, comprising the steps of:

Step 1, preprocessing an original power load curve;

step 2, decomposing the original power load curve into components through variation modal decomposition A sub-sequence of intrinsic modal components;

step 3, carrying out space reconstruction on all the intrinsic mode component subsequences in a splitting mode to obtain information tables corresponding to the intrinsic mode component subsequences respectively; each row in the information table sequentially corresponds to each curve segment which is divided in sequence in the inherent modal component subsequence, and each row in the information table comprises all sequentially arranged normalized data points in the corresponding curve segment, namely, each row corresponds to a row vector consisting of the data points;

Step 4, clustering the row vectors in each information table to obtain Row vector clustering results corresponding to the intrinsic mode component sub-sequences respectively;

step 5, cross-clustering is carried out on row vector clustering results of all the intrinsic mode component subsequences to obtain a final clustering result;

step 6, dividing each cluster into sets according to the element number of each cluster in the final clustering result Sum set: If the element number of a cluster of a certain class in the final clustering result is 1, the cluster of the class is taken as a setIf not, the cluster is used as a setIs a sub-cluster of (2);

Step 7, respectively interpolating each row of all information tables, if a row corresponds to a set If the row is a sub-cluster element, then the row adopts linear interpolation between adjacent data points; if a certain line corresponds to a setIf one element of a sub-cluster in the row is a sub-cluster, the row adopts neighbor interpolation between adjacent data points;

And 8, respectively carrying out inverse normalization on each row in the information table after interpolation, then forming the data points of each row in the information table into curve segments corresponding to the row, sequentially forming the curve segments of each row in the information table into component reconstruction curves of the intrinsic mode component subsequence corresponding to the information table, and finally superposing and merging all the component reconstruction curves to obtain a final reconstruction curve.

2. The electrical load curve reconstruction method as set forth in claim 1, wherein: in step 1, preprocessing includes deleting abnormal data and filling the deleted data.

3. The electrical load curve reconstruction method as set forth in claim 1, wherein: in step 3, each information table is provided to includeRows, each row comprisingData points;

The normalization refers to normalization according to rows, and the first is set Line 1The original value of the data point isThe normalized value at this point is:

。

4. The electrical load curve reconstruction method as set forth in claim 1, wherein: in step 5, the cross clustering process is as follows: and taking a clustering result obtained by intersecting the row vector clustering result of the first natural mode component sub-sequence and the row vector clustering result of the second natural mode component sub-sequence as an intersecting clustering result corresponding to the second natural mode component sub-sequence, starting from the third natural mode component sub-sequence, intersecting the clustering result of the current natural mode component sub-sequence with the intersecting clustering result of the previous natural mode component sub-sequence to obtain an intersecting clustering result of the current natural mode component sub-sequence, and taking the intersecting clustering result of the last natural mode component sub-sequence as the final clustering result.

5. The electrical load curve reconstruction method as set forth in claim 4, wherein: in step 5, for any two clustering results, the intersecting process is as follows: traversing the class clusters in the first clustering result, and for each class cluster: intersecting the clustering result with all class clusters in the second clustering result respectively, and combining the intersecting results into intersected class clusters corresponding to the class clusters; and then merging the intersected cluster groups corresponding to all the original cluster groups in the first clustering result to obtain intersected clustering results of the two clustering results.

6. The method of reconstructing an electrical load curve according to claim 1, wherein in step 7, the linear interpolation process is: suppose that interpolation between adjacent data pointsData point, the previous data point is the first data point before interpolation of the lineThe data points and values areThe next data point is the first data point before the row interpolationThe data points and values areThen insert the firstThe value of each new data point is。

7. The method of reconstructing an electrical load curve according to claim 1, wherein in step 7, the linear interpolation process is:

For any information table, assume that the row currently being interpolated corresponds to a set Some sub-cluster of (a)One element of the subclass clusterThe row corresponding to all elements in the information table is used as an operation row of the information table;

in the case of interpolating the region between any two adjacent data points in the operation line, the previous data point is set to be the first data point before the line is interpolated Data point, the latter data point is the first data point before the line interpolationData points, the searching range is set as the original first operation line in the information tableData point and the firstValues of the individual data points;

Edge slave first Data points to the firstWhen the direction of the data points is sequentially inserted into the interpolation points, the value which is the second closest to the Euclidean distance of the value of the previous original data point or the interpolation point in the search range is taken as the value of the current interpolation point until the neighbor interpolation operation of all operation lines is completed.

8. The method for reconstructing an electrical load curve according to any one of claims 1 to 7, wherein in step 8, the inverse normalization process is:

Interpolation of a certain information table is set Line 1Data point value isThe value after the inverse normalization of the point is:

；

Wherein, AndThe maximum and minimum values of the data points before normalization of the line, respectively.