[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN116227738A - Method and system for predicting traffic interval of power grid customer service - Google Patents

Method and system for predicting traffic interval of power grid customer service Download PDF

Info

Publication number
CN116227738A
CN116227738A CN202310483798.9A CN202310483798A CN116227738A CN 116227738 A CN116227738 A CN 116227738A CN 202310483798 A CN202310483798 A CN 202310483798A CN 116227738 A CN116227738 A CN 116227738A
Authority
CN
China
Prior art keywords
data
telephone traffic
traffic
complete
influence factor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310483798.9A
Other languages
Chinese (zh)
Other versions
CN116227738B (en
Inventor
伍广斌
苏立伟
蒋崇颖
覃浩
康峰
林楷东
陈海燕
谭火超
王帅
张艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Power Grid Co Ltd
Customer Service Center of Guangdong Power Grid Co Ltd
Original Assignee
Guangdong Power Grid Co Ltd
Customer Service Center of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Power Grid Co Ltd, Customer Service Center of Guangdong Power Grid Co Ltd filed Critical Guangdong Power Grid Co Ltd
Priority to CN202310483798.9A priority Critical patent/CN116227738B/en
Publication of CN116227738A publication Critical patent/CN116227738A/en
Application granted granted Critical
Publication of CN116227738B publication Critical patent/CN116227738B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/01Customer relationship services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y04INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
    • Y04SSYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
    • Y04S10/00Systems supporting electrical power generation, transmission or distribution
    • Y04S10/50Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Data Mining & Analysis (AREA)
  • Strategic Management (AREA)
  • Mathematical Analysis (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Mathematical Optimization (AREA)
  • Human Resources & Organizations (AREA)
  • Development Economics (AREA)
  • Operations Research (AREA)
  • Probability & Statistics with Applications (AREA)
  • Game Theory and Decision Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Finance (AREA)
  • Tourism & Hospitality (AREA)
  • Evolutionary Biology (AREA)
  • Accounting & Taxation (AREA)
  • Algebra (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Quality & Reliability (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Telephonic Communication Services (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The invention discloses a method and a system for predicting a traffic interval of power grid customer service, which are used for carrying out similar daily clustering on the traffic influence factor data after the missing value is supplemented by acquiring historical traffic data and traffic influence factor data. After feature extraction is carried out on the processed data and the clustering result by using the prediction model to obtain feature data, a quantile regression model is adopted to calculate the feature data to obtain a condition quantile, a non-parameter kernel density estimation method is adopted to calculate the condition quantile to obtain an interval prediction result, and the interval prediction result is sent to a power grid customer service scheduling system so that the power grid customer service scheduling system carries out telephone traffic scheduling according to the interval prediction result. According to the method, the telephone traffic probability of the power grid customer service is predicted by combining telephone traffic influencing factors, so that the effective quantification of the uncertainty of the prediction of a modern power supply service system can be realized, and the accuracy of the prediction of the telephone traffic interval of the power grid customer service is improved.

Description

Method and system for predicting traffic interval of power grid customer service
Technical Field
The invention relates to the field of traffic interval prediction, in particular to a method and a system for predicting a power grid customer service traffic interval.
Background
With the continuous development of society, the electric power customer service center is continuously enlarged in scale, plays an increasingly important role as an important bridge for enterprises to communicate with clients, and the operation management mode of the electric power customer service center is also required to be continuously updated along with the social and economic development. The current telephone traffic prediction management application of the electric power customer service center mainly depends on historical experience, and has the problems of low scheduling efficiency, poor fitting degree between a required value and a scheduling person, large human resource input, unattainable service level and the like. The traditional telephone traffic prediction technology cannot adapt to the complex service scene containing the value-added service in the modern power supply service at present, so that the utilization of human resources of the telephone traffic service is unreasonable. At present, main traffic prediction models comprise inertial prediction, kalman filtering, traffic OLAP analysis and the like, and all belong to point prediction.
The interval prediction can realize the effective quantification of the prediction uncertainty of a modern power supply service system, and can provide more comprehensive prediction information compared with the classical prediction uncertainty, thereby providing key data support for analysis and decision-making of an electric power customer service system.
Disclosure of Invention
The invention provides a method and a system for predicting a power grid customer service traffic interval, which can realize the effective quantification of the prediction uncertainty of a modern power supply service system and improve the accuracy of the power grid customer service traffic interval prediction.
In order to solve the above technical problems, an embodiment of the present invention provides a method for predicting a traffic interval of a customer service of a power grid, including:
acquiring historical telephone traffic data and telephone traffic influence factor data, and supplementing missing values of the historical telephone traffic data and the telephone traffic influence factor data to obtain complete historical telephone traffic data and complete telephone traffic influence factor data;
performing similar daily clustering on the complete telephone traffic influence factor data to obtain a clustering result of the complete telephone traffic influence factor data;
inputting the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a constructed prediction model, carrying out feature extraction on the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data by the prediction model to obtain feature data, calculating the feature data by adopting a quantile regression model to obtain the conditional quantiles of different quantile points, calculating the conditional quantiles by adopting a non-parameter kernel density estimation method to obtain an interval prediction result, and sending the interval prediction result to a grid customer service scheduling system so that the grid customer service scheduling system carries out telephone traffic scheduling according to the interval prediction result.
According to the embodiment, historical telephone traffic data and telephone traffic influence factor data are obtained, missing value supplementation is carried out on the historical telephone traffic data and telephone traffic influence factor data, complete historical telephone traffic data and complete telephone traffic influence factor data are obtained, similar daily clustering is carried out on the complete telephone traffic influence factor data to obtain a clustering result of the complete telephone traffic influence factor data, the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data is input into a built prediction model, after feature extraction is carried out on the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data by the prediction model to obtain feature data, a quantile regression model is adopted to calculate the feature data to obtain a conditional quantile, then a non-parameter kernel density estimation method is adopted to calculate the conditional quantile to obtain an interval prediction result, and the interval prediction result is sent to a grid customer service scheduling system, so that the grid customer service scheduling system carries out telephone traffic scheduling according to the interval prediction result. According to the method, the telephone traffic probability of the power grid customer service is predicted by combining telephone traffic influencing factors, so that the effective quantification of the uncertainty of the prediction of a modern power supply service system can be realized, and the accuracy of the prediction of the telephone traffic interval of the power grid customer service is improved.
As a preferred scheme, missing value supplementation is carried out on historical telephone traffic data and telephone traffic influencing factor data to obtain complete historical telephone traffic data and complete telephone traffic influencing factor data, which concretely comprises the following steps:
obtaining an approximation of historical traffic data and an approximation of traffic impact factor data by fitting a polynomial using adjacent normal data;
and supplementing the approximate value as a missing value to the historical telephone traffic data and the telephone traffic influencing factor data to obtain complete historical telephone traffic data and complete telephone traffic influencing factor data.
According to the embodiment, the approximation value of the historical telephone traffic data and the approximation value of the telephone traffic influence factor data are obtained by utilizing the adjacent normal data fitting polynomials, the approximation value is used as a missing value to be supplemented into the historical telephone traffic data and the telephone traffic influence factor data, the complete historical telephone traffic data and the complete telephone traffic influence factor data are obtained, the missing value supplementation is carried out on the data, the abnormal value of the data is cleaned, the integrity and the accuracy of the data are guaranteed, and the prediction precision is improved.
As a preferred scheme, the complete telephone traffic influence factor data is clustered on a similar day to obtain a clustering result of the complete telephone traffic influence factor data, specifically:
after normalizing the complete telephone traffic influence factor data, selecting a plurality of center points;
and distributing the data of each complete telephone traffic influencing factor to a central point smaller than a preset distance to obtain a plurality of clusters, and recalculating the central point of each cluster to serve as a clustering result of the data of the complete telephone traffic influencing factor.
As a preferred scheme, the quantile regression model is adopted to calculate the characteristic data to obtain the conditional quantiles of different quantile points, and the method specifically comprises the following steps:
solving the characteristic data by using a quantile regression function to obtain regression coefficient vectors of different quantile points, wherein the quantile regression function is as follows:
Figure SMS_1
wherein ,
Figure SMS_2
representing dependent variablesP p Is the first of (2)τThe number of digits of the individual condition is calculated,τthe range of (1, 0),β(τ) Is a vector of regression coefficients, +.>
Figure SMS_3
Represented as feature data;
solving regression coefficient vectors of different quantile points to obtain conditional quantile of the different quantile points, wherein a solving formula is as follows:
Figure SMS_4
wherein L is the neural network loss function value,
Figure SMS_5
represented as characteristic data>
Figure SMS_6
Expressed as an asymmetric function:
Figure SMS_7
where s is the argument of the function γ.
As a preferred scheme, a non-parameter kernel density estimation method is adopted to calculate the conditional quantile to obtain an interval prediction result, and the method specifically comprises the following steps:
calculating the conditional quantiles by using a non-parameter kernel density estimation function to obtain an interval prediction result, wherein the kernel density estimation function is as follows:
Figure SMS_8
wherein ,V(u a ) For the probability distribution of the traffic on the a-th day to be predicted,u a as an independent variable of the probability function,N a for a similar number of days for the a moments to be predicted,hfor the bandwidth, takeh=0.01,K() As a kernel function, taking the kernel function as a Gaussian function,G P in the case of a vector of samples,G P =[
Figure SMS_9
,
Figure SMS_10
,
Figure SMS_11
…,
Figure SMS_12
]。
preferably, the prediction model is obtained through training, specifically:
dividing the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a training set and a testing set;
constructing a prediction model, and inputting the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data into the prediction model for calculation to obtain an actual interval prediction result;
and comparing the actual interval prediction result with the expected interval prediction result according to the error function to obtain an error, and updating model parameters according to the error until the training times are reached to obtain a trained prediction model.
In order to solve the same technical problems, the embodiment of the invention also provides a system for predicting the traffic interval of the power grid customer service, which comprises a data acquisition module, a clustering module and a prediction interval calculation module,
the data acquisition module is used for acquiring historical telephone traffic data and telephone traffic influence factor data, and supplementing missing values of the historical telephone traffic data and the telephone traffic influence factor data to obtain complete historical telephone traffic data and complete telephone traffic influence factor data;
the clustering module is used for carrying out similar daily clustering on the complete telephone traffic influence factor data to obtain a clustering result of the complete telephone traffic influence factor data;
the prediction interval calculation module is used for inputting the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a constructed prediction model, so that after the prediction model performs feature extraction on the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data to obtain feature data, the feature data is calculated by adopting a quantile regression model to obtain the conditional quantiles of different quantiles, then the conditional quantiles are calculated by adopting a non-parameter kernel density estimation method to obtain an interval prediction result, and the interval prediction result is sent to the grid customer service scheduling system so that the grid customer service scheduling system performs telephone traffic scheduling according to the interval prediction result.
Preferably, the data acquisition module comprises an approximation calculation unit and a filling unit,
the approximate value calculating unit is used for obtaining the approximate value of the historical telephone traffic data and the approximate value of the telephone traffic influencing factor data by fitting a polynomial by using adjacent normal data;
the filling unit is used for supplementing the approximate value as a missing value to the historical telephone traffic data and the telephone traffic influencing factor data to obtain complete historical telephone traffic data and complete telephone traffic influencing factor data.
Preferably, the clustering module comprises a central point selecting unit and a clustering result generating unit,
the central point selecting unit is used for selecting a plurality of central points after normalizing the complete telephone traffic influencing factor data;
the clustering result generating unit is used for distributing the telephone traffic influence factor data of each complete telephone traffic influence factor data to a central point smaller than a preset distance to obtain a plurality of clusters and recalculate the central point of each cluster to be used as a clustering result of the complete telephone traffic influence factor data.
Drawings
Fig. 1: a flow diagram of one embodiment of the power grid customer service traffic interval prediction method provided by the invention;
fig. 2: the telephone traffic similar day clustering result schematic diagram of one embodiment of the telephone traffic interval prediction method of the power grid customer service provided by the invention;
fig. 3: the invention provides a system structure schematic diagram of another embodiment of a power grid customer service traffic interval prediction method.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Example 1
Referring to fig. 1, in order to provide a method for predicting a traffic interval of a customer service in a power grid according to an embodiment of the present invention, the method for predicting a traffic interval of a customer service in a power grid includes steps 101 to 104, where the steps are as follows:
step 101: and acquiring historical telephone traffic data and telephone traffic influence factor data, and supplementing missing values of the historical telephone traffic data and the telephone traffic influence factor data to obtain complete historical telephone traffic data and complete telephone traffic influence factor data.
Optionally, missing value supplementation is performed on historical traffic data and traffic volume influence factor data to obtain complete historical traffic data and complete traffic volume influence factor data, which specifically comprises:
obtaining an approximation of historical traffic data and an approximation of traffic impact factor data by fitting a polynomial using adjacent normal data;
and supplementing the approximate value as a missing value to the historical telephone traffic data and the telephone traffic influencing factor data to obtain complete historical telephone traffic data and complete telephone traffic influencing factor data.
In this embodiment, the historical traffic data is derived from the grid customer service system on an hour scale, two years, and one hour intervals. Then, the historical weather information, the historical power failure information and the holiday condition are led out from a weather bureau officer network, wherein the historical weather information specifically comprises rainfall (light rain, medium rain and heavy rain), snowfall (light snow, medium snow and heavy snow), wind power (1-9 grades and typhoons), temperature (high-temperature yellow early warning, high Wen Chengse early warning and high-temperature red early warning); the historical power outage information specifically comprises the power outage type (scheduled maintenance power outage and fault temporary power outage), the power outage duration and the number of users involved in the power outage range, and the holiday situation specifically comprises national legal holidays and weekends.
Then, a Lagrange interpolation method is adopted, and an approximation value is obtained by fitting a polynomial by using adjacent normal data to supplement the missing value, specifically:
missing value replenishment is performed by fitting polynomials with neighboring normal data to obtain approximations:
Figure SMS_13
wherein ,
Figure SMS_14
is the firstiDay missing data timing of the day, +.>
Figure SMS_15
Is the firstiData missing on day.
Interpolation polynomial
Figure SMS_16
The construction process is as follows: taking the missing data for two yearsnData points>
Figure SMS_17
As a result ofnPersonal (S)n-polynomial of degree 1->
Figure SMS_18
Figure SMS_19
wherein ,
Figure SMS_20
is the firstiData of day loss, ->
Figure SMS_21
Is the firstiData missing on day.
Finally obtaining interpolation functionyAccording to the interpolation function, the approximation value is obtained by calculation, and the expression of the interpolation function is as follows:
Figure SMS_22
wherein ,
Figure SMS_23
is the firstiData missing on day,/->
Figure SMS_24
Representingn-polynomial of degree 1.
Step 102: and carrying out similar daily clustering on the complete telephone traffic influence factor data to obtain a clustering result of the complete telephone traffic influence factor data.
Optionally, similar daily clustering is performed according to the traffic influencing factor data to obtain a traffic influencing factor data clustering result, which specifically comprises:
after the telephone traffic influencing factor data are standardized, a plurality of centers are randomly selected;
and carrying out iterative computation on the daily traffic volume influence factor data by using a preset loss function to obtain a plurality of clusters, and calculating the central point of each cluster.
In this embodiment, K-means is used to perform similar day clustering based on the data characteristics of traffic volume influencing factor variables.
First, normalization processing is performed on the original data:
Figure SMS_25
wherein ,
Figure SMS_26
for the original data +.>
Figure SMS_27
Is normalized data.
Randomly selectqCenter, and define a loss function, wherein,qthe centers are denoted as
Figure SMS_28
The defined loss function is:
Figure SMS_29
order the
Figure SMS_30
For the number of iterative steps, the following procedure is repeated until convergence: sample->
Figure SMS_31
It is assigned to the center nearest to it +.>
Figure SMS_32
Figure SMS_33
For the center of each class, the center point is recalculated, with the calculation formula:
Figure SMS_34
wherein ,
Figure SMS_35
representing sample data, ++>
Figure SMS_36
Representing a randomly selected center point.
Finally, obtaining a plurality of clusters and recalculating the central point of each cluster, as a clustering result of the complete traffic influencing factor data, wherein the clustering result is shown in figure 2, the approximation value of the historical traffic data and the approximation value of the traffic influencing factor data are obtained by utilizing the fitting polynomial of adjacent normal data, the approximation value is used as a missing value to be supplemented into the historical traffic data and the traffic influencing factor data, the complete historical traffic data and the complete traffic influencing factor data are obtained, the missing value supplementation is carried out on the data, and meanwhile, the abnormal value of the data is cleaned, so that the integrity and the accuracy of the data are guaranteed, and the prediction accuracy is improved.
Step 103: inputting the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a constructed prediction model, carrying out feature extraction on the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data by the prediction model to obtain feature data, calculating the feature data by adopting a quantile regression model to obtain the conditional quantiles of different quantile points, calculating the conditional quantiles by adopting a non-parameter kernel density estimation method to obtain an interval prediction result, and sending the interval prediction result to a grid customer service scheduling system so that the grid customer service scheduling system carries out telephone traffic scheduling according to the interval prediction result.
Optionally, the quantile regression model is adopted to calculate the feature data to obtain the conditional quantiles of different quantile points, which specifically comprises the following steps:
solving the characteristic data by using a quantile regression function to obtain regression coefficient vectors of different quantile points, wherein the quantile regression function is as follows:
Figure SMS_37
wherein ,
Figure SMS_38
representing dependent variablesP p Is the first of (2)τThe number of digits of the individual condition is calculated,τthe range of (1, 0),β(τ) Is a vector of regression coefficients, +.>
Figure SMS_39
Represented as feature data;
solving regression coefficient vectors of different quantile points to obtain conditional quantile of the different quantile points, wherein a solving formula is as follows:
Figure SMS_40
wherein L is the neural network loss function value,
Figure SMS_41
represented as characteristic data>
Figure SMS_42
Expressed as an asymmetric function:
Figure SMS_43
where s is the argument of the function γ.
In this embodiment, the historical traffic data and the traffic influencing factor data sequence are input into the prediction model, and it should be noted that, the prediction model may be preferably constructed based on a long-short memory neural network, and the long-short memory neural network is adopted to process the traffic data on similar days, where the processing steps are as follows:
time of daytThe input of the corresponding neuron has three parts: at this point in timepTraffic data of (a)z p Output at the previous timeo p-1 State value of previous timeS p-1 At the same time, time of daypWill also output the neurono p Sum state valueS p To the next neuron;
the long-term memory neural network introduces three control gates: input doord p Output doore p Forgetful doorf p . The three control gates are all between 0 and 1]The coefficients of the interval and the calculation formulas of the three control gates are as follows:
Figure SMS_44
Figure SMS_45
Figure SMS_46
wherein ,W dW eW f respectively a weight matrix of three control gates,b db eb f respectively, the corresponding offset amounts are set,
Figure SMS_47
is a sigmoid function and then based on the input at the current timez p And the output of the last timeo p-1 To calculate candidate state value of the current neuron +.>
Figure SMS_48
, wherein ,
Figure SMS_49
The expression of (2) is:
Figure SMS_50
wherein ,W sb s respectively representing a weight matrix and a bias of the candidate state;
the state value at the current moment is obtained from the state value at the previous moment and the current candidate state value and is obtained from a forgetting doorf p And an input doord p To determine the corresponding ratio, representing the multiplication by element:
Figure SMS_51
finally, calculating the output value at the current momento p
Figure SMS_52
wherein ,
Figure SMS_53
representing an output gate.
In the prediction model, firstly, feature extraction is carried out on clustering results of complete historical telephone traffic data and complete telephone traffic influence factor data to obtain feature data, and then a quantile regression function is utilized to solve the feature data to obtain regression coefficient vectors of different quantile points, wherein the quantile regression function is as follows:
Figure SMS_54
wherein ,
Figure SMS_55
representing dependent variablesP p Is the first of (2)τThe number of digits of the individual condition is calculated,τthe range of (1, 0),β(τ) Is a vector of regression coefficients, +.>
Figure SMS_56
Is shown as a characteristicData;
solving regression coefficient vectors of different quantile points to obtain conditional quantile of the different quantile points, wherein a solving formula is as follows:
Figure SMS_57
wherein L is the neural network loss function value,
Figure SMS_58
represented as characteristic data>
Figure SMS_59
Expressed as an asymmetric function:
Figure SMS_60
where s is the argument of the function γ.
Calculating the conditional quantiles by using a non-parameter kernel density estimation function to obtain an interval prediction result, wherein the kernel density estimation function is as follows:
Figure SMS_61
wherein ,V(u a ) For the probability distribution of the traffic on the a-th day to be predicted,u a as an independent variable of the probability function,N a for a similar number of days for the a moments to be predicted,hfor the bandwidth, takeh=0.01,K() As a kernel function, taking the kernel function as a Gaussian function,G P in the case of a vector of samples,G P =[
Figure SMS_62
,
Figure SMS_63
,
Figure SMS_64
…,
Figure SMS_65
]。/>
and sending the obtained interval prediction result to a power grid customer service scheduling system so that the power grid customer service scheduling system performs traffic scheduling according to the interval prediction result.
As an example of the embodiment, taking traffic data of the electric power customer service center 2020, 1 st 2021 nd 12 nd 31 st in a certain city in the south as an example data set, the sampling period is 1 hour; traffic data from 1 st 2020 to 12 nd 20 st 2021 is used as a training set, and traffic data from 21 st 2021 to 31 st 2021 is used as a test set; comparing three methods of similar daily cluster-kernel density estimation (method one), CNN-kernel density estimation (method two) and LSTM-Gaussian (method three) with the method provided by the invention, the CNN network parameters used in the calculation example are as follows: the two convolution layers, the two pooling layers and one full-connection layer are respectively provided with convolution kernels with the size of 2 multiplied by 2, the number of the convolution kernels is respectively 12 and 16, the pooling window of the pooling layers is respectively provided with the size of 2 multiplied by 2, the step length is 12, and the number of neurons of the two full-connection layers is respectively 100 and 120; LSTM parameters are: the number of network layers is 3, and the number of hidden layer nodes is 12.
The reliability and sharpness performance of the prediction result were evaluated by selecting the interval coverage (PICP) and the Prediction Interval Average Width (PIAW), and the results are shown in table 1:
table 1 comparison of the evaluation results of the methods at different confidence intervals
Figure SMS_66
Under the confidence levels of 95%, 90% and 80%, the interval prediction method provided by the invention has higher coverage rate of the prediction interval and higher sensitivity of the prediction interval.
The invention has the following beneficial effects:
obtaining historical telephone traffic data and telephone traffic influencing factor data, supplementing missing values of the historical telephone traffic data and telephone traffic influencing factor data to obtain complete historical telephone traffic data and complete telephone traffic influencing factor data, carrying out similar daily clustering on the complete telephone traffic influencing factor data to obtain a clustering result of the complete telephone traffic influencing factor data, inputting the clustering result of the complete historical telephone traffic data and the complete telephone traffic influencing factor data into a constructed prediction model, carrying out feature extraction on the clustering result of the complete historical telephone traffic data and the complete telephone traffic influencing factor data by the prediction model to obtain feature data, calculating the feature data by a quantile regression model to obtain a conditional quantile, calculating the conditional quantile by a non-parameter kernel density estimation method to obtain a section prediction result, and sending the section prediction result to a grid customer service scheduling system so that the grid customer service scheduling system carries out telephone traffic scheduling according to the section prediction result. According to the method, the telephone traffic probability of the power grid customer service is predicted by combining telephone traffic influencing factors, so that the effective quantification of the uncertainty of the prediction of a modern power supply service system can be realized, and the accuracy of the prediction of the telephone traffic interval of the power grid customer service is improved.
Example two
Correspondingly, referring to fig. 3, fig. 3 is a schematic structural diagram of a system for predicting a traffic interval of a customer service of a power grid, as shown in the drawing, the system for predicting the traffic interval of the customer service of the power grid includes a data acquisition module 301, a clustering module 302, and a prediction interval calculation module 303, wherein specific units of each module are as follows:
the data acquisition module 301 is configured to acquire historical traffic data and traffic influencing factor data, and perform missing value supplementation on the historical traffic data and the traffic influencing factor data to obtain complete historical traffic data and complete traffic influencing factor data;
the clustering module 302 is configured to perform similar daily clustering on complete traffic volume influence factor data to obtain a clustering result of the complete traffic volume influence factor data;
the prediction interval calculation module 303 is configured to input the clustering result of the complete historical traffic data and the complete traffic influencing factor data into a constructed prediction model, so that the prediction model performs feature extraction on the clustering result of the complete historical traffic data and the complete traffic influencing factor data to obtain feature data, then calculates the feature data by adopting a quantile regression model to obtain the conditional quantiles of different quantiles, calculates the conditional quantiles by adopting a non-parametric kernel density estimation method to obtain an interval prediction result, and sends the interval prediction result to the grid customer service scheduling system so that the grid customer service scheduling system performs traffic scheduling according to the interval prediction result.
Alternatively, the data acquisition module 301 includes an approximation calculation unit 3011 and a padding unit 3012,
the approximate value calculation unit 3011 is used for obtaining an approximate value of historical traffic data and an approximate value of traffic volume influence factor data by fitting a polynomial with adjacent normal data;
the filling unit 3012 is configured to supplement the approximate value as a missing value to the historical traffic data and the traffic influencing factor data, so as to obtain complete historical traffic data and complete traffic influencing factor data.
The clustering module 302 comprises a central point selection unit 3021 and a clustering result generation unit 3022,
the central point selecting unit 3021 is configured to normalize the complete traffic volume influencing factor data and then select a plurality of central points;
the clustering result generating unit 3022 is configured to distribute each complete traffic volume influencing factor data to a central point smaller than a preset distance, obtain a plurality of clusters, and recalculate the central point of each cluster as a clustering result of the complete traffic volume influencing factor data.
The above-mentioned system for predicting the traffic interval of the power grid customer service can implement the method for predicting the traffic interval of the power grid customer service in the embodiment of the method. The options in the method embodiments described above are also applicable to this embodiment and will not be described in detail here. The rest of the embodiments of the present application may refer to the content of the method embodiments described above, and in this embodiment, no further description is given.
Compared with the prior art, the method has the advantages that the historical telephone traffic data and the telephone traffic influence factor data are obtained, missing value supplementation is carried out on the historical telephone traffic data and the telephone traffic influence factor data, the complete historical telephone traffic data and the complete telephone traffic influence factor data are obtained, similar daily clustering is carried out on the complete telephone traffic influence factor data to obtain the clustering result of the complete telephone traffic influence factor data, the clustering result of the complete historical telephone traffic data and the clustering result of the complete telephone traffic influence factor data is input into a built prediction model, after feature extraction is carried out on the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data by the prediction model to obtain feature data, the feature data is calculated by adopting a quantile regression model to obtain a conditional quantile, then the conditional quantile is calculated by adopting a non-parameter kernel density estimation method to obtain an interval prediction result, and the interval prediction result is sent to a grid customer service scheduling system, so that the grid customer service scheduling system carries out telephone traffic scheduling according to the interval prediction result. According to the method, the telephone traffic probability of the power grid customer service is predicted by combining telephone traffic influencing factors, so that the effective quantification of the uncertainty of the prediction of a modern power supply service system can be realized, and the accuracy of the prediction of the telephone traffic interval of the power grid customer service is improved.
The foregoing embodiments have been provided for the purpose of illustrating the general principles of the present invention, and are not to be construed as limiting the scope of the invention. It should be noted that any modifications, equivalent substitutions, improvements, etc. made by those skilled in the art without departing from the spirit and principles of the present invention are intended to be included in the scope of the present invention.

Claims (9)

1. The utility model provides a power grid customer service traffic interval prediction method which is characterized by comprising the following steps:
acquiring historical telephone traffic data and telephone traffic influence factor data, and supplementing missing values of the historical telephone traffic data and the telephone traffic influence factor data to obtain complete historical telephone traffic data and complete telephone traffic influence factor data;
performing similar daily clustering on the complete telephone traffic influence factor data to obtain a clustering result of the complete telephone traffic influence factor data;
inputting the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a constructed prediction model, so that the prediction model performs feature extraction on the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data to obtain feature data, then calculates the feature data by adopting a quantile regression model to obtain the conditional quantile of different quantile points, calculates the conditional quantile by adopting a non-parameter kernel density estimation method to obtain an interval prediction result, and sends the interval prediction result to a power grid customer service scheduling system so that the power grid customer service scheduling system performs telephone traffic scheduling according to the interval prediction result.
2. The method for predicting a customer service traffic interval of a power grid according to claim 1, wherein the step of supplementing the historical traffic data and the traffic influencing factor data by missing values to obtain complete historical traffic data and complete traffic influencing factor data comprises the following steps:
obtaining an approximation of the historical traffic data and an approximation of the traffic impact factor data by fitting a polynomial with adjacent normal data;
and supplementing the approximate value as a missing value to the historical telephone traffic data and the telephone traffic influence factor data to obtain complete historical telephone traffic data and complete telephone traffic influence factor data.
3. The method for predicting the traffic interval of the customer service of the power grid according to claim 1, wherein the clustering result of the complete traffic influence factor data is obtained by performing similar daily clustering on the complete traffic influence factor data, specifically:
after normalizing the complete telephone traffic influence factor data, selecting a plurality of center points;
and distributing the telephone traffic influence factor data of the complete telephone traffic influence factor data to a central point smaller than a preset distance to obtain a plurality of clusters and recalculate the central point of each cluster to be used as a clustering result of the telephone traffic influence factor data.
4. The method for predicting the traffic interval of the customer service of the power grid according to claim 1, wherein the calculation of the feature data by using a quantile regression model is performed to obtain conditional quantiles of different quantile points, specifically:
solving the characteristic data by using a quantile regression function to obtain regression coefficient vectors of different quantile points, wherein the quantile regression function is as follows:
Figure QLYQS_1
wherein ,
Figure QLYQS_2
representing dependent variablesP p Is the first of (2)τThe number of digits of the individual condition is calculated,τthe range of (1, 0),β(τ) Is a vector of regression coefficients, +.>
Figure QLYQS_3
Represented as feature data;
solving the regression coefficient vector of the different quantile points to obtain the conditional quantile of the different quantile points, wherein the solving formula is as follows:
Figure QLYQS_4
wherein L is the neural network loss function value,
Figure QLYQS_5
represented as characteristic data>
Figure QLYQS_6
Expressed as an asymmetric function: />
Figure QLYQS_7
Where s is the argument of the function γ.
5. The method for predicting the traffic interval of the customer service of the power grid according to claim 1, wherein the calculating the conditional quantile by adopting the non-parametric kernel density estimation method is performed to obtain an interval prediction result, and specifically comprises the following steps:
calculating the conditional quantiles by using a non-parameter kernel density estimation function to obtain an interval prediction result, wherein the kernel density estimation function is as follows:
Figure QLYQS_8
wherein ,V(u a ) For the probability distribution of the traffic on the a-th day to be predicted,u a as an independent variable of the probability function,N a for a similar number of days for the a moments to be predicted,hfor the bandwidth, takeh=0.01,K() As a kernel function, taking the kernel function as a Gaussian function,G P in the case of a vector of samples,G P =[
Figure QLYQS_9
,
Figure QLYQS_10
,
Figure QLYQS_11
…,
Figure QLYQS_12
]。
6. the method for predicting a traffic volume of customer service in a power grid according to claim 1, wherein the following steps are performed
The prediction model is obtained through training, and specifically comprises the following steps:
dividing the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a training set and a testing set;
constructing a prediction model, and inputting the clustering result of the complete historical telephone traffic data and the complete telephone traffic influence factor data into the prediction model for calculation to obtain an actual interval prediction result;
and comparing the actual interval prediction result with the expected interval prediction result according to the error function to obtain an error, and updating model parameters according to the error until the training times are reached to obtain a trained prediction model.
7. A prediction system for a traffic interval of power grid customer service is characterized by comprising a data acquisition module, a clustering module and a prediction interval calculation module, wherein,
the data acquisition module is used for acquiring historical telephone traffic data and telephone traffic influence factor data, and supplementing missing values of the historical telephone traffic data and the telephone traffic influence factor data to obtain complete historical telephone traffic data and complete telephone traffic influence factor data;
the clustering module is used for carrying out similar daily clustering on the complete telephone traffic influence factor data to obtain a clustering result of the complete telephone traffic influence factor data;
the prediction interval calculation module is used for inputting the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data into a constructed prediction model, so that the prediction model performs feature extraction on the clustering results of the complete historical telephone traffic data and the complete telephone traffic influence factor data to obtain feature data, then, a quantile regression model is adopted to calculate the feature data to obtain the conditional quantiles of different quantiles, a non-parameter kernel density estimation method is adopted to calculate the conditional quantiles to obtain interval prediction results, and the interval prediction results are sent to a power grid customer service scheduling system so that the power grid customer service scheduling system performs telephone traffic scheduling according to the interval prediction results.
8. The grid customer service traffic interval prediction system according to claim 7, wherein the data acquisition module comprises an approximation calculation unit and a filling unit,
the approximation calculation unit is used for obtaining the approximation of the historical traffic data and the approximation of the traffic volume influence factor data by fitting a polynomial by using adjacent normal data;
and the filling unit is used for supplementing the approximate value as a missing value to the historical telephone traffic data and the telephone traffic influencing factor data to obtain complete historical telephone traffic data and complete telephone traffic influencing factor data.
9. The grid customer service traffic interval prediction system according to claim 7, wherein the clustering module comprises a central point selection unit and a clustering result generation unit,
the central point selecting unit is used for selecting a plurality of central points after normalizing the complete telephone traffic influencing factor data;
the clustering result generating unit is used for distributing the telephone traffic volume influence factor data of the complete telephone traffic volume influence factor data to a central point smaller than a preset distance to obtain a plurality of clusters and recalculate the central point of each cluster to be used as a clustering result of the complete telephone traffic volume influence factor data.
CN202310483798.9A 2023-05-04 2023-05-04 Method and system for predicting traffic interval of power grid customer service Active CN116227738B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310483798.9A CN116227738B (en) 2023-05-04 2023-05-04 Method and system for predicting traffic interval of power grid customer service

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310483798.9A CN116227738B (en) 2023-05-04 2023-05-04 Method and system for predicting traffic interval of power grid customer service

Publications (2)

Publication Number Publication Date
CN116227738A true CN116227738A (en) 2023-06-06
CN116227738B CN116227738B (en) 2023-12-08

Family

ID=86577214

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310483798.9A Active CN116227738B (en) 2023-05-04 2023-05-04 Method and system for predicting traffic interval of power grid customer service

Country Status (1)

Country Link
CN (1) CN116227738B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118014282A (en) * 2024-02-06 2024-05-10 广东联合电子服务股份有限公司 Operator scheduling method and system based on high-speed customer service traffic prediction

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101159976A (en) * 2007-08-24 2008-04-09 中国移动通信集团设计院有限公司 Method and device of predicting telephone traffic and channel configuration
CN103002165A (en) * 2012-11-21 2013-03-27 江苏省电力公司电力科学研究院 Method for predicting short-term telephone traffic of power supply service center
CN105847598A (en) * 2016-04-05 2016-08-10 浙江远传信息技术股份有限公司 Method and device for call center multifactorial telephone traffic prediction
CN106713677A (en) * 2016-05-24 2017-05-24 国家电网公司客户服务中心 Prediction method for incoming call traffic of power client service center
CN109063970A (en) * 2018-07-06 2018-12-21 郑州天迈科技股份有限公司 Two-way automatic scheduling system based on passenger flow simulation analysis
CN109978201A (en) * 2017-12-27 2019-07-05 深圳市景程信息科技有限公司 Probability load prediction system and method based on Gaussian process quantile estimate model
CN113973156A (en) * 2021-12-22 2022-01-25 杭州远传新业科技有限公司 Telephone traffic prediction method and system and telephone traffic prediction device
CN115271041A (en) * 2022-07-25 2022-11-01 国家电网有限公司客户服务中心 Method for predicting telephone traffic of power service

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101159976A (en) * 2007-08-24 2008-04-09 中国移动通信集团设计院有限公司 Method and device of predicting telephone traffic and channel configuration
CN103002165A (en) * 2012-11-21 2013-03-27 江苏省电力公司电力科学研究院 Method for predicting short-term telephone traffic of power supply service center
CN105847598A (en) * 2016-04-05 2016-08-10 浙江远传信息技术股份有限公司 Method and device for call center multifactorial telephone traffic prediction
CN106713677A (en) * 2016-05-24 2017-05-24 国家电网公司客户服务中心 Prediction method for incoming call traffic of power client service center
CN109978201A (en) * 2017-12-27 2019-07-05 深圳市景程信息科技有限公司 Probability load prediction system and method based on Gaussian process quantile estimate model
CN109063970A (en) * 2018-07-06 2018-12-21 郑州天迈科技股份有限公司 Two-way automatic scheduling system based on passenger flow simulation analysis
CN113973156A (en) * 2021-12-22 2022-01-25 杭州远传新业科技有限公司 Telephone traffic prediction method and system and telephone traffic prediction device
CN115271041A (en) * 2022-07-25 2022-11-01 国家电网有限公司客户服务中心 Method for predicting telephone traffic of power service

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李彬等: "基于深度学习分位数回归模型的风电功率概率密度预测", 《电力自动化设备》, vol. 38, no. 9, pages 15 - 20 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118014282A (en) * 2024-02-06 2024-05-10 广东联合电子服务股份有限公司 Operator scheduling method and system based on high-speed customer service traffic prediction

Also Published As

Publication number Publication date
CN116227738B (en) 2023-12-08

Similar Documents

Publication Publication Date Title
CN109658695B (en) Multi-factor short-term traffic flow prediction method
CN110705743B (en) New energy consumption electric quantity prediction method based on long-term and short-term memory neural network
CN111260136A (en) Building short-term load prediction method based on ARIMA-LSTM combined model
CN113554466B (en) Short-term electricity consumption prediction model construction method, prediction method and device
CN111861013B (en) Power load prediction method and device
CN113837499A (en) Ultra-short-term wind power prediction method and system
CN111008726B (en) Class picture conversion method in power load prediction
CN114792156A (en) Photovoltaic output power prediction method and system based on curve characteristic index clustering
CN113112791A (en) Traffic flow prediction method based on sliding window long-and-short term memory network
CN114357670A (en) Power distribution network power consumption data abnormity early warning method based on BLS and self-encoder
CN115186923A (en) Photovoltaic power generation power prediction method and device and electronic equipment
CN112766600A (en) Urban area crowd flow prediction method and system
CN115204477A (en) Bicycle flow prediction method of context awareness graph recursive network
CN116227738B (en) Method and system for predicting traffic interval of power grid customer service
CN114330934A (en) Model parameter self-adaptive GRU new energy short-term power generation power prediction method
CN114721898A (en) Edge cloud server utilization rate prediction method and device based on boosting algorithm and storage medium
CN114757330A (en) Urban instantaneous water consumption prediction method based on LSTM
Połap et al. Energy consumption prediction model for smart homes via decentralized federated learning with LSTM
CN113868938A (en) Short-term load probability density prediction method, device and system based on quantile regression
CN113033110A (en) Important area personnel emergency evacuation system and method based on traffic flow model
CN110807508B (en) Bus peak load prediction method considering complex weather influence
CN116050595A (en) Attention mechanism and decomposition mechanism coupled runoff amount prediction method
CN115238854A (en) Short-term load prediction method based on TCN-LSTM-AM
CN114897264A (en) Photovoltaic output interval prediction method under small sample scene based on transfer learning
CN117408394B (en) Carbon emission factor prediction method and device for electric power system and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant