[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN114118580A - Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis - Google Patents

Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis Download PDF

Info

Publication number
CN114118580A
CN114118580A CN202111429817.7A CN202111429817A CN114118580A CN 114118580 A CN114118580 A CN 114118580A CN 202111429817 A CN202111429817 A CN 202111429817A CN 114118580 A CN114118580 A CN 114118580A
Authority
CN
China
Prior art keywords
data
enterprise
pollution
enterprises
monitoring
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202111429817.7A
Other languages
Chinese (zh)
Other versions
CN114118580B (en
Inventor
马传国
隋敬麒
马春玲
孙晨鑫
常露
孙宏君
武鹏飞
张华�
管朔
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dongying Power Industry Bureau Of State Grid Shandong Electric Power Co
Original Assignee
Dongying Power Industry Bureau Of State Grid Shandong Electric Power Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dongying Power Industry Bureau Of State Grid Shandong Electric Power Co filed Critical Dongying Power Industry Bureau Of State Grid Shandong Electric Power Co
Priority to CN202111429817.7A priority Critical patent/CN114118580B/en
Publication of CN114118580A publication Critical patent/CN114118580A/en
Application granted granted Critical
Publication of CN114118580B publication Critical patent/CN114118580B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Health & Medical Sciences (AREA)
  • Marketing (AREA)
  • General Health & Medical Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Educational Administration (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Operations Research (AREA)
  • Primary Health Care (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Game Theory and Decision Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis, which is characterized in that data fitting of correlation degree between power consumption data and environmental protection index data is carried out to obtain correlation mapping between the power consumption data and the environmental protection index data of different enterprises; establishing enterprise images of enterprises in a sample set from two dimensions of environmental protection index data and power utilization characteristic data, and realizing enterprise sample pollution discharge monitoring data completion based on the enterprise images and the combination of actual sampling point data of the enterprises and actual power utilization load data of the enterprises; based on the power load prediction values of the enterprises in the short term of the enterprise, the short-term pollution discharge prediction data of the enterprises in the region is obtained by combining the power consumption data of the enterprises and the environmental protection index data correlation mapping and is used as the input of the drainage basin dynamic monitoring and early warning model, so that the short-term environmental indexes in the region are predicted, and the environmental protection early warning and monitoring are realized.

Description

Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis
Technical Field
The invention relates to a pollution monitoring and early warning method, in particular to a method for monitoring and early warning a pollution source of a river basin to which a river belongs by fusing electric power and environment-friendly data.
Background
The problems that the real-time pollution discharge monitoring data of the current enterprise is lack and the early warning of regional environmental protection indexes is difficult to realize are solved.
Because the pollution control of enterprises faces the dilemma of multiple points, wide area, long line and the like, and the treatment procedures of industrial wastewater and waste gas are complicated and high in cost, part of heavily polluted enterprises only consider the benefits of the enterprises, and the phenomenon of non-obedient discharge exists.
Existing heavily polluted enterprises are often distributed in middle-east areas with developed economy and numerous industries, such as textile, printing and dyeing, wine making, medicines, building materials and the like, and are often aggregated locally; meanwhile, the monitoring data based on the environmental pollution emission monitoring station is statistical data obtained by mixing various factors, so that the tracing of the pollution source is difficult to effectively realize.
With the promotion of national electric power full coverage and ubiquitous electric power internet of things construction, electric power is essential energy for enterprise production activities, and can timely and accurately reflect the production conditions and equipment use conditions of enterprises, so that current relevant organizations realize enterprise pollution discharge monitoring by monitoring the power consumption of key enterprises, however, only part of the enterprises currently have real-time pollution discharge monitoring data, most of the enterprises (especially medium and small enterprises) only have partial available sampling point monitoring data, sample data is lack of data, and enterprise pollution discharge monitoring is difficult to realize, so that regional environment monitoring and early warning are difficult to realize.
Currently, related mechanisms realize enterprise pollution discharge monitoring by monitoring power consumption of key enterprises, for example, a document [ a monitoring method for illegal production of polluted enterprises based on power utilization law [ J ] electric power big data, 2019,22(08):35-39 ], is to set a power utilization threshold value based on the power utilization law of the polluted enterprises to study and judge illegal production, and the method can realize monitoring alarm of production stopping enterprises and shutdown alarm of environmental protection equipment, but because of production power utilization data interference in practical use, the misjudgment rate is higher; an environmental impact prediction model based on industrial power consumption is constructed by a regression analysis method in a document [ design of a special action scheme for pollution prevention and control of key enterprises based on electric power big data mining [ J ] for power supply and utilization, 2021,38(04):28-36 ], so that monitoring and early warning on atmospheric pollution prevention and control of key enterprises are realized, but follow-up impact of enterprise production on atmospheric pollution cannot be dynamically evaluated.
The method shows that the electricity utilization data can be used as an effective basis for monitoring the pollution discharge of enterprises, but depends on a linear data processing process, on one hand, the equipment, regulation and control constraint conditions of various enterprises and the obtained monitoring data indexes are different, and on the other hand, the linear training model has a cross-enterprise deviation problem in practical application, so that the pollution discharge monitoring and early warning effect is not ideal; on the other hand, the methods do not excavate the potential relevance of various factors in different time and space, short-term pollution discharge data prediction cannot be realized based on currently available data, and the delayed pollution monitoring result often leads to that the environmental management can only be used for sheep death and reinforcement.
Disclosure of Invention
A yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis comprises the following steps:
A. acquiring various types of acquired data, wherein the acquired data comprises: the method comprises the following steps of acquiring electric energy data, equipment operation data, pollution discharge monitoring data, environmental protection index data, meteorological data of an area where the enterprise is located, water area monitoring data, atmosphere monitoring data and holiday data of the enterprise with high energy consumption and high pollution;
summarizing and preprocessing the acquired data to obtain available enterprise sample data;
B. adopting multi-label classification to enterprise sample data to form an enterprise classification training sample set, wherein the labels comprise: industry labels, production scale labels, emission pollutants labels;
C. based on the same type of enterprise sample data, performing relevance analysis on the environmental protection index data and the electric energy acquisition data of the enterprise to obtain the relevance characteristics of the power utilization characteristics of the enterprise and the environmental protection index, and setting a threshold range of the environmental protection index;
D. selecting a plurality of enterprises with real-time pollution discharge monitoring data to set as typical high-pollution enterprises, acquiring environment-friendly index data of the typical high-pollution enterprises, and combining the environment-friendly index data of each typical high-pollution enterprise with respective power load curves to form load cluster data;
according to the load clustering data of each typical high-pollution enterprise, dividing the operation state of each typical high-pollution enterprise into: a full load operating state, an under load operating state and a shutdown state;
for an enterprise with real-time sewage discharge monitoring data, establishing a power utilization-environment protection index association atlas;
E. based on all enterprise classification training sample set data, constructing enterprise figures of all enterprises in a sample set from two dimensions of environment-friendly index data and enterprise electricity utilization characteristics;
F. for enterprises without pollution discharge real-time monitoring data, extracting sample characteristic data from enterprise figures of the same type of enterprises to serve as input data, calculating similarity by adopting a Pearson correlation coefficient, and taking existing enterprises with the largest similarity as similar enterprises of the enterprises without pollution discharge real-time monitoring data;
G. for an enterprise with pollution discharge real-time monitoring data, constructing a short-term power load forecasting model constructed by adopting an LSTM algorithm based on power load characteristic data of a period of history, and outputting a short-term power load forecasting value of the enterprise;
for enterprises without pollution discharge real-time monitoring data, extracting power load characteristic data of a period of history of similar enterprises, constructing a short-term power load prediction model constructed by adopting an LSTM algorithm, and outputting the short-term power load prediction value of the enterprises;
H. judging whether an enterprise has a power utilization-environment protection index correlation atlas, and if the enterprise has the power utilization-environment protection index correlation atlas, matching the power utilization load predicted value of the enterprise in a short period with the power utilization-environment protection index correlation atlas to obtain short-period pollution discharge prediction data; if the enterprise does not have the power utilization-environmental protection index correlation atlas, the power utilization-environmental protection index correlation atlas of a similar enterprise of the enterprise is combined based on the scaling factor, and then short-term pollution discharge prediction data is obtained;
comparing the short-term pollution discharge prediction data with an environmental protection index threshold range;
I. dividing river sections, peripheral atmosphere monitoring stations and high-pollution enterprises to which river watersheds belong according to distribution of an actual Geographic Information System (GIS), and creating a watershed dynamic monitoring area for each river of the watersheds;
J. the method comprises the steps of constructing a basin dynamic monitoring early warning model according to historical sewage discharge data of high-pollution enterprises in a basin dynamic monitoring area, and predicting an environmental protection index in the basin dynamic monitoring area based on acquired short-term sewage discharge prediction data, so that an environmental protection department is assisted to make preventive treatment measures in advance.
Preferably, the electric energy collection data includes: power consumer type, consumer area, consumer industry, power supply address, voltage class, transformer capacity, tariff class, voltage, current, active power, reactive power, power factor, harmonics, solar energy, maximum demand.
Preferably, the emission monitoring data includes: exhaust gas temperature of industrial exhaust gas, relative humidity, air flow rate, fresh air volume, total suspended particle boiler soot, industrial furnace soot, ringelmann's blackness, inhalable particulate matter, ammonium chromate, fluoride, hydrogen chloride, sulfuric acid mist, carbon disulfide, carbon monoxide, nitrogen dioxide, nitrogen oxides, ozone, sulfur dioxide, hydrogen sulfide, hydrogen cyanide, chlorine, phenolic compounds, catering oil fume, anilines, formaldehyde, benzene series, benzene, toluene, xylene, styrene, Total Volatile Organic Compounds (TVOC), contents of methanol, acetone, total hydrocarbons, acrylonitrile, acrolein, non-methane total hydrocarbons, acetaldehyde, vinyl chloride, nitrobenzene, methane;
water temperature, odor, conductivity, transparency, pH, total salt content, color, turbidity, suspended matter, acidity, alkalinity, hexavalent chromium, total mercury, copper, zinc, lead, cadmium, nickel, iron, manganese, beryllium, total chromium, potassium, sodium, calcium, magnesium, total hardness, total arsenic, selenium, barium, molybdenum, cobalt, dissolved oxygen, ammonia nitrogen, nitrite nitrogen, nitrate nitrogen, sulfate, total nitrogen, total phosphorus, chloride, fluoride, total cyanide, sulfide, permanganate index, biochemical oxygen demand, chemical oxygen demand, volatile phenol, petroleum, animal and vegetable oils, anionic surfactants, benzene, toluene, ethylbenzene, p-xylene, o-xylene, m-xylene, styrene content of industrial wastewater.
Preferably, the regional meteorological data includes: the daily value data of the elements of air pressure, air temperature, precipitation, evaporation capacity, relative humidity, wind direction and wind speed, sunshine hours and 0cm ground temperature.
Preferably, the water area monitoring data includes: odor, water temperature, turbidity, pH, conductivity, dissolved solids, suspended solids, total nitrogen, Total Organic Carbon (TOC) Dissolved Oxygen (DO), Biochemical Oxygen Demand (BOD), Chemical Oxygen Demand (COD), total bacteria count, coliform content.
Preferably, the summarizing and preprocessing the acquired data includes: the method comprises the steps of data cleaning, data conversion, data normalization standardization and abnormal value deletion, so that available sample data is obtained, and the accuracy of the available sample data is improved.
Preferably, the industry label is: fossil power, steel, cement, petrochemical, chemical industry, non-ferrous metal metallurgy and others, the production scale label is: super-huge, large-sized, medium-sized, small-sized and micro-sized.
More preferably, the enterprises with the same industry label, the same production scale label and the same pollutant emission label belong to the same type of enterprise.
Preferably, the relevance analysis is performed on the environmental protection index data and the electric energy collection data of the enterprise in the following way:
C1. selecting the same type of enterprise sample data as a type of sample data set A; the type data set comprises N power utilization characteristics and M environmental protection index characteristics; sDThe selected power utilization characteristic set is initially an empty set; sHAn environment-friendly index feature set is selected, and an empty set is initially selected; fDThe method comprises the steps that a power characteristic set to be selected initially comprises N characteristics, and the N characteristics of the power characteristic set to be selected are dynamic variables changing along with time; fHThe method comprises the steps that an environment-friendly feature set to be selected initially comprises M features;
C2. calculating a candidate power feature set FDAny one of feature items F ini DAnd candidate environment-friendly feature set FHAny one of the features of
Figure BDA0003379717270000052
Inter-information between
Figure BDA0003379717270000053
The calculation formula is as follows:
Figure BDA0003379717270000054
wherein, the characteristic item Fi DAnd
Figure BDA0003379717270000056
(i 1,2, …, N, j1, 2, …, M) has a probability density distribution of
Figure BDA0003379717270000057
And
Figure BDA0003379717270000058
in order to be a discrete joint probability density,
Figure BDA0003379717270000059
is Fi DThe value of the characteristic at the time instant t,
Figure BDA00033797172700000511
is Fi DThe differential value from time t-1 to time t,
Figure BDA00033797172700000513
is composed of
Figure BDA00033797172700000514
The value of the characteristic at the time instant t,
Figure BDA00033797172700000515
is composed of
Figure BDA00033797172700000516
The difference value from t-1 to t;
because of the feature term Fi DAnd
Figure BDA00033797172700000518
the correlation degree changes at the same time, so that the difference value of the two at the same time is selected for correlation analysis and mutual information
Figure BDA00033797172700000519
Representing the degree of association between the two;
C3. setting an environmental index threshold D if
Figure BDA00033797172700000520
Then the feature item F is selectedi DAnd
Figure BDA00033797172700000522
repeating the steps C2 and C3 for the associated features until the feature set F is traversedDAnd FHCharacteristic items of interior, consisting ofThe selected power utilization characteristic set S is obtainedDAnd selecting an environmental feature set SHAnd associated feature set
Figure BDA00033797172700000523
More preferably, when similar enterprises are selected for the enterprises without pollution discharge real-time monitoring data, the formula for calculating the maximum similarity is as follows:
Figure BDA0003379717270000061
wherein, A is an enterprise without a power-environment protection index correlation diagram, and B is an enterprise with a power-environment protection index correlation diagram set; i isA(FD,FH) The relevance of the associated features included in enterprise a,
Figure BDA0003379717270000062
correlation characteristic correlation degree mean value, I, included for enterprise AB(FD,FH) The relevance of the associated features included by enterprise B,
Figure BDA0003379717270000063
is the average of the relevance of the associated features contained in enterprise B,
Figure BDA0003379717270000064
feature item F for Enterprise Ai DAnd
Figure BDA0003379717270000066
the value of the degree of association of (c),
Figure BDA0003379717270000067
feature item F for Enterprise Bi DAnd
Figure BDA0003379717270000069
cov is covariance and σ is standard deviation.
More preferably, the short-term power load forecasting model and the short-term power load forecasting value of the enterprise are constructed and obtained in the following manner:
G1. the temperature data, the data of holidays and the characteristic data of the electrical loads of enterprises are taken as an input characteristic set x of LSTM model data, an input layer is 100 neurons,
the temperature data, the holiday data and the enterprise power load characteristic data at the time t are input values xtThe predicted output value of the electrical load of the last neural unit is ht+1The two inputs have corresponding weights, and a value of 0-1 is obtained under the action of sigmoid activation, namely three gate values, and the calculation formula is as follows:
it=σ(Wixt+Uiht-1)
ft=σ(Wfxt+Ufht-1)
ot=σ(Woxt+Uoht-1)
at time t, the input gate output value is itThe output value of the forgetting gate is ftThe output value of the output gate is ot,Wi、Ui、Wf、Uf、Wo、UoRespectively corresponding parameters of each door;
input value xtAnd the output h of the previous cellt+1The two values having a corresponding weight WcAnd UcAnd using the tanh activation function, the output value is equivalent to obtaining a new memory value obtained based on the input value at the time t
Figure BDA00033797172700000610
The calculation formula is as follows:
Figure BDA00033797172700000611
calculating and acquiring final memory value c of the neural unitt
Figure BDA0003379717270000071
ftOutput value of forgetting gate at time t, itThe output value of the input gate at time t, ct-1The final memory value of the last neuron at the time t-1;
final output value htIs composed of
Figure BDA0003379717270000072
The hidden layer adopts a 3-layer structure, the output layer is one-dimensional output, and the output data is the predicted value h of the short-term enterprise power loadt
G2. Based on historical actual power load data of the enterprise at the time t and the power load predicted value h of the short-term enterprisetPerforming cross entropy operation, performing reverse feedback as a loss function, adjusting each parameter value of the neuron, and fixing each parameter value when the loss function is smaller than a set threshold value to complete the construction of a short-term power load prediction model;
G3. taking temperature data, holiday data and power load characteristic data of an enterprise at the current moment as input, and obtaining a short-term power load predicted value of the enterprise;
more preferably, the scaling factor d is calculated as follows:
Figure BDA0003379717270000073
wherein,
Figure BDA0003379717270000074
f for Enterprise Bi DThe value of the characteristic at the time instant t,
Figure BDA0003379717270000076
for enterprise B
Figure BDA0003379717270000077
At time tThe value of the characteristic is set to be,
Figure BDA0003379717270000078
f for Enterprise Ai DThe value of the characteristic at the time instant t,
Figure BDA00033797172700000710
for enterprise A
Figure BDA00033797172700000711
The characteristic value at time t, ∑ is
Figure BDA00033797172700000712
The sum of all actual sampling points of the enterprise A;
and for the enterprises without pollution discharge real-time monitoring data, multiplying the scaling factor d by the short-term pollution discharge prediction data of the similar enterprises to obtain the short-term pollution discharge prediction data of the enterprises.
More preferably, the construction process of the watershed dynamic monitoring and early warning model comprises the following steps:
J1. acquiring historical pollution discharge fitting data of high-pollution enterprises in the dynamic drainage basin monitoring area based on historical power utilization data of the high-pollution enterprises in the dynamic drainage basin monitoring area and historical short-term pollution discharge prediction data of the enterprises;
J2. taking a high-pollution enterprise sample in a river basin dynamic monitoring area as a river basin dynamic monitoring early warning model training sample (x)kH '), K ∈ (1,2, L, K), h' is pollution prediction data, wherein x is an enterprise sample, the total number of enterprises is K, and each sample data comprises historical pollution discharge data/historical pollution discharge prediction data, historical meteorological data and historical holiday data;
J3. the model is defined by adopting an LSTM algorithm structure and various parameters as shown in the step G, single-layer hidden neurons are adopted, and short-term water pollution/atmospheric pollution prediction data of a drainage basin dynamic monitoring area are output;
J4. based on a gradient descent strategy, the parameters are adjusted in the direction of the negative gradient of the target by combining the diffusion weight from the pollution monitoring point to the pollution discharge enterprise,
Figure BDA0003379717270000081
Figure BDA0003379717270000082
in the formula,
Figure BDA0003379717270000083
is a predicted value of pollution, h ', output at time t'tThe actual pollution value at the time t, alpha is the diffusion weight from the pollution monitoring point to the pollution discharge enterprise, and the normalization definition is carried out based on the distance and the number of the enterprises;
J5. iteratively feeding back and optimizing neuron parameters until the neuron parameters are accumulated within a self-defined time period EtIf the epsilon is less than or equal to the expected accumulated error, the training of the basin dynamic monitoring early warning model is finished;
the acquired short-term pollution discharge prediction data is combined with the acquired meteorological data and holiday data to serve as input data of the basin dynamic monitoring early warning model, and environmental protection index prediction in a basin dynamic monitoring area is output, so that an environmental protection department is assisted to make preventive treatment measures in advance
More preferably, in the step J1, if there is a small portion of real sample detection data, the historical short-term pollution discharge prediction data is corrected, and the difference data is used as feedback to update the scaling factor d in the step H, so as to improve the accuracy of the historical pollution discharge fitting data of the highly polluted enterprise.
The technical scheme provided by the invention solves the following technical problems:
1. the problem of data fitting of the correlation degree between the power utilization data and the environmental protection index data is solved. Because the correlation degree exists between the power consumption data and the environmental protection index data, but the correlation degree is changed by various factors such as time, space, environment and the like, the dynamic correlation degree is difficult to describe by a linear relation, and the sample data of the high-pollution enterprise with real-time pollution discharge data monitoring is less, so that the power consumption-environmental protection index correlation atlas of the enterprise is constructed based on the real-time monitoring data of part of available typical high-pollution enterprises, and the correlation mapping between the power consumption data and the environmental protection index data of different enterprises is obtained;
2. the problem of blowdown enterprise real-time supervision sample volume be few, only have partial sampling point data is solved. Establishing enterprise figures of enterprises in a sample set from two dimensions of environmental protection index data and power utilization characteristic data, calculating a Pearson correlation coefficient between the enterprises based on the enterprise figures, acquiring an enterprise power utilization-environmental protection index correlation map set of similar enterprises for the enterprises without real-time monitoring data, combining actual sampling point data of the enterprises with actual power utilization load data of the enterprises, acquiring pollution discharge fitting data of all enterprise samples based on a scaling factor, and realizing the completion of the pollution discharge monitoring data of the enterprise samples;
3. the problem of hysteresis quality of environmental protection early warning is solved. Based on the power load prediction values of the enterprises in the short term, the power consumption data of the enterprises in the area and the environmental protection index data are combined to be mapped in an associated mode, the short-term pollution discharge prediction data of the enterprises in the area are obtained and used as the input of a drainage basin dynamic monitoring and early warning model, so that the short-term environmental indexes in the area are predicted, and the environmental protection early warning and monitoring are realized;
4. the comprehensive monitoring of the pollution discharge data of the high-pollution enterprise is realized. A drainage basin dynamic monitoring area is established based on each river, and environmental protection index prediction in the drainage basin dynamic monitoring area is achieved from a plurality of angles of atmosphere, land infiltration, water sources, upstream and downstream influences and the like through two aspects of atmospheric pollution monitoring and water pollution monitoring and early warning.
The method is based on high real-time electric power data, combines environmental protection monitoring index data, meteorological data and holiday data, monitors and pre-warns the enterprises with the important pollution sources in the yellow river basin, on one hand, potential relation between the pollution discharge and the electricity consumption data of the enterprises is mined, on the basis of multi-label classification, enterprise figures and Pearson correlation coefficient algorithm, migration application of correlation mapping between the electricity consumption and the environmental protection indexes of the enterprises among similar enterprises is realized, LSTM algorithm and electricity consumption-environmental protection index correlation atlas matching is adopted, short-term pollution discharge prediction data of each enterprise is realized, on the other hand, the dynamic pre-warning and traceability analysis of the environmental protection indexes of the yellow river basin are realized by combining enterprise dynamic pollution discharge data with geospatial data based on fine granularity, joint defense joint control of pollution prevention and control of the important enterprises is strengthened, and cooperative control of atmosphere and water source pollution of the yellow river basin is realized.
Drawings
FIG. 1 is a flow chart of the present invention
FIG. 2 is a diagram of a neuron construction process for constructing a short-term electrical load prediction model according to the present invention
FIG. 3 is a flow chart of the construction of the basin dynamic monitoring and early warning model of the present invention
Detailed Description
As shown in fig. 1, the method for monitoring and early warning the pollution source in the yellow river basin based on the fusion analysis of power-environmental protection data, provided by the invention, comprises the following steps:
A. the method comprises the following steps of extracting various electric energy acquisition data of high-energy-consumption and high-pollution enterprises, operating data of the enterprises in operation, pollution discharge monitoring data of the enterprises and corresponding archive data, acquiring meteorological data, water area monitoring data and holiday data of the areas where the enterprises are located, summarizing the data, and finishing acquisition of available enterprise sample data, wherein the specifically acquired data comprises the following contents:
performing data preprocessing on the acquired collected data, including data cleaning, data conversion, data normalization standardization and abnormal value deletion, so as to acquire available sample data;
the data collected for each type are summarized in the following table:
Figure BDA0003379717270000101
Figure BDA0003379717270000111
B. based on sample data of each enterprise, performing multi-label classification according to industry, production scale and pollutant emission in sequence to form an enterprise classification training sample set;
wherein, the industry is according to thermoelectricity, steel, cement, petrifaction, chemical industry, non-ferrous metal smelting and the like; the production scale is divided into super-large size, medium size, small size and micro size; labeling and classifying the discharged pollutants according to the discharged pollution monitoring indexes of each enterprise; the same industry and the same scale and the same label of the discharged pollutants are divided into the same category;
C. performing relevance analysis on the environmental protection index data and the enterprise power utilization data based on the same type of enterprise sample data to obtain relevant power utilization characteristics and environmental protection index data and obtain a relevant index threshold range;
C1. based on sample data of a certain same type of enterprise as a sample data set A of the type, the sample data set of the type comprises N power utilization characteristics and M environmental protection index characteristics, SDFor selecting the power consumption feature set, the power consumption feature set is initially an empty set SHFor selecting the environmental characteristic set, an empty set is initially selected, FDInitially containing N characteristics as dynamic variable changing with time, non-static class data, FHThe method comprises the steps that an environment-friendly feature set to be selected initially comprises M features;
C2. respectively calculating a set of candidate power characteristics FDAny one of feature items F ini DAnd candidate environment-friendly feature set FHAny one of the features of
Figure BDA0003379717270000113
Inter-information between
Figure BDA0003379717270000114
The calculation formula is as follows:
Figure BDA0003379717270000115
wherein, the characteristic item Fi DAnd
Figure BDA0003379717270000122
(i 1,2, …, N, j1, 2, …, M) has a probability density distribution of
Figure BDA0003379717270000123
And
Figure BDA0003379717270000124
in order to be a discrete joint probability density,
Figure BDA0003379717270000125
is Fi DThe value of the characteristic at the time instant t,
Figure BDA0003379717270000127
is Fi DThe differential value from time t-1 to time t,
Figure BDA0003379717270000129
is composed of
Figure BDA00033797172700001210
The value of the characteristic at the time instant t,
Figure BDA00033797172700001211
is composed of
Figure BDA00033797172700001212
The difference value from t-1 to t;
because of the feature term Fi DAnd
Figure BDA00033797172700001214
the correlation degree changes at the same time, so that the difference value of the two at the same time is selected for correlation analysis and mutual information
Figure BDA00033797172700001215
Representing the degree of association between the two;
C3. set a threshold value D, if
Figure BDA00033797172700001216
Then the feature item F is selectedi DAnd
Figure BDA00033797172700001218
repeating the steps C2 and C3 for the associated features until the feature set F is traversedDAnd FHCharacteristic items in the system, thereby obtaining the selected power consumptionFeature set SDAnd selecting an environmental feature set SHAnd associated feature set
Figure BDA00033797172700001219
D. The method is characterized in that the same enterprise operation state is divided into a full load working state, an under load working state and a shutdown state based on part of typical high-pollution enterprise environmental protection index data with real-time pollution discharge monitoring data and combined with the same enterprise power utilization load curve clustering data, wherein the under load state can be divided into a plurality of types based on the enterprise load clustering condition. According to the associated feature set
Figure BDA00033797172700001220
Drawing associated features Fi DAnd
Figure BDA00033797172700001222
the enterprise power utilization-environmental protection index association atlas at the same time;
E. c, training sample set data of all enterprise classifications, and acquiring a power utilization characteristic set S based on the step CDSelecting an environmental feature set SHAnd associated feature set
Figure BDA00033797172700001223
Establishing enterprise figures of enterprises in a sample set from two associated dimensions of environmental protection index data and electricity utilization characteristic data;
F. in the same multi-label classification sample set, aiming at an enterprise A without pollution discharge real-time monitoring data, taking sample characteristic data of an enterprise image as input, adopting a Pearson correlation coefficient to carry out similarity calculation, and taking an enterprise B of an existing enterprise electricity utilization-environmental protection index association atlas with the maximum similarity as a similar enterprise of the enterprise without pollution discharge real-time monitoring data;
defining an enterprise A and an enterprise B, wherein the enterprise A does not have an enterprise electricity-environment protection index correlation diagram, the enterprise B already has an electricity-environment protection index correlation diagram set, and the maximum similarity calculation formula of the enterprise A and the enterprise B is as follows:
Figure BDA0003379717270000131
wherein, IA(FD,FH) The relevance of the associated features included in enterprise a,
Figure BDA0003379717270000132
correlation characteristic correlation degree mean value, I, included for enterprise AB(FD,FH) The relevance of the associated features included by enterprise B,
Figure BDA0003379717270000133
is the average of the relevance of the associated features contained in enterprise B,
Figure BDA0003379717270000134
feature item F for Enterprise Ai DAnd
Figure BDA0003379717270000136
the value of the degree of association of (c),
Figure BDA0003379717270000137
feature item F for Enterprise Bi DAnd
Figure BDA0003379717270000139
cov is covariance, σ is standard deviation;
G. based on the obtained historical power load characteristic data of the enterprises, a short-term power load forecasting model is constructed by adopting an LSTM algorithm, and power load forecasting values of the enterprises in the short term are output;
g1 takes temperature data, holiday data and enterprise electricity load characteristic data as LSTM model data input feature set x, the input layer is 100 neurons, and the construction process of each neuron is shown in FIG. 2.
The temperature data, the holiday data and the enterprise power load characteristic data at the time t are input values xtThe predicted output value of the electrical load of the last neural unit is ht+1Both inputs having corresponding weightsAnd obtaining a value of 0-1 under the action of sigmoid activation, namely three gate values, wherein the calculation formula is as follows:
it=σ(Wixt+Uiht-1)
ft=σ(Wfxt+Ufht-1)
ot=σ(Woxt+Uoht-1)
at time t, the input gate output value is itThe output value of the forgetting gate is ftThe output value of the output gate is ot,Wi、Ui、Wf、Uf、Wo、UoRespectively corresponding parameters of each door;
input value xtAnd the output h of the previous cellt+1The two values having a corresponding weight WcAnd UcAnd using the tanh activation function, the output value is equivalent to obtaining a new memory value obtained based on the input value at the time t
Figure BDA00033797172700001310
The calculation formula is as follows:
Figure BDA00033797172700001311
calculating and acquiring final memory value c of the neural unitt
Figure BDA0003379717270000141
ftOutput value of forgetting gate at time t, itThe output value of the input gate at time t, ct-1The final memory value of the last neuron at the time t-1;
final output value htIs composed of
Figure BDA0003379717270000142
The hidden layer adopts a 3-layer structure, the output layer is one-dimensional output, and the output data is the predicted value h of the short-term enterprise power loadt
G2 is based on historical actual electric load data of the enterprise at the time t and a short-term enterprise electric load predicted value htPerforming cross entropy operation, performing reverse feedback as a loss function, adjusting each parameter value of the neuron, and fixing each parameter value when the loss function is smaller than a set threshold value to complete the construction of a short-term power load prediction model;
g3, taking temperature data, holiday data and enterprise power load characteristic data of the enterprise at the current moment as input, and obtaining a short-term power load predicted value of the enterprise;
H. if the enterprise has the power utilization-environmental protection index correlation atlas, matching is carried out based on the short-term power utilization load predicted value of the enterprise to obtain short-term pollution discharge predicted data; if the enterprise does not have the power utilization-environmental protection index correlation atlas, the short-term pollution discharge prediction data of the enterprise is obtained by multiplying the scaling factor d by the short-term pollution discharge prediction data of the similar enterprise based on the scaling factor and the power utilization-environmental protection index correlation atlas of the similar enterprise obtained in the step F, and the calculation formula of the scaling factor d is as follows:
Figure BDA0003379717270000143
wherein,
Figure BDA0003379717270000144
f for Enterprise Bi DThe value of the characteristic at the time instant t,
Figure BDA0003379717270000146
for enterprise B
Figure BDA0003379717270000147
The value of the characteristic at the time instant t,
Figure BDA0003379717270000148
f for Enterprise Ai DThe value of the characteristic at the time instant t,
Figure BDA00033797172700001410
for enterprise A
Figure BDA00033797172700001411
The characteristic value at time t, ∑ is
Figure BDA00033797172700001412
The sum of all the actual sampling points of enterprise a within.
And comparing the short-term pollution discharge prediction data with the environmental protection index threshold range.
I. Dividing river sections, peripheral atmosphere monitoring stations and high-pollution enterprises to which river basins belong according to actual GIS distribution, and creating a basin dynamic monitoring area for each river of the basin;
J. aiming at historical pollution discharge data of high-pollution enterprises in a dynamic drainage basin monitoring area, a dynamic drainage basin monitoring and early warning model is respectively constructed aiming at two aspects of atmospheric pollution and water pollution, and the model construction process is shown in figure 3 and comprises the following steps:
j1 obtaining historical short-term pollution discharge prediction data of the enterprise according to the step H based on historical power utilization data of the high-pollution enterprise in the dynamic drainage basin monitoring area, correcting the historical short-term pollution discharge prediction data if a small part of sample real detection data exists, and updating the scaling factor d of the step H by using the difference data as feedback, thereby obtaining historical pollution discharge fitting data of the high-pollution enterprise in the dynamic drainage basin monitoring area;
j2 takes high-pollution enterprise samples in the river basin dynamic monitoring area as river basin dynamic monitoring early warning model training samples (x)kH '), K ∈ (1,2, L, K), h' is pollution prediction data, wherein x is an enterprise sample, the total number of enterprises is K, and each sample data comprises historical pollution discharge data/historical pollution discharge prediction data, historical meteorological data and historical holiday data;
j3, as shown in the step G, the model is defined by adopting an LSTM algorithm structure and various parameters, single-layer hidden neurons are adopted, and short-term water pollution/atmospheric pollution prediction data of the drainage basin dynamic monitoring area are output;
j4 based on gradient descent strategy, combined with diffusion weight from pollution monitoring point to pollution discharge enterprise, adjusting parameters in negative gradient direction of target,
Figure BDA0003379717270000151
Figure BDA0003379717270000152
in the formula,
Figure BDA0003379717270000153
is a predicted value of pollution, h ', output at time t'tThe actual pollution value at the time t, alpha is the diffusion weight from the pollution monitoring point to the pollution discharge enterprise, and the normalization definition is carried out based on the distance and the number of the enterprises;
j5 iterative feedback optimization of neuron parameters until E is accumulated within a self-defined time periodtIf the epsilon is less than or equal to the expected accumulated error, the training of the basin dynamic monitoring early warning model is finished;
and (D) outputting environmental protection index prediction in the watershed dynamic monitoring area by taking the short-term pollution discharge prediction data acquired based on the step H and combining the acquired meteorological data and holiday data as input data of the watershed dynamic monitoring early warning model, thereby assisting an environmental protection department to make preventive treatment measures in advance.

Claims (13)

1. A yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis is characterized by comprising the following steps:
A. acquiring various types of acquired data, wherein the acquired data comprises: the method comprises the following steps of acquiring electric energy data, equipment operation data, pollution discharge monitoring data, environmental protection index data, meteorological data of an area where the enterprise is located, water area monitoring data, atmosphere monitoring data and holiday data of the enterprise with high energy consumption and high pollution;
summarizing and preprocessing the acquired data to obtain available enterprise sample data;
B. adopting multi-label classification to enterprise sample data to form an enterprise classification training sample set, wherein the labels comprise: industry labels, production scale labels, emission pollutants labels;
C. based on the same type of enterprise sample data, performing relevance analysis on the environmental protection index data and the electric energy acquisition data of the enterprise to obtain the relevance characteristics of the power utilization characteristics of the enterprise and the environmental protection index, and setting a threshold range of the environmental protection index;
D. selecting a plurality of enterprises with real-time pollution discharge monitoring data to set as typical high-pollution enterprises, acquiring environment-friendly index data of the typical high-pollution enterprises, and combining the environment-friendly index data of each typical high-pollution enterprise with respective power load curves to form load cluster data;
according to the load clustering data of each typical high-pollution enterprise, dividing the operation state of each typical high-pollution enterprise into: a full load operating state, an under load operating state and a shutdown state;
for an enterprise with real-time sewage discharge monitoring data, establishing a power utilization-environment protection index association atlas;
E. based on all enterprise classification training sample set data, constructing enterprise figures of all enterprises in a sample set from two dimensions of environment-friendly index data and enterprise electricity utilization characteristics;
F. for enterprises without pollution discharge real-time monitoring data, extracting sample characteristic data from enterprise figures of the same type of enterprises to serve as input data, calculating similarity by adopting a Pearson correlation coefficient, and taking existing enterprises with the largest similarity as similar enterprises of the enterprises without pollution discharge real-time monitoring data;
G. for an enterprise with pollution discharge real-time monitoring data, constructing a short-term power load forecasting model constructed by adopting an LSTM algorithm based on power load characteristic data of a period of history, and outputting a short-term power load forecasting value of the enterprise;
for enterprises without pollution discharge real-time monitoring data, extracting power load characteristic data of a period of history of similar enterprises, constructing a short-term power load prediction model constructed by adopting an LSTM algorithm, and outputting the short-term power load prediction value of the enterprises;
H. judging whether an enterprise has a power utilization-environment protection index correlation atlas, and if the enterprise has the power utilization-environment protection index correlation atlas, matching the power utilization load predicted value of the enterprise in a short period with the power utilization-environment protection index correlation atlas to obtain short-period pollution discharge prediction data; if the enterprise does not have the power utilization-environmental protection index correlation atlas, the power utilization-environmental protection index correlation atlas of a similar enterprise of the enterprise is combined based on the scaling factor, and then short-term pollution discharge prediction data is obtained;
comparing the short-term pollution discharge prediction data with an environmental protection index threshold range;
I. dividing river sections, peripheral atmosphere monitoring stations and high-pollution enterprises to which river watersheds belong according to distribution of an actual Geographic Information System (GIS), and creating a watershed dynamic monitoring area for each river of the watersheds;
J. the method comprises the steps of constructing a basin dynamic monitoring early warning model according to historical sewage discharge data of high-pollution enterprises in a basin dynamic monitoring area, and predicting an environmental protection index in the basin dynamic monitoring area based on acquired short-term sewage discharge prediction data, so that an environmental protection department is assisted to make preventive treatment measures in advance.
2. The yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis of claim 1, wherein the electric energy collection data comprises: power consumer type, consumer area, consumer industry, power supply address, voltage class, transformer capacity, tariff class, voltage, current, active power, reactive power, power factor, harmonics, solar energy, maximum demand.
3. The yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis according to claim 1, wherein the pollution discharge monitoring data comprises: exhaust gas temperature of industrial exhaust gas, relative humidity, air flow rate, fresh air volume, total suspended particle boiler soot, industrial furnace soot, ringelmann's blackness, inhalable particulate matter, ammonium chromate, fluoride, hydrogen chloride, sulfuric acid mist, carbon disulfide, carbon monoxide, nitrogen dioxide, nitrogen oxides, ozone, sulfur dioxide, hydrogen sulfide, hydrogen cyanide, chlorine, phenolic compounds, catering oil fume, anilines, formaldehyde, benzene series, benzene, toluene, xylene, styrene, Total Volatile Organic Compounds (TVOC), contents of methanol, acetone, total hydrocarbons, acrylonitrile, acrolein, non-methane total hydrocarbons, acetaldehyde, vinyl chloride, nitrobenzene, methane;
water temperature, odor, conductivity, transparency, pH, total salt content, color, turbidity, suspended matter, acidity, alkalinity, hexavalent chromium, total mercury, copper, zinc, lead, cadmium, nickel, iron, manganese, beryllium, total chromium, potassium, sodium, calcium, magnesium, total hardness, total arsenic, selenium, barium, molybdenum, cobalt, dissolved oxygen, ammonia nitrogen, nitrite nitrogen, nitrate nitrogen, sulfate, total nitrogen, total phosphorus, chloride, fluoride, total cyanide, sulfide, permanganate index, biochemical oxygen demand, chemical oxygen demand, volatile phenol, petroleum, animal and vegetable oils, anionic surfactants, benzene, toluene, ethylbenzene, p-xylene, o-xylene, m-xylene, styrene content of industrial wastewater.
4. The yellow river basin pollution source monitoring and early warning method based on power-environmental protection data fusion analysis of claim 1, wherein the regional meteorological data comprises: the daily value data of the elements of air pressure, air temperature, precipitation, evaporation capacity, relative humidity, wind direction and wind speed, sunshine hours and 0cm ground temperature.
5. The method for monitoring and early warning pollution sources in the yellow river basin based on electric power-environmental protection data fusion analysis of claim 1, wherein the water area monitoring data comprises: odor, water temperature, turbidity, pH, conductivity, dissolved solids, suspended solids, total nitrogen, Total Organic Carbon (TOC) Dissolved Oxygen (DO), Biochemical Oxygen Demand (BOD), Chemical Oxygen Demand (COD), total bacteria count, coliform content.
6. The yellow river basin pollution source monitoring and early warning method based on power-environmental protection data fusion analysis according to claim 1, wherein the summarizing and preprocessing of the collected data comprises: the method comprises the steps of data cleaning, data conversion, data normalization standardization and abnormal value deletion, so that available sample data is obtained, and the accuracy of the available sample data is improved.
7. The yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis according to claim 1, wherein the industry label is as follows: fossil power, steel, cement, petrochemical, chemical industry, non-ferrous metal metallurgy and others, the production scale label is: super-huge, large-sized, medium-sized, small-sized and micro-sized.
8. The yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis according to claim 7, wherein enterprises with the same industry label, the same production scale label and the same pollutant emission label belong to the same type of enterprise.
9. The yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis according to any one of claims 1 to 8, characterized in that relevance analysis is performed on environmental protection index data and electric energy collection data of enterprises in the following way:
C1. selecting the same type of enterprise sample data as a type of sample data set A; the type data set comprises N power utilization characteristics and M environmental protection index characteristics; sDThe selected power utilization characteristic set is initially an empty set; sHAn environment-friendly index feature set is selected, and an empty set is initially selected; fDThe method comprises the steps that a power characteristic set to be selected initially comprises N characteristics, and the N characteristics of the power characteristic set to be selected are dynamic variables changing along with time; fHThe method comprises the steps that an environment-friendly feature set to be selected initially comprises M features;
C2. calculating a candidate power feature set FDAny one of feature items F ini DAnd candidate environment-friendly feature set FHIn (1)A characteristic item
Figure FDA00033797172600000510
Inter-information between
Figure FDA00033797172600000511
The calculation formula is as follows:
Figure FDA0003379717260000051
wherein, the characteristic item Fi DAnd
Figure FDA00033797172600000512
has a probability density distribution of
Figure FDA0003379717260000052
And
Figure FDA0003379717260000053
Figure FDA0003379717260000054
in order to be a discrete joint probability density,
Figure FDA0003379717260000055
is Fi DThe value of the characteristic at the time instant t,
Figure FDA0003379717260000056
is Fi DThe differential value from time t-1 to time t,
Figure FDA0003379717260000057
is composed of
Figure FDA00033797172600000513
The value of the characteristic at the time instant t,
Figure FDA0003379717260000058
is composed of
Figure FDA00033797172600000514
The difference value from t-1 to t;
because of the feature term Fi DAnd
Figure FDA00033797172600000515
the correlation degree changes at the same time, so that the difference value of the two at the same time is selected for correlation analysis and mutual information
Figure FDA0003379717260000059
Representing the degree of association between the two;
C3. setting an environmental index threshold D if
Figure FDA00033797172600000516
Then the feature item F is selectedi DAnd
Figure FDA00033797172600000517
repeating the steps C2 and C3 for the associated features until the feature set F is traversedDAnd FHThe feature items in the system are obtained, and a selected electricity utilization feature set S is obtainedDAnd selecting an environmental feature set SHAnd associated feature set
Figure FDA00033797172600000518
10. The method for monitoring and warning the pollution sources in the yellow river basin based on the power-environment-friendly data fusion analysis as claimed in claim 9, wherein when similar enterprises are selected for enterprises without pollution discharge real-time monitoring data, the formula for calculating the maximum similarity is as follows:
Figure FDA0003379717260000061
wherein, A is an enterprise without a power-environment protection index correlation diagram, and B is an enterprise with a power-environment protection index correlation diagram set; i isA(FD,FH) The relevance of the associated features included in enterprise a,
Figure FDA0003379717260000062
correlation characteristic correlation degree mean value, I, included for enterprise AB(FD,FH) The relevance of the associated features included by enterprise B,
Figure FDA0003379717260000063
is the average of the relevance of the associated features contained in enterprise B,
Figure FDA0003379717260000064
feature item F for Enterprise Ai DAnd
Figure FDA0003379717260000067
the value of the degree of association of (c),
Figure FDA0003379717260000066
feature item F for Enterprise Bi DAnd
Figure FDA0003379717260000065
cov is covariance and σ is standard deviation.
11. The yellow river basin pollution source monitoring and early warning method based on power-environmental protection data fusion analysis according to claim 10, wherein the short-term power load forecasting model and the short-term power load forecasting value of an enterprise are constructed and obtained in the following way:
G1. the temperature data, the data of holidays and the characteristic data of the electrical loads of enterprises are taken as an input characteristic set x of LSTM model data, an input layer is 100 neurons,
air temperature data, holiday data, enterprise at time tThe characteristic data of the industrial electrical load is an input value xtThe predicted output value of the electrical load of the last neural unit is ht+1The two inputs have corresponding weights, and a value of 0-1 is obtained under the action of sigmoid activation, namely three gate values, and the calculation formula is as follows:
it=σ(Wixt+Uiht-1)
ft=σ(Wfxt+Ufht-1)
ot=σ(Woxt+Uoht-1)
at time t, the input gate output value is itThe output value of the forgetting gate is ftThe output value of the output gate is ot,Wi、Ui、Wf、Uf、Wo、UoRespectively corresponding parameters of each door;
input value xtAnd the output h of the previous cellt-1The two values having a corresponding weight WcAnd UcAnd using the tanh activation function, the output value is equivalent to obtaining a new memory value obtained based on the input value at the time t
Figure FDA0003379717260000071
The calculation formula is as follows:
Figure FDA0003379717260000072
calculating and acquiring final memory value c of the neural unitt
Figure FDA0003379717260000073
ftOutput value of forgetting gate at time t, itThe output value of the input gate at time t, ct-1The final memory value of the last neuron at the time t-1;
final output value htIs composed of
Figure FDA0003379717260000074
The hidden layer adopts a 3-layer structure, the output layer is one-dimensional output, and the output data is the predicted value h of the short-term enterprise power loadt
G2. Based on historical actual power load data of the enterprise at the time t and the power load predicted value h of the short-term enterprisetPerforming cross entropy operation, performing reverse feedback as a loss function, adjusting each parameter value of the neuron, and fixing each parameter value when the loss function is smaller than a set threshold value to complete the construction of a short-term power load prediction model;
G3. taking temperature data, holiday data and power load characteristic data of an enterprise at the current moment as input, and obtaining a short-term power load predicted value of the enterprise;
more preferably, the scaling factor d is calculated as follows:
Figure FDA0003379717260000075
wherein,
Figure FDA0003379717260000081
f for Enterprise Bi DThe value of the characteristic at the time instant t,
Figure FDA0003379717260000082
for enterprise B
Figure FDA0003379717260000088
The value of the characteristic at the time instant t,
Figure FDA0003379717260000083
f for Enterprise Ai DThe value of the characteristic at the time instant t,
Figure FDA0003379717260000084
for enterprise A
Figure FDA0003379717260000089
The characteristic value at time t, ∑ is
Figure FDA0003379717260000085
The sum of all actual sampling points of the enterprise A;
and for the enterprises without pollution discharge real-time monitoring data, multiplying the scaling factor d by the short-term pollution discharge prediction data of the similar enterprises to obtain the short-term pollution discharge prediction data of the enterprises.
12. The yellow river basin pollution source monitoring and early warning method of electric power-environmental protection data fusion analysis of claim 11, wherein the basin dynamic monitoring and early warning model building process is as follows:
J1. acquiring historical pollution discharge fitting data of high-pollution enterprises in the dynamic drainage basin monitoring area based on historical power utilization data of the high-pollution enterprises in the dynamic drainage basin monitoring area and historical short-term pollution discharge prediction data of the enterprises;
J2. taking a high-pollution enterprise sample in a river basin dynamic monitoring area as a river basin dynamic monitoring early warning model training sample (x)kH '), K ∈ (1,2, L, K), h' is pollution prediction data, wherein x is an enterprise sample, the total number of enterprises is K, and each sample data comprises historical pollution discharge data/historical pollution discharge prediction data, historical meteorological data and historical holiday data;
J3. the model is defined by adopting an LSTM algorithm structure and various parameters as shown in the step G, single-layer hidden neurons are adopted, and short-term water pollution/atmospheric pollution prediction data of a drainage basin dynamic monitoring area are output;
J4. based on a gradient descent strategy, the parameters are adjusted in the direction of the negative gradient of the target by combining the diffusion weight from the pollution monitoring point to the pollution discharge enterprise,
Figure FDA0003379717260000086
Figure FDA0003379717260000087
in the formula,
Figure FDA0003379717260000091
is a predicted value of pollution, h ', output at time t'tThe actual pollution value at the time t, alpha is the diffusion weight from the pollution monitoring point to the pollution discharge enterprise, and the normalization definition is carried out based on the distance and the number of the enterprises;
J5. iteratively feeding back and optimizing neuron parameters until the neuron parameters are accumulated within a self-defined time period EtIf the epsilon is less than or equal to the expected accumulated error, the training of the basin dynamic monitoring early warning model is finished;
the acquired short-term pollution discharge prediction data is combined with the acquired meteorological data and holiday data to serve as input data of the basin dynamic monitoring early warning model, and environmental protection index prediction in a basin dynamic monitoring area is output, so that an environmental protection department is assisted to make preventive treatment measures in advance
13. The yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis according to claim 12, wherein in the step J1, if a small portion of sample real detection data exists, historical short-term pollution discharge prediction data is corrected, and the difference data is used as feedback to update the scaling factor d of the step H, so that the accuracy of historical pollution discharge fitting data of highly polluted enterprises is improved.
CN202111429817.7A 2021-11-29 2021-11-29 Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis Active CN114118580B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111429817.7A CN114118580B (en) 2021-11-29 2021-11-29 Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111429817.7A CN114118580B (en) 2021-11-29 2021-11-29 Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis

Publications (2)

Publication Number Publication Date
CN114118580A true CN114118580A (en) 2022-03-01
CN114118580B CN114118580B (en) 2023-04-07

Family

ID=80371114

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111429817.7A Active CN114118580B (en) 2021-11-29 2021-11-29 Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis

Country Status (1)

Country Link
CN (1) CN114118580B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114819773A (en) * 2022-06-29 2022-07-29 天津大学 Enterprise green production management and control method and system based on big data
CN114997310A (en) * 2022-06-02 2022-09-02 河南省开封生态环境监测中心 Environment-friendly monitoring data processing method and system
CN115471145A (en) * 2022-11-15 2022-12-13 碳管家智能云平台有限公司 Enterprise energy consumption double-control management method, device and medium
CN115689396A (en) * 2022-12-30 2023-02-03 天津友美环保科技有限公司 Pollutant discharge control method, device, equipment and medium
CN115860590A (en) * 2023-03-02 2023-03-28 广东慧航天唯科技有限公司 Intelligent analysis early warning method and system for enterprise emission pollution data
CN115908082A (en) * 2023-01-06 2023-04-04 佰聆数据股份有限公司 Enterprise pollution discharge monitoring method and device based on electricity utilization characteristic indexes
CN116434857A (en) * 2023-03-24 2023-07-14 青岛思普润水处理股份有限公司 Intelligent dosing setting method of sewage treatment system
CN116976733A (en) * 2023-07-31 2023-10-31 青岛熙正数字科技有限公司 Atmospheric pollution source on-line monitoring data evaluation system based on big data
CN117335416A (en) * 2023-11-24 2024-01-02 国网浙江省电力有限公司 Method, device, equipment and storage medium for optimizing power load
CN117455124A (en) * 2023-12-25 2024-01-26 杭州烛微智能科技有限责任公司 Environment-friendly equipment monitoring method, system, medium and electronic equipment for enterprises
CN117829614A (en) * 2024-03-06 2024-04-05 四川国蓝中天环境科技集团有限公司 Industrial enterprise pollution discharge risk classification calculation method based on multi-source data fusion
CN118396193A (en) * 2024-06-27 2024-07-26 国网山东省电力公司营销服务中心(计量中心) Transformer reverse heavy/overload early warning method and system based on neural network
CN118569890A (en) * 2024-07-29 2024-08-30 张家港保税区达安进口汽车检验有限公司 Whole car operating mode tail gas emission test system
CN118710327A (en) * 2024-08-29 2024-09-27 北京市生态环境监测中心 Enterprise gas-related pollution discharge link production behavior prediction method based on equipment power consumption

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106651200A (en) * 2016-12-29 2017-05-10 中国西电电气股份有限公司 Electrical load management method and system for industrial enterprise aggregate user
CN109242169A (en) * 2018-08-28 2019-01-18 贵州电网有限责任公司 A kind of short-term load forecasting method towards large power customers
CN110991700A (en) * 2019-11-08 2020-04-10 北京博望华科科技有限公司 Weather and electricity utilization correlation prediction method and device based on deep learning improvement
AU2020104000A4 (en) * 2020-12-10 2021-02-18 Guangxi University Short-term Load Forecasting Method Based on TCN and IPSO-LSSVM Combined Model
CN113298422A (en) * 2021-06-16 2021-08-24 福建亿力优能电力科技有限公司 Pollution source enterprise illegal production monitoring method based on electricity consumption data

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106651200A (en) * 2016-12-29 2017-05-10 中国西电电气股份有限公司 Electrical load management method and system for industrial enterprise aggregate user
CN109242169A (en) * 2018-08-28 2019-01-18 贵州电网有限责任公司 A kind of short-term load forecasting method towards large power customers
CN110991700A (en) * 2019-11-08 2020-04-10 北京博望华科科技有限公司 Weather and electricity utilization correlation prediction method and device based on deep learning improvement
AU2020104000A4 (en) * 2020-12-10 2021-02-18 Guangxi University Short-term Load Forecasting Method Based on TCN and IPSO-LSSVM Combined Model
CN113298422A (en) * 2021-06-16 2021-08-24 福建亿力优能电力科技有限公司 Pollution source enterprise illegal production monitoring method based on electricity consumption data

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
林琳等: "基于空气质量指数与k近邻理论的居民用电负荷超短期预测研究", 《电子测量技术》 *
郑一新等: "基于GIS的工业水污染调查研究――以滇池流域为例", 《环境保护科学》 *

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114997310A (en) * 2022-06-02 2022-09-02 河南省开封生态环境监测中心 Environment-friendly monitoring data processing method and system
CN114997310B (en) * 2022-06-02 2024-11-05 河南省开封生态环境监测中心 Environment-friendly monitoring data processing method and system
CN114819773A (en) * 2022-06-29 2022-07-29 天津大学 Enterprise green production management and control method and system based on big data
CN115471145A (en) * 2022-11-15 2022-12-13 碳管家智能云平台有限公司 Enterprise energy consumption double-control management method, device and medium
CN115471145B (en) * 2022-11-15 2024-06-04 碳管家智能云平台有限公司 Dual-control management method, device and medium for enterprise energy consumption
CN115689396A (en) * 2022-12-30 2023-02-03 天津友美环保科技有限公司 Pollutant discharge control method, device, equipment and medium
CN115689396B (en) * 2022-12-30 2023-08-08 天津友美环保科技有限公司 Pollutant emission control method, device, equipment and medium
CN115908082A (en) * 2023-01-06 2023-04-04 佰聆数据股份有限公司 Enterprise pollution discharge monitoring method and device based on electricity utilization characteristic indexes
CN115860590A (en) * 2023-03-02 2023-03-28 广东慧航天唯科技有限公司 Intelligent analysis early warning method and system for enterprise emission pollution data
CN115860590B (en) * 2023-03-02 2023-04-28 广东慧航天唯科技有限公司 Intelligent analysis and early warning method and system for enterprise emission pollution data
CN116434857B (en) * 2023-03-24 2024-03-15 青岛思普润水处理股份有限公司 Intelligent dosing setting method of sewage treatment system
CN116434857A (en) * 2023-03-24 2023-07-14 青岛思普润水处理股份有限公司 Intelligent dosing setting method of sewage treatment system
CN116976733B (en) * 2023-07-31 2024-04-02 青岛熙正数字科技有限公司 Atmospheric pollution source on-line monitoring data evaluation system based on big data
CN116976733A (en) * 2023-07-31 2023-10-31 青岛熙正数字科技有限公司 Atmospheric pollution source on-line monitoring data evaluation system based on big data
CN117335416B (en) * 2023-11-24 2024-03-01 国网浙江省电力有限公司 Method, device, equipment and storage medium for optimizing power load
CN117335416A (en) * 2023-11-24 2024-01-02 国网浙江省电力有限公司 Method, device, equipment and storage medium for optimizing power load
CN117455124B (en) * 2023-12-25 2024-03-08 杭州烛微智能科技有限责任公司 Environment-friendly equipment monitoring method, system, medium and electronic equipment for enterprises
CN117455124A (en) * 2023-12-25 2024-01-26 杭州烛微智能科技有限责任公司 Environment-friendly equipment monitoring method, system, medium and electronic equipment for enterprises
CN117829614A (en) * 2024-03-06 2024-04-05 四川国蓝中天环境科技集团有限公司 Industrial enterprise pollution discharge risk classification calculation method based on multi-source data fusion
CN117829614B (en) * 2024-03-06 2024-05-07 四川国蓝中天环境科技集团有限公司 Industrial enterprise pollution discharge risk classification calculation method based on multi-source data fusion
CN118396193A (en) * 2024-06-27 2024-07-26 国网山东省电力公司营销服务中心(计量中心) Transformer reverse heavy/overload early warning method and system based on neural network
CN118569890A (en) * 2024-07-29 2024-08-30 张家港保税区达安进口汽车检验有限公司 Whole car operating mode tail gas emission test system
CN118710327A (en) * 2024-08-29 2024-09-27 北京市生态环境监测中心 Enterprise gas-related pollution discharge link production behavior prediction method based on equipment power consumption

Also Published As

Publication number Publication date
CN114118580B (en) 2023-04-07

Similar Documents

Publication Publication Date Title
CN114118580B (en) Yellow river basin pollution source monitoring and early warning method based on electric power-environmental protection data fusion analysis
Ye et al. Tackling environmental challenges in pollution controls using artificial intelligence: A review
Bagherzadeh et al. Prediction of energy consumption and evaluation of affecting factors in a full-scale WWTP using a machine learning approach
Singh et al. Artificial intelligence and machine learning-based monitoring and design of biological wastewater treatment systems
Ihsanullah et al. Recent advances in applications of artificial intelligence in solid waste management: A review
Farhi et al. Prediction of wastewater treatment quality using LSTM neural network
Cheng et al. Forecasting of wastewater treatment plant key features using deep learning-based models: A case study
Farzin et al. Introducing a framework for modeling of drug electrochemical removal from wastewater based on data mining algorithms, scatter interpolation method, and multi criteria decision analysis (DID)
Haiyan Assessment and prediction of overall environmental quality of Zhuzhou City, Hunan Province, China
Abbasi et al. Monthly and seasonal modeling of municipal waste generation using radial basis function neural network
CN112465206B (en) Water resource optimization configuration and optimization result evaluation feedback method
CN113313399A (en) Underground water pollution tracing method based on characteristic pollutant source analysis
Yang et al. Cloud-model-based feature engineering to analyze the energy–water nexus of a full-scale wastewater treatment plant
Yang et al. Cloud model driven assessment of interregional water ecological carrying capacity and analysis of its spatial-temporal collaborative relation
Aldaghi et al. The evaluation of wastewater treatment plant performance: A data mining approach
Sheik et al. Machine learning-based design and monitoring of algae blooms: Recent trends and future perspectives–A short review
Narayanan et al. Artificial intelligence driven advances in wastewater treatment: Evaluating techniques for sustainability and efficacy in global facilities
Hou et al. Integrated machine learning methods with oversampling technique for regional suitability prediction of waste-to-energy incineration projects
Nejsum et al. Population dynamics of Ascaris suum in trickle-infected pigs
Valente et al. Fuzzy modelling of acid mine drainage environments using geochemical, ecological and mineralogical indicators
Olatunji et al. Evolutionary optimization of biogas production from food, fruit, and vegetable (FFV) waste
Abba et al. Bio-communal wastewater treatment plant real-time modeling using an intelligent meta-heuristic approach: A sustainable and green ecosystem
Liu et al. An air quality grade forecasting approach based on ensemble learning
Lu et al. Feature fusion improves performance and interpretability of machine learning models in identifying soil pollution of potentially contaminated sites
Wang et al. Artificial intelligence algorithm application in wastewater treatment plants: Case study for COD load prediction

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant