[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN109739646A - A kind of data processing method and device - Google Patents

A kind of data processing method and device Download PDF

Info

Publication number
CN109739646A
CN109739646A CN201811621057.8A CN201811621057A CN109739646A CN 109739646 A CN109739646 A CN 109739646A CN 201811621057 A CN201811621057 A CN 201811621057A CN 109739646 A CN109739646 A CN 109739646A
Authority
CN
China
Prior art keywords
data
data cell
cell
period
access equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811621057.8A
Other languages
Chinese (zh)
Inventor
徐越
潘登
郑钧元
龙春竹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NSFOCUS Information Technology Co Ltd
Beijing NSFocus Information Security Technology Co Ltd
Original Assignee
NSFOCUS Information Technology Co Ltd
Beijing NSFocus Information Security Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NSFOCUS Information Technology Co Ltd, Beijing NSFocus Information Security Technology Co Ltd filed Critical NSFOCUS Information Technology Co Ltd
Priority to CN201811621057.8A priority Critical patent/CN109739646A/en
Publication of CN109739646A publication Critical patent/CN109739646A/en
Pending legal-status Critical Current

Links

Landscapes

  • Memory System Of A Hierarchy Structure (AREA)

Abstract

The invention discloses a kind of data processing method and devices.This method comprises: data access equipment obtains at least one data cell that the central processor CPU of the data access equipment is handled within the period 1;The data access equipment determines the cold and hot data target value of each data cell at least one described data cell;The data access equipment is by the cold and hot maximum T data cell of data target value at least one described data cell as first set;For the first data cell at least one described data cell, the data access equipment is if it is determined that first data cell is respectively positioned on the first set in continuous N number of period before the period 1, and be not stored in the cpu cache, then by first data cell storage into the cpu cache;First data cell is any data unit at least one described data cell.

Description

A kind of data processing method and device
Technical field
The present invention relates to data processing field more particularly to a kind of data processing method and devices.
Background technique
In data storage, searching system, there are two types of common datas, wherein dsc data is in following a period of time, it will The data cell frequently handled by central processing unit (CPU, Central Processing Unit);Cold data is in future one In the section time, the data cell that will not be frequently handled by CPU, general data amount is larger, such as historical data.Dsc data is preferential It is put into caching, the swapping in and out that can undoubtedly reduce data in caching operates, and then improves the performance and efficiency of CPU processing, needs Identify cold and hot data.
Therefore, how determining cold and hot data and improving data processing accuracy rate is a urgent problem to be solved.
Summary of the invention
The embodiment of the present application provides a kind of data processing method and device, solve how to determine in the prior art it is cold and hot The problem of data and raising data processing accuracy rate.
The embodiment of the present invention provides a kind of data processing method, this method comprises:
The central processor CPU that data access equipment obtains the data access equipment is handled at least within the period 1 One data cell;
The data access equipment determines the cold and hot data target of each data cell at least one described data cell Value;The attribute information value of the data cell determines according to the cold and hot data target value of each data cell;The attribute letter Breath includes following one or more: the access of the predicted value, data cell of the auto regressive moving average arma modeling of data cell Spatial cache size needed for frequency, the storage access price of data cell and data cell;
The data access equipment cold and hot maximum T data sheet of data target value at least one data cell by described in Member is used as first set;
The data access equipment deposits the data cell for meeting the first Rule of judgment at least one described data cell In the caching for storing up the CPU;First Rule of judgment is continuous N number of period of the data cell before the period 1 It is inside respectively positioned on the first set, and is not stored in the caching of the CPU, the N is the integer greater than 0.
Optionally, the data access equipment determines the cold and hot number of each data cell at least one described data cell According to index value, comprising:
For each data cell at least one described data cell, the data access equipment determines the data cell Attribute information value each single item and the corresponding weighted value of this;
The data access equipment by each single item of the attribute information value of the data cell and the corresponding weighted value of this, Substitute into the operation result of preset formula, the cold and hot data target value as the data cell.
Optionally, the data access equipment determines each single item of the attribute information value of the data cell, comprising:
If the attribute information value of the data cell includes the storage access price of data cell, the data access equipment According to the data cell within the period 1, accessed number and duration accessed every time determine the data cell Storage access price.
Optionally, the data access equipment determines each single item of the attribute information value of the data cell, comprising:
If the attribute information value of the data cell includes the visiting frequency of the data cell, the data access equipment root The number and the duration of the period 1 being accessed within the period 1 according to the data cell, determine the data cell The visiting frequency of the data cell.
Optionally, the data access equipment will meet the data of the second Rule of judgment at least one described data cell Unit release;Second Rule of judgment is not located in the first set for data cell, and before the period 1 Continuous N number of period in be not located in the first set.
The embodiment of the present invention provides a kind of data processing equipment, which includes:
Obtain module, at least one data cell handled within the period 1 for obtaining central processor CPU;
Processing module, for determining the cold and hot data target value of each data cell at least one described data cell; The attribute information value of the data cell determines according to the cold and hot data target value of each data cell;The attribute information packet It includes following one or more: the predicted value of the auto regressive moving average arma modeling of data cell, the data access equipment Spatial cache needed for the visiting frequency of central processing unit data cell, the storage access price of data cell and data cell is big It is small;The data cell for meeting the first Rule of judgment at least one described data cell is stored into the caching of the CPU;Institute Stating the first Rule of judgment is that data cell is respectively positioned on the first set in continuous N number of period before the period 1, And be not stored in the caching of the CPU, the N is the integer greater than 0.
Optionally, the processing module, is specifically used for:
For each data cell at least one described data cell, the every of the attribute information value of the data cell is determined One and the corresponding weighted value of this;And by each single item of the attribute information value of the data cell and the corresponding power of this Weight values substitute into the operation result of preset formula, the cold and hot data target value as the data cell.
Optionally, the processing module, is specifically used for:
If the attribute information value of the data cell includes the storage access price of data cell, existed according to the data cell In the period 1, accessed number and duration accessed every time determine the storage access price of the data cell.
Optionally, the processing module, is specifically used for:
If the attribute information value of the data cell includes the visiting frequency of the data cell, according to the data cell in institute The number being accessed in the period 1 and the duration of the period 1 are stated, determines the visit of the data cell of the data cell Ask frequency.
Optionally, the processing module is also used to that the second Rule of judgment will be met at least one described data cell Data cell release;Second Rule of judgment is not located in the first set for data cell, and in the period 1 It is not located in the first set in continuous N number of period before.
The embodiment of the present invention according to the attribute information value of each data cell at least one of, whether determine the data cell , by CPU processing but not in the data cell of cpu cache, to ensure that accuracy in N number of period;If so, data access equipment will The data cell is stored to cpu cache, so that the cold and hot biggish data cell of data target value is preferentially processed, is improved slow Hit rate and hot spot data access efficiency are deposited, disposed of in its entirety performance is improved.
Detailed description of the invention
Fig. 1 is a kind of architecture diagram for data processing method respective modules that the embodiment of the present invention proposes;
Fig. 2 is the knot of the cold and hot data identifier internal module in a kind of data processing method that the embodiment of the present invention proposes Composition;
Fig. 3 is a kind of step flow chart of data processing method provided in an embodiment of the present invention;
Fig. 4 is a kind of execution flow chart of data processing method provided in an embodiment of the present invention;
Fig. 5 is a kind of specific execution flow chart of data processing method provided in an embodiment of the present invention;
Fig. 6 is a kind of structural schematic diagram of data processing equipment provided in an embodiment of the present invention.
Specific embodiment
In order to better understand the above technical scheme, below in conjunction with Figure of description and specific embodiment to above-mentioned Technical solution is described in detail, it should be understood that the specific features in the embodiment of the present application and embodiment are to the application skill The detailed description of art scheme, rather than the restriction to technical scheme, in the absence of conflict, the embodiment of the present application And the technical characteristic in embodiment can be combined with each other.
In data-storage system, there are two types of common datas, wherein dsc data is in following a period of time, it will by The data cell that central processor (CPU, Central Processing Unit) is frequently handled;Cold data is when one section following In, the data cell that will not be frequently handled by CPU, general data amount is larger, such as historical data.
If cooperating caching mechanism on the basis of to cold and hot data, the process performance of data-storage system can be optimized.Tool Body caching mechanism is, using memory cache dsc data, dsc data to be preferentially put into caching, can undoubtedly reduce data changing in caching Enter the operation that swaps out, and then improves the performance and efficiency of CPU processing.Accordingly, it is determined that cold and hot data are for meaning in distributed system It is extremely great, for example, the distributed cache system of computing engines (spark).
The embodiment of the present invention proposes a kind of data processing method.As shown in Figure 1, the one kind proposed for the embodiment of the present invention The architecture diagram of data processing method respective modules.For example, inquiry request is structured query language (structured Query language, SQL) inquiry request.It should be noted that the distribution that the corresponding distributed system of Fig. 1 is spark is slow Deposit system, but only illustrate the framework of each module of realization data processing method as example, it is not limited to other and has used the present invention A kind of framework of the data processing method proposed in embodiment.
Inquiry request resolver 101, for the query statement of middle rank, high-level programming language to be resolved to machine language.
Logic plan generator 102 is instructed for generating corresponding logical operation according to machine language.
Logic planning optimizer 103, for optimizing the logical operation generated instruction.
Physics plan generator 104, instruction morphing for the logical operation after optimizing is machine operational order, machine behaviour It include the physical address that CPU needs to handle data in instructing.
Cold and hot data identifier 105, CPU needs to handle the cold data and dsc data of data for identification, real for the present invention Apply the corresponding specific module of a method.
Cache manager 106 caches for CPU management, determines cold data, the swapping in and out of dsc data.
Operations actuator 107, for executing machine operational order to the data in caching.
Raw storage section 108, for storing all data of machine operational order needs.
Data cached memory block 109, for storing the data for currently needing to operate.
Below with reference to Fig. 2, the internal module of cold and hot data identifier is introduced, as shown in Fig. 2, proposing for the embodiment of the present invention A kind of data processing method in cold and hot data identifier internal module structure chart.In Fig. 2, history table 1051 is looked into Ask the internal module that processing module 1052, data identification module 1053 and caching record sheet 1054 are data identifier 105.
History table 1051, for recording processing condition data.
Query processing module 1052, for receiving the machine operational order of the transmission of physics plan generator 104, according to operation The operation note of actuator 107 identifies real data disposition, and more new engine operational order;It is also used to extraction machine behaviour Following information in instructing: data table name (including database name), column, querying condition (optional) are identified as data cell, and CPU handles the processing history of each data cell in record certain period of time.Processing history includes: data cell history processing Number is counted with configurable time granularity according to time series;The data cell processing time per treatment, to locate every time The reason returned data time subtracts the calculating of processing time started;The memory space of the buffering data units occupies.
Data identification module 1053 periodically carries out the data cell handled by CPU according to the processing history of record The determination of cold and hot data, and notify cache manager 106 to update caching, while saving caching record after update.
Caching record table 1054, for recording cpu cache information.
As shown in figure 3, being a kind of step flow chart of data processing method provided in an embodiment of the present invention.
Step 301: data access equipment obtains the central processor CPU of the data access equipment within the period 1 At least one data cell of reason.
Step 302: each data cell is cold and hot at least one determining described data cell of the data access equipment Data target value.
Step 303: the data access equipment cold and hot maximum T of data target value at least one data cell by described in A data cell is as first set.
Step 304: the data access equipment will meet the number of the first Rule of judgment at least one described data cell According to unit storage into the caching of the CPU.
First Rule of judgment is described in data cell is respectively positioned in continuous N number of period before the period 1 First set, and be not stored in the caching of the CPU, the N is the integer greater than 0.
In step 301, each data cell is the memory space for having default size.At least the one of data access equipment acquisition A data cell is all data cells that CPU is handled within the period 1, and the period 1 is a preset period, citing For, the period 1 is 1 minute.
In step 302, the attribute information value of the data cell is true according to the cold and hot data target value of each data cell Fixed;The attribute information includes following one or more: the autoregressive moving average (Autoregressive of data cell Moving average, ARMA) predicted value of model, the central processing unit data cell of the data access equipment access frequency Spatial cache size needed for degree, the storage access price of data cell and data cell.Cold and hot finger data scale value is for characterizing One data cell belongs to dsc data or cold data.
A kind of optional implementation of cold and hot finger data scale value for determining data cell is, at least one described data Each data cell in unit, the data access equipment determine the attribute information value of the data cell each single item and this Corresponding weighted value;
The data access equipment by each single item of the attribute information value of the data cell and the corresponding weighted value of this, Substitute into the operation result of preset formula, the cold and hot data target value as the data cell.
For example,
Wherein, Score (t) is the cold and hot data target value of the data cell,For auto regressive moving average ARMA The predicted value of model,For the visiting frequency of the central processing unit data cell of data access equipment,At CPU Duration cost is managed, O is the storage size of the cpu cache.t1、t2、t3, α, β, γ be configurable parameter, can be according to historical record The data set of formation carries out tune ginseng to configurable parameter by deep learning model, accurately identifies hot number using account of the history According to.t1Typically much deeper than t2、t3, α, β, γ representative give data processing cycle or time trend factor, recent processing frequency because The shared weight of element, processing cost factor in marking, at obvious data periodicity or time trend Reason then α need to configure it is larger, if there is obvious Recent data processing locality then β need to configure it is larger, if different numbers Huge according to processing cost difference, γ need to configure larger.
Optionally, data access equipment caches effect for different access scene optimization by providing different parameter configurations, Expand the scope of application of the data processing method.
The specific each single item of attribute information optionally determines that method is as follows:
(1) predicted value of the auto regressive moving average arma modeling of data cell.
It determines that method is to be fitted using ARIMA model row, obtains prediction model:
WhereinFor predicted value,For periodic portions,For trend part,For residual error portion Point,Reflect the processed history cycle of data cell and development trend.
(2) visiting frequency of the central processing unit data cell of data access equipment: for nearest a period of time range t2It is interior Average treatment frequency, determine method are as follows:
If the attribute information value of the data cell includes the visiting frequency of the data cell, the data access equipment According to number of processes and the duration of the period 1 of the data cell within the period 1, the data cell is determined The visiting frequency of the data cell.
For example,
WhereinFor (t-t2, t] and the sum of the number of processes counting of data cell in the time.
Reflect the recent processing frequency of data cell.
(3) the determination method of CPU handling duration cost are as follows:
If the attribute information value of the data cell includes the storage access price of data cell, the data access equipment According to the data cell within the period 1, the duration handled by the CPU number of processing and every time by the CPU, Determine the storage access price of the data cell.
For example,
WhereinFor (t-t3, t] and the sum of the processing time of data cell in the time,For (t- t3, t] and the sum of the number of processes counting of data cell in the time.
Ct3(t) time overhead that the processing of data cell needs is reflected.
(4) storage size of cpu cache is that the data cell stored in cpu cache takes up space corresponding value.Citing For, every 1 Mbytes of respective value is 1.
In step 303, the data access equipment is maximum by cold and hot data target value at least one described data cell T data cell as first set.
Optionally, the data access equipment will at least one described data cell according to each data cell described the The sequence of the cold and hot data target value from high to low in one period, by preceding T data cell, as first set;The T For the integer greater than 0;The K data cells of the data access equipment by the period 1 in the cpu cache, As second set;The K is the integer greater than 0;The data access equipment is according to the first set and second collection It closes, determines dsc data and cold data.
Optionally, the data access equipment by the first set subtract the second set as a result, as third collection It closes;
The data access equipment by the second set subtract the first set as a result, as the 4th set;
The data access equipment by within continuous X period 1 in P data sheet of the third set Member, as the dsc data;The P is natural number;X is positive integer;
The Q data sheet that the data access equipment will be gathered the described 4th within the continuous X period 1 Member, as the cold data;The Q is natural number.
Continuous N number of period before the period 1, adjacent with the period 1 in step 304, centre is without time interval.Separately Outside, each period is respectively positioned on first set expression, and the cold and hot data target value of the data cell in each period is at preceding T.
Optionally, the data access equipment obtains the second data cell in the cpu cache;Second data sheet Member is any data unit at least one described data cell;
The data access equipment is not if it is determined that second data cell is located in the first set, and described It is not located in the first set in continuous N number of period before one period, then discharges second data cell.
Another mode of the embodiment of the present invention replaces multiple cold datas with multiple dsc datas, specifically, if the P More than or equal to the Q, Q data cell replaces the Q before data access equipment sequence in the P data cell A data cell;Q data cell is the cold and hot number of the P data cell according to each data cell before the sequence The preceding Q data cell arranged in descending order according to index value;If the P is less than the Q, the data access equipment institute It states P data cell and replaces in the Q data cell P data cell before backward;P data cell is institute before the backward State the last P that Q data cell arranges in descending order according to the cold and hot data target value according to each data cell A data cell.
As shown in figure 4, being a kind of execution flow chart of data processing method provided in an embodiment of the present invention.
Step 401: data access equipment determines each data cell in all data cells that CPU in the period 1 is accessed Cold and hot data target value.
Step 402: data access equipment determines and refers to by cold and hot data according to the cold and hot data target value of each data cell The preceding T data cell of scale value sequence.
It should be noted that this T data cell is first set, the data sheet stored in cpu cache in the period 1 Member is second set.
Step 403: data access equipment determines third set and the 4th set according to first set and second set.
It is specific to determine method, it is described in detail in step 301~304, details are not described herein again.
Step 404: data access equipment determines in third set with the presence or absence of the continuous X period 1 in third set Element and the 4th set in the presence or absence of the continuous X period 1 the 4th gather element.
It should be noted that the element that third set, the 4th set all have the condition that meets just executes step 405, otherwise Execute step 406.
Step 405: data access equipment is according to the element for meeting condition in third set, the 4th set, to corresponding data Unit carries out swapping in and out operation, is specifically executed by cache manager 106.
Step 406: terminating process.
As shown in figure 5, being a kind of specific execution flow chart of data processing method provided in an embodiment of the present invention.
Step 501: data access equipment receives physics plan.The physics plan indicates data cell to be treated.
Step 502: data access equipment extracts data cell to be treated.
Step 503: data access equipment updates processing statistics.For example, the number of processes of data cell and every time place The time of reason.
Step 504: data access equipment determines whether alignment processing unit meets the condition of swapping in and out.Specifically judge item Part is described in detail in step 301~304, repeats no more.
Step 505: data access equipment executes caching swapping in and out operation.
Step 506: data access equipment executes processing operation to corresponding data unit.
Step 507: data access equipment updates the attribute information value of corresponding data unit according to this treatment process.
The embodiment of the present invention according to the attribute information value of each data cell at least one of, whether determine the data cell , by CPU processing but not in the data cell of cpu cache, to ensure that accuracy in N number of period;If so, data access equipment will The data cell is stored to cpu cache, so that the cold and hot biggish data cell of data target value is preferentially processed, is improved slow Hit rate and hot spot data access efficiency are deposited, disposed of in its entirety performance is improved.
As shown in fig. 6, being the structural schematic diagram of data processing equipment provided in an embodiment of the present invention.
The embodiment of the present invention provides a kind of data processing equipment, which includes:
Obtain module 601, at least one data cell handled within the period 1 for obtaining central processor CPU;
Processing module 602, for determining the cold and hot data target of each data cell at least one described data cell Value;The attribute information value of the data cell determines according to the cold and hot data target value of each data cell;The attribute letter Breath includes following one or more: the predicted value of the auto regressive moving average arma modeling of data cell, the data access are set Caching needed for the visiting frequency of standby central processing unit data cell, the storage access price of data cell and data cell is empty Between size;The data cell storage of the first Rule of judgment will be met at least one described data cell to the caching of the CPU In;First Rule of judgment is that data cell is respectively positioned on described first in continuous N number of period before the period 1 Set, and be not stored in the caching of the CPU, the N is the integer greater than 0.
Optionally, the processing module 602, is specifically used for:
For each data cell at least one described data cell, the every of the attribute information value of the data cell is determined One and the corresponding weighted value of this;And by each single item of the attribute information value of the data cell and the corresponding power of this Weight values substitute into the operation result of preset formula, the cold and hot data target value as the data cell.
Optionally, the processing module 602, is specifically used for:
If the attribute information value of the data cell includes the storage access price of data cell, existed according to the data cell In the period 1, accessed number and duration accessed every time determine the storage access price of the data cell.
Optionally, the processing module 602, is specifically used for:
If the attribute information value of the data cell includes the visiting frequency of the data cell, according to the data cell in institute The number being accessed in the period 1 and the duration of the period 1 are stated, determines the visit of the data cell of the data cell Ask frequency.
Optionally, the processing module 602 is also used to that the second Rule of judgment will be met at least one described data cell Data cell release;Second Rule of judgment was not located in the first set for data cell, and at described first week It is not located in the first set in continuous N number of period before phase.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present invention Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of device (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to include these modifications and variations.

Claims (10)

1. a kind of data processing method characterized by comprising
The central processor CPU that data access equipment obtains the data access equipment handled within the period 1 at least one Data cell;
The data access equipment determines the cold and hot data target value of each data cell at least one described data cell;Often The attribute information value of the data cell determines according to the cold and hot data target value of a data cell;The attribute information includes It is one or more below: the predicted value of auto regressive moving average arma modeling of data cell, the visiting frequency of data cell, number Spatial cache size needed for storage access price and data cell according to unit;
The data access equipment makees the maximum T data cell of data target value cold and hot at least one described data cell For first set;
The data access equipment arrives the data cell storage for meeting the first Rule of judgment at least one described data cell In the caching of the CPU;First Rule of judgment is that data cell is equal in continuous N number of period before the period 1 It positioned at the first set, and is not stored in the caching of the CPU, the N is the integer greater than 0.
2. the method as described in claim 1, which is characterized in that the data access equipment determines at least one described data sheet The cold and hot data target value of each data cell in member, comprising:
For each data cell at least one described data cell, the data access equipment determines the category of the data cell The each single item and the corresponding weighted value of this of the property value of information;
The data access equipment substitutes into each single item of the attribute information value of the data cell and the corresponding weighted value of this The operation result of preset formula, the cold and hot data target value as the data cell.
3. method according to claim 2, which is characterized in that the data access equipment determines the attribute letter of the data cell The each single item of breath value, comprising:
If the attribute information value of the data cell includes the storage access price of data cell, the data access equipment according to For the data cell within the period 1, accessed number and duration accessed every time determine depositing for the data cell Store up access price.
4. method according to claim 2, which is characterized in that the data access equipment determines the attribute letter of the data cell The each single item of breath value, comprising:
If the attribute information value of the data cell includes the visiting frequency of the data cell, the data access equipment is according to this The number and the duration of the period 1 that data cell is accessed within the period 1, determine the described of the data cell The visiting frequency of data cell.
5. the method as described in claim 1-4 is any, which is characterized in that further include:
The data access equipment discharges the data cell for meeting the second Rule of judgment at least one described data cell;Institute It states the second Rule of judgment not being located in the first set for data cell, and the continuous N before the period 1 It is not located in the first set in a period.
6. a kind of data processing equipment characterized by comprising
Obtain module, at least one data cell handled within the period 1 for obtaining central processor CPU;
Processing module, for determining the cold and hot data target value of each data cell at least one described data cell;Each The attribute information value of the data cell determines according to the cold and hot data target value of data cell;The attribute information include with The next item down is multinomial: the center of the predicted value of the auto regressive moving average arma modeling of data cell, the data access equipment Spatial cache size needed for the visiting frequency of processor data unit, the storage access price of data cell and data cell; The data cell for meeting the first Rule of judgment at least one described data cell is stored into the caching of the CPU;It is described First Rule of judgment is respectively positioned on the first set in continuous N number of period before the period 1 for data cell, and It is not stored in the caching of the CPU, the N is the integer greater than 0.
7. device as claimed in claim 6, which is characterized in that the processing module is specifically used for:
For each data cell at least one described data cell, each single item of the attribute information value of the data cell is determined And the corresponding weighted value of this;And by each single item of the attribute information value of the data cell and the corresponding weight of this Value, substitutes into the operation result of preset formula, the cold and hot data target value as the data cell.
8. device as claimed in claim 7, which is characterized in that the processing module is specifically used for:
If the attribute information value of the data cell includes the storage access price of data cell, according to the data cell described In period 1, accessed number and duration accessed every time determine the storage access price of the data cell.
9. device as claimed in claim 7, which is characterized in that the processing module is specifically used for:
If the attribute information value of the data cell includes the visiting frequency of the data cell, according to the data cell described The number and the duration of the period 1 being accessed in one period determine the access frequency of the data cell of the data cell Degree.
10. the device as described in claim 6-9 is any, which is characterized in that
The processing module is also used to release the data cell for meeting the second Rule of judgment at least one described data cell It puts;Second Rule of judgment be data cell be not located in the first set, and before the period 1 described in It is not located in the first set in continuous N number of period.
CN201811621057.8A 2018-12-28 2018-12-28 A kind of data processing method and device Pending CN109739646A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811621057.8A CN109739646A (en) 2018-12-28 2018-12-28 A kind of data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811621057.8A CN109739646A (en) 2018-12-28 2018-12-28 A kind of data processing method and device

Publications (1)

Publication Number Publication Date
CN109739646A true CN109739646A (en) 2019-05-10

Family

ID=66361716

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811621057.8A Pending CN109739646A (en) 2018-12-28 2018-12-28 A kind of data processing method and device

Country Status (1)

Country Link
CN (1) CN109739646A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110399316A (en) * 2019-06-28 2019-11-01 苏州浪潮智能科技有限公司 A kind of method and apparatus for the optimization of operation system performance
CN112379842A (en) * 2020-11-18 2021-02-19 深圳安捷丽新技术有限公司 Method and device for predicting cold and hot properties of data
CN112699142A (en) * 2020-12-29 2021-04-23 平安普惠企业管理有限公司 Cold and hot data processing method and device, electronic equipment and storage medium
CN113064930A (en) * 2020-12-29 2021-07-02 中国移动通信集团贵州有限公司 Cold and hot data identification method and device of data warehouse and electronic equipment
CN113360553A (en) * 2020-03-03 2021-09-07 中国移动通信集团贵州有限公司 Data cold and hot degree evaluation method and server

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101369451A (en) * 2007-08-14 2009-02-18 三星电子株式会社 Solid state memory (ssm), computer system including an ssm, and method of operating an ssm
CN103688259A (en) * 2011-05-19 2014-03-26 甲骨文国际公司 Techniques for automatic data placement with compression and columnar storage
CN107844269A (en) * 2017-10-17 2018-03-27 华中科技大学 A kind of layering mixing storage system and method based on uniformity Hash
CN107870729A (en) * 2016-09-23 2018-04-03 伊姆西Ip控股有限责任公司 It is a kind of to be used for data cached method, apparatus and system
CN108874311A (en) * 2018-05-29 2018-11-23 北京盛和大地数据科技有限公司 Merge the data migration method and device in storage system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101369451A (en) * 2007-08-14 2009-02-18 三星电子株式会社 Solid state memory (ssm), computer system including an ssm, and method of operating an ssm
CN103688259A (en) * 2011-05-19 2014-03-26 甲骨文国际公司 Techniques for automatic data placement with compression and columnar storage
CN107870729A (en) * 2016-09-23 2018-04-03 伊姆西Ip控股有限责任公司 It is a kind of to be used for data cached method, apparatus and system
CN107844269A (en) * 2017-10-17 2018-03-27 华中科技大学 A kind of layering mixing storage system and method based on uniformity Hash
CN108874311A (en) * 2018-05-29 2018-11-23 北京盛和大地数据科技有限公司 Merge the data migration method and device in storage system

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110399316A (en) * 2019-06-28 2019-11-01 苏州浪潮智能科技有限公司 A kind of method and apparatus for the optimization of operation system performance
CN110399316B (en) * 2019-06-28 2021-10-01 苏州浪潮智能科技有限公司 Method and device for optimizing performance of service system
CN113360553A (en) * 2020-03-03 2021-09-07 中国移动通信集团贵州有限公司 Data cold and hot degree evaluation method and server
CN112379842A (en) * 2020-11-18 2021-02-19 深圳安捷丽新技术有限公司 Method and device for predicting cold and hot properties of data
CN112699142A (en) * 2020-12-29 2021-04-23 平安普惠企业管理有限公司 Cold and hot data processing method and device, electronic equipment and storage medium
CN113064930A (en) * 2020-12-29 2021-07-02 中国移动通信集团贵州有限公司 Cold and hot data identification method and device of data warehouse and electronic equipment
CN113064930B (en) * 2020-12-29 2023-04-28 中国移动通信集团贵州有限公司 Cold and hot data identification method and device of data warehouse and electronic equipment

Similar Documents

Publication Publication Date Title
CN109739646A (en) A kind of data processing method and device
Kraska et al. Sagedb: A learned database system
CN103345514B (en) Streaming data processing method under big data environment
US7716214B2 (en) Automated and dynamic management of query views for database workloads
CN107451393B (en) Nurse Scheduling method and apparatus based on random variable neighborhood search algorithm
CN104361113B (en) A kind of OLAP query optimization method under internal memory flash memory mixing memory module
CN110457397A (en) A kind of method and device that data are synchronous
JP2004518226A (en) Database system and query optimizer
JP6020448B2 (en) Database performance prediction apparatus and database prediction method
JP2017194778A (en) Tuning device and method for relational database
CN103164418A (en) Method and device for processing serial numbers
Hassan et al. Optimizing the performance of data warehouse by query cache mechanism
CN107533511A (en) The prediction of behaviour is cached using the real time high-speed of imaginary cache
Mukherjee Synthesis of non-replicated dynamic fragment allocation algorithm in distributed database systems
CN109783441A (en) Mass data inquiry method based on Bloom Filter
Wang et al. Skyline preference query based on massive and incomplete dataset
CN108509723A (en) LRU Cache based on artificial neural network prefetch mechanism performance income evaluation method
Zhang et al. Buffer pool aware query scheduling via deep reinforcement learning
CN106681830B (en) A kind of task buffer space monitoring method and apparatus
CN103365923A (en) Method and device for assessing partition schemes of database
CN112597076B (en) Spark-oriented cache replacement method and system based on data perception
Wang et al. An Improved Memory Cache Management Study Based on Spark.
Huang et al. A grid and density based fast spatial clustering algorithm
CN118210603A (en) Cloud resource scheduling method based on enhanced growth optimizer
CN117215785A (en) Micro-service resource scheduling method for quotation playback system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190510

RJ01 Rejection of invention patent application after publication