CN102521307A

CN102521307A - Parallel query processing method for share-nothing database cluster in cloud computing environment

Info

Publication number: CN102521307A
Application number: CN2011103926770A
Authority: CN
Inventors: 李睿峰; 王殿成; 冯玉; 李祥凯; 冷建全
Original assignee: Beijing Kingbase Information Technologies Co Ltd
Current assignee: Beijing Kingbase Information Technologies Co Ltd
Priority date: 2011-12-01
Filing date: 2011-12-01
Publication date: 2012-06-27

Abstract

The invention discloses a parallel query processing method for a share-nothing database cluster in a cloud computing environment. The method comprises the following steps of: firstly, segmenting a query plan to obtain sections which are executed by all data nodes; secondly, performing section processing on all data nodes to obtain Hash joins of all data nodes; and thirdly, performing Hash aggregation on control nodes by flowing of aggregated data between all data nodes to obtain a query result set. By the flowing operation of node data, the data can flow between all nodes in the query execution process, the parallel execution query process is finished, and an execution speed is greatly increased.

Description

Shared-nothing database cluster parallel query disposal route under a kind of cloud computing environment

Technical field

The present invention relates to a kind of database parallel query method, relate in particular to the shared-nothing database cluster parallel query disposal route under a kind of cloud computing environment, belong to the data base cluster system technical field.

Background technology

Along with deepening continuously of enterprise and e-government; The complicacy of database application strengthens day by day; Press for the problem that solves mass data processing, mass data storage and enhanced scalability, so that good data is stored, inquiry service for business support system provides.But the demand can't be fundamentally satisfied in traditional single-node data storehouse.

For this reason, people use for reference the technical concept of computer cluster (one group of loose integrated computer software and/or hardware are coupled together highly closely cooperation accomplish evaluation work), and a plurality of databases are coupled together the composition data base cluster system.Data base cluster system (DatabaseCluster System; Be called for short DBCS) Clustering is combined with Database Systems; It is one group of complete, autonomous calculation processing unit (node); Each node all has hardware resources such as CPU, internal memory and disk alone, moves independently operating system and autonomous Database Systems.Each node is worked in coordination with each other and is calculated through high-speed dedicated network or the interconnection of commercial universal network, and system provides parallel transaction services as the unified data storehouse.

At present, in database field two kinds of data-base clusters are arranged: a kind of is not have (ShareNothing) data-base cluster of sharing, and another kind is shared disk (Share Disk) data-base cluster.Along with the rise of development of database and cloud computing technology, the shared-nothing database cluster obtains widespread use with advantages such as its concurrency are good in the cloud computing field.

The data base querying service is one of application scenarios that running frequency is the highest in these data base cluster systems.In application number was 201010277129.9 Chinese invention patent application, a kind of parallel query method of distributed data base was disclosed.In this method, the inquiry proxy module is set at first, makes the inquiry proxy module directly receive the querying command of external module; Next is provided with a plurality of inquiry nucleus modules that are associated with the inquiry proxy module; The inquiry nucleus module provides the granularity inquiry to the data multidimensional through task resolution, improves whole response speed, and the inquiry nucleus module passes through multithreading; Many concurrent Query Databases; Give full play to the high-throughput of commercial data base, high concurrent characteristics, simultaneously multithreading is coordinated control, the querying flow management; The 3rd, mechanism is read in form and the employing of adopting batch data to return in advance, improves response speed.The thought of this method through query task " is divided and rule "; By the how concurrent initiation inquiry of inquiry nucleus module; Preferentially return the fast granularity data of inquiry velocity, thereby reached the effect that the entire system query performance significantly promotes, can satisfy the needs of most application system fast query.

Summary of the invention

Technical matters to be solved by this invention is to provide the disposal route of the shared-nothing database cluster parallel query under a kind of cloud computing environment.

For realizing above-mentioned goal of the invention, the present invention adopts following technical scheme:

Shared-nothing database cluster parallel query disposal route under a kind of cloud computing environment is characterized in that may further comprise the steps:

The first step: inquiry plan is cut apart, obtained the section of carrying out respectively by each back end;

Second step: each back end carries out slicing treatment respectively, and the Hash that obtains each back end connects;

The 3rd step: each back end flows through the polymerization combined data, carries out the Hash polymerization in Control Node, obtains query results.

Wherein more excellently, in said second step, the slicing treatment of each back end may further comprise the steps:

Data node scan of step 1) first table obtains first Hash table, scans second table then, carries out Hash with first Hash table and is connected, and obtains first data set;

Step 2) another back end scans the 3rd table, obtains the 3rd Hash table, scans the 4th table then, carries out Hash with the 3rd Hash table and is connected, and obtains second data set;

Said another back end of step 3) carries out Hash based on said first data set with said second data set and is connected.

Perhaps, the slicing treatment of each back end adopts following steps:

Data node scan of step 11) first table obtains first Hash table, scans second table then; Carrying out Hash with first Hash table is connected; Obtain first data set, and second table is perhaps redistributed the streams data operation through the mobile operation of broadcast data, send to other back end;

Another back end of step 12) scans the 3rd table, based on said second table that receives and the 3rd table of scanning, obtains the 3rd Hash table, scans the 4th table then, carries out Hash with the 3rd Hash table and is connected, and obtains second data set;

Said another back end of step 13) carries out Hash based on said first data set with said second data set and is connected.

Wherein more excellently, in said the 3rd step, after said Hash connection the carrying out Hash polymerization and ordering that each back end obtains in going on foot second, through said polymerization combined data flow operation, send to said Control Node again.

The present invention utilizes the node data flow operation, make the inquiry implementation in data can between each node, flow, thereby make each back end all obtain needed total data, accomplish concurrent execution query script.Because being carried out the parallel query support, handles query statement, so promoted execution speed significantly.

Description of drawings

Below in conjunction with accompanying drawing and embodiment the present invention is done further detailed description.

Fig. 1 is in the specific embodiment of the present invention, the synoptic diagram that node data flows;

Fig. 2 is in the specific embodiment of the present invention, the synoptic diagram of employed inquiry plan tree.

Embodiment

Under cloud computing environment, the data in the shared-nothing database cluster are burst storages, and each back end is some data only.In the implementation of some SQL statement, some back end need be operated the total data that comprises other node datas could accomplish inquiry plan.For this reason; Suitable local the insert node data flow operation of the present invention through setting at the inquiry plan of back end; Make the inquiry implementation in data can between each node, flow, thereby make each back end all obtain needed total data, accomplish concurrent execution query script.Bright specifically in the face of this expansion down.

In certain shared-nothing database cluster, comprise Control Node and one or more back end as the specific embodiment of the invention.Each back end is responsible for storing data, and the plan of accepting the interview is also carried out, and returns the plan result then.User's data for example adopts Hash (hash also claims hash) distribution mode or scope (range) distribution mode etc. to leave in respectively on each back end at first with more uniform mode.Control Node is accepted the query requests of client, analysis request and generated query plan, and the distribution inquiry plan is given back end, so that the concurrent execution inquiry of each back end.

In order to realize maximized parallel query; The present invention is divided into different sections with inquiry plan; Be used for each back end and carry out, and in section, add the appropriate data flow operation, carry out inquiry so that each back end can access required total data.Each section all is the part of inquiry plan, and can independently on each back end, work.The execution content that comprises in partially sliced has been Duoed single stepping than the traditional database query script: streams data.The streams data of section in being according to schedule cut apart to inquiry plan operated and done.The both sides that data flow operation relates to are arranged in different sections.

Before address; The inquiry plan of some back end must have the data of other back end could be accomplished; The suitable local node data flow operation of inserting of for this reason cutting into slices at the inquiry plan of this back end; Data required in the inquiry implementation are flowed between each node, and so each back end all can obtain needed total data, has also guaranteed all execution separately on each back end of query manipulation simultaneously.Need to prove that not all inquiry plan all needs streams data operation, for example inquiry system table information etc. does not just need.

In the present invention, streams data comprises three types:

A: the polymerization combined data flows, and is meant that the data after each back end is with Hash connection (join) are dealt on the single node, normally is dealt into Control Node and merges.

B: the redistribution streams data, promptly heavy distributing data flows, and is meant the difference of the cryptographic hash of the train value that utilizes Hash to connect (join), and the data after the screening are redistributed on other back end.

C: broadcast data flows, i.e. the data that the data node need send to a plurality of node broadcasts.The redistribution streams data is to send data to specific minority back end, and it is to send data to a plurality of nodes that broadcast data flows.

Why need broadcast data to flow and the redistribution streams data; Be because all only preserved a part of data on each back end; When occurring doing connection (join) operation between the different pieces of information node, need the data of oneself be sent to other back end.This two operations have been arranged, just can guarantee all execution separately on each back end of all connections (join) operation.It is after each back end has been carried out inquiry plan separately that the polymerization combined data flows, and data are mail on the Control Node, is integrated by Control Node then and returns to client.

The broadcast data energy of flow guarantees that enough the data integrity of inquiring about on the individual data node under all situations, following of situation about having need do the redistribution data manipulation and just can guarantee the data integrity of inquiring about on the individual data node.

Under cloud computing environment, the shared-nothing database cluster is the database of a distributed storage.For the user, need from the database of these distributed storage, obtain complete data at any time.The polymerization combined data flows in order to guarantee that it is complete returning to user's data.And broadcast data flows and the redistribution streams data is the integrality that guarantees data during for inquiry on the individual data node.

Based on the classification of above-mentioned streams data, the suitable local node data flow operation of setting at the inquiry plan of back end of inserting also is divided three classes: broadcast data operation, redistribution data manipulation and polymerization combined data are operated.

The mobile operation of broadcast data is the data that node need send to a plurality of node broadcasts.

The operation of redistribution streams data is a difference of utilizing the cryptographic hash of the train value (connecting key) that connects (join), and the data after the screening are redistributed on other back end.

Polymerization combined data flow operation is that the data that each node will be carried out after inquiring about are dealt on the single node, normally is dealt into Control Node and merges.

Below in conjunction with Fig. 1 and Fig. 2, inquiring about with the order in a commercial management field is example, and practical implementation step of the present invention is described further.

In this order query example, suppose following query statement:

The implication of above-mentioned query statement is in nearest 2 months, the consumption total value what is pressed, the statistics rank of from high to low the customer consumption total value being carried out.Relate to 4 tables altogether: client (customer) table, order (orders) table, commodity (lineitem) table and nationality (nation) table.

Under the environment of shared-nothing database cluster, suppose that the data of above 4 tables all relatively have been evenly distributed on n the data node, inquiry plan is divided into and data section the count consistent section of n, i.e. n section.Streams data between each back end is as shown in Figure 1, and corresponding inquiry plan tree is as shown in Figure 2.

The whole step of the parallel query that the present invention adopted is:

The concrete steps of the slicing treatment of each back end are as shown in Figure 1 in second step, comprising:

Connect to the Hash that step 3) has just obtained each back end through step 1), carried out for the 3rd step again to obtain result set.

If having certain data (supposing second table) of a data node is that a plurality of back end all need, so just need utilizes broadcast data to flow and operate.If having certain data (supposing second table) of a data node is another back end needs, so just need to utilize the operation of redistribution streams data.

Particularly, abovementioned steps 1) can change to step 3):

Another back end of step 13) carries out Hash based on said first data set with said second data set and is connected.

Below in conjunction with specific embodiment, be elaborated based on Fig. 2:

Steps A. the nationality of scanning separately simultaneously on each back end shows data, and the nationality on each back end is shown data to other node broadcasts, and promptly broadcast data flows, and makes each back end obtain whole nationalities and shows data.Because nationality's table record bar number is few, so this step execution is very fast.At this, supposing needs nationality's table is broadcasted according to search request.

Scan customer data separately simultaneously on each node of step B., and carry out Hash based on the Hash table that customer data and the nationality who receives show data and be connected, generate the RS-CN data set.

Each node of step C. scans order table data separately simultaneously, and filtering data generates the RS-O data set.

Each node of step D. scans commodity list data separately simultaneously, filters to generate the RS-L data set.

Step e. the RS-O Hash table that each node simultaneously will be separately carries out Hash with RS-L and is connected, and generation RS-OL data set notices that this process need not carry out the redistribution data manipulation, because the branch Boulez of order and commodity all is the key value of inquiring about (order key).This has just guaranteed that the object that need connect separately all is on machine separately, so n node just begins parallel the connection.

Step F. each node redistributes the streams data operation with the own RS-OL data set that generates in step e according to custkey (client's key value) between all nodes; Can be according to Hash (hash; Hash) distribution mode or scope (range) distribution mode redistributes data between node, and acquiescence adopts the Hash distribution mode in the present embodiment.Like this, each node all can have the RS-OL data set of oneself.

Each node of step G. will the own RS-CN data set that generates at step B, carries out Hash with the RS-OL data set through the redistribution streams data on the own node and is connected.

Last each back end of step H. carries out polymerization, ordering, mails to Control Node.

Shared-nothing database cluster parallel query disposal route provided by the present invention makes full use of the computing power of each clustered node, the request of concurrent processing data query under cloud computing environment.Can be on this theoretical method with the increase of node number the continuous handling capacity of elevator system, continue the satisfying magnanimity data query to performance demands.

More than shared-nothing database cluster parallel query disposal route provided by the present invention has been carried out detailed explanation.To those skilled in the art, any conspicuous change of under the prerequisite that does not deviate from connotation of the present invention, it being done all will constitute to infringement of patent right of the present invention, with corresponding legal responsibilities.

Claims

1. the shared-nothing database cluster parallel query disposal route under the cloud computing environment; Said data-base cluster comprises Control Node and a plurality of back end; Said Control Node is accepted the query requests of client; The generated query plan also is distributed to said back end, it is characterized in that may further comprise the steps:

The first step: said inquiry plan is cut apart, obtained the section of carrying out respectively by each back end;

The 3rd step: each back end carries out the Hash polymerization through polymerization combined data flow operation in Control Node, obtains query results.

2. shared-nothing database cluster parallel query disposal route as claimed in claim 1 is characterized in that:

In said second step, the slicing treatment of each back end may further comprise the steps:

3. shared-nothing database cluster parallel query disposal route as claimed in claim 1 is characterized in that:

4. shared-nothing database cluster parallel query disposal route as claimed in claim 3 is characterized in that:

The mobile operation of said broadcast data is the data that node need send to a plurality of node broadcasts.

5. shared-nothing database cluster parallel query disposal route as claimed in claim 3 is characterized in that:

Said redistribution streams data operation is a difference of utilizing the cryptographic hash of the train value that connects, and the data after the screening are redistributed on other back end.

6. shared-nothing database cluster parallel query disposal route as claimed in claim 1 is characterized in that:

In said the 3rd step, after said Hash connection the carrying out Hash polymerization and ordering that each back end obtains in going on foot second, through said polymerization combined data flow operation, send to said Control Node again.