CN102929789A

CN102929789A - Record organizational method and record organizational structure

Info

Publication number: CN102929789A
Application number: CN2012103572253A
Authority: CN
Inventors: 马照云; 杨浩; 马振杰; 苗艳超; 刘新春; 邵宗有
Original assignee: Dawning Information Industry Beijing Co Ltd
Current assignee: Dawning Information Industry Beijing Co Ltd; Dawning Information Industry Co Ltd
Priority date: 2012-09-21
Filing date: 2012-09-21
Publication date: 2013-02-13
Anticipated expiration: 2032-09-21
Also published as: CN102929789B

Abstract

The invention discloses a record organizational method and a record organizational structure, wherein the record organizational method comprises the steps that an unique identifier is assigned for each type of records; a data file and a metadata file corresponding to each type of records are created in the corresponding disk file, and the data file of each type of records corresponds to one disk file; and only continuous re-appending write operation can be carried out to the data files. Through the technical scheme in the invention, the performance of a metadata server under high pressure can be improved, and the extendibility is good.

Description

Record organization method and record organization structure

Technical field

The present invention relates to the storage system of data server, more specifically, relate to record organization method and the record organization structure of record.

Background technology

In parallel memory system, often need some auxiliary records of record or index.For example when after the disk failures for the data above it are carried out fast quick-recovery, need all have which object on this disk of record, this just need to add record, delete corresponding record when deleting when document creation.In addition, because data server is many copies, so also need to record inconsistent object, deletion corresponding record after reparation is finished.In order to improve the metadata performance, asynchronous system has been adopted in deletion, that is, be the acknowledged client end after meta data server is put deleted marker, and the removing of real inode (index node) and data is by the garbage reclamation thread process.In addition, which file is the quota function also need to record, and the operations such as the owner revises have occured, and send the relevant quota information of message modification by background thread to data server.Lose and then cause part operation not finish in order to prevent outage record, these records need to be recorded to the bottom disk file, simultaneously in order further to improve reliability, these records also need to utilize the log mechanism log (also just to reach the principal and subordinate standby mutually for these records like this, has equal reliability with metadata) of metadata.

In order to take full advantage of the existing mechanism of metadata, existing implementation is every kind of corresponding catalogue inode of record, wherein every record manages as the dentry item of inode, so only need odd word seldom just can add record, when protocol failure, also can directly utilize dentry entry deletion interface to delete.

Yet there is following problem in this implementation:

1) is unfavorable for expansion: because the metadata of record is the inode of file system, and the reserved field of inode is limited, will be difficult to add when the record metadata attribute that need to increase exceeds restriction (the most important structure inode that revises as file system metadata for supplementary is not too desirable);

2) wasting space: because each record carries out record as a dentry item, so certainly can inherit all properties of dentry item, but be not all properties all be that this record is necessary;

3) also be the most serious defective of this mode, this implementation can affect the application performance of daily record, when the metadata pressure ratio is larger, can become the bottleneck of system, affect overall performance: the target that metadata profile hash pursues is that the dentry item is as far as possible even in each hash bucket, and the record amount of these auxiliary records is very large (because the file of each ParaStor file system can be used a plurality of disks), when expansion hash is larger, the daily record application can cause magnetic head to come flyback retrace, and then greatly reduce application efficiency, will temporarily refuse to a certain extent the metadata daily record when daily record pressure heap and submit request to, and then affect the performance of whole metadata.

For the problem in the correlation technique, effective solution is proposed not yet at present.

Summary of the invention

For the problem in the correlation technique, the present invention proposes a kind of record organization method and record organization structure, can improve the performance of meta data server and can well expand.

According to an aspect of the present invention, provide a kind of record organization method, having comprised: for every kind of record distributes a unique identification; Set up in the disk file of correspondence and every kind of data file and meta data file that record is corresponding, wherein, the data file of every kind of record is corresponding to a disk file; And make data file only append continuously write operation.

Preferably, metadata information in meta data file take unique identification as index.

Preferably, when a protocol failure in the data file, insert corresponding with an invalid record record that offsets at the end of data file.

Preferably, when needs use record, the data file is recycled, invalid record is offseted with offseting to record.

More preferably, utilize the recovery function of registered in advance to recycle.

More preferably, sequential scanning data file recycles.

More preferably, recycling comprises: all are offseted record add a ltsh chain table; The sequential scanning data file for each effective record, is searched in ltsh chain table and is offseted record; If find the corresponding record that offsets, then from ltsh chain table, extract this and offset record; If do not find the corresponding record that offsets, then this effectively recorded and write a new temporary file.

Preferably, after having scanned all effective records, replace data file with new temporary file.

Preferably, when generating daily record, only record logic log.

According to a further aspect in the invention, a kind of record organization structure is provided, this record organization structure comprises data file and the meta data file corresponding to a kind of record, wherein, the data file of distributing a unique identification and making every kind of record for each record is corresponding to a disk file, and the data file is only appended write operation continuously.

The present invention organizes mode by this simple metadata, eliminated the index demand of data file, realize record deletion by appending to write to offset to record, to be reduced to the access of objfile file and append write sequence and read, improved the disk read-write performance, and the unascertainable problem of some information when solving the concurrent daily record that brings of thread and generate with logic log.

Description of drawings

In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use among the embodiment, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.

Fig. 1 is the process flow diagram according to record organization method of the present invention;

Fig. 2 is the schematic diagram according to the data organizational structure of the embodiment of the invention; And

Fig. 3 is the record organization mode and the schematic diagram that offsets the result that illustrates according to the embodiment of the invention.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, the every other embodiment that those of ordinary skills obtain belongs to the scope of protection of the invention.

For problems of the prior art, the present inventor has proposed a kind of simple record organization mode.Be described in detail referring to Fig. 1 to Fig. 3.

Fig. 1 is the process flow diagram according to record organization method of the present invention.

With reference to Fig. 1, record organization method according to the present invention comprises: S102, for every kind of record distributes a unique identification; S104 sets up in the disk file of correspondence and every kind of data file and meta data file that record is corresponding, and wherein, the data file of every kind of record is corresponding to a disk file; And S106, make data file only append continuously write operation.

Fig. 2 is the schematic diagram according to the data organizational structure of the embodiment of the invention.

With reference to Fig. 2, according to data organizational structure of the present invention be: (we are referred to as fid for every kind of record distributes a unique identification, since 1), every kind of corresponding disk file of record is used for depositing physical record, and (file is named with fid, we are called objfile file (data file)), a structure is used for depositing associated metadata information (such as current record number etc.), metadata information leave in 0 the name file in (being called the objfile meta data file), metadata information in the objfile meta data file take fid as index.

Only show two kinds of data files corresponding to record of " the asynchronous deletion of inode: 1 " and " inconsistent object: 2 " among Fig. 2, but it should be appreciated by those skilled in the art that to have other multiple records and corresponding data file thereof.

In order to simplify processing, when one in data file record A is invalid, can immediately it not deleted from the objfile file, offset record A ' but insert one in end of file.

Difficult point here is that to write disk file be to carry out according to filename and the side-play amount of log recording to log system, it might be concurrent that the objfile record produces thread, and each generation thread may once produce a plurality of objfile file records, hereof side-play amount of record is that uncertain (the objfile lock can not be added to whole daily record things life cycle when daily record generates, because being critical path, this can reduce the metadata performance, so the daily record that generates is first not necessarily submitted to first).For this problem, the present inventor has proposed a kind of solution, namely only records logic log when daily record generates, and logic log expansion thread is unique (log system is run into logic log meeting calling logic daily record call back function and launched).

When needs use record, at first the objfile file is reclaimed, A and A ' are offseted, only stay effective record for.

In order to improve extensibility, objfile supports to reclaim function registration, and all kinds of records can specifically reclaim function according to the specific needs registered in advance, uses the recovery function of registration when recycling.

In addition, also can use the acquiescence mode to reclaim.The acquiescence way of recycling is sequential scanning objfile file, to offset record and add a hash chained list, sequential scanning file again after scanning is finished, effective record is only paid close attention in this scanning, for each effective record, at first search at the hash chained list and offset record, extract from the hash chained list if find then will offset record, otherwise record is write a new temporary file, substitute the objfile file with temporary file after this scanning is finished.Objfile file record organizational form and to offset the result shown in Figure 3.

By adopting above-mentioned organizational form, can be achieved as follows advantage:

1) owing to only having continuous appending to write to the objfile file, and the record organization mode is simple, can not cause owing to the adding of new record the expansion of hash bucket, and then resettlement has write record.So greatly improved the daily record application efficiency, improved the performance of meta data server under the large pressure.

2) do not have useless attribute to write disk file, compare with original mode and saved the space, objfile metadata structure and interrecord structure are established for this purpose specially, so there is not redundant information.

3) have good extendability, not only be embodied in the objfile dependency structure member interpolation/deletion, and when having new auxiliary record type to produce, can use very easily this framework to expand, and can reach particular requirement by registration function.

In sum, by means of technique scheme of the present invention, organize mode by proposing simple objfile metadata, for every kind of record distributes specific fid, as the index of associated metadata in the objfile meta data file, eliminate the index demand of objfile file record with fid, realized record deletion by appending to write to offset to record, thereby will be reduced to the access of objfile file and append write sequence and read, improve the disk read-write performance.The unascertainable problem of some information when in addition, solving the concurrent daily record generation that brings of thread with logic log.

The above only is preferred embodiment of the present invention, and is in order to limit the present invention, within the spirit and principles in the present invention not all, any modification of doing, is equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims

1. a record organization method is characterized in that, described method comprises:

For every kind of record distributes a unique identification;

Set up in the disk file of correspondence and every kind of data file and meta data file that record is corresponding, wherein, the data file of every kind of record is corresponding to a disk file; And

Make described data file only append continuously write operation.

2. method according to claim 1 is characterized in that, metadata information in described meta data file take described unique identification as index.

3. method according to claim 1 is characterized in that, when a protocol failure in the described data file, inserts corresponding with an invalid record record that offsets at the end of described data file.

4. method according to claim 3 is characterized in that, when needs use record, described data file is recycled, and described invalid record and the described record that offsets are offseted.

5. method according to claim 4 is characterized in that, utilizes the recovery function of registered in advance to carry out described recycling.

6. method according to claim 4 is characterized in that, the described data file of sequential scanning is carried out described recycling.

7. method according to claim 6 is characterized in that, described recycling comprises:

All are offseted record add a ltsh chain table;

The described data file of sequential scanning for each effective record, is searched in described ltsh chain table and is offseted record;

If find the corresponding record that offsets, then from described ltsh chain table, extract this and offset record;

If do not find the corresponding record that offsets, then this effectively recorded and write a new temporary file.

8. method according to claim 7 is characterized in that, after having scanned all effective records, replaces described data file with described new temporary file.

9. each described method in 8 according to claim 1 is characterized in that, only records logic log when generating daily record.

10. record organization structure, it is characterized in that, described record organization structure comprises data file and the meta data file corresponding to a kind of record, wherein, the data file of distributing a unique identification and making every kind of record for each record is corresponding to a disk file, and described data file is only appended write operation continuously.