[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN109933478A - A kind of fault handling method and storage system of storage system - Google Patents

A kind of fault handling method and storage system of storage system Download PDF

Info

Publication number
CN109933478A
CN109933478A CN201711377004.1A CN201711377004A CN109933478A CN 109933478 A CN109933478 A CN 109933478A CN 201711377004 A CN201711377004 A CN 201711377004A CN 109933478 A CN109933478 A CN 109933478A
Authority
CN
China
Prior art keywords
link
storage
storage control
branch
connection chip
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711377004.1A
Other languages
Chinese (zh)
Other versions
CN109933478B (en
Inventor
刘玉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Cloud Computing Technologies Co Ltd
Original Assignee
Hangzhou Huawei Digital Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou Huawei Digital Technologies Co Ltd filed Critical Hangzhou Huawei Digital Technologies Co Ltd
Priority to CN201711377004.1A priority Critical patent/CN109933478B/en
Publication of CN109933478A publication Critical patent/CN109933478A/en
Application granted granted Critical
Publication of CN109933478B publication Critical patent/CN109933478B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Techniques For Improving Reliability Of Storages (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The present invention a kind of fault handling method and storage system of storage system, for multilink between storage control and storage dish in existing storage system, it is overtime when there is IO by wherein active link, it only executes link switching and carries out subsequent I/O operation to reserve link, or the problem of being repaired only for storage dish, the situation of the application monitoring operation exception, the reparation of the connection chip on active link is carried out in the case where operation exception, to repair active link, to guarantee that subsequent I/O request switches back into active link, IO execution efficiency is improved.

Description

A kind of fault handling method and storage system of storage system
Technical field
The present invention relates to technical field of memory, specifically, being related to fault handling method and the storage of a kind of storage system System.
Background technique
Hard disk array in storage system, be managed collectively by storage control connected by interface and agreement it is more A hard disk, to provide mature, reliable, large capacity data storage service.
Typical hard disk array framework is double-control system, and two controllers are linked together by mirror-image channels, counted According to exchange.Hard disk is usually dual-port disk, and the rear end of two controllers can be connected respectively on each hard disk, and hard disk is for two A controller is all visible.Two controllers of hard disk array can be linked together by PCIe chip and multiple hard disks, For each hard disk, there are both links to connect respectively with two controllers.
In the prior art scheme, controller manages all hard disks by hard disk management software, saves hard disk link letter Breath, by detecting disk state, noting abnormalities in time and making diagnosis guarantees hard disk array number to repair exception as much as possible According to the reliability of storage.
However troubleshooting scheme in the prior art, only disk state and hard disk failure are repaired, cannot but be repaired Failure in multiple link, leads to the increase of I O process time delay.
Summary of the invention
The present invention is based on only repairing in the prior art to disk state and hard disk failure, a kind of reparation is provided The method and storage system of the failure of connection chip in link.
In a first aspect, the application provides a kind of storage fault handling method, it is applied to storage system, the storage system packet At least one storage dish and at least two storage controls are included, each storage control includes connection chip, each control Device is connected to each storage dish by the connection chip of itself, and at least two storage control interconnects;
The described method includes:
First storage control receives the first data operation request, is counted by the first branch of the first link to described first The first operational order is sent according to corresponding first storage dish of read-write operation, first link is comprising the first storage control The link of the connection chip of device, the target side that the first of first link branches into first link are deposited for described first The connection of disk is stored up, first storage control is any storage control at least two storage control;
First storage control monitors that first operational order executes time-out, and first storage control is logical It crosses the first branch of the second link and forwards first operational order to first storage dish, second link is includes institute The link of the connection chip of the second storage control is stated, the first of second link branches into the target side of second link For the connection of first storage dish, second storage control is any storage connecting with first storage control Controller;
First storage control is received to be referred to by first operation of the first multi-branch transport of second link The operation success response of order, according to the number of the operation success response statistical operation exception, described in the operation exception instruction The operational order that first storage control receives executes time-out by first link, but is executed by second link Success;
First storage control determines that the statistics number of the operation exception described in the given time is more than predetermined threshold, Fault restoration is carried out to the connection chip in first link.
The above method, from the first link switching to the second link, storage control can refer to according to operation The practical executive condition enabled carrys out the number of statistical operation exception, after operation exception reaches predetermined threshold, executes the first link Reparation improve the stability and follow-up data operation of storage system to identify link failure and repair link failure Execution efficiency.
For above-mentioned in a first aspect, a kind of possible mode for executing operation exception statistics is as follows: the first storage control Device processed increases behaviour that is primary or maintaining former statistics according to statistical rules and the operation success response, by the number of operation exception It is constant to make frequency of abnormity, wherein the statistical rules includes: that every branch of operation occur in to(for) first link is different Often, only count primary;Correspondingly, the predetermined threshold is less than or equal to the quantity N of storage dish described in the storage system.This Predetermined threshold, is set as the quantity N of storage dish, that is, predetermined threshold is set as the first link by kind specific embodiment Numbers of branches, when operation exception statistics, the operation exception occurred in every branch of the first link is only counted once, The operation exception occurred again in any branch through counting will not count, when all occur in every branch operation exception it Afterwards, the statistics number of operation exception reaches quantity N, that is, reaches predetermined threshold, in such cases, that is, can determine whether the first link There is link failure.
For above-mentioned in a first aspect, in a kind of possible implementation: the method also includes: the first storage control Device determines there is operation exception in the n-th branch of first link after, the failure of the n-th branch of first link is set Label, the faulty tag indicate that the n-th branch of first link is unavailable or the grade of the n-th branch of first link It does not reduce, n is nature number variable, and n is more than or equal to 1, and is less than or equal to N;First controller is being received for the n-th storage After the follow-up data operation requests of disk, according to the faulty tag of the n-th branch of first link, directly pass through described second N-th branch of link sends subsequent operation instruction to n-th storage dish
Before the first link failure is not repaired, above-mentioned implementation avoid subsequent operation instruction execution delay or Person executes failure.
Further, after carrying out fault restoration to the connection chip in first link, the method also includes: The faulty tag of every branch of first link is deleted, or the normal mark of every branch of first link is set Label, the normal tag indicate that first link state can be used or the rank of first link is normal;Then described first Controller is after receiving the follow-up data operation requests for first storage dish, according to the first link failure mark The normal tag of the state of label or the first link switches back into first link and deposits to subsequent data operation request is targeted It stores up disk and sends operational order.
Above-mentioned implementation, so that the first storage control continues the data operation request that be sent to storage dish upon receipt Later, it according to the state of first link (faulty tag has been deleted or the normal tag of the first link), directly selects First link carries out the transmission of operational order, and since the path of the first link is more shorter than the second link, subsequent operation refers to Order will get more quickly to processing, and such processing mode avoids subsequent operation instruction and passes through caused by the execution of the second link Time delay, improve the treatment effeciency of operational order.
Optionally, after the connection chip to first link carries out fault restoration, the method also includes: detection Whether the connection chip in first link repairs success;Then detect the connection chip reparation in first link at After function, the faulty tag of first link, or the normal tag of setting first link are deleted.
The above method further detects the first link, to get the true shape of link after repairing the first link State guarantees that subsequent operation execution can be carried out according to true link state.
Optionally, the method also includes: detect connection chip in first link repair it is unsuccessful after, Issue the Breakdown Maintenance notice of the connection chip in first link.
Specifically, the connection chip in first link carries out fault restoration, comprising: restarts described first and deposits Store up the connection chip of controller;Alternatively, the connection chip of isolation first storage control;Alternatively, to first storage Queue on the connection chip of controller is repaired;Alternatively, to the port on the connection chip of first storage control It is repaired.
The above-mentioned reparation to link focuses on the reparation of the connection chip to chain road, so that hardware problem of making a thorough investigation, guarantees to repair Multiple efficiency.
Optionally, after first storage control monitors that first operational order executes time-out, the method Further include: first label of the first storage control record, first label indicate that first operational order passes through institute The first branch for stating the first link executes time-out;Before the number according to the operation success response statistical operation exception, Further include: second label of the first storage control record, second label indicate that first operational order passes through institute The first branch for stating the second link runs succeeded;Determine whether first operational order is provided simultaneously with first label and institute State the second label;If first operational order is provided simultaneously with first label and second label, determination is grasped Make abnormal.
Second aspect, the application provide a kind of storage system characterized by comprising at least one storage dish and at least Two storage controls;Each storage control includes connection chip, and each controller is connected by the connection chip of itself It is connected to each storage dish;At least two storage control interconnects;
First storage control, for receiving the first data operation request, by the first branch of the first link to described Corresponding first storage dish of data read-write operation sends the first operational order, is monitoring that it is super that first operational order executes When, first operational order, the first storage control are forwarded to first storage dish by the first branch of the second link Device processed is any storage control at least two storage control, and second storage control is and described first Any storage control of storage control connection, first link include the connection chip of first storage control Link, the first of first link branch into the connection that the target side in first link is first storage dish, institute The link that the second link is the connection chip comprising second storage control is stated, the first of second link branches into institute The target side for stating the second link is the connection of first storage dish;And first storage control passes through for receiving The operation success response of first operational order of first multi-branch transport of second link, successfully rings according to described operate Answer the number of statistical operation exception, in the given time the operation exception number be more than predetermined threshold after, to first chain Connection chip in road carries out fault restoration, and the operation exception indicates the operational order that first storage control receives Time-out is executed by first link, but is run succeeded by second link.
The first storage control in above-mentioned storage system is also used to execute the first storage control in above-mentioned first aspect The correlation function executed in fault handling method.
The third aspect, the application provide a kind of storage control, comprising:
Processing module is stored, for receiving the first data operation request, by the first branch of the first link to described the Corresponding first storage dish of one data read-write operation sends the first operational order, and first link is comprising first storage The link of the connection chip of controller, it is described that the first of first link, which branches into the target side in first link, The connection of one storage dish.
Link failure processing module, for passing through the second link after monitoring that first operational order executes time-out The first branch forward first operational order to first storage dish, second link be include second storage The link of the connection chip of controller, the target side that the first of second link branches into second link is described first The connection of storage dish;And receive by the operation of first operational order of the first multi-branch transport of second link at Function response, according to the number of the operation success response statistical operation exception, the in the given time statistics of the operation exception Number carries out fault restoration to the connection chip in first link, the operation exception indicates institute more than after predetermined threshold It states the first operational order and time-out is executed by first link, but run succeeded by second link.
Optionally, link failure processing module executes operation exception statistics and specifically includes: according to statistical rules and the behaviour Make success response, the number of operation exception is increased primary or maintain the operation exception number of former statistics constant, wherein is described Statistical rules includes: the operation exception occurred in every branch for first link, only counts primary;Correspondingly, institute State the quantity N that predetermined threshold is storage dish described in the storage system.
Optionally, link failure processing module is also used to determine in the n-th branch of first link operation exception occur Later, the faulty tag of the n-th branch of first link is set, and the faulty tag indicates n-th point of first link Branch is unavailable or the rank of the n-th branch of first link reduces, and n is nature number variable, and n is more than or equal to 1, and is less than etc. In N;
Processing module is then stored, is also used to after receiving the follow-up data operation requests for the n-th storage dish, according to The faulty tag of n-th branch of first link, directly by the n-th branch of second link to n-th storage dish Send subsequent operation instruction.
Optionally, link failure processing module is also used to repair to the connection chip progress failure in first link After multiple, the faulty tag of every branch of first link is deleted, or every branch of first link is set Normal tag, the normal tag indicate that first link state can be used or the rank of first link is normal;Then deposit After storing up processing module follow-up data operation requests, just according to the state of the first link failure label or the first link Normal label switches back into the targeted storage dish of follow-up data operation requests described in the first chain road direction and sends operational order.
Fourth aspect, the application provide a kind of storage control, comprising:
Equipment is stored, for storing instruction;And
An at least processor is coupled with the storage equipment;
Wherein, when an at least processor execution described instruction, described instruction causes the processor to execute State the method for first aspect.Method, storage system and the storage control that above-mentioned the application various aspects provide, Neng Gouzhen The execution time delay for just solving the problems, such as operational order caused by link failure in storage system avoids toggle path in the prior art Although operation success but inefficient problem, further increase the efficiency of storage system caused by the mode of handling failure.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention without any creative labor, may be used also for those of ordinary skill in the art To obtain other drawings based on these drawings.
Fig. 1 is the configuration diagram of storage system provided by the present application;
Fig. 2 is a kind of schematic diagram of storage control provided by the present application composition;
Fig. 3 is fault handling method provided by the present application operational process schematic diagram within the storage system;
Fig. 4 is another schematic diagram of storage control provided by the present application composition.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiment is a part of the embodiments of the present invention, rather than whole embodiments.Based on this hair Embodiment in bright, those of ordinary skill in the art's every other reality obtained without making creative work Example is applied, all should belong to the scope of protection of the invention.
As shown in Figure 1, being the architecture diagram of storage system provided in an embodiment of the present invention, which includes two storages Controller (10,20) and three storage dishes (30,40,50), storage control 10 include connection chip 60, and storage control 20 wraps Connection chip 70 is included, storage control 10 is connect with storage dish 30,40 and 50 respectively by connecting chip 60, storage control 20 It is connect respectively with storage dish 30,40 and 50 by connecting chip 70;There are also chains between storage control 10 and storage control 20 Road connection, which can be network connection, and to be also possible to bus direct-connected.Storage control 10 has 3 link difference 3 storage dishes are connected to, are the link 01 that storage control 10 is connected to storage dish 30 by connecting chip 60, storage control respectively The link 02 that device 10 processed is connected to storage dish 40 by connecting chip 60, storage control 10 are connected to by connection chip 60 and are deposited The link 03 of disk 50 is stored up, storage control 20 there are 3 links to be connected respectively to 3 storage dishes, is that storage control 20 is logical respectively The link 04 that connection chip 70 is connected to storage dish 30 is crossed, storage control 20 is connected to storage dish 40 by connecting chip 70 Link 05, the link 06 that storage control 20 is connected to storage dish 50 by connecting chip 70.
Above-mentioned connection chip (60,70) can be high speed peripheral component interconnection bus (full name in English: peripheral Component interconnect express, abbreviation PCIe) chip, be also possible to small computer system interface (English Language: Small Computer System Interface;Write a Chinese character in simplified form: SCSI) chip, so-called connection core in the embodiment of the present invention Piece refers on storage control for connecting the connection chip of storage dish.
The quantity of storage control and storage dish in the storage system of above-mentioned Fig. 1 is all citing form, and the present invention is real The storage system for applying example offer includes at least two storage controls and at least one storage dish.Above-mentioned storage control with deposit The hardware state of storage disk can be flexibly, such as can be storage control and formed together with storage control concentrated setting Controller chassis, multiple storage dishes concentrate in together composition this discharge plate control of hard disk chassis separation (disk refers to storage dish, and control refers to storage control) Form, can also be, for example, the form of disk control one of the storage control together with storage dish concentrated setting.
Storage dish in above-mentioned Fig. 1 can be the disk of traditional form, such as hard disk hard disk, be also possible to solid-state Storage hard disk (full name in English: solid state disk, referred to as: SSD), it can also be that other storage medium shapes are morphogenetic and deposit Store up disk.During storage dish in Fig. 1 is specifically used, disk array (English: Redundant Arrays of can be formed Independent Disks, referred to as: RAID), the storage of data is carried out by composition disk array, can provide reliability more High storage system.
Storage control in above-mentioned Fig. 1 can be hardware entities device, as shown in Fig. 2, the storage control 200 can be with Including processing unit 201 and communication interface 202, processing unit 201 is used to execute the data storage function of storage control 200, Communication interface 202 is used to carry out communication interaction with other equipment, and other equipment can be access host or other storage systems System, such as processing unit 201 receive the data read request or data write request that access host is sent, tool by communication interface 202 Body, communication interface 202 can be adapter.Optionally, the storage control 200 of the example, in hardware can also include defeated Enter output interface 203, input/output interface 203 is connected with input-output apparatus, information for receiving input, output operation As a result.Input/output interface 203 can be mouse, keyboard, display or CD-ROM drive etc..Optionally, which deposits Storing up controller 200 can also include additional storage 204, also commonly referred to as external memory, and the storage medium of additional storage 204 can be with It is magnetic medium (for example, floppy disk, hard disk, tape), optical medium (such as CD) or semiconductor medium (such as solid state hard disk) Deng.
Processing unit 201 is used to execute the data storage function of storage control 200, can there are many implement shape Formula, such as processing unit 201 may include processor 2011 and memory 2012, processor 2011 in memory 2012 according to storing Program unit executes relevant data storage processing, and processor 2011 can be central processing unit (CPU) or image processor (English Text: graphics processing unit, GPU), processor 2011 can be single core processor or multi-core processor.Processing Unit 201 can also be realized individually using the logical device of built-in processing logic, such as (English is complete for field programmable gate array Claim: Field Programmable Gate Array, abbreviation: FPGA) or digital signal processor (English: digital Signal processor, DSP) etc..
Storage control in above-mentioned Fig. 1 is also possible to be made of the processing logic of storage control, processing logic tool Logic circuit realization can also be respectively adopted, in Fig. 2 realizing by way of the program code for residing in memory in body Schematically in the processing logic composition deposited, the processing logic of the storage control may include: storage processing logical AND storage Logic is managed, is used for after receiving the data reading or data write request that access host is sent, holds wherein storing processing logic The relevant data storage operations of row;Storage management logic be used for storage processing logic implementation procedure in, to storage control, The treatment process of storage dish and data is managed, and the state and/or process failure of monitoring device simultaneously carry out corresponding failure Processing.
In above-mentioned storage system shown in FIG. 1, for each storage dish, there are both links to deposit respectively with two Store up controller connection.Wherein, the storage management logic in storage control can be used to manage all storage dishes, such as save storage The link information of disk notes abnormalities in time by the state of detection storage dish and makes diagnosis, to repair exception as much as possible, Guarantee the reliability of data storage.The storage management logic in storage control carries out troubleshooting mode such as in the prior art Under: storage control 10 receives storage operation requests, determines that the storage operation requests need to read storage dish 30, passes through itself Connection chip 60 to storage dish 30 send reading instruction, storage control 10 determine send reading instruction processing time-out after, really It makes and has showed IO time-out, send storage control 20 for the reading instruction thus according to internal scheduled troubleshooting strategy, with So that storage control 20 sends reading instruction to storage dish 30 by connection chip 70 again, after storage control 20 is handled successfully Returning response message is to storage control 10.Such troubleshooting mode, although to store operation requests (I/O request) finally Can be succeeded processing by replacement link, but I/O link is long, and time delay is big, be will lead to the whole service disconnection that externally presents and showed As.In the prior art, occur in business compared under long time delay or the scene of interruption, such as above-mentioned storage control 10 is sent out determining In the case where the IO time-out for being sent to storage dish 30, it is also possible to be repaired to storage dish 30, if however IO time-out is because of storage Link between controller 10 and storage dish 30 breaks down, then repairing storage dish 30 is idle work.
The present invention in view of the above technical problems, provides the fault handling method and device of a kind of storage system, this method and Device is for solving link failure bring IO time delay or service disconnection.The present invention increases link failure in storage control Processing module to identify that the I O process caused because of link failure (connection failure of chip) is abnormal, and carries out link corresponding Reparation handled so that subsequent I/O request can switch back in original link, improve treatment effeciency.
The link failure processing module that storage control in the embodiment of the present invention provides, can be storage management logic Function enhancing is also possible to the individual processing logic independently of storage management logic, and the link failure processing module is for knowing Other link failure and the reparation for handling link failure.Specific troubleshooting process will be specific by subsequent embodiment with details Explanation.
Before describing the specific embodiments, for convenience with it is clear, herein first to the chain in the embodiment of the present invention Road carries out the unification in appellation.Each storage control and each storage in storage system as where the embodiment of the present invention There may be multilink between disk, for each storage dish, each storage dish there are at least both links to be connected to difference Storage control, storage system as shown in Figure 1, since there are two storage controls (10,20), then each storage dish connects Being connected to each storage control and being corresponding with includes two chains between both links, such as storage dish 30 and storage control 10 Road, the first link are the link 01 that storage control 10 is connected to storage dish 30 by connecting chip 60, and the second link is storage Controller 10 is connected to the link 04 of storage dish 30 by the connection chip 70 of storage control 20;For storage dish 40, with It also include both links between storage control 10, the first link is connected to storage by connecting chip 60 for storage control 10 The link 02 of disk 40, the second link are connected to storage dish 40 by the connection chip 70 of storage control 20 for storage control 10 Link 05;As it can be seen that including n link between each storage dish and each storage control, control is stored in n=storage system The quantity of device.For convenience, in present specification, for the link between each storage dish and each storage control into Distinction description is gone, the link of the connection chip comprising the first storage control is known as the first link, and (example includes as shown in figure 1 The link 01 of the connection chip 60 of storage control 10,02,03), by the link of the connection chip comprising the second storage control Referred to as the second link (link 04,05,06 that example includes the connection chip 70 of storage control 20 as shown in figure 1), will deposit comprising n-th The link for storing up the connection chip of controller is known as the n-th link.Link is distinguished according to the difference of the target side of the first link connection Branch, such as the connection that the target side of the first link is storage dish 30 is known as to the first branch (such as link of the first link It 01) is, that the connection of storage dish 30 is known as the first branch (such as link 04) of the second link by the target side of the second link, by the The target side of one link is that the connection of storage dish 40 is known as the second branch (such as link 02) of the first link, by the second link Target side is that the connection of storage dish 40 is known as the second branch (such as link 05) of the second link.
As shown in figure 3, being the specific implementation process of link failure processing method provided in an embodiment of the present invention, need to illustrate , storage system shown in Fig. 3 is the simple version of storage system shown in Fig. 1, and the company on storage control is omitted in Fig. 3 The connection relationship for connecing chip, other storage dishes and storage dish and storage control is mainly used for illustrating flow processing relationship. It accesses host and initiates the first data operation request to the first storage control in step 301, which is used for Data access is carried out to the first storage dish.In step 302, the first storage control is by the first branch of the first link to first Storage dish initiates the first operational order, and request carries out read operation to corresponding data or write operation, first link include The connection chip of first storage control;In step 303, it is super that the first storage control monitors that above-mentioned first operational order executes When, the first storage control handoff links, by the connection between the first storage control and the second storage control to Two storage controls forward the first operational order;In step 304, the second storage control is by the connection chip of itself to first Storage dish sends the first operational order;Step 303 and step 304 realize the first storage control the by the second link One branch forwards first operational order to first storage dish, and second link includes the company of the second storage control Connect chip;In step 305, the first storage dish is finished after operation, and the behaviour of the first operational order is sent to second controller Make success response, forwards the operation success response to the first storage control in second storage control of step 306;In step 307, the first storage control sends access response to host.
In step 308 (for step 308 with 307 practical execution without permanent order, the two sequence is interchangeable), the first storage control For device after the first branch by the second link receives the operation success response of the first operational order, statistical operation is abnormal Number.It, can be by time of operation exception when determination needs to increase the number of operation exception according to predetermined statistical rules Number increases once, and the operation exception indicates that the operational order that first storage control receives is held by first link Row time-out, but the operation to be run succeeded by second link;Specifically, can there are the following two kinds statistical rules, the first system Meter rule: it for the operation exception occurred in every branch of each link, only counts primary, it is understood that are as follows: the first chain The operation exception for the first time occurred in each branch on road, operation exception number increase once, go out in each branch of the first link Existing non-operation exception for the first time, without statistics;Second statistical rules: as long as there is operation exception, operation exception time is carried out Several statistics, regardless of whether the operation exception for the first time in the branch, i.e., every once-through operation exception occur, the number of operation exception is Increase primary.In the present embodiment, it according to the first statistical rules, determines in the first branch of first link and occurs grasping for the first time Make exception, the statistics number of operation exception is updated to one by zero.
For how to identify operation exception, specifically, the first storage control can monitor above-mentioned first in step 303 After operational order executes time-out, the first label of record, first label indicates that first operational order passes through described first First branch of link executes time-out;In step 306, the first storage control receives the second storage control forwarding After operating success response message, the second label of record, second label indicates first operational order by described the First branch of two links runs succeeded;First storage control determines whether the first operational order is provided simultaneously with the first mark Note may determine that described with the second label if first operational order is corresponding with the first label and the second label simultaneously There is operation exception in the implementation procedure of first operational order.First storage control determine aforesaid operations it is abnormal and then According to above-mentioned first statistical rules or the second statistical rules, the number of operation exception is increased primary or is remained unchanged.First Storage control can also be arranged described first after operation exception occurs in the first branch of determining first link of step 308 The faulty tag of first branch of link, the faulty tag indicate that the first branch of first link is unavailable or described The rank of first branch of the first link reduces;After the faulty tag of the first branch of first link is set, access Host initiates the second data operation request of the first storage dish of access, and the first storage control continues upon receipt will be sent to first After second data operation request of storage dish, according to the faulty tag of the first branch of first link, chain can be carried out The directly switching or the shunting of link on road, such as turning for subsequent operation instruction is no longer carried out by the first branch of the first link Hair, but the forwarding of subsequent operation instruction is directly carried out by the first branch of the second link, in the present embodiment, the first storage control Device processed directly forwards the second operational order by the second link, and pass through according to the faulty tag of the first branch of the first link Second link obtains the response of the second operational order, and the first storage control sends the second data operation request to access host (such as step 309 arrives step 314) for response.Before the first link failure is not repaired, directly switching or the chain of above-mentioned link The troubleshooting mode of the shunting on road avoids the execution delay of subsequent operation instruction or executes failure.
Then it in next link, accesses host and is asked in step 315 to the initiation third data manipulation of the first storage control It asks, which is used to carry out data access to the second storage dish.In step 316, the first storage control is logical Cross the second branch of the first link and initiate third operational order to the second storage dish, request to corresponding data carry out read operation or Person's write operation;In step 317, the first storage control monitors that above-mentioned third operational order executes time-out, the first storage control Device handoff links processed are forwarded by the connection between the first storage control and the second storage control to the second storage control Third operational order;In step 318, the second storage control sends third behaviour to the second storage dish by the connection chip of itself It instructs;Step 317 and step 318 realize the first storage control by the second branch of the second link to described second Storage dish forwards the third operational order, and second link includes the connection chip of the second storage control;In step 319, the second storage dish is finished after operation, and the operation success response of third operational order is sent to second controller, Second storage control of step 320 forwards the operation success response of the third operational order to the first storage control;In step 321, the first storage control sends corresponding access response to host.
In step 322, the first storage control is receiving third operational order by the second branch of the second link Operate the number of success response and then secondary statistical operation exception.According to predetermined statistical rules, need to increase in determination When the number of operation exception, the number of operation exception can be increased once, the operation exception indicates the first storage control The operational order that device receives executes time-out, but the operation to run succeeded by second link by first link; It in the present embodiment, according to the first statistical rules, determines occur operation exception for the first time in the second branch of first link, operates Abnormal statistics number is updated to two by one.In this step, the first storage control can be with the first link of further progress The label of the faulty tag of second branch.First storage control determines that the second branch of the first link grasps in step 322 After making exception, the faulty tag of the second branch of first link can also be set, the faulty tag instruction described the Second branch of one link is unavailable or the rank of the second branch of first link reduces;First link is being set The second branch faulty tag after, access host initiate access the second storage dish other data operation requests, first deposits Storage controller continues upon receipt will be sent to after other data operation requests of the second storage dish, according to first link The faulty tag of second branch can carry out the direct switching or the shunting of link of link.
Further, the first storage control can also be before step 301, or in any of step 301- step 322 Moment starts timer to monitor the statistics number of the operation exception in a period of time, when the statistics number of operation exception is pre- Reach predetermined threshold in fixing time, the statistics number of the operation exception also can indicate that the first link breaks down;At this point, first Storage control can carry out the fault restoration of the first link.For example, in step 323, timer expired, the first storage control Determine whether the statistics number of operation exception reaches predetermined threshold, after the statistics number of operation exception reaches predetermined threshold, the One storage control carries out the fault restoration of the first link.Above-mentioned first statistical rules occurs just for each branch of link Operation exception is counted for the first time, and correspondingly, predetermined threshold is set as the quantity of storage dish, that is, the numbers of branches of link, (or the data manipulation instruction of each storage dish is destined for when the case where operation exception all occurs in every branch for determining link There is operation exception), that is, the case where meeting predetermined threshold, can assert that link breaks down, at this point, being directed to the connection core of link Piece is repaired, and failure can be solved.Above-mentioned second statistical rules, as long as operation exception occurs in chain road, regardless of whether this point The operation exception for the first time occurred in branch all carries out the statistics of operation exception, in such cases, sets predetermined threshold as empirical value, Link failure can also be identified from certain probability, and in the reparation for the connection chip for identifying the laggard line link of link failure, i.e., It can solve failure.
In the case where above-mentioned first statistical rules, predetermined threshold might be less that the quantity of storage dish, such as setting make a reservation for The quantity for the storage dish that threshold value is 2/3rds, when the statistics number of operation exception reaches the quantity of 2/3rds storage dish, There is operation exception on i.e. 2/3rds link branches, can also identify that failure has occurred in outgoing link.
In addition, the above-mentioned predetermined time can be executed by timer, can also realize in other way.Pre- When fixing time not up to, the number of operation exception has reached predetermined threshold, also it can be assumed that there is link failure.
Specifically, since, comprising the connection chip of the first storage control, the first storage control is to this in the first link Connection chip carries out repairing the failure that can repair the first link.Again specifically, the first storage control can restart described The connection chip of one storage control;Alternatively, the connection chip of isolation first storage control;Alternatively, to described first Queue on the connection chip of storage control is repaired;Alternatively, on the connection chip of first storage control It is repaired port.
The embodiment of the present invention can also proceed as follows after above-mentioned troubleshooting: the first storage control carries out After the fault restoration of first link, the faulty tag of first link can also be deleted, or setting described first The normal tag of link, the normal tag indicate that first link state is available or the rank of first link just Often, so that the first storage control upon receipt continue to be sent to after other data operation requests of the first storage dish, root According to the state (faulty tag has been deleted or the normal tag of the first link) of first link, the first link is directly selected The transmission of operational order is carried out, since the path of the first link is more shorter than the second link, subsequent operational order will be more It is quickly obtained processing, such processing mode avoids subsequent operation instruction and executes caused time delay by the second link, Improve the treatment effeciency of operational order.The embodiment of the present invention really solves operation caused by link failure in storage system The execution time delay problem of instruction, although avoid in the prior art toggle path come operation caused by the mode of handling failure success But inefficient problem further increases the efficiency of storage system.
The embodiment of the present invention, after repairing to the first link, can also further detect first link is It is no to repair successfully, after detecting first link repair success, just the deletion of the faulty tag of the first link of progress or The label of the normal tag of the first link of person.For example, whether the connection chip in the first link of detection repairs success, then the connection After the state of chip is normal, deletes the faulty tag of the first link or the normal tag of the first link is set.If detecting One link repair is unsuccessful, such as connection chip in the first link is repaired unsuccessful, and first storage control can be with The Breakdown Maintenance notice of first link is issued, such as specifically, the failure of the connection chip in the first link can be issued Maintenance or replacement notice, thoroughly to solve the problems, such as hardware fault.
The functional module of storage control provided in an embodiment of the present invention is introduced below, as shown in figure 4, storage control Device processed includes storage processing module 401 and link failure processing module 402, and link failure processing mould 402 can be storage tube The function enhancing for managing logic, is also possible to the individual processing logic independently of storage management logic, those skilled in the art can It is neatly realized with introduction according to an embodiment of the present invention.
Processing module 401 is stored, for receiving the first data operation request, by the first branch of the first link to described Corresponding first storage dish of first data read-write operation sends the first operational order, and first link is to deposit comprising described first The link of the connection chip of controller is stored up, the target side that the first of first link branches into first link is described The connection of first storage dish.
Link failure handles mould 402, for passing through the second link after monitoring that first operational order executes time-out The first branch forward first operational order to first storage dish, second link be include second storage The link of the connection chip of controller, the target side that the first of second link branches into second link is described first The connection of storage dish;And receive by the operation of first operational order of the first multi-branch transport of second link at Function response, according to the number of the operation success response statistical operation exception, the in the given time statistics of the operation exception Number carries out fault restoration to the connection chip in first link, the operation exception indicates institute more than after predetermined threshold It states the first operational order and time-out is executed by first link, but run succeeded by second link.
Link failure processing module 402, execute operation exception statistics specifically includes: according to statistical rules and it is described operation at The number of operation exception is increased primary or maintains the operation exception number of former statistics constant, wherein the statistics by function response Rule includes: the operation exception occurred in every branch for first link, only counts primary;Correspondingly, described pre- Determine the quantity N that threshold value is storage dish described in the storage system.
Link failure handles mould 402, after being also used to determine and operation exception occur in the n-th branch of first link, The faulty tag of n-th branch of first link is set, and the faulty tag indicates that the n-th branch of first link can not With or first link the n-th branch rank reduce, n be nature number variable, n be more than or equal to 1, and be less than or equal to N;Then Storage processing module 401 is after receiving the follow-up data operation requests for the n-th storage dish, according to first link The faulty tag of n-th branch directly sends subsequent operation to n-th storage dish by the n-th branch of second link and refers to It enables.
Link failure handles mould 402, is also used to after carrying out fault restoration to the connection chip in first link, The faulty tag of every branch of first link is deleted, or the normal mark of every branch of first link is set Label, the normal tag indicate that first link state can be used or the rank of first link is normal;Then store processing After module follow-up data operation requests, according to the state of the first link failure label or the normal mark of the first link Label switch back into the targeted storage dish of follow-up data operation requests described in the first chain road direction and send operational order.
The concrete function of each functional module also has associated description in above-mentioned embodiment illustrated in fig. 3, and details are not described herein.
It is apparent to those skilled in the art that for convenience of description and succinctly, foregoing description is The specific work process of system, device and unit, can refer to corresponding processes in the foregoing method embodiment, details are not described herein.
In several embodiments provided herein, it should be understood that disclosed systems, devices and methods, it can be with It realizes by another way.For example, the apparatus embodiments described above are merely exemplary, for example, the unit It divides, only a kind of logical function partition, there may be another division manner in actual implementation, such as multiple units or components It can be combined or can be integrated into another system, or some features can be ignored or not executed.In addition, shown or beg for Opinion mutual coupling, direct-coupling or communication connection can be through some interfaces, the INDIRECT COUPLING of device or unit Or communication connection, it is also possible to electricity, mechanical or other form connections.

Claims (13)

1. a kind of storage fault handling method, which is characterized in that be applied to storage system, the storage system includes at least one Storage dish and at least two storage controls, each storage control include connection chip, and each controller passes through itself Connection chip be connected to each storage dish, at least two storage control interconnects;
The described method includes:
First storage control receives the first data operation request, is read by the first branch of the first link to first data Corresponding first storage dish of write operation sends the first operational order, and first link is to include first storage control The link of chip is connected, it is first storage dish that the first of first link, which branches into the target side in first link, Connection, first storage control be at least two storage control in any storage control;
First storage control monitors that first operational order executes time-out, and first storage control passes through the First branch of two links forwards first operational order to first storage dish, and second link is comprising described the The link of the connection chip of two storage controls, the first of second link branch into the target side of second link as institute The connection of the first storage dish is stated, second storage control is any storage control connecting with first storage control Device;
First storage control receives first operational order of the first multi-branch transport by second link Success response is operated, according to the number of the operation success response statistical operation exception, the operation exception instruction described first The operational order that storage control receives executes time-out by first link, but pass through second link execution at Function;
First storage control determines that the statistics number of the operation exception described in the given time is more than predetermined threshold, to institute The connection chip stated in the first link carries out fault restoration.
2. the method as described in claim 1, which is characterized in that described abnormal according to the operation success response statistical operation Number, comprising:
First storage control increases the number of operation exception primary according to statistical rules and the operation success response Or maintain the operation exception number of former statistics constant, wherein the statistical rules includes: every for first link The operation exception occurred in branch only counts primary;Correspondingly, the predetermined threshold is less than or equal to institute in the storage system State the quantity N of storage dish.
3. method according to claim 1 or 2, which is characterized in that the method also includes:
First storage control determines there is operation exception in the n-th branch of first link after, be arranged described the The faulty tag of n-th branch of one link, the faulty tag indicate that the n-th branch of first link is unavailable or described The rank of n-th branch of the first link reduces, and n is nature number variable, and n is more than or equal to 1, and is less than or equal to N;
First controller is after receiving the follow-up data operation requests for the n-th storage dish, according to first chain The faulty tag of n-th branch on road directly sends subsequent behaviour to n-th storage dish by the n-th branch of second link It instructs.
4. method as claimed in claim 3, which is characterized in that repaired carrying out failure to the connection chip in first link After multiple, the method also includes:
Delete the faulty tag of every branch of first link, or setting first link every branch it is normal Label, the normal tag indicate that the bifurcation state of first link is available or branch's rank of first link just Often;
After then first controller continues data operation request upon receipt, according to the shape of the first link failure label The normal tag of state or the first link switches back into the targeted storage of follow-up data operation requests described in the first chain road direction Disk sends operational order.
5. the method as described in claim 1-4 any one, which is characterized in that the connection chip to first link into After row fault restoration, the method also includes:
Whether the connection chip detected in first link repairs success;
After detecting that the connection chip reparation in first link is unsuccessful, the connection core in first link is issued The Breakdown Maintenance of piece notifies.
6. the method according to claim 1 to 5, which is characterized in that the connection chip in first link Carry out fault restoration, comprising:
Restart the connection chip of first storage control;Alternatively, the connection chip of isolation first storage control;Or Person repairs the queue on the connection chip of first storage control;Alternatively, to first storage control It is repaired port on connection chip.
7. as the method according to claim 1 to 6, which is characterized in that first storage control monitors described After one operational order executes time-out, the method also includes:
The first label of first storage control record, first label indicate first operational order by described the First branch of one link executes time-out;
Before the number according to the operation success response statistical operation exception, further includes:
The second label of first storage control record, second label indicate first operational order by described the First branch of two links runs succeeded;
Determine whether first operational order is provided simultaneously with first label and second label;
If first operational order is provided simultaneously with first label and second label, there is operation exception in determination.
8. a kind of storage system characterized by comprising
At least one storage dish and at least two storage controls;
Each storage control includes connection chip, and each controller is connected to each storage by the connection chip of itself Disk;
At least two storage control interconnects;
First storage control, for receiving the first data operation request, by the first branch of the first link to the data Corresponding first storage dish of read-write operation sends the first operational order, is monitoring that first operational order executes time-out, logical The first branch for crossing the second link forwards first operational order to first storage dish, and first storage control is Any storage control at least two storage control, second storage control are to control with first storage Any storage control of device connection processed, first link include the link of the connection chip of first storage control, The first of first link branch into the target side in first link be first storage dish connection, described second Link is the link of the connection chip comprising second storage control, and the first of second link branches into described second The target side of link is the connection of first storage dish;And
First storage control is also used to receive first operation of the first multi-branch transport by second link The operation success response of instruction, according to the number of the operation success response statistical operation exception, the in the given time behaviour Make frequency of abnormity more than after predetermined threshold, fault restoration, the operation exception are carried out to the connection chip in first link It indicates that the operational order that first storage control receives executes time-out by first link, but passes through described second Link runs succeeded.
9. storage system as claimed in claim 8, which is characterized in that abnormal time of the first storage control statistical operation Number specifically includes: according to statistical rules and the operation success response, increasing primary by the number of operation exception or remains former The operation exception number of statistics is constant, wherein the statistical rules includes: to occur in every branch for first link Operation exception, only count primary;Correspondingly, the predetermined threshold is less than or equal to storage dish described in the storage system Quantity N.
10. storage system as claimed in claim 8 or 9, which is characterized in that first storage control is also used to determine After there is operation exception in n-th branch of first link, the faulty tag of the n-th branch of first link is set, The faulty tag indicates that the n-th branch of first link is unavailable or the grade drops of the n-th branch of first link It is low;
First controller is also used to after receiving the follow-up data operation requests for the n-th storage dish, according to described The faulty tag of n-th branch of the first link is directly sent by the n-th branch of second link to n-th storage dish Subsequent operation instruction, wherein n is nature number variable, and n is more than or equal to 1, and is less than or equal to N.
11. storage system as claimed in claim 10, which is characterized in that first storage control is to first chain After connection chip in road carries out fault restoration, it is also used to delete the faulty tag of every branch of first link, or The normal tag of every branch of first link is arranged in person, and the normal tag indicates the bifurcation state of first link It can use or branch's rank of first link is normal;
After then first controller continues data operation request upon receipt, it is also used to according to the first link failure mark It is corresponding to switch back into follow-up data operation requests described in the first chain road direction for the normal tag of the state of label or the first link Storage dish forwards operational order.
12. the storage system as described in claim 8-11 any one, which is characterized in that first storage control, also For detecting whether the connection chip in first link repairs success, the connection chip in first link is being detected Repair it is unsuccessful after, issue the Breakdown Maintenance notice of the connection chip of first storage control.
13. the storage system as described in claim 8-12, which is characterized in that first storage control is deposited to described first The connection chip of storage controller carries out fault restoration and specifically includes: restarting the connection chip of first storage control;Alternatively, The connection chip of first storage control is isolated;Alternatively, to the queue on the connection chip of first storage control It is repaired;Alternatively, being repaired to the port on the connection chip of first storage control.
CN201711377004.1A 2017-12-19 2017-12-19 Storage system and fault processing method thereof Active CN109933478B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711377004.1A CN109933478B (en) 2017-12-19 2017-12-19 Storage system and fault processing method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711377004.1A CN109933478B (en) 2017-12-19 2017-12-19 Storage system and fault processing method thereof

Publications (2)

Publication Number Publication Date
CN109933478A true CN109933478A (en) 2019-06-25
CN109933478B CN109933478B (en) 2021-02-26

Family

ID=66983970

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711377004.1A Active CN109933478B (en) 2017-12-19 2017-12-19 Storage system and fault processing method thereof

Country Status (1)

Country Link
CN (1) CN109933478B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111782137A (en) * 2020-06-17 2020-10-16 杭州宏杉科技股份有限公司 Path fault processing method and device
CN111858122A (en) * 2020-07-29 2020-10-30 北京浪潮数据技术有限公司 Fault detection method, device, equipment and storage medium of storage link
CN112286743A (en) * 2020-10-23 2021-01-29 苏州浪潮智能科技有限公司 Storage equipment backboard link detection and diagnosis device and method
CN113986142A (en) * 2021-11-09 2022-01-28 苏州浪潮智能科技有限公司 Disk fault monitoring method and device, computer equipment and storage medium
CN114020661A (en) * 2021-10-27 2022-02-08 浪潮(北京)电子信息产业有限公司 Storage device and configuration method thereof

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090149133A1 (en) * 2007-11-02 2009-06-11 Kabushiki Kaisha Toshiba Communication apparatus and program for identifying faults and computer program storage medium
CN103001998A (en) * 2011-12-19 2013-03-27 深圳市安云信息科技有限公司 FC-SAN (fiber channel-storage area network) storage system and method for improving stability of fiber channel
CN103428333A (en) * 2012-05-15 2013-12-04 宇龙计算机通信科技(深圳)有限公司 Mobile terminal, server and error restoration method
CN104407999A (en) * 2014-11-04 2015-03-11 浪潮(北京)电子信息产业有限公司 Information security access architecture, method and system
CN104917624A (en) * 2014-03-10 2015-09-16 华耀(中国)科技有限公司 Health check system and method for link aggregation path
CN105389127A (en) * 2015-11-04 2016-03-09 华为技术有限公司 Method and apparatus for transmitting message in storage system, storage system and controller
US20170238197A1 (en) * 2016-02-17 2017-08-17 King Fahd University Of Petroleum And Minerals Route repair of ad hoc on-demand distance vector routing protocol in a wireless sensor network

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090149133A1 (en) * 2007-11-02 2009-06-11 Kabushiki Kaisha Toshiba Communication apparatus and program for identifying faults and computer program storage medium
CN103001998A (en) * 2011-12-19 2013-03-27 深圳市安云信息科技有限公司 FC-SAN (fiber channel-storage area network) storage system and method for improving stability of fiber channel
CN103428333A (en) * 2012-05-15 2013-12-04 宇龙计算机通信科技(深圳)有限公司 Mobile terminal, server and error restoration method
CN104917624A (en) * 2014-03-10 2015-09-16 华耀(中国)科技有限公司 Health check system and method for link aggregation path
CN104407999A (en) * 2014-11-04 2015-03-11 浪潮(北京)电子信息产业有限公司 Information security access architecture, method and system
CN105389127A (en) * 2015-11-04 2016-03-09 华为技术有限公司 Method and apparatus for transmitting message in storage system, storage system and controller
US20170238197A1 (en) * 2016-02-17 2017-08-17 King Fahd University Of Petroleum And Minerals Route repair of ad hoc on-demand distance vector routing protocol in a wireless sensor network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
黎沛春: "面向大数据存储系统的故障检测技术研究", 《中国优秀硕士学位论文全文数据库》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111782137A (en) * 2020-06-17 2020-10-16 杭州宏杉科技股份有限公司 Path fault processing method and device
CN111858122A (en) * 2020-07-29 2020-10-30 北京浪潮数据技术有限公司 Fault detection method, device, equipment and storage medium of storage link
CN112286743A (en) * 2020-10-23 2021-01-29 苏州浪潮智能科技有限公司 Storage equipment backboard link detection and diagnosis device and method
CN112286743B (en) * 2020-10-23 2023-01-06 苏州浪潮智能科技有限公司 Storage equipment backboard link detection and diagnosis device and method
CN114020661A (en) * 2021-10-27 2022-02-08 浪潮(北京)电子信息产业有限公司 Storage device and configuration method thereof
CN114020661B (en) * 2021-10-27 2023-07-25 浪潮(北京)电子信息产业有限公司 Storage device and configuration method thereof
CN113986142A (en) * 2021-11-09 2022-01-28 苏州浪潮智能科技有限公司 Disk fault monitoring method and device, computer equipment and storage medium
CN113986142B (en) * 2021-11-09 2023-08-08 苏州浪潮智能科技有限公司 Disk fault monitoring method, device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN109933478B (en) 2021-02-26

Similar Documents

Publication Publication Date Title
CN109933478A (en) A kind of fault handling method and storage system of storage system
CN101776983B (en) The synchronous method of information of double controllers in disk array and disc array system
US10007629B2 (en) Inter-processor bus link and switch chip failure recovery
CN103019889A (en) Distributed file system and failure processing method thereof
CN111858411B (en) System and method for realizing out-of-band interaction of server Expander cards
CN104216805B (en) A kind of high-end disk array rear end disk chassis link failure protection system and method
US8234447B2 (en) Storage control device for storage system provided with storage device coupled to switch network
CN103744743A (en) Heartbeat signal redundant configuration method based on RAC model of database
US20130315058A1 (en) Relay device, connection management method, and information communication system
US8055934B1 (en) Error routing in a multi-root communication fabric
CN212541329U (en) Dual-redundancy computer equipment based on domestic Loongson platform
CN101149666B (en) Selective crossing communication device, system and method between storage modules
WO2021012169A1 (en) Method of improving reliability of storage system, and related apparatus
CN100370756C (en) Reset processing method and device for system
CN109474694A (en) A kind of management-control method and device of the NAS cluster based on SAN storage array
CN117459482B (en) Restarting method and device of switch, storage medium and electronic device
JP6777848B2 (en) Control device and storage device
CN116909494B (en) Storage switching method and device of server and server system
CN115220937A (en) Method, electronic device and program product for storage management
CN104317678B (en) A kind of method and apparatus not interrupting data storage service and reparation RAID array
AU2021266891B2 (en) Fencing non-responding ports in a network fabric
CN113742142B (en) Method for managing SATA hard disk by storage system and storage system
CN107423167A (en) A kind of ISCSI target redundancy control methods and system based on dual control storage
CN108984602A (en) A kind of database control method and Database Systems
CN115168141A (en) Optical interface management system, method, device, programmable logic device and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200426

Address after: 518129 Bantian HUAWEI headquarters office building, Longgang District, Guangdong, Shenzhen

Applicant after: HUAWEI TECHNOLOGIES Co.,Ltd.

Address before: 301, A building, room 3, building 301, foreshore Road, No. 310052, Binjiang District, Zhejiang, Hangzhou

Applicant before: Hangzhou Huawei Digital Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20220223

Address after: 550025 Huawei cloud data center, jiaoxinggong Road, Qianzhong Avenue, Gui'an New District, Guiyang City, Guizhou Province

Patentee after: Huawei Cloud Computing Technologies Co.,Ltd.

Address before: 518129 Bantian HUAWEI headquarters office building, Longgang District, Guangdong, Shenzhen

Patentee before: HUAWEI TECHNOLOGIES Co.,Ltd.