[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

US7003687B2 - Fail-over storage system - Google Patents

Fail-over storage system Download PDF

Info

Publication number
US7003687B2
US7003687B2 US10/150,245 US15024502A US7003687B2 US 7003687 B2 US7003687 B2 US 7003687B2 US 15024502 A US15024502 A US 15024502A US 7003687 B2 US7003687 B2 US 7003687B2
Authority
US
United States
Prior art keywords
interface
chn
fail
over
controller
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related, expires
Application number
US10/150,245
Other versions
US20030135782A1 (en
Inventor
Naoto Matsunami
Kouji Sonoda
Manabu Kitamura
Takashi Oeda
Yutaka Takata
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Assigned to HITACHI, LTD. reassignment HITACHI, LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: TAKATA, YUTAKA, KITAMURA, MANABU, MATSUNAMI, NAOTO, OEDA, TAKASHI, SONODA, KOUJI
Publication of US20030135782A1 publication Critical patent/US20030135782A1/en
Priority to US11/316,463 priority Critical patent/US7447933B2/en
Application granted granted Critical
Publication of US7003687B2 publication Critical patent/US7003687B2/en
Adjusted expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2089Redundant storage control functionality
    • G06F11/2092Techniques of failing over between control units
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0727Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a storage system, e.g. in a DASD or network based storage system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2025Failover techniques using centralised failover control functionality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2033Failover techniques switching over of hardware resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2043Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant where the redundant components share a common memory address space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0766Error or fault reporting or storing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2069Management of state, configuration or failover

Definitions

  • the present invention relates to fail-over storage systems employed for computer systems. More particularly, the present invention relates to fail-over storage systems provided with a plurality of input/output interfaces.
  • I/F Interfaces used between storage systems and computers are roughly classified into two types.
  • the first type is the “block I/O interface.”
  • This interface enables data input/output (I/O) in blocks, a block being a unit of data management of storage units.
  • Multiple computers are often connected to multiple storage systems by such block I/O interfaces in systems.
  • the systems are referred to as a storage area network (SAN).
  • Fiber channels are usually used to interconnect a SAN.
  • the second type of interface is the “file I/O interface.”
  • This type of interface enables data I/O in files.
  • Interfaces that enable data I/O by using the Network File System, a protocol used to transfer files between file servers and client servers, are file I/O interfaces.
  • LAN local area network
  • NAS network attached storage
  • a conventional technique, disclosed in U.S. Pat. No. 5,696,895, referred to as the fail-over technique assures the resistance of file servers to failures.
  • the technique enables “heartbeat” signals to be exchanged between a first server that uses a first storage system and a second server that uses a second storage system. If a failure occurs in the first server, the “heartbeat” signal stops. The second server detects the absence of signal and accesses the first storage system used by the first server to take over the processing of the first server (fail-over processing).
  • the conventional NAS storage system is composed of a file server and a storage system with the file server attached to a storage system as a host computer.
  • the conventional fail-over technique considers a failure that might occur in the file server of the NAS storage system., but it does not consider any failure that might occur in the storage system of the NAS.
  • the conventional fail-over technique gives no consideration to any failure that might occur in the storage system that performs the fail-over processing (resistance to multiple failures).
  • the conventional technique does not provide for a storage system capable of connecting multiple network domains, nor to the fail-over processing executable in that configuration.
  • one feature of the present invention is to provide a storage system that can reduce system management cost by managing numerous system interfaces collectively.
  • the present invention provides a storage system resistant to multiple failures and capable of connecting many network domains.
  • the storage system of the preferred embodiment includes multiple slots used for various interface controllers such as a block I/O interface controller or a file I/O interface controller, and multiple disk controllers used to control various disk drives to be accessed from those interface controllers.
  • various interface controllers such as a block I/O interface controller or a file I/O interface controller
  • multiple disk controllers used to control various disk drives to be accessed from those interface controllers.
  • FIG. 1 is a block diagram of a storage system in an embodiment of the present invention
  • FIG. 2 is a schematic view of a storage system in the embodiment of the present invention.
  • FIG. 3 is an external view of a channel adapter
  • FIG. 4 is a block diagram of the channel adapter
  • FIG. 5 is an internal block diagram of a memory of the channel adapter shown in FIG. 4 ;
  • FIG. 6 is a concept chart for grouping channel adapters
  • FIG. 7 is an internal block diagram of shared memory
  • FIG. 8 is a channel adapter management table
  • FIG. 9 is a flowchart of processing executed in both a failed channel adapter and takeover channel adapter
  • FIG. 10 is a flowchart of processing executed in both failed channel adapter and takeover channel adapter
  • FIG. 11 is a flowchart of processing executed in both the recovered and takeover channel adapters.
  • FIGS. 12 through 14 are examples of fail-over operations.
  • each of the interface controllers is mounted as a board in the subject computer system and the shapes of all the controllers are the same so that they can be loaded in any of the slots.
  • the above configuration of the storage system of the present invention in another preferred embodiment, further includes a management table that manages fail-over interface controllers collectively, an information table that directs a fail-over procedure, and fail-over control means the taking-over of processing between interface controllers belonging to the same fail-over interface group according to the directed fail-over procedure.
  • FIG. 1 shows an embodiment of a storage system of the present invention.
  • Storage system 1 includes a disk controller 11 and multiple storage units 1700 .
  • NAS channel adapters (CHN) 1100 – 1105 are interface controllers connected to NAS clients 400 via a file I/O interface.
  • Fiber channel adapters (CHF) 1110 , 1111 are interface controllers connected to SAN clients 500 via a block I/O interface.
  • CHN and CHF will be referred to as channel adapters.
  • Each storage unit 1700 is connected to a disk adapter 120 .
  • Each disk adapter 120 controls a storage unit 1700 connected thereto.
  • Reference numeral 13 denotes a shared memory (SM); 14 denotes a cache memory (CM).
  • a shared memory controller (SMC) 15 is connected to NAS channel adapters 1100 – 1105 , fiber channel adapters 1110 , 1111 , disk adaptors 120 , and shared memory 13 .
  • Shared memory controller 15 controls data transfer between NAS channel adapters 1100 – 1105 and fiber channel adapters 1110 , 1111 , as well as between disk adapters 120 and shared memory 13 .
  • a cache memory controller (CMC) 16 is connected to NAS channel adapters 1100 – 1105 , fiber channel adapters 1110 , 1111 , disk adapters 120 , and cache memory 14 .
  • Cache memory controller 16 controls data transfer between NAS channel adapters 1100 – 1105 and fiber channel adapters 1110 , 1111 , as well as between disk adapters 120 and cache memory 14 .
  • the LANs 20 and 21 connect NAS channel adapters 1100 – 1105 to NAS clients 400 .
  • the IP network is used for the LANs.
  • Different domains are assigned to LANs 20 and 21 .
  • domain means a management range in a network.
  • DOM-LAN 0 domain names are given to LAN 20 and DOM-LAN 1 domain names are given to LAN 21 .
  • SAN 30 connects fiber channel adapters 1100 – 1105 to SAN clients 500 .
  • a DOM-FC 0 domain name is given to SAN 30 .
  • every channel adapter can access the cache memory 14 and every storage unit 1700 via cache memory controller 16 .
  • Storage system 1 is provided with both SAN and NAS interfaces. This embodiment enables multiple NAS channel adapters to be divided into groups and each of the groups to be connected to a LAN managed in a domain different from the others. Of course, storage system 1 may be provided with only SAN or NAS interfaces.
  • FIG. 2 shows an external view of storage system 1 .
  • Disk controller 11 houses NAS channel adapters 1100 – 1105 , fiber channel adapters 1110 , 1111 , disk adapters 120 , shared memory 13 and cache memory 14 .
  • Disk units (DKU) 180 and 181 house storage units 1700 , respectively.
  • Shared memory 13 is actually composed of multiple controller boards 130
  • cache memory 14 is composed of multiple cache boards 140 .
  • Boards 130 and 140 are loaded in slots 190 .
  • the user of storage system 1 increases/decreases the number of those boards to obtain a desired storage capacity.
  • FIG. 2 shows how boards 130 and 140 are loaded in the respective slots 190 by fours.
  • Adapter boards that include a built-in NAS channel adapters 1100 – 1105 are also loaded in the slots 190 .
  • the shape of slots 190 , the size of the adapter boards, and the shape of the connectors are fixed among all the interfaces to make them compatible. Consequently, disk controller 11 can house any adapter boards in any slots 190 regardless of their interface types.
  • the user of storage system 1 can choose a combination of a number of NAS channel adapters 1100 – 1105 and a number of fiber channel adapters 1110 , 1111 , and freely load them in slots 190 of storage system 1 .
  • FIG. 3 shows a configuration of an adapter board that includes a built-in NAS channel adapter 1100 .
  • a connector 11007 is connected to a connector of a disk controller.
  • NAS channel adapter 1100 and fiber channel adapter 1110 or 1111 have the same configuration of connectors.
  • An interface connector 2001 conforms to the IP network.
  • interface connector 2001 corresponds to a fiber channel.
  • FIG. 4 is an internal block diagram of NAS channel adapter 1100 .
  • Reference numeral 11001 denotes a center controller.
  • a LAN controller 11002 connects a LAN via interface connector 2001 .
  • Memory 1004 is connected to center controller 11001 .
  • Memory 11004 stores programs and control data to be executed by center controller 11001 .
  • a shared memory interface controller (SM I/F) 11005 controls access of NAS channel adapters 1100 – 1105 to shared memory 13 .
  • Cache memory interface controlling means 11006 controls access of the NAS channel adapters to cache memory 14 .
  • SM I/F shared memory interface controller
  • Center controller 11001 may be a single processor or a set of processors.
  • center controller 11001 may be composed of symmetrical multiple processors used for the horizontal load distribution of control processing.
  • the symmetrical multiple processors may be configured so that one processor employs the I/O interface protocol for processing and the other processor controls disk volumes.
  • the configuration of fiber channel adapters 1110 , 1111 is the same as that shown in FIG. 4 except that LAN controller 11002 is replaced with a fiber channel controller.
  • FIG. 5 is a block diagram of memory 11004 of NAS channel adapters 1100 – 1105 .
  • An operating system program 110040 is used to manage all the programs and control the data I/O in the subject system.
  • a LAN controller driver program 110041 is used to control LAN controller 11002 (shown in FIG. 4 ).
  • a TCP/IP program 110042 is used to control the TCP/IP that is a LAN communication protocol.
  • a file system program 110043 is used to manage files stored in the storage unit.
  • a network file system program 110044 is used to control the protocols of Network File System used to supply files stored in the storage unit to NAS clients 400 .
  • a disk volume control program 110045 is used to control access to disk volumes set in the storage units 1700 (shown in FIGS.
  • a cache control program 110046 is used to manage the data in cache memory 14 (shown in FIGS. 1 and 2 ) and to control hit/miss decisions, etc.
  • a fail-over program 110047 is used to control such processing as passing of processing from a NAS channel adapter that has failed to another normal NAS adapter. The fail-over program 110047 will be described more in detail later.
  • the channel adapters of storage system 1 are managed in layers to make it easier to manage storage system 1 . That is, the channel adapters are divided into four layers according to four indexes: physical interface, logical interface, domain, and fail-over group. The indexes are not limited to only those four, however.
  • FIG. 6 shows an example of the channel adapters division into the four layers.
  • a shaded area denotes storage system 1 .
  • the outermost track denotes the physical interface layer.
  • channel adapters are grouped according to the physical medium of the interface by which each channel adapter is connected to the host. Specifically, channel adapters are grouped according to the four physical media of the fiber channel, the UltraSCSI, the Mainframe Channel, and the IP network.
  • the second track denotes the physical interface layer.
  • channel adapters are grouped according to the logical protocol of the interface by which each channel adapter is connected to the host. Specifically, channel adapters are grouped by the fiber channel protocol (FCP), the SCSI, the Mainframe Channel, and the NAS (that is, a file I/O interface), and the iSCSI logical protocol.
  • FCP fiber channel protocol
  • SCSI Serial Bus interface
  • Mainframe Channel Mainframe Channel
  • NAS that is, a file I/O interface
  • NAS that is, a file I/O interface
  • the third track denotes the domain layer.
  • channel adapters are grouped according to the assigned domain (an IP network domain [sub-net] for the IP network, one SCSI bus for the SCSI, and the whole SAN composed in one group or single address space for the fiber channel).
  • the innermost track denotes the fail-over group layer.
  • channel adapters that are fail-over-enabled are grouped into one unit.
  • the group may be a single group, like the DOM-LAN 0 domain, or two or more, like the DOM-LAN 2 domain.
  • the number of channel adapters in a fail-over group may be two, like FOG-LN 1 , or three or more, like FOG-LN 0 .
  • Each innermost square denotes a channel adapter. In FIG. 6 , there are a total of 27 channel adapters.
  • FIG. 7 is a block diagram of shared memory 13 .
  • This shared memory 13 stores management information used to manage channel adapters.
  • a configuration management information storing area 131 stores management information that denotes the configuration of each item of the storage system, such as an interface.
  • Configuration information storing area 131 includes a channel adapter management table 1310 , fail-over management information 1311 , a heartbeat mark storing area 1312 , and a fail-over information storing area 1313 .
  • FIG. 8 shows the contents of channel adapter management table 1310 .
  • the table 1310 is used to manage channel adapter groups.
  • the tables shown in FIG. 8 are formed in accordance with the configuration of storage system 1 .
  • the channel adapter entry 13101 includes registered channel adapter identifiers.
  • the physical interface group entry 13102 holds the physical interface group to which each registered channel adapter belongs.
  • the logical interface group entry 13103 holds information about the logical interface group to which each registered channel adapter belongs.
  • a domain entry 13104 holds information about the domain to which each registered channel adapter belongs.
  • a fail-over group entry 13105 holds information about the fail-over group to which each registered channel adapter belongs.
  • a status entry 13106 holds the status of each registered channel adapter (normal, abnormal, channel adapter for which a fail-over operation is done, etc.).
  • An operating ratio entry 13107 holds information about the operating state of each registered channel adapter, particularly the operation ratio of each channel adapter.
  • FIG. 8 shows how the processing jobs of failed channel adapters CHN 1 and CHN 2 are taken over by a normal channel adapter CHN 3 in the same fail-over group (FOG-LN 0 ) of the same domain (DON-LAN 0 ).
  • CHN 3 executes its own processing, as well as the processing of the two CHNs that have failed, so that the operating ratio of CHN 3 becomes as high as 86%.
  • heartbeat mark storing area 1312 stores state information about each channel adapter.
  • the state information is sometimes referred to as a heartbeat mark.
  • the heartbeat mark includes such data as NAS channel adapter identifier, normal code, updating time.
  • Takeover (See FIG. 7 ) information storing area 1313 holds the takeover-related information for each channel adapter, so that the processing of a failed channel adapter can be taken over by another channel adapter.
  • the takeover information includes both MAC and IP addresses of LAN controller 11002 , device information for file system 110043 or mount point information, and export information for network file system 110044 .
  • Takeover information storing area 1313 stores information related to takeover processing between channel adapters, monitoring related information, specific channel adapter processing to be taken over (to be described later) and information about each channel adapter to be monitored, etc. Each of the above information items will be described more in detail with reference to FIGS. 12 through 14 .
  • the operation of storage system 1 in this embodiment is now described, starting with a description of how storage system 1 operates when a failure is detected in a channel adapter.
  • the “failure” mentioned here means an unrecoverable failure that occurs in a channel adapter whose processing must be taken over by a normal channel adapter.
  • the failed channel adapter is CH-A and the adapter that takes over the processing of CH-A is CH-B.
  • CH-A detects a failure by itself
  • the fail-over processing, the recovery processing, and the take-back processing are executed by the following procedure.
  • CH-A finds the failure and executes a block-off processing. As a result, heartbeat mark updating of CH-A stops.
  • the block-off processing means stopping a channel adapter operation.
  • Recovery processing is executed for CH-A.
  • recovery processing means CH-A board replacement, repair, or other service by a maintenance worker.
  • Storage system 1 executes recovery processing according to the reported failure content.
  • the report may be any of the messages displayed on the screen of the subject management terminal, a Simple Network Management Protocol (SNMP), an E-mail, a syslog, a pocket bell sound, an assisting notice (via a hot line to the center), etc.
  • SNMP Simple Network Management Protocol
  • CH-B confirms that heartbeat mark of CH-A has been updated.
  • CH-A cannot execute a block-off process by itself for a failure detected therein, CH-A executes the following procedure.
  • FIG. 9 is a flowchart of the operation in fail-over procedure step (1) of the center controller 11001 of the NAS channel adapter CHN 1101 .
  • CHN 1101 is equivalent to CH-A.
  • Center controller 11001 monitors failure occurrence in CHN 1101 by using a fail-over control program 110047 .
  • Center controller 11001 starts up the fail-over control program 110047 when the CHN 1101 is powered (step 4700 ).
  • Center controller 11001 decides whether or not a failure has occurred in CHN 1101 under the control of the fail-over control program 110047 (step 4701 ).
  • center controller 11001 controls processing so that the heartbeat mark is stored in heartbeat mark storing area 1312 of shared memory 13 (step 4702 ). After the storing (or updating) the heartbeat mark, fail-over control program 110047 stops for a fixed time (step 4703 ). After that, center controller 11001 repeats processing in steps 4701 to 4703 .
  • center controller 11001 executes the following processing. Note, however, that a hardware failure might be detected when a hardware interruption is issued to a given center controller 11001 , in a step other than step 4701 . Even in that case, the center controller 11001 executes the following processing.
  • Center controller 11001 when it is able to work, stops the updating of the heartbeat mark.
  • Center controller 11001 can also control heartbeat mark updating to enable the heartbeat mark to include information denoting that CHN 1101 has stopped due to a detected failure (step 4704 ).
  • Center controller 11001 then sets the detected failure (failed channel adapter) in the cell equivalent to CHN 1101 in the status entry 13106 column of channel adapter management table 1310 (step 4705 ). After that, center controller 11001 executes block-off processing (step 4706 ).
  • center controller 11001 When center controller 11001 is not able to work, the processing in steps 4704 to 4706 cannot be executed. If the operation of center controller 1101 is disabled, the heartbeat mark is not updated (equivalent to (1′)) even when heartbeat mark updating time is reached. In this case, another channel adapter monitors the communication status of the heartbeat mark to detect a failure occurrence in the failed channel adapter (equivalent to (2)). In addition, the monitoring channel adapter executes the processing in steps 4705 and 4706 , that is, the processing in (3′) in place of the failed channel adapter, and, thereby, the fail-over processing is continued.
  • FIG. 10 is a flowchart of how the processing of CH-A are taken over by CH-B. Specifically, the flowchart shows the operations in (2) and (3) of NAS channel adapter CHN 1102 .
  • center controller 11001 When CHN 1102 is powered, its center controller 11001 starts up fail-over control program 110047 (step 4800 ). Center controller 11001 monitors failure occurrence in the target channel adapter in the same fail-over group by checking the heartbeat mark of the target channel adapter (CHN 1101 in this case).
  • a “monitoring target channel adapter” means another channel adapter assigned to a first channel adapter to be monitored by that channel adapter. Such a monitoring target channel adapter is registered in fail-over management information 1311 stored in shared memory 13 . Each target channel adapter is set at the factory when the product is delivered or it is set freely by the user through a software program pre-installed in the product.
  • center controller 11001 decides that a failure has occurred in the target channel adapter (steps 4801 and 4802 ). When no failure is detected, center controller 11001 sleeps for a predetermined time (steps 4802 and 4803 ), then repeats processing in steps 4801 to 4803 .
  • center controller 11001 checks the state of the failed channel adapter, that is, the state of CHN 1101 (step 4804 ).
  • CHN 1102 executes post-failure processing in place of CHN 1101 .
  • Post-failure processing means that instead of center controller 11001 of the failed channel adapter, a normal channel adapter has detected a failure; sets the failure occurrence (failure state) in the status column of channel adapter management table 1310 , in the cell corresponding to the failed channel adapter; and forcibly blocks off the failed channel adapter. This processing is equivalent to the processing in (3′)(step 4810 ).
  • center controller 11001 identifies the subsidiary channel adapter whose processing is to be taken over.
  • Information about the subsidiary channel adapter is stored in fail-over management information 1131 .
  • a subsidiary channel adapter means a channel adapter assigned to another channel adapter so that the other channel adapter takes over the processing of the subsidiary channel adapter when a failure is detected in the subsidiary channel adapter.
  • CHN 1101 is assigned as a subsidiary channel adapter of CHN 1102
  • CHN 1102 takes over the processing of CHN 1101 when a failure is detected in CHN 1101 .
  • the subsidiary channel adapter is not only the channel adapter that has failed, but also another channel adapter whose processing had been taken over by the channel adapter that has failed.
  • a channel adapter when it takes over the processing of another channel adapter, is also required to take over the processing of every channel adapter.
  • center controller 11001 checks the presence of the channel adapter with reference to fail-over management information 1311 .
  • CHN 1101 is assigned as a subsidiary channel adapter of CHN 1102 . Consequently, center controller 11001 identifies CHN 1101 as a subsidiary channel adapter in this step. How such a subsidiary channel adapter is checked is described later (step 4805 ). Center controller 11001 updates the information included in fail-over management information 1311 . How the information the information is updated is described later (step 4806 ).
  • Center controller 11001 updates each monitoring target channel adapter. This is because updating the information in fail-over management information 1311 might cause assignment of another NAS channel adapter that must be monitored. How the information is updated is described later (step 4807 ).
  • Center controller 11001 of CHN 1102 which has detected a failure in CHN 1101 a monitored subsidiary channel adapter of CHN 1102 , takes over the processing of CHN 1101 in the following procedure.
  • Center controller 11001 obtains from fail-over information storing area 1313 of shared memory 13 , the fail-over information related to the failed CHN 1101 . Center controller 11001 then sets both the MAC and IP addresses of LAN controller 11002 of failed CHN 1101 in the LAN controller 11002 of CHN 1102 . As a result, CHN 1102 can respond to both the LAN access to CHN 1101 and the LAN access to CHN 1102 . Center controller 11001 then mounts a file system mounted in CHN 1101 in CHN 1102 according to the device information and the mount point information related to file system 110043 of CHN 1101 . Center controller 11001 replays the journal as a recovery processing of the file system.
  • center controller 11001 opens the recovered file system at a predetermined export point according to the export information of network file system 110044 .
  • Center controller 11001 takes over any unfinished processing that was requested of CHN 1101 by a NAS client, as needed (step 4808 ). This completes the fail-over processing (step 4809 ). After that, center controller 11001 restarts the monitoring in step 4800 .
  • FIG. 11 is a flowchart of recovery processing in a channel adapter that takes over (CHN 1102 in this case) the processing of a failed channel adapter, that is, operations ( 6 ) and ( 7 ).
  • center controller 11001 starts the recovery processing (step 4900 ).
  • Center controller 11001 checks the heartbeat mark of every monitoring target channel adapter (step 4901 ). This processing is the same as that in step 4801 .
  • center controller 11001 executes the processing in and after step 4904 (step 4902 ).
  • center controller 11001 sleeps for a predetermined time (step 4903 ), then repeats the processing in steps 4901 to 4903 .
  • Center controller 11001 then updates fail-over management information 1311 to eliminate CHN 1101 from the fail-over processing (step 4904 ). How the information 1311 is updated is described later. Center controller 11001 updates the target channel adapter of fail-over processing. That is, center controller 11001 updates the necessary information to eliminate the recovered channel adapter from fail-over processing.
  • center controller 11001 can eliminate the channel adapter from fail-over processing.
  • the process is as follows. First, CHN 1101 fails and CHN 1102 takes over the processing of CHN 1101 . Then, CHN 1102 fails and CHN 1103 takes over the processing of both CHN 1102 and CHN 1101 . If CHN 1102 is recovered after that, CHN 1103 can exit the processing of both CHN 1102 and CHN 1101 (step 4905 ). How the necessary information in such a case is updated is described in detail, later, with reference to FIGS. 12 through 14 .
  • Center controller 11001 updates the monitoring target channel adapter. This is because the monitoring target channel adapter might also be changed due to the updating of the fail-over management information, etc.(step 4906 ). Center controller 11001 then executes take-back processing. “Take-back processing” means processing that returns fail-over processing to the original NAS channel adapter. That is, fail-over information taken over in fail-over processing is returned to the recovered channel adapter (step 4907 ). This completes recovery processing (step 4908 ). If there is another NAS channel adapter whose processing is to be taken over by CHN 1102 , the above processing steps are repeated again.
  • FIGS. 12 to 14 show concrete examples of a series of fail-over processes.
  • there are four NAS channel adapters (CHN 0 , CHN 1 , CHN 2 , and CHN 3 ) in fail-over group FOG-LN 0 of domain DOM-LAN 0 and two of the channel adapters, CHN 1 and CHN 2 , have failed consecutively.
  • the right portion of FIG. 12( a ) shows that each CHN is operating normally.
  • Each CHN periodically updates its heartbeat mark (HBM) stored in heart beat mark storing area 1312 (an HBM being periodically updated is shown as ON).
  • HBM heartbeat mark
  • the contents of the fail-over management information are as shown in the left portion of FIG. 12( a ).
  • the information is stored in fail-over management information 1311 as a list as shown in the right portion of FIG. 12( a ).
  • the CHN located at the arrowhead monitors the CHN at the other (round) end of the arrow.
  • the CHN located at the arrowhead executes a fail-over operation (the dotted line arrow shown in the left portion of FIG. 12( a ) also denotes the same relationship).
  • the CHN 1 monitors the CHN 0 .
  • the CHN 0 is a target channel adapter to be monitored by CHN 1 .
  • the relationship denoted by this arrow is referred to as a “current” relationship.
  • FIG. 12( b ) shows that the CHN 1 has failed.
  • CHN 1 fails, updating of the heartbeat mark of CHN 1 stops (the HBM updating stopped state is shown as OFF).
  • CHN 2 detects the HBM updating has stopped. Fail-over management information 1311 shown in the right portion of FIG. 12( b ) is not updated at this time.
  • FIG. 12( c ) shows that the CHN 2 has taken over the processing of CHN 1 .
  • the channel adapter that has taken over the processing of CHN 1 (hereinafter, the takeover channel adapter) is set as CHN 1 .
  • CHN 2 which detected the failed CHN 1 identifies CHN 1 as a subsidiary channel adapter, then updates fail-over management information 1311 as shown in the right portion of FIG. 12(C) .
  • FIG. 12( c ) shows that CHN 1 becomes a subsidiary channel adapter of CHN 2 , thereby its processing is taken over by CHN 2 (as denoted by the solid, upward arrow in the figure).
  • the relationship denoted by this upward arrow is referred to as a “takeover” relationship.
  • Such a “current” relationship of is set between CHN 0 and CHN 2 .
  • CHN 1 is added to the target channel adapters to be monitored by CHN 2 .
  • the “current” relationship between CHN 0 and CHN 1 and the one between CHN 1 and CHN 2 are respectively updated to a default relationship (as shown by a dotted arrow in the figure).
  • the take-over relationship denoted by a solid line indicates an “active” relationship that both channel adapters are monitoring each other (or taking over).
  • the dotted line denotes an “inactive” relationship.
  • An inactive relationship indicates that none of the monitoring and taking-over is carried out between the subject channel adapters. Because of updated fail-over management information 1311 , CHN 2 comes to have two active relationships of “takeover” and “current.” As a result, CHN 2 monitors two channel adapters (CHN 1 and CHN 0 ), as shown in the left portion of FIG. 12( c ).
  • FIG. 13( a ) shows that CHN 2 has failed.
  • HBM heartbeat mark
  • CHN 3 detects that CHN 2 heartbeat mark updating has stopped. Fail-over management information 1311 shown in the right portion of FIG. 13( a ) is not updated at this time.
  • FIG. 13( b ) shows the state of CHN 3 , which has taken over the processing of CHN 2 .
  • CHN 3 is set as the takeover channel adapter of CHN 2 .
  • CHN 1103 which detected the failure of CHN 2 , identifies CHN 2 as a target channel adapter and updates fail-over information 1311 , as shown in the right portion of FIG. 13( b ).
  • CHN 2 becomes a subsidiary channel adapter of CHN 3
  • CHN 1 which is a subsidiary channel adapter of the CHN 2
  • CHN 1103 also becomes as a subsidiary channel adapter of CHN 1103 ; thus, the processing of both CHN 1 and CHN 2 are taken over by CHN 3 (as denoted by the solid, upward arrow in the figure).
  • a takeover relationship is set between CHN 1 and CHN 3 , as well as between CHN 2 and CHN 3 .
  • the takeover relationship between CHN 1 and CHN and the “current” relationship between CHN 0 and CHN 2 are reset.
  • a new “current” relationship is set between CHN 0 and CHN 3 .
  • the “current” relationship between CHN 2 and CHN 3 is updated to default.
  • the default relationship between CHN 0 and CHN 1 and the one between CHN 1 and CHN 2 are kept as they are.
  • CHN 3 Due to the updating of fail-over management information 1311 as described above, CHN 3 comes to have three active relationships (two takeover relationships and one “current” relationship). As a result, CHN 3 monitors three channel adapters (CHN 1 , CHN 2 , and CHN 0 ), as shown in the left portion of the figure.
  • FIG. 13( c ) shows the state of CHN 1 recovered from a failure.
  • HBM heartbeat mark
  • FIG. 14( a ) shows the state of CHN 1 after the processing is returned from the CHN 3 thereto.
  • CHN 3 was set as the takeover channel adapter of CHN 1 .
  • CHN 3 which detected the recovered CHN 1 , updates fail-over management information 1311 as shown in the right portion of FIG. 14( a )
  • the takeover relationship between CHN 1 and CHN 3 is reset from CHN 1 .
  • the default relationship between CHN 1 and CHN 3 is updated to a “current” relationship.
  • the relationship between CHN 0 and CHN 1 is updated from default to “current”. This means that processing is returned (taken back) from CHN 3 by CHN 1 .
  • the “current”[flow] relationship between CHN 0 and CHN 3 is reset.
  • the relationship between CHN 2 and CHN 3 is kept as is at this time.
  • CHN 3 Due to the updating of fail-over management information 1311 as described above, CHN 3 comes to have two active relationships (one takeover relationship and one “current” relationship). As a result, CHN 3 monitors two channel adapters (CHN 1 and CHN 2 ), as shown in the left portion of in FIG. 14( a ).
  • FIG. 14( b ) shows the state of CHN 2 recovered from a failure.
  • HBM heartbeat mark
  • FIG. 14( c ) shows the state of CHN 1102 after processing is returned from CHN 3 thereto.
  • CHN 1103 was set as the takeover channel adapter of CHN 2 .
  • CHN 3 which detected the recovered CHN 1102 , updates fail-over management information 1311 as shown in the right portion of FIG. 14( c ).
  • the take-over relationship between CHN 2 and CHN 3 is reset from CHN 2 .
  • the default relationship between CHN 2 and CHN 3 is updated to a “current” relationship.
  • the relationship between CHN 1 and CHN 2 is updated from default to current. This means that processing is returned (taken back) from CHN 3 to CHN 2 .
  • the “current” relationship between CHN 1 and CHN 3 is reset. Updating of the above relationships restores the state shown in FIG. 12( a ).
  • a channel adapter provided with various kinds of block I/O interfaces and a channel adapter provided with various kinds of file I/O interfaces together in one storage system; thus, the storage system can be connected to a plurality of network domains.
  • a channel adapter to be monitored is the same as that from which processing is to be taken over
  • the channel adapter to be monitored may be different from the channel adapter from which processing is to be taken over.
  • information must be exchanged between CHN 2 and CHN 3 in such a system configuration. This required processing will be described later.
  • the storage system 1 chooses a takeover channel adapter statically according to predetermined fail-over management information.
  • the usage ratio of the takeover channel adapter will become very high.
  • the present invention provides a variation of this embodiment. Specifically, the storage system itself collects and records the operating ratio of each channel adapter and selects the channel adapter whose operating ratio is the lowest in the same fail-over group as a takeover channel adapter. The storage system then enables the takeover channel adapter to take over the processing of each channel adapter that has failed.
  • both takeover and monitor relationships defined in the above embodiment shown in FIG. 12( b ) are modified. That is, an arrow line in FIG. 12B does not represent any takeover relationship, but represents only a relationship between the channel adapters so that one channel adapter monitors failure occurrence in the other channel adapter.
  • each channel adapter measures the operating ratio of its center controller 11001 and periodically stores the result in channel adapter management table 1310 .
  • an idle process is executed when the center controller 11001 has no work to execute.
  • the interval in which the idle process is executed is measured for a certain time, thereby calculating the operating ratio of center controller 11001 in a fixed period.
  • the fixed period may be any value, but it should preferably be a time interval to which the measurement overhead is added so as to become larger enough with respect to the processor clock, for example, about 1 second.
  • a takeover channel adapter is identified as follows.
  • a channel adapter monitors heart beat mark area 1312 , just as in the above embodiment, to detect a channel adapter that has failed, which is a target channel adapter to be monitored.
  • the channel adapter that has detected the failed channel adapter refers to channel adapter management table 1310 to identify the channel adapter whose operating ratio is the lowest at that time among the normal channel adapters in the same fail-over group. Then, the channel adapter that detected the failed channel adapter selects the channel adapter whose operating ratio is the lowest as the takeover channel adapter. After that, the channel adapter that detected the failed channel adapter updates fail-over management information 1311 .
  • a takeover relationship is thus set between the failed channel adapter and the NAS channel adapter selected as the takeover channel adapter.
  • the monitoring relationships of default and “current” are the same as those shown in FIG. 12 .
  • the monitoring channel adapter sends a signal to the channel adapter selected as the takeover channel adapter. Receiving the signal, the takeover channel adapter refers to fail-over management information 1311 to ascertain that it has become the takeover channel adapter of the failed channel adapter. After that, the takeover channel adapter executes fail-over processing as described above.
  • concentration of the load on the takeover channel adapter can be avoided.
  • a takeover channel adapter is chosen according to the operating ratio at a certain time as described above, such a takeover channel adapter may also be selected so that the load of the takeover channel adapter is dispersed over a long period according to the recorded variation of the operating ratio over time, etc. In this case, the effect of the load balance will become more significant for a system with a load that varies with time.
  • the present invention therefore, provides a storage system that can employ various kinds of interfaces conforming to the standards of both NAS and SAN.
  • the system configuration is more adaptable, and system configuration varied more freely to reduce management costs. It is also possible to provide a storage system with excellent resistance to multiple failures occurring in multiple interfaces conforming to the standards of both NAS and SAN.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Hardware Redundancy (AREA)

Abstract

A storage system 1 including multiple slots for loading a block I/O interface controller, a file I/O interface controller, and any other kinds of interface controllers that are combined freely. The storage system 1 includes a management table that manages fail-over-enabled devices by grouping those devices in accordance with the interface type and the domain to which each device belongs; an information table that directs a fail-over procedure; and fail-over controlling means that takes over the processing of a failed interface controller belonging to a fail-over-enabled group. The fail-over system offers several modalities for monitoring failures, selecting takeover controllers and restoring functionality. Storage system 1 solves conventional problems by providing a system that can mount a plurality of file systems, and that resists multiple failures detected in a fail-over server.

Description

CROSS-REFERENCES TO RELATED APPLICATIONS
Not Applicable
STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT
Not Applicable
REFERENCE TO A “SEQUENCE LISTING,” A TABLE, OR A COMPUTER PROGRAM LISTING APPENDIX SUBMITTED ON A COMPACT DISK.
Not Applicable
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to fail-over storage systems employed for computer systems. More particularly, the present invention relates to fail-over storage systems provided with a plurality of input/output interfaces.
2. Background of the Invention
Interfaces (I/F) used between storage systems and computers are roughly classified into two types. The first type is the “block I/O interface.” This interface enables data input/output (I/O) in blocks, a block being a unit of data management of storage units. The fiber channel, the SCSI (Small Computer Systems Interface), the Mainframe channel, etc., belong to this “block I/O interface” type. Multiple computers are often connected to multiple storage systems by such block I/O interfaces in systems. The systems are referred to as a storage area network (SAN). Fiber channels are usually used to interconnect a SAN.
The second type of interface is the “file I/O interface.” This type of interface enables data I/O in files. Interfaces that enable data I/O by using the Network File System, a protocol used to transfer files between file servers and client servers, are file I/O interfaces. A storage system provided with this type of file I/O interface and capable of connecting a network, including a local area network (LAN), is referred to as a network attached storage (NAS) system.
A conventional technique, disclosed in U.S. Pat. No. 5,696,895, referred to as the fail-over technique assures the resistance of file servers to failures. Specifically, the technique enables “heartbeat” signals to be exchanged between a first server that uses a first storage system and a second server that uses a second storage system. If a failure occurs in the first server, the “heartbeat” signal stops. The second server detects the absence of signal and accesses the first storage system used by the first server to take over the processing of the first server (fail-over processing).
BRIEF SUMMARY OF THE INVENTION
According to the above-described conventional technique, if someone wants a computer system which includes a SAN function and a NAS function, it is necessary to prepare the SAN storage system and the NAS storage system independently to make use of both the SAN and NAS functionalities. Consequently, each of those storage systems needs to be managed individually, increasing the system management cost.
Usually, the conventional NAS storage system is composed of a file server and a storage system with the file server attached to a storage system as a host computer. The conventional fail-over technique considers a failure that might occur in the file server of the NAS storage system., but it does not consider any failure that might occur in the storage system of the NAS. Furthermore, the conventional fail-over technique gives no consideration to any failure that might occur in the storage system that performs the fail-over processing (resistance to multiple failures). In addition, the conventional technique does not provide for a storage system capable of connecting multiple network domains, nor to the fail-over processing executable in that configuration. Under the circumstances, one feature of the present invention is to provide a storage system that can reduce system management cost by managing numerous system interfaces collectively. In addition, the present invention provides a storage system resistant to multiple failures and capable of connecting many network domains.
To provide these features, the storage system of the preferred embodiment includes multiple slots used for various interface controllers such as a block I/O interface controller or a file I/O interface controller, and multiple disk controllers used to control various disk drives to be accessed from those interface controllers. Other and further objects, features and advantages of the invention will appear more fully from the following description.
BRIEF DESCRIPTION OF THE DRAWINGS
A preferred form of the present invention is illustrated in the accompanying drawings in which:
FIG. 1 is a block diagram of a storage system in an embodiment of the present invention;
FIG. 2 is a schematic view of a storage system in the embodiment of the present invention;
FIG. 3 is an external view of a channel adapter;
FIG. 4 is a block diagram of the channel adapter;
FIG. 5 is an internal block diagram of a memory of the channel adapter shown in FIG. 4;
FIG. 6 is a concept chart for grouping channel adapters;
FIG. 7 is an internal block diagram of shared memory;
FIG. 8 is a channel adapter management table;
FIG. 9 is a flowchart of processing executed in both a failed channel adapter and takeover channel adapter;
FIG. 10 is a flowchart of processing executed in both failed channel adapter and takeover channel adapter;
FIG. 11 is a flowchart of processing executed in both the recovered and takeover channel adapters; and
FIGS. 12 through 14 are examples of fail-over operations.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
In a preferred embodiment of the present invention, each of the interface controllers is mounted as a board in the subject computer system and the shapes of all the controllers are the same so that they can be loaded in any of the slots. Furthermore, the above configuration of the storage system of the present invention, in another preferred embodiment, further includes a management table that manages fail-over interface controllers collectively, an information table that directs a fail-over procedure, and fail-over control means the taking-over of processing between interface controllers belonging to the same fail-over interface group according to the directed fail-over procedure.
FIG. 1 shows an embodiment of a storage system of the present invention. (Herein, “x” denotes an integer.) Storage system 1 includes a disk controller 11 and multiple storage units 1700. In the disk controller 11, NAS channel adapters (CHN) 11001105, are interface controllers connected to NAS clients 400 via a file I/O interface. Fiber channel adapters (CHF) 1110, 1111 are interface controllers connected to SAN clients 500 via a block I/O interface. Herein, CHN and CHF will be referred to as channel adapters. Each storage unit 1700 is connected to a disk adapter 120. Each disk adapter 120 (DKA) controls a storage unit 1700 connected thereto. Reference numeral 13 denotes a shared memory (SM); 14 denotes a cache memory (CM). A shared memory controller (SMC) 15 is connected to NAS channel adapters 11001105, fiber channel adapters 1110, 1111, disk adaptors 120, and shared memory 13. Shared memory controller 15 controls data transfer between NAS channel adapters 11001105 and fiber channel adapters 1110, 1111, as well as between disk adapters 120 and shared memory 13. A cache memory controller (CMC) 16 is connected to NAS channel adapters 11001105, fiber channel adapters 1110, 1111, disk adapters 120, and cache memory 14. Cache memory controller 16 controls data transfer between NAS channel adapters 11001105 and fiber channel adapters 1110, 1111, as well as between disk adapters 120 and cache memory 14.
The LANs 20 and 21 connect NAS channel adapters 11001105 to NAS clients 400. Generally, the IP network is used for the LANs. Different domains are assigned to LANs 20 and 21. Here, “domain” means a management range in a network. In this embodiment, DOM-LAN0 domain names are given to LAN 20 and DOM-LAN1 domain names are given to LAN 21. SAN 30 connects fiber channel adapters 11001105 to SAN clients 500. In this embodiment, a DOM-FC0 domain name is given to SAN 30.
In storage system 1, every channel adapter can access the cache memory 14 and every storage unit 1700 via cache memory controller 16. Storage system 1 is provided with both SAN and NAS interfaces. This embodiment enables multiple NAS channel adapters to be divided into groups and each of the groups to be connected to a LAN managed in a domain different from the others. Of course, storage system 1 may be provided with only SAN or NAS interfaces.
FIG. 2 shows an external view of storage system 1. Disk controller 11 houses NAS channel adapters 11001105, fiber channel adapters 1110, 1111, disk adapters 120, shared memory 13 and cache memory 14. Disk units (DKU) 180 and 181 house storage units 1700, respectively. Shared memory 13 is actually composed of multiple controller boards 130, and cache memory 14 is composed of multiple cache boards 140. Boards 130 and 140 are loaded in slots 190. The user of storage system 1 increases/decreases the number of those boards to obtain a desired storage capacity. FIG. 2 shows how boards 130 and 140 are loaded in the respective slots 190 by fours.
Adapter boards that include a built-in NAS channel adapters 11001105 are also loaded in the slots 190. In this embodiment, the shape of slots 190, the size of the adapter boards, and the shape of the connectors are fixed among all the interfaces to make them compatible. Consequently, disk controller 11 can house any adapter boards in any slots 190 regardless of their interface types. The user of storage system 1 can choose a combination of a number of NAS channel adapters 11001105 and a number of fiber channel adapters 1110, 1111, and freely load them in slots 190 of storage system 1.
FIG. 3 shows a configuration of an adapter board that includes a built-in NAS channel adapter 1100. A connector 11007 is connected to a connector of a disk controller. In this embodiment, as described above, NAS channel adapter 1100 and fiber channel adapter 1110 or 1111 have the same configuration of connectors. An interface connector 2001 conforms to the IP network. When the adapter board is a fiber channel adapter 1110 or 1111, interface connector 2001 corresponds to a fiber channel.
FIG. 4 is an internal block diagram of NAS channel adapter 1100. Reference numeral 11001 denotes a center controller. A LAN controller 11002 connects a LAN via interface connector 2001. Memory 1004 is connected to center controller 11001. Memory 11004 stores programs and control data to be executed by center controller 11001. A shared memory interface controller (SM I/F) 11005 controls access of NAS channel adapters 11001105 to shared memory 13. Cache memory interface controlling means 11006 controls access of the NAS channel adapters to cache memory 14.
Center controller 11001 may be a single processor or a set of processors. For example, center controller 11001 may be composed of symmetrical multiple processors used for the horizontal load distribution of control processing. The symmetrical multiple processors may be configured so that one processor employs the I/O interface protocol for processing and the other processor controls disk volumes. The configuration of fiber channel adapters 1110, 1111 is the same as that shown in FIG. 4 except that LAN controller 11002 is replaced with a fiber channel controller.
FIG. 5 is a block diagram of memory 11004 of NAS channel adapters 11001105. An operating system program 110040 is used to manage all the programs and control the data I/O in the subject system. A LAN controller driver program 110041 is used to control LAN controller 11002 (shown in FIG. 4). A TCP/IP program 110042 is used to control the TCP/IP that is a LAN communication protocol. A file system program 110043 is used to manage files stored in the storage unit. A network file system program 110044 is used to control the protocols of Network File System used to supply files stored in the storage unit to NAS clients 400. A disk volume control program 110045 is used to control access to disk volumes set in the storage units 1700 (shown in FIGS. 1 and 2). A cache control program 110046 is used to manage the data in cache memory 14 (shown in FIGS. 1 and 2) and to control hit/miss decisions, etc. A fail-over program 110047 is used to control such processing as passing of processing from a NAS channel adapter that has failed to another normal NAS adapter. The fail-over program 110047 will be described more in detail later.
Next, the processing executed in storage system 1 is described. In this embodiment, the channel adapters of storage system 1 are managed in layers to make it easier to manage storage system 1. That is, the channel adapters are divided into four layers according to four indexes: physical interface, logical interface, domain, and fail-over group. The indexes are not limited to only those four, however.
FIG. 6 shows an example of the channel adapters division into the four layers. In the figure, a shaded area denotes storage system 1. The outermost track denotes the physical interface layer. In this layer, channel adapters are grouped according to the physical medium of the interface by which each channel adapter is connected to the host. Specifically, channel adapters are grouped according to the four physical media of the fiber channel, the UltraSCSI, the Mainframe Channel, and the IP network.
The second track denotes the physical interface layer. In this layer, channel adapters are grouped according to the logical protocol of the interface by which each channel adapter is connected to the host. Specifically, channel adapters are grouped by the fiber channel protocol (FCP), the SCSI, the Mainframe Channel, and the NAS (that is, a file I/O interface), and the iSCSI logical protocol.
The third track denotes the domain layer. In this layer, channel adapters are grouped according to the assigned domain (an IP network domain [sub-net] for the IP network, one SCSI bus for the SCSI, and the whole SAN composed in one group or single address space for the fiber channel). The innermost track denotes the fail-over group layer. In this layer, channel adapters that are fail-over-enabled are grouped into one unit.
To perform a fail-over operation between channel adapters, address information must be exchanged between the channel adapters. Consequently, one and the same domain must be assigned to the channel adapters in a fail-over group. When the fail-over group is in the same domain, the group may be a single group, like the DOM-LAN0 domain, or two or more, like the DOM-LAN2 domain. The number of channel adapters in a fail-over group may be two, like FOG-LN1, or three or more, like FOG-LN0. Each innermost square denotes a channel adapter. In FIG. 6, there are a total of 27 channel adapters.
FIG. 7 is a block diagram of shared memory 13. This shared memory 13 stores management information used to manage channel adapters. A configuration management information storing area 131 stores management information that denotes the configuration of each item of the storage system, such as an interface. Configuration information storing area 131 includes a channel adapter management table 1310, fail-over management information 1311, a heartbeat mark storing area 1312, and a fail-over information storing area 1313.
FIG. 8 shows the contents of channel adapter management table 1310. The table 1310 is used to manage channel adapter groups. The tables shown in FIG. 8 are formed in accordance with the configuration of storage system 1. The channel adapter entry 13101 includes registered channel adapter identifiers. The physical interface group entry 13102 holds the physical interface group to which each registered channel adapter belongs. The logical interface group entry 13103 holds information about the logical interface group to which each registered channel adapter belongs. A domain entry 13104 holds information about the domain to which each registered channel adapter belongs. A fail-over group entry 13105 holds information about the fail-over group to which each registered channel adapter belongs. A status entry 13106 holds the status of each registered channel adapter (normal, abnormal, channel adapter for which a fail-over operation is done, etc.). An operating ratio entry 13107 holds information about the operating state of each registered channel adapter, particularly the operation ratio of each channel adapter.
FIG. 8 shows how the processing jobs of failed channel adapters CHN 1 and CHN 2 are taken over by a normal channel adapter CHN 3 in the same fail-over group (FOG-LN0) of the same domain (DON-LAN0). In this case, CHN 3 executes its own processing, as well as the processing of the two CHNs that have failed, so that the operating ratio of CHN 3 becomes as high as 86%.
Returning to the description of FIG. 7, heartbeat mark storing area 1312 stores state information about each channel adapter. Herein, the state information is sometimes referred to as a heartbeat mark. The heartbeat mark includes such data as NAS channel adapter identifier, normal code, updating time.
Takeover (See FIG. 7) information storing area 1313 holds the takeover-related information for each channel adapter, so that the processing of a failed channel adapter can be taken over by another channel adapter. The takeover information includes both MAC and IP addresses of LAN controller 11002, device information for file system 110043 or mount point information, and export information for network file system 110044.
Takeover information storing area 1313 stores information related to takeover processing between channel adapters, monitoring related information, specific channel adapter processing to be taken over (to be described later) and information about each channel adapter to be monitored, etc. Each of the above information items will be described more in detail with reference to FIGS. 12 through 14.
The operation of storage system 1 in this embodiment is now described, starting with a description of how storage system 1 operates when a failure is detected in a channel adapter. The “failure” mentioned here means an unrecoverable failure that occurs in a channel adapter whose processing must be taken over by a normal channel adapter. Here, the failed channel adapter is CH-A and the adapter that takes over the processing of CH-A is CH-B.
Where CH-A detects a failure by itself, the fail-over processing, the recovery processing, and the take-back processing are executed by the following procedure.
(1) CH-A finds the failure and executes a block-off processing. As a result, heartbeat mark updating of CH-A stops. The block-off processing means stopping a channel adapter operation.
(2) CH-B confirms that heartbeat mark updating of CH-A has stopped.
(3) CH-B takes over the processing of CH-A (fail-over).
(4) Recovery processing is executed for CH-A. Specifically, recovery processing means CH-A board replacement, repair, or other service by a maintenance worker. Storage system 1 executes recovery processing according to the reported failure content. For example, the report may be any of the messages displayed on the screen of the subject management terminal, a Simple Network Management Protocol (SNMP), an E-mail, a syslog, a pocket bell sound, an assisting notice (via a hot line to the center), etc.
(5) CH-A is recovered and heartbeat mark updating of CH-A restarts.
(6) CH-B confirms that heartbeat mark of CH-A has been updated.
(7) CH-A takes back the processing failed over to CH-B (taking-back).
Where CH-A cannot execute a block-off process by itself for a failure detected therein, CH-A executes the following procedure.
(1′) Another failure occurs in CH-A (because the center controller does not function, heartbeat mark updating also stops at this time.)
(2′) CH-B confirms that heartbeat mark updating of CH-A has stopped.
(3′) CH-B forcibly blocks off the CH-A.
The procedure following (3′) is the same as that of steps (3–7 above), so that the description will be omitted here.
Next, the details of the processing in (1) and (1′) is described. Hereinafter, only the NAS channel adapter will be described, but the fiber channel adapter can be processed in the same way.
FIG. 9 is a flowchart of the operation in fail-over procedure step (1) of the center controller 11001 of the NAS channel adapter CHN 1101. In this case, CHN 1101 is equivalent to CH-A. Center controller 11001 monitors failure occurrence in CHN 1101 by using a fail-over control program 110047 . Center controller 11001 starts up the fail-over control program 110047 when the CHN 1101 is powered (step 4700). Center controller 11001 then decides whether or not a failure has occurred in CHN 1101 under the control of the fail-over control program 110047 (step 4701).
When no failure is detected, center controller 11001 controls processing so that the heartbeat mark is stored in heartbeat mark storing area 1312 of shared memory 13 (step 4702). After the storing (or updating) the heartbeat mark, fail-over control program 110047 stops for a fixed time (step 4703). After that, center controller 11001 repeats processing in steps 4701 to 4703.
When a failure is detected in step 4701, center controller 11001 executes the following processing. Note, however, that a hardware failure might be detected when a hardware interruption is issued to a given center controller 11001, in a step other than step 4701. Even in that case, the center controller 11001 executes the following processing.
Center controller 11001, when it is able to work, stops the updating of the heartbeat mark. Center controller 11001 can also control heartbeat mark updating to enable the heartbeat mark to include information denoting that CHN 1101 has stopped due to a detected failure (step 4704).
Center controller 11001 then sets the detected failure (failed channel adapter) in the cell equivalent to CHN 1101 in the status entry 13106 column of channel adapter management table 1310 (step 4705). After that, center controller 11001 executes block-off processing (step 4706).
When center controller 11001 is not able to work, the processing in steps 4704 to 4706 cannot be executed. If the operation of center controller 1101 is disabled, the heartbeat mark is not updated (equivalent to (1′)) even when heartbeat mark updating time is reached. In this case, another channel adapter monitors the communication status of the heartbeat mark to detect a failure occurrence in the failed channel adapter (equivalent to (2)). In addition, the monitoring channel adapter executes the processing in steps 4705 and 4706, that is, the processing in (3′) in place of the failed channel adapter, and, thereby, the fail-over processing is continued.
FIG. 10 is a flowchart of how the processing of CH-A are taken over by CH-B. Specifically, the flowchart shows the operations in (2) and (3) of NAS channel adapter CHN 1102.
When CHN 1102 is powered, its center controller 11001 starts up fail-over control program 110047 (step 4800). Center controller 11001 monitors failure occurrence in the target channel adapter in the same fail-over group by checking the heartbeat mark of the target channel adapter (CHN 1101 in this case). A “monitoring target channel adapter” means another channel adapter assigned to a first channel adapter to be monitored by that channel adapter. Such a monitoring target channel adapter is registered in fail-over management information 1311 stored in shared memory 13. Each target channel adapter is set at the factory when the product is delivered or it is set freely by the user through a software program pre-installed in the product.
Where the heartbeat mark of such a target channel adapter of monitoring is not updated, even at the predetermined updating time, or when it is confirmed that a failure occurrence code is described in the heartbeat mark, center controller 11001 decides that a failure has occurred in the target channel adapter (steps 4801 and 4802). When no failure is detected, center controller 11001 sleeps for a predetermined time (steps 4802 and 4803), then repeats processing in steps 4801 to 4803.
If a failure is detected, center controller 11001 checks the state of the failed channel adapter, that is, the state of CHN 1101 (step 4804). When no block-off processing is executed for CHN 1101, that is, when CHN 1101 is in the state of (1′), CHN 1102 executes post-failure processing in place of CHN 1101. Post-failure processing means that instead of center controller 11001 of the failed channel adapter, a normal channel adapter has detected a failure; sets the failure occurrence (failure state) in the status column of channel adapter management table 1310, in the cell corresponding to the failed channel adapter; and forcibly blocks off the failed channel adapter. This processing is equivalent to the processing in (3′)(step 4810).
After that, center controller 11001 identifies the subsidiary channel adapter whose processing is to be taken over. Information about the subsidiary channel adapter is stored in fail-over management information 1131.
A subsidiary channel adapter means a channel adapter assigned to another channel adapter so that the other channel adapter takes over the processing of the subsidiary channel adapter when a failure is detected in the subsidiary channel adapter. For example, when CHN 1101 is assigned as a subsidiary channel adapter of CHN 1102, CHN 1102 takes over the processing of CHN 1101 when a failure is detected in CHN 1101. The subsidiary channel adapter is not only the channel adapter that has failed, but also another channel adapter whose processing had been taken over by the channel adapter that has failed. In such a case, a channel adapter, when it takes over the processing of another channel adapter, is also required to take over the processing of every channel adapter. As a result, center controller 11001 checks the presence of the channel adapter with reference to fail-over management information 1311.
In this embodiment, it is assumed that CHN 1101 is assigned as a subsidiary channel adapter of CHN 1102. Consequently, center controller 11001 identifies CHN 1101 as a subsidiary channel adapter in this step. How such a subsidiary channel adapter is checked is described later (step 4805). Center controller 11001 updates the information included in fail-over management information 1311. How the information the information is updated is described later (step 4806).
Center controller 11001 updates each monitoring target channel adapter. This is because updating the information in fail-over management information 1311 might cause assignment of another NAS channel adapter that must be monitored. How the information is updated is described later (step 4807). Center controller 11001 of CHN 1102, which has detected a failure in CHN 1101 a monitored subsidiary channel adapter of CHN 1102, takes over the processing of CHN 1101 in the following procedure.
Center controller 11001 obtains from fail-over information storing area 1313 of shared memory 13, the fail-over information related to the failed CHN 1101. Center controller 11001 then sets both the MAC and IP addresses of LAN controller 11002 of failed CHN 1101 in the LAN controller 11002 of CHN 1102. As a result, CHN 1102 can respond to both the LAN access to CHN 1101 and the LAN access to CHN 1102. Center controller 11001 then mounts a file system mounted in CHN 1101 in CHN 1102 according to the device information and the mount point information related to file system 110043 of CHN 1101. Center controller 11001 replays the journal as a recovery processing of the file system. After that, center controller 11001 opens the recovered file system at a predetermined export point according to the export information of network file system 110044. Center controller 11001 takes over any unfinished processing that was requested of CHN 1101 by a NAS client, as needed (step 4808). This completes the fail-over processing (step 4809). After that, center controller 11001 restarts the monitoring in step 4800.
FIG. 11 is a flowchart of recovery processing in a channel adapter that takes over (CHN 1102 in this case) the processing of a failed channel adapter, that is, operations (6) and (7). At first, center controller 11001 starts the recovery processing (step 4900). Center controller 11001 checks the heartbeat mark of every monitoring target channel adapter (step 4901). This processing is the same as that in step 4801. Confirming the recovery of the failed channel adapter (CHN 1101 in this case), center controller 11001 executes the processing in and after step 4904 (step 4902). When not confirming a recovery, center controller 11001 sleeps for a predetermined time (step 4903), then repeats the processing in steps 4901 to 4903.
Center controller 11001 then updates fail-over management information 1311 to eliminate CHN 1101 from the fail-over processing (step 4904). How the information 1311 is updated is described later. Center controller 11001 updates the target channel adapter of fail-over processing. That is, center controller 11001 updates the necessary information to eliminate the recovered channel adapter from fail-over processing.
Where CHN 1102 takes over not only the processing of CHN 1101, but also the processing of another NAS channel adapter, which had been taken over by CHN 1101, center controller 11001 can eliminate the channel adapter from fail-over processing. In this case the process is as follows. First, CHN 1101 fails and CHN 1102 takes over the processing of CHN 1101. Then, CHN 1102 fails and CHN 1103 takes over the processing of both CHN 1102 and CHN 1101. If CHN 1102 is recovered after that, CHN 1103 can exit the processing of both CHN 1102 and CHN 1101 (step 4905). How the necessary information in such a case is updated is described in detail, later, with reference to FIGS. 12 through 14.
Center controller 11001 updates the monitoring target channel adapter. This is because the monitoring target channel adapter might also be changed due to the updating of the fail-over management information, etc.(step 4906). Center controller 11001 then executes take-back processing. “Take-back processing” means processing that returns fail-over processing to the original NAS channel adapter. That is, fail-over information taken over in fail-over processing is returned to the recovered channel adapter (step 4907). This completes recovery processing (step 4908). If there is another NAS channel adapter whose processing is to be taken over by CHN 1102, the above processing steps are repeated again.
FIGS. 12 to 14 show concrete examples of a series of fail-over processes. In the examples, there are four NAS channel adapters (CHN 0, CHN 1, CHN 2, and CHN 3) in fail-over group FOG-LN0 of domain DOM-LAN0, and two of the channel adapters, CHN 1 and CHN 2, have failed consecutively. The right portion of FIG. 12( a) shows that each CHN is operating normally. Each CHN periodically updates its heartbeat mark (HBM) stored in heart beat mark storing area 1312 (an HBM being periodically updated is shown as ON). In this case, the contents of the fail-over management information are as shown in the left portion of FIG. 12( a). Actually, however, the information is stored in fail-over management information 1311 as a list as shown in the right portion of FIG. 12( a).
The CHN located at the arrowhead monitors the CHN at the other (round) end of the arrow. When the CHN located at the round end of the arrow fails, the CHN located at the arrowhead executes a fail-over operation (the dotted line arrow shown in the left portion of FIG. 12( a) also denotes the same relationship). For example, the CHN 1 monitors the CHN 0. In other words, the CHN 0 is a target channel adapter to be monitored by CHN 1. The relationship denoted by this arrow is referred to as a “current” relationship.
FIG. 12( b) shows that the CHN 1 has failed. When CHN 1 fails, updating of the heartbeat mark of CHN 1 stops (the HBM updating stopped state is shown as OFF). CHN 2 then detects the HBM updating has stopped. Fail-over management information 1311 shown in the right portion of FIG. 12( b) is not updated at this time.
FIG. 12( c) shows that the CHN 2 has taken over the processing of CHN 1. In the fail-over management information, before the fail-over is completed, the channel adapter that has taken over the processing of CHN 1 (hereinafter, the takeover channel adapter) is set as CHN 1. As a result, CHN 2, which detected the failed CHN 1 identifies CHN 1 as a subsidiary channel adapter, then updates fail-over management information 1311 as shown in the right portion of FIG. 12(C).
The right portion of FIG. 12( c) shows that CHN 1 becomes a subsidiary channel adapter of CHN 2, thereby its processing is taken over by CHN 2 (as denoted by the solid, upward arrow in the figure). The relationship denoted by this upward arrow is referred to as a “takeover” relationship. Such a “current” relationship of is set between CHN 0 and CHN 2. This means that CHN 1 is added to the target channel adapters to be monitored by CHN 2. On the other hand, the “current” relationship between CHN 0 and CHN 1 and the one between CHN 1 and CHN 2 are respectively updated to a default relationship (as shown by a dotted arrow in the figure). The take-over relationship denoted by a solid line indicates an “active” relationship that both channel adapters are monitoring each other (or taking over). On the other hand, the dotted line denotes an “inactive” relationship. An inactive relationship indicates that none of the monitoring and taking-over is carried out between the subject channel adapters. Because of updated fail-over management information 1311, CHN 2 comes to have two active relationships of “takeover” and “current.” As a result, CHN 2 monitors two channel adapters (CHN 1 and CHN 0), as shown in the left portion of FIG. 12( c).
FIG. 13( a) shows that CHN 2 has failed. When CHN 2 fails in this way, updating of the heartbeat mark (HBM) of CHN 2 stops. CHN 3 then detects that CHN 2 heartbeat mark updating has stopped. Fail-over management information 1311 shown in the right portion of FIG. 13( a) is not updated at this time.
FIG. 13( b) shows the state of CHN 3, which has taken over the processing of CHN 2. In fail-over management information 1311, before the taking-over is completed, CHN 3 is set as the takeover channel adapter of CHN 2. As a result, CHN 1103, which detected the failure of CHN 2, identifies CHN 2 as a target channel adapter and updates fail-over information 1311, as shown in the right portion of FIG. 13( b). The right portion of the figure also shows that failed CHN 2 becomes a subsidiary channel adapter of CHN 3, and CHN 1, which is a subsidiary channel adapter of the CHN 2, also becomes as a subsidiary channel adapter of CHN 1103; thus, the processing of both CHN 1 and CHN 2 are taken over by CHN 3 (as denoted by the solid, upward arrow in the figure). In other words, a takeover relationship is set between CHN 1 and CHN 3, as well as between CHN 2 and CHN 3. In the meantime, the takeover relationship between CHN 1 and CHN and the “current” relationship between CHN 0 and CHN 2 are reset. Then, a new “current” relationship is set between CHN 0 and CHN 3. In addition, the “current” relationship between CHN 2 and CHN 3 is updated to default. The default relationship between CHN 0 and CHN 1 and the one between CHN 1 and CHN 2 are kept as they are.
Due to the updating of fail-over management information 1311 as described above, CHN 3 comes to have three active relationships (two takeover relationships and one “current” relationship). As a result, CHN 3 monitors three channel adapters (CHN 1, CHN 2, and CHN 0), as shown in the left portion of the figure.
FIG. 13( c) shows the state of CHN 1 recovered from a failure. When CHN 1 has been recovered, updating of the heartbeat mark (HBM) of CHN 1 restarts. CHN 3 then detects this restarted CHN 1 HBM updating. Fail-over management information 1311 shown in the right portion of FIG. 13( c) is not updated at this time.
FIG. 14( a) shows the state of CHN 1 after the processing is returned from the CHN 3 thereto. In the fail-over management information before CHN 1 was recovered, CHN 3 was set as the takeover channel adapter of CHN 1. As a result, CHN 3, which detected the recovered CHN 1, updates fail-over management information 1311 as shown in the right portion of FIG. 14( a) As shown in the figure, the takeover relationship between CHN 1 and CHN 3 is reset from CHN 1. The default relationship between CHN 1 and CHN 3 is updated to a “current” relationship. In addition, the relationship between CHN 0 and CHN 1 is updated from default to “current”. This means that processing is returned (taken back) from CHN 3 by CHN 1. Furthermore, the “current”[flow] relationship between CHN 0 and CHN 3 is reset. The relationship between CHN 2 and CHN 3 is kept as is at this time.
Due to the updating of fail-over management information 1311 as described above, CHN 3 comes to have two active relationships (one takeover relationship and one “current” relationship). As a result, CHN 3 monitors two channel adapters (CHN 1 and CHN 2), as shown in the left portion of in FIG. 14( a).
FIG. 14( b) shows the state of CHN 2 recovered from a failure. When CHN 2 is recovered, updating of the heartbeat mark (HBM) of CHN 2 restarts. CHN 3 then detects this restarted CHN 2 HBM updating. Fail-over management information 1311 shown in the right portion of the figure is not updated at this time.
FIG. 14( c) shows the state of CHN 1102 after processing is returned from CHN 3 thereto. In the fail-over management information before CHN 2 was recovered, CHN 1103 was set as the takeover channel adapter of CHN 2. As a result, CHN 3, which detected the recovered CHN 1102, updates fail-over management information 1311 as shown in the right portion of FIG. 14( c). As shown in the figure, the take-over relationship between CHN 2 and CHN 3 is reset from CHN 2. The default relationship between CHN 2 and CHN 3 is updated to a “current” relationship. In addition, the relationship between CHN 1 and CHN 2 is updated from default to current. This means that processing is returned (taken back) from CHN 3 to CHN 2. Furthermore, the “current” relationship between CHN 1 and CHN 3 is reset. Updating of the above relationships restores the state shown in FIG. 12( a).
According to this embodiment, it is possible to use a channel adapter provided with various kinds of block I/O interfaces and a channel adapter provided with various kinds of file I/O interfaces together in one storage system; thus, the storage system can be connected to a plurality of network domains. In addition, it is possible to compose a proper fail-over group in such a system configuration so that the processing by multiple channel adapters in the fail-over group can be taken over by a normal channel adapter even if consecutive failures occur in the group.
Although, in this embodiment, a channel adapter to be monitored is the same as that from which processing is to be taken over, the channel adapter to be monitored may be different from the channel adapter from which processing is to be taken over. For example, it is possible to configure the system so that CHN 2 monitors CHN 1, but CHN 3 takes over the processing of CHN 1. However, note that information must be exchanged between CHN 2 and CHN 3 in such a system configuration. This required processing will be described later.
In the embodiment as described above, the storage system 1 chooses a takeover channel adapter statically according to predetermined fail-over management information. However, when one channel adapter takes over the processing of multiple channel adapters (fail-over), the usage ratio of the takeover channel adapter will become very high.
To avoid the problem, the present invention provides a variation of this embodiment. Specifically, the storage system itself collects and records the operating ratio of each channel adapter and selects the channel adapter whose operating ratio is the lowest in the same fail-over group as a takeover channel adapter. The storage system then enables the takeover channel adapter to take over the processing of each channel adapter that has failed.
Moreover, both takeover and monitor relationships defined in the above embodiment shown in FIG. 12( b) are modified. That is, an arrow line in FIG. 12B does not represent any takeover relationship, but represents only a relationship between the channel adapters so that one channel adapter monitors failure occurrence in the other channel adapter.
Furthermore, each channel adapter measures the operating ratio of its center controller 11001 and periodically stores the result in channel adapter management table 1310. Specifically, an idle process is executed when the center controller 11001 has no work to execute. The interval in which the idle process is executed is measured for a certain time, thereby calculating the operating ratio of center controller 11001 in a fixed period. The fixed period may be any value, but it should preferably be a time interval to which the measurement overhead is added so as to become larger enough with respect to the processor clock, for example, about 1 second.
A takeover channel adapter is identified as follows. A channel adapter monitors heart beat mark area 1312, just as in the above embodiment, to detect a channel adapter that has failed, which is a target channel adapter to be monitored. The channel adapter that has detected the failed channel adapter refers to channel adapter management table 1310 to identify the channel adapter whose operating ratio is the lowest at that time among the normal channel adapters in the same fail-over group. Then, the channel adapter that detected the failed channel adapter selects the channel adapter whose operating ratio is the lowest as the takeover channel adapter. After that, the channel adapter that detected the failed channel adapter updates fail-over management information 1311. A takeover relationship is thus set between the failed channel adapter and the NAS channel adapter selected as the takeover channel adapter. The monitoring relationships of default and “current” are the same as those shown in FIG. 12.
The monitoring channel adapter sends a signal to the channel adapter selected as the takeover channel adapter. Receiving the signal, the takeover channel adapter refers to fail-over management information 1311 to ascertain that it has become the takeover channel adapter of the failed channel adapter. After that, the takeover channel adapter executes fail-over processing as described above.
According to this embodiment, concentration of the load on the takeover channel adapter can be avoided.
Although a takeover channel adapter is chosen according to the operating ratio at a certain time as described above, such a takeover channel adapter may also be selected so that the load of the takeover channel adapter is dispersed over a long period according to the recorded variation of the operating ratio over time, etc. In this case, the effect of the load balance will become more significant for a system with a load that varies with time.
There are also other methods that employ the operating ratio to select a takeover channel adapter. For example, there is a fail-over method to average the number of clients connected per channel adapter, a fail-over method to average the number of disks to be accessed per channel adapter, etc.
The present invention, therefore, provides a storage system that can employ various kinds of interfaces conforming to the standards of both NAS and SAN. As a result, the system configuration is more adaptable, and system configuration varied more freely to reduce management costs. It is also possible to provide a storage system with excellent resistance to multiple failures occurring in multiple interfaces conforming to the standards of both NAS and SAN.

Claims (3)

1. The storage system comprising:
a plurality of slots usable for each of various kinds of interface controllers, including at least an interface controller that controls a block I/O interface and an interface controller that controls a file I/O interface, said slots having the same shape;
a disk controller comprising:
a plurality of first interface controllers and a plurality of second interface controllers, wherein some of said first interface controllers are connected to a network managed in the same domain and others of said first interface controllers are connected to a network managed in another domain;
a shared memory connected to said first and second interface controllers;
a disk adapter connected to said shared memory;
a cache memory connected to said first and second interface controllers, to said shared memory, and to said disk adapter;
first fail-over means for transferring processing of said failed interface controller to a different interface controller included in some of said first interface controllers, when an interface controller included among said first interface controllers connected to a network managed in said same domain fails; and
second fail-over means for transferring processing of said different interface controller to a normal interface controller included among said first interface controllers, when said different interface controller fails; and
a storage unit connected to said disk controller,
wherein said shared memory stores a procedure for transferring processing of a failed interface controller, to a different interface controller; and
wherein said first and second fail-over means are executed in said procedure.
2. A storage system comprising:
a plurality of slots usable for each of various kinds of interface controllers, including at least an interface controller that controls a block I/O interface and an interface controller that controls a file I/O interface, said slots having the same shape; and
a disk controller comprising:
a plurality of first interface controllers and a plurality of second interface controllers;
a shared memory connected to said first and second interface controllers;
a disk adapter connected to said shared memory; and
a cache memory connected to said first and second interface controllers, to said shared memory, and to said disk adapter,
wherein each of said first and second interface controllers includes:
means that stores a heartbeat mark in a predetermined area in a heartbeat mark storing area of said shared memory at fixed time intervals; and
means that enables said interface controllers to monitor their states, each another by using said heart beat mark stored in said heart beat mark storing area.
3. A storage system comprising:
a plurality of slots usable for each of various kinds of interface controllers, including at least an interface controller that controls a block I/O interface and an interface controller that controls a file I/O interface, said slots having the same shape;
a disk controller comprising:
a plurality of first interface controllers and a plurality of second interface controllers, wherein some of said first interface controllers are connected to a network managed in the same domain and others of said first interface controllers are connected to a network managed in another domain;
a shared memory connected to said first and second interface controllers;
a disk adapter connected to said shared memory;
a cache memory connected to said first and second interface controllers, to said shared memory, and to said disk adapter;
first fail-over means for transferring processing of said failed interface controller to a different interface controller included in some of said first interface controllers, when an interface controller included among said first interface controllers connected to a network managed in said same domain fails; and
second fail-over means for transferring processing of said different interface controller to a normal interface controller included among said first interface controllers, when said different interface controller fails; and
a storage unit connected to said disk controller,
wherein said first fail-over means selects an interface controller whose operating ratio is the lowest among said interface controllers included said first interface controllers and transfers processing of said failed interface controller to said selected interface controller.
US10/150,245 2002-01-16 2002-05-15 Fail-over storage system Expired - Fee Related US7003687B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/316,463 US7447933B2 (en) 2002-01-16 2005-12-21 Fail-over storage system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2002-006873 2002-01-16
JP2002006873A JP3964212B2 (en) 2002-01-16 2002-01-16 Storage system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US11/316,463 Continuation US7447933B2 (en) 2002-01-16 2005-12-21 Fail-over storage system

Publications (2)

Publication Number Publication Date
US20030135782A1 US20030135782A1 (en) 2003-07-17
US7003687B2 true US7003687B2 (en) 2006-02-21

Family

ID=19191285

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/150,245 Expired - Fee Related US7003687B2 (en) 2002-01-16 2002-05-15 Fail-over storage system
US11/316,463 Expired - Fee Related US7447933B2 (en) 2002-01-16 2005-12-21 Fail-over storage system

Family Applications After (1)

Application Number Title Priority Date Filing Date
US11/316,463 Expired - Fee Related US7447933B2 (en) 2002-01-16 2005-12-21 Fail-over storage system

Country Status (2)

Country Link
US (2) US7003687B2 (en)
JP (1) JP3964212B2 (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040148542A1 (en) * 2003-01-23 2004-07-29 Dell Products L.P. Method and apparatus for recovering from a failed I/O controller in an information handling system
US20050188239A1 (en) * 2004-01-30 2005-08-25 Dell Products L.P. Method, software and system for multi-path fail-over recovery in sequential storage systems
US20050228943A1 (en) * 2004-04-02 2005-10-13 Decenzo David P Multipath redundant storage system architecture and method
US20050259632A1 (en) * 2004-03-31 2005-11-24 Intel Corporation Load balancing and failover
US20060075416A1 (en) * 2004-10-04 2006-04-06 Fujitsu Limited Disk array device
US20060129784A1 (en) * 2003-01-20 2006-06-15 Hitachi, Ltd. Method of controlling storage device controlling apparatus, and storage device controlling apparatus
US20070079016A1 (en) * 2003-01-20 2007-04-05 Hitachi, Ltd. Storage device controlling apparatus and a circuit board for the same
US20070083638A1 (en) * 2005-08-31 2007-04-12 Microsoft Corporation Offloaded neighbor cache entry synchronization
US7234073B1 (en) * 2003-09-30 2007-06-19 Emc Corporation System and methods for failover management of manageable entity agents
US7236987B1 (en) 2003-02-28 2007-06-26 Sun Microsystems Inc. Systems and methods for providing a storage virtualization environment
US20070192459A1 (en) * 2006-02-13 2007-08-16 Kazuhide Horimoto Control method of computer, program, and virtual computer system
US7290168B1 (en) * 2003-02-28 2007-10-30 Sun Microsystems, Inc. Systems and methods for providing a multi-path network switch system
US7383381B1 (en) 2003-02-28 2008-06-03 Sun Microsystems, Inc. Systems and methods for configuring a storage virtualization environment
US20080133942A1 (en) * 2003-01-20 2008-06-05 Hitachi Ltd. Method of installing software on storage device controlling apparatus, method of controlling storage device controlling apparatus, and storage device controlling apparatus
US7406617B1 (en) * 2004-11-22 2008-07-29 Unisys Corporation Universal multi-path driver for storage systems including an external boot device with failover and failback capabilities
US20080195831A1 (en) * 2007-02-13 2008-08-14 Fujitsu Limited Data transfer apparatus and data transfer method
US20080222661A1 (en) * 2004-03-19 2008-09-11 Alexander Belyakov Failover and Load Balancing
US7430568B1 (en) 2003-02-28 2008-09-30 Sun Microsystems, Inc. Systems and methods for providing snapshot capabilities in a storage virtualization environment
US20090249114A1 (en) * 2008-03-31 2009-10-01 Fujitsu Limited Computer system
US20100199131A1 (en) * 2009-01-30 2010-08-05 Fujitsu Limited Storage system and a control method for a storage system
US8122120B1 (en) * 2002-12-16 2012-02-21 Unisys Corporation Failover and failback using a universal multi-path driver for storage devices
US8621262B2 (en) 2007-10-01 2013-12-31 Renesas Electronics Corporation Semiconductor integrated circuit and method for controlling semiconductor integrated circuit
US10203890B1 (en) * 2016-09-20 2019-02-12 Tintri Inc. Multi-tier mechanism to achieve high availability in a multi-controller system
US10817220B2 (en) 2019-01-31 2020-10-27 EMC IP Holding Company LLC Sharing processor cores in a multi-threading block i/o request processing data storage system

Families Citing this family (136)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8205009B2 (en) * 2002-04-25 2012-06-19 Emc Israel Development Center, Ltd. Apparatus for continuous compression of large volumes of data
US7702786B2 (en) * 2002-08-09 2010-04-20 International Business Machines Corporation Taking a resource offline in a storage network
US7475124B2 (en) * 2002-09-25 2009-01-06 Emc Corporation Network block services for client access of network-attached data storage in an IP network
JP4439798B2 (en) 2002-10-17 2010-03-24 株式会社日立製作所 Disk array device control method and disk array device
US7415565B2 (en) * 2002-10-31 2008-08-19 Ring Technology Enterprises, Llc Methods and systems for a storage system with a program-controlled switch for routing data
US7197662B2 (en) * 2002-10-31 2007-03-27 Ring Technology Enterprises, Llc Methods and systems for a storage system
US7707351B2 (en) * 2002-10-31 2010-04-27 Ring Technology Enterprises Of Texas, Llc Methods and systems for an identifier-based memory section
US6879526B2 (en) * 2002-10-31 2005-04-12 Ring Technology Enterprises Llc Methods and apparatus for improved memory access
JP2004220216A (en) * 2003-01-14 2004-08-05 Hitachi Ltd San/nas integrated storage device
JP2004227097A (en) * 2003-01-20 2004-08-12 Hitachi Ltd Control method of storage device controller, and storage device controller
JP4342804B2 (en) * 2003-01-31 2009-10-14 株式会社日立製作所 Storage system control method, storage system, and program
JP2004234555A (en) * 2003-01-31 2004-08-19 Hitachi Ltd Control method for storage system, storage system, and program
JP2004234558A (en) * 2003-01-31 2004-08-19 Hitachi Ltd Storage device controller and program
JP3778171B2 (en) * 2003-02-20 2006-05-24 日本電気株式会社 Disk array device
US7904599B1 (en) 2003-03-28 2011-03-08 Cisco Technology, Inc. Synchronization and auditing of zone configuration data in storage-area networks
US7433300B1 (en) * 2003-03-28 2008-10-07 Cisco Technology, Inc. Synchronization of configuration data in storage-area networks
JP2005071196A (en) * 2003-08-27 2005-03-17 Hitachi Ltd Disk array apparatus and control method of its fault information
US7321985B2 (en) * 2004-02-26 2008-01-22 International Business Machines Corporation Method for achieving higher availability of computer PCI adapters
JP2005267111A (en) * 2004-03-17 2005-09-29 Hitachi Ltd Storage control system and method for controlling storage control system
JP3909062B2 (en) * 2004-03-25 2007-04-25 株式会社日立製作所 NAS control device, backup method, and program
US7392261B2 (en) * 2004-05-20 2008-06-24 International Business Machines Corporation Method, system, and program for maintaining a namespace of filesets accessible to clients over a network
JP4870915B2 (en) * 2004-07-15 2012-02-08 株式会社日立製作所 Storage device
JP4605637B2 (en) * 2004-07-29 2011-01-05 株式会社日立製作所 Storage device system and signal transmission method in storage device system
US7487385B2 (en) * 2004-11-01 2009-02-03 Netapp, Inc. Apparatus and method for recovering destroyed data volumes
US7535832B2 (en) * 2004-11-22 2009-05-19 International Business Machines Corporation Apparatus and method to set the signaling rate of a switch domain disposed within an information storage and retrieval system
JP4563794B2 (en) * 2004-12-28 2010-10-13 株式会社日立製作所 Storage system and storage management method
JP2006227856A (en) * 2005-02-17 2006-08-31 Hitachi Ltd Access controller and interface mounted on the same
JP2006244123A (en) * 2005-03-03 2006-09-14 Fujitsu Ltd Data storage system and data storage control device
JP4969791B2 (en) * 2005-03-30 2012-07-04 株式会社日立製作所 Disk array device and control method thereof
JP4871546B2 (en) 2005-08-22 2012-02-08 株式会社日立製作所 Storage system
US7774565B2 (en) * 2005-12-21 2010-08-10 Emc Israel Development Center, Ltd. Methods and apparatus for point in time data access and recovery
US8060713B1 (en) 2005-12-21 2011-11-15 Emc (Benelux) B.V., S.A.R.L. Consolidating snapshots in a continuous data protection system using journaling
US7849361B2 (en) * 2005-12-22 2010-12-07 Emc Corporation Methods and apparatus for multiple point in time data access
EP1969454A2 (en) * 2006-01-03 2008-09-17 EMC Corporation Methods and apparatus for reconfiguring a storage system
JP4414399B2 (en) * 2006-01-30 2010-02-10 富士通株式会社 Disk controller
US7577867B2 (en) * 2006-02-17 2009-08-18 Emc Corporation Cross tagging to data for consistent recovery
US7574630B1 (en) * 2006-08-14 2009-08-11 Network Appliance, Inc. Method and system for reliable access of expander state information in highly available storage devices
US7627687B2 (en) * 2006-09-28 2009-12-01 Emc Israel Development Center, Ltd. Methods and apparatus for managing data flow in a continuous data replication system having journaling
US7627612B2 (en) * 2006-09-28 2009-12-01 Emc Israel Development Center, Ltd. Methods and apparatus for optimal journaling for continuous data replication
JP5090098B2 (en) 2007-07-27 2012-12-05 株式会社日立製作所 Method for reducing NAS power consumption and computer system using the method
US7805632B1 (en) * 2007-09-24 2010-09-28 Net App, Inc. Storage system and method for rapidly recovering from a system failure
US7840536B1 (en) 2007-12-26 2010-11-23 Emc (Benelux) B.V., S.A.R.L. Methods and apparatus for dynamic journal expansion
US8041940B1 (en) 2007-12-26 2011-10-18 Emc Corporation Offloading encryption processing in a storage area network
US7860836B1 (en) 2007-12-26 2010-12-28 Emc (Benelux) B.V., S.A.R.L. Method and apparatus to recover data in a continuous data protection environment using a journal
US7958372B1 (en) 2007-12-26 2011-06-07 Emc (Benelux) B.V., S.A.R.L. Method and apparatus to convert a logical unit from a first encryption state to a second encryption state using a journal in a continuous data protection environment
JP4483947B2 (en) * 2008-01-17 2010-06-16 日本電気株式会社 I / O controller
US9128868B2 (en) * 2008-01-31 2015-09-08 International Business Machines Corporation System for error decoding with retries and associated methods
US8181094B2 (en) * 2008-01-31 2012-05-15 International Business Machines Corporation System to improve error correction using variable latency and associated methods
US8171377B2 (en) 2008-01-31 2012-05-01 International Business Machines Corporation System to improve memory reliability and associated methods
US8185801B2 (en) 2008-01-31 2012-05-22 International Business Machines Corporation System to improve error code decoding using historical information and associated methods
US8176391B2 (en) * 2008-01-31 2012-05-08 International Business Machines Corporation System to improve miscorrection rates in error control code through buffering and associated methods
US8185800B2 (en) * 2008-01-31 2012-05-22 International Business Machines Corporation System for error control coding for memories of different types and associated methods
US8352806B2 (en) * 2008-01-31 2013-01-08 International Business Machines Corporation System to improve memory failure management and associated methods
US9501542B1 (en) 2008-03-11 2016-11-22 Emc Corporation Methods and apparatus for volume synchronization
JP4571203B2 (en) * 2008-05-09 2010-10-27 株式会社日立製作所 Management server and cluster management method in information processing system
US8108634B1 (en) 2008-06-27 2012-01-31 Emc B.V., S.A.R.L. Replicating a thin logical unit
US7719443B1 (en) 2008-06-27 2010-05-18 Emc Corporation Compressing data in a continuous data protection environment
US8060714B1 (en) 2008-09-26 2011-11-15 Emc (Benelux) B.V., S.A.R.L. Initializing volumes in a replication system
US7882286B1 (en) 2008-09-26 2011-02-01 EMC (Benelux)B.V., S.A.R.L. Synchronizing volumes for replication
JP4648447B2 (en) 2008-11-26 2011-03-09 株式会社日立製作所 Failure recovery method, program, and management server
US8327186B2 (en) * 2009-03-10 2012-12-04 Netapp, Inc. Takeover of a failed node of a cluster storage system on a per aggregate basis
US8145838B1 (en) 2009-03-10 2012-03-27 Netapp, Inc. Processing and distributing write logs of nodes of a cluster storage system
US8069366B1 (en) 2009-04-29 2011-11-29 Netapp, Inc. Global write-log device for managing write logs of nodes of a cluster storage system
US8392680B1 (en) 2010-03-30 2013-03-05 Emc International Company Accessing a volume in a distributed environment
US8332687B1 (en) 2010-06-23 2012-12-11 Emc Corporation Splitter used in a continuous data protection environment
US20130144838A1 (en) * 2010-08-25 2013-06-06 Hewlett-Packard Development Company, L.P. Transferring files
US8433869B1 (en) 2010-09-27 2013-04-30 Emc International Company Virtualized consistency group using an enhanced splitter
US8478955B1 (en) 2010-09-27 2013-07-02 Emc International Company Virtualized consistency group using more than one data protection appliance
US8694700B1 (en) 2010-09-29 2014-04-08 Emc Corporation Using I/O track information for continuous push with splitter for storage device
US8335771B1 (en) 2010-09-29 2012-12-18 Emc Corporation Storage array snapshots for logged access replication in a continuous data protection system
US8930620B2 (en) * 2010-11-12 2015-01-06 Symantec Corporation Host discovery and handling of ALUA preferences and state transitions
US8335761B1 (en) 2010-12-02 2012-12-18 Emc International Company Replicating in a multi-copy environment
JP5481658B2 (en) * 2011-06-17 2014-04-23 株式会社日立製作所 Optical communication system, interface board, and control method
US9256605B1 (en) 2011-08-03 2016-02-09 Emc Corporation Reading and writing to an unexposed device
US8898112B1 (en) 2011-09-07 2014-11-25 Emc Corporation Write signature command
US9507524B1 (en) 2012-06-15 2016-11-29 Qlogic, Corporation In-band management using an intelligent adapter and methods thereof
US9223659B1 (en) 2012-06-28 2015-12-29 Emc International Company Generating and accessing a virtual volume snapshot in a continuous data protection system
JP6040612B2 (en) * 2012-07-24 2016-12-07 富士通株式会社 Storage device, information processing device, information processing system, access control method, and access control program
US10235145B1 (en) 2012-09-13 2019-03-19 Emc International Company Distributed scale-out replication
US9336094B1 (en) 2012-09-13 2016-05-10 Emc International Company Scaleout replication of an application
JP5874933B2 (en) * 2013-01-29 2016-03-02 日本電気株式会社 Path control device, path control method, and path control program
US9383937B1 (en) 2013-03-14 2016-07-05 Emc Corporation Journal tiering in a continuous data protection system using deduplication-based storage
US9110914B1 (en) 2013-03-14 2015-08-18 Emc Corporation Continuous data protection using deduplication-based storage
US8996460B1 (en) 2013-03-14 2015-03-31 Emc Corporation Accessing an image in a continuous data protection using deduplication-based storage
US9696939B1 (en) 2013-03-14 2017-07-04 EMC IP Holding Company LLC Replicating data using deduplication-based arrays using network-based replication
US9152339B1 (en) 2013-03-15 2015-10-06 Emc Corporation Synchronization of asymmetric active-active, asynchronously-protected storage
US9244997B1 (en) 2013-03-15 2016-01-26 Emc Corporation Asymmetric active-active access of asynchronously-protected data storage
US9081842B1 (en) 2013-03-15 2015-07-14 Emc Corporation Synchronous and asymmetric asynchronous active-active-active data access
US9069709B1 (en) 2013-06-24 2015-06-30 Emc International Company Dynamic granularity in data replication
US9087112B1 (en) 2013-06-24 2015-07-21 Emc International Company Consistency across snapshot shipping and continuous replication
US9146878B1 (en) 2013-06-25 2015-09-29 Emc Corporation Storage recovery from total cache loss using journal-based replication
US9369525B2 (en) 2013-06-26 2016-06-14 International Business Machines Corporation Highly resilient protocol servicing in network-attached storage
US9304861B2 (en) 2013-06-27 2016-04-05 International Business Machines Corporation Unobtrusive failover in clustered network-attached storage
US9367260B1 (en) 2013-12-13 2016-06-14 Emc Corporation Dynamic replication system
US9405765B1 (en) 2013-12-17 2016-08-02 Emc Corporation Replication of virtual machines
US9158630B1 (en) 2013-12-19 2015-10-13 Emc Corporation Testing integrity of replicated storage
US9454305B1 (en) 2014-01-27 2016-09-27 Qlogic, Corporation Method and system for managing storage reservation
US9189339B1 (en) 2014-03-28 2015-11-17 Emc Corporation Replication of a virtual distributed volume with virtual machine granualarity
US9423980B1 (en) 2014-06-12 2016-08-23 Qlogic, Corporation Methods and systems for automatically adding intelligent storage adapters to a cluster
US10082980B1 (en) 2014-06-20 2018-09-25 EMC IP Holding Company LLC Migration of snapshot in replication system using a log
US9274718B1 (en) 2014-06-20 2016-03-01 Emc Corporation Migration in replication system
US9436654B1 (en) 2014-06-23 2016-09-06 Qlogic, Corporation Methods and systems for processing task management functions in a cluster having an intelligent storage adapter
US9619543B1 (en) 2014-06-23 2017-04-11 EMC IP Holding Company LLC Replicating in virtual desktop infrastructure
US9477424B1 (en) 2014-07-23 2016-10-25 Qlogic, Corporation Methods and systems for using an intelligent storage adapter for replication in a clustered environment
US10324798B1 (en) 2014-09-25 2019-06-18 EMC IP Holding Company LLC Restoring active areas of a logical unit
US10101943B1 (en) 2014-09-25 2018-10-16 EMC IP Holding Company LLC Realigning data in replication system
US10437783B1 (en) 2014-09-25 2019-10-08 EMC IP Holding Company LLC Recover storage array using remote deduplication device
US9460017B1 (en) 2014-09-26 2016-10-04 Qlogic, Corporation Methods and systems for efficient cache mirroring
US9910621B1 (en) 2014-09-29 2018-03-06 EMC IP Holding Company LLC Backlogging I/O metadata utilizing counters to monitor write acknowledgements and no acknowledgements
US9529885B1 (en) 2014-09-29 2016-12-27 EMC IP Holding Company LLC Maintaining consistent point-in-time in asynchronous replication during virtual machine relocation
US9600377B1 (en) 2014-12-03 2017-03-21 EMC IP Holding Company LLC Providing data protection using point-in-time images from multiple types of storage devices
US10496487B1 (en) 2014-12-03 2019-12-03 EMC IP Holding Company LLC Storing snapshot changes with snapshots
US9405481B1 (en) 2014-12-17 2016-08-02 Emc Corporation Replicating using volume multiplexing with consistency group file
US9483207B1 (en) 2015-01-09 2016-11-01 Qlogic, Corporation Methods and systems for efficient caching using an intelligent storage adapter
US9632881B1 (en) 2015-03-24 2017-04-25 EMC IP Holding Company LLC Replication of a virtual distributed volume
US10296419B1 (en) 2015-03-27 2019-05-21 EMC IP Holding Company LLC Accessing a virtual device using a kernel
US9411535B1 (en) 2015-03-27 2016-08-09 Emc Corporation Accessing multiple virtual devices
US9678680B1 (en) 2015-03-30 2017-06-13 EMC IP Holding Company LLC Forming a protection domain in a storage architecture
US10853181B1 (en) 2015-06-29 2020-12-01 EMC IP Holding Company LLC Backing up volumes using fragment files
WO2017052548A1 (en) * 2015-09-24 2017-03-30 Hewlett Packard Enterprise Development Lp Failure indication in shared memory
US9684576B1 (en) 2015-12-21 2017-06-20 EMC IP Holding Company LLC Replication using a virtual distributed volume
US10133874B1 (en) 2015-12-28 2018-11-20 EMC IP Holding Company LLC Performing snapshot replication on a storage system not configured to support snapshot replication
US10235196B1 (en) 2015-12-28 2019-03-19 EMC IP Holding Company LLC Virtual machine joining or separating
US10067837B1 (en) 2015-12-28 2018-09-04 EMC IP Holding Company LLC Continuous data protection with cloud resources
CN106936616B (en) 2015-12-31 2020-01-03 伊姆西公司 Backup communication method and device
CN105739930B (en) * 2016-02-02 2019-01-08 华为技术有限公司 A kind of storage architecture and its initial method and date storage method and managing device
US10579282B1 (en) 2016-03-30 2020-03-03 EMC IP Holding Company LLC Distributed copy in multi-copy replication where offset and size of I/O requests to replication site is half offset and size of I/O request to production volume
US10235087B1 (en) 2016-03-30 2019-03-19 EMC IP Holding Company LLC Distributing journal data over multiple journals
US10152267B1 (en) 2016-03-30 2018-12-11 Emc Corporation Replication data pull
US10235060B1 (en) 2016-04-14 2019-03-19 EMC IP Holding Company, LLC Multilevel snapshot replication for hot and cold regions of a storage system
US10235091B1 (en) 2016-09-23 2019-03-19 EMC IP Holding Company LLC Full sweep disk synchronization in a storage system
US10210073B1 (en) 2016-09-23 2019-02-19 EMC IP Holding Company, LLC Real time debugging of production replicated data with data obfuscation in a storage system
US10235090B1 (en) 2016-09-23 2019-03-19 EMC IP Holding Company LLC Validating replication copy consistency using a hash function in a storage system
US10146961B1 (en) 2016-09-23 2018-12-04 EMC IP Holding Company LLC Encrypting replication journals in a storage system
US10019194B1 (en) 2016-09-23 2018-07-10 EMC IP Holding Company LLC Eventually consistent synchronous data replication in a storage system
JP7331027B2 (en) 2021-02-19 2023-08-22 株式会社日立製作所 Scale-out storage system and storage control method

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696895A (en) 1995-05-19 1997-12-09 Compaq Computer Corporation Fault tolerant multiple network servers
US5774640A (en) * 1991-10-21 1998-06-30 Tandem Computers Incorporated Method and apparatus for providing a fault tolerant network interface controller
US20020178143A1 (en) * 2001-05-25 2002-11-28 Kazuhisa Fujimoto Storage system, a method of file data backup and method of copying of file data
US20030023784A1 (en) * 2001-07-27 2003-01-30 Hitachi, Ltd. Storage system having a plurality of controllers
US6553408B1 (en) * 1999-03-25 2003-04-22 Dell Products L.P. Virtual device architecture having memory for storing lists of driver modules
US6725106B1 (en) * 2000-02-28 2004-04-20 Autogas Systems, Inc. System and method for backing up distributed controllers in a data network
US20040139168A1 (en) * 2003-01-14 2004-07-15 Hitachi, Ltd. SAN/NAS integrated storage system
US20040153740A1 (en) * 2003-01-31 2004-08-05 Hitachi, Ltd. Methods for controlling storage devices controlling apparatuses
US6779063B2 (en) * 2001-04-09 2004-08-17 Hitachi, Ltd. Direct access storage system having plural interfaces which permit receipt of block and file I/O requests
US6792507B2 (en) * 2000-12-14 2004-09-14 Maxxan Systems, Inc. Caching system and method for a network storage system
US6810462B2 (en) * 2002-04-26 2004-10-26 Hitachi, Ltd. Storage system and method using interface control devices of different types
US20040230720A1 (en) * 2003-01-20 2004-11-18 Hitachi, Ltd. Storage device controlling apparatus and method of controlling the same
US20040233910A1 (en) * 2001-02-23 2004-11-25 Wen-Shyen Chen Storage area network using a data communication protocol

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06282385A (en) * 1993-03-25 1994-10-07 Hitachi Ltd Storage controller and information processing system provided with this controller
US5848241A (en) 1996-01-11 1998-12-08 Openframe Corporation Ltd. Resource sharing facility functions as a controller for secondary storage device and is accessible to all computers via inter system links
JPH1139103A (en) * 1997-07-24 1999-02-12 Nec Yonezawa Ltd Magnetic tape processor and processing method therefor
JP3741345B2 (en) * 1999-03-24 2006-02-01 株式会社日立製作所 Network connection disk unit
JP2001256003A (en) 2000-03-10 2001-09-21 Hitachi Ltd Disk array controller, its disk array control unit and its expanding method
JP2001325207A (en) * 2000-05-17 2001-11-22 Hitachi Ltd Switch with built-in cache, computer system and switch control method for switch with built-in cache

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5774640A (en) * 1991-10-21 1998-06-30 Tandem Computers Incorporated Method and apparatus for providing a fault tolerant network interface controller
US5696895A (en) 1995-05-19 1997-12-09 Compaq Computer Corporation Fault tolerant multiple network servers
US6553408B1 (en) * 1999-03-25 2003-04-22 Dell Products L.P. Virtual device architecture having memory for storing lists of driver modules
US6725106B1 (en) * 2000-02-28 2004-04-20 Autogas Systems, Inc. System and method for backing up distributed controllers in a data network
US6792507B2 (en) * 2000-12-14 2004-09-14 Maxxan Systems, Inc. Caching system and method for a network storage system
US20040233910A1 (en) * 2001-02-23 2004-11-25 Wen-Shyen Chen Storage area network using a data communication protocol
US6779063B2 (en) * 2001-04-09 2004-08-17 Hitachi, Ltd. Direct access storage system having plural interfaces which permit receipt of block and file I/O requests
US20020178143A1 (en) * 2001-05-25 2002-11-28 Kazuhisa Fujimoto Storage system, a method of file data backup and method of copying of file data
US20030023784A1 (en) * 2001-07-27 2003-01-30 Hitachi, Ltd. Storage system having a plurality of controllers
US6810462B2 (en) * 2002-04-26 2004-10-26 Hitachi, Ltd. Storage system and method using interface control devices of different types
US20040139168A1 (en) * 2003-01-14 2004-07-15 Hitachi, Ltd. SAN/NAS integrated storage system
US20040230720A1 (en) * 2003-01-20 2004-11-18 Hitachi, Ltd. Storage device controlling apparatus and method of controlling the same
US20040153740A1 (en) * 2003-01-31 2004-08-05 Hitachi, Ltd. Methods for controlling storage devices controlling apparatuses

Cited By (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8122120B1 (en) * 2002-12-16 2012-02-21 Unisys Corporation Failover and failback using a universal multi-path driver for storage devices
US20070277007A1 (en) * 2003-01-20 2007-11-29 Hitachi, Ltd. Method of Controlling Storage Device Controlling Apparatus, and Storage Device Controlling Apparatus
US7908513B2 (en) * 2003-01-20 2011-03-15 Hitachi, Ltd. Method for controlling failover processing for a first channel controller and a second channel controller
US20060129784A1 (en) * 2003-01-20 2006-06-15 Hitachi, Ltd. Method of controlling storage device controlling apparatus, and storage device controlling apparatus
US20070079016A1 (en) * 2003-01-20 2007-04-05 Hitachi, Ltd. Storage device controlling apparatus and a circuit board for the same
US20080133942A1 (en) * 2003-01-20 2008-06-05 Hitachi Ltd. Method of installing software on storage device controlling apparatus, method of controlling storage device controlling apparatus, and storage device controlling apparatus
US7380057B2 (en) * 2003-01-20 2008-05-27 Hitachi, Ltd. Storage device controlling apparatus and a circuit board for the same
US7263584B2 (en) 2003-01-20 2007-08-28 Hitachi, Ltd. Method of controlling storage device controlling apparatus, and storage device controlling apparatus
US7600157B2 (en) 2003-01-23 2009-10-06 Dell Products L.P. Recovering from a failed I/O controller in an information handling system
US20090037776A1 (en) * 2003-01-23 2009-02-05 Dell Products L.P. Recovering From A Failed I/O Controller In An Information Handling System
US7480831B2 (en) * 2003-01-23 2009-01-20 Dell Products L.P. Method and apparatus for recovering from a failed I/O controller in an information handling system
US20040148542A1 (en) * 2003-01-23 2004-07-29 Dell Products L.P. Method and apparatus for recovering from a failed I/O controller in an information handling system
US7447939B1 (en) 2003-02-28 2008-11-04 Sun Microsystems, Inc. Systems and methods for performing quiescence in a storage virtualization environment
US8166128B1 (en) 2003-02-28 2012-04-24 Oracle America, Inc. Systems and methods for dynamically updating a virtual volume in a storage virtualization environment
US7290168B1 (en) * 2003-02-28 2007-10-30 Sun Microsystems, Inc. Systems and methods for providing a multi-path network switch system
US7430568B1 (en) 2003-02-28 2008-09-30 Sun Microsystems, Inc. Systems and methods for providing snapshot capabilities in a storage virtualization environment
US7383381B1 (en) 2003-02-28 2008-06-03 Sun Microsystems, Inc. Systems and methods for configuring a storage virtualization environment
US7236987B1 (en) 2003-02-28 2007-06-26 Sun Microsystems Inc. Systems and methods for providing a storage virtualization environment
US7234073B1 (en) * 2003-09-30 2007-06-19 Emc Corporation System and methods for failover management of manageable entity agents
US20050188239A1 (en) * 2004-01-30 2005-08-25 Dell Products L.P. Method, software and system for multi-path fail-over recovery in sequential storage systems
US7281169B2 (en) * 2004-01-30 2007-10-09 Dell Products L.P. Method, software and system for multi-path fail-over recovery in sequential storage systems
US20080222661A1 (en) * 2004-03-19 2008-09-11 Alexander Belyakov Failover and Load Balancing
US8429452B2 (en) 2004-03-19 2013-04-23 Intel Corporation Failover and load balancing
US7992039B2 (en) 2004-03-19 2011-08-02 Intel Corporation Failover and load balancing
US20100185794A1 (en) * 2004-03-19 2010-07-22 Alexander Belyakov Failover and load balancing
US7721150B2 (en) * 2004-03-19 2010-05-18 Intel Corporation Failover and load balancing
US20050259632A1 (en) * 2004-03-31 2005-11-24 Intel Corporation Load balancing and failover
US7760626B2 (en) 2004-03-31 2010-07-20 Intel Corporation Load balancing and failover
US20050228943A1 (en) * 2004-04-02 2005-10-13 Decenzo David P Multipath redundant storage system architecture and method
US8024602B2 (en) 2004-04-02 2011-09-20 Seagate Technology Llc Multipath redundant storage system architecture and method
US20080276033A1 (en) * 2004-04-02 2008-11-06 Seagate Technology Llc Multipath redundant storage system architecture and method
US20060075416A1 (en) * 2004-10-04 2006-04-06 Fujitsu Limited Disk array device
US7509527B2 (en) * 2004-10-04 2009-03-24 Fujitsu Limited Collection of operation information when trouble occurs in a disk array device
US7406617B1 (en) * 2004-11-22 2008-07-29 Unisys Corporation Universal multi-path driver for storage systems including an external boot device with failover and failback capabilities
US20070083638A1 (en) * 2005-08-31 2007-04-12 Microsoft Corporation Offloaded neighbor cache entry synchronization
US7577864B2 (en) * 2006-02-13 2009-08-18 Hitachi, Ltd. Control method of computer, program, and virtual computer system
US20070192459A1 (en) * 2006-02-13 2007-08-16 Kazuhide Horimoto Control method of computer, program, and virtual computer system
US20080195831A1 (en) * 2007-02-13 2008-08-14 Fujitsu Limited Data transfer apparatus and data transfer method
US7895375B2 (en) * 2007-02-13 2011-02-22 Fujitsu Limited Data transfer apparatus and data transfer method
US8621262B2 (en) 2007-10-01 2013-12-31 Renesas Electronics Corporation Semiconductor integrated circuit and method for controlling semiconductor integrated circuit
US8370682B2 (en) * 2008-03-31 2013-02-05 Fujitsu Limited Virtual tape system take-over-controlled by standby server computer
US20090249114A1 (en) * 2008-03-31 2009-10-01 Fujitsu Limited Computer system
US8145952B2 (en) * 2009-01-30 2012-03-27 Fujitsu Limited Storage system and a control method for a storage system
US20100199131A1 (en) * 2009-01-30 2010-08-05 Fujitsu Limited Storage system and a control method for a storage system
US10203890B1 (en) * 2016-09-20 2019-02-12 Tintri Inc. Multi-tier mechanism to achieve high availability in a multi-controller system
US10817220B2 (en) 2019-01-31 2020-10-27 EMC IP Holding Company LLC Sharing processor cores in a multi-threading block i/o request processing data storage system

Also Published As

Publication number Publication date
JP2003208362A (en) 2003-07-25
US20060117211A1 (en) 2006-06-01
US7447933B2 (en) 2008-11-04
JP3964212B2 (en) 2007-08-22
US20030135782A1 (en) 2003-07-17

Similar Documents

Publication Publication Date Title
US7003687B2 (en) Fail-over storage system
US7111084B2 (en) Data storage network with host transparent failover controlled by host bus adapter
US6609213B1 (en) Cluster-based system and method of recovery from server failures
US7069468B1 (en) System and method for re-allocating storage area network resources
US6880101B2 (en) System and method for providing automatic data restoration after a storage device failure
EP1760591B1 (en) System and method of managing access path
US6883065B1 (en) System and method for a redundant communication channel via storage area network back-end
US7127633B1 (en) System and method to failover storage area network targets from one interface to another
US8028193B2 (en) Failover of blade servers in a data center
EP1370945B1 (en) Failover processing in a storage system
US7043663B1 (en) System and method to monitor and isolate faults in a storage area network
US7003688B1 (en) System and method for a reserved memory area shared by all redundant storage controllers
JP3714613B2 (en) Storage device, information processing device including the storage device, and information storage system recovery method
JP4400913B2 (en) Disk array device
US6996741B1 (en) System and method for redundant communication between redundant controllers
JP4856864B2 (en) Logical unit security for clustered storage area networks
JP5523468B2 (en) Active-active failover for direct attached storage systems
US7945773B2 (en) Failover of blade servers in a data center
US20030120772A1 (en) Data fail-over for a multi-computer system
US7689759B2 (en) Method and apparatus for providing continuous access to shared tape drives from multiple virtual tape servers within a data storage system
US7966449B2 (en) Distributed storage system with global replication
US7702757B2 (en) Method, apparatus and program storage device for providing control to a networked storage architecture
US7231503B2 (en) Reconfiguring logical settings in a storage system
CN113535472A (en) Cluster server
CN118069376B (en) Multi-tenant high-availability system based on SAN storage

Legal Events

Date Code Title Description
AS Assignment

Owner name: HITACHI, LTD., JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MATSUNAMI, NAOTO;SONODA, KOUJI;KITAMURA, MANABU;AND OTHERS;REEL/FRAME:012921/0630;SIGNING DATES FROM 20020304 TO 20020313

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FEPP Fee payment procedure

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 8

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.)

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.)

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20180221