JP2012138841A

JP2012138841A - Information processing device, operation control method, and program

Info

Publication number: JP2012138841A
Application number: JP2010291093A
Authority: JP
Inventors: Akihiro Nonoyama; 明広野々山
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2010-12-27
Filing date: 2010-12-27
Publication date: 2012-07-19

Abstract

PROBLEM TO BE SOLVED: To provide an information processing device realizing an operation function of a redundancy configuration by which an information processing device constructing the redundancy configuration can be added and deleted and the like arbitrarily.SOLUTION: According to an embodiment, an information processing device constructing a redundancy configuration together with the other information processing device connected via a network has: life message processing means circulating a life message between the information processing devices constructing the redundancy configuration to detect abnormalities; master processing means managing as a master a circulation route for the life message, and when a request for participation in the redundancy configuration is received from the other information processing device, adding the other information processing device as a slave into the circulation route for the life message; and slave processing means transmitting the request for participation in the redundancy configuration to the other information processing device managing as a master the circulation route for the life message to participate in the circulation route for the life message as a slave.

Description

本発明の実施形態は、ネットワークを介して接続される他の情報処理装置と冗長構成を構築する情報処理装置に好適な動作制御技術に関する。 Embodiments described herein relate generally to an operation control technique suitable for an information processing apparatus that establishes a redundant configuration with another information processing apparatus connected via a network.

耐故障性を高めるため、または、負荷分散を図るために、複数の情報処理装置を疎結合して冗長構成を構築することが広く行われている。インターネットを介して各種サービスを提供するサーバ機能は、冗長構成で構築することがもはや一般的となっている。 In order to enhance fault tolerance or to achieve load distribution, it is widely performed to construct a redundant configuration by loosely coupling a plurality of information processing apparatuses. Server functions that provide various services via the Internet are now generally built in a redundant configuration.

そして、この冗長構成については、例えば異常発生箇所を速やかに特定するための機構など、様々な仕組みがこれまでも提案されている。 For this redundant configuration, various mechanisms have been proposed so far, for example, a mechanism for quickly identifying an abnormality occurrence location.

なお、近年のパブリックネットワークでの技術として、クラウド技術が存在する。クラウドでは、サーバの冗長構成だけではなく、サーバシステムで提供するサービスを構成するデータベース等のアプリケーション層を含む冗長構成技術が提案されている。クラウドで実現されている冗長構成技術としては、「コンシステントハッシング」、「分散ハッシュテーブル（ＤＨＴ）」などがある。 In addition, cloud technology exists as a technology in recent public networks. In the cloud, not only a redundant configuration of servers, but also a redundant configuration technique including an application layer such as a database constituting a service provided by a server system has been proposed. Redundant configuration technologies implemented in the cloud include “consistent hashing”, “distributed hash table (DHT)”, and the like.

コンシステントハッシングは、複数のサーバで構成されるデータベースを維持するために、データの配置とサーバの増加(または入れ替え)とを実現する手法である。整数値を取るＩＤを生成するハッシュ関数を使用した空間を設定する。ＩＤ空間は、使用するハッシュ関数により２＊＊１２８−１（ＭＤ５）や２＊＊１６０−１（ＳＨＡ−１）の規模である。格納したいデータのキーから当該ハッシュ関数によりハッシュ値を求め、そのハッシュ値の範囲を担当するサーバにデータを送信する。サーバを追加した場合、このハッシュ値の担当範囲を分割または再配置する。 Consistent hashing is a technique for realizing data placement and server increase (or replacement) in order to maintain a database composed of a plurality of servers. A space using a hash function for generating an ID that takes an integer value is set. The ID space has a scale of 2 ** 128-1 (MD5) or 2 ** 160-1 (SHA-1) depending on the hash function used. A hash value is obtained from the key of the data to be stored by the hash function, and the data is transmitted to a server in charge of the range of the hash value. When a server is added, the assigned range of the hash value is divided or rearranged.

分散ハッシュテーブルを用いた実装としては、「Windows（登録商標）AzureTable」が製品化されている。「Windows（登録商標）AzureTable」では、前述したコンシステントハッシングのアルゴリズムを使用して、入力されたデータのキーからハッシュ値を求め、データを管理するサービスインスタンス（サーバ上のソフトウェア）にデータを送信する。 As an implementation using a distributed hash table, “Windows (registered trademark) AzureTable” has been commercialized. In "Windows (registered trademark) AzureTable", the hash value is obtained from the key of the input data using the consistent hashing algorithm described above, and the data is sent to the service instance (software on the server) that manages the data. To do.

サービスインスタンスは、受信したデータを格納するが、このとき隣接するサービスインスタンスにデータの複製を送信する。これにより、３つのサービスインスタンスでデータを保持することになる。隣接するサービスインスタンスは、プライマリでデータを管理するサービスインスタンスが消滅（サーバ障害、メンテナンスなどで停止）した場合、代替サーバとして動作することになる。 The service instance stores the received data. At this time, a copy of the data is transmitted to the adjacent service instance. As a result, data is held by three service instances. Adjacent service instances operate as alternative servers when the service instance that manages data at the primary disappears (stopped due to server failure, maintenance, etc.).

特開平８−９５９３１号公報JP-A-8-95931

ところで、専用サーバ機種ではないパーソナルコンピュータ（ＰＣ）等でサーバ機能の冗長構成を構築する場合、従来のサーバ機種の管理ソフトウェアのように２４時間運用が必ずしも必須要件ではないことが多い。また、環境への配慮のため、ＰＣの電源設定状態により、電源断、ハイバネーション、スリープなどのＰＣ機能の停止または制限運用状態への対応が必要である。 By the way, when constructing a redundant configuration of server functions using a personal computer (PC) or the like that is not a dedicated server model, 24-hour operation is not always an indispensable requirement, unlike conventional server model management software. In consideration of the environment, it is necessary to stop the PC function such as power-off, hibernation, and sleep, or to cope with the restricted operation state depending on the power setting state of the PC.

また、複数台のＰＣで構成を組む場合、従来方式のサーバ監視方式で相互間監視を実施するとなると、組み合わせ数が膨大になる虞があり（ｎＣ２の組み合わせ）、また、特定の管理サーバで管理を集中する場合、管理サーバでの問題が発生すると代替サーバに切り替える必要があるが、この代替サーバを柔軟に設定することは難しかった。 In addition, when a configuration is made up of a plurality of PCs, the number of combinations may become enormous if mutual monitoring is performed using the conventional server monitoring method (nC2 combination), and management is performed by a specific management server. When a problem occurs in the management server, it is necessary to switch to an alternative server, but it is difficult to set up the alternative server flexibly.

本発明は、このような事情を考慮してなされたものであり、冗長構成を構築する情報処理装置を適宜に追加・削除すること等を可能とする冗長構成の運用機能を実現した情報処理装置、動作制御方法およびプログラムを提供することを目的とする。 The present invention has been made in consideration of such circumstances, and is an information processing apparatus that realizes an operation function of a redundant configuration that makes it possible to appropriately add and delete information processing apparatuses that construct a redundant configuration. An object of the present invention is to provide an operation control method and a program.

実施形態によれば、ネットワークを介して接続される他の情報処理装置と冗長構成を構築する情報処理装置は、前記冗長構成を構築する情報処理装置間でライフメッセージを巡回させて異常を検出するためのライフメッセージ処理手段と、前記ライフメッセージの巡回ルートをマスタとして管理し、他の情報処理装置から前記冗長構成への参加要求を受信した場合に、当該他の情報処理装置を前記ライフメッセージの巡回ルートにスレーブとして加えるためのマスタ処理手段と、前記ライフメッセージの巡回ルートをマスタとして管理する他の情報処理装置に対して前記冗長構成への参加要求を送信して、前記ライフメッセージの巡回ルートにスレーブとして加わるためのスレーブ処理手段と、を具備する。 According to the embodiment, an information processing device that constructs a redundant configuration with another information processing device connected via a network detects an abnormality by circulating a life message between the information processing devices that construct the redundant configuration. Life message processing means for managing the life message cyclic route as a master, and when receiving a request to participate in the redundant configuration from another information processing apparatus, the other information processing apparatus A master processing means for adding as a slave to the cyclic route, and a request for participation in the redundant configuration to another information processing apparatus that manages the cyclic route of the life message as a master, and the cyclic route of the life message Slave processing means for joining as a slave.

実施形態の情報処理装置の概略的なハードウェア構成を示す図。1 is a diagram illustrating a schematic hardware configuration of an information processing apparatus according to an embodiment. 実施形態の情報処理装置の冗長構成の運用機能に関わるソフトウェア構成を示す図。The figure which shows the software structure in connection with the operation function of the redundant structure of the information processing apparatus of embodiment. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループ作成時の各ノードの動作原理を説明するための概念図。The conceptual diagram for demonstrating the operation principle of each node at the time of the management loop creation in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成で用いられるユニークＩＤ要求のメッセージフォーマットの一例を示す図。The figure which shows an example of the message format of the unique ID request | requirement used with the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成で用いられるユニークＩＤ応答／ユニークＩＤ更新のメッセージフォーマットの一例を示す図。The figure which shows an example of the message format of the unique ID response / unique ID update used by the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成で用いられるライフメッセージのメッセージフォーマットの一例を示す図。The figure which shows an example of the message format of the life message used by the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループへのノード追加時の各ノードの動作原理を説明するための概念図。The conceptual diagram for demonstrating the principle of operation of each node at the time of the node addition to the management loop in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループからのノード脱退時の各ノードの動作原理を説明するための概念図。The conceptual diagram for demonstrating the operation principle of each node at the time of the node withdrawal from the management loop in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成で用いられる脱退メッセージのメッセージフォーマットの一例を示す図。The figure which shows an example of the message format of the withdrawal message used by the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループ内の参加ノードの応答が停止した時の各ノードの動作原理を説明するための概念図。The conceptual diagram for demonstrating the operation principle of each node when the response of the participating node in the management loop in the redundant structure which the information processing apparatus of embodiment construct | assembles with another information processing apparatus stops. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループ内の管理ノードの応答が停止した時の各ノードの動作原理を説明するための第１の概念図。The 1st conceptual diagram for demonstrating the operation principle of each node when the response of the management node in the management loop in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus stops. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループ内の管理ノードの応答が停止した時の各ノードの動作原理を説明するための第２の概念図。The 2nd conceptual diagram for demonstrating the operation principle of each node when the response of the management node in the management loop in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus stops. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ノードの開始処理手順を示すタイミングチャート。6 is a timing chart illustrating a management node start processing procedure in a redundant configuration that the information processing apparatus according to the embodiment constructs with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における参加ノードの開始処理手順を示すタイミングチャート。The timing chart which shows the start processing procedure of the participating node in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における管理ループへの参加ノードの追加処理手順を示すタイミングチャート。The timing chart which shows the addition process procedure of the participating node to the management loop in the redundant structure which the information processing apparatus of embodiment builds with another information processing apparatus. 実施形態の情報処理装置が他の情報処理装置と構築する冗長構成における代替管理ノードの処理手順を示すタイミングチャート。9 is a timing chart illustrating a processing procedure of an alternative management node in a redundant configuration that the information processing apparatus of the embodiment constructs with another information processing apparatus.

以下、実施の形態について図面を参照して説明する。 Hereinafter, embodiments will be described with reference to the drawings.

図１は、実施形態の情報処理装置の概略的なハードウェア構成を示す図である。 FIG. 1 is a diagram illustrating a schematic hardware configuration of the information processing apparatus according to the embodiment.

図１に示すように、本情報処理装置は、プロセッサ１、主メモリ２、外部記憶装置３、入力装置４、表示装置５および通信装置６を有している。 As illustrated in FIG. 1, the information processing apparatus includes a processor 1, a main memory 2, an external storage device 3, an input device 4, a display device 5, and a communication device 6.

プロセッサ１は、外部記憶装置３から主メモリ２にロードされる各種プログラムを実行することによって本情報処理装置内の各部の動作を制御する処理装置である。主メモリ２は、プロセッサ１によって実行される各種プログラムや当該各種プログラムによって入出力される各種データを一時的に記憶する記憶装置である。外部記憶装置３は、主メモリ２の補助装置として各種プログラムおよび各種データを大量かつ恒久的に記憶する記憶装置である。 The processor 1 is a processing device that controls the operation of each unit in the information processing apparatus by executing various programs loaded from the external storage device 3 to the main memory 2. The main memory 2 is a storage device that temporarily stores various programs executed by the processor 1 and various data input / output by the various programs. The external storage device 3 is a storage device that permanently stores a large amount of various programs and various data as an auxiliary device of the main memory 2.

入力装置４は、本情報処理装置が提供するユーザインタフェースのインプット側を司る装置である。表示装置５は、本情報処理装置が提供するユーザインタフェースのアウトプット側を司る装置である。そして、通信装置５は、ネットワークを介して他の情報処理装置との間で実行されるデータ通信を制御する装置である。 The input device 4 is a device that controls the input side of the user interface provided by the information processing apparatus. The display device 5 is a device that controls the output side of the user interface provided by the information processing apparatus. And the communication apparatus 5 is an apparatus which controls the data communication performed between other information processing apparatuses via a network.

以上のようなハードウェア構成を持つ本情報処理装置は、ネットワークを介して他の情報処理装置（クライアント）に各種サービスを提供するサーバとして動作することが可能である。そこで、本情報処理装置は、ネットワークを介して接続される他の情報処理装置とサーバ機能の冗長構成を構築するにあたって、この冗長構成を構築する情報処理装置を適宜に追加・削除すること等を可能とする冗長構成の運用機能を実現したものであり、以下、この点について詳述する。 The information processing apparatus having the hardware configuration described above can operate as a server that provides various services to other information processing apparatuses (clients) via a network. Therefore, this information processing apparatus, when constructing a redundant configuration of server functions with other information processing apparatuses connected via a network, appropriately adds / deletes the information processing apparatus constructing this redundant configuration, etc. The operation function of the redundant configuration that can be realized is realized, and this point will be described in detail below.

図２は、本情報処理装置の冗長構成の運用機能に関わるソフトウェア構成を示す図である。 FIG. 2 is a diagram illustrating a software configuration related to a redundant configuration operation function of the information processing apparatus.

本情報処理装置の冗長構成の運用機能は、管理ループ運用プログラム１０によって実現される。なお、この管理ループ運用プログラム１０は、冗長構成を構築する情報処理装置の追加・削除等の運用機能を実現するものであって、複数の情報処理装置間での負荷分散制御等は既知の手法で別途司られているものとしてここでは考慮しない。また、ネットワークプロトコル２０は、管理ループ運用プログラム１０を含む各種ソフトウェアがハードウェアである通信装置６を用いて他の情報処理装置上の各種ソフトウェアとの間でネットワークを介したデータ通信を行うために設けられるプログラムである。ＯＳ（Operating system）／ドライバ３０は、本情報処理装置のリソース管理やハードウェア制御を行うためのプログラムである。これら管理ループ運用プログラム１０、ネットワークプロトコル２０およびＯＳ／ドライバ３０のいずれも、外部記憶装置３から主メモリ２にロードされてプロセッサ１によって実行されることによって機能する。 The operation function of the redundant configuration of the information processing apparatus is realized by the management loop operation program 10. The management loop operation program 10 realizes an operation function such as addition / deletion of an information processing apparatus that constructs a redundant configuration, and load balancing control among a plurality of information processing apparatuses is a known method. It is not considered here because it is managed separately. The network protocol 20 is used to perform data communication via a network with various software on other information processing apparatuses using the communication apparatus 6 in which various software including the management loop operation program 10 is hardware. It is a program that is provided. An OS (Operating system) / driver 30 is a program for performing resource management and hardware control of the information processing apparatus. The management loop operation program 10, the network protocol 20, and the OS / driver 30 all function by being loaded from the external storage device 3 into the main memory 2 and executed by the processor 1.

図２に示すように、管理ループ運用プログラム１０は、ノード管理機能モジュール１１、参加ノード機能モジュール１２、管理ノード機能モジュール１３、ライフメッセージ管理機能モジュール１４、ユニークＩＤメッセージ管理機能モジュール１５、メッセージ送受信機能モジュール１６およびユーザインタフェース機能モジュール１７の処理部と、収集ノード一覧テーブル１８のデータ部とを有している。 As shown in FIG. 2, the management loop operation program 10 includes a node management function module 11, a participating node function module 12, a management node function module 13, a life message management function module 14, a unique ID message management function module 15, and a message transmission / reception function. The processing unit of the module 16 and the user interface function module 17 and the data unit of the collection node list table 18 are included.

本情報処理装置が他の情報処理装置と構築する冗長構成では、当該冗長構成を構築する情報処理装置間でライフメッセージを巡回させる。ライフメッセージは、冗長構成を構築する複数の情報処理装置の中のいずれかの情報処理装置に異常が発生した場合に、この異常を（異常を発生させた情報処理装置以外の）他の情報処理装置が検出するために巡回させるメッセージである。このライフメッセージの巡回ルートを、ここでは、管理ループと称する。また、冗長構成を構築する情報処理装置であって、この管理ループ上でライフメッセージの送受信を実行する情報処理装置を、ここではノードと称する。ノードは、管理ループの管理を司るマスタの役割を担う１つの管理ノードと、管理ノード（マスタ）の配下で管理ループにスレーブとして参加するその他の参加ノードとに大別される。管理ループ運用プログラム１０は、本情報処理装置を管理ノードまたは参加ノードのいずれとしても動作させることができるソフトウェアである。以下、管理ノードとして動作する情報処理装置を単に管理ノード、参加ノードとして動作する情報処理装置を単に参加ノードと称することがある。 In the redundant configuration that this information processing device is constructed with other information processing devices, the life message is circulated between the information processing devices that construct the redundant configuration. The life message indicates that when an abnormality occurs in any one of the information processing apparatuses that form a redundant configuration, this abnormality is indicated by other information processing (other than the information processing apparatus that caused the abnormality). This message is circulated for detection by the device. This life message circulation route is referred to herein as a management loop. In addition, an information processing apparatus that constructs a redundant configuration and that transmits and receives a life message on this management loop is referred to as a node here. The nodes are roughly classified into one management node that plays the role of a master that manages the management loop, and other participating nodes that participate in the management loop as slaves under the management node (master). The management loop operation program 10 is software that can operate the information processing apparatus as either a management node or a participating node. Hereinafter, an information processing apparatus that operates as a management node may be simply referred to as a management node, and an information processing apparatus that operates as a participation node may be simply referred to as a participation node.

参加ノード機能モジュール１２は、本情報処理装置を参加ノードとして動作させるためのモジュールである。また、管理ノード機能モジュール１３は、本情報処理装置を管理ノードとして動作させるためのモジュールである。そして、ノード管理機能モジュール１１は、参加ノード機能モジュール１２または管理ノード機能モジュール１３のいずれを稼働させるか、即ち、本情報処理装置を管理ノードまたは参加ノードのいずれとして動作させるかを決定するモジュールである。 The participating node function module 12 is a module for operating the information processing apparatus as a participating node. The management node function module 13 is a module for operating the information processing apparatus as a management node. The node management function module 11 is a module that determines which of the participating node function module 12 or the management node function module 13 is operated, that is, whether the information processing apparatus is operated as a management node or a participating node. is there.

管理ループ運用プログラム１０が起動した際における初期動作として、ノード管理機能モジュール１１は、本情報処理装置を参加ノードとして動作させるべく参加ノード機能モジュール１２を稼働させるための設定を実施する。また、この初期動作時に、管理者等により管理ノードとして動作すべき指示が入力された場合には、ノード管理機能モジュール１１は、本情報処理装置を管理ノードとして動作させるべく管理ノード機能モジュール１３を稼働させるための設定を実施する。ユーザインタフェース機能モジュール１７は、管理者等に対して情報の受け渡しのための仕組みを提供するモジュールである。 As an initial operation when the management loop operation program 10 is activated, the node management function module 11 performs settings for operating the participating node function module 12 so that the information processing apparatus operates as a participating node. Further, when an instruction to operate as a management node is input by the administrator or the like during this initial operation, the node management function module 11 sets the management node function module 13 to operate the information processing apparatus as the management node. Perform settings for operation. The user interface function module 17 is a module that provides a mechanism for exchanging information to an administrator or the like.

さらに、ノード管理機能モジュール１１は、本情報処理装置が参加ノードとして動作を開始した後、管理ノードとして動作する他の情報処理装置がその動作を停止した場合、代替管理ノードへの移行を決断して、参加ノード機能モジュール１２の稼働状態から管理ノード機能モジュール１３の稼働状態へと移行するための設定を実施する。 Further, the node management function module 11 decides to move to the alternative management node when another information processing apparatus operating as a management node stops its operation after the information processing apparatus starts operating as a participating node. Thus, the setting for shifting from the operating state of the participating node function module 12 to the operating state of the management node function module 13 is performed.

管理ノード機能モジュール１３は、管理ループの作成および管理を実行する。本情報処理装置が管理ノードとして動作している場合、管理ノード機能モジュール１３は、他の情報処理装置から送信されてくるユニークＩＤ要求によって、管理ループに参加ノードとして参加する当該他の情報処理装置を認識する。ユニークＩＤは、各ノードが自分の参加する管理ループを識別するための識別情報である。 The management node function module 13 creates and manages a management loop. When this information processing apparatus is operating as a management node, the management node function module 13 may participate in the management loop as a participating node in response to a unique ID request transmitted from the other information processing apparatus. Recognize The unique ID is identification information for identifying a management loop in which each node participates.

管理ノード機能モジュール１３は、このユニークＩＤ要求に対してユニークＩＤ応答を返信し、当該他の情報処理装置を参加ノードとして管理ループに追加する。ユニークＩＤメッセージ管理機能モジュール１５は、このユニークＩＤ要求やユニークＩＤ応答、後述する管理ループからの脱退メッセージなどを作成するモジュールであり、メッセージ送受信機能モジュール１６は、このユニークＩＤ要求やユニークＩＤ応答、後述する管理ループからの脱退メッセージなどの、管理ノードおよび参加ノード間におけるメッセージの送受信を実行するモジュールである。 The management node function module 13 returns a unique ID response to this unique ID request, and adds the other information processing apparatus as a participating node to the management loop. The unique ID message management function module 15 is a module for creating a unique ID request and a unique ID response, a withdrawal message from a management loop, which will be described later, and the message transmission / reception function module 16 has a unique ID request and a unique ID response. This module executes transmission / reception of messages between a management node and participating nodes, such as a withdrawal message from a management loop described later.

管理ノード機能モジュール１３は、例えば、本情報処理装置が管理ノードとして動作を開始した後、他の情報処理装置から最初にユニークＩＤ要求を受けた時、当該他の情報処理装置との間で管理ループを作成するためのユニークＩＤを発番し、このユニークＩＤをユニークＩＤ応答として返信する。また、この管理ループの作成後にさらに別の情報処理装置からユニークＩＤ要求を受けた場合には、管理ノード機能モジュール１３は、作成済みの管理ループへの参加が要求されたと判断する。この参加ノードの追加時、管理ノード機能モジュール１３は、当該管理ループ用のユニークＩＤを再発番して更新し、すべての参加ノードにユニークＩＤ応答として送信する。 For example, when the information processing apparatus starts operating as a management node and receives a unique ID request from another information processing apparatus for the first time, the management node function module 13 manages with the other information processing apparatus. A unique ID for creating a loop is issued, and this unique ID is returned as a unique ID response. If a unique ID request is received from another information processing apparatus after the management loop is created, the management node function module 13 determines that participation in the created management loop has been requested. When adding the participating node, the management node function module 13 re-numbers and updates the unique ID for the management loop, and transmits the unique ID response to all the participating nodes as a unique ID response.

また、参加ノードの追加時、管理ノード機能モジュール１３は、収集ノード一覧テーブル１８を更新する。収集ノード一覧テーブル１８は、管理ループ上の参加ノードに関する情報を保持するテーブルである。収集ノード一覧テーブル１８に保持される情報は、ライフメッセージの生成に用いられる。ライフメッセージ管理機能モジュール１４は、ライフメッセージの作成および送信を実行するモジュールであり、本情報処理装置が管理ノードとして動作している時、管理ノード機能モジュール１３は、予め定められた期間毎に、ライフメッセージ管理機能モジュールに対し、ライフメッセージの作成および送信の実行を指示する。管理ノード機能モジュール１３は、参加ノードを巡回してライフメッセージ管理機能モジュールによって回収されたライフメッセージを参照して、参加ノードの動作状況を確認する。 In addition, when adding a participating node, the management node function module 13 updates the collection node list table 18. The collection node list table 18 is a table that holds information related to participating nodes on the management loop. Information held in the collection node list table 18 is used to generate a life message. The life message management function module 14 is a module that executes creation and transmission of a life message. When the information processing apparatus operates as a management node, the management node function module 13 Instructs the life message management function module to create and send a life message. The management node function module 13 circulates the participating nodes and refers to the life message collected by the life message management function module to confirm the operation status of the participating nodes.

ライフメッセージが回収されなかった場合、管理ノード機能モジュール１３は、いずれかの参加ノードが停止したと判断して、（停止した参加ノード以外の）その他の参加ノードとの間で新たな管理ループを再作成する。そのために、管理ノード機能モジュール１３は、（停止した参加ノード以外の）その他の参加ノードからのユニークＩＤ要求の送信を待機する。ユニークＩＤ要求を受信したら、管理ノード機能モジュール１３は、再作成する管理ループ用のユニークＩＤを改めて発番して、このユニークＩＤをユニークＩＤ応答として返信する。また、この時、管理ノード機能モジュール１３は、収集ノード一覧テーブル１８の更新も実行する。これにより、管理ループの再作成および再作成された管理ループでのライフメッセージの巡回が実現される。 If the life message is not collected, the management node function module 13 determines that one of the participating nodes has stopped, and creates a new management loop with other participating nodes (other than the stopped participating node). Recreate it. For this purpose, the management node function module 13 waits for transmission of a unique ID request from other participating nodes (other than the stopped participating node). When receiving the unique ID request, the management node function module 13 issues a new unique ID for the management loop to be recreated, and returns this unique ID as a unique ID response. At this time, the management node function module 13 also updates the collection node list table 18. Thereby, re-creation of the management loop and circulation of the life message in the re-created management loop are realized.

参加ノード機能モジュール１２は、管理ループへの参加処理を実行する。管理ループ運用プログラム１０が起動して本情報処理装置が参加ノードとして動作を開始した時、参加ノード機能モジュール１２は、参加ノードとして管理ループに参加するべく前述のユニークＩＤ要求をブロードキャスト送信する。参加ノード機能モジュール１２は、管理ノードから前述のユニークＩＤ応答が返信されたか否かによって管理ループへの参加可否を判断する。また、参加ノード機能モジュール１２は、ライフメッセージ管理機能モジュール１４によるライフメッセージの送受信状況から自ノードが管理ループへ正しく参加できたことを確認する。 The participating node function module 12 executes a management loop participation process. When the management loop operation program 10 is activated and the information processing apparatus starts operating as a participating node, the participating node function module 12 broadcasts the unique ID request described above to participate in the management loop as a participating node. The participation node function module 12 determines whether or not to participate in the management loop based on whether or not the unique ID response is returned from the management node. In addition, the participating node function module 12 confirms that the own node has successfully participated in the management loop from the life message transmission / reception status by the life message management function module 14.

ライフメッセージには、ユニークＩＤのほか、管理ノードによってその巡回順位が記録されており、参加ノードとして管理ループに加わった後、参加ノード機能モジュール１２は、この巡回順位に従って、ライフメッセージ管理機能モジュール１４を介してライフメッセージの送受信を実行する。この時、参加ノード機能モジュール１２は、ライフメッセージ管理機能モジュール１４によって送受信されるライフメッセージに、自ノードによる当該ライフメッセージの受信状況を記録する。 In the life message, in addition to the unique ID, the cyclic order is recorded by the management node. After joining the management loop as a participating node, the participating node function module 12 follows the cyclic order in accordance with the cyclic order. Send and receive life messages via At this time, the participating node function module 12 records the reception status of the life message by the own node in the life message transmitted and received by the life message management function module 14.

また、参加ノードとして管理ループに加わった後、ライフメッセージが予め定められた期間を越えて受信されない時、または、ライフメッセージの送信が失敗した時、参加ノード機能モジュール１２は、いずれかの参加ノードが停止したと判断して、管理ノードによって再作成されるであろう管理ループに加わるためのユニークＩＤ要求の送信を管理ノードに対して実行する。このユニークＩＤ要求に対して、ユニークＩＤ応答として新たなユニークＩＤが返信されてきたら、前述の手順で参加ノードとしての動作を再開する。 Also, after joining the management loop as a participating node, when the life message is not received beyond a predetermined period, or when the life message transmission fails, the participating node function module 12 Is sent to the management node to transmit a unique ID request to join the management loop that will be recreated by the management node. When a new unique ID is returned as a unique ID response in response to this unique ID request, the operation as a participating node is resumed in the above-described procedure.

もし、ユニークＩＤ応答が返信されなかった場合、参加ノード機能モジュール１２は、管理ノードが停止したと判断する。この場合、参加ノード機能モジュール１２は、管理ループに参加する参加ノードの中で代替管理ノードに移行すべきノードを最後に送受信されたライフメッセージに記録された巡回順位に基づいて選定する。 If the unique ID response is not returned, the participating node function module 12 determines that the management node has stopped. In this case, the participating node function module 12 selects a node to be transferred to the alternative management node among the participating nodes participating in the management loop based on the tour order recorded in the last transmitted / received life message.

自ノードでなかった場合、参加ノード機能モジュール１２は、代替管理ノードとなるべき他ノードにユニークＩＤ要求を送信する。その後は、参加ノードとしての動作を再開する。一方、自ノードであった場合には、参加ノード機能モジュール１２は、代替管理ノードへの移行指示をノード管理機能モジュール１１に通知する。この通知を受けたノード管理機能モジュール１１は、参加ノード機能モジュール１２の稼働状態から管理ノード機能モジュール１３の稼働状態へと本情報処理装置を移行させるための設定を実施する。この場合、他ノードからユニークＩＤ要求が送られてくるので、ユニークＩＤの発番やユニークＩＤ応答の返信、収集ノード一覧テーブル１８の更新、ライフメッセージの作成および送信など、管理ノードとして動作を再開する。 If it is not the own node, the participating node function module 12 transmits a unique ID request to another node that is to become an alternative management node. Thereafter, the operation as a participating node is resumed. On the other hand, if the node is the own node, the participating node function module 12 notifies the node management function module 11 of an instruction to shift to the alternative management node. Upon receiving this notification, the node management function module 11 performs settings for shifting the information processing apparatus from the operating state of the participating node function module 12 to the operating state of the management node function module 13. In this case, since a unique ID request is sent from another node, the operation is resumed as a management node, such as issuing a unique ID or returning a unique ID response, updating the collection node list table 18, and creating and sending a life message. To do.

また、参加ノード機能モジュール１２は、ユニークＩＤ要求を受信すると、新たなノードが管理ループへの参加を要求していると判断し、当該新たなノードを加えた管理ループを再作成した管理ノードが送ってくるであろうユニークＩＤ応答を待機して受信する。このユニークＩＤ応答を受信すると、参加ノード機能モジュール１２は、参加ノードとしての動作を再開する。 In addition, upon receiving the unique ID request, the participating node function module 12 determines that a new node is requesting participation in the management loop, and the management node that re-created the management loop including the new node Waiting for and receiving a unique ID response that will be sent. Upon receiving this unique ID response, the participating node function module 12 resumes operation as a participating node.

管理ノードからの脱退は、脱退メーセージをブロードキャスト送信することによって実施する。管理ノードとして参加ノードから脱退メーセージを受けた場合、管理ノード機能モジュール１３は、この参加ノードに関する情報を削除すべく収集ノード一覧テーブル１８を更新し、新たなユニークＩＤを発番してその他の参加ノードに対してユニークＩＤ更新として送信する。これにより、脱退メッセージを送信してきた参加ノードを除いた管理ループが再作成される。 The withdrawal from the management node is performed by broadcasting a withdrawal message. When a withdrawal message is received from a participating node as a management node, the management node function module 13 updates the collection node list table 18 to delete information related to the participating node, issues a new unique ID, and issues other participation. It is transmitted as a unique ID update to the node. As a result, the management loop excluding the participating nodes that have transmitted the withdrawal message is recreated.

また、参加ノードとして脱退メーセージを受けた場合、参加ノード機能モジュール１２は、それが管理ノードからのものか否かを調べ、管理ノードからのものでなければ、管理ノードから送信されてくるであろうユニークＩＤ更新を待機する。一方、管理ノードからのものであった場合には、管理ループに参加する参加ノードの中で代替管理ノードに移行すべきノードを最後に送受信されたライフメッセージに記録された巡回順位に基づいて選定する。代替管理ノードに移行すべきノードが自ノードでなければ、代替管理ノードから送信されてくるであろうユニークＩＤ更新を待機する。 Also, when a withdrawal message is received as a participating node, the participating node function module 12 checks whether it is from the management node, and if it is not from the management node, it is sent from the management node. Wait for deaf unique ID update. On the other hand, if the node is from the management node, the node to be transferred to the alternative management node among the participating nodes participating in the management loop is selected based on the circulation order recorded in the last transmitted / received life message. To do. If the node to be transferred to the alternative management node is not its own node, it waits for a unique ID update that will be transmitted from the alternative management node.

そして、代替管理ノードに移行すべきノードが自ノードであった場合は、代替管理ノードへの移行指示をノード管理機能モジュール１１に通知する。これに伴って稼働された管理ノード機能モジュール１３は、最後に送受信されたライフメッセージから収集ノード一覧テーブル１８を作成し、新たなユニークＩＤを発番してその他の参加ノードに対してユニークＩＤ更新として送信する。これにより、脱退メッセージを送信してきた管理ノードが除かれ、かつ、自ノードが代替管理ノードとなった管理ループが再作成される。 If the node to be transferred to the alternative management node is the local node, the node management function module 11 is notified of an instruction to transfer to the alternative management node. The management node function module 13 that has been operated accordingly creates the collection node list table 18 from the last transmitted / received life message, issues a new unique ID, and updates the unique ID to other participating nodes. Send as. As a result, the management node that has transmitted the withdrawal message is removed, and the management loop in which the own node becomes the alternative management node is recreated.

なお、前述したように、ライフメッセージの巡回が途絶えると、いずれかのノードが停止したものと各ノードが判断し、管理ループを再作成すべく各々動作するので、脱退メーセージを用いることなく、脱退させたいノードを停止させることもできる。 As described above, when the life message circulation stops, each node determines that one of the nodes has stopped and operates to re-create the management loop, so it is possible to leave without using a withdrawal message. You can also stop the node you want to make.

つまり、以上のような構成を持つ本情報処理装置が（同等の構成を持つ）他の情報処理装置と構築する冗長構成は、当該冗長構成を構築する複数の情報処理装置それぞれが適応的に動作することで、全体として、適切な運用機能が実現される。 In other words, the redundant configuration constructed by the information processing apparatus having the above configuration with another information processing apparatus (having an equivalent configuration) operates adaptively for each of the plurality of information processing apparatuses constructing the redundant configuration. By doing so, an appropriate operation function is realized as a whole.

次に、図３乃至図１２を参照して、本情報処理装置が他の情報処理装置と構築する冗長構成で実現される運用機能の基本原理の概要について説明する。 Next, with reference to FIG. 3 to FIG. 12, an outline of the basic principle of the operation function realized by the redundant configuration constructed by the information processing apparatus with another information processing apparatus will be described.

まず、図３乃至図６を参照して、管理ループ作成時の各ノードの動作原理について説明する。 First, the operation principle of each node when creating a management loop will be described with reference to FIGS.

ここでは、図３に示すように、ノードＡが管理ノード、ノードＢが参加ノードとして動作を開始したものとする。 Here, as shown in FIG. 3, it is assumed that the operation starts with node A as the management node and node B as the participating node.

参加ノードとして動作を開始したノードＢは、ネットワーク上に存在すると思われる管理ノードに向けてユニークＩＤ要求ａ１を送信する（図３の（Ａ））。このユニークＩＤ要求は、ブロードキャストまたはマルチキャストのネットワークプロトコル（例えばＴＣＰ／ＩＰ、ＮｅｔＢＩＯＳ、ＭＡＣなど)によって送信される。図４は、ユニークＩＤ要求のメッセージフォーマットの一例を示す図である。 The node B that has started to operate as a participating node transmits a unique ID request a1 to a management node that seems to exist on the network ((A) in FIG. 3). This unique ID request is transmitted by a broadcast or multicast network protocol (for example, TCP / IP, NetBIOS, MAC, etc.). FIG. 4 is a diagram illustrating an example of the message format of the unique ID request.

一方、管理ノードとして動作を開始したノードＡは、このブロードキャストを受信すると、ライフメッセージの送信リスト、つまり収集ノード一覧テーブル１８にノードＢを追加し、ユニークＩＤ応答ａ２をブロードキャストまたはマルチキャストのネットワークプロトコルで返信する(図３の（Ｂ））。図５は、ユニークＩＤ応答／ユニークＩＤ更新のメッセージフォーマットの一例を示す図である。 On the other hand, when receiving the broadcast, the node A that has started to operate as a management node adds the node B to the life message transmission list, that is, the collection node list table 18, and sends a unique ID response a 2 using a broadcast or multicast network protocol. Reply ((B) of FIG. 3). FIG. 5 is a diagram illustrating an example of a message format of the unique ID response / unique ID update.

参加ノードは、ユニークＩＤ応答を受信後、ライフメッセージの受信待ちに入る。管理ノードが同一ネットワークに存在する場合、参加ノードは、最初に受信したユニークＩＤ応答に含まれるユニークＩＤを識別子として、対応する管理ノードから直接送信されるか参加ノード間で転送されるライフメッセージを受信待ちする。図６は、ライフメッセージのメッセージフォーマットの一例を示す図である。 After receiving the unique ID response, the participating node waits to receive a life message. When the management node exists in the same network, the participating node uses the unique ID included in the first received unique ID response as an identifier to transmit a life message transmitted directly from the corresponding management node or transferred between the participating nodes. Wait for reception. FIG. 6 is a diagram illustrating an example of a message format of the life message.

そして、参加ノードは、（ライフメッセージに含まれる）送信リストへの登録順位に従って、このライフメッセージの転送（ａ３）を実行する（図３の（Ｃ））。ライフメッセージを転送する際、参加ノードは、自ノード情報のシーケンス番号の更新し、前回メッセージ受信時刻からの差分時間を更新する。なお、自ノードが参加ノードとして接続直後の場合は、前回メッセージ受信時刻からの差分時間は０とする。 Then, the participating node executes the transfer (a3) of this life message according to the registration order in the transmission list (included in the life message) ((C) in FIG. 3). When transferring the life message, the participating node updates the sequence number of its own node information and updates the difference time from the previous message reception time. Note that if the local node has just joined as a participating node, the difference time from the previous message reception time is zero.

一方、管理ノードは、ライフメッセージが参加ノードを一周して最後の参加ノードから返信された時間を元にライフメッセージに記載するタイムアウト値を更新し、各ノードにライフメッセージにより通知する。 On the other hand, the management node updates the timeout value described in the life message based on the time when the life message goes around the participating nodes and is returned from the last participating node, and notifies each node by the life message.

図７は、管理ループへのノード追加時の各ノードの動作原理について説明するための概念図である。 FIG. 7 is a conceptual diagram for explaining the operation principle of each node when adding a node to the management loop.

ここでは、ノードＡ（管理ノード）およびノードＢ（参加ノード）による既存の管理ループが存在し、既にライフメッセージを交換している状態で、新規の参加要求ノード（ノードＣ）が現れた場合を想定する。 Here, a case where an existing management loop by the node A (management node) and the node B (participating node) exists, and a new participation request node (node C) appears in a state where the life message has already been exchanged. Suppose.

参加要求ノードであるノードＣは、前述のように、ユニークＩＤ要求ｂ２をブロードキャスト（またはマルチキャスト）する（図７の（Ａ））。このブロードキャストを受け取った、管理ループに参加するノードは、ライフメッセージの交換（ｂ１）を停止して、管理ノードからのユニークＩＤ応答を待つ（図７の（Ｂ））。また、管理ノードは、このノードＣを追加すべく送信リストを更新し、かつ、ユニークＩＤを更新して、すべての参加ノードにユニークＩＤ応答ｂ３を送信する。 Node C, which is the participation request node, broadcasts (or multicasts) the unique ID request b2 as described above ((A) in FIG. 7). Upon receiving this broadcast, the node participating in the management loop stops life message exchange (b1) and waits for a unique ID response from the management node ((B) of FIG. 7). Further, the management node updates the transmission list to add this node C, updates the unique ID, and transmits the unique ID response b3 to all the participating nodes.

管理ノードより更新後のユニークＩＤが含まれるユニークＩＤ応答が送信されると、各ノードは、保持していたユニークＩＤを更新して、ライフメッセージ受信待ち状態に移行する。これにより、管理ノードを基点とした、新たなユニークＩＤに基づくライフメッセージ送信ｂ４が再開される（図７の（Ｃ））。 When the unique ID response including the updated unique ID is transmitted from the management node, each node updates the held unique ID and shifts to a life message reception waiting state. Thereby, the life message transmission b4 based on the new unique ID with the management node as the base point is resumed ((C) in FIG. 7).

次に、図８および図９を参照して、管理ループからのノード脱退時の各ノードの動作原理について説明する。 Next, with reference to FIG. 8 and FIG. 9, the operation principle of each node when the node leaves the management loop will be described.

ここでは、ノードＡ（管理ノード）とノードＢおよびノードＣ（参加ノード）とによる既存の管理ループが存在し、既にライフメッセージを交換している状態で、１つの参加ノード（ノードＣ）が管理ループから脱退する場合を想定する。 Here, there is an existing management loop consisting of node A (management node), node B, and node C (participating node), and one participating node (node C) manages it while exchanging life messages. Assume that you want to leave the loop.

管理ループから脱退するノードＣは、脱退メッセージｃ２をブロードキャスト(またはマルチキャスト)する（図８の（Ａ））。図９は、ユニークＩＤ要求のメッセージフォーマットの一例を示す図である。 The node C leaving the management loop broadcasts (or multicasts) the withdrawal message c2 ((A) in FIG. 8). FIG. 9 is a diagram illustrating an example of a message format of the unique ID request.

このブロードキャストを受け取った、管理ループに参加するノードは、ライフメッセージの交換（ｃ１）を停止して、管理ノードからのユニークＩＤ更新ｃ３を待つ。また、管理ノードは、このノードＣを削除すべく送信リストを更新し、かつ、ユニークＩＤを更新して、残りのすべての参加ノードにユニークＩＤ更新ｃ３を送信する（図８の（Ｂ））。 The node that receives this broadcast and participates in the management loop stops life message exchange (c1) and waits for the unique ID update c3 from the management node. Further, the management node updates the transmission list to delete this node C, updates the unique ID, and transmits the unique ID update c3 to all the remaining participating nodes ((B) in FIG. 8). .

管理ノードより更新後のユニークＩＤが含まれるユニークＩＤ応答が送信されると、各ノードは、保持していたユニークＩＤを更新して、ライフメッセージ受信待ち状態に移行する。これにより、管理ノードを基点とした、新たなユニークＩＤに基づくライフメッセージ送信ｃ４が再開される（図８の（Ｃ））。 When the unique ID response including the updated unique ID is transmitted from the management node, each node updates the held unique ID and shifts to a life message reception waiting state. Thereby, the life message transmission c4 based on the new unique ID with the management node as the base point is resumed ((C) in FIG. 8).

図１０は、管理ループ内の参加ノードの応答が停止した時の各ノードの動作原理について説明するための概念図である。 FIG. 10 is a conceptual diagram for explaining the operation principle of each node when the response of the participating node in the management loop stops.

ここでは、ノードＡ（管理ノード）とノードＢおよびノードＣ（参加ノード）とによる既存の管理ループが存在し、既にライフメッセージを交換している状態で、１つの参加ノード（ノードＣ）からのライフメッセージ送信が停止された場合を想定する。 Here, there is an existing management loop of node A (management node), node B, and node C (participating node), and a life message has already been exchanged, and from one participating node (node C), Assume that the life message transmission is stopped.

管理ノードを含むすべてのノードは一定時間のタイムアウト値を共有しており、図６に示したライフメッセージフォーマットの「前回メッセージ受信時刻からの差分時間」から起算したタイムアウト処理の管理を各々が実施する。従って、ノードＣ自体の故障発生やネットワークの切断などによりライフメッセージｄ１の交換が止まった状態となると（図１０の（Ａ））、その他のすべてのノードでタイムアウトが検出されることになる。 All nodes including the management node share a timeout value for a certain time, and each executes management of timeout processing calculated from “difference time from previous message reception time” in the life message format shown in FIG. . Therefore, when the exchange of the life message d1 is stopped due to the failure of the node C itself or the disconnection of the network ((A) in FIG. 10), a timeout is detected in all other nodes.

そうすると、すべての参加ノードが、管理ノードに対してユニークＩＤ要求ｄ２を送信する（図１０の（Ｂ））。管理ノードは、ユニークＩＤ要求を送信してきたノードからなる送信リストを再作成し、かつ、ユニークＩＤを更新して、当該ユニークＩＤ要求を送信してきたノードにユニークＩＤ更新ｄ３を送信する（図１０の（Ｃ））。 Then, all the participating nodes transmit a unique ID request d2 to the management node ((B) in FIG. 10). The management node re-creates a transmission list including nodes that have transmitted the unique ID request, updates the unique ID, and transmits a unique ID update d3 to the node that has transmitted the unique ID request (FIG. 10). (C)).

管理ノードより更新後のユニークＩＤが含まれるユニークＩＤ更新が送信されると、各ノードは、保持していたユニークＩＤを更新して、ライフメッセージ受信待ち状態に移行する。これにより、管理ノードを基点とした、新たなユニークＩＤに基づくライフメッセージ送信ｄ４が再開される（図１０の（Ｄ））。 When the unique ID update including the updated unique ID is transmitted from the management node, each node updates the held unique ID and shifts to a life message reception waiting state. Thereby, the life message transmission d4 based on the new unique ID with the management node as the base point is resumed ((D) in FIG. 10).

また、図１１および図１２は、管理ループ内の管理ノードの応答が停止した時の各ノードの動作原理について説明するための概念図である。 FIGS. 11 and 12 are conceptual diagrams for explaining the operation principle of each node when the response of the management node in the management loop stops.

ここでは、ノードＡ（管理ノード）とノードＢおよびノードＣ（参加ノード）とによる既存の管理ループが存在し、既にライフメッセージを交換している状態で、管理ノードであるノードＡからのライフメッセージ送信が停止された場合を想定する。 Here, there is an existing management loop of node A (management node), node B, and node C (participating node), and the life message from node A, which is the management node, is already exchanged in the life message. Assume that transmission is stopped.

前述したように、管理ノードを含むすべてのノードは一定時間のタイムアウト値を共有しており、図６に示したライフメッセージフォーマットの「前回メッセージ受信時刻からの差分時間」から起算したタイムアウト処理の管理を各々が実施する。従って、ノードＡ自体の故障発生やネットワークの切断などによりライフメッセージｅ１の交換が止まった状態となると（図１１の（Ａ））、その他のすべてのノードでタイムアウトが検出されることになる。 As described above, all the nodes including the management node share a timeout value for a certain time, and management of timeout processing calculated from “difference time from previous message reception time” in the life message format shown in FIG. Each implements. Accordingly, when the exchange of the life message e1 is stopped due to the failure of the node A itself or the disconnection of the network ((A) in FIG. 11), a timeout is detected in all other nodes.

そうすると、すべての参加ノードが、管理ノードに対してユニークＩＤ要求ｅ２を送信する（図１０の（Ｂ））。しかしながら、管理ノードが存在しない状態のため、今度はユニークＩＤ要求に対するユニークＩＤ応答待ち処理がタイムアウトとなる。そこで、各参加ノードは、管理ノードが停止したものと判断する。 Then, all the participating nodes transmit a unique ID request e2 to the management node ((B) in FIG. 10). However, because there is no management node, the unique ID response wait process for the unique ID request times out this time. Therefore, each participating node determines that the management node has stopped.

この場合、図６に示すライフメッセージの順位番号で上位の参加ノードが代替管理ノードに昇格する。代替管理ノードに昇格した参加ノード以外の参加ノードは、昇格した代替管理ノードに対してユニークＩＤ要求ｅ３を改めて送信する（図１２の（Ａ））。ここでは、ノードＣよりも先に管理ループに参加したノードＢが代替管理ノードに昇格している例を示しているが、ライフメッセージの順位番号は、管理ループへの参加順に限定されるものではない。管理ループを作成し直す都度、管理ノードが、ライフメッセージの効率的な巡回ルートを探索し、その結果に応じて決定するようにしてもよい。 In this case, the upper participating node is promoted to the alternative management node in the life message rank number shown in FIG. Participating nodes other than the participating node promoted to the alternative management node again transmit the unique ID request e3 to the promoted alternative management node ((A) of FIG. 12). Here, an example is shown in which Node B that participated in the management loop before Node C is promoted to an alternative management node, but the rank number of the life message is not limited to the order of participation in the management loop. Absent. Each time the management loop is re-created, the management node may search for an efficient patrol route of the life message and decide according to the result.

昇格した代替管理ノードは、ユニークＩＤ要求を送信してきたノードからなる送信リストを作成し、かつ、ユニークＩＤを更新して、当該ユニークＩＤ要求を送信してきたノードにユニークＩＤ更新ｅ４を送信する（図１２の（Ｂ））。 The promoted alternative management node creates a transmission list including the nodes that have transmitted the unique ID request, updates the unique ID, and transmits the unique ID update e4 to the node that has transmitted the unique ID request ( (B) of FIG.

昇格した代替管理ノードより更新後のユニークＩＤが含まれるユニークＩＤ更新が送信されると、各ノードは、保持していたユニークＩＤを更新して、ライフメッセージ受信待ち状態に移行する。これにより、昇格した代替管理ノードを基点とした、新たなユニークＩＤに基づくライフメッセージ送信ｅ５が再開される（図１２の（Ｃ））。 When a unique ID update including the updated unique ID is transmitted from the promoted alternative management node, each node updates the held unique ID and shifts to a life message reception waiting state. As a result, the life message transmission e5 based on the new unique ID is resumed with the promoted alternative management node as a base point ((C) of FIG. 12).

また、図１３乃至図１６は、本情報処理装置の冗長構成の運用機能に関わる動作手順を示すタイミングチャートである。 FIG. 13 to FIG. 16 are timing charts showing the operation procedure related to the operation function of the redundant configuration of the information processing apparatus.

まず、図１３を参照して、管理ノードの開始処理の動作手順について説明する。 First, the operation procedure of the management node start process will be described with reference to FIG.

管理ノードの開始処理は、ノード管理機能モジュール１１によって管理ノード初期設定読込処理が実行された場合に起動する（図１３のｆ１）。この場合、ノード管理機能モジュール１１は、管理ノードを構成すべく、管理ノード機能モジュール１３に対し、初期化処理の実行を指示する（図１３のｆ２）。これにより、管理ノード機能モジュール１３の制御下で、ユニークＩＤメッセージ管理機能モジュール１５、ライフメッセージ管理機能モジュール１４、メッセージ送受信機能モジュール１６の初期化処理が実行されることになる（図１３のｆ３，ｆ４，ｆ５）。 The management node start process is started when the management node initial setting read process is executed by the node management function module 11 (f1 in FIG. 13). In this case, the node management function module 11 instructs the management node function module 13 to execute an initialization process to configure a management node (f2 in FIG. 13). As a result, the initialization process of the unique ID message management function module 15, the life message management function module 14, and the message transmission / reception function module 16 is executed under the control of the management node function module 13 (f3 in FIG. 13). f4, f5).

この初期化処理が終了すると、管理ノード機能モジュール１３が、管理ループの構成処理を開始し（図１３のｆ６）、ユニークＩＤメッセージ管理機能モジュール１５に対し、参加ノード情報の収集処理を指示する（図１３のｆ７）。ユニークＩＤメッセージ管理機能モジュール１５は、ユニークＩＤの収集を開始し（図１３のｆ７１）、メッセージ送受信機能モジュール１６に対し、ユニークＩＤメッセージの受信要求を通知する（図１３のｆ７１１）。これにより、メッセージ送受信機能モジュール１６は、メッセージ受信待ち処理に入る（図１３のｆ７１１１）。 When this initialization process ends, the management node function module 13 starts a management loop configuration process (f6 in FIG. 13), and instructs the unique ID message management function module 15 to collect the participating node information ( F7) of FIG. The unique ID message management function module 15 starts collecting unique IDs (f71 in FIG. 13), and notifies the message transmission / reception function module 16 of a unique ID message reception request (f711 in FIG. 13). As a result, the message transmission / reception function module 16 enters a message reception waiting process (f7111 in FIG. 13).

メッセージ送受信機能モジュール１６は、ユニークＩＤメッセージを受信すると、その旨をユニークＩＤメッセージ管理機能モジュール１５に通知する（図１３のｆ８）。ユニークＩＤメッセージ管理機能モジュール１５は、管理ノード機能モジュール１３に対し、参加ノードの追加を要求する（図１３のｆ８１）。管理ノード機能モジュール１３は、この要求に対する返答を行い、収集ノード一覧の更新を実行する（図１３のｆ９）。一方、管理ノード機能モジュール１３からの返答を受けたユニークＩＤメッセージ管理機能モジュール１５は、メッセージ送受信機能モジュール１６に対し、ユニークＩＤ応答メッセージの送信要求を通知する（図１３のｆ８２）。これにより、メッセージ送受信機能モジュール１６によるメッセージ送信処理が実行される（図１３のｆ８２１）。 When receiving the unique ID message, the message transmission / reception function module 16 notifies the unique ID message management function module 15 to that effect (f8 in FIG. 13). The unique ID message management function module 15 requests the management node function module 13 to add a participating node (f81 in FIG. 13). The management node function module 13 replies to this request and updates the collection node list (f9 in FIG. 13). On the other hand, the unique ID message management function module 15 that has received a response from the management node function module 13 notifies the message transmission / reception function module 16 of a transmission request for a unique ID response message (f82 in FIG. 13). Thereby, the message transmission process by the message transmission / reception function module 16 is executed (f821 in FIG. 13).

また、管理ノード機能モジュール１３は、ライフメッセージ処理を起動する（図１３のｆ１０）。このライフメッセージ処理において、管理ノード機能モジュール１３は、収集ノード一覧に基づくライフメッセージ情報の作成処理を行い（図１３のｆ１０１）、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の開始を指示する（図１３のｆ１０２）。ライフメッセージ管理機能モジュール１４は、ライフメッセージ処理を起動し（図１３のｆ１０２１）、メッセージ送受信機能モジュール１６に対し、参加ノードへのライフメッセージの送信を指示する（図１３のｆ１０２１１）。これにより、メッセージ送受信機能モジュール１６による参加ノードへのライフメッセージの送信が実行される（図１３のｆ１０２１１１）。 Further, the management node function module 13 starts the life message process (f10 in FIG. 13). In this life message processing, the management node function module 13 creates life message information based on the collection node list (f101 in FIG. 13), and instructs the life message management function module 14 to start life message processing. (F102 in FIG. 13). The life message management function module 14 starts the life message processing (f1021 in FIG. 13), and instructs the message transmission / reception function module 16 to transmit a life message to the participating nodes (f10211 in FIG. 13). As a result, the message transmission / reception function module 16 transmits the life message to the participating node (f102111 in FIG. 13).

次に、ライフメッセージ管理機能モジュール１４は、メッセージ送受信機能モジュール１６に対し、ライフメッセージの受信要求を通知する（図１３のｆ１１）。これにより、メッセージ送受信機能モジュール１６は、メッセージ受信待ち処理に入る（図１３のｆ１１１）。メッセージ送受信機能モジュール１６は、ライフメッセージを受信すると、その旨をライフメッセージ管理機能モジュール１４に通知する（図１３のｆ１１２）。ライフメッセージ管理機能モジュール１４は、ライフメッセージの確認処理を行い（図１３のｆ１１２１）、問題がなければ、ライフメッセージの送信をメッセージ送受信機能モジュール１６に指示する（図１３のｆ１１２２）。以降、このライフメッセージの送信が繰り返される（図１３のｆ１１２２１，…）。 Next, the life message management function module 14 notifies the message transmission / reception function module 16 of a life message reception request (f11 in FIG. 13). As a result, the message transmission / reception function module 16 enters a message reception waiting process (f111 in FIG. 13). When the message transmission / reception function module 16 receives the life message, the message transmission / reception function module 16 notifies the life message management function module 14 (f112 in FIG. 13). The life message management function module 14 performs a life message confirmation process (f1121 in FIG. 13), and if there is no problem, instructs the message transmission / reception function module 16 to transmit the life message (f1122 in FIG. 13). Thereafter, the transmission of this life message is repeated (f11221,... In FIG. 13).

図１４は、参加ノードの開始処理手順を示すタイミングチャートである。 FIG. 14 is a timing chart showing the start processing procedure of the participating nodes.

参加ノードの開始処理では、まず、ノード管理機能モジュール１１が、参加ノードを構成すべく、参加ノード機能モジュール１２に対し、初期化処理の実行を指示する（図１４のｇ１）。これにより、参加ノード機能モジュール１２の制御下で、ユニークＩＤメッセージ管理機能モジュール１５、ライフメッセージ管理機能モジュール１４、メッセージ送受信機能モジュール１６の初期化処理が実行されることになる（図１４のｇ２，ｇ３，ｇ４）。 In the start process of the participating node, first, the node management function module 11 instructs the participating node function module 12 to execute the initialization process in order to configure the participating node (g1 in FIG. 14). As a result, the initialization processing of the unique ID message management function module 15, the life message management function module 14, and the message transmission / reception function module 16 is executed under the control of the participating node function module 12 (g2, FIG. 14). g3, g4).

この初期化処理が終了すると、参加ノード機能モジュール１２が、管理ループへの参加処理を開始し（図１４のｇ５）、ユニークＩＤメッセージ管理機能モジュール１５に対し、管理ループへの参加処理を指示する（図１４のｇ５１）。ユニークＩＤメッセージ管理機能モジュール１５は、メッセージ送受信機能モジュール１６に対し、ユニークＩＤ要求メッセージの送信要求を通知する（図１４のｇ５１１）。これにより、メッセージ送受信機能モジュール１６によるメッセージ送信処理が実行される（図１４のｇ５１１１）。 When this initialization process is completed, the participating node function module 12 starts the participation process in the management loop (g5 in FIG. 14), and instructs the unique ID message management function module 15 to participate in the management loop. (G51 in FIG. 14). The unique ID message management function module 15 notifies the message transmission / reception function module 16 of a transmission request for a unique ID request message (g511 in FIG. 14). Thereby, the message transmission processing by the message transmission / reception function module 16 is executed (g5111 in FIG. 14).

メッセージ送受信機能モジュール１６は、ユニークＩＤ応答メッセージを受信すると、その旨をユニークＩＤメッセージ管理機能モジュール１５に通知する（図１４のｇ６）。ユニークＩＤメッセージ管理機能モジュール１５は、参加ノード機能モジュール１２に対し、管理ループへの参加確認を指示する（図１４のｇ６１）。 When receiving the unique ID response message, the message transmission / reception function module 16 notifies the unique ID message management function module 15 (g6 in FIG. 14). The unique ID message management function module 15 instructs the participation node function module 12 to confirm participation in the management loop (g61 in FIG. 14).

また、参加ノード機能モジュール１２は、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の開始を指示する（図１４のｇ７）。ライフメッセージ管理機能モジュール１４は、ライフメッセージ処理を起動し（図１４のｇ７１）、メッセージ送受信機能モジュール１６に対し、ライフメッセージの受信要求を通知する（図１４のｇ７１１）。これにより、メッセージ送受信機能モジュール１６は、メッセージ受信待ち処理に入る（図１４のｇ７１１１）。 Further, the participating node function module 12 instructs the life message management function module 14 to start life message processing (g7 in FIG. 14). The life message management function module 14 starts life message processing (g71 in FIG. 14), and notifies the message transmission / reception function module 16 of a life message reception request (g711 in FIG. 14). As a result, the message transmission / reception function module 16 enters a message reception waiting process (g7111 in FIG. 14).

メッセージ送受信機能モジュール１６は、ライフメッセージを受信すると、その旨をライフメッセージ管理機能モジュール１４に通知する（図１４のｇ８）。ライフメッセージ管理機能モジュール１４は、受信されたライフメッセージの更新処理を行い（図１４のｇ８１）、メッセージ送受信機能モジュール１６に対し、更新後のライフメッセージの送信を指示する（図１４のｇ８１１）。これにより、メッセージ送受信機能モジュール１６によるライフメッセージの送信が実行される（図１４のｇ８１１１）。 When the message transmission / reception function module 16 receives the life message, the message transmission / reception function module 16 notifies the life message management function module 14 (g8 in FIG. 14). The life message management function module 14 updates the received life message (g81 in FIG. 14), and instructs the message transmission / reception function module 16 to transmit the updated life message (g811 in FIG. 14). Accordingly, the life message is transmitted by the message transmission / reception function module 16 (g8111 in FIG. 14).

以降、ライフメッセージ管理機能モジュール１４は、このライフメッセージの送受信を繰り返す（図１４のｇ８２，…）。 Thereafter, the life message management function module 14 repeats transmission / reception of this life message (g82,... In FIG. 14).

また、図１５は、管理ループへの参加ノードの追加処理手順を示すタイミングチャートである。 FIG. 15 is a timing chart showing a procedure for adding a node participating in the management loop.

管理ループへの参加ノードの追加処理は、管理ループに参加するノード間でライフメッセージの送受信が行われている状態において（図１５のｈ１，…，ｈ１２）、新たなノード（参加要求ノード）の参加ノード機能モジュール１２が管理ループへの参加処理を開始させたことによって起動する（図１５のｈ２）。参加要求ノードの参加ノード機能モジュール１２は、ユニークＩＤメッセージ管理機能モジュール１５に対し、管理ループへの参加処理を指示する（図１５のｈ３）。これにより、参加要求ノードからユニークＩＤ要求メッセージがブロードキャストされる（図１５のｈ３１，ｈ３２）。 In the state where life messages are transmitted and received between nodes participating in the management loop (h1,..., H12 in FIG. 15), the process of adding a participating node to the management loop is performed in a new node (participation request node). The participation node function module 12 is activated when the participation process to the management loop is started (h2 in FIG. 15). The participation node function module 12 of the participation request node instructs the unique ID message management function module 15 to participate in the management loop (h3 in FIG. 15). Thereby, the unique ID request message is broadcast from the participation request node (h31, h32 in FIG. 15).

管理ノードでは、このユニークＩＤ要求メッセージを受信したユニークＩＤメッセージ管理機能モジュール１５が、管理ノード機能モジュール１３に対し、参加ノード追加要求を通知する（図１５のｈ３１１）。これに応答して、管理ノード機能モジュール１３は、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の停止を指示する（図１５のｈ３１１１）。また、ユニークＩＤ要求メッセージを受信したユニークＩＤメッセージ管理機能モジュール１５は、参加要求ノードに対し、ユニークＩＤ応答メッセージを送信する（図１５のｈ３１２）。 In the management node, the unique ID message management function module 15 that has received this unique ID request message notifies the management node function module 13 of a participation node addition request (h311 in FIG. 15). In response to this, the management node function module 13 instructs the life message management function module 14 to stop the life message processing (h3111 in FIG. 15). Further, the unique ID message management function module 15 that has received the unique ID request message transmits a unique ID response message to the participation request node (h312 in FIG. 15).

一方、参加ノードでは、ユニークＩＤ要求メッセージを受信したユニークＩＤメッセージ管理機能モジュール１５が、参加ノード機能モジュール１２に対し、（自ノードが実行するライフメッセージ送受信のための）管理ループ再構成処理を指示する（図１５のｈ３２１）。これに応答して、参加ノード機能モジュール１２は、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の再初期化を指示する（図１５のｈ３２１１）。この初期化後、ライフメッセージ管理機能モジュール１４は、ライフメッセージの受信待ち処理に入る（図１５のｈ３２１１１）。 On the other hand, in the participating node, the unique ID message management function module 15 that has received the unique ID request message instructs the participating node function module 12 to perform management loop reconfiguration processing (for life message transmission / reception executed by the own node). (H321 in FIG. 15). In response to this, the participating node function module 12 instructs the life message management function module 14 to reinitialize the life message process (h3211 in FIG. 15). After this initialization, the life message management function module 14 enters a life message reception waiting process (h32111 in FIG. 15).

参加要求ノードのユニークＩＤメッセージ管理機能モジュール１５は、管理ノードからのユニークＩＤ応答メッセージを受けると、参加ノード機能モジュール１２に対し、管理ループ参加確認を指示する（図１５のｈ３１２１）。これに応答して、参加ノード機能モジュール１２は、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の開始を指示する（図１５のｈ３１２１１）。これにより、ライフメッセージ管理機能モジュール１４は、ライフメッセージ受信待ち処理に入る（図１５のｈ３１２１１１）。 When receiving the unique ID response message from the management node, the unique ID message management function module 15 of the participation request node instructs the participation node function module 12 to confirm participation in the management loop (h3121 in FIG. 15). In response to this, the participating node function module 12 instructs the life message management function module 14 to start life message processing (h3111 in FIG. 15). Thereby, the life message management function module 14 enters a life message reception waiting process (h3111 in FIG. 15).

また、管理ノードでは、管理ノード機能モジュール１３が、収集ノード一覧の更新処理を実行して（図１５のｈ４）、ライフメッセージ処理を再開する（図１５のｈ５）。管理ノード機能モジュール１３は、更新後の収集ノード一覧に基づき、ライフメッセージ情報の作成処理を実行し（図１５のｈ５１）、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理開始を要求する（図１５のｈ５２）。これにより、ライフメッセージ管理機能モジュール１４によるライフメッセージの送信が実行され（図１５のｈ５２１）、以降、参加要求ノードを追加した管理ループでのライフメッセージの送受信が実行されることになる（図１５のｈ５２１１，…）。 In the management node, the management node function module 13 executes the collection node list update process (h4 in FIG. 15) and resumes the life message process (h5 in FIG. 15). The management node function module 13 executes life message information creation processing based on the updated collection node list (h51 in FIG. 15), and requests the life message management function module 14 to start life message processing (FIG. 15). 15 h52). Thereby, the life message is transmitted by the life message management function module 14 (h521 in FIG. 15), and thereafter, the life message is transmitted and received in the management loop to which the participation request node is added (FIG. 15). H5211, ...).

図１６は、代替管理ノードの処理手順を示すタイミングチャートである。 FIG. 16 is a timing chart showing the processing procedure of the alternative management node.

代替管理ノードの処理は、管理ループに参加するノード間でライフメッセージの送受信が行われている状態において（図１６のｊ１，…，ｊ１２１２２１）、管理ノードによるライフメッセージの送信が停止したことによって起動する（図１６のｊ２）。各参加ノードのライフメッセージ管理機能モジュール１４は、ライフメッセージ受信タイムアウト処理を開始し（図１６のｊ３，ｊ４）、参加ノード機能モジュール１２に対し、管理ノードへの送信エラー通知処理を指示する（図１６のｊ３１，ｊ４１）。これに応答して、参加ノード機能モジュール１２は、ユニークＩＤメッセージ管理機能モジュール１５に対し、ユニークＩＤ再取得処理を指示する（図１６のｊ３１１，ｊ４１１）。 The process of the alternative management node is started when the life message is transmitted and received between the nodes participating in the management loop (j1,..., J122221 in FIG. 16) and the life message transmission by the management node is stopped. (J2 in FIG. 16). The life message management function module 14 of each participating node starts a life message reception timeout process (j3, j4 in FIG. 16), and instructs the participating node function module 12 to perform a transmission error notification process to the management node (FIG. 16). 16 j31, j41). In response to this, the participating node function module 12 instructs the unique ID message management function module 15 to perform unique ID reacquisition processing (j311 and j411 in FIG. 16).

しかしながら、管理ノードは停止しているので、各参加ノードのユニークＩＤメッセージ管理機能モジュール１５は、ユニークＩＤ要求送信タイムアウト処理を開始することになる（図１６のｊ３１１１，ｊ４１１１）。ユニークＩＤメッセージ管理機能モジュール１５は、参加ノード機能モジュール１２に対し、ユニークＩＤ取得エラーを通知する（図１６のｊ３１１２，ｊ４１１２）。この通知を受けた参加ノード機能モジュール１２は、代替管理ノード移行判断処理を実行する（図１６のｊ３１１２１，ｊ４１１２１）。これにより、いずれか１つの参加ノードが代替管理ノードに昇格する。昇格した代替管理ノードの管理ノード機能モジュール１３は、ユニークＩＤメッセージ管理機能モジュール１５に対し、ユニークＩＤ受信処理を指示する（図１６のｊ３１１２１１）。それ以外の参加ノードの参加ノード機能モジュール１２は、ユニークＩＤ再取得処理を再度指示する（図１６のｊ４１１２１１）。 However, since the management node is stopped, the unique ID message management function module 15 of each participating node starts the unique ID request transmission timeout process (j3111, j4111 in FIG. 16). The unique ID message management function module 15 notifies the participating node function module 12 of a unique ID acquisition error (j3112, j4112 in FIG. 16). Upon receiving this notification, the participating node function module 12 executes an alternative management node migration determination process (j31121, j41121 in FIG. 16). As a result, any one participating node is promoted to an alternative management node. The management node function module 13 of the promoted alternative management node instructs the unique ID message management function module 15 to perform unique ID reception processing (j311211 in FIG. 16). The participating node function modules 12 of the other participating nodes instruct the unique ID reacquisition process again (j411211 in FIG. 16).

そうすると、ユニークＩＤ再取得処理を再度指示された参加ノードのユニークＩＤメッセージ管理機能モジュール１５と、ユニークＩＤ受信処理を指示された代替管理ノードのユニークＩＤメッセージ管理機能モジュール１５との間で、ユニークＩＤ要求が送受信され（図１６のｊ５）、代替管理ノードのユニークＩＤメッセージ管理機能モジュール１５は、管理ノード機能モジュール１３に対し、管理ループへの追加を指示する（図１６のｊ５１）。 Then, between the unique ID message management function module 15 of the participating node that has been instructed again for the unique ID reacquisition process and the unique ID message management function module 15 of the alternative management node instructed to receive the unique ID. The request is transmitted / received (j5 in FIG. 16), and the unique ID message management function module 15 of the alternative management node instructs the management node function module 13 to add to the management loop (j51 in FIG. 16).

これに応答して、代替管理ノードの管理ノード機能モジュール１２は、収集ノード一覧の更新処理を実行して（図１６のｊ５１１）、管理ループへの追加の完了をＩＤメッセージ管理機能モジュール１５に回答する（図１６のｊ５１２）。これを受けて、ＩＤメッセージ管理機能モジュール１５は、参加ノードのユニークＩＤメッセージ管理機能モジュール１５に対し、ユニークＩＤ応答を送信する（図１６のｊ５１２１）。 In response to this, the management node function module 12 of the alternative management node executes the update processing of the collection node list (j511 in FIG. 16), and replies completion of addition to the management loop to the ID message management function module 15 (J512 in FIG. 16). In response to this, the ID message management function module 15 transmits a unique ID response to the unique ID message management function module 15 of the participating node (j5121 in FIG. 16).

参加ノードのユニークＩＤメッセージ管理機能モジュール１５は、ユニークＩＤ応答を受信すると、参加ノード機能モジュール１２に対し、管理ループへの参加完了を通知する（図１６のｊ５１２１１）。これに応答して、参加ノード機能モジュール１２は、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の開始を指示する（図１６のｊ５１２１１１）。 When receiving the unique ID response, the unique ID message management function module 15 of the participating node notifies the participation node function module 12 of the completion of participation in the management loop (j51211 in FIG. 16). In response to this, the participating node function module 12 instructs the life message management function module 14 to start life message processing (j512111 in FIG. 16).

一方、代替管理ノードの管理ノード機能モジュール１３も、ライフメッセージ管理機能モジュール１４に対し、ライフメッセージ処理の開始を指示する（図１６のｊ６）。これにより、新たな管理ループによるライフメッセージの送受信が開始される。 On the other hand, the management node function module 13 of the alternative management node also instructs the life message management function module 14 to start life message processing (j6 in FIG. 16). Thereby, transmission / reception of a life message by a new management loop is started.

以上のように、本情報処理装置は、冗長構成を構築する情報処理装置を適宜に追加・削除すること等を可能とする冗長構成の運用機能を実現する。 As described above, the information processing apparatus realizes an operation function of a redundant configuration that enables an information processing apparatus that constructs a redundant configuration to be added or deleted as appropriate.

なお、本実施形態の動作制御処理は、ソフトウェア（プログラム）によって実現することができるので、このソフトウェアを格納したコンピュータ読み取り可能な記憶媒体を通じてこのソフトウェアを通常のコンピュータにインストールして実行することにより、本実施形態と同様の効果を容易に実現することができる。 Note that the operation control processing of the present embodiment can be realized by software (program), and therefore by installing this software on a normal computer through a computer-readable storage medium storing this software, Effects similar to those of the present embodiment can be easily realized.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although several embodiments of the present invention have been described, these embodiments are presented by way of example and are not intended to limit the scope of the invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１…プロセッサ、２…主メモリ、３…外部記憶装置、４…入力装置、５…表示装置、６…通信装置、１０…管理ループ運用プログラム、１１…ノード管理機能モジュール、１２…参加ノード機能モジュール、１３…管理ノード機能モジュール、１４…ライフメッセージ管理機能モジュール、１５…ユニークＩＤメッセージ管理機能モジュール、１６…メッセージ送受信機能モジュール１６、１７…ユーザインタフェース機能モジュール、１８…収集ノード一覧テーブル、２０…ネットワークプロトコル、３０…ＯＳ（Operating system）／ドライバ。 DESCRIPTION OF SYMBOLS 1 ... Processor, 2 ... Main memory, 3 ... External storage device, 4 ... Input device, 5 ... Display device, 6 ... Communication device, 10 ... Management loop operation program, 11 ... Node management function module, 12 ... Participating node function module , 13 ... management node function module, 14 ... life message management function module, 15 ... unique ID message management function module, 16 ... message transmission / reception function module 16, 17 ... user interface function module, 18 ... collection node list table, 20 ... network Protocol, 30 ... OS (Operating system) / Driver.

Claims

In an information processing apparatus that builds a redundant configuration with other information processing apparatuses connected via a network,
A life message processing means for detecting an anomaly by circulating a life message between the information processing apparatuses constructing the redundant configuration;
Managing the life message cyclic route as a master, and when receiving a request to join the redundant configuration from another information processing device, for adding the other information processing device as a slave to the life message cyclic route Master processing means;
A slave processing means for sending a request to participate in the redundant configuration to another information processing apparatus that manages the cyclic route of the life message as a master, and for joining the cyclic route of the life message as a slave;
An information processing apparatus comprising:

The life message processing means performs transmission / reception of the life message based on identification information shared between information processing apparatuses that construct the redundant configuration,
The slave processing means requests the identification information as a request to participate in the redundant configuration,
When receiving the request for participation in the redundant configuration, the master processing means generates new identification information and participates as a slave in the cyclic route of the life message including the request source of the request for participation in the redundant configuration. Send to other information processing devices,
The information processing apparatus according to claim 1.

The slave processing means and the master processing means send a withdrawal request from the redundant configuration to other information processing devices participating in the life message circulation route to leave the life message circulation route. Having means,
When the master processing unit receives a withdrawal request from the redundant configuration from another information processing apparatus that participates in the cyclic route of the life message, the master processing unit generates new identification information and sends a request for withdrawal from the redundant configuration. Send to other information processing devices participating in the cyclic route of the life message excluding the request source,
When the slave processing means receives a withdrawal request from the redundant configuration from another information processing apparatus that participates in the cyclic route of the life message, it waits for reception of newly generated identification information.
The information processing apparatus according to claim 2.

The slave processing unit manages the cyclic route of the life message as an alternative master when the newly generated identification information is not received beyond a predetermined period after receiving the withdrawal request from the redundant configuration. And means for operating the master processing means,
When the master processing unit operates to manage the life message circulation route as an alternative master from the state in which the own device participates as a slave in the life message circulation route, generates new identification information, Send to other information processing devices participating as a slave in the cyclic route of the life message,
The information processing apparatus according to claim 3.

The slave processing means executes processing for operating the master processing means when the relay order of the life message is the highest among information processing apparatuses participating as a slave in the cyclic route of the life message. 4. The information processing apparatus according to 4.

The slave processing unit requests new identification information from another information processing apparatus that manages a life route of the life message as a master when the life message is not received beyond a predetermined period. 2. The information processing apparatus according to 2.

The slave processing means requests new identification information from another information processing apparatus that manages the life message cyclic route as a master because the life message is not received beyond a predetermined period. When new identification information has not been returned from the other information processing device, the means for operating the master processing means to manage the circulation route of the life message as an alternative master,
When the master processing unit operates to manage the life message circulation route as an alternative master from the state in which the own device participates as a slave in the life message circulation route, generates new identification information, Send to other information processing devices participating as a slave in the cyclic route of the life message,
The information processing apparatus according to claim 6.

The slave processing means executes processing for operating the master processing means when the relay order of the life message is the highest among information processing apparatuses participating as a slave in the cyclic route of the life message. 7. The information processing apparatus according to 7.

An operation control method in an information processing apparatus that builds a redundant configuration with another information processing apparatus connected via a network,
Detecting an anomaly by circulating a life message between the information processing devices constructing the redundant configuration;
Managing the cyclic route of the life message as a master, and when receiving a request to participate in the redundant configuration from another information processing device, adding the other information processing device as a slave to the cyclic route of the life message; ,
Sending a request to participate in the redundant configuration to another information processing apparatus that manages the cyclic route of the life message as a master, and joining as a slave to the cyclic route of the life message;
An operation control method comprising:

Detecting the abnormality is performed by transmitting and receiving the life message based on identification information shared between information processing devices that construct the redundant configuration,
To join as a slave in the cyclic route of the life message, request the identification information as a request to participate in the redundant configuration,
Managing the cyclic route of the life message as a master, when receiving a request to participate in the redundant configuration, generates new identification information and includes the request source of the request for participation in the redundant configuration. To other information processing devices participating as slaves in the traveling route of
The operation control method according to claim 9.

Further comprising a request for withdrawal from the redundant configuration to another information processing apparatus participating in the cyclic route of the life message to leave the cyclic route of the life message,
Managing the life message cyclic route as a master, when receiving a withdrawal request from the redundant configuration from another information processing apparatus participating in the life message cyclic route, generating new identification information, Send to the information processing apparatus participating in the cyclic route of the life message excluding the request source of the withdrawal request from the redundant configuration,
Adding as a slave to the life message cyclic route means that when a withdrawal request from the redundant configuration is received from another information processing apparatus participating in the life message cyclic route, the newly generated identification information is received. stand by,
The operation control method according to claim 10.

When participating in the cyclic route of the life message as a slave, if the identification information newly generated over a predetermined period is not received after receiving the withdrawal request from the redundant configuration, the life message Further managing the tour route as an alternative master,
Managing the life message cyclic route as an alternative master is to generate new identification information during operation and transmit it to other information processing devices participating as slaves in the life message cyclic route.
The operation control method according to claim 11.

13. The management of the life message cyclic route as an alternative master is performed when the relay order of the life message is the highest among information processing apparatuses participating as slaves in the life message cyclic route. Information processing device.

The addition of the life message to the cyclic route as a slave means that if the life message is not received beyond a predetermined period of time, it is new to another information processing apparatus that manages the cyclic route of the life message as a master. The operation control method according to claim 10, wherein identification information is requested.

Another information processing apparatus that manages the life message circulation route as a master because the life message is not received beyond a predetermined period when participating in the life message circulation route as a slave. When the new identification information is not returned from the other information processing apparatus when requesting the new identification information, the life message circulation route is further managed as an alternative master,
Managing the life message cyclic route as an alternative master is to generate new identification information during operation and transmit it to other information processing devices participating as slaves in the life message cyclic route.
The operation control method according to claim 14.

16. The management of the life message cyclic route as an alternative master operates when the life message relay rank is highest among information processing apparatuses participating as slaves in the life message cyclic route. Operation control method.

A program executed by a computer that builds a redundant configuration with another computer connected via a network, the program
A life message processing means for detecting an anomaly by circulating a life message between the information processing apparatuses constructing the redundant configuration;
Managing the life message cyclic route as a master, and when receiving a request to join the redundant configuration from another information processing device, for adding the other information processing device as a slave to the life message cyclic route Master processing means,
A slave processing means for transmitting a request to participate in the redundant configuration to another information processing apparatus that manages the cyclic route of the life message as a master, and for joining the cyclic route of the life message as a slave;
A program that is intended to operate as

The life message processing means performs transmission / reception of the life message based on identification information shared between information processing apparatuses that construct the redundant configuration,
The slave processing means requests the identification information as a request to participate in the redundant configuration,
When receiving the request for participation in the redundant configuration, the master processing means generates new identification information and participates as a slave in the cyclic route of the life message including the request source of the request for participation in the redundant configuration. Send to other information processing devices,
The program according to claim 15.

The slave processing means and the master processing means send a withdrawal request from the redundant configuration to other information processing devices participating in the life message circulation route to leave the life message circulation route. Having means,
When the master processing unit receives a withdrawal request from the redundant configuration from another information processing apparatus that participates in the cyclic route of the life message, the master processing unit generates new identification information and sends a request for withdrawal from the redundant configuration. Send to other information processing devices participating in the cyclic route of the life message excluding the request source,
When the slave processing means receives a withdrawal request from the redundant configuration from another information processing apparatus that participates in the cyclic route of the life message, it waits for reception of newly generated identification information.
The program according to claim 18.

The slave processing unit manages the cyclic route of the life message as an alternative master when the newly generated identification information is not received beyond a predetermined period after receiving the withdrawal request from the redundant configuration. And means for operating the master processing means,
When the master processing unit operates to manage the life message circulation route as an alternative master from the state in which the own device participates as a slave in the life message circulation route, generates new identification information, Send to other information processing devices participating as a slave in the cyclic route of the life message,
The program according to claim 19.

The slave processing means executes processing for operating the master processing means when the relay order of the life message is the highest among information processing apparatuses participating as a slave in the cyclic route of the life message. 20. The program according to 20.

The slave processing unit requests new identification information from another information processing apparatus that manages a life route of the life message as a master when the life message is not received beyond a predetermined period. 18. The program according to 18.

The slave processing means requests new identification information from another information processing apparatus that manages the life message cyclic route as a master because the life message is not received beyond a predetermined period. When new identification information has not been returned from the other information processing device, the means for operating the master processing means to manage the circulation route of the life message as an alternative master,
When the master processing unit operates to manage the life message circulation route as an alternative master from the state in which the own device participates as a slave in the life message circulation route, generates new identification information, Send to other information processing devices participating as a slave in the cyclic route of the life message,
The program according to claim 22.

The slave processing means executes processing for operating the master processing means when the relay order of the life message is the highest among information processing apparatuses participating as a slave in the cyclic route of the life message. 23. The program according to 23.