JP6969261B2

JP6969261B2 - Node management device, node management method and program

Info

Publication number: JP6969261B2
Application number: JP2017191164A
Authority: JP
Inventors: 悟金田
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2017-09-29
Filing date: 2017-09-29
Publication date: 2021-11-24
Anticipated expiration: 2037-09-29
Also published as: JP2019067087A

Description

本発明は、互いに連携して要求を処理する複数のノードを管理する、ノード管理装置、ノード管理方法及びプログラムに関する。 The present invention relates to a node management device, a node management method, and a program for managing a plurality of nodes that process requests in cooperation with each other.

互いに連携して要求を処理する複数のノードからなるシステムの一例として、分散処理システムが知られている。この分散処理システムでは、各ノードの負荷の偏りを防ぎ、全体的な処理の効率性を保つため、負荷分散が行われる。負荷分散には、静的負荷分散と動的負荷分散がある。前者は、事前に定めた割り振りで処理をノードに割り当てる方法である。また、後者は、ノードの負荷を監視して、負荷の軽いノードに処理を割り当てる方法である。一般に、動的負荷分散の方が、静的負荷分散よりも実状に応じた負荷分散が期待される。 A distributed processing system is known as an example of a system consisting of a plurality of nodes that process requests in cooperation with each other. In this distributed processing system, load balancing is performed in order to prevent the load bias of each node and maintain the overall processing efficiency. Load balancing includes static load balancing and dynamic load balancing. The former is a method of allocating a process to a node by a predetermined allocation. The latter is a method of monitoring the load of a node and allocating processing to a node having a light load. In general, dynamic load balancing is expected to be more load-balanced according to the actual situation than static load balancing.

分散処理システムの一例として、特許文献１に、生産ラインの全行程を管理する生産管理システムが記載されている。この生産管理システムでは、所要に対し、在庫又は仕掛完成予定を製品在庫から順次、源流工程に向けて工程毎に引当て、各工程の在庫又は引当分、余剰分、不足分を確定する。不足分について仕込換算を行って、要仕込分の要仕込日及び要仕込数を求める。そして、引当分と要仕込分とを源流工程から下流工程に向けて負荷山積し、各工程の仕込予定日と完成予定日を求める。 As an example of a distributed processing system, Patent Document 1 describes a production control system that manages the entire process of a production line. In this production control system, inventories or work-in-process completion schedules are sequentially allocated from product inventory for each process toward the source process, and inventory or allocation, surplus, and shortage of each process are determined. The shortage is converted into charges, and the required preparation date and the required number of preparations are obtained. Then, the reserved portion and the required preparation portion are piled up from the headwater process to the downstream process, and the scheduled preparation date and the scheduled completion date of each process are obtained.

特開平１１−４８１０２号公報Japanese Unexamined Patent Publication No. 11-48102

上述した分散処理システムでは、予め定めた割り振りで処理をノードに割り当てるが、実際に要求を処理した場合の各ノードの処理負荷を予測することはできない。このため、ノードに対する対処が間に合わず、過負荷を生じる場合がある。また、必要以上に負荷を軽減してしまう場合もある。
特許文献１に記載の生産管理システムにおいても、処理負荷の予測はできないため、上記の分散処理システムと同様の問題を生じる。 In the distributed processing system described above, processing is assigned to nodes by a predetermined allocation, but it is not possible to predict the processing load of each node when actually processing a request. For this reason, it may not be possible to deal with the node in time, resulting in an overload. In addition, the load may be reduced more than necessary.
Even in the production control system described in Patent Document 1, since the processing load cannot be predicted, the same problem as the above-mentioned distributed processing system arises.

本発明の目的は、上記問題を解決することができる、ノード管理装置、ノード管理方法及びプログラムを提供することである。 An object of the present invention is to provide a node management device, a node management method and a program capable of solving the above problems.

上記目的を達成するため、本発明の第１のノード管理装置は、互いに連携して要求を処理する複数のノードについて、各ノードが前記要求を処理した際の処理負荷と前記要求のノード間の遷移を分析する分析部と、前記処理負荷および遷移の分析結果に基づいて、所望の要求数に対する各ノードの処理負荷を予測する予測部と、を有する。 In order to achieve the above object, the first node management device of the present invention has a plurality of nodes that process requests in cooperation with each other, between the processing load when each node processes the request and the node of the request. It has an analysis unit that analyzes the transition, and a prediction unit that predicts the processing load of each node for a desired number of requests based on the processing load and the analysis result of the transition.

本発明の第１のノード管理方法は、互いに連携して要求を処理する複数のノードについて、各ノードが前記要求を処理した際の処理負荷と前記要求のノード間の遷移を分析し、
前記処理負荷および遷移の分析結果に基づいて、所望の要求数に対する各ノードの処理負荷を予測することを含む。 The first node management method of the present invention analyzes the processing load when each node processes the request and the transition between the nodes of the request for a plurality of nodes that process the request in cooperation with each other.
It includes predicting the processing load of each node for a desired number of requests based on the processing load and transition analysis results.

本発明の第１のプログラムは、互いに連携して要求を処理する複数のノードについて、各ノードが前記要求を処理した際の処理負荷と前記要求のノード間の遷移を分析する処理と、前記処理負荷および遷移の分析結果に基づいて、所望の要求数に対する各ノードの処理負荷を予測する処理と、をコンピュータに実行させる。 The first program of the present invention is a process of analyzing the processing load when each node processes the request and the transition between the nodes of the request for a plurality of nodes that process the request in cooperation with each other, and the process. A computer is made to perform a process of predicting the processing load of each node for a desired number of requests based on the load and transition analysis results.

過負荷や低負荷のノードを推定し、負荷分散や輻輳制御などの適切な対策を早期に行うことができる。 It is possible to estimate overloaded and underloaded nodes and take appropriate measures such as load balancing and congestion control at an early stage.

本発明の第１の実施形態によるノード管理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the node management apparatus by 1st Embodiment of this invention. 本発明の第２の実施形態によるノード管理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the node management apparatus by the 2nd Embodiment of this invention. 図２に示すノード管理装置にて行われるノード管理処理の一手順を示すフローチャートである。It is a flowchart which shows one procedure of the node management processing performed by the node management apparatus shown in FIG. 処理負荷と遷移先ノード情報の集計結果の一例を説明するための図である。It is a figure for demonstrating an example of the aggregation result of the processing load and the transition destination node information. 各ノード間を遷移した要求数のカウント値の一例を説明するための図である。It is a figure for demonstrating an example of the count value of the number of requests which transitioned between each node. 要求がノード間を遷移する遷移確率の算出処理の一例を説明するための図である。It is a figure for demonstrating an example of the calculation process of the transition probability that a request transitions between nodes. 遷移確率行列を用いた予測の一例を説明するための図である。It is a figure for demonstrating an example of prediction using a transition probability matrix. 遷移確率調整用テーブルの一例を説明するための図である。It is a figure for demonstrating an example of the transition probability adjustment table.

次に、本発明の実施形態について図面を参照して説明する。
（第１の実施形態）
図１は、本発明の第１の実施形態によるノード管理装置の構成を示すブロック図である。なお、図１において、一方向性の矢印は、ある信号（データ又は情報）の流れの方向を端的に示したもので、双方向性を排除するものではない。
図１を参照すると、ノード管理装置１１は、分析部１２及び予測部１３を有する。分析部１２は、互いに連携して要求を処理する複数のノード１５₁〜１５_nについて、各ノードが要求を処理した際の処理負荷と要求のノード間の遷移を分析する。予測部１３は、分析部１２からの処理負荷及び遷移の分析結果に基づいて、所望の要求数ｗに対する各ノードの処理負荷を予測する。 Next, an embodiment of the present invention will be described with reference to the drawings.
(First Embodiment)
FIG. 1 is a block diagram showing a configuration of a node management device according to the first embodiment of the present invention. In addition, in FIG. 1, the unidirectional arrow simply indicates the direction of the flow of a certain signal (data or information), and does not exclude bidirectionality.
Referring to FIG. 1, the node management device 11 has an analysis unit 12 and a prediction unit 13. The analysis unit 12 analyzes the processing load when each node processes a request and the transition between the request _{nodes for a plurality of nodes 15 1} to 15 _n that process the request in cooperation with each other. The prediction unit 13 predicts the processing load of each node for the desired number of requests w based on the analysis result of the processing load and the transition from the analysis unit 12.

上述したノード管理装置１１によれば、所望の要求数ｗに対する各ノードの処理負荷を定量的に予測することができるので、過負荷のノードや低負荷のノードを推定し、負荷分散や輻輳制御などの適切な対策を早期に行うことができる。 According to the node management device 11 described above, since the processing load of each node can be quantitatively predicted for a desired number of requests w, an overloaded node or a low-loaded node can be estimated, and load balancing and congestion control can be performed. Appropriate measures such as can be taken at an early stage.

なお、ノード管理装置１１において、分析部１２及び予測部１３のそれぞれの機能を有する１つの制御部が設けられても良い。また、その制御部は、ＣＰＵ（Central Processing Unit）等を具備したコンピュータで実現されても良い。この場合、コンピュータがプログラムを実行することで、分析部１２及び予測部１３のそれぞれの機能を実現しても良い。 The node management device 11 may be provided with one control unit having the functions of the analysis unit 12 and the prediction unit 13. Further, the control unit may be realized by a computer equipped with a CPU (Central Processing Unit) or the like. In this case, the functions of the analysis unit 12 and the prediction unit 13 may be realized by the computer executing the program.

プログラムは、分析部１２及び予測部１３のそれぞれの機能に対応する処理をコンピュータが実行するためのものであっても良い。例えば、プログラムは、互いに連携して要求を処理する複数のノード１５₁〜１５_nについて、ノードが要求を処理した際の処理負荷と、要求のノード間の遷移とを分析する処理と、処理負荷及び遷移の分析結果に基づいて、所望の要求数に対する各ノードの処理負荷を予測する処理とを、コンピュータが実行するためのものであっても良い。プログラムは、コンピュータ使用可能媒体またはコンピュータ可読媒体で提供されてもよく、また、インターネット等のネットワークを介して提供されてもよい。 The program may be for the computer to execute the processing corresponding to each function of the analysis unit 12 and the prediction unit 13. For example, the program analyzes the processing load when a node processes a request and the transition between the request _{nodes for a plurality of nodes 15 1} to 15 _n that process the request in cooperation with each other, and the processing load. And the process of predicting the processing load of each node for a desired number of requests based on the analysis result of the transition may be performed by the computer. The program may be provided on a computer-enabled or computer-readable medium, or may be provided via a network such as the Internet.

コンピュータ使用可能媒体またはコンピュータ可読媒体は、磁気、光、電子、電磁気、赤外線などを用いて情報の記録または読み取りが可能な媒体を含む。そのような媒体として、例えば、半導体メモリ、半導体または固体の記憶装置、磁気テープ、取外し可能なコンピュータディスケット、ランダムアクセスメモリ（ＲＡＭ）、読出し専用メモリ（ＲＯＭ）、磁気ディスク、光ディスク、光磁気ディスクなどがある。 Computer-enabled or computer-readable media include media on which information can be recorded or read using magnetism, light, electrons, electromagnetics, infrared rays, and the like. Such media include, for example, semiconductor memories, semiconductor or solid storage devices, magnetic tapes, removable computer diskettes, random access memory (RAM), read-only memory (ROM), magnetic disks, optical disks, optomagnetic disks, etc. There is.

また、ノード管理装置１１において、分析部１２は、ノード１５₁〜１５_nについて、ノードが要求を処理した際の処理負荷と、該ノードにて処理された要求の遷移先を示す遷移先ノード情報とを集計し、その集計結果に基づいて、処理負荷および遷移先の傾向を分析しても良い。
さらに、分析部１２が、ノード間の前記要求の遷移確率を算出し、予測部１３が、要求の遷移確率に基づいて、各ノードの処理負荷を予測しても良い。
さらに、分析部１２が、ノード１５₁〜１５_nについて、各ノード間を遷移した要求数を単位処理負荷毎にカウントし、該カウント値に基づいて遷移確率行列を算出し、予測部１３が、遷移確率行列に基づいて、各ノードの処理負荷を予測しても良い。 Further, in the node management device 11, the analysis unit 12 indicates, for the nodes 15 ₁ to 15 _n , the processing load when the node processes the request and the transition destination node information indicating the transition destination of the request processed by the node. And may be aggregated, and the processing load and the tendency of the transition destination may be analyzed based on the aggregated result.
Further, the analysis unit 12 may calculate the transition probability of the request between the nodes, and the prediction unit 13 may predict the processing load of each node based on the transition probability of the request.
Further, the analysis unit 12 _{counts the number of requests transitioned between each node for the nodes 15 1} to 15 _n for each unit processing load, calculates a transition probability matrix based on the count value, and the prediction unit 13 determines. The processing load of each node may be predicted based on the transition probability matrix.

また、ノード管理装置１１において、ノード毎に、予測部１３で予測した処理負荷の予測値と許容値とを比較し、予測値が許容値を超えている場合に、所定のメッセージを出力する対策部を、さらに有しても良い。所定のメッセージは、例えば警告を示すメッセージであっても良い。
さらに、対策部が、予測値が許容値を超えたノードの処理負荷を低減するための対策メッセージを出力しても良い。この場合、対策は、処理負荷の改善、許容値の緩和、要求数の緩和および遷移確率の低減のいずれか一つ、または、２つ以上の組み合わせであっても良い。 Further, in the node management device 11, for each node, the predicted value of the processing load predicted by the prediction unit 13 and the permissible value are compared, and when the predicted value exceeds the permissible value, a predetermined message is output. The unit may be further provided. The predetermined message may be, for example, a message indicating a warning.
Further, the countermeasure unit may output a countermeasure message for reducing the processing load of the node whose predicted value exceeds the allowable value. In this case, the countermeasure may be any one of improvement of the processing load, relaxation of the allowable value, relaxation of the number of requests, and reduction of the transition probability, or a combination of two or more.

（第２の実施形態）
図２は、本発明の第２の実施形態によるノード管理装置の構成を示すブロック図である。なお、図２において、一方向性の矢印は、ある信号（データ又は情報）の流れの方向を端的に示したもので、双方向性を排除するものではない。
図２を参照すると、ノード管理装置２１は、分散処理システム１０の処理負荷の予測及び監視を行う装置であって、遷移検出部２２、遷移集計分析部２３、処理負荷予測部２４、要求数取得部２５、対策算出部２６及び許容負荷取得部２７を有する。遷移集計分析部２３、または、遷移検出部２２と遷移集計分析部２３とからなる部分を、分析部と呼ぶことができる。処理負荷予測部２４及び要求数取得部２５を、予測部と呼ぶことができる。対策算出部２６及び許容負荷取得部２７を、対策部と呼ぶことができる。 (Second embodiment)
FIG. 2 is a block diagram showing a configuration of a node management device according to a second embodiment of the present invention. In FIG. 2, the unidirectional arrow simply indicates the direction of the flow of a certain signal (data or information), and does not exclude bidirectionality.
Referring to FIG. 2, the node management device 21 is a device that predicts and monitors the processing load of the distributed processing system 10, and is a transition detection unit 22, a transition aggregation analysis unit 23, a processing load prediction unit 24, and a request number acquisition. It has a unit 25, a countermeasure calculation unit 26, and an allowable load acquisition unit 27. The transition aggregation analysis unit 23, or the portion including the transition detection unit 22 and the transition aggregation analysis unit 23 can be referred to as an analysis unit. The processing load prediction unit 24 and the request number acquisition unit 25 can be referred to as a prediction unit. The countermeasure calculation unit 26 and the allowable load acquisition unit 27 can be referred to as a countermeasure unit.

分散処理システム１０は、互いに連携して要求（要求数＝ｗ）を処理する複数のノード１〜５を有する。ここでは、ノード１〜５のそれぞれに対して、特定の処理が割り当てられている。要求は、一連の処理から成り、処理に応じたノードで処理される。要求は、あるノードでの処理が終わると、次に行う処理が確定し、対応する次のノードに遷移する。要求は、処理と遷移を繰り返して、有限時間内に全ての処理が完了する。要求を処理する際のノードの処理負荷は、計測または推定できるものとする。 The distributed processing system 10 has a plurality of nodes 1 to 5 that process requests (number of requests = w) in cooperation with each other. Here, specific processing is assigned to each of the nodes 1 to 5. The request consists of a series of processes and is processed by the node corresponding to the process. When the processing at a certain node is completed, the request determines the processing to be performed next and transitions to the corresponding next node. The request repeats processing and transition, and all processing is completed within a finite time. The processing load of the node when processing the request shall be measurable or estimateable.

遷移検出部２２は、要求のノード間における遷移を検出する。遷移検出結果は、遷移検出部２２から遷移集計分析部２３に供給される。
遷移集計分析部２３は、ノード１〜５について、ノードが要求を処理した際の処理負荷と、該ノードにて処理された要求の遷移先を示す遷移先ノード情報とを集計する。例えば、遷移集計分析部２３は、処理負荷の単位量ごとに離散的に、各要求がどのノードで処理されていたかを示す情報を集計しても良い。遷移集計分析部２３は、処理負荷と遷移先ノード情報の集計結果に基づいて、処理負荷および遷移先の傾向を分析する。遷移集計分析部２３は、分析結果を処理負荷予測部２４に供給する。 The transition detection unit 22 detects the transition between the requesting nodes. The transition detection result is supplied from the transition detection unit 22 to the transition aggregation analysis unit 23.
The transition aggregation analysis unit 23 aggregates the processing load when the node processes the request and the transition destination node information indicating the transition destination of the request processed by the node for the nodes 1 to 5. For example, the transition aggregation analysis unit 23 may aggregate information indicating on which node each request was processed discretely for each unit amount of the processing load. The transition aggregation analysis unit 23 analyzes the processing load and the tendency of the transition destination based on the aggregation result of the processing load and the transition destination node information. The transition aggregation analysis unit 23 supplies the analysis result to the processing load prediction unit 24.

なお、遷移検出部２２は、各ノードに対して、それぞれの処理負荷と遷移先ノード情報を問合せても良い。あるいは、各ノードが自発的に、自ノードの処理負荷と遷移先ノード情報とを遷移検出部２２に報告しても良い。あるいは、遷移検出部２２が、各ノードの通信を監視して、要求の処理開始のタイミングや、要求の受け渡しを検出することで、処理負荷及び遷移先ノード情報の集計を行っても良い。 The transition detection unit 22 may inquire each node about the processing load and the transition destination node information. Alternatively, each node may voluntarily report the processing load of its own node and the transition destination node information to the transition detection unit 22. Alternatively, the transition detection unit 22 may monitor the communication of each node and detect the timing of starting the processing of the request and the passing of the request, thereby totaling the processing load and the transition destination node information.

要求数取得部２５は、ノード１〜５で処理させる要求数を取得する。例えば、要求数取得部２５は、各ノードに想定される要求数の入力を受け付けても良い。要求数取得部２５は、取得した要求数を処理負荷予測部２４に供給する。
処理負荷予測部２４は、遷移集計分析部２３からの分析結果に基づいて、所望の要求数に対する各ノードの処理負荷を予測する。所望の要求数は、要求数取得部２５から供給される。処理負荷予測部２４は、各ノードの処理負荷の予測値を対策算出部２６に供給する。 The request number acquisition unit 25 acquires the number of requests to be processed by the nodes 1 to 5. For example, the request number acquisition unit 25 may accept input of the expected number of requests to each node. The request number acquisition unit 25 supplies the acquired request number to the processing load prediction unit 24.
The processing load prediction unit 24 predicts the processing load of each node for a desired number of requests based on the analysis result from the transition aggregation analysis unit 23. The desired number of requests is supplied from the request number acquisition unit 25. The processing load prediction unit 24 supplies the predicted value of the processing load of each node to the countermeasure calculation unit 26.

許容負荷取得部２７は、ノード１〜５の処理負荷の許容値を取得する。例えば、許容負荷取得部２７は、処理負荷の許容値の入力を受け付けても良い。許容負荷取得部２７は、取得した許容値を対策算出部２６に供給する。
対策算出部２６は、ノード毎に、処理負荷の予測値と許容値とを比較する。予測値が許容値を超えた場合、対策算出部２６は、警告メッセージを出力するとともに、ノードに対する対策を算出し、その対策を示すメッセージを出力する。対策算出部２６は、通知部と呼ぶことができる。 The allowable load acquisition unit 27 acquires the allowable value of the processing load of the nodes 1 to 5. For example, the allowable load acquisition unit 27 may accept the input of the allowable value of the processing load. The allowable load acquisition unit 27 supplies the acquired allowable value to the countermeasure calculation unit 26.
The countermeasure calculation unit 26 compares the predicted value and the allowable value of the processing load for each node. When the predicted value exceeds the permissible value, the countermeasure calculation unit 26 outputs a warning message, calculates a countermeasure for the node, and outputs a message indicating the countermeasure. The countermeasure calculation unit 26 can be called a notification unit.

次に、本実施形態のノード管理装置２１の動作について具体的に説明する。
図３は、ノード管理装置２１にて行われるノード管理処理の一手順を示すフローチャートである。 Next, the operation of the node management device 21 of the present embodiment will be specifically described.
FIG. 3 is a flowchart showing one procedure of the node management process performed by the node management device 21.

まず、遷移検出部２２が要求のノード間の遷移を検出する。そして、遷移集計分析部２３が、各ノードの処理負荷と遷移先ノード情報を集計する（ステップＳ１０）。
次に、遷移集計分析部２３が、ステップＳ１０で得られた集計結果に基づいて、各ノードの処理負荷及び遷移先の傾向を分析する（ステップＳ１１）。次に、処理負荷予測部２４が、ステップＳ１１で得られた処理負荷及び遷移先の傾向分析結果に基づいて、要求数取得部２５が取得した要求数に対する各ノードの処理負荷を予測する（ステップＳ１２）。 First, the transition detection unit 22 detects the transition between the requested nodes. Then, the transition aggregation analysis unit 23 aggregates the processing load of each node and the transition destination node information (step S10).
Next, the transition aggregation analysis unit 23 analyzes the processing load of each node and the tendency of the transition destination based on the aggregation result obtained in step S10 (step S11). Next, the processing load prediction unit 24 predicts the processing load of each node with respect to the number of requests acquired by the request number acquisition unit 25 based on the processing load obtained in step S11 and the tendency analysis result of the transition destination (step). S12).

次に、対策算出部２６が、各ノードについて、ステップＳ１２で得られた処理負荷の予測値が、許容負荷取得部２７が取得した許容値より大きいか否かを判定する（ステップＳ１３）。予測値が許容値より大きい場合に、対策算出部２６が、警告メッセージを出力するとともに、ノードに対する対策を算出し、その対策を示すメッセージを出力する。 Next, the countermeasure calculation unit 26 determines whether or not the predicted value of the processing load obtained in step S12 is larger than the allowable value acquired by the allowable load acquisition unit 27 for each node (step S13). When the predicted value is larger than the permissible value, the countermeasure calculation unit 26 outputs a warning message, calculates a countermeasure for the node, and outputs a message indicating the countermeasure.

以下に、処理負荷と遷移先ノード情報の集計処理（ステップＳ１０）、処理負荷及び遷移先の傾向分析処理（ステップＳ１１）、処理負荷の予測処理（ステップＳ１２）及び警告／対策算出処理（ステップＳ１４）を詳細に説明する。
以下では、ノードの処理負荷の尺度として、処理時間を用いる。すなわち、ノード間を遷移する要求を分析する際に、単位処理負荷として単位処理時間を用いる。これとは異なるケースとして、処理負荷を例えば通信量で評価する場合は、単位処理負荷として単位通信量を用いれば良い。 Below, the processing load and transition destination node information aggregation processing (step S10), processing load and transition destination trend analysis processing (step S11), processing load prediction processing (step S12), and warning / countermeasure calculation processing (step S14). ) Will be explained in detail.
In the following, the processing time is used as a measure of the processing load of the node. That is, the unit processing time is used as the unit processing load when analyzing the request transitioning between the nodes. As a case different from this, when the processing load is evaluated by, for example, the communication amount, the unit communication amount may be used as the unit processing load.

処理負荷と遷移先ノード情報の集計処理（ステップＳ１０）
図４に、処理負荷と遷移先ノード情報の集計結果の一例を示す。図４において、「ａ」、「ｂ」、「ｃ」はそれぞれ要求を示す。図４の集計結果は、要求ａ〜ｃが、ノード１〜５のいずれで処理されていたかを示す情報を処理順に単位処理時間毎に集計したものである。この例では、処理負荷を処理時間で評価するものとして、遷移集計分析部２３は、要求の遷移についての情報を各ノードにおける単位処理時間毎に集計する。 Aggregation processing of processing load and transition destination node information (step S10)
FIG. 4 shows an example of the aggregation result of the processing load and the transition destination node information. In FIG. 4, "a", "b", and "c" indicate requirements, respectively. The aggregation result of FIG. 4 is the aggregation of the information indicating which of the nodes 1 to 5 the requests a to c were processed for each unit processing time in the processing order. In this example, assuming that the processing load is evaluated by the processing time, the transition aggregation analysis unit 23 aggregates the information about the request transition for each unit processing time in each node.

ノード１は、累計処理時間(以下、時刻)０から要求ａを処理し、時刻３で要求ｂを処理し、時刻４及び５で要求ｃを処理する。ノード２は、時刻１で要求ａを処理し、時刻６で要求ｃを処理する。ノード３は、時刻２で要求ａを処理し、時刻４で要求ｂを処理し、時刻７及び９で要求ｃを処理する。ノード４は、時刻３で要求ａを処理し、時刻８及び１０で要求ｃを処理する。ノード５は、時刻４で要求ａを処理し、時刻５で要求ｂを処理し、時刻１１で要求ｃを処理する。 Node 1 processes the request a from the cumulative processing time (hereinafter, time) 0, processes the request b at the time 3, and processes the request c at the times 4 and 5. The node 2 processes the request a at time 1 and the request c at time 6. The node 3 processes the request a at time 2, the request b at time 4, and the request c at times 7 and 9. Node 4 processes the request a at time 3 and the request c at times 8 and 10. The node 5 processes the request a at time 4, processes the request b at time 5, and processes the request c at time 11.

時刻がtからt+1になった時、あるノードから別のノードに移った要求は、あるノードでの処理がその単位処理時間内で終わり、次のノードへ遷移したことを示す。例えば、時刻０でノード１が要求ａを処理し、時刻１でノード２が要求ａを処理している。この場合、要求ａは、単位処理時間を跨いでノードを移った要求である。
また、時刻がtからt+1になっても、あるノードに留まった要求は、あるノードでの処理が単位処理時間内に終わらず、ノードを遷移しなかったことを示す。例えば、ノード１は、時刻４及び５で要求ｃを処理している。この場合、要求ｃは、単位処理時間を跨いで同じノードに留まった要求である。 When the time changes from t to t + 1, a request that moves from one node to another indicates that the processing at one node has finished within that unit processing time and has transitioned to the next node. For example, at time 0, node 1 processes request a, and at time 1, node 2 processes request a. In this case, the request a is a request in which the nodes are moved over the unit processing time.
Also, even if the time changes from t to t + 1, a request that stays at a certain node indicates that the processing at a certain node did not end within the unit processing time and the node did not transition. For example, node 1 is processing request c at times 4 and 5. In this case, the request c is a request that stays at the same node over a unit processing time.

上記の処理負荷と遷移先ノード情報の集計処理（ステップＳ１０）は、以下で説明する処理負荷と遷移先の傾向を分析する処理（ステップＳ１１）までにすれば良い。つまり、単位処理時間毎に定期的に行なっても良く、遷移の度に逐次的に行なっても良く、あるいは、ステップ１１の直前に一括して行なっても良い。 The above processing load and transition destination node information aggregation processing (step S10) may be performed up to the processing for analyzing the processing load and transition destination tendency described below (step S11). That is, it may be performed periodically for each unit processing time, sequentially for each transition, or collectively immediately before step 11.

処理負荷及び遷移先の傾向分析処理（ステップＳ１１）
遷移集計分析部２３は、分析結果として遷移確率行列を算出する。すなわち、遷移集計分析部２３は、あるノードで要求が単位処理負荷の処理を受けた後、次のノードへ遷移する遷移確率行列を算出する。
例えば、遷移集計分析部２３は、各ノード間を遷移した要求数を、単位処理負荷ごとにカウントし、該カウント値に基づいて遷移確率行列を算出する。以下の例で、この遷移確率行列の算出処理を具体的に説明する。
以下の例では、前述の例で示した、処理負荷(処理時間)と遷移先ノード情報の集計結果を用いる。したがって、処理負荷を処理時間で評価するものとする。 Processing load and transition destination tendency analysis processing (step S11)
The transition aggregation analysis unit 23 calculates a transition probability matrix as an analysis result. That is, the transition aggregation analysis unit 23 calculates a transition probability matrix in which a request is processed by a unit processing load at a certain node and then transitions to the next node.
For example, the transition aggregation analysis unit 23 counts the number of requests that have transitioned between each node for each unit processing load, and calculates a transition probability matrix based on the count value. The calculation process of this transition probability matrix will be specifically described with the following example.
In the following example, the aggregated result of the processing load (processing time) and the transition destination node information shown in the above example is used. Therefore, the processing load shall be evaluated by the processing time.

［要求がノードｉからノードｊへ遷移した回数の算出］
図５に、各ノード間を遷移した要求数のカウント値の一例を示す。単位処理時間毎に要求がノードｉからノードｊへ遷移した回数を累積する遷移カウンタＣｉｊを用いて、図４に示した集計結果に基づく要求数をカウントした結果が示されている。ここで、ｉ及びｊは、分散処理システム１０を構成するノードの数の範囲内で与えられる。 [Calculation of the number of times a request has transitioned from node i to node j]
FIG. 5 shows an example of the count value of the number of requests transitioned between each node. The result of counting the number of requests based on the aggregation result shown in FIG. 4 is shown by using the transition counter Cij which accumulates the number of times the request has transitioned from the node i to the node j for each unit processing time. Here, i and j are given within the range of the number of nodes constituting the distributed processing system 10.

図５において、遷移カウンタＣiiは、要求がノードiからノードiに遷移した回数、すなわち、要求がノードiに留まった回数をカウントする。図４の集計結果では、要求ｃが累計処理時間(以下、時刻)４及び５でノード１に留まっているため、時刻４で、遷移カウンタＣ１１のカウント値が「１」になる。これ以降、時刻１１まで、遷移カウンタＣ１１はカウント値「１」を維持する。 In FIG. 5, the transition counter Cii counts the number of times the request has transitioned from node i to node i, that is, the number of times the request has stayed at node i. In the aggregation result of FIG. 4, since the request c stays at the node 1 at the cumulative processing times (hereinafter, time) 4 and 5, the count value of the transition counter C11 becomes “1” at the time 4. After that, the transition counter C11 maintains the count value "1" until time 11.

遷移カウンタＣijは、要求がノードiからノードjに遷移した回数をカウントする。図４の集計結果では、時刻０から１にかけて、要求ａがノード１からノード２に遷移しているため、時刻０での遷移カウンタＣ12のカウント値を「１」としている。これ以降、時刻４まで、遷移カウンタＣ12はカウント値「１」を維持する。また、時刻５から６にかけて、要求ｃがノード１からノード２に遷移しているため、時刻５での遷移カウンタＣ12のカウント値を「２」にする。これ以降、時刻１１まで、遷移カウンタＣ12はカウント値「２」を維持する。 The transition counter Cij counts the number of times a request has transitioned from node i to node j. In the aggregation result of FIG. 4, since the request a transitions from the node 1 to the node 2 from the time 0 to 1, the count value of the transition counter C12 at the time 0 is set to “1”. After that, the transition counter C12 maintains the count value "1" until time 4. Further, since the request c is transitioning from the node 1 to the node 2 from the time 5 to 6, the count value of the transition counter C12 at the time 5 is set to “2”. After that, the transition counter C12 maintains the count value "2" until time 11.

図５のその他の遷移カウンタＣ１３〜Ｃ１５、Ｃ２１〜Ｃ２５、Ｃ３１〜Ｃ３５、Ｃ４１〜Ｃ４５、Ｃ５１〜Ｃ５５も、遷移カウンタＣ１１又は遷移カウンタＣ１２と同様の原理で、図４の集計結果に基づいて要求のノード間の遷移をカウントする。 The other transition counters C13 to C15, C21 to C25, C31 to C35, C41 to C45, and C51 to C55 of FIG. 5 are also requested based on the aggregation result of FIG. 4 by the same principle as the transition counter C11 or the transition counter C12. Count the transitions between the nodes of.

［要求がノードｉからノードｊへ遷移する遷移確率の算出］
図６に、要求がノード間を遷移する遷移確率の算出処理の一例を示す。この例では、図５に示したカウント結果に基づいて、要求がノードｉからノードｊへ遷移する遷移確率Ｐｉｊを算出した結果が示されている。 [Calculation of transition probability that a request transitions from node i to node j]
FIG. 6 shows an example of a transition probability calculation process in which a request transitions between nodes. In this example, the result of calculating the transition probability Pij in which the request transitions from the node i to the node j is shown based on the count result shown in FIG.

要求が遷移元ノードから遷移先ノードへ遷移する遷移確率Pijは、遷移元ノードiから遷移先ノードjへ遷移した要求数を、その遷移元ノードiから遷移した全要求数で割った値（確率値）として求められる。
図６において、Ｐ１１は、要求がノード１からノード１へ遷移する確率、すなわち、要求がノード１に留まる確率を示す。遷移確率Ｐ１１は、ノード１に留まった要求数を、ノード１からから遷移した全要求数で割った値で与えられる。すなわち、遷移確率Ｐ１１は、遷移カウンタＣ１１のカウント結果「１」を遷移カウンタＣ１１〜Ｃ１５のカウント結果の合計値「４」（＝Ｃ１１＋Ｃ１２＋Ｃ１３＋Ｃ１４＋Ｃ１５＝１＋２＋１＋０＋０）で割った値（＝１／４）である。 The transition probability Pij in which a request transitions from the transition source node to the transition destination node is the value obtained by dividing the number of requests transitioned from the transition source node i to the transition destination node j by the total number of requests transitioned from the transition source node i (probability). Value).
In FIG. 6, P11 indicates the probability that the request transitions from node 1 to node 1, that is, the probability that the request stays at node 1. The transition probability P11 is given by dividing the number of requests staying at node 1 by the total number of requests transitioned from node 1. That is, the transition probability P11 is a value (= 1/4) obtained by dividing the count result “1” of the transition counter C11 by the total value “4” (= C11 + C12 + C13 + C14 + C15 = 1 + 2 + 1 + 0 + 0) of the count results of the transition counters C11 to C15.

Ｐ１２は、要求がノード１からノード２へ遷移する確率を示す。遷移確率Ｐ１２は、ノード１からノード２へ遷移した要求数を、ノード１からから遷移した全要求数で割った値で与えられる。すなわち、遷移確率Ｐ１２は、遷移カウンタＣ１２のカウント結果「２」を遷移カウンタＣ１１〜Ｃ１５のカウント結果の合計値（＝Ｃ１１＋Ｃ１２＋Ｃ１３＋Ｃ１４＋Ｃ１５＝１＋２＋１＋０＋０）で割った値（＝１／２）である。
図６のその他の遷移確率Ｐ１３〜Ｐ１５、Ｐ２１〜Ｐ２５、Ｐ３１〜Ｐ３５、Ｐ４１〜Ｐ４５、Ｐ５１〜Ｐ５５についても、遷移確率Ｐ１１又は遷移確率Ｐ１２と同様の原理で、図５のカウント結果に基づいて算出される。 P12 indicates the probability that the request will transition from node 1 to node 2. The transition probability P12 is given by dividing the number of requests transitioned from node 1 to node 2 by the total number of requests transitioned from node 1. That is, the transition probability P12 is a value (= 1/2) obtained by dividing the count result “2” of the transition counter C12 by the total value (= C11 + C12 + C13 + C14 + C15 = 1 + 2 + 1 + 0 + 0) of the count results of the transition counters C11 to C15.
The other transition probabilities P13 to P15, P21 to P25, P31 to P35, P41 to P45, and P51 to P55 in FIG. 6 are also based on the same principle as the transition probabilities P11 or the transition probabilities P12, based on the count results in FIG. It is calculated.

［遷移確率行列］
上述の遷移確率Ｐ１１〜Ｐ１５、Ｐ２１〜Ｐ２５、Ｐ３１〜Ｐ３５、Ｐ４１〜Ｐ４５、Ｐ５１〜Ｐ５５に基づき、以下の遷移確率行列

を得る。 [Transition probability matrix]
Based on the above-mentioned transition probabilities P11 to P15, P21 to P25, P31 to P35, P41 to P45, and P51 to P55, the following transition probability matrix

To get.

この遷移確率行列

では、遷移元ノードが同じ遷移確率どうしを同じ列に配置し、遷移先ノードが同じ遷移確率どうしを同じ行に配置している。

This transition probability matrix

Then, the transition source nodes arrange the same transition probabilities in the same column, and the transition destination nodes arrange the same transition probabilities in the same row.

処理負荷の予測処理（ステップＳ１２）
処理負荷予測部２４は、遷移集計分析部２３が算出した遷移確率行列

を用いて、所望の要求数に対する各ノードの処理負荷を予測する。例えば、処理負荷予測部２４は、ノードの処理負荷、すなわち各ノードに遷移する要求の総数を、遷移確率行列

を用いて期待値として予測する。以下の例では、前述の例で求めた遷移確率行列を用いて、この予測を行なう処理を具体的に説明する。 Processing load prediction processing (step S12)
The processing load prediction unit 24 is a transition probability matrix calculated by the transition aggregation analysis unit 23.

Is used to predict the processing load of each node for the desired number of requests. For example, the processing load prediction unit 24 sets the processing load of the node, that is, the total number of requests transitioning to each node, as a transition probability matrix.

Is used to predict the expected value. In the following example, the process of making this prediction will be specifically described using the transition probability matrix obtained in the above example.

図７に、遷移確率行列

を用いた予測の一例を示す。この例では、ノード１〜５にそれぞれ、

個の要求が投入されると要求数取得部25によって想定されたとする。各ノードが、それぞれの要求を単位処理負荷（前述の例と同様に、以下、単位処理時間）分だけ処理すると、要求の一部は他のノードに遷移する。ノードiからノードjへ遷移する要求の数は、遷移確率Ｐｉｊを用いると、期待値（Ｐｉｊ×ｗｉ）と予測される。よって、最初の単位処理時間後（１回目）に、各ノードに遷移した要求数は、遷移確率行列

と各ノードに投入された要求数

の行列積

となると期待される。
行列積

の値は、以下の式で算出される。 Figure 7 shows the transition probability matrix.

An example of prediction using is shown. In this example, nodes 1 to 5 respectively

It is assumed that the request number acquisition unit 25 assumes that individual requests are input. When each node processes each request by the unit processing load (hereinafter, unit processing time as in the above example), a part of the request transitions to another node. The number of requests transitioning from node i to node j is predicted to be an expected value (Pij × wi) using the transition probability Pij. Therefore, the number of requests that have transitioned to each node after the first unit processing time (first time) is a transition probability matrix.

And the number of requests submitted to each node

Matrix product of

Is expected to be.
Matrix product

The value of is calculated by the following formula.

同様に、次の単位処理時間後（２回目）に、各ノードに遷移した要求数は、行列積

となると期待される。また、３回目，．．．，１３回目についても、それぞれ行列積

となると期待される。このような処理を繰り返す結果、各ノードの要求の延べ数、すなわち、各ノードの処理負荷は、

となると期待される。 Similarly, after the next unit processing time (second time), the number of requests transitioned to each node is a matrix product.

Is expected to be. Also, the third time ,. .. .. ， For the 13th time, matrix multiplication

Is expected to be. As a result of repeating such processing, the total number of requests of each node, that is, the processing load of each node is

Is expected to be.

［処理時間の予測（解１）］
処理時間の累計値Ｔは、要求数（要求の処理に要する単位処理時間）の総和を

とすると、

で与えられる。図７の例において、ノード１〜５それぞれの処理時間の累計値

は、

で求めることができる。この結果、ノード１の処理時間の累計値Ｔは１．３、ノード２の処理時間の累計値Ｔは０．７、ノード３の処理時間の累計値Ｔは１．３、ノード４、５の処理時間の累計値Ｔはそれぞれ１．０である。(下式では、小数点2桁以下を四捨五入)

[Prediction of processing time (Solution 1)]
The cumulative value T of the processing time is the sum of the number of requests (unit processing time required to process the request).

Then

Given in. In the example of FIG. 7, the cumulative value of the processing time of each of the nodes 1 to 5

teeth,

Can be obtained at. As a result, the cumulative value T of the processing time of the node 1 is 1.3, the cumulative value T of the processing time of the node 2 is 0.7, the cumulative value T of the processing time of the node 3 is 1.3, and the cumulative value T of the

nodes

4 and 5 is 1. The cumulative value T of the processing time is 1.0, respectively. (In the formula below, round off to two decimal places)

［処理時間の予測（解２）］
また、要求は有限時間内に全ての処理が完了する、という前提条件があることから、

となる。このため、各ノードの処理負荷は、

で求めることができる。ここで、

は単位行列を示す。 [Prediction of processing time (Solution 2)]
Also, since there is a precondition that all processing of the request is completed within a finite time,

Will be. Therefore, the processing load of each node is

Can be obtained at. here,

Indicates the identity matrix.

具体的に説明すると、

である。両辺に

をかけると、

となる。式１の

から式２の

を引くと、

となる。ゆえに、

を得る。 Specifically,

Is. On both sides

When you apply

Will be. Equation 1

From formula 2

When you pull,

Will be. therefore,

To get.

図７の例では、

の値は以下のように求められる。 In the example of FIG. 7,

The value of is calculated as follows.

よって、

Therefore,

を得る。これは、上記の解１の結果と同じである。

To get. This is the same as the result of Solution 1 above.

警告／対策算出処理（ステップＳ１４）
ここでは、処理時間の許容値をＴ_limit＝１．０と仮定する。上述したように、処理時間

と予測され、ノード１、3の処理時間が許容値を越えている。つまり、これは、ノード１、3に遷移してくる要求が過多であることを示す。 Warning / Countermeasure calculation process (step S14)
Here, it is assumed that the allowable value of the processing time is T _limit = 1.0. As mentioned above, processing time

It is predicted that the processing time of

nodes

1 and 3 exceeds the permissible value. That is, this indicates that there are too many requests to transition to the

nodes

1 and 3.

対策算出部２６は、処理時間の予測値と許容値とを比較する。ノード１とノード３の処理時間の予測値はともに「１．３」であり、許容値の「１．０」よりも大きい。この場合、対策算出部２６は、対策算出部２６は、警告メッセージ（例えば、「予測される処理時間が、許容される処理時間を超過しています」といったメッセージ）を出力するとともに、以下のような対策案を提示することができる。 The countermeasure calculation unit 26 compares the predicted value of the processing time with the permissible value. The predicted value of the processing time of the node 1 and the node 3 is both "1.3", which is larger than the allowable value "1.0". In this case, the countermeasure calculation unit 26 outputs a warning message (for example, a message such as "the predicted processing time exceeds the allowable processing time"), and the countermeasure calculation unit 26 is as follows. Can present various countermeasures.

［処理負荷の改善］
対策算出部２６は、予測値が許容値を超えたノード１とノード３について、「１．０に高速化するか、並列化でますか？」といった処理負荷の改善を促すメッセージを出力する。 [Improvement of processing load]
The countermeasure calculation unit 26 outputs a message prompting the improvement of the processing load, such as "Is it faster to 1.0 or parallelization?" For the node 1 and the node 3 whose predicted values exceed the permissible values.

［許容値の緩和］
対策算出部２６は、「Ｔ_limitを１．０から１．３に緩和できますか？」といった許容値の緩和を促すメッセージを出力する。 [Relaxation of tolerance]
The countermeasure calculation unit 26 outputs a message prompting the relaxation of the allowable value, such as "Can the _{T limit be relaxed from 1.0 to 1.3?".}

［要求の緩和］
対策算出部２６は、「

の要求量を緩和できますか？」
といった要求の緩和を促すメッセージを出力する。 [Relaxation of demand]
The countermeasure calculation unit 26 is described as "

Can you relax your demands? "
A message prompting the relaxation of such a request is output.

［遷移確率の調整］
図８に、遷移確率調整用テーブルを示す。遷移確率調整用テーブルの列方向に遷移先ノードが示され、行方向に遷移元モードが示されている。テーブル中、太字で示したノード及び数値は、処理時間が超過しているノードへ要求を遷移させている遷移元のノードとその遷移確率を示す。
対策算出部２６は、図８に示した遷移確率調整用テーブルを出力するとともに、処理時間が超過しているノードへ要求を遷移させている遷移元のノードについて、遷移確率を下げることを促すメッセージを出力する。例えば、対策算出部２６は、ノード３に遷移する確率、すなわち、「Ｐ１３＝１/４」と「Ｐ２３＝１/１」と「Ｐ４３＝１/３」の遷移確率を下げることを促すメッセージを出力する。 [Adjustment of transition probability]
FIG. 8 shows a transition probability adjustment table. The transition destination node is shown in the column direction of the transition probability adjustment table, and the transition source mode is shown in the row direction. In the table, the nodes and numerical values shown in bold indicate the transition source node that is transitioning the request to the node whose processing time has been exceeded and the transition probability thereof.
The countermeasure calculation unit 26 outputs the transition probability adjustment table shown in FIG. 8, and prompts the node of the transition source that transitions the request to the node whose processing time has exceeded to lower the transition probability. Is output. For example, the countermeasure calculation unit 26 sends a message urging the probability of transition to the node 3, that is, the transition probability of "P13 = 1/4", "P23 = 1/1", and "P43 = 1/3". Output.

以上説明した本実施形態のノード管理装置２１によれば、要求を有限時間内に処理する任意の分散処理システムについて、実際の処理結果に基づいて、任意の要求数に対する各ノードの処理負荷を定量的に予測できる。この予測では、過負荷のノードやその負荷だけでなく、過負荷の要因や低負荷のノードを推定できるため、負荷分散や輻輳制御などの対策をより早く、より詳細に立てることができる。 According to the node management device 21 of the present embodiment described above, the processing load of each node for an arbitrary number of requests is quantified based on the actual processing result for any distributed processing system that processes requests within a finite time. Can be predicted. In this prediction, not only the overloaded node and its load but also the cause of the overload and the low load node can be estimated, so that measures such as load balancing and congestion control can be set up faster and in more detail.

ノード管理装置２１において、遷移検出部２２、遷移集計分析部２３、処理負荷予測部２４、要求数取得部２５、対策算出部２６及び許容負荷取得部２７のそれぞれの機能を有する１つの制御部が設けられても良い。また、その制御部は、ＣＰＵ等を具備したコンピュータで実現されても良い。この場合、コンピュータがプログラムを実行することで、遷移検出部２２、遷移集計分析部２３、処理負荷予測部２４、要求数取得部２５、対策算出部２６及び許容負荷取得部２７のそれぞれの機能を実現しても良い。 In the node management device 21, one control unit having the functions of the transition detection unit 22, the transition aggregation analysis unit 23, the processing load prediction unit 24, the request number acquisition unit 25, the countermeasure calculation unit 26, and the allowable load acquisition unit 27 is It may be provided. Further, the control unit may be realized by a computer equipped with a CPU or the like. In this case, by executing the program by the computer, each function of the transition detection unit 22, the transition aggregation analysis unit 23, the processing load prediction unit 24, the request number acquisition unit 25, the countermeasure calculation unit 26, and the allowable load acquisition unit 27 can be performed. It may be realized.

プログラムは、遷移検出部２２、遷移集計分析部２３、処理負荷予測部２４、要求数取得部２５、対策算出部２６及び許容負荷取得部２７のそれぞれの機能に対応する処理をコンピュータが実行するためのものであって良い。プログラムは、上述したコンピュータ使用可能媒体またはコンピュータ可読媒体で提供されてもよく、また、インターネット等のネットワークを介して提供されてもよい。 In the program, the computer executes the processes corresponding to the functions of the transition detection unit 22, the transition aggregation analysis unit 23, the processing load prediction unit 24, the request number acquisition unit 25, the countermeasure calculation unit 26, and the allowable load acquisition unit 27. It may be one. The program may be provided on the computer-enabled or computer-readable medium described above, or may be provided via a network such as the Internet.

また、ノード管理装置２１において、以下のような変形を適用することができる。
遷移検出部２２は、ノードが要求を処理した際に発生した処理負荷と、要求の遷移先ノードの情報を集計する。ここで、各ノードが、処理負荷と遷移先ノードの情報を自発的に報告しても良いが、遷移検出部２２が、処理状態や遷移結果をノードに問い合わせても良い。また、遷移検出部２２が、ノードの通信を監視して、要求の処理開始のタイミングや、要求の受け渡しを検出しても良い。 Further, in the node management device 21, the following modifications can be applied.
The transition detection unit 22 aggregates the processing load generated when the node processes the request and the information of the request transition destination node. Here, each node may voluntarily report the processing load and the information of the transition destination node, but the transition detection unit 22 may inquire the node about the processing state and the transition result. Further, the transition detection unit 22 may monitor the communication of the node to detect the timing of starting the processing of the request and the passing of the request.

また、ノード内に要求が処理されるまでの待ち行列がある場合は、ノード自身、処理の開始・終了タイミングなどから処理負荷を求めても良い。一方、ノード内に待ち行列が無い場合、すなわち、ノード内では各要求が一定負荷の処理を受けるような場合には、遷移時刻などを知り得た、ノード以外の第３者が処理負荷を類推しても良い。
例えば、分散処理システムが生産ラインシステムである場合、各ノードに待ち行列がある場合がある。この場合、ノード(検査工程）で要求（仕掛品）を処理（検査）した際の負荷（検査を開始・終了した時刻）と、遷移先ノード（検査の結果、判定された次の工程）を、ノード自身が求めても良い。遷移集計分析部２３は、ノードからの報告に基づいて情報を集計する。 Further, if there is a queue in the node until the request is processed, the processing load may be obtained from the node itself, the start / end timing of processing, and the like. On the other hand, when there is no queue in the node, that is, when each request receives a constant load of processing in the node, a third party other than the node who can know the transition time etc. infers the processing load. You may.
For example, if the distributed processing system is a production line system, each node may have a queue. In this case, the load (time when the inspection was started / ended) when the request (work in process) was processed (inspected) by the node (inspection process) and the transition destination node (next process determined as a result of the inspection) are determined. , The node itself may ask for it. The transition aggregation analysis unit 23 aggregates information based on the report from the node.

また、ノードは、移動体通信の基地局であっても良い。この場合、遷移集計分析部２３は、ノードが送信するハンドオーバーの制御情報の通信を監視し、要求（ユーザ）がノード（基地局）間を遷移（ハンドオーバー）した時刻と遷移先のノード（ターゲットの基地局）を検出しても良い。また、各ユーザは一定の処理負荷をノードに与えると仮定して、遷移集計分析部２３は、遷移（ハンドオーバー）した時刻から処理負荷を推定しても良い。
処理負荷と遷移先ノードの情報の集計の網羅性は、全ての遷移に関するものでも良く、あるいは、無作為に抽出されたサンプルでも良い。ここで、サンプルは、無作為に抽出された要求の全ての遷移、または、無作為に抽出された要求の無作為に抽出された遷移であっても良い。 Further, the node may be a base station for mobile communication. In this case, the transition aggregation analysis unit 23 monitors the communication of the control information of the handover transmitted by the node, and the time when the request (user) transitions (handover) between the nodes (base stations) and the transition destination node (handover). The target base station) may be detected. Further, assuming that each user applies a constant processing load to the node, the transition aggregation analysis unit 23 may estimate the processing load from the transition (handover) time.
The completeness of the aggregation of the processing load and the information of the transition destination node may be related to all transitions, or may be a sample randomly selected. Here, the sample may be all transitions of randomly sampled requests, or randomly sampled transitions of randomly sampled requests.

処理負荷予測部２４は、予測の前提となる要求数として、分散処理システム１０に入力される実際の要求数を用いても良い。ここで、要求数は、過去の要求数だけでなく、現在入力されている要求数でも良い。後者の場合でも、実際の処理をする直前に処理負荷を予測できる。あるいは、処理負荷予測部２４は、何らかのシミュレーションに基づいて、将来発生する要求数を予測しても良い。または、実際の要求数と部分的なシミュレーションの要求数を加算したものを、分散処理システム１０に入力される要求数としても良い。 The processing load prediction unit 24 may use the actual number of requests input to the distributed processing system 10 as the number of requests that are the premise of prediction. Here, the number of requests may be not only the number of requests in the past but also the number of requests currently input. Even in the latter case, the processing load can be predicted immediately before the actual processing. Alternatively, the processing load prediction unit 24 may predict the number of requests that will occur in the future based on some simulation. Alternatively, the number of requests input to the distributed processing system 10 may be the sum of the actual number of requests and the number of partial simulation requests.

処理負荷の許容値は、例えば、ユーザインターフェースから対話的に入力されても良い(例えば、要求数wが、単発的に発生する場合)。あるいは、上記の予測の前提とした要求数の発生頻度を、処理負荷の許容値としても良い（例えば、要求数

が、連続的に発生する場合、その発生周期を処理時間の許容値とする）。
処理負荷が許容値を超えた場合の通知先は、例えば、分散処理システムの管理者または状態監視装置、ノードの負荷分散の担当者または負荷分散装置、要求の受付システム、または輻輳制御装置である。ここで、負荷分散装置は、処理負荷の予測値が許容値を超えていないノードの処理能力の一部を、処理負荷が許容値を超えているノードの処理に割り当てても良い。 The allowable value of the processing load may be input interactively from the user interface, for example (for example, when the number of requests w occurs sporadically). Alternatively, the frequency of occurrence of the number of requests based on the above prediction may be used as the allowable value of the processing load (for example, the number of requests).

However, if it occurs continuously, the generation cycle is taken as the allowable value of the processing time).
When the processing load exceeds the allowable value, the notification destination is, for example, the administrator or status monitoring device of the distributed processing system, the person in charge of load balancing or the load balancing device of the node, the request receiving system, or the congestion control device. .. Here, the load balancer may allocate a part of the processing capacity of the node whose predicted processing load does not exceed the allowable value to the processing of the node whose processing load exceeds the allowable value.

予測値が許容値を超えているノードと処理負荷を通報する際に、処理負荷予測部２４は、処理負荷の予測値が許容値以下のノードとその処理負荷についても併せて通知しても良い。この通知により、分散処理（例えば、予測値が許容値未満のノードから、予測値が許容値以上のノードに処理のリソースを配置換えする処理）、あるいは、輻輳制御（例えば、負荷の高いノードではなく負荷の低いノードに、要求の一部を遷移させる処理）ができる。 When reporting a node whose predicted value exceeds the allowable value and a processing load, the processing load prediction unit 24 may also notify a node whose predicted value of the processing load is equal to or less than the allowable value and its processing load. .. By this notification, distributed processing (for example, processing that relocates processing resources from a node whose predicted value is less than the allowable value to a node whose predicted value is more than the allowable value) or congestion control (for example, in a node with a high load). It is possible to perform a process of transitioning a part of the request to a node with a low load.

以上説明した各実施形態のノード管理装置は、本発明の一例であり、その構成や動作については、発明の趣旨を逸脱しない範囲で、当業者が理解し得る変更を適用することができる。 The node management device of each embodiment described above is an example of the present invention, and changes that can be understood by those skilled in the art can be applied to the configuration and operation thereof without departing from the spirit of the invention.

本発明は、互いに連携して要求を処理する複数のノードからなるシステムに適用することができる。例えば、本発明は、分散処理システムの負荷分散、移動体通信のトラフィック制御・輻輳制御、生産ラインの生産管理や進捗管理に適用することができる。 The present invention can be applied to a system consisting of a plurality of nodes that process requests in cooperation with each other. For example, the present invention can be applied to load balancing of a distributed processing system, traffic control / congestion control of mobile communication, production management and progress management of a production line.

１１ノード管理装置
１２分析部
１３予測部
１５₁〜１５_n ノード 11 Node management device 12 Analysis department 13 Forecasting department 15 ₁ to 15 _n nodes

Claims

For a plurality of nodes that process requests in cooperation with each other, an analysis unit that analyzes the processing load when each node processes the request and the transition between the nodes of the request,
A node management device including a prediction unit that predicts the processing load of each node for a desired number of requests based on the analysis result of the processing load and the transition.

The analyzing unit, on the basis of the result of node aggregates the processing load at the time of processing the request, and a transition destination node information indicating a transition destination of requests processed by said node, and aggregated, the processing The node management device according to claim 1, which analyzes the tendency of the load and the transition destination.

The analysis unit calculates the transition probability for each unit processing load of the request between the nodes.
The node management device according to claim 1 or 2, wherein the prediction unit predicts the processing load of each node based on the transition probability of the request.

The analyzing unit, for said plurality of nodes, the number of requests that transitions between the nodes counted for each unit processing load to calculate the transition probability matrix based on the value obtained by counting,
The node management device according to claim 3, wherein the prediction unit predicts the processing load of each node based on the transition probability matrix.

A claim that further has a countermeasure unit for each node, which compares the predicted value of the processing load predicted by the prediction unit with the permissible value and outputs a predetermined message when the predicted value exceeds the permissible value. The node management device according to any one of 1 to 4.

The node management device according to claim 5, wherein the countermeasure unit outputs a message indicating measures for reducing the processing load of the node whose predicted value exceeds the permissible value.

For a plurality of nodes that process requests in cooperation with each other, the processing load when each node processes the request and the transition between the nodes of the request are analyzed.
A node management method comprising predicting the processing load of each node for a desired number of requests based on the processing load and transition analysis results.

For a plurality of nodes that process requests in cooperation with each other, a process of analyzing the processing load when each node processes the request and a transition between the nodes of the request, and a process of analyzing the transition between the nodes of the request.
A program for causing a computer to execute a process of predicting the processing load of each node for a desired number of requests based on the analysis result of the processing load and the transition.