JP5246872B2

JP5246872B2 - Storage system and storage management method

Info

Publication number: JP5246872B2
Application number: JP2009080148A
Authority: JP
Inventors: 洋俊赤池; 和久藤本
Original assignee: Tohoku University NUC; Hitachi Ltd
Current assignee: Tohoku University NUC; Hitachi Ltd
Priority date: 2009-03-27
Filing date: 2009-03-27
Publication date: 2013-07-24
Anticipated expiration: 2029-03-27
Also published as: JP2010231636A

Description

本発明は、計算機のデータを格納する記憶装置システム（以下「ストレージシステム」とも言う）に関し、特に複数のディスク装置から構成されるストレージ装置複数台を階層構成とした階層型ストレージシステムに関する。 The present invention relates to a storage device system (hereinafter also referred to as “storage system”) for storing computer data, and more particularly to a hierarchical storage system in which a plurality of storage devices composed of a plurality of disk devices are arranged in a hierarchy.

近年、データセンターの大規模化に伴う消費エネルギーの急増が問題となっている。その中で、データ量の急激な増大に伴い、ストレージシステムが消費するエネルギーの割合が増大している。特に、オンライントランザクションシステムやＨＰＣ（ＨｉｇｈＰｅｒｆｏｒｍａｎｃｅＣｏｍｐｕｔｉｎｇ）システム向けの用途に利用される高性能で、大容量のストレージシステムの消費エネルギーの占める割合が大きくなっている。最近ではデータセンターの約２割から３割の消費エネルギーがストレージシステムで消費されており、データ量の急増に伴い、その割合が今後ますます増えるという報告もあり、ストレージシステムの省電力化が今後の重要な課題の１つになると考えられている。 In recent years, there has been a problem of a rapid increase in energy consumption accompanying the enlargement of data centers. Among them, the rate of energy consumed by the storage system is increasing as the amount of data increases rapidly. In particular, a high-performance and large-capacity storage system occupies a large proportion of energy consumption used in applications for online transaction systems and HPC (High Performance Computing) systems. Recently, approximately 20% to 30% of energy consumption in data centers is consumed in storage systems, and there is a report that the rate will increase with the rapid increase in data volume. It is considered to be one of the important issues.

上記問題を解決する手段として、ストレージシステムに搭載されたハードディスクの電源のオン／オフを制御する技術が、特許文献１、２に開示されている。特許文献１の方法は、ＲＡＩＤ（ＲｅｄｕｎｄａｎｔＡｒｒａｙｓｏｆＩｎｅｘｐｅｎｓｉｖｅＤｉｓｋｓ）を構成するハードディスクグループ内のハードディスク単位で電源制御を行うという方法で、さらにその方法を用いた製品では、アクセス要求に対して遅延なく応答するために、常時稼動しているハードディスクを複数台設けるという方法であった（非特許文献１）。 As means for solving the above-described problem, Patent Documents 1 and 2 disclose techniques for controlling power on / off of a hard disk mounted on a storage system. The method of Patent Document 1 is a method in which power control is performed in units of hard disks in a hard disk group constituting a RAID (Redundant Arrays of Independent Disks), and a product using the method responds to an access request without delay. In order to achieve this, there is a method of providing a plurality of hard disks that are always operating (Non-Patent Document 1).

特許文献２の方法は、アクセスがないＲＡＩＤを構成するハードディスクグループのハードディスクの電源をオフ、または節電状態にするという方法であった。 The method of Patent Document 2 is a method of turning off the power of a hard disk of a hard disk group that constitutes a RAID that is not accessed or putting it in a power saving state.

また、上記問題を解決する別の手段として、階層ストレージシステムを利用する技術が、非特許文献２に開示されている。その方法は、高速のストレージ装置と低電力・大容量のストレージ装置を階層化した階層ストレージシステムにおいて、生成したデータはまず高速のストレージ装置に保存し、アクセス頻度が低くなったデータを高速のストレージ装置から低電力・大容量のストレージ装置に移行することにより、消費電力の大きい高速のストレージ装置の容量増加を抑え、システム全体として、消費エネルギーの増加を抑制するという方法であった。 Further, as another means for solving the above problem, Non-Patent Document 2 discloses a technique using a hierarchical storage system. The method uses a high-speed storage device and a low-power, large-capacity storage device in a hierarchical storage system. The generated data is first stored in the high-speed storage device, and the data with low access frequency is stored in the high-speed storage device. By shifting from a storage device to a low-power / large-capacity storage device, the increase in the capacity of a high-speed storage device with high power consumption is suppressed, and the increase in energy consumption as a whole system is suppressed.

また、上記問題を解決する別の手段として、データベース用に利用されるストレージシステムにおいて、データベース管理システムが有する情報を利用してハードディスクの回転数を制御する技術が、特許文献３に開示されている。その方法は、通常はハードディスクをスピンダウンして消費電力を下げ、データベース管理システムが作成するクエリプラン（ストレージ装置へのアクセス処理の列）を利用して、事前にどのハードディスクがアクセスされるかを知り、アクセスされるハードディスクのスピンアップ／ダウンを行うという方法であった。 As another means for solving the above problem, Patent Document 3 discloses a technique for controlling the number of revolutions of a hard disk using information held in a database management system in a storage system used for a database. . The method usually involves spinning down hard disks to reduce power consumption, and using a query plan (column of storage device access processing) created by the database management system, which hard disks are accessed in advance. It was a method of knowing and spinning up / down the accessed hard disk.

米国特許出願公開第２００４／００５４９３９号明細書US Patent Application Publication No. 2004/0054939 特開２０００−２９３３１４号公報JP 2000-293314 A 特開２００７−２９３４７９号公報JP 2007-293479 A

[online][平成２１年３月１６日検索]、インターネット＜ＵＲＬ：ｈｔｔｐ：／／ｗｗｗ．ｃｏｐａｎｓｙｓ．ｃｏｍ／ｐｄｆｓ／Ｒｅｖｏｌｕｔｉｏｎ２００ＴＤａｔａＳｈｅｅｔ．ｐｄｆ＞[online] [Search on March 16, 2009], Internet <URL: http: // www. copasys. com / pdfs / Revolution200TDataSheet. pdf> ＩＬＭａｎｄＴｉｅｒｅｄＳｔｏｒａｇｅ．ＳｔｏｒａｇｅＮｅｔｗｏｒｋｉｎｇＩｎｄｕｓｔｒｙＡｓｓｏｃｉａｔｉｏｎ，２００６．ILM and Tiered Storage. Storage Networking Industry Association, 2006.

ストレージシステムの省電力化の方法として、特許文献１、非特許文献１に開示されている従来技術は、アクセス要求に対して遅延なく応答するために、常時稼動しているハードディスクを複数台設けているが、そのハードディスクにアクセス要求があったデータが格納されていない場合、節電状態のハードディスクを稼働状態に戻してからアクセスさせるため、応答ペナルティが大きく、高性能が要求されるアプリケーションに適用できないという問題があった。 As a method for saving power in a storage system, the conventional techniques disclosed in Patent Document 1 and Non-Patent Document 1 are provided with a plurality of hard disks that are always operating in order to respond to access requests without delay. However, if the requested data is not stored on the hard disk, the power-saving hard disk is returned to the operating state before being accessed, so the response penalty is large and it cannot be applied to applications that require high performance. There was a problem.

また、特許文献２に開示されている従来技術は、アクセスがないときにハードディスクの電源をオフするという受動的な電源制御であるため、上記の方法と同様に、アクセス要求に対して遅延なく応答することが難しく、高性能が要求されるアプリケーションに適用できないという問題があった。 Further, since the conventional technique disclosed in Patent Document 2 is passive power control in which the power of the hard disk is turned off when there is no access, it responds to an access request without delay as in the above method. There is a problem that it cannot be applied to applications that require high performance.

また、非特許文献２に開示されている従来技術は、消費電力の大きい高速のストレージ装置の容量を抑えるために、データを頻繁に低電力・大容量のストレージ装置に移行すると、アクセス要求のあるデータが低速の低電力・大容量ストレージに格納されている割合が高まり、システム全体として性能が劣化するという、性能と省電力化がトレードオフの関係にあり、高性能と省電力化の両立が難しいという問題があった。 In addition, the conventional technique disclosed in Non-Patent Document 2 has an access request when data is frequently migrated to a low-power / large-capacity storage device in order to suppress the capacity of a high-speed storage device with high power consumption. The proportion of data stored in low-speed, low-power, large-capacity storage increases, and the performance of the entire system deteriorates. There is a trade-off between performance and power saving, and both high performance and power saving are compatible. There was a problem that it was difficult.

また、特許文献３に開示されている従来技術は、クエリプラン（ストレージ装置へのアクセス処理の列）の情報から、データベース管理システムがストレージに対していつ、どのハードディスクにアクセスするかをデータベース管理システム自身、すなわちアプリケーションが決定可能である。しかしながら、処理の実行開始時間がアプリケーション自身で決定できないようなバッチ処理型のアプリケーション、例えば、ＨＰＣシステムで実行される科学計算のようなアプリケーションには適用できないという問題があった。
そこで、本発明は、計算機上で実行される処理の実行開始時間をアプリケーション自身で決定できない場合であっても、高性能と低消費電力を両立することを目的とする。 In addition, the prior art disclosed in Patent Document 3 is based on information of a query plan (a column of access processing to a storage device), and the database management system determines when and which hard disk the database management system accesses the storage. It can be determined by itself, ie the application. However, there is a problem that it cannot be applied to a batch processing type application in which the execution start time of the process cannot be determined by the application itself, for example, an application such as scientific calculation executed in the HPC system.
Therefore, an object of the present invention is to achieve both high performance and low power consumption even when the execution start time of processing executed on a computer cannot be determined by the application itself.

上述した課題を解決するために、本発明の一実施態様は以下の構成を有する。具体的には、第一の管理装置（符号１８に相当）が接続された複数の計算機に接続され、１以上の第一のハードディスク装置（符号４２に相当）から構成される１以上の第一のボリューム（符号５１に相当）を有する第一のストレージ装置（符号１１に相当）と、上記第一のストレージ装置に接続され、１以上の第二のハードディスク装置（符号４３に相当）から構成される１以上の第二のボリューム（符号５２に相当）を有する第二のストレージ装置（符号１２に相当）と、上記第一のストレージ装置と、上記第二のストレージ装置、及び上記第一の管理装置に接続される第二の管理装置（符号１９に相当）を有するストレージシステムであって、上記第一の管理装置は、計算機上で逐次実行されるジョブの情報（ジョブ情報）と、実行中及び実行を待つジョブキューの情報（ジョブキュー情報）を有し、上記第二の管理装置は、上記ジョブ情報を収集する手段と上記ジョブキュー情報を収集する手段（符号２４に相当）、及び収集した上記ジョブ情報と上記ジョブキュー情報を解析する解析手段（符号２５に相当）を有し、上記解析手段は、上記ジョブ情報から該ジョブがアクセスする第二のボリュームを特定する手段と、上記ジョブキュー情報から実行を待つ各ジョブが実行開始されるまでの平均待ち時間を算出する手段を有する。 In order to solve the above-described problems, an embodiment of the present invention has the following configuration. Specifically, one or more first hard disks (corresponding to reference numeral 42) connected to a plurality of computers to which a first management apparatus (corresponding to reference numeral 18) is connected. The first storage device (corresponding to reference numeral 51) having a volume (corresponding to reference numeral 51) and one or more second hard disk devices (corresponding to reference numeral 43) connected to the first storage device. A second storage apparatus (corresponding to reference numeral 12) having one or more second volumes (corresponding to reference numeral 52), the first storage apparatus, the second storage apparatus, and the first management. A storage system having a second management apparatus (corresponding to reference numeral 19) connected to the apparatus, wherein the first management apparatus is information on jobs (job information) that are sequentially executed on the computer, and is being executed The second management device collects the job information, collects the job queue information (corresponding to reference numeral 24), and collects Analyzing means (corresponding to reference numeral 25) for analyzing the job information and the job queue information, the analyzing means specifying a second volume accessed by the job from the job information, and the job Means for calculating an average waiting time until the execution of each job waiting for execution is started from the queue information.

そして、上記第二のストレージ装置は、上記第二のハードディスク装置の電源を制御する手段を有し、上記第二のボリュームに全てのデータが格納され、アクセスされない上記第二のボリュームを構成する上記第二のハードディスク装置の電源を切断している。 The second storage device has a means for controlling the power supply of the second hard disk device, and all the data is stored in the second volume and constitutes the second volume that is not accessed. The second hard disk drive is turned off.

そして、上記第二の管理装置は、上記解析手段によって特定された第二のボリュームを構成する第二のハードディスク装置の電源を投入して稼働状態とし、該第二のボリュームを上記第一のボリュームにコピーするのに要する閾値時間を算出する手段と、上記平均待ち時間と上記閾値時間を比較する手段と、上記第二のハードディスク装置の電源を制御する指示と上記第一のストレージ装置と上記第二のストレージ装置間でデータをコピーする指示を発行する手段を有し、上記第二の管理装置は、ジョブが投入された時点で、該ジョブの平均待ち時間が該ジョブの閾値時間より短い場合は、該ジョブが投入された時点から少なくとも該閾値時間だけ、該ジョブの実行を遅らせる指示を第一の管理装置に発行する。 The second management device powers on the second hard disk device that constitutes the second volume specified by the analyzing means to put it in an operating state, and sets the second volume to the first volume. Means for calculating a threshold time required for copying to the disk, means for comparing the average waiting time with the threshold time, an instruction for controlling the power supply of the second hard disk device, the first storage device, and the first storage device. The second management device has a means for issuing an instruction to copy data between the two storage devices, and when the average waiting time of the job is shorter than the threshold time of the job when the job is submitted Issues an instruction to delay the execution of the job to the first management apparatus at least for the threshold time from the time the job is submitted.

また、上記第二の管理装置は、該ジョブがアクセスする第二のボリュームを構成する第二のハードディスク装置の電源を投入して稼働状態とし、該第二のボリュームを前記第一のボリュームへコピーする指示を上記第二のストレージ装置及び上記第一のストレージ装置に発行し、上記第二のボリュームの上記第一のボリュームへのコピーが終了した後、該第二のボリュームを構成する第二のハードディスク装置の電源を切断する指示を上記第二のストレージ装置に発行する。 In addition, the second management device turns on the second hard disk device constituting the second volume accessed by the job to put it in an operating state, and copies the second volume to the first volume. Is issued to the second storage device and the first storage device, and after the copying of the second volume to the first volume is completed, the second volume constituting the second volume is configured. An instruction to turn off the power of the hard disk device is issued to the second storage device.

また、上記第二の管理装置は、ジョブが投入された時点で、該ジョブの平均待ち時間が該ジョブの閾値時間より長い場合は、遅くとも該ジョブの平均待ち時間が該ジョブの閾値時間に達する直前までに、該ジョブがアクセスする第二のボリュームを構成する第二のハードディスク装置の電源を投入して稼働状態とし、該第二のボリュームを第一のボリュームへコピーする指示を上記第二のストレージ装置及び上記第一のストレージ装置に発行する。
その他、本願が開示する課題、及びその解決方法は、発明の実施形態の欄及び図面により明らかにされる。 In addition, when the average waiting time of the job is longer than the threshold time of the job when the job is submitted, the second management device reaches the threshold time of the job at the latest. Immediately before, the second hard disk device constituting the second volume accessed by the job is turned on to be in an operating state, and an instruction to copy the second volume to the first volume is sent to the second volume. Issued to the storage device and the first storage device.
In addition, the problem which this application discloses and the solution method are clarified by the column and drawing of embodiment of invention.

本発明によれば、計算機上で実行される処理の実行開始時間をアプリケーション自身で決定できない場合であっても、高性能と低消費電力を両立することができる。 According to the present invention, even when the execution start time of processing executed on a computer cannot be determined by the application itself, both high performance and low power consumption can be achieved.

ストレージシステムを含む計算機システムの構成例を示す図である。It is a figure which shows the structural example of the computer system containing a storage system. 計算機管理サーバとストレージ管理サーバの機能構成の一例を示す図である。It is a figure which shows an example of a function structure of a computer management server and a storage management server. 第一階層ストレージ装置の構成の例を示す図である。It is a figure which shows the example of a structure of a 1st tier storage apparatus. 第二階層ストレージ装置の構成の例を示す図である。It is a figure which shows the example of a structure of a 2nd tier storage apparatus. ファイルサーバの構成の例を示す図である。It is a figure which shows the example of a structure of a file server. ＪＯＢキューの状態の一例を示す図である。It is a figure which shows an example of the state of a JOB queue. 計算機管理サーバが管理するＪＯＢキュー情報テーブル１の一例を示す図である。It is a figure which shows an example of the JOB queue information table 1 which a computer management server manages. ストレージ管理サーバが管理するＪＯＢキュー情報テーブル２の一例を示す図である。It is a figure which shows an example of the JOB queue information table 2 which a storage management server manages. ボリュームのステージングをしていない時のファイル格納用ディレクトリと、ファイル格納用仮想ボリューム、ファイル格納用第一、第二ボリュームの対応関係の一例を示す図である。FIG. 5 is a diagram illustrating an example of a correspondence relationship between a file storage directory, a file storage virtual volume, and a file storage first and second volume when the volume is not staged. ボリューム管理テーブルの一例を示す図である。It is a figure which shows an example of a volume management table. ボリューム使用状況管理テーブルの一例を示す図である。It is a figure which shows an example of a volume use condition management table. ファイルサーバ、第一、第二階層ストレージ装置間でのボリュームのステージングのタイミングの決定、及びステージング／デステージングの手順の一例を示す図である。FIG. 5 is a diagram illustrating an example of a procedure for determining the staging of a volume between a file server and first and second tier storage apparatuses, and a staging / destaging procedure. ファイルサーバ、第一、第二階層ストレージ装置間でのボリュームのステージングのタイミングの決定、及びステージング／デステージングの手順の一例を示す図である。FIG. 5 is a diagram illustrating an example of a procedure for determining the staging of a volume between a file server and first and second tier storage apparatuses, and a staging / destaging procedure. ファイルサーバ、第一、第二階層ストレージ装置間でのボリュームのステージングのタイミングの決定、及びステージング／デステージングの手順の一例を示す図である。FIG. 5 is a diagram illustrating an example of a procedure for determining the staging of a volume between a file server and first and second tier storage apparatuses, and a staging / destaging procedure. 計算機実行スクリプトの一例を示す図である。It is a figure which shows an example of a computer execution script. ボリュームのステージングをした時のファイル格納用ディレクトリと、ファイル格納用仮想ボリューム、ファイル格納用第一、第二ボリュームの対応関係の一例を示す図である。It is a figure which shows an example of the correspondence of the file storage directory at the time of volume staging, the file storage virtual volume, and the file storage first and second volumes. ファイルサーバ、第一、第二階層ストレージ装置間でのボリュームのステージングのタイミングの決定、及びステージング／デステージングの手順の他の一例を示す図である。FIG. 10 is a diagram showing another example of the determination of the volume staging timing between the file server and the first and second tier storage apparatuses, and the staging / destaging procedure. ファイルサーバ、第一、第二階層ストレージ装置間でのボリュームのステージングのタイミングの決定、及びステージング／デステージングの手順の他の一例を示す図である。FIG. 10 is a diagram showing another example of the determination of the volume staging timing between the file server and the first and second tier storage apparatuses, and the staging / destaging procedure. ファイルサーバ、第一、第二階層ストレージ装置間でのボリュームのステージングのタイミングの決定、及びステージング／デステージングの手順の他の一例を示す図である。FIG. 10 is a diagram showing another example of the determination of the volume staging timing between the file server and the first and second tier storage apparatuses, and the staging / destaging procedure.

以下、本発明を実施するための形態（「実施形態」という。）について、図面を参照しながら説明する。 Hereinafter, modes for carrying out the present invention (referred to as “embodiments”) will be described with reference to the drawings.

≪第一の実施形態≫
図１は、第一の実施形態のストレージシステムを含む計算機システムの構成例を示す図である。計算機システム１は、ストレージシステム２、ＩＰスイッチ１６、計算機１４、及び計算機管理サーバ１８を有する。また、ストレージシステム２は、ファイルサーバ１３、第一階層ストレージ装置（第一のストレージ装置）１１、第二階層ストレージ装置（第二のストレージ装置）１２、ファイバチャネル（ＦＣ：ＦｉｂｒｅＣｈａｎｎｅｌ）スイッチ１７、及びストレージ管理サーバ１９を有する。 ≪First embodiment≫
FIG. 1 is a diagram illustrating a configuration example of a computer system including a storage system according to the first embodiment. The computer system 1 includes a storage system 2, an IP switch 16, a computer 14, and a computer management server 18. The storage system 2 includes a file server 13, a first tier storage device (first storage device) 11, a second tier storage device (second storage device) 12, a fiber channel (FC: Fiber Channel) switch 17, And a storage management server 19.

図１に示すように、ＩＰスイッチ１６を介してファイルサーバ１３と計算機１４を接続することにより、ストレージシステム２と計算機１４は接続されている。また、計算機管理サーバ１８とストレージ管理サーバ１９はＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）１５を介して互いに接続される。さらに、ストレージ管理サーバ１９と、ファイルサーバ１３、第一階層ストレージ装置１１、及び第二階層ストレージ装置１２間はＬＡＮ１５を介して互いに接続される。 As illustrated in FIG. 1, the storage system 2 and the computer 14 are connected by connecting the file server 13 and the computer 14 via the IP switch 16. The computer management server 18 and the storage management server 19 are connected to each other via a LAN (Local Area Network) 15. Further, the storage management server 19, the file server 13, the first tier storage apparatus 11, and the second tier storage apparatus 12 are connected to each other via a LAN 15.

第一階層ストレージ装置１１は、ファイルサーバ１３に直接接続される。接続インターフェースとしては、ファイバチャネル、ｉＳＣＳＩ（ＩｎｔｅｒｎｅｔＳｍａｌｌＣｏｍｐｕｔｅｒｓｙｓｔｅｍＩｎｔｅｒｆａｃｅ）等のブロックデータを送るプロトコルのインターフェースを用いるのが一般的である。ここで、第一階層ストレージ装置１１はスイッチを介してファイルサーバ１３に接続されていても問題ない。 The first tier storage apparatus 11 is directly connected to the file server 13. As the connection interface, it is common to use an interface of a protocol for sending block data, such as Fiber Channel or iSCSI (Internet Small Computer System Interface). Here, there is no problem even if the first tier storage apparatus 11 is connected to the file server 13 via a switch.

第二階層ストレージ装置１２は、ＦＣスイッチ１７を介して第一階層ストレージ装置１２に接続される。接続インターフェースとしては、ファイバチャネル以外にｉＳＣＳＩ等のブロックデータを送るプロトコルのインターフェースを用いても問題ない。 The second tier storage apparatus 12 is connected to the first tier storage apparatus 12 via the FC switch 17. As a connection interface, there is no problem even if an interface of a protocol for sending block data such as iSCSI other than Fiber Channel is used.

第一階層ストレージ装置１１は、ファイルサーバ１３が入出力処理を行うファイルを格納するためのファイル格納用第一ボリューム（第一のボリューム）５１を有する。第二階層ストレージ装置１２は、ファイルサーバ１３が入出力処理を行うファイルを格納するためのファイル格納用第二ボリューム（第二のボリューム）５２を有する。また、第一階層ストレージ装置１１は、第二階層ストレージ装置１２が有するボリュームを当該第一階層ストレージ装置１１が計算機１４に提供するボリュームとして、すなわちファイル格納用仮想ボリューム６１として仮想的に提供する仮想化機能を有する。
なお、図１中のファイル格納用仮想ボリューム６１およびファイル格納用第二ボリューム５２に示される「ｕｓｒ１」、「ｕｓｒ２」、・・・との標記は、計算機１４によるファイルの入出力処理を実行するために当該ボリュームを使用するユーザを意味する。換言すれば、各ユーザには、第一階層ストレージ装置１１および第二階層ストレージ装置１２において、使用することができるボリュームが割り当てられている。ただし、その割り当て方は、例えば計算機システム１の運用に応じて変更することができる。 The first tier storage apparatus 11 has a file storage first volume (first volume) 51 for storing a file to be input / output processed by the file server 13. The second tier storage apparatus 12 has a file storage second volume (second volume) 52 for storing a file for which the file server 13 performs input / output processing. In addition, the first tier storage apparatus 11 is a virtual that virtually provides a volume that the second tier storage apparatus 12 has as a volume that the first tier storage apparatus 11 provides to the computer 14, that is, as a file storage virtual volume 61. It has a function to convert.
Note that “usr1”, “usr2”,... Shown in the file storage virtual volume 61 and the file storage second volume 52 in FIG. 1 execute file input / output processing by the computer 14. This means a user who uses the volume for this purpose. In other words, each user is assigned a volume that can be used in the first tier storage apparatus 11 and the second tier storage apparatus 12. However, the allocation method can be changed according to the operation of the computer system 1, for example.

図３に第一階層ストレージ装置１１の構成の一例を示す。コントローラ３１は、ファイルサーバ３や計算機１４等の上位装置からのデータの書き込み／読み出しアクセスを制御するチャネルＩＦ(ｉｎｔｅｒｆａｃｅ：インターフェース)部３２、複数の高速ハードディスク（第一のハードディスク装置）４２に接続され、高速ハードディスク４２へのデータの書き込み／読出しアクセスを制御するディスクＩＦ(インターフェース)３３、高速ハードディスク４２への書き込み／読み出しデータを一時的に格納するキャッシュメモリ３４、制御用データを格納する制御メモリ３８及びチャネルＩＦ部３２、ディスクＩＦ部３３、及びキャッシュメモリ３４を接続する結合部３５を有する。結合部３５は、１つ以上のスイッチから構成されるのが一般的であるが、１本以上の共通バスから構成しても問題ない。 FIG. 3 shows an example of the configuration of the first tier storage apparatus 11. The controller 31 is connected to a channel IF (interface) unit 32 that controls data write / read access from a host device such as the file server 3 and the computer 14, and a plurality of high-speed hard disks (first hard disk devices) 42. A disk IF (interface) 33 for controlling data write / read access to the high-speed hard disk 42; a cache memory 34 for temporarily storing write / read data to the high-speed hard disk 42; and a control memory 38 for storing control data. And a coupling unit 35 for connecting the channel IF unit 32, the disk IF unit 33, and the cache memory 34. The coupling unit 35 is generally composed of one or more switches, but there is no problem even if it is composed of one or more common buses.

チャネルＩＦ部３２は上位装置からのデータの書き込み／読み出しアクセスを受けた際に、キャッシュメモリ３４との間のデータ転送を制御し、ディスクＩＦ部３３は、高速ハードディスク４２へのデータの書き込み／読出し時にキャッシュメモリ３４との間のデータ転送を制御する。このようなキャッシュメモリ３４を介したチャネルＩＦ部３２とディスクＩＦ部３３の間のデータのやり取りにより、上位装置から高速ハードディスク４２へのデータの書き込み／読出しを行う。このような制御を行うため、チャネルＩＦ部３２、ディスクＩＦ部３３は１つ以上のプロセッサ(図示していない)を有する。このプロセッサには内部ＬＡＮ３７が接続される。さらに、第一ストレージ装置１１外部のストレージ管理サーバ１９がＬＡＮ１５を介して、内部ＬＡＮ３７に接続される。 The channel IF unit 32 controls data transfer with the cache memory 34 when receiving data write / read access from the host device, and the disk IF unit 33 writes / reads data to / from the high-speed hard disk 42. Sometimes it controls data transfer to and from the cache memory 34. By such data exchange between the channel IF unit 32 and the disk IF unit 33 via the cache memory 34, data is written / read to / from the high-speed hard disk 42 from the host device. In order to perform such control, the channel IF unit 32 and the disk IF unit 33 have one or more processors (not shown). An internal LAN 37 is connected to this processor. Further, a storage management server 19 outside the first storage device 11 is connected to the internal LAN 37 via the LAN 15.

ここで、上述のコントローラ３１の構成は一例に過ぎず、構成を上記に限定するものではない。コントローラ３１は、計算機１４からのデータの書き込み／読み出し要求に応じて高速ハードディスク４２へのデータの書き込み／読出しを行う機能を有していれば問題ない。
さらに、コントローラ３１は高速ハードディスク４２の電源のオン／オフ(投入／遮断)を制御する電源制御部３６を有していても良く、その場合、電源制御部３６は内部ＬＡＮ３７に接続される。 Here, the configuration of the controller 31 described above is merely an example, and the configuration is not limited to the above. There is no problem if the controller 31 has a function of writing / reading data to / from the high-speed hard disk 42 in response to a data write / read request from the computer 14.
Further, the controller 31 may include a power control unit 36 that controls the power on / off (turning on / off) of the high-speed hard disk 42. In this case, the power control unit 36 is connected to the internal LAN 37.

ハードディスク搭載部４１Ａ（４１）は、複数の高速ハードディスク４２の個々のハードディスクへ電源を供給するハードディスク電源４６Ａ（４６）を有する。複数の高速ハードディスク４２は複数台の高速ハードディスク４２から構成されるＲＡＩＤグループ（Ｇｒ．）１：４４にグルーピングされる。 The hard disk mounting unit 41A (41) has a hard disk power supply 46A (46) for supplying power to each of the plurality of high-speed hard disks 42. The plurality of high-speed hard disks 42 are grouped into a RAID group (Gr.) 1:44 composed of a plurality of high-speed hard disks 42.

ここで、高速ハードディスク４２としては、回転数が１０，０００ｒｐｍ（ｒｅｖｏｌｕｔｉｏｎｐｅｒｍｉｎｕｔｅ）あるいは１５，０００ｒｐｍで、ＦＣやＳＡＳ（ＳｅｒｉａｌＡｔｔａｃｈｅｄＳＣＳＩ）インターフェースを有する高速ハードディスクを用いるのが一般的である。また、最近ストレージ装置に搭載されるようになってきた固体メモリディスク（ＳＳＤ：ＳｏｌｉｄＳｔａｔｅＭｅｍｏｒｙ）を用いても問題ない。そうすることにより、高速ハードディスクを利用した場合に比べ、第一階層ストレージ装置１１をさらに高速かつ低消費電力にすることができる。 Here, as the high-speed hard disk 42, a high-speed hard disk having a rotation speed of 10,000 rpm (revolution per minute) or 15,000 rpm and having an FC or SAS (Serial Attached SCSI) interface is generally used. Further, there is no problem even if a solid state memory (SSD) that has recently been installed in a storage apparatus is used. By doing so, the first tier storage apparatus 11 can be made even faster and consume less power than when a high-speed hard disk is used.

またここで、ハードディスク電源４６Ａは、個々の高速ハードディスク４２毎、またはＲＡＩＤＧｒ．１：４４毎に１個または２個(冗長構成を組む場合)程度設けても問題ない。 Here, the hard disk power source 46A is connected to each individual high-speed hard disk 42 or RAID Gr. There may be no problem even if one or two (in the case of a redundant configuration) are provided every 1:44.

コントローラ３１内の電源制御部３６は、ハードディスク電源４６Ａに接続され、電源のオン／オフの制御を行う。
ここで、電源制御部３６は、コントローラ３１の中ではなく、ハードディスク搭載部４１Ａの中にあっても問題ない。また、電源制御部３６はストレージ管理サーバ１９に直接接続されていても問題ない。 A power supply control unit 36 in the controller 31 is connected to the hard disk power supply 46A, and controls on / off of the power supply.
Here, there is no problem even if the power supply control unit 36 is not in the controller 31 but in the hard disk mounting unit 41A. Further, there is no problem even if the power control unit 36 is directly connected to the storage management server 19.

図１の説明で述べたファイル格納用第一ボリューム５１は、複数台のハードディスク４２から構成されるＲＡＩＤＧｒ．１：４４の領域上に形成される。 The file storing first volume 51 described in the explanation of FIG. 1 is a RAIDGr. It is formed on a 1:44 area.

図４に、第二階層ストレージ装置１２の構成の一例を示す。コントローラ７１は、第一階層ストレージ装置１１等の上位装置を接続する計算機接続ポート７６、複数の大容量ハードディスク（第二のハードディスク装置）４３を接続するディスク接続ポート７８、大容量ハードディスク４３への書き込み／読み出しデータを一時的に格納する共有メモリ７３、及びプロセッサ７２を有する。また、計算機接続ポート７６、ディスク接続ポート７８、プロセッサ７２、及び共有メモリ７３は結合部７４を介して接続される。結合部７４は、スイッチから構成されるのが一般的であるが、共通バスから構成しても問題ない。 FIG. 4 shows an example of the configuration of the second tier storage apparatus 12. The controller 71 is a computer connection port 76 for connecting a host device such as the first tier storage device 11, a disk connection port 78 for connecting a plurality of large capacity hard disks (second hard disk devices) 43, and writing to the large capacity hard disk 43. / A shared memory 73 for temporarily storing read data and a processor 72. Further, the computer connection port 76, the disk connection port 78, the processor 72, and the shared memory 73 are connected via a coupling unit 74. The coupling unit 74 is generally composed of a switch, but there is no problem even if it is composed of a common bus.

プロセッサ７２は上位装置からのデータの書き込み／読み出しアクセスを受けた際に、計算機接続ポート７６と共有メモリ７３との間のデータ転送を制御するとともに、大容量ハードディスク４３へのデータの書き込み／読出し時に、大容量ハードディスク４３と共有メモリ７３との間のデータ転送を制御する。このような共有メモリ７３を介した計算機接続ポート７６と大容量ハードディスク４３の間のデータのやり取りにより、上位装置から大容量ハードディスク４３へのデータの書き込み／読出しを行う。 The processor 72 controls data transfer between the computer connection port 76 and the shared memory 73 when receiving data write / read access from the host device, and at the time of data write / read to the large-capacity hard disk 43. The data transfer between the large-capacity hard disk 43 and the shared memory 73 is controlled. By exchanging data between the computer connection port 76 and the large-capacity hard disk 43 via the shared memory 73, data is written / read to / from the large-capacity hard disk 43 from the host device.

プロセッサ７２には内部ＬＡＮ７７が接続される。さらに、第二ストレージ装置１２の外部のストレージ管理サーバ１９がＬＡＮ１５を介して、内部ＬＡＮ７７に接続される。 An internal LAN 77 is connected to the processor 72. Further, a storage management server 19 outside the second storage device 12 is connected to the internal LAN 77 via the LAN 15.

ここで、上述のコントローラ７１の構成は一例に過ぎず、構成を上記に限定するものではない。コントローラ７１は計算機１４からのデータの書き込み／読み出し要求に応じて大容量ハードディスク４３へのデータの書き込み／読出しを行う機能を有していれば問題ない。
さらに、コントローラ７１は大容量ハードディスク４３の電源のオン／オフ(投入／遮断)を制御する電源制御部７５を有していても良く、その場合、電源制御部７５は内部ＬＡＮ３７に接続される。 Here, the configuration of the controller 71 described above is merely an example, and the configuration is not limited to the above. There is no problem if the controller 71 has a function of writing / reading data to / from the large-capacity hard disk 43 in response to a data write / read request from the computer 14.
Further, the controller 71 may include a power control unit 75 that controls on / off (turning on / off) the power of the large-capacity hard disk 43, and in this case, the power control unit 75 is connected to the internal LAN 37.

ハードディスク搭載部４１Ｂ（４１）およびハードディスク電源４６Ｂ（４６）については、図３で示した第一階層ストレージ装置１１の構成（４１Ａ、４６Ａ）と同様であるため、説明は省略する。 The hard disk mounting unit 41B (41) and the hard disk power supply 46B (46) are the same as the configuration (41A, 46A) of the first tier storage apparatus 11 shown in FIG.

ここで、大容量ハードディスク４３としては、回転数が７，２００ｒｐｍ（ｒｅｖｏｌｕｔｉｏｎｐｅｒｍｉｎｕｔｅ）以下で、ＳＡＴＡ（ＳｅｒｉａｌＡｄｖａｎｃｅｄＴｅｃｈｎｏｌｏｇｙＡｔｔａｃｈｅｄ）インターフェースを有し、容量当たりの消費電力が高速ハードディスク４２に比べて小さい、大容量・低電力ハードディスクを用いるのが一般的である。また、アクセスが来ない時は、回転数を落として消費電力を低減するといった省電力機能を備えたハードディスクを用いても良い。 Here, the large-capacity hard disk 43 has a rotation speed of 7,200 rpm (revolution per minute) or less, a SATA (Serial Advanced Technology Attached) interface, and power consumption per capacity is smaller than that of the high-speed hard disk 42. It is common to use large capacity, low power hard disks. Further, when access does not come, a hard disk having a power saving function such as reducing the number of rotations to reduce power consumption may be used.

図１の説明で述べたファイル格納用第二ボリューム５２は、複数台の大容量ハードディスク４３から構成されるＲＡＩＤＧｒ．２：４５の領域上に形成される。 The file storage second volume 52 described in the description of FIG. 1 is a RAID Gr. It is formed on the area of 2:45.

図３、図４において第一階層ストレージ装置１１、第二階層ストレージ装置１２の構成について、一般的な構成についてそれぞれ述べたが、それらは上記した構成に限定されるものではない。第一階層ストレージ装置１１として要求されるＩ／Ｏ（Ｉｎｐｕｔ／Ｏｕｔｐｕｔ）処理性能としては、第二階層ストレージ装置１２を上回るＩ／Ｏ処理性能を有する装置であれば良い。また、第二階層ストレージ装置１２として要求される仕様としては、上記計算機１４が必要とする容量を、第一階層ストレージ装置１１に比べて少ない台数のハードディスクで実現できる装置であれば良い。言い換えると、容量当たりの消費電力が小さい装置であれば良い。 3 and 4, the general configurations of the first tier storage apparatus 11 and the second tier storage apparatus 12 have been described. However, they are not limited to the configurations described above. The I / O (Input / Output) processing performance required for the first tier storage apparatus 11 may be an apparatus having an I / O processing performance exceeding that of the second tier storage apparatus 12. The specifications required for the second tier storage apparatus 12 may be any apparatus that can realize the capacity required by the computer 14 with a smaller number of hard disks than the first tier storage apparatus 11. In other words, any device that consumes less power per capacity may be used.

ここで、第一階層ストレージ装置１１と第二階層ストレージ装置１２は、１つのストレージ装置で構成しても問題ない。すなわち、例えば、第一階層ストレージ装置１１において、ハードディスク搭載部４１Ａ内に高速ハードディスク４２と大容量ハードディスク４３を混載し、それぞれのハードディスクでＲＡＩＤＧｒ．１：４４，ＲＡＩＤＧｒ．２：４５を構成し、さらに、ＲＡＩＤＧｒ．１：４４及びＲＡＩＤＧｒ．２：４５の領域上に，それぞれファイル格納用第一ボリューム５１、ファイル格納用第二ボリューム５２を形成しても良い。こうすることにより、第二階層ストレージ装置のコントローラ７１の消費電力分を削減できる。 Here, there is no problem even if the first tier storage apparatus 11 and the second tier storage apparatus 12 are configured by one storage apparatus. That is, for example, in the first tier storage apparatus 11, the high-speed hard disk 42 and the large-capacity hard disk 43 are mixedly mounted in the hard disk mounting unit 41A, and RAID Gr. 1:44, RAID Gr. 2:45, and RAID Gr. 1:44 and RAID Gr. A file storage first volume 51 and a file storage second volume 52 may be formed on the 2:45 area, respectively. By doing so, the power consumption of the controller 71 of the second tier storage apparatus can be reduced.

図５に、ファイルサーバ１３の構成の一例を示す。ファイルサーバ１３は、入出力コントローラ２５１と、入出力コントローラ２５２と、プロセッサ２５０と，メモリ２５３からなる。入出力コントローラ２５１はＩＰスイッチ１６に接続され、ファイルデータの入出力処理を行う。また入出力コントローラ２５２は第一階層ストレージ装置１１に接続され、第一階層ストレージ装置１１へのブロックデータの書き込み及び読み出し処理を行う。またメモリ２５３では，入出力コントローラ２５１と入出力コントローラ２５２間でのデータのバッファリング／キャッシングを行う。またプロセッサ２５０では，ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）としてＬＩＮＵＸ（登録商標）が動作しており，そのファイルシステムとしてＮＦＳ（ＮｅｔｗｏｒｋＦｉｌｅｓｙｓｔｅｍ）が動作している。このファイルシステムがホストサーバからアクセスされるファイルデータをブロックデータのアドレスに変換する処理を行う。ファイルデータとブロックデータの変換に必要な変換テーブル等の管理情報は、プロセッサ２５０がメモリ２５３に格納する。ここで，ＯＳはＬＩＮＵＸに限らず，またファイルシステムもＮＦＳに限らない。ホストサーバからファイルデータを受け取り，それをブロックデータに変換して第一階層ストレージ装置１１へアクセスする機能を有していれば問題無い。 FIG. 5 shows an example of the configuration of the file server 13. The file server 13 includes an input / output controller 251, an input / output controller 252, a processor 250, and a memory 253. The input / output controller 251 is connected to the IP switch 16 and performs input / output processing of file data. The input / output controller 252 is connected to the first tier storage apparatus 11 and performs writing and reading processing of block data to the first tier storage apparatus 11. In the memory 253, data buffering / caching is performed between the input / output controller 251 and the input / output controller 252. In the processor 250, LINUX (registered trademark) operates as an OS (Operating System), and NFS (Network File system) operates as its file system. This file system performs processing for converting file data accessed from the host server into block data addresses. Management information such as a conversion table necessary for conversion between file data and block data is stored in the memory 253 by the processor 250. Here, the OS is not limited to LINUX, and the file system is not limited to NFS. There is no problem if it has a function of receiving file data from the host server, converting it to block data, and accessing the first tier storage apparatus 11.

図２に、計算機管理サーバ１８及びストレージ管理サーバ１９の機能構成を示す。
計算機管理サーバ１８は、計算機１４で実行するＪＯＢ（ジョブ）を管理するＪＯＢ管理部２１、計算機１４にＪＯＢの実行を依頼するユーザを管理するユーザ管理部２２、計算機１４で実行されるＪＯＢ情報をストレージ管理サーバ１９に提供するインターフェースとなる情報提供部２３を有する。なお、前記ジョブは、バッチ処理型のアプリケーションにおいて、逐次実行されるジョブをいう。 FIG. 2 shows functional configurations of the computer management server 18 and the storage management server 19.
The computer management server 18 includes a job management unit 21 that manages jobs (jobs) executed by the computer 14, a user management unit 22 that manages users who request the computer 14 to execute jobs, and job information executed by the computer 14. An information providing unit 23 serving as an interface provided to the storage management server 19 is provided. The job refers to a job that is sequentially executed in a batch processing type application.

なお、本実施形態中の構成要素を説明する際に用いる、ＪＯＢ管理部２１、情報解析部２５等の各機能部は、ソフトウェア（プログラム）により論理的に構成されても良いし、専用ＬＳＩ（ＬａｒｇｅＳｃａｌｅＩｎｔｅｇｒａｔｉｏｎ）等によりハードウェア的に構成されても良いし、さらには、ソフトウェアとハードウェアの組み合わせにより実現されても良い。なお、論理的に構成される場合、ストレージ管理サーバ１９の各機能部は、メモリ９４（記憶部）上に格納され、プロセッサ９５（制御部）によって処理が実行されることで、その機能が実現される。また、計算機管理サーバ１８の各機能部は、メモリ９９上に格納され、プロセッサ９８（計算機管理サーバ用制御部）によって処理が実行されることで、その機能が実現される。 Note that each functional unit such as the JOB management unit 21 and the information analysis unit 25 used when describing the components in the present embodiment may be logically configured by software (program), or a dedicated LSI ( (Large Scale Integration) or the like, or may be realized by a combination of software and hardware. When logically configured, each function unit of the storage management server 19 is stored on the memory 94 (storage unit), and the function is realized by processing performed by the processor 95 (control unit). Is done. Each functional unit of the computer management server 18 is stored on the memory 99, and the function is realized by executing processing by the processor 98 (computer management server control unit).

ＪＯＢ管理部２１は、投入ＪＯＢ管理部２０１、ＪＯＢスケジューラ２０２、終了ＪＯＢ管理部２０６を有する。また、ＪＯＢスケジューラ２０２は、待ちキュー２０３と実行キュー２０５を有する。 The JOB management unit 21 includes an input JOB management unit 201, a JOB scheduler 202, and an end JOB management unit 206. The JOB scheduler 202 has a waiting queue 203 and an execution queue 205.

ユーザは、計算機１４で計算ＪＯＢ（ＪＯＢ）を実行するために、図１３に示す計算実行スクリプト２３４を作成し、計算機管理サーバ１８に入力する。その入力は、例えば、計算機管理サーバ１８が備えるＧＵＩ（ＧｒａｐｈｉｃａｌＵｓｅｒＩｎｔｅｒｆａｃｅ）やＣＬＩ（ＣｏｍｍａｎｄＬｉｎｅＩｎｔｅｒｆａｃｅ）により、計算機管理サーバ１８に直接か、計算機管理サーバ１８に接続されるクライアント端末（図示していない）を通して行う。 The user creates a calculation execution script 234 shown in FIG. 13 and inputs it to the computer management server 18 in order to execute the calculation JOB (JOB) on the computer 14. The input is, for example, a client terminal (not shown) connected to the computer management server 18 directly or via the GUI (Graphical User Interface) or CLI (Command Line Interface) of the computer management server 18. )

入力された計算実行スクリプト２３４は、投入ＪＯＢ管理部２０１で管理され、待ちキュー２０３内に優先度の高い順に用意されているキュー１：２１１、キュー２：２１２、キュー３：２１３、あるいはキュー４：２１４のいずれかに振り分けられる。その振分け方、つまり、優先度の付け方は、例えば、計算実行スクリプト２３４の中に記述された、使用するＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）数（ＮｕｍｂｅｒｏｆＣＰＵｓ）３０１、最大計算時間（ＭＡＸＣＰＵＴＩＭＥ）３０２の長短、使用する主記憶容量（ＭｅｍｏｒｙＳｉｚｅ）３０３の多少等で判断することや、ユーザが計算実行スクリプト２３４の中で明示的に優先順位を指定すること等がある。ＪＯＢは各キューに並んだ順番に実行される。また、キュー１〜４：２１１〜２１４のＪＯＢの実行順は、優先度の高いキュー１：２１１から順次実行される。優先度の高いキューのＪＯＢが実行に入った後、計算機１４のＣＰＵリソースに空きがあり、次の優先度のＪＯＢが実行可能であるならば、そのＪＯＢについても空いているＣＰＵ上で並行して実行する。その次の優先度のＪＯＢについても同様である。実行中のＪＯＢは実行キュー２０５の中で管理され、ＪＯＢが終了すると終了ＪＯＢ管理部２０６へ管理が移される。 The input calculation execution script 234 is managed by the input JOB management unit 201 and is prepared in the waiting queue 203 in descending order of priority: queue 1: 211, queue 2: 212, queue 3: 213, or queue 4 : Any one of 214. The distribution method, that is, the method of assigning priorities is, for example, the number of CPUs (Central Processing Units) 301 (Number of CPUs) 301 and the maximum calculation time (MAX CPU TIME) 302 described in the calculation execution script 234. , The amount of main memory capacity (Memory Size) 303 to be used, and the like, and the user explicitly specifies the priority order in the calculation execution script 234. Jobs are executed in the order in which they are queued. In addition, the execution order of the jobs in the queues 1 to 4: 211 to 214 is sequentially executed from the queue 1: 211 having a higher priority. After a job with a high priority queue enters execution, if the CPU resource of the computer 14 is free and the next priority job can be executed, that job is also executed on the free CPU in parallel. And execute. The same applies to the job with the next priority. The job being executed is managed in the execution queue 205, and when the job is completed, the management is transferred to the end job management unit 206.

ユーザ管理部２２は、計算機管理サーバ１８や、計算機管理サーバ１８に接続されたクライアント端末から計算機１４を利用するユーザの管理、すなわち、ユーザ認証やユーザが計算に使用するファイルを格納するためのユーザディレクトリの管理を行う。この管理のために、例えば、ＮＩＳ（ＮｅｔｗｏｒｋＩｎｆｏｒｍａｔｉｏｎＳｅｒｖｉｃｅ）やＬＤＡＰ（ＬｉｇｈｔｗｅｉｇｈｔＤｉｒｅｃｔｏｒｙＡｃｃｅｓｓＰｒｏｔｏｃｏｌ）等のプロトコルが用いられる。 The user management unit 22 manages users who use the computer 14 from the computer management server 18 or a client terminal connected to the computer management server 18, that is, a user for storing files used for user authentication and calculation by the user. Manage the directory. For this management, for example, a protocol such as NIS (Network Information Service) or LDAP (Lightweight Directory Access Protocol) is used.

また、情報提供部２３は、ストレージ管理サーバ１９に対して各ＪＯＢの計算実行スクリプト２３４とＪＯＢの実行順序を示す情報、及び計算機１４を使用するユーザ及びそのユーザが使用するユーザディレクトリの情報を送信する。 In addition, the information providing unit 23 transmits the calculation execution script 234 of each JOB and the information indicating the execution order of the JOB to the storage management server 19 and information on the user who uses the computer 14 and the user directory used by the user. To do.

ストレージ管理サーバ１９は、計算機管理サーバ１８の情報提供部２３から計算機１４で実行されるＪＯＢ情報及びＪＯＢキュー情報を受け取る情報収集部２４と、ＪＯＢ情報及びＪＯＢキュー情報をストレージ装置（１１、１２）で利用するために解析を行う情報解析部２５、解析した情報を元に、ファイルサーバ１３が管理するユーザディレクトリへの第一階層ストレージ装置１１、第二階層ストレージ装置１２が有するボリュームのマウント／アンマウントや、その第一階層ストレージ装置１１と第二階層ストレージ装置１２間でのファイルまたはボリュームのステージング／デステージングの管理を行うボリューム管理部２６、ファイルサーバ１３が取り扱うユーザディレクトリを管理するユーザエリア管理部２７、ファイルサーバ１３、第一階層ストレージ装置１１、及び第二階層ストレージ装置１２へボリュームの割り当てやボリュームのマウント／アンマウントを指示したり、第一階層ストレージ装置１１及び第二階層ストレージ装置１２内の電源制御部３６、７５にハードディスクの電源制御を指示したりするストレージ管理部２８を有する。 The storage management server 19 includes an information collection unit 24 that receives JOB information and JOB queue information executed by the computer 14 from the information providing unit 23 of the computer management server 18, and stores the JOB information and JOB queue information in the storage device (11, 12). The information analysis unit 25 that performs analysis for use in the server, mounts / unmounts the volumes of the first tier storage apparatus 11 and the second tier storage apparatus 12 to the user directory managed by the file server 13 based on the analyzed information A volume management unit 26 that manages staging / destaging of files or volumes between the first tier storage device 11 and the second tier storage device 12, and a user area management unit that manages user directories handled by the file server 13. 27, file server 3. Instruct the volume allocation and volume mounting / unmounting to the first tier storage apparatus 11 and the second tier storage apparatus 12, and the power control unit 36 in the first tier storage apparatus 11 and the second tier storage apparatus 12. , 75 has a storage management unit 28 for instructing power control of the hard disk.

図６は、ある１つのキュー（キュー４：２１４）において、１つのＪＯＢが投入されたときのキューの状態を示している。ここで、λはＪＯＢがキューに投入される際の平均投入頻度、μ（前記キューに投入されたＪＯＢが実行される際の平均実行頻度）の逆数が、ＪＯＢが実行される際の平均ＪＯＢ実行時間（平均実行時間）Ｔｅを表す。λの逆数はＪＯＢの平均投入間隔Ｔｉを表す。また、ＴｗはＪＯＢの実行までの平均待ち時間(この図では、投入されたＪＯＢの実行までの平均待ち時間)を表す。平均待ち時間の算出方法については、以下に示す。 FIG. 6 shows the state of a queue when one JOB is input in one queue (queue 4: 214). Here, λ is an average input frequency when a JOB is input to the queue, and an inverse of μ (average execution frequency when the JOB input to the queue is executed) is an average JOB when the JOB is executed It represents the execution time (average execution time) Te. The reciprocal of λ represents the average JOB input interval Ti. Tw represents an average waiting time until the execution of the JOB (in this figure, an average waiting time until the execution of the input JOB). The calculation method of the average waiting time is shown below.

図７に計算機管理サーバ１８のＪＯＢ管理部２１に格納されているＪＯＢキュー情報テーブル１：７０の一例を示す。ＪＯＢキュー情報テーブル１：７０は、各ＪＯＢの現在の状態を示す値を格納している。「ＪＯＢＩＤ（Ｉｄｅｎｔｉｆｉｅｒ）」７０１は、ＪＯＢを識別するための識別情報を示す。そして、ＪＯＢが投入された順にＪＯＢＩＤが付与される。「ＪＯＢ状態」７０２は、各ＪＯＢの現在の状態を示す。「実行中」は、ＪＯＢが実行されていることを示す。「待ち」は、ＪＯＢが実行待ち状態であることを示す。実行待ち状態のＪＯＢについては、その実行の順番を示す優先度が付されている。通常、優先度は先に投入されたＪＯＢの優先度を高くするように付される。しかしながら、ユーザが優先度を指定したり、計算機管理サーバ１８の管理者が計算条件や計算機１４の使用状況から判断して、その優先度を変えたりすることが可能である。ここで、図７のＪＯＢキュー情報テーブル１：７０のフォーマットは一例に過ぎず、フォーマットを図のように限定するものではない。上記に述べた情報が少なくとも含まれていれば良い。 FIG. 7 shows an example of the job queue information table 1:70 stored in the job management unit 21 of the computer management server 18. The job queue information table 1:70 stores a value indicating the current state of each job. A “JOB ID (Identifier)” 701 indicates identification information for identifying a JOB. JOB IDs are assigned in the order in which JOBs are input. A “JOB status” 702 indicates the current status of each JOB. “In progress” indicates that the job is being executed. “Waiting” indicates that the job is waiting to be executed. JOBs waiting to be executed are given a priority indicating the order of execution. Usually, the priority is assigned so as to increase the priority of the JOB input first. However, it is possible for the user to specify the priority, or for the administrator of the computer management server 18 to change the priority based on the calculation conditions and the usage status of the computer 14. Here, the format of the JOB queue information table 1:70 in FIG. 7 is merely an example, and the format is not limited as shown in the figure. It is sufficient that at least the information described above is included.

図８にストレージ管理サーバ１９のメモリ９４に格納されるＪＯＢキュー情報テーブル２：８０の一例を示す。「ＪＯＢＩＤ」８０１は、ＪＯＢを識別するための識別情報を示す。「ＪＯＢ状態」８０２は、各ＪＯＢの現在の状態（キュー状態）を示す。「実行中」は、ＪＯＢが実行されていることを示す。「待ち」は、ＪＯＢが実行待ち状態であることを示す。実行待ち状態のＪＯＢについては、その実行の順番を示す優先度が付されている。「Ｔｗ」８０３は、各ＪＯＢの平均待ち時間を示す。ＪＯＢ１のように「実行中」である場合には、待ち時間は「０」であるため、Ｔｗは格納されない。「ＵｓｅｒＩＤ」８０４は、各ＪＯＢを実行するＵｓｅｒ（ユーザ）の識別情報を示す。「対象ｄｉｒ」８０５は、各ＪＯＢによって使用されるディレクトリの識別情報を示す。「Ｔｔｈ」８０６は、各ＪＯＢの実行に用いる論理ボリュームを構成する記憶装置（ストレージ装置）の電源を入れて稼動状態にし、その論理ボリューム、またはボリューム内に格納された使用されるファイルを第一階層ストレージ１１内のファイル格納用第一ボリューム５１にステージング(コピー)するのに要する時間（これを閾値時間と呼ぶ。）を示す。ここで、図８のＪＯＢキュー情報テーブル２：８０のフォーマットは一例に過ぎず、フォーマットを図のように限定するものではない。上記に述べた情報が少なくとも含まれていれば問題ない。 FIG. 8 shows an example of the JOB queue information table 2:80 stored in the memory 94 of the storage management server 19. “JOB ID” 801 indicates identification information for identifying a JOB. “JOB status” 802 indicates the current status (queue status) of each JOB. “In progress” indicates that the job is being executed. “Waiting” indicates that the job is waiting to be executed. JOBs waiting to be executed are given a priority indicating the order of execution. “Tw” 803 indicates the average waiting time of each JOB. When “execution” is performed as in JOB1, the waiting time is “0”, so Tw is not stored. “User ID” 804 indicates identification information of a user (user) who executes each JOB. “Target dir” 805 indicates identification information of a directory used by each JOB. “Tth” 806 turns on the storage device (storage device) constituting the logical volume used to execute each JOB and puts it in an operating state, and sets the logical volume or the file to be used stored in the volume to the first. The time required for staging (copying) to the file storage first volume 51 in the tiered storage 11 (referred to as threshold time) is shown. Here, the format of the JOB queue information table 2:80 in FIG. 8 is merely an example, and the format is not limited as shown. There is no problem if at least the information described above is included.

図９は、ユーザが計算を開始する前のユーザディレクトリとボリュームの関係の一例を示している。ファイルサーバ１３は、ストレージ管理部２８の指示に基づいて、ファイル格納用仮想ボリューム６１をユーザディレクトリであるファイル格納用ディレクトリ８１としてマウントする。 FIG. 9 shows an example of the relationship between the user directory and the volume before the user starts the calculation. The file server 13 mounts the file storage virtual volume 61 as a file storage directory 81 that is a user directory based on an instruction from the storage management unit 28.

第一階層ストレージ装置１１内のコントローラ３１は、ストレージ管理部２８の指示に基づいて、第二階層ストレージ装置１２内のファイル格納用第二ボリューム５２を、第一階層ストレージ装置１１内のファイル格納用仮想ボリューム６１として仮想化し、第一階層ストレージ装置１１が管理する。こうすることにより、第一階層ストレージ装置１１が第二階層ストレージ装置１２のボリュームも一括して管理することができるため、ボリューム管理を簡単化することができる。
ここで、ファイル格納用第二ボリューム５２を、直接、ファイル格納用ディレクトリ８１としてマウントしても良い。 Based on an instruction from the storage management unit 28, the controller 31 in the first tier storage apparatus 11 converts the file storage second volume 52 in the second tier storage apparatus 12 into a file storage in the first tier storage apparatus 11. It is virtualized as a virtual volume 61 and managed by the first tier storage apparatus 11. By doing so, the first tier storage apparatus 11 can collectively manage the volumes of the second tier storage apparatus 12, thereby simplifying volume management.
Here, the file storage second volume 52 may be directly mounted as the file storage directory 81.

ディレクトリ・ツリー１０１に、ユーザディレクトリと、ファイル格納用仮想ボリューム６１、及びファイル格納用第二ボリューム５２との関係の一例を示す。ここでは、ｕｓｒ０下のディレクトリｄｉｒ０，ｄｉｒ１と、ｕｓｒ１下のディレクトリｄｉｒ０，ｄｉｒ１，ｄｉｒ２毎に別個のファイル格納用第二ボリューム５２を割り当てている。 An example of the relationship between the user directory, the file storage virtual volume 61, and the file storage second volume 52 is shown in the directory tree 101. Here, a separate file storage second volume 52 is assigned to each of the directories dir0 and dir1 under usr0 and the directories dir0, dir1 and dir2 under usr1.

ここで、上記のユーザディレクトリ（ファイル格納用ディレクトリ８１）とファイル格納用第二ボリューム５２との対応関係（マッピング）は、ストレージ管理部１９内のユーザエリア管理部２７で管理される。またこの対応関係は、ストレージ管理サーバ１９に接続されるクライアント端末からユーザが要求を出した際に、作成、または変更される。 Here, the correspondence (mapping) between the user directory (file storage directory 81) and the file storage second volume 52 is managed by the user area management unit 27 in the storage management unit 19. This correspondence is created or changed when a user issues a request from a client terminal connected to the storage management server 19.

また、ユーザが扱うファイル（計算機１４から入出力されるファイルを含む）は全て、ファイル格納用ディレクトリ８１、すなわち、第２階層ストレージ装置１２内のファイル格納用第二ボリューム５２に格納される。さらに、通常、計算機１４またはユーザからアクセスされない間、乃至はファイル／ボリュームのステージング／デステージングが行われない間は、ファイル格納用第二ボリューム５２はアンマウントし、ファイル格納用第二ボリューム５２を構成する大容量ハードディスク４３の電源は落としておく。こうすることにより、ストレージシステム２全体の消費電力量を削減することが可能となる。 All files handled by the user (including files input / output from the computer 14) are stored in the file storage directory 81, that is, the file storage second volume 52 in the second tier storage apparatus 12. Further, normally, the file storage second volume 52 is unmounted and the file storage second volume 52 is configured while it is not accessed by the computer 14 or the user or when the file / volume staging / destaging is not performed. The large capacity hard disk 43 to be turned off is turned off. By doing so, it is possible to reduce the power consumption of the entire storage system 2.

ここで、大容量ハードディスク４３の電源を落とさずに、スピンダウン（ハードディスク装置のディスクを予め定められた回転速度（メモリに記憶されている回転速度）以下に回転させること）、スピンオフ（ハードディスク装置のディスクの回転を停止させること）あるいは省電力モードにしても良い。こうすることによって、削減される消費電力量は少なくなるが、アクセスが来る前に大容量ハードディスク４３を立ち上げる時間（入出力可能状態になるまでの時間）を短くすることが可能となる。 Here, without turning off the power of the large-capacity hard disk 43, spin down (rotating the disk of the hard disk device below a predetermined rotation speed (rotation speed stored in the memory)), spin-off (removal of the hard disk device) The disk rotation may be stopped) or the power saving mode may be set. By doing this, the amount of power consumption to be reduced is reduced, but it is possible to shorten the time to start up the large-capacity hard disk 43 (time until input / output is enabled) before access.

ディレクトリ・ツリー１０１は、例えば、図１０に示すボリューム管理テーブル１００によりユーザ毎に管理されている。ボリューム管理テーブル１００は、ストレージ管理サーバ１９のメモリ９４に格納されている。「ＵｓｅｒＩＤ」１００１は、各Ｕｓｅｒの識別情報を示す。「ｄｉｒＩＤ」１００２は、Ｕｓｅｒごとのディレクトリの識別情報を示す。「ＬＵＩＤ」１００３は、各ｄｉｒに対応する論理ユニット（論理ボリューム）の識別情報を示す。「ＲＡＩＤＩＤ」１００４は、各ＬＵを構成するＲＡＩＤグループの識別情報を示す。ＲＡＩＤグループは、複数の記憶装置により構成される。ここで、図１０のボリューム管理テーブルのフォーマットは一例に過ぎず、フォーマットを図のように限定するものではない。上記に述べた情報が少なくとも含まれていれば問題ない。 The directory tree 101 is managed for each user by, for example, the volume management table 100 shown in FIG. The volume management table 100 is stored in the memory 94 of the storage management server 19. “User ID” 1001 indicates identification information of each User. “Dir ID” 1002 indicates directory identification information for each user. “LU ID” 1003 indicates identification information of a logical unit (logical volume) corresponding to each dir. “RAID ID” 1004 indicates identification information of a RAID group constituting each LU. The RAID group is composed of a plurality of storage devices. Here, the format of the volume management table in FIG. 10 is merely an example, and the format is not limited as shown. There is no problem if at least the information described above is included.

図１１は、ボリューム使用状況管理テーブル１１０を示す。ボリューム使用状況管理テーブル１１０は、第一階層ストレージ装置１１の各論理ボリュームの使用状況を示す。ボリューム使用状況管理テーブル１１０は、ストレージ管理サーバ１９のメモリ９４に格納されている。「ＬＵＩＤ」１１０１は、第一階層ストレージ装置１１の各論理ボリュームの識別情報を示す。「使用状況」１１０２は、第一階層ストレージ装置１１の各論理ボリュームがＵｓｅｒ（計算機）によって使用されているか否かを示す。「サイズ」１１０３は、各論理ボリュームの容量を示す。ここで、図１１のボリューム使用状況管理テーブル１１０のフォーマットは一例に過ぎず、フォーマットを図のように限定するものではない。上記に述べた情報が少なくとも含まれていれば良い。 FIG. 11 shows the volume usage status management table 110. The volume usage status management table 110 indicates the usage status of each logical volume in the first tier storage apparatus 11. The volume usage status management table 110 is stored in the memory 94 of the storage management server 19. “LU ID” 1101 indicates identification information of each logical volume of the first tier storage apparatus 11. “Usage status” 1102 indicates whether each logical volume of the first tier storage apparatus 11 is used by a User (computer). “Size” 1103 indicates the capacity of each logical volume. Here, the format of the volume usage status management table 110 in FIG. 11 is merely an example, and the format is not limited as shown. It is sufficient that at least the information described above is included.

図１２（図１２Ａ〜図１２Ｃの総称）は、本実施形態のストレージシステムでのファイルのステージング／デステージングの手順を示している。この手順は、待ちキュー２０３内のキュー毎（キュー１：２１１からキュー４：２１４）に並行して実施する。プロセッサ９５が処理主体となることで、各機能部（２４、２５等）による機能が実現され、前記手順が定期的に繰り返される。 FIG. 12 (generic name of FIG. 12A to FIG. 12C) shows a file staging / destaging procedure in the storage system of this embodiment. This procedure is executed in parallel for each queue in the waiting queue 203 (queue 1: 211 to queue 4: 214). By the processor 95 being the processing subject, functions by the respective functional units (24, 25, etc.) are realized, and the above procedure is repeated periodically.

まずステップ４０１で、ストレージ管理サーバ１９の情報収集部２４は、定期的に、計算機管理サーバ１８の情報提供部２３から待ちキュー２０３内にある全てのＪＯＢの実行順序を示す情報と終了ＪＯＢ管理部２０６にある終了ＪＯＢ情報（ＪＯＢキュー情報）を取得する。取得するときは、例えば計算機管理サーバ１８のＪＯＢスケジューラ２０２をモニタリングするためのコマンドが計算機管理サーバ１８に送信され、計算機管理サーバ１８による、そのコマンドに対する応答として前記情報を受信する。 First, in step 401, the information collection unit 24 of the storage management server 19 periodically receives information indicating the execution order of all jobs in the waiting queue 203 from the information providing unit 23 of the computer management server 18 and the end job management unit. The end JOB information (JOB queue information) in 206 is acquired. When acquiring, for example, a command for monitoring the JOB scheduler 202 of the computer management server 18 is transmitted to the computer management server 18, and the computer management server 18 receives the information as a response to the command.

そしてステップ４０２で、ＪＯＢキュー情報テーブル２：８０に記載された前回受け取ったＪＯＢキュー情報と、ステップ４０１で受け取ったＪＯＢキュー情報テーブル１：７０に記載された各ＪＯＢの現在の状態をＪＯＢＩＤ毎に比較し、待ちキュー２０３内の該当する１つのキュー（キュー１：２１１乃至キュー：２１４）について、キューの状態が変化したかどうかを調べる。ここで、キューの状態が変化したとは、新たなＪＯＢが投入された、待ちキュー内のＪＯＢが実行に移された、待ちキュー内のＪＯＢがキャンセルされた、ＪＯＢの実行が終了したことを少なくとも意味する。具体的にいえば、当該レコードのＪＯＢ状態７０２の値と、ＪＯＢ状態８０２の値が異なったことを意味する。
ここで、実行中のＪＯＢ（図７または図８の場合、ＪＯＢ１）は、実行キュー２０５の中でそのＪＯＢＩＤが管理される。また、実行を終了したＪＯＢについては、終了ＪＯＢ管理部２０６でそのＪＯＢＩＤが管理される。 In step 402, the job queue information received last time described in the job queue information table 2:80 and the current state of each job described in the job queue information table 1:70 received in step 401 are displayed for each job ID. In comparison with the above, it is checked whether or not the state of the queue has changed for one corresponding queue (queue 1: 211 to queue: 214) in the waiting queue 203. Here, the state of the queue has changed means that a new job has been entered, a job in the wait queue has been moved to execution, a job in the wait queue has been canceled, or a job has been executed. I mean at least. More specifically, this means that the value of the job status 702 and the value of the job status 802 of the record are different.
Here, the JOB ID of the job being executed (JOB1 in the case of FIG. 7 or FIG. 8) is managed in the execution queue 205. Further, the JOB ID of the JOB that has been executed is managed by the ending job management unit 206.

ここで、キュー状態が変化していなかった場合は（ステップ４０２でＮｏ）、ステップ４０１に戻り、次のキュー状態取得まで待つ。キュー状態が変化していた場合は（ステップ４０２でＹｅｓ）、ＪＯＢキュー情報テーブル２：８０のＪＯＢＩＤとＪＯＢ状態のカラム（８０１、８０２）を取得してきた内容に置き換え、ステップ４０３へ移る。 If the queue state has not changed (No in step 402), the process returns to step 401 and waits for the next queue state acquisition. If the queue status has changed (Yes in step 402), the job ID and job status columns (801, 802) in the job queue information table 2:80 are replaced with the acquired contents, and the process proceeds to step 403.

ステップ４０３では、キュー状態の変化がＪＯＢ実行終了（完了）であるかどうかをチェックする。キュー状態の変化がＪＯＢ実行終了であった場合（ステップ４０３でＹｅｓ）、ステップ４１８に処理を移す。また、キュー状態の変化がＪＯＢ実行終了でなかった場合（ステップ４０３でＮｏ）、ステップ４０４に移る。ここで、キュー状態の変化がＪＯＢ実行終了であったことは、実行キュー２０５内の該当するＪＯＢが、ＪＯＢ終了管理部２０６へ移ったことで確認できる。 In step 403, it is checked whether or not the change in the queue state is the end of job execution (completion). If the change in the queue state is the end of job execution (Yes in step 403), the process proceeds to step 418. If the change in the queue state is not the end of job execution (No in step 403), the process proceeds to step 404. Here, it can be confirmed that the change of the queue state is the end of job execution by the corresponding job in the execution queue 205 having moved to the job end management unit 206.

次にステップ４０４では、キュー状態の変化がＪＯＢ投入であった場合、待ちキュー２０３内にある該当するキューのＪＯＢの計算実行スクリプト２３４を取得し、その解析を行う。その解析について説明する前に、計算機１４で実行される実行ジョブ（あるいは計算）の情報を記述した計算実行スクリプト２３４の一例を図１３に示す。複数のユーザが計算実行スクリプト２３４を投入するので、図１３では複数の計算実行スクリプト２３４があるイメージを示している。ファイルサーバ１３が管理するユーザディレクトリ（ファイル格納用ディレクトリ８１）への第一階層ストレージ装置１１、第二階層ストレージ装置１２が有するボリュームのマウント／アンマウントのスケジューリングや、そのユーザディレクトリ間でのファイルのステージング／デステージングのスケジューリング及びファイルサーバ１３が取り扱うユーザディレクトリを管理するためには、計算実行スクリプト２３４は少なくとも計算のパラメータや計算の実行結果を格納するディレクトリの情報３００を含む。
さらに、使用するＣＰＵ数（ＮｕｍｂｅｒｏｆＣＰＵｓ）３０１、最大計算時間（ＭＡＸＣＰＵＴＩＭＥ）３０２の長短、使用する主記憶容量（ＭｅｍｏｒｙＳｉｚｅ）３０３を少なくとも含む。これら情報を元にＪＯＢの優先順位付けが行われ、ＪＯＢは複数あるキューに優先順位ごとに振り分けられて投入される。
ここで、図１３の計算実行スクリプトのフォーマットは一例に過ぎず、フォーマットを図のように限定するものではない。上記に述べた情報が少なくとも含まれていれば問題ない。 Next, in step 404, if the change in queue status is JOB input, the job execution calculation script 234 of the corresponding queue in the waiting queue 203 is acquired and analyzed. Before explaining the analysis, FIG. 13 shows an example of a calculation execution script 234 describing information on an execution job (or calculation) executed by the computer 14. Since a plurality of users input the calculation execution script 234, FIG. 13 shows an image with a plurality of calculation execution scripts 234. Scheduling of mount / unmount of volumes of the first tier storage device 11 and the second tier storage device 12 to a user directory (file storage directory 81) managed by the file server 13, and staging of files between the user directories / In order to manage the scheduling of destaging and the user directory handled by the file server 13, the calculation execution script 234 includes at least information about a directory 300 for storing calculation parameters and calculation execution results.
Further, it includes at least the number of CPUs to be used (Number of CPUs) 301, the length of maximum calculation time (MAX CPU TIME) 302, and the main memory capacity to be used (Memory Size) 303. Prioritization of JOBs is performed based on these pieces of information, and the JOBs are distributed to a plurality of queues according to the priorities.
Here, the format of the calculation execution script of FIG. 13 is merely an example, and the format is not limited as shown in the figure. There is no problem if at least the information described above is included.

情報解析部２５では、各ＪＯＢの計算実行スクリプト２３４から、入出力用ファイルのディレクトリ名（ディレクトリの情報３００）を抽出し、ＪＯＢキュー情報テーブル２：８０の対象ｄｉｒ(ディレクトリ)のカラム（８０５）の該当するＪＯＢＩＤの箇所にそのディレクトリ名を入力する。 In the information analysis unit 25, the directory name (directory information 300) of the input / output file is extracted from the calculation execution script 234 of each job, and the column (805) of the target dir (directory) in the job queue information table 2:80. Enter the directory name in the corresponding JOB ID.

さらに、キュー状態の変化がＪＯＢ投入である場合に限らず全ての場合について、該当するキューのＪＯＢキュー情報から、ＪＯＢの投入間隔の平均値及び分散、ＪＯＢ実行時間の平均値及び分散を計算する。これら平均値及び分散は、該当するキューの状態を取得するたびに統計情報として、以前ＪＯＢが投入された時刻からその次にＪＯＢが投入されるまでの時間間隔、及び以前ＪＯＢが実行に移された時刻からその次にＪＯＢが実行に移されるまでの時間間隔を収集することにより、それら収集した値から求めることができる。 Furthermore, the average value and variance of job submission intervals and the average value and variance of job execution times are calculated from the job queue information of the corresponding queue for all cases, not only when the change in queue status is job submission. . These average values and variances are used as statistical information every time the status of the corresponding queue is acquired, and the time interval from the time when the previous JOB was input until the next JOB is input, and the previous JOB is executed. By collecting the time interval from the time when the job is executed to the next time, it can be obtained from the collected values.

次にステップ４０５で、待ち行列理論から導かれる式を用いて、図６に示す、λとμから各ＪＯＢの実行までの平均の待ち時間Ｔｗを算出する。そして、ＪＯＢキュー情報テーブル２：８０のＴｗのカラム（８０３）の該当するＪＯＢＩＤの箇所にその値を入力する。ここで、λ及びμの分散の値を使うことにより、平均値を使う場合に比べてより正確に平均の待ち時間を計算することができる。 Next, in step 405, an average waiting time Tw from λ and μ to the execution of each job shown in FIG. 6 is calculated using an expression derived from the queue theory. Then, the value is input to the corresponding JOB ID in the Tw column (803) of the JOB queue information table 2:80. Here, by using the dispersion values of λ and μ, the average waiting time can be calculated more accurately than when the average value is used.

さらに、ＪＯＢの投入時には、ステップ４０５で抽出した、該当するＪＯＢが計算で使用するファイルのディレクトリから、図９に示すディレクトリ・ツリー１０１をたどって、該当するファイルが格納されているファイル格納用第二ボリューム５２を、ボリューム管理部２６において特定する。
そして、閾値時間Ｔｔｈを算出する。閾値時間とは上述した通り、「その特定したボリュームを構成するＲＡＩＤＧｒ２：４５の大容量ハードディスク４３の電源を入れて稼動状態にし、そのボリューム、またはボリューム内に格納された使用されるファイルを第一階層ストレージ１１内のファイル格納用第一ボリューム５１にステージング(コピー)するのに要する時間」である。Ｔｔｈは、ステージングするファイル／ボリュームのサイズと第一階層ストレージ装置１１と第二階層ストレージ装置１２の間のデータ転送速度から求めることができる。Ｔｔｈを算出後、ＪＯＢキュー情報テーブル２：８０のＴｔｈのカラム（８０６）の該当するＪＯＢＩＤの箇所にその値を入力する。
ここで、特定したボリュームを構成するＲＡＩＤＧｒ２：４５は、ボリューム管理テーブル：１００のＲＡＩＤＩＤ（１００４）から特定する。 Further, when a job is input, the directory for file storage in which the corresponding file is stored by tracing the directory tree 101 shown in FIG. 9 from the directory of the file used for calculation by the corresponding job extracted in step 405. The second volume 52 is specified by the volume management unit 26.
Then, a threshold time Tth is calculated. As described above, the threshold time is as follows: “The RAID Gr2: 45 large-capacity hard disk 43 configuring the specified volume is turned on and brought into operation, and the volume or a file to be used stored in the volume “Time required for staging (copying) to the first volume 51 for file storage in the one-tier storage 11”. Tth can be obtained from the size of the file / volume to be staged and the data transfer rate between the first tier storage apparatus 11 and the second tier storage apparatus 12. After calculating Tth, the value is input to the corresponding JOB ID in the Tth column (806) of the JOB queue information table 2:80.
Here, the RAID Gr2: 45 constituting the specified volume is specified from the RAID ID (1004) of the volume management table: 100.

ステップ４０６では、キュー状態の変化がＪＯＢ投入かどうかをチェックする。キュー状態の変化がＪＯＢ投入でなかった場合（ステップ４０６でＮｏ）、ステップ４０９へ処理を移す。また、キュー状態の変化がＪＯＢ投入であった場合（ステップ４０６でＹｅｓ）、ステップ４０７に移る。ここで、キュー状態の変化がＪＯＢ投入であったとは、該当するキューの最後尾に新たなＪＯＢ（ＪＯＢＩＤ）が存在していることで確認できる。 In step 406, it is checked whether or not the change in the queue state is JOB input. If the change in the queue state is not JOB input (No in step 406), the process proceeds to step 409. If the change in the queue state is JOB input (Yes in step 406), the process proceeds to step 407. Here, it can be confirmed that the change in the queue state is JOB input because a new JOB (JOB ID) exists at the end of the corresponding queue.

ステップ４０７で、図８に示すＪＯＢキュー情報テーブル２：８０を用いて、投入されたＪＯＢの実行までの平均の待ち時間（Ｔｗ）と閾値時間（Ｔｔｈ）を比較し、Ｔｗが長い場合は（ステップ４０７でＮｏ）ステップ４０１へ処理を移す。また、ＴｗがＴｔｈ以下の場合（ステップ４０７でＹｅｓ）は、ステップ４０８に移る。 In step 407, using the JOB queue information table 2:80 shown in FIG. 8, the average waiting time (Tw) until execution of the input JOB is compared with the threshold time (Tth). If Tw is long ( In step 407, the process proceeds to step 401). If Tw is equal to or less than Tth (Yes in step 407), the process proceeds to step 408.

ステップ４０８では、投入したＪＯＢの実行を少なくとも閾値時間Ｔｔｈの間だけ待つように、ストレージ管理サーバ１９の情報解析部２５から計算機管理サーバ１８のＪＯＢ管理部２１へ通知する。これはＴｗがＴｔｈ以下の場合には、該当するＪＯＢが実行開始になる前に、そのＪＯＢが実行中にアクセスするファイル／ディレクトリ（言い換えると、アクセスするファイル格納用第二ボリューム５２）が、第一階層ストレージ装置１１へのステージングを完了することができないためである。この場合、第一階層ストレージ装置１１にアクセスされるファイル／ディレクトリがないため、計算機１４から第一階層ストレージ装置１１への入出力エラーとなる。あるいは、第二階層ストレージ装置１２のファイル格納用第二ボリューム５２から直接入出力することになるため、入出力性能が低下する。このような状態になるのを防ぐため、ジョブが投入された時点から少なくとも閾値時間ＴｔｈだけＪＯＢの実行を遅らせる処理を行う。 In step 408, the information analysis unit 25 of the storage management server 19 notifies the job management unit 21 of the computer management server 18 so as to wait for execution of the input JOB for at least the threshold time Tth. When Tw is equal to or less than Tth, the file / directory accessed during execution of the JOB (in other words, the second volume 52 for storing files to be accessed) is changed before the corresponding JOB starts execution. This is because staging to the one-tier storage apparatus 11 cannot be completed. In this case, since there is no file / directory accessed to the first tier storage apparatus 11, an input / output error from the computer 14 to the first tier storage apparatus 11 occurs. Alternatively, since input / output is directly performed from the file storage second volume 52 of the second tier storage apparatus 12, the input / output performance is degraded. In order to prevent such a situation from occurring, processing for delaying the execution of JOB is performed for at least the threshold time Tth from the time the job is submitted.

次にステップ４０９では、該当するキュー内の全てのキューについて、ジョブ実行までの平均の待ち時間（Ｔｗ）を、閾値時間（Ｔｔｈ）に一定の時間（α）を加えた時間と比較する。比較する際には、例えばＪＯＢキュー情報テーブル２：８０を用いる。そして、ＴｗがＴｔｈにαを加えた時間より長い場合は（ステップ４０９でＮｏ）、ステップ４０１へ処理を移す。また、ＴｗがＴｔｈにαを加えた時間以下の場合は（ステップ４０９でＹｅｓ）、ステップ４１０へ移る。この条件を満たすＪＯＢがあった場合は、少なくともＴｔｈにαを加えた時間後に、該当するＪＯＢの実行が開始される可能性があることを意味する。したがって、ステップ４１０以降でファイルまたはボリュームのステージング処理を行う必要がある。 Next, in step 409, the average waiting time (Tw) until job execution is compared with the threshold time (Tth) plus a certain time (α) for all the queues in the corresponding queue. For comparison, for example, the JOB queue information table 2:80 is used. If Tw is longer than the time obtained by adding α to Tth (No in step 409), the process proceeds to step 401. On the other hand, if Tw is equal to or less than the time obtained by adding α to Tth (Yes in step 409), the process proceeds to step 410. When there is a JOB that satisfies this condition, it means that the execution of the corresponding JOB may be started at least after a time when α is added to Tth. Therefore, it is necessary to perform file or volume staging processing in step 410 and subsequent steps.

ここで、Ｔｔｈはシステムの稼働状況により変動する場合があるため、αという余裕を付加する。この余裕を大きくとりすぎると、多くのＪＯＢについて、ＪＯＢが使用するファイルまたはボリュームを第二階層ストレージ装置１２から第一階層ストレージ装置１１へステージングすることになるため、第一階層ストレージ装置１１の容量が増える。その結果、消費電力の大きい高速ハードディスクの台数が増えるので、省電力効果が小さくなる。そのためαは、例えばＴｔｈの１０％以下程度とする。 Here, since Tth may vary depending on the operating status of the system, a margin of α is added. If this margin is too large, the files or volumes used by the JOB will be staged from the second tier storage device 12 to the first tier storage device 11 for many JOBs. Will increase. As a result, the number of high-speed hard disks with large power consumption increases, so the power saving effect is reduced. Therefore, α is set to about 10% or less of Tth, for example.

このように処理すると、平均待ち時間Ｔｗが閾値時間Ｔｔｈよりも大きい場合であっても、結果的には平均待ち時間Ｔｗが閾値時間Ｔｔｈと略同じ値になるまで、ステップ４１０以降の処理が保留される。そのため、ジョブが実行されるよりもかなり早い段階でステージングを実行してしまい、第一階層ストレージ装置１１の容量を無駄に長時間使用してしまったことによる電力の無駄を省くことができる。 In this way, even if the average waiting time Tw is larger than the threshold time Tth, the processing after step 410 is suspended until the average waiting time Tw becomes substantially the same value as the threshold time Tth. Is done. Therefore, staging is executed at a much earlier stage than when the job is executed, and waste of power due to the use of the capacity of the first tier storage apparatus 11 for a long time can be saved.

次にステップ４１０で、計算機１４からアクセスされるファイル格納用第二ボリューム５２を構成する対象ＲＡＩＤＧｒ．２：４５の大容量ハードディスク４３の電源を投入（ＯＮ）するよう、ストレージ管理部２８から第二階層ストレージ装置１２に対して、指示する。対象ＲＡＩＤグループは、図１０に示すボリューム管理テーブル：１００を用いて決定することができる。 Next, at step 410, the target RAID Gr. The storage management unit 28 instructs the second tier storage apparatus 12 to turn on (ON) the large-capacity hard disk 43 of 2:45. The target RAID group can be determined using the volume management table: 100 shown in FIG.

ステップ４１１で、大容量ハードディスク４３の電源投入が完了し、大容量ハードディスク４３が稼動状態となるまで待つ。 In step 411, the process waits until the power supply of the large-capacity hard disk 43 is completed and the large-capacity hard disk 43 becomes operational.

稼動状態となったら（ステップ４１１でＹｅｓ）、ステップ４１２で、ストレージ管理部２８から、ファイルサーバ１３に対して、電源を投入し稼動状態としたファイル格納用第二ボリューム５２をファイル格納用ディレクトリ８１にマウントするように指示を出す。その後、ストレージ管理部２８から第二階層ストレージ装置１２及び第一階層ストレージ装置１１に、アクセスされるファイル格納用第二ボリューム５２、またはそのボリューム内に格納されている、アクセスされるファイルを第一階層ストレージ装置１１のファイル格納用第一ボリューム５１へステージング（コピー）するよう指示する。 When the operating state is reached (Yes in step 411), in step 412, the storage management unit 28 turns on the file server 13 to turn on the file server 13 and sets the second volume 52 for storing files to the file storing directory 81. Instruct to mount. Thereafter, the second tier storage device 52 accessed from the storage management unit 28 to the second tier storage device 12 and the first tier storage device 11, or the file to be accessed stored in the volume is first An instruction is given to stage (copy) to the file storage first volume 51 of the hierarchical storage apparatus 11.

ここで、ファイル格納用第一ボリューム５１は複数のボリュームＬＵ０乃至ＬＵｎを有しており、その中のどのボリュームにコピーを行うかを決定する必要がある。それは、これらのボリュームが未使用で、かつそのボリュームのサイズがコピーされるボリュームまたはファイルのサイズ以上であるボリュームの中から選択する。ファイル格納用ボリューム５１の各ボリュームの使用状況及びサイズは、図１１に示すボリューム使用状況管理テーブル：１１０で管理されており、ストレージ管理部２８において、未使用でサイズの条件が合うボリュームをこの表１１０から選択する。 Here, the first file storage volume 51 has a plurality of volumes LU0 to LUn, and it is necessary to determine which volume of the volumes to copy. It selects from those volumes that are unused and whose size is greater than or equal to the size of the volume or file being copied. The usage status and size of each volume of the file storage volume 51 are managed by the volume usage status management table: 110 shown in FIG. 11, and the storage management unit 28 indicates the unused volume that satisfies the size condition in this table. Select from 110.

ステップ４１３で、ステージングが完了するまで待つ。
ステージングが完了したら（ステップ４１３でＹｅｓ）、ステップ４１４でファイル格納用ディレクトリ８１にマウントされていた該当するファイル格納用第二ボリューム５２（アクセスされるボリューム）をアンマウントし、ステージングが完了したファイル格納用第一ボリューム５１に当該ディレクトリをマウントしなおす（切り替える）よう、ストレージ管理部２８からファイルサーバ１３に対して、指示する。 In step 413, wait until staging is completed.
When the staging is completed (Yes in step 413), the corresponding file storage second volume 52 (accessed volume) mounted in the file storage directory 81 is unmounted in step 414, and the staging is completed. The storage management unit 28 instructs the file server 13 to remount (switch) the directory in the first volume 51.

図１４にステージングが終了して、マウント切り替えを行った後の、ディレクトリ及びボリュームの状態を示している。この図では、ＪＯＢからアクセスされるファイル格納用第二ボリューム５２がＬＵ００である場合について示している。ＬＵ００に格納されていたファイルが、未使用かつ当該ファイルを格納しうるサイズを持つＬＵ０にステージング（コピー：移動）１１０される。マウント切り替えをした後、ＶＬＵ００（ＬＵ００）とディレクトリｕｓｒ０／ｄｉｒ０とのマッピングは一時的に消滅する。 FIG. 14 shows the state of the directory and volume after staging is completed and mount switching is performed. This figure shows a case where the file storage second volume 52 accessed from JOB is LU00. The file stored in LU00 is staged (copied: moved) 110 to LU0 which is unused and has a size capable of storing the file. After the mount switching, the mapping between VLU00 (LU00) and the directory usr0 / dir0 disappears temporarily.

ステップ４１５で、マウント切り替えが完了するまで待つ。
マウント切り替えが完了したら（ステップ４１５でＹｅｓ）、ステップ４１６で、ステージングが終了した（データ準備が完了した）という情報（ステージング完了情報）をストレージ管理サーバ１９のストレージ管理部２８から計算機管理サーバ１８のＪＯＢ管理部２１へ通知する。計算機管理サーバ１８では、上記ストレージ管理サーバ１９から送信されるステージング完了情報に基づき、各ＪＯＢの実行開始前に、そのＪＯＢの実行に必要なファイルのステージングが完了しているかどうかをチェックして、ステージングが完了していればＪＯＢを実行し、完了していなければステージングが完了した後にＪＯＢを実行するように制御する。つまり、その通知を受信するまでＪＯＢの実行を遅らせる。こうすることによって、ステージング完了前にＪＯＢの実行が開始されて、入出力エラーとなったり、性能の低い第二階層ストレージ装置１２からの入出力を防いだりすることが可能となる。 In step 415, the process waits until the mount switching is completed.
When the mount switching is completed (Yes in step 415), information (staging completion information) that staging is completed (data preparation is completed) is sent from the storage management unit 28 of the storage management server 19 to the computer management server 18 in step 416. The job management unit 21 is notified. Based on the staging completion information transmitted from the storage management server 19, the computer management server 18 checks whether or not staging of files necessary for execution of each JOB has been completed before starting execution of each JOB. If the staging is completed, the job is executed. If the staging is not completed, the job is executed after the staging is completed. That is, the execution of the job is delayed until the notification is received. By doing so, the execution of JOB is started before the completion of staging, and it becomes possible to prevent an input / output error or input / output from the second tier storage apparatus 12 with low performance.

次にステップ４１７で、アンマウントしたファイル格納用第二ボリューム５２を構成するＲＡＩＤＧｒ．２：４５を構成する大容量ハードディスク２３の電源を切る（ＯＦＦ）ように、ストレージ管理部２８から第二階層ストレージ装置１２に対して、指示する。指示した後、ステップ４０１に戻る。 Next, at step 417, the RAID Gr. The storage management unit 28 instructs the second tier storage apparatus 12 to turn off (OFF) the large-capacity hard disk 23 configuring 2:45. After the instruction, the process returns to step 401.

次にステップ４１８以降の処理について説明するが、これらの処理はＪＯＢ実行終了時（ステップ４０３でＹｅｓ）に行う処理である。ステップ４１８では、実行が終了したＪＯＢからアクセスされていたファイル／ディレクトリが元々格納／マウントされていたファイル格納用第二ボリューム５２を構成する対象ＲＡＩＤＧｒ．２：４５の大容量ハードディスク４３の電源を投入（ＯＮ）するよう、ストレージ管理部２８から第二階層ストレージ装置１２に対して、指示する。 Next, the processing after step 418 will be described. These processing are processing performed at the end of JOB execution (Yes in step 403). In step 418, the target RAID Gr. Constituting the file storage second volume 52 in which the file / directory accessed from the completed JOB was originally stored / mounted. The storage management unit 28 instructs the second tier storage apparatus 12 to turn on (ON) the large-capacity hard disk 43 of 2:45.

ステップ４１９で、大容量ハードディスク４３の電源投入が完了し、大容量ハードディスク４３が稼動状態となるまで待つ。 In step 419, the process waits until the power supply of the large-capacity hard disk 43 is completed and the large-capacity hard disk 43 becomes operational.

稼動状態となったら（ステップ４１９でＹｅｓ）、ステップ４２０で、ストレージ管理部２８から第一階層ストレージ装置１１及び第二階層ストレージ装置１２に、第一階層ストレージ装置１１の該当するＪＯＢからアクセスされていたファイル格納用第一ボリューム５１、またはそのボリューム内に格納されているアクセスされるファイルを、元々のファイル格納用第二ボリューム５１へデステージング（コピー）するよう指示する。なお、デステージングの完了後、ファイル格納用第一ボリューム５１に格納されているファイルは、タイミングを問わず、削除するようにしても良い。 When the operation state is reached (Yes in Step 419), the storage management unit 28 has accessed the first tier storage apparatus 11 and the second tier storage apparatus 12 from the corresponding JOB in the first tier storage apparatus 11 in Step 420. The file storage first volume 51 or the accessed file stored in the volume is instructed to be destaged (copied) to the original file storage second volume 51. Note that, after completion of destaging, the files stored in the file storage first volume 51 may be deleted regardless of the timing.

ステップ４２１で、デステージングが完了するまで待つ。
デステージングが完了したら（ステップ４２１でＹｅｓ）、ステップ４２２でファイル格納用ディレクトリ８１にマウントされていた該当するファイル格納用第一ボリューム５１（アクセスされていたボリューム）をアンマウントし、デステージングが完了したファイル格納用第二ボリューム５２に当該ディレクトリをマウントしなおす（切り替える）よう、ストレージ管理部２８からファイルサーバ１３に対して、指示する。 In step 421, the process waits until the destaging is completed.
When the destaging is completed (Yes in Step 421), the corresponding file storage first volume 51 (the accessed volume) mounted in the file storage directory 81 is unmounted in Step 422, and the destaging is completed. The storage management unit 28 instructs the file server 13 to remount (switch) the directory to the file storage second volume 52.

ステップ４２３で、マウント切り替えが完了するまで待つ。
マウント切り替えが完了したら（ステップ４２３でＹｅｓ）、ステップ４２４で、デステージングの終了したファイル格納用第二ボリューム５２をファイル格納用ディレクトリ８１からアンマウントするように、ストレージ管理部２８からファイルサーバ１３に対して、指示する。 In step 423, the process waits until the mount switching is completed.
When the mount switching is completed (Yes in Step 423), the storage management unit 28 causes the file server 13 to unmount the file storage second volume 52 that has been destaged from the file storage directory 81 in Step 424. And instruct.

次に、ステップ４２５で、デステージングの終了したファイル格納用第二ボリューム５２を構成する大容量ハードディスク２３の電源を切る（ＯＦＦ）よう、ストレージ管理部２８から第二階層ストレージ装置１２に対して、指示する。指示した後、ステップ４０１に戻る。 Next, in step 425, the storage management unit 28 instructs the second tier storage apparatus 12 to turn off (turn off) the large-capacity hard disk 23 that constitutes the file storage second volume 52 that has been destaged. Instruct. After the instruction, the process returns to step 401.

本実施形態によれば、計算機１４で実行されるＪＯＢが必要なときに、必要なファイルを第一階層ストレージ装置１１にステージングできるため、第一階層ストレージ装置１１の高性能を活かして計算機１４へのファイルの入出力を高速化することが可能となる。また、第二階層ストレージ装置１２の大容量ハードディスク４３は、アクセスされるとき以外は電源を切っておくことが可能となる。さらに、第一階層ストレージ装置１１の容量を最小限にすることが可能となる。したがって、ストレージシステム２の消費電力量を削減することが可能となる。よって、高性能が要求されるバッチ処理型のアプリケーション向けに、性能劣化を最小限に抑え、且つ、低消費電力を可能とする高速・大容量の階層ストレージシステムを提供することが可能となる。 According to the present embodiment, when a job to be executed by the computer 14 is necessary, a necessary file can be staged on the first tier storage apparatus 11, so that the high performance of the first tier storage apparatus 11 is utilized to the computer 14. It is possible to speed up the input / output of files. Further, the large-capacity hard disk 43 of the second tier storage apparatus 12 can be turned off except when accessed. Furthermore, the capacity of the first tier storage apparatus 11 can be minimized. Therefore, the power consumption of the storage system 2 can be reduced. Therefore, it is possible to provide a high-speed and large-capacity tiered storage system that minimizes performance degradation and enables low power consumption for batch processing type applications that require high performance.

≪第二の実施形態≫
次に、本発明の第二の実施形態について説明する。
図１５（図１５Ａ〜図１５Ｃの総称）に、第二の実施形態のステージング／デステージングの手順を示す。図１５に示す手順は、以下の点を除いて、図１２に示す手順と同様である（図１５のステップ５０１からステップ５２５はそれぞれ、以下の点を除いて、図１２のステップ４０１からステップ４２５と同様である）。
異なる点は、ＪＯＢの実行の遅延、ステージングの開始タイミングを判断するために、平均の待ち時間ではなく、キュー内に待っているＪＯＢの数を利用する点である。以下、その異なる点について説明する。 << Second Embodiment >>
Next, a second embodiment of the present invention will be described.
FIG. 15 (generic name of FIGS. 15A to 15C) shows a staging / destaging procedure of the second embodiment. The procedure shown in FIG. 15 is the same as the procedure shown in FIG. 12 except for the following points (Steps 501 to 525 in FIG. 15 are the same as those shown in FIG. 12, except for the following points. Is the same).
The difference is that the number of JOBs waiting in the queue is used instead of the average waiting time in order to determine the execution delay of JOB and the start timing of staging. Hereinafter, the different points will be described.

そのため、ステップ５０４では、該当するキュー内に待っているＪＯＢの数（待ちＪＯＢ数）とそのキューにあった実行中のＪＯＢの数（実行中ＪＯＢ数）の和（図６中のｋに相当）を抽出する。また、ＪＯＢの平均実行頻度μ（図６参照）の逆数をとり、平均ＪＯＢ実行時間Ｔｅ（ＪＯＢ実行時間の平均値）を算出する。 Therefore, in step 504, the sum of the number of JOBs waiting in the corresponding queue (the number of waiting JOBs) and the number of jobs being executed in the queue (the number of jobs being executed) (corresponding to k in FIG. 6). ). Further, the reciprocal of the average job execution frequency μ (see FIG. 6) is taken to calculate the average job execution time Te (average value of the job execution time).

また、ステップ５０５では、ＪＯＢの実行の遅延、ステージングの開始タイミングを判断するための、ｋの閾値数（ｋｔｈ）を待ち行列理論により算出する。ｋｔｈは、ステップ５０４で求める平均ＪＯＢ実行時間と閾値時間Ｔｔｈ、及び閾値数をｋｔｈとしたときに、ＪＯＢ実行開始前までに、アクセスされるファイル／ボリュームのステージングを終了できない確率（失敗確率）を用いて、待ち行列理論から導出される式を使って計算する。失敗確率の上限値は、予め指定する（メモリ９４に記憶される）。その上限値以下になるように、ｋの閾値数（ｋｔｈ）が決定される。
そして、ＪＯＢの実行の遅延、ステージングの開始タイミングを、ステップ５０７及び５０９でそれぞれ判断する。 In step 505, the threshold number (kth) of k for determining the delay of execution of JOB and the start timing of staging is calculated by queuing theory. kth is the probability (failure probability) that staging of the accessed file / volume cannot be completed before the start of JOB execution when the average JOB execution time and threshold time Tth obtained in step 504 and the threshold number are kth. And calculate using formulas derived from queuing theory. The upper limit value of the failure probability is designated in advance (stored in the memory 94). The threshold number (kth) of k is determined so as to be equal to or less than the upper limit value.
Then, the execution delay of JOB and the start timing of staging are determined in steps 507 and 509, respectively.

具体的には、ステップ５０７では、図８に示すＪＯＢキュー情報テーブル：８０を用いて、該当するキュー内に待っているＪＯＢの数とそのキューにあった実行中のＪＯＢの数の和ｋと閾値数（ｋｔｈ）を比較する（待ち行列理論により、Ｔｅ、Ｔｉ、及びＴｔｈからｋｔｈを算出することができる）。ｋが大きい場合は（ステップ５０７でＮｏ）、ステップ５０１へ処理を移す。また、ｋがｋｔｈ以下の場合（ステップ５０７でＹｅｓ）は、ステップ５０８に移る。 Specifically, in step 507, using the job queue information table 80 shown in FIG. 8, the number k of jobs waiting in the corresponding queue and the sum k of the number of jobs being executed in the queue, The threshold numbers (kth) are compared (kth can be calculated from Te, Ti, and Tth by queuing theory). If k is large (No in step 507), the process proceeds to step 501. If k is equal to or less than kth (Yes in step 507), the process proceeds to step 508.

ステップ５０９では、該当するキュー内の全てのキューについて、該当するキュー内に待っているＪＯＢの数とそのキューにあった実行中のＪＯＢの数の和ｋを、閾値数（ｋｔｈ）と比較する。比較する際には、例えばＪＯＢキュー情報テーブル２：８０を用いる。そして、ｋがｋｔｈより大きい場合は（ステップ５０９でＮｏ）、ステップ５０１へ処理を移す。また、ｋがｋｔｈ以下の場合は（ステップ５０９でＹｅｓ）、ステップ５１０へ移る。この条件を満たすＪＯＢがあった場合は、少なくともｋｔｈ番目に、該当するＪＯＢの実行が開始される可能性があることを意味する。 In step 509, for all the queues in the corresponding queue, the sum k of the number of JOBs waiting in the corresponding queue and the number of executing jobs in the queue is compared with the threshold number (kth). . For comparison, for example, the JOB queue information table 2:80 is used. If k is larger than kth (No in step 509), the process proceeds to step 501. If k is equal to or less than kth (Yes in step 509), the process proceeds to step 510. If there is a JOB that satisfies this condition, it means that the execution of the corresponding JOB may be started at least at the kthth.

このように処理すると、該当するキュー内に待っているＪＯＢの数とそのキューにあった実行中のＪＯＢの数の和ｋが閾値数ｋｔｈよりも大きい場合であっても、結果的には前記和ｋが閾値数ｋｔｈと略同じ値になるまで、ステップ５１０以降の処理が保留される。そのため、ジョブが実行されるよりもかなり早い段階でステージングを実行してしまい、第一階層ストレージ装置１１の容量を無駄に長時間使用してしまったことによる電力の無駄を省くことができる。 If the processing is performed in this way, even if the sum k of the number of JOBs waiting in the corresponding queue and the number of jobs being executed in the queue is larger than the threshold number kth, the result is as described above. Until the sum k becomes substantially the same value as the threshold number kth, the processing after step 510 is suspended. Therefore, staging is executed at a much earlier stage than when the job is executed, and waste of power due to the use of the capacity of the first tier storage apparatus 11 for a long time can be saved.

本実施形態では、ＪＯＢの実行開始前にステージングを終えることができない確率（失敗確率）を管理者が予め指定する値以下に抑えることが可能となる。 In this embodiment, it is possible to suppress the probability (failure probability) that the staging cannot be completed before the start of JOB execution to a value that is specified in advance by the administrator.

≪その他≫
前記した各実施形態は、本発明を実施するために好適のものであるが、その実施形式はこれらに限定されるものでなく、本発明の要旨を変更しない範囲内において種々変形することが可能である。 ≪Others≫
Each of the above-described embodiments is suitable for carrying out the present invention, but the form of implementation is not limited to these, and various modifications can be made without departing from the scope of the present invention. It is.

例えば、ストレージ管理サーバ１９が、計算機管理サーバ１８の機能を兼ねるようなシステム構成にしても良い。具体的には、ストレージ管理サーバ１９が、ＪＯＢ管理部２１、ユーザ管理部２２、情報提供部２３等の機能部を有し、プロセッサ９５が前記機能部による機能を実現するように処理を実行しても良い。 For example, the storage management server 19 may have a system configuration that also functions as the computer management server 18. Specifically, the storage management server 19 has functional units such as a JOB management unit 21, a user management unit 22, and an information providing unit 23, and the processor 95 executes processing so as to realize the functions of the functional units. May be.

その他、ハードウェア、ソフトウェア、各フローチャート等の具体的な構成について、本発明の趣旨を逸脱しない範囲で適宜変更が可能である。 In addition, specific configurations of hardware, software, flowcharts, and the like can be appropriately changed without departing from the spirit of the present invention.

１計算機システム
２ストレージシステム
１１第一階層ストレージ装置（第一のストレージ装置）
１２第二階層ストレージ装置（第二のストレージ装置）
１３ファイルサーバ
１４計算機
１５ＬＡＮ
１６ＩＰスイッチ
１７ＦＣスイッチ
１８計算機管理サーバ
１９ストレージ管理サーバ
２１ＪＯＢ管理部
２２ユーザ管理部
２３情報提供部
２４情報収集部
２５情報解析部
２６ボリューム管理部
２７ユーザエリア管理部
２８ストレージ管理部
４２高速ハードディスク（第一のハードディスク装置）
４３大容量ハードディスク（第二のハードディスク装置）
５１ファイル格納用第一ボリューム（第一のボリューム）
５２ファイル格納用第二ボリューム（第二のボリューム）
６１ファイル格納用仮想ボリューム
９４メモリ（記憶部）
９５プロセッサ（制御部）
９８プロセッサ（計算機管理サーバ用制御部）
９９メモリ 1 Computer System 2 Storage System 11 First Tier Storage Device (First Storage Device)
12 Second tier storage device (second storage device)
13 File server 14 Computer 15 LAN
16 IP switch 17 FC switch 18 Computer management server 19 Storage management server 21 JOB management unit 22 User management unit 23 Information provision unit 24 Information collection unit 25 Information analysis unit 26 Volume management unit 27 User area management unit 28 Storage management unit 42 High-speed hard disk (First hard disk device)
43 Large-capacity hard disk (second hard disk device)
51 First volume for file storage (first volume)
52 Second volume for storing files (second volume)
61 Virtual volume 94 for storing files Memory (storage unit)
95 Processor (control unit)
98 processor (control unit for computer management server)
99 memory

Claims

A first storage device including one or more first hard disk devices constituting one or more first volumes;
A second storage device including one or more second hard disk devices constituting one or more second volumes;
A storage management server that manages the first volume and the second volume in which files accessed from the computer when executing a job sequentially executed on the computer;
In a storage system that is connected so that
The storage management server
At least a storage unit that stores, for each job, identification information of the second volume in which a job queue state and a file accessed from the computer when the job is executed are stored;
A control in which the job is to calculate the average waiting time on the basis of the average execution frequency when the jobs submitted to the average loading interval and the queue when it is submitted to the queue is executed,
The second hard disk device is put in an operating state to such an extent that the file input / output processing can be performed, and the time required for moving the file stored in the second volume to the first volume is thresholded. Control to calculate as value time,
The average waiting time is compared with the threshold time, and when the average waiting time is equal to or less than the threshold time, the execution of the job is delayed by at least the threshold time from the time when the job is submitted;
Before the job is executed, the second hard disk device that constitutes the second volume in which the file is stored is put in an operating state to the extent that the input / output processing of the file can be performed, and the file is moved to the first volume. And a control unit that executes the job on the computer while the file is stored in the first volume and the file is stored in the first volume.

Wherein,
After pre-SL movement is completed, the storage system according to claim 1, characterized in that to perform the control to the second hard disk device in a non-operating state.

The controller is
When the average waiting time is larger than the threshold time, the second hard disk constituting the second volume in which the file is stored until the average waiting time becomes substantially the same as the threshold time at the latest. Control the apparatus to be in an operating state to the extent that input / output processing of the file is possible, and to move the file to the first volume;
2. The storage system according to claim 1, wherein after the movement is completed, control is performed to place the second hard disk device in a non-operating state.

The controller is
When it is confirmed that the execution of the job has been completed with reference to the queue status, the file input / output processing is performed on the second hard disk device that constitutes the second volume in which the file is originally stored. Control to move the file from the first volume to the second volume,
After the move is completed, the storage system according to any of claims 1 to 3, characterized by executing a control for the second hard disk device in a non-operating state, the.

A computer management server that manages jobs executed sequentially on the computer is connected to be communicable,
The computer management server is
From the storage management server, until it receives a notification the movement of the file is complete, claims 1 to 3, characterized in that it comprises a computer management server control unit for executing control to delay the execution of the job The storage system according to any one of the above.

All files that are not accessed from the computer are stored in the second volume,
The non-operating state of the second hard disk device constituting the second volume is that the power supply of the second hard disk device is turned off or the disk of the second hard disk device is stopped. The storage system according to any one of claims 1 to 4, wherein the storage system is in a state of being rotated at a rotation speed equal to or lower than a predetermined rotation speed.

A file server that is communicably connected via the computer and the first storage device is communicably connected,
The first volume and the second volume are mounted on a directory managed by the file server,
The storage system according to any one of claims 1 to 4, wherein the storage unit stores the directory for each job.

A first storage device including one or more first hard disk devices constituting one or more first volumes;
A second storage device including one or more second hard disk devices constituting one or more second volumes;
A storage management server that manages the first volume and the second volume in which files accessed from the computer when executing a job sequentially executed on the computer;
In a storage system that is connected so that
The storage management server
At least a storage unit that stores, for each job, identification information of the second volume in which a job queue state and a file accessed from the computer when the job is executed are stored;
Referring to the storage unit, the sum of the number of jobs whose queue status is waiting and the number of jobs being executed is calculated, and the job is determined based on the average execution frequency when the jobs submitted to the queue are executed. Control for calculating the average execution time when executed,
The second hard disk device is put in an operating state to such an extent that the file input / output processing can be performed, and the time required for moving the file stored in the second volume to the first volume is thresholded. Control to calculate as value time,
The probability that the movement cannot be finished, and the failure probability calculated based on the average execution time, the average insertion interval, and the threshold time is less than or equal to a value stored in the storage unit. Control to determine the threshold number;
A control that compares the sum with the threshold number, and delays execution of the job for at least the threshold time from when the job is submitted when the sum is less than or equal to the threshold number;
Before the job is executed, the second hard disk device that constitutes the second volume in which the file is stored is put in an operating state to the extent that the input / output processing of the file can be performed, and the file is moved to the first volume. And a control unit that executes the job on the computer while the file is stored in the first volume and the file is stored in the first volume.

Wherein,
After pre-SL movement is completed, the storage system according to claim 8, characterized in that to perform the control to the second hard disk device in a non-operating state.

The controller is
When the sum is larger than the threshold number, the second hard disk device that constitutes the second volume in which the file is stored until the sum becomes substantially the same as the threshold number at the latest. Control to move the file to the first volume, and move the file to the first volume so that file input / output processing is possible;
The storage system according to claim 8, wherein after the movement is completed, control is performed to place the second hard disk device in a non-operating state.

A first tier storage apparatus including one or more high-speed hard disks constituting one or more file storage first volumes;
A second tier storage apparatus including one or more large-capacity hard disks constituting one or more file storage second volumes;
A storage management server for managing the file storage first volume and the file storage second volume in which files accessed from the computer when executing a job sequentially executed on the computer;
In a storage system that is connected so that
The storage management server
At least a job storing a job JOB state and identification information of the second volume for storing files in which a file accessed from the computer when executing the job is stored for each job;
A control in which the job is to calculate the average waiting time on the basis of the average execution frequency when the jobs submitted to the average loading interval and the queue when it is submitted to the queue is executed,
Time required for putting the large-capacity hard disk in an operating state to the extent that the file input / output processing is possible and staging the file stored in the file storage second volume to the file storage first volume a control for calculating a threshold value time,
The average waiting time and the threshold time are compared, and based on the comparison result, the job execution timing and the large-capacity hard disk constituting the file storage second volume in which the file is stored are operating. Or a control for adjusting the timing of making the non-operating state and executing the job on the computer in a state where the file is stored in the first volume for file storage,
The processor is
When the average waiting time is equal to or less than the threshold time, a control for delaying execution of the job by at least the threshold time from the time when the job is submitted;
The large-capacity hard disk that constitutes the second file storage volume in which the file is stored is brought into an operating state to the extent that the input / output processing of the file can be performed, and the file is staged on the first file storage volume Control to
After the staging is completed, control to put the large-capacity hard disk in a non-operating state, and
When the average waiting time is larger than the threshold time, the large capacity constituting the second volume for storing files in which the file is stored until the average waiting time becomes approximately the same as the threshold time at the latest Control the hard disk to be in an operating state to the extent that input / output processing of the file is possible, and staging the file to the first volume for storing the file;
After the staging is completed, control to put the large-capacity hard disk in a non-operating state, and
When it is confirmed that the execution of the job has been completed with reference to the JOB state, the input / output processing of the file is performed on the large-capacity hard disk constituting the second volume for file storage in which the file was originally stored. Control to bring the file into the operating state to the extent that it is possible and destage the file from the file storage first volume to the file storage second volume;
After the destaging is completed, a control for executing the large capacity hard disk in a non-operating state is executed.

A first storage device including one or more first hard disk devices constituting one or more first volumes;
A second storage device including one or more second hard disk devices constituting one or more second volumes;
A storage management server that manages the first volume and the second volume in which files accessed from the computer when executing a job sequentially executed on the computer;
In a storage management method in a storage system that is communicably connected,
The storage unit of the storage management server is
At least the job queue status and the identification information of the second volume in which the file accessed from the computer when the job is executed are stored for each job,
The control unit of the storage management server
And processing the job to calculate the average waiting time on the basis of the average execution frequency when the jobs submitted to the average loading interval and the queue when it is submitted to the queue is executed,
The second hard disk device is put in an operating state to such an extent that the file input / output processing can be performed, and the time required for moving the file stored in the second volume to the first volume is thresholded. Processing to calculate the value time;
Comparing the average waiting time with the threshold time, and when the average waiting time is equal to or less than the threshold time, a process of delaying execution of the job by at least the threshold time from the time when the job is submitted,
Before the job is executed, the second hard disk device that constitutes the second volume in which the file is stored is put in an operating state to the extent that the input / output processing of the file can be performed, and the file is moved to the first volume. A storage management method comprising: moving to one volume and executing the job on the computer in a state where the file is stored in the first volume.

Wherein,
After pre-SL movement is completed, the storage management method according to claim 12, characterized in that performing the processing of the second hard disk device in a non-operating state.

The controller is
When the average waiting time is larger than the threshold time, the second hard disk constituting the second volume in which the file is stored until the average waiting time becomes substantially the same as the threshold time at the latest. A process for moving the file to the first volume by putting the device in an operating state to the extent that input / output processing of the file is possible; and
The storage management method according to claim 12, wherein after the movement is completed, a process of setting the second hard disk device to a non-operating state is executed.

The controller is
When it is confirmed that the execution of the job has been completed with reference to the queue status, the file input / output processing is performed on the second hard disk device that constitutes the second volume in which the file is originally stored. A process for moving the file from the first volume to the second volume,
The storage management method according to any one of claims 12 to 14, wherein after the movement is completed, a process of setting the second hard disk device to a non-operating state is executed.

A computer management server that manages jobs executed sequentially on the computer is connected to be communicable,
The computer management server control unit of the computer management server includes:
The storage management according to any one of claims 12 to 14, wherein processing for delaying execution of the job is executed until a notification of completion of the movement of the file is received from the storage management server. Method.

All files that are not accessed from the computer are stored in the second volume,
The non-operating state of the second hard disk device constituting the second volume is that the power supply of the second hard disk device is turned off or the disk of the second hard disk device is stopped. The storage management method according to any one of claims 12 to 15, wherein the storage management method is in a state of being rotated at a rotation speed equal to or lower than a predetermined rotation speed.

A first storage device including one or more first hard disk devices constituting one or more first volumes;
A second storage device including one or more second hard disk devices constituting one or more second volumes;
A storage management server that manages the first volume and the second volume in which files accessed from the computer when executing a job sequentially executed on the computer;
In a storage management method in a storage system that is communicably connected,
The storage unit of the storage management server is
At least the job queue status and the identification information of the second volume in which the file accessed from the computer when the job is executed are stored for each job,
The control unit of the storage management server
Referring to the storage unit, the sum of the number of jobs whose queue status is waiting and the number of jobs being executed is calculated, and the job is executed based on the average execution frequency when the job submitted to the queue is executed. Processing to calculate the average execution time when
The second hard disk device is put in an operating state to such an extent that the file input / output processing can be performed, and the time required for moving the file stored in the second volume to the first volume is thresholded. Processing to calculate the value time;
The probability that the movement cannot be finished, and the failure probability calculated based on the average execution time, the average insertion interval, and the threshold time is less than or equal to a value stored in the storage unit. Processing to determine the threshold number;
Comparing the sum with the threshold number, and when the sum is less than or equal to the threshold number, a process of delaying execution of the job for at least the threshold time from when the job was submitted,
Before the job is executed, the second hard disk device that constitutes the second volume in which the file is stored is put in an operating state to the extent that the input / output processing of the file can be performed, and the file is moved to the first volume. A storage management method comprising: moving to one volume and executing the job on the computer in a state where the file is stored in the first volume.

Wherein,
After pre-SL movement is completed, the storage management method according to claim 18, characterized in that performing the processing of the second hard disk device in a non-operating state.

The controller is
When the sum is larger than the threshold number, the second hard disk device that constitutes the second volume in which the file is stored until the sum becomes substantially the same as the threshold number at the latest. A process for moving the file to the first volume, and an operation state that enables file input / output processing; and
The storage management method according to claim 18, further comprising: executing a process of bringing the second hard disk device into a non-operating state after the movement is completed.