CN107632907B - BMC chip hosting system and control method thereof - Google Patents
BMC chip hosting system and control method thereof Download PDFInfo
- Publication number
- CN107632907B CN107632907B CN201710667575.2A CN201710667575A CN107632907B CN 107632907 B CN107632907 B CN 107632907B CN 201710667575 A CN201710667575 A CN 201710667575A CN 107632907 B CN107632907 B CN 107632907B
- Authority
- CN
- China
- Prior art keywords
- bmc
- chip
- hosting
- fan
- power supply
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 16
- 238000001514 detection method Methods 0.000 claims abstract description 47
- 238000004891 communication Methods 0.000 claims description 72
- 230000002159 abnormal effect Effects 0.000 claims description 14
- 238000012544 monitoring process Methods 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 239000002699 waste material Substances 0.000 description 2
- PXLYROINIXKFAW-UHFFFAOYSA-N 1-(3-bromophenyl)-2-(methylamino)propan-1-one Chemical compound CNC(C)C(=O)C1=CC=CC(Br)=C1 PXLYROINIXKFAW-UHFFFAOYSA-N 0.000 description 1
- 238000001816 cooling Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000005670 electromagnetic radiation Effects 0.000 description 1
- 230000017525 heat dissipation Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 230000000737 periodic effect Effects 0.000 description 1
Images
Landscapes
- Control Of Temperature (AREA)
- Debugging And Monitoring (AREA)
Abstract
The invention provides a BMC (baseboard management controller) chip hosting system and a control method thereof, wherein the system comprises a BMC hosting chip and a server mainboard, and the BMC hosting chip is connected with a BMC chip, a temperature sensor, a fan and a power supply; the temperature sensor, the fan and the power supply are arranged on the server mainboard; the BMC hosting chip is connected with the BMC chip through a control bus, the BMC hosting chip is connected with a power supply through a power supply control line, and the BMC hosting chip is connected with the fan through a fan control signal line; the BMC hosting chip is used for judging that the BMC chip is invalid; when the BMC chip fails, generating a fan control parameter and a power supply control parameter according to received temperature detection information of the temperature sensor, sending the fan control parameter to a fan to control the rotating speed of the fan, and sending the power supply control parameter to a power supply to control the power supply to be turned off.
Description
Technical Field
The invention belongs to the technical field of chips special for servers, and particularly relates to a BMC (baseboard management controller) chip hosting system and a control method thereof.
Background
BMC is an abbreviation of Baseboard Management Controller, a board Management Controller.
With the rapid development of server performance, the frequency of chips used by the server is faster and faster, and meanwhile, the requirement of clients on the stability of the server is higher and higher. In the current server structure, a BMC chip is mostly adopted for managing the management of a server mainboard, and the BMC chip can monitor and control the temperature and the rotating speed of a fan, so that the mainboard is better maintained. The client demands the server more and more in terms of performance, and the stability demands the same more and more. In practical applications, the BMC chip may be subjected to various disturbances, such as electromagnetic radiation, which may cause the BMC chip to fail. When the BMC chip fails, the fan is out of control, and a large amount of special chips integrated in the server generate a large amount of heat, if the heat is dissipated in time, the whole server has a risk of abnormal work. If the state before the BMC chip fails is not stored, the opportunity of analyzing the failure reason of the BMC chip is lost. Meanwhile, the BMC chip is invalid, and alarm information cannot be sent to a control room.
In the current server fan management design among the prior art, more advanced adoption is in case after the BMC chip became invalid, use the mode of fixed mode control fan, inside the server, adopt the BMC chip as the controller of fan speed, and the BMC chip outputs periodic pulse signal as feeding the dog signal under normal operating condition, connect the singlechip with the BMC chip, this singlechip is as the monitoring unit of BMC chip operating condition, feed the dog signal of losing through monitoring BMC, realize the monitoring whether normal work to BMC, after the BMC chip became invalid, regulate and control fan speed, guarantee the normal heat dissipation of system. In the above steps, an alternative gating device is adopted as the switching selection of the fan control signal after failure and the normal control signal; meanwhile, the gate is connected with a high level signal and used as a fan control signal after the BMC chip is invalid, so that the fan is in a full-speed rotation state; the power consumption of the fan cannot be well controlled by adopting a fixed mode, because the fan runs at full speed, the resource waste is caused, the working state of the server cannot be detected even if the fan runs at full speed, and if the power consumption of the system is increased sharply, the server cannot be closed; meanwhile, the remote management system BMC chip cannot be informed of the occurrence of a problem, and system maintenance is carried out.
Therefore, it is very necessary to provide a BMC hosting chip and a control method thereof to overcome the above-mentioned drawbacks of the prior art.
Disclosure of Invention
The invention aims to provide a BMC chip hosting system and a control method thereof to solve the technical problems, aiming at the defects that a fan is out of control after a BMC chip fails, the reason of the failure of the BMC chip cannot be analyzed, and service alarm cannot be performed.
In order to achieve the purpose, the invention provides the following technical scheme:
a BMC chip hosting system comprises a BMC hosting chip and a server mainboard, wherein the BMC hosting chip is connected with a BMC chip, a temperature sensor, a fan and a power supply; the temperature sensor, the fan and the power supply are arranged on the server mainboard;
the BMC hosting chip is connected with the BMC chip through a control bus, the BMC hosting chip is connected with a power supply through a power supply control line, and the BMC hosting chip is connected with the fan through a fan control signal line;
the BMC hosting chip is used for judging that the BMC chip is invalid; when the BMC chip fails, generating a fan control parameter and a power supply control parameter according to received temperature detection information of the temperature sensor, sending the fan control parameter to a fan to control the rotating speed of the fan, and sending the power supply control parameter to a power supply to control the power supply to be turned off.
Further, the BMC hosting chip is connected with the BMC chip through a network cable, and the BMC hosting chip is also connected with a remote management unit;
when the BMC chip fails, the BMC hosting chip is further in network communication with the remote management unit;
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip.
Further, the BMC hosting chip comprises a logic control module, and the logic control module is connected with a BMC failure detection module, a BMC running information storage module, a first network interface and a second network interface;
the logic control module is also connected with a power supply, a fan and a temperature sensor;
the BMC failure detection module and the BMC running information storage module are also connected with the BMC chip through a control bus;
the BMC hosting chip is in communication connection with the BMC chip through the first network interface and is in communication connection with the remote management unit through the second network interface;
when the BMC chip fails, the BMC hosting chip sends third network communication data to the remote management unit through the second network interface, or receives fourth network communication data from the remote management unit through the second network interface; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip by the remote management unit;
when the BMC chip works normally, the BMC hosting chip receives first network communication data from the BMC chip through a first network interface and sends the first network communication data to the remote management unit through a second network interface, or receives second network communication data from the remote management unit through the second network interface and sends the second network communication data to the BMC chip through the first network interface; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power supply control parameter which are sent to the BMC chip by the remote management unit;
the BMC running information storage module is used for storing control parameters sent by the BMC chip to the power supply and the fan and state information of the BMC chip;
the BMC failure detection module is used for detecting whether the BMC chip is in a failure state;
the logic control module is used for judging whether the BMC hosting chip is in a state of forwarding BMC chip data to the fan and the power supply or in a state of taking over the function of the fan and the power supply of the BMC chip according to whether the BMC chip fails or not.
Furthermore, the control bus comprises a BMC interface state line, a power control bus, a temperature detection bus and a fan control bus.
Furthermore, the fan control signal line adopts a PWM fan control signal line.
The invention also provides a control method based on the BMC hosting chip system, which comprises the following steps:
judging whether the BMC chip fails or not;
when the BMC chip is judged to be invalid, the BMC hosting chip receives temperature detection information sent by the temperature sensor; generating a fan control parameter and a power supply control parameter according to the temperature detection information, and sending the fan control parameter to a fan; and providing the power supply control parameter to the power supply.
Further, still include:
when the BMC chip is judged to be invalid, the BMC hosting chip is further in network communication with the remote management unit, and the BMC hosting chip sends third network communication data to the remote management unit or receives fourth network communication data from the remote management unit; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power control parameters which are sent to the BMC hosting chip by the remote management unit.
Further, still include:
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power control parameter which are sent to the BMC chip by the remote management unit.
Further, according to the temperature detection information, generating a fan control parameter and a power supply control parameter, comprising:
when the temperature corresponding to the temperature detection information belongs to a preset temperature range, the BMC hosting chip maintains the parameters of the control fan and the power supply control parameters before the BMC chip fails to be changed;
when the temperature corresponding to the temperature detection information does not belong to a preset temperature range, calculating a temperature change rate;
when the temperature change rate is larger than a first threshold value, generating a fan control parameter for increasing the rotating speed of the fan;
and when the temperature change rate is greater than a second threshold value, generating a power control parameter for closing the power supply of the server, and simultaneously sending an alarm to the remote management unit by the BMC hosting chip.
Further, when the BMC chip is judged to work normally, the BMC hosting chip also records the state information of each interface of the BMC chip sent by the BMC chip; each interface state information of the BMC comprises processor internal register information of the BMC and each peripheral state information;
when the BMC chip is judged to be invalid, the BMC hosting chip also sends the stored BMC chip state information to the remote management unit through the second network interface; the failure reason of the BMC chip can be analyzed by recording and sending the state information before the failure of the BMC chip.
Further, the determining whether the BMC chip fails includes:
receiving a check pulse sent by a BMC chip, wherein the check pulse comprises a plurality of pulse signals, and the widths of the plurality of pulse signals are different;
when the width of the pulse signal is different from that of the corresponding standard signal, determining the pulse signal as an abnormal pulse signal, and counting the times of the abnormal pulse signal;
and when the frequency of the abnormal pulse signals is greater than a threshold value, judging that the BMC chip is invalid.
Furthermore, the check pulses are divided into a plurality of groups, each group comprises pulse signals with an appointed number, and when the frequency of detecting that any one group of abnormal pulse signals is greater than a threshold value, the BMC chip is judged to be invalid.
Further, the alarm comprises an SNMP alarm.
The invention has the beneficial effects that: the BMC hosting chip plays a role in maintaining the operation of the server in the operation process of the server. In order to prevent a BMC chip from losing control of a cooling fan due to failure caused by unknown reasons, the server is over-high in temperature, the alarm function is failed, and further an administrator cannot timely get rid of faults and possibly has a series of phenomena such as safety accidents, a BMC hosting chip is introduced, when the BMC chip is normal, the BMC hosting chip plays a forwarding role, when the BMC chip fails, the BMC chip plays a hosting role, and simultaneously saves and sends the state before the BMC chip fails, and the opportunity of the failure reason of the BMC chip is analyzed; the fan and the power supply are operated in a state before the BMC chip fails, so that the safety accident caused by not controlling the fan or the resource waste caused by operating the fan at full speed is avoided.
In addition, the invention has reliable design principle, simple structure and very wide application prospect.
Therefore, compared with the prior art, the invention has prominent substantive features and remarkable progress, and the beneficial effects of the implementation are also obvious.
Drawings
FIG. 1 is a block diagram of the system of the present invention;
FIG. 2 is a block diagram of a BMC hosting chip of the present invention;
FIG. 3 is a flow chart of a method of the present invention;
wherein, 1-BMC trustees the chip; 1.1-a logic control module; 1.2-BMC failure detection module; 1.3-BMC operation information storage module; 1.4-a first network interface; 1.5-a second network interface; 2-BMC chip; 3-a power supply; 4-a fan; 5-a temperature sensor; 6-remote management unit.
The specific implementation mode is as follows:
in order to make the objects, features and advantages of the present invention more obvious and understandable, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.
As shown in fig. 1, the present invention provides a BMC chip hosting system, which includes a BMC hosting chip 1 and a server motherboard, wherein the BMC hosting chip 1 is connected to a BMC chip 2, a temperature sensor 5, a fan 4 and a power supply 3; the temperature sensor, the fan 4 and the power supply 3 are arranged on the server mainboard;
the BMC hosting chip 1 is connected with the BMC chip 2 through a control bus, the BMC hosting chip 1 is connected with the power supply 3 through a power supply control line, and the BMC hosting chip 1 is connected with the fan 4 through a fan control signal line; the control bus comprises a BMC interface state line, a power supply control bus, a temperature detection bus and a fan control bus; the fan control signal line adopts a PWM fan control signal line;
the BMC hosting chip 1 is used for judging that the BMC chip 2 is invalid; when the BMC chip 2 fails, generating a fan control parameter and a power supply control parameter according to the received temperature detection information of the temperature sensor 5, sending the fan control parameter to the fan 4 to control the rotating speed of the fan, and sending the power supply control parameter to the power supply 3 to control the power supply to be turned off;
the BMC hosting chip 1 is connected with the BMC chip 2 through a network cable, and the BMC hosting chip 1 is connected with a remote management unit 6;
when the BMC chip 2 fails, the BMC hosting chip 1 is further in network communication with the remote management unit 6;
when the BMC chip 2 works normally, the BMC hosting chip 1 forwards the fan control parameter sent by the BMC chip 2 to the fan 4, and forwards the power control parameter sent by the BMC chip 2 to the power supply 3; the BMC hosting chip 1 forwards the temperature detection information sent by the temperature sensor 5 to the BMC chip 2; the BMC hosting chip 1 also forwards the first network communication data sent by the BMC chip 2 to the remote management unit 6, or forwards the second network communication data sent by the remote management unit 6 to the BMC chip 2.
As shown in fig. 2, the BMC hosting chip 1 includes a logic control module 1.1, and the logic control module 1.1 is connected to a BMC failure detection module 1.2, a BMC operation information storage module 1.3, a first network interface 1.4, and a second network interface 1.5;
the logic control module 1.1 is also connected with a power supply 3, a fan 4 and a temperature sensor 5;
the BMC failure detection module 1.2 and the BMC running information storage module 1.3 are also connected with the BMC chip 2 through a control bus;
the BMC hosting chip 1 is in communication connection with the BMC chip 2 through the first network interface 1.4, and is in communication connection with the remote management unit 6 through the second network interface 1.5;
when the BMC chip 2 fails, the BMC hosting chip 1 is configured to send third network communication data to the remote management unit 6, or receive fourth network communication data from the remote management unit 6; the third network communication data comprises temperature detection information received by the BMC hosting chip 1 and sent by the temperature sensor 5, fan control parameters sent by the BMC hosting chip 1 to the fan 4 and power supply control parameters sent by the BMC hosting chip 1 to the power supply 3; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip 1 by the remote management unit 6;
when the BMC chip 2 normally works, the BMC hosting chip 1 is configured to receive first network communication data from the BMC chip 2 and send the first network communication data to the remote management unit 6, or receive second network communication data from the remote management unit 6 and send the second network communication data to the BMC chip 2; the first network communication data comprises temperature detection information which is received by the BMC chip 2 and sent by the temperature sensor 5, fan control parameters which are sent to the fan 4 by the BMC chip 2 and power supply control parameters which are sent to the power supply 3 by the BMC chip 2; the second network communication data comprises a fan control parameter and a power supply control parameter which are sent to the BMC chip 2 by the remote management unit 6;
the BMC running information storage module 1.3 is used for storing control parameters sent by the BMC chip 2 to the power supply 3 and the fan 4 and state information of the BMC chip 2;
the BMC failure detection module 1.2 is used for detecting whether the BMC chip 2 is in a failure state;
the logic control module 1.1 is used for judging whether the BMC hosting chip 1 is in a state of forwarding the BMC chip 2 data to a fan and a power supply or in a state of taking over the fan and power supply functions of the BMC chip 2 according to whether the BMC chip 2 fails.
As shown in fig. 3, the present invention further provides a method for controlling a BMC chip hosting system, including the following steps:
judging whether the BMC chip fails or not;
when the BMC chip is judged to be invalid, the BMC hosting chip receives temperature detection information sent by the temperature sensor; generating a fan control parameter and a power supply control parameter according to the temperature detection information, sending the fan control parameter to a fan, and sending the power supply control parameter to a power supply; the BMC hosting chip is also in network communication with the remote management unit, and sends third network communication data to the remote management unit or receives fourth network communication data from the remote management unit; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip by the remote management unit; the BMC hosting chip also sends the stored BMC chip state information to the remote management unit through a second network interface; the failure reason of the BMC chip can be analyzed by recording and sending the state information before the failure of the BMC chip;
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power supply control parameter which are sent to the BMC chip by the remote management unit; the BMC hosting chip also records the state information of each interface of the BMC chip sent by the BMC chip; each interface state information of the BMC comprises processor internal register information of the BMC and each peripheral state information;
the generating of the fan control parameter and the power supply control parameter according to the temperature detection information includes:
when the temperature corresponding to the temperature detection information belongs to a preset temperature range, the BMC hosting chip maintains the parameters of the control fan and the power supply control parameters before the BMC chip fails to be changed;
when the temperature corresponding to the temperature detection information does not belong to a preset temperature range, calculating a temperature change rate;
when the temperature change rate is larger than a first threshold value, generating a fan control parameter for increasing the rotating speed of the fan;
when the temperature change rate is larger than a second threshold value, generating a power supply control parameter for closing the power supply of the server, and simultaneously sending an alarm to a remote management unit by the BMC hosting chip;
the step of judging whether the BMC chip fails comprises the following steps: receiving a check pulse sent by a BMC chip, wherein the check pulse comprises a plurality of pulse signals, and the widths of the plurality of pulse signals are different; when the width of the pulse signal is different from that of the corresponding standard signal, determining the pulse signal as an abnormal pulse signal, and counting the times of the abnormal pulse signal; when the frequency of the abnormal pulse signals is greater than a threshold value, judging that the BMC chip is invalid;
the check pulses are divided into a plurality of groups, each group comprises pulse signals with a predetermined number, and when the frequency of detecting that any one group of abnormal pulse signals is greater than a threshold value, the BMC chip is judged to be invalid;
taking the number of pulse signals appointed in each group as 5, the time threshold value is 2 as an example, the 5 pulse signals are respectively appointed as a first pulse signal, a second pulse signal, a third pulse signal, a fourth pulse signal and a fifth pulse signal, the width of the first pulse signal is 1 clock cycle, the width of the second pulse signal is 2 clock cycles, the width of the third pulse signal is 3 clock cycles, the width of the fourth pulse signal is 4 clock cycles, the width of the fifth pulse signal is 5 clock cycles, the first pulse signal, the second pulse signal, the third pulse signal, the fourth pulse signal and the fifth pulse signal are sequentially sent according to the sequence number, and if more than 2 times, for example, 3 times of abnormal pulse signals occur, the BMC chip is determined to be out of service.
The embodiments of the present invention are illustrative rather than restrictive, and the above-mentioned embodiments are only provided to help understanding of the present invention, so that the present invention is not limited to the embodiments described in the detailed description, and other embodiments derived from the technical solutions of the present invention by those skilled in the art also belong to the protection scope of the present invention.
Claims (8)
1. A BMC chip hosting system is characterized by comprising a BMC hosting chip and a server mainboard, wherein the BMC hosting chip is connected with a BMC chip, a temperature sensor, a fan and a power supply;
the temperature sensor, the fan and the power supply are arranged on the server mainboard;
the BMC hosting chip is connected with the BMC chip through a control bus, the BMC hosting chip is connected with a power supply through a power supply control line, and the BMC hosting chip is connected with the fan through a fan control signal line;
the BMC hosting chip is used for judging that the BMC chip is invalid; when the BMC chip fails, generating a fan control parameter and a power supply control parameter according to received temperature detection information of a temperature sensor, sending the fan control parameter to a fan to control the rotating speed of the fan, and sending the power supply control parameter to a power supply to control the power supply to be turned off;
the BMC hosting chip is also connected with a remote management unit;
when the BMC chip fails, the BMC hosting chip is further in network communication with the remote management unit;
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip.
2. The BMC chip hosting system of claim 1, wherein the BMC chip hosting system comprises a logic control module, and the logic control module is connected to a BMC failure detection module, a BMC operation information storage module, a first network interface, and a second network interface;
the logic control module is also connected with a power supply, a fan and a temperature sensor;
the BMC failure detection module and the BMC running information storage module are also connected with the BMC chip through a control bus;
the BMC hosting chip is in communication connection with the BMC chip through the first network interface and is in communication connection with the remote management unit through the second network interface;
when the BMC chip fails, the BMC hosting chip sends third network communication data to the remote management unit through the second network interface, or receives fourth network communication data from the remote management unit through the second network interface; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip by the remote management unit;
when the BMC chip works normally, the BMC hosting chip receives first network communication data from the BMC chip through a first network interface and sends the first network communication data to the remote management unit through a second network interface, or receives second network communication data from the remote management unit through a second network interface and sends the second network communication data to the BMC chip through the first network interface; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power control parameter which are sent to the BMC chip by the remote management unit.
3. A method for controlling a BMC chip hosting system according to claim 1 or 2, comprising the steps of:
judging whether the BMC chip fails or not;
when the BMC chip is judged to be invalid, the BMC hosting chip receives temperature detection information sent by the temperature sensor; generating a fan control parameter and a power supply control parameter according to the temperature detection information, sending the fan control parameter to a fan, and sending the power supply control parameter to a power supply;
when the BMC chip is judged to be invalid, the BMC hosting chip is further in network communication with the remote management unit, and the BMC hosting chip sends third network communication data to the remote management unit or receives fourth network communication data from the remote management unit; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power control parameters which are sent to the BMC hosting chip by the remote management unit.
4. The method of claim 3, further comprising:
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power control parameter which are sent to the BMC chip by the remote management unit.
5. The method of claim 3, wherein the generating the fan control parameter and the power control parameter according to the temperature detection information comprises:
when the temperature corresponding to the temperature detection information does not belong to a preset temperature range, calculating a temperature change rate;
when the temperature change rate is larger than a first threshold value, generating a fan control parameter for increasing the rotating speed of the fan;
and when the temperature change rate is greater than a second threshold value, generating a power control parameter for closing the power supply of the server, and simultaneously sending an alarm to the remote management unit by the BMC hosting chip.
6. The method of claim 3, further comprising:
when the BMC chip is judged to work normally, the BMC hosting chip also records the state information of each interface of the BMC chip sent by the BMC chip;
when the BMC chip is judged to be invalid, the BMC hosting chip also sends the stored BMC chip state information to the remote management unit through the second network interface.
7. The method of claim 3, wherein the determining whether the BMC chip fails comprises:
receiving a check pulse sent by a BMC chip, wherein the check pulse comprises a plurality of pulse signals, and the widths of the plurality of pulse signals are different;
when the width of the pulse signal is different from that of the corresponding standard signal, determining the pulse signal as an abnormal pulse signal, and counting the times of the abnormal pulse signal;
and when the frequency of the abnormal pulse signals is greater than a threshold value, judging that the BMC chip is invalid.
8. The method as claimed in claim 7, wherein the check pulses are divided into a plurality of groups, each group includes a predetermined number of pulse signals, and when the number of times that any one group of abnormal pulse signals is detected is greater than a threshold value, it is determined that the BMC chip is disabled.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710667575.2A CN107632907B (en) | 2017-08-07 | 2017-08-07 | BMC chip hosting system and control method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710667575.2A CN107632907B (en) | 2017-08-07 | 2017-08-07 | BMC chip hosting system and control method thereof |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107632907A CN107632907A (en) | 2018-01-26 |
CN107632907B true CN107632907B (en) | 2020-10-23 |
Family
ID=61099539
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710667575.2A Active CN107632907B (en) | 2017-08-07 | 2017-08-07 | BMC chip hosting system and control method thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107632907B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108845651A (en) * | 2018-06-27 | 2018-11-20 | 郑州云海信息技术有限公司 | A kind of autonomous cool-down method of planless server and system |
US10817217B2 (en) * | 2018-06-28 | 2020-10-27 | Seagate Technology Llc | Data storage system with improved time-to-ready |
CN109634879A (en) * | 2018-12-17 | 2019-04-16 | 郑州云海信息技术有限公司 | A kind of PCIE pinboard and monitoring system server |
CN110362176A (en) * | 2019-06-29 | 2019-10-22 | 苏州浪潮智能科技有限公司 | Cooling control system and method when a kind of server B MC fails |
CN110362130B (en) * | 2019-08-21 | 2022-02-11 | 昂纳信息技术(深圳)有限公司 | Drive control method of temperature control system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN203133821U (en) * | 2013-02-19 | 2013-08-14 | 漯河医学高等专科学校 | Computer fault alarm device |
CN104141623A (en) * | 2013-05-10 | 2014-11-12 | 鸿富锦精密工业(深圳)有限公司 | Fan speed control system |
CN105134451A (en) * | 2015-09-25 | 2015-12-09 | 天津斯巴克瑞汽车电子股份有限公司 | Pulse monitoring device for durability of automobile ignition coil and realizing method of pulse monitoring device |
CN106528374A (en) * | 2016-11-04 | 2017-03-22 | 郑州云海信息技术有限公司 | Rack-level storage device and cooling control system thereof |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080126597A1 (en) * | 2006-08-15 | 2008-05-29 | Tyan Computer Corporation | Alternative Local Card, Central Management Module and System Management Architecture For Multi-Mainboard System |
CN100565463C (en) * | 2007-12-14 | 2009-12-02 | 中国人民解放军国防科学技术大学 | A kind of fault-tolerant backup method of concurrent computational system Management Controller and system |
CN103186452A (en) * | 2011-12-27 | 2013-07-03 | 鸿富锦精密工业(深圳)有限公司 | Server system |
CN104317677A (en) * | 2014-09-29 | 2015-01-28 | 浪潮电子信息产业股份有限公司 | Brickland-based redundant monitoring management system and method |
CN104598329A (en) * | 2015-02-12 | 2015-05-06 | 浪潮电子信息产业股份有限公司 | Automatic BMC (baseboard management controller) fault solution method based on RMC (rack server management center) management |
-
2017
- 2017-08-07 CN CN201710667575.2A patent/CN107632907B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN203133821U (en) * | 2013-02-19 | 2013-08-14 | 漯河医学高等专科学校 | Computer fault alarm device |
CN104141623A (en) * | 2013-05-10 | 2014-11-12 | 鸿富锦精密工业(深圳)有限公司 | Fan speed control system |
CN105134451A (en) * | 2015-09-25 | 2015-12-09 | 天津斯巴克瑞汽车电子股份有限公司 | Pulse monitoring device for durability of automobile ignition coil and realizing method of pulse monitoring device |
CN106528374A (en) * | 2016-11-04 | 2017-03-22 | 郑州云海信息技术有限公司 | Rack-level storage device and cooling control system thereof |
Also Published As
Publication number | Publication date |
---|---|
CN107632907A (en) | 2018-01-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107632907B (en) | BMC chip hosting system and control method thereof | |
US8656003B2 (en) | Method for controlling rack system using RMC to determine type of node based on FRU's message when status of chassis is changed | |
US20150127814A1 (en) | Monitoring Server Method | |
US20180157572A1 (en) | Rogue Hardware Detection Through Power Monitoring | |
CN107482590B (en) | A kind of driving of IGBT module and fault secure circuit | |
US10303574B1 (en) | Self-generated thermal stress evaluation | |
CN103354503A (en) | Cloud storage system capable of automatically detecting and replacing failure nodes and method thereof | |
WO2016107270A1 (en) | Device managing method, device and device managing controller | |
CN103135732B (en) | Server cabinet system | |
US11733762B2 (en) | Method to allow for higher usable power capacity in a redundant power configuration | |
CN104049709A (en) | Power management method of server system | |
US20240053812A1 (en) | Power supply control method and apparatus, and server and non-volatile storage medium | |
CN114168401A (en) | Log recording method and device and electronic equipment | |
US8271816B2 (en) | System and method for statistics recording of power devices | |
CN103425545A (en) | System fault tolerance method for multiprocessor server | |
CN116820820A (en) | Server fault monitoring method and system | |
CN112882901A (en) | Intelligent health state monitor of distributed processing system | |
CN104834584A (en) | Method and system for monitoring host computer hardware loads | |
CN114676019B (en) | Method, device, equipment and storage medium for monitoring state of central processing unit | |
US10284134B2 (en) | Method for controlling a fan module of a server rack and controller unit for implementing the same | |
US11126255B1 (en) | Power throttling based on input voltage monitoring | |
US20090204270A1 (en) | Method and system for redundant management of fans within a shared enclosure | |
WO2021190093A1 (en) | Server system, and frequency control device for processor therein | |
CN104680087A (en) | Blade server hot plugging control circuit and device | |
CN114528163A (en) | Automatic positioning system, method and device for server fault hard disk |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200922 Address after: 215100 No. 1 Guanpu Road, Guoxiang Street, Wuzhong Economic Development Zone, Suzhou City, Jiangsu Province Applicant after: SUZHOU LANGCHAO INTELLIGENT TECHNOLOGY Co.,Ltd. Address before: 450000 Henan province Zheng Dong New District of Zhengzhou City Xinyi Road No. 278 16 floor room 1601 Applicant before: ZHENGZHOU YUNHAI INFORMATION TECHNOLOGY Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |