[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN107632907B - BMC chip hosting system and control method thereof - Google Patents

BMC chip hosting system and control method thereof Download PDF

Info

Publication number
CN107632907B
CN107632907B CN201710667575.2A CN201710667575A CN107632907B CN 107632907 B CN107632907 B CN 107632907B CN 201710667575 A CN201710667575 A CN 201710667575A CN 107632907 B CN107632907 B CN 107632907B
Authority
CN
China
Prior art keywords
bmc
chip
hosting
fan
power supply
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710667575.2A
Other languages
Chinese (zh)
Other versions
CN107632907A (en
Inventor
刘同强
王朝辉
童元满
赵元
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN201710667575.2A priority Critical patent/CN107632907B/en
Publication of CN107632907A publication Critical patent/CN107632907A/en
Application granted granted Critical
Publication of CN107632907B publication Critical patent/CN107632907B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Control Of Temperature (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a BMC (baseboard management controller) chip hosting system and a control method thereof, wherein the system comprises a BMC hosting chip and a server mainboard, and the BMC hosting chip is connected with a BMC chip, a temperature sensor, a fan and a power supply; the temperature sensor, the fan and the power supply are arranged on the server mainboard; the BMC hosting chip is connected with the BMC chip through a control bus, the BMC hosting chip is connected with a power supply through a power supply control line, and the BMC hosting chip is connected with the fan through a fan control signal line; the BMC hosting chip is used for judging that the BMC chip is invalid; when the BMC chip fails, generating a fan control parameter and a power supply control parameter according to received temperature detection information of the temperature sensor, sending the fan control parameter to a fan to control the rotating speed of the fan, and sending the power supply control parameter to a power supply to control the power supply to be turned off.

Description

BMC chip hosting system and control method thereof
Technical Field
The invention belongs to the technical field of chips special for servers, and particularly relates to a BMC (baseboard management controller) chip hosting system and a control method thereof.
Background
BMC is an abbreviation of Baseboard Management Controller, a board Management Controller.
With the rapid development of server performance, the frequency of chips used by the server is faster and faster, and meanwhile, the requirement of clients on the stability of the server is higher and higher. In the current server structure, a BMC chip is mostly adopted for managing the management of a server mainboard, and the BMC chip can monitor and control the temperature and the rotating speed of a fan, so that the mainboard is better maintained. The client demands the server more and more in terms of performance, and the stability demands the same more and more. In practical applications, the BMC chip may be subjected to various disturbances, such as electromagnetic radiation, which may cause the BMC chip to fail. When the BMC chip fails, the fan is out of control, and a large amount of special chips integrated in the server generate a large amount of heat, if the heat is dissipated in time, the whole server has a risk of abnormal work. If the state before the BMC chip fails is not stored, the opportunity of analyzing the failure reason of the BMC chip is lost. Meanwhile, the BMC chip is invalid, and alarm information cannot be sent to a control room.
In the current server fan management design among the prior art, more advanced adoption is in case after the BMC chip became invalid, use the mode of fixed mode control fan, inside the server, adopt the BMC chip as the controller of fan speed, and the BMC chip outputs periodic pulse signal as feeding the dog signal under normal operating condition, connect the singlechip with the BMC chip, this singlechip is as the monitoring unit of BMC chip operating condition, feed the dog signal of losing through monitoring BMC, realize the monitoring whether normal work to BMC, after the BMC chip became invalid, regulate and control fan speed, guarantee the normal heat dissipation of system. In the above steps, an alternative gating device is adopted as the switching selection of the fan control signal after failure and the normal control signal; meanwhile, the gate is connected with a high level signal and used as a fan control signal after the BMC chip is invalid, so that the fan is in a full-speed rotation state; the power consumption of the fan cannot be well controlled by adopting a fixed mode, because the fan runs at full speed, the resource waste is caused, the working state of the server cannot be detected even if the fan runs at full speed, and if the power consumption of the system is increased sharply, the server cannot be closed; meanwhile, the remote management system BMC chip cannot be informed of the occurrence of a problem, and system maintenance is carried out.
Therefore, it is very necessary to provide a BMC hosting chip and a control method thereof to overcome the above-mentioned drawbacks of the prior art.
Disclosure of Invention
The invention aims to provide a BMC chip hosting system and a control method thereof to solve the technical problems, aiming at the defects that a fan is out of control after a BMC chip fails, the reason of the failure of the BMC chip cannot be analyzed, and service alarm cannot be performed.
In order to achieve the purpose, the invention provides the following technical scheme:
a BMC chip hosting system comprises a BMC hosting chip and a server mainboard, wherein the BMC hosting chip is connected with a BMC chip, a temperature sensor, a fan and a power supply; the temperature sensor, the fan and the power supply are arranged on the server mainboard;
the BMC hosting chip is connected with the BMC chip through a control bus, the BMC hosting chip is connected with a power supply through a power supply control line, and the BMC hosting chip is connected with the fan through a fan control signal line;
the BMC hosting chip is used for judging that the BMC chip is invalid; when the BMC chip fails, generating a fan control parameter and a power supply control parameter according to received temperature detection information of the temperature sensor, sending the fan control parameter to a fan to control the rotating speed of the fan, and sending the power supply control parameter to a power supply to control the power supply to be turned off.
Further, the BMC hosting chip is connected with the BMC chip through a network cable, and the BMC hosting chip is also connected with a remote management unit;
when the BMC chip fails, the BMC hosting chip is further in network communication with the remote management unit;
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip.
Further, the BMC hosting chip comprises a logic control module, and the logic control module is connected with a BMC failure detection module, a BMC running information storage module, a first network interface and a second network interface;
the logic control module is also connected with a power supply, a fan and a temperature sensor;
the BMC failure detection module and the BMC running information storage module are also connected with the BMC chip through a control bus;
the BMC hosting chip is in communication connection with the BMC chip through the first network interface and is in communication connection with the remote management unit through the second network interface;
when the BMC chip fails, the BMC hosting chip sends third network communication data to the remote management unit through the second network interface, or receives fourth network communication data from the remote management unit through the second network interface; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip by the remote management unit;
when the BMC chip works normally, the BMC hosting chip receives first network communication data from the BMC chip through a first network interface and sends the first network communication data to the remote management unit through a second network interface, or receives second network communication data from the remote management unit through the second network interface and sends the second network communication data to the BMC chip through the first network interface; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power supply control parameter which are sent to the BMC chip by the remote management unit;
the BMC running information storage module is used for storing control parameters sent by the BMC chip to the power supply and the fan and state information of the BMC chip;
the BMC failure detection module is used for detecting whether the BMC chip is in a failure state;
the logic control module is used for judging whether the BMC hosting chip is in a state of forwarding BMC chip data to the fan and the power supply or in a state of taking over the function of the fan and the power supply of the BMC chip according to whether the BMC chip fails or not.
Furthermore, the control bus comprises a BMC interface state line, a power control bus, a temperature detection bus and a fan control bus.
Furthermore, the fan control signal line adopts a PWM fan control signal line.
The invention also provides a control method based on the BMC hosting chip system, which comprises the following steps:
judging whether the BMC chip fails or not;
when the BMC chip is judged to be invalid, the BMC hosting chip receives temperature detection information sent by the temperature sensor; generating a fan control parameter and a power supply control parameter according to the temperature detection information, and sending the fan control parameter to a fan; and providing the power supply control parameter to the power supply.
Further, still include:
when the BMC chip is judged to be invalid, the BMC hosting chip is further in network communication with the remote management unit, and the BMC hosting chip sends third network communication data to the remote management unit or receives fourth network communication data from the remote management unit; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power control parameters which are sent to the BMC hosting chip by the remote management unit.
Further, still include:
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power control parameter which are sent to the BMC chip by the remote management unit.
Further, according to the temperature detection information, generating a fan control parameter and a power supply control parameter, comprising:
when the temperature corresponding to the temperature detection information belongs to a preset temperature range, the BMC hosting chip maintains the parameters of the control fan and the power supply control parameters before the BMC chip fails to be changed;
when the temperature corresponding to the temperature detection information does not belong to a preset temperature range, calculating a temperature change rate;
when the temperature change rate is larger than a first threshold value, generating a fan control parameter for increasing the rotating speed of the fan;
and when the temperature change rate is greater than a second threshold value, generating a power control parameter for closing the power supply of the server, and simultaneously sending an alarm to the remote management unit by the BMC hosting chip.
Further, when the BMC chip is judged to work normally, the BMC hosting chip also records the state information of each interface of the BMC chip sent by the BMC chip; each interface state information of the BMC comprises processor internal register information of the BMC and each peripheral state information;
when the BMC chip is judged to be invalid, the BMC hosting chip also sends the stored BMC chip state information to the remote management unit through the second network interface; the failure reason of the BMC chip can be analyzed by recording and sending the state information before the failure of the BMC chip.
Further, the determining whether the BMC chip fails includes:
receiving a check pulse sent by a BMC chip, wherein the check pulse comprises a plurality of pulse signals, and the widths of the plurality of pulse signals are different;
when the width of the pulse signal is different from that of the corresponding standard signal, determining the pulse signal as an abnormal pulse signal, and counting the times of the abnormal pulse signal;
and when the frequency of the abnormal pulse signals is greater than a threshold value, judging that the BMC chip is invalid.
Furthermore, the check pulses are divided into a plurality of groups, each group comprises pulse signals with an appointed number, and when the frequency of detecting that any one group of abnormal pulse signals is greater than a threshold value, the BMC chip is judged to be invalid.
Further, the alarm comprises an SNMP alarm.
The invention has the beneficial effects that: the BMC hosting chip plays a role in maintaining the operation of the server in the operation process of the server. In order to prevent a BMC chip from losing control of a cooling fan due to failure caused by unknown reasons, the server is over-high in temperature, the alarm function is failed, and further an administrator cannot timely get rid of faults and possibly has a series of phenomena such as safety accidents, a BMC hosting chip is introduced, when the BMC chip is normal, the BMC hosting chip plays a forwarding role, when the BMC chip fails, the BMC chip plays a hosting role, and simultaneously saves and sends the state before the BMC chip fails, and the opportunity of the failure reason of the BMC chip is analyzed; the fan and the power supply are operated in a state before the BMC chip fails, so that the safety accident caused by not controlling the fan or the resource waste caused by operating the fan at full speed is avoided.
In addition, the invention has reliable design principle, simple structure and very wide application prospect.
Therefore, compared with the prior art, the invention has prominent substantive features and remarkable progress, and the beneficial effects of the implementation are also obvious.
Drawings
FIG. 1 is a block diagram of the system of the present invention;
FIG. 2 is a block diagram of a BMC hosting chip of the present invention;
FIG. 3 is a flow chart of a method of the present invention;
wherein, 1-BMC trustees the chip; 1.1-a logic control module; 1.2-BMC failure detection module; 1.3-BMC operation information storage module; 1.4-a first network interface; 1.5-a second network interface; 2-BMC chip; 3-a power supply; 4-a fan; 5-a temperature sensor; 6-remote management unit.
The specific implementation mode is as follows:
in order to make the objects, features and advantages of the present invention more obvious and understandable, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention.
As shown in fig. 1, the present invention provides a BMC chip hosting system, which includes a BMC hosting chip 1 and a server motherboard, wherein the BMC hosting chip 1 is connected to a BMC chip 2, a temperature sensor 5, a fan 4 and a power supply 3; the temperature sensor, the fan 4 and the power supply 3 are arranged on the server mainboard;
the BMC hosting chip 1 is connected with the BMC chip 2 through a control bus, the BMC hosting chip 1 is connected with the power supply 3 through a power supply control line, and the BMC hosting chip 1 is connected with the fan 4 through a fan control signal line; the control bus comprises a BMC interface state line, a power supply control bus, a temperature detection bus and a fan control bus; the fan control signal line adopts a PWM fan control signal line;
the BMC hosting chip 1 is used for judging that the BMC chip 2 is invalid; when the BMC chip 2 fails, generating a fan control parameter and a power supply control parameter according to the received temperature detection information of the temperature sensor 5, sending the fan control parameter to the fan 4 to control the rotating speed of the fan, and sending the power supply control parameter to the power supply 3 to control the power supply to be turned off;
the BMC hosting chip 1 is connected with the BMC chip 2 through a network cable, and the BMC hosting chip 1 is connected with a remote management unit 6;
when the BMC chip 2 fails, the BMC hosting chip 1 is further in network communication with the remote management unit 6;
when the BMC chip 2 works normally, the BMC hosting chip 1 forwards the fan control parameter sent by the BMC chip 2 to the fan 4, and forwards the power control parameter sent by the BMC chip 2 to the power supply 3; the BMC hosting chip 1 forwards the temperature detection information sent by the temperature sensor 5 to the BMC chip 2; the BMC hosting chip 1 also forwards the first network communication data sent by the BMC chip 2 to the remote management unit 6, or forwards the second network communication data sent by the remote management unit 6 to the BMC chip 2.
As shown in fig. 2, the BMC hosting chip 1 includes a logic control module 1.1, and the logic control module 1.1 is connected to a BMC failure detection module 1.2, a BMC operation information storage module 1.3, a first network interface 1.4, and a second network interface 1.5;
the logic control module 1.1 is also connected with a power supply 3, a fan 4 and a temperature sensor 5;
the BMC failure detection module 1.2 and the BMC running information storage module 1.3 are also connected with the BMC chip 2 through a control bus;
the BMC hosting chip 1 is in communication connection with the BMC chip 2 through the first network interface 1.4, and is in communication connection with the remote management unit 6 through the second network interface 1.5;
when the BMC chip 2 fails, the BMC hosting chip 1 is configured to send third network communication data to the remote management unit 6, or receive fourth network communication data from the remote management unit 6; the third network communication data comprises temperature detection information received by the BMC hosting chip 1 and sent by the temperature sensor 5, fan control parameters sent by the BMC hosting chip 1 to the fan 4 and power supply control parameters sent by the BMC hosting chip 1 to the power supply 3; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip 1 by the remote management unit 6;
when the BMC chip 2 normally works, the BMC hosting chip 1 is configured to receive first network communication data from the BMC chip 2 and send the first network communication data to the remote management unit 6, or receive second network communication data from the remote management unit 6 and send the second network communication data to the BMC chip 2; the first network communication data comprises temperature detection information which is received by the BMC chip 2 and sent by the temperature sensor 5, fan control parameters which are sent to the fan 4 by the BMC chip 2 and power supply control parameters which are sent to the power supply 3 by the BMC chip 2; the second network communication data comprises a fan control parameter and a power supply control parameter which are sent to the BMC chip 2 by the remote management unit 6;
the BMC running information storage module 1.3 is used for storing control parameters sent by the BMC chip 2 to the power supply 3 and the fan 4 and state information of the BMC chip 2;
the BMC failure detection module 1.2 is used for detecting whether the BMC chip 2 is in a failure state;
the logic control module 1.1 is used for judging whether the BMC hosting chip 1 is in a state of forwarding the BMC chip 2 data to a fan and a power supply or in a state of taking over the fan and power supply functions of the BMC chip 2 according to whether the BMC chip 2 fails.
As shown in fig. 3, the present invention further provides a method for controlling a BMC chip hosting system, including the following steps:
judging whether the BMC chip fails or not;
when the BMC chip is judged to be invalid, the BMC hosting chip receives temperature detection information sent by the temperature sensor; generating a fan control parameter and a power supply control parameter according to the temperature detection information, sending the fan control parameter to a fan, and sending the power supply control parameter to a power supply; the BMC hosting chip is also in network communication with the remote management unit, and sends third network communication data to the remote management unit or receives fourth network communication data from the remote management unit; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip by the remote management unit; the BMC hosting chip also sends the stored BMC chip state information to the remote management unit through a second network interface; the failure reason of the BMC chip can be analyzed by recording and sending the state information before the failure of the BMC chip;
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power supply control parameter which are sent to the BMC chip by the remote management unit; the BMC hosting chip also records the state information of each interface of the BMC chip sent by the BMC chip; each interface state information of the BMC comprises processor internal register information of the BMC and each peripheral state information;
the generating of the fan control parameter and the power supply control parameter according to the temperature detection information includes:
when the temperature corresponding to the temperature detection information belongs to a preset temperature range, the BMC hosting chip maintains the parameters of the control fan and the power supply control parameters before the BMC chip fails to be changed;
when the temperature corresponding to the temperature detection information does not belong to a preset temperature range, calculating a temperature change rate;
when the temperature change rate is larger than a first threshold value, generating a fan control parameter for increasing the rotating speed of the fan;
when the temperature change rate is larger than a second threshold value, generating a power supply control parameter for closing the power supply of the server, and simultaneously sending an alarm to a remote management unit by the BMC hosting chip;
the step of judging whether the BMC chip fails comprises the following steps: receiving a check pulse sent by a BMC chip, wherein the check pulse comprises a plurality of pulse signals, and the widths of the plurality of pulse signals are different; when the width of the pulse signal is different from that of the corresponding standard signal, determining the pulse signal as an abnormal pulse signal, and counting the times of the abnormal pulse signal; when the frequency of the abnormal pulse signals is greater than a threshold value, judging that the BMC chip is invalid;
the check pulses are divided into a plurality of groups, each group comprises pulse signals with a predetermined number, and when the frequency of detecting that any one group of abnormal pulse signals is greater than a threshold value, the BMC chip is judged to be invalid;
taking the number of pulse signals appointed in each group as 5, the time threshold value is 2 as an example, the 5 pulse signals are respectively appointed as a first pulse signal, a second pulse signal, a third pulse signal, a fourth pulse signal and a fifth pulse signal, the width of the first pulse signal is 1 clock cycle, the width of the second pulse signal is 2 clock cycles, the width of the third pulse signal is 3 clock cycles, the width of the fourth pulse signal is 4 clock cycles, the width of the fifth pulse signal is 5 clock cycles, the first pulse signal, the second pulse signal, the third pulse signal, the fourth pulse signal and the fifth pulse signal are sequentially sent according to the sequence number, and if more than 2 times, for example, 3 times of abnormal pulse signals occur, the BMC chip is determined to be out of service.
The embodiments of the present invention are illustrative rather than restrictive, and the above-mentioned embodiments are only provided to help understanding of the present invention, so that the present invention is not limited to the embodiments described in the detailed description, and other embodiments derived from the technical solutions of the present invention by those skilled in the art also belong to the protection scope of the present invention.

Claims (8)

1. A BMC chip hosting system is characterized by comprising a BMC hosting chip and a server mainboard, wherein the BMC hosting chip is connected with a BMC chip, a temperature sensor, a fan and a power supply;
the temperature sensor, the fan and the power supply are arranged on the server mainboard;
the BMC hosting chip is connected with the BMC chip through a control bus, the BMC hosting chip is connected with a power supply through a power supply control line, and the BMC hosting chip is connected with the fan through a fan control signal line;
the BMC hosting chip is used for judging that the BMC chip is invalid; when the BMC chip fails, generating a fan control parameter and a power supply control parameter according to received temperature detection information of a temperature sensor, sending the fan control parameter to a fan to control the rotating speed of the fan, and sending the power supply control parameter to a power supply to control the power supply to be turned off;
the BMC hosting chip is also connected with a remote management unit;
when the BMC chip fails, the BMC hosting chip is further in network communication with the remote management unit;
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip.
2. The BMC chip hosting system of claim 1, wherein the BMC chip hosting system comprises a logic control module, and the logic control module is connected to a BMC failure detection module, a BMC operation information storage module, a first network interface, and a second network interface;
the logic control module is also connected with a power supply, a fan and a temperature sensor;
the BMC failure detection module and the BMC running information storage module are also connected with the BMC chip through a control bus;
the BMC hosting chip is in communication connection with the BMC chip through the first network interface and is in communication connection with the remote management unit through the second network interface;
when the BMC chip fails, the BMC hosting chip sends third network communication data to the remote management unit through the second network interface, or receives fourth network communication data from the remote management unit through the second network interface; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power supply control parameters which are sent to the BMC hosting chip by the remote management unit;
when the BMC chip works normally, the BMC hosting chip receives first network communication data from the BMC chip through a first network interface and sends the first network communication data to the remote management unit through a second network interface, or receives second network communication data from the remote management unit through a second network interface and sends the second network communication data to the BMC chip through the first network interface; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power control parameter which are sent to the BMC chip by the remote management unit.
3. A method for controlling a BMC chip hosting system according to claim 1 or 2, comprising the steps of:
judging whether the BMC chip fails or not;
when the BMC chip is judged to be invalid, the BMC hosting chip receives temperature detection information sent by the temperature sensor; generating a fan control parameter and a power supply control parameter according to the temperature detection information, sending the fan control parameter to a fan, and sending the power supply control parameter to a power supply;
when the BMC chip is judged to be invalid, the BMC hosting chip is further in network communication with the remote management unit, and the BMC hosting chip sends third network communication data to the remote management unit or receives fourth network communication data from the remote management unit; the third network communication data comprises temperature detection information sent by the temperature sensor and received by the BMC hosting chip, a fan control parameter sent by the BMC hosting chip to the fan and a power supply control parameter sent by the BMC hosting chip to the power supply; the fourth network communication data comprises fan control parameters and power control parameters which are sent to the BMC hosting chip by the remote management unit.
4. The method of claim 3, further comprising:
when the BMC chip works normally, the BMC hosting chip forwards the fan control parameter sent by the BMC chip to the fan, and forwards the power control parameter sent by the BMC chip to the power supply; the BMC hosting chip forwards the temperature detection information sent by the temperature sensor to the BMC chip; the BMC hosting chip also forwards the first network communication data sent by the BMC chip to the remote management unit, or forwards the second network communication data sent by the remote management unit to the BMC chip; the first network communication data comprises temperature detection information which is received by the BMC chip and sent by the temperature sensor, fan control parameters which are sent to the fan by the BMC chip and power supply control parameters which are sent to the power supply by the BMC chip; the second network communication data comprises a fan control parameter and a power control parameter which are sent to the BMC chip by the remote management unit.
5. The method of claim 3, wherein the generating the fan control parameter and the power control parameter according to the temperature detection information comprises:
when the temperature corresponding to the temperature detection information does not belong to a preset temperature range, calculating a temperature change rate;
when the temperature change rate is larger than a first threshold value, generating a fan control parameter for increasing the rotating speed of the fan;
and when the temperature change rate is greater than a second threshold value, generating a power control parameter for closing the power supply of the server, and simultaneously sending an alarm to the remote management unit by the BMC hosting chip.
6. The method of claim 3, further comprising:
when the BMC chip is judged to work normally, the BMC hosting chip also records the state information of each interface of the BMC chip sent by the BMC chip;
when the BMC chip is judged to be invalid, the BMC hosting chip also sends the stored BMC chip state information to the remote management unit through the second network interface.
7. The method of claim 3, wherein the determining whether the BMC chip fails comprises:
receiving a check pulse sent by a BMC chip, wherein the check pulse comprises a plurality of pulse signals, and the widths of the plurality of pulse signals are different;
when the width of the pulse signal is different from that of the corresponding standard signal, determining the pulse signal as an abnormal pulse signal, and counting the times of the abnormal pulse signal;
and when the frequency of the abnormal pulse signals is greater than a threshold value, judging that the BMC chip is invalid.
8. The method as claimed in claim 7, wherein the check pulses are divided into a plurality of groups, each group includes a predetermined number of pulse signals, and when the number of times that any one group of abnormal pulse signals is detected is greater than a threshold value, it is determined that the BMC chip is disabled.
CN201710667575.2A 2017-08-07 2017-08-07 BMC chip hosting system and control method thereof Active CN107632907B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710667575.2A CN107632907B (en) 2017-08-07 2017-08-07 BMC chip hosting system and control method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710667575.2A CN107632907B (en) 2017-08-07 2017-08-07 BMC chip hosting system and control method thereof

Publications (2)

Publication Number Publication Date
CN107632907A CN107632907A (en) 2018-01-26
CN107632907B true CN107632907B (en) 2020-10-23

Family

ID=61099539

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710667575.2A Active CN107632907B (en) 2017-08-07 2017-08-07 BMC chip hosting system and control method thereof

Country Status (1)

Country Link
CN (1) CN107632907B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108845651A (en) * 2018-06-27 2018-11-20 郑州云海信息技术有限公司 A kind of autonomous cool-down method of planless server and system
US10817217B2 (en) * 2018-06-28 2020-10-27 Seagate Technology Llc Data storage system with improved time-to-ready
CN109634879A (en) * 2018-12-17 2019-04-16 郑州云海信息技术有限公司 A kind of PCIE pinboard and monitoring system server
CN110362176A (en) * 2019-06-29 2019-10-22 苏州浪潮智能科技有限公司 Cooling control system and method when a kind of server B MC fails
CN110362130B (en) * 2019-08-21 2022-02-11 昂纳信息技术(深圳)有限公司 Drive control method of temperature control system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN203133821U (en) * 2013-02-19 2013-08-14 漯河医学高等专科学校 Computer fault alarm device
CN104141623A (en) * 2013-05-10 2014-11-12 鸿富锦精密工业(深圳)有限公司 Fan speed control system
CN105134451A (en) * 2015-09-25 2015-12-09 天津斯巴克瑞汽车电子股份有限公司 Pulse monitoring device for durability of automobile ignition coil and realizing method of pulse monitoring device
CN106528374A (en) * 2016-11-04 2017-03-22 郑州云海信息技术有限公司 Rack-level storage device and cooling control system thereof

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080126597A1 (en) * 2006-08-15 2008-05-29 Tyan Computer Corporation Alternative Local Card, Central Management Module and System Management Architecture For Multi-Mainboard System
CN100565463C (en) * 2007-12-14 2009-12-02 中国人民解放军国防科学技术大学 A kind of fault-tolerant backup method of concurrent computational system Management Controller and system
CN103186452A (en) * 2011-12-27 2013-07-03 鸿富锦精密工业(深圳)有限公司 Server system
CN104317677A (en) * 2014-09-29 2015-01-28 浪潮电子信息产业股份有限公司 Brickland-based redundant monitoring management system and method
CN104598329A (en) * 2015-02-12 2015-05-06 浪潮电子信息产业股份有限公司 Automatic BMC (baseboard management controller) fault solution method based on RMC (rack server management center) management

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN203133821U (en) * 2013-02-19 2013-08-14 漯河医学高等专科学校 Computer fault alarm device
CN104141623A (en) * 2013-05-10 2014-11-12 鸿富锦精密工业(深圳)有限公司 Fan speed control system
CN105134451A (en) * 2015-09-25 2015-12-09 天津斯巴克瑞汽车电子股份有限公司 Pulse monitoring device for durability of automobile ignition coil and realizing method of pulse monitoring device
CN106528374A (en) * 2016-11-04 2017-03-22 郑州云海信息技术有限公司 Rack-level storage device and cooling control system thereof

Also Published As

Publication number Publication date
CN107632907A (en) 2018-01-26

Similar Documents

Publication Publication Date Title
CN107632907B (en) BMC chip hosting system and control method thereof
US8656003B2 (en) Method for controlling rack system using RMC to determine type of node based on FRU's message when status of chassis is changed
US20150127814A1 (en) Monitoring Server Method
US20180157572A1 (en) Rogue Hardware Detection Through Power Monitoring
CN107482590B (en) A kind of driving of IGBT module and fault secure circuit
US10303574B1 (en) Self-generated thermal stress evaluation
CN103354503A (en) Cloud storage system capable of automatically detecting and replacing failure nodes and method thereof
WO2016107270A1 (en) Device managing method, device and device managing controller
CN103135732B (en) Server cabinet system
US11733762B2 (en) Method to allow for higher usable power capacity in a redundant power configuration
CN104049709A (en) Power management method of server system
US20240053812A1 (en) Power supply control method and apparatus, and server and non-volatile storage medium
CN114168401A (en) Log recording method and device and electronic equipment
US8271816B2 (en) System and method for statistics recording of power devices
CN103425545A (en) System fault tolerance method for multiprocessor server
CN116820820A (en) Server fault monitoring method and system
CN112882901A (en) Intelligent health state monitor of distributed processing system
CN104834584A (en) Method and system for monitoring host computer hardware loads
CN114676019B (en) Method, device, equipment and storage medium for monitoring state of central processing unit
US10284134B2 (en) Method for controlling a fan module of a server rack and controller unit for implementing the same
US11126255B1 (en) Power throttling based on input voltage monitoring
US20090204270A1 (en) Method and system for redundant management of fans within a shared enclosure
WO2021190093A1 (en) Server system, and frequency control device for processor therein
CN104680087A (en) Blade server hot plugging control circuit and device
CN114528163A (en) Automatic positioning system, method and device for server fault hard disk

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200922

Address after: 215100 No. 1 Guanpu Road, Guoxiang Street, Wuzhong Economic Development Zone, Suzhou City, Jiangsu Province

Applicant after: SUZHOU LANGCHAO INTELLIGENT TECHNOLOGY Co.,Ltd.

Address before: 450000 Henan province Zheng Dong New District of Zhengzhou City Xinyi Road No. 278 16 floor room 1601

Applicant before: ZHENGZHOU YUNHAI INFORMATION TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant