[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2019218483A1 - Regression tree model-based blood analysis method and apparatus, terminal device and readable storage medium - Google Patents

Regression tree model-based blood analysis method and apparatus, terminal device and readable storage medium Download PDF

Info

Publication number
WO2019218483A1
WO2019218483A1 PCT/CN2018/097562 CN2018097562W WO2019218483A1 WO 2019218483 A1 WO2019218483 A1 WO 2019218483A1 CN 2018097562 W CN2018097562 W CN 2018097562W WO 2019218483 A1 WO2019218483 A1 WO 2019218483A1
Authority
WO
WIPO (PCT)
Prior art keywords
blood
value
regression tree
detection
output
Prior art date
Application number
PCT/CN2018/097562
Other languages
French (fr)
Chinese (zh)
Inventor
卢少烽
洪博然
徐亮
阮晓雯
肖京
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2019218483A1 publication Critical patent/WO2019218483A1/en

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/30ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for calculating health indices; for individual health risk assessment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/18Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis

Definitions

  • the present application belongs to the field of data processing technologies, and in particular, to a blood analysis method, apparatus, terminal device and computer readable storage medium based on a regression tree model.
  • Blood is an important part of the human body. With the continuous advancement of science and technology, the analysis of blood is also progressing towards refinement and diversification. Generally speaking, in the blood analysis process, first, it is necessary to obtain numerical information of each component in the blood. The specific process is to place the blood to be tested on the test instrument, and then the analyst obtains the numerical information of the blood according to the test result displayed by the test instrument. Then, based on the statistical theory, the mathematical theory, and the preset threshold, the computer judges whether the numerical information exceeds the corresponding threshold, and obtains the blood analysis result according to the judgment result.
  • the value of the threshold is generally set to a range larger than the normal value. If the problem of blood in a certain problem is that some numerical information appears to be higher or lower, but the numerical information of each component does not exceed the corresponding threshold, then the analysis result obtained by analyzing according to the existing analytical method is obviously wrong.
  • the existing blood analysis method can only analyze the numerical information of each component in the blood alone, and cannot comprehensively judge a plurality of numerical information, that is, the reliability and accuracy of analyzing the blood are low.
  • the embodiments of the present application provide a blood analysis method, device, terminal device, and computer readable storage medium based on a regression tree model, so as to solve the analysis of multiple components in the prior art that cannot integrate blood, and analyze blood.
  • a first aspect of the embodiments of the present application provides a blood analysis method based on a regression tree model, including:
  • each of the blood information samples including a blood value and a symptom characteristic value
  • the first alarm prompt is output.
  • a second aspect of an embodiment of the present application provides a blood analysis method apparatus based on a regression tree model, which may include means for implementing the steps of the above-described regression tree model based blood analysis method.
  • a third aspect of the embodiments of the present application provides a terminal device, including a memory and a processor, where the computer stores computer readable instructions executable on the processor, the processor executing the computer The steps of the above-described regression analysis based blood analysis method are implemented when the instruction is read.
  • a fourth aspect of the embodiments of the present application provides a computer readable storage medium storing computer readable instructions, the computer readable instructions being executed by a processor to implement the above-described regression tree model based The steps of the blood analysis method.
  • a plurality of blood information samples are obtained, each blood information sample includes a blood value and a symptom feature value, and a plurality of blood information samples are fitted to a regression tree model, and the fitted regression tree model is used as a detection.
  • the model then the blood to be analyzed, first obtain the blood value of the blood to be tested, and input the blood value to the detection model, obtain the output of the detection model as the detection value, and finally, if it is determined that the detection value is greater than the detection threshold, the output is An alarm prompts that the embodiment of the present application trains the regression tree model based on a plurality of blood information samples, realizes comprehensive analysis of blood values in the blood to be measured, and improves the reliability and accuracy of the blood analysis.
  • FIG. 1 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in the first embodiment of the present application
  • FIG. 2 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 2 of the present application;
  • FIG. 3 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 3 of the present application;
  • FIG. 4 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 4 of the present application;
  • FIG. 5 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 5 of the present application;
  • FIG. 6 is a structural block diagram of a blood analysis apparatus based on a regression tree model in Embodiment 6 of the present application;
  • FIG. 7 is a schematic diagram of a terminal device in Embodiment 7 of the present application.
  • FIG. 1 is a flowchart of implementing a blood analysis method based on a regression tree model according to an embodiment of the present application. As shown in Figure 1, the blood analysis method comprises the following steps:
  • S101 Acquire a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value.
  • each blood information sample includes a blood value and a symptom characteristic value.
  • the blood value is the relevant value of one or several components in the existing blood of the blood information sample, and can be determined by the test result obtained by the existing blood test instrument, and the existing blood refers to the analyzed instrument and the file is established. Blood.
  • the blood value can be the red blood cell content of the existing blood, or a collection of red blood cell content, white blood cell content, and hemoglobin content of the existing blood.
  • the symptom characteristic value is the affected condition of the existing blood, and is specifically set for a certain symptom, and the symptom characteristic value is determined by judging whether the existing blood is affected by the symptom, and generally, the existing blood affected by the symptom is determined.
  • the symptom characteristic value under the blood information sample is set to 1
  • the symptom characteristic value under the blood information sample of the existing blood that is not affected by the symptom is set to 0 to facilitate calculation, and the purpose of the embodiment of the present application is to judge Whether the blood to be tested is affected by the symptoms.
  • the blood information sample is obtained according to a preset order of magnitude, that is, the obtained blood information sample quantity is at a preset order of magnitude, and the order of magnitude can be freely set according to the accuracy requirement.
  • the order of magnitude can be one thousand.
  • a plurality of blood information samples are obtained from a health file or a hospital database. Since a plurality of blood information samples of different populations are stored in the health file or the hospital database, the number of blood information samples is large, and the symptom characteristic values under the blood information samples are highly accurate, so the need can be directly obtained from the health file or the hospital database.
  • Multiple blood information samples are acquired according to specific conditions. In the actual application scenario, the acquisition condition is set according to the scene requirement to acquire multiple blood information samples, for example, on the premise that the blood to be tested of the human body in a certain town needs to be analyzed, specific conditions are set, only relevant to the town.
  • the health file or hospital database obtains multiple blood information samples; for example, if the blood to be tested is aged for people over 60 years old, the corresponding specific conditions are set, and the age is 60 years old from the health file or hospital database.
  • the plurality of blood information samples corresponding to the above population may also set specific conditions to obtain a plurality of blood information samples of a certain order of magnitude, etc., thereby improving applicability to different application scenarios.
  • S102 Fit the plurality of blood information samples with a regression tree model, and use the regression tree model that is fitted to be a detection model.
  • the blood value under the blood information sample may contain a plurality of related values of the plurality of components in the existing blood
  • the plurality of related values of the plurality of components are related to whether the existing blood is affected by the symptoms.
  • the blood value of the blood information sample includes the white blood cell content, the hemoglobin content, and the platelet content. If the blood of the blood information sample belongs to the chronic obstructive pulmonary disease, it is affected by the chronic obstructive pulmonary disease.
  • the blood value of normal blood the blood value of the blood information sample may appear to increase white blood cell content, increase hemoglobin content and decrease platelet content.
  • it is not known that a plurality of related values of a plurality of components in the blood value are affected by the symptoms.
  • the regression tree model applies the classification and regression tree (CART) algorithm, and the CART algorithm takes multiple correlation values in blood values as multiple features, and establishes "yes" and "no" for each relevant value.
  • CART classification and regression tree
  • Two forks usually the left branch is a branch with a value of "yes”, and the right branch is a branch with a value of "no".
  • the specific division method is based on the symptom characteristic values corresponding to multiple related values.
  • S103 Acquire a blood value of the blood to be tested, and input the blood value of the blood to be tested into the detection model to obtain a detection value.
  • the detection model is generated from the plurality of blood information samples, the blood to be measured is analyzed, the blood value of the blood to be tested is obtained, and the blood value of the blood to be tested is input to the detection model, and the output result of the detection model is obtained, which is the detection value.
  • the plurality of components corresponding to the blood value of the blood to be tested are the same as the plurality of components corresponding to the blood value of the blood information sample, or the blood component corresponding to the blood value of the blood to be tested is the blood value of the blood information sample.
  • the output of the detection model can be regarded as an effective detection value.
  • the detection value is calculated from the blood value of the blood to be tested, and is used to judge whether the blood to be tested is affected by the symptoms. If the symptom characteristic value is 1 indicating that the corresponding blood has been affected by the symptoms, and the symptom characteristic value is 0 indicating that the corresponding blood is not affected by the symptoms, the closer the detection value is to 1, the more likely the blood to be tested is affected by the symptoms. In order to improve the accuracy of the judgment, the detection threshold is set.
  • the detection threshold may be determined according to a plurality of blood information samples and a plurality of symptom characteristic values of the plurality of blood information samples, or a plurality of blood information samples may be input to the detection model, and the plurality of blood information samples are output according to the detection model. The output is determined.
  • a plurality of existing blood information samples are obtained, wherein the blood information samples include blood values and symptom feature values, and the plurality of blood information samples and the regression tree model are obtained.
  • the fitting is performed, and the fitted regression tree model is output as a detection model, and finally, the blood to be measured is analyzed, the blood value of the blood to be tested is obtained, and the blood value of the blood to be tested is input to the detection model, and the model is detected.
  • the output result is used as a detection value, and the detection value is compared with the detection threshold. If the detection value is greater than the detection threshold, it is proved that the blood to be tested is affected by the symptom, and the first alarm prompt is output.
  • the embodiment of the present application can perform blood values for different components.
  • the analysis of the blood to be tested improves the applicability of the blood analysis in different scenarios, and improves the accuracy of the blood analysis by training the regression tree model.
  • FIG. 2 is a flowchart of an implementation of a blood analysis method based on a regression tree model according to Embodiment 2 of the present application.
  • this embodiment refines S102 to obtain S201-S202, which are as follows:
  • S201 input the plurality of blood information samples to the regression tree model to train the regression tree model, wherein a blood value in the blood information sample is used as an input parameter of the regression tree model, The symptom feature value in the blood information sample is used as the tag parameter of the regression tree model.
  • a blood sample set is constructed based on a plurality of blood values and a plurality of symptom characteristic values in the plurality of blood information samples, (Blood value1 , Symptom value1 ), (Blood value2 , Symptom value2 ), ( Blood value1 , Symptom value1 )...(Blood valuen , Symptom valuen ), where Blood valuei indicates the blood value of the i-th blood information sample, and Symptom valuei indicates the symptom characteristic value of the i-th blood information sample.
  • the blood sample set is input to the regression tree model, the blood value is used as an input parameter of the regression tree model, and the symptom feature value is used as a label parameter of the regression tree model, and is fitted to the regression tree model, in the embodiment of the present application.
  • the calculation formula for the regression tree model is:
  • f() indicates a function that exists in the function space.
  • the function space refers to a set of functions of a given kind from one set to another, that is, the f() function is initially in an unknown state, and K is in the regression tree model. There are K of the above f() functions.
  • the regression tree model is trained based on the blood sample set, so that the K f() functions in the final regression tree model maximize the data in the blood sample set.
  • the f() function is learned by a sequential learning method to reduce errors in the learning process. For example, on the basis of the input parameter is a Blood valuei , the predicted value of the t-round is predicted. And when predicting the prediction value of the t-th round, the prediction result of the prediction value of the t-1th round is retained, that is, each regression tree (each f() function) in the regression tree model is sequentially trained, as follows:
  • the prediction value after the prediction of the t-th round is given on the basis that the input parameter is the Blood valuei .
  • the objective function of the regression tree model is constructed, as follows:
  • Symptom valuei concentration is a blood sample Blood valuei input parameter corresponding to the tag parameter, i.e. the value of the blood sample and the information value corresponding to the symptoms characteristic of the blood.
  • D is a constant term, wherein the regular term controls the training degree of the objective function Obj (t) to prevent over-fitting of the blood sample set and the regression tree model, and the regular term is specifically described later; the constant term is one Constant, the constant term is set in the embodiment of the present application mainly for controlling the numerical range of Obj (t) .
  • the objective function Obj (t) is optimized, that is, the appropriate f() function is found. The value is minimized.
  • the expanded target function is:
  • the output value obtained by the training function depends on the values of g i and h i , which improves the simplicity of the training. Therefore, in the embodiment of the present application, the regression tree model can be trained by the finally generated training function.
  • the value of f t (Blood valuei) of the regression prediction value is the tree of blood Blood valuei value.
  • the q(Blood valuei ) in the f() function is used to map, specifically mapping a blood value Blood valuei to a value from 1 to K, that is, dividing the Blood valuei into a node of the regression tree.
  • H is the same as D in the objective function and is a constant term
  • ⁇ and ⁇ are training coefficients, which can be customized by the user for the actual application scenario to adjust the structure of the generated regression tree (the size of the detection model), ⁇ and Any increase in ⁇ will result in a streamlined structure of the regression tree.
  • the training effect of the regression tree model can be significantly improved.
  • the trained regression tree model is output as the detection model.
  • the blood test values Blood valuex blood detection model input the calculation formula can be optimized by detecting a Model After calculation, the detected value is obtained.
  • the existing plurality of blood information samples are input to a regression tree model, wherein the blood value in the blood information sample is used as an input parameter of the regression tree model, The symptom feature value in the blood information sample is used as the tag parameter of the regression tree model, the regression tree model is trained, and the trained regression tree model is obtained as the detection model.
  • the embodiment of the present application embodies the specific training process of the regression tree model, and Training accuracy and training effects are improved by setting the objective function.
  • FIG. 3 is a flowchart of an implementation of a blood analysis method based on a regression tree model according to Embodiment 3 of the present application.
  • the embodiment refines the process before S104 to obtain S301-S303, which are as follows:
  • S301 Input blood values of the plurality of blood information samples to the detection model, and acquire a plurality of output values output by the detection model.
  • the blood value of the blood to be tested can be input to the detection model to obtain a detection value, and according to the detection value, it is judged whether the blood to be tested has been affected by the symptom.
  • the blood value of the plurality of blood information samples is input as an input parameter to the detection model, and the blood sample of the detection model and the plurality of blood information samples are acquired. Corresponding multiple output values.
  • S302 Sort the plurality of output values to generate a sequence of values.
  • the detection model is generated based on the blood value and the symptom feature value in the plurality of blood information samples, but when the blood value in the plurality of blood information samples is re-entered into the detection model, the operation of the training model is not performed, so the output value generated by the detection model is There is a difference from the original symptom characteristic value.
  • multiple output values are sorted according to size to generate a sequence of values, and the first column of the sequence of values is the output value with the largest value. It is worth mentioning that if the first output value of the plurality of output values is the same as the second output value, the sequence of values is generated according to the input order of the blood information samples corresponding to the first output value and the second output value. Specifically, the order writing mechanism is set.
  • an output value When an output value needs to be written into the numerical sequence, it is determined whether there is an existing output value in the numerical sequence that is the same as the output value, if there is no existing output that is the same as the output value.
  • the value is used to find the smallest output value in the numerical sequence that is greater than the output value according to the value of the output value, and write the output value at the position after the existing output value; if there is the same value as the output value
  • the existing output value is written to the output value after the existing output value. If there are multiple existing output values that are the same as the output value, the existing output at the end of the existing output value is searched. The value is written to the output value after the existing output value.
  • S303 Use an output value of the numerical sequence located at a preset position as the detection threshold.
  • the output value at the preset position in the numerical sequence is used as the detection threshold. For example, if 300 output values have been written in the numerical sequence and the preset position is the 50th position, the output value of the 50th digit of the numerical sequence is extracted as the detection threshold.
  • the preset position is determined according to a preset ratio.
  • the preset ratio can be determined according to the big data analysis method. For example, a plurality of national blood samples can be counted nationwide, and each national blood sample includes blood value and symptom characteristic value, and the symptom characteristic value of 1 is determined in a plurality of national blood samples. The proportion of blood samples nationwide, and the ratio is used as a preset ratio.
  • the preset position is the 30th position, and the output value of the 30th digit of the extracted numerical sequence is used as the detection threshold, and the detection threshold is set by the big data analysis method.
  • the preset ratio or preset position can also be pre-set by humans.
  • the blood value of the plurality of blood information samples is input as an input parameter to the detection model, and the plurality of output values output by the detection model are acquired, and the plurality of output values are obtained. Sorting according to the size, generating a numerical sequence, extracting the output value at the preset position in the numerical sequence, using the output value as the detection threshold, and setting the detection threshold according to the detection model, thereby improving the applicability of the detection threshold.
  • FIG. 4 is a flowchart of an implementation of a blood analysis method based on a regression tree model according to Embodiment 4 of the present application.
  • the process before S104 is refined to obtain S401 to S402, which are as follows:
  • S401 Determine, according to the plurality of blood information samples, the first sample number of the symptom characteristic value as the first feature value and the second sample number of the symptom feature value as the second feature value.
  • the symptom characteristic values in the blood information sample have two values, namely 1 or 0, but it should be understood that other characteristic symptom characteristic values can also be applied in the embodiments of the present application.
  • the symptom characteristic value of 1 is taken as the first characteristic value
  • the symptom characteristic value of 0 is taken as the second characteristic value
  • the first characteristic value represents that the existing blood to which the blood information sample belongs has been affected by the symptom
  • the second eigenvalue represents that the existing blood to which the blood information sample belongs is not affected by the symptoms.
  • the number of blood information samples in which the symptom characteristic value is the first characteristic value in the plurality of blood information samples is taken as the first sample number, and the symptom characteristic value in the plurality of blood information samples is the second value.
  • the number of blood information samples of the feature value is taken as the number of second samples.
  • S402 Determine a first proportion ratio and a second proportion ratio according to the first sample number of people and the second sample number, and according to the first proportion ratio, the second proportion ratio, the The first feature value and the second feature value determine the detection threshold.
  • the detection threshold is determined according to the first ratio, the second ratio, the first characteristic value, and the second characteristic value. For example, if the first eigenvalue is greater than the second eigenvalue, one method of determining the detection threshold is to obtain a difference between the first eigenvalue and the second eigenvalue, and the first proportion ratio and the difference A multiplication operation is performed to obtain a boundary value, and a result obtained by subtracting the boundary value from the first feature value is used as a detection threshold.
  • the symptom characteristic value in the plurality of blood information samples is determined as the first characteristic value by using the symptom characteristic value as the first characteristic value or the second characteristic value.
  • the first sample number and the symptom characteristic value are the second sample number of the second feature value, and the first proportion ratio and the second proportion ratio are determined according to the first sample number of people and the second sample number, and finally according to The first proportion ratio, the second proportion ratio, the first eigenvalue, and the second eigenvalue are obtained by the detection threshold. Since the plurality of blood information samples may be collected in a certain area, the embodiment of the present application improves the detection threshold for a certain Geographical accuracy.
  • FIG. 5 is a flowchart of implementing a blood analysis method based on a regression tree model according to Embodiment 5 of the present application.
  • the embodiment further includes the following steps with respect to the embodiment corresponding to FIG. 1 :
  • S501 Acquire a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood.
  • the blood threshold template is the same as the blood value of the alarm blood that has been affected by a certain symptom, wherein the blood threshold template may Contains the relevant values of one or several blood components in the alarm blood.
  • the types of related values can be selected according to the actual application scenario.
  • obtaining a plurality of alarm bloods that have been affected by a symptom, and performing average evaluation of the plurality of blood values of the plurality of alarm blood, and using the generated average value as a blood threshold template which can reduce the generation of the blood threshold template. The effect of unexpected factors increases the accuracy of the blood threshold template.
  • a plurality of alarm bloods that are affected by the symptoms for a preset time such as obtaining a plurality of alarm bloods that have been affected by the symptoms for six months, and by the same average number evaluation method, multiple alarm bloods are provided.
  • the blood values generate a blood threshold template to further improve the accuracy in the time dimension based on the blood threshold template.
  • S502 Perform a one-to-one comparison of the blood value of the blood to be tested and the plurality of blood threshold templates.
  • the alignment direction is set for multiple related values of the blood threshold template for the symptoms.
  • the blood threshold template corresponding to chronic obstructive pulmonary disease contains leukocyte content and hemoglobin content.
  • the white blood cell content is set to the upward alignment, and the hemoglobin content is also set to the opposite direction, and the platelet content is set to the opposite direction.
  • the upward alignment means that if the white blood cell content in the blood value of the blood to be tested exceeds or equals the white blood cell content in the blood threshold template, the comparison is determined to be successful, otherwise the comparison fails; the downward comparison refers to the blood to be tested. If the platelet content in the blood value is less than the white blood cell content in the blood threshold template, the comparison is deemed successful, otherwise the comparison is deemed to have failed.
  • a setting mechanism may be created for different symptoms, and the setting of the matching direction in the plurality of blood threshold templates is automatically completed by the setting mechanism.
  • the blood value of the blood to be tested is compared with a plurality of blood threshold templates, and if the blood value of the blood to be tested is successfully compared with a blood threshold template, the output is The second alarm prompt, wherein the second alarm prompt may include a symptom corresponding to the blood threshold template, which is convenient for the user to quickly view; if the blood blood value of the blood to be tested and the plurality of blood threshold templates fail to match, the normal prompt is output.
  • the blood threshold template is taken from the blood value of the alarm blood affected by the symptom, and is to be tested.
  • the blood blood value is compared with the plurality of blood threshold templates one by one. If the blood blood value of the blood to be tested is successfully compared with one of the plurality of blood threshold templates, the second alarm prompt is output, and the alarm is generated, and the alarm is set.
  • a blood threshold template enables multi-dimensional analysis of the blood to be measured and improves the reliability of the blood analysis to be tested.
  • FIG. 6 is a structural block diagram of a regression tree model-based blood analysis apparatus provided by an embodiment of the present application.
  • the device includes:
  • a first obtaining unit 61 configured to acquire a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
  • a fitting unit 62 configured to fit the plurality of blood information samples with a regression tree model, and use the regression tree model that is completed as a detection model;
  • a second obtaining unit 63 configured to acquire a blood value of the blood to be tested, and input the blood value of the blood to be tested into the detection model to obtain a detection value;
  • the output unit 64 is configured to output a first alarm prompt if the detected value is greater than the detection threshold.
  • the fitting unit 62 includes:
  • An input unit configured to input the plurality of blood information samples to the regression tree model to train the regression tree model, wherein a blood value in the blood information sample is used as an input vector of the regression tree model And using the symptom feature value in the blood information sample as a label vector of the regression tree model;
  • a model output unit configured to output the trained regression tree model as the detection model.
  • the output unit 64 further includes:
  • a sample input unit configured to input blood values of the plurality of blood information samples to the detection model, and acquire a plurality of output values output by the detection model
  • a sorting unit configured to sort the plurality of output values to generate a sequence of values
  • a threshold output unit configured to use an output value of the numerical sequence at a preset position as the detection threshold.
  • the symptom characteristic value is a first feature value or a second feature value
  • the output unit 64 further includes:
  • a first determining unit configured to determine, according to the plurality of blood information samples, that the symptom feature value is the first sample number of the first feature value and the symptom feature value is the second feature value Two sample numbers;
  • a second determining unit configured to determine, according to the first sample number of people and the second sample number of people, a first proportion ratio and a second proportion ratio, and according to the first proportion ratio, the second proportion The ratio, the first eigenvalue, and the second eigenvalue determine the detection threshold.
  • the blood analysis device further includes:
  • a template acquiring unit configured to acquire a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood;
  • An aligning unit configured to compare the blood value of the blood to be tested and the plurality of blood threshold templates one by one;
  • the alarm output unit is configured to output a second alarm prompt if the blood value of the blood to be tested is successfully matched with one of the plurality of blood threshold templates.
  • FIG. 7 is a schematic diagram of a terminal device according to an embodiment of the present application.
  • the terminal device 7 of this embodiment includes a processor 70 and a memory 71 in which computer readable instructions 72 executable on the processor 70 are stored, for example, based on a regression tree model. Blood analysis program.
  • the processor 70 executes the computer readable instructions 72 to implement the steps in the various embodiments of the regression tree model based blood analysis method described above, such as steps S101 through S104 shown in FIG.
  • the processor 70 when executing the computer readable instructions 72, implements the functions of the various units in the apparatus embodiments described above, such as the functions of the units 61 through 64 shown in FIG.
  • the computer readable instructions 72 may be partitioned into one or more modules/units that are stored in the memory 71 and executed by the processor 70, To complete this application.
  • the one or more modules/units may be a series of computer readable instruction segments capable of performing a particular function for describing the execution of the computer readable instructions 72 in the terminal device 7.
  • the computer readable instructions 72 may be segmented into a first acquisition unit, a fitting unit, a second acquisition unit, and an output unit, each unit having a specific function as described above.
  • the terminal device may include, but is not limited to, a processor 70 and a memory 71. It will be understood by those skilled in the art that FIG. 7 is only an example of the terminal device 7, and does not constitute a limitation of the terminal device 7, and may include more or less components than those illustrated, or combine some components or different components.
  • the terminal device may further include an input/output device, a network access device, a bus, and the like.
  • the processor 70 may be a central processing unit (CPU), or may be other general-purpose processors, a digital signal processor (DSP), an application specific integrated circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc.
  • the general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
  • the memory 71 may be an internal storage unit of the terminal device 7, such as a hard disk or a memory of the terminal device 7.
  • the memory 71 may also be an external storage device of the terminal device 7, for example, a plug-in hard disk provided on the terminal device 7, a smart memory card (SMC), and a secure digital (SD). Card, flash card, etc. Further, the memory 71 may also include both an internal storage unit of the terminal device 7 and an external storage device.
  • the memory 71 is for storing the computer readable instructions and other programs and data required by the terminal device.
  • the memory 71 can also be used to temporarily store data that has been output or is about to be output.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium.
  • a computer readable storage medium A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application.
  • the foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Medical Informatics (AREA)
  • Health & Medical Sciences (AREA)
  • Public Health (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Optimization (AREA)
  • Mathematical Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Mathematics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Probability & Statistics with Applications (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Primary Health Care (AREA)
  • General Health & Medical Sciences (AREA)
  • Operations Research (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Epidemiology (AREA)
  • Algebra (AREA)
  • Pathology (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

The present solution is applicable to the field of data processing technology, and provides a regression tree model-based blood analysis method, a terminal device and a computer readable storage medium, the method comprising: acquiring a plurality of blood information samples, each of the blood information samples comprising a blood numerical value and a symptom feature value; fitting a regression tree model to the plurality of blood information samples, and using the fitted regression tree model as a test model; acquiring the blood numerical value of blood to be tested, and inputting the blood numerical value of said blood into the test model to obtain a test value; and if the test value is greater than a test threshold, outputting a first alarm prompt. By training the regression tree model, the present solution implements a comprehensive analysis of the blood numerical value of blood to be tested, improving the reliability and accuracy of blood analysis.

Description

基于回归树模型的血液分析方法、装置、终端设备及可读存储介质Blood analysis method, device, terminal device and readable storage medium based on regression tree model
本申请要求于2018年05月14日提交中国专利局、申请号为201810457009.3、发明名称为“基于回归树模型的血液分析方法及终端设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to Chinese Patent Application No. 201810457009.3, entitled "Regression Tree Model Based Blood Analysis Method and Terminal Equipment", submitted to the Chinese Patent Office on May 14, 2018, the entire contents of which are incorporated by reference. In this application.
技术领域Technical field
本申请属于数据处理技术领域,尤其涉及一种基于回归树模型的血液分析方法、装置、终端设备及计算机可读存储介质。The present application belongs to the field of data processing technologies, and in particular, to a blood analysis method, apparatus, terminal device and computer readable storage medium based on a regression tree model.
背景技术Background technique
血液是人体的重要组成部分,随着科学技术的不断进步,对血液的分析也在朝着精细化和多样化发展。通常来说,在血液分析过程中,首先需要获取血液中各个成分的数值信息,具体过程为将待测血液放置于测试仪器,再由分析人员根据测试仪器显示的测试结果得到血液的数值信息。然后,计算机基于统计学理论、数学理论和预设的阈值,判断数值信息是否超过对应的阈值,并根据判断的结果得到血液的分析结果。Blood is an important part of the human body. With the continuous advancement of science and technology, the analysis of blood is also progressing towards refinement and diversification. Generally speaking, in the blood analysis process, first, it is necessary to obtain numerical information of each component in the blood. The specific process is to place the blood to be tested on the test instrument, and then the analyst obtains the numerical information of the blood according to the test result displayed by the test instrument. Then, based on the statistical theory, the mathematical theory, and the preset threshold, the computer judges whether the numerical information exceeds the corresponding threshold, and obtains the blood analysis result according to the judgment result.
为了防止错误判断,阈值的数值一般设置为超出正常值较大的范围。如果某种问题血液的问题体现在某几个数值信息出现偏高或偏低,但是各个成分的数值信息都未超出对应阈值,那么按照现有的分析方法对其进行分析得到的分析结果显然是错误的。综上,现有的血液分析方法只能单独对血液中各个成分的数值信息进行分析,无法对多个数值信息进行综合判断,即对血液进行分析的可靠性和准确性低。In order to prevent erroneous judgment, the value of the threshold is generally set to a range larger than the normal value. If the problem of blood in a certain problem is that some numerical information appears to be higher or lower, but the numerical information of each component does not exceed the corresponding threshold, then the analysis result obtained by analyzing according to the existing analytical method is obviously wrong. In summary, the existing blood analysis method can only analyze the numerical information of each component in the blood alone, and cannot comprehensively judge a plurality of numerical information, that is, the reliability and accuracy of analyzing the blood are low.
技术问题technical problem
有鉴于此,本申请实施例提供了基于回归树模型的血液分析方法、装置、终端设备及计算机可读存储介质,以解决现有技术中无法综合血液的多个成分进行分析,对血液进行分析的可靠性和准确性低的问题。In view of this, the embodiments of the present application provide a blood analysis method, device, terminal device, and computer readable storage medium based on a regression tree model, so as to solve the analysis of multiple components in the prior art that cannot integrate blood, and analyze blood. The problem of low reliability and accuracy.
技术解决方案Technical solution
本申请实施例的第一方面提供了一种基于回归树模型的血液分析方法,包括:A first aspect of the embodiments of the present application provides a blood analysis method based on a regression tree model, including:
获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值;Obtaining a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型;Comparing the plurality of blood information samples with a regression tree model, and fitting the completed regression tree model as a detection model;
获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值;Obtaining a blood value of the blood to be tested, and inputting the blood value of the blood to be tested into the detection model to obtain a detection value;
若所述检测值大于检测阈值,则输出第一告警提示。If the detected value is greater than the detection threshold, the first alarm prompt is output.
本申请实施例的第二方面提供了一种基于回归树模型的血液分析方法装置,可以包括用于实现上述基于回归树模型的血液分析方法的步骤的单元。A second aspect of an embodiment of the present application provides a blood analysis method apparatus based on a regression tree model, which may include means for implementing the steps of the above-described regression tree model based blood analysis method.
本申请实施例的第三方面提供了一种终端设备,包括存储器以及处理器,所述存储器中存储有可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现上述基于回归树模型的血液分析方法的步骤。A third aspect of the embodiments of the present application provides a terminal device, including a memory and a processor, where the computer stores computer readable instructions executable on the processor, the processor executing the computer The steps of the above-described regression analysis based blood analysis method are implemented when the instruction is read.
本申请实施例的第四方面提供了一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,所述计算机可读指令被处理器执行时实现上述基于回归树模型的血液分析方法的步骤。A fourth aspect of the embodiments of the present application provides a computer readable storage medium storing computer readable instructions, the computer readable instructions being executed by a processor to implement the above-described regression tree model based The steps of the blood analysis method.
有益效果Beneficial effect
本申请实施例通过获取多个血液信息样本,每个血液信息样本包括血液数值和症状特征值,将多个血液信息样本与回归树模型进行拟合,并将拟合完毕的回归树模型作为检测模型,然后对待测血液进行分析,首先获取待测血液的血液数值,并将血液数值输入至检测模型,获取检测模型的输出结果作为检测值,最后若判断出检测值大于检测阈值,则输出第一告警提示,本申请实施例基于多个血液信息样本对回归树模型进行训练,实现了对待测血液中血液数值的综合分析,提升了血液分析的可靠性和准确性。In the embodiment of the present application, a plurality of blood information samples are obtained, each blood information sample includes a blood value and a symptom feature value, and a plurality of blood information samples are fitted to a regression tree model, and the fitted regression tree model is used as a detection. The model, then the blood to be analyzed, first obtain the blood value of the blood to be tested, and input the blood value to the detection model, obtain the output of the detection model as the detection value, and finally, if it is determined that the detection value is greater than the detection threshold, the output is An alarm prompts that the embodiment of the present application trains the regression tree model based on a plurality of blood information samples, realizes comprehensive analysis of blood values in the blood to be measured, and improves the reliability and accuracy of the blood analysis.
附图说明DRAWINGS
图1是本申请实施例一中基于回归树模型的血液分析方法的实现流程图;1 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in the first embodiment of the present application;
图2是本申请实施例二中基于回归树模型的血液分析方法的实现流程图;2 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 2 of the present application;
图3是本申请实施例三中基于回归树模型的血液分析方法的实现流程图;3 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 3 of the present application;
图4是本申请实施例四中基于回归树模型的血液分析方法的实现流程图;4 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 4 of the present application;
图5是本申请实施例五中基于回归树模型的血液分析方法的实现流程图;5 is a flowchart showing an implementation of a blood analysis method based on a regression tree model in Embodiment 5 of the present application;
图6是本申请实施例六中基于回归树模型的血液分析装置的结构框图;6 is a structural block diagram of a blood analysis apparatus based on a regression tree model in Embodiment 6 of the present application;
图7是本申请实施例七中终端设备的示意图。FIG. 7 is a schematic diagram of a terminal device in Embodiment 7 of the present application.
本发明的实施方式Embodiments of the invention
为了对本申请的技术特征、目的和效果有更加清楚的理解,现对照附图详细说明本申请的具体实施方式。In order to more clearly understand the technical features, objects and effects of the present application, the specific embodiments of the present application will be described in detail with reference to the accompanying drawings.
请参阅图1,图1是本申请实施例提供的一种基于回归树模型的血液分析方法的实现流程图。如图1所示,该血液分析方法包括以下步骤:Please refer to FIG. 1. FIG. 1 is a flowchart of implementing a blood analysis method based on a regression tree model according to an embodiment of the present application. As shown in Figure 1, the blood analysis method comprises the following steps:
S101:获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值。S101: Acquire a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value.
在本申请实施例中,在对待测血液进行分析之前,首先获取多个血液信息样本,每个血液信息样本包括血液数值和症状特征值。其中,血液数值是血液信息样本所属已有血液中某个或某几个成分的相关数值,可由已有血液经测试仪器分析得到的测试结果确定,已有血液是指已经测试仪器分析并建立档案的血液。举例来说,血液数值可以为已有血液的红细胞含量,也可为已有血液的红细胞含量、白细胞含量和血红蛋白含量的集合。症状特征值为已有血液的受影响情况,具体针对某个症状进行设置,通过判断已有血液是否受到该症状的影响而确定症状特征值,一般来说,将受到该症状影响的已有血液的血液信息样本下的症状特征值设置为1,将未受到该症状影响的已有血液的血液信息样本下的症状特征值设置为0,以方便进行计算,本申请实施例的目的即是判断待测血液是否受到该症状的影响。为了提升对待测血液分析的准确度,按照预设的数量级获取血液信息样本,即获取的血液信息样本数量处于预设的数量级,数量级可根据准确度要求自由设置。比如数量级可以为一千个。In the embodiment of the present application, before analyzing the blood to be tested, a plurality of blood information samples are first acquired, and each blood information sample includes a blood value and a symptom characteristic value. The blood value is the relevant value of one or several components in the existing blood of the blood information sample, and can be determined by the test result obtained by the existing blood test instrument, and the existing blood refers to the analyzed instrument and the file is established. Blood. For example, the blood value can be the red blood cell content of the existing blood, or a collection of red blood cell content, white blood cell content, and hemoglobin content of the existing blood. The symptom characteristic value is the affected condition of the existing blood, and is specifically set for a certain symptom, and the symptom characteristic value is determined by judging whether the existing blood is affected by the symptom, and generally, the existing blood affected by the symptom is determined. The symptom characteristic value under the blood information sample is set to 1, and the symptom characteristic value under the blood information sample of the existing blood that is not affected by the symptom is set to 0 to facilitate calculation, and the purpose of the embodiment of the present application is to judge Whether the blood to be tested is affected by the symptoms. In order to improve the accuracy of the blood analysis to be measured, the blood information sample is obtained according to a preset order of magnitude, that is, the obtained blood information sample quantity is at a preset order of magnitude, and the order of magnitude can be freely set according to the accuracy requirement. For example, the order of magnitude can be one thousand.
优选地,从健康档案或医院数据库获取多个血液信息样本。由于健康档案或医院数据库中存放有不同人群的多个血液信息样本,血液信息样本的数量多,并且血液信息样本下的症状特征值准确度高,故可直接从健康档案或医院数据库中获取需要的多个血液信息样本。可选地,根据特定条件获取多个血液信息样本。在实际应用场景中,根据场景要求设置获取条件进行多个血液信息样本的获取,比如在需要对某个镇的人体的待测血液进行分析的前提下,设置特定条件,只从与该镇相关的健康档案或医院数据库获取多个血液信息样本;又比如需要对年龄在六十岁以上人体的待测血液进行分析,则设置对应的特定条件,从健康档案或医院数据库获取年龄在六十岁以上的人群对应的多个血液信息样本,还可设置特定条件以获取特定数量级的多个血液信息样本等,提升了对不同应用场景的适用性。Preferably, a plurality of blood information samples are obtained from a health file or a hospital database. Since a plurality of blood information samples of different populations are stored in the health file or the hospital database, the number of blood information samples is large, and the symptom characteristic values under the blood information samples are highly accurate, so the need can be directly obtained from the health file or the hospital database. Multiple blood information samples. Optionally, multiple blood information samples are acquired according to specific conditions. In the actual application scenario, the acquisition condition is set according to the scene requirement to acquire multiple blood information samples, for example, on the premise that the blood to be tested of the human body in a certain town needs to be analyzed, specific conditions are set, only relevant to the town. The health file or hospital database obtains multiple blood information samples; for example, if the blood to be tested is aged for people over 60 years old, the corresponding specific conditions are set, and the age is 60 years old from the health file or hospital database. The plurality of blood information samples corresponding to the above population may also set specific conditions to obtain a plurality of blood information samples of a certain order of magnitude, etc., thereby improving applicability to different application scenarios.
S102:将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型。S102: Fit the plurality of blood information samples with a regression tree model, and use the regression tree model that is fitted to be a detection model.
由于血液信息样本下的血液数值可能包含已有血液中多个成分的多个相关数值,而多个成分的多个相关数值与已有血液是否受到症状影响有关。举例来说,血液信息样本的血液数值包含白细胞含量、血红蛋白含量和血小板含量,若该血液信息样本所属的已有血液受到慢阻肺症状的影响,则相较于未受到慢阻肺症状影响的正常血液的血液数值,该血液信息样本的血液数值中可能会出现白细胞含量增大,血红蛋白含量增大和血小板含量减小的情况。但是,在普遍情况下,无法获知血液数值内多个成分的多个相关数值受症状影响的变化规律,故在本申请实施例中,将症状特征值已确定的多个血液信息样本与回归树模型进行拟合,在拟合过程中不断训练回归树模型,最后将拟合完成的回归树模型输出为检测模型。其中,回归树模型应用分类与回归树(Classification and regression tree,CART)算法,CART算法将 血液数值中的多个相关数值作为多个特征,对每个相关数值建立“是”和“否”的两条分叉,通常左分支是取值为“是”的分支,右分支是取值为“否”的分支,具体划分方法依据多个相关数值对应的症状特征值。通过CART算法递归地二分每个相关数值,将多个相关数值构成的输入空间划分为有限的多个单元,并通过多个相关数值对应的症状特征值确定每个单元的单元取值,最终生成二叉树。将多个血液信息样本输入回归树模型,实质上是基于多个血液信息样本建立和修正二叉树,最终生成的二叉树即为检测模型,具体计算过程在后文进行阐述。Since the blood value under the blood information sample may contain a plurality of related values of the plurality of components in the existing blood, the plurality of related values of the plurality of components are related to whether the existing blood is affected by the symptoms. For example, the blood value of the blood information sample includes the white blood cell content, the hemoglobin content, and the platelet content. If the blood of the blood information sample belongs to the chronic obstructive pulmonary disease, it is affected by the chronic obstructive pulmonary disease. The blood value of normal blood, the blood value of the blood information sample may appear to increase white blood cell content, increase hemoglobin content and decrease platelet content. However, in a general case, it is not known that a plurality of related values of a plurality of components in the blood value are affected by the symptoms. Therefore, in the embodiment of the present application, a plurality of blood information samples having the symptom characteristic values have been determined and the regression tree. The model is fitted, and the regression tree model is continuously trained during the fitting process. Finally, the fitted regression tree model is output as the detection model. Among them, the regression tree model applies the classification and regression tree (CART) algorithm, and the CART algorithm takes multiple correlation values in blood values as multiple features, and establishes "yes" and "no" for each relevant value. Two forks, usually the left branch is a branch with a value of "yes", and the right branch is a branch with a value of "no". The specific division method is based on the symptom characteristic values corresponding to multiple related values. Recursively divide each relevant value by the CART algorithm, divide the input space composed of multiple related values into a limited number of units, and determine the unit value of each unit by the symptom feature values corresponding to the multiple related values, and finally generate Binary tree. The input of multiple blood information samples into the regression tree model essentially establishes and corrects the binary tree based on multiple blood information samples, and the resulting binary tree is the detection model. The specific calculation process is described later.
S103:获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值。S103: Acquire a blood value of the blood to be tested, and input the blood value of the blood to be tested into the detection model to obtain a detection value.
由多个血液信息样本生成检测模型后,对待测血液进行分析,获取待测血液的血液数值,并将待测血液的血液数值输入至检测模型,获取检测模型的输出结果,即为检测值。值得一提的是,待测血液的血液数值对应的多个成分与血液信息样本的血液数值对应的多个成分相同,或者待测血液的血液数值对应的多个成分为血液信息样本的血液数值对应的多个成分的一部分,在上述两种情况下,将待测血液的血液数值输入至检测模型后,检测模型的输出结果才能作为有效的检测值。After the detection model is generated from the plurality of blood information samples, the blood to be measured is analyzed, the blood value of the blood to be tested is obtained, and the blood value of the blood to be tested is input to the detection model, and the output result of the detection model is obtained, which is the detection value. It is worth mentioning that the plurality of components corresponding to the blood value of the blood to be tested are the same as the plurality of components corresponding to the blood value of the blood information sample, or the blood component corresponding to the blood value of the blood to be tested is the blood value of the blood information sample. A part of the corresponding plurality of components, in the above two cases, after the blood value of the blood to be tested is input to the detection model, the output of the detection model can be regarded as an effective detection value.
S104:若所述检测值大于检测阈值,则输出第一告警提示。S104: If the detected value is greater than the detection threshold, output a first alarm prompt.
检测值由待测血液的血液数值计算得出,用于判断待测血液是否受症状影响。如果症状特征值为1代表对应的血液已受到症状影响,并且症状特征值为0代表对应的血液未受到症状影响,则检测值越接近1,则待测血液受到症状影响的可能性越大。为了提升判断的准确性,设置检测阈值,若检测值大于检测阈值,则确定待测血液受到症状影响,输出第一告警提示,方便用户进行查看;若检测值小于或等于检测阈值,则确定待测血液未受到症状影响,输出正常提示。检测阈值可依据多个血液信息样本及多个血液信息样本的多个症状特征值进行确定,也可将多个血液信息样本输入至检测模型,并根据检测模型根据多个血液信息样本输出的多个输出结果进行确定。The detection value is calculated from the blood value of the blood to be tested, and is used to judge whether the blood to be tested is affected by the symptoms. If the symptom characteristic value is 1 indicating that the corresponding blood has been affected by the symptoms, and the symptom characteristic value is 0 indicating that the corresponding blood is not affected by the symptoms, the closer the detection value is to 1, the more likely the blood to be tested is affected by the symptoms. In order to improve the accuracy of the judgment, the detection threshold is set. If the detection value is greater than the detection threshold, it is determined that the blood to be tested is affected by the symptom, and the first alarm prompt is output, which is convenient for the user to view; if the detection value is less than or equal to the detection threshold, it is determined to be The blood is not affected by the symptoms, and the output is normal. The detection threshold may be determined according to a plurality of blood information samples and a plurality of symptom characteristic values of the plurality of blood information samples, or a plurality of blood information samples may be input to the detection model, and the plurality of blood information samples are output according to the detection model. The output is determined.
通过图1所示实施例可知,在本申请实施例中,通过获取现有的多个血液信息样本,其中血液信息样本包括血液数值和症状特征值,并将多个血液信息样本与回归树模型进行拟合,将拟合完毕的回归树模型输出为检测模型,最后进行对待测血液的分析,获取待测血液的血液数值,并将待测血液的血液数值输入至检测模型,将检测模型的输出结果作为检测值,将检测值与检测阈值进行比对,若检测值大于检测阈值,则证明待测血液受到症状影响,输出第一告警提示,本申请实施例可以针对不同成分的血液数值进行待测血液的分析,提升了血液分析在不同场景的适用性,并且通过训练回归树模型,提升了血液分析的准确性。It can be seen from the embodiment shown in FIG. 1 that, in the embodiment of the present application, a plurality of existing blood information samples are obtained, wherein the blood information samples include blood values and symptom feature values, and the plurality of blood information samples and the regression tree model are obtained. The fitting is performed, and the fitted regression tree model is output as a detection model, and finally, the blood to be measured is analyzed, the blood value of the blood to be tested is obtained, and the blood value of the blood to be tested is input to the detection model, and the model is detected. The output result is used as a detection value, and the detection value is compared with the detection threshold. If the detection value is greater than the detection threshold, it is proved that the blood to be tested is affected by the symptom, and the first alarm prompt is output. The embodiment of the present application can perform blood values for different components. The analysis of the blood to be tested improves the applicability of the blood analysis in different scenarios, and improves the accuracy of the blood analysis by training the regression tree model.
请参阅图2,图2是本申请实施例二提供的一种基于回归树模型的血液分析方法的实现流程图。相对于图1对应的实施例,本实施例对S102进行细化后得到S201~S202,详述如下:Please refer to FIG. 2. FIG. 2 is a flowchart of an implementation of a blood analysis method based on a regression tree model according to Embodiment 2 of the present application. With respect to the embodiment corresponding to FIG. 1, this embodiment refines S102 to obtain S201-S202, which are as follows:
S201:将所述多个血液信息样本输入至所述回归树模型,以训练所述回归树模型,其中,将所述血液信息样本中的血液数值作为所述回归树模型的输入参数,将所述血液信息样本中的症状特征值作为所述回归树模型的标签参数。S201: input the plurality of blood information samples to the regression tree model to train the regression tree model, wherein a blood value in the blood information sample is used as an input parameter of the regression tree model, The symptom feature value in the blood information sample is used as the tag parameter of the regression tree model.
由于存在多个血液信息样本,故基于多个血液信息样本中的多个血液数值与多个症状特征值构建血液样本集,为(Blood value1,Symptom value1),(Blood value2,Symptom value2),(Blood value1,Symptom value1)……(Blood valuen,Symptom valuen),其中,Blood valuei指示第i个血液信息样本的血液数值,Symptom valuei指示第i个血液信息样本的症状特征值。构建完成后,将血液样本集输入至回归树模型,将血液数值作为回归树模型的输入参数,将症状特征值作为回归树模型的标签参数,与回归树模型进行拟合,在本申请实施例中,回归树模型的计算公式为: Since there are a plurality of blood information samples, a blood sample set is constructed based on a plurality of blood values and a plurality of symptom characteristic values in the plurality of blood information samples, (Blood value1 , Symptom value1 ), (Blood value2 , Symptom value2 ), ( Blood value1 , Symptom value1 )...(Blood valuen , Symptom valuen ), where Blood valuei indicates the blood value of the i-th blood information sample, and Symptom valuei indicates the symptom characteristic value of the i-th blood information sample. After the construction is completed, the blood sample set is input to the regression tree model, the blood value is used as an input parameter of the regression tree model, and the symptom feature value is used as a label parameter of the regression tree model, and is fitted to the regression tree model, in the embodiment of the present application. The calculation formula for the regression tree model is:
Figure PCTCN2018097562-appb-000001
Figure PCTCN2018097562-appb-000001
在上述公式中,
Figure PCTCN2018097562-appb-000002
表示对输入参数为Blood valuei的预测值,即是将Blood valuei作为输入参数输入至回归树模型后,回归树模型计算后的输出结果。f()指示一个存在于函数空间的函数,函数空间指的是从一个集合到另一个集合的给定种类的函数的集合,即f()函数最初处于未知状态,K则表示回归树模型中存在K个上述的f()函数。
In the above formula,
Figure PCTCN2018097562-appb-000002
Indicates the predicted value of the input parameter for the Blood valuei , that is, the output result after the regression tree model is input by inputting the Blood valuei as an input parameter to the regression tree model. f() indicates a function that exists in the function space. The function space refers to a set of functions of a given kind from one set to another, that is, the f() function is initially in an unknown state, and K is in the regression tree model. There are K of the above f() functions.
在确定回归树模型的计算公式后,基于血液样本集对回归树模型进行训练,以使最终的回归树模型中的K个f()函数最大限度地符合血液样本集中的数据。在本申请实施例中,采用依次学习方法对f()函数进行学习,以减小学习过程中的误差,举例来说,在输入参数为Blood valuei的基础上,进行t轮的预测值预测,并在进行第t轮的预测值预测时,保留第t-1轮的预测值预测结果,即依次训练回归树模型中的每一棵回归树(每一个f()函数),具体见下: After determining the calculation formula of the regression tree model, the regression tree model is trained based on the blood sample set, so that the K f() functions in the final regression tree model maximize the data in the blood sample set. In the embodiment of the present application, the f() function is learned by a sequential learning method to reduce errors in the learning process. For example, on the basis of the input parameter is a Blood valuei , the predicted value of the t-round is predicted. And when predicting the prediction value of the t-th round, the prediction result of the prediction value of the t-1th round is retained, that is, each regression tree (each f() function) in the regression tree model is sequentially trained, as follows:
Figure PCTCN2018097562-appb-000003
Figure PCTCN2018097562-appb-000003
Figure PCTCN2018097562-appb-000004
Figure PCTCN2018097562-appb-000004
上述公式中的
Figure PCTCN2018097562-appb-000005
是在给出输入参数为Blood valuei的基础上,进行第t轮的预测后的预测值。为了判断在依次学习过程中所需求的f()函数,即确定符合血液样本集的f()函数,故构建回归树模型的目标函数,具体见下:
In the above formula
Figure PCTCN2018097562-appb-000005
The prediction value after the prediction of the t-th round is given on the basis that the input parameter is the Blood valuei . In order to judge the f() function required in the sequential learning process, that is, to determine the f() function that conforms to the blood sample set, the objective function of the regression tree model is constructed, as follows:
Figure PCTCN2018097562-appb-000006
Figure PCTCN2018097562-appb-000006
在上述公式中,Symptom valuei是血液样本集中与输入参数Blood valuei对应的标签参数,即是血液信息样本中与血液数值对应的症状特征值。上述公式中的
Figure PCTCN2018097562-appb-000007
为正则项,D为常数项,其中,正则项控制目标函数Obj (t)的训练程度,防止血液样本集和回归树模型过拟合,在后文对正则项进行具体说明;常数项为一个常量,在本申请实施例中设置常数项主要为了控制Obj (t)的数值范围。另外,
Figure PCTCN2018097562-appb-000008
为误差函数,对目标函数Obj (t)进行优化,即是查找到合适的f()函数使
Figure PCTCN2018097562-appb-000009
的值尽量减小。
In the above formula, Symptom valuei concentration is a blood sample Blood valuei input parameter corresponding to the tag parameter, i.e. the value of the blood sample and the information value corresponding to the symptoms characteristic of the blood. In the above formula
Figure PCTCN2018097562-appb-000007
For the regular term, D is a constant term, wherein the regular term controls the training degree of the objective function Obj (t) to prevent over-fitting of the blood sample set and the regression tree model, and the regular term is specifically described later; the constant term is one Constant, the constant term is set in the embodiment of the present application mainly for controlling the numerical range of Obj (t) . In addition,
Figure PCTCN2018097562-appb-000008
For the error function, the objective function Obj (t) is optimized, that is, the appropriate f() function is found.
Figure PCTCN2018097562-appb-000009
The value is minimized.
在本申请实施例中,为了方便进行对目标函数的优化,对
Figure PCTCN2018097562-appb-000010
进行展开,并定义:
In the embodiment of the present application, in order to facilitate optimization of the objective function,
Figure PCTCN2018097562-appb-000010
Expand and define:
Figure PCTCN2018097562-appb-000011
Figure PCTCN2018097562-appb-000011
展开后的目标函数为:The expanded target function is:
Figure PCTCN2018097562-appb-000012
Figure PCTCN2018097562-appb-000012
提取出展开后的目标函数中所有的常数项,可生成训练函数,具体如下:Extract all the constant items in the expanded target function to generate a training function, as follows:
Figure PCTCN2018097562-appb-000013
Figure PCTCN2018097562-appb-000013
在训练函数中,训练函数得到的输出值依赖于g i和h i的值,提升了训练的简便性,故在本申请实施例中,可通过最终生成的训练函数训练回归树模型。 In the training function, the output value obtained by the training function depends on the values of g i and h i , which improves the simplicity of the training. Therefore, in the embodiment of the present application, the regression tree model can be trained by the finally generated training function.
可选地,在定义f()函数为
Figure PCTCN2018097562-appb-000014
ω∈R K,q:R d→{1,2,…,K}的基础上,定义正则项
Figure PCTCN2018097562-appb-000015
将f t(Blood valuei)看作一个回归树,f t(Blood valuei)的值即为该回归树对血液数值Blood valuei的预测值。设置该回归树存在K个叶子节点,上述的K个叶子节点的值组成K维向量ω。f()函数中的q(Blood valuei)用于进行映射,具体将某个血液数值Blood valuei映射为1到K的某个值,也即是将Blood valuei分至该回归树的某个节点。在前述基础上,定义正则项
Figure PCTCN2018097562-appb-000016
其中,H与目标函数中的D相同,为常数项;γ和λ为训练系数,可由用户针对实际的应用场景进行自定义,以调整生成的回归树的结构(检测模型的规模),γ和λ任一项增大都会导致回归树的结构精简。通过定义的正则项,可显著提升回归树模型的训练效果。
Optionally, after defining the f() function as
Figure PCTCN2018097562-appb-000014
ω∈R K , q:R d →{1,2,...,K}, based on the definition of the regular term
Figure PCTCN2018097562-appb-000015
The f t (Blood valuei) as a regression tree, the value of f t (Blood valuei) of the regression prediction value is the tree of blood Blood valuei value. There are K leaf nodes in the regression tree, and the values of the above K leaf nodes constitute a K-dimensional vector ω. The q(Blood valuei ) in the f() function is used to map, specifically mapping a blood value Blood valuei to a value from 1 to K, that is, dividing the Blood valuei into a node of the regression tree. On the basis of the foregoing, define the regular term
Figure PCTCN2018097562-appb-000016
Where H is the same as D in the objective function and is a constant term; γ and λ are training coefficients, which can be customized by the user for the actual application scenario to adjust the structure of the generated regression tree (the size of the detection model), γ and Any increase in λ will result in a streamlined structure of the regression tree. Through the defined regular items, the training effect of the regression tree model can be significantly improved.
S202:将训练完成的所述回归树模型输出为所述检测模型。S202: Output the trained regression tree model as the detection model.
当血液样本集中所有的血液数值和症状特征值都输入回归树模型,并且回归树模型训练完成后,将训练完成的回归树模型作为检测模型进行输出。当需要对待测血液进行分析时,将待测血液的血液数值Blood valuex输入检测模型,即可通过检测模型中优化后的计算公式
Figure PCTCN2018097562-appb-000017
经过计算得到检测值
Figure PCTCN2018097562-appb-000018
When all the blood values and symptom characteristic values in the blood sample set are input into the regression tree model, and the regression tree model training is completed, the trained regression tree model is output as the detection model. When the test needs to be taken for analysis of blood, the blood test values Blood valuex blood detection model input, the calculation formula can be optimized by detecting a Model
Figure PCTCN2018097562-appb-000017
After calculation, the detected value is obtained.
Figure PCTCN2018097562-appb-000018
通过图2所示实施例可知,在本申请实施例中,将现有的多个血液信息样本输入至回归树模型,其中,将血液信息样本中的血液数值作为回归树模型的输入参数,将血液信息样本中的症状特征值作为回归树模型的标签参数,对回归树模型进行训练,并获取训练完成的回归树模型作为检测模型,本申请实施例体现了回归树模型的具体训练过程,并通过设置目标函数提升了训练精度和训练效果。As shown in the embodiment shown in FIG. 2, in the embodiment of the present application, the existing plurality of blood information samples are input to a regression tree model, wherein the blood value in the blood information sample is used as an input parameter of the regression tree model, The symptom feature value in the blood information sample is used as the tag parameter of the regression tree model, the regression tree model is trained, and the trained regression tree model is obtained as the detection model. The embodiment of the present application embodies the specific training process of the regression tree model, and Training accuracy and training effects are improved by setting the objective function.
请参阅图3,图3是本申请实施例三提供的一种基于回归树模型的血液分析方法的实现 流程图。相对于图1对应的实施例,本实施例对S104之前的过程进行细化后得到S301~S303,详述如下:Referring to FIG. 3, FIG. 3 is a flowchart of an implementation of a blood analysis method based on a regression tree model according to Embodiment 3 of the present application. With respect to the embodiment corresponding to FIG. 1, the embodiment refines the process before S104 to obtain S301-S303, which are as follows:
S301:将所述多个血液信息样本的血液数值输入至所述检测模型,并获取所述检测模型输出的多个输出数值。S301: Input blood values of the plurality of blood information samples to the detection model, and acquire a plurality of output values output by the detection model.
在检测模型生成后,可将待测血液的血液数值输入至检测模型得到检测值,并根据检测值判断待测血液是否已受到症状影响。为了方便进行上述的判断,在本申请实施例中,在获取到检测模型后,将多个血液信息样本的血液数值作为输入参数输入至检测模型,并获取检测模型输出的与多个血液信息样本对应的多个输出数值。After the detection model is generated, the blood value of the blood to be tested can be input to the detection model to obtain a detection value, and according to the detection value, it is judged whether the blood to be tested has been affected by the symptom. In order to facilitate the above determination, in the embodiment of the present application, after the detection model is acquired, the blood value of the plurality of blood information samples is input as an input parameter to the detection model, and the blood sample of the detection model and the plurality of blood information samples are acquired. Corresponding multiple output values.
S302:对所述多个输出数值进行排序,生成数值序列。S302: Sort the plurality of output values to generate a sequence of values.
检测模型基于多个血液信息样本中的血液数值和症状特征值生成,但在多个血液信息样本中的血液数值重新输入检测模型时,不执行训练模型的操作,故经检测模型生成的输出数值与原有的症状特征值存在差异。获取到多个输出数值后,对多个输出数值按照大小进行排序,生成数值序列,数值序列的列首即为数值最大的输出数值。值得一提的是,若多个输出数值中的第一输出数值与第二输出数值相同,则按照第一输出数值和第二输出数值对应的血液信息样本的输入次序生成数值序列。具体地,设置次序写入机制,当某一个输出数值需要写入数值序列时,判断数值序列中是否存在与该输出数值相同的已有输出数值,若不存在与该输出数值相同的已有输出数值,则按照该输出数值的数值大小查找出数值序列中最小的大于该输出数值的已有输出数值,并在该已有输出数值后的位置写入该输出数值;若存在与该输出数值相同的已有输出数值,则在该已有输出数值后的位置写入该输出数值,若存在多个与该输出数值相同的已有输出数值,则查找多个已有输出数值末尾的已有输出数值,并在该已有输出数值后的位置写入该输出数值。The detection model is generated based on the blood value and the symptom feature value in the plurality of blood information samples, but when the blood value in the plurality of blood information samples is re-entered into the detection model, the operation of the training model is not performed, so the output value generated by the detection model is There is a difference from the original symptom characteristic value. After obtaining multiple output values, multiple output values are sorted according to size to generate a sequence of values, and the first column of the sequence of values is the output value with the largest value. It is worth mentioning that if the first output value of the plurality of output values is the same as the second output value, the sequence of values is generated according to the input order of the blood information samples corresponding to the first output value and the second output value. Specifically, the order writing mechanism is set. When an output value needs to be written into the numerical sequence, it is determined whether there is an existing output value in the numerical sequence that is the same as the output value, if there is no existing output that is the same as the output value. The value is used to find the smallest output value in the numerical sequence that is greater than the output value according to the value of the output value, and write the output value at the position after the existing output value; if there is the same value as the output value The existing output value is written to the output value after the existing output value. If there are multiple existing output values that are the same as the output value, the existing output at the end of the existing output value is searched. The value is written to the output value after the existing output value.
S303:将所述数值序列中位于预设位置的输出数值作为所述检测阈值。S303: Use an output value of the numerical sequence located at a preset position as the detection threshold.
数值序列生成后,将数值序列中位于预设位置的输出数值作为检测阈值。举例来说,数值序列中已写入300个输出数值,而预设位置为第50位,则提取数值序列位于第50位的输出数值作为检测阈值。可选地,根据预设比例确定预设位置。预设比例可根据大数据分析方法进行确定,例如可统计全国范围的多个全国血液样本,每个全国血液样本包括血液数值和症状特征值,确定多个全国血液样本中症状特征值为1的全国血液样本的比例,并将该比例作为预设比例。比如比例为10%,数值序列中已有300个输出数值,则预设位置为第30位,提取数值序列位于第30位的输出数值作为检测阈值,通过大数据分析方法使得设置的检测阈值的普遍适用性较高。当然,预设比例或预设位置也可人为预先设定。After the numerical sequence is generated, the output value at the preset position in the numerical sequence is used as the detection threshold. For example, if 300 output values have been written in the numerical sequence and the preset position is the 50th position, the output value of the 50th digit of the numerical sequence is extracted as the detection threshold. Optionally, the preset position is determined according to a preset ratio. The preset ratio can be determined according to the big data analysis method. For example, a plurality of national blood samples can be counted nationwide, and each national blood sample includes blood value and symptom characteristic value, and the symptom characteristic value of 1 is determined in a plurality of national blood samples. The proportion of blood samples nationwide, and the ratio is used as a preset ratio. For example, if the ratio is 10% and there are 300 output values in the numerical sequence, the preset position is the 30th position, and the output value of the 30th digit of the extracted numerical sequence is used as the detection threshold, and the detection threshold is set by the big data analysis method. Generally universal applicability. Of course, the preset ratio or preset position can also be pre-set by humans.
通过图3所示实施例可知,在本申请实施例中,通过将多个血液信息样本的血液数值作 为输入参数输入至检测模型,并获取检测模型输出的多个输出数值,对多个输出数值按照大小进行排序,生成数值序列,提取数值序列中处于预设位置的输出数值,将该输出数值作为检测阈值,根据检测模型进行检测阈值的设置,提升了检测阈值的适用程度。As shown in the embodiment shown in FIG. 3, in the embodiment of the present application, the blood value of the plurality of blood information samples is input as an input parameter to the detection model, and the plurality of output values output by the detection model are acquired, and the plurality of output values are obtained. Sorting according to the size, generating a numerical sequence, extracting the output value at the preset position in the numerical sequence, using the output value as the detection threshold, and setting the detection threshold according to the detection model, thereby improving the applicability of the detection threshold.
请参阅图4,图4是本申请实施例四提供的一种基于回归树模型的血液分析方法的实现流程图。相对于图1对应的实施例,本实施例在症状特征值为第一特征值或第二特征值的基础上,对S104之前的过程进行细化后得到S401~S402,详述如下:Please refer to FIG. 4. FIG. 4 is a flowchart of an implementation of a blood analysis method based on a regression tree model according to Embodiment 4 of the present application. With respect to the embodiment corresponding to FIG. 1, in the embodiment, after the symptom feature value is the first feature value or the second feature value, the process before S104 is refined to obtain S401 to S402, which are as follows:
S401:根据所述多个血液信息样本确定所述症状特征值为所述第一特征值的第一样本人数以及所述症状特征值为所述第二特征值的第二样本人数。S401: Determine, according to the plurality of blood information samples, the first sample number of the symptom characteristic value as the first feature value and the second sample number of the symptom feature value as the second feature value.
通常来说,血液信息样本中的症状特征值有两种取值,即1或0,但应获知的是,其他取值的症状特征值也可应用在本申请实施例中。为了便于说明,将取1的症状特征值作为第一特征值,将取0的症状特征值作为第二特征值,且第一特征值代表血液信息样本所属的已有血液已受到症状影响,第二特征值代表血液信息样本所属的已有血液未受到症状影响。在本申请实施例中,将多个血液信息样本中症状特征值为第一特征值的血液信息样本的个数作为第一样本个数,将多个血液信息样本中症状特征值为第二特征值的血液信息样本的个数作为第二样本个数。In general, the symptom characteristic values in the blood information sample have two values, namely 1 or 0, but it should be understood that other characteristic symptom characteristic values can also be applied in the embodiments of the present application. For convenience of explanation, the symptom characteristic value of 1 is taken as the first characteristic value, and the symptom characteristic value of 0 is taken as the second characteristic value, and the first characteristic value represents that the existing blood to which the blood information sample belongs has been affected by the symptom, The second eigenvalue represents that the existing blood to which the blood information sample belongs is not affected by the symptoms. In the embodiment of the present application, the number of blood information samples in which the symptom characteristic value is the first characteristic value in the plurality of blood information samples is taken as the first sample number, and the symptom characteristic value in the plurality of blood information samples is the second value. The number of blood information samples of the feature value is taken as the number of second samples.
S402:根据所述第一样本人数和所述第二样本人数确定第一占比比例和第二占比比例,并根据所述第一占比比例、所述第二占比比例、所述第一特征值以及所述第二特征值确定所述检测阈值。S402: Determine a first proportion ratio and a second proportion ratio according to the first sample number of people and the second sample number, and according to the first proportion ratio, the second proportion ratio, the The first feature value and the second feature value determine the detection threshold.
第一样本个数和第二样本个数确定后,确定第一样本个数在多个血液信息样本的总个数中占有的第一占比比例,以及第二样本个数在多个血液信息样本的总个数中占有的第二占比比例,并根据第一占比比例、第二占比比例、第一特征值以及第二特征值确定检测阈值。比如说,若第一特征值大于第二特征值,则确定检测阈值的一种方法是,获取第一特征值与第二特征值之间的差值,将第一占比比例与该差值进行乘法运算得到边界值,并将第一特征值减去该边界值得到的结果作为检测阈值。比如第一特征值为2,第二特征值为1,第一占比比例为30%,则检测阈值为2-(2-1)×30%=1.7。当然,以上仅为确定检测阈值的方法的个例,并不构成对本申请实施例的限定,根据实际应用场景的不同,还可应用更多的确定检测阈值的方法。After the first sample number and the second sample number are determined, determining a first proportion of the first sample number in a total number of the plurality of blood information samples, and the second sample number is plural a second proportion of the total number of blood information samples, and the detection threshold is determined according to the first ratio, the second ratio, the first characteristic value, and the second characteristic value. For example, if the first eigenvalue is greater than the second eigenvalue, one method of determining the detection threshold is to obtain a difference between the first eigenvalue and the second eigenvalue, and the first proportion ratio and the difference A multiplication operation is performed to obtain a boundary value, and a result obtained by subtracting the boundary value from the first feature value is used as a detection threshold. For example, if the first feature value is 2, the second feature value is 1, and the first ratio is 30%, the detection threshold is 2-(2-1)×30%=1.7. Certainly, the above is only a specific example of the method for determining the detection threshold, and does not constitute a limitation on the embodiment of the present application. According to different actual application scenarios, more methods for determining the detection threshold may be applied.
通过图4所示实施例可知,在本申请实施例中,通过在症状特征值为第一特征值或第二特征值的情况下,确定多个血液信息样本中症状特征值为第一特征值的第一样本个数以及症状特征值为第二特征值的第二样本个数,并根据第一样本人数和第二样本人数确定第一占比比例和第二占比比例,最后根据第一占比比例、第二占比比例、第一特征值以及第二特征值 得到检测阈值,由于多个血液信息样本可能在某个地域集中采集,故本申请实施例提升了检测阈值针对某个地域的精确性。It can be seen from the embodiment shown in FIG. 4 that, in the embodiment of the present application, the symptom characteristic value in the plurality of blood information samples is determined as the first characteristic value by using the symptom characteristic value as the first characteristic value or the second characteristic value. The first sample number and the symptom characteristic value are the second sample number of the second feature value, and the first proportion ratio and the second proportion ratio are determined according to the first sample number of people and the second sample number, and finally according to The first proportion ratio, the second proportion ratio, the first eigenvalue, and the second eigenvalue are obtained by the detection threshold. Since the plurality of blood information samples may be collected in a certain area, the embodiment of the present application improves the detection threshold for a certain Geographical accuracy.
请参阅图5,图5是本申请实施例五提供的一种基于回归树模型的血液分析方法的实现流程图。相对于图1对应的实施例,本实施例还包括如下步骤:Please refer to FIG. 5. FIG. 5 is a flowchart of implementing a blood analysis method based on a regression tree model according to Embodiment 5 of the present application. The embodiment further includes the following steps with respect to the embodiment corresponding to FIG. 1 :
S501:获取预设的多个血液阈值模板,所述血液阈值模板由告警血液的血液数值确定。S501: Acquire a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood.
由于不同症状造成的对血液数值的影响不同,故预先获取多个症状对应的多种血液阈值模板,血液阈值模板与已受某个症状影响的告警血液的血液数值相同,其中,血液阈值模板可能包含告警血液中某个或某几个血液成分的相关数值,相关数值的种类可依据实际应用场景进行选择。可选地,获取已受某症状影响的多个告警血液,并对多个告警血液的多个血液数值进行平均数求值,将生成的平均数值作为血液阈值模板,可降低血液阈值模板生成的意外因素影响,提升血液阈值模板的准确性。此外,还可获取受症状影响达到预设时间的多个告警血液,比如获取受症状影响已达六个月的多个告警血液,并通过同样的平均数求值方法由多个告警血液的多个血液数值生成血液阈值模板,进一步提升基于血液阈值模板进行判断的在时间维度上的准确性。Because different symptoms have different effects on blood values, multiple blood threshold templates corresponding to multiple symptoms are acquired in advance, and the blood threshold template is the same as the blood value of the alarm blood that has been affected by a certain symptom, wherein the blood threshold template may Contains the relevant values of one or several blood components in the alarm blood. The types of related values can be selected according to the actual application scenario. Optionally, obtaining a plurality of alarm bloods that have been affected by a symptom, and performing average evaluation of the plurality of blood values of the plurality of alarm blood, and using the generated average value as a blood threshold template, which can reduce the generation of the blood threshold template. The effect of unexpected factors increases the accuracy of the blood threshold template. In addition, it is also possible to obtain a plurality of alarm bloods that are affected by the symptoms for a preset time, such as obtaining a plurality of alarm bloods that have been affected by the symptoms for six months, and by the same average number evaluation method, multiple alarm bloods are provided. The blood values generate a blood threshold template to further improve the accuracy in the time dimension based on the blood threshold template.
S502:将所述待测血液的血液数值与所述多个血液阈值模板进行一一比对。S502: Perform a one-to-one comparison of the blood value of the blood to be tested and the plurality of blood threshold templates.
由于多个血液阈值模板针对不同症状,并且血液阈值模板可能包含多个相关数值,故针对症状对血液阈值模板的多个相关数值设置比对方向。以症状为慢阻肺进行举例,由于受慢阻肺影响的血液会出现白细胞含量增大,血红蛋白含量增大和血小板含量减小,故在与慢阻肺对应的血液阈值模板包含白细胞含量、血红蛋白含量和血小板含量三个相关数值时,对白细胞含量设置比对方向为向上比对,对血红蛋白含量同样设置比对方向为向上比对,对血小板含量设置比对方向为向下比对。向上比对是指若待测血液的血液数值中的白细胞含量超过或等于血液阈值模板中的白细胞含量,则认定比对成功,否则认定比对失败;向下比对是指若待测血液的血液数值中的血小板含量小于血液阈值模板中的白细胞含量,则认定比对成功,否则认定比对失败。在本申请实施例中,可针对不同症状创建设置机制,并通过设置机制自动完成多个血液阈值模板中比对方向的设置。Since multiple blood threshold templates are for different symptoms, and the blood threshold template may contain multiple related values, the alignment direction is set for multiple related values of the blood threshold template for the symptoms. Taking the symptoms of chronic obstructive pulmonary disease as an example, blood levels affected by chronic obstructive pulmonary disease increase, hemoglobin content increases and platelet content decreases, so the blood threshold template corresponding to chronic obstructive pulmonary disease contains leukocyte content and hemoglobin content. When the three values are related to the platelet content, the white blood cell content is set to the upward alignment, and the hemoglobin content is also set to the opposite direction, and the platelet content is set to the opposite direction. The upward alignment means that if the white blood cell content in the blood value of the blood to be tested exceeds or equals the white blood cell content in the blood threshold template, the comparison is determined to be successful, otherwise the comparison fails; the downward comparison refers to the blood to be tested. If the platelet content in the blood value is less than the white blood cell content in the blood threshold template, the comparison is deemed successful, otherwise the comparison is deemed to have failed. In the embodiment of the present application, a setting mechanism may be created for different symptoms, and the setting of the matching direction in the plurality of blood threshold templates is automatically completed by the setting mechanism.
S503:若所述待测血液的血液数值与所述多个血液阈值模板的某一个比对成功,则输出第二告警提示。S503: If the blood value of the blood to be tested is successfully aligned with one of the plurality of blood threshold templates, outputting a second alarm prompt.
多个血液阈值模板的比对方向设置完成后,将待测血液的血液数值与多个血液阈值模板进行比对,若待测血液的血液数值与某一个血液阈值模板比对成功,则输出第二告警提示,其中,第二告警提示可包括与该血液阈值模板对应的症状,方便用户快速查看;若待测血液的血液数值与多个血液阈值模板都比对失败,则输出正常提示。After the setting of the alignment direction of the plurality of blood threshold templates is completed, the blood value of the blood to be tested is compared with a plurality of blood threshold templates, and if the blood value of the blood to be tested is successfully compared with a blood threshold template, the output is The second alarm prompt, wherein the second alarm prompt may include a symptom corresponding to the blood threshold template, which is convenient for the user to quickly view; if the blood blood value of the blood to be tested and the plurality of blood threshold templates fail to match, the normal prompt is output.
通过图5所示实施例可知,在本申请实施例中,通过获取预设的针对多个症状的多个血液阈值模板,血液阈值模板取自受症状影响的告警血液的血液数值,将待测血液的血液数值与多个血液阈值模板进行一一比对,若待测血液的血液数值与多个血液阈值模板中的某一个比对成功,则输出第二告警提示,进行告警,通过设置多个血液阈值模板实现了对待测血液的多维度分析,提升了待测血液分析的可靠性。According to the embodiment shown in FIG. 5, in the embodiment of the present application, by acquiring a plurality of preset blood threshold templates for a plurality of symptoms, the blood threshold template is taken from the blood value of the alarm blood affected by the symptom, and is to be tested. The blood blood value is compared with the plurality of blood threshold templates one by one. If the blood blood value of the blood to be tested is successfully compared with one of the plurality of blood threshold templates, the second alarm prompt is output, and the alarm is generated, and the alarm is set. A blood threshold template enables multi-dimensional analysis of the blood to be measured and improves the reliability of the blood analysis to be tested.
对应于上文实施例所述的一种基于回归树模型的血液分析方法,图6示出了本申请实施例提供的一种基于回归树模型的血液分析装置的一个结构框图,参照图6,该装置包括:Corresponding to a regression tree model-based blood analysis method described in the above embodiments, FIG. 6 is a structural block diagram of a regression tree model-based blood analysis apparatus provided by an embodiment of the present application. Referring to FIG. 6, The device includes:
第一获取单元61,用于获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值;a first obtaining unit 61, configured to acquire a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
拟合单元62,用于将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型;a fitting unit 62, configured to fit the plurality of blood information samples with a regression tree model, and use the regression tree model that is completed as a detection model;
第二获取单元63,用于获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值;a second obtaining unit 63, configured to acquire a blood value of the blood to be tested, and input the blood value of the blood to be tested into the detection model to obtain a detection value;
输出单元64,用于若所述检测值大于检测阈值,则输出第一告警提示。The output unit 64 is configured to output a first alarm prompt if the detected value is greater than the detection threshold.
可选地,所述拟合单元62包括:Optionally, the fitting unit 62 includes:
输入单元,用于将所述多个血液信息样本输入至所述回归树模型,以训练所述回归树模型,其中,将所述血液信息样本中的血液数值作为所述回归树模型的输入向量,将所述血液信息样本中的症状特征值作为所述回归树模型的标签向量;An input unit, configured to input the plurality of blood information samples to the regression tree model to train the regression tree model, wherein a blood value in the blood information sample is used as an input vector of the regression tree model And using the symptom feature value in the blood information sample as a label vector of the regression tree model;
模型输出单元,用于将训练完成的所述回归树模型输出为所述检测模型。And a model output unit, configured to output the trained regression tree model as the detection model.
可选地,所述输出单元64还包括:Optionally, the output unit 64 further includes:
样本输入单元,用于将所述多个血液信息样本的血液数值输入至所述检测模型,并获取所述检测模型输出的多个输出数值;a sample input unit, configured to input blood values of the plurality of blood information samples to the detection model, and acquire a plurality of output values output by the detection model;
排序单元,用于对所述多个输出数值进行排序,生成数值序列;a sorting unit, configured to sort the plurality of output values to generate a sequence of values;
阈值输出单元,用于将所述数值序列中位于预设位置的输出数值作为所述检测阈值。And a threshold output unit configured to use an output value of the numerical sequence at a preset position as the detection threshold.
可选地,所述症状特征值为第一特征值或第二特征值,所述输出单元64还包括:Optionally, the symptom characteristic value is a first feature value or a second feature value, and the output unit 64 further includes:
第一确定单元,用于根据所述多个血液信息样本确定所述症状特征值为所述第一特征值的第一样本个数以及所述症状特征值为所述第二特征值的第二样本个数;a first determining unit, configured to determine, according to the plurality of blood information samples, that the symptom feature value is the first sample number of the first feature value and the symptom feature value is the second feature value Two sample numbers;
第二确定单元,用于根据所述第一样本人数和所述第二样本人数确定第一占比比例和第二占比比例,并根据所述第一占比比例、所述第二占比比例、所述第一特征值以及所述第二特征值确定所述检测阈值。a second determining unit, configured to determine, according to the first sample number of people and the second sample number of people, a first proportion ratio and a second proportion ratio, and according to the first proportion ratio, the second proportion The ratio, the first eigenvalue, and the second eigenvalue determine the detection threshold.
可选地,所述血液分析装置还包括:Optionally, the blood analysis device further includes:
模板获取单元,用于获取预设的多个血液阈值模板,所述血液阈值模板由告警血液的血液数值确定;a template acquiring unit, configured to acquire a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood;
比对单元,用于将所述待测血液的血液数值与所述多个血液阈值模板进行一一比对;An aligning unit, configured to compare the blood value of the blood to be tested and the plurality of blood threshold templates one by one;
告警输出单元,用于若所述待测血液的血液数值与所述多个血液阈值模板的某一个比对成功,则输出第二告警提示。The alarm output unit is configured to output a second alarm prompt if the blood value of the blood to be tested is successfully matched with one of the plurality of blood threshold templates.
图7是本申请实施例提供的终端设备的示意图。如图7所示,该实施例的终端设备7包括:处理器70以及存储器71,所述存储器71中存储有可在所述处理器70上运行的计算机可读指令72,例如基于回归树模型的血液分析程序。所述处理器70执行所述计算机可读指令72时实现上述各个基于回归树模型的血液分析方法实施例中的步骤,例如图1所示的步骤S101至S104。或者,所述处理器70执行所述计算机可读指令72时实现上述装置实施例中各单元的功能,例如图6所示单元61至64的功能。FIG. 7 is a schematic diagram of a terminal device according to an embodiment of the present application. As shown in FIG. 7, the terminal device 7 of this embodiment includes a processor 70 and a memory 71 in which computer readable instructions 72 executable on the processor 70 are stored, for example, based on a regression tree model. Blood analysis program. The processor 70 executes the computer readable instructions 72 to implement the steps in the various embodiments of the regression tree model based blood analysis method described above, such as steps S101 through S104 shown in FIG. Alternatively, the processor 70, when executing the computer readable instructions 72, implements the functions of the various units in the apparatus embodiments described above, such as the functions of the units 61 through 64 shown in FIG.
示例性的,所述计算机可读指令72可以被分割成一个或多个模块/单元,所述一个或者多个模块/单元被存储在所述存储器71中,并由所述处理器70执行,以完成本申请。所述一个或多个模块/单元可以是能够完成特定功能的一系列计算机可读指令段,该指令段用于描述所述计算机可读指令72在所述终端设备7中的执行过程。例如,所述计算机可读指令72可以被分割成第一获取单元、拟合单元、第二获取单元及输出单元,各单元具体功能如上所述。Illustratively, the computer readable instructions 72 may be partitioned into one or more modules/units that are stored in the memory 71 and executed by the processor 70, To complete this application. The one or more modules/units may be a series of computer readable instruction segments capable of performing a particular function for describing the execution of the computer readable instructions 72 in the terminal device 7. For example, the computer readable instructions 72 may be segmented into a first acquisition unit, a fitting unit, a second acquisition unit, and an output unit, each unit having a specific function as described above.
所述终端设备可包括,但不仅限于,处理器70、存储器71。本领域技术人员可以理解,图7仅仅是终端设备7的示例,并不构成对终端设备7的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件,例如所述终端设备还可以包括输入输出设备、网络接入设备、总线等。The terminal device may include, but is not limited to, a processor 70 and a memory 71. It will be understood by those skilled in the art that FIG. 7 is only an example of the terminal device 7, and does not constitute a limitation of the terminal device 7, and may include more or less components than those illustrated, or combine some components or different components. For example, the terminal device may further include an input/output device, a network access device, a bus, and the like.
所称处理器70可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 70 may be a central processing unit (CPU), or may be other general-purpose processors, a digital signal processor (DSP), an application specific integrated circuit (ASIC), Field-Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic device, discrete hardware components, etc. The general purpose processor may be a microprocessor or the processor or any conventional processor or the like.
所述存储器71可以是所述终端设备7的内部存储单元,例如终端设备7的硬盘或内存。所述存储器71也可以是所述终端设备7的外部存储设备,例如所述终端设备7上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,所述存储器71还可以既包括所述终端设备7的内部存储单元也包括外部存储设备。所述存储器71用于存储所述计算机可读指令以及所述终端设 备所需的其他程序和数据。所述存储器71还可以用于暂时地存储已经输出或者将要输出的数据。The memory 71 may be an internal storage unit of the terminal device 7, such as a hard disk or a memory of the terminal device 7. The memory 71 may also be an external storage device of the terminal device 7, for example, a plug-in hard disk provided on the terminal device 7, a smart memory card (SMC), and a secure digital (SD). Card, flash card, etc. Further, the memory 71 may also include both an internal storage unit of the terminal device 7 and an external storage device. The memory 71 is for storing the computer readable instructions and other programs and data required by the terminal device. The memory 71 can also be used to temporarily store data that has been output or is about to be output.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above integrated unit can be implemented in the form of hardware or in the form of a software functional unit.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as a standalone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application, in essence or the contribution to the prior art, or all or part of the technical solution may be embodied in the form of a software product stored in a storage medium. A number of instructions are included to cause a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in various embodiments of the present application. The foregoing storage medium includes: a U disk, a mobile hard disk, a read-only memory (ROM), a random access memory (RAM), a magnetic disk, or an optical disk, and the like, which can store program codes. .
以上所述,以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围。The above embodiments are only used to explain the technical solutions of the present application, and are not limited thereto; although the present application has been described in detail with reference to the foregoing embodiments, those skilled in the art should understand that they can still The technical solutions described in the embodiments are modified, or the equivalents of the technical features are replaced by the equivalents. The modifications and substitutions of the embodiments do not depart from the spirit and scope of the technical solutions of the embodiments of the present application.

Claims (20)

  1. 一种基于回归树模型的血液分析方法,其特征在于,包括:A blood analysis method based on a regression tree model, comprising:
    获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值;Obtaining a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
    将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型;Comparing the plurality of blood information samples with a regression tree model, and fitting the completed regression tree model as a detection model;
    获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值;Obtaining a blood value of the blood to be tested, and inputting the blood value of the blood to be tested into the detection model to obtain a detection value;
    若所述检测值大于检测阈值,则输出第一告警提示。If the detected value is greater than the detection threshold, the first alarm prompt is output.
  2. 如权利要求1所述的血液分析方法,其特征在于,所述将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型,包括:The blood analysis method according to claim 1, wherein the fitting the plurality of blood information samples to a regression tree model, and fitting the completed regression tree model as a detection model comprises:
    将所述多个血液信息样本输入至所述回归树模型,以训练所述回归树模型,其中,将所述血液信息样本中的血液数值作为所述回归树模型的输入参数,将所述血液信息样本中的症状特征值作为所述回归树模型的标签参数;Inputting the plurality of blood information samples to the regression tree model to train the regression tree model, wherein the blood value in the blood information sample is used as an input parameter of the regression tree model, the blood is The symptom feature value in the information sample is used as a tag parameter of the regression tree model;
    将训练完成的所述回归树模型输出为所述检测模型。The trained regression tree model is output as the detection model.
  3. 如权利要求1所述的血液分析方法,其特征在于,所述若所述检测值大于检测阈值,则输出第一告警提示之前,还包括:The blood analysis method according to claim 1, wherein if the detection value is greater than the detection threshold, before the outputting the first alarm prompt, the method further includes:
    将所述多个血液信息样本的血液数值输入至所述检测模型,并获取所述检测模型输出的多个输出数值;And inputting blood values of the plurality of blood information samples to the detection model, and acquiring a plurality of output values output by the detection model;
    对所述多个输出数值进行排序,生成数值序列;Sorting the plurality of output values to generate a sequence of values;
    将所述数值序列中位于预设位置的输出数值作为所述检测阈值。An output value at the preset position in the numerical sequence is used as the detection threshold.
  4. 如权利要求1所述的血液分析方法,其特征在于,所述症状特征值为第一特征值或第二特征值,所述若所述检测值大于检测阈值,则输出第一告警提示之前,还包括:The blood analysis method according to claim 1, wherein the symptom characteristic value is a first characteristic value or a second characteristic value, and if the detection value is greater than a detection threshold, before the first alarm prompt is output, Also includes:
    根据所述多个血液信息样本确定所述症状特征值为所述第一特征值的第一样本个数以及所述症状特征值为所述第二特征值的第二样本个数;Determining, according to the plurality of blood information samples, the first sample number of the symptom characteristic value of the first feature value and the second sample number of the symptom feature value of the second feature value;
    根据所述第一样本人数和所述第二样本人数确定第一占比比例和第二占比比例,并根据所述第一占比比例、所述第二占比比例、所述第一特征值以及所述第二特征值确定所述检测阈值。Determining, according to the first sample number of people and the second sample number of people, a first proportion ratio and a second proportion ratio, and according to the first ratio ratio, the second proportion ratio, the first The feature value and the second feature value determine the detection threshold.
  5. 如权利要求1所述的血液分析方法,其特征在于,还包括:The blood analysis method according to claim 1, further comprising:
    获取预设的多个血液阈值模板,所述血液阈值模板由告警血液的血液数值确定;Obtaining a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood;
    将所述待测血液的血液数值与所述多个血液阈值模板进行一一比对;Comparing the blood value of the blood to be tested and the plurality of blood threshold templates one by one;
    若所述待测血液的血液数值与所述多个血液阈值模板的某一个比对成功,则输出第二告警提示。And if the blood value of the blood to be tested is successfully aligned with one of the plurality of blood threshold templates, the second alarm prompt is output.
  6. 一种基于回归树模型的血液分析装置,其特征在于,包括:A blood analysis device based on a regression tree model, comprising:
    第一获取单元,用于获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值;a first acquiring unit, configured to acquire a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
    拟合单元,用于将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型;a fitting unit, configured to fit the plurality of blood information samples with a regression tree model, and use the regression tree model that is completed as a detection model;
    第二获取单元,用于获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值;a second acquiring unit, configured to acquire a blood value of the blood to be tested, and input the blood value of the blood to be tested into the detection model to obtain a detection value;
    输出单元,用于若所述检测值大于检测阈值,则输出第一告警提示。And an output unit, configured to output a first alarm prompt if the detected value is greater than a detection threshold.
  7. 如权利要求6所述的血液分析装置,其特征在于,所述拟合单元,包括:The blood analysis device according to claim 6, wherein the fitting unit comprises:
    输入单元,用于将所述多个血液信息样本输入至所述回归树模型,以训练所述回归树模型,其中,将所述血液信息样本中的血液数值作为所述回归树模型的输入向量,将所述血液信息样本中的症状特征值作为所述回归树模型的标签向量;An input unit, configured to input the plurality of blood information samples to the regression tree model to train the regression tree model, wherein a blood value in the blood information sample is used as an input vector of the regression tree model And using the symptom feature value in the blood information sample as a label vector of the regression tree model;
    模型输出单元,用于将训练完成的所述回归树模型输出为所述检测模型。And a model output unit, configured to output the trained regression tree model as the detection model.
  8. 如权利要求6所述的血液分析装置,其特征在于,所述输出单元,还包括:The blood analysis device according to claim 6, wherein the output unit further comprises:
    样本输入单元,用于将所述多个血液信息样本的血液数值输入至所述检测模型,并获取所述检测模型输出的多个输出数值;a sample input unit, configured to input blood values of the plurality of blood information samples to the detection model, and acquire a plurality of output values output by the detection model;
    排序单元,用于对所述多个输出数值进行排序,生成数值序列;a sorting unit, configured to sort the plurality of output values to generate a sequence of values;
    阈值输出单元,用于将所述数值序列中位于预设位置的输出数值作为所述检测阈值。And a threshold output unit configured to use an output value of the numerical sequence at a preset position as the detection threshold.
  9. 如权利要求6所述的血液分析装置,其特征在于,所述症状特征值为第一特征值或第二特征值,所述输出单元,还包括:The blood analysis device according to claim 6, wherein the symptom characteristic value is a first feature value or a second feature value, and the output unit further comprises:
    第一确定单元,用于根据所述多个血液信息样本确定所述症状特征值为所述第一特征值的第一样本个数以及所述症状特征值为所述第二特征值的第二样本个数;a first determining unit, configured to determine, according to the plurality of blood information samples, that the symptom feature value is the first sample number of the first feature value and the symptom feature value is the second feature value Two sample numbers;
    第二确定单元,用于根据所述第一样本人数和所述第二样本人数确定第一占比比例和第二占比比例,并根据所述第一占比比例、所述第二占比比例、所述第一特征值以及所述第二特征值确定所述检测阈值。a second determining unit, configured to determine, according to the first sample number of people and the second sample number of people, a first proportion ratio and a second proportion ratio, and according to the first proportion ratio, the second proportion The ratio, the first eigenvalue, and the second eigenvalue determine the detection threshold.
  10. 如权利要求6所述的血液分析装置,其特征在于,还包括:The blood analysis device according to claim 6, further comprising:
    模板获取单元,用于获取预设的多个血液阈值模板,所述血液阈值模板由告警血液的血液数值确定;a template acquiring unit, configured to acquire a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood;
    比对单元,用于将所述待测血液的血液数值与所述多个血液阈值模板进行一一比对;An aligning unit, configured to compare the blood value of the blood to be tested and the plurality of blood threshold templates one by one;
    告警输出单元,用于若所述待测血液的血液数值与所述多个血液阈值模板的某一个比对成功,则输出第二告警提示。The alarm output unit is configured to output a second alarm prompt if the blood value of the blood to be tested is successfully matched with one of the plurality of blood threshold templates.
  11. 一种终端设备,其特征在于,包括存储器以及处理器,所述存储器中存储有可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:A terminal device, comprising: a memory and a processor, wherein the memory stores computer readable instructions executable on the processor, and the processor implements the following steps when the computer readable instructions are executed :
    获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值;Obtaining a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
    将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型;Comparing the plurality of blood information samples with a regression tree model, and fitting the completed regression tree model as a detection model;
    获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值;Obtaining a blood value of the blood to be tested, and inputting the blood value of the blood to be tested into the detection model to obtain a detection value;
    若所述检测值大于检测阈值,则输出第一告警提示。If the detected value is greater than the detection threshold, the first alarm prompt is output.
  12. 根据权利要求11所述的终端设备,其特征在于,所述将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型,包括:The terminal device according to claim 11, wherein the fitting the plurality of blood information samples with a regression tree model, and fitting the completed regression tree model as a detection model comprises:
    将所述多个血液信息样本输入至所述回归树模型,以训练所述回归树模型,其中,将所述血液信息样本中的血液数值作为所述回归树模型的输入参数,将所述血液信息样本中的症状特征值作为所述回归树模型的标签参数;Inputting the plurality of blood information samples to the regression tree model to train the regression tree model, wherein the blood value in the blood information sample is used as an input parameter of the regression tree model, the blood is The symptom feature value in the information sample is used as a tag parameter of the regression tree model;
    将训练完成的所述回归树模型输出为所述检测模型。The trained regression tree model is output as the detection model.
  13. 根据权利要求11所述的终端设备,其特征在于,所述若所述检测值大于检测阈值,则输出第一告警提示之前,还包括:The terminal device according to claim 11, wherein if the detection value is greater than the detection threshold, before the outputting the first alarm prompt, the method further includes:
    将所述多个血液信息样本的血液数值输入至所述检测模型,并获取所述检测模型输出的多个输出数值;And inputting blood values of the plurality of blood information samples to the detection model, and acquiring a plurality of output values output by the detection model;
    对所述多个输出数值进行排序,生成数值序列;Sorting the plurality of output values to generate a sequence of values;
    将所述数值序列中位于预设位置的输出数值作为所述检测阈值。An output value at the preset position in the numerical sequence is used as the detection threshold.
  14. 根据权利要求11所述的终端设备,其特征在于,所述症状特征值为第一特征值或第二特征值,所述若所述检测值大于检测阈值,则输出第一告警提示之前,还包括:The terminal device according to claim 11, wherein the symptom characteristic value is a first feature value or a second feature value, and if the detection value is greater than a detection threshold, before the first alarm prompt is output, include:
    根据所述多个血液信息样本确定所述症状特征值为所述第一特征值的第一样本个数以及所述症状特征值为所述第二特征值的第二样本个数;Determining, according to the plurality of blood information samples, the first sample number of the symptom characteristic value of the first feature value and the second sample number of the symptom feature value of the second feature value;
    根据所述第一样本人数和所述第二样本人数确定第一占比比例和第二占比比例,并根据所述第一占比比例、所述第二占比比例、所述第一特征值以及所述第二特征值确定所述检测阈值。Determining, according to the first sample number of people and the second sample number of people, a first proportion ratio and a second proportion ratio, and according to the first ratio ratio, the second proportion ratio, the first The feature value and the second feature value determine the detection threshold.
  15. 根据权利要求11所述的终端设备,其特征在于,还包括:The terminal device according to claim 11, further comprising:
    获取预设的多个血液阈值模板,所述血液阈值模板由告警血液的血液数值确定;Obtaining a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood;
    将所述待测血液的血液数值与所述多个血液阈值模板进行一一比对;Comparing the blood value of the blood to be tested and the plurality of blood threshold templates one by one;
    若所述待测血液的血液数值与所述多个血液阈值模板的某一个比对成功,则输出第二告警提示。And if the blood value of the blood to be tested is successfully aligned with one of the plurality of blood threshold templates, the second alarm prompt is output.
  16. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机可读指令,其特征在于,所述计算机可读指令被至少一个处理器执行时实现如下步骤:A computer readable storage medium storing computer readable instructions, wherein the computer readable instructions, when executed by at least one processor, implement the following steps:
    获取多个血液信息样本,每个所述血液信息样本包括血液数值和症状特征值;Obtaining a plurality of blood information samples, each of the blood information samples including a blood value and a symptom characteristic value;
    将所述多个血液信息样本与回归树模型进行拟合,并将拟合完成的所述回归树模型作为检测模型;Comparing the plurality of blood information samples with a regression tree model, and fitting the completed regression tree model as a detection model;
    获取待测血液的血液数值,并将所述待测血液的血液数值输入至所述检测模型,得到检测值;Obtaining a blood value of the blood to be tested, and inputting the blood value of the blood to be tested into the detection model to obtain a detection value;
    若所述检测值大于检测阈值,则输出第一告警提示。If the detected value is greater than the detection threshold, the first alarm prompt is output.
  17. 根据权利要求16所述的计算机可读存储介质,其特征在于,所述计算机可读指令被至少一个处理器执行时实现如下步骤:The computer readable storage medium of claim 16, wherein the computer readable instructions are executed by at least one processor to implement the following steps:
    将所述多个血液信息样本输入至所述回归树模型,以训练所述回归树模型,其中,将所述血液信息样本中的血液数值作为所述回归树模型的输入参数,将所述血液信息样本中的症状特征值作为所述回归树模型的标签参数;Inputting the plurality of blood information samples to the regression tree model to train the regression tree model, wherein the blood value in the blood information sample is used as an input parameter of the regression tree model, the blood is The symptom feature value in the information sample is used as a tag parameter of the regression tree model;
    将训练完成的所述回归树模型输出为所述检测模型。The trained regression tree model is output as the detection model.
  18. 根据权利要求16所述的计算机可读存储介质,其特征在于,所述计算机可读指令被至少一个处理器执行时实现如下步骤:The computer readable storage medium of claim 16, wherein the computer readable instructions are executed by at least one processor to implement the following steps:
    将所述多个血液信息样本的血液数值输入至所述检测模型,并获取所述检测模型输出的多个输出数值;And inputting blood values of the plurality of blood information samples to the detection model, and acquiring a plurality of output values output by the detection model;
    对所述多个输出数值进行排序,生成数值序列;Sorting the plurality of output values to generate a sequence of values;
    将所述数值序列中位于预设位置的输出数值作为所述检测阈值。An output value at the preset position in the numerical sequence is used as the detection threshold.
  19. 根据权利要求16所述的计算机可读存储介质,其特征在于,所述症状特征值为第一特征值或第二特征值,所述计算机可读指令被至少一个处理器执行时还实现如下步骤:The computer readable storage medium according to claim 16, wherein the symptom characteristic value is a first feature value or a second feature value, and the computer readable instructions are further executed as follows when executed by at least one processor :
    根据所述多个血液信息样本确定所述症状特征值为所述第一特征值的第一样本个数以及所述症状特征值为所述第二特征值的第二样本个数;Determining, according to the plurality of blood information samples, the first sample number of the symptom characteristic value of the first feature value and the second sample number of the symptom feature value of the second feature value;
    根据所述第一样本人数和所述第二样本人数确定第一占比比例和第二占比比例,并根据所述第一占比比例、所述第二占比比例、所述第一特征值以及所述第二特征值确定所述检测阈值。Determining, according to the first sample number of people and the second sample number of people, a first proportion ratio and a second proportion ratio, and according to the first ratio ratio, the second proportion ratio, the first The feature value and the second feature value determine the detection threshold.
  20. 根据权利要求16所述的计算机可读存储介质,其特征在于,所述计算机可读指令被至少一个处理器执行时还实现如下步骤:The computer readable storage medium of claim 16, wherein the computer readable instructions are further executed by the at least one processor to:
    获取预设的多个血液阈值模板,所述血液阈值模板由告警血液的血液数值确定;Obtaining a preset plurality of blood threshold templates, wherein the blood threshold template is determined by a blood value of the alarm blood;
    将所述待测血液的血液数值与所述多个血液阈值模板进行一一比对;Comparing the blood value of the blood to be tested and the plurality of blood threshold templates one by one;
    若所述待测血液的血液数值与所述多个血液阈值模板的某一个比对成功,则输出第二告警提示。And if the blood value of the blood to be tested is successfully aligned with one of the plurality of blood threshold templates, the second alarm prompt is output.
PCT/CN2018/097562 2018-05-14 2018-07-27 Regression tree model-based blood analysis method and apparatus, terminal device and readable storage medium WO2019218483A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810457009.3A CN108735295B (en) 2018-05-14 2018-05-14 Blood analysis method and terminal equipment based on regression tree model
CN201810457009.3 2018-05-14

Publications (1)

Publication Number Publication Date
WO2019218483A1 true WO2019218483A1 (en) 2019-11-21

Family

ID=63937452

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/097562 WO2019218483A1 (en) 2018-05-14 2018-07-27 Regression tree model-based blood analysis method and apparatus, terminal device and readable storage medium

Country Status (2)

Country Link
CN (1) CN108735295B (en)
WO (1) WO2019218483A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923648A (en) * 2009-06-15 2010-12-22 深圳迈瑞生物医疗电子股份有限公司 Clustering method and device for support vector machine
CN104584017A (en) * 2012-08-16 2015-04-29 金格输入输出有限公司 Method for modeling behavior and health changes
CN104715129A (en) * 2013-12-13 2015-06-17 中国科学院心理研究所 Mental health state assessment system and method based on mobile equipment using behavior
CN106934223A (en) * 2017-02-28 2017-07-07 思派(北京)网络科技有限公司 A kind of blood disease intelligent classification system and method based on big data

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9700219B2 (en) * 2013-10-17 2017-07-11 Siemens Healthcare Gmbh Method and system for machine learning based assessment of fractional flow reserve
CN105550507A (en) * 2015-12-09 2016-05-04 上海斐讯数据通信技术有限公司 Intelligent terminal for measuring blood type and measuring method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923648A (en) * 2009-06-15 2010-12-22 深圳迈瑞生物医疗电子股份有限公司 Clustering method and device for support vector machine
CN104584017A (en) * 2012-08-16 2015-04-29 金格输入输出有限公司 Method for modeling behavior and health changes
CN104715129A (en) * 2013-12-13 2015-06-17 中国科学院心理研究所 Mental health state assessment system and method based on mobile equipment using behavior
CN106934223A (en) * 2017-02-28 2017-07-07 思派(北京)网络科技有限公司 A kind of blood disease intelligent classification system and method based on big data

Also Published As

Publication number Publication date
CN108735295A (en) 2018-11-02
CN108735295B (en) 2024-01-26

Similar Documents

Publication Publication Date Title
CN108091372B (en) Medical field mapping verification method and device
WO2021008328A1 (en) Image processing method and device, terminal, and storage medium
CN107633265B (en) Data processing method and device for optimizing credit evaluation model
US20230015054A1 (en) Text classification method, electronic device and computer-readable storage medium
CN108062556B (en) Drug-disease relationship identification method, system and device
WO2022121163A1 (en) User behavior tendency identification method, apparatus, and device, and storage medium
CN110890137A (en) Modeling method, device and application of compound toxicity prediction model
CN111177356B (en) Acid-base index medical big data analysis method and system
Golugula et al. Evaluating feature selection strategies for high dimensional, small sample size datasets
Shrestha et al. Supervised machine learning for early predicting the sepsis patient: modified mean imputation and modified chi-square feature selection
US11049409B1 (en) Systems and methods for treatment of aberrant responses
CN113674862A (en) Acute renal function injury onset prediction method based on machine learning
CN103473416A (en) Protein-protein interaction model building method and device
CN116864011A (en) Colorectal cancer molecular marker identification method and system based on multiple sets of chemical data
WO2023148733A1 (en) Method and system for predicting drug-drug interactions
WO2019218482A1 (en) Big data-based population screening method and apparatus, terminal device and readable storage medium
CN111161884A (en) Disease prediction method, device, equipment and medium for unbalanced data
US20200279148A1 (en) Material structure analysis method and material structure analyzer
WO2019218483A1 (en) Regression tree model-based blood analysis method and apparatus, terminal device and readable storage medium
Sameki et al. ICORD: Intelligent Collection of Redundant Data-A Dynamic System for Crowdsourcing Cell Segmentations Accurately and Efficiently.
CN113392086B (en) Medical database construction method, device and equipment based on Internet of things
Muller et al. Outcome prediction with serial neuron-specific enolase and machine learning in anoxic-ischaemic disorders of consciousness
CN110826616A (en) Information processing method and device, electronic equipment and storage medium
CN113724779B (en) SNAREs protein identification method, system, storage medium and equipment based on machine learning technology
CN112651397B (en) Inspection sheet classification method, apparatus, computer device, and storage medium

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205N DATED 20/01/2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18918566

Country of ref document: EP

Kind code of ref document: A1